Design and Analysis of a Neural Network Inference Engine Based on Adaptive Weight Compression.

IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems(2019)

引用 17|浏览21
暂无评分
摘要
Neural networks generally require significant memory capacity/bandwidth to store/access a large number of synaptic weights. This paper presents design of an energy-efficient neural network inference engine based on adaptive weight compression using a JPEG image encoding algorithm. To maximize compression ratio with minimum accuracy loss, the quality factor of the JPEG encoder is adaptively control...
更多
查看译文
关键词
Image coding,Transform coding,Training,Engines,Entropy,Memory management,Discrete cosine transforms
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要