Approach to Improve the Performance Using Bit-level Sparsity in Neural Networks.

DATE(2021)

引用 1|浏览12
暂无评分
摘要
This paper presents a convolutional neural network (CNN) accelerator that can skip zero weights and handle outliers, which are few but have a significant impact on the accuracy of CNNs, to achieve speedup and increase the energy efficiency of CNN. We propose an offline weight-scheduling algorithm which can skip zero weights and combine two non-outlier weights simultaneously using bit-level sparsity of CNNs. We use a reconfigurable multiplier-and-accumulator (MAC) unit for two purposes; usually used to compute combined two non-outliers and sometimes to compute outliers. We further improve the speedup of our accelerator by clipping some of the outliers with negligible accuracy loss. Compared to DaDianNao [7] and Bit-Tactical [16] architectures, our CNN accelerator can improve the speed by 3.34 and 2.31 times higher and reduce energy consumption by 29.3% and 30.2%, respectively.
更多
查看译文
关键词
bit-level sparsity,convolutional neural network accelerator,zero weights,handle outliers,offline weight-scheduling algorithm,nonoutlier weights,multiplier-and-accumulator unit,combined two nonoutliers,negligible accuracy loss,CNN accelerator
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要