Proteus: Exploiting precision variability in deep neural networks.

Parallel Computing(2018)

引用 22|浏览210
暂无评分
摘要
•Analyses a set of deep neural networks in terms of required precision per layer.•Describes a method of finding the Pareto frontier in accuracy vs. memory bandwidth.•Shows that required precision varies significantly between layers and networks.•Proposes a variable precision hardware compression method.•Reduces a neural network accelerator’s memory traffic by 43% and energy by 15%.
更多
查看译文
关键词
Machine learning,Neural networks,Deep learning,Accelerators,Approximate computing,Reduced precision
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要