CASSANN-v2: A high-performance CNN accelerator architecture with on-chip memory self-adaptive tuning

IEICE ELECTRONICS EXPRESS(2022)

引用 5|浏览25
暂无评分
摘要
This work proposes a high-performance reconfigurable CNN accelerator architecture, called CASSANN-v2, which can achieve 1TOPS peak performance at 1 GHz. CASSANN-v2 provides the function of on-chip SRAM memory real-time adaptive tuning by parameter configuration to reduce the intermediate output data transmission to further exploit the acceleration performance. The system simulation results show that CASSANN-v2 exhibits excellent performance on VGG-16 and ResNet-18 inference, with a throughput of 1009.54GOPS and 923.24GOPS at 1 GHz, which achieved 98.59% and 90.20% average processing element utilization, respectively. Compared with state-of-the-art accelerator works, CASSANN-v2 improves the resource utilization by 2.02x in VGG-16 and 2.35x in ResNet-18.
更多
查看译文
关键词
SoC design, convolutional neural network (CNN) accelerator, high-performance accelerator, architecture optimization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要