A 3D Implementation of Convolutional Neural Network for Fast Inference.

ISCAS(2023)

引用 0|浏览16
暂无评分
摘要
Low latency inference has many applications in edge machine learning. In this paper, we present a run-time configurable convolutional neural network (CNN) inference ASIC design for low-latency edge machine learning. By implementing a 5-stage pipelined CNN inference model in a 3D ASIC technology, we demonstrate that the model distributed on two dies utilizing face-to-face (F2F) 3D integration achieves superior performance. Our experimental results show that the design based on 3D integration achieves 43% better energy-delay product when compared to the traditional 2D technology.
更多
查看译文
关键词
3D ASIC technology,3D implementation,3D integration achieves superior performance,5-stage pipelined CNN inference model,edge machine learning,fast inference,low latency inference,run-time configurable convolutional neural network inference ASIC design
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要