End to End Framework for CNN Acceleration on FPGAs with Dynamic Algorithm Mapping.

International Conference on Contemporary Computing (IC3)(2022)

引用 0|浏览4
暂无评分
摘要
As CNNs are becoming more diverse with respect to per-layer computation characteristics, per-layer strategy selection and fine-grained tuning are required to achieve low end-to-end latency. DYNAMAP is a framework for efficiently selecting algorithms for different layers and re-using a unified accelerator. However, the end-to-end deployment of DYNAMAP faces some challenges in terms of productivity and portability. In this work, we develop an API for automatically extracting the ONNX graph from high-level programming libraries such as Pytorch and Tensorflow. We further integrate this with the front-end of DYNAMAP to increase its portability. We then define an HLS template of the accelerator supporting three parallel algorithm choices for convolution operations to increase the productivity of generating hardware. Using three state-of-the-art CNNs, we demonstrate that our framework is able to automatically deploy CNN models to achieve minimum latency which incurs very low overhead in the deployment process.
更多
查看译文
关键词
cnn acceleration,fpgas,end framework,mapping
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要