A Scalable Architecture for CNN Accelerators Leveraging High-Performance Memories

2020 IEEE High Performance Extreme Computing Conference (HPEC)(2020)

引用 0|浏览16
暂无评分
摘要
As FPGA-based accelerators become ubiquitous and more powerful, the demand for integration with High-Performance Memory (HPM) grows. Although HPMs offer a much greater bandwidth than standard DDR4 DRAM, they introduce new design challenges such as increased latency and higher bandwidth mismatch between memory and FPGA cores. This paper presents a scalable architecture for convolutional neural network accelerators conceived specifically to address these challenges and make full use of the memory's high bandwidth. The accelerator, which was designed using high-level synthesis, is highly configurable. The intrinsic parallelism of its architecture allows near-perfect scaling up to saturating the available memory bandwidth.
更多
查看译文
关键词
scalable architecture,CNN accelerators leveraging High-Performance memories,FPGA-based accelerators,High-Performance Memory,greater bandwidth,standard DDR4 DRAM,design challenges,higher bandwidth mismatch,FPGA cores,convolutional neural network accelerators,high-level synthesis,available memory bandwidth
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要