Sensitivity Analysis of Memory Bandwidth on Column-superposed Versatile Linear CGRA

Tomoya Akabe, Ryotaro Funai,Yasuhiko Nakashima

2023 21st IEEE Interregional NEWCAS Conference (NEWCAS)(2023)

引用 0|浏览1
暂无评分
摘要
The time required for DMA between IMAX's cache memory and ARM's main memory using AXI transactions was a bottleneck for IMAX, a CGRA-type accelerator hosted by an ARM processor and connected via an AXI interface. Therefore, we modified the IMAX accelerator to have eight DMA channels and take full advantage of the AXI interface. Evaluation of a 7200x7200 matrix multiplication without pre-transposition showed that the improved IMAX reduced the time to write input data from ARM main memory to cache memory by about 45% and the overall time by about 20% compared to the conventional IMAX.
更多
查看译文
关键词
DMA,CGRA,Memory Bandwidth
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要