PIM-HLS: An Automatic Hardware Generation Tool for Heterogeneous Processing-In-Memory-based Neural Network Accelerators.

DAC(2023)

引用 1|浏览4
暂无评分
摘要
Processing-in-memory (PIM) architectures have shown great abilities for neural network (NN) acceleration on edge devices that demand low latency under severe area constraints. Heterogeneous PIM architectures with different PIM implementation approaches such as RRAM-based PIM and SRAMbased PIM can further improve the performance. However, the automatic generation of heterogeneous PIM architectures faces the following two unresolved problems. First, existing work has not considered the design for heterogeneous PIM-based NN accelerators with multiple memory technologies. Second, for PIM with insufficient memory on edge devices, it is challenging to find the optimal runtime weight scheduling strategy in an O(L!) optimization space for the NN with L layers. In this paper, we propose PIM-HLS, an automatic hardware generation tool for heterogeneous PIM-based NN accelerators. Aiming at the problems above, we first point out that heterogeneous PIM can improve the performance under severe area constraints. Then we optimize the architectures for each NN layer by taking the advantage of different memory technologies. We also define the optimization problem of runtime weight scheduling and mapping for the first time, and propose a dynamic-programming-based weight scheduling algorithm to reduce the optimization space to O(L-2). We implement PIM-HLS to automatically generate the hardware code and the instructions. Results show that we achieve an averagely 5.9x speedup with 72.8% less area compared with state-of-the-art PIM designs.
更多
查看译文
关键词
automatic generation,automatic hardware generation tool,different memory technologies,dynamic-programming-based weight,edge devices,heterogeneous PIM architectures,heterogeneous PIM-based NN accelerators,Heterogeneous Processing-In-Memory-based Neural Network Accelerators,multiple memory technologies,neural network acceleration,optimal runtime weight scheduling strategy,optimization space,PIM-HLS,processing-in-memory architectures,RRAM-based PIM,severe area constraints,SRAM-based PIM,state-of-the-art PIM designs
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要