NeuroPIM: Felxible Neural Accelerator for Processing-in-Memory Architectures

Ali Monavari Bidgoli, Sepideh Fattahi,Seyyed Hossein Seyyedaghaei Rezaei,Mehdi Modarressi,Masoud Daneshtalab

2023 26th International Symposium on Design and Diagnostics of Electronic Circuits and Systems (DDECS)（2023）

引用 0|浏览3

暂无评分

摘要

The performance of microprocessors under many modern workloads is mainly limited by the off-chip memory bandwidth. The emerging process-in-memory paradigm present a unique opportunity to reduce data movement overheads by moving computation closer to memory. State-of-the-art processing-in-memory proposals stack a logic layer on top of one or multiple memory layers in a 3D fashion and leverage the logic layer to build near-memory processing units. Such processing units are either application-specific accelerators or general-purpose cores. In this paper, we present NeuroPIM, a new processing-in-memory architecture that uses a neural network as the memory-side general-purpose accelerator. This design is mainly motivated by the observation that in many real-world applications, some program regions, or even the entire program, can be replaced by a neural network that is learned to approximate the program’s output. NeuroPIM benefits from both the flexibility of general-purpose processors and superior performance of application-specific accelerators. Experimental results show that NeuroPIM provides up to 41% speedup over a processor-side neural network accelerator and up to 8x speedup over a general-purpose processor.

查看译文

关键词

Processing-in-memory, Hardware acceleration, Neural network

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要