Parallel Architecture With Resistive Crosspoint Array for Dictionary Learning Acceleration

IEEE Journal of Emerging and Selected Topics in Circuits and Systems(2015)

引用 68|浏览62
暂无评分
摘要
This paper proposes a parallel architecture with resistive crosspoint array. The design of its two essential operations, read and write, is inspired by the biophysical behavior of a neural system, such as integrate-and-fire and local synapse weight update. The proposed hardware consists of an array with resistive random access memory (RRAM) and CMOS peripheral circuits, which perform matrix-vector multiplication and dictionary update in a fully parallel fashion, at the speed that is independent of the matrix dimension. The read and write circuits are implemented in 65 nm CMOS technology and verified together with an array of RRAM device model built from experimental data. The overall system exploits array-level parallelism and is demonstrated for accelerated dictionary learning tasks. As compared to software implementation running on a 8-core CPU, the proposed hardware achieves more than 3000 speedup, enabling high-speed feature extraction on a single chip.
更多
查看译文
关键词
CMOS integration, dictionary learning, memristive device, parallel computing, resistive crosspoint array
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要