Memristive Boltzmann Machine: A Hardware Accelerator For Combinatorial Optimization And Deep Learning

PROCEEDINGS OF THE 2016 IEEE INTERNATIONAL SYMPOSIUM ON HIGH-PERFORMANCE COMPUTER ARCHITECTURE (HPCA-22)(2016)

引用 252|浏览76
暂无评分
摘要
The Boltzmann machine is a massively parallel computational model capable of solving a broad class of combinatorial optimization problems. In recent years, it has been successfully applied to training deep machine learning models on massive datasets. High performance implementations of the Boltzmann machine using GPUs, MPI-based HPC clusters, and FPGAs have been proposed in the literature. Regrettably, the required all-to-all communication among the processing units limits the performance of these efforts.This paper examines a new class of hardware accelerators for large-scale combinatorial optimization and deep learning based on memristive Boltzmann machines. A massively parallel, memory-centric hardware accelerator is proposed based on recently developed resistive RAM (RRAM) technology. The proposed accelerator exploits the electrical properties of RRAM to realize in situ, fine-grained parallel computation within memory arrays, thereby eliminating the need for exchanging data between the memory cells and the computational units. Two classical optimization problems, graph partitioning and boolean satisfiability, and a deep belief network application are mapped onto the proposed hardware. As compared to a multicore system, the proposed accelerator achieves 57x higher performance and 25x lower energy with virtually no loss in the quality of the solution to the optimization problems. The memristive accelerator is also compared against an RRAM based processing-in-memory (PIM) system, with respective performance and energy improvements of 6.89x and 5.2x.
更多
查看译文
关键词
parallel computational model,combinatorial optimization problems,deep machine learning models,high performance implementations,GPU,MPI-based HPC clusters,FPGAs,all-to-all communication,hardware accelerators,large-scale combinatorial optimization,memristive Boltzmann machines,massively parallel memory-centric hardware accelerator,resistive RAM technology,RRAM technology,electrical properties,fine-grained parallel computation,memory arrays,graph partitioning,boolean satisfiability,deep belief network application,multicore system,memristive accelerator,RRAM based processing-in-memory system,RRAM PIM system
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要