E-LSTM: Efficient Inference of Sparse LSTM on Embedded Heterogeneous System

Proceedings of the 56th Annual Design Automation Conference 2019(2019)

引用 20|浏览47
暂无评分
摘要
Various models with Long Short-Term Memory (LSTM) network have demonstrated prior art performances in sequential information processing. Previous LSTM-specific architectures set large on-chip memory for weight storage to alleviate the memory-bound issue and facilitate the LSTM inference in cloud computing. In this paper, E-LSTM is proposed for embedded scenarios with the consideration of the chip-area and limited data-access bandwidth. The heterogeneous hardware in E-LSTM tightly couples an LSTM co-processor with an embedded RISC-V CPU. The eSELL format is developed to represent the sparse weight matrix. With the proposed cell fusion optimization based on the inherent sparsity in computation, E-LSTM achieves up to 2.2× speedup of processing throughput.
更多
查看译文
关键词
sparse LSTM,embedded heterogeneous system,sequential information processing,on-chip memory,memory-bound issue,LSTM inference,limited data-access bandwidth,heterogeneous hardware,LSTM co-processor,embedded RISC-V CPU,sparse weight matrix,long short-term memory network,E-LSTM,processing throughput
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要