Dynamic Rate Neural Acceleration Using Multiprocessing Mode Support

IEEE Transactions on Very Large Scale Integration (VLSI) Systems(2022)

引用 0|浏览12
暂无评分
摘要
Multiobject detection has become an integral component in various neural applications, such as autonomous driving and augmented reality. The system should be able to recognize and process multiple objects simultaneously. Moreover, the performance requirements for this system can be dynamically changed depending on the number of regions of interest (ROIs) in each frame. Consequently, the processing unit (PU) of the neural acceleration system should provide various inference rates. Therefore, we present a field-programmable gate array (FPGA)-based dynamic rate neural acceleration system called MultiLockOn to dynamically change the inference performance according to the number of ROIs per frame. It supports multiprocessing modes with different speeds through the introduction of novel multi-mode processing engines (PEs) comprising minimum reconfigurable interconnections across inference modes to minimize hardware overhead. The MultiLockOn system can provide an improvement of up to $4\times $ in the inference performance compared to that of DNNWeaver and $5.7\times $ compared to that of the ARM Cortex-A53 with minimum accuracy loss by supporting the multiprocessing modes.
更多
查看译文
关键词
Accelerator,approximation,neural networks,region of interest (ROI),weight quantization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要