A 409.6 GOPS and 204.8 GFLOPS Mixed-Precision Vector Processor System for General-Purpose Machine Learning Acceleration

2022 IEEE Asian Solid-State Circuits Conference (A-SSCC)(2022)

引用 1|浏览8
暂无评分
摘要
As machine learning (ML) technology has become critical in various artificial intelligence (AI) applications such as image classification, object detection, and natural language processing, the latest ML models require massive computations and various operations to attain high accuracy, as shown in Fig.1. In order to cope with the computational challenges, many hardware accelerators have been proposed [1-4]. However, most of them have focused on a specific or a few models in the target domain, as shown in the table, only to achieve high performance without considering generic usages for various applications. With little programmability, the previous accelerators are often unable to adapt to model updates and algorithmic changes or suffer from low utilization if they adopt a new model. In addition, many support only fixed-point data types and arithmetic units, which limits their usage to emerging ML models.
更多
查看译文
关键词
vector,gops,mixed-precision,general-purpose
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要