An optimized FPGA architecture for machine learning applications

AEU-INTERNATIONAL JOURNAL OF ELECTRONICS AND COMMUNICATIONS(2024)

引用 1|浏览0
暂无评分
摘要
FPGAs are currently the most suitable hardware accelerators to implement and accommodate the non-stop growth of machine learning applications. This paper presents an FPGA architecture with added posit multipliers that outweigh the current IEEE-754 multipliers in terms of delay and area. Since machine learning algorithms involve a lot of expensive mathematical operations, having such powerful multipliers in the proposed FPGA architecture will execute the needed operations with high efficiency, which will stand out for machine learning applications without compromising other FPGA applications. Experimental results using Verilog to Routing (VTR) on both machine learning and non-machine learning benchmarks have demonstrated that our proposed architecture consumes 10% less area than the Stratix IV FPGA. Furthermore, it consumes less power compared to both the Stratix IV and Stratix 10 FPGAs, with reductions of 22% and 10%, respectively. Additionally, our architecture outperforms the Stratix IV and Stratix 10 FPGAs in terms of critical path delay, with improvements of 25.5% and 43%, respectively.
更多
查看译文
关键词
FPGA architectures,Hardware accelerators,IEEE-754 multipliers,Machine learning applications,Posit multipliers,Verilog to Routing (VTR)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要