Lupulus: A Flexible Hardware Accelerator for Neural Networks

ICASSP(2020)

引用 1|浏览52
暂无评分
摘要
Neural networks have become indispensable for a wide range of applications, but they suffer from high computational- and memory-requirements, requiring optimizations from the algorithmic description of the network to the hardware implementation. Moreover, the high rate of innovation in machine learning makes it important that hardware implementations provide a high level of programmability to support current and future requirements of neural networks. In this work, we present a flexible hardware accelerator for neural networks, called Lupulus, supporting various methods for scheduling and mapping of operations onto the accelerator. Lupulus was implemented in a 28nm FD-SOI technology and demonstrates a peak performance of 380 GOPS/GHz with latencies of 21.4ms and 183.6ms for the convolutional layers of AlexNet and VGG-16, respectively.
更多
查看译文
关键词
machine learning,VCG-16,AlexNet,28nm FD-SOI technology,future requirements,hardware implementation,high computationaland memory-requirements,neural networks,flexible hardware accelerator,Lupulus
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要