Compilation and Optimizations for Efficient Machine Learning on Embedded Systems

Xiaofan Zhang,Yao Chen,Cong Hao,Sitao Huang,Yuhong Li,Deming Chen

arxiv（2022）

引用 0|浏览31

暂无评分

摘要

Deep Neural Networks (DNNs) have achieved great success in a variety of machine learning (ML) applications, delivering high-quality inferencing solutions in computer vision, natural language processing, and virtual reality, etc. However, DNN-based ML applications also bring much increased computational and storage requirements, which are particularly challenging for embedded systems with limited compute/storage resources, tight power budgets, and small form factors. Challenges also come from the diverse application-specific requirements, including real-time responses, high-throughput performance, and reliable inference accuracy. To address these challenges, we introduce a series of effective design methodologies, including efficient ML model designs, customized hardware accelerator designs, and hardware/software co-design strategies to enable efficient ML applications on embedded systems.

查看译文

关键词

efficient machine learning,compilation,machine learning,optimizations

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要