Lego: Dynamic Tensor-Splitting Multi-Tenant DNN Models on Multi-Chip-Module Architecture

Zhou Yu Xuan, Ching-Jui Lee,Tsung Tai Yeh

2022 19th International SoC Design Conference (ISOCC)(2022)

引用 0|浏览8
暂无评分
摘要
Modern deep neural network (DNN) accelerators target the acceleration of a single DNN model and limit the throughput for multi-tenant DNN data center applications. The multi-chip-module (MCM) architecture breaks a monolithic accelerator into multiple small chiplets. The MCM is a promising approach that dispatches DNN models across chiplets with equal PEs. However, it is challenging to distribute data of DNN model layers with different parameters across chiplets while maximizing the chiplet utilization. This work proposes Lego MCM architecture that dynamically adapts to the size of DNN model layers and improves the throughput of multi-tenant DNN applications by increasing the chiplet utilization. Lego's dynamic scheduler achieves the geometric average 1.51× speedup over a monolithic DNN accelerator.
更多
查看译文
关键词
architecture,tensor-splitting,multi-tenant,multi-chip-module
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要