GTuner: tuning DNN computations on GPU via graph attention network

Qi Sun,Xinyun Zhang,Hao Geng,Yuxuan Zhao,Yang Bai,Haisheng Zheng,Bei Yu

Design Automation Conference (DAC)（2022）

引用 3|浏览29

暂无评分

摘要

It is an open problem to compile DNN models on GPU and improve the performance. A novel framework, GTuner, is proposed to jointly learn from the structures of computational graphs and the statistical features of codes to find the optimal code implementations. A Graph ATtention network (GAT) is designed as the performance estimator in GTuner. In GAT, graph neural layers are used to propagate the information in the graph and a multi-head self-attention module is designed to learn the complicated relationships between the features. Under the guidance of GAT, the GPU codes are generated through auto-tuning. Experimental results demonstrate that our method outperforms the previous arts remarkably.

查看译文

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要