Harnessing the Crowd for Autotuning High-Performance Computing Applications.

IPDPS(2023)

引用 0|浏览22
暂无评分
摘要
This paper presents GPTuneCrowd, a crowd-based autotuning framework for tuning high-performance computing applications. GPTuneCrowd collects performance data from various users using a user-friendly tuner interface. GPTuneCrowd then presents novel autotuning techniques, based on transfer learning and parameter sensitivity analysis, to maximize tuning quality using collected data from the crowd. This paper shows several real-world case studies of GPTuneCrowd. Our evaluation shows that GPTuneCrowd's transfer learning improves the tuned performance of ScaLAPACK's PDGEQRF by 1.57x and a plasma fusion code NIMROD by 2.97x, over a non-transfer learning autotuner. We use GPTuneCrowd's sensitivity analysis to reduce the search space of SuperLU DIST and Hypre. Tuning on the reduced search space achieves 1.17x and 1.35x better tuned performance of SuperLU DIST and Hypre, respectively, compared to the original search space.
更多
查看译文
关键词
autotuning,crowd-based autotuning,transfer learning,sensitivity analysis,Exascale Computing Project
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要