A Novel Multi-Level Integrated Roofline Model Approach For Performance Characterization

HIGH PERFORMANCE COMPUTING, ISC HIGH PERFORMANCE 2018(2018)

引用 17|浏览74
暂无评分
摘要
With energy-efficient architectures, including accelerators and many-core processors, gaining traction, application developers face the challenge of optimizing their applications for multiple hardware features including many-core parallelism, wide processing vector-units and on-chip high-bandwidth memory. In this paper, we discuss the development and utilization of a new application performance tool based on an extension of the classical roofline-model for simultaneously profiling multiple levels in the cache-memory hierarchy. This tool presents a powerful visual aid for the developer and can be used to frame the many-dimensional optimization problem in a tractable way. We show case studies of real scientific applications that have gained insights from the Integrated Roofline Model.
更多
查看译文
关键词
Performance models, Roofline, Knights landing, Application performance measurement
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要