Performance Tuning and Analysis for Stencil-Based Applications on POWER8 Processor.

TACO(2019)

引用 11|浏览61
暂无评分
摘要
This article demonstrates an approach for combining general tuning techniques with the POWER8 hardware architecture through optimizing three representative stencil benchmarks. Two typical real-world applications, with kernels similar to those of the winning programs of the Gordon Bell Prize 2016 and 2017, are employed to illustrate algorithm modifications and a combination of hardware-oriented tuning strategies with the application algorithms. This work fills the gap between hardware capability and software performance of the POWER8 processor, and provides useful guidance for optimizing stencil-based scientific applications on POWER systems.
更多
查看译文
关键词
POWER CPU, atmospheric simulation, performance optimization, scientific applications, stencil
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要