Performance Analysis of SIMD Vectorization of High-Order Finite-Element Kernels

2018 International Conference on High Performance Computing & Simulation (HPCS)(2018)

引用 3|浏览17
暂无评分
摘要
Physics-based three-dimensional numerical simulations are becoming more predictive and are already essential for improving the understanding of natural phenomena, such as earthquakes, tsunami, flooding or climate change and global warming. Among the numerical methods available to support these simulations, Finite-Element formulations have been implemented in several major software packages. The efficiency of these algorithms remains a challenge due to the irregular memory access that prevents the squeezing out of the maximum level of performance out of current architectures. This is particularly true at the shared-memory level with several levels of parallelism and complex memory hierarchies. Despite significant efforts, automatic optimizations provided by compilers and high-level frameworks are often far from the performances obtained from hand-tuned implementations. In this paper, we have extracted a kernel from the EFISPEC software package developed at BRGM (the French Geological Survey). This application implements a high-order finite-element method to solve the elastodynamic equation. We characterize the performance of the extracted mini-app considering key parameters such as the order of the approximation, the memory access pattern or the vector length. Based on this study, we detail specific optimizations and we discuss the results measured as regards to the roofline performance model on Intel Broadwell and Skylake architectures.
更多
查看译文
关键词
physics-based three-dimensional numerical simulations,natural phenomena,earthquakes,flooding,global warming,numerical methods,software packages,irregular memory access,shared-memory level,complex memory hierarchies,automatic optimizations,hand-tuned implementations,EFISPEC software package,French Geological Survey,high-order finite-element method,memory access pattern,vector length,roofline performance model,SIMD vectorization,finite-element formulations,high-order finite-element kernels
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要