Distributed OpenMP Offloading of OpenMC on Intel GPU MAX Accelerators
arxiv(2024)
摘要
Monte Carlo (MC) simulations play a pivotal role in diverse scientific and
engineering domains, with applications ranging from nuclear physics to
materials science. Harnessing the computational power of high-performance
computing (HPC) systems, especially Graphics Processing Units (GPUs), has
become essential for accelerating MC simulations. This paper focuses on the
adaptation and optimization of the OpenMC neutron and photon transport Monte
Carlo code for Intel GPUs, specifically the Intel Data Center Max 1100 GPU
(codename Ponte Vecchio, PVC), through distributed OpenMP offloading. Building
upon prior work by Tramm J.R., et al. (2022), which laid the groundwork for GPU
adaptation, our study meticulously extends the OpenMC code's capabilities to
Intel GPUs. We present a comprehensive benchmarking and scaling analysis,
comparing performance on Intel MAX GPUs to state-of-the-art CPU execution
(Intel Xeon Platinum 8480+ Processor, codename 4th generation Sapphire Rapids).
The results demonstrate a remarkable acceleration factor compared to CPU
execution, showcasing the GPU-adapted code's superiority over its CPU
counterpart as computational load increases.
更多查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要