A hybrid CPU/GPU method for Hartree-Fock self-consistent-field calculation.

The Journal of chemical physics(2023)

引用 0|浏览2
暂无评分
摘要
The calculation of two-electron repulsion integrals (ERIs) is a crucial aspect of Hartree-Fock calculations. In computing the ERIs of varying angular momentum, both the central processing unit (CPU) and the graphics processing unit (GPU) have their respective advantages. To accelerate the ERI evaluation and Fock matrix generation, a hybrid CPU/GPU method has been proposed to maximize the computational power of both CPU and GPU while overlapping the CPU and GPU computations. This method employs a task queue where each task corresponds to ERIs with the same angular momentum. The queue begins with ERIs of low angular momentum, which are computationally efficient on GPUs, and ends with ERIs of high angular momentum, which are better suited for CPU computation. CPUs and GPUs dynamically grab and complete tasks from the start and end of the queue using OpenMP dynamic scheduling until all tasks are finished. The hybrid CPU/GPU computation offers the advantage of enabling calculations with arbitrary angular momentum. Test calculations showed that the hybrid CPU/GPU algorithm is more efficient than "GPU-only" when using a single GPU. However, as more GPUs are involved, the advantage diminishes or disappears. The scaling exponents of the hybrid method were slightly higher than "GPU-only," but the pre-exponent factor was significantly lower, making the hybrid method more effective overall.
更多
查看译文
关键词
cpu/gpu method,hybrid cpu/gpu,hartree–fock,self-consistent-field
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要