A Computation- and Communication-Optimal Parallel Direct 3-Body Algorithm

New Orleans, LA(2014)

引用 12|浏览24
暂无评分
摘要
Traditional particle simulation methods are used to calculate pair wise potentials, but some problems require 3-body potentials that calculate over triplets of particles. A direct calculation of 3-body interactions involves O(n3) interactions, but has significant redundant computations that occur in a nested loop formulation. In this paper we explore algorithms for 3-body computations that simultaneously optimize three criteria: computation minimization through symmetries, communication optimality, and load balancing. We present a new 3-body algorithm that is both communication and computation optimal. Its optional replication factor, c, saves c3 in latency (number of messages) and c2 in bandwidth (volume), with bounded load imbalance. We also consider the k-body case and discuss an algorithm that is optimal if there is a cut off distance of less than 1/3 of the domain. The 3-body algorithm demonstrates 99% efficiency on tens of thousands of cores, showing strong scaling properties with order of magnitude speedups over the nïve algorithm.
更多
查看译文
关键词
parallel algorithms,resource allocation,3-body computations,3-body interactions,bounded load imbalance,communication optimality,communication-optimal parallel direct 3-body algorithm,computation minimization,computation-optimal parallel direct 3-body algorithm,k-body case,load balancing,nested loop formulation,optional replication factor,particle simulation methods,communication-avoiding algorithms,n-body,parallel algorithms,particle methods
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要