Designing scalable b-Matching algorithms on distributed memory multiprocessors by approximation.

SC(2016)

引用 14|浏览142
暂无评分
摘要
A b-Matching is a subset of edges M such that at most b(v) edges in M are incident on each vertex v, where b(v) is specified. We present a distributed-memory parallel algorithm, b-Suitor, that computes a b-Matching with more than half the maximum weight in a graph with weights on the edges. The approximation algorithm is designed to have high concurrency and low time complexity. We organize the implementation of the algorithm in terms of asynchronous supersteps that combine computation and communication, and balance the computational work and frequency of communication to obtain high performance. Since the performance of the b-Suitor algorithm is strongly influenced by communication, we present several strategies to reduce the communication volume. We implement the algorithm using a hybrid strategy where inter-node communication uses MPI and intra-node computation is done with OpenMP threads. We demonstrate strong and weak scaling of b-Suitor up to 16K processors on two supercomputers at NERSC. We compute a b-Matching in a graph with 2 billion edges in under 4 seconds using 16K processors.
更多
查看译文
关键词
scalable b-matching algorithm,distributed memory multiprocessors,distributed memory parallel algorithm,graph,approximation algorithm,high performance computing,b-suitor algorithm,MPI,intra-node computation,OpenMP threads
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要