Truss decomposition on shared-memory parallel systems
2017 IEEE High Performance Extreme Computing Conference (HPEC)(2017)
摘要
The scale of data used in graph analytics grows at an unprecedented rate. More than ever, domain experts require efficient and parallel algorithms for tasks in graph analytics. One such task is the truss decomposition, which is a hierarchical decomposition of the edges of a graph and is closely related to the task of triangle enumeration. As evidenced by the recent GraphChallenge, existing algorithms and implementations for truss decomposition are insufficient for the scale of modern datasets. In this work, we propose a parallel algorithm for computing the truss decomposition of massive graphs on a shared-memory system. Our algorithm breaks a computation-efficient serial algorithm into several bulk-synchronous parallel steps which do not rely on atomics or other fine-grained synchronization. We evaluate our algorithm across a variety of synthetic and real-world datasets on a 56-core Intel Xeon system. Our serial implementation achieves over 1400 × speedup over the provided GraphChallenge serial benchmark implementation and is up to 28 × faster than the state-of-the-art shared-memory parallel algorithm.
更多查看译文
关键词
computation-efficiency serial algorithm,shared-memory parallel algorithm,bulk-synchronous parallel steps,truss decomposition,graph analytics,shared-memory parallel systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络