A fast, energy-efficient abstraction for simultaneous breadth-first searches

2015 IEEE High Performance Extreme Computing Conference (HPEC)(2015)

引用 4|浏览42
暂无评分
摘要
Optimized GPU kernels are sufficiently complicated to write that they often are specialized to input data, target architectures, or applications. This paper presents a multi-search abstraction for computing multiple breadth-first searches in parallel and demonstrates a high-performance, general implementation. Our abstraction removes the burden of orchestrating graph traversal from the user while providing high performance and low energy usage, an often overlooked component of algorithm design. Energy consumption has become a first-class hardware design constraint for both massive and embedded computing platforms. Our abstraction can be applied to such problems as the all-pairs shortest-path problem, community detection, reachability querying, and others. To map graph traversal efficiently to the GPU, our hybrid implementation chooses between processing active vertices with a single thread or an entire warp based on vertex outdegree. For a set of twelve varied graphs, the implementation of our abstraction saves 42% time and 62% energy on average compared to representative implementations of specific applications from existing literature.
更多
查看译文
关键词
energy-efficient abstraction,simultaneous breadth-first searches,optimized GPU kernels,multisearch abstraction,multiple breadth-first searches,graph traversal,energy consumption,first-class hardware design constraint,embedded computing platforms,massive computing platforms,all-pairs shortest-path problem,vertex outdegree
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要