Efficient sparse-matrix multi-vector product on GPUs.

Changwan Hong,Aravind Sukumaran-Rajam,Bortik Bandyopadhyay,Jinsung Kim,Süreyya Emre Kurt,Israt Nisa, Shivani Sabhlok,Ümit V. Çatalyürek,Srinivasan Parthasarathy,P. Sadayappan

HPDC（2018）

引用 42|浏览127

暂无评分

摘要

Sparse Matrix-Vector (SpMV) and Sparse Matrix-Multivector (SpMM) products are key kernels for computational science and data science. While GPUs offer significantly higher peak performance and memory bandwidth than multicore CPUs, achieving high performance on sparse computations on GPUs is very challenging. A tremendous amount of recent research has focused on various GPU implementations of the SpMV kernel. But the multi-vector SpMM kernel has received much less attention. In this paper, we present an in-depth analysis to contrast SpMV and SpMM, and develop a new sparse-matrix representation and computation approach suited to achieving high data-movement efficiency and effective GPU parallelization of SpMM. Experimental evaluation using the entire SuiteSparse matrix suite demonstrates significant performance improvement over existing SpMM implementations from vendor libraries.

查看译文

关键词

Sparse Matrix-Vector Multiplication, Sparse Matrix-Matrix Multiplication, Sparse Matrix Multi-Vector Multiplication, GPU

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要