Performance Issues of SYRK Implementations in Shared Memory Environments for Edge Cases

2018 21st International Conference of Computer and Information Technology (ICCIT)(2018)

引用 0|浏览26
暂无评分
摘要
The symmetric rank-k update (SYRK) is a level-3 BLAS routine commonly used by many Data Mining/Machine Learning(DM/ML) algorithms such as regression, dimensionality reduction algorithms like PCA, matrix factorization and k-mean clustering. This paper presents a comprehensive analysis of the SYRK routine under popular dense linear algebra libraries such as OpenBLAS, Intel MKL, and BLIS particularly focusing on edge cases of dense matrices (thin or fat shapes) that are common in DM/ML applications. Our work identifies some performance issues of the SYRK routine in multi-threaded shared memory environments for edge cases and discuss matrix dependent modifications for performance improvement.
更多
查看译文
关键词
Fats,Symmetric matrices,Shape,Libraries,Scalability,Approximation algorithms,Computer science
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要