Scalable Hybrid Loop- And Task-Parallel Matrix Inversion For Multicore Processors

2021 IEEE INTERNATIONAL PARALLEL AND DISTRIBUTED PROCESSING SYMPOSIUM WORKSHOPS (IPDPSW)(2021)

引用 2|浏览8
暂无评分
摘要
We propose a hybrid parallelization scheme for matrix inversion on multicore processors that combines a look-ahead technique to extract task-parallelism, at a high level, with loop-level parallelism to ensure an efficient utilization of the processor memory subsystem. As a result, our scheme outperforms the conventional approach for dense linear algebra operations, which simply extracts parallelism from a multi-threaded instance of the BLAS (Basic Linear Algebra Subprograms), but also the alternative based on OpenMP task-parallelism only, which is supposed to offer higher scalability. We provide an extensive collection of experiments supporting our remarks on two recent Intel- and ARM-based architectures, with a very large count of cores.
更多
查看译文
关键词
Loop-parallelism, task-parallelism, OpenMP, matrix inversion, high performance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要