A Hybrid Parallel Delaunay Image-To-Mesh Conversion Algorithm Scalable On Distributed-Memory Clusters

Computer-Aided Design(2016)

引用 7|浏览9
暂无评分
摘要
In this paper, we present a scalable three dimensional hybrid MPI+ Threads parallel Delaunay image-to-mesh conversion algorithm. A nested master-worker communication model for parallel mesh generation is implemented which simultaneously explores process-level parallelization and thread-level parallelization: inter-node communication using MPI and inter-core communication inside one node using threads. In order to overlap the communication (task request and data movement) and computation (parallel mesh refinement), the inter-node MPI communication and intra-node local mesh refinement is separated. The master thread that initializes the MPI environment is in charge of the inter-node MPI communication while the worker threads of each process are only responsible for the local mesh refinement within the node. We conducted a set of experiments to test the performance of the algorithm on Turing, a distributed memory cluster at Old Dominion University High Performance Computing Center and observed that the granularity of coarse level data decomposition, which affects the coarse level concurrency, has a significant influence on the performance of the algorithm. With the proper value of granularity, the algorithm expresses impressive performance potential and is scalable to 30 distributed memory compute nodes with 20 cores each (the maximum number of nodes available for us in the experiments). (C) 2016 The Authors. Published by Elsevier Ltd.
更多
查看译文
关键词
Hybrid Programming,Parallel Mesh Generation,Nested Master-Worker Model,Two-Level Parallelization
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要