RACECAR: a heuristic for automatic function specialization on multi-core heterogeneous systems

PPOPP(2012)

引用 12|浏览7
暂无评分
摘要
High-performance computing systems increasingly combine multi-core processors and heterogeneous resources such as graphics-processing units and field-programmable gate arrays. However, significant application design complexity for such systems has often led to untapped performance potential. Application designers targeting such systems currently must determine how to parallelize computation, create device-specialized implementations for each heterogeneous resource, and determine how to partition work for each resource. In this paper, we present the RACECAR heuristic to automate the optimization of applications for multi-core heterogeneous systems by automatically exploring implementation alternatives that include different algorithms, parallelization strategies, and work distributions. Experimental results show RACECAR-specialized implementations achieve speedups up to 117x and average 11x compared to a single CPU thread when parallelizing computation across multiple cores, graphics-processing units, and field-programmable gate arrays.
更多
查看译文
关键词
parallelizing computation,field-programmable gate array,heterogeneous resource,graphics-processing unit,work distribution,high-performance computing system,multi-core heterogeneous system,automatic function specialization,significant application design complexity,application designer,multi-core processor,optimization,multi core processor,speedup,parallel computer,field programmable gate array
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要