HPCC RandomAccess benchmark for next generation supercomputers

Rome(2009)

引用 26|浏览0
暂无评分
摘要
In this paper we examine the key elements determining the performance of the HPC Challenge RandomAccess benchmark on next generation supercomputers. We find that the performance of this benchmark is closely related to the bisection bandwidth of the underlying communication network, performance of integer divide operation and details of benchmark specifications such as error tolerance and permissible multi-core mapping strategies. We demonstrate that seemingly small and innocuous changes in the benchmark can lead to significantly different system performance. We also present an algorithm to optimize RandomAccess benchmark for multi-core systems. Our algorithm uses aggregation and software routing and balances the load on the cores by specializing each of the cores for one specific routing or update function. This algorithm gives approximately a factor of 3 speedup on the Blue Gene/P system which is based on quad-core nodes.
更多
查看译文
关键词
specific routing,multi-core system,blue gene,software routing,next generation supercomputers,hpcc randomaccess benchmark,randomaccess benchmark,p system,benchmark specification,permissible multi-core mapping strategy,hpc challenge randomaccess benchmark,different system performance,communication networks,routing,random access,benchmark testing,bandwidth,system performance
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要