Performance Analysis Of Coarse-Grained Parallel Genetic Algorithms On The Multicore Sun Ultrasparc T1

PROCEEDINGS OF THE IEEE SOUTHEASTCON 2009, TECHNICAL PROCEEDINGS(2009)

引用 14|浏览15
暂无评分
摘要
The new generation of shared memory multicore processors with multiple parallel execution paths provides a promising hardware platform for applications with high degree of task-level parallelism (TLP). Genetic Algorithm (GA), a widely-used evolutionary meta-heuristic optimization method, Is a unique candidate In this class of applications and demonstrates significant amount of explicit and implicit parallelism. In this paper, we present the performance characteristics of a GA optimizing a placement problem on a Sun UltraSPARC T1 processor. To Investigate the behavior of the benchmark, we vary both algorithm-specific parameters as well as the size of the target problem. The system performance is evaluated by monitoring throughput, cycle-per-instruction (CPI) and, the memory access patterns for different core and thread combinations. Our experiments show that for a constant data size, as the number of threads per core Increase from 1 to 4, the throughput of the system Increases by 84% keeping all cores active. Similarly, as we increase the number of cores in the system, the throughput of the system increases by a factor of 3. The average memory bandwidth Is seen to scale In proportion to throughput for both core-scaling and thread-scaling. The overall Increase in throughput, either by core-scaling or thread-scaling, in spite of growing memory bandwidth, shows the ability of the multi-threaded multicore processor to hide long latency memory accesses for the targeted benchmark.
更多
查看译文
关键词
hardware,gallium,throughput,sun,genetic algorithms,benchmark testing,system performance,cycles per instruction,multicore processing,memory bandwidth,parallel processing,multi core processor,bandwidth,scalability,genetic algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要