Mars: Accelerating MapReduce with Graphics Processors

IEEE Transactions on Parallel and Distributed Systems(2011)

引用 190|浏览0
暂无评分
摘要
We design and implement Mars, a MapReduce runtime system accelerated with graphics processing units (GPUs). MapReduce is a simple and flexible parallel programming paradigm originally proposed by Google, for the ease of large-scale data processing on thousands of CPUs. Compared with CPUs, GPUs have an order of magnitude higher computation power and memory bandwidth. However, GPUs are designed as special-purpose coprocessors and their programming interfaces are less familiar than those on the CPUs to MapReduce programmers. To harness GPUs' power for MapReduce, we developed Mars to run on NVIDIA GPUs, AMD GPUs as well as multicore CPUs. Furthermore, we integrated Mars into Hadoop, an open-source CPU-based MapReduce system. Mars hides the programming complexity of GPUs behind the simple and familiar MapReduce interface, and automatically manages task partitioning, data distribution, and parallelization on the processors. We have implemented six representative applications on Mars and evaluated their performance on PCs equipped with GPUs as well as multicore CPUs. The experimental results show that, the GPU-CPU coprocessing of Mars on an NVIDIA GTX280 GPU and an Intel quad-core CPU outperformed Phoenix, the state-of-the-art MapReduce on the multicore CPU with a speedup of up to 72 times and 24 times on average, depending on the applications. Additionally, integrating Mars into Hadoop enabled GPU acceleration for a network of PCs.
更多
查看译文
关键词
large-scale data processing,public domain software,mapreduce,nvidia gpus,state-of-the-art mapreduce,computer graphic equipment,accelerating mapreduce,mars hide,parallel programming,mars,graphics processing units,integrated mars,mapreduce programmer,multicore processor,google,flexible parallel programming,many-core architecture.,harness gpus,multicore cpus,parallel computing,graphics processor,runtime system,hadoop,graphics processors,mapreduce runtime system,familiar mapreduce interface,open-source cpu-based mapreduce system,mapreduce system,acceleration,programming,indexing terms,memory bandwidth,multicore processors,bandwidth,memory management,data processing,parallel computer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要