Parallel lossless data compression on the GPU

Innovative Parallel Computing(2012)

引用 125|浏览10
暂无评分
摘要
We present parallel algorithms and implementations of a bzip2-like lossless data compression scheme for GPU architectures. Our approach parallelizes three main stages in the bzip2 compression pipeline: Burrows-Wheeler transform (BWT), move-to-front transform (MTF), and Huffman coding. In particular, we utilize a two-level hierarchical sort for BWT, design a novel scan-based parallel MTF algorithm, and implement a parallel reduction scheme to build the Huffman tree. For each algorithm, we perform detailed performance analysis, discuss its strengths and weaknesses, and suggest future directions for improvements. Overall, our GPU implementation is dominated by BWT performance and is 2.78× slower than bzip2, with BWT and MTF-Huffman respectively 2.89× and 1.34× slower on average.
更多
查看译文
关键词
huffman codes,data compression,graphics processing units,parallel algorithms,telecommunication computing,trees (mathematics),bwt,burrows-wheeler transform,gpu architectures,huffman coding,huffman tree,mtf,bzip2-like lossless data compression scheme,move-to-front transform,parallel lossless data compression,parallel reduction scheme,scan-based parallel mtf algorithm,move to front transform,instruction sets,encoding,histograms,burrows wheeler transform
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要