Compression of Whole Genome Alignments

IEEE Transactions on Information Theory(2010)

引用 18|浏览0
暂无评分
摘要
Recent advances in DNA sequencing technology have caused an exponential growth of publicly available genomic sequence data. A particularly voluminous, frequently used static data set are whole genome alignments. The first lossless compression algorithm for such data sets based on well-established statistical evolutionary models and prediction techniques from lossless binary image compression is introduced. The compression rate is improved by a factor of 1.6 compared to the currently used Lempel-Ziv (LZ) compression.
更多
查看译文
关键词
DNA,biological techniques,biology computing,data compression,evolution (biological),genetics,genomics,molecular biophysics,statistical analysis,DNA sequencing,Lempel-Ziv compression,genomic sequence data,lossless binary image compression,lossless compression algorithm,statistical evolutionary models,whole genome alignment compression,Compression,genetics,lossless binary image compression,multiple sequence alignment,probabilistic models of evolution,whole genome alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要