System For Random Access Dna Sequence Compression

2010 IEEE INTERNATIONAL CONFERENCE ON BIOINFORMATICS AND BIOMEDICINE WORKSHOPS (BIBMW)(2010)

引用 4|浏览21
暂无评分
摘要
DNA sequences are generally compressed by algorithms using approximate repeats that are found in most DNA sequences. The regions of DNA that are not part of a repeat are encoded by using arithmetic coder which estimates the probabilities for each symbol using a Markov model. Since arithmetic coding is used for compressing the bitstream, random access is very difficult in these methods as bthe itstream is tightly packed. Random access is a desirable feature as it enable to decompress only interesting regions in the sequence. This paper presents a system which uses the approximate repeats based compression algorithm and provides random access capability.
更多
查看译文
关键词
DNA Sequence Compression, Genomic Data Storage
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要