Contact Matrix Compressor

2022 Data Compression Conference (DCC)(2022)

引用 0|浏览9
暂无评分
摘要
The study of three-dimensional folding of chromosomes is important to understand genomics processes. This is done through techniques, such as Hi-C, that analyze the spatial organization of chromosomes in a cell. The data coming from the study is a 2-dimensional quantitative maps with genomic coordinate systems. We present a novel approach called Contact Matrix Compressor(CMC) for the efficient compression of Hi-C data. By exploiting the properties of the data, such as diagonally dominant and symmetrical, CMC achieves a much higher compression. CMC outperforms the existing method Cooler, and also the generic compression methods LZMA as well as BZip2.
更多
查看译文
关键词
genomics processes,Hi-C,genomic coordinate systems,efficient compression,higher compression,generic compression methods,BZip2,contact matrix compressor,chromosome 3D folding,2D quantitative maps,CMC,generic compression method
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要