Increasing the Efficiency of Data Storage and Analysis Using Indexed Compression

Oxford(2009)

引用 5|浏览0
暂无评分
摘要
The massive data sets produced by the high- throughput, multidimensional mass spectrometry instruments used in proteomics create challenges in data acquisition, storage and analysis. Data compression can help mitigate some of these problems but at the cost of less efficient data access, which directly impacts the computational time of data analysis. We have developed a compression methodology that 1) is optimized for a targeted mass spectrometry proteomics data set and 2) provides the benefits of size and speed from compression while increasing analysis efficiency by allowing extraction of segments of uncompressed data from a file without having to uncompress the entire file. This paper describes our compression algorithm, presents comparative metrics of compression size and speed, and explores approaches for applying the algorithm to a generalized data set.
更多
查看译文
关键词
data analysis,compression algorithm,indexed compression,compression methodology,uncompressed data,data acquisition,data storage,data compression,generalized data,compression size,massive data,efficient data access,high throughput,spectroscopy,data mining,proteins,compression algorithms,mass spectrometry,proteomics,data access,algorithm design and analysis,encoding
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要