XML Lossy Text Compression: A Preliminary Study

DATABASE AND XML TECHNOLOGIES, PROCEEDINGS(2009)

引用 6|浏览0
暂无评分
摘要
Lossy compression techniques have been applied to image and text compression, yielding compression factors that are vastly superior to lossless compression schemes. In this paper, we present a preliminary study on a set of lossy transformations for XML documents that preserve the semantics. Inspired by previous techniques, e.g. lossy text compression and literate programming, we apply a simple algorithm to XML syntactic constructs to loose superfluous layout information and redundant text. The obtained XML keeps the human-readability and machine-readability properties. Additionally, it can lead to a considerable reduction of its space occupancy and boost the application of conventional text compressors, thus representing a promising technology for several data management tasks.
更多
查看译文
关键词
xml lossy text compression,compression factor,lossy transformation,preliminary study,lossy compression technique,xml document,lossless compression scheme,considerable reduction,text compression,lossy text compression,redundant text,conventional text compressor,data management,lossless compression,lossy compression
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要