Multiresolution Approach Based On Adaptive Superpixels For Administrative Documents Segmentation Into Color Layers

2015 13th International Conference on Document Analysis and Recognition (ICDAR)(2015)

引用 3|浏览13
暂无评分
摘要
Administrative document images are usually processed in black and white what generates many problems due to the errors related to the binarization. Besides all semantic information provided by the color is lost. Document images have a rich and highly variable content. The presence of false colors and artefacts introduced by the scanning and the compression alter the segmentation of the regions. Problems arise when there is no correspondence between the point clouds which are detected in a color space and the real regions of an image. In order to help the segmentation, we propose the extraction of the main colors of an image as a set of binary layers. Due to the industrial context, our approach has to run unsupervised on a generic dataset of color administrative documents. The originality of this approach is the use of a multiresolution analysis to detect the number of colors automatically. At a low resolution, a set of local regions is obtained thanks to a SLIC-based approach which takes into account the structure of documents and which combines both colorimetric information and spatial information. Then, a merging stage is applied on each resolution separately based on the colors which have been extracted at a lower resolution. This contribution can both feed the traditional process and exploit colorimetric information.
更多
查看译文
关键词
adaptive superpixels,administrative documents segmentation,color layers,administrative document images,binarization,compression,region segmentation,binary layers,main color extraction,color administrative documents,multiresolution analysis,SLIC-based approach,colorimetric information,spatial information,merging stage,colors extraction,simple linear iterative clustering
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要