Page Segmentation for Historical Handwritten Documents Using Fully Convolutional Networks

2017 14th IAPR International Conference on Document Analysis and Recognition (ICDAR)(2017)

引用 41|浏览13
暂无评分
摘要
Page segmentation is a fundamental and challenging task in document image analysis due to the layout diversity. In this work, we propose a pixel-wise segmentation method for historical handwritten documents using fully convolutional network (FCN). The document image is segmented into different regions by classifying pixels into different categories: background, main text body, comments, and decorations. By supervised learning on document images with pixel-wise labels, the FCN can extract discriminative features and perform pixel-wise segmentation accurately. After pixel-wise classification, post-processing steps are taken to reduce noises, correct wrong segmentations and find out overlapping regions. Experimental results on the public dataset DIVA-HisDB containing challenging medieval manuscripts demonstrate the effectiveness and superiority of the proposed method, which yields pixel-level accuracy of above 99%.
更多
查看译文
关键词
page segmentation,layout analysis,fully convolutional network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要