Script-Free Text Line Segmentation Using Interline Space Model for Printed Document Images

Document Analysis and Recognition(2011)

引用 4|浏览0
暂无评分
摘要
This paper proposes a model-based text line segmentation algorithm for machine-printed document images. The model is based on geometric configuration which uses the interline spaces rather than the text lines. The paper proposes an objective function whose maximization leads to the optimal solution. The proposed interline space model provides the primary advantage of script-free nature. Additionally the model is versatile due to its abilities of processing both horizontally and vertically written documents and inferring the semantic of reading order. The experiments performed with various document images in Latin, Korean, Chinese, and Japanese scripts have proven the aforementioned advantages and have shown the noise tolerance.
更多
查看译文
关键词
japanese script,latin scripts,optimisation,text line segmentation,korean scripts,optimal solution,reading order,machine-printed document image,various document image,objective function,noise tolerance,proposed interline space model,image segmentation,chinese scripts,japanese scripts,machine printed document image processing,written document processing,interline space,maximization,model based text line segmentation algorithm,printed document images,geometric matching,geometric configuration,script-free text line segmentation,aforementioned advantage,text analysis,interline space model,document image processing,script free text line segmentation,model-based approach,text line,model-based text line segmentation,noise,algorithm design and analysis,pattern analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要