A New Method For Curvilinear Text Line Extraction And Straightening Of Arabic Handwritten Text
INTERNATIONAL ARAB JOURNAL OF INFORMATION TECHNOLOGY(2018)
摘要
Line extraction is a critical step from one of the main subtasks of Document Image Analysis, which is layout analysis. This paper presents a new method for curvilinear text line extraction and straightening in Arabic handwritten documents. The proposed method is based on a strategy that consists of two distinct steps. First, text line is extracted based on morphological dilation operation. Secondly, the extracted text line is straighten in two sub-steps: Course tuning of text line orientation based on Hough transform, then fine tuning based on cenfroid alignment of the connected component that forms the text line. The proposed approach has been extensively experimented on samples from the benchmark datasets of KFUPM Handwritten Arabic TexT (KHATT) and Arabic Handwriting DataBase (AHDB). Experimental results show that, the proposed method is capable of detecting and straightening curvilinear text lines even on challenging Arabic handwritten documents.
更多查看译文
关键词
Document image analysis, arabic handwriting, text line extraction, hough transform
AI 理解论文
溯源树
样例
![](https://originalfileserver.aminer.cn/sys/aminer/pubs/mrt_preview.jpeg)
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要