Arabic Hand-Written Text-Line Extraction

ICDAR-1(2001)

引用 167|浏览40
暂无评分
摘要
Abstract: This paper describes a text-line extraction based method. The typical segmentation for a printed binary document is based on the horizontal projection analysis and then the regrouping of the connected components. These techniques can't be used for handwritten unconstrained text because data frequently contain undulations and shifts in the baseline, baseline-skew variability and inter-line distance variability. So, we think that the border line for a handwritten unconstrained documents should be a collection of horizontal line segments. From this point of view, we use a partial contour following based method to detect the separating lines. In the current version of our algorithm, we proceed to text slant detection, text line number evaluation using partial projection. Then we carry out a partial contour following of every line; first in the direction of the writing, then in the opposite direction. After the treatment, the adjacent lines are separated. In the experimental session, we describe the application of the algorithm used for the extraction of text line. Database images contains about one hundred handwritten Arabic texts written by different writers. Results about diacritical points affectation are also reported.
更多
查看译文
关键词
adjacent line,partial contour,border line,arabic hand-written text-line extraction,handwritten arabic text,horizontal line segment,handwritten unconstrained document,text slant detection,handwritten unconstrained text,text line,text line number evaluation,image analysis,autocorrelation,writing,data mining,histograms,strips,connected component,feature extraction,image segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要