KHATT: Arabic Offline Handwritten Text Database

ICFHR(2012)

引用 79|浏览23
暂无评分
摘要
In this paper, we report our comprehensive Arabic offline Handwritten Text database (KHATT) after completion of the collection of 1000 handwritten forms written by 1000 writers from different countries. It is composed of an image database containing images of the written text at 200, 300, and 600 dpi resolutions, a manually verified ground truth database that contains meta-data describing the written text at the page, paragraph, and line levels. A formal verification procedure is implemented to align the handwritten text with its ground truth at the form, paragraph and line levels. Tools to extract paragraphs from pages and segment paragraphs into lines are developed. Preliminary experiments on Arabic handwritten text recognition are conducted using sample data from the database and the results are reported. The database will be made freely available to researchers world-wide for research in various handwritten-related problems such as text recognition, writer identification and verification, etc.
更多
查看译文
关键词
line level,comprehensive arabic,image database,handwritten text,ground truth database,offline handwritten text database,text recognition,handwritten form,written text,arabic offline handwritten text,arabic handwritten text recognition,formal verification,hmm,natural languages,handwriting recognition,feature extraction,data verification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要