Ancient printed documents indexation: a new approach

ICAPR'05 Proceedings of the Third international conference on Advances in Pattern Recognition - Volume Part I(2005)

引用 17|浏览0
暂无评分
摘要
Based on the study of the specificity of historical printed books and on the main error sources of classical methods of page layout analysis, this paper presents a new way to achieve an indexation of ancient printed documents. We have developed an approach based on the extraction and the quantification of the various orientations that are present in printed document images. The documents are initially splitted into homogenous areas in which we analyze significant orientations with a directional rose. Each kind of information (textual or graphical) is typically identified and labelled according to its orientation distribution. This choice of characterization allows us to separate textual regions from graphical ones by minimizing the a priori knowledge. The evaluation of our proposition lies on a document image retrieval using layout extraction criteria and can also be used to precisely localize graphical parts in various types of documents. The system has been tested with success over several ancient printed books of the Renaissance.
更多
查看译文
关键词
ancient printed book,ancient printed document,historical printed book,printed document image,localize graphical part,document image retrieval,layout extraction criterion,page layout analysis,textual region,various orientation,ancient printed documents indexation,new approach
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要