Information Extraction from Handwritten Tables in Historical Documents

DOCUMENT ANALYSIS SYSTEMS, DAS 2022(2022)

引用 2|浏览9
暂无评分
摘要
Recently, significant advances have been made in Document Understanding in structured historical documents. However, not much research has been done in information extraction from handwritten structured historical documents. In this paper, we compare two Machine Learning approaches and another approach that is based on heuristic rules to extract information in historical pre-printed forms with handwritten information. We analyze how each approach performs at each step of the extraction process. The proposed approaches improve the heuristic-rule baseline by up to 0.14 F-measure points throughout the information extraction pipeline.
更多
查看译文
关键词
Structured handwritten documents, Information extraction, Neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要