Error Correcting HTR’ed Byzantine Text

John Pavlopoulos, Vasiliki Kougia, Paraskevi Platanou, Stepan Shabalin, Konstantina Liagkou, Emmanouil Papadatos, Holger Essler, Jean-Baptiste Camps, Franz Fischer

Research Square (Research Square)(2023)

引用 0|浏览1
暂无评分
摘要
Abstract The automated correction of errors in the Handwritten Text Recognition (HTR) output can be challenging and is far from solved. To address this challenge, we set up a shared task on AIcrowd that received 271 submissions, of which very few succeed. This paper presents the datasets, the best methods, and experimental analysis in error-correcting HTRed manuscripts and papyri in Byzantine Greek, the language that followed Classical and preceded Modern Greek. By using recognised and transcribed data from seven centuries, the two best-performing methods are compared, one based on a neural encoded-decoder architecture and the other based on linguistic knowledge. We show that the recognition error rate can be reduced by both, up to 2.5 points at the level of characters and up to 15 at the level of words, also highlighting the weak and strong points of each.
更多
查看译文
关键词
text
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要