An iterative multimodal framework for the transcription of handwritten historical documents

Pattern Recognition Letters(2014)

引用 20|浏览7
暂无评分
摘要
The transcription of historical documents is one of the most interesting tasks in which Handwritten Text Recognition can be applied, due to its interest in humanities research. One alternative for transcribing the ancient manuscripts is the use of speech dictation by using Automatic Speech Recognition techniques. In the two alternatives similar models (Hidden Markov Models and n-grams) and decoding processes (Viterbi decoding) are employed, which allows a possible combination of the two modalities with little difficulties. In this work, we explore the possibility of using recognition results of one modality to restrict the decoding process of the other modality, and apply this process iteratively. Results of these multimodal iterative alternatives are significantly better than the baseline uni-modal systems and better than the non-iterative alternatives.
更多
查看译文
关键词
hidden markov models,ancient manuscript,baseline uni-modal system,alternatives similar model,automatic speech recognition technique,viterbi decoding,process iteratively,iterative multimodal framework,historical document,handwritten text recognition,decoding process,handwritten historical document
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要