Towards audio-video based handwritten mathematical content recognition in classroom videos

Communications, Computers and Signal Processing(2011)

引用 0|浏览2
暂无评分
摘要
Recognizing handwritten mathematical content in classroom videos poses a range of interesting challenges. In this paper, we focus on improving the character recognition accuracy in such videos using a combination of video and audio based text recognizers. We propose a two step assembly consisting of a video text recognizer (VTR) as the primary character recognizer and an audio text recognizer (ATR) for disambiguating, if needed, the output of the VTR. We propose techniques for (1) detecting ambiguity in the output of the VTR so that a combination with the ATR may be triggered only for ambiguous characters, (2) synchronizing the output of the two recognizers for enabling combination, and (3) combining the options generated by the two recognizers using measurement and rank based methods. We have implemented the system using an open source implementation of a character recognizer and a commercially available phonetic word-spotter. Through experiments conducted using video recorded in a classroom-like environment, we demonstrate the improvement in the character recognition accuracy that can be achieved using our approach.
更多
查看译文
关键词
computer aided instruction,handwritten character recognition,text analysis,video signal processing,atr,vtr,audio text recognizer,audio-video based handwritten mathematical content recognition,character recognition,classroom videos,open source implementation,primary character recognizer,text recognizers,video text recognizer
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要