Text Alignment from Bimodal Mathematical Expression Sources.

ICFHR(2014)

引用 3|浏览10
暂无评分
摘要
In this paper we propose a new approach to merge mathematical expression recognition results coming from handwriting and speech modalities. Using a bimodal description of mathematical expressions allows taking advantage of the complementarities between both signals, and can disambiguate situations were a single modality would not be clear enough. To combine the signals coming from both modalities, we propose to represent them in the same space as a textual description. First, from the handwriting signal, we generate the Nbest mathematical expressions, each of them is next translated as different possible strings. From the audio signal, an automatic speech recognition system provides a transcript, which is also available as a string. A string comparison algorithm is achieved to select the best mathematical expressions. This bimodal system is evaluated on real bimodal data from the HAMEX dataset and the results are compared to a single modality (handwriting) based system.
更多
查看译文
关键词
handwritten character recognition,speech recognition,string matching,text analysis,HAMEX dataset,MER,Nbest mathematical expressions,audio signal,automatic speech recognition system,bimodal mathematical expression sources,handwriting based mathematical expression recognition,string comparison algorithm,text alignment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要