A Dynamic Term Discovery Strategy For Automatic Speech Recognizers With Evolving Dictionaries

EXPERT SYSTEMS WITH APPLICATIONS(2021)

引用 3|浏览24
暂无评分
摘要
We present a dynamic term discovery (TD) strategy that is capable of automatically adapting the dictionaries managed by ASR systems to the input speech, in terms of lexicon and language model (LM). The adaptation tries to solve the problem of out-of-vocabulary (OOV) words that are likely to appear in most realistic scenarios and uses external knowledge sources for extending the capabilities of the LMs present in the systems. The handling of the OOV words is made by existing TD strategies that are able to detect and solve OOVs, plus special word selection processes that decide which words are to be added or deleted, so as to update the vocabulary constantly. We also propose a mathematical model for controlling the vocabulary size of the ASR system as well as the word addition and deletion rates that are involved. Then, the update of the overall LM is based on an interpolation scheme with smaller LMs built with external language knowledge that depends on the current speech and the words to be added at each time. We designed a realistic experimental framework for evaluating the strategy, employing ASR systems with moderated vocabulary sizes and a couple of test speech corpora with very distinct features. The results show that the dynamic TD strategy is able to offer a general positive tendency in WER improvement over systems without it, being able indeed to reach a significant difference after few hours of speech processing.
更多
查看译文
关键词
Dynamic term discovery, Out-of-vocabulary, Language model adaptation, Automatic speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要