Learning Phrase Patterns For Asr Name Error Detection Using Semantic Similarity

16TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION (INTERSPEECH 2015), VOLS 1-5(2015)

引用 23|浏览20
暂无评分
摘要
This paper addresses the problem of detecting name errors in automatic speech recognition (ASR) output, when available ASR output training data is sparse and covers only a limited subset of scenarios from a broad-domain task. We reduce the dimensionality of the lexical feature set using a related task (sentence-level name detection) and improve feature coverage by expanding single words to classes using semantic similarity measures derived from both WordNet and neural network word embeddings. Phrase patterns are learned over the selected word features using a frequency-based pattern selection algorithm. In experiments on English dialogs, we find that adding sentence level features performs better than using local n-gram context. Automatically-learned seed features perform better than hand-crafted patterns, and their class expansions generalize better. Finally, the embedding-based class expansions yield better results than the corresponding WordNet-based configurations.
更多
查看译文
关键词
speech recognition, name error detection, word embeddings, word class features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要