Improving ASR Error Correction Using N-Best Hypotheses

2021 IEEE Automatic Speech Recognition and Understanding Workshop (ASRU)(2021)

引用 2|浏览2
暂无评分
摘要
In the field of Automatic Speech Recognition (ASR), Grammatical Error Correction (GEC) can be used to correct errors in recognition results of ASR systems and whereby it further reduces the word error rate (WER). Most conventional GEC approaches make correction merely based on a single recognition hypothesis from the ASR system, i.e., the best path in the decoding lattice. However, with such limited information, it's common that GEC fails to detect errors in a sentence or even makes wrong corrections. In this paper we propose two novel GEC approaches utilizing N-best hypotheses instead, that's, N-best plain approach and N-best alignment approach. In this way, our novel approaches can access more information than conventional approaches with 1-best-only hypothesis, and thus more accurate corrections can be made. Experiments on an industrial-scale internal dataset and the open-source dataset AISHELL-1 demonstrate the effectiveness of N-best approaches: 1) both N-best plain and N-best alignment approaches achieve remarkable WER reduction. N-best alignment approach achieves the best performance on both datasets, 12.54% WERR on an internal dataset and 27.71% WERR on AISHELL-1. 2) They also outperform the baseline 1-best GEC approach by a large margin.
更多
查看译文
关键词
speech recognition,grammatical error correction,End-to-End,Transformer,language modeling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要