Toward Human Parity in Conversational Speech Recognition.
IEEE/ACM Transactions on Audio, Speech, and Language Processing(2017)
摘要
Conversational speech recognition has served as a flagship speech recognition task since the release of the Switchboard corpus in the 1990s. In this paper, we measure a human error rate on the widely used NIST 2000 test set for commercial bulk transcription. The error rate of professional transcribers is 5.9% for the Switchboard portion of the data, in which newly acquainted pairs of people discus...
更多查看译文
关键词
Speech recognition,Error analysis,Spatial analysis,Recurrent neural networks,NIST,Acoustics
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络