Multichannel Audio Front-End for Far-Field Automatic Speech Recognition

2018 26th European Signal Processing Conference (EUSIPCO)(2018)

引用 15|浏览0
暂无评分
摘要
Far-field automatic speech recognition (ASR) is a key enabling technology that allows untethered and natural voice interaction between users and Amazon Echo family of products. A key component in realizing far-field ASR on these products is the suite of audio front-end (AFE) algorithms that helps in mitigating acoustic environmental challenges and thereby improving the ASR performance. In this paper, we discuss the key algorithms within the AFE, and we provide insights into how these algorithms help in mitigating the various acoustical challenges for far-field processing. We also provide insights into the audio algorithm architecture adopted for the AFE, and we discuss ongoing and future research.
更多
查看译文
关键词
Beamforming,far-field,AFE,deep neural networks,ASR,Amazon Echo
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要