Speech enhancement using beamforming and non negative matrix factorization for robust speech recognition in the CHiME-3 challenge

2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU)(2015)

引用 14|浏览16
暂无评分
摘要
In this paper we present our contribution to the third CHiME challenge on speech separation and recognition for noisy multi-channel recordings. The use-case of the challenge consists in single speaker utterances recorded in highly non-stationary noisy environments using a 6-microphone array mounted on a tablet computer. The front-end of our system is performing speech enhancement by cascading a cross-correlation-based channel selection, Signal Dependent MVDR beamforming and online source separation based on sparse NMF. The back-end module is a state-of-the-art speech recognition system with DNN acoustic models trained on fMLLR features and a RNN Language Model. Our system reaches an overall WER of 11.94% on real test recordings, achieving a relative improvement of 65% compared to the baseline system.
更多
查看译文
关键词
Speech Enhancement,Automatic Speech Recognition,MVDR Beamforming,Non Negative Matrix Factorization,CHiME challenge
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要