Improvements To The Ibm Speech Activity Detection System For The Darpa Rats Program

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2015)

引用 64|浏览35
暂无评分
摘要
In this paper we describe improvements to the IBM speech activity detection (SAD) system for the third phase of the DARPA RATS program. The progress during this final phase comes from jointly training convolutional and regular deep neural networks with rich time-frequency representations of speech. With these additions, the phase 3 system reduces the equal error rate (EER) significantly on both of the program's development sets (relative improvements of 20% on dev1 and 7% on dev2) compared to an earlier phase 2 system. For the final program evaluation, the newly developed system also performs well past the program target of 3% P-miss at 1% P-fa with a performance of 1.2% P-miss at 1% P-fa and 0.3% P-fa at 3% P-miss.
更多
查看译文
关键词
Speech activity detection,acoustic features,robust speech recognition,deep neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要