A Speech Enhancement Front-End for Intent Classification in Noisy Environments

29TH EUROPEAN SIGNAL PROCESSING CONFERENCE (EUSIPCO 2021)(2021)

引用 3|浏览2
暂无评分
摘要
Recently, several neural time-domain speech denoising and speech separation approaches have been investigated in literature, considerably progressing the state-of-the-art in the field. Among these methods, Wave-U-Net is particularly appealing because it allows an integrated modelling of the phase information and can handle large temporal contexts. In this paper, we present an evolution of the original Wave-U-Net architecture, that features a deeper model with exponentially increasing dilation rate from layer to layer in the downsampling blocks. Experiments on a contaminated version of Librispeech show that the proposed architecture outperforms the original one in terms of intelligibility metrics. In addition, we evaluate the performance of the proposed enhancement scheme on a simple intent classification task based on a noisy version of the Fluent Speech Commands dataset. Results show that, also in this case, the proposed method outperforms the baseline and substantially improves the classification accuracy in noisy conditions.
更多
查看译文
关键词
Intent classification, Speech Enhancement, Deep Learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要