Transient-based speech transmission index for predicting intelligibility in nonlinear speech enhancement processors

Acoustics, Speech and Signal Processing(2012)

引用 5|浏览5
暂无评分
摘要
A new speech intelligibility metric is proposed for the assessment of speech enhancement processors. These processors usually affect the fine structure in speech that is of fundamental importance to speech intelligibility. Classical metrics analyze the entire signal and thereby generally overestimate intelligibility. The measure presented here, therefore, isolates speech-transients by a cepstral smoothing technique and subsequently calculates speech intelligibility using an efficient version of the speech transmission index. By means of a genetic optimization of adjustable parameters, the proposed transition-based speech transmission index (TB STI) is adapted to the subjective data of linearly and nonlinearly processed speech. The method was assessed on untrained subjective data and showed a considerable improvement over other well-established measures.
更多
查看译文
关键词
genetic algorithms,speech enhancement,TB STI,cepstral smoothing technique,genetic optimization,intelligibility prediction,nonlinear speech enhancement processors,nonlinearly processed speech,speech intelligibility metric,transient-based speech transmission index,transition-based speech transmission index,untrained subjective data,Cepstrum,intelligibility,speech enhancement,speech perception,transients
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要