Simple and artefact-free spectral modifications for enhancing the intelligibility of casual speech

Acoustics, Speech and Signal Processing(2014)

引用 9|浏览9
暂无评分
摘要
In this paper, the problem of modifying casual speech to reach the intelligibility level of clear speech is addressed. Unlike other studies, in this work modifications on casual speech both consider intelligibility and speech quality. To achieve this, the authors focus on human-like modifications inspired by clear speech. An acoustic analysis performed on clear and casual speech reveals energy differences on specific frequency bands between the two speaking styles. Then, a simple method is used to boost these frequency regions on casual speech. The proposed method, called mix-filtering, uses a multi-band filtering scheme to isolate the information of these frequency bands and then, add this information to the original signal. Our method is compared in terms of intelligibility and quality with unmodified casual speech and with a highly intelligible spectral modification technique, namely the Spectral Shaping and Dynamic Range Compression (SSDRC). Two different objective measures that are highly correlated with subjective intelligibility scores are used for estimating the intelligibility, whereas for evaluating the quality, preference listening tests are performed. Results show that the mix-filtering technique increases the intelligibility of casual speech while maintains its quality. On the other hand, while SSDRC outperforms on intelligibility, it degrades significantly the quality of casual speech.
更多
查看译文
关键词
filtering theory,speech coding,speech enhancement,SSDRC,acoustic analysis,artefact-free spectral modifications,casual speech intelligibility enhancement,clear speech,mix-filtering technique,multiband filtering scheme,spectral shaping and dynamic range compression,speech quality,Casual speech,Clear speech,Intelligibility,Spectral modifications,Speech quality
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要