The Effects Of Whispered Speech On State-Of-The-Art Voice Based Biometrics Systems

2015 IEEE 28th Canadian Conference on Electrical and Computer Engineering (CCECE)(2015)

引用 35|浏览33
暂无评分
摘要
In this paper, automatic speaker verification using whispered speech is explored. In the past, whispered speech has been shown to convey relevant speaker identity and gender information, nevertheless it is not clear how to efficiently use this information in speech-based biometric systems. This study compares the performance of three different speaker verification systems trained and tested under different scenarios and with two different feature representations. First, we show the benefits of using AM-FM based features as well as their effectiveness for i-vectors extraction. Second, for the classical mel-frequency cepstral coefficient (MFCC) features we show that gains of up to 40% could be achieved with the fusion of traditional Gaussian mixture model (GMM) based systems and more recent i-vector based ones, relative to using either system alone for normal speech. Additionally, for MFCC, fusion schemes were shown to be more robust to addition of whispered speech data during training or enrollment. Overall, AM-FM based features were shown to be more robust to varying training/testing conditions and to improve speaker verification performance for both normal and whispered speech by using the GMM based system alone.
更多
查看译文
关键词
Whispered speech,i-vectors,speaker verification,GMM,modulation features
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要