On Using Voice Source Measures In Automatic Gender Classification Of Children'S Speech

11TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2010 (INTERSPEECH 2010), VOLS 1-2(2010)

引用 31|浏览33
暂无评分
摘要
Acoustic characteristics of speech signals differ with gender due to physiological differences of the glottis and the vocal tract. Previous research [1] showed that adding the voice-source related measures H-1* - H-2* and H-1* - A(3)* improved gender classification accuracy compared to using only the fundamental frequency (F-0) and formant frequencies. H-i* refers to the i-th source spectral harmonic magnitude, and A(i)* refers to the magnitude of the source spectrum at the i-th formant. In this paper, three other voice source related measures: CPP, HNR and H-2* - H-4* are used in gender classification of children's voices. CPP refers to the Cepstral Peak Prominence [2], HNR refers to the harmonic-to-noise ratio [3], and H-2* - H-4* refers to the difference between the 2nd and the 4th source spectral harmonic magnitudes. Results show that using these three features improves gender classification accuracy compared with [1].
更多
查看译文
关键词
gender classification,gender identification,voice source
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要