Contribution of the glottal flow residual in affect-related voice transformation

Zihan Wang,Christer Gobl

Conference of the International Speech Communication Association (INTERSPEECH)(2022)

引用 0|浏览6
暂无评分
摘要
This paper explores the contribution of the glottal flow residual in affect-related voice transformation. This signal, which is defined as the difference between the output of the inverse filter estimating the glottal flow signal and the modelled source signal, was analysed using multiple regression analysis. Results show that the strength of the residual varies as a function of the source parameters and this variation is frequency dependent: low frequency energy in the residual is mainly determined by the glottal excitation strength, whereas mid to high frequencies are more influenced by the glottal pulse shape. A method for modelling the residual is presented, which enables modifications based on the changes in source parameters used for voice transformation. This method makes it possible to use the residual as part of the voice source signal when transforming the voice quality in expressive speech synthesis. The result of a listening test, involving the transformation of a neutral voice to an angry or a sad voice, shows that including the glottal flow residual can improve the perceived naturalness of the synthesis. However, the fact that the transformed utterances are still relatively degraded indicates that other factors also need to be considered.
更多
查看译文
关键词
glottal flow residual,voice,transformation,affect-related
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要