An improved method for voice conversion based on Gaussian mixture model

ICCASM), 2010 International Conference(2010)

引用 5|浏览3
暂无评分
摘要
Voice conversion is a technology by modifying the source personality to mock the voice of target. This technology has a wide prospect and potential technical value both in technical field and entertainment, such as text to speech (TTS) and toys. This paper develops an improved voice conversion method. For the voiced frames, the conversion is implemented by Gaussian mixture model (GMM) based on speech transformation and representation using adaptive interpolation of weighted spectral contour (STRAIGHT) algorithm. For the unvoiced frames, the envelope is stretched or compressed according to the ratio of the vocal tract length (VTL) of the source and the target. The subjective experiment shows that the proposed method indeed improve the quality of the converted voice with the introduction of VTL.
更多
查看译文
关键词
voice conversion,vocal tract length (vtl),gmm,straight,speech processing,vtl,speech transformation,spectral analysis,weighted spectral contour algorithm,straight algorithm,speech representation,voiced frame,gaussian processes,vocal tract length,gaussian mixture model,text to speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要