Parametric representation of speech employing multi-component AFM signal model

Avinash Shrikant Hood,Ram Bilas Pachori, Varuna Kumar Reddy,Pradip Sircar

International Journal of Speech Technology(2015)

引用 21|浏览6
暂无评分
摘要
In this paper, we have proposed parametric representation of speech signals employing a novel multi-component amplitude and frequency modulated (AFM) sinusoidal signal model. The Fourier–Bessel (FB) series expansion is used to separate the multi-component speech signal into a set of mono-component signals. It has been shown that the first component or low-frequency component can be modeled with one set of parameters for the complete signal length. For other components of the speech which is a non-stationary signal, segmentation is required in order to apply the AFM signal model. We have proposed modeling of the second and third (and higher) components based on the AFM model with time-varying parameters. Thus, the signal is to be modeled in segments by selecting suitable length where the AFM signal model is admissible. The Itakura–Saito distance and root mean square log-spectral measure have been applied to determine distortion between the actual and modeled speech signals. Simulation results demonstrate the suitability of the AFM signal model for speech signal representation.
更多
查看译文
关键词
Speech signal modeling,Fourier-Bessel series expansion,Amplitude and frequency modulated signal model,Non-stationary signal analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要