Perturbation analysis of mel-frequency cepstrum coefficients

Audio Language and Image Processing(2010)

引用 3|浏览2
暂无评分
摘要
Mel-frequency cepstrum coefficient (MFCC) is a widely used feature vector in speech signal precessing. Its feature extraction procedure can be seen as a mapping function which transfers the input speech signals to output MFCC feature vectors. However, this function is too complex to analyze and even a simple approximation is not easy to obtain. This paper studies the effects of each MFCC feature extraction step and obtains the relation between the input signal-to-noise ratio (SNR) and the output perturbation bound of MFCC feature vectors. Experimental results show that the obtained bounds are ”tight” and nearly full covered. This analysis method may help us to find new clue of MFCC and may has potential applications in speech recognition.
更多
查看译文
关键词
speech processing,perturbation analysis,mfcc feature extraction,speech signal processing,mel-frequency cepstrum coefficients,feature extraction,mfcc feature vectors,mapping function,signal-to-noise ratio,feature vector,speech recognition,mel frequency cepstral coefficient,speech,upper bound,signal to noise ratio
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要