Modulation frequency features for phoneme recognition in noisy speech.

JOURNAL OF THE ACOUSTICAL SOCIETY OF AMERICA(2009)

引用 41|浏览25
暂无评分
摘要
In this letter, a new feature extraction technique based on modulation spectrum derived from syllable-length segments of subband temporal envelopes is proposed. These subband envelopes are derived from autoregressive modeling of Hilbert envelopes of the signal in critical bands, processed by both a static (logarithmic) and a dynamic (adaptive loops) compression. These features are then used for machine recognition of phonemes in telephone speech. Without degrading the performance in clean conditions, the proposed features show significant improvements compared to other state-of-the-art speech analysis techniques. In addition to the overall phoneme recognition rates, the performance with broad phonetic classes is reported.
更多
查看译文
关键词
autoregressive processes,feature extraction,Hilbert transforms,speech processing,speech recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要