Discriminative speaker adaptation in Persian continuous speech recognition systems
Procedia - Social and Behavioral Sciences(2012)
摘要
In this paper, the use of discriminative criteria such as minimum phone error (MPE) and maximum mutual information (MMI) is investigated for discriminative training HMM models for Persian speech recognition system. Discriminative training criteria have been successfully used to train acoustic models, so these criteria are expected to improve the estimation of linear transforms for speaker adaptation. MPE criterion is used to estimate the discriminative linear transforms (DLTs) for mean transforms. Experiments on Farsdat corpus show considerable improvements of discriminative training against ML trained models and MPE training outperforms MMI training on test data. Furthermore, MPE-based DLT reduces the word error rate in comparison to MLLR adaptation.
更多查看译文
关键词
Speech recognition,discriminative training,speaker adaptation,discrminative linear transform,minimum phone error
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要