Data-driven Arabic phoneme recognition using varying number of HMM states

Communications, Signal Processing, and their Applications(2013)

引用 6|浏览9
暂无评分
摘要
Continuous Arabic Speech Recognition, appears in many real life applications. Its speed, accuracy and improvement are highly dependent on the accuracy of the language phonemes set. The main goal of this research is to recognize and transcribe the Arabic phonemes based on a data-driven approach. We built a phoneme recognizer based on a data driven approach using HTK tool. Different numbers of Gaussian mixtures with different numbers of HMM states were used in modeling the Arabic phonemes in order to reach the best configuration. The corpus used consists of about 4000 files, representing 5 recorded hours of modern standard Arabic of TV-News. The maximum phoneme recognition accuracy reached was 56.79%. This result is very encouraging and shows the viability of our approach as compared to using a fixed number of HMM states.
更多
查看译文
关键词
gaussian processes,hidden markov models,natural language processing,speech processing,speech recognition,television broadcasting,gaussian mixtures,hmm state varying number,htk tool,tv-news,continuous arabic speech recognition,data-driven arabic phoneme recognition,hidden markov model,language phonemes set,maximum phoneme recognition accuracy,phoneme recognizer,arabic speech recognition,kfupm arabic speech,phoneme recognition,corpus hmm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要