Linear Predictive Coefficients-Based Feature To Identify Top-Seven Spoken Languages

International Journal of Pattern Recognition and Artificial Intelligence(2020)

引用 10|浏览28
暂无评分
摘要
Speech recognition in multilingual scenario is not trivial in the case when multiple languages are used in one conversation. Language must be identified before we process speech recognition as such tools are language-dependent. We present a language identification system (or AI tool) to distinguish top-seven world languages namely Chinese, Spanish, English, Hindi, Arabic, Bangla and Portuguese [G. F. Simons and C. D. Fennig (eds.), Ethnologue: Laguage of the Americas and the Pacific, Twentieth Edn. (SIL Internatinal, 2017)]. The system uses linear predictive coefficients-based feature, i.e. the line spectral pair-grade ratio (LSP-GR) feature, and ensemble learning for classification. Experiments were performed on more than 200 h of real-world YouTube data and the highest possible accuracy of 96.95% was received. The results can be compared with other machine learning classifiers.
更多
查看译文
关键词
Automatic language identification, line spectral pair, ensemble learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要