A Comprehensive Study on Bangla Automatic Speech Recognition Systems

Prajat Paul, Mohamed Mehfoud Bouh,Forhad Hossain, Ashir Ahmed

2023 2nd International Conference on Frontiers of Communications, Information System and Data Science (CISDS)(2023)

引用 0|浏览0
暂无评分
摘要
This article examines the current state of speech recognition technologies in Bangla, which is one of the Low Resource Languages (LRLs) spoken by a population of almost 185 million in Bangladesh and Western India. Out of the 12,338 articles on Bangla Speech Recognition Systems, 15 have been selected based on the fine-tuning approach of deep-learning algorithms, speech corpus development, gender bias mitigation, and unique acoustic modeling. However, the datasets and evaluation metrics used in these articles vary, making it difficult to compare their performance. To address this, a comparative analysis of the selected papers is provided, summarizing the technological approaches, adapted methods, and achieved performance. Additionally, a prospective application of speech-based data collection in the healthcare domain is introduced, highlighting its potential.
更多
查看译文
关键词
Automatic Speech Recognition (ASR),Low-resource language,Deep Learning,Bangla ASR
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要