Evaluating a Fine-Tuned Whisper Model on Underrepresented Romanian Speech

2023 International Conference on Speech Technology and Human-Computer Dialogue (SpeD)(2023)

引用 0|浏览5
暂无评分
摘要
Speech datasets available for training Romanian automatic speech recognition (ASR) systems are constructed around similar demographics (male voices, age between 19-29 years). In this paper, we present a dataset of underrepresented Romanian speech (USPDATRO), constructed from open data. We fine-tune a state-of-the-art Whisper model using existing datasets and evaluate the resulting model on the dataset of underrepresented speech. Results indicate that more such data is needed to improve the performance of Romanian ASR sytems.
更多
查看译文
关键词
automatic speech recognition,Romanian,underrepresented speech dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要