Development and Analysis of a Versatile Dataset of Speech, Real and Synthesized, of Arabic Learners

2023 3rd International Conference on Computing and Information Technology (ICCIT)(2023)

引用 0|浏览0
暂无评分
摘要
Computer-Aided Pronunciation Training (CAPT) systems are gaining popularity recently due to the advancements in deep neural networks (DNN) and machine learning and the availability of databases of speech of language learners. Unfortunately, research in Arabic CAPT systems suffer from the lack of CAPT datasets compared to other languages. In this paper, we present the details of and the ideas used in the development of a versatile dataset of speech, real and synthesized, of Arabic Learners. To develop the dataset, we utilized an existing Arabic speech corpus, King Saud University Speech Database (KSU-SD). KSU-SD's main application was a speaker recognition system, but it was designed to be useful in other applications such as CAPT systems. KSU-SD includes a large number of speakers from diverse nationalities (Saudis, Arabs, and Non-Arabs). KSU-SD contains the recording of about 60 non-native speakers from more than 20 nationalities, hence we selected it to build the non-native Arabic-CAPT corpus. The developed corpus consists of transcribed, segmented, and annotated speech, which makes it suitable for building Arabic CAPT systems. In addition to presenting the details of developing the dataset, we also present an analysis of the text and speech errors of the dataset. The dataset was verified by many CAPT systems.
更多
查看译文
关键词
Arabic CAPT,MDD,Synthesis Speech
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要