Sell-Corpus: An Open Source Multiple Accented Chinese-English Speech Corpus For L2 English Learning Assessment

2019 IEEE INTERNATIONAL CONFERENCE ON ACOUSTICS, SPEECH AND SIGNAL PROCESSING (ICASSP)(2019)

引用 7|浏览13
暂无评分
摘要
We present SELL-CORPUS, a multiple accented speech corpus for L2 English learning in China, aiming at the potential research of multiple accented acoustic model, mispronunciation detection and pronunciation assessment for future nationwide oral English tests. Our corpus contains 31.6 hour speech recordings contributed by 389 volunteer speakers, including 186 males and 203 females. Our corpus covers seven major regional dialects and provides a baseline for Chinese multiple accented automatic speech recognition system. We released our speech corpus to the public for academic research. To the best of our knowledge, it is the first open-source English speech corpus that accounts for the accents of all major Chinese regional dialects.
更多
查看译文
关键词
English speech corpus, Chinese dialects, Automatic speech recognition, Second language learning, English pronunciation assessment
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要