Unsupervised Pronunciation Fluency Scoring by infoGan

Asia-Pacific Signal and Information Processing Association Annual Summit and Conference（2019）

引用 2|浏览3

暂无评分

摘要

Pronunciation fluency scoring (PFS) is a primary task in computer-aided second language (L2) learning. Most of existing PFS algorithms are based on supervised learning, where human-labeled scores are used to train the scoring model. However, the human labeling is rather costly and tends to be biased. In order to tackle this problem, we propose an unsupervised learning approach, where an infoGan model is constructed to infer latent speech codes, and then these codes are used to build a classifier that distinguishes native and foreign speech. We found that this native-foreign classifier can generate good utterance-based fluency scores.

查看译文

关键词

human-labeled scores,scoring model,unsupervised learning approach,infoGan model,latent speech codes,native-foreign classifier,unsupervised pronunciation fluency scoring,computer-aided second language learning,PFS algorithms,utterance-based fluency scores

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要