Semi-Supervised Training In Low-Resource Asr And Kws

Florian Metze,Ankur Gandhe, Yajie Miao,Zaid Sheikh,Yun Wang,Di Xu,Hao Zhang,Jungsuk Kim,Ian Lane,Won Kyum Lee,Sebastian Stueker,Markus Mueller

2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)（2015）

引用 14|浏览153

暂无评分

摘要

In particular for "low resource" Keyword Search (KWS) and Speech-to-Text (STT) tasks, more untranscribed test data may be available than training data. Several approaches have been proposed to make this data useful during system development, even when initial systems have Word Error Rates (WER) above 70%. In this paper, we present a set of experiments on low-resource languages in telephony speech quality in Assamese, Bengali, Lao, Haitian, Zulu, and Tamil, demonstrating the impact that such techniques can have, in particular learning robust bottle-neck features on the test data. In the case of Tamil, when significantly more test data than training data is available, we integrated semi-supervised training and speaker adaptation on the test data, and achieved significant additional improvements in STT and KWS.

查看译文

关键词

spoken term detection,automatic speech recognition,low-resource LTs,semi-supervised training

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要