Pseudo-Labeling for Massively Multilingual Speech Recognition

Loren Lugosch,Tatiana Likhomanenko,Gabriel Synnaeve,Ronan Collobert

IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)（2022）

引用 19|浏览78

暂无评分

摘要

Semi-supervised learning through pseudo-labeling has become a staple of state-of-the-art monolingual speech recognition systems. In this work, we extend pseudo-labeling to massively multilingual speech recognition with 60 languages. We propose a simple pseudo-labeling recipe that works well even with low-resource languages: train a supervised multilingual model, fine-tune it with semi-supervised learning on a target language, generate pseudo-labels for that language, and train a final model using pseudo-labels for all languages, either from scratch or by fine-tuning. Experiments on the labeled Common Voice and unlabeled VoxPopuli datasets show that our recipe can yield a model with better performance for many languages that also transfers well to LibriSpeech.

查看译文

关键词

speech recognition,massively multilingual models,semi-supervised learning,pseudo-labeling

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要