Speaker diarization with unsupervised training framework

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2016)

引用 1|浏览0
暂无评分
摘要
This paper investigates single and cross-show diarization based on an unsupervised i-vector framework, on French TV and Radio corpora. This framework uses speaker clustering as a way to automatically select data from unlabeled corpora to train i-vector PLDA models. Performances between supervised and unsupervised models are compared. The experimental results on two distinct test corpora (one TV, one Radio) show that unsupervised models perform as good as supervised models for both tasks. Such results indicate that performing an effective cross-show diarization on new language or new domain data in the future should not depend on the availability of manually annotated data.
更多
查看译文
关键词
Speaker diarization,speaker linking,unsupervised training
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要