Two Step Speaker Segmentation Method Using Bayesian Information Criterion And Adapted Gaussian Mixtures Models

INTERSPEECH 2008: 9TH ANNUAL CONFERENCE OF THE INTERNATIONAL SPEECH COMMUNICATION ASSOCIATION 2008, VOLS 1-5(2008)

引用 23|浏览1
暂无评分
摘要
This paper addresses the topic of online unsupervised speaker segmentation in a complex audio environment as it is present in the Broadcast News databases. A new two stage speaker change detection algorithm is proposed, which combines the Bayesian Information Criterion with an ABLS-SCD statistical framework where adapted Gaussian mixture models are used to achieve higher accuracy. To enhance the performance of the proposed method a sub-window dependent threshold selection strategy for the ABLS-SCD is introduced. Also an additional window selection strategy for the proposed method is presented. Experimental design and test evaluation were carried out on the Slovenian BNSI Broadcast News database.
更多
查看译文
关键词
speaker segmentation, Bayesian Information Criterion, statistical speaker modeling, Universal Background Model
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要