Ivec-PLDA-AHC priors for VB-HMM speaker diarization system

2017 IEEE International Workshop on Signal Processing Systems (SiPS)(2017)

引用 2|浏览19
暂无评分
摘要
This paper proposes a hybrid speaker diarization system. The main body is a variational Bayes - hidden Markov model (VB-HMM) speaker diarization system. The VB-HMM speaker diarization system avoids making premature hard decision and takes advantages of soft speaker information in an iterative way. Thus, it outperforms most of mainstream speaker diarization systems. Unfortunately, this system is sensitive to its prior in some cases. Either a uniform prior or a flat Dirichlet prior may fail and lead to poor results, thus a more robust and informative prior is desired. Another speaker diarization branch is an i-vector - probabilistic linear discriminant analysis - agglomerative hierarchical clustering (Ivec-PLDA-AHC) system. Benefits from the excellent performance of the Ivec-PLDA system in the speaker recognition field, the Ivec-PLDA-AHC speaker diarization system is believed to be more powerful to cluster segmental i-vectors according to their speakers. Inspired by this feature, we take the output of the Ivec-PLDA-AHC as the VB-HMM's prior. Experiments on our collected database show that the proposed system is significantly better than both of the mentioned systems.
更多
查看译文
关键词
I-vector (Ivec),probabilistic linear discriminant analysis (PLDA),agglomerative hierarchical clustering (AHC),variational Bayes (VB),hidden Markov model (HMM),speaker diariazation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要