ESTIMATING THE DOMINANT PERSON IN MULTI-PARTY CONVERSATIO NS USING SPEAKER DIARIZATION STRATEGIES

International Conference on Acoustics, Speech, and Signal Processing(2008)

引用 40|浏览5
暂无评分
摘要
In this paper, we apply speaker diarization strategies from a single source to the task of estimating the dominant person in a group meet- ing. Previous work has shown that speaking length is strongly cor- related with perceived dominance. Here we investigate this in more depth by considering two dominance tasks where there is full and majority agreement amongst ground-truth annotators. In addition, we investigate how 24 different speed-up and algorithmic strategies, and source types lead to interesting outcomes when applied to dom- inance estimation. We obtained the best performance of 77% using our slowest scheme and a single distant microphone (SDM). Within the top 3 out of 24 performing experiments in both dominance tasks, we show that we can use the furthest SDM, with no prior knowledge of the number of speakers and the fastest diarization scheme, which performs 1.3 times faster than real-time.
更多
查看译文
关键词
index terms— speaker diarization,dominance modelling
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要