Automatic Detection of Laughter and Fillers in Spontaneous Mobile Phone Conversations

Systems, Man, and Cybernetics(2013)

引用 49|浏览0
暂无评分
摘要
This article presents experiments on automatic detection of laughter and fillers, two of the most important nonverbal behavioral cues observed in spoken conversations. The proposed approach is fully automatic and segments audio recordings captured with mobile phones into four types of interval: laughter, filler, speech and silence. The segmentation methods rely not only on probabilistic sequential models (in particular Hidden Markov Models), but also on Statistical Language Models aimed at estimating the a-priori probability of observing a given sequence of the four classes above. The experiments are speaker independent and performed over a total of 8 hours and 25 minutes of data (120 people in total). The results show that F1 scores up to 0.64 for laughter and 0.58 for fillers can be achieved.
更多
查看译文
关键词
segmentation method,important nonverbal behavioral cue,statistical language,spontaneous mobile phone conversations,automatic detection,mobile phone,probabilistic sequential model,a-priori probability,f1 score,particular hidden markov models,hidden markov models,speech recognition,audio signal processing
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要