Inter-Speaker Variability In Forensic Voice Comparison: A Preliminary Evaluation

2016 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)(2016)

引用 13|浏览44
暂无评分
摘要
In forensic voice comparison, it is strongly recommended to follow Bayesian paradigm. In this paradigm, the strength of the forensic evidence is summarized by a likelihood ratio (LR). The LR magnitude quantifies the strength of the evidence: far from unity for a meaningful LR (a LR which supports strongly one of the hypothesis); close to unity when the evidence is next to useless. Despite this nice theoretical aspect, the LR does not embed the reliability of its estimation process itself. And, in various cases, a lack in reliability inside the estimation process is able to destroy the reliability of the resulting LR. It is particularly true when voice comparison is considered, as Speaker Recognition (SR) systems are outputting a score in all situations regardless of the case specific conditions. Furthermore, SR systems use different normalization steps to see their scores as LR and these normalization steps are clearly a potential source of bias. Consequently, a complete view of reliability should be taken into account for forensic voice comparison. This article focuses on one part of this question, the "speaker factor", the characteristics and the behaviors of the two speakers involved in a voice comparison trial.
更多
查看译文
关键词
forensic voice comparison,inter-speaker variability,speaker profile,speaker recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要