Fusion of speech, faces and text for person identification in TV broadcast

COMPUTER VISION - ECCV 2012, PT III(2012)

引用 21|浏览0
暂无评分
摘要
The Repere challenge is a project aiming at the evaluation of systems for supervised and unsupervised multimodal recognition of people in TV broadcast. In this paper, we describe, evaluate and discuss QCompere consortium submissions to the 2012 Repere evaluation campaign dry-run. Speaker identification (and face recognition) can be greatly improved when combined with name detection through video optical character recognition. Moreover, we show that unsupervised multimodal person recognition systems can achieve performance nearly as good as supervised monomodal ones (with several hundreds of identity models).
更多
查看译文
关键词
qcompere consortium submission,video optical character recognition,person identification,face recognition,supervised monomodal,unsupervised multimodal recognition,identity model,tv broadcast,unsupervised multimodal person recognition,repere challenge,repere evaluation campaign dry-run
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要