Names and Faces
The Physician and sportsmedicine(2004)
摘要
We show that a large and realistic face dataset can be built from news photographs and their associated captions. Our automatically constructed face dataset consists of 30,281 face im- ages, obtained by applying a face nder to approximately half a million captioned news images and labeled using image information from the photographs and word information extracted from the corresponding caption. This dataset is more realistic than usual face recognition datasets, because it contains faces captured ìin the wildî in a variety of cong- urations with respect to the camera, taking a variety of expressions, and under illumination of widely varying color. Faces are extracted from the images and names with context are extracted from the associated caption. Our system uses a clustering procedure to nd the correspondence between faces and associated names in news picture-caption pairs. The context in which a name appears in a caption provides powerful cues as to whether it is depicted in the associated image. By incorporating simple natural language techniques, we are able to improve our name assignment signicantly . We use two models of word context, a Naive Bayes model and a Maximum Entropy model. Once our procedure is complete, we have an accurately labeled set of faces, an appearance model for each indi- vidual depicted, and a natural language model that can produce accurate results on captions in isolation.
更多查看译文
关键词
faces,words,news,names,pictures
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络