Love Thy Neighbors: Image Annotation by Exploiting Image Metadata

2015 IEEE International Conference on Computer Vision (ICCV)(2015)

引用 147|浏览85
暂无评分
摘要
Some images that are difficult to recognize on their own may become more clear in the context of a neighborhood of related images with similar social-network metadata. We build on this intuition to improve multilabel image annotation. Our model uses image metadata nonparametrically to generate neighborhoods of related images using Jaccard similarities, then uses a deep neural network to blend visual information from the image and its neighbors. Prior work typically models image metadata parametrically, in contrast, our nonparametric treatment allows our model to perform well even when the vocabulary of metadata changes between training and testing. We perform comprehensive experiments on the NUS-WIDE dataset, where we show that our model outperforms state-of-the-art methods for multilabel image annotation even when our model is forced to generalize to new types of metadata.
更多
查看译文
关键词
image recognition,images neighborhood,social-network metadata,multilabel image annotation,image metadata,Jaccard similarities,deep neural network,visual information,nonparametric treatment,NUS-WIDE dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要