User generated video annotation using geo-tagged image databases

ICME(2009)

引用 14|浏览6
暂无评分
摘要
In this paper we propose a system that annotates a user generated video based on the associated location metadata, by exploiting user-tagged image databases. An example of such a database is a photo sharing website such as Flickr [1] where users upload their images and annotate them with various tags. The goal is to find the tags that have high probability of being relevant to the video without any complex object or action recognition being done to the video sequence. A video is first segmented into camera views and a set of keyframes are selected to represent the video. We will describe the concept of camera view as the basic element of user generated videos which has special properties suitable for the video annotation application. The keyframes are used to retrieve the most relevant images in the database. A "tag processing" step is then used to tag the video.
更多
查看译文
关键词
video annotation application,tag processing,action recognition,various tag,camera view,geo-tagged image databases,complex object,associated location metadata,relevant image,basic element,video sequence,data mining,meta data,object recognition,probability,image segmentation,histograms,geo location
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要