Visual Recognition With Humans In The Loop

ECCV'10: Proceedings of the 11th European conference on Computer vision: Part IV(2010)

引用 575|浏览302
暂无评分
摘要
We present an interactive, hybrid human-computer method for object classification. The method applies to classes of objects that are recognizable by people with appropriate expertise (e.g., animal species or airplane model); but not (in general) by people without such expertise. It can be seen as a visual version of the 20 questions game, where questions based on simple visual attributes are posed interactively. The goal is to identify the true class while minimizing the number of questions asked; using the visual content of the image. We introduce a general framework for incorporating almost any off-the-shelf multi-class object; recognition algorithm into the visual 20 questions game, and provide methodologies to account for imperfect user responses and unreliable computer vision algorithms. We evaluate our methods on Birds-200, a difficult dataset of 200 tightly-related bird species, and on the Animals With Attributes dataset. Our results demonstrate that incorporating user input drives up recognition accuracy to levels that are good enough for practical applications, while at the same time, computer vision reduces the amount of human interaction required.
更多
查看译文
关键词
questions game,simple visual attribute,visual content,visual version,Attributes dataset,animal species,appropriate expertise,computer vision,difficult dataset,general framework,visual recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要