Improving Bag Of Visual Words Representations With Genetic Programming

2015 International Joint Conference on Neural Networks (IJCNN)(2015)

引用 2|浏览33
暂无评分
摘要
The bag of visual words is a well established representation in diverse computer vision problems. Taking inspiration from the fields of text mining and retrieval, this representation has proved to be very effective in a large number of domains. In most cases, a standard term-frequency weighting scheme is considered for representing images and videos in computer vision. This is somewhat surprising, as there are many alternative ways of generating bag of words representations within the text processing community. This paper explores the use of alternative weighting schemes for landmark tasks in computer vision: image categorization and gesture recognition. We study the suitability of using well-known supervised and unsupervised weighting schemes for such tasks. More importantly, we devise a genetic program that learns new ways of representing images and videos under the bag of visual words representation. The proposed method learns to combine term-weighting primitives trying to maximize the classification performance. Experimental results are reported in standard image and video data sets showing the effectiveness of the proposed evolutionary algorithm.
更多
查看译文
关键词
bag of visual words representation,genetic programming,computer vision problems,text mining,text retrieval,standard term-frequency weighting scheme,image representation,video representation,text processing,landmark tasks,image categorization,gesture recognition,unsupervised weighting schemes,image classification,evolutionary algorithm,supervised weighting schemes
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要