Compact Bilinear Pooling

2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016)

引用 976|浏览148
暂无评分
摘要
Bilinear models has been shown to achieve impressive performance on a wide range of visual tasks, such as semantic segmentation, fine grained recognition and face recognition. However, bilinear features are high dimensional, typically on the order of hundreds of thousands to a few million, which makes them impractical for subsequent analysis. We propose two compact bilinear representations with the same discriminative power as the full bilinear representation but with only a few thousand dimensions. Our compact representations allow back-propagation of classification errors enabling an end-to-end optimization of the visual recognition system. The compact bilinear representations are derived through a novel kernelized analysis of bilinear pooling which provide insights into the discriminative power of bilinear pooling, and a platform for further research in compact pooling methods. Experimentation illustrate the utility of the proposed representations for image classification and few-shot learning across several datasets.
更多
查看译文
关键词
compact bilinear pooling,visual tasks,semantic segmentation,fine grained recognition,face recognition,bilinear features,bilinear representations,backpropagation,classification errors,end-to-end optimization,visual recognition system,image classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要