Switchable Deep Network for Pedestrian Detection

CVPR(2014)

引用 309|浏览495
暂无评分
摘要
In this paper, we propose a Switchable Deep Network (SDN) for pedestrian detection. The SDN automatically learns hierarchical features, salience maps, and mixture representations of different body parts. Pedestrian detection faces the challenges of background clutter and large variations of pedestrian appearance due to pose and viewpoint changes and other factors. One of our key contributions is to propose a Switchable Restricted Boltzmann Machine (SRBM) to explicitly model the complex mixture of visual variations at multiple levels. At the feature levels, it automatically estimates saliency maps for each test sample in order to separate background clutters from discriminative regions for pedestrian detection. At the part and body levels, it is able to infer the most appropriate template for the mixture models of each part and the whole body. We have devised a new generative algorithm to effectively pretrain the SDN and then fine-tune it with back-propagation. Our approach is evaluated on the Caltech and ETH datasets and achieves the state-of-the-art detection performance.
更多
查看译文
关键词
pedestrians,image representation,back-propagation,sdn,pedestrian detection,complex visual variation mixture,switchable deep network,pedestrian appearance,mixture models,boltzmann machines,salience map estimation,background clutter,object detection,hierarchical feature learning,discriminative regions,clutter,caltech datasets,switchable restricted boltzmann machine,eth datasets,mixture body part representations,srbm,logistics,switches,detectors,back propagation,feature extraction,vectors
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要