VikingDet: A Real-time Person and Face Detector for Surveillance Cameras

2019 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS)(2019)

引用 1|浏览9
暂无评分
摘要
In this paper, we propose a novel one-stage detector that can simultaneously detect both pedestrians and their faces. The framework is named as VikingDet for its simple but effective two-headed architecture. To tackle the challenges of person and face detection especially under surveillance cameras (e.g. low data quality, complex environments, requirements for efficiency, etc.), we make contributions in the following several aspects: 1) integrating both person and face detection into one network which current leading object detection algorithms are seldomly able to handle; 2) emphasizing detection in low-quality images. we introduce multiple thresholds for matching different sized positive samples, and set proper hyper-parameters, hence our VikingDet is able to locate small objects in surveillance cameras even of low-quality; 3) introducing a training strategy to utilize datasets on hand. Since most available public datasets annotate only people without their faces or faces without bodies, we use multi-step training and an integrated loss function to train VikingDet with these partly annotated data. As a consequence, our detector achieves satisfactory performances in several relative benchmarks with a speed at more than 60 FPS on NVIDIA TITAN X GPU, and can be further deployed on an embedded device such as NVIDIA Jetson TX1 or TX2 with a real-time speed of over 28 FPS.
更多
查看译文
关键词
VikingDet,surveillance cameras,one-stage detector,face detection,data quality,object detection algorithms,quality images,person detector,two-headed architecture,pedestrians,NVIDIA TITAN X GPU,embedded device,integrated loss function,multiple thresholds
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要