One-Shot Video Object Segmentation
2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2017)
摘要
This paper tackles the task of semi-supervised video object segmentation, i.e., the separation of an object from the background in a video, given the mask of the first frame. We present One-Shot Video Object Segmentation (OSVOS), based on a fully-convolutional neural network architecture that is able to successively transfer generic semantic information, learned on ImageNet, to the task of foreground segmentation, and finally to learning the appearance of a single annotated object of the test sequence (hence one-shot). Although all frames are processed independently, the results are temporally coherent and stable. We perform experiments on two annotated video segmentation databases, which show that OSVOS is fast and improves the state of the art by a significant margin (79.8% vs 68.0%).
更多查看译文
关键词
generic semantic information,foreground segmentation,single annotated object,annotated video segmentation databases,OSVOS,semisupervised video object segmentation,One-Shot Video Object Segmentation,fully-convolutional neural network architecture,one-shot video object segmentation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络