Video Instance Segmentation 2019: A Winning Approach for Combined Detection, Segmentation, Classification and Tracking
2019 IEEE/CVF International Conference on Computer Vision Workshop (ICCVW)(2019)
摘要
Video Instance Segmentation (VIS) is the task of localizing all objects in a video, segmenting them, tracking them throughout the video and classifying them into a set of predefined classes. In this work, divide VIS into these four parts: detection, segmentation, tracking and classification. We then develop algorithms for performing each of these four sub tasks individually, and combine these into a complete solution for VIS. Our solution is an adaptation of UnOVOST, the current best performing algorithm for Unsupervised Video Object Segmentation, to this VIS task. We benchmark our algorithm on the 2019 YouTube-VIS Challenge, where we obtain first place with an mAP score of 46.7%.
更多查看译文
关键词
Video Instance Segmentation,Detection,Tracking,Segmentation,Classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络