Data Distillation: Towards Omni-Supervised Learning

2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition(2017)

引用 472|浏览635
暂无评分
摘要
We investigate omni-supervised learning, a special regime of semi-supervised learning in which the learner exploits all available labeled data plus internet-scale sources of unlabeled data. Omni-supervised learning is lower-bounded by performance on existing labeled datasets, offering the potential to surpass state-of-the-art fully supervised methods. To exploit the omni-supervised setting, we propose data distillation, a method that ensembles predictions from multiple transformations of unlabeled data, using a single model, to automatically generate new training annotations. We argue that visual recognition models have recently become accurate enough that it is now possible to apply classic ideas about self-training to challenging real-world data. Our experimental results show that in the cases of human keypoint detection and general object detection, state-of-the-art models trained with data distillation surpass the performance of using labeled data from the COCO dataset alone.
更多
查看译文
关键词
omni-supervised learning,semisupervised learning,unlabeled data,fully supervised methods,data distillation,labeled data plus internet-scale sources,visual recognition models,human keypoint detection,general object detection,COCO dataset
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要