The Mapillary Vistas Dataset For Semantic Understanding Of Street Scenes

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV)(2017)

引用 1314|浏览137
暂无评分
摘要
The Mapillary Vistas Dataset is a novel, large-scale street-level image dataset containing 25 000 high-resolution images annotated into 66 object categories with additional, instance-specific labels for 37 classes. Annotation is performed in a dense and fine-grained style by using polygons for delineating individual objects. Our dataset is 5 x larger than the total amount of fine annotations for Cityscapes and contains images from all around the world, captured at various conditions regarding weather, season and daytime. Images come from different imaging devices (mobile phones, tablets, action cameras, professional capturing rigs) and differently experienced photographers. In such a way, our dataset has been designed and compiled to cover diversity, richness of detail and geographic extent. As default benchmark tasks, we define semantic image segmentation and instance-specific image segmentation, aiming to significantly further the development of state-of-theart methods for visual road-scene understanding.
更多
查看译文
关键词
Mapillary Vistas Dataset,large-scale street-level image dataset,semantic image segmentation,instance-specific image segmentation,visual road-scene understanding,image resolution,street scene understanding,image annotation
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要