The Cityscapes Dataset

user-5bd69975530c70d56f390249(2015)

引用 256|浏览25
暂无评分
摘要
Semantic understanding of urban street scenes through visual perception has been widely studied due to many possible practical applications. Key challenges arise from the high visual complexity of such scenes. In this paper, we present ongoing work on a new large-scale dataset for (1) assessing the performance of vision algorithms for different tasks of semantic urban scene understanding, including scene labeling, instance-level scene labeling, and object detection; (2) supporting research that aims to exploit large volumes of (weakly) annotated data, e.g. for training deep neural networks. We aim to provide a large and diverse set of stereo video sequences recorded in street scenes from 50 different cities, with high quality pixel-level annotations of 5000 frames in addition to a larger set of weakly annotated frames. The dataset is thus an order of magnitude larger than similar previous attempts. Several aspects are still up for discussion, and timely feedback from the community would be greatly appreciated. Details on annotated classes and examples will be available at www. cityscapes-dataset.net. Moreover, we will use this website to collect remarks and suggestions.
更多
查看译文
关键词
Object detection,Visual perception,Machine learning,Exploit,Artificial intelligence,Computer science,Deep neural networks,Scene labeling,Video sequence,Vision algorithms,Visual complexity
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要