Matterport3D: Learning from RGB-D Data in Indoor Environments

2017 International Conference on 3D Vision (3DV)(2017)

引用 1672|浏览209
暂无评分
摘要
Access to large, diverse RGB-D datasets is critical for training RGB-D scene understanding algorithms. However, existing datasets still cover only a limited number of views or a restricted scale of spaces. In this paper, we introduce Matterport3D, a large-scale RGB-D dataset containing 10,800 panoramic views from 194,400 RGB-D images of 90 building-scale scenes. Annotations are provided with surface reconstructions, camera poses, and 2D and 3D semantic segmentations. The precise global alignment and comprehensive, diverse panoramic set of views over entire buildings enable a variety of supervised and self-supervised computer vision tasks, including keypoint matching, view overlap prediction, normal prediction from color, semantic segmentation, and region classification.
更多
查看译文
关键词
Matterport3D,400 RGB-D images,90 building-scale scenes,3D semantic segmentations,supervised self-supervised computer vision tasks,view overlap prediction,RGB-D data,indoor environments,diverse RGB-D datasets,training RGB-D scene understanding algorithms,existing datasets,panoramic views,RGB-D images
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要