3d Scene Reconstruction With Multi-Layer Depth And Epipolar Transformers

2019 IEEE/CVF INTERNATIONAL CONFERENCE ON COMPUTER VISION (ICCV 2019)(2019)

引用 53|浏览208
暂无评分
摘要
We tackle the problem of automatically reconstructing a complete 3D model of a scene from a single RGB image. This challenging task requires inferring the shape of both visible and occluded surfaces. Our approach utilizes viewer-centered, multi-layer representation of scene geometry adapted from recent methods for single object shape completion. To improve the accuracy of view-centered representations for complex scenes, we introduce a novel "Epipolar Feature Transformer" that transfers convolutional network features from an input view to other virtual camera viewpoints, and thus better covers the 3D scene geometry. Unlike existing approaches that first detect and localize objects in 3D, and then infer object shape using category-specific models, our approach is fully convolutional, end-to-end differentiable, and avoids the resolution and memory limitations of voxel representations. We demonstrate the advantages of multi-layer depth representations and epipolar feature transformers on the reconstruction of a large database of indoor scenes.
更多
查看译文
关键词
epipolar feature transformers,indoor scenes,epipolar transformers,3D model,RGB image,visible surfaces,occluded surfaces,multilayer representation,3D scene geometry,view-centered representations,complex scenes,convolutional network features,virtual camera viewpoints,category-specific models,voxel representations,multilayer depth representations,object shape completion,3D scene reconstruction,multilayer depth
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要