A Point Set Generation Network for 3D Object Reconstruction from a Single Image

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2016)

引用 1234|浏览268
暂无评分
摘要
Generation of 3D data by deep neural network has been attracting increasing attention in the research community. The majority of extant works resort to regular representations such as volumetric grids or collection of images; however, these representations obscure the natural invariance of 3D shapes under geometric transformations and also suffer from a number of other issues. In this paper we address the problem of 3D reconstruction from a single image, generating a straight-forward form of output -- point cloud coordinates. Along with this problem arises a unique and interesting issue, that the groundtruth shape for an input image may be ambiguous. Driven by this unorthodox output form and the inherent ambiguity in groundtruth, we design architecture, loss function and learning paradigm that are novel and effective. Our final solution is a conditional shape sampler, capable of predicting multiple plausible 3D point clouds from an input image. In experiments not only can our system outperform state-of-the-art methods on single image based 3d reconstruction benchmarks; but it also shows a strong performance for 3d shape completion and promising ability in making multiple plausible predictions.
更多
查看译文
关键词
loss function,learning paradigm,single image based 3d reconstruction benchmarks,geometric transformations,research community,extant works,regular representations,volumetric grids,natural invariance,single image,groundtruth shape,input image,unorthodox output form,conditional shape sampler,multiple plausible 3D point clouds,3D shape completion,point set generation network,3D object reconstruction,deep neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要