Unsupervised Learning of Depth and Ego-Motion from Video

2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR)(2017)

引用 2827|浏览535
暂无评分
摘要
We present an unsupervised learning framework for the task of monocular depth and camera motion estimation from unstructured video sequences. We achieve this by simultaneously training depth and camera pose estimation networks using the task of view synthesis as the supervisory signal. The networks are thus coupled via the view synthesis objective during training, but can be applied independently at test time. Empirical evaluation on the KITTI dataset demonstrates the effectiveness of our approach: 1) monocular depth performing comparably with supervised methods that use either ground-truth pose or depth for training, and 2) pose estimation performing favorably with established SLAM systems under comparable input settings.
更多
查看译文
关键词
ego-motion,unsupervised learning framework,monocular depth,camera motion estimation,unstructured video sequences,end-to-end learning approach,supervisory signal,monocular video sequences,pose estimation,KITTI dataset,ground-truth pose
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要