Distill Knowledge From NRSfM for Weakly Supervised 3D Pose Learning

2019 IEEE/CVF International Conference on Computer Vision (ICCV)(2019)

引用 52|浏览0
暂无评分
摘要
We propose to learn a 3D pose estimator by distilling knowledge from Non-Rigid Structure from Motion (NRSfM). Our method uses solely 2D landmark annotations. No 3D data, multi-view/temporal footage, or object specific prior is required. This alleviates the data bottleneck, which is one of the major concern for supervised methods. The challenge for using NRSfM as teacher is that they often make poor depth reconstruction when the 2D projections have strong ambiguity. Directly using those wrong depth as hard target would negatively impact the student. Instead, we propose a novel loss that ties depth prediction to the cost function used in NRSfM. This gives the student pose estimator freedom to reduce depth error by associating with image features. Validated on H3.6M dataset, our learned 3D pose estimation network achieves more accurate reconstruction compared to NRSfM methods. It also outperforms other weakly supervised methods, in spite of using significantly less supervision.
更多
查看译文
关键词
weakly supervised 3D pose learning,solely 2D landmark annotations,data bottleneck,poor depth reconstruction,ties depth prediction,depth error,H3.6M dataset,3D pose estimation network,NRSfM methods,weakly supervised methods,image features,NonRigid Structure from Motion
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要