Weakly-Supervised 3d Human Pose Learning Via Multi-View Images In The Wild

2020 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)(2020)

引用 121|浏览176
暂无评分
摘要
One major challenge for monocular 3D human pose estimation in-the-wild is the acquisition of training data that contains unconstrained images annotated with accurate 3D poses. In this paper, we address this challenge by proposing a weakly-supervised approach that does not require 3D annotations and learns to estimate 3D poses from unlabeled multi-view data, which can be acquired easily in in-the-wild environments. We propose a novel end-to-end learning framework that enables weakly-supervised training using multi-view consistency. Since multi-view consistency is prone to degenerated solutions, we adopt a 2.5D pose representation and propose a novel objective function that can only be minimized when the predictions of the trained model are consistent and plausible across all camera views. We evaluate our proposed approach on two large scale datasets (Human3.6M and MPII-INF-3DHP) where it achieves state-of-the-art performance among semi-/weakly-supervised methods.
更多
查看译文
关键词
in-the-wild environments,end-to-end learning framework,multiview consistency,2.5D pose representation,camera views,MPII-INF-3DHP,multiview images,unconstrained images,accurate 3D poses,unlabeled multiview data,weakly-supervised training approach,monocular 3D human pose estimation in-the-wild,training data acquisition,objective function,trained model prediction,semisupervised methods
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要