Mixing Body-Part Sequences for Human Pose Estimation

CVPR(2014)

引用 133|浏览131
暂无评分
摘要
In this paper, we present a method for estimating articulated human poses in videos. We cast this as an optimization problem defined on body parts with spatio-temporal links between them. The resulting formulation is unfortunately intractable and previous approaches only provide approximate solutions. Although such methods perform well on certain body parts, e.g., head, their performance on lower arms, i.e., elbows and wrists, remains poor. We present a new approximate scheme with two steps dedicated to pose estimation. First, our approach takes into account temporal links with subsequent frames for the less-certain parts, namely elbows and wrists. Second, our method decomposes poses into limbs, generates limb sequences across time, and recomposes poses by mixing these body part sequences. We introduce a new dataset \"Poses in the Wild\", which is more challenging than the existing ones, with sequences containing background clutter, occlusions, and severe camera motion. We experimentally compare our method with recent approaches on this new dataset as well as on two other benchmark datasets, and show significant improvement.
更多
查看译文
关键词
clutter,estimation theory,optimisation,pose estimation,spatiotemporal phenomena,video signal processing,approximate scheme,background clutter,body-part sequences,camera motion,human pose estimation,limb sequences,occlusions,poses in the wild,spatiotemporal links,Human pose estimation,Mixing body-parts,Optical flow for pose estimation,Poses in videos
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要