Multi-Level Fusion Net for hand pose estimation in hand-object interaction

SIGNAL PROCESSING-IMAGE COMMUNICATION(2021)

引用 2|浏览13
暂无评分
摘要
This work is about solving a challenging problem of estimating the full 3D hand pose when a hand interacts with an unknown object. Compared to isolated single hand pose estimation, occlusion and interference induced by the manipulated object and the clutter background bring more difficulties for this task. Our proposed Multi-Level Fusion Net focuses on extracting more effective features to overcome these disadvantages by multi-level fusion design with a new end-to-end Convolutional Neural Network (CNN) framework. It takes cropped RGBD data from a single RGBD camera at free viewpoint as input without requiring additional handobject pre-segmentation and object or hand pre-modeling. Through extensive evaluations on public hand-object interaction dataset, we demonstrate the state-of-the-art performance of our method.
更多
查看译文
关键词
Hand pose estimation, Hand-object interaction, Occlusion, RGBD, Convolutional Neural Networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要