2D Fingertip Localization on Depth Videos Using Paired Video-to-Video Translation.

ISVC (2)(2022)

引用 0|浏览10
暂无评分
摘要
We propose a two-stage pipeline and formulate 2D hand keypoint localization as a problem of conditional video generation. The goal is to learn a mapping function from an input depth video in the source domain to an output depth video along with 5 color marks on each fingertip by enforcing temporal consistency constraints. Next, by applying color segmentation techniques in HSV domain, we extract the center of each segmented part as 2D coordinates of fingertips on the translated video. To the best of our knowledge, this is the first work on fingertip localization on depth videos through domain adaptation. Our comparative experimental results with the state-of-the-art single-frame hand pose estimation on the challenging NYU dataset demonstrates that by exploiting temporal information, our model manifests better hand appearance consistency in video-to-video synthesis stage which leads to accurate estimations of 2D hand poses under motion blur by fast hand motion.
更多
查看译文
关键词
depth videos,localization,video-to-video
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要