CFAM: Estimating 3D Hand Poses from a Single RGB Image with Attention
APPLIED SCIENCES-BASEL(2020)
摘要
Precise 3D hand pose estimation can be used to improve the performance of human-computer interaction (HCI). Specifically, computer-vision-based hand pose estimation can make this process more natural. Most traditional computer-vision-based hand pose estimation methods use depth images as the input, which requires complicated and expensive acquisition equipment. Estimation through a single RGB image is more convenient and less expensive. Previous methods based on RGB images utilize only 2D keypoint score maps to recover 3D hand poses but ignore the hand texture features and the underlying spatial information in the RGB image, which leads to a relatively low accuracy. To address this issue, we propose a channel fusion attention mechanism that combines 2D keypoint features and RGB image features at the channel level. In particular, the proposed method replans weights by using cascading RGB images and 2D keypoint features, which enables rational planning and the utilization of various features. Moreover, our method improves the fusion performance of different types of feature maps. Multiple contrast experiments on public datasets demonstrate that the accuracy of our proposed method is comparable to the state-of-the-art accuracy.
更多查看译文
关键词
hand pose estimation,CFAM,3D keypoint,RGB image,attention
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络