Multimodal Gesture Recognition Based On The Resc3d Network

2017 IEEE INTERNATIONAL CONFERENCE ON COMPUTER VISION WORKSHOPS (ICCVW 2017)(2017)

引用 163|浏览95
暂无评分
摘要
Gesture recognition is an important issue in computer vision. Recognizing gestures with videos remains a challenging task due to the barriers of gesture-irrelevant factors. In this paper, we propose a multimodal gesture recognition method based on a ResC3D network. One key idea is to find a compact and effective representation of video sequences. Therefore, the video enhancement techniques, such as Retinex and median filter are applied to eliminate the illumination variation and noise in the input video, and a weighted frame unification strategy is utilized to sample key frames. Upon these representations, a ResC3D network, which leverages the advantages of both residual and C3D model, is developed to extract features, together with a canonical correlation analysis based fusion scheme for blending features. The performance of our method is evaluated in the Chalearn LAP isolated gesture recognition challenge. It reaches 67.71% accuracy and ranks the 1st place in this challenge.
更多
查看译文
关键词
ResC3D network,gesture-irrelevant factors,multimodal gesture recognition method,video sequences,video enhancement techniques,input video,weighted frame unification strategy,sample key frames,residual C3D model,canonical correlation analysis based fusion scheme,gesture recognition challenge,illumination variation,illumination noise,feature extraction,Chalearn LAP isolated gesture recognition
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要