Semi-Coupled Two-Stream Fusion ConvNets for Action Recognition at Extremely Low Resolutions

2017 IEEE Winter Conference on Applications of Computer Vision (WACV)(2017)

引用 34|浏览30
暂无评分
摘要
Deep convolutional neural networks (ConvNets) have been recently shown to attain state-of-the-art performance for action recognition on standard-resolution videos. However, less attention has been paid to recognition performance at extremely low resolutions (eLR) (e.g., 16 12 pixels). Reliable action recognition using eLR cameras would address privacy concerns in various application environments such as private homes, hospitals, nursing/rehabilitation facilities, etc. In this paper, we propose a semi-coupled, filter-sharing network that leverages high-resolution (HR) videos during training in order to assist an eLR ConvNet. We also study methods for fusing spatial and temporal ConvNets customized for eLR videos in order to take advantage of appearance and motion information. Our method outperforms state-of-the-art methods at extremely low resolutions on IXMAS (93:7%) and HMDB (29:2%) datasets.
更多
查看译文
关键词
semicoupled two-stream fusion ConvNets,action recognition,filter-sharing network,high-resolution videos,HR videos,eLR ConvNet,motion information,appearance information,extremely low resolutions,deep convolutional neural networks
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要