Extreme Low-Resolution Action Recognition with Confident Spatial-Temporal Attention Transfer

International Journal of Computer Vision(2023)

引用 3|浏览25
暂无评分
摘要
ion recognition on extreme low-resolution videos, e.g., a resolution of 12 × 16 pixels, plays a vital role in far-view surveillance and privacy-preserving multimedia analysis. As low-resolution videos often only contain limited information, it is difficult for us to perform action recognition in them. Given the fact that one same action may be represented by videos in both high resolution (HR) and extreme low resolution (eLR), it is worth studying to utilize the relevant HR data to improve the eLR action recognition. In this work, we propose a novel Confident Spatial-Temporal Attention Transfer (CSTAT) for eLR action recognition. CSTAT acquires information from HR data by reducing the attention differences with a transfer-learning strategy. Besides, the confidence of the supervisory signal is also taken into consideration for a more reliable transferring process. Experimental results demonstrate that, the proposed method can effectively improve the accuracy of eLR action recognition and achieve state-of-the-art performances on 12× 16 HMDB51, 12× 16 Kinects-400, and 12× 16 Something-Something v2.
更多
查看译文
关键词
Action recognition,Attention transfer,Extreme low-resolution vision
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要