TMN: Temporal-guided Multiattention Network for Action Recognition.

ICPR(2022)

引用 1|浏览14
暂无评分
摘要
2D convolutional neural network, due to its low computational complexity and fast recognition speed, has attracted more and more attention from researchers in the field of video action recognition. Temporal shift and temporal differential, have made tremendous progress, but the lack of crucial spatio-temporal attention mechanism has led to huge performance loss. To address this issue, we propose a Temporal-guided Multiattention Network (TMN), which fully excavate and fuse spatio-temporal attention information for effective video action recognition. Concretely, the multi-attention module squeezes and expands spatio-temporal features to achieve weighting of corresponding regions for video in spatio-temporal dimensions, while the adaptive temporal guidance module imports temporal guiding signal to the spatial attention and re-weight the global temporal attention to accomplish the accurate temporal modeling. Extensive experiments and analyses show that our proposed temporal-guided multiattention network can achieve state-of-the-art promising video action recognition performance on the widely used benchmarks (HMDB51, UCF101 and Something-Something V1).
更多
查看译文
关键词
2D convolutional neural network,accurate temporal modeling,adaptive temporal guidance module imports,crucial spatiotemporal attention mechanism,effective video action recognition,expands spatio-temporal features,global temporal attention,recognition speed,spatio-temporal attention information,spatio-temporal dimensions,state-of-the-art promising video action recognition performance,Temporal shift,temporal-guided multiattention network
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要