Nonlocal Spatial Attention Module For Image Classification
INTERNATIONAL JOURNAL OF ADVANCED ROBOTIC SYSTEMS(2020)
摘要
To enhance the capability of neural networks, research on attention mechanism have been deepened. In this area, attention modules make forward inference along channel dimension and spatial dimension sequentially, parallelly, or simultaneously. However, we have found that spatial attention modules mainly apply convolution layers to generate attention maps, which aggregate feature responses only based on local receptive fields. In this article, we take advantage of this finding to create a nonlocal spatial attention module (NL-SAM), which collects context information from all pixels to adaptively recalibrate spatial responses in a convolutional feature map. NL-SAM overcomes the limitations of repeating local operations and exports a 2D spatial attention map to emphasize or suppress responses in different locations. Experiments on three benchmark datasets show at least 0.58% improvements on variant ResNets. Furthermore, this module is simple and can be easily integrated with existing channel attention modules, such as squeeze-and-excitation and gather-excite, to exceed these significant models at a minimal additional computational cost (0.196%).
更多查看译文
关键词
Convolutional neural network, nonlocal, attention module, image classification, computer vision, object recognition and classification, vision systems
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络