Global and Multi-Scale Feature Learning for Remote Sensing Scene Classification.

Ziying Xia, Guolong Gan,Siyu Liu, Wei Cao,Jian Cheng

IEEE International Geoscience and Remote Sensing Symposium (IGARSS)(2022)

引用 0|浏览0
暂无评分
摘要
Although the convolutional neural network (CNN)-based and vision transformer (ViT)-based methods have achieved effective remote sensing scene classification results in the past few years, the CNN's inductive bias and the ViT's single spatial scale limit the further improvement of accuracy. To address these problems, in this paper, we proposed a novel multiscale vision transformer (MS-ViT) for remote sensing scene image classification, consists of two different spatial scale information streams that extract spatial multi-scale information, a hybrid attention module that fuses the information and a classifier to predict the scene category. We evaluated the effectiveness of our proposed method on two different remote sensing datasets, namely NWPU-RESISC45 and AID. The experimental results also show that our method outperforms CNN-based methods and the original ViT-based method in performance.
更多
查看译文
关键词
remote sensing,classification,global,feature,learning,multi-scale
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要