Boosting Few-Shot Remote Sensing Image Scene Classification with Language-Guided Multimodal Prompt Tuning

Haixia Bi, Zhangwei Gao, Kang Liu, Qian Song,Xiaotian Wang

2023 International Conference on New Trends in Computational Intelligence (NTCI)(2023)

引用 0|浏览0
暂无评分
摘要
Remote sensing image Scene classification is an important research topic in remote sensing community and has evoked a growing concern with the recent development of deep learning techniques. However, the requirement of a large amount of annotations brings great challenges to deep learning-based scene classification approaches. Visual-linguistic pretraining models, which improve the transferability of visual models using the supervision information of text, create a new way for the task under label scarcity scenario. In this paper, we explore the novel approach of prompt engineering, aiming to achieve satisfactory performance of multi-modal pretraining models on downstream remote sensing image scene classification task with minimal amounts of training data. Experiments were conducted on multiple publicly available datasets. The results indicate that training the learnable prompts with a small number of samples can yield impressive results, surpassing the few-shot transfer learning results of the best-performing pre-trained models.
更多
查看译文
关键词
Remote sensing image scene classification,Prompt tuning,Few-shot learning,Multi-modal pretraining
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要