Grouped Bidirectional LSTM Network and Multi-Stage Fusion Convolutional Transformer for Hyperspectral Image Classification

IEEE Transactions on Geoscience and Remote Sensing(2022)

引用 6|浏览1
暂无评分
摘要
The efficiently and effectively discriminative spectral-spatial feature representation is essential for hyperspectral image (HSI) classification. However, most of the existing methods rely on patch-based convolutional neural networks (CNNs) whose ability to extract the global spatial information is very limited. To address this issue, in this article, we propose a two-branch network consisting of a grouped bidirectional long short-term memory (GBiLSTM) network and multistage fusion convolutional transformer (MFCT) for HSI classification. In the proposed GBiLSTM-MFCT, to extract the spectral features of HSI efficiently, a GBiLSTM network is designed by dividing the sequence features and hidden units of the BiLSTM network into several separate groups. To simultaneously extract the global and local spatial features of HSI, an MFCT is proposed by fusing the features of different levels obtained from the multiple phases of the convolutional vision transformer (CVT). Moreover, in the multiheaded attention module of each stage, the blueprint separable convolution-based self-attention (BSCA) module is designed, which is able to model the global and local spatial information effectively. The outputs of the GBiLSTM network and MFCT are fused to generate discriminative and robust spectral-spatial features for HSI classification. Experiments on three benchmark datasets of Indian Pine (IN), University of Pavia (UP), and Kennedy Space Center (KSC) demonstrate that the proposed GBiLSTM-MFCT exhibits higher classification performance with very limited labeled samples than eight state-of-the-art methods.
更多
查看译文
关键词
Bidirectional long short-term memory (LSTM),convolutional vision transformer (CVT),hyperspectral image (HSI) classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要