Permuted KPCA and SMOTE to Guide GAN-Based Oversampling for Imbalanced HSI Classification

IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING(2024)

引用 0|浏览1
暂无评分
摘要
Lack of sufficient and balanced data is one of the major challenges in hyperspectral image classification. This problem can cause poor classification performance, especially in detecting or classifying samples of minority classes. The easiest way to overcome the problem is by resampling or creating synthetic samples to balance the class distributions. As the most advanced generative method, generative adversarial networks (GANs) have been used for generating synthetic data. However, GANs need a large amount or sufficient minority class data to train. In this article, we propose to leverage the synthetic minority oversampling technique (SMOTE) in GANs for creating high quality synthetic data to tackle the imbalance problem. The main idea is to train the generator of the GAN to synthesize data from pattern vectors instead of random noise vectors so to guide the GAN to produce data that can expand the minority class data on the decision boundaries. We used kernel principal component analysis and SMOTE to create the pattern vectors and used a silhouette score to control and prevent overlapping issues. In addition, we applied a self-attention module and an automatic data filter to further minimize potentially wrongly labeled or overlapping samples before being added into the training set. Experimental results on both hyperspectral and remote sensing datasets show that the proposed technique can generate more realistic, diverse, and unambiguous synthetic data, resulting in significantly improved classification performances over the existing oversampling techniques.
更多
查看译文
关键词
Generative adversarial network (GAN),hyperspectral image (HSI),imbalance classification,kernel principal component analysis (kernel PCA),synthetic minority oversampling technique (SMOTE)
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要