An evolutionary computation classification method for high-dimensional mixed missing variables data

ELECTRONICS LETTERS(2023)

引用 0|浏览2
暂无评分
摘要
Data missing is a prevalent issue in various real-world systems. It may deteriorate the performance of classification algorithms running on these platforms. Numerous effective imputation methods exist to address this problem. However, traditional data imputation approaches mainly focus on low-dimensional missing data, and in addition, they do not make use of the randomness of the missing values and the information of labels simultaneously. To solve these problems, the authors propose a novel data imputation algorithm, named Particle Swarm Optimization for High-dimensional mixed Missing variables data (PSOHM). PSOHM introduces a feature filtering algorithm for feature selection on missing data, followed by a feature discrimination method to categorize chosen features. PSOHM then employs particle swarm optimization to optimize imputation functions for both continuous and discrete features. Continuous features are modelled as Gaussian distributions, with the mean and standard deviation encoded into particles. Additionally, the probabilities of values for discrete features are also encoded. Moreover, accuracy serves as the optimization objective, utilizing both the randomness of missing values and the label information to improve the algorithm's performance. Six typical algorithms are employed to make a comparison. The results demonstrate that the authors' method is superior to the compared approaches on the six different kinds of classical datasets. In order to solve the classification problem of high-dimensional mixed missing variables data, this paper proposes an imputation method based on the evolutionary algorithm, called PSOHM. In PSOHM, the authors propose feature filtering algorithm for feature selection and feature discriminate method for the feature type discrimination of continuous and discrete features, respectively. Then the authors design imputation functions based on the feature types, and use particle swarm optimization to optimize the parameters of the imputation functions. image
更多
查看译文
关键词
evolutionary computation,feature selection,pattern classification
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要