Heuristic Feature Selection with Classification Efficiency Using Soft Cluster Analysis for Biological Datasets

J. Inf. Sci. Eng.(2023)

引用 0|浏览1
暂无评分
摘要
With a deeper investigation to deciphering the sophisticated relations among input and output variables of multi-class classification problems, the goal of this paper is to propose a new model of variable selection which maximizes the discrimination and minimizes the size of the selected feature subsets. For molecular datasets with a tremendous amount of input variables, the proposed heuristic algorithm is capable of exploring the essential factors of classification problems. Our model devotes to three accomplishments of multi class classification tasks. Feature discretization using fuzzy clustering analysis for the improvement of feature discrimination is the first. Multivariate analysis for the investigation of information relevance and redundancy is the second achievement in this study. The third is a novel heuristic feature selection algorithm with effectiveness but without overfitting problem. Experimental results convince our model acquires significant discrimination improvement for microarray classification problems.
更多
查看译文
关键词
feature discretization,fuzzy c-means,feature selection,feature evaluation,dis-crimination power
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要