Feature clustering and feature discretization assisting gene selection for molecular classification using fuzzy c-means and expectation–maximization algorithm

The Journal of Supercomputing(2020)

引用 8|浏览23
暂无评分
摘要
In this paper, a novel gene selection benefiting from feature clustering and feature discretization is developed. In large numbers of genes, unsupervised fuzzy clustering algorithm facilitates the analysis of both similarities and dissimilarities. The supervised process, adopting information gain and statistical Chi-square test, is applied to approve the relevant gene clusters. Then, expectation–maximization algorithm discretizes the candidate genes and helps to recognize distinguishability. In our previously proposed selection criterion, we finalized gene selection and generated the gene subsets for molecular classification. For high-dimensional datasets congested with erroneous or ambiguous information, the current scheme is particularly suitable in its own right. The efficiency and effectiveness are verified by our experimental results.
更多
查看译文
关键词
Feature clustering, Feature discretization, Gene selection, Fuzzy cluster analysis, Expectation–maximization algorithm
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要