The Effect of Different Classifiers on Recursive Cluster Elimination in the Analysis of Transcriptomic Data

Nurten Bulut,Burcu Bakir-Gungor, Bahjat F. Qaqish,Malik Yousef

2023 Innovations in Intelligent Systems and Applications Conference (ASYU)(2023)

引用 0|浏览3
暂无评分
摘要
Gene expression data with limited sample size and a large number of genes are frequently encountered in genetic studies. In such high-dimensional data, identification of genes that distinguish between disease states is a challenging task. Feature selection (FS) is a useful approach in dealing with high dimensionality. Support Vector Machines Recursive Cluster Elimination (SVM-RCE) is a technique for FS in high-dimensional data. The SVM-RCE approach has been utilized for identification of clusters of genes whose expression levels correlate with pathological state. A key step in SVM-RCE is the use of an SVM classifier to assign an area under the curve (AUC) score to each gene cluster based on its ability to predict class labels. In this study, we investigate the use of alternative classifiers in the cluster-scoring step. Specifically, we compare Support Vector Machines, Random Forest, XgBoost, Naive Bayes, and linear logistic regression. In addition to AUC score performance evaluation, the algorithms are compared in terms of the number of selected genes at different levels of clustering and in terms of the running time.
更多
查看译文
关键词
Recursive Cluster Elimination,Feature Selection,Clustering,Gene Expression Data Analysis
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要