An Efficient Framework for classifying Cancer diseases using Ensemble machine learning over Cancer Gene Expression and Sequence Based Protein Interactions.

2023 2nd International Conference for Innovation in Technology (INOCON)(2023)

引用 0|浏览6
暂无评分
摘要
In recent years, a significant number of deaths worldwide have been due to cancer diseases. Analysis of Microarray gene expressions and protein interaction data facilitates early cancer identification. The accurate prediction of information for thousands of genes is made possible by using DNA microarray technology. Protein-Protein Interactions (PPIs) are the crucial protein activities involved in the cell cycle that replicates the DNA and cellular signaling. Determining whether a pair of proteins interacts is crucial for diagnosing an illness in molecular biology is therefore important. In existing machine learning classifiers have two-class problem that is limited and only be used to solve binary class problems, additionally, they can be prone to overfitting, as the classification framework may also become too specialized to the training data and not generalized to the varied data. To overcome this problem, this paper proposes an ensemble machine learning technique; ensembling combines the strengths of both classifiers that allow more robust and accurate framework. The better combination of both Support Vector machine and Naïve Bayes ensemble provides better performance in terms of various performance parameters. The proposed SVM-NB Ensemble classifier outperforms the existing classifiers by 15-20% over various performance parameters like classification accuracy, time taken for classification, precision, recall, and F-measure. The results were drawn by comparing the proposed ensemble (SVM+NB) classifier with the existing most applied classifiers like Logistic Regression (LR), Support Vector Machine and Naive Bayes techniques.
更多
查看译文
关键词
Ensemble Machine Learning,Support Vector Machine,Naïve Bayes,Sequence Based Proteins,Cancer Gene Data
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要