An ensemble method of the machine learning to prognosticate the gastric cancer

ANNALS OF OPERATIONS RESEARCH(2022)

引用 11|浏览4
暂无评分
摘要
Gastric Cancer is the most common malignancy of the digestive tract, which is the third leading cause of cancer-related mortality worldwide. The early prognosis methods, especially Machine Learning (ML)-based approaches are one of the main strategies against GC, which have become a necessity to identify and prognosticate the factors that affect the GC. They enable the specialists to accelerate the subsequent clinical management of patients, who suffer from the GC. This paper aims at creating an Ensemble Method inspired from ML to predict the most significant factors of the GC occurrence. The main objective of this research is to predict the probabilities of the GC occurrence and its associated deaths. To achieve this goal, the created EM benefits from some ML-based methods, including Least Absolute Shrinkage and Selection Operator (LASSO)/Ridge Regression, Elastic Net, Logistic Regression (LR), Random Forest (RF), Gradient Boosting Decision Trees (GBDTs), and Deep Neural Network (DNN). The purpose of the provided EM is to lessen the prediction errors for the large number of the patients’ features. The main novelties of this research include: (i) A sequential EM created by a Stacking method to predict the probability of the GC and associated deaths; (ii) Benefiting from the significance level to make an accurate prediction; (iii) Employing two Chi-square tests to select the influent features; (iv) Tuning the parameters of the applied ML models to avoid over-fitting and intensifying the errors; (v) Applying different kinds of the regression methods to treat hyper-dimension cases; (vi) A new model for weighting the applied ML models. The outcomes of the implementation of the created EM in seven pioneer hospitals in the field of GC show that the designed EM generates more precise predictions with an accuracy of 97.9% and 76.3% to predict the GC and its associated deaths, respectively. Moreover, the obtained results from the Area Under Curve (AUC) validates and confirms the capability of the created EM to predict the probability of the GC and its related deaths with an accuracy of 98% and 90% to predict the GC and its associated deaths, respectively.
更多
查看译文
关键词
Gastric cancer,Ensemble learning,Machine learning,Classification,Mutual information,Stacking
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要