FEATURE SELECTION FOR TWO-YEAR PROGNOSIS IN ADVANCED STAGE HIGH GRADE SEROUS OVARIAN CANCER USING MACHINE LEARNING METHODS

International Journal of Gynecologic Cancer(2021)

引用 0|浏览15
暂无评分
摘要
Introduction/Background* The prognosis of advanced stage high grade serous ovarian cancer patients (HGSOC) is multifactorial, and could be accurately predicted by using Machine Learning (ML) algorithms. We designed a study to support the feature selection of selected clinical variables to define their relative survival impact on two-year prognosis prediction in HGSOC patients, who received surgical treatment. Methodology This was a retrospective analysis of 209 FIGO stage III-IV HGSOC women, who were scheduled for cytoreductive surgery in SJUH, Leeds between Jan 2015 to Dec 2018 with curative or life-prolonging intent. The two-year prognosis estimation was formulated as a binary classification problem. Dataset was split into training (80%) and test (20%) cohorts with repeated random sampling until there was no significant difference (p=0.20) between the two cohorts. A ten-fold cross-validation was applied. Various state-of-the-art supervised ML classifiers were tested, including Support-Vector-Machines (SVMs), K-Nearest Neighbors (KNNs), Ensemble Classifiers, and Naive Bayes, based on a set of performance metrics. These results were directly compared to conventional Logistic Regression (LR). For feature selection, multivariate feature ranking using the MRMR method was carried out. Result(s)* Two hundred nine patients were identified. The model’s mean prediction accuracy reached 73%. We demonstrated that SVM and Ensemble Discriminant algorithms outperformed Logistic Regression in accuracy indices. The probability of achieving a cancer-free state was maximized with a combination of primary cytoreduction, good performance status, and maximal surgical effort (AUC 0.63). Standard chemotherapy, performance status, tumor load, and residual disease were consistently predictive of the two-year overall survival (AUC 0.63-0.66) (figure 1). The model recall and precision were greater than 80%. Conclusion* Appropriate feature selection is required when building a HGSOC model for two-year prognosis prediction. For HGSOC prognosis, one should consider not only the patient’s disease burden but also their overall medical status and ability to undergo extensive surgery, resulting in survival benefits alongside with standard chemotherapy.
更多
查看译文
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要