Machine Learning for Prediction of Immunotherapy Efficacy in Non-Small Cell Lung Cancer from Simple Clinical and Biological Data

CANCERS(2021)

引用 14|浏览15
暂无评分
摘要
Simple Summary We studied determinants of response to immune-checkpoint inhibition in advanced non-small cell lung cancer patients. Specifically, we evaluated the association with response of multiple simple pre-treatment blood markers available from routine examination. We first used classical statistical tools and then developed a machine learning algorithm for individual predictions. We obtained a 69% accuracy. Hemoglobin levels and performance status were the strongest predictors. Neutrophil-to-lymphocyte ratio was also associated with outcome. A benchmark of 8 machine learning models also evidenced that the best model performed almost equally well than a logistic regression (basic statistical learning model). Background: Immune checkpoint inhibitors (ICIs) are now a therapeutic standard in advanced non-small cell lung cancer (NSCLC), but strong predictive markers for ICIs efficacy are still lacking. We evaluated machine learning models built on simple clinical and biological data to individually predict response to ICIs. Methods: Patients with metastatic NSCLC who received ICI in second line or later were included. We collected clinical and hematological data and studied the association of this data with disease control rate (DCR), progression free survival (PFS) and overall survival (OS). Multiple machine learning (ML) algorithms were assessed for their ability to predict response. Results: Overall, 298 patients were enrolled. The overall response rate and DCR were 15.3% and 53%, respectively. Median PFS and OS were 3.3 and 11.4 months, respectively. In multivariable analysis, DCR was significantly associated with performance status (PS) and hemoglobin level (OR 0.58, p < 0.0001; OR 1.8, p < 0.001). These variables were also associated with PFS and OS and ranked top in random forest-based feature importance. Neutrophil-to-lymphocyte ratio was also associated with DCR, PFS and OS. The best ML algorithm was a random forest. It could predict DCR with satisfactory efficacy based on these three variables. Ten-fold cross-validated performances were: accuracy 0.68 +/- 0.04, sensitivity 0.58 +/- 0.08; specificity 0.78 +/- 0.06; positive predictive value 0.70 +/- 0.08; negative predictive value 0.68 +/- 0.06; AUC 0.74 +/- 0.03. Conclusion: Combination of simple clinical and biological data could accurately predict disease control rate at the individual level.
更多
查看译文
关键词
blood counts, lung cancer, response, survival, prediction, machine learning
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要