Oversampling in Machine Learning for Patient Risk Stratification for Acute Lower Gastrointestinal Bleeding

Jonathan Ho, George M. Hanna,Amber Charoen,Fadlallah Habr

American Journal of Gastroenterology（2022）

引用 0|浏览2

暂无评分

摘要

Introduction: Lower gastrointestinal bleeding (LGIB) is a common cause of hospital admissions and can lead to hospital-based interventions that consume a significant amount of medical resources. However, only a minority of cases are high-risk and result in significant morbidity and mortality. We present an oversampling method to help with rebalancing for machine learning modeling for triaging in LGIB when there is significant imbalance between high risk (HR) and low risk (LR) patients. Methods: From retrospective data, hemodynamically stable patients with suspected LGIB were labeled into HR or LR groups (Figure). Risk factors associated with LGIB (e.g. age, sex, blood pressure, hemoglobin) were included as predictors. The dataset was divided into 80% for training and 20% for testing. Two machine learning models (stepwise logistic regression and decision trees) were applied to the training data to create predictive models. Then, the training and testing performances were evaluated using standard performance metrics (e.g. sensitivity, specificity, and F1). Results: 1414 records were reviewed. On average, patients were 61 years old and 48.8% were male. The average systolic blood pressure was 138 mmHg and diastolic was 78.0 mmHg with an average pulse of 82.0. The average laboratory values were 13.2 g/dL for hemoglobin (Hb), 16.0 mg/dL for BUN, 0.83 mg/dL for creatinine, 1.1 for INR, and 227.0 × 10e9/L for platelets. Among these patients, 14% were on anticoagulants, 4.3% were on antiplatelet agents, and 14.9% took NSAIDs. There were 69 HR patients and 1345 LR patients. Among the included factors, age, blood pressure, pulse, BUN, Hb, INR, quartile of transfusions, and being on antiplatelet agents were statistically different between the 2 risk groups. During training the decision tree model showed excellent specificity (0.985) and negative predictive value (NPV) (0.9088) among 586 cases. In the testing phase, specificity was 0.982 and NPV was 0.960 among 12 cases. Sensitivity dropped from 0.908 in the training phase to 0.083 in the testing phase; similarly, the F1 dropped from 0.945 to 0.111. The logistic regression model had a sensitivity of 0.691 that dropped to 0.583 and specificity from 0.723 to 0.752; the F1 dropped from 0.709 to 0.163. Conclusion: Logistic regression did not perform as well as decision trees in training; however, it can generalize better to unseen data. A larger dataset with more HR cases would potentially reduce the overfitting issue and provide a more accurate predictive model.Figure 1.: Algorithm for classifying patients with low-risk or high-risk LGIB BRBPR = Bright red blood per rectum, ER = Emergency room, LGIB = Lower gastrointestinal bleeding.

查看译文

关键词

patient risk stratification,risk stratification,machine learning,acute

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要