A Prediction Algorithim for Neoplasia in First Time Screening Colonoscopy

Erica Duh, Robert Nisbet,Amelie Tiritilli, David L. Cheung, Christopher Rombaoa, Mary Kate Roccato,William Karnes

The American Journal of Gastroenterology(2023)

引用 0|浏览0
暂无评分
摘要
Introduction: The timing of the first colonoscopy for average-risk individuals based on age has resulted in a 30% reduction in colorectal cancer (CRC) mortality among those above the screening age but an alarming rise in CRC among those under the screening age. By ignoring risk factors other than age, the timing of cost-effective preventative screening colonoscopy will remain flawed. We hypothesized that a personalized risk model for precancerous colorectal lesions could be developed with potential to optimize timing of first colonoscopy using decision tree-based machine learning. Methods: Outcome data from first colonoscopies in the University of California, Irvine Colonoscopy Quality Database (UCICQD) between 2012 and 2023 were combined with pre-colonoscopy data from the EHR. Colonoscopy outcome was defined by the presence or absence of one or more neoplastic (Type 1) lesions (adenoma, sessile serrated polyp, hyperplastic polyps >1cm, or carcinoma). Patients were excluded if EHR or UCICQD data indicated inflammatory bowel disease, positive FIT or DNA fecal test, family history of colorectal cancer, genetic syndrome, or prior history of colonoscopy, bowel surgery, or colonic neoplasm. Following these exclusions and including only first-time colonoscopies and EHR data that predated the colonoscopy, 3,994 patients/colonoscopies were available for analysis. A 10-fold cross-validation process was applied to three models (Tree Ensemble, Gradient Boosted Tree, and XGBoosted Tree) using the open-source Konstanz Information Miner (KNIME). Results: An ensemble of the 3 models utilizing just 137 modeling variables predicted the presence of Type 1 colonic lesions with an overall accuracy of 67% (sensitivity = 79%; specificity = 51%) and an AUC = 0.7. The model was able to increase the average number of projected type 1 lesions per colonoscopy from 1.39 to 1.787 when including positive predictors, a 27.7% increase (Table 1, Figure 1). Conclusion: The resulting model performs well enough in this population to begin prospective validation as a screening tool to determine the optimal starting time for average-risk screening colonoscopy by incorporating variables in addition to age.Figure 1.: Flow Chart Containing Variables That Fed Into Our Final Ensemble of Algorithims to Create Our Polyp Prediction Model. Table 1. - A Binary Target Variable Was Defined as 1 if the Pathology From That Initial Colonoscopy Yielded Any of the Lesions Listed on the Left of the Chart Binary Target Variable Target = 1 Target = 0 Adenomas (Tubular/ Tubulovillous/ Villous) Other Sessile serrated polyps Cancer Traditional Serrated Adenomas
更多
查看译文
关键词
screening,neoplasia,prediction
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要