Enhancing Cardiovascular Disease Prediction: A Domain Knowledge-Based Feature Selection and Stacked Ensemble Machine Learning Approach

Zahiriddin Rustamov,Jaloliddin Rustamov,Nazar Zaki, Sherzod Turaev, Most Sarmin Sultana, Jerry Tan,Vimala Balakrishnan

Research Square (Research Square)(2023)

引用 0|浏览1
暂无评分
摘要
Abstract Cardiovascular diseases (CVDs) are prevalent disorders affecting the heart or blood arteries. Early disease detection significantly enhances survival prospects, thus emphasizing the necessity for accurate prediction methods. Emerging technologies, such as machine learning (ML), present promising avenues for more precise prediction of CVDs. However, a critical challenge lies in developing models that not only ensure optimal predictive performance but also conform to well-established domain knowledge, thereby enhancing their credibility. Single classifiers often fall short due to issues like overfitting and bias. In response, this study proposes a domain knowledge-based feature selection integrated with a stacking ensemble classifier. The Framingham Heart Study, UCI Heart Disease and UAE retrospective cohort study datasets were utilized for training and evaluation of the ML algorithms. The results indicate that the proposed domain knowledge-based feature selection performs on par with frequently adopted feature selection techniques. Moreover, the proposed stacked ensemble, in conjunction with domain knowledge-based feature selection, achieved the highest metrics with 89.66% accuracy, and 89.16% F1-score on the Framingham dataset. Similarly, the proposed method achieved an F1-score of 85.26% and 96.23% on the UCI Heart Disease and UAE datasets. Furthermore, this study employs explainable AI techniques to illuminate the decision-making process of the predictive models. Thus, the study establishes that domain knowledge-based feature selection promotes the credibility of ML models without compromising predictive performance.
更多
查看译文
关键词
cardiovascular disease prediction,feature selection,cardiovascular disease,ensemble,knowledge-based
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要