Machine Learning Based Diabetes Detection Model for False Negative Reduction

Biomedical Materials & Devices(2024)

引用 2|浏览1
暂无评分
摘要
Diabetes is a chronic disease characterized by the inability of the pancreas to produce enough insulin or the body’s inability to use insulin efficiently. This disease is becoming increasingly prevalent worldwide and can result in severe complications such as blindness, kidney failure, and stroke. Early detection of diabetes can potentially save millions of lives globally, making it a crucial focus of research. In this study, we propose a machine learning model to aid in predicting diabetes. The model comprises several machine learning methods: Linear Regression (LnR), Logistic Regression (LR), k-nearest neighbor (KNN), Naive Bayes (NB), Random Forest (RF), Support Vector Machine (SVM), and Decision Tree (DT). Prior to feeding the pre-processed data into the machine learning model for evaluation, we conducted several pre-processing steps, such as removing null values, standardizing data using normalization, and labeling data using the label encoding process. Imbalanced datasets can adversely affect the accuracy of machine learning algorithms, and we address this problem by balancing the datasets using the Synthetic Minority Oversampling Technique (SMOTE) method. We assessed the model’s performance on two datasets and found that the random forest algorithm produced optimal results, with 97% accuracy on the diabetes dataset 2019 and 80% accuracy on the Pima Indian dataset. However, using a balanced dataset, we can significantly reduce the number of false-negative detections.
更多
查看译文
关键词
Machine learning,Pre-processed data,SMOTE,Random forest,Balance data,Features selection
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络
Chat Paper
正在生成论文摘要