DefinePK

DefinePK hosts the largest index of Pakistani journals, research articles, news headlines, and videos. It also offers chapter-level book search.

Data to Diagnosis: Evaluating Machine Learning Algorithms for Predictive Healthcare in Diabetes


Article Information

Title: Data to Diagnosis: Evaluating Machine Learning Algorithms for Predictive Healthcare in Diabetes

Authors: Musharaf Ali Talpur, Manal A. Asiri, Umme Laila, Samar Raza Talpur, Abdul Khaliq, Muhammad Noman Saeed

Journal: VFAST Transactions on Software Engineering

HEC Recognition History
Category From To
Y 2024-10-01 2025-12-31
Y 2023-07-01 2024-09-30
Y 2022-07-01 2023-06-30
Y 2021-07-01 2022-06-30

Publisher: VFAST-Research Platform

Country: Pakistan

Year: 2025

Volume: 13

Issue: 3

Language: en

DOI: 10.21015/vtse.v13i3.2141

Categories

Abstract

Diabetes mellitus, a chronic metabolic disease, presents alarming challenges to world health. It is vital to diagnose it early to prevent serious complications. In this research, eight machine learning algorithms—SVM, XGBoost, Naive Bayes, Logistic Regression, Gradient Boosting, KNN, Decision Tree, and Random Forest—are used on a formatted dataset with clinical and demographic attributes. Normalization and categorical encoding were done for preprocessing. Although no class-balancing methods (e.g., SMOTE or weighting) were used or hyperparameter tuning was performed, models were tested with accuracy, precision, recall, F1-score, and confusion matrices. Interestingly, the dataset is very imbalanced (~10% diabetic cases), and thus may influence sensitivity. Ensemble models, particularly Gradient Boosting and XGBoost, reported more than 91% accuracy. In spite of limitations, findings suggest the promise of ML in early prediction of diabetes.


Paper summary is not available for this article yet.

Loading PDF...

Loading Statistics...