Enhancing Preventive Healthcare: Developing a Robust ML-Based Model for Diabetes Prediction

Authors

  • Kavya Markapuram Narasaraopeta Engineering College
  • Veema Rao
  • Bobbepalli Meera Mohiddin Shaik
  • Chakrapani Sai Manikanta Badigunchala
  • Rajasekhar Boddu
  • Krishna Jyothi Nannapaneni

DOI:

https://doi.org/10.51485/ajss.v10i4.290

Keywords:

Diabetes prediction, Machine Learning, Random Forest, Cross-Validation, Pima Indians Dataset, Healthcare Analytics, Preprocessing, Predictive Modeling

Abstract

Diabetes Mellitus represents a significant global health challenge, with early detection being crucial for mitigating severe complications. This study conducts a rigorous comparative analysis of machine learning models for diabetes prediction, leveraging the Pima Indians Diabetes Dataset. We implemented a rigorous preprocessing protocol to address the dataset’s inherent challenges, including the handling of missing data denoted by zero values in key clinical features. Four machine learning algorithms—Support Vector Machine (SVM), Random Forest, Decision Tree, and Naïve Bayes—were meticulously optimized and evaluated using stratified 10-fold cross-validation. This method ensures a robust and generalizable assessment of model performance. Our results indicate that the Random Forest classifier outperformed its counterparts, achieving a mean cross-validation accuracy of 84.2%, a precision of 0.80, a recall of 0.82, an F1-score of 0.81, and an AUC of 0.90. The study demonstrates the efficacy of ensemble methods in medical diagnostics and provides a transparent, reproducible benchmark for future research. This research underscores the potential of ML-based tools to augment traditional diagnostic methods, paving the way for accessible prescreening in diverse clinical environments.

Downloads

Download data is not yet available.

Downloads

Published

2025-12-31

How to Cite

[1]
Markapuram, K., Alluri, V.R., Shaik, B. M.M., Badigunchala, C.S.M., Boddu, R. and Nannapaneni, K.J. 2025. Enhancing Preventive Healthcare: Developing a Robust ML-Based Model for Diabetes Prediction. Algerian Journal of Signals and Systems . 10, 4 (Dec. 2025), 220-225. DOI:https://doi.org/10.51485/ajss.v10i4.290.

Issue

Section

Articles