Combining SVM and Gradient Boosting for Enhanced Accuracy in Diabetes Diagnosis
DOI:
https://doi.org/10.69968/ijisem.2025v4i108-16Keywords:
Diabetes detection, machine learning, Support Vector Machine, Gradient Boosting, voting ensemble, early diagnosisAbstract
To improve prediction accuracy, this work introduces a voting ensemble model that combines Support Vector Machine (SVM) and Gradient Boosting to enable early identification of diabetes. For the first round of testing, we used 89.4–98.0% accuracy with more conventional machine learning models like Decision Tree, Random Forest, and K-Nearest Neighbours. The intricacies of diabetes risk assessment in its early stages, however, were beyond the capabilities of these models. With an impressive 99.0% accuracy, the suggested ensemble model proved to be a powerful tool for combining the best features of SVM and Gradient Boosting. Compared to other models in the literature, our ensemble technique is more effective in detecting diabetes at an early stage. The research emphasises the significance of using sophisticated machine learning methods to improve clinical outcomes. These approaches may help diagnose issues associated to diabetes more quickly and accurately, which might reduce the burden on patients.
References
[1] International Diabetes Federation. IDF Diabetes Atlas. 10th ed., 2021.
[2] World Health Organization. Global Report on Diabetes. 2016.
[3] Kavakiotis, Ioannis, et al. Machine Learning and Data Mining Methods in Diabetes Research. Computational and Structural Biotechnology Journal, vol. 15, 2017, pp. 104-116.https://doi.org/10.1016/j.csbj.2016.12.005
[4] Chen, Ching-Hsueh, et al. Diabetes Prediction by Logistic Regression, Support Vector Machine, and Neural Networks. Journal of Diabetes Research, vol. 2017, 2017.
[5] Friedman, Jerome H. Greedy Function Approximation: A Gradient Boosting Machine. Annals of Statistics, vol. 29, no. 5, 2001, pp. 11891232.https://doi.org/10.1214/aos/1013203451
[6] Natekin, Alexey, and Alois Knoll. Gradient Boosting Machines, a Tutorial. Frontiers in Neurorobotics, vol. 7, 2013, article 21.https://doi.org/10.3389/fnbot.2013.00021
[7] Vapnik, Vladimir N. The Nature of Statistical Learning Theory. Springer, 1995.https://doi.org/10.1007/978-1-4757-2440-0
[8] Cortes, Corinna, and Vladimir Vapnik. Support-Vector Networks. Machine Learning, vol. 20, 1995, pp. 273297.https://doi.org/10.1007/BF00994018
[9] Kuncheva, Ludmila I. Combining Pattern Classifiers: Methods and Algorithms. Wiley-Interscience, 2004.https://doi.org/10.1002/0471660264
[10] Lai, Hang, et al. "Predictive models for diabetes mellitus using machine learning techniques." BMC endocrine disorders 19.1 (2019): 1-9.https://doi.org/10.1186/s12902-019-0436-6
[11] Ali, Berina, Lejla Gurbeta, and Almir Badnjevi. "Machine learning techniques for classification of diabetes and cardiovascular diseases." 2017 6th mediterranean conference on embedded computing (MECO). IEEE, 2017.https://doi.org/10.1109/MECO.2017.7977152
[12] Kavakiotis, Ioannis, et al. "Machine learning and data mining methods in diabetes research." Computational and structural biotechnology journal 15 (2017): 104-116.https://doi.org/10.1016/j.csbj.2016.12.005
[13] Kaur, Harleen, and Vinita Kumari. "Predictive modelling and analytics for diabetes using a machine learning approach." Applied computing and informatics 18.1/2 (2022): 90-100.https://doi.org/10.1016/j.aci.2018.12.004
[14] Mujumdar, Aishwarya, and V. Vaidehi. "Diabetes prediction using machine learning algorithms." Procedia Computer Science 165 (2019): 292-299.https://doi.org/10.1016/j.procs.2020.01.047
[15] Ghosh, Pronab, et al. "A comparative study of different machine learning tools in detecting diabetes." Procedia Computer Science 192 (2021): 467-477.https://doi.org/10.1016/j.procs.2021.08.048
[16] Plis, Kevin, et al. "A machine learning approach to predicting blood glucose levels for diabetes management." Workshops at the Twenty-Eighth AAAI conference on artificial intelligence. 2014.
[17] Ma, Juncheng. "Machine learning in predicting diabetes in the early stage." 2020 2nd International Conference on Machine Learning, Big Data and Business Intelligence (MLBDBI). IEEE, 2020https://doi.org/10.1109/MLBDBI51377.2020.00037
Downloads
Published
Issue
Section
License
Copyright (c) 2025 Shivani, Jayshree Boaddh, Meena Deshbhratar

This work is licensed under a Creative Commons Attribution-ShareAlike 4.0 International License.
Re-users must give appropriate credit, provide a link to the license, and indicate if changes were made. You may do so in any reasonable manner, but not in any way that suggests the licensor endorses you or your use. This license allows for redistribution, commercial and non-commercial, as long as the original work is properly credited.