Skip to main content
Log in

Analytical model to predict diabetic patients using an optimized hybrid classifier

  • Foundation, algebraic, and analytical methods in soft computing
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

Diabetes is the most common disease and is a major cause for blindness, kidney failure, heart attacks, stroke and lower limb amputation. Thus, early prediction of diabetes is very crucial to initiating proper treatment to avoid further serious complications of the disease. The performance of recent diabetes detection schemes based on clinical data is highly influenced by low feature distinctiveness and unwanted features such as dermatologic manifestations. Different machine learning classifiers need tedious hyper-parameter tuning, which fails to assure a better diabetes detection rate. This article presents an analytical model to detect diabetes based on an optimized Support Vector Machine (SVM), K Nearest Neighbor (KNN) and Random Forest (RF) using decision level fusion to improve the diabetes detection rate. The hyper-parameters SVM, KNN, and RF are optimized using a multi-objective function-based Particle Swarm Optimization (PSO) algorithm, which considers various clinical entities for the diabetes detection, such as age, body mass index (BMI), blood pressure (BP), glucose, insulin, number of pregnancies, skin thickness, and diabetes pedigree function. The extensive experiments on the Indian Pima diabetes dataset confirmed that the diabetes detection using hybrid classifiers can provide a better prediction rate (94.27%) compared with single classifiers and the previous state of arts.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

Data availability statement

All data presented in this work have been obtained from PIMA Indian diabetes dataset repository at Kaggle. The dataset used in the analysis of the work carried out is available with the authors which can be obtained upon request.

References

  • Abbas H, Alic L, Rios M, Abdul-Ghani M, Qaraqe K (2019) Predicting diabetes in healthy population through machine learning. In: 2019 IEEE 32nd international symposium on computer-based medical systems (CBMS). IEEE, Cordoba, Spain, pp 567–570

  • Alshamlan H, Taleb HB, Al Sahow A (2020) A gene prediction function for type 2 diabetes mellitus using logistic regression. In: 2020 11th International conference on information and communication systems (ICICS). IEEE, Irbid, Jordan, pp 1–4

  • Bakkouri I, Afdel K (2023) MLCA2F: multi-level context attentional feature fusion for COVID-19 lesion segmentation from CT scans. Signal Image Video Process 17(4):1181–1188. https://doi.org/10.1007/s11760-022-02325-w

    Article  Google Scholar 

  • Bakkouri I, Afdel K, Benois-Pineau J, GCFADN Initiative (2022) BG-3DM2F: bidirectional gated 3D multi-scale feature fusion for Alzheimer’s disease diagnosis. Multimed Tools Appl 81(8):10743–10776. https://doi.org/10.1007/s11042-022-12242-2

    Article  Google Scholar 

  • Besiru JM, Kehinde AM, Martin AO, Ayodeji OS (2020) Diagnosis of diabetes mellitus type II using a feed forward back propagation neural network. J Comput Sci Control Syst 13(1):39–43

    Google Scholar 

  • Bhangale KB, Mohanaprasad K (2021) A review on speech processing using machine learning paradigm. Int J Speech Technol 24(2):367–388

    Article  Google Scholar 

  • Bhangale KB, Shekokar RU (2014) Human body detection in static images using hog & piecewise linear SVM. Int J Innov Res Dev 3(6):179–184

  • Bhangale KB, Titare P, Pawar R, Bhavsar S (2018) Synthetic speech spoofing detection using MFCC and radial basis function SVM. IOSR J Eng 8(6):55–62

    Google Scholar 

  • Bhangale K, Ingle P, Kanase R, Desale D (2022) Multi-view multi-pose robust face recognition based on VGGNet. In: Chen JIZ, Tavares JMRS, Iliyasu AM, Du KL (eds) Second international conference on image processing and capsule networks. ICIPCN 2021. Lecture notes in networks and systems, vol 300. Springer, Cham. https://doi.org/10.1007/978-3-030-84760-9_36

  • Chen M, Malook T, Rehman AU, Muhammad Y, Alshehri MD, Akbar A et al (2021) Blockchain-enabled healthcare system for detection of diabetes. J Inf Secur Appl 58:102771

    Google Scholar 

  • Elbes M, Alzubi S, Kanan T, Al-Fuqaha A, Hawashin B (2019) A survey on particle swarm optimization with emphasis on engineering and network applications. Evol Intell 12(2):113–129

    Article  Google Scholar 

  • Habibi N, Harouni M (2018) Estimation of re-hospitalization risk of diabetic patients based on radial base function (RBF) neural network method combined with colonial competition optimization algorithm. Majlesi J Electr Eng 12(1):109–116

    Google Scholar 

  • Haq AU, Li JP, Khan J, Memon MH, Nazir S, Ahmad S, Ali A (2020) Intelligent machine learning approach for effective recognition of diabetes in E-healthcare using clinical data. Sensors 20(9):2649

    Article  Google Scholar 

  • Haritha R, Babu DS, Sammulal P (2018) A hybrid approach for prediction of type-1 and type-2 diabetes using firefly and cuckoo search algorithms. Int J Appl Eng Res 13(2):896–907

    Google Scholar 

  • Hassan AS, Malaserene I, Leema AA (2020) Diabetes mellitus prediction using classification techniques. Int J Innov Technol Explor Eng 9(5):2080–2084

    Article  Google Scholar 

  • Ibrahim RA, Ewees AA, Oliva D, AbdElaziz M, Lu S (2019) Improved SALP swarm algorithm based on particle swarm optimization for feature selection. J Ambient Intell Humaniz Comput 10(8):3155–3169

    Article  Google Scholar 

  • Indoria P, Rathore YK (2018) A survey: detection and prediction of diabetes using machine learning techniques. Int J Eng Res Technol 7(3):287–291

    Google Scholar 

  • Kantawong K, Tongphet S, Bhrommalee P, Rachata N, Pravesjit S (2020) The methodology for diabetes complications prediction model. In: 2020 Joint international conference on digital arts, media and technology with ECTI northern section conference on electrical, electronics, computer and telecommunications engineering (ECTI DAMT and NCON). IEEE, Pattaya, Thailand, pp 110–113

  • Kaur H, Kumari V (2020) Predictive modelling and analytics for diabetes using a machine learning approach. Appl Comput Inform 18(1/2):90–100

  • Li Z, Nie F, Chang X, Nie L, Zhang H, Yang Y (2018a) Rank-constrained spectral clustering with flexible embedding. IEEE Trans Neural Netw Learn Syst 29(12):6073–6082

    Article  MathSciNet  Google Scholar 

  • Li Z, Nie F, Chang X, Yang Y, Zhang C, Sebe N (2018b) Dynamic affinity graph construction for spectral clustering using multiple features. IEEE Trans Neural Netw Learn Syst 29(12):6323–6332

    Article  MathSciNet  Google Scholar 

  • Li K, Daniels J, Liu C, Herrero P, Georgiou P (2019) Convolutional recurrent neural networks for glucose prediction. IEEE J Biomed Health Inform 24(2):603–613

    Article  Google Scholar 

  • Machine learning: Pima Indians diabetes. https://www.andreagrandi.it/2018/04/14/machine-learning-pima-indians-diabetes/. 14 April 2018

  • Manna S, Maity S, Munshi S, Adhikari M (2018) Diabetes prediction model using cloud analytics. In: 2018 International conference on advances in computing, communications and informatics (ICACCI). IEEE, Bangalore, India, pp 30–36

  • Montañez CAC, Fergus P, Hussain A, Al-Jumeily D, Abdulaimma B, Hind J, Radi N (2017) Machine learning approaches for the prediction of obesity using publicly available genetic profiles. In: 2017 International joint conference on neural networks (IJCNN). IEEE, Anchorage, AK, USA, pp 2743–2750

  • Montaser E, Díez JL, Rossetti P, Rashid M, Cinar A, Bondia J (2019) Seasonal local models for glucose prediction in type 1 diabetes. IEEE J Biomed Health Inform 24(7):2064–2072

    Article  Google Scholar 

  • Nilamyani N, Lawi A, Thamrin SA (2018) A preliminary study on identifying probable biomarker of type 2 diabetes using recursive feature extraction. In: 2018 2nd East Indonesia conference on computer and information technology (EIConCIT). IEEE, Makassar, Indonesia, pp 267–270

  • Nilashi M, bin Ibrahim O, Ahmadi H, Shahmoradi L (2017) An analytical method for diseases prediction using machine learning techniques. Comput Chem Eng 106:212–223

    Article  Google Scholar 

  • Prajwala TR (2015) A comparative study on decision tree and random forest using R tool. Int J Adv Res Comput Commun Eng 4(1):196–199

    Google Scholar 

  • Rashid TA, Abdullah SM (2018) A hybrid of artificial bee colony, genetic algorithm, and neural network for diabetic mellitus diagnosing. ARO Sci J Koya Univ 6(1):55–64

    Google Scholar 

  • VijiyaKumar K, Lavanya B, Nirmala I, Caroline SS (2019) Random forest algorithm for the prediction of diabetes. In: 2019 IEEE international conference on system, computation, automation and networking (ICSCAN). IEEE, Pondicherry, India, pp 1–5

  • WHO (2022) https://www.who.int/news-room/fact-sheets/detail/diabetes. Published 16 September 2022. Accessed 27 Feb 2023

  • Xu Z, Wang Z (2019) A risk prediction model for type 2 diabetes based on weighted feature selection of random forest and xgboost ensemble classifier. In: 2019 Eleventh international conference on advanced computational intelligence (ICACI). IEEE, Guilin, China, pp 278–283

  • Yu E, Sun J, Li J, Chang X, Han XH, Hauptmann AG (2018) Adaptive semi-supervised feature selection for cross-modal retrieval. IEEE Trans Multimed 21(5):1276–1288

    Article  Google Scholar 

  • Yuan D, Chang X, Huang PY, Liu Q, He Z (2020) Self-supervised deep correlation tracking. IEEE Trans Image Process 30:976–985

    Article  Google Scholar 

  • Zhang Y, Lin Z, Kang Y, Ning R, Meng Y (2018) A feed-forward neural network model for the accurate prediction of diabetes mellitus. Int J Sci Technol Res 7(8):151–155

    Google Scholar 

  • Zhou R, Chang X, Shi L, Shen YD, Yang Y, Nie F (2019) Person reidentification via multi-feature fusion with adaptive graph learning. IEEE Trans Neural Netw Learn Syst 31(5):1592–1601

    Article  Google Scholar 

  • Zhu C, Idemudia CU, Feng W (2019) Improved logistic regression model for diabetes prediction by integrating PCA and K-means techniques. Inform Med Unlocked 17:100179

    Article  Google Scholar 

Download references

Acknowledgements

Prof. (Dr.) Poonkuntran Shanmugam, Professor & Dean VIT Bhopal University, Sehore, India. Supervises this research work. Author thanks her for the continuous guidance and timely help. Author also thanks Management, Director and Head of Institution and other technical staff of VIT Bhopal University, Sehore, India, for providing infrastructure and other necessary support for this research work.

Funding

The authors did not receive support from any organization for the submitted work.

Author information

Authors and Affiliations

Authors

Contributions

All the authors equally contributed in the formulation of manuscript and the work involved.

Corresponding author

Correspondence to Albert Alexander Stonier.

Ethics declarations

Conflict of interest

All the authors declare that we have no conflict of interest. On behalf of all authors, the corresponding author states that there is no conflict of interest.

Ethical and informed consent for data used

This article does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shimpi, J.K., Shanmugam, P. & Stonier, A.A. Analytical model to predict diabetic patients using an optimized hybrid classifier. Soft Comput 28, 1883–1892 (2024). https://doi.org/10.1007/s00500-023-09487-w

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-023-09487-w

Keywords

Navigation