Skip to main content
Log in

Explainable and transparency machine learning approach to predict diabetes develop

  • Original Paper
  • Published:
Health and Technology Aims and scope Submit manuscript

Abstract

Purpose

This study aims to address the problem of type 1 diabetes by utilizing machine learning techniques and developing a decision support system based on Explainable Artificial Intelligence (XAI). The main research question is to predict the risk of developing type 1 diabetes in a population using different machine learning algorithms, while ensuring interpretability and transparency of the decision support system. The study builds upon a case-control study conducted by previous researchers, who approached the problem from a statistical-parametric perspective.

Method

In this work, various machine learning algorithms, including Decision Trees (DT), Deep Neural Networks (DNN), XGBoost (XGB), Logistic Regression (LR), K-Nearest Neighbors (KNN), and Support Vector Classifier (SVC), are employed. The algorithms are evaluated based on their ability to predict the disease risk accurately and consistently on both the training and validation datasets. Additionally, Explainable AI techniques such as LIME (Local interpretable model-agnostic explanations) are employed to contextualize and interpret each prediction and assess the importance of various characteristics influencing the probability of developing the disease.

Results

The results obtained from the application of machine learning algorithms show promising outcomes on both the training and validation datasets. However, the best-performing algorithms are not necessarily those with the highest accuracy, as they may suffer from overfitting. Instead, algorithms such as DNNs (97%) or KNNs (93%) exhibit similar behavior on both training and test datasets, making them more reliable, LR and SVC both around (98.3%). The adoption of Explainable AI techniques enables the measurement of each characteristic’s importance and the analysis of factors influencing the disease’s development probability. This allows the development of a clinical decision support system (CDSS) that is immediately understandable, transparent, and interpretable. By leveraging machine learning techniques and Explainable AI, this study addresses the challenge of type 1 diabetes prediction and decision support.

Conclusion

The results indicate that algorithms like DNNs and KNNs offer reliable performance in predicting the risk of developing type 1 diabetes. The integration of Explainable AI techniques, specifically LIME, enhances the interpretability of predictions and provides insights into the factors influencing the disease. The developed CDSS based on XAI can potentially assist healthcare professionals in making informed clinical decisions, thereby improving patient care and management of type 1 diabetes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

References

  1. O’Connor PJ, Sperl-Hillen JM, Rush WA, Johnson PE, Amundson GH, Asche SE, Ekstrom HL, Gilmer TP. Impact of electronic health record clinical decision support on diabetes care: a randomized trial. Ann Fam Med. 2011.

  2. Georga E, Protopappas V, Arvaniti E, Fotiadis D. The Diabino System: Temporal Pattern Mining from Diabetes Healthcare and Daily Self-monitoring Data; 2019.

  3. Rung-Ching C, Hui Qin J, Chung-Yi H, Cho-Tsan B. Clinical Decision Support System for Diabetes Based on Ontology Reasoning and TOPSIS Analysis. Artif Intell Med Appl. 2017.

  4. Amatul Z, Asmawaty AK, Aznan MAM. A comparative study on the pre-processing and mining of Pima Indian diabetes dataset. 2013.

  5. International Diabetes Federation. 2021. https://idf.org/news/diabetes-now-affects-one-in-10-adults-worldwide/.

  6. Pima Indian Diabetes Database, Schulz LO, Bennett PH, Ravussin E, Kidd JR, Kidd KK, Esparza J, Valencia ME. Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the US. Diabetes Care. 2006;29(8):1866–71. https://doi.org/10.2337/dc06-0138.

    Article  Google Scholar 

  7. Deepti S, Dilip SS. Prediction of Diabetes using Classification Algorithms. Procedia Comput Sci. 2018;132:1578–85.

    Article  Google Scholar 

  8. Han W, Shengqi Y, Zhangqin H, Jian H, Xiaoyi W. Type 2 diabetes mellitus prediction model based on data mining. Inform Med Unlocked. 2018;10:100–7.

    Article  Google Scholar 

  9. Arrieta B, Rodriguez ADN, Del Ser J, Bennetot A, Tabik SB, González A, García S, Gil-López S, Molina D, Benjamins VR, Chatila RH, Francisco. Explainable Artificial Intelligence (XAI): Concepts. Opportunities and Challenges toward Responsible AI: Taxonomies; 2019.

  10. Patil BM, Joshi RC, Durga T. Hybrid prediction model for Type-2 diabetic patients. Expert Syst Appl. 2010;37(12):8102–8.

    Article  Google Scholar 

  11. Dagliati A, Marini S, Sacchi L, et al. Machine Learning Methods to Predict Diabetes Complications. J Diabetes Sci Technol. 2018;12(2):295–302. https://doi.org/10.1177/1932296817706375.

    Article  Google Scholar 

  12. Zou Q, Qu K, Luo Y, Yin D, Ju Y, Tang H. Predicting Diabetes Mellitus With Machine Learning Techniques. Front Genet. 2018;9:515. https://www.frontiersin.org/article/10.3389/fgene.2018.00515.

  13. Butt UM, Letchmunan S, Ali M, et al. Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications. J Healthc Eng Hindawi. 2021.

  14. Asaduzzaman S, Al Masud F, Bhuiyan T, Ahmed K, Paul BK, Matiur Rahman SAM. Dataset on significant risk factors for Type 1 Diabetes: a Bangladeshi perspective. Data Brief. 2018;21:700–8 ISSN 2352-3409.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Francesco Curia.

Ethics declarations

Conflicts of interest

The author declares under his own responsibility that there are no conflicts of interest in the realization of this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Curia, F. Explainable and transparency machine learning approach to predict diabetes develop. Health Technol. 13, 769–780 (2023). https://doi.org/10.1007/s12553-023-00781-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12553-023-00781-z

Keywords

Navigation