Explainable and transparency machine learning approach to predict diabetes develop

Curia, Francesco

doi:10.1007/s12553-023-00781-z

Explainable and transparency machine learning approach to predict diabetes develop

Original Paper
Published: 27 September 2023

Volume 13, pages 769–780, (2023)
Cite this article

Health and Technology Aims and scope Submit manuscript

Francesco Curia ORCID: orcid.org/0000-0003-4341-1101¹

227 Accesses
1 Citation
Explore all metrics

Abstract

Purpose

This study aims to address the problem of type 1 diabetes by utilizing machine learning techniques and developing a decision support system based on Explainable Artificial Intelligence (XAI). The main research question is to predict the risk of developing type 1 diabetes in a population using different machine learning algorithms, while ensuring interpretability and transparency of the decision support system. The study builds upon a case-control study conducted by previous researchers, who approached the problem from a statistical-parametric perspective.

Method

In this work, various machine learning algorithms, including Decision Trees (DT), Deep Neural Networks (DNN), XGBoost (XGB), Logistic Regression (LR), K-Nearest Neighbors (KNN), and Support Vector Classifier (SVC), are employed. The algorithms are evaluated based on their ability to predict the disease risk accurately and consistently on both the training and validation datasets. Additionally, Explainable AI techniques such as LIME (Local interpretable model-agnostic explanations) are employed to contextualize and interpret each prediction and assess the importance of various characteristics influencing the probability of developing the disease.

Results

The results obtained from the application of machine learning algorithms show promising outcomes on both the training and validation datasets. However, the best-performing algorithms are not necessarily those with the highest accuracy, as they may suffer from overfitting. Instead, algorithms such as DNNs (97%) or KNNs (93%) exhibit similar behavior on both training and test datasets, making them more reliable, LR and SVC both around (98.3%). The adoption of Explainable AI techniques enables the measurement of each characteristic’s importance and the analysis of factors influencing the disease’s development probability. This allows the development of a clinical decision support system (CDSS) that is immediately understandable, transparent, and interpretable. By leveraging machine learning techniques and Explainable AI, this study addresses the challenge of type 1 diabetes prediction and decision support.

Conclusion

The results indicate that algorithms like DNNs and KNNs offer reliable performance in predicting the risk of developing type 1 diabetes. The integration of Explainable AI techniques, specifically LIME, enhances the interpretability of predictions and provides insights into the factors influencing the disease. The developed CDSS based on XAI can potentially assist healthcare professionals in making informed clinical decisions, thereby improving patient care and management of type 1 diabetes.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

A Diabetes Prediction Model with Visualized Explainable Artificial Intelligence (XAI) Technology

Explainable AI for Healthcare: A Study for Interpreting Diabetes Prediction

An Explainable AI Approach for Diabetes Prediction

References

O’Connor PJ, Sperl-Hillen JM, Rush WA, Johnson PE, Amundson GH, Asche SE, Ekstrom HL, Gilmer TP. Impact of electronic health record clinical decision support on diabetes care: a randomized trial. Ann Fam Med. 2011.
Georga E, Protopappas V, Arvaniti E, Fotiadis D. The Diabino System: Temporal Pattern Mining from Diabetes Healthcare and Daily Self-monitoring Data; 2019.
Rung-Ching C, Hui Qin J, Chung-Yi H, Cho-Tsan B. Clinical Decision Support System for Diabetes Based on Ontology Reasoning and TOPSIS Analysis. Artif Intell Med Appl. 2017.
Amatul Z, Asmawaty AK, Aznan MAM. A comparative study on the pre-processing and mining of Pima Indian diabetes dataset. 2013.
International Diabetes Federation. 2021. https://idf.org/news/diabetes-now-affects-one-in-10-adults-worldwide/.
Pima Indian Diabetes Database, Schulz LO, Bennett PH, Ravussin E, Kidd JR, Kidd KK, Esparza J, Valencia ME. Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the US. Diabetes Care. 2006;29(8):1866–71. https://doi.org/10.2337/dc06-0138.
Article Google Scholar
Deepti S, Dilip SS. Prediction of Diabetes using Classification Algorithms. Procedia Comput Sci. 2018;132:1578–85.
Article Google Scholar
Han W, Shengqi Y, Zhangqin H, Jian H, Xiaoyi W. Type 2 diabetes mellitus prediction model based on data mining. Inform Med Unlocked. 2018;10:100–7.
Article Google Scholar
Arrieta B, Rodriguez ADN, Del Ser J, Bennetot A, Tabik SB, González A, García S, Gil-López S, Molina D, Benjamins VR, Chatila RH, Francisco. Explainable Artificial Intelligence (XAI): Concepts. Opportunities and Challenges toward Responsible AI: Taxonomies; 2019.
Patil BM, Joshi RC, Durga T. Hybrid prediction model for Type-2 diabetic patients. Expert Syst Appl. 2010;37(12):8102–8.
Article Google Scholar
Dagliati A, Marini S, Sacchi L, et al. Machine Learning Methods to Predict Diabetes Complications. J Diabetes Sci Technol. 2018;12(2):295–302. https://doi.org/10.1177/1932296817706375.
Article Google Scholar
Zou Q, Qu K, Luo Y, Yin D, Ju Y, Tang H. Predicting Diabetes Mellitus With Machine Learning Techniques. Front Genet. 2018;9:515. https://www.frontiersin.org/article/10.3389/fgene.2018.00515.
Butt UM, Letchmunan S, Ali M, et al. Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications. J Healthc Eng Hindawi. 2021.
Asaduzzaman S, Al Masud F, Bhuiyan T, Ahmed K, Paul BK, Matiur Rahman SAM. Dataset on significant risk factors for Type 1 Diabetes: a Bangladeshi perspective. Data Brief. 2018;21:700–8 ISSN 2352-3409.
Article Google Scholar

Download references

Author information

Authors and Affiliations

Rome, Italy
Francesco Curia

Authors

Francesco Curia
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Francesco Curia.

Ethics declarations

Conflicts of interest

The author declares under his own responsibility that there are no conflicts of interest in the realization of this work.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (CSV 45 KB)

Supplementary file1 (CSV 6 KB)

Supplementary file1 (CSV 8 KB)

Supplementary file1 (DOCX 17 KB)

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Curia, F. Explainable and transparency machine learning approach to predict diabetes develop. Health Technol. 13, 769–780 (2023). https://doi.org/10.1007/s12553-023-00781-z

Download citation

Received: 03 January 2023
Accepted: 22 August 2023
Published: 27 September 2023
Issue Date: September 2023
DOI: https://doi.org/10.1007/s12553-023-00781-z

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Explainable and transparency machine learning approach to predict diabetes develop