Abstract
Purpose
This study aims to address the problem of type 1 diabetes by utilizing machine learning techniques and developing a decision support system based on Explainable Artificial Intelligence (XAI). The main research question is to predict the risk of developing type 1 diabetes in a population using different machine learning algorithms, while ensuring interpretability and transparency of the decision support system. The study builds upon a case-control study conducted by previous researchers, who approached the problem from a statistical-parametric perspective.
Method
In this work, various machine learning algorithms, including Decision Trees (DT), Deep Neural Networks (DNN), XGBoost (XGB), Logistic Regression (LR), K-Nearest Neighbors (KNN), and Support Vector Classifier (SVC), are employed. The algorithms are evaluated based on their ability to predict the disease risk accurately and consistently on both the training and validation datasets. Additionally, Explainable AI techniques such as LIME (Local interpretable model-agnostic explanations) are employed to contextualize and interpret each prediction and assess the importance of various characteristics influencing the probability of developing the disease.
Results
The results obtained from the application of machine learning algorithms show promising outcomes on both the training and validation datasets. However, the best-performing algorithms are not necessarily those with the highest accuracy, as they may suffer from overfitting. Instead, algorithms such as DNNs (97%) or KNNs (93%) exhibit similar behavior on both training and test datasets, making them more reliable, LR and SVC both around (98.3%). The adoption of Explainable AI techniques enables the measurement of each characteristic’s importance and the analysis of factors influencing the disease’s development probability. This allows the development of a clinical decision support system (CDSS) that is immediately understandable, transparent, and interpretable. By leveraging machine learning techniques and Explainable AI, this study addresses the challenge of type 1 diabetes prediction and decision support.
Conclusion
The results indicate that algorithms like DNNs and KNNs offer reliable performance in predicting the risk of developing type 1 diabetes. The integration of Explainable AI techniques, specifically LIME, enhances the interpretability of predictions and provides insights into the factors influencing the disease. The developed CDSS based on XAI can potentially assist healthcare professionals in making informed clinical decisions, thereby improving patient care and management of type 1 diabetes.
Similar content being viewed by others
References
O’Connor PJ, Sperl-Hillen JM, Rush WA, Johnson PE, Amundson GH, Asche SE, Ekstrom HL, Gilmer TP. Impact of electronic health record clinical decision support on diabetes care: a randomized trial. Ann Fam Med. 2011.
Georga E, Protopappas V, Arvaniti E, Fotiadis D. The Diabino System: Temporal Pattern Mining from Diabetes Healthcare and Daily Self-monitoring Data; 2019.
Rung-Ching C, Hui Qin J, Chung-Yi H, Cho-Tsan B. Clinical Decision Support System for Diabetes Based on Ontology Reasoning and TOPSIS Analysis. Artif Intell Med Appl. 2017.
Amatul Z, Asmawaty AK, Aznan MAM. A comparative study on the pre-processing and mining of Pima Indian diabetes dataset. 2013.
International Diabetes Federation. 2021. https://idf.org/news/diabetes-now-affects-one-in-10-adults-worldwide/.
Pima Indian Diabetes Database, Schulz LO, Bennett PH, Ravussin E, Kidd JR, Kidd KK, Esparza J, Valencia ME. Effects of traditional and western environments on prevalence of type 2 diabetes in Pima Indians in Mexico and the US. Diabetes Care. 2006;29(8):1866–71. https://doi.org/10.2337/dc06-0138.
Deepti S, Dilip SS. Prediction of Diabetes using Classification Algorithms. Procedia Comput Sci. 2018;132:1578–85.
Han W, Shengqi Y, Zhangqin H, Jian H, Xiaoyi W. Type 2 diabetes mellitus prediction model based on data mining. Inform Med Unlocked. 2018;10:100–7.
Arrieta B, Rodriguez ADN, Del Ser J, Bennetot A, Tabik SB, González A, García S, Gil-López S, Molina D, Benjamins VR, Chatila RH, Francisco. Explainable Artificial Intelligence (XAI): Concepts. Opportunities and Challenges toward Responsible AI: Taxonomies; 2019.
Patil BM, Joshi RC, Durga T. Hybrid prediction model for Type-2 diabetic patients. Expert Syst Appl. 2010;37(12):8102–8.
Dagliati A, Marini S, Sacchi L, et al. Machine Learning Methods to Predict Diabetes Complications. J Diabetes Sci Technol. 2018;12(2):295–302. https://doi.org/10.1177/1932296817706375.
Zou Q, Qu K, Luo Y, Yin D, Ju Y, Tang H. Predicting Diabetes Mellitus With Machine Learning Techniques. Front Genet. 2018;9:515. https://www.frontiersin.org/article/10.3389/fgene.2018.00515.
Butt UM, Letchmunan S, Ali M, et al. Machine Learning Based Diabetes Classification and Prediction for Healthcare Applications. J Healthc Eng Hindawi. 2021.
Asaduzzaman S, Al Masud F, Bhuiyan T, Ahmed K, Paul BK, Matiur Rahman SAM. Dataset on significant risk factors for Type 1 Diabetes: a Bangladeshi perspective. Data Brief. 2018;21:700–8 ISSN 2352-3409.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The author declares under his own responsibility that there are no conflicts of interest in the realization of this work.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Curia, F. Explainable and transparency machine learning approach to predict diabetes develop. Health Technol. 13, 769–780 (2023). https://doi.org/10.1007/s12553-023-00781-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12553-023-00781-z