Abstract
Diabetes is one of the high-risk medical diseases, in which, blood sugar levels gets higher. It is one of the leading causes of increase in deaths, worldwide. In 2040, the world's diabetic patients will hit 642 million approximately, according to the rising morbidity in recent years. This leads to an observation that one of the ten adults in the future will suffer from diabetes. This motivates researchers to adapt both machine learning and deep learning for early diagnosis of diabetic patients. Significant number of data mining and machine learning techniques has applied on diabetes dataset for risk prediction of disease. The objective of this paper is to analyze all the famous machine learning techniques, namely, Random Forest, Decision Trees, K-Nearest Neighbor, Gradient Boosting, Support Vector Machine, and Extra Trees on well-known diabetic patient's dataset PIMA. Thereafter, deep learning model ANN is also applied for comparative analysis. From the obtained results, it is observed that extra tree classifier outperforms other algorithms having an accuracy of 81.16% along with a good AUC score of 81%. In addition, artificial neural network (ANN) obtains an accuracy of 73.58% on test dataset, which is quite low as the dataset is small.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lonappan A, Bindu G, Thomas V, Jacob J, Rajasekaran C, Mathew KT (2007) Diagnosis of diabetes mellitus using microwaves. J Electromagn Waves Appl 21(10):1393–1401
Krasteva A, Panov V, Krasteva A, Kisselova A, Krastev Z (2011) Oral cavity and systemic diseases—diabetes mellitus. Biotechnol Biotechnol Equip 25(1):2183–2186
Sorin BUZURA, Vasile DADARLAT, Bogdan IANCU, Adrian PECULEA, Emil CEBUC, Rudolf KOVACS et al. (2020) Logical, masking internal node. In: 2020 IEEE International conference on automation, quality and testing, robotics.
Mujumdar, Aishwarya, Vaidehi V (2019) Diabetes prediction using machine learning algorithms. Procedia Comput Sci 165:292–299
Cox ME, Edelman D (2009) Tests for screening and diagnosis of type 2 diabetes. Clin Diabetes 27(4):132–138
Polat K, GĂ¼neÅŸ S (2007) An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digit Signal Process 17(4):702–710
ÇaliÅŸir D, DoÄŸantekin E (2011) An automatic diabetes diagnosis system based on LDA-wavelet support vector machine classifier. Expert Syst Appl 38(7):8311–8315
Kavakiotis I, Tsave O, Salifoglou A, Maglaveras N, Vlahavas I, Chouvarda I (2017) Machine learning and data mining methods in diabetes research. Comput Struct Biotechnol J 15:104–116
Lee, Bum Ju, Jong Yeol Kim (2015) Identification of type 2 diabetes risk factors using phenotypes consisting of anthropometry and triglycerides based on machine learning. IEEE J Biomed Health Inform 20(1):39–46
Ozcift A, Gulten A (2011) Classifier ensemble construction with rotation forest to improve medical diagnosis performance of machine learning algorithms. Comput Methods Programs Biomed 104(3):443–451
Han L, Luo S, Jianmin Yu, Pan L, Chen S (2014) Rule extraction from support vector machines using ensemble learning approach: an application for diagnosis of diabetes. IEEE J Biomed Health Inform 19(2):728–734
Kumari V Anuja, Chitra R (2013) Classification of diabetes disease using support vector machine. Int J Eng Res Appl 3(2):1797–1801
.Zehra, Amatul, Tuty Asmawaty, Aznan M (2014) A comparative study on the pre-processing and mining of Pima Indian diabetes dataset. technical report. 80, 98, 99, 102, 106, 138, 141, 142
Marom, Nadav David, Lior Rokach, Armin Shmilovici (2010) Using the confusion matrix for improving ensemble classifiers. In 2010 IEEE 26th Convention of electrical and electronics engineers in Israel. 000555–000559. IEEE
Bradley AP (1997) The use of the area under the ROC curve in the evaluation of machine learning algorithms. Pattern Recogn 30(7):1145–1159
Alyoubi Wejdan L, Shalash Wafaa M, Abulkhair Maysoon F (2020) Diabetic retinopathy detection through deep learning techniques: a review. Inform Med Unlocked 100377
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Malviya, L., Mal, S., Lalwani, P., Chadha, J.S. (2021). Diabetes Classification Using Machine Learning and Deep Learning Models. In: Bajpai, M.K., Kumar Singh, K., Giakos, G. (eds) Machine Vision and Augmented Intelligence—Theory and Applications. Lecture Notes in Electrical Engineering, vol 796. Springer, Singapore. https://doi.org/10.1007/978-981-16-5078-9_40
Download citation
DOI: https://doi.org/10.1007/978-981-16-5078-9_40
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-16-5077-2
Online ISBN: 978-981-16-5078-9
eBook Packages: Computer ScienceComputer Science (R0)