Diagnosis of diabetes type-II using hybrid machine learning based ensemble model

  • Abid SarwarEmail author
  • Mehbob Ali
  • Jatinder Manhas
  • Vinod Sharma
Original Research


The work done in this paper exhibits an expert system based ensemble model in diagnosing type-II diabetes. Diabetes Mellitus is a disease with high mortality rate that affects more than 60% population. The mindset of this task is to analyze various machine learning techniques for binary classification concerning with illness i.e. to diagnose whether a subject is suffering from disease or not. There are in total fifteen classifiers considered and out of them five major techniques namely: ANN, SVM, KNN, Naive Bayes and Ensemble are used. For achieving the desired goals the tools that were employed namely matrix laboratory (MATLAB) and WEKA 3.6.13. In Ensemble method the predictive potentials of various individual classifiers are fused together. Using Ensemble method, it increases the performance by combining the classifying ability of individual classifiers and the chances of misclassifying a particular instance are reduced significantly, this provides a greater accuracy to the overall classification process. It is the enhancing technique that does the majority voting and gives us the percolated results. The medical database analysed in this study includes a rich database of about 400 people from across a wide geographical region and ten physiological attributes. Furthermore, this diagnostic tool is examined by verifying denary cross attestation; on top of that the outcome has been confronted along the truly existing real interpretation about the cases. A GUI based diagnostic tool founded upon ensemble classifier is developed in such a manner it would be able to predict whether a patient is enduring against the disease or not when it is fed with all the 10 attributes from user through a user friendly GUI (Graphical User Interface).The development of this diagnostic tool is done using MATLAB 2013a. Out of 10 parameters that the user needs to enter as input in GUI based diagnostic tool five are numeric and the rest are nominal values. The diagnostic tool in execution is demonstrated below in Fig. 3. The main objective of this manuscript is to propose an intelligent framework that will act as a useful aid for doctors for correct and timely biopsy can be done at early stage. The result indicated that ensemble technique assured an accuracy of 98.60% that clubs the predictive performance of multiple AI based algorithms and are superior in comparison with all other individual counterparts. The algorithms with better exactness than others are followed by Artificial neural network (ANN), Naïve Bayes, Support Vector Machine (SVM), K-Nearest Neighbor (K-NN).


GUI based diagnostic tool Ensemble method MATLAB 2013a Diabetes WEKA 3.6.13 Classifiers and expert systems 


  1. 1.
    Kharroubi AT, Darwish HM (2015) Diabetes mellitus: the epidemic of the century. World J Diabetes 6(6):850CrossRefGoogle Scholar
  2. 2.
    Olokoba AB, Obateru OA, Olokoba LB (2012) Type 2 diabetes mellitus: a review of current trends. Oman Med J 27(4):269–273CrossRefGoogle Scholar
  3. 3.
    Deepa SN, Aruna Devi B (2011) A survey on artificial intelligence approaches for medical image classification. Indian J Sci Technol 4(11):1583–1595Google Scholar
  4. 4.
    Michalski RS, Carbonell JG, Mitchell TM (eds) (2013) Machine learning: an artificial intelligence approach. Springer, BerlinGoogle Scholar
  5. 5.
    Karan O et al (2012) Diagnosing diabetes using neural networks on small mobile devices. Expert Syst Appl 39(1):54–60CrossRefGoogle Scholar
  6. 6.
    Manju T, Priya K, Chitra R (2013) Heart disease prediction system using weight optimized neural network. Int J Comput Sci Manag Res 2:5Google Scholar
  7. 7.
    Sokouti B, Haghipour S, Tabrizi AD (2014) A framework for diagnosing cervical cancer disease based on feedforward MLP neural network and Thin Prep histopathological cell image features. Neural Comput Appl 24(1):221–232CrossRefGoogle Scholar
  8. 8.
    Yasodha P, Kannan M (2011) Analysis of a population of diabetic patients databases in WEKA tool. Int J Sci Eng Res 2:5Google Scholar
  9. 9.
    Kononenko I (2001) Machine learning for medical diagnosis: history, state of the art and perspective. Artif Intell Med 23(1):89–109CrossRefGoogle Scholar
  10. 10.
    Polat K, Güneş S (2007) An expert system approach based on principal component analysis and adaptive neuro-fuzzy inference system to diagnosis of diabetes disease. Digital Signal Process 17(4):702–710CrossRefGoogle Scholar
  11. 11.
    Su C-T et al (2006) Data mining for the diagnosis of type II diabetes from three-dimensional body surface anthropometrical scanning data. Comput Math Appl 51(6):1075–1092MathSciNetCrossRefGoogle Scholar
  12. 12.
    Chikh MA, Saidi M, Settouti N (2012) Diagnosis of diabetes diseases using an artificial immune recognition system 2 (AIRS2) with fuzzy k-nearest neighbor. J Med Syst 36(5):2721–2729CrossRefGoogle Scholar
  13. 13.
    Jayalakshmi T, Santhakumaran A (2010) A novel classification method for diagnosis of diabetes mellitus using artificial neural networks. In: Data Storage and Data Engineering (DSDE), 2010 International Conference on. IEEE, 2010Google Scholar
  14. 14.
    Puuronen S, Terziyan V, Tsymbal A (1999) A dynamic integration algorithm for an ensemble of classifiers. Found Intell Syst 592–600Google Scholar
  15. 15.
    Kahramanli H, Allahverdi N (2008) Design of a hybrid system for the diabetes and heart diseases. Expert Syst Appl 35(1):82–89CrossRefGoogle Scholar
  16. 16.
    Barakat N, Bradley AP, Barakat MNH (2010) Intelligible support vector machines for diagnosis of diabetes mellitus. IEEE Trans Inf Technol Biomed 14(4):1114–1120CrossRefGoogle Scholar
  17. 17.
    Subbiah A, Subbiah G (2002) Diabetes research in India and China today: from literature-based mapping to health-care policy (2002)Google Scholar
  18. 18.
    El-Khatib F et al (2007) Valproate, weight gain and carbohydrate craving: a gender study. Seizure 16(3):226–232CrossRefGoogle Scholar
  19. 19.
    Buchwald H et al (2009) Weight and type 2 diabetes after bariatric surgery: systematic review and meta-analysis. Am J Med 122(3):248–256CrossRefGoogle Scholar
  20. 20.
    Dietterich TG (2000) Ensemble methods in machine learning. In: International workshop on multiple classifier systems. Springer, Berlin Heidelberg, 2000Google Scholar
  21. 21.
    Vijiyarani S, Sudha S (2013) Disease prediction in data mining technique–a survey. Int J Comput Appl Inf Technol 2:17–21Google Scholar
  22. 22.
    Guo G et al (2003) KNN model-based approach in classification. In: “OTM Confederated International Conferences” on the move to meaningful internet systems. Springer, Berlin HeidelbergGoogle Scholar
  23. 23.
    Rish I (2001) An empirical study of the naive Bayes classifier. In: IJCAI 2001 workshop on empirical methods in artificial intelligence, vol 3. no 22. IBM, New YorkGoogle Scholar

Copyright information

© Bharati Vidyapeeth's Institute of Computer Applications and Management 2018

Authors and Affiliations

  • Abid Sarwar
    • 1
    Email author
  • Mehbob Ali
    • 2
  • Jatinder Manhas
    • 1
  • Vinod Sharma
    • 1
  1. 1.University of JammuJammuIndia
  2. 2.University of KashmirKashmirIndia

Personalised recommendations