Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study

  • Anurag Kumar Verma
  • Saurabh PalEmail author
  • Surjeet Kumar


Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.


Skin disease Dermatology Extra tree classifier Radius neighbors classifier Passive aggressive classifier 



passive aggressive classifier


linear discriminant analysis


radius neighbors classifier


Bernoulli naïve Bayesian


Gaussian naïve Bayesian


extra tree classifier


feature selection


Compliance with Ethical Standards

Conflict of Interest

The authors declare that they have no conflict of interest.

Research Involving Human Participants and/or Animals

This paper does not contain any studies with human participants or animals performed by any of the authors.


  1. 1.
    Chaurasia, V., Pal, S., & Tiwari, B. B. (2018). Chronic kidney disease: a predictive model using decision tree. International Journal of Engineering Research and Technology, 11(11), 1781–1794.Google Scholar
  2. 2.
    Chaurasia, V., Pal, S., & Tiwari, B. B. (2018). Prediction of benign and malignant breast cancer using data mining techniques. Journal of Algorithms & Computational Technology, 12(2), 119–126.CrossRefGoogle Scholar
  3. 3.
    Polat, K., & Güneş, S. (2009). A novel hybrid intelligent method based on C4. 5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications, 36(2), 1587–1592.CrossRefGoogle Scholar
  4. 4.
    Immagulate, I., & Vijaya, M. S. (2015). Categorization of non-melanoma skin lesion diseases using support vector machine and its variants. International Journal of Medical Imaging, 3(2), 34–40.CrossRefGoogle Scholar
  5. 5.
    Chang, C. L., & Chen, C. H. (2009). Applying decision tree and neural network to increase quality of dermatologic diagnosis. Expert Systems with Applications, 36(2), 4035–4041.CrossRefGoogle Scholar
  6. 6.
    Ramya, G., & Rajeshkumar, J. (2015). A novel method for segmentation of skin lesions from digital images. International Research Journal of Engineering and Technology, 2(8), 1544–1547.Google Scholar
  7. 7.
    Übeyli, E. D., & Doğdu, E. (2010). Automatic detection of erythemato-squamous diseases using k-means clustering. Journal of Medical Systems, 34(2), 179–184.CrossRefPubMedGoogle Scholar
  8. 8.
    Güvenir, H. A., Demiröz, G., & Ilter, N. (1998). Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals. Artificial Intelligence in Medicine, 13(3), 147–165.CrossRefPubMedGoogle Scholar
  9. 9.
    Ahmed, K., Jesmin, T., & Rahman, M. Z. (2013). Early prevention and detection of skin cancer risk using data mining. International Journal of Computer Applications, 62(4), 1–6.CrossRefGoogle Scholar
  10. 10.
    Fernando, Z. T., Trivedi, P., & Patni, A. (2013). DOCAID: predictive healthcare analytics using naive Bayes classification. In Second Student Research Symposium (SRS), International Conference on Advances in Computing, Communications and Informatics (ICACCI’13), 1–5.Google Scholar
  11. 11.
    Jaleel, J. A., Salim, S., & Aswin, R. B. (2012). Artificial neural network based detection of skin cancer. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 1(3), 200–205.Google Scholar
  12. 12.
    Theodoraki, E. M., Katsaragakis, S., Koukouvinos, C., & Parpoula, C. (2010). Innovative data mining approaches for outcome prediction of trauma patients. Journal of Biomedical Science and Engineering, 3(08), 791–798.CrossRefGoogle Scholar
  13. 13.
    Sharma, D. K., & Hota, H. S. (2013). Data mining techniques for prediction of different categories of dermatology diseases. Journal of Management Information and Decision Sciences, 16(2), 103.Google Scholar
  14. 14.
    Rambhajani, M., Deepanker, W., & Pathak, N. (2015). Classification of dermatology diseases through Bayes net and best first search. International Journal of Advanced Research in Computer and Communication Engineering, 4(5), 275–86.Google Scholar
  15. 15.
    Bakpo, F. S., & Kabari, L. G. (2011). Diagnosing skin diseases using an artificial neural network. In Artificial Neural Networks-Methodological Advances and Biomedical Applications, Suzuki K (ed.), intech. Available from:
  16. 16.
    Manjusha, K. K., Sankaranarayanan, K., & Seena, P. (2014). Prediction of different dermatological conditions using naive Bayesian classification. International Journal of Advanced Research in Computer Science and Software Engineering, 4(1), 864–868.Google Scholar
  17. 17.
    Yadav, D. C., & Pal, S. (2019). To generate an ensemble model for women thyroid prediction using data mining techniques. Asian Pacific Journal of Cancer Prevention, 20(4), 1275–1281.CrossRefPubMedGoogle Scholar
  18. 18.
    Tuba, E., Ribic, I., Capor-Hrosik, R., & Tuba, M. (2017). Support vector machine optimized by elephant herding algorithm for erythemato-squamous diseases detection. Procedia Computer Science, 122, 916–923.CrossRefGoogle Scholar
  19. 19.
    Zhang, X., Wang, S., Liu, J., & Tao, C. (2018). Towards improving diagnosis of skin diseases by combining deep neural network and human knowledge. BMC Medical Informatics and Decision Making, 18(2), 59.CrossRefPubMedPubMedCentralGoogle Scholar
  20. 20.
    Übeyli, E. D. (2009). Combined neural networks for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 36(3), 5107–5112.CrossRefGoogle Scholar
  21. 21.
    Lekkas, S., & Mikhailov, L. (2010). Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases. Artificial Intelligence in Medicine, 50(2), 117–126.CrossRefPubMedGoogle Scholar
  22. 22.
    Xie, J., & Wang, C. (2011). Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 38(5), 5809–5815.CrossRefGoogle Scholar
  23. 23.
    Cataloluk, H., & Kesler, M. (2012). A diagnostic software tool for skin diseases with basic and weighted K-NN in International Symposium on Innovations in Intelligent Systems and Applications, IEEE (2012), 1-4.Google Scholar
  24. 24.
    Olatunji, S. O., & Arif, H. (2013). Identification of erythemato-squamous skin diseases using extreme learning machine and artificial neural network. ICTACT Journal of Softw Computing, 4(1), 627–632.CrossRefGoogle Scholar
  25. 25.
    Ravichandran, K. S., Narayanamurthy, B., Ganapathy, G., Ravalli, S., & Sindhura, J. (2014). An efficient approach to an automatic detection of erythemato-squamous diseases. Neural Computing and Applications, 25(1), 105–114.CrossRefGoogle Scholar
  26. 26.
    Amarathunga, A. A. L. C., Ellawala, E. P. W. C., Abeysekara, G. N., & Amalraj, C. R. J. (2015). Expert system for diagnosis of skin diseases. International Journal of Scientific & Technology Research, 4(01), 174–178.Google Scholar
  27. 27.
    Parikh, K. S., Shah, T. P., Kota, R., & Vora, R. (2015). Diagnosing common skin diseases using soft computing techniques. International Journal of Bio-Science and Bio-Technology, 7(6), 275–286.CrossRefGoogle Scholar
  28. 28.
    Maghooli, K., Langarizadeh, M., Shahmoradi, L., Habibi-koolaee, M., Jebraeily, M., & Bouraghi, H. (2016). Differential diagnosis of erythemato-squamous diseases using classification and regression tree. Acta Informatica Medica, 24(5), 338.CrossRefPubMedPubMedCentralGoogle Scholar
  29. 29.
    Pravin, S. R., & Jafar, O. A. M. (2017). Prediction of skin disease using data mining techniques. IJARCCE, 6(7), 313–318.Google Scholar
  30. 30.
    Zhou, H., Xie, F., Jiang, Z., Liu, J., Wang, S., & Zhu, C. (2017). Multi-classification of skin diseases for dermoscopy images using deep learning. In 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (pp. 1-5). IEEE.Google Scholar
  31. 31.
    Idoko, J. B., Arslan, M., & Abiyev, R. (2018). Fuzzy neural system application to differential diagnosis of Erythemato-squamous diseases. Cyprus Journal of Medical Sciences, 3(2), 90–97.CrossRefGoogle Scholar

Copyright information

© Springer Science+Business Media, LLC, part of Springer Nature 2019

Authors and Affiliations

  1. 1.MCA DepartmentVBS Purvanchal UniversityJaunpurIndia

Personalised recommendations