Skip to main content

Advertisement

Log in

Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study

  • Published:
Applied Biochemistry and Biotechnology Aims and scope Submit manuscript

Abstract

Nowadays, skin disease is a major problem among peoples worldwide. Different machine learning techniques are applied to predict the various classes of skin disease. In this research paper, we have applied six different machine learning algorithm to categorize different classes of skin disease using three ensemble techniques and then a feature selection method to compare the results obtained from different machine learning techniques. In the proposed study, we present a new method, which applies six different data mining classification techniques and then developed an ensemble approach using bagging, AdaBoost, and gradient boosting classifiers techniques to predict the different classes of skin disease. Further, the feature importance method is used to select important 15 features which play a major role in prediction. A subset of the original dataset is obtained after selecting only 15 features to compare the results of used six machine learning techniques and ensemble approach as on the whole dataset. The ensemble method used on skin disease dataset is compared with the new subset of the original dataset obtained from feature selection method. The outcome shows that the dermatological prediction accuracy of the test dataset is increased compared with an individual classifier and a better accuracy is obtained as compared with subset obtained from feature selection method. The ensemble method and feature selection used on dermatology datasets give better performance as compared with individual classifier algorithms. Ensemble method gives more accurate and effective skin disease prediction.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9

Similar content being viewed by others

Abbreviations

PAC:

passive aggressive classifier

LDA:

linear discriminant analysis

RNC:

radius neighbors classifier

BNB:

Bernoulli naïve Bayesian

NB:

Gaussian naïve Bayesian

ETC:

extra tree classifier

FS:

feature selection

References

  1. Chaurasia, V., Pal, S., & Tiwari, B. B. (2018). Chronic kidney disease: a predictive model using decision tree. International Journal of Engineering Research and Technology, 11(11), 1781–1794.

    Google Scholar 

  2. Chaurasia, V., Pal, S., & Tiwari, B. B. (2018). Prediction of benign and malignant breast cancer using data mining techniques. Journal of Algorithms & Computational Technology, 12(2), 119–126.

    Article  Google Scholar 

  3. Polat, K., & Güneş, S. (2009). A novel hybrid intelligent method based on C4. 5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications, 36(2), 1587–1592.

    Article  Google Scholar 

  4. Immagulate, I., & Vijaya, M. S. (2015). Categorization of non-melanoma skin lesion diseases using support vector machine and its variants. International Journal of Medical Imaging, 3(2), 34–40.

    Article  Google Scholar 

  5. Chang, C. L., & Chen, C. H. (2009). Applying decision tree and neural network to increase quality of dermatologic diagnosis. Expert Systems with Applications, 36(2), 4035–4041.

    Article  Google Scholar 

  6. Ramya, G., & Rajeshkumar, J. (2015). A novel method for segmentation of skin lesions from digital images. International Research Journal of Engineering and Technology, 2(8), 1544–1547.

    Google Scholar 

  7. Übeyli, E. D., & Doğdu, E. (2010). Automatic detection of erythemato-squamous diseases using k-means clustering. Journal of Medical Systems, 34(2), 179–184.

    Article  Google Scholar 

  8. Güvenir, H. A., Demiröz, G., & Ilter, N. (1998). Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals. Artificial Intelligence in Medicine, 13(3), 147–165.

    Article  Google Scholar 

  9. Ahmed, K., Jesmin, T., & Rahman, M. Z. (2013). Early prevention and detection of skin cancer risk using data mining. International Journal of Computer Applications, 62(4), 1–6.

    Article  Google Scholar 

  10. Fernando, Z. T., Trivedi, P., & Patni, A. (2013). DOCAID: predictive healthcare analytics using naive Bayes classification. In Second Student Research Symposium (SRS), International Conference on Advances in Computing, Communications and Informatics (ICACCI’13), 1–5.

  11. Jaleel, J. A., Salim, S., & Aswin, R. B. (2012). Artificial neural network based detection of skin cancer. International Journal of Advanced Research in Electrical, Electronics and Instrumentation Engineering, 1(3), 200–205.

    Google Scholar 

  12. Theodoraki, E. M., Katsaragakis, S., Koukouvinos, C., & Parpoula, C. (2010). Innovative data mining approaches for outcome prediction of trauma patients. Journal of Biomedical Science and Engineering, 3(08), 791–798.

    Article  Google Scholar 

  13. Sharma, D. K., & Hota, H. S. (2013). Data mining techniques for prediction of different categories of dermatology diseases. Journal of Management Information and Decision Sciences, 16(2), 103.

    Google Scholar 

  14. Rambhajani, M., Deepanker, W., & Pathak, N. (2015). Classification of dermatology diseases through Bayes net and best first search. International Journal of Advanced Research in Computer and Communication Engineering, 4(5), 275–86.

  15. Bakpo, F. S., & Kabari, L. G. (2011). Diagnosing skin diseases using an artificial neural network. In Artificial Neural Networks-Methodological Advances and Biomedical Applications, Suzuki K (ed.), intech. Available from: http://www.intechopen.com/articles/show/title/diagnosing-skin-diseases-using-an-artificial-neural-network.

  16. Manjusha, K. K., Sankaranarayanan, K., & Seena, P. (2014). Prediction of different dermatological conditions using naive Bayesian classification. International Journal of Advanced Research in Computer Science and Software Engineering, 4(1), 864–868.

    Google Scholar 

  17. Yadav, D. C., & Pal, S. (2019). To generate an ensemble model for women thyroid prediction using data mining techniques. Asian Pacific Journal of Cancer Prevention, 20(4), 1275–1281.

    Article  Google Scholar 

  18. Tuba, E., Ribic, I., Capor-Hrosik, R., & Tuba, M. (2017). Support vector machine optimized by elephant herding algorithm for erythemato-squamous diseases detection. Procedia Computer Science, 122, 916–923.

    Article  Google Scholar 

  19. Zhang, X., Wang, S., Liu, J., & Tao, C. (2018). Towards improving diagnosis of skin diseases by combining deep neural network and human knowledge. BMC Medical Informatics and Decision Making, 18(2), 59.

    Article  Google Scholar 

  20. Übeyli, E. D. (2009). Combined neural networks for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 36(3), 5107–5112.

    Article  Google Scholar 

  21. Lekkas, S., & Mikhailov, L. (2010). Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases. Artificial Intelligence in Medicine, 50(2), 117–126.

    Article  Google Scholar 

  22. Xie, J., & Wang, C. (2011). Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 38(5), 5809–5815.

    Article  Google Scholar 

  23. Cataloluk, H., & Kesler, M. (2012). A diagnostic software tool for skin diseases with basic and weighted K-NN in International Symposium on Innovations in Intelligent Systems and Applications, IEEE (2012), 1-4.

  24. Olatunji, S. O., & Arif, H. (2013). Identification of erythemato-squamous skin diseases using extreme learning machine and artificial neural network. ICTACT Journal of Softw Computing, 4(1), 627–632.

    Article  Google Scholar 

  25. Ravichandran, K. S., Narayanamurthy, B., Ganapathy, G., Ravalli, S., & Sindhura, J. (2014). An efficient approach to an automatic detection of erythemato-squamous diseases. Neural Computing and Applications, 25(1), 105–114.

    Article  Google Scholar 

  26. Amarathunga, A. A. L. C., Ellawala, E. P. W. C., Abeysekara, G. N., & Amalraj, C. R. J. (2015). Expert system for diagnosis of skin diseases. International Journal of Scientific & Technology Research, 4(01), 174–178.

    Google Scholar 

  27. Parikh, K. S., Shah, T. P., Kota, R., & Vora, R. (2015). Diagnosing common skin diseases using soft computing techniques. International Journal of Bio-Science and Bio-Technology, 7(6), 275–286.

    Article  Google Scholar 

  28. Maghooli, K., Langarizadeh, M., Shahmoradi, L., Habibi-koolaee, M., Jebraeily, M., & Bouraghi, H. (2016). Differential diagnosis of erythemato-squamous diseases using classification and regression tree. Acta Informatica Medica, 24(5), 338.

    Article  Google Scholar 

  29. Pravin, S. R., & Jafar, O. A. M. (2017). Prediction of skin disease using data mining techniques. IJARCCE, 6(7), 313–318.

    Google Scholar 

  30. Zhou, H., Xie, F., Jiang, Z., Liu, J., Wang, S., & Zhu, C. (2017). Multi-classification of skin diseases for dermoscopy images using deep learning. In 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (pp. 1-5). IEEE.

  31. Idoko, J. B., Arslan, M., & Abiyev, R. (2018). Fuzzy neural system application to differential diagnosis of Erythemato-squamous diseases. Cyprus Journal of Medical Sciences, 3(2), 90–97.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Saurabh Pal.

Ethics declarations

Conflict of Interest

The authors declare that they have no conflict of interest.

Research Involving Human Participants and/or Animals

This paper does not contain any studies with human participants or animals performed by any of the authors.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Verma, A.K., Pal, S. & Kumar, S. Prediction of Skin Disease Using Ensemble Data Mining Techniques and Feature Selection Method—a Comparative Study. Appl Biochem Biotechnol 190, 341–359 (2020). https://doi.org/10.1007/s12010-019-03093-z

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12010-019-03093-z

Keywords

Navigation