Skip to main content

Prediction of Skin Disease with Three Different Feature Selection Techniques Using Stacking Ensemble Method

Abstract

Skin disease is the most common problem between people. Due to pollution and deployment of ozone layer, harmful UV rays of sun burn the skin and develop various types of skin diseases. Nowadays, machine learning and deep learning algorithms are generally used for diagnosis for various kinds of diseases. In this study, we have applied three feature extraction techniques univariate feature selection, feature importance, and correlation matrix with heat map to find the optimum data subset of erythemato-squamous disease. Four classification techniques Gaussian Naïve Bayesian (NB), decision tree (DT), support vector machine (SVM), and random forest are used for measuring the performance of model. Stacking ensemble technique is then applied to enhance the prediction performance of the model. The proposed method used for measuring the performance of the model. It is finding that the optimal subset of the erythemato-squamous disease is performed well in the case of correlation and heat map feature selection techniques. The mean value, slandered deviation, root mean square error, kappa statistical error, and area under receiver operating characteristics and accuracy are calculated for demonstrating the effectiveness of the proposed model. The feature selection techniques applied with staking ensemble technique gives the better result as compared to individual machine learning techniques. The obtained results show that the performance of proposed model is higher than previous results obtained by researchers.

This is a preview of subscription content, access via your institution.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6

Data Availability

The datasets used in this paper are publicly online available as described in “Dataset Analysis” section.

References

  1. Güvenir, H. A., Demiröz, G., & Ilter, N. (1998). Learning differential diagnosis of erythemato-squamous diseases using voting feature intervals. Artificial Intelligence in Medicine, 13(3), 147–165. https://doi.org/10.1016/S0933-3657(98)00028-1.

    Article  PubMed  Google Scholar 

  2. Barati, E., Saraee, M., Mohammadi, A., Adibi, N., & Ahamadzadeh, M. R. (2011). A survey on utilization of data mining approaches for dermatological (skin) diseases prediction. Journal of Selected Areas in Health Informatics, 2(3), 1–11.

    Google Scholar 

  3. Ravichandran, K. S., Narayanamurthy, B., Ganapathy, G., Ravalli, S., & Sindhura, J. (2014). An efficient approach to an automatic detection of erythemato-squamous diseases. Neural Computing and Applications, 25(1), 105–114. https://doi.org/10.1007/s00521-013-1452-5.

    Article  Google Scholar 

  4. Badrinath, N., Gopinath, G., Ravichandran, K. S., & Soundhar, R. G. (2016). Estimation of automatic detection of erythemato-squamous diseases through AdaBoost and its hybrid classifiers. Artificial Intelligence Review, 45(4), 471–488. https://doi.org/10.1007/s10462-015-9436-8.

    Article  Google Scholar 

  5. Verma, A. K., Pal, S., & Kumar, S. (2019). Prediction of skin disease using ensemble data mining techniques and feature selection method—a comparative study. Applied Biochemistry and Biotechnology, 1–19. https://doi.org/10.1007/s12010-019-03093-z.

  6. Xie, J., Lei, J., Xie, W., Shi, Y., & Liu, X. (2013). Two-stage hybrid feature selection algorithms for diagnosing erythemato-squamous diseases. Health Information Science and Systems, 1(1), 1–14. https://doi.org/10.1186/2047-2501-1-10.

    Article  Google Scholar 

  7. Übeyli, E. D., & Doǧdu, E. (2010). Automatic detection of erythemato-squamous diseases using κ-means clustering. Journal of Medical Systems, 34(2), 179–184. https://doi.org/10.1007/s10916-008-9229-6.

    Article  PubMed  Google Scholar 

  8. Manjusha, K. K., Sankaranarayanan, K., & Seena, P. (2015). Data mining in dermatological diagnosis: a method for severity prediction. International Journal of Computer Applications, 117(11), 11–14. https://doi.org/10.5120/20597-3102.

    Article  Google Scholar 

  9. Ozcift, A., & Gulten, A. (2012). A robust multi-class feature selection strategy based on rotation forest ensemble algorithm for diagnosis of erythemato-squamous diseases. Journal of Medical Systems, 36(2), 941–949. https://doi.org/10.1007/s10916-010-9558-0.

    Article  PubMed  Google Scholar 

  10. Giveki, D. (2012). Detection of erythemato-squamous diseases using AR-CatfishBPSO-KSVM. Signal & Image Processing : An International Journal, 2(4), 57–72. https://doi.org/10.5121/sipij.2011.2406.

    Article  Google Scholar 

  11. Kabari, L. G., & Bakpo, F. S. (2009). Diagnosing skin diseases using an artificial neural network. ICAST 2009 - 2nd International Conference on Adaptive Science and Technology, 187–191. https://doi.org/10.1109/ICASTECH.2009.5409725.

  12. Aruna, S., Nandakishore, V. L., & Rajagopalan, P. S. (2012). A hybrid feature selection method based on IGSBFS and naive Bayes for the diagnosis of erythemato-squamous diseases. International Journal of Computer Applications, 41(7), 13–18. https://doi.org/10.5120/5552-7623.

    Article  Google Scholar 

  13. Almarabeh, H., & Ehab, F. E. (2017). A study of data mining techniques accuracy for healthcare. International Journal of Computer Applications, 168(3), 12–17. https://doi.org/10.5120/ijca2017914338.

    Article  Google Scholar 

  14. Maryam, Setiawan, N. A., & Wahyunggoro, O. (2017). A hybrid feature selection method using multiclass SVM for diagnosis of erythemato-squamous disease. AIP Conference Proceedings, 1867(August). https://doi.org/10.1063/1.4994451.

  15. Verma, A. K., Pal, S., & Kumar, S. (2019). Classification of skin disease using ensemble data mining techniques. Asian Pacific Journal of Cancer Prevention : APJCP, 20(6), 1887–1894. https://doi.org/10.31557/APJCP.2019.20.6.1887.

    Article  PubMed  Google Scholar 

  16. Chaurasia, V., & Pal, S. (2019). Skin diseases prediction: binary classification machine learning and multi model ensemble techniques. Research Journal of Pharmacy and Technology, 12(August), 3829–3832. https://doi.org/10.5958/0974-360X.2019.00656.5.

    Article  Google Scholar 

  17. Wang, N., Xu, H. L., Zhao, X., Wen, X., Wang, F. T., Wang, S. Y., Fu, L. L., Liu, B., & Bao, J. K. (2012). Network-based identification of novel connections among apoptotic signaling pathways in cancer. Applied Biochemistry and Biotechnology, 167(3), 621–631. https://doi.org/10.1007/s12010-012-9704-x.

    CAS  Article  PubMed  Google Scholar 

  18. Banerjee, A. K., Ravi, V., Murty, U. S. N., Sengupta, N., & Karuna, B. (2013). Application of intelligent techniques for classification of bacteria using protein sequence-derived features. Applied Biochemistry and Biotechnology, 170(6), 1263–1281. https://doi.org/10.1007/s12010-013-0268-1.

    CAS  Article  PubMed  Google Scholar 

  19. Behbahani, M., Nosrati, M., & Moradi, M. (2019). Using Chou’s general pseudo amino acid composition to classify laccases from bacterial and fungal sources via Chou’s five-step rule. Applied Biochemistry and Biotechnology, 1–14. https://doi.org/10.1007/s12010-019-03141-8.

  20. Polat, K., & Güneş, S. (2009). A novel hybrid intelligent method based on C4.5 decision tree classifier and one-against-all approach for multi-class classification problems. Expert Systems with Applications, 36(2), 1587–1592. https://doi.org/10.1016/J.ESWA.2007.11.051.

    Article  Google Scholar 

  21. Übeyli, E. D. (2009). Combined neural networks for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 36(3), 5107–5112. https://doi.org/10.1016/J.ESWA.2008.06.002.

    Article  Google Scholar 

  22. Chang, C. L., & Chen, C. H. (2009). Applying decision tree and neural network to increase quality of dermatologic diagnosis. Expert Systems with Applications, 36(2), 4035–4041. https://doi.org/10.1016/J.ESWA.2008.03.007.

    Article  Google Scholar 

  23. Lekkas, S., & Mikhailov, L. (2010). Evolving fuzzy medical diagnosis of Pima Indians diabetes and of dermatological diseases. Artificial Intelligence in Medicine, 50(2), 117–126. https://doi.org/10.1016/J.ARTMED.2010.05.007.

    Article  PubMed  Google Scholar 

  24. Xie, J., & Wang, C. (2011). Using support vector machines with a novel hybrid feature selection method for diagnosis of erythemato-squamous diseases. Expert Systems with Applications, 38(5), 5809–5815. https://doi.org/10.1016/J.ESWA.2010.10.050.

    Article  Google Scholar 

  25. Çataloluk, H., & Kesler, M. (2012). A diagnostic software tool for skin diseases with basic and weighted K-NN. INISTA 2012 - International Symposium on INnovations in Intelligent SysTems and Applications, 0–3. https://doi.org/10.1109/INISTA.2012.6246999.

  26. Olatunji, S. O., & Arif, H. (2013). Identification of erythemato-squamous skin diseases using extreme learning machine and artificial neural network. ICTACT Journal on Soft Computing, 4(1), 627–632. https://doi.org/10.21917/ijsc.2013.0090.

    Article  Google Scholar 

  27. Sharma, D., & Hota, H. (2013). Data mining techniques for prediction of different categories of dermatology diseases. Journal of Management Information and Decision Sciences, 16(2), 103.

    Google Scholar 

  28. Olatunji, S., & Arif, H. (2014). Identification of erythemato-squamous skin diseases using support vector machines and extreme learning machines: a comparative study towards effective diagnosis. Transactions on Machine Learning and Artificial Intelligence, 2(6). https://doi.org/10.14738/tmlai.26.812.

  29. Amarathunga, A. A. L. C., Ellawala, E. P. W. C., Abeysekara, G. N., & Amalraj, C. R. J. (2015). Expert system for diagnosis of skin diseases. International Journal of Scientific & Technology Research, 4(1), 174–178.

    Google Scholar 

  30. Parikh, K. S., Shah, T. P., Kota, R. K., & Vora, R. (2015). Diagnosing common skin diseases using soft computing techniques. International Journal of Bio-Science and Bio-Technology, 7(6), 275–286. https://doi.org/10.14257/ijbsbt.2015.7.6.28.

    Article  Google Scholar 

  31. Maghooli, K., Langarizadeh, M., Shahmoradi, L., Habibi-Koolaee, M., Jebraeily, M., & Bouraghi, H. (2016). Differential diagnosis of erythemato-squamous diseases using classification and regression tree. Acta informatica medica : AIM : journal of the Society for Medical Informatics of Bosnia & Herzegovina : casopis Drustva za medicinsku informatiku BiH, 24(5), 338–342. https://doi.org/10.5455/aim.2016.24.338-342.

    Article  Google Scholar 

  32. Zhou, H., Xie, F., Jiang, Z., Liu, J., Wang, S., & Zhu, C. (2017). Multi-classification of skin diseases for dermoscopy images using deep learning. In 2017 IEEE International Conference on Imaging Systems and Techniques (IST) (pp. 1–5). IEEE. https://doi.org/10.1109/IST.2017.8261543.

  33. Idoko, J. B., Arslan, M., & Abiyev, R. (2018). Fuzzy neural system application to differential diagnosis of erythemato-squamous diseases. Cyprus Journal of Medical Sciences, 90–97. https://doi.org/10.5152/cjms.2018.576.

  34. Zhang, X., Wang, S., Liu, J., & Tao, C. (2018). Towards improving diagnosis of skin diseases by combining deep neural network and human knowledge. BMC Medical Informatics and Decision Making, 18(S2), 59. https://doi.org/10.1186/s12911-018-0631-9.

    Article  PubMed  PubMed Central  Google Scholar 

Download references

Author information

Affiliations

Authors

Contributions

All authors contributed both the concepts and contents of this study. AKV provided the manuscript under supervised by SP. All authors also performed discussion intensively for contents improvement. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Saurabh Pal.

Ethics declarations

Conflict of Interest

The authors declare that they have no competing interests.

Additional information

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Verma, A.K., Pal, S. Prediction of Skin Disease with Three Different Feature Selection Techniques Using Stacking Ensemble Method. Appl Biochem Biotechnol 191, 637–656 (2020). https://doi.org/10.1007/s12010-019-03222-8

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12010-019-03222-8

Keywords

  • Erythemato-squamous disease
  • SVM
  • Stacking
  • RMSE
  • KSE