An Empirical Analysis of Machine Learning Classifiers for Clinical Decision Making in Asthma

Conference paper
Part of the Communications in Computer and Information Science book series (CCIS, volume 801)

Abstract

An empirical analysis of the various machine learning classifiers on Asthma data and their performance assessment are presented. As an integral part of preprocessing the data, feature reduction and outlier detection are carried out, following which classifiers are applied to the same to distinguish asthmatic subjects from those of non-asthmatic ones. In cases where the classifiers perform poorly, their performance is boosted via adaptive classifiers. Feature ranking is adopted to reduce the number of features that are considered for the final process of classification while support vector machine with radial basis function is deployed to eliminate outliers. The performance is evaluated via cross validation and random sampling with variable training sizes. The various performance metrics evaluated for the system including precision, recall, F1-measure and classification accuracy indicate good results. The study can be used as a guideline for the effective application of knowledge discovery techniques on clinical data in order to explore hidden knowledge representations.

Keywords

Sampling Cross validation Bootstrap Outliers Knowledge discovery 

References

  1. 1.
    Tu, M.C., Shin, D., Shin, D.: A comparative study of medical data classification methods based on decision tree and bagging algorithms. In: Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing. IEEE (2009)Google Scholar
  2. 2.
    Prasadl, B.D.C.N., Krishna Prasad, P.N., Sagar, Y.: An approach to develop expert systems in medical diagnosis using machine learning algorithms (Asthma) and a performance study. Int. J. Soft Comput. (IJSC) 2, 26–33 (2011)CrossRefGoogle Scholar
  3. 3.
    Rahman, R.M., Hasan, F.R.M.: Using and comparing different decision tree classification techniques for mining ICDDR, B Hospital surveillance data. Expert Syst. Appl. 38(9), 11421–11436 (2011)CrossRefGoogle Scholar
  4. 4.
    Jensen, P.B., Jensen, L.J., Brunak, S.: Mining electronic health records: towards better research applications and clinical care. Nat. Rev. Genet. 13, 395–405 (2012)CrossRefGoogle Scholar
  5. 5.
    Saunes, M., et al.: Early eczema and the risk of childhood asthma a prospective population-based study. BMC Pediatr. 12, 168 (2012)CrossRefGoogle Scholar
  6. 6.
    Chen, X., Ishwaran, H.: Random forests for genomic data analysis. Genomics 99, 323–329 (2012)CrossRefGoogle Scholar
  7. 7.
    Bright, T.J., et al.: Effect of clinical decision-support systems systematic review. Ann. Intern. Med. 157, 29–43 (2012)CrossRefGoogle Scholar
  8. 8.
    Lomotan, E.A., et al.: Evaluating the use of a computerized clinical decision support system for asthma by pediatric pulmonologists. Int. J. Med. Inf. 81, 157–165 (2012)CrossRefGoogle Scholar
  9. 9.
    Amaral, J.L.M., et al.: Machine learning algorithms and forced oscillation measurements applied to the automatic identification of chronic obstructive pulmonary disease. Comput. Methods Programs Biomed. 105, 183–193 (2012)CrossRefGoogle Scholar
  10. 10.
    Pathak, J., Kho, A.N., Denny, J.C.: Electronic health records-driven phenotyping: challenges, recent advances, and perspectives. J. Am. Med. Inf. Assoc. 20, 206–211 (2013)CrossRefGoogle Scholar
  11. 11.
    Shivade, C., et al.: A review of approaches to identifying patient phenotype cohorts using electronic health record. J. Am. Med. Inf. Assoc. 21, 221–230 (2013)CrossRefGoogle Scholar
  12. 12.
    Tomar, D., Agarwal, S.: A survey on data mining approaches for healthcare. Int. J. Bio-Sci. Bio-Technol. 5, 241–266 (2013)CrossRefGoogle Scholar
  13. 13.
    Nambiar, R., et al.: A look at challenges and opportunities of big data analytics in healthcare. In: 2013 IEEE International Conference on Big Data (2013)Google Scholar
  14. 14.
    Zięba, M., et al.: Boosted SVM for extracting rules from imbalanced data in application to prediction of the post-operative life expectancy in the lung cancer patients. Appl. Soft Comput. 14, 99–108 (2014)CrossRefGoogle Scholar
  15. 15.
    Velickovski, F., et al.: Clinical decision support systems (CDSS) for preventive management of COPD patients. J. Transl. Med. 12, S9 (2014)CrossRefGoogle Scholar
  16. 16.
    Emanet, N., et al.: A comparative analysis of machine learning methods for classification type decision problems in healthcare. Decis. Anal. 1, 6 (2014)CrossRefGoogle Scholar
  17. 17.
    Wu, W., et al.: Unsupervised phenotyping of severe asthma research program participants using expanded lung data. J. Allergy Clin. Immunol. 133(5), 1280–1288 (2014)CrossRefGoogle Scholar
  18. 18.
    Wang, X., Sontag, D., Wang, F.: Unsupervised learning of disease progression models. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining. ACM (2014)Google Scholar
  19. 19.
    Schneeweiss, S.: Learning from big health care data. New Engl. J. Med. 370, 2161–2163 (2014)CrossRefGoogle Scholar
  20. 20.
    Prosperi, M.C.F., et al.: Predicting phenotypes of asthma and eczema with machine learning. BMC Med. Genom. 7, S7 (2014)CrossRefGoogle Scholar
  21. 21.
    Liang, Z., et al.: Deep learning for healthcare decision making with EMRs. In: IEEE International Conference on Bioinformatics and Biomedicine (BIBM) (2014)Google Scholar
  22. 22.
    Luo, G.: MLBCD: a machine learning tool for big clinical data. Health Inf. Sci. Syst. 3, 3 (2015)CrossRefGoogle Scholar
  23. 23.
    Caban, J.J., Gotz, D.: Visual analytics in healthcare–opportunities and research challenges, pp. 260–262 (2015)Google Scholar
  24. 24.
    Amaral, J.L.M., et al.: Machine learning algorithms and forced oscillation measurements to categorize the airway obstruction severity in chronic obstructive pulmonary disease. Comput. Methods Programs Biomed. 11, 186–197 (2015)CrossRefGoogle Scholar
  25. 25.
    Bateman, E.D., et al.: Development and validation of a novel risk score for asthma exacerbations: the risk score for exacerbations. J. Allergy Clin. Immunol. 135, 1457–1464 (2015)CrossRefGoogle Scholar
  26. 26.
    Deo, R.C.: Machine learning in medicine. Circulation 132, 1920–1930 (2015)CrossRefGoogle Scholar
  27. 27.
    Howard, R., et al.: Distinguishing asthma phenotypes using machine learning approaches. Curr. Allergy Asthma Reports, 1–10 (2015)Google Scholar
  28. 28.
    Berner, Eta S., La Lande, Tonya J.: Overview of clinical decision support systems. In: Berner, Eta S. (ed.) Clinical Decision Support Systems. HI, pp. 1–17. Springer, Cham (2016).  https://doi.org/10.1007/978-3-319-31913-1_1Google Scholar
  29. 29.
    Lefeuvre, D., Delmas, M., Marguet, C., Chauvin, P., Vandentorren, S.: Asthma-like symptoms in homeless children in the greater Paris area in, prevalence, associated factors and utilization of healthcare services in the ENFAMS survey. PLOS ONE 11(4), e0153872 (2013).  https://doi.org/10.1371/journal.pone.0153872CrossRefGoogle Scholar

Copyright information

© Springer Nature Singapore Pte Ltd. 2018

Authors and Affiliations

  1. 1.Vidyavardhaka College of EngineeringMysuruIndia
  2. 2.Sri Jayachamarajendra College of EngineeringMysuruIndia

Personalised recommendations