Skip to main content

Research and Application on Ensemble Learning Methods

  • Conference paper
  • First Online:
Proceedings of 2019 Chinese Intelligent Automation Conference (CIAC 2019)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 586))

Included in the following conference series:

  • 1423 Accesses

Abstract

As shown in previous data, diabetes has led to the increasing mortality and considerable financial expenditure in the US. It is necessary to find out how to making correct diagnosis and prescription of diabetes plays an important role in helping patients. That is why we choose the dataset of diabetic inpatients having diagnosis at hospitals in the US, and predict how different treatments and medications influence patient outcomes. We use the class attribute of readmission number to obtain the results.

Because of the large and biased dataset, we firstly remove attributes with high missing value rate, and reduce the imbalance classes of instances by over-sampling and under-sampling, then followed by the attribute selection through various methods, such as the Correlation-based feature selection, the Chi-Squared Attribute Evaluator, the Information Gain Attribute Evaluator, etc.

Three classification methods C4.5, RIPPER, and Random Forests are used to predict the classification in Weka. In addition, we also use the ensemble learning methods including bagging and boosting to improve the stability and accuracy. From the analysing results, we can see that C4.5 and Ripper perform better, and both bagging and boosting increase the accuracy rate to differing degrees because both algorithms are somewhat unstable. There is no doubt that Random Forests is the best performer among all classification methods we use, and after using boosting, we see big increases in the values of the evaluation metrics we use. The final outcome is much better than random guess.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Centers for Disease Control and Prevention (2014) National diabetes statistics report: estimates of diabetes and its burden in the United States, 2014. US Department of Health and Human Services, Atlanta

    Google Scholar 

  2. Gale EAM, Gillespie KM (2001) Diabetes and gender. Diabetologia 44(1):3–15 Web May 2016

    Article  Google Scholar 

  3. Gebel E (2001) How diabetes differs for men and women. Diabetes Forecast, October 2011. N.p., Web May 2016

    Google Scholar 

  4. (2011) “Men ‘develop diabetes more easily’”. Men ‘get diabetes more easily than women’, 05 October 2011. N.p., Web May 2016

    Google Scholar 

  5. Frese T, Sandholzer H (2013) The epidemiology of type 1 diabetes mellitus. Type 1 diabetes. In: Escher A (ed) InTech. https://doi.org/10.5772/52893. http://www.intechopen.com/books/type-1-diabetes/the-epidemiology-of-type-1-diabetes-mellitus

    Google Scholar 

  6. Cherney K (2013) Age of onset for type 2 diabetes: know your risk. Healthline, 13 August 2013. N.p., Web May 2016

    Google Scholar 

  7. (2016) “Type 1 diabetes”. Type 1 diabetes. N.p., n.d., Web May 2016

    Google Scholar 

  8. Ingraham C (2015) The average american woman now weighs as much as the average 1960s man. Washington post. The Washington post, 12 June 2015. Web May 2016

    Google Scholar 

  9. “Unexplained weight loss”. - Reasons, symptoms causes. N.p., n.d., Web May 2016

    Google Scholar 

  10. Brownlee J (2015) 8 tactics to combat imbalanced classes in your machine learning dataset-machine learning mastery. Mach Learn Mastery, 19 August 2015. N.p., Web May 2016

    Google Scholar 

  11. Brownlee J (2014) An introduction to feature selection - machine learning mastery. Mach Learn Mastery, 06 October 2014. N.p., Web May 2016

    Google Scholar 

  12. “CfsSubsetEval” CfsSubsetEval. N.p, n.d., Web May 2016

    Google Scholar 

  13. Hall MA (1999) Correlation-based feature selection for machine learning. Thesis, The University of Waikato, N.p: n.p, n.d. Print

    Google Scholar 

  14. Ladha L (2011) Feature selection methods and algorithms. Int J Comput Sci Eng (IJCSE) 3(5):1787–1797 Web May 2016

    Google Scholar 

  15. “Decision forest – ALGLIB”. Decision forest – ALGLIB. N.p., n.d., Web May 2016

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yuzhong Wang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Wang, Y. (2020). Research and Application on Ensemble Learning Methods. In: Deng, Z. (eds) Proceedings of 2019 Chinese Intelligent Automation Conference. CIAC 2019. Lecture Notes in Electrical Engineering, vol 586. Springer, Singapore. https://doi.org/10.1007/978-981-32-9050-1_17

Download citation

Publish with us

Policies and ethics