Skip to main content

Prediction of COVID-19 Severity Level Using XGBoost Algorithm: A Machine Learning Approach Based on SIR Epidemiological Model

  • Conference paper
  • First Online:
Intelligent Systems and Sustainable Computing

Part of the book series: Smart Innovation, Systems and Technologies ((SIST,volume 289))

Abstract

COVID-19 formally termed as “2019 novel coronavirus,” is disrupting socioeconomic conditions throughout the world. Due to the unavailability of efficient ways to predict the severity level of COVID-19, governmental officials and policymakers of different countries are facing difficulties to take precautionary measures for minimizing risks. This paper presents a model trained to predict COVID-19 situation severity level using XGBoost which is a gradient boosting algorithm. To categorize severity level, SIR epidemiological method has been employed which can express the current condition of any area affected by contagious diseases like COVID-19, analyzing the number of susceptible people stands for S, the number of infected people stands for I, and the number of recovered people stands for R. By comparing the evaluation metrics of the selected machine learning model with other machine learning algorithms, it is deduced that the model performs better for less training time(speed), gives an accuracy rate of 95%, and has the ability to reduce over-fitting.

These authors contributed equally to this work.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 299.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    The Dataset is downloaded from: https://www.kaggle.com/vignesh1694/Covid-19-coronavirus.

References

  1. Gupta, V.K., Gupta, A., Kumar, D., Sardana, A.: Prediction of Covid-19 confirmed, death, and cured cases in India using random forest model. Big Data Mining Anal. 4(2), 116–123 (2021). https://doi.org/10.26599/BDMA.2020.9020016

    Article  Google Scholar 

  2. Kolla, B.: Analysis, prediction and evaluation of Covid-19 datasets using machine learning algorithms, vol. 8, No. 5 (2021). https://doi.org/10.30534/ijeter/2020/117852020

  3. Coronavirus: (2021). Retrieved from https://www.who.int/health-topics/coronavirus#tab=tab_1

  4. Pinter, G., Felde, I., Mosavi, A., et al.: Covid-19 pandemic prediction for Hungary. Hybrid Mach. Learn. Approach Math. 8, 890 (2020). https://doi.org/10.3390/math8060890

    Article  Google Scholar 

  5. Rustam, F., et al.: Covid-19 future forecasting using supervised machine learning models. IEEE Access 8, 101489–101499 (2020). https://doi.org/10.1109/ACCESS.2020.2997311

    Article  Google Scholar 

  6. Ardabili, S.F., Mosavi, A., Ghamisi, P., Ferdinand, F., Varkonyi-Koczy, A.R., Reuter, U., Rabczuk, T., Atkinson, P.M.: Covid-19 outbreak prediction with machine learning. Algorithms (2020). https://doi.org/10.3390/a13100249

  7. Zoabi, Y., Deri-Rozov, S., Shomron, N.: Machine learning-based prediction of Covid-19 diagnosis based on symptoms. npj Digit. Med. 4(3). https://doi.org/10.1038/s41746-020-00372-6

  8. Yan, L., Zhang, H., Goncalves, J., et al.: A machine learning-based model for survival prediction in patients with severe Covid-19 infection (2020)). https://doi.org/10.1101/2020.02.27.20028027

  9. Iqbal, N., Islam, M.: Machine learning for dengue outbreak prediction: a performance evaluation of different prominent classifiers. Informatica 43, 363–371 (2019). https://doi.org/10.31449/inf.v43i3.1548

  10. The SIR epidemic model: (2021). Retrieved from https://scipython.com/book/chapter-8

  11. Rokach, L., Maimon, O.: Decision trees. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA (2005). https://doi.org/10.1007/0-387-25465-X_9

  12. Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324

  13. Sperandei, S.: Understanding logistic regression analysis. Biochemia Medica 24(1), 12–18 (2014). https://doi.org/10.11613/BM.2014.003

  14. Noh, B., Youm, C., Goh, E., Lee, M., Park, H., Jeon, H., Kim, O.: XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes. Sci. Rep. 11(1). https://doi.org/10.1038/s41598-021-91797-w

  15. Shmilovici, A.: Support vector machines. In: Data Mining and Knowledge Discovery Handbook, pp. 257–276. https://doi.org/10.1007/0-387-25465-x_12

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Labeba Tahsin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tahsin, L., Roy, S. (2022). Prediction of COVID-19 Severity Level Using XGBoost Algorithm: A Machine Learning Approach Based on SIR Epidemiological Model. In: Reddy, V.S., Prasad, V.K., Mallikarjuna Rao, D.N., Satapathy, S.C. (eds) Intelligent Systems and Sustainable Computing. Smart Innovation, Systems and Technologies, vol 289. Springer, Singapore. https://doi.org/10.1007/978-981-19-0011-2_7

Download citation

Publish with us

Policies and ethics