Abstract
COVID-19 formally termed as “2019 novel coronavirus,” is disrupting socioeconomic conditions throughout the world. Due to the unavailability of efficient ways to predict the severity level of COVID-19, governmental officials and policymakers of different countries are facing difficulties to take precautionary measures for minimizing risks. This paper presents a model trained to predict COVID-19 situation severity level using XGBoost which is a gradient boosting algorithm. To categorize severity level, SIR epidemiological method has been employed which can express the current condition of any area affected by contagious diseases like COVID-19, analyzing the number of susceptible people stands for S, the number of infected people stands for I, and the number of recovered people stands for R. By comparing the evaluation metrics of the selected machine learning model with other machine learning algorithms, it is deduced that the model performs better for less training time(speed), gives an accuracy rate of 95%, and has the ability to reduce over-fitting.
These authors contributed equally to this work.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
The Dataset is downloaded from: https://www.kaggle.com/vignesh1694/Covid-19-coronavirus.
References
Gupta, V.K., Gupta, A., Kumar, D., Sardana, A.: Prediction of Covid-19 confirmed, death, and cured cases in India using random forest model. Big Data Mining Anal. 4(2), 116–123 (2021). https://doi.org/10.26599/BDMA.2020.9020016
Kolla, B.: Analysis, prediction and evaluation of Covid-19 datasets using machine learning algorithms, vol. 8, No. 5 (2021). https://doi.org/10.30534/ijeter/2020/117852020
Coronavirus: (2021). Retrieved from https://www.who.int/health-topics/coronavirus#tab=tab_1
Pinter, G., Felde, I., Mosavi, A., et al.: Covid-19 pandemic prediction for Hungary. Hybrid Mach. Learn. Approach Math. 8, 890 (2020). https://doi.org/10.3390/math8060890
Rustam, F., et al.: Covid-19 future forecasting using supervised machine learning models. IEEE Access 8, 101489–101499 (2020). https://doi.org/10.1109/ACCESS.2020.2997311
Ardabili, S.F., Mosavi, A., Ghamisi, P., Ferdinand, F., Varkonyi-Koczy, A.R., Reuter, U., Rabczuk, T., Atkinson, P.M.: Covid-19 outbreak prediction with machine learning. Algorithms (2020). https://doi.org/10.3390/a13100249
Zoabi, Y., Deri-Rozov, S., Shomron, N.: Machine learning-based prediction of Covid-19 diagnosis based on symptoms. npj Digit. Med. 4(3). https://doi.org/10.1038/s41746-020-00372-6
Yan, L., Zhang, H., Goncalves, J., et al.: A machine learning-based model for survival prediction in patients with severe Covid-19 infection (2020)). https://doi.org/10.1101/2020.02.27.20028027
Iqbal, N., Islam, M.: Machine learning for dengue outbreak prediction: a performance evaluation of different prominent classifiers. Informatica 43, 363–371 (2019). https://doi.org/10.31449/inf.v43i3.1548
The SIR epidemic model: (2021). Retrieved from https://scipython.com/book/chapter-8
Rokach, L., Maimon, O.: Decision trees. In: Maimon, O., Rokach, L. (eds.) Data Mining and Knowledge Discovery Handbook. Springer, Boston, MA (2005). https://doi.org/10.1007/0-387-25465-X_9
Breiman, L.: Random forests. Mach. Learn. 45, 5–32 (2001). https://doi.org/10.1023/A:1010933404324
Sperandei, S.: Understanding logistic regression analysis. Biochemia Medica 24(1), 12–18 (2014). https://doi.org/10.11613/BM.2014.003
Noh, B., Youm, C., Goh, E., Lee, M., Park, H., Jeon, H., Kim, O.: XGBoost based machine learning approach to predict the risk of fall in older adults using gait outcomes. Sci. Rep. 11(1). https://doi.org/10.1038/s41598-021-91797-w
Shmilovici, A.: Support vector machines. In: Data Mining and Knowledge Discovery Handbook, pp. 257–276. https://doi.org/10.1007/0-387-25465-x_12
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Tahsin, L., Roy, S. (2022). Prediction of COVID-19 Severity Level Using XGBoost Algorithm: A Machine Learning Approach Based on SIR Epidemiological Model. In: Reddy, V.S., Prasad, V.K., Mallikarjuna Rao, D.N., Satapathy, S.C. (eds) Intelligent Systems and Sustainable Computing. Smart Innovation, Systems and Technologies, vol 289. Springer, Singapore. https://doi.org/10.1007/978-981-19-0011-2_7
Download citation
DOI: https://doi.org/10.1007/978-981-19-0011-2_7
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0010-5
Online ISBN: 978-981-19-0011-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)