Abstract
In this paper, we present an approach on how to predict the coronavirus spread per country from the country-specific socio-economic indicators. To this end, firstly, we describe in detail how the growth of COVID-19 cases can be represented with a parameterized exponential curve. Then, having collected and pre-processed various country rankings, statistics and indicators of socio-economic circumstances of a country, we constructed an adequate dataset of 116 countries. In order to predict the behavior of the coronavirus spread behavior, we employed machine learning algorithms, i.e., regression and classification approach. Since the dataset is unlabelled, we also made use of the clustering methods. In essence, the results of the regression analysis indicate a strong relationship between countries’ socio-economic indicators and the behavior of the coronavirus number of novel cases. Whereas, due to the lack of labeled dataset, the classification method results in a rather poor performance.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
At the moment of writing this paper.
References
World Health Organization - COVID-19 Situation Reports. Situation Report 191. https://www.who.int/emergencies/diseases/novel-coronavirus-2019/situation-reports/. Accessed 30 June 2020
Bonaccorsi, G., et al.: Economic and social consequences of human mobility restrictions under covid-19. Proc. Nat. Acad. Sci. 117(27), 15530–15535 (2020)
Haushofer, J., Metcalf, C.J.E.: Which interventions work best in a pandemic? Science 368(6495), 1063–1065 (2020)
Stübinger, J., Schneider, L.: Epidemiology of coronavirus covid-19: forecasting the future incidence in different countries. In: Healthcare, vol. 8, p. 99. Multidisciplinary Digital Publishing Institute (2020)
Giuliani, D., Dickson, M.M., Espa, G., Santi, F.: Modelling and predicting the spatio-temporal spread of coronavirus disease 2019 (covid-19) in Italy. Available at SSRN 3559569 (2020)
Nesteruk, I.: Statistics-based predictions of coronavirus epidemic spreading in mainland china. Innovation Biosystem Bioengineering, vol. 4 (2020)
Nesteruk, I.: Statistics based models for the dynamics of chernivtsi children disease. Res. Bull. Nat. Tech. Univ. Ukraine Kyiv Polytech. Inst. 5, 26–34 (2017)
Zhan, C., Tse, C., Fu, Y., Lai, Z., Zhang, H.: Modelling and prediction of the: coronavirus disease spreading in china incorporating human migration data. Available at SSRN, vol. 3546051, p. 2020 (2019)
Zhang, X., Ma, R., Wang, L.: Predicting turning point, duration and attack rate of covid-19 outbreaks in major western countries. Chaos, Solitons Fractals 135, 109829 (2020)
Elmousalami, H.H., Hassanien, A.E.: Day level forecasting for coronavirus disease (covid-19) spread: analysis, modeling and recommendations. arXiv preprint arXiv:2003.07778 (2020)
Pal, R., Sekh, A.A., Kar, S., Prasad, D.K.: Neural network based country wise risk prediction of covid-19. arXiv preprint arXiv:2004.00959 (2020)
The Humanitarian Data Exchange - Open source data. Novel Coronavirus (COVID-19) Cases Data. https://data.humdata.org/. Accessed 31 July 2020
Virtanen, P., et al.: SciPy 1.0: fundamental algorithms for scientific computing in python. Nat. Methods 17, 261–272 (2020)
Levenberg, K.: A method for the solution of certain non-linear problems in least squares. Q. Appl. Math. 2(2), 164–168 (1944)
Gavin, H.: The levenberg-marquardt method for nonlinear least squares curve-fitting problems. Dept. Civ. Environ. Eng. Duke Univ. 28, 1–5 (2011)
Roweis, S.: Levenberg-marquardt optimization. University Of Toronto, Notes (1996)
Ranganathan, A.: The levenberg-marquardt algorithm. Tutoral LM Algorithm 11(1), 101–110 (2004)
Knoema website. Free data, statistics, analysis, visualization and sharing. https://knoema.com/. Accessed 25 July 2020
Sagar, A.D., Najam, A.: The human development index: a critical review. Ecol. Econ. 25(3), 249–264 (1998)
Kramer, O.: Genetic Algorithm Essentials. SCI, vol. 679. Springer, Cham (2017). https://doi.org/10.1007/978-3-319-52156-5
Pedregosa, F., et al.: Scikit-learn: machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011)
Buitinck, L., et al.: API design for machine learning software: experiences from the scikit-learn project. In: ECML PKDD Workshop: Languages for Data Mining and Machine Learning, pp. 108–122 (2013)
Rokach, L., Maimon, O.: Clustering methods. In: Maimon, O., Rokach, L. (eds) Data Mining and Knowledge Discovery Handbook, pp. 321–352. Springer, Boston, MA. https://doi.org/10.1007/0-387-25465-X_15
Banerjee, A., Dave, R.N.: Validating clusters using the hopkins statistic. In: 2004 IEEE International conference on fuzzy systems (IEEE Cat. No. 04CH37542), vol. 1, pp. 149–153. IEEE (2004)
Syakur, M., Khotimah, B., Rochman, E., Satoto, B.: Integration k-means clustering method and elbow method for identification of the best customer profile cluster. In: IOP Conference Series: Materials Science and Engineering, vol. 336, p. 012017. IOP Publishing (2018)
Author information
Authors and Affiliations
Corresponding authors
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2021 Springer Nature Switzerland AG
About this paper
Cite this paper
Altwlkany, K., Ražanica, E., Mijatović, N., Delić, A. (2021). Predicting the Coronavirus Spread Based on Countries’ Long-Term Socio-Economic Indicators. In: Hasic Telalovic, J., Kantardzic, M. (eds) Mediterranean Forum – Data Science Conference. MeFDATA 2020. Communications in Computer and Information Science, vol 1343. Springer, Cham. https://doi.org/10.1007/978-3-030-72805-2_1
Download citation
DOI: https://doi.org/10.1007/978-3-030-72805-2_1
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-72804-5
Online ISBN: 978-3-030-72805-2
eBook Packages: Computer ScienceComputer Science (R0)