Abstract
Coronavirus these days becomes an alarming topic as the loss of lives is increasing every day. Coronavirus was born in China (Wuhan) and has spread globally within no time. Machine learning is used to forecast the future outcomes almost in all areas like computers, medical, business, security, and many more. In this study, various machine learning techniques such as Linear Regression, SVM, Random Forest, Logistic Regression, and KNN are used to forecast the future tendency of SARS-CoV-2. The polynomial feature with degree 3 is used to enhance the accuracy of the model. Dataset is collected from Johns Hopkins University dashboard. It contains 3 separate files of Covid-19 such as confirmed case, recovered case, and death case dataset. This article has four phases, i.e., data preprocessing, data classification, training, testing, and parametric evaluation such as RMSE, MSE, MAE, and R2-score. In this study, Linear Regression (LR) shows excellent outcome in contrast to other approaches. LR has a minimum mean squared error value and an obtained accuracy of 99.92% for Covid-19 confirmed cases, 99.73% for recovered cases, and 99.60% for death cases. Results show that if humans become unsocial and the administration imposes strict lockdown, there is a great chance to overcome this disease within a small period.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Ahmed MH (2020) Dexamethasone for the treatment of coronavirus disease (COVID-19): a review. http://doi.org/10.1007/s42399-020-00610-8
WHO | World Health Organization. https://www.who.int/. Accessed 24 Sept 2021
Pikoulis E et al (2021) The effect of the COVID pandemic lockdown measures on surgical emergencies: experience and lessons learned from a Greek Tertiary Hospital. World J Emerg Surg 16(1):1–8. https://doi.org/10.1186/s13017-021-00364-1
How to protect yourself & others | CDC. https://www.cdc.gov/coronavirus/2019-ncov/prevent-getting-sick/prevention.html. Accessed 19 Jan 2022
Khan MA, Abbas S, Khan KM, Ghamdi MAA, Rehman A (2020) Intelligent forecasting model of covid-19 novel coronavirus outbreak empowered with deep extreme learning machine. Comput Mater Contin 64(3):1329–1342. https://doi.org/10.32604/cmc.2020.011155
Caso V, Federico A (2020) No lockdown for neurological diseases during COVID19 pandemic infection. Neurol Sci 41(5):999–1001. https://doi.org/10.1007/s10072-020-04389-3
Rameshrad M, Ghafoori M, Mohammadpour AH, Nayeri MJD, Hosseinzadeh H (2020) A comprehensive review on drug repositioning against coronavirus disease 2019 (COVID19). Naunyn Schmiedebergs Arch Pharmacol 393(7):1137–1152. https://doi.org/10.1007/s00210-020-01901-6
SARS-CoV-2 resources—NCBI. https://www.ncbi.nlm.nih.gov/sars-cov-2/. Accessed 19 Jan 2022
Bajracharya T, Kumar RS (2020) COVID-19 (novel coronavirus) a global disease, vol 19
Biancalana E, Parolini F, Mengozzi A, Solini A (2021) Short-term impact of COVID-19 lockdown on metabolic control of patients with well-controlled type 2 diabetes: a single-centre observational study. Acta Diabetol 58(4):431–436. https://doi.org/10.1007/s00592-020-01637-y
Ndwandwe D, Wiysonge CS (2021) COVID-19 vaccines. Curr Opin Immunol 71:111–116. https://doi.org/10.1016/j.coi.2021.07.003
Nemati M, Ansary J, Nemati N (2020) Machine-learning approaches in COVID-19 survival analysis and discharge-time likelihood prediction using clinical data. Patterns 1(5):100074. https://doi.org/10.1016/j.patter.2020.100074
Lauer SA et al (2020) The incubation period of coronavirus disease 2019 (CoVID-19) from publicly reported confirmed cases: estimation and application. Ann Intern Med 172(9):577–582. https://doi.org/10.7326/M20-0504
Sharma DK, Subramanian M, Malyadri P, Reddy BS, Sharma M, Tahreem M (2021) Classification of COVID-19 by using supervised optimized machine learning technique. Mater Today Proc. http://doi.org/10.1016/j.matpr.2021.11.388
Ghafouri-Fard S, Mohammad-Rahimi H, Motie P, Minabi MAS, Taheri M, Nateghinia S (2021) Application of machine learning in the prediction of COVID-19 daily new cases: a scoping review. Heliyon 7(10):e08143. https://doi.org/10.1016/j.heliyon.2021.e08143
Pahar M, Klopper M, Warren R, Niesler T (2021) COVID-19 cough classification using machine learning and global smartphone recordings. Comput Biol Med 135:104572. http://doi.org/10.1016/j.compbiomed.2021.104572
Yadav M, Perumal M, Srinivas M (2020) Analysis on novel coronavirus (COVID-19) using machine learning methods. Chaos Solitons Fractals 139:110050. https://doi.org/10.1016/j.chaos.2020.110050
Dairi A, Harrou F, Zeroual A, Hittawe MM, Sun Y (2021) Comparative study of machine learning methods for COVID-19 transmission forecasting. J Biomed Inform 118:103791. http://doi.org/10.1016/j.jbi.2021.103791
Gothai E, Thamilselvan R, Rajalaxmi RR, Sadana RM, Ragavi A, Sakthivel R (2021) Materials today: proceedings prediction of COVID-19 growth and trend using machine learning approach. Mater Today Proc. http://doi.org/10.1016/j.matpr.2021.04.051
Punn NS, Sonbhadra SK, Agarwal S (2020) COVID-19 epidemic analysis using machine learning and deep learning algorithms. medRxiv. http://doi.org/10.1101/2020.04.08.20057679
Bhadana V, Jalal AS, Pathak P (2020) A comparative study of machine learning models for COVID-19 prediction in India. In: 4th IEEE conference on information and communication technology CICT 2020, Dec 2020. http://doi.org/10.1109/CICT51604.2020.9312112
Rustam F et al (2020) COVID-19 future forecasting using supervised machine learning models. IEEE Access 8:101489–101499. https://doi.org/10.1109/ACCESS.2020.2997311
Kumari P, Toshniwal D (2020) Real-time estimation of COVID-19 cases using machine learning and mathematical models—the case of India. In: 2020 IEEE 15th international conference on industrial and information systems, ICIIS 2020, pp 369–374. http://doi.org/10.1109/ICIIS51140.2020.9342735
Yao H et al (2020) Severity detection for the coronavirus disease 2019 (COVID-19) patients using a machine learning model based on the blood and urine tests. Front Cell Dev Biol 8:1–10. https://doi.org/10.3389/fcell.2020.00683
Hossen MS, Karmoker D (2020) Predicting the probability of Covid-19 recovered in south Asian countries based on healthy diet pattern using a machine learning approach. In: 2020 2nd international conference on sustainable technologies for industry 4.0, STI 2020, pp 19–20. http://doi.org/10.1109/STI50764.2020.9350439
Liu Y, Xiao Y (2020) Analysis and prediction of COVID-19 in Xinjiang based on machine learning. In: Proceedings 2020 5th international conference on information science, computer technology and transportation, ISCTT 2020, pp 382–385. http://doi.org/10.1109/ISCTT51595.2020.00072
Khanday AMUD, Rabani ST, Khan QR, Rouf N, Mohi Ud Din M (2020) Machine learning based approaches for detecting COVID-19 using clinical text data. Int J Inf Technol 12(3):731–739. http://doi.org/10.1007/s41870-020-00495-9
Aljameel SS, Khan IU, Aslam N, Aljabri M, Alsulmi ES (2021) Machine learning-based model to predict the disease severity and outcome in COVID-19 patients. Sci Program 2021. http://doi.org/10.1155/2021/5587188
De Souza FSH, Hojo-Souza NS, Dos Santos EB, Da Silva CM, Guidoni DL (2021) Predicting the disease outcome in COVID-19 positive patients through machine learning: a retrospective cohort study with Brazilian data. Front Artif Intell 4:1–13. https://doi.org/10.3389/frai.2021.579931
Pourhomayoun M, Shakibi M (2021) Predicting mortality risk in patients with COVID-19 using machine learning to help medical decision-making. Smart Health 20:100178. http://doi.org/10.1016/j.smhl.2020.100178
Abirami RS, Kumar GS (2022) Comparative study based on analysis of coronavirus disease (COVID-19) detection and prediction using machine learning models. SN Comput Sci 3(1). http://doi.org/10.1007/s42979-021-00965-2
Anupam A, Mohan NJ, Sahoo S, Chakraborty S (2021) Preliminary diagnosis of COVID-19 based on cough sounds using machine learning algorithms. In: Proceedings of 5th international conference on intelligent computing and control systems, ICICCS 2021, pp 1391–1397. http://doi.org/10.1109/ICICCS51141.2021.9432324
Chowdhury AA, Hasan KT, Hoque KKS (2021) Analysis and prediction of COVID-19 pandemic in Bangladesh by using ANFIS and LSTM network. Cognit Comput 13(3):761–770. https://doi.org/10.1007/s12559-021-09859-0
Gupta JP, Singh A, Kumar RK (2021) A computer-based disease prediction and medicine recommendation system using machine learning approach. Academia.Edu 12(3):673–683. https://doi.org/10.34218/IJARET.12.3.2021.0
Kwekha-Rashid AS, Abduljabbar HN, Alhayani B (2021) Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Appl Nanosci (0123456789). http://doi.org/10.1007/s13204-021-01868-7
COVID-19 data from John Hopkins University | Kaggle. https://www.kaggle.com/antgoldbloom/covid19-data-from-john-hopkins-university. Accessed 23 Jan 2022
CSSEGISandData/COVID-19: novel coronavirus (COVID-19) cases, provided by JHU CSSE. https://github.com/CSSEGISandData/COVID-19. Accessed 06 Jan 2022
Hota HS, Handa R, Shrivas AK (2021) 27—COVID-19 pandemic in India: forecasting using machine learning techniques. Elsevier Inc., Amsterdam
Manco L, Maffei N, Strolin S, Vichi S, Bottazzi L, Strigari L (2021) Basic of machine learning and deep learning in imaging for medical physicists. Phys Medica 83:194–205. https://doi.org/10.1016/j.ejmp.2021.03.026
Vrindavanam J, Srinath R, Shankar HH, Nagesh G (2021) Machine learning based COVID-19 cough classification models—a comparative analysis. In: Proceedings 5th international conference on computing methodologies and communication, ICCMC 2021, pp 420–426. http://doi.org/10.1109/ICCMC51019.2021.9418358
Date P, Potok T (2021) Adiabatic quantum linear regression. Sci Rep 11(1):1–11. https://doi.org/10.1038/s41598-021-01445-6
Pandey G, Chaudhary P, Gupta R, Pal S (2020) SEIR and regression model based COVID-19 outbreak predictions in India, pp 1–10. http://doi.org/10.1101/2020.04.01.20049825
Arslan H, Arslan H (2021) A new COVID-19 detection method from human genome sequences using CpG island features and KNN classifier. Eng Sci Technol Int J 24(4):839–847. https://doi.org/10.1016/j.jestch.2020.12.026
Fan Z, Xie JK, Wang ZY, Liu PC, Qu SJ, Huo L (1930) Image classification method based on improved KNN algorithm. J Phys Conf Ser 1:2021. https://doi.org/10.1088/1742-6596/1930/1/012009
Albagmi FM, Alansari A, Al Shawan DS, AlNujaidi HY, Olatunji SO (2022) Prediction of generalized anxiety levels during the Covid-19 pandemic: a machine learning-based modeling approach. Inform Med Unlocked 28:100854. http://doi.org/10.1016/j.imu.2022.100854
Tigga NP, Garg S (2020) ScienceDirect prediction of type 2 diabetes using machine learning classification methods. Procedia Comput Sci 167(2019):706–716. https://doi.org/10.1016/j.procs.2020.03.336
Utami NA, Maharani W, Atastina I (2021) Personality classification of facebook users according to big five personality using SVM (Support Vector Machine) method. Procedia Comput Sci 179(2020):177–184. https://doi.org/10.1016/j.procs.2020.12.023
Majumder S, Aich A, Das S (2021) Sentiment analysis of people during lockdown period of COVID-19 using SVM and logistic regression analysis. SSRN Electron J. http://doi.org/10.2139/ssrn.3801039
Faris H, Habib M, Faris M, Alomari M, Alomari A (2020) Medical speciality classification system based on binary particle swarms and ensemble of one vs. rest support vector machines. J Biomed Inform 109:103525. http://doi.org/10.1016/j.jbi.2020.103525
YeÅŸilkanat CM (2020) Spatio-temporal estimation of the daily cases of COVID-19 in worldwide using random forest machine learning algorithm. Chaos Solitons Fractals 140. http://doi.org/10.1016/j.chaos.2020.110210
Zhu L, Zhou X, Zhang C (2021) Rapid identification of high-quality marine shale gas reservoirs based on the oversampling method and random forest algorithm. Artif Intell Geosci 2:76–81. https://doi.org/10.1016/j.aiig.2021.12.001
Rustam Z, Saragih G (2021) Prediction insolvency of insurance companies using random forest. J Phys Conf Ser 1752(1). http://doi.org/10.1088/1742-6596/1752/1/012036
Wibowo FW, Wihayati (2021) Prediction modelling of COVID-19 outbreak in Indonesia using a logistic regression model. J Phys Conf Ser 1803(1). http://doi.org/10.1088/1742-6596/1803/1/012015
Jain H, Khunteta A, Srivastava S (2020) Churn prediction in telecommunication using logistic regression and logit boost. Procedia Comput Sci 167(2019):101–112. https://doi.org/10.1016/j.procs.2020.03.187
Moses KP, Devadas MD (2012) An approach to reduce root mean square error in toposheets. Eur J Sci Res 91(2):268–274
Wang W, Lu Y (2018) Analysis of the mean absolute error (MAE) and the root mean square error (RMSE) in assessing rounding model. IOP Conf Ser Mater Sci Eng 324(1). http://doi.org/10.1088/1757-899X/324/1/012049
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Kaur, G., Kaur, P., Kaur, N., Kaur, P. (2023). Forecasting Prediction of Covid-19 Outbreak Using Linear Regression. In: Jacob, I.J., Kolandapalayam Shanmugam, S., Izonin, I. (eds) Data Intelligence and Cognitive Informatics. Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-6004-8_17
Download citation
DOI: https://doi.org/10.1007/978-981-19-6004-8_17
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-6003-1
Online ISBN: 978-981-19-6004-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)