Abstract
Unanticipated information in December 2019 changed the world around us. A relatively contagious disease unfolded through the SARS-CoV-2 virus that travelled throughout the globe and was declared an epidemic by WHO in March 2020. The need of examining the scenario became the inducement behind this research. The assessment of COVID-19 in India is performed from 1 April 2020 to 20 May 2021 which amassed a total of 415 instances. Further, preprocessing of the dataset is executed with the use of normalization. The experimentation is executed through the use of four ensemble strategies which are bagging, boosting, stacking and voting with four distinct machine learning algorithms linear regression, sequential minimal optimizer for regression, multilayer perceptron and Gaussian process. The splitting of the dataset is completed at 75%, and machine learning algorithms with ensemble techniques are applied. Linear regression with the bagging ensemble method gives satisfactory outcomes with the correlation coefficient of 0.935 and 0.919 for confirmed cases and recovered cases, respectively, and Gaussian process presented the best results for deceased cases. In the case of ensemble strategies, bagging indicates the best correlation coefficient in each case. Therefore, with the help of the three best algorithms, confirmed cases, recovered cases and deceased cases predictions are performed. The paper has potential implementations that can foresee the COVID-19 confirmed cases, recovered cases and deceased cases based on historic data and subsequently structure the plan for the future.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Sohrabi C et al (2020) World Health Organization declares global emergency: a review of the 2019 novel coronavirus (COVID-19). Int J Surg 76:71–76
World Health Organization (2020) Origin of SARS-CoV-2, 26 March 2020. No. WHO/2019-nCoV/FAQ/Virus_origin/2020.1. World Health Organization
Huang C et al (2020) Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395:10223:497–506
Chan JF-W et al (2020) A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet 395(10223):514–523
Kar SK et al (2021) Second wave of COVID-19 pandemic in India: barriers to the effective governmental response. EClinicalMedicine 36
Jain VK, Iyengar KP, Vaishya R (2021) Differences between first wave and second wave of COVID-19 in India. Diabetes Metabolic Syndr
Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real-time. Lancet Inf Dis 20(5):533–534. https://doi.org/10.1016/S1473-3099(20)30120-1
O’Neill A (2021) India—total population 2016–2026. Statista, 5 May 2021. www.statista.com/statistics/263766/total-population-of-india/
Bagcchi S (2021) The world’s largest COVID-19 vaccination campaign. Lancet Infect Dis 21(3):323
Majhi R et al (2020) Analysis and prediction of COVID‐19 trajectory: a machine learning approach. J Public Aff 21:e2537
Zu ZY et al (2020) Coronavirus disease 2019 (COVID-19): a perspective from China. Radiology 296(2):E15–E25
Nishiura H, Linton NM, Akhmetzhanov AR (2020) Serial interval of novel coronavirus (COVID-19) infections. Int J Infect Dis 93:284–286
Gupta R, Pal SK, Pandey G (2020) A comprehensive analysis of COVID-19 outbreak situation in India. MedRxiv
Kwekha-Rashid AS, Abduljabbar HN, Alhayani B (2021) Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Appl Nanosci 2021:1–13
Barstugan M, Ozkaya U, Ozturk S (2020) Coronavirus (covid-19) classification using CT images by machine learning methods. arXiv:2003.09424
Tiwari D et al (2021) Pandemic coronavirus disease (Covid‐19): world effects analysis and prediction using machine‐learning techniques. Expert Syst
Mahfouz A et al (2020) Ensemble classifiers for network intrusion detection using a novel network attack dataset. Future Internet 12(11):180
Tiwari A, Prakash A (2014) Improving classification of J48 algorithm using bagging, boosting and blending ensemble methods on SONAR dataset using WEKA. Int J Eng Tech Res 2(9):207–209
Kotsiantis S, Pintelas P (2004) Combining bagging and boosting. Int J Comput Intell 1(4):324–333
Abdar M et al (2020) A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recogn Lett 132:123–131
Guidotti E, Ardia D (2020) COVID-19 data hub. J Open Sour Softw 5(51):2376. https://doi.org/10.21105/joss.02376
Wan X (2019) Influence of feature scaling on the convergence of gradient iterative algorithm. J Phys Conf Ser 1213(3)
Bhandari A (2020) Feature scaling: standardization vs normalization. Analytics Vidhya, 31 July 2020. www.analyticsvidhya.com/blog/2020/04/feature-scaling-machine-learning-normalization-standardization/
Smith TC, Frank E (2016) Introducing machine learning concepts with WEKA.Statistical genomics. Humana Press, New York, NY, pp 353–378
Jo J-M (2019) Effectiveness of normalization pre-processing of big data to the machine learning performance. J Korea Inst Electron Commun Sci 14(3):547–552
Frank E, Hall MA, Witten IH (2016) The WEKA workbench. Online appendix for “data mining: practical machine learning tools and techniques, 4th edn. Morgan Kaufmann
Baskin II et al (2017) Bagging and boosting of regression models. In: Tutorials in chemoinformatics, pp 249–255
Džeroski S, Ženko B (2004) Is combining classifiers with stacking better than selecting the best one? Mach Learn 54(3):255–273
Cheng M, Hsieh C-J, Dhillon I (2020) Voting based ensemble improves the robustness of defensive models. arXiv:2011.14031
Almarabeh H (2017) Analysis of students’ performance by using different data mining classifiers. Int J Mod Educ Comput Sci 9(8):9
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Chauhan, N.K., Goel, C., Singh, P. (2023). Comparative Analysis of Machine Learning Algorithms with Ensemble Techniques and Forecasting COVID-19 Cases in India. In: Gupta, M., Ghatak, S., Gupta, A., Mukherjee, A.L. (eds) Artificial Intelligence on Medical Data. Lecture Notes in Computational Vision and Biomechanics, vol 37. Springer, Singapore. https://doi.org/10.1007/978-981-19-0151-5_6
Download citation
DOI: https://doi.org/10.1007/978-981-19-0151-5_6
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-0150-8
Online ISBN: 978-981-19-0151-5
eBook Packages: EngineeringEngineering (R0)