Skip to main content

Comparative Analysis of Machine Learning Algorithms with Ensemble Techniques and Forecasting COVID-19 Cases in India

  • Conference paper
  • First Online:
Artificial Intelligence on Medical Data

Abstract

Unanticipated information in December 2019 changed the world around us. A relatively contagious disease unfolded through the SARS-CoV-2 virus that travelled throughout the globe and was declared an epidemic by WHO in March 2020. The need of examining the scenario became the inducement behind this research. The assessment of COVID-19 in India is performed from 1 April 2020 to 20 May 2021 which amassed a total of 415 instances. Further, preprocessing of the dataset is executed with the use of normalization. The experimentation is executed through the use of four ensemble strategies which are bagging, boosting, stacking and voting with four distinct machine learning algorithms linear regression, sequential minimal optimizer for regression, multilayer perceptron and Gaussian process. The splitting of the dataset is completed at 75%, and machine learning algorithms with ensemble techniques are applied. Linear regression with the bagging ensemble method gives satisfactory outcomes with the correlation coefficient of 0.935 and 0.919 for confirmed cases and recovered cases, respectively, and Gaussian process presented the best results for deceased cases. In the case of ensemble strategies, bagging indicates the best correlation coefficient in each case. Therefore, with the help of the three best algorithms, confirmed cases, recovered cases and deceased cases predictions are performed. The paper has potential implementations that can foresee the COVID-19 confirmed cases, recovered cases and deceased cases based on historic data and subsequently structure the plan for the future.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 139.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 179.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 179.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

  1. 1.

    https://www.cs.waikato.ac.nz/ml/weka.

References

  1. Sohrabi C et al (2020) World Health Organization declares global emergency: a review of the 2019 novel coronavirus (COVID-19). Int J Surg 76:71–76

    Google Scholar 

  2. World Health Organization (2020) Origin of SARS-CoV-2, 26 March 2020. No. WHO/2019-nCoV/FAQ/Virus_origin/2020.1. World Health Organization

    Google Scholar 

  3. Huang C et al (2020) Clinical features of patients infected with 2019 novel coronavirus in Wuhan, China. Lancet 395:10223:497–506

    Google Scholar 

  4. Chan JF-W et al (2020) A familial cluster of pneumonia associated with the 2019 novel coronavirus indicating person-to-person transmission: a study of a family cluster. Lancet 395(10223):514–523

    Google Scholar 

  5. Kar SK et al (2021) Second wave of COVID-19 pandemic in India: barriers to the effective governmental response. EClinicalMedicine 36

    Google Scholar 

  6. Jain VK, Iyengar KP, Vaishya R (2021) Differences between first wave and second wave of COVID-19 in India. Diabetes Metabolic Syndr

    Google Scholar 

  7. Dong E, Du H, Gardner L. An interactive web-based dashboard to track COVID-19 in real-time. Lancet Inf Dis 20(5):533–534. https://doi.org/10.1016/S1473-3099(20)30120-1

  8. O’Neill A (2021) India—total population 2016–2026. Statista, 5 May 2021. www.statista.com/statistics/263766/total-population-of-india/

  9. Bagcchi S (2021) The world’s largest COVID-19 vaccination campaign. Lancet Infect Dis 21(3):323

    Article  Google Scholar 

  10. Majhi R et al (2020) Analysis and prediction of COVID‐19 trajectory: a machine learning approach. J Public Aff 21:e2537

    Google Scholar 

  11. Zu ZY et al (2020) Coronavirus disease 2019 (COVID-19): a perspective from China. Radiology 296(2):E15–E25

    Google Scholar 

  12. Nishiura H, Linton NM, Akhmetzhanov AR (2020) Serial interval of novel coronavirus (COVID-19) infections. Int J Infect Dis 93:284–286

    Article  Google Scholar 

  13. Gupta R, Pal SK, Pandey G (2020) A comprehensive analysis of COVID-19 outbreak situation in India. MedRxiv

    Google Scholar 

  14. Kwekha-Rashid AS, Abduljabbar HN, Alhayani B (2021) Coronavirus disease (COVID-19) cases analysis using machine-learning applications. Appl Nanosci 2021:1–13

    Google Scholar 

  15. Barstugan M, Ozkaya U, Ozturk S (2020) Coronavirus (covid-19) classification using CT images by machine learning methods. arXiv:2003.09424

  16. Tiwari D et al (2021) Pandemic coronavirus disease (Covid‐19): world effects analysis and prediction using machine‐learning techniques. Expert Syst

    Google Scholar 

  17. Mahfouz A et al (2020) Ensemble classifiers for network intrusion detection using a novel network attack dataset. Future Internet 12(11):180

    Google Scholar 

  18. Tiwari A, Prakash A (2014) Improving classification of J48 algorithm using bagging, boosting and blending ensemble methods on SONAR dataset using WEKA. Int J Eng Tech Res 2(9):207–209

    Google Scholar 

  19. Kotsiantis S, Pintelas P (2004) Combining bagging and boosting. Int J Comput Intell 1(4):324–333

    Google Scholar 

  20. Abdar M et al (2020) A new nested ensemble technique for automated diagnosis of breast cancer. Pattern Recogn Lett 132:123–131

    Google Scholar 

  21. Guidotti E, Ardia D (2020) COVID-19 data hub. J Open Sour Softw 5(51):2376. https://doi.org/10.21105/joss.02376

    Article  Google Scholar 

  22. Wan X (2019) Influence of feature scaling on the convergence of gradient iterative algorithm. J Phys Conf Ser 1213(3)

    Google Scholar 

  23. Bhandari A (2020) Feature scaling: standardization vs normalization. Analytics Vidhya, 31 July 2020. www.analyticsvidhya.com/blog/2020/04/feature-scaling-machine-learning-normalization-standardization/

  24. Smith TC, Frank E (2016) Introducing machine learning concepts with WEKA.Statistical genomics. Humana Press, New York, NY, pp 353–378

    Google Scholar 

  25. Jo J-M (2019) Effectiveness of normalization pre-processing of big data to the machine learning performance. J Korea Inst Electron Commun Sci 14(3):547–552

    Google Scholar 

  26. Frank E, Hall MA, Witten IH (2016) The WEKA workbench. Online appendix for “data mining: practical machine learning tools and techniques, 4th edn. Morgan Kaufmann

    Google Scholar 

  27. Baskin II et al (2017) Bagging and boosting of regression models. In: Tutorials in chemoinformatics, pp 249–255

    Google Scholar 

  28. Džeroski S, Ženko B (2004) Is combining classifiers with stacking better than selecting the best one? Mach Learn 54(3):255–273

    Article  Google Scholar 

  29. Cheng M, Hsieh C-J, Dhillon I (2020) Voting based ensemble improves the robustness of defensive models. arXiv:2011.14031

  30. Almarabeh H (2017) Analysis of students’ performance by using different data mining classifiers. Int J Mod Educ Comput Sci 9(8):9

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Nidhi Kumari Chauhan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Chauhan, N.K., Goel, C., Singh, P. (2023). Comparative Analysis of Machine Learning Algorithms with Ensemble Techniques and Forecasting COVID-19 Cases in India. In: Gupta, M., Ghatak, S., Gupta, A., Mukherjee, A.L. (eds) Artificial Intelligence on Medical Data. Lecture Notes in Computational Vision and Biomechanics, vol 37. Springer, Singapore. https://doi.org/10.1007/978-981-19-0151-5_6

Download citation

  • DOI: https://doi.org/10.1007/978-981-19-0151-5_6

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-19-0150-8

  • Online ISBN: 978-981-19-0151-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics