An Efficient K-Means Clustering Algorithm for Analysing COVID-19

Zubair, Md.; Asif Iqbal, MD.; Shil, Avijeet; Haque, Enamul; Moshiul Hoque, Mohammed; Sarker, Iqbal H.

doi:10.1007/978-3-030-73050-5_43

Md. Zubair²⁰,
MD. Asif Iqbal²⁰,
Avijeet Shil²⁰,
Enamul Haque²¹,
Mohammed Moshiul Hoque ORCID: orcid.org/0000-0001-8806-708X²⁰ &
…
Iqbal H. Sarker ORCID: orcid.org/0000-0003-1740-5517²⁰

Part of the book series: Advances in Intelligent Systems and Computing ((AISC,volume 1375))

Included in the following conference series:

International Conference on Hybrid Intelligent Systems

851 Accesses
8 Citations

Abstract

COVID-19 hits the world like a storm by arising pandemic situations for most of the countries around the world. The whole world is trying to overcome this pandemic situation. A better health care quality may help a country to tackle the pandemic. Making clusters of countries with similar types of health care quality provides an insight into the quality of health care in different countries. In the area of machine learning and data science, the K-means clustering algorithm is typically used to create clusters based on similarity. In this paper, we propose an efficient K-means clustering method that determines the initial centroids of the clusters efficiently. Based on this proposed method, we have determined health care quality clusters of countries utilizing the COVID-19 datasets. Experimental results show that our proposed method reduces the number of iterations and execution time to analyze COVID-19 while comparing with the traditional k-means clustering algorithm.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 169.00; Price excludes VAT (USA)

Softcover Book: USD 219.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Notes

1.
It is one of the matrices used by Oxford COVID-19 Government Response Tracker [25]. It delivers a picture of the country’s enforced strongest measures.

References

Organization WH et al.: Coronavirus disease 2019 (covid-19): situation report, 82 (2020)
Google Scholar
Sarker, I.H., Hoque, M.M., Uddin, M.K., Alsanoosy, T.: Mobile data science and intelligent apps: concepts, AI-based modeling and research directions. Mobile Netw. Appl. 1–19 (2020)
Google Scholar
Sarker, I.H., Kayes, A., Badsha, S., Alqahtani, H., Watters, P., Ng, A.: Cybersecurity data science: an overview from machine learning perspective. J. Big Data 7(1), 1–29 (2020)
Article Google Scholar
Sarker, I.H.: Context-aware rule learning from smartphone data: survey, challenges and future directions. J. Big Data 6(1), 95 (2019)
Article Google Scholar
Han, J., Pei, J., Kamber, M.: Data Mining: Concepts and Techniques. Elsevier, Amsterdam (2011)
MATH Google Scholar
Sarker, I.H., Colman, A., Kabir, M.A., Han, J.: Individualized time-series segmentation for mining mobile phone user behavior. Comput. J. 61(3), 349–368 (2018)
Article Google Scholar
Vattani, A.: The hardness of k-means clustering in the plane (2009). http://cseweb.ucsd.edu/avattani/papers/kmeans_hardness.pdf 617
Rahim, M.S., Ahmed, T.: An initial centroid selection method based on radial and angular coordinates for k-means algorithm. In: 2017 20th International Conference of Computer and Information Technology (ICCIT), pp. 1–6. IEEE (2017)
Google Scholar
Kumar, A., Gupta, S.C.: A new initial centroid finding method based on dissimilarity tree for k-means algorithm. arXiv preprint arXiv:1509.03200 (2015)
Mahmud, M.S., Rahman, M.M., Akhtar, M.N.: Improvement of k-means clustering algorithm with better initial centroids based on weighted average. In: 2012 7th International Conference on Electrical and Computer Engineering, pp. 647–650. IEEE (2012)
Google Scholar
Goyal, M., Kumar, S.: Improving the initial centroids of k-means clustering algorithm to generalize its applicability. J. Inst. Eng. (India) Ser. B 95(4), 345–350 (2014)
Google Scholar
Na, S., Xumin, L., Yong, G.: Research on k-means clustering algorithm: an improved k-means clustering algorithm. In: 2010 Third International Symposium on Intelligent Information Technology and Security Informatics, pp. 63–67. IEEE (2010)
Google Scholar
Lakshmi, M.A., Daniel, G.V., Rao, D.S.: Initial centroids for k-means using nearest neighbors and feature means. In: Soft Computing and Signal Processing, pp. 27–34. Springer (2019)
Google Scholar
Vadyala, S.R., Betgeri, S.N., Sherer, E.A., Amritphale, A.: Prediction of the number of covid-19 confirmed cases based on k-means-LSTM. arXiv preprint arXiv:2006.14752 (2020)
Poompaavai, A., Manimannan, G.: Clustering study of Indian states and union territories affected by coronavirus (COVID-19) using k-means algorithm. Int. J. Data Mining Emerg. Technol. 9(2), 43–51 (2019)
Article Google Scholar
Sonbhadra, S.K., Agarwal, S., Nagabhushan, P.: Target specific mining of covid-19 scholarly articles using one-class approach. arXiv preprint arXiv:2004.11706 (2020)
Berkhin, P.: A survey of clustering data mining techniques. In: Grouping Multidimensional Data, pp. 25–71. Springer (2006)
Google Scholar
Sarker, I.H., Abushark, Y.B., Khan, A.I.: ContextPCA: predicting context-aware smartphone apps usage based on machine learning techniques. Symmetry 12(4), 499 (2020)
Article Google Scholar
Total covid-19 tests performed by country - humanitarian data exchange. https://data.humdata.org/dataset/total-covid-19-tests-performed-by-country. Accessed 09 Mar 2020
Covid-19 testing policies, Sep 3, 2020. https://ourworldindata.org/grapher/covid-19-testing-policy?region=Asia. Accessed 09 Mar 2020
Uncover covid-19 challenge | kaggle. https://www.kaggle.com/roche-data-science-coalition/uncover. Accessed 09 Mar 2020
Abdi, H., Williams, L.J.: Principal component analysis. Wiley Interdisc. Rev. Comput. Stat. 2(4), 433–459 (2010)
Article Google Scholar
Altman, D.G., Bland, J.M.: Statistics notes: quartiles, quintiles, centiles, and other quantiles. BMJ 309(6960), 996–996 (1994)
Article Google Scholar
Arthur, D., Vassilvitskii, S.: k-means++: the advantages of careful seeding. Technical report, Stanford (2006)
Google Scholar
Coronavirus government response tracker | blavatnik school of government. https://www.bsg.ox.ac.uk/research/research-projects/coronavirus-government-response-tracker. Accessed 09 June 2020
Kodinariya, T.M., Makwana, P.R.: Review on determining number of cluster in k-means clustering. Int. J. 1(6), 90–95 (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Computer Science and Engineering, Chittagong University of Engineering and Technology, Chittagong, 4349, Bangladesh
Md. Zubair, MD. Asif Iqbal, Avijeet Shil, Mohammed Moshiul Hoque & Iqbal H. Sarker
McMaster University, Hamilton, Canada
Enamul Haque

Authors

Md. Zubair
View author publications
You can also search for this author in PubMed Google Scholar
MD. Asif Iqbal
View author publications
You can also search for this author in PubMed Google Scholar
Avijeet Shil
View author publications
You can also search for this author in PubMed Google Scholar
Enamul Haque
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Moshiul Hoque
View author publications
You can also search for this author in PubMed Google Scholar
Iqbal H. Sarker
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Iqbal H. Sarker .

Editor information

Editors and Affiliations

Scientific Network for Innovation and Research Excellence, Machine Intelligence Research Labs (MIR Labs), Auburn, WA, USA
Ajith Abraham
Institute for Information Systems, University of Applied Sciences and Arts Northwestern Switzerland, Olten, Solothurn, Switzerland
Thomas Hanne
Division of Graduate Studies, Tijuana Institute of Technology, Tijuana, Mexico
Oscar Castillo
Scientific Network for Innovation and Research Excellence, Machine Intelligence Research Labs (MIR Labs), Auburn, AL, USA
Niketa Gandhi
Universidade Federal da Bahia, Salvador, Bahia, Brazil
Tatiane Nogueira Rios
Computer Science and Information Engineering, National University of Kaohsiung, Kaohsiung, Taiwan
Tzung-Pei Hong

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Zubair, M., Asif Iqbal, M., Shil, A., Haque, E., Moshiul Hoque, M., Sarker, I.H. (2021). An Efficient K-Means Clustering Algorithm for Analysing COVID-19. In: Abraham, A., Hanne, T., Castillo, O., Gandhi, N., Nogueira Rios, T., Hong, TP. (eds) Hybrid Intelligent Systems. HIS 2020. Advances in Intelligent Systems and Computing, vol 1375. Springer, Cham. https://doi.org/10.1007/978-3-030-73050-5_43

Download citation

DOI: https://doi.org/10.1007/978-3-030-73050-5_43
Published: 17 April 2021
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-73049-9
Online ISBN: 978-3-030-73050-5
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics