A Comprehensive Analysis of the Most Common Hard Clustering Algorithms

Vardhan, Aditya; Sarmah, Priyanshu; Das, Arunav

doi:10.1007/978-3-030-33846-6_6

Aditya Vardhan¹²,
Priyanshu Sarmah¹² &
Arunav Das¹²

Part of the book series: Lecture Notes in Networks and Systems ((LNNS,volume 98))

Included in the following conference series:

International Conference on Inventive Computation Technologies

917 Accesses

Abstract

From past decades, Clustering is the process of observation that set assignment into subsets called clusters. It is an unsupervised method and can be grouped as hard and soft clustering. Hard clustering methods assign the sample point to a specific cluster whereas soft clustering methods give a probability of assignment to all clusters. In this paper, we have tried to give intuition to some of the popular hard clustering methods with their associated algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 259.00; Price excludes VAT (USA)

Softcover Book: USD 329.99; Price excludes VAT (USA)

Hardcover Book: USD 329.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Esteves, R.M., Hacker, T., Rong, C.: Competitive K-Means, a new accurate and distributed K-Means algorithm for large datasets. In: 2013 IEEE 5th International Conference on Cloud Computing Technology and Science, Bristol, pp. 17–24 (2013). https://doi.org/10.1109/cloudcom.2013.89
Shi, N., Liu, X., Guan, Y.: Research on k-means clustering algorithm: an improved k-means clustering algorithm. In: 2010 Third International Symposium on Intelligent Information Technology and Security Informatics, Jinggangshan, pp. 63–67 (2010)
Google Scholar
https://www.coursera.org/learn/machine-learning
Advantages & Disadvantages of k-Means and Hierarchical Clustering (Unsupervised Learning). Machine Learning for Language Technology ML4LT (2016). Marina San, Department of Linguistics and Philology, Uppsala University
Google Scholar
Smiti, A., Elouedi, Z.: Dynamic DBSCAN-GM clustering algorithm. In: 2015 16th IEEE International Symposium on Computational Intelligence and Informatics (CINTI), pp. 311–316 (2015)
Google Scholar
Bansal, K., Bansal, M.: Dynamic data clustering and visualization using FDClust algorithm. In: 2017 International Conference on Computer Communication and Informatics (ICCCI), pp. 1–5 (2017)
Google Scholar
Zhang, L., Deng, S., Li, S.: Analysis of power consumer behaviour based on the complementation of K-means and DBSCAN. In: 2017 IEEE Conference on Energy Internet and Energy System Integration (EI2), Beijing, pp. 1–5 (2017). https://doi.org/10.1109/ei2.2017.8245490
Ghuman, S.S.: Clustering techniques - a review. Int. J. Comput. Sci. Mob. Comput. 5, 524–530 (2016)
Google Scholar
Xu, R., Wunsch II, D.: Survey of clustering algorithms (2005)
Google Scholar
Wang, L., Li, M., Han, X., Zheng, K.: An improved density-based spatial clustering of application with noise. Int. J. Comput. Appl. (2018)
Google Scholar
Yu, Y., Zhao, J., Wang, X., Wang, Q., Zhang, Y.: Cludoop: an efficient distributed density-based clustering for big data using hadoop. Int. J. Distrib. Sens. Netw. 11, 579391 (2015)
Article Google Scholar
Smiti, A., Elouedi, Z.: DBSCAN-GM: an improved clustering method based on Gaussian Means and DBSCAN techniques. In: 2012 IEEE 16th International Conference on Intelligent Engineering Systems (INES) (2012)
Google Scholar
Jain, A.K., Narasimha Murty, M., Flynn, P.J.: Data clustering: a review. ACM Comput. Surv. (CSUR) 31(3), 264–323 (1999)
Article Google Scholar
Steinbach, M., Karypis, G., Kumar, V.: A comparison of document clustering techniques. In: KDD Workshop on Text Mining, vol. 400, no. 1 (2000)
Google Scholar

Download references

Author information

Authors and Affiliations

Odisha, India
Aditya Vardhan, Priyanshu Sarmah & Arunav Das

Authors

Aditya Vardhan
View author publications
You can also search for this author in PubMed Google Scholar
Priyanshu Sarmah
View author publications
You can also search for this author in PubMed Google Scholar
Arunav Das
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Aditya Vardhan .

Editor information

Editors and Affiliations

Computer Science and Engineering, RVS Technical Campus, Coimbatore, Tamil Nadu, India
S. Smys
Faculty of Electrical Engineering, Czech Technical University in Prague, Prague 6, Czech Republic
Robert Bestak
Departamento de Engenharia Informática, Universidade de Coimbra, Coimbra, Portugal
Álvaro Rocha

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Vardhan, A., Sarmah, P., Das, A. (2020). A Comprehensive Analysis of the Most Common Hard Clustering Algorithms. In: Smys, S., Bestak, R., Rocha, Á. (eds) Inventive Computation Technologies. ICICIT 2019. Lecture Notes in Networks and Systems, vol 98. Springer, Cham. https://doi.org/10.1007/978-3-030-33846-6_6

Download citation

DOI: https://doi.org/10.1007/978-3-030-33846-6_6
Published: 03 November 2019
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-33845-9
Online ISBN: 978-3-030-33846-6
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics