Abstract
A new clustering algorithm based on the concept of graph connectivity is introduced. The idea is to develop a meaningful graph representation for data, where each resulting sub-graph corresponds to a cluster with highly similar objects connected by edge. The proposed algorithm has a fairly strong theoretical basis that supports its originality and computational efficiency. Further, some useful guidelines are provided so that the algorithm can be tuned to optimize the well-designed quality indices. Numerical evidences show that the proposed algorithm can provide a very good clustering accuracy for a number of benchmark data and has a relatively low computational complexity compared to some sophisticated clustering methods.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Hansen, P., Jaumard, B.: Cluster analysis and mathematical programming. Math. Program. 79, 191–215 (1997)
Sneath, P.: The application of computers to taxonomy. J. Gen. Microbiol. 17, 201–226 (1957)
Sorensen, T.: A method of establishing groups of equal amplitude in plant sociology based on similarity of species content and its application to analyzes of the vegetation on danish commons. Biologiske Skrifter 5, 1–34 (1948)
Kaufman, L., Rousseeuw, P.: Finding Groups in Data: An Introduction to Cluster Analysis. Wiley, New York (1990)
Lloyd, S.: Least squares quantization in PCM. IEEE Trans. Inf. Theory 28, 129–137 (1982)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning. Springer, New York (2001)
Jain, A.K., Dubes, R.C.: Algorithms for Clustering Data. Prentice-Hall, New Jersey (1988)
Everitt, B., Landau, S.: Cluster Analysis. Amold, London (2001)
Harary, F.: Graph Theory. Addison-Wesley, Boston (1969)
Karypis, G., Han, E., Kumar, V.: Chameleon: hierarchical clustering using dynamic modeling. IEEE Comput. 32, 68–75 (1999)
Zadeh, L.: Fuzzy sets. Inf. Control 8, 338–353 (1965)
Kohonen, T.: Self-organizing Maps. Springer, New York (2001)
Schölkopf, B., Smola, A., Müller, K.: Nonlinear component analysis as a kernel eigenvalue problem. Neural Comput. 10, 1299–1319 (1998)
Matula, D.W.: The cohesive strength of graphs. In: Chartrand, G., Kapoor, S.F. (eds.) The Many Facets of Graph Theory. Lecture Notes in Mathematics, vol. 110, pp. 215–221. Springer, Berlin (1969)
Matula, D.W.: k-components, clusters and slicings in graphs. SIAM J. Appl. Math. 22, 459–480 (1972)
Cherng, J., Lo, M.: A hypergraph based clustering algorithm for spatial data sets. In: Proceedings of the IEEE International Conference on Data Mining, pp. 83–90 (2001)
Estivill-Castro, V., Lee, I.: AMOEBA: hierarchical clustering based on spatial proximity using Delaunay diagram. In: Proceedings of the 9th Symposium on Spatial Data Handling, pp. 7a.26–7a.41 (1999)
Hartuv, E., Shamir, R.: A clustering algorithm based on graph connectivity. Inf. Process. Lett. 76, 175–181 (2000)
Sharan, R., Shamir, R.: CLICK: a clustering algorithm with applications to gene expression analysis. In: Proceedings of the 8th International Conference on Intelligent Systems for Molecular Biology, pp. 307–316 (2000)
Ben-Dor, A., Shamir, R., Yakhini, Z.: Clustering gene expression patterns. J. Comput. Biol. 6, 281–297 (1999)
Zhou, Y., Cheng, H., Yu, J.X.: Graph clustering based on structural/attribute similarities. Proc. VLDB Endow. 2, 718–729 (2009)
Parimala, M., Lopez, D.: Graph clustering based on structural attribute neighborhood similarity (SANS). In: Proceedings of the IEEE International Conference on Electrical, Computer and Communication Technologies (ICECCT), pp. 1–4 (2015)
Milligan, G., Cooper, M.: An examination of procedures for determining the number of clusters in a data set. Psychometrika 50, 159–179 (1985)
Blum, M., Floyd, R.W., Pratt, V.R., Rivest, R.L., Tarjan, R.E.: The bounds for selection. J. Comput. Syst. Sci. 7, 448–461 (1973)
Moore, E.F.: The shortest path through a maze. In: Proceedings of the International Symposium on the Theory of Switching, pp. 285–292 (1959)
Stoer, M., Wagner, F.: A simple min-cut algorithm. J. ACM 44, 585–591 (1997)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2019 Springer Nature Switzerland AG
About this paper
Cite this paper
Li, YF., Lu, LH., Hung, YC. (2019). A New Clustering Algorithm Based on Graph Connectivity. In: Arai, K., Kapoor, S., Bhatia, R. (eds) Intelligent Computing. SAI 2018. Advances in Intelligent Systems and Computing, vol 858. Springer, Cham. https://doi.org/10.1007/978-3-030-01174-1_33
Download citation
DOI: https://doi.org/10.1007/978-3-030-01174-1_33
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-01173-4
Online ISBN: 978-3-030-01174-1
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)