Abstract
Node embeddings present a powerful method of embedding graph-structured data into a low dimensional space while preserving local node information. Clustering is a common preprocessing task on unsupervised data utilized to get the best insight into the input dataset. The most prominent clustering algorithm is the K-Means algorithm. In this paper, we formulate clustering as an optimization problem using different objective functions following the idea of searching for the best fit centroid-based cluster exemplars. We also apply several nature-inspired optimization algorithms since the K-Means algorithm is trapped in local optima during its execution. We demonstrate our cluster frameworks’ capability on several graph clustering datasets used in node embeddings and node clustering tasks. Performance evaluation and comparison of our frameworks with the K-Means algorithm are demonstrated and discussed in detail. We end this paper with a discussion on the impact of the objective function’s choice on the clustering results.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Alihodzic, A., Tuba, E., Tuba, M.: An upgraded bat algorithm for tuning extreme learning machines for data classification. In: Proceedings of the Genetic and Evolutionary Computation Conference Companion, GECCO 2017, pp. 125–126. Association for Computing Machinery, New York (2017). https://doi.org/10.1145/3067695.3076088
Alihodzic, A., Tuba, M.: Improved bat algorithm applied to multilevel image thresholding. Sci. World J. 2014, 16 (2014). Article ID 176718. https://doi.org/10.1155/2014/176718
Dorigo, M., Blum, C.: Ant colony optimization theory: a survey. Theor. Comput. Sci. 344(2–3), 243–278 (2005). https://doi.org/10.1016/j.tcs.2005.05.020
Grover, A., Leskovec, J.: Node2vec: scalable feature learning for networks. In: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2016, pp. 855–864. Association for Computing Machinery, New York (2016). https://doi.org/10.1145/2939672.2939754
Li, J., Tan, Y.: The bare bones fireworks algorithm: a minimalist global optimizer. Appl. Soft Comput. 62, 454–462 (2018). https://doi.org/10.1016/j.asoc.2017.10.046
Maulik, U., Bandyopadhyay, S.: Genetic algorithm-based clustering technique. Pattern Recognit. 33(9), 1455–1465 (2000). https://doi.org/10.1016/S0031-3203(99)00137-5
Perozzi, B., Al-Rfou, R., Skiena, S.: Deepwalk: Online learning of social representations. In: Proceedings of the 20th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, KDD 2014, pp. 701–710. Association for Computing Machinery, New York (2014). https://doi.org/10.1145/2623330.2623732
Rozemberczki, B., Davies, R., Sarkar, R., Sutton, C.: GEMSEC: graph embedding with self clustering. In: Proceedings of the 2019 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2019, pp. 65–72. ACM (2019)
Storn, R., Price, K.: Differential evolution - a simple and efficient heuristic for global optimization over continuous spaces. J. Glob. Optim. 11(4), 341–359 (1997). https://doi.org/10.1023/A:1008202821328
Tang, R., Fong, S., Yang, X., Deb, S.: Integrating nature-inspired optimization algorithms to k-means clustering. In: Seventh International Conference on Digital Information Management (ICDIM 2012), pp. 116–123 (2012). https://doi.org/10.1109/ICDIM.2012.6360145
Tuba, E., Dolicanin-Djekic, D., Jovanovic, R., Simian, D., Tuba, M.: Combined elephant herding optimization algorithm with k-means for data clustering. In: Satapathy, S.C., Joshi, A. (eds.) Information and Communication Technology for Intelligent Systems. SIST, vol. 107, pp. 665–673. Springer, Singapore (2019). https://doi.org/10.1007/978-981-13-1747-7_65
Tuba, E., Jovanovic, R., Hrosik, R.C., Alihodzic, A., Tuba, M.: Web intelligence data clustering by bare bone fireworks algorithm combined with k-means. In: Proceedings of the 8th International Conference on Web Intelligence, Mining and Semantics, WIMS 2018. Association for Computing Machinery, New York (2018). https://doi.org/10.1145/3227609.3227650
Tuba, M., Alihodzic, A., Bacanin, N.: Cuckoo search and bat algorithm applied to training feed-forward neural networks. In: Yang, X.-S. (ed.) Recent Advances in Swarm Intelligence and Evolutionary Computation. SCI, vol. 585, pp. 139–162. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-13826-8_8
van der Merwe, D.W., Engelbrecht, A.P.: Data clustering using particle swarm optimization. In: The 2003 Congress on Evolutionary Computation, CEC 2003, vol. 1, pp. 215–220 (2003). https://doi.org/10.1109/CEC.2003.1299577
Wang, G.G., Deb, S., Gao, X.Z., Coelho, L.D.S.: A new metaheuristic optimisation algorithm motivated by elephant herding behaviour. Int. J. Bio-Inspired Comput. 8(6), 394–409 (2017). https://doi.org/10.1504/IJBIC.2016.081335
Yang, X.-S.: Firefly algorithms for multimodal optimization. In: Watanabe, O., Zeugmann, T. (eds.) SAGA 2009. LNCS, vol. 5792, pp. 169–178. Springer, Heidelberg (2009). https://doi.org/10.1007/978-3-642-04944-6_14
Yang, X.S.: A new metaheurisitic bat-inspired algorithm. Stud. Comput. Intell. 284, 65–74 (2010). https://doi.org/10.1007/978-3-642-12538-6_6
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2022 Springer Nature Switzerland AG
About this paper
Cite this paper
Alihodžić, A., Chahin, M., Čunjalo, F. (2022). New Clustering Techniques of Node Embeddings Based on Metaheuristic Optimization Algorithms. In: Lirkov, I., Margenov, S. (eds) Large-Scale Scientific Computing. LSSC 2021. Lecture Notes in Computer Science, vol 13127. Springer, Cham. https://doi.org/10.1007/978-3-030-97549-4_23
Download citation
DOI: https://doi.org/10.1007/978-3-030-97549-4_23
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-97548-7
Online ISBN: 978-3-030-97549-4
eBook Packages: Computer ScienceComputer Science (R0)