Abstract
Graph clustering, also often referred to as network community detection, is the process of assigning common labels to vertices that are densely connected to each other but sparsely connected to the rest of the graph. There are many different approaches to clustering in the literature. However, in this article, we formulate the clustering problem as a combinatorial optimization problem. Our main contribution is a novel problem formulation that maximizes intra-cluster density, a statistically meaningful quantity. It requires the number of clusters, a softbound on cluster size and a penalty coefficient as parameter inputs. More importantly, it is designed to prevent common degeneracies, like the so-called “mega-clusters”. We end with some suggestions on numerical solution techniques and note that an ensemble-like optimization routine seems promising.
Note on vocabulary: Although there are subtle differences between the concepts of graph clustering and network community detection, in this document we use the two interchangeably.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Aloise, D., Caporossi, G., Hansen, P., Liberti, L., Perron, S., Ruiz, M.: Modularity maximization in networks by variable neighborhood search. In: Bader, D.A., Meyerhenke, H., Sanders, P., Wagner, D. (eds.) Graph Partitioning and Graph Clustering, 10th DIMACS Implementation Challenge Workshop, Georgia Institute of Technology, Atlanta, GA, USA, 13–14 Feb 2012, pp. 113–128 (2012). http://www.ams.org/books/conm/588/11705
Bertsimas, D., Tsitsiklis, J.: Simulated annealing. Stat. Sci. 8(1), 10–15 (1993)
Brownlee, J.: Clever Algorithms: Nature-Inspired Programming Recipes, 1st edn. Lulu.com (2011)
Creusefond, J., Largillier, T., Peyronnet, S.: Finding compact communities in large graphs. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, ASONAM ’15, pp. 1457–1464. ACM, New York, NY, USA (2015). https://doi.org/10.1145/2808797.2808868
van Dongen, S.: Graph clustering by flow simulation. Ph.D. thesis, Faculteit Wiskunde en Informatica, Universiteit Utrecht (2000)
Fan, N., Pardalos, P.M.: Linear and quadratic programming approaches for the general graph partitioning problem. J. Global Optim. 48(1), 57–71 (2010). https://doi.org/10.1007/s10898-009-9520-1
Fan, N., Pardalos, P.M.: Robust optimization of graph partitioning and critical node detection in analyzing networks. In: Proceedings of the 4th International Conference on Combinatorial Optimization and Applications—Volume Part I, COCOA’10, pp. 170–183. Springer, Berlin, Heidelberg (2010). http://dl.acm.org/citation.cfm?id=1940390.1940405
Fortunato, S.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010). https://doi.org/10.1016/j.physrep.2009.11.002
Fortunato, S., Barthélemy, M.: Resolution limit in community detection. Proc. Natl. Acad. Sci. 104(1), 36–41 (2007). http://www.pnas.org/content/104/1/36.abstract
Fortunato, S., Hric, D.: Community detection in networks: a user guide. ArXiv e-prints (2016)
Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning, Second Edition: Data Mining, Inference, and Prediction, 2nd ed. Springer Series in Statistics. Springer (2009)
Holland, P.W., Laskey, K.B., Leinhardt, S.: Stochastic blockmodels: First steps. Soc. Netw. 5(2), 109–137 (1983). https://doi.org/10.1016/0378-8733(83)90021-7. http://www.sciencedirect.com/science/article/pii/0378873383900217
Jin, J.: Fast community detection by score. Ann. Stat. 43 (2015)
Kazakovtsev, L., Antamoshkin, A.: Genetic algorithm with fast greedy heuristic for clustering and location problems. Informatica (Slovenia) 38(3) (2014). http://www.informatica.si/index.php/informatica/article/view/704
Lancichinetti, A., Radicchi, F., Ramasco, J.J., Fortunato, S.: Finding statistically significant communities in networks. PLoS ONE 6, e18,961 (2011). https://doi.org/10.1371/journal.pone.0018961
von Luxburg, U.: A Tutorial on Spectral Clustering. CoRR abs/0711.0189 (2007). http://arxiv.org/abs/0711.0189
Miasnikof, P., Shestopaloff, A., Bonner, A., Lawryshyn, Y.: A statistical performance analysis of graph clustering algorithms. In: Lecture Notes in Computer Science. Springer (2018)
Nascimento, M., Pitsoulis, L.: Community detection by modularity maximization using GRASP with path relinking. Comput. Oper. Res. 40, 3121–3131 (2013)
Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E, Stat. Nonlinear, Soft Matter Phys. 69, 026,113 (2004)
Ovelgönne, M., Geyer-Schulz, A.: An ensemble learning strategy for graph clustering. In: Bader, D.A., Meyerhenke, H., Sanders, P., Wagner, D. (eds.) Graph Partitioning and Graph Clustering, 10th DIMACS Implementation Challenge Workshop, Georgia Institute of Technology, Atlanta, GA, USA, 13–14 Feb 2012, pp. 113–128 (2012). http://www.ams.org/books/conm/588/11705
Papadimitriou, C., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Dover Books on Computer Science. Dover Publications (1998). https://books.google.ca/books?id=u1RmDoJqkF4C
Prokhorenkova, L.O., Prałat, P., Raigorodskii, A.: Modularity of complex networks models. In: Bonato, A., Graham, F., Prałat, P. (eds.) Algorithms and Models for the Web Graph, pp. 115–126. Springer International Publishing, Cham (2016)
Prokhorenkova, L.O., Prałat, P., Raigorodskii, A.: Modularity in several random graph models. In: The European Conference on Combinatorics, Graph Theory and Applications (EUROCOMB’17), Electronic Notes in Discrete Mathematics 61, 947–953 (2017). https://doi.org/10.1016/j.endm.2017.07.058. http://www.sciencedirect.com/science/article/pii/S1571065317302238
Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach, pp. 92–115. Prentice-Hall Inc. (1995)
Schaeffer, S.E.: Survey: graph clustering. Comput. Sci. Rev. 1(1), 27–64 (2007). https://doi.org/10.1016/j.cosrev.2007.05.001
Tasgin, M., Herdagdelen, A., Bingol, H.: Community Detection in Complex Networks Using Genetic Algorithms. ArXiv e-prints (2007)
Weisstein, E.: Clique. MathWorld–A Wolfram Web Resource (2018). http://mathworld.wolfram.com/Clique.html
Yang, J., Leskovec, J.: Defining and Evaluating Network Communities Based on Ground-truth. CoRR abs/1205.6233 (2012). http://arxiv.org/abs/1205.6233
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2020 Springer Nature Switzerland AG
About this paper
Cite this paper
Miasnikof, P., Pitsoulis, L., Bonner, A.J., Lawryshyn, Y., Pardalos, P.M. (2020). Graph Clustering Via Intra-Cluster Density Maximization. In: Bychkov, I., Kalyagin, V., Pardalos, P., Prokopyev, O. (eds) Network Algorithms, Data Mining, and Applications. NET 2018. Springer Proceedings in Mathematics & Statistics, vol 315. Springer, Cham. https://doi.org/10.1007/978-3-030-37157-9_3
Download citation
DOI: https://doi.org/10.1007/978-3-030-37157-9_3
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-030-37156-2
Online ISBN: 978-3-030-37157-9
eBook Packages: Mathematics and StatisticsMathematics and Statistics (R0)