Skip to main content

Graph Clustering Via Intra-Cluster Density Maximization

  • Conference paper
  • First Online:
Network Algorithms, Data Mining, and Applications (NET 2018)

Part of the book series: Springer Proceedings in Mathematics & Statistics ((PROMS,volume 315))

Included in the following conference series:

  • 726 Accesses

Abstract

Graph clustering, also often referred to as network community detection, is the process of assigning common labels to vertices that are densely connected to each other but sparsely connected to the rest of the graph. There are many different approaches to clustering in the literature. However, in this article, we formulate the clustering problem as a combinatorial optimization problem. Our main contribution is a novel problem formulation that maximizes intra-cluster density, a statistically meaningful quantity. It requires the number of clusters, a softbound on cluster size and a penalty coefficient as parameter inputs. More importantly, it is designed to prevent common degeneracies, like the so-called “mega-clusters”. We end with some suggestions on numerical solution techniques and note that an ensemble-like optimization routine seems promising.

Note on vocabulary: Although there are subtle differences between the concepts of graph clustering and network community detection, in this document we use the two interchangeably.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Aloise, D., Caporossi, G., Hansen, P., Liberti, L., Perron, S., Ruiz, M.: Modularity maximization in networks by variable neighborhood search. In: Bader, D.A., Meyerhenke, H., Sanders, P., Wagner, D. (eds.) Graph Partitioning and Graph Clustering, 10th DIMACS Implementation Challenge Workshop, Georgia Institute of Technology, Atlanta, GA, USA, 13–14 Feb 2012, pp. 113–128 (2012). http://www.ams.org/books/conm/588/11705

  2. Bertsimas, D., Tsitsiklis, J.: Simulated annealing. Stat. Sci. 8(1), 10–15 (1993)

    Article  Google Scholar 

  3. Brownlee, J.: Clever Algorithms: Nature-Inspired Programming Recipes, 1st edn. Lulu.com (2011)

    Google Scholar 

  4. Creusefond, J., Largillier, T., Peyronnet, S.: Finding compact communities in large graphs. In: Proceedings of the 2015 IEEE/ACM International Conference on Advances in Social Networks Analysis and Mining 2015, ASONAM ’15, pp. 1457–1464. ACM, New York, NY, USA (2015). https://doi.org/10.1145/2808797.2808868

  5. van Dongen, S.: Graph clustering by flow simulation. Ph.D. thesis, Faculteit Wiskunde en Informatica, Universiteit Utrecht (2000)

    Google Scholar 

  6. Fan, N., Pardalos, P.M.: Linear and quadratic programming approaches for the general graph partitioning problem. J. Global Optim. 48(1), 57–71 (2010). https://doi.org/10.1007/s10898-009-9520-1

  7. Fan, N., Pardalos, P.M.: Robust optimization of graph partitioning and critical node detection in analyzing networks. In: Proceedings of the 4th International Conference on Combinatorial Optimization and Applications—Volume Part I, COCOA’10, pp. 170–183. Springer, Berlin, Heidelberg (2010). http://dl.acm.org/citation.cfm?id=1940390.1940405

  8. Fortunato, S.: Community detection in graphs. Phys. Rep. 486, 75–174 (2010). https://doi.org/10.1016/j.physrep.2009.11.002

    Article  MathSciNet  Google Scholar 

  9. Fortunato, S., Barthélemy, M.: Resolution limit in community detection. Proc. Natl. Acad. Sci. 104(1), 36–41 (2007). http://www.pnas.org/content/104/1/36.abstract

  10. Fortunato, S., Hric, D.: Community detection in networks: a user guide. ArXiv e-prints (2016)

    Google Scholar 

  11. Hastie, T., Tibshirani, R., Friedman, J.: The Elements of Statistical Learning, Second Edition: Data Mining, Inference, and Prediction, 2nd ed. Springer Series in Statistics. Springer (2009)

    Google Scholar 

  12. Holland, P.W., Laskey, K.B., Leinhardt, S.: Stochastic blockmodels: First steps. Soc. Netw. 5(2), 109–137 (1983). https://doi.org/10.1016/0378-8733(83)90021-7. http://www.sciencedirect.com/science/article/pii/0378873383900217

  13. Jin, J.: Fast community detection by score. Ann. Stat. 43 (2015)

    Google Scholar 

  14. Kazakovtsev, L., Antamoshkin, A.: Genetic algorithm with fast greedy heuristic for clustering and location problems. Informatica (Slovenia) 38(3) (2014). http://www.informatica.si/index.php/informatica/article/view/704

  15. Lancichinetti, A., Radicchi, F., Ramasco, J.J., Fortunato, S.: Finding statistically significant communities in networks. PLoS ONE 6, e18,961 (2011). https://doi.org/10.1371/journal.pone.0018961

  16. von Luxburg, U.: A Tutorial on Spectral Clustering. CoRR abs/0711.0189 (2007). http://arxiv.org/abs/0711.0189

  17. Miasnikof, P., Shestopaloff, A., Bonner, A., Lawryshyn, Y.: A statistical performance analysis of graph clustering algorithms. In: Lecture Notes in Computer Science. Springer (2018)

    Google Scholar 

  18. Nascimento, M., Pitsoulis, L.: Community detection by modularity maximization using GRASP with path relinking. Comput. Oper. Res. 40, 3121–3131 (2013)

    Article  MathSciNet  Google Scholar 

  19. Newman, M.E.J., Girvan, M.: Finding and evaluating community structure in networks. Phys. Rev. E, Stat. Nonlinear, Soft Matter Phys. 69, 026,113 (2004)

    Google Scholar 

  20. Ovelgönne, M., Geyer-Schulz, A.: An ensemble learning strategy for graph clustering. In: Bader, D.A., Meyerhenke, H., Sanders, P., Wagner, D. (eds.) Graph Partitioning and Graph Clustering, 10th DIMACS Implementation Challenge Workshop, Georgia Institute of Technology, Atlanta, GA, USA, 13–14 Feb 2012, pp. 113–128 (2012). http://www.ams.org/books/conm/588/11705

  21. Papadimitriou, C., Steiglitz, K.: Combinatorial Optimization: Algorithms and Complexity. Dover Books on Computer Science. Dover Publications (1998). https://books.google.ca/books?id=u1RmDoJqkF4C

  22. Prokhorenkova, L.O., Prałat, P., Raigorodskii, A.: Modularity of complex networks models. In: Bonato, A., Graham, F., Prałat, P. (eds.) Algorithms and Models for the Web Graph, pp. 115–126. Springer International Publishing, Cham (2016)

    Google Scholar 

  23. Prokhorenkova, L.O., Prałat, P., Raigorodskii, A.: Modularity in several random graph models. In: The European Conference on Combinatorics, Graph Theory and Applications (EUROCOMB’17), Electronic Notes in Discrete Mathematics 61, 947–953 (2017). https://doi.org/10.1016/j.endm.2017.07.058. http://www.sciencedirect.com/science/article/pii/S1571065317302238

  24. Russell, S., Norvig, P.: Artificial Intelligence: A Modern Approach, pp. 92–115. Prentice-Hall Inc. (1995)

    Google Scholar 

  25. Schaeffer, S.E.: Survey: graph clustering. Comput. Sci. Rev. 1(1), 27–64 (2007). https://doi.org/10.1016/j.cosrev.2007.05.001

  26. Tasgin, M., Herdagdelen, A., Bingol, H.: Community Detection in Complex Networks Using Genetic Algorithms. ArXiv e-prints (2007)

    Google Scholar 

  27. Weisstein, E.: Clique. MathWorld–A Wolfram Web Resource (2018). http://mathworld.wolfram.com/Clique.html

  28. Yang, J., Leskovec, J.: Defining and Evaluating Network Communities Based on Ground-truth. CoRR abs/1205.6233 (2012). http://arxiv.org/abs/1205.6233

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Pierre Miasnikof .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2020 Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Miasnikof, P., Pitsoulis, L., Bonner, A.J., Lawryshyn, Y., Pardalos, P.M. (2020). Graph Clustering Via Intra-Cluster Density Maximization. In: Bychkov, I., Kalyagin, V., Pardalos, P., Prokopyev, O. (eds) Network Algorithms, Data Mining, and Applications. NET 2018. Springer Proceedings in Mathematics & Statistics, vol 315. Springer, Cham. https://doi.org/10.1007/978-3-030-37157-9_3

Download citation

Publish with us

Policies and ethics