Exact algorithms for the minimum s-club partitioning problem
Graph clustering (partitioning) is a helpful tool in understanding complex systems and analyzing their structure and internal properties. One approach for graph clustering is based on partitioning the graph into cliques. However, clique models are too restrictive and prone to errors given imperfect data. Thus, using clique relaxations instead may provide a more reasonable and applicable partitioning of the graph. An s-club is a distance-based relaxation of a clique and is formally defined as a subset of vertices inducing a subgraph with a diameter of at most s. In this work, we study the minimum s-club partitioning problem, which is to partition the graph into a minimum number of non-overlapping s-club clusters. Integer programming techniques and combinatorial branch-and-bound framework are employed to develop exact algorithms to solve this problem. We also study and compare the computational performance of the proposed algorithms for the special cases of \(s=2\) and \(s=3\) on a test-bed of randomly generated instances and real-life graphs.
Keywordss-Clubs Partitioning Graph-based clustering Integer programming Combinatorial branch-and-bound
This material is based upon work supported by the AFRL Mathematical Modeling and Optimization Institute. Partial support by AFOSR under Grant FA8651-14-2-0005, and NSF Grant CMMI-1538493 is also gratefully acknowledged.
- Berry, N., Ko, T., Moy, T., Smrcka, J., Turnley, J., & Wu, B. (2004) Emergent clique formation in terrorist recruitment. The AAAI-04 workshop on agent organizations: Theory and practice, July 25, 2004, San Jose, California. http://www.cs.uu.nl/~virginia/aotp/papers.htm.
- Chang, J. M., Yang, J. S., & Peng, S. L. (2014). On the complexity of graph clustering with bounded diameter. In 2014 international computer science and engineering conference (ICSEC) (pp. 18–22). IEEE.Google Scholar
- Csardi, G., & Nepusz, T. (2006). The igraph software package for complex network research. InterJournal Complex Systems, 1695(5), 1–9.Google Scholar
- DIMACS (2012) Algorithm implementation challenge: Graph partitioning and graph clustering. The tenth DIMACS implementation challenge. http://dimacs.rutgers.edu/Challenges/.
- European Bioinformatics Institute. (2017). IntAct molecular interaction database. http://www.ebi.ac.uk/intact/. Accessed May, 2017.
- Fernandess, Y., & Malkhi, D. (2002). K-clustering in wireless ad hoc networks. In Proceedings of the second ACM international workshop on principles of mobile computing (pp. 31–37). ACM.Google Scholar
- Garey, M. R., & Johnson, D. S. (1979). Computers and intractability: A guide to the theory of NP-completeness. New York: WH Freeman.Google Scholar
- González, R. C., & Tou, J. T. (1974). Pattern recognition principles. Applied mathematics and computation. Reading, MA: Addison-Wesley.Google Scholar
- GUROBI. (2016). Gurobi optimizer. http://www.gurobi.com.
- Kershenbaum, A. (1993). Telecommunications network design algorithms. New York: McGraw-Hill.Google Scholar
- Li, Y., Lao, L., & Cui, J. H. (2006). Sdc: A distributed clustering protocol for peer-to-peer networks. In International conference on research in networking (pp. 1234–1239). Springer.Google Scholar
- Pasupuleti, S. (2008). Detection of protein complexes in protein interaction networks using \(n\)-clubs. In Proceedings of the 6th European conference on evolutionary computation, machine learning and data mining in bioinformatics. Lecture notes in computer science (Vol. 4973, pp. 153–164). Springer.Google Scholar
- Sageman, M. (2004). Understanding terrorist networks. Philadelphia, PA: University of Pennsylvania Press.Google Scholar
- Shahinpour, S., & Butenko, S. (2013b). Dinstance-based clique relaxations in networks: \(s\)-cliques and \(s\)-clubs. In B. I. Goldengorin, V. A. Kalyagin, & P. M. Pardalos (Eds.), Models, algorithms, and technologies for network analysis. Springer proceedings in mathematics and statistics (Vol. 59, pp. 149–174). New York: Springer.Google Scholar