Partitioning Biological Networks into Highly Connected Clusters with Maximum Edge Coverage
We introduce the combinatorial optimization problem Highly Connected Deletion, which asks for removing as few edges as possible from a graph such that the resulting graph consists of highly connected components. We show that Highly Connected Deletion is NP-hard and provide a fixed-parameter algorithm and a kernelization. We propose exact and heuristic solution strategies, based on polynomial-time data reduction rules and integer linear programming with column generation. The data reduction typically identifies 85 % of the edges that need to be deleted for an optimal solution; the column generation method can then optimally solve protein interaction networks with up to 5 000 vertices and 12 000 edges.
KeywordsColumn Generation Protein Interaction Network Annotation Term Large Connected Component Edge Deletion
Unable to display preview. Download preview PDF.
- 1.Aloise, D., Cafieri, S., Caporossi, G., Hansen, P., Perron, S., Liberti, L.: Column generation algorithms for exact modularity maximization in networks. Physical Review E 82, 046112 (2010)Google Scholar
- 3.Boyle, E.I., Weng, S., Gollub, J., Jin, H., Botstein, D., Cherry, J.M., Sherlock, G.: GO:TermFinder–open source software for accessing gene ontology information and finding significantly enriched gene ontology terms associated with a list of genes. Bioinformatics 20(18), 3710–3715 (2004)CrossRefGoogle Scholar
- 4.Chang, W.-C., Vakati, S., Krause, R., Eulenstein, O.: Exploring biological interaction networks with tailored weighted quasi-bicliques. BMC Bioinformatics 13(S-10), S16 (2012)Google Scholar
- 6.Chekuri, C., Goldberg, A.V., Karger, D.R., Levine, M.S., Stein, C.: Experimental study of minimum cut algorithms. In: Proc. 8th SODA, pp. 324–333 (1997)Google Scholar
- 7.van Dongen, S.: Graph Clustering by Flow Simulation. PhD thesis, University of Utrecht (2000)Google Scholar
- 10.Hayes, W., Sun, K., Pržulj, N.: Graphlet-based measures are suitable for biological network comparison. Bioinformatics (to appear, 2013)Google Scholar
- 11.Jiang, D., Pei, J.: Mining frequent cross-graph quasi-cliques. ACM Transactions on Knowledge Discovery from Data 2(4), 16:1–16:42 (2009)Google Scholar
- 14.Niedermeier, R.: Invitation to Fixed-Parameter Algorithms. OUP (2006)Google Scholar
- 16.Ronhovde, P., Nussinov, Z.: Local resolution-limit-free Potts model for community detection. Physical Review E 81(4), 046114 (2010)Google Scholar
- 19.van Rooij, J.M.M., van Kooten Niekerk, M.E., Bodlaender, H.L.: Partition into triangles on bounded degree graphs. Theory of Computing Systems (to appear, 2013)Google Scholar