Abstract
Identifying highly connected subgraphs in biological networks has become a powerful tool in computational biology. By definition a highly connected graph with n vertices can only be disconnected by removing more than \(\frac{n}{2}\) of its edges. This definition, however, is not suitable for bipartite graphs, which have various applications in biology, since such graphs cannot contain highly connected subgraphs. Here, we introduce a natural modification of highly connected graphs for bipartite graphs, and prove that the problem of finding such subgraphs with the maximum number of vertices in bipartite graphs is NP-hard. To address this problem, we provide an integer linear programming solution, as well as a local search heuristic. Finally, we demonstrate the applicability of our heuristic to predict protein function by identifying highly connected subgraphs in bipartite networks that connect proteins with their experimentally established functionality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Andreopoulos, B., An, A., Wang, X., Faloutsos, M., Schroeder, M.: Clustering by common friends finds locally significant proteins mediating modules. Bioinformatics 23(9), 1124–1131 (2007)
Ashburner, M., Ball, C.A., Blake, J.A., Botstein, D., Butler, H., Cherry, J.M., Davis, A.P., Dolinski, K., et al.: Gene ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)
Dongbo, B., Zhao, Y., Cai, L., Xue, H., Zhu, X., Hongchao, L., Zhang, J., et al.: Topological structure analysis of the protein-protein interaction network in budding yeast. Nucleic Acids Res. 31(9), 2443–2450 (2003)
Chang, W.-C., Vakati, S., Krause, R., Eulenstein, O.: Exploring biological interaction networks with tailored weighted quasi-bicliques. BMC Bioinform. 13(10), S16 (2012)
UniProt Consortium, et al.: Uniprot: a hub for protein information. Nucleic Acids Res. 43, 989 (2014)
Dankelmann, P., Volkmann, L.: New sufficient conditions for equality of minimum degree and edge-connectivity. Ars Combinatoria 40, 270–278 (1995)
Friedberg, I.: Automated protein function prediction the genomic challenge. Briefings Bioinform. 7(3), 225–242 (2006)
Geva, G., Sharan, R.: Identification of protein complexes from co-immunoprecipitation data. Bioinformatics 27(1), 111–117 (2011)
Hartuv, E., Schmitt, A.O., Lange, J., Meier-Ewert, S., Lehrach, H., Shamir, R.: An algorithm for clustering cDNA fingerprints. Genomics 66(3), 249–256 (2000)
Hartuv, E., Shamir, R.: A clustering algorithm based on graph connectivity. Inf. Process. Lett. 76(4), 175–181 (2000)
Hüffner, F., Komusiewicz, C., Liebtrau, A., Niedermeier, R.: Partitioning biological networks into highly connected clusters with maximum edge coverage. IEEE/ACM Trans. Comput. Biol. Bioinf. (TCBB) 11(3), 455–467 (2014)
Hüffner, F., Komusiewicz, C., Sorge, M.: Finding highly connected subgraphs. In: Italiano, G.F., Margaria-Steffen, T., Pokorný, J., Quisquater, J.-J., Wattenhofer, R. (eds.) SOFSEM 2015-Testing. LNCS, vol. 8939, pp. 254–265. Springer, Heidelberg (2015)
Karp, R.M.: Reducibility among combinatorial problems. In: Miller, R.E., Thatcher, J.W., Bohlinger, J.D. (eds.) Complexity of Computer Computations, pp. 85–103. Springer, New York (1972)
Pržulj, N., Wigle, D.A., Jurisica, I.: Functional topology in a network of protein interactions. Bioinformatics 20(3), 340–348 (2004)
Radivojac, P., Clark, W.T., Oron, T.R., Schnoes, A.M., Wittkop, T., Sokolov, A., Graim, K., Funk, C., Verspoor, K., Ben-Hur, A., et al.: A large-scale evaluation of computational protein function prediction. Nat. Methods 10(3), 221–227 (2013)
Rentzsch, R., Orengo, C.A.: Protein function prediction-the power of multiplicity. Trends Biotechnol. 27(4), 210–219 (2009)
Sharan, R., Ulitsky, I., Shamir, R.: Network-based prediction of protein function. Mol. Syst. Biol. 3(1), 88 (2007)
Wang, L.: Near optimal solutions for maximum quasi-bicliques. J. Comb. Optim. 25(3), 481–497 (2013)
Acknowledgments
IF and OE acknowledge support from the National Science Foundation award # DBI 1458359 and #GS 133814.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2016 Springer International Publishing Switzerland
About this paper
Cite this paper
Moon, J., Friedberg, I., Eulenstein, O. (2016). Highly Bi-Connected Subgraphs for Computational Protein Function Annotation. In: Dinh, T., Thai, M. (eds) Computing and Combinatorics . COCOON 2016. Lecture Notes in Computer Science(), vol 9797. Springer, Cham. https://doi.org/10.1007/978-3-319-42634-1_46
Download citation
DOI: https://doi.org/10.1007/978-3-319-42634-1_46
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-42633-4
Online ISBN: 978-3-319-42634-1
eBook Packages: Computer ScienceComputer Science (R0)