Abstract
Biological databases have been developed with a special focus on the efficient retrieval of single records or the efficient computation of specialized bioinformatics algorithms against the overall database, such as in sequence alignment. The continuos production of biological knowledge spread on several biological databases and ontologies, such as Gene Ontology, and the availability of efficient techniques to handle such knowledge, such as annotation and semantic similarity measures, enable the development on novel bioinformatics applications that explicitly use and integrate such knowledge. After introducing the annotation process and the main semantic similarity measures, this paper shows how annotations and semantic similarity can be exploited to improve the extraction and analysis of biologically relevant data from protein interaction databases. As case studies, the paper presents two novel software tools, OntoPIN and CytoSeVis, both based on the use of Gene Ontology annotations, for the advanced querying of protein interaction databases and for the enhanced visualization of protein interaction networks.
Similar content being viewed by others
References
Mario Cannataro, Pietro Hiram Guzzi, Giuseppe Tradigo, Pierangelo Veltri, Biological Databases, in Springer Handbook of Bio-/Neuro-Informatics, edited by Nikola Kasabov (Springer-Verlag, 2014) pp. 431–440 ISBN 978-3-642-30573-3.
Mario Cannataro, Pietro Hiram Guzzi, Data Management of Protein Interaction Networks (Wiley-IEEE Computer Society Press, USA, 2011) ISBN-10: 0470770406, ISBN-13: 978-0470770405.
Tero Aittokallio, Benno Schwikowski, Brief. Bioinform. 7, 243 (2006).
Evelyn Camon, Michele Magrane, Daniel Barrell, Vivian Lee, Emily Dimmer, John Maslen, David Binns, Nicola Harte, Rodrigo Lopez, Rolf Apweiler, Nucl. Acids Res. 32, D262 (2004).
M. Cannataro, P.H. Guzzi, P. Veltri, ACM Comput. Surv. 43, 1 (2010) 10.1145/1824795.1824796.
G. Ciriello, M. Mina, P.H. Guzzi, M. Cannataro, C. Guerra, PloS ONE 7, e38107 (2012) DOI: 10.1371/journal.pone.0038107.
Mario Cannataro, Pietro H. Guzzi, Pierangelo Veltri, ACM Comput. Surv. 43, (2010).
Pietro H. Guzzi, Marco Mina, Concettina Guerra, Mario Cannataro, Brief. Bioinform. 13, 569 (2012) DOI: 10.1093/bib/bbr066.
Lukasz Salwinski, Christopher S. Miller, Adam J. Smith, Frank K. Pettit, James U. Bowie, David Eisenberg, Nucl. Acids Res. 32, D449 (2004).
Mario Cannataro, Pietro H. Guzzi, Pierangelo Veltri, Using ontologies for annotating and retrieving protein-protein interactions data, in Proceedings of the 22nd IEEE International Symposium on Computer-Based Medical Systems (CBMS 2009) (IEEE Press, 2009) pp. 1–5.
Evelyn Camon, Michele Magrane, Daniel Barrell, Vivian Lee, Emily Dimmer, John Maslen, David Binns, Nicola Harte, Rodrigo Lopez, Rolf Apweiler, Nucl. Acids Res. 32, D262 (2004).
Pietro H. Guzzi, Mario Cannataro, EMBnet J. 18, 32 (2012).
Paul Shannon, Andrew Markiel, Owen Ozier, Nitin S. Baliga, Jonathan T. Wang, Daniel Ramage, Nada Amin, Benno Schwikowski, Trey Ideker, Genome Res. 13, 2498 (2003).
B. Aranda, P. Achuthan, Y. Alam-Faruque, I. Armean, A. Bridge, C. Derow, M. Feuermann, A.T. Ghanbarian, S. Kerrien, J. Khadake, J. Kerssemakers, C. Leroy, M. Menden, M. Michaut, L. Montecchi-Palazzi, S.N. Neuhauser, S. Orchard, V. Perreau, B. Roechert, K. van Eijk, H. Hermjakob, Nucl. Acids Res. 38, gkp878 (2010).
A. Zanzoni, L. Montecchi-Palazzi, M. Quondam, G. Ausiello, M. Helmer-Citterich, G. Cesareni, FEBS Lett. 513, 135 (2002).
P. Vlkel, P. Le Faou, P.O. Angrand, Biochem. Soc. Trans. 38, 883 (2010) DOI: 10.1042/BST0380883.
Allison Doerr, Nature Methods 10, 38 (2013) DOI: 10.1038/nmeth.2298.
Bioproximity web cite, www.bioproximity.com.
E.D. Levy, J.B. Pereira-Leal, C. Chothia, S.A. Teichmann, PLoS Comput. Biol. 17, e155 (2006).
S. Pu, J. Wong, B. Turner, E. Cho, S.J. Wodak, Nucl. Acids Res. 37, 825 (2008).
Louis du Plessis, Nives Skunca, Christophe Dessimoz, Brief. Bioinform. 12, 723 (2011).
Catia Pesquita, Daniel Faria, Andr O. Falco, Phillip Lord, Francisco M. Couto, PLoS Comput. Biol. 5, 07 (2009).
Pietro H. Guzzi, Marco Mina, Concettina Guerra, Mario Cannataro, Brief. Bioinform. 13, 569 (2012) DOI: 10.1093/bib/bbr066.
Philip Resnik, Using information content to evaluate semantic similarity in a taxonomy, in IJCAI, pages 448–453.
Dekang Lin, An information-theoretic definition of similarity, in ICML, edited by Jude W. Shavlik, Jude W. Shavlik, (Morgan Kaufmann, 1998) pp. 296–304.
J.J. Jiang, D.W. Conrath, Semantic similarity based on corpus statistics and lexical taxonomy, in International Conference Research on Computational Linguistics (ROCLING X) (1997) p. 9008.
Mihail Popescu, James M. Keller, Joyce A. Mitchell, IEEE/ACM Trans. Comput. Biol. Bioinform. 3, 263 (2006).
Guangchuang Yu, Fei Li, Yide Qin, Xiaochen Bo, Yibo Wu, Shengqi Wang, Bioinformatics 26, btq064 (2010).
Y.R. Cho, M. Mina, Y. Lu, N. Kwon, P.H. Guzzi, Proteome Sci. 11, S3 (2013).
Da Wei Huang, Brad T. Sherman, Richard A. Lempicki, Nucl. Acids Res. 37, 113 (2009).
P. Erdös, A. Rényi, Publ. Math. Inst. Hung. Acad. Sci. 5, 17 (1960).
Albert-Laszlo Barabasi, Reka Albert, Science 286, 509 (1999).
Albert-László Barabási, Science 325, 412 (2009).
N. Przulj, D. Higham, J. R. Soc. Interface 3, 711 (2006).
George Michailidis, Data visualization through their graph representations (2008) pp. 103–120.
Georgios Pavlopoulos, Anna L. Wegener, Reinhard Schneider, BioData Mining 1, 12 (2008).
Matthew Suderman, Michael Hallett, Bioinformatics 23, 2651 (2007).
Giuseppe Agapito, Pietro H. Guzzi, Mario Cannataro, BMC Bioinform. 14, S1 (2013).
Kevin R. Brown, David Otasek, Muhammad Ali, Michael J. McGuffin, Wing Xie, Baiju Devani, Ian Lawson van Toch, Jurisica Igor, Bioinformatics 25, 3327 (2009).
S. Maere, K. Heymans, M. Kuiper, Bioinformatics 21, 3448 (2005).
O. Garcia, C. Saveanu, M. Cline, M. Fromont-Racine, A. Jacquier, B. Schwikowski, T. Aittokallio, Bioinformatics 23, 394 (2007).
Author information
Authors and Affiliations
Corresponding author
Additional information
Contribution to the Focus Point on “Pattern Recognition Tools for Proteomics” edited by V. Cantoni.
Rights and permissions
About this article
Cite this article
Cannataro, M., Hiram Guzzi, P. & Veltri, P. Annotation and retrieval in protein interaction databases. Eur. Phys. J. Plus 129, 135 (2014). https://doi.org/10.1140/epjp/i2014-14135-x
Received:
Accepted:
Published:
DOI: https://doi.org/10.1140/epjp/i2014-14135-x