Methods of Gene Ontology Term Similarity Analysis in Graph Database Environment

  • Łukasz Stypka
  • Michał Kozielski
Part of the Communications in Computer and Information Science book series (CCIS, volume 424)


The article presents and analyses three graph processing issues that can be identified in three methods of GO term similarity evaluation. The solutions of these problems are implemented in Neo4j graph database environment. Each of the issues can be solved directly by a single Cypher query or can be divided into several queries which results have to be merged. The comparison of the introduced solutions is presented in terms of time and memory effectivness. The results show how to implement the effective solutions of this class of issues.


graph database Neo4j Gene Ontology GO term similarity 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Al Mubaid, H., Nagar, A.: Comparison of four similarity measures based on go annotations for gene clustering. In: IEEE Symposium on Computers and Communications, ISCC 2008, pp. 531–536. IEEE (2008)Google Scholar
  2. 2.
    Ashburner, M., et al.: Gene Ontology: tool for the unification of biology. Nat. Genet. 25(1), 25–29 (2000)CrossRefGoogle Scholar
  3. 3.
    Couto, F.M., Silva, M.J., Coutinho, P.M.: Measuring semantic similarity between gene ontology terms. Data & Knowledge Engineering 61(1), 137–152 (2007)CrossRefGoogle Scholar
  4. 4.
    Jiang, J., Conrath, D.: Semantic similarity based on corpus statistics and lexical ontology. In: Proc. on International Conference on Research in Computational Linguistics, pp. 19–33 (1997)Google Scholar
  5. 5.
    Kozielski, M., Stypka, Ł.: Gene ontology based gene analysis in graph database environment. Studia Informatica 34(2A), 111 (2013)Google Scholar
  6. 6.
    Lin, D.: An information-theoretic definition of similarity. In: ICML, vol. 98, pp. 296–304 (1998)Google Scholar
  7. 7.
    Neo4j: Graph database:
  8. 8.
    Pesquita, C., Faria, D., Falcao, A.O., Lord, P., Couto, F.M.: Semantic similarity in biomedical ontologies. PLoS Computational Biology 5(7), e1000443 (2009)Google Scholar
  9. 9.
    Resnik, P.: Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. Journal of Artificial Intelligence Research 11, 95–130 (1999)zbMATHGoogle Scholar

Copyright information

© Springer International Publishing Switzerland 2014

Authors and Affiliations

  1. 1.Institute of InformaticsSilesian University of TechnologyGliwicePoland
  2. 2.Future ProcessingGliwicePoland

Personalised recommendations