Abstract
Entries in biomolecular databases are often annotated with concepts from different ontologies and thereby establish links between pairs of concepts. Such links may reveal meaningful relationships between linked concepts, however they could as well relate concepts by chance.
In this work we present InterOnto, a methodology that allows us to rank concept pairs to identify the most meaningful associations. The novelty of our approach compared to previous works is that we take the entire structure of the involved ontologies into account. This way, our method even finds links that are not present in the annotated data, but may be inferred through subsumed concept pairs.
We have evaluated our methodology both quantitatively and qualitatively. Using real-life data from TAIR we show that our proposed scoring function is able to identify the most representative concept pairs while preventing overgeneralization. In comparison to prior work our method generally yields rankings of equivalent or better quality.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Ashburner, M., Ball, C., Blake, J., Botstein, D., et al.: Gene ontology: tool for the unification of biology. The Gene Ontology Consortium. Nature Genetics 25(1), 25–29 (2000)
Bodenreider, O., Aubry, M., Burgun, A.: Non-lexical approaches to identifying associative relations in the gene ontology. In: Pac. Symp. Biocomput., pp. 91–102 (2005)
Brauer, F., Huber, M., Hackenbroich, G., Leser, U., et al.: Graph-Based Concept Identification and Disambiguation for Enterprise Search. In: Proceedings of the 19th International Conference on World Wide Web (WWW), pp. 171–180. ACM (2010)
Castano, S., Ferrara, A., Montanelli, S., Varese, G.: Ontology and Instance Matching. In: Paliouras, G., Spyropoulos, C.D., Tsatsaronis, G. (eds.) Multimedia Information Extraction. LNCS, vol. 6050, pp. 167–195. Springer, Heidelberg (2011)
Huang, D.W., Sherman, B.T., Lempicki, R.A.: Bioinformatics enrichment tools: paths toward the comprehensive functional analysis of large gene lists. Nucleic Acids Res. 37(1), 1–13 (2009)
Isaac, A., van der Meij, L., Schlobach, S., Wang, S.: An Empirical Study of Instance-Based Ontology Matching. In: Aberer, K., Choi, K.-S., Noy, N., Allemang, D., Lee, K.-I., Nixon, L.J.B., Golbeck, J., Mika, P., Maynard, D., Mizoguchi, R., Schreiber, G., Cudré-Mauroux, P. (eds.) ASWC 2007 and ISWC 2007. LNCS, vol. 4825, pp. 253–266. Springer, Heidelberg (2007)
Jaiswal, P., Avraham, S., Ilic, K., Kellogg, E.A., et al.: Plant Ontology (PO): a Controlled Vocabulary of Plant Structures and Growth Stages. Comp. Funct. Genomics 6(7-8), 388–397 (2005)
Jiang, J.J., Conrath, D.W.: Semantic Similarity Based on Corpus Statistics and Lexical Taxonomy. In: Proceedings of the Tenth International Conference on Research on Computational Linguistics (ROCLING), pp. 19–33 (1997)
Kalfoglou, Y., Schorlemmer, M.: Ontology mapping: the state of the art. The Knowledge Engineering Review 18(1), 1–31 (2003)
Kirsten, T., Thor, A., Rahm, E.: Instance-Based Matching of Large Life Science Ontologies. In: Cohen-Boulakia, S., Tannen, V. (eds.) DILS 2007. LNCS (LNBI), vol. 4544, pp. 172–187. Springer, Heidelberg (2007)
Lee, J.H., Kim, M.-H., Lee, Y.-J.: Ranking Documents in Thesaurus-Based Boolean Retrieval Systems. Inf. Process. Manage. 30(1), 79–92 (1994)
Lee, W.-J., Raschid, L., Sayyadi, H., Srinivasan, P.: Exploiting Ontology Structure and Patterns of Annotation to Mine Significant Associations between Pairs of Controlled Vocabulary Terms. In: Bairoch, A., Cohen-Boulakia, S., Froidevaux, C. (eds.) DILS 2008. LNCS (LNBI), vol. 5109, pp. 44–60. Springer, Heidelberg (2008)
Lee, W.-J., Raschid, L., Srinivasan, P., Shah, N., Rubin, D., Noy, N.: Using Annotations from Controlled Vocabularies to Find Meaningful Associations. In: Cohen-Boulakia, S., Tannen, V. (eds.) DILS 2007. LNCS (LNBI), vol. 4544, pp. 247–263. Springer, Heidelberg (2007)
Maedche, A., Staab, S.: Measuring Similarity between Ontologies. In: Gómez-Pérez, A., Benjamins, V.R. (eds.) EKAW 2002. LNCS (LNAI), vol. 2473, pp. 251–263. Springer, Heidelberg (2002)
Myhre, S., Tveit, H., Mollestad, T., Laegreid, A.: Additional Gene Ontology structure for improved biological reasoning. Bioinformatics 22(16), 2020–2027 (2006)
Noy, N., Musen, M.: The PROMPT suite: Interactive tools for ontology merging and mapping. International Journal of Human-Computer Studies 59(6), 983–1024 (2003)
Resnik, P.: Using Information Content to Evaluate Semantic Similarity in a Taxonomy. In: Proceedings of the 14th International Joint Conference on Artificial Intelligence (IJCAI), pp. 448–453 (1995)
Saha, B., Hoch, A., Khuller, S., Raschid, L., Zhang, X.-N.: Dense Subgraphs with Restrictions and Applications to Gene Annotation Graphs. In: Berger, B. (ed.) RECOMB 2010. LNCS, vol. 6044, pp. 456–472. Springer, Heidelberg (2010)
Smith, B., Ashburner, M., Rosse, C., Bard, J., et al.: The OBO Foundry: coordinated evolution of ontologies to support biomedical data integration. Nat. Biotechnol. 25(11), 1251–1255 (2007)
Swarbreck, D., Wilks, C., Lamesch, P., Berardini, T.Z., et al.: The Arabidopsis Information Resource (TAIR): gene structure and function annotation. Nucleic Acids Research 36(Database issue), D1009–D1014 (2008)
Tan, H., Jakonienė, V., Lambrix, P., Aberg, J., Shahmehri, N.: Alignment of Biomedical Ontologies Using Life Science Literature. In: Bremer, E.G., Hakenberg, J., Han, E.-H(S.), Berrar, D., Dubitzky, W. (eds.) KDLL 2006. LNCS (LNBI), vol. 3886, pp. 1–17. Springer, Heidelberg (2006)
Yamaguchi, S., Smith, M.W., Brown, R.G., Kamiya, Y., Sun, T.: Phytochrome Regulation and Differential Expression of Gibberellin 3β-Hydroxylase Genes in Germinating Arabidopsis Seeds. Plant Cell 10(12), 2115–2126 (1998)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Trißl, S., Hussels, P., Leser, U. (2012). InterOnto – Ranking Inter-Ontology Links. In: Bodenreider, O., Rance, B. (eds) Data Integration in the Life Sciences. DILS 2012. Lecture Notes in Computer Science(), vol 7348. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-31040-9_2
Download citation
DOI: https://doi.org/10.1007/978-3-642-31040-9_2
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-31039-3
Online ISBN: 978-3-642-31040-9
eBook Packages: Computer ScienceComputer Science (R0)