An Empirical Study of Instance-Based Ontology Matching

  • Antoine Isaac
  • Lourens van der Meij
  • Stefan Schlobach
  • Shenghui Wang
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4825)


Instance-based ontology mapping is a promising family of solutions to a class of ontology alignment problems. It crucially depends on measuring the similarity between sets of annotated instances. In this paper we study how the choice of co-occurrence measures affects the performance of instance-based mapping.

To this end, we have implemented a number of different statistical co-occurrence measures. We have prepared an extensive test case using vocabularies of thousands of terms, millions of instances, and hundreds of thousands of co-annotated items. We have obtained a human Gold Standard judgement for part of the mapping-space. We then study how the different co-occurrence measures and a number of algorithmic variations perform on our benchmark dataset as compared against the Gold Standard.

Our systematic study shows excellent results of instance-based matching in general, where the more simple measures often outperform more sophisticated statistical co-occurrence measures.


Information Gain Mapping Index Ontology Mapping Pointwise Mutual Information Equivalent Concept 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Euzenat, J., Shvaiko, P.: Ontology Matching. Springer, Heidelberg (2007)zbMATHGoogle Scholar
  2. 2.
    Vizine-Goetz, D.: Popular LCSH with Dewey Numbers: Subject headings for everyone. Annual Review of OCLC Research (1997)Google Scholar
  3. 3.
    Avesani, P., Giunchiglia, F., Yatskevich, M.: A large scale taxonomy mapping evaluation. In: Gil, Y., Motta, E., Benjamins, V.R., Musen, M.A. (eds.) ISWC 2005. LNCS, vol. 3729, Springer, Heidelberg (2005)CrossRefGoogle Scholar
  4. 4.
    Isaac, A., Matthezing, H., van der Meij, L., Schlobach, S., Wang, S., Zinn, C.: The value of usage scenarios for thesaurus alignment in cultural heritage context. Under submissionGoogle Scholar
  5. 5.
    Shvaiko, P., Euzenat, J.: A survey of schema-based matching approaches. Journal of data semantics 4, 146–171 (2005)CrossRefGoogle Scholar
  6. 6.
    Doerr, M.: Semantic problems of thesaurus mapping. Journal of Digital Information 1(8) (2004)Google Scholar
  7. 7.
    Manning, C., Schütze, H.: Foundations of Statistical Natural Language Processing. MIT Press, Cambridge, MA (1999)zbMATHGoogle Scholar
  8. 8.
    van Gendt, M., Isaac, A., van der Meij, L., Schlobach, S.: Semantic Web Techniques for Multiple Views on Heterogeneous Collections: a Case Study. In: 10th European Conference on Digital Libraries (ECDL), Alicante, Spain (2006)Google Scholar
  9. 9.
    Euzenat, J., Mochol, M., Shvaiko, P., Stuckenschmidt, H., Šváb, O., Svátek, V., van Hage, W.R., Yatskevich, M.: Results of the ontology alignment evaluation initiative 2006. In: Cruz, I., Decker, S., Allemang, D., Preist, C., Schwabe, D., Mika, P., Uschold, M., Aroyo, L. (eds.) ISWC 2006. LNCS, vol. 4273, Springer, Heidelberg (2006)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2007

Authors and Affiliations

  • Antoine Isaac
    • 1
    • 2
  • Lourens van der Meij
    • 1
    • 2
  • Stefan Schlobach
    • 1
  • Shenghui Wang
    • 1
    • 2
  1. 1.Vrije Universiteit Amsterdam 
  2. 2.Koninklijke Bibliotheek, Den Haag 

Personalised recommendations