Abstract
Ontology matching is one of the key solutions for solving the heterogeneity problem in the Semantic Web. Nowadays, the increasing amount of multilingual data on the Web and the consequent development of ontologies in different natural languages have pushed the need for multilingual and cross-lingual ontology matching. This chapter provides an overview of multilingual and cross-lingual ontology matching. We formally define the problem of matching multilingual and cross-lingual ontologies and provide a classification of different techniques and approaches. Systematic evaluations of these techniques are discussed with an emphasis on standard and freely available data sets and systems.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsNotes
- 1.
The ontology is available at http://dati.camera.it/ocd/reference_document/.
- 2.
ocd: standing for http://dati.camera.it/ocd/ and dbpedia-owl: for http://dbpedia.org/ontology/.
- 3.
In order to get parliament members and their political party membership from the DBpedia.
- 4.
An example of comparable natural languages is English and German, both belonging to the Germanic language family. Comparable natural languages can also be languages that are not from the same language family. For example, Italian belonging to the Romance language family and German belonging to the Germanic language family can still be compared using string comparison techniques such as edit distance, as they are both alphabetic letter based with comparable graphemes. An example of natural languages that are not comparable in this context can be Chinese and English, where the former is logogram based and the latter is alphabetic letter based. In this chapter, we consider natural languages to be comparable when they contain graphemes that can be analysed using automated string comparison techniques.
- 5.
For example, in the case of 329 “Linked Open Vocabularies” at http://lov.okfn.org/, there are 271 vocabularies in English, 23 in French, 17 in German, 17 in Spanish, etc. Growing trend of multilinguality is documented by Vila-Suero et al. (this volume).
- 6.
These approaches are not exclusive and might overlap. This classification takes into account the kind of technique used (manual, translation, learning, etc.) and resources involved (dictionaries, corpora, etc.).
- 7.
- 8.
- 9.
- 10.
- 11.
- 12.
- 13.
- 14.
In this chapter, we have revised the definitions of multilingual and cross-lingual terms. Contrary to what is reported in Meilicke et al. (2012) MultiFarm is a benchmark for cross-lingual ontology matching.
- 15.
- 16.
- 17.
References
Aguirre, J. L., Eckert, K., Euzenat, J., Ferrara, A., van Hage, W. R., Hollink, L., et al. (2012). Results of the ontology alignment evaluation initiative 2012. In Proceedings of the 7th International Workshop on Ontology Matching (pp. 73–115).
Beisswanger, E. (2010). Exploiting relation extraction for ontology alignment. In Proceedings of the 9th International Semantic Web Conference (pp. 289–296).
Bouma, G. (2010). Cross-lingual ontology alignment using EuroWordNet and Wikipedia. In Proceedings of the 7th International Conference on Language Resources and Evaluation. European Language Resources Association.
Buitelaar, P., Cimiano, P., Haase, P., & Sintek, M. (2009). Towards linguistically grounded ontologies. In Proceedings of the 6th European Semantic Web Conference (pp. 111–125).
Cheng, C., Lau, G., Pan, J., Law, K., & Jones, A. (2008). Domain-specific ontology mapping by corpus-based semantic similarity. In Proceedings of 2008 Engineering Research and Innovation Conference.
Cimiano, P., Buitelaar, P., McCrae, J., & Sintek, M. (2011). LexInfo: A declarative model for the lexicon-ontology interface. Web Semantics: Science, Services and Agents on the World Wide Web, 9(1), 29–51.
Cimiano, P., Montiel-Ponsoda, E., Buitelaar, P., Espinoza, M., & Gómez-Pérez, A. (2010). A note on ontology localization. Applied Ontology, 5(2), 127–137.
Eger, S., & Sejane, I. (2010). Computing semantic similarity from bilingual dictionaries. In Proceedings of the 10th International Conference on the Statistical Analysis of Textual Data (pp. 1217–1225).
Espinoza, M., Gómez-Pérez, A., & Mena, E. (2008). LabelTranslator: A tool to automatically localize an ontology. In Proceedings of the 5th European Semantic Web Conference (pp. 792–796).
Euzenat, J., Ferrara, A., Hollink, L., Isaac, A., Joslyn, C., Malaisé, V., et al. (2009). Results of the ontology alignment evaluation initiative 2009. In Proceedings of the 4th International Workshop on Ontology Matching (pp. 73–126).
Euzenat, J., Meilicke, C., Stuckenschmidt, H., Shvaiko, P., & Trojahn, C. (2011). Ontology alignment evaluation initiative: Six years of experience. Journal on Data Semantics, XV, 158–192.
Euzenat, J., & Shvaiko, P. (2007). Ontology matching. New York: Springer.
Fu, B., Brennan, R., & O’Sullivan, D. (2009). Cross-lingual ontology mapping: An investigation of the impact of machine translation. In Proceedings of the 4th Annual Asian Semantic Web Conference (pp. 1–15).
Fu, B., Brennan, R., & O’Sullivan, D. (2010). Cross-lingual ontology mapping and its use on the multilingual semantic web. In Proceedings of the 1st International Workshop on the Multilingual Semantic Web (pp. 13–20).
Fu, B., Brennan, R., & O’Sullivan, D. (2011). Using pseudo feedback to improve cross-lingual ontology mapping. In Proceedings of the 8th Extended Semantic Web Conference (pp. 336–351). Berlin: Springer.
Fu, B., Brennan, R., & O’Sullivan, D. (2012). A configurable translation-based cross-lingual ontology mapping system to adjust mapping outcomes. Web Semantics: Science, Services and Agents on the World Wide Web, 15, 15–36.
Gracia, J., Montiel-Ponsoda, E., Cimiano, P., Gómez-Pérez, A., Buitelaar, P., & McCrae, J. (2012). Challenges for the multilingual web of data. Web Semantics: Science, Services and Agents on the World Wide Web, 11, 63–71.
Hassan, S., & Mihalcea, R. (2009). Cross-lingual semantic relatedness using encyclopedic knowledge. In Proceedings of the 2009 Conference on Empirical Methods in Natural Language Processing (pp. 1192–1201). Stroudsburg, PA, USA: ACL.
Hertling, S., & Paulheim, H. (2012). WikiMatch: Using wikipedia for ontology matching. In Proceedings of the 7th International Workshop on Ontology Matching (pp. 37–48).
Jung, J. J., Håkansson, A., & Hartung, R. (2009). Indirect alignment between multilingual ontologies: A case study of Korean and Swedish ontologies. In Proceedings of the 3rd KES International Symposium on Agent and Multi-Agent Systems (pp. 233–241). Berlin: Springer.
Kalfoglou, Y., & Schorlemmer, M. (2003). Ontology mapping: The state of the art. Knowledge Engineering Review, 18(1), 1–31.
Landry, P. (2009). Multilingualism and subject heading languages: How the MACS project is providing multilingual subject access in Europe. Catalogue & Index, 157, 9–11.
Li, J., Tang, J., Li, Y., & Luo, Q. (2009). RiMOM: A dynamic multi-strategy ontology alignment framework. IEEE Transactions on Knowledge and Data Engineering, 21(8), 1218–1232.
Liang, A. C., & Sini, M. (2006). Mapping agrovoc and the chinese agricultural thesaurus: Definitions, tools, procedures. The New Review of Hypermedia and Multimedia, 12(1), 51–62.
Lin, F., & Krizhanovsky, A. (2011). Multilingual ontology matching based on Wiktionary data accessible via SPARQL endpoint. In Proceedings of the 13th All-Russian Conference Digital Libraries: Advanced Methods and Technologies, Digital Collections (pp. 19–22).
Mayr, P., & Petras, V. (2008). Building a terminology network for search: The KoMoHe project. In Proceedings of the Conference on Dublin Core and Metadata Applications (pp. 177–182).
McCrae, J., Spohr, D., & Cimiano, P. (2011). Linking lexical resources and ontologies on the semantic web with lemon. In The semantic web: Research and applications (pp. 245–259). Heidelberg: Springer.
Meilicke, C., Garcia-Castro, R., Freitas, F., van Hage, W. R., Montiel-Ponsoda, E., de Azevedo, R. R., et al. (2012). MultiFarm: A benchmark for multilingual ontology matching. Journal on Web Semantics, 15, 62–68.
Mihic, S., & Ivetic, D. (2012). Multilingual ontology alignment based on visual representations of ontology concepts. In Proceedings of the 5th International Conference on Advances in Computer-Human Interaction (pp. 101–105).
Mohammad, S., Gurevych, I., Hirst, G., & Zesch, T. (2007). Cross-lingual distributional profiles of concepts for measuring semantic distance. In Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (pp. 571–580).
Montiel-Ponsoda, E., de Cea, G. ;A., Gómez-Pérez, A., & Peters, W. (2011). Enriching ontologies with multilingual information. Natural Language Engineering, 17(3), 283–309.
Ngai, G., Carpuat, M., & Fung, P. (2002). Identifying concepts across languages: A first step towards a corpus-based approach to automatic ontology alignment. In Proceedings of the 19th International Conference on Computational Linguistics (pp. 1–7).
Pazienza, M. T., & Stellato, A. (2005). Linguistically motivated ontology mapping for the semantic web. In Proceedings of the 2nd Italian Semantic Web Workshop.
Pazienza, M. T., & Stellato, O. (2006). Linguistic enrichment of ontologies: A methodological framework. In Proceedings of the 2nd Workshop on Interfacing Ontologies and Lexical Resources for Semantic Web Technologies.
Peters, C., Braschler, M., & Clough, P. (2012). Multilingual information retrieval: From research to practice. New York: Springer.
Rahm, E., & Bernstein, P A. (2001). A survey of approaches to automatic schema matching. The VLDB Journal, 10(4), 334–350.
Shvaiko, P., & Euzenat, J. (2005). A survey of schema-based matching approaches. Journal on Data Semantics, 4, 146–171.
Spohr, D., Hollink, L., & Cimiano, P. (2011). A machine learning approach to multilingual and cross-lingual ontology matching. In Proceedings of the 10th International Semantic Web Conference (pp. 665–680). Berlin: Springer.
Sváb-Zamazal, O., Svátek, V., Berka, P., Rak, D., & Tomášek, P. (2005). OntoFarm: Towards an experimental collection of parallel ontologies. In Poster Proceedings of the 4th International Semantic Web Conference.
Trojahn, C., Quaresma, P., & Vieira, R. (2008). A framework for multilingual ontology mapping. In Proceedings of the 6th International Conference on Language Resources and Evaluation, 1034–1037.
Wang, S., Isaac, A., Schopman, B., Schlobach, S., & Meij, L. (2009). Matching multi-lingual subject vocabularies. In Research and advanced technology for digital libraries (pp. 125–137). Heidelberg: Springer.
Acknowledgements
Cassia Trojahn is partially supported by the CAPES-COFECUB Cameleon project number 707-11. Ondřej Zamazal has been supported by the CSF grant no. 14-14076P.
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2014 Springer-Verlag Berlin Heidelberg
About this chapter
Cite this chapter
Trojahn, C., Fu, B., Zamazal, O., Ritze, D. (2014). State-of-the-Art in Multilingual and Cross-Lingual Ontology Matching. In: Buitelaar, P., Cimiano, P. (eds) Towards the Multilingual Semantic Web. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-662-43585-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-662-43585-4_8
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-662-43584-7
Online ISBN: 978-3-662-43585-4
eBook Packages: Computer ScienceComputer Science (R0)