WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia

  • Dong Nguyen
  • Arnold Overwijk
  • Claudia Hauff
  • Dolf R. B. Trieschnigg
  • Djoerd Hiemstra
  • Franciska de Jong
Conference paper

DOI: 10.1007/978-3-642-04447-2_6

Part of the Lecture Notes in Computer Science book series (LNCS, volume 5706)
Cite this paper as:
Nguyen D., Overwijk A., Hauff C., Trieschnigg D.R.B., Hiemstra D., de Jong F. (2009) WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia. In: Peters C. et al. (eds) Evaluating Systems for Multilingual and Multimodal Information Access. CLEF 2008. Lecture Notes in Computer Science, vol 5706. Springer, Berlin, Heidelberg

Abstract

This paper presents WikiTranslate, a system which performs query translation for cross-lingual information retrieval (CLIR) using only Wikipedia to obtain translations. Queries are mapped to Wikipedia concepts and the corresponding translations of these concepts in the target language are used to create the final query. WikiTranslate is evaluated by searching with topics formulated in Dutch, French and Spanish in an English data collection. The system achieved a performance of 67% compared to the monolingual baseline.

Keywords

Cross-lingual information retrieval query translation word sense disambiguation Wikipedia comparable corpus 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2009

Authors and Affiliations

  • Dong Nguyen
    • 1
  • Arnold Overwijk
    • 1
  • Claudia Hauff
    • 1
  • Dolf R. B. Trieschnigg
    • 1
  • Djoerd Hiemstra
    • 1
  • Franciska de Jong
    • 1
  1. 1.University of TwenteThe Netherlands

Personalised recommendations