WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia
- Cite this paper as:
- Nguyen D., Overwijk A., Hauff C., Trieschnigg D.R.B., Hiemstra D., de Jong F. (2009) WikiTranslate: Query Translation for Cross-Lingual Information Retrieval Using Only Wikipedia. In: Peters C. et al. (eds) Evaluating Systems for Multilingual and Multimodal Information Access. CLEF 2008. Lecture Notes in Computer Science, vol 5706. Springer, Berlin, Heidelberg
This paper presents WikiTranslate, a system which performs query translation for cross-lingual information retrieval (CLIR) using only Wikipedia to obtain translations. Queries are mapped to Wikipedia concepts and the corresponding translations of these concepts in the target language are used to create the final query. WikiTranslate is evaluated by searching with topics formulated in Dutch, French and Spanish in an English data collection. The system achieved a performance of 67% compared to the monolingual baseline.
KeywordsCross-lingual information retrieval query translation word sense disambiguation Wikipedia comparable corpus
Unable to display preview. Download preview PDF.