Multi-language Models and Meta-dictionary Adaptation for Accessing Multilingual Digital Libraries
Accessing digital libraries raises the important issue of how to deal with the multilinguality of the documents. Inside a target collection, documents can be written in very different languages and the record associated to a particular document often contains field descriptors in different languages. This paper proposes a principled way to solve this issue, by proposing a multi-language model approach to information retrieval, as well as an extension of the dictionary adaptation mechanism to cover multiple languages (including the source language). In experiments related to the TEL task of the CLEF2008 Ad-hoc track, runs based on the assumption of a purely bilingual approach, translating the query only in the official language of the collection, appeared to result in performance (mean average precision) larger or equal to the ones of the other participants. But, contrarily to our initial intuition, in the case of the TEL task, the experiments showed that exploiting information in languages different from the official language of the collection turns out to offer no advantage.
KeywordsLanguage Model Digital Library Average Precision Source Language Late Fusion
Unable to display preview. Download preview PDF.
- 2.Agirre, E., Nunzio, G.D., Ferro, N., Mandl, T., Peters, C.: Clef 2008 ad-hoc track overview. In: Working Notes of CLEF 2008. Avalaible On-line on the CLEF Web Site (2008)Google Scholar
- 3.Clinchant, S., Renders, J.-M.: Xrce’s participation to clef 2007 - domain specific track. In: Working Notes of CLEF 2007. Avalaible On-line on the CLEF Web Site (2007)Google Scholar
- 4.Clinchant, S., Renders, J.-M.: Xrce’s participation to clef 2008 ad-hoc track. In: Working Notes of CLEF 2008. Avalaible On-line on the CLEF Web Site (2008)Google Scholar
- 5.Zhai, C., Lafferty, J.: A study of smoothing methods for language models applied to ad hoc to information retrieval. In: Proceedings of SIGIR 2001, pp. 334–342. ACM, New York (2001)Google Scholar