Monolingual, Bilingual, and GIRT Information Retrieval at CLEF-2005
For our fifth participation in the CLEF evaluation campaigns, our first objective was to propose an effective and general stopword list as well as a light stemming procedure for the Hungarian, Bulgarian and Portuguese (Brazilian) languages. Our second objective was to obtain a better picture of the relative merit of various search engines when processing documents in those languages. To do so we evaluated our scheme using two probabilistic models and five vector-processing approaches. In the bilingual track, we evaluated both the machine translation and bilingual dictionary approaches applied to automatically translate a query submitted in English into various target languages. Finally, using the GIRT corpora (available in English, German and Russian), we investigated the variations in retrieval effectiveness that resulted when we included or excluded manually assigned keywords attached to the bibliographic records (mainly comprising a title and an abstract).
KeywordsMachine Translation Average Precision Retrieval Performance Bibliographic Record Translation Tool
Unable to display preview. Download preview PDF.
- 1.Savoy, J.: Combining Multiple Strategies for Effective Monolingual and Cross-Lingual Retrieval. IR Journal 7, 121–148 (2004)Google Scholar
- 4.Buckley, C., Singhal, A., Mitra, M., Salton, G.: New Retrieval Approaches Using SMART. In: Proceedings TREC-4. NIST Publication #500-236, Gaithersburg, pp. 25–48 (1996)Google Scholar
- 5.Singhal, A., Choi, J., Hindle, D., Lewis, D.D., Pereira, F.: AT&T at TREC-7. In: Proceedings TREC-7. NIST, Publication #500-242, Gaithersburg, pp. 239–251 (1999)Google Scholar
- 9.Tomlinson, S.: European Ad Hoc Retrieval Experiments with Hummingbird SearchServerTMat CLEF2005 (2005) (In this volume)Google Scholar
- 10.Tordai, A., de Rijke, M.: Hungarian Monolingual Retrieval at CLEF2005 (2005) (In this volume)Google Scholar
- 11.Vogt, C.C., Cottrell, G.W.: Fusion via a Linear Combination of Scores. IR Journal 1, 151–173 (1999)Google Scholar