Dublin City University at CLEF 2004: Experiments in Monolingual, Bilingual and Multilingual Retrieval
- 504 Downloads
The Dublin City University group participated in the monolingual, bilingual and multilingual retrieval tasks. The main focus of our investigation for CLEF 2004 was extending our information retrieval system to document languages other than English, and completing the multilingual task comprising four languages: English, French, Russian and Finnish. Our retrieval system is based on the City University Okapi BM25 system with document preprocessing using the Snowball stemming software and stopword lists. Our French monolingual experiments compare retrieval using French documents and topics, and documents and topics translated into English. Our results indicate that working directly in French is more effective for retrieval than adopting document and topic translation. A breakdown of our multilingual retrieval results by the individual languages shows that similar overall average precision can be achieved when there is significant underlying variation in performance for individual languages.
KeywordsRelevant Document Data Fusion Average Precision Pseudo Relevance Feedback English Topic
Unable to display preview. Download preview PDF.
- 2.Snowball toolkit, http://snowball.tartarus.org/
- 3.Porter, M.F.: An algorithm for suffix stripping. Program 14, 10–137 (1980)Google Scholar
- 4.Lam-Adesina, A.M., Jones, G.J.F.: Applying Summarization Techniques for Term Selection in Relevance Feedback. In: Proceedings of the 24th Annual International ACM SIGIR Conference, New Orleans, pp. 1–9. ACM, New York (2001)Google Scholar