Abstract
We applied a Cross-Lingual PRF (Pseudo-Relevance Feedback) system to both the monolingual task and the German->English task. We focused on the effects of extracting a comparable corpus from the given newspaper data; our corpus doubled the average precision when used together with a parallel corpus made available to participants. The PRF performance was lower for the queries with few relevant documents. We also examined the effects of the PRF first-step retrieval in the parallel corpus vs. the entire document collection.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
M. Franz et al. Ad hoc and Multilingual Information Retrieval at IBM. In The Seventh Text REtrieval Conference (TREC-8)
G. Neumann. Morphix Software Package. http://www.dfki.de/~neumann/morphix/morphix.html
Y. Yang et al. Translingual Information Retrieval: Learning from Bilingual Corpora. In AI Journal Special Issue: Best of IJCAI 1997
J. Xu and W.B. Croft. Query Expansion Using Local and Global Document Analysis. In Proceedings of the Nineteenth Annual International ACM SIGIR Conference on Research and Development in Information Retrieval
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Rogati, M., Yang, Y. (2002). Cross-Lingual Pseudo-Relevance Feedback Using a Comparable Corpus. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds) Evaluation of Cross-Language Information Retrieval Systems. CLEF 2001. Lecture Notes in Computer Science, vol 2406. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45691-0_12
Download citation
DOI: https://doi.org/10.1007/3-540-45691-0_12
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44042-0
Online ISBN: 978-3-540-45691-9
eBook Packages: Springer Book Archive