Abstract
In the CLEF 2005 Ad-Hoc Track we addressed the problem of retrieving information in morphologically rich languages, by experimenting with language-specific morphosyntactic processing and light Natural Language Processing (NLP). The diversity of the languages processed, namely Bulgarian, French, Italian, English, and Greek, allowed us to measure the effect of system-specific features upon the retrieval of these languages, and to juxtapose that effect to the role of language resources in Cross Language Information Retrieval (CLIR) in general.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Amati, G.: Probabilistic Models for Information Retrieval based on Divergence from Randomness. PhD thesis, Dept of Computing Science, University of Glasgow (2003)
Aronson, H.I.: Bulgarian Inflectional Morphophonology. The Hague, Mouton (1968)
Babelfish Machine Translation, http://babelfish.altavista.com/
Bauer, L.: Introducing Linguistic Morphology. Edinburgh University Press (1988)
Joseph, B., Philippaki-Warburton, I.: Modern Greek: A Linguist’s Grammar. In: Croom Helm (Lingua Descriptive Series), London (1987)
Lioma, C., He, B., Plachouras, V., Ounis, I.: The University of Glasgow at CLEF 2004: French Monolingual Information Retrieval with Terrier. In: Peters, C., Clough, P., Gonzalo, J., Jones, G.J.F., Kluck, M., Magnini, B. (eds.) CLEF 2004. LNCS, vol. 3491, pp. 253–259. Springer, Heidelberg (2005)
Marcus, M.P., Santorini, B., Marcinkiewicz, M.A.: Building a Large Annotated Corpus for English: The Penn Treebank. Computational Linguistics 19(2), 313–330 (1993)
Ounis, I., Amati, G., Plachouras, V., He, B., Macdonald, C., Johnson, D.: Terrier Information Retrieval Platform. In: Losada, D.E., Fernández-Luna, J.M. (eds.) ECIR 2005. LNCS, vol. 3408, pp. 517–519. Springer, Heidelberg (2005), http://ir.dcs.gla.ac.uk/terrier/
Robertson, S.E.: Okapi at TREC-3. In: Harman, D. K. (eds.): Overview of the Third Text Retrieval Conference (TREC-3), NIST (2005)
Schmidt, H.: Probabilistic Part-of-Speech Tagging Using Decision Trees. In: Jones, D., Somers, H. (eds.) New Methods in Language Processing Studies. Computational Linguistics, UCL Press (1997)
Skycode Machine Translation, http://webtrance.skycode.com/online.asp/
Snowball stemmers, http://snowball.tartarus.org/
Worldlingo Machine Translation, http://www.worldlingo.com/
Xerox Greek Language Analysis, http://www.xrce.xerox.com/competencies/content-analysis/demos/greek
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Lioma, C., Macdonald, C., He, B., Plachouras, V., Ounis, I. (2006). Applying Light Natural Language Processing to Ad-Hoc Cross Language Information Retrieval. In: Peters, C., et al. Accessing Multilingual Information Repositories. CLEF 2005. Lecture Notes in Computer Science, vol 4022. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11878773_19
Download citation
DOI: https://doi.org/10.1007/11878773_19
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45697-1
Online ISBN: 978-3-540-45700-8
eBook Packages: Computer ScienceComputer Science (R0)