Scalable Multilingual Information Access

  • Paul McNamee
  • James Mayfield
Conference paper

DOI: 10.1007/978-3-540-45237-9_17

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2785)
Cite this paper as:
McNamee P., Mayfield J. (2003) Scalable Multilingual Information Access. In: Peters C., Braschler M., Gonzalo J., Kluck M. (eds) Advances in Cross-Language Information Retrieval. CLEF 2002. Lecture Notes in Computer Science, vol 2785. Springer, Berlin, Heidelberg

Abstract

The third Cross-Language Evaluation Forum workshop (CLEF-2002) provides the unprecedented opportunity to evaluate retrieval in eight different languages using a common set of topics and a uniform assessment methodology. This year the Johns Hopkins University Applied Physics Laboratory participated in the monolingual, bilingual, and multilingual retrieval tasks. We contend that information access in a plethora of languages requires approaches that are inexpensive in developer and run-time costs. In this paper we describe a simplified approach that seems suitable for retrieval in many languages; we also show how good retrieval is possible over many languages, even when translation resources are scarce, or when query-time translation is infeasible. In particular, we investigate the use of character n-grams for monolingual retrieval, CLIR between related languages using partial morphological matches, and translation of document representations to an interlingua for computationally efficient retrieval against multiple languages.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Paul McNamee
    • 1
  • James Mayfield
    • 1
  1. 1.Applied Physics LabJohns Hopkins UniversityLaurelUSA

Personalised recommendations