Advertisement

Conceptual Indexing for Multilingual Information Retrieval

  • Jacques Guyot
  • Saïd Radhouani
  • Gilles Falquet
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4022)

Abstract

We present a translation-free technique for multilingual information retrieval. This technique is based on an ontological representation of documents and queries. For each language, we use a dictionary (set of lexical reference for concepts) to map a term to its corresponding concept. The same mapping is applied to each document and each query. Then, we use a classic vector space model based on concept for indexing and querying the document corpus. The main advantages of our approach are: no merging phase is required; no dependency on automatic translators between all pairs of languages; and adding a new language only requires a new mapping dictionary to be added into the multilingual ontology. Experimental results on the CLEF 2005 multi8 collection show that this approach is efficient, even with relatively small and low fidelity dictionaries and without word sense disambiguation.

Keywords

Query Language Mean Average Precision Vector Space Model Word Sense Disambiguation Original Query 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Bozsak, E., et al.: KAON - Towards a Large Scale Semantic Web. In: Bauknecht, K., Tjoa, A.M., Quirchmayr, G. (eds.) EC-Web 2002. LNCS, vol. 2455, pp. 304–313. Springer, Heidelberg (2002)CrossRefGoogle Scholar
  2. 2.
    Chen, A., Gey, F.: Combining Query Translation and Document Translation in Cross-Language Retrieval. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 108–121. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  3. 3.
    Ergane: http://download.travlang.com/, see also http://www.majstro.com/ [visited on August 2005]
  4. 4.
    Gruber, T.R.: A translation Approach to Portable Ontology Specifications. Knowledge Acquisition 5, 199–220 (1993)CrossRefGoogle Scholar
  5. 5.
    Guyot, J.: yaaa: yet another alignment algorithm - Alignement ontologique bi-texte pour un corpus multilingue. Cahier du CUI – Université de Genève (2005)Google Scholar
  6. 6.
    Guyot, J., Falquet, G.: Construire un moteur d’indexation. Cahier du CUI - Université de Genève (2005)Google Scholar
  7. 7.
    Salton, G.: Automatic Text Processing: The Transformation, Analysis and Retrieval of Information by Computer. Addison-Wesley, Reading (1989)Google Scholar
  8. 8.
    SnowBall [visited on August 2005], http://snowball.tartarus.org/
  9. 9.
    Uchida, H., Zhu, M., Della Senta, T.G.: Universal Networking Language. UNDL Foundation (2005)Google Scholar
  10. 10.
    UNL: Universal Networking Language, http://cui.unige.ch/isi/unl

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Jacques Guyot
    • 1
  • Saïd Radhouani
    • 1
  • Gilles Falquet
    • 1
  1. 1.Centre Universitaire d’InformatiqueUniversity of GenevaGenève 4Switzerland

Personalised recommendations