Advertisement

A Hybrid Approach to Query and Document Translation Using a Pivot Language for Cross-Language Information Retrieval

  • Kazuaki Kishida
  • Noriko Kando
Part of the Lecture Notes in Computer Science book series (LNCS, volume 4022)

Abstract

This paper reports experimental results for cross-language infor-mation retrieval (CLIR) from German to French, in which a hybrid approach to query and document translation was attempted, i.e., combining the results of query translation (German to French) and of document translation (French to German). In order to reduce the complexity of computation when translating a large amount of texts, we performed pseudo-translation, i.e., a simple replacement of terms by a bilingual dictionary (for query translation, a machine translation system was used). In particular, since English was used as an intermediary language for both translation directions between German and French, English translations at the middle stage were employed as document representations in order to reduce the number of translation steps. By omitting a translation step (English to German), the performance was improved. Unfortunately, our hybrid approach did not show better performance than a simple query translation. This may be due to the low performance of document translation, which was carried out by a simple replacement of terms using a bilingual dictionary with no term disambiguation.

Keywords

Hybrid Approach Machine Translation Average Precision Mean Average Precision English Document 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Kang, I.S., Na, S.H., Lee, J.H.: POSTECH at NTCIR-4: CJKE Monolingual and Korean-related Cross-Language Retrieval Experiments. In: NTCIR Workshop 4 Meeting Working Notes, pp. 89–95 (2004)Google Scholar
  2. 2.
    Kishida, K.: Technical issues of cross-language information retrieval: a review. Information Processing & Management 41, 433–455 (2005)CrossRefGoogle Scholar
  3. 3.
    Scott McCarley, J.: Should we translate the documents or the queries in cross-language information retrieval? In: Proceedings of the 37th conference on Association for Computational Linguistics, pp. 208–214 (1999)Google Scholar
  4. 4.
    Fujii, A., Ishikawa, T.: Japanese-English cross-language information retrieval integrating query and document translation methods. The Transactions of the Institute of Electronics, Information and Communication Engineers J84-D-II, 362–369 (2001) (In Japanese)Google Scholar
  5. 5.
    Kishida, K., Kando, N.: Two-Stage Refinement of Query Translation in a Pivot Language Approach to Cross-Lingual Information Retrieval: An Experiment at CLEF 2003. In: Peters, C., Gonzalo, J., Braschler, M., Kluck, M. (eds.) CLEF 2003. LNCS, vol. 3237, pp. 253–262. Springer, Heidelberg (2004)CrossRefGoogle Scholar
  6. 6.
    Porter, M.F.: An algorithm for suffix stripping. Program 14, 130–137 (1980)Google Scholar
  7. 7.
    Robertson, S.E., Walker, S., Jones, S., Hancock-Beaulieu, M.M., Gatford, M.: Okapi at TREC-3. In: Proceedings of TREC-3, National Institute of Standards and Technology, Gaithersburg (1995), http://trec.nist.gov/pubs/

Copyright information

© Springer-Verlag Berlin Heidelberg 2006

Authors and Affiliations

  • Kazuaki Kishida
    • 1
  • Noriko Kando
    • 2
  1. 1.Surugadai UniversityHanno, SaitamaJapan
  2. 2.National Institute of Informatics (NII)TokyoJapan

Personalised recommendations