Skip to main content

Resolving Translation Ambiguity Using Monolingual Corpora

  • Conference paper
Advances in Cross-Language Information Retrieval (CLEF 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2785))

Included in the following conference series:

Abstract

In the bilingual track of CLEF 2002, focusing on word translation ambiguity, we experimented with several techniques for choosing the best target translation for each source query word by using co-occurrence statistics in a reference corpus consisting of documents in the target language. Our techniques give one best translation per source query word. We also experimented with combining these word choice results in the final translated query. The source query languages were Spanish, Chinese, and Japanese; the target language documents were in English. For Spanish-to-English retrieval, the best recall and average precision of Spanish-to-English retrieval reached 95% and 97%, respectively, of the recall and average precision of an English monolingual retrieval run. For Chinese-to-English text retrieval, the recall and average precision reached 89% and 60%, respectively, of the English run. For Japanese-to-English text retrieval, the best recall and average precision reached 82% and 69%, respectively, of the English run.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Ballesteros, L., and W. B. Croft: Resolving Ambiguity for Cross-Language Retrieval. In Proceedings of ACM SIGIR (1998) 64-71

    Google Scholar 

  2. Davis, M. W., and W. C. Ogden. QUILT: Implementing a Large-Scale Cross-Language Text Retrieval System. In Proceedings of ACM SIGIR (1997) 92-98

    Google Scholar 

  3. Evans, D.A: Method and Apparatus for Cross Linguistic Document Retrieval. U.S. Patent # 6,055,528 (2000)

    Google Scholar 

  4. Evans, D.A.: Method and Apparatus for Cross Linguistic Database Retrieval. U.S. Patent # 6,263,329 (a division of U.S. Patent # 6,055,528) (2001)

    Google Scholar 

  5. Evans, D.A., and R.G. Lefferts: CLARIT-TREC Experiments. Information Processing and Management, Vol.31, No.3 (1995) 385–395

    Article  Google Scholar 

  6. Grefenstette, G.: The Problem of Cross-Language Information Retrieval. In G. Grefenstette, editor, Cross-Language Information Retrieval, Chapter 1. Kluwer Academic Publishers, Boston, (1998) 1–9

    Chapter  Google Scholar 

  7. Grefenstette, G.: The WWW as a resource for example-based MT tasks. In Proc., ASLIB Translating and the Computer 21 Conference, London (1999)

    Google Scholar 

  8. Hull, D.A. and G. Grefenstette: Experiments in Multilingual Information Retrieval. In Proceedings of the 19th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval, Zurich, Switzerland (1996)

    Google Scholar 

  9. Oard, D.W. and B.J. Dorr: A survey of multilingual text retrieval. Technical Report UMIACS-TR-96-19, University of Maryland, Institute for Advanced Computer Studies (1996)

    Google Scholar 

  10. Qu, Y., A.N. Eilerman, H. Jin, and D.A. Evans. The Effect of Pseudo-relevance Feedback on MT-Based CLIR. In Proceedings of the Recherche d’Informations Assistée par Ordinateur (RIAO 2000) (2000)

    Google Scholar 

  11. Zhu, X. and R. Rosenfeld. Improving Trigram Language Modeling with the World Wide Web. In Proceedings of International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (2001)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2003 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Qu, Y., Grefenstette, G., Evans, D.A. (2003). Resolving Translation Ambiguity Using Monolingual Corpora. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds) Advances in Cross-Language Information Retrieval. CLEF 2002. Lecture Notes in Computer Science, vol 2785. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-45237-9_19

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-45237-9_19

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-40830-7

  • Online ISBN: 978-3-540-45237-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics