Skip to main content

A Progressive Algorithm for Cross-Language Information Retrieval Based on Dictionary Translation

  • Conference paper

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 4993))

Abstract

Query translation is the mainstream in cross-language information retrieval, but ambiguity must be resolved by methods based on dictionary translation. In this paper, we propose a progressive algorithm for disambiguation which is derived from another algorithm we propose called the max-sum model. The new algorithm take a strategy called weighted-average probability distribution to redistribute the probabilities. Moreover, the new algorithm can be computed in a more direct way by solving an equation system. All the resource our method requires is a bilingual dictionary and a monolingual corpus. Experiments show it outperforms four other methods.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Federico, M., Bertoldi, N.: Statistical cross-language information retrieval using N-best query translations. In: SIGIR 2002: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 167–174. ACM Press, New York (2002)

    Chapter  Google Scholar 

  2. Xu, J., Weischedel, R.: TREC-9 cross-lingual retrieval at BBN. In: Proceedings of the TREC-9 Conference (2001)

    Google Scholar 

  3. Lavrenko, V., Choquette, M., Croft, W.B.: Cross-lingual relevance models. In: SIGIR 2002: Proceedings of the 25th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 175–182. ACM Press, New York (2002)

    Chapter  Google Scholar 

  4. Nie, J.-Y., Simard, M.: Using statistical translation models for bilingual ir. In: Peters, C., Braschler, M., Gonzalo, J., Kluck, M. (eds.) CLEF 2001. LNCS, vol. 2406, pp. 137–150. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  5. Gao, J., Nie, J.-Y., Xun, E., Zhang, J., Zhou, M., Huang, C.: Improving query translation for cross-language information retrieval using statistical models. In: SIGIR 2001: Proceedings of the 24th annual international ACM SIGIR conference on Research and development in information retrieval, pp. 96–104. ACM Press, New York (2001)

    Chapter  Google Scholar 

  6. Kraaij, W., Pohlmann, R., Hiemstra, D.: Twenty-one at TREC-8: using language technology for information retrieval. In: Voorhees, E.M., Harman, D.K. (eds.) The Eigth Text REtrieval Conference (TREC-8). National Institute of Standards and Technology, NIST, vol. 8, pp. 285–300. NIST Special Publication 500-246 (2000)

    Google Scholar 

  7. Maeda, A., Sadat, F., Yoshikawa, M., Uemura, S.: Query term disambiguation for web cross-language information retrieval using a search engine. In: IRAL 2000: Proceedings of the fifth international workshop on Information retrieval with Asian languages, pp. 25–32. ACM Press, New York (2000)

    Chapter  Google Scholar 

  8. Liu, Y., Jin, R., Chai, J.Y.: A maximum coherence model for dictionary-based cross-language information retrieval. In: Proceedings of the 28th annual international ACM SIGIR conference on Research and development in information retrieval, Salvador, Brazil, August 15–19, 2005, pp. 536–543 (2005)

    Google Scholar 

  9. Adriani, M.: Dictionary-based CLIR for the CLEF multilingual track. In: Working Notes of the Workshop in Cross-Language Evaluation Forum (CLEF), September 2000, Lisbon (2000)

    Google Scholar 

  10. Mathews, J.H., Fink, K.D.: Numerical Methods Using MATLAB, 3rd edn. Prentice-Hall, Englewood Cliffs (2002)

    Google Scholar 

  11. Xu, S.: Computer Algorithms for Common Use, 2nd edn. Tsinghua University Press (1995)

    Google Scholar 

  12. Daelemans, W., Sima’an, K., Veenstra, J., Zavrel, J. (eds.): Different Approaches to Cross Language Information Retrieval, Language and Computers: Studies in Practical Linguistics, vol. 37. Rodopi, Amsterdam (2001)

    Google Scholar 

  13. Li, X., Yan, H., Wang, J.: Search Engine: Principle, Technology and Systems. Science Press (2004)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Hang Li Ting Liu Wei-Ying Ma Tetsuya Sakai Kam-Fai Wong Guodong Zhou

Rights and permissions

Reprints and permissions

Copyright information

© 2008 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Yuan, S.A., Yu, S.N. (2008). A Progressive Algorithm for Cross-Language Information Retrieval Based on Dictionary Translation. In: Li, H., Liu, T., Ma, WY., Sakai, T., Wong, KF., Zhou, G. (eds) Information Retrieval Technology. AIRS 2008. Lecture Notes in Computer Science, vol 4993. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-68636-1_24

Download citation

  • DOI: https://doi.org/10.1007/978-3-540-68636-1_24

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-68633-0

  • Online ISBN: 978-3-540-68636-1

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics