Skip to main content

Research on Lesk-C-Based WSD and Its Application in English-Chinese Bi-directional CLIR

  • Conference paper
Information Retrieval Technology (AIRS 2009)

Part of the book series: Lecture Notes in Computer Science ((LNISA,volume 5839))

Included in the following conference series:

  • 829 Accesses

Abstract

Cross-Language Information Retrieval (CLIR) combines the traditional Information Retrieval technique and Machine Translation technique. There are many aspects related to the problem of polysemy, which are good cut-in points for the application of WSD in CLIR. Therefore, an attempt in this paper is to apply WSD in English-Chinese Bi-Directional CLIR. The query expansion and the proposed Lesk-C WSD strategy are explored. Although limited improvement on WSD can be obtained, query expansion and disambiguation based on the related strategies of WSD are beneficial to CLIR, and can improve the whole retrieval performance. Specially, by considering the “Coordinate Terms”, the Lesk-C algorithm shows the better performance and has more extensive applicability on CLIR.

This paper is supported by National Natural Science Foundation of China (No. 60773124), National Science and Technology Pillar Program of China (No. 2007BAH09B03) and Shanghai Municipal R&D Foundation (No. 08dz1500109). Tao Zhang is the corresponding author.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Peters, C.: Cross-Language Information Retrieval and Evaluation. LNCS, vol. 2069, pp. 261–272. Springer, Germany (2001)

    Google Scholar 

  2. Oard, W., Ertunc, F.: Translation-Based Indexing for Cross-Language Retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J.K. (eds.) ECIR 2002. LNCS, vol. 2291, pp. 324–333. Springer, Heidelberg (2002)

    Chapter  Google Scholar 

  3. Gao, J., Nie, J.-Y., He, H., Chen, W., Zhou, M.: Resolving Query Translation Ambiguity Using a Decaying Co-Occurrence Model and Syntactic Dependence Relations. In: Proc. of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2002), pp. 183–190. ACM Press, New York (2002)

    Chapter  Google Scholar 

  4. Stokoe, C., Oakes, M.P., Tait, J.: Word Sense Disambiguation in Information Retrieval Revisited. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), pp. 159–165. ACM Press, New York (2003)

    Google Scholar 

  5. Monz, C., Dorr, B.J.: Iterative Translation Disambiguation for Cross-Language Information Retrieval. In: Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pp. 520–527. ACM Press, New York (2005)

    Google Scholar 

  6. Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)

    MATH  Google Scholar 

  7. Liu, Y., Jin, R., Chai, J.Y.: A Statistical Framework for Query Translation Disambiguation. ACM Transaction on Asian Language Information Processing (TALIP) 5(4), 360–387 (2006)

    Article  Google Scholar 

  8. EKEDAHL, J., GOLUB, K.: Word Sense Disambiguation using WordNet and the Lesk Algorithm. Projektarbeten 2004. Institutionen för Datavetenskap, Lunds University (2004)

    Google Scholar 

  9. Pedersen, T., Banerjee, S., Patwardhan, S.: Maximizing Semantic Relatedness to Perform Word Sense Disambiguation. University of Minnesota Supercomputing Institute Research Report UMSI 2005/25 (2005)

    Google Scholar 

  10. Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: Proc. of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL 1994), Las Cruces, New Mexico, pp. 133–138 (1994)

    Google Scholar 

  11. Jiang, J.J., Conrath, D.W.: Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In: Proc. of International Conference on Research in Computational Linguistics (ROCLING X), pp. 19–33. Scandinavian University Press, Taiwan (1997)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Zhang, Y., Zhang, T. (2009). Research on Lesk-C-Based WSD and Its Application in English-Chinese Bi-directional CLIR. In: Lee, G.G., et al. Information Retrieval Technology. AIRS 2009. Lecture Notes in Computer Science, vol 5839. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04769-5_33

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-04769-5_33

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-04768-8

  • Online ISBN: 978-3-642-04769-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics