Abstract
Cross-Language Information Retrieval (CLIR) combines the traditional Information Retrieval technique and Machine Translation technique. There are many aspects related to the problem of polysemy, which are good cut-in points for the application of WSD in CLIR. Therefore, an attempt in this paper is to apply WSD in English-Chinese Bi-Directional CLIR. The query expansion and the proposed Lesk-C WSD strategy are explored. Although limited improvement on WSD can be obtained, query expansion and disambiguation based on the related strategies of WSD are beneficial to CLIR, and can improve the whole retrieval performance. Specially, by considering the “Coordinate Terms”, the Lesk-C algorithm shows the better performance and has more extensive applicability on CLIR.
This paper is supported by National Natural Science Foundation of China (No. 60773124), National Science and Technology Pillar Program of China (No. 2007BAH09B03) and Shanghai Municipal R&D Foundation (No. 08dz1500109). Tao Zhang is the corresponding author.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Peters, C.: Cross-Language Information Retrieval and Evaluation. LNCS, vol. 2069, pp. 261–272. Springer, Germany (2001)
Oard, W., Ertunc, F.: Translation-Based Indexing for Cross-Language Retrieval. In: Crestani, F., Girolami, M., van Rijsbergen, C.J.K. (eds.) ECIR 2002. LNCS, vol. 2291, pp. 324–333. Springer, Heidelberg (2002)
Gao, J., Nie, J.-Y., He, H., Chen, W., Zhou, M.: Resolving Query Translation Ambiguity Using a Decaying Co-Occurrence Model and Syntactic Dependence Relations. In: Proc. of the 25th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2002), pp. 183–190. ACM Press, New York (2002)
Stokoe, C., Oakes, M.P., Tait, J.: Word Sense Disambiguation in Information Retrieval Revisited. In: Proc. of the 26th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2003), pp. 159–165. ACM Press, New York (2003)
Monz, C., Dorr, B.J.: Iterative Translation Disambiguation for Cross-Language Information Retrieval. In: Proc. of the 28th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval (SIGIR 2005), pp. 520–527. ACM Press, New York (2005)
Witten, I.H., Frank, E.: Data Mining: Practical Machine Learning Tools and Techniques, 2nd edn. Morgan Kaufmann, San Francisco (2005)
Liu, Y., Jin, R., Chai, J.Y.: A Statistical Framework for Query Translation Disambiguation. ACM Transaction on Asian Language Information Processing (TALIP) 5(4), 360–387 (2006)
EKEDAHL, J., GOLUB, K.: Word Sense Disambiguation using WordNet and the Lesk Algorithm. Projektarbeten 2004. Institutionen för Datavetenskap, Lunds University (2004)
Pedersen, T., Banerjee, S., Patwardhan, S.: Maximizing Semantic Relatedness to Perform Word Sense Disambiguation. University of Minnesota Supercomputing Institute Research Report UMSI 2005/25 (2005)
Wu, Z., Palmer, M.: Verb Semantics and Lexical Selection. In: Proc. of the 32nd Annual Meeting of the Association for Computational Linguistics (ACL 1994), Las Cruces, New Mexico, pp. 133–138 (1994)
Jiang, J.J., Conrath, D.W.: Semantic Similarity based on Corpus Statistics and Lexical Taxonomy. In: Proc. of International Conference on Research in Computational Linguistics (ROCLING X), pp. 19–33. Scandinavian University Press, Taiwan (1997)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Zhang, Y., Zhang, T. (2009). Research on Lesk-C-Based WSD and Its Application in English-Chinese Bi-directional CLIR. In: Lee, G.G., et al. Information Retrieval Technology. AIRS 2009. Lecture Notes in Computer Science, vol 5839. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-04769-5_33
Download citation
DOI: https://doi.org/10.1007/978-3-642-04769-5_33
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-04768-8
Online ISBN: 978-3-642-04769-5
eBook Packages: Computer ScienceComputer Science (R0)