This chapter explores the different sources of linguistic knowledge that can be employed by WSD systems. These are more abstract than the features used by WSD algorithms, which are encoded at the algorithmic level and normally extracted from a lexical resource or corpora. The chapter begins by listing a comprehensive set of knowledge sources with examples of their application and then explains whether this linguistic knowledge may be found in corpora, lexical knowledge bases or machine readable dictionaries. An analysis of knowledge sources used in actual WSD systems is then presented. It has been observed that the best results are often obtained by combining knowledge sources and the chapter concludes by analyzing experiments on the effect of different knowledge sources which have implications about the effectiveness of each.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, Eneko & David MartÃnez. 2001a. Knowledge sources for word sense disambiguation. Proceedings of the Fourth International Conference on Text Speech and Dialogue (TSD), Plzen, Czech Republic.
Agirre, Eneko & David MartÃnez. 2001b. Learning class-to-class selectional preferences. Proceedings of the ACL/EACL Workshop on Computational Natural Language Learning (CoNLL), Toulouse, France.
Agirre, Eneko & German Rigau. 1996. Word sense disambiguation using conceptual density. Proceedings of the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark.
ALPAC. 1966. Languages and Machines: Computers in Translation and Linguistics. National Research Council Publication 1416, Washington, USA.
Bar-Hillel, Yehoshua. 1964. Language and Information Addison-Wesley, New York, USA.
Bateman, John A., Robert Kasper, Johanna Moore & Richard A. Whitney. 1990. A General Organization of Knowledge for Natural Language Processing: The PENMAN Upper Model. Technical report, USC/Information Sciences Institute, Marina del Rey, USA.
Boguraev, Branimir. 1979. Automatic Resolution of Linguistic Ambiguities. Ph.D. Thesis, Computer Laboratory, University of Cambridge, Cambridge, UK.
Brill, Eric. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21 (4):543-566.
Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra & Robert L. Mercer. 1991. Word sense disambiguation using statistical methods. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL), Berkeley, USA, 264-270.
Bruce, Rebecca & Louise Guthrie. 1992. Genus disambiguation: A study in weighted preference. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 1187-1191.
Carroll, John & Ted Briscoe. 2001. High precision extraction of grammatical relations. Proceedings of the 7th ACL/SIGPARSE International Workshop on Parsing Technologies, Beijing, China, 78-89.
Charniak, Eugene. 1983. Marker Passing: A Theory of Contextual Influence in Language Comprehension. Cognitive Science 7.
Chapman, Robert L. 1977. Roget’s International Thesaurus, Fourth Edition. Harper and Row, New York, USA.
Cowie, Jim, Louise Guthrie & Joe Guthrie. 1992. Lexical disambiguation using simulated annealing. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 359-365.
Cruse, David. 1998. Lexical Semantics. Cambridge University Press, Cambridge, UK.
Daelemans, Walter, Jakub Zavrel, Ko van der Sloot & Antal van den Bosch. 1999. TiMBL: Tilburg Memory Based Learner, Version 2.0, Reference Guide. ILK Technical Report 99-01, Tilburg University, The Netherlands.
Decadt, Bart, Véronique Hoste, Walter Daelemans, & Antal van den Bosch. 2004. GAMBL, Genetic Algorithm Optimization of Memory-Based WSD. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 108-112.
Dang, Hoa Trang & Martha Palmer. 2002. Combining contextual features for word sense disambiguation. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, USA.
Duda, Richard & Peter E. Hart. 1973. Pattern Classification and Scene Analysis. New York: Wiley.
Elworthy, David. 1994. Does Baum-Welch re-estimation help taggers? Proceedings of the 4th Conference on Applied Natural Language Processing, Stuttgart, Germany, 53-58.
Fellbaum, Christiane. 1998. WordNet: An Electronic Lexical Database. Massachusetts and London: The MIT Press.
Fernández, David, Julio Gonzalo & Felisa Verdejo. 2001. The UNED systems at Senseval-2. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 75-78.
Fillmore, Charles. 1971. Types of lexical information. Semantics: An interdisciplinary reader in philosophy, linguistics, and psychology. Cambridge: Cambridge University Press, 370-392.
Florian, Radu, Silviu Cucerzan, Charles Schafer & David Yarowsky. 2002. Classifier combination for word sense disambiguation. Journal of Natural Language Engineering, 8(4): 327-341.
Freund, Yoav & Robert E. Schapire. 1996. Experiments with a new boosting algorithm. Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, 148-156.
Gale, William, Kenneth W. Church & David Yarowsky. 1993. A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26 (5): 415-439.
Hirst, Graeme. 1987. Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.
Kilgarriff, Adam. 1997. Putting frequencies in the dictionary. International Journal of Lexicography, 10(2): 135-155
Kriedler, Charles. 1998. Introducing English Semantics. London and New York: Routledge.
Lee, Yoong K. & Hwee Tou Ng. 2002. An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, USA, 41-48.
Lee, Yoong K., Hwee Tou Ng & Tee Kiah Chia. 2004. Supervised word sense disambiguation with support vector machines and multiple knowledge sources. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 137-140.
Lesk, Michael. 1986. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proceedings of SIGDOC-86: 5th International Conference on Systems Documentation, Toronto, Canada, 24-26.
Lin, Dekang. 1993. Principle based parsing without overgeneration. Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, USA, 112-120.
Lin, Dekang. 1997. Using syntactic dependency as local context to resolve word sense ambiguity. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL), Madrid, 64-71.
Magnini, Bernardo, Carlo Strapparava, Giovani Pezzulo & Alfio Gliozzo. 2001. Using domain information for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, France, 111-114.
MartÃnez, David, Eneko Agirre & Lluis Márquez. 2002. Syntactic features for high precision word sense disambiguation. Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan.
McCarthy, Diana, John Carroll & Judita Preiss. 2001. Disambiguating noun and verb senses using automatically acquired selectional preferences. Proceedings of the ACL/EACL Senseval-2 Workshop, Toulouse, France.
McCarthy, Diana, Rob Koeling, Julie Weeds & John Carroll. 2004. Using automatically acquired predominant senses for word sense disambiguation. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 151-158.
McRoy, Susan W. 1992. Using multiple knowledge sources for word sense disambiguation. Computational Linguistics, 18(1): 1-30.
Mihalcea, Rada & Dan Moldovan. 2001. Pattern learning and active feature selection for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.
Mihalcea, Rada & Ehsanul Faruque. 2004. SenseLearner: Minimally supervised word sense disambiguation for all words in open text. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 155-158.
Miller, George. A., Claudia Leacock, Randee Tengi & Ross. T. Bunker. 1993. A semantic concordance. Proceedings of the ARPA Workshop on Human Language Technology, 303-308.
Montoyo, Andres & Armando Suárez. 2001. The University of Alicante word sense disambiguation system. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.
Ng, Hwee Tou & Hiang B. Lee. 1996. Integrating multiple knowledge sources for word sense disambiguation: An exemplar-based approach. Proceedings of the 34th Meeting of the Association for Computational Linguistics (ACL), Santa Cruz, CA, USA, 40-47.
Patwardhan, Siddharth, Satanjeev Banerjee & Ted Pedersen. 2003. Using measures of semantic relatedness for word sense disambiguation. Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico.
Pedersen, Ted. 2002. Assessing system agreement and instance difficulty in the lexical sample tasks of Senseval-2. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, PA, USA.
Preiss, Judita. 2001. Anaphora resolution with word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 143-146.
Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. Harlow, UK: Longman Group.
Quinlan, J. Ross. 1993. C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann.
Resnik, Philip. 1997. Selectional preferences and word sense disambiguation. Proceedings of the ACL/SIGLEX Workshop on Tagging Text with Lexical Semantics: What, Why and How?, Washington, DC, USA, 52-57.
Resnik, Philip & David Yarowsky. 1997. A perspective on word sense disambiguation algorithms and their evaluation. Proceedings of the ACL/SIGLEX Workshop Tagging Texts with Lexical Semantics: What, Why and How?, Washingtonn, DC, USA, 79-86.
Sag, Ivan A., Timothy Baldwin, Francis Bond, Ann Copestake & Dan Flickinger. 2002. Multiword expressions: A pain in the neck for NLP. Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico, 1-15.
Small, Steven L. 1980. Word Expert Parsing: A Theory of Distributed Wordbased Natural Language Understanding. Ph.D. Thesis, Department of Computer Science, University of Maryland, USA.
Strapparava, Carlo, Alfio Gliozzo, & Claudio Giuliano. 2004. Pattern abstraction and term similarity for word sense disambiguation: IRST at Senseval-3. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 229-233.
Stevenson, Mark. 2003. Word Sense Disambiguation: The Case for Combination of Knowledge Sources. Stanford, USA: CSLI Publications.
Stevenson, Mark & Yorick Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3): 321-349.
Vapnik, Vladimir. 1995. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag.
Wilks, Yorick. 1975. A preferential pattern-seeking semantics for natural language inference. Artificial Intelligence, 6: 53-74.
Wilks, Yorick. 1978. Making preferences more active. Artificial Intelligence 11(3): 197-223.
Wilks, Yorick & Mark Stevenson. 1998. The grammar of sense: Using part of speech tags as a first step in semantic disambiguation. Journal of Natural Language Engineering, 4(2): 135-144.
Yarowsky, David. 1992. Word-sense disambiguation using statistical models of Roget’s categories trained on large corpora. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 454-460.
Yarowsky, David. 1996. Three Algorithms for Lexical Ambiguity Resolution, Ph.D. Thesis, School of Computer and Information Science, University of Pennsylvania, USA.
Yarowsky, David, Silviu Cucerzan, Radu Florian, Charles Schafer & Richard Wicentowski. 2001. The Johns Hopkins Senseval-2 system description. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 163-166.
Yarowsky, David & Radu Florian. 2002. Evaluating sense disambiguation across diverse parameter spaces. Journal of Natural Language Engineering, 8(2): 293-310.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Agirre, E., Stevenson, M. (2007). Knowledge Sources for WSD. In: Agirre, E., Edmonds, P. (eds) Word Sense Disambiguation. Text, Speech and Language Technology, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4809-8_8
Download citation
DOI: https://doi.org/10.1007/978-1-4020-4809-8_8
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4808-1
Online ISBN: 978-1-4020-4809-8
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)