Knowledge Sources for WSD

  • Eneko Agirre
  • Mark Stevenson
Part of the Text, Speech and Language Technology book series (TLTB, volume 33)

This chapter explores the different sources of linguistic knowledge that can be employed by WSD systems. These are more abstract than the features used by WSD algorithms, which are encoded at the algorithmic level and normally extracted from a lexical resource or corpora. The chapter begins by listing a comprehensive set of knowledge sources with examples of their application and then explains whether this linguistic knowledge may be found in corpora, lexical knowledge bases or machine readable dictionaries. An analysis of knowledge sources used in actual WSD systems is then presented. It has been observed that the best results are often obtained by combining knowledge sources and the chapter concludes by analyzing experiments on the effect of different knowledge sources which have implications about the effectiveness of each.

Keywords

Target Word Knowledge Source Ambiguous Word Word Sense Grammatical Category 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre, Eneko & David Martínez. 2001a. Knowledge sources for word sense disambiguation. Proceedings of the Fourth International Conference on Text Speech and Dialogue (TSD), Plzen, Czech Republic.Google Scholar
  2. Agirre, Eneko & David Martínez. 2001b. Learning class-to-class selectional preferences. Proceedings of the ACL/EACL Workshop on Computational Natural Language Learning (CoNLL), Toulouse, France.Google Scholar
  3. Agirre, Eneko & German Rigau. 1996. Word sense disambiguation using conceptual density. Proceedings of the 16th International Conference on Computational Linguistics (COLING), Copenhagen, Denmark.Google Scholar
  4. ALPAC. 1966. Languages and Machines: Computers in Translation and Linguistics. National Research Council Publication 1416, Washington, USA.Google Scholar
  5. Bar-Hillel, Yehoshua. 1964. Language and Information Addison-Wesley, New York, USA.Google Scholar
  6. Bateman, John A., Robert Kasper, Johanna Moore & Richard A. Whitney. 1990. A General Organization of Knowledge for Natural Language Processing: The PENMAN Upper Model. Technical report, USC/Information Sciences Institute, Marina del Rey, USA.Google Scholar
  7. Boguraev, Branimir. 1979. Automatic Resolution of Linguistic Ambiguities. Ph.D. Thesis, Computer Laboratory, University of Cambridge, Cambridge, UK.Google Scholar
  8. Brill, Eric. 1995. Transformation-based error-driven learning and natural language processing: A case study in part of speech tagging. Computational Linguistics, 21 (4):543-566.Google Scholar
  9. Brown, Peter F., Stephen A. Della Pietra, Vincent J. Della Pietra & Robert L. Mercer. 1991. Word sense disambiguation using statistical methods. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics (ACL), Berkeley, USA, 264-270.CrossRefGoogle Scholar
  10. Bruce, Rebecca & Louise Guthrie. 1992. Genus disambiguation: A study in weighted preference. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 1187-1191.CrossRefGoogle Scholar
  11. Carroll, John & Ted Briscoe. 2001. High precision extraction of grammatical relations. Proceedings of the 7th ACL/SIGPARSE International Workshop on Parsing Technologies, Beijing, China, 78-89.Google Scholar
  12. Charniak, Eugene. 1983. Marker Passing: A Theory of Contextual Influence in Language Comprehension. Cognitive Science 7.Google Scholar
  13. Chapman, Robert L. 1977. Roget’s International Thesaurus, Fourth Edition. Harper and Row, New York, USA.Google Scholar
  14. Cowie, Jim, Louise Guthrie & Joe Guthrie. 1992. Lexical disambiguation using simulated annealing. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 359-365.CrossRefGoogle Scholar
  15. Cruse, David. 1998. Lexical Semantics. Cambridge University Press, Cambridge, UK.Google Scholar
  16. Daelemans, Walter, Jakub Zavrel, Ko van der Sloot & Antal van den Bosch. 1999. TiMBL: Tilburg Memory Based Learner, Version 2.0, Reference Guide. ILK Technical Report 99-01, Tilburg University, The Netherlands.Google Scholar
  17. Decadt, Bart, Véronique Hoste, Walter Daelemans, & Antal van den Bosch. 2004. GAMBL, Genetic Algorithm Optimization of Memory-Based WSD. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 108-112.Google Scholar
  18. Dang, Hoa Trang & Martha Palmer. 2002. Combining contextual features for word sense disambiguation. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, USA.Google Scholar
  19. Duda, Richard & Peter E. Hart. 1973. Pattern Classification and Scene Analysis. New York: Wiley.Google Scholar
  20. Elworthy, David. 1994. Does Baum-Welch re-estimation help taggers? Proceedings of the 4th Conference on Applied Natural Language Processing, Stuttgart, Germany, 53-58.CrossRefGoogle Scholar
  21. Fellbaum, Christiane. 1998. WordNet: An Electronic Lexical Database. Massachusetts and London: The MIT Press.Google Scholar
  22. Fernández, David, Julio Gonzalo & Felisa Verdejo. 2001. The UNED systems at Senseval-2. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 75-78.Google Scholar
  23. Fillmore, Charles. 1971. Types of lexical information. Semantics: An interdisciplinary reader in philosophy, linguistics, and psychology. Cambridge: Cambridge University Press, 370-392.Google Scholar
  24. Florian, Radu, Silviu Cucerzan, Charles Schafer & David Yarowsky. 2002. Classifier combination for word sense disambiguation. Journal of Natural Language Engineering, 8(4): 327-341.CrossRefGoogle Scholar
  25. Freund, Yoav & Robert E. Schapire. 1996. Experiments with a new boosting algorithm. Proceedings of the 13th International Conference on Machine Learning, Bari, Italy, 148-156.Google Scholar
  26. Gale, William, Kenneth W. Church & David Yarowsky. 1993. A method for disambiguating word senses in a large corpus. Computers and the Humanities, 26 (5): 415-439.Google Scholar
  27. Hirst, Graeme. 1987. Semantic Interpretation and the Resolution of Ambiguity. Cambridge, UK: Cambridge University Press.Google Scholar
  28. Kilgarriff, Adam. 1997. Putting frequencies in the dictionary. International Journal of Lexicography, 10(2): 135-155CrossRefGoogle Scholar
  29. Kriedler, Charles. 1998. Introducing English Semantics. London and New York: Routledge.Google Scholar
  30. Lee, Yoong K. & Hwee Tou Ng. 2002. An empirical evaluation of knowledge sources and learning algorithms for word sense disambiguation. Proceedings of the Conference on Empirical Methods in Natural Language Processing (EMNLP), Philadelphia, USA, 41-48.CrossRefGoogle Scholar
  31. Lee, Yoong K., Hwee Tou Ng & Tee Kiah Chia. 2004. Supervised word sense disambiguation with support vector machines and multiple knowledge sources. Proceedings of the ACL/EACL Senseval-3 Workshop, Barcelona, Spain, 137-140.Google Scholar
  32. Lesk, Michael. 1986. Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proceedings of SIGDOC-86: 5th International Conference on Systems Documentation, Toronto, Canada, 24-26.CrossRefGoogle Scholar
  33. Lin, Dekang. 1993. Principle based parsing without overgeneration. Proceedings of the 31st Annual Meeting of the Association for Computational Linguistics (ACL), Columbus, USA, 112-120.CrossRefGoogle Scholar
  34. Lin, Dekang. 1997. Using syntactic dependency as local context to resolve word sense ambiguity. Proceedings of the 35th Annual Meeting of the Association for Computational Linguistics (ACL), Madrid, 64-71.Google Scholar
  35. Magnini, Bernardo, Carlo Strapparava, Giovani Pezzulo & Alfio Gliozzo. 2001. Using domain information for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, France, 111-114.Google Scholar
  36. Martínez, David, Eneko Agirre & Lluis Márquez. 2002. Syntactic features for high precision word sense disambiguation. Proceedings of the 19th International Conference on Computational Linguistics (COLING), Taipei, Taiwan.Google Scholar
  37. McCarthy, Diana, John Carroll & Judita Preiss. 2001. Disambiguating noun and verb senses using automatically acquired selectional preferences. Proceedings of the ACL/EACL Senseval-2 Workshop, Toulouse, France.Google Scholar
  38. McCarthy, Diana, Rob Koeling, Julie Weeds & John Carroll. 2004. Using automatically acquired predominant senses for word sense disambiguation. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 151-158.Google Scholar
  39. McRoy, Susan W. 1992. Using multiple knowledge sources for word sense disambiguation. Computational Linguistics, 18(1): 1-30.Google Scholar
  40. Mihalcea, Rada & Dan Moldovan. 2001. Pattern learning and active feature selection for word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.Google Scholar
  41. Mihalcea, Rada & Ehsanul Faruque. 2004. SenseLearner: Minimally supervised word sense disambiguation for all words in open text. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 155-158.Google Scholar
  42. Miller, George. A., Claudia Leacock, Randee Tengi & Ross. T. Bunker. 1993. A semantic concordance. Proceedings of the ARPA Workshop on Human Language Technology, 303-308.Google Scholar
  43. Montoyo, Andres & Armando Suárez. 2001. The University of Alicante word sense disambiguation system. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France.Google Scholar
  44. Ng, Hwee Tou & Hiang B. Lee. 1996. Integrating multiple knowledge sources for word sense disambiguation: An exemplar-based approach. Proceedings of the 34th Meeting of the Association for Computational Linguistics (ACL), Santa Cruz, CA, USA, 40-47.CrossRefGoogle Scholar
  45. Patwardhan, Siddharth, Satanjeev Banerjee & Ted Pedersen. 2003. Using measures of semantic relatedness for word sense disambiguation. Proceedings of the Fourth International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico.Google Scholar
  46. Pedersen, Ted. 2002. Assessing system agreement and instance difficulty in the lexical sample tasks of Senseval-2. Proceedings of the ACL Workshop on Word Sense Disambiguation: Recent Successes and Future Directions, Philadelphia, PA, USA.Google Scholar
  47. Preiss, Judita. 2001. Anaphora resolution with word sense disambiguation. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 143-146.Google Scholar
  48. Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. Harlow, UK: Longman Group.Google Scholar
  49. Quinlan, J. Ross. 1993. C4.5: Programs for Machine Learning. San Francisco: Morgan Kaufmann.Google Scholar
  50. Resnik, Philip. 1997. Selectional preferences and word sense disambiguation. Proceedings of the ACL/SIGLEX Workshop on Tagging Text with Lexical Semantics: What, Why and How?, Washington, DC, USA, 52-57.Google Scholar
  51. Resnik, Philip & David Yarowsky. 1997. A perspective on word sense disambiguation algorithms and their evaluation. Proceedings of the ACL/SIGLEX Workshop Tagging Texts with Lexical Semantics: What, Why and How?, Washingtonn, DC, USA, 79-86.Google Scholar
  52. Sag, Ivan A., Timothy Baldwin, Francis Bond, Ann Copestake & Dan Flickinger. 2002. Multiword expressions: A pain in the neck for NLP. Proceedings of the 3rd International Conference on Intelligent Text Processing and Computational Linguistics (CICLing), Mexico City, Mexico, 1-15.Google Scholar
  53. Small, Steven L. 1980. Word Expert Parsing: A Theory of Distributed Wordbased Natural Language Understanding. Ph.D. Thesis, Department of Computer Science, University of Maryland, USA.Google Scholar
  54. Strapparava, Carlo, Alfio Gliozzo, & Claudio Giuliano. 2004. Pattern abstraction and term similarity for word sense disambiguation: IRST at Senseval-3. Proceedings of Senseval-3: Third International Workshop on the Evaluation of Systems for the Semantic Analysis of Text, Barcelona, Spain, 229-233.Google Scholar
  55. Stevenson, Mark. 2003. Word Sense Disambiguation: The Case for Combination of Knowledge Sources. Stanford, USA: CSLI Publications.Google Scholar
  56. Stevenson, Mark & Yorick Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3): 321-349.CrossRefGoogle Scholar
  57. Vapnik, Vladimir. 1995. The Nature of Statistical Learning Theory. New York, USA: Springer-Verlag.Google Scholar
  58. Wilks, Yorick. 1975. A preferential pattern-seeking semantics for natural language inference. Artificial Intelligence, 6: 53-74.CrossRefGoogle Scholar
  59. Wilks, Yorick. 1978. Making preferences more active. Artificial Intelligence 11(3): 197-223.CrossRefGoogle Scholar
  60. Wilks, Yorick & Mark Stevenson. 1998. The grammar of sense: Using part of speech tags as a first step in semantic disambiguation. Journal of Natural Language Engineering, 4(2): 135-144.CrossRefGoogle Scholar
  61. Yarowsky, David. 1992. Word-sense disambiguation using statistical models of Roget’s categories trained on large corpora. Proceedings of the 14th International Conference on Computational Linguistics (COLING), Nantes, France, 454-460.Google Scholar
  62. Yarowsky, David. 1996. Three Algorithms for Lexical Ambiguity Resolution, Ph.D. Thesis, School of Computer and Information Science, University of Pennsylvania, USA.Google Scholar
  63. Yarowsky, David, Silviu Cucerzan, Radu Florian, Charles Schafer & Richard Wicentowski. 2001. The Johns Hopkins Senseval-2 system description. Proceedings of Senseval-2: Second International Workshop on Evaluating Word Sense Disambiguation Systems, Toulouse, France, 163-166.Google Scholar
  64. Yarowsky, David & Radu Florian. 2002. Evaluating sense disambiguation across diverse parameter spaces. Journal of Natural Language Engineering, 8(2): 293-310.CrossRefGoogle Scholar

Copyright information

© Springer 2007

Authors and Affiliations

  • Eneko Agirre
    • 1
  • Mark Stevenson
    • 2
  1. 1.Department of Computer ScienceUniversity of the Basque CountryDonostiaSpain
  2. 2.Department of Computer ScienceUniversity of SheffieldUK

Personalised recommendations