Skip to main content

WSD Algorithm Applied to a NLP System

  • Conference paper
  • First Online:
Natural Language Processing and Information Systems (NLDB 2000)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 1959))

Abstract

Nowadays, the need of advanced free text filtering is increasing. Therefore, when searching for specific keywords, it is desirable to eliminate occurrences where the word or words are used in an inappropriate sense. This task could be exploited in internet browsers, and resource discovery systems, relational databases containing free text fields, electronic document management systems, data warehouse and data mining systems, etc. In order to resolve this problem in this paper a method for the automatic disambiguating of nouns, using the notion of Specification Marks and the noun taxonomy of the WordNet lexical knowledge base [8] is presented. This method is applied to a Natural Language Processing System (NLP). The method resolves the lexical ambiguity of nouns in any sort of text, and although it relies on the semantics relations (Hypernymy/Hyponymy) and the hierarchic organization of WordNet. However, it does not require any sort of training process, no hand-coding of lexical entries, nor the hand-tagging of texts. An evaluation of the method was done on both the Semantic Concordance Corpus (Semcor)[9], and on Microsoft’s electronic encyclopaedia („Microsoft 98 Encarta Encyclopaedia Deluxe“). The percentage of correct resolutions achieved with these two corpora were: Semcor 65.8% and Microsoft 65.6%. This percentages show that successful results with different domain corpus have been obtained, so our proposed method can be applied successfully on any corpus.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Agirre E. and Rigau G. (1996) Word Sense Disambiguation using Conceptual Density. Proc. 16th International Conference on COLING. Copenhagen.

    Google Scholar 

  2. Cowie J., Guthrie J. and Guthrie L. (1992) Lexical disambiguation using simulated annealing. Proc. DARPA Workshop on Speech and Natural Language. 238–242. New York.

    Google Scholar 

  3. Hale, Michael L. Mc. A comparison of WordNet and Roget’s taxonomy for measuring semantic similarity.

    Google Scholar 

  4. Ide N. and Véronis J. (1998) Introduction to the Special Issue on Word Sense Disambiguation: The State of the Art. Computational Linguistics. 24 (1), 1–40.

    Google Scholar 

  5. Lesk, M. (1986) Automatic sense disambiguation using machine readable dictionaries: How to tell a pine cone from an ice cream cone. Proc. 1986 SIGDOC Conference, ACM 24–26, New York.

    Google Scholar 

  6. McRoy S. (1992) Using Multiple Knowledge Sources for Word Sense Discrimination. Computational Linguistics 18 (1).

    Google Scholar 

  7. Mihalcea R. and Moldovan D. (1999) A Method for word sense disambiguation of unrestricted text. Proc. 37th Annual Meeting of the ACL 152–158, Maryland, USA.

    Google Scholar 

  8. Miller G. A., Beckwith R., Fellbaum C., Gross D., and Miller K. J. (1990) WordNet: An online lexical database. International Journal of Lexicography, 3(4): 235–244.

    Article  Google Scholar 

  9. Miller G., Leacock C., Randee T. and Bunker R. (1993) A Semantic Concordance. Proc. 3rd DARPA Workshop on Human Language Tecnology, 303–308, Plainsboro, New Jersey.

    Google Scholar 

  10. Resnik P. (1995) Disambiguating noun groupings with respect to WordNet senses. Proc. Third Workshop on Very Large Corpora. 54–68.Cambridge, MA.

    Google Scholar 

  11. Resnik P. and Yarowsky D. (1997) A perspective on word sense disambiguation methods and their evaluation. Proc. ACL Siglex Wordshop on Tagging Text with Lexical Semantics, why, what and how?, Washington DC.

    Google Scholar 

  12. Resnik P. (1999) Semantic similarity in a taxonomy: an information-based measure and its application to problems of ambiguity in natural lenguage. In Journal of Artificial Intelligence Research 11. 95–130.

    MATH  Google Scholar 

  13. Rigau G., Atserias J. and Agirre E. (1997) Combining Unsupervised Lexical Knowledge Methods for Word Sense Disambiguation. Proc. 35th Annual Meeting of the ACL, 48–55, Madrid, Spain.

    Google Scholar 

  14. Slator B. and Wilks Y. (1987) Towards semantic structures from dictionary entries. Proc. 2nd Annual Rocky Mountain Conference on Artificial Inteligence, 85–96. Boulder, CO.

    Google Scholar 

  15. Stetina J., Kurohashi S. and Nagao M. (1998) General word sense disambiguation method based on full sentencial context. In Usage of WordNet in Natural Language Processing. COLING-ACL Workshop, Montreal, Canada.

    Google Scholar 

  16. Sussna M. (1993) Word sense disambiguation for free-text indexing using a massive semantic network. Proc. Second International CIKM, 67-74, Airlington, VA.

    Google Scholar 

  17. Voorhees E. (1993) Using WordNet to disambiguation word senses for text retrieval. Proc. 16th Annual International ACM SIGIR Conference on Research and Development in Information Retrieval. 171–180, Pittsburgh, PA.

    Google Scholar 

  18. Wilks Y., Fass D., Guo C., McDonal J., Plate T. and Slator B. (1993) Providing Machine Tractablle Dictionary Tools. In Semantics and the lexicon (Pustejowsky J. Ed.) 341–401.

    Google Scholar 

  19. Wilks Y. And Stevenson M. (1996) The grammar of sense: Is word sense tagging much more than part-of-speech tagging? Technical Report CS-96-05, University of Sheffield, UK.

    Google Scholar 

  20. Yarowsky D. (1992) Word Sense disambiguation using statistical models of Roget’s categories trainined on large corpora. Proc. 14th COLING, 454–460, Nantes, France.

    Google Scholar 

  21. Yarowsky, D. (1995) Unsupervised word Sense disambiguation rivaling supervised methods. Proc. 32nd Annual Meeting of the ACL.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2001 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Montoyo, A., Palomar, M. (2001). WSD Algorithm Applied to a NLP System. In: Bouzeghoub, M., Kedad, Z., Métais, E. (eds) Natural Language Processing and Information Systems. NLDB 2000. Lecture Notes in Computer Science, vol 1959. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45399-7_5

Download citation

  • DOI: https://doi.org/10.1007/3-540-45399-7_5

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-41943-3

  • Online ISBN: 978-3-540-45399-4

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics