This chapter describes a number of approaches to word sense disambiguation, which take the wider “semantic space” of ambiguous words into account.Semantic space may be instantiated by a specific domain, task, or application. Approaches discussed include the use of subject codes as specified in dictionaries or manually added to WordNet and similar semantic resources, the extraction of topic signatures through a combined use of a semantic resource and domain-specific corpora, and domainspecific tuning of semantic resources in a top-down or bottom-up fashion.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Agirre, Eneko, Olatz Ansa, Eduard Hovy & David Martínez. 2000. Enriching very large ontologies using the WWW. Proceedings of the Ontology Learning Workshop, European Conference on Artificial Intelligence (ECAI), Berlin, Germany.
Agirre, Eneko, Olatz Ansa, David Martínez & Eduard Hovy. 2001. Enriching WordNet concepts with topic signatures. Proceedings of the NAACL Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA.
Agirre, Eneko & Oier Lopez de Lacalle. 2004. Publicly available topic signatures for all WordNet nominal senses. Proceedings of the 4rd International Conference on Language Resources and Evaluations (LREC). Lisbon, Portugal.
Basili, Roberto, Michelangelo Della Rocca & Maria-Theresa Pazienza. 1997. Contextual word sense tuning and disambiguation. Applied Artificial Intelligence, 11:235-262.
Buitelaar, Paul. 1998. CoreLex: Systematic Polysemy and Underspecification. Ph.D. Thesis, Brandeis University.
Buitelaar, Paul & Bogdan Sacaleanu. 2001. Ranking and selecting synsets by domain relevance. Proceedings of the Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA.
Buitelaar, Paul & Bogdan Sacaleanu. 2002. Extending synsets with medical terms. Proceedings of the First International WordNet Conference, Mysore, India.
Cucchiarelli, Alessandro & Paola Velardi. 1998. Finding a domain-appropriate sense inventory for semantically tagging a corpus. Natural Language Engineering, 4(4): 325-344.
Escudero, Gerard, Lluis Màrquez & German Rigau. 2000. An empirical study of the domain dependence of supervised word sense disambiguation systems. Proceedings of Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC), Hong Kong, China.
Gale, William, Kenneth Church & David Yarowsky. 1992. One sense per discourse. Proceedings of the 4th DARPA Speech and Natural Language Workshop, 233-237.
Gliozzo, Alfio, Bernardo Magnini & Carlo Strapparava. 2004a. Unsupervised domain relevance for word sense disambiguation. Proceedings of the 2004 Conference on Empirical Methods in Natural Language Processing (EMNLP), Barcelona, Spain, 380-387.
Gliozzo, Alfio, Carlo Strapparava & Ido Dagan. 2004b. Unsupervised and supervised exploitation of semantic domains in lexical disambiguation. Computer Speech and Language, 18(3): 275-299.
Gliozzo, Alfio, Claudio Giuliano & Carlo Strapparava, 2005. Domain kernels for word sense disambiguation, Proceedings of the 43rd Annual Meeting of the Association for Computational Linguistics (ACL), Ann Arbor, Michigan, 403-410.
Hearst, Marti & Hinrich Schütze. 1993. Customizing a lexicon to better suit a computational task. Proceedings of the ACL SIGLEX Workshop on the Acquisition of Lexical Knowledge from Text.
Guthrie, Joe A., Louise Guthrie, Yorick Wilks & Homa Aidinejad. 1991. Subject dependent co-occurrence and word sense disambiguation. Proceedings of the 29th Annual Meeting of the Association for Computational Linguistics, 146-152.
Kilgarriff, Adam. 1998. Bridging the gap between lexicon and corpus: Convergence of formalisms. Proceedings of LREC Workshop on Adapting Lexical Resources to Sublanguages and Applications, Granada, Spain.
Koeling, Rob, Diana McCarthy & John Carroll. 2005. Domain-specific sense distributions and predominant sense acquisition. Proceedings of the Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (HLT/EMNLP), 419-426.
Krovetz, Robert. 1998. More than one sense per discourse. Research Memorandum, NEC Labs America, Princeton, NJ.
Lesk, Michael. 1986. Automated sense disambiguation using machine-readable dictionaries: How to tell a pine cone from an ice cream cone. Proceedings of the 1986 ACM SIGDOC Conference, Toronto, Canada, 24-26.
Liu, H, Teller, V. & Friedman, C. 2004. A multi-aspect comparison study of supervised word sense disambiguation. Journal of the American Medical Informatics Association, 11(4): 320-31.
Magnini, Bernardo & Gabriela Cavaglià. 2000. Integrating subject field codes into WordNet. Proceedings of the Second International Conference Language Resources and Evaluation Conference (LREC), Athens, Greece, 1413-1418.
Magnini, Bernardo, Carlo Strapparava, Giovanni Pezzulo & Alfio Gliozzo. 2002. The role of domain information in word sense disambiguation. Natural Language Engineering, 8(4): 359-373.
Magnini, Bernardo & Carlo Strapparava. 2004. User modeling for news web sites with word sense based techniques. User Modeling and User-Adapted Interaction, 14: (2-3): 239-257.
Martínez, David & Eneko Agirre. 2000. One sense per collocation and genre/topic variations. Proceedings of Joint SIGDAT Conference on Empirical Methods in Natural Language Processing and Very Large Corpora (EMNLP/VLC), Hong Kong, China.
McCarthy, Diana, Rob Koeling, Julie Weeds & John Carroll. 2004. Finding predominant senses in untagged text. Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics. Barcelona, Spain, 280-287.
Montoyo, Andres, Armando Suarez, German Rigau, Manuel Palomar. 2005. Combining knowledge- and corpus-based word sense disambiguation methods. Journal of Artificial Intelligence Research, 23: 299-330.
Novischi, Adrian 2004. Combining methods for word sense disambiguation of WordNet glosses. Proceedings of FLAIRS 2004, Florida.
Peh, Li Shiuan & Hwee Tou Ng. 1997. Domain-specific semantic class disambiguation using wordNet. Proceedings of the Fifth Workshop on Very Large Corpora. Beijing & Hong Kong, 56-64.
Procter, Paul, ed. 1978. Longman Dictionary of Contemporary English. London: Longman Group.
Redner, Richard A. & Homer F. Walker, 1984. Mixture densities, maximum likelihood and the EM algorithm. SIAM Review, 26(2): 195-236.
Salton, Gerard & Chris Buckley. 1988. Term-weighting approaches in automatic text retrieval. Information Processing and Management, 24(5): 513-523.
Schuemie, M. J., Kors, J. A. & Mons, B. 2005. Word sense disambiguation in the biomedical domain: An overview. Journal Computational Biology, 12(5): 554-65.
Stevenson, Mark & Yorick Wilks. 2001. The interaction of knowledge sources in word sense disambiguation. Computational Linguistics, 27(3): 321-349.
Turcato, David, Fred Popowich, Janine Toole, Dan Fass, Devlan Nicholson & Gordon Tisher. 2002. Adapting a synonym database to specific domains. Proceedings of the ACL Workshop on Recent Advances in Natural Language Processing and Information Retrieval, Hong Kong.
Vossen, Piek. 2001. Extending, trimming and fusing WordNet for technical documents. Proceedings of the Workshop on WordNet and Other Lexical Resources, Pittsburgh, PA.
Vossen, Piek, German Rigau, Iñaki Alegria, Eneko Agirre, David Farwell & Manuel Fuentes. 2006. Meaningful results for information retrieval in the MEANING project. Proceedings of the 3rd Global Wordnet Conference, Jeju Island, Korea.
Yarowsky, David. 1992. Word sense disambiguation using statistical models of Roget’s categories trained on large corpora. Proceedings of the 14th International Conference on Computational Linguistic (COLING), Nantes, France, 454-460.
Walker, Don & Robert Amsler. 1986. The use of machine readable dictionaries in sublanguage analysis. Analyzing Language in Restricted Domains, ed. by Ralph Grishman & Richard Kittredge, 69-83, Hillsdale, NJ: Erlbaum.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2007 Springer
About this chapter
Cite this chapter
Buitelaar, P., Magnini, B., Strapparava, C., Vossen, P. (2007). Domain-Specific WSD. In: Agirre, E., Edmonds, P. (eds) Word Sense Disambiguation. Text, Speech and Language Technology, vol 33. Springer, Dordrecht. https://doi.org/10.1007/978-1-4020-4809-8_10
Download citation
DOI: https://doi.org/10.1007/978-1-4020-4809-8_10
Publisher Name: Springer, Dordrecht
Print ISBN: 978-1-4020-4808-1
Online ISBN: 978-1-4020-4809-8
eBook Packages: Humanities, Social Sciences and LawSocial Sciences (R0)