Word Sense Representations

White, Lyndon; Togneri, Roberto; Liu, Wei; Bennamoun, Mohammed

doi:10.1007/978-981-13-0062-2_4

Lyndon White⁶,
Roberto Togneri⁶,
Wei Liu⁷ &
…
Mohammed Bennamoun⁷

Part of the book series: Studies in Computational Intelligence ((SCI,volume 783))

677 Accesses

Abstract

In this chapter, techniques for representing the multiple meanings of a single word are discussed. This is a growing area, and is particularly important in languages where polysemous and homonymous words are common. This includes English, but it is even more prevalent in Mandarin for example. The techniques discussed can broadly be classified as lexical word sense representation, and as word sense induction. The inductive techniques can be sub-classified as clustering -based or as prediction-based.

1a. In a literal, exact, or actual sense; not figuratively, allegorically, etc

1b. Used to indicate that the following word or phrase must be taken in its literal sense, usually to add emphasis

1c. colloq. Used to indicate that some (frequently conventional) metaphorical or hyperbolical expression is to be taken in the strongest admissible sense: ‘virtually, as good as’; (also) ‘completely, utterly, absolutely’ ...

2a With reference to a version of something, as a transcription, translation, etc.: in the very words, word for word

2b. In extended use. With exact fidelity of representation; faithfully

3a. With or by the letters (of a word). Obs. rare

3b. In or with regard to letters or literature. Obs. rare

The seven senses of literally, Oxford English

Dictionary, 3rd ed., 2011

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 79.99; Price excludes VAT (USA)

Softcover Book: USD 99.99; Price excludes VAT (USA)

Hardcover Book: USD 99.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Agirre, Eneko and Aitor Soroa. 2007. Semeval-2007 task 02: Evaluating word sense induction and discrimination systems. In Proceedings of the 4th international workshop on semantic evaluations. SemEval ’07 , 7–12. Prague, Czech Republic: Association for Computational Linguistics.
Google Scholar
Agirre, Eneko, David Martínez, Oier López De Lacalle, and Aitor Soroa. 2006. Evaluating and optimizing the parameters of an unsupervised graph-based WSD algorithm. In Proceedings of the first workshop on graph based methods for natural language processing, 89–96. Association for Computational Linguistics.
Google Scholar
Bartunov, Sergey, Dmitry Kondrashkin, Anton Osokin, and Dmitry P. Vetrov. 2015. Breaking sticks and ambiguities with adaptive skip-gram. In CoRR. arXiv:1502.07257.
Basile, Pierpaolo, Annalina Caputo, and Giovanni Semeraro. 2014. An Enhanced lesk word sense disambiguation algorithm through a distributional semantic model. In Proceedings of COLING 2014, the 25th international conference on computational linguistics: Technical papers. Dublin, 1591–1600. Ireland: Dublin City University and Association for Computational Linguistics.
Google Scholar
Bird, Steven, Ewan Klein, and Edward Loper. 2009. Natural language processing with Python. O’Reilly Media, Inc.
Google Scholar
Chen, Xinxiong, Zhiyuan Liu, and Maosong Sun. 2014. A unified model for word sense representation and disambiguation. In EMNLP (Citeseer), 1025–1035.
Google Scholar
De Smedt, Tom and Walter Daelemans. 2012. Pattern for python. The Journal of Machine Learning Research 13 (1): 2063–2067.
Google Scholar
Deng, J., W. Dong, R. Socher, L.-J. Li, K. Li, and L. Fei-Fei. 2009. ImageNet: A large-scale hierarchical image database. In CVPR09.
Google Scholar
Finkelstein, Lev, Evgeniy Gabrilovich, Yossi Matias, Ehud Rivlin, Zach Solan, Gadi Wolfman, and Eytan Ruppin. 2001. Placing search in context: The concept revisited. In Proceedings of the 10th international conference on World Wide Web, 406–414. ACM.
Google Scholar
Frey, Brendan J., and Delbert Dueck. 2007. Clustering by passing messages between data points. Science 315 (5814): 972–976.
Article MathSciNet Google Scholar
Huang, Eric H., Richard Socher, Christopher D. Manning, and Andrew Y. Ng. 2012. Improving word representations via global context and multiple word proto-types. In Proceedings of the 50th annual meeting of the association for computational linguistics: Long papers, vol. 1, 873–882. Association for Computational Linguistics.
Google Scholar
Iacobacci, Ignacio, Mohammad Taher Pilehvar, and Roberto Navigli. 2015. SensEmbed: Learning sense embeddings for word and relational similarity. In Proceedings of ACL, 95–105.
Google Scholar
Kågebäck, Mikael, Fredrik Johansson, Richard Johansson, and Devdatt Dubhashi. 2015. Neural context embeddings for automatic discovery of word senses. In Proceedings of NAACL-HLT, 25–32.
Google Scholar
Kilgarriff, Adam. 2004. How dominant is the commonest sense of a word? In Text, speech and dialogue: 7th international conference, TSD 2004, Brno, Czech Republic, September 8–11, 2004. Proceedings, eds. Petr Sojka, Ivan Kopecek, and Karel Pala, 103–111. Berlin, Heidelberg: Springer. ISBN: 978-3-540-30120-2. https://doi.org/10.1007/978-3-540-30120-2_14.
Kleinberg, Jon M. 2003. An impossibility theorem for clustering. In Advances in neural information processing systems, 463–470.
Google Scholar
Levy, Omer and Yoav Goldberg. 2014. Neural word embedding as implicit matrix factorization. In Advances in neural information processing systems, 2177–2185.
Google Scholar
Mihalcea, Rada, Timothy Anatolievich Chklovski, and Adam Kilgarriff. 2004. The senseval-3 english lexical sample task. In Association for computational linguistics.
Google Scholar
Mikolov, Tomas, Kai Chen, Greg Corrado, and Jeffrey Dean. 2013. Efficient estimation of word representations in vector space. arXiv:1301.3781.
Miller, George A. 1995. WordNet: A lexical database for English. Communications of the ACM 38 (11): 39–41.
Article Google Scholar
Moro, Andrea and Roberto Navigli. 2015. SemEval-2015 task 13: Multilingual all-words sense disambiguation and entity linking. In Proceedings of SemEval- 2015.
Google Scholar
Moro, Andrea, Alessandro Raganato, and Roberto Navigli. 2014. Entity linking meets word sense disambiguation: A unified approach. Transactions of the Association for Computational Linguistics (TACL) 2: 231–244.
Google Scholar
Navigli, Roberto and Simone Paolo Ponzetto. 2010. BabelNet: Building a very large multilingual semantic network. In Proceedings of the 48th annual meeting of the association for computational linguistics, 216–225. Association for Computational Linguistics.
Google Scholar
Navigli, Roberto, Kenneth C. Litkowski, and Orin Hargraves. 2007. SemEval- 2007 task 07: Coarse-grained english all-words task. In Proceedings of the 4th international workshop on semantic evaluations. SemEval ’07, 30–35. Prague, Czech Republic: Association for Computational Linguistics.
Google Scholar
Neelakantan, Arvind, Jeevan Shankar, Alexandre Passos, and Andrew McCallum. 2015. Efficient non-parametric estimation of multiple embeddings per word in vector space. arXiv:1504.06654.
Pantel, Patrick and Dekang Lin. 2002. Discovering word senses from text. In Proceedings of the eighth ACM SIGKDD international conference on knowledge discovery and data mining, 613–619. ACM.
Google Scholar
Reisinger, Joseph and Raymond J. Mooney. 2010. Multi-prototype vector-space models of word meaning. In Human language technologies: The 2010 annual conference of the north american chapter of the association for computational linguistics, 109–117. Association for Computational Linguistics.
Google Scholar
Schütze, Hinrich. 1998. Automatic word sense discrimination. Computational Linguistics. 24 (1): 97–123. ISSN: 0891-2017.
Google Scholar
Schwenk, Holger. 2004. Efficient training of large neural networks for language modeling. In 2004 IEEE international joint conference on neural networks, 2004. Proceedings, 4, 3059–3064. IEEE.
Google Scholar
Tengi, Randee I. 1998. Design and implementation of the WordNet lexical database and searching software. WordNet: An electronic lexical database, ed. Christiane (réd.) Fellbaum, 105. Cambridge: The MIT Press.
Google Scholar
Tian, Fei, Hanjun Dai, Jiang Bian, Bin Gao, Rui Zhang, Enhong Chen, and Tie- Yan Liu. 2014. A probabilistic model for learning multi-prototype word embeddings. In COLING, 151–160.
Google Scholar
White, Lyndon, Roberto Togneri, Wei Liu, and Mohammed Bennamoun. 2018. Finding word sense embeddings of known meaning. In 19th international conference on intelligent text processing and computational linguistics (CICLing).
Google Scholar
Zipf, George Kingsley. 1945. The meaning-frequency relationship of words. The Journal of General Psychology 33 (2): 251–256.
Article Google Scholar
Zipf, G.K. 1949. Human behavior and the principle of least effort: An introduction to human ecology. Cambridge: Addison-Wesley Press.
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical, Electronic and Computer Engineering, School of Engineering, Faculty of Engineering and Mathematical Sciences, The University of Western Australia, Perth, WA, Australia
Lyndon White & Roberto Togneri
Department of Computer Science and Software Engineering, School of Physics, Mathematics and Computing, Faculty of Engineering and Mathematical Sciences, The University of Western Australia, Perth, WA, Australia
Wei Liu & Mohammed Bennamoun

Authors

Lyndon White
View author publications
You can also search for this author in PubMed Google Scholar
Roberto Togneri
View author publications
You can also search for this author in PubMed Google Scholar
Wei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Bennamoun
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Lyndon White .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

White, L., Togneri, R., Liu, W., Bennamoun, M. (2019). Word Sense Representations. In: Neural Representations of Natural Language. Studies in Computational Intelligence, vol 783. Springer, Singapore. https://doi.org/10.1007/978-981-13-0062-2_4

Download citation

DOI: https://doi.org/10.1007/978-981-13-0062-2_4
Published: 30 August 2018
Publisher Name: Springer, Singapore
Print ISBN: 978-981-13-0061-5
Online ISBN: 978-981-13-0062-2
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)

Publish with us

Policies and ethics