Skip to main content

Concrete Sentence Spaces for Compositional Distributional Models of Meaning

  • Chapter

Part of the book series: Text, Speech and Language Technology ((TLTB,volume 47))

Abstract

Coecke et al. (2010) developed a compositional model of meaning for distributional semantics, in which each word in a sentence has a meaning vector and the distributional meaning of the sentence is a function of the tensor products of the word vectors. Abstractly speaking, this function is the morphism corresponding to the grammatical structure of the sentence in the category of finite dimensional vector spaces. In this chapter, we provide a concrete method for implementing this linear meaning map by presenting an algorithm for computing representations for various syntactic classes which have functional types; this algorithm results in assigning concrete corpus-based vector spaces to the abstract type of ‘sentence’. Our construction method is based on structured vector spaces whose basis vectors are pairs of words and grammatical roles. The concrete sentence spaces only depend on the types of the verbs of sentences; we use an embedding of these spaces and compare meanings of sentences with different grammatical structures by simply taking the inner product of their vectors in the bigger space. Our constructions are exemplified on a toy corpus.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   99.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD   129.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Notes

  1. 1.

    Intransitive and ditransitive verbs are interpreted in an analogous fashion; see Sect. 5.

References

  • Baroni, M., & Zamparelli, R. (2010). Nouns are vectors, adjectives are matrices: Representing adjective-noun constructions in semantic space. In Conference on empirical methods in natural language processing (EMNLP-10), Cambridge.

    Google Scholar 

  • Clark, S., & Pulman, S. (2007). Combining symbolic and distributional models of meaning. In Proceedings of AAAI spring symposium on quantum interaction. Menlo Park: AAAI Press.

    Google Scholar 

  • Coecke, B., Sadrzadeh, M., & Clark, S. (2010). Mathematical foundations for a compositional distributional model of meaning. Linguistic Analysis, 36, 345–384 (Lambek Festschrift). arXiv:1003.4394v1 [cs.CL].

    Google Scholar 

  • Erk, K., & Padó, S. (2008). A structured vector space model for word meaning in context. In Conference on empirical methods in natural language processing (EMNLP-08), Honolulu, Hawaii (pp. 897–906).

    Chapter  Google Scholar 

  • Firth, J. (1957). Papers in linguistics, 1934–1951. London: Oxford University Press.

    Google Scholar 

  • Frege, G. (1892). Über Sinn und Bedeutung. Zeitschrift fuer Philosophie un philosophische Kritik, 100, 25–50.

    Google Scholar 

  • Grefenstette, G. (1992). Use of syntactic context to produce term association lists for text retrieval. In N. J. Belkin, P. Ingwersen, & A. M. Pejtersen (Eds.), SIGIR (pp. 89–97). New York: ACM.

    Google Scholar 

  • Grefenstette, G. (1994). Explorations in automatic thesaurus discovery. Boston: Kluwer Academic.

    Book  MATH  Google Scholar 

  • Harris, Z. (1968). Mathematical structures of language. New York: Wiley-Interscience.

    MATH  Google Scholar 

  • Jurafsky, D., & Martin, J. H. (2000). Speech and language processing: An introduction to natural language processing, computational linguistics, and speech recognition. New York: Prentice Hall.

    Google Scholar 

  • Lambek, J. (2008). From word to sentence. Monza: Polimetrica.

    MATH  Google Scholar 

  • Landauer, T., & Dumais, S. (1997). A solution to Plato’s problem: The latent semantic analysis theory of acquisition, induction, and representation of knowledge. Psychological Review, 104(2), 211–240.

    Article  Google Scholar 

  • Manning, C., Raghavan, P., & Schutze, H. (2008). Introduction to information retrieval (Vol. 1). Cambridge: Cambridge University Press.

    Book  MATH  Google Scholar 

  • Mitchell, J., & Lapata, M. (2008). Vector-based models of semantic composition. In Proceedings of the 46th annual meeting of the association for computational linguistics, Columbus (pp. 236–244).

    Google Scholar 

  • Montague, R. (1970). English as a formal language. In Linguaggi nella societae nella tecnica (pp. 189–224).

    Google Scholar 

  • Schuetze, H. (1998). Automatic word sense discrimination. Computational Linguistics, 24(1), 97–123.

    Google Scholar 

  • Smolensky, P., & Legendre, G. (2005). The harmonic mind: From neural computation to optimality-theoretic grammar. Vol. I: Cognitive architecture. Vol. II: Linguistic and philosophical implications. Cambridge: MIT Press.

    Google Scholar 

  • Widdows, D. (2004). Geometry and meaning. Stanford: CSLI.

    MATH  Google Scholar 

  • Wittgenstein, L. (1953). Philosophical investigations (trans: Anscombe, G. E. M.). Oxford: Blackwell Sci.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Edward Grefenstette .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer Science+Business Media Dordrecht

About this chapter

Cite this chapter

Grefenstette, E., Sadrzadeh, M., Clark, S., Coecke, B., Pulman, S. (2014). Concrete Sentence Spaces for Compositional Distributional Models of Meaning. In: Bunt, H., Bos, J., Pulman, S. (eds) Computing Meaning. Text, Speech and Language Technology, vol 47. Springer, Dordrecht. https://doi.org/10.1007/978-94-007-7284-7_5

Download citation

  • DOI: https://doi.org/10.1007/978-94-007-7284-7_5

  • Publisher Name: Springer, Dordrecht

  • Print ISBN: 978-94-007-7283-0

  • Online ISBN: 978-94-007-7284-7

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics