Detection of Semantic Compositionality Using Semantic Spaces

  • Lubomír Krčmář
  • Karel Ježek
  • Massimo Poesio
Conference paper

DOI: 10.1007/978-3-642-32790-2_43

Part of the Lecture Notes in Computer Science book series (LNCS, volume 7499)
Cite this paper as:
Krčmář L., Ježek K., Poesio M. (2012) Detection of Semantic Compositionality Using Semantic Spaces. In: Sojka P., Horák A., Kopeček I., Pala K. (eds) Text, Speech and Dialogue. TSD 2012. Lecture Notes in Computer Science, vol 7499. Springer, Berlin, Heidelberg

Abstract

Any Natural Language Processing (NLP) system that does semantic processing relies on the assumption of semantic compositionality: the meaning of a compound is determined by the meaning of its parts and their combination. However, the compositionality assumption does not hold for many idiomatic expressions such as “blue chip”. This paper focuses on the fully automatic detection of these, further referred to as non-compositional compounds.

We have proposed and tested an intuitive approach based on replacing the parts of compounds by semantically related words. Our models determining the compositionality combine simple statistic ideas with the COALS semantic space. For the evaluation, the shared dataset for the Distributional Semantics and Compositionality 2011 workshop (DISCO 2011) is used. A comparison of our approach with the traditionally used Pointwise Mutual Information (PMI) is also presented. Our best models outperform all the systems competing in DISCO 2011.

Keywords

DISCO 2011 compositionality semantic space collocations COALS PMI 

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2012

Authors and Affiliations

  • Lubomír Krčmář
    • 1
  • Karel Ježek
    • 1
  • Massimo Poesio
    • 2
  1. 1.Faculty of Applied SciencesUniversity of West BohemiaCzech Republic
  2. 2.University of EssexUnited Kingdom

Personalised recommendations