Multiword Expressions: A Pain in the Neck for NLP

  • Ivan A. Sag
  • Timothy Baldwin
  • Francis Bond
  • Ann Copestake
  • Dan Flickinger
Conference paper

DOI: 10.1007/3-540-45715-1_1

Part of the Lecture Notes in Computer Science book series (LNCS, volume 2276)
Cite this paper as:
Sag I.A., Baldwin T., Bond F., Copestake A., Flickinger D. (2002) Multiword Expressions: A Pain in the Neck for NLP. In: Gelbukh A. (eds) Computational Linguistics and Intelligent Text Processing. CICLing 2002. Lecture Notes in Computer Science, vol 2276. Springer, Berlin, Heidelberg

Abstract

Multiword expressions are a key problem for the development of large-scale, linguistically sound natural language processing technology. This paper surveys the problem and some currently available analytic techniques. The various kinds of multiword expressions should be analyzed in distinct ways, including listing “words with spaces”, hierarchically organized lexicons, restricted combinatoric rules, lexical selection, “idiomatic constructions” and simple statistical affinity. An adequate comprehensive analysis of multiword expressions must employ both symbolic and statistical techniques.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Copyright information

© Springer-Verlag Berlin Heidelberg 2002

Authors and Affiliations

  • Ivan A. Sag
    • 1
  • Timothy Baldwin
    • 1
  • Francis Bond
    • 2
  • Ann Copestake
    • 3
  • Dan Flickinger
    • 1
  1. 1.CSLI, Ventura HallStanford University StanfordUSA
  2. 2.NTT Communication Science Labs.KyotoJapan
  3. 3.Computer LaboratoryUniversity of CambridgeCambridgeUK

Personalised recommendations