Date: 25 May 2000

Parsing and Collocations

* Final gross prices may vary according to local VAT.

Get Access

Abstract

Proper treatment of collocations constitutes a serious challenge for NLP systems in general. This paper describes how Fips, a “Principle and Parameters” grammar-based parser developed at LATL handles multi-word expressions. In order to get more precise and more reliable collocation data, the Fips parser is used to extract collocations from large text corpora. It will be shown that collocational information can help ranking alternative analyses computed by the parser, in order to improve the quality of its results.

Thanks to Paola Merlo, Luka Nerima, Juri Mengon, and Stephanie Durrleman for comments on an earlier version of this paper. This research was supported in part by a grant from the Swiss Commission for technology and innovation (CTI).