Chapter

Computational Linguistics and Intelligent Text Processing

Volume 7816 of the series Lecture Notes in Computer Science pp 559-571

Facilitating the Analysis of Discourse Phenomena in an Interoperable NLP Platform

  • Riza Theresa Batista-NavarroAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Georgios KontonatsiosAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Claudiu MihăilăAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Paul ThompsonAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Rafal RakAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Raheel NawazAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Ioannis KorkontzelosAffiliated withThe National Centre for Text Mining, The University of Manchester
  • , Sophia AnaniadouAffiliated withThe National Centre for Text Mining, The University of Manchester

* Final gross prices may vary according to local VAT.

Get Access

Abstract

The analysis of discourse phenomena is essential in many natural language processing (NLP) applications. The growing diversity of available corpora and NLP tools brings a multitude of representation formats. In order to alleviate the problem of incompatible formats when constructing complex text mining pipelines, the Unstructured Information Management Architecture (UIMA) provides a standard means of communication between tools and resources. U-Compare, a text mining workflow construction platform based on UIMA, further enhances interoperability through a shared system of data types, allowing free combination of compliant components into workflows. Although U-Compare and its type system already support syntactic and semantic analyses, support for the analysis of discourse phenomena was previously lacking. In response, we have extended the U-Compare type system with new discourse-level types. We illustrate processing and visualisation of discourse information in U-Compare by providing several new deserialisation components for corpora containing discourse annotations. The new U-Compare is downloadable from http://nactem.ac.uk/ucompare.

Keywords

UIMA interoperabilty U-Compare discourse causality coreference meta-knowledge