SFU ReviewSP-NEG: a Spanish corpus annotated with negation for sentiment analysis. A typology of negation patterns

  • Salud María Jiménez-Zafra
  • Mariona Taulé
  • M. Teresa Martín-Valdivia
  • L. Alfonso Ureña-López
  • M. Antónia Martí
Original Paper

Abstract

In this paper, we present SFU ReviewSP-NEG, the first Spanish corpus annotated with negation with a wide coverage freely available. We describe the methodology applied in the annotation of the corpus including the tagset, the linguistic criteria and the inter-annotator agreement tests. We also include a complete typology of negation patterns in Spanish. This typology has the advantage that it is easy to express in terms of a tagset for corpus annotation: the types are clearly defined, which avoids ambiguity in the annotation process, and they provide wide coverage (i.e. they resolved all the cases occurring in the corpus). We use the SFU ReviewSP as a base in order to make the annotations. The corpus consists of 400 reviews, 221,866 words and 9455 sentences, out of which 3022 sentences contain at least one negation structure.

Keywords

Annotation of negation Scope of negation Polarity annotation Sentiment analysis 

Copyright information

© Springer Science+Business Media Dordrecht 2017

Authors and Affiliations

  1. 1.Department of Computer ScienceUniversidad de JaénJaénSpain
  2. 2.CLiC, Centre de Llenguatge i Computació, Department of LinguisticsUniversity of BarcelonaBarcelonaSpain

Personalised recommendations