Language Resources and Evaluation

, Volume 47, Issue 4, pp 945–971

Glissando: a corpus for multidisciplinary prosodic studies in Spanish and Catalan

  • Juan María Garrido
  • David Escudero
  • Lourdes Aguilar
  • Valentín Cardeñoso
  • Emma Rodero
  • Carme de-la-Mota
  • César González
  • Carlos Vivaracho
  • Sílvia Rustullet
  • Olatz Larrea
  • Yesika Laplaza
  • Francisco Vizcaíno
  • Eva Estebas
  • Mercedes Cabrera
  • Antonio Bonafonte
Original Paper

DOI: 10.1007/s10579-012-9213-0

Cite this article as:
Garrido, J.M., Escudero, D., Aguilar, L. et al. Lang Resources & Evaluation (2013) 47: 945. doi:10.1007/s10579-012-9213-0

Abstract

Literature review on prosody reveals the lack of corpora for prosodic studies in Catalan and Spanish. In this paper, we present a corpus intended to fill this gap. The corpus comprises two distinct data-sets, a news subcorpus and a dialogue subcorpus, the latter containing either conversational or task-oriented speech. More than 25 h were recorded by twenty eight speakers per language. Among these speakers, eight were professional (four radio news broadcasters and four advertising actors). The entire material presented here has been transcribed, aligned with the acoustic signal and prosodically annotated. Two major objectives have guided the design of this project: (i) to offer a wide coverage of representative real-life communicative situations which allow for the characterization of prosody in these two languages; and (ii) to conduct research studies which enable us to contrast the speakers different speaking styles and discursive practices. All material contained in the corpus is provided under a Creative Commons Attribution 3.0 Unported License.

Keywords

Prosodic corpus Radio news corpus Dialogue corpus Spanish corpus Catalan corpus 

Copyright information

© Springer Science+Business Media Dordrecht 2013

Authors and Affiliations

  • Juan María Garrido
    • 1
  • David Escudero
    • 2
  • Lourdes Aguilar
    • 3
  • Valentín Cardeñoso
    • 2
  • Emma Rodero
    • 4
  • Carme de-la-Mota
    • 3
  • César González
    • 2
  • Carlos Vivaracho
    • 2
  • Sílvia Rustullet
    • 1
  • Olatz Larrea
    • 4
  • Yesika Laplaza
    • 1
  • Francisco Vizcaíno
    • 5
  • Eva Estebas
    • 6
  • Mercedes Cabrera
    • 5
  • Antonio Bonafonte
    • 7
  1. 1.Computational Linguistics Group (GLiCom), Department of Translation and Language SciencesUniversitat Pompeu FabraBarcelonaSpain
  2. 2.Department of Computer SciencesUniversidad de ValladolidValladolidSpain
  3. 3.Department of Spanish PhilologyUniversitat Autònoma de BarcelonaBarcelonaSpain
  4. 4.Department of CommunicationUniversitat Pompeu FabraBarcelonaSpain
  5. 5.Department of Modern LanguagesUniversidad de las Palmas de Gran CanariaLas Palmas de Gran CanariaSpain
  6. 6.Department of Modern LanguagesUniversidad Nacional de Educación a DistanciaMadridSpain
  7. 7.Department of Signal Theory and CommunicationsUniversitat Politècnica de CatalunyaBarcelonaSpain