Language Resources and Evaluation

, Volume 44, Issue 3, pp 221–261

The MATCH corpus: a corpus of older and younger users’ interactions with spoken dialogue systems

  • Kallirroi Georgila
  • Maria Wolters
  • Johanna D. Moore
  • Robert H. Logie
Article

DOI: 10.1007/s10579-010-9118-8

Cite this article as:
Georgila, K., Wolters, M., Moore, J.D. et al. Lang Resources & Evaluation (2010) 44: 221. doi:10.1007/s10579-010-9118-8

Abstract

We present the MATCH corpus, a unique data set of 447 dialogues in which 26 older and 24 younger adults interact with nine different spoken dialogue systems. The systems varied in the number of options presented and the confirmation strategy used. The corpus also contains information about the users’ cognitive abilities and detailed usability assessments of each dialogue system. The corpus, which was collected using a Wizard-of-Oz methodology, has been fully transcribed and annotated with dialogue acts and “Information State Update” (ISU) representations of dialogue context. Dialogue act and ISU annotations were performed semi-automatically. In addition to describing the corpus collection and annotation, we present a quantitative analysis of the interaction behaviour of older and younger users and discuss further applications of the corpus. We expect that the corpus will provide a key resource for modelling older people’s interaction with spoken dialogue systems.

Keywords

Spoken dialogue corporaSpoken dialogue systemsCognitive ageingAnnotationInformation statesSpeech actsUser simulationsSpeech recognition

Copyright information

© Springer Science+Business Media B.V. 2010

Authors and Affiliations

  • Kallirroi Georgila
    • 1
  • Maria Wolters
    • 2
  • Johanna D. Moore
    • 2
  • Robert H. Logie
    • 3
  1. 1.Institute for Creative TechnologiesUniversity of Southern CaliforniaMarina del ReyUSA
  2. 2.Human Communication Research CentreUniversity of EdinburghEdinburghUK
  3. 3.Centre for Cognitive Ageing and Cognitive EpidemiologyUniversity of EdinburghEdinburghUK