Advertisement

The JASMIN Speech Corpus: Recordings of Children, Non-natives and Elderly People

  • Catia Cucchiarini
  • Hugo Van hamme
Chapter
Part of the Theory and Applications of Natural Language Processing book series (NLP)

Abstract

Large speech corpora (LSC) constitute an indispensable resource for conducting research in speech processing and for developing real-life speech applications. In 2004 the Spoken Dutch Corpus (Corpus Gesproken Nederlands - CGN) became available, a corpus of standard Dutch as spoken by adult natives in the Netherlands and Flanders. CGN does not include speech of children, non-natives, elderly people and recordings of speech produced in human-machine interactions. Since such recordings would be extremely useful for conducting research and for developing HLT applications for these specific groups of speakers of Dutch, the JASMIN-CGN was started with the aim of extending CGN in three dimensions: age, mother tongue and interaction mode. First, by collecting a corpus of contemporary Dutch as spoken by children of different age groups, non-natives with different mother tongues and elderly people in the Netherlands and Flanders (JASMIN-CGN), we aimed at an extension along the age and mother tongue dimensions. In addition, we collected speech material in a communication setting that was not envisaged in CGN: human-machine interaction. One third of the data was collected in Flanders and two thirds in the Netherlands. The corpus has already been used in different ways within the STEVIN programme. In addition, it turned out to be useful for different lines of research. Since 2008 the JASMIN speech corpus has been available through the Dutch-Flemish HLT Agency. We hope that other researchers will make use of the data and the knowledge gathered in this project for further research and development.

Notes

Acknowledgements

We are indebted to the publishers Thieme-Meulenhoff and Zwijsen who allowed us to use their texts for the recordings, to A. van den Bosch who allowed us to use the POS tagger, to all the speakers as well as institutions that participated and thus made it possible to collect this corpus and to the people who, at different stages and for different periods, were part of the JASMIN team: Leontine Aul, Andrea Diersen, Joris Driesen, Olga van Herwijnen, Chantal Mülders, August Oostens, Eric Sanders, Maarten Van Segbroeck, Alain Sips, Felix Smits, Koen Snijders, Erik Stegeman and Barry van der Veen.

References

  1. 1.
  2. 2.
  3. 3.
    Anderson, S., Liberman, N., Bernstein, E., Foster, S., Cate, E., Levin, B., Hudson, R.: Recognition of elderly speech and voice-driven document retrieval. In: Proceedings of the ICASSP, Phoenix, USA (1999). Paper 2060Google Scholar
  4. 4.
    Cucchiarini, C., van Doremalen, J., Strik, H.: Fluency in non-native read and spontaneous speech. In: Proceedings of DiSS-LPSS Joint Workshop, Tokyo, Japan, pp. 15–18 (2010)Google Scholar
  5. 5.
    Dagevos, J., Gijsberts, M., van Praag, C.: Rapportage minderheden 2003; Onderwijs, arbeid en sociaal-culturele integratie. SCP-publicatie 2003–13, Sociaal en Cultureel Planbureau, The Hague (2003)Google Scholar
  6. 6.
    D’Arcy, S.M., Wong, L., Russell, M.J.: Recognition of read and spontaneous children’s speech using two new corpora. In: Proceedings of ICSLP, Korea pp. 588–591 (2004)Google Scholar
  7. 7.
    Duchateau, J., Wigham, M., Demuynck, K., Van hamme, H.: A flexible recogniser architecture in a reading tutor for children. In: Proceedings of ITRW on Speech Recognition and Intrinsic Variation, Toulouse, France, pp. 59–64 (2006)Google Scholar
  8. 8.
    Ferguson, G., et al.: The medication advisor project: preliminary report. Technical Report 776, CS Department, U. Rochester (2002)Google Scholar
  9. 9.
    Hagen, A., Pellom, B., Cole, R.: Children’s speech recognition with application to interactive books and tutors. In: Proceedings of ASRU, St. Thomas, USA, pp. 265–293 (2003)Google Scholar
  10. 10.
    Hans, M., Graf, B., Schraft, R.: Robotic home assistant care-o-bot: past-present-future. In: Proceedings of the IEEE ROMAN, Berlin, pp. 380–385 (2002)Google Scholar
  11. 11.
    Mommers, M., Verhoeven, L., Van der Linden, S.: Veilig Leren Lezen. Zwijsen, Tilburg (1990)Google Scholar
  12. 12.
    Müller, C., Wasinger, R.: Adapting multimodal dialog for the elderly. In: Proceedings of the ABIS-Workshop 2002 on Personalization for the Mobile World, Hannover, Germany, pp. 31–34 (2002)Google Scholar
  13. 13.
    Narayanan, S., Potamianos, A.: Creating conversational interfaces for children. IEEE Trans. Speech Audio Process. 10 (2), 65–78 (2002)CrossRefGoogle Scholar
  14. 14.
    Nederlandse Taalunie: Woordenlijst nederlandse taal (2005). http://woordenlijst.org/
  15. 15.
    Raux, A., Langner, B., Black, A., Eskenazi, M.: LET’S GO: improving spoken dialog systems for the elderly and non-natives. In: Proceedings of Eurospeech, Geneva, Switzerland, pp. 753–756 (2003)Google Scholar
  16. 16.
    Strik, H., van de Loo, J., van Doremalen, J., Cucchiarini, C.: Practicing syntax in spoken interaction: automatic detection of syntactic errors in non-native utterances. In: Proceedings of the SLaTE-2010 Workshop, Tokyo, Japan (2010)Google Scholar
  17. 17.
    Strik, H., van Doremalen, J., van de Loo, J., Cucchiarini, C.: Improving asr processing of ungrammatical utterances through grammatical error modeling. In: Proceedings of the SLaTE-2010 Workshop, Venice, Italy (2011)Google Scholar
  18. 18.
    Takahashi, S., Morimoto, T., Maeda, S., Tsuruta, S.: Spoken dialogue system for home health care. In: Proceedings of ICSLP, Denver, USA (2002), pp. 2709–2712Google Scholar
  19. 19.
    Van Bael, C., Binnenpoorte, D., Strik, H., van den Heuvel, H.: Validation of phonetic transcriptions based on recognition performance. In: Proceedings of Eurospeech, Geneva, Switzerland (2003), pp. 1545–1548Google Scholar
  20. 20.
    Van Compernolle, D.: Recognizing speech of goats, wolves, sheep and …non-natives. Speech Commun. 35 (1–2), 71–79 (2001)CrossRefGoogle Scholar
  21. 21.
    Van den Bosch, A., Schuurman, I., Vandeghinste, V.: Transferring pos-tagging and lemmatzation tools from spoken to written dutch corpus development. In: Proceedings of LREC, Genoa, Italy (2006)Google Scholar
  22. 22.
    van Doremalen, J., Cucchiarini, C., Strik, H.: Automatic detection of vowel pronunciation errors using multiple information sources. In: Proceedings of ASRU, Merano, Italy, (2009), pp. 80–585Google Scholar
  23. 23.
    van Doremalen, J., Cucchiarini, C., Strik, H.: Optimizing automatic speech recognition for low-proficient non-native speakers. EURASIP J. Audio, Speech, Music Process. (2010). http://www.hindawi.com/journals/asmp/2010/973954/
  24. 24.
    van Doremalen, J., Cucchiarini, C., Strik, H.: Phoneme errors in read and spontaneous non-native speech: relevance for capt system development. In: Proceedings of the SLaTE-2010 Workshop, Tokyo, Japan (2010b)Google Scholar
  25. 25.
    Van Eynde, F.: Part of speech tagging en lemmatisering van het corpus gesproken nederlands (2004). http://lands.let.ru.nl/cgn/doc_Dutch/topics/version_.0/annot/pos_tagging/tg_prot.pdf
  26. 26.
    Wentink, H.: From graphemes to syllables. Ph.D. thesis, Nijmegen (1997)Google Scholar

Copyright information

© The Author(s) 2013

Open Access. This chapter is distributed under the terms of the Creative Commons Attribution Noncommercial License, which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

Authors and Affiliations

  1. 1.CLST, Radboud UniversityNijmegenThe Netherlands
  2. 2.ESAT DepartmentKatholieke Universiteit LeuvenLeuvenBelgium

Personalised recommendations