Advertisement

Automatic Detection of Prosodic Boundaries in Brazilian Portuguese Spontaneous Speech

  • Bárbara TeixeiraEmail author
  • Plínio Barbosa
  • Tommaso Raso
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 11122)

Abstract

This paper presents some models based on multiple phonetic-acoustic parameters for the automatic detection of prosodic boundaries in spontaneous speech. A sample with seven excerpts of monologic Brazilian Portuguese spontaneous speech was segmented into prosodic units by 14 trained annotators. The perceived prosodic boundaries were annotated as terminal or non-terminal prosodic boundaries. A Praat script was prepared in order to extract a set of acoustic parameters during the speech signal. Two statistical classifiers, namely Random Forest e Linear Discriminant Analysis, were used to generate models of subgroups of acoustic parameters that could work as predictors of prosodic boundaries in comparison with the human annotators. The initial evaluation of the classifiers showed that both present relative success in detecting boundaries. The LDA performed better in predicting boundaries and therefore its models were refined. The final model for terminal boundaries showed 80% of agreement with human annotators. As for non-terminal boundaries, three models were obtained. The sum of boundaries identified by the three models together corresponds to an agreement of 98% with the human annotators.

Keywords

Automatic detection Prosodic boundaries Spontaneous speech 

References

  1. 1.
    Schubiger, M.: English Intonation: Its Form and Function. Niemeyer, Tübingen (1958)Google Scholar
  2. 2.
    Chafe, W.: The deployment of consciousness in the production of a narrative. In: Chafe, W. (ed.) The Pear Stories: Cognitive, Cultural, and Linguistic Aspects of Narrative Production, pp. 9–50. Ablex, Norwood (1980). (Org.)Google Scholar
  3. 3.
    Schuetze-Coburn, S.: Prosody, syntax, and discourse pragmatics: assessing information flow in German conversation. Ph.D. University of California, Los Angeles (1994)Google Scholar
  4. 4.
    Ladd, R.: Intonational Phonology, 2nd edn. CUP, Cambridge (2008)CrossRefGoogle Scholar
  5. 5.
    Cooper, W., Paccia Cooper, J.: Syntax and Speech. Harvard Universty Press, Cambridge (1980)CrossRefGoogle Scholar
  6. 6.
    Selkirk, E. Comments on Intonational phrasing in English. In: Frota, S., Vigário, M., Freitas, M.J. (eds.) Prosodies, pp. 11–58. Mouton de Gruyter, Berlin (2005)Google Scholar
  7. 7.
    Halliday, M.A.K.: Speech and Situation. University College, London (1965)Google Scholar
  8. 8.
    Cresti, E.: Corpus di Italiano parlato, vol. 1. Accademia della Crusca, Firenze (2000)Google Scholar
  9. 9.
    Szczepek Reed, B.: Prosody, syntax and action formation: intonation phrases and action components. In: Bergmann, P. et al. (eds.), Prosody and Embodiment in Interactional Grammar, pp. 142–169. Mouton de Gruyter, Berlin (2012)Google Scholar
  10. 10.
    Chafe, W.: Discourse, Consciousness and Time: The Flow and Dsiplacement of Conscious Experience in Speaking and Writing. University of Chicago Press, Chicago (1994)Google Scholar
  11. 11.
    Croft, W.: Intonation Units and grammatical structure. Linguistics 33(5), 839–882 (1995)CrossRefGoogle Scholar
  12. 12.
    Bybee, J.: Language, Usage and Cognition. CUP, Cambridge (2010)CrossRefGoogle Scholar
  13. 13.
    Barth-Weingarten, D.: Intonation Units Revised: Cesuras in Talk-in-Interaction. John Benjamins Publishing Company, Philadelphia (2016)CrossRefGoogle Scholar
  14. 14.
    Pike, L.: The Intonation of American English. University of Michigan Press, Ann Arbor (1945)Google Scholar
  15. 15.
    Pierrehumbert, J. Phonetics and phonology of English intonation. Ph.D. Massachusetts Institute of Technology (1980)Google Scholar
  16. 16.
    Schegloff, E.: Reflections on studying prosody in talk-in-interaction. Lang. Speech 41(3–4), 235–263 (1998)CrossRefGoogle Scholar
  17. 17.
    Szczepek Reed, B.: Turn-final intonation in English. In: Couper-Kuhlen, E., Ford, C. (eds.), Sound Patterns in Interaction, pp. 97–117. Benjamins, Amsterdam (2004)Google Scholar
  18. 18.
    Avanzi, M., Lacheret-Dujour, A., Victorri, B.: A tool for semi-automatic annotation of french prosodic structure. In: ANALOR, pp. 119–122, Campinas, Brazil, (2008)Google Scholar
  19. 19.
    Ni, C.J., Zhang, A.Y., Liu, W.J., Xu, B.: Automatic prosodic break detection and feature analysis. J. Comput. Sci. Tchol. 27, 1184–1196 (2012)CrossRefGoogle Scholar
  20. 20.
    Kim, J.: Automatic detection of sentence boundaries, disfluencies, and conversational fillers in spontaneous speech. 103 f. Ph.D. University of Washington (2004)Google Scholar
  21. 21.
    Raso, T., Mello, H. (Org.): C-ORAL-BRASIL I: corpus de referência do português brasileiro falado informal, 1 edn. UFMG, Belo Horizonte (2012)Google Scholar
  22. 22.
    Raso, T., Mello, H. (Org.): C-ORAL-BRASIL II: corpus de referência do português brasileiro falado informal (forthcoming)Google Scholar
  23. 23.
    Mello, H.R. et al.: Transcrição e segmentação prosódica do corpus C-ORAL-BRASIL: critérios de implementação e validação. In: Raso, T., Mello, H.R. (eds.) C-ORAL-Brasil I: Corpus de referência do português brasileiro falado informal, pp. 125–176. Editora UFMG, Belo Horizonte (2012)Google Scholar
  24. 24.
    Fleiss, J.L.: Measuring nominal scale agreement among many raters. Psychol. Bull. 76(5), 378–382 (1971)CrossRefGoogle Scholar
  25. 25.
    Boersma, P., Weenink, D. Praat: doing phonetics by computer. 2015. Software http://www.praat.org/. Accessed 16 Jan 2015
  26. 26.
    Barbosa, P.: Semi-automatic and automatic tools for generating prosodic descriptors for prosody research. In: Bigi, B., Hirst, D. (eds.), Proceedings of the Tools and Resources for the Analysis of Speech Prosody, vol. 13, pp. 86–89. Aix-en-Provence: Laboratoire Parole et Language (2013). http://www.lpl-aix.fr/~trasp/Proceedings/19874-trasp2013.pdf
  27. 27.
    Barbosa, P.: BreakDescriptor (Versão 1.0) [Programa de computador] (2016). Available with the authorGoogle Scholar
  28. 28.
    R Development Core Team: R a language and environment for statistical computing. R Foundation for Statistical Computing, Vienna, Austria (2017)Google Scholar

Copyright information

© Springer Nature Switzerland AG 2018

Authors and Affiliations

  1. 1.Federal University of Minas GeraisBelo HorizonteBrazil
  2. 2.University of CampinasCampinas, São PauloBrazil

Personalised recommendations