Skip to main content

Investigation of Normalised Time of Increasing Vocal Fold Contact as a Discriminator of Emotional Voice Type

  • Conference paper
Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5641))

Abstract

To date, descriptions of thecategorisation ofemotional voice type have mostly been provided in terms of fundamental frequency (f0), amplitude and duration. It is of interest to seek additional cues that may help to improve recognition of emotional colouring in speech, and, expressiveness in speech synthesis. The present contribution examines a specific laryngeal measure - the normalised time of increasing contact of the vocal folds (NTIC) i.e. increasing contact time divided by cycle duration - as estimated from the electroglottogram signal. This preliminary study, using a single female speaker, analyses the sustained vowel [a:], produced when simulating the emotional states anger, joy, neutral, sad and tender. The results suggest that NTIC may not be ideally suited for emotional voice discrimination. Additional measures are suggested to further characterise the emotional portrayals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Airas, M., Alku, P.: Emotions in vowel segments of continuous speech: analysis of the glottal flow using the normalized amplitude quotient. Phonetica 63, 26–46 (2006)

    Article  Google Scholar 

  2. McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., Stroeve, S.: Approaching automatic recognition of emotion from voice: a rough benchmark. In: Proceedings of the ISCA work-shop on Speech and Emotion (Belfast), pp. 207–212 (2000)

    Google Scholar 

  3. Toivanen, J., Waaramaa, T., Alku, P., Laukkanen, A.-M., Seppänen, T., Väyrynen, E., Airas, M.: Emotions in [a]: A perceptual and acoustic study. Logopedics Phoniatrics Vocology 31, 43–48 (2006)

    Article  Google Scholar 

  4. Gobl, C., Ní Chasaide, A.: The role of voice quality in communicating emotion, mood and attitude. Speech Communication 40, 189–212 (2003)

    Article  MATH  Google Scholar 

  5. Laukkanen, A.-M., Vilkman, E., Alku, P., Oksanen, H.: Physical variations related to stress and emotional state: a preliminary study. J. Phonetics 24, 313–335 (1996)

    Article  Google Scholar 

  6. Cummings, K.E., Clements, M.A.: Analysis of the glottal excitation of emotionally styled and stressed speech. J. Acoust. Soc. Am. 98, 88–98 (1995)

    Article  Google Scholar 

  7. Rothenberg, M., Mashie, J.J.: Monitoring vocal fold abduction through vocal fold contact area. J. Speech Hear Res. 31, 338–351 (1988)

    Article  Google Scholar 

  8. Titze, I.: Interpretation of the electroglottographic signal. J. Voice 4, 1–9 (1990)

    Article  Google Scholar 

  9. Alku, P., Bäckström, T., Vilkman, E.: Normalised amplitude quotient for parameterization of the glottal flow. J. Acoust. Soc. Am. 112, 701–710 (2002)

    Article  Google Scholar 

  10. Murphy, P.: Voice source change during fundamental frequency variation. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 165–173. Springer, Heidelberg (2007)

    Chapter  Google Scholar 

  11. Murphy, P., Laukkanen, A.-M.: Electroglottogram analysis of emotionally styled phonation. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodel Signals. LNCS (LNAI), vol. 5398, pp. 264–270. Springer, Heidelberg (2008)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Murphy, P.J., Laukkanen, AM. (2009). Investigation of Normalised Time of Increasing Vocal Fold Contact as a Discriminator of Emotional Voice Type. In: Esposito, A., Vích, R. (eds) Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions. Lecture Notes in Computer Science(), vol 5641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03320-9_9

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03320-9_9

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03319-3

  • Online ISBN: 978-3-642-03320-9

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics