Abstract
To date, descriptions of thecategorisation ofemotional voice type have mostly been provided in terms of fundamental frequency (f0), amplitude and duration. It is of interest to seek additional cues that may help to improve recognition of emotional colouring in speech, and, expressiveness in speech synthesis. The present contribution examines a specific laryngeal measure - the normalised time of increasing contact of the vocal folds (NTIC) i.e. increasing contact time divided by cycle duration - as estimated from the electroglottogram signal. This preliminary study, using a single female speaker, analyses the sustained vowel [a:], produced when simulating the emotional states anger, joy, neutral, sad and tender. The results suggest that NTIC may not be ideally suited for emotional voice discrimination. Additional measures are suggested to further characterise the emotional portrayals.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Airas, M., Alku, P.: Emotions in vowel segments of continuous speech: analysis of the glottal flow using the normalized amplitude quotient. Phonetica 63, 26–46 (2006)
McGilloway, S., Cowie, R., Douglas-Cowie, E., Gielen, S., Westerdijk, M., Stroeve, S.: Approaching automatic recognition of emotion from voice: a rough benchmark. In: Proceedings of the ISCA work-shop on Speech and Emotion (Belfast), pp. 207–212 (2000)
Toivanen, J., Waaramaa, T., Alku, P., Laukkanen, A.-M., Seppänen, T., Väyrynen, E., Airas, M.: Emotions in [a]: A perceptual and acoustic study. Logopedics Phoniatrics Vocology 31, 43–48 (2006)
Gobl, C., Nà Chasaide, A.: The role of voice quality in communicating emotion, mood and attitude. Speech Communication 40, 189–212 (2003)
Laukkanen, A.-M., Vilkman, E., Alku, P., Oksanen, H.: Physical variations related to stress and emotional state: a preliminary study. J. Phonetics 24, 313–335 (1996)
Cummings, K.E., Clements, M.A.: Analysis of the glottal excitation of emotionally styled and stressed speech. J. Acoust. Soc. Am. 98, 88–98 (1995)
Rothenberg, M., Mashie, J.J.: Monitoring vocal fold abduction through vocal fold contact area. J. Speech Hear Res. 31, 338–351 (1988)
Titze, I.: Interpretation of the electroglottographic signal. J. Voice 4, 1–9 (1990)
Alku, P., Bäckström, T., Vilkman, E.: Normalised amplitude quotient for parameterization of the glottal flow. J. Acoust. Soc. Am. 112, 701–710 (2002)
Murphy, P.: Voice source change during fundamental frequency variation. In: Esposito, A., Faundez-Zanuy, M., Keller, E., Marinaro, M. (eds.) COST Action 2102. LNCS (LNAI), vol. 4775, pp. 165–173. Springer, Heidelberg (2007)
Murphy, P., Laukkanen, A.-M.: Electroglottogram analysis of emotionally styled phonation. In: Esposito, A., Hussain, A., Marinaro, M., Martone, R. (eds.) Multimodel Signals. LNCS (LNAI), vol. 5398, pp. 264–270. Springer, Heidelberg (2008)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Murphy, P.J., Laukkanen, AM. (2009). Investigation of Normalised Time of Increasing Vocal Fold Contact as a Discriminator of Emotional Voice Type. In: Esposito, A., VÃch, R. (eds) Cross-Modal Analysis of Speech, Gestures, Gaze and Facial Expressions. Lecture Notes in Computer Science(), vol 5641. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03320-9_9
Download citation
DOI: https://doi.org/10.1007/978-3-642-03320-9_9
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-03319-3
Online ISBN: 978-3-642-03320-9
eBook Packages: Computer ScienceComputer Science (R0)