Emotional Speech Synthesis in Spanish for Natural Interaction

Baldassarri, Sandra; Cerezo, Eva; Anaya, David

doi:10.1007/978-1-84882-352-5_15

Sandra Baldassarri⁴,
Eva Cerezo &
David Anaya

983 Accesses

Abstract

New trends in Human–Computer Interaction (HCI) focus on the development of techniques that favor natural communication between users and machines. Within these techniques, natural language plays a basic and important role. However, for a good and natural communication, language is not enough: emotional aspects must be included in speech synthesis. This chapter describes a conversational interface that supports the communication between a user and a virtual character in real time, using natural and emotional language in Spanish. During the interaction with the user, the emotional state of the virtual character may change, depending on how the conversation develops. The emotions are expressed in the choice of different answers and in the modulation of the voice.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Hardcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Pantic M., Rothktantz L. “Toward an Affect-Sensitive Multimodal Human-Computer Interaction”, Proceedings of the IEEE, Vol. 91 (9), pp. 1370–1390, 2003.
Google Scholar
Cowie R., Douglas-Cowie E., Shroder M.: ICSA Workshop on Speech and Emotion: a Conceptual Framework for Research. Belfast, 2000
Google Scholar
Bolinger D. “Intonation and its uses, melody and grammar in discourse”, London: Edward Arnold, 1989.
Google Scholar
Murray I., Arnott J. “Toward the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion”, Journal of the Acoustical Society of America, Vol. 93 (2), pp. 1097–1108, 1993.
Article Google Scholar
Shroder M. “Emotional Speech Synthesis: A review”, Proceedings of the 7th European Conference on Speech Communication and Technology, Vol. 1, pp. 561–564, 2001.
Google Scholar
Hoult C., “Emotion in Speech Synthesis”, 2004
Google Scholar
Montero J.M, Gutierrez-Arriola J., Colas J., Enriquez E., Pardo J.M, “Analysis and modelling of emotional speech in Spanish”, Proceedings of the 14th International Conference on Phonetic, pp. 957–960, 1999.
Google Scholar
Iriondo I., Guaus R., Rodriguez A., Lázaro P., Montoya N., Blanco J. M., Bernadas D., Oliver J. M., Tena D., Longth L. “Validation of an acoustical modelling of emotional expression in Spanish using speech synthesis techniques”. Proc.ISCA 2000, pp.161–166.
Google Scholar
Boula de Mareuil P., Celerier P., Toen J. Elan. “Generation of Emotions by a Morphing Technique in English, French and Spanish”, Proc. Speech Prosody 2002, pp. 187–190.
Google Scholar
Loquendo, http://www.loquendo.com/
Artificial Intelligence Foundation, http://www.alicebot.org/
Proyect CyN, http://www.daxtron.com/cyn.htm
Artificial Intelligence Markup Language (AIML) Version 1.0.1, http://www.alicebot.org/TR/2001/WD-aiml/
Microsoft Speech API 5.1 (SAPI5) http://www.microsoft.com/speech/default.mspx
Ekman P. “Facial Expression, The Handbook of Cognition and Emotion” John Wiley and Sons, 1999
Google Scholar
Francisco V., Gervás P., Hervás R. “Expression of emotions in the synthesis of voice incontexts narrative”. Proc. UCAmI2005, pp. 353–360, 2005.
Google Scholar
Barra R., Montero J.M., Macías-Guarasa J., D’Haro L.F., San-Segundo R., Córdoba R. “Prosodic and segmental rubrics in emotion identification”, Proc. ICASSP 2006 IEEE International Conference on Acoustics, Speech and Signal Processing.
Google Scholar
Baldassarri S., Cerezo E., Serón F.J. “Maxine: a platform for embodied animated agents”, Computers & Graphics (in press, doi: 10.1016/j.cag.2008.04.006), 2008
Google Scholar
Seron F.J., Cerezo E., Baldassarri S. “Computer Graphics: Problem based learning and Interactive Embodied Pedagogical Agents”, Proc. Eurographics 2008. Annex educational papers, pp. 173–180, issn 1017–4656, 2008.
Google Scholar
Cerezo E., Baldassarri S., Cuartero E., Serón F., Montoro G., Haya P., Alamán X: “Agentes virtuales 3D para el control de entornos inteligentes domóticos”, XIII Congreso Internacional de Interacción Persona-Ordenador, pp. 363–372, 2007 (in Spanish).
Google Scholar
Kasap Z., Magnenat-Thalmann N. “Intelligent virtual humans with autonomy and personality: State-of-the-art”, Intelligent Decision Technologies, Vol 1, pp. 3–15, 2007.
Google Scholar
Ruttkay Z., Doorman C., Noot H. “Evaluating ECAs – What and how?”, Proc. of the AAMAS02 Workshop on ‘Embodied conversational agents – let’s specify and evaluate them!’, Bologna, Italy, July 2002.
Google Scholar

Download references

Acknowledgments

This work has been partly financed by the Spanish government through the TIN2007-63025 project and by the Aragon regional government through the Walqa agreement (Ref. 2004/04/86) and the CTPP02/2006 project.

Author information

Authors and Affiliations

Advanced Computer Graphics Group (GIGA), Computer Science Department, University of Zaragoza, Aragon Institute for Engineering Research (I3A), Spain
Sandra Baldassarri

Authors

Sandra Baldassarri
View author publications
You can also search for this author in PubMed Google Scholar
Eva Cerezo
View author publications
You can also search for this author in PubMed Google Scholar
David Anaya
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Baldassarri, S., Cerezo, E., Anaya, D. (2009). Emotional Speech Synthesis in Spanish for Natural Interaction. In: Macías, J., Granollers Saltiveri, A., Latorre, P. (eds) New Trends on Human–Computer Interaction. Springer, London. https://doi.org/10.1007/978-1-84882-352-5_15

Download citation

DOI: https://doi.org/10.1007/978-1-84882-352-5_15
Published: 27 February 2009
Publisher Name: Springer, London
Print ISBN: 978-1-84882-351-8
Online ISBN: 978-1-84882-352-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics