Abstract
New trends in Human–Computer Interaction (HCI) focus on the development of techniques that favor natural communication between users and machines. Within these techniques, natural language plays a basic and important role. However, for a good and natural communication, language is not enough: emotional aspects must be included in speech synthesis. This chapter describes a conversational interface that supports the communication between a user and a virtual character in real time, using natural and emotional language in Spanish. During the interaction with the user, the emotional state of the virtual character may change, depending on how the conversation develops. The emotions are expressed in the choice of different answers and in the modulation of the voice.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Pantic M., Rothktantz L. “Toward an Affect-Sensitive Multimodal Human-Computer Interaction”, Proceedings of the IEEE, Vol. 91 (9), pp. 1370–1390, 2003.
Cowie R., Douglas-Cowie E., Shroder M.: ICSA Workshop on Speech and Emotion: a Conceptual Framework for Research. Belfast, 2000
Bolinger D. “Intonation and its uses, melody and grammar in discourse”, London: Edward Arnold, 1989.
Murray I., Arnott J. “Toward the Simulation of Emotion in Synthetic Speech: A Review of the Literature on Human Vocal Emotion”, Journal of the Acoustical Society of America, Vol. 93 (2), pp. 1097–1108, 1993.
Shroder M. “Emotional Speech Synthesis: A review”, Proceedings of the 7th European Conference on Speech Communication and Technology, Vol. 1, pp. 561–564, 2001.
Hoult C., “Emotion in Speech Synthesis”, 2004
Montero J.M, Gutierrez-Arriola J., Colas J., Enriquez E., Pardo J.M, “Analysis and modelling of emotional speech in Spanish”, Proceedings of the 14th International Conference on Phonetic, pp. 957–960, 1999.
Iriondo I., Guaus R., Rodriguez A., Lázaro P., Montoya N., Blanco J. M., Bernadas D., Oliver J. M., Tena D., Longth L. “Validation of an acoustical modelling of emotional expression in Spanish using speech synthesis techniques”. Proc.ISCA 2000, pp.161–166.
Boula de Mareuil P., Celerier P., Toen J. Elan. “Generation of Emotions by a Morphing Technique in English, French and Spanish”, Proc. Speech Prosody 2002, pp. 187–190.
Loquendo, http://www.loquendo.com/
Artificial Intelligence Foundation, http://www.alicebot.org/
Proyect CyN, http://www.daxtron.com/cyn.htm
Artificial Intelligence Markup Language (AIML) Version 1.0.1, http://www.alicebot.org/TR/2001/WD-aiml/
Microsoft Speech API 5.1 (SAPI5) http://www.microsoft.com/speech/default.mspx
Ekman P. “Facial Expression, The Handbook of Cognition and Emotion” John Wiley and Sons, 1999
Francisco V., Gervás P., Hervás R. “Expression of emotions in the synthesis of voice incontexts narrative”. Proc. UCAmI2005, pp. 353–360, 2005.
Barra R., Montero J.M., Macías-Guarasa J., D’Haro L.F., San-Segundo R., Córdoba R. “Prosodic and segmental rubrics in emotion identification”, Proc. ICASSP 2006 IEEE International Conference on Acoustics, Speech and Signal Processing.
Baldassarri S., Cerezo E., Serón F.J. “Maxine: a platform for embodied animated agents”, Computers & Graphics (in press, doi: 10.1016/j.cag.2008.04.006), 2008
Seron F.J., Cerezo E., Baldassarri S. “Computer Graphics: Problem based learning and Interactive Embodied Pedagogical Agents”, Proc. Eurographics 2008. Annex educational papers, pp. 173–180, issn 1017–4656, 2008.
Cerezo E., Baldassarri S., Cuartero E., Serón F., Montoro G., Haya P., Alamán X: “Agentes virtuales 3D para el control de entornos inteligentes domóticos”, XIII Congreso Internacional de Interacción Persona-Ordenador, pp. 363–372, 2007 (in Spanish).
Kasap Z., Magnenat-Thalmann N. “Intelligent virtual humans with autonomy and personality: State-of-the-art”, Intelligent Decision Technologies, Vol 1, pp. 3–15, 2007.
Ruttkay Z., Doorman C., Noot H. “Evaluating ECAs – What and how?”, Proc. of the AAMAS02 Workshop on ‘Embodied conversational agents – let’s specify and evaluate them!’, Bologna, Italy, July 2002.
Acknowledgments
This work has been partly financed by the Spanish government through the TIN2007-63025 project and by the Aragon regional government through the Walqa agreement (Ref. 2004/04/86) and the CTPP02/2006 project.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2009 Springer-Verlag London Limited
About this chapter
Cite this chapter
Baldassarri, S., Cerezo, E., Anaya, D. (2009). Emotional Speech Synthesis in Spanish for Natural Interaction. In: Macías, J., Granollers Saltiveri, A., Latorre, P. (eds) New Trends on Human–Computer Interaction. Springer, London. https://doi.org/10.1007/978-1-84882-352-5_15
Download citation
DOI: https://doi.org/10.1007/978-1-84882-352-5_15
Published:
Publisher Name: Springer, London
Print ISBN: 978-1-84882-351-8
Online ISBN: 978-1-84882-352-5
eBook Packages: Computer ScienceComputer Science (R0)