Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems

López-Cózar, Ramón; Callejas, Zoraida; Kroul, Martin; Nouza, Jan; Silovský, Jan

doi:10.1007/978-3-540-87391-4_78

Ramón López-Cózar¹,
Zoraida Callejas¹,
Martin Kroul²,
Jan Nouza² &
…
Jan Silovský²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 5246))

Included in the following conference series:

International Conference on Text, Speech and Dialogue

960 Accesses
4 Citations

Abstract

This paper proposes a technique to enhance emotion classification in spoken dialogue systems by means of two fusion modules. The first combines emotion predictions generated by a set of classifiers that deal with different kinds of information about each sentence uttered by the user. To do this, the module employs several fusion methods that produce other predictions about the emotional state of the user. The predictions are the input to the second fusion module, where they are combined to deduce the user’s emotional state. Experiments have been carried out considering two emotion categories (‘Non-negative’ and ‘Negative’) and classifiers that deal with prosodic, acoustic, lexical and dialogue acts information. The results show that the first fusion module significantly increases the classification rates of a baseline and the classifiers working separately, as has been observed previously in the literature. The novelty of the technique is the inclusion of the second fusion module, which enhances classification rate by 2.25% absolute.

This work has been funded by the Spanish project HADA TIN2007-64718, and the grant no. 1QS108040569 of the Agency of Sciences of the Czech Republic.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Bänziger, T., Scherer, K.R.: The role of intonation in emotional expressions. Speech communication 46, 252–267 (2005)
Article Google Scholar
Ai, H., Litman, D., Forbes-Riley, K., Rotaru, M., Tetreault, J., Purandare, A.: Using systems and user performance features to improve emotion detection in spoken tutoring dialogs. In: Proc. of Interspeech 2006-ICSLP, Pittsburgh, USA, pp. 797–800 (2006)
Google Scholar
Devillers, L., Scherer, K.: Real-life emotions detection with lexical and paralinguistic cues on human-human call center dialogs. In: Proc. of Interspeech 2006-ICSLP, Pittsburgh, USA, pp. 801–804 (2006)
Google Scholar
Morrison, D., Wang, R., Silva, L.C.D.: Ensemble methods for spoken emotion recognition in call-centers. Speech communication 49, 98–112 (2007)
Article Google Scholar
Lee, C.M., Narayanan, S.S.: Toward detecting emotions in spoken dialogs. IEEE Transactions on Speech and Audio Processing 13, 293–303 (2005)
Article Google Scholar
Tax, D., Breukelen, M.V., Duin, R., Kittler, J.: Combining multiple classifiers by averaging or multiplying. Pattern Recognition 33, 1475–1485 (2001)
Article Google Scholar
López-Cózar, R., Callejas, Z.: Combining language models in the input interface of a spoken dialogue system. Computer Speech and Language 20, 420–440 (2005)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Dept. of Languages and Computer Systems, University of Granada, Spain
Ramón López-Cózar & Zoraida Callejas
Institute of Information Technology and Electronics, Technical University of Liberec, Czech Republic
Martin Kroul, Jan Nouza & Jan Silovský

Authors

Ramón López-Cózar
View author publications
You can also search for this author in PubMed Google Scholar
Zoraida Callejas
View author publications
You can also search for this author in PubMed Google Scholar
Martin Kroul
View author publications
You can also search for this author in PubMed Google Scholar
Jan Nouza
View author publications
You can also search for this author in PubMed Google Scholar
Jan Silovský
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Petr Sojka Aleš Horák Ivan Kopeček Karel Pala

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

López-Cózar, R., Callejas, Z., Kroul, M., Nouza, J., Silovský, J. (2008). Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems. In: Sojka, P., Horák, A., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2008. Lecture Notes in Computer Science(), vol 5246. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-87391-4_78

Download citation

DOI: https://doi.org/10.1007/978-3-540-87391-4_78
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-87390-7
Online ISBN: 978-3-540-87391-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Two-Level Fusion to Improve Emotion Classification in Spoken Dialogue Systems