Combining Audio and Video by Dominance in Bimodal Emotion Recognition

Huang, Lixing; Xin, Le; Zhao, Liyue; Tao, Jianhua

doi:10.1007/978-3-540-74889-2_71

Lixing Huang¹,
Le Xin¹,
Liyue Zhao¹ &
…
Jianhua Tao¹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 4738))

Included in the following conference series:

International Conference on Affective Computing and Intelligent Interaction

5765 Accesses
4 Citations

Abstract

We propose a novel bimodal emotion recognition approach by using the boosting-based framework, in which we can automatically determine the adaptive weights for audio and visual features. In this way, we balance the dominances of audio and visual features dynamically in feature-level to obtain better performance.

The work is supported by the National Natural Science Foundation of China (No. 60575032) and the 863 Program (No. 2006AA01Z138).

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Huang, T.S., Chen, L., Tao, H.: Bimodal emotion recognition by man and machine. In: Proc. ATR Workshop on Virtual Communication Environments, Japan (April 1998)
Google Scholar
Schapire, R.E., Singer, Y.: Improved boosting algorithms using confidence-rated prediction. Machine Learning 37, 297–336 (1999)
Article MATH Google Scholar
Silva, D., Miyasato, T., Nakatsu, R.: Facial emotion recognition using multi-modal information. In: Proc. International Conference on Information and Communications Security, pp. 397–401 (1997)
Google Scholar
Chen, C.Y., Huang, Y.K., Cook, P.: Visual/Acoustic emotion recognition. In: Proc. International Conference on Multimedia and Expo, pp. 1468–1471 (2005)
Google Scholar

Download references

Author information

Authors and Affiliations

National Laboratory of Pattern Recognition (NLPR), Institute of Automation, Chinese Academy of Sciences,
Lixing Huang, Le Xin, Liyue Zhao & Jianhua Tao

Authors

Lixing Huang
View author publications
You can also search for this author in PubMed Google Scholar
Le Xin
View author publications
You can also search for this author in PubMed Google Scholar
Liyue Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Jianhua Tao
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Ana C. R. Paiva Rui Prada Rosalind W. Picard

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Huang, L., Xin, L., Zhao, L., Tao, J. (2007). Combining Audio and Video by Dominance in Bimodal Emotion Recognition. In: Paiva, A.C.R., Prada, R., Picard, R.W. (eds) Affective Computing and Intelligent Interaction. ACII 2007. Lecture Notes in Computer Science, vol 4738. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-74889-2_71

Download citation

DOI: https://doi.org/10.1007/978-3-540-74889-2_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-74888-5
Online ISBN: 978-3-540-74889-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics