Advertisement

EmoEars: An Emotion Recognition System for Mandarin Speech

  • Bo Xie
  • Ling Chen
  • Gen-Cai Chen
  • Chun Chen
Part of the Lecture Notes in Computer Science book series (LNCS, volume 3801)

Abstract

In this paper, an emotion recognition system for mandarin speech is presented. Five basic human emotions including angry, fear, happy, neutral and sad are investigated. The recognizer is based on neural network with OCON and ACON architecture. Some novel feature selection methods are also added as optional tool to enhance the efficiency and classification accuracy. The system can train speaker dependent emotion speech model through online emotional utterance recording. Experiment results show that emotion can be recognized through neural network model, the best mean accuracy is 86.7%. In addition, the feature selection module is effective to reduce the compute load and increase the generalization ability of the recognizer.

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. 1.
    Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., et al.: Emotion recognition in human-computer interaction. IEEE Signal Processing Magazine 18(1), 32–80 (2001)CrossRefGoogle Scholar
  2. 2.
    Picard, R.W.: Affective computing. MIT Press, Cambridge (1997)Google Scholar
  3. 3.
    McGilloway, S., Cowie, R., Douglas-Cowie, E., et al.: Approaching automatic recognition of emotion from voice: A rough benchmark. In: ISCA Workshop on Speech and Emotion, Belfast (2000)Google Scholar
  4. 4.
    Dellaert, F., Polzin, T., Waibel, A.: Recognizing emotion in speech. In: Proceedings of International Conference on Spoken Language Processing, pp. 1970–1973 (1996)Google Scholar
  5. 5.
    Murray, I.R., Arnott, J.L.: Toward the simulation of emotion in synthetic speech: A review of the literature on human vocal emotion. Journal of the Acoustical Society of America 93(2), 1097–1108 (1933)CrossRefGoogle Scholar
  6. 6.
    Lee, C.M., Narayanan, S., Pieraccini, R.: Recognition of negative emotions from the speech signal. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 240–243 (2001)Google Scholar
  7. 7.
    Kwon, O.W., Chan, K., Hao, J., Lee, T.W.: Emotion recognition by speech signals. In: Proceedings of EUROSPEECH, pp. 125–128 (2003)Google Scholar
  8. 8.
    Nicholson, J., Takahashi, K., Nakatsu, R.: Emotion recognition in speech using neural networks. Neural Comput. & Applic., 290–296 (2000)Google Scholar
  9. 9.
    Marker, J.M., Gray, A.H.: Linear prediction of speech. Springer, Heidelberg (1976)Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2005

Authors and Affiliations

  • Bo Xie
    • 1
  • Ling Chen
    • 1
  • Gen-Cai Chen
    • 1
  • Chun Chen
    • 1
  1. 1.College of Computer ScienceZhejiang UniversityHangzhouP.R. China

Personalised recommendations