EmoEars: An Emotion Recognition System for Mandarin Speech
In this paper, an emotion recognition system for mandarin speech is presented. Five basic human emotions including angry, fear, happy, neutral and sad are investigated. The recognizer is based on neural network with OCON and ACON architecture. Some novel feature selection methods are also added as optional tool to enhance the efficiency and classification accuracy. The system can train speaker dependent emotion speech model through online emotional utterance recording. Experiment results show that emotion can be recognized through neural network model, the best mean accuracy is 86.7%. In addition, the feature selection module is effective to reduce the compute load and increase the generalization ability of the recognizer.
Unable to display preview. Download preview PDF.
- 2.Picard, R.W.: Affective computing. MIT Press, Cambridge (1997)Google Scholar
- 3.McGilloway, S., Cowie, R., Douglas-Cowie, E., et al.: Approaching automatic recognition of emotion from voice: A rough benchmark. In: ISCA Workshop on Speech and Emotion, Belfast (2000)Google Scholar
- 4.Dellaert, F., Polzin, T., Waibel, A.: Recognizing emotion in speech. In: Proceedings of International Conference on Spoken Language Processing, pp. 1970–1973 (1996)Google Scholar
- 6.Lee, C.M., Narayanan, S., Pieraccini, R.: Recognition of negative emotions from the speech signal. In: Proceedings of IEEE Workshop on Automatic Speech Recognition and Understanding, pp. 240–243 (2001)Google Scholar
- 7.Kwon, O.W., Chan, K., Hao, J., Lee, T.W.: Emotion recognition by speech signals. In: Proceedings of EUROSPEECH, pp. 125–128 (2003)Google Scholar
- 8.Nicholson, J., Takahashi, K., Nakatsu, R.: Emotion recognition in speech using neural networks. Neural Comput. & Applic., 290–296 (2000)Google Scholar
- 9.Marker, J.M., Gray, A.H.: Linear prediction of speech. Springer, Heidelberg (1976)Google Scholar