Abstract
The study of emotions in human-computer interaction is a growing research area. Focusing on automatic emotion recognition, work is being performed in order to achieve good results particularly in speech and facial gesture recognition. In this paper we present a study performed to analyze different Machine Learning techniques validity in automatic speech emotion recognition area. Using a bilingual affective database, different speech parameters have been calculated for each audio recording. Then, several Machine Learning techniques have been applied to evaluate their usefulness in speech emotion recognition. In this particular case, techniques based on evolutive algorithms (EDA) have been used to select speech feature subsets that optimize automatic emotion recognition success rate. Achieved experimental results show a representative increase in the abovementioned success rate.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Aha, D., Kibler, D., Albert, M.K.: Instance-Based learning algorithms. Machine Learning 6, 37–66 (1991)
Bachorowski, J.A., Owren, M.J.: Vocal expression of emotion: Acoustic properties of speech are associated with emotional intensity and context. Psychological Science 6, 219–224 (1995)
Casacuberta, D.: La mente humana: Diez Enigmas y 100 preguntas (The human mind: Ten Enigmas and 100 questions). In: Océano (ed). Barcelona, Spain (2001) ISBN: 84-7556-122-5
Cowie, R., Douglas-Cowie, E., Cox, C.: Beyond emotion archetypes: Databases for emotion modelling using neural networks. Neural Networks 18, 371–388 (2005)
Cowie, R., Douglas-Cowie, E., Tsapatsoulis, N., Votsis, G., Kollias, S., Fellenz, W., Taylor, J.: Emotion recognition in human-computer interaction (2001)
Dasarathy, B.V.: Nearest Neighbor (NN) Norms: NN Pattern Recognition Classification Techniques. IEEE Computer Society Press, Los Alamitos (1991)
Dellaert, F., Polzin, T., Waibel, A.: Recognizing Emotion in Speech. In: Proc. of ICSLP (1996)
Ekman, P., Friesen, W.: Pictures of facial affect. Consulting Psychologist Press, Palo Alto (1976)
Fernández, R.: A Computational Model for the Automatic Recognition of Affect in Speech. Massachusetts Institute of Technology (2004)
Gunes, V., Menard, M., Loonis, P., Petit-Renaud, S.: Combination, cooperation and selection of classiers: A state of the art. International Journal of Pattern Recognition 17, 1303–1324 (2003)
Huber, R., Batliner, A., Buckow, J., Noth, E., Warnke, V., Niemann, H.: Recognition of emotion in a realistic dialogue scenario. In: Proc. ICSLP, pp. 665–668 (2000)
Humaine (retrieved March 10, 2006), http://emotion-research.net/
Inza, I., Larrañaga, P., Etxeberria, R., Sierra, B.: Feature subsetselection by Bayesian network-based optimization. Artificial Intelligence 123, 157–184 (2000)
Iriondo, I., Guaus, R., Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernadas, D., Oliver, J.M., Tena, D., Longhi, L.: Validation of an acoustical modelling of emotional expression in Spanish using speech synthesis techniques. In: SpeechEmotion, pp. 161–166 (2000)
Kazemzadeh, A., Lee, S., Narayanan, S.: Acoustic correlates of user response to errors in human-computer dialogues. In: Proc. IEEE ASRU (St. Thomas, U.S. Virgin Islands) (December 2003)
Kohavi, R., Sommerfield, D., Dougherty, J.: Data mining using MLC++, a Machine Learning Library in C++. International Journal of Artificial Intelligence Tools 6(4), 537–566 (1997), http://www.sgi.com/Technology/mlc/
Laukka, P.: Vocal Expression of Emotion. Discrete-emotions and Dimensional Accounts. Acta Universitatis Upsaliensis. Comprehensive Summaries of Uppsala Dissertations from the Faculty of Social Sciences, 141, p. 80, Uppsala (2004) ISBN 91-554-6091-7
Liu, H., Motoda, H.: Feature Selection for Knowledge Discovery and Data Mining. Kluwer Academic Publishers, Dordrecht (1998)
López, J.M., Cearreta, I., Fajardo, I., Garay, N.: Evaluating the validity of RekEmozio affective multimodal database with experimental subjects. Technical Report EHU-KAT-IK-04-06. Computer Architecture and Technology department, University of the Basque Country (2006)
López, J.M., Cearreta, I., Garay, N., López de Ipiña, K., Beristain, A.: RekEmozio project: bilingual and multimodal affective database. Technical Report EHU-KAT-IK-03-06. Computer Architecture and Technology department, University of the Basque Country (2006)
Martin, J.K.: An exact probability metric for Decision Tree splitting and stopping. Machine Learning 28(2/3) (1997)
Mingers, J.: A comparison of methods of pruning induced Rule Trees, Technical Report. Coventry, England: University of Warwick, School of Indutrial and Business Studies (1988)
Minsky, M.: Steps towards artificial intelligence. Proceedings of the IRE 49, 8–30 (1961)
Montero, J.M., Gutiérrez-Arriola, J., Palazuelos, S., Enríquez, E., Aguilera, S., Pardo, J.M.: Emotional speech synthesis: from speech database to tts. In: Proceedings of the 5th International Conference of Spoken Language Processing, Sydney, Australia, pp. 923–926 (1998)
Navas, E., Hernáez, I., Castelruiz, A., Luengo, I.: Obtaining and Evaluating an Emotional Database for Prosody Modelling in Standard Basque. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS (LNAI), vol. 3206, pp. 393–400. Springer, Heidelberg (2004)
Pelikan, M., Goldberg, D.E., Lobo, F.: A Survey of Optimization by Building and Using Probabilistic Models. Technical Report 99018, IlliGAL (1999)
Picard, R.W.: Affective Computing. MIT Press, Cambridge (1997)
Quinlan, J.R.: Induction of Decision Trees. Machine Learning 1, 81–106 (1986)
Quinlan, J.R.: C4.5: Programs for Machine Learning. Morgan Kaufmann Publishers, Los Altos (1993)
Rodríguez, A., Lázaro, P., Montoya, N., Blanco, J.M., Bernadas, D., Oliver, J.M., Longhi, L.: Modelización acústica de la expresión emocional en el español. Procesamiento del Lenguaje Natural, No. 25, Lérida, España, 159–166 (1999) ISSN: 1135-5948
Rothkrantz, L.J.M., Wiggers, P., van Wees, J.W.A., van Vark, R.J.: Voice stress analysis. In: Sojka, P., Kopeček, I., Pala, K. (eds.) TSD 2004. LNCS, vol. 3206, pp. 449–456. Springer, Heidelberg (2004)
Schröder, M.: Speech and Emotion Research: An overview of research frameworks and a dimensional approach to emotional speech synthesis. Ph.D. thesis, PHONUS 7, Research Report of the Institute of Phonetics, Saarland University (2004)
Stone, M.: Cross-validation choice and assessment of statistical procedures. Journal Royal of Statistical Society 36, 111–147 (1974)
Sun, X.: Pitch determination and voice quality analysis using subharmonic-to-harmonic ratio (2002), http://mel.speech.nwu.edu/sunxj/pda.htm
Tao, J., Tan, T.: Affective computing: A review. In: Tao, J., Tan, T., Picard, R.W. (eds.) ACII 2005. LNCS, vol. 3784, pp. 981–995. Springer, Heidelberg (2005)
Taylor, J.G., Scherer, K., Cowie, R.: Neural Networks. special issue on Emotion and Brain 18(4), 313–455 (2005)
Ting, K.M.: Common issues in Instance-Based and Naive-Bayesian classifiers, Ph.D. Thesis, Basser Department of Computer Science. The Univesity of Sydney, Australia (1995)
Wettschereck, D.: A study of distance-based Machine Learning Algorithms, Ph.D. Thesis, Oregon State University (1994)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2006 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Álvarez, A. et al. (2006). Feature Subset Selection Based on Evolutionary Algorithms for Automatic Emotion Recognition in Spoken Spanish and Standard Basque Language. In: Sojka, P., Kopeček, I., Pala, K. (eds) Text, Speech and Dialogue. TSD 2006. Lecture Notes in Computer Science(), vol 4188. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11846406_71
Download citation
DOI: https://doi.org/10.1007/11846406_71
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-39090-9
Online ISBN: 978-3-540-39091-6
eBook Packages: Computer ScienceComputer Science (R0)