Speech Emotion Recognition Using Spiking Neural Networks

Buscicchio, Cosimo A.; Górecki, Przemysław; Caponetti, Laura

doi:10.1007/11875604_6

Cosimo A. Buscicchio²²,
Przemysław Górecki²² &
Laura Caponetti²²

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 4203))

Included in the following conference series:

International Symposium on Methodologies for Intelligent Systems

1236 Accesses
7 Citations

Abstract

Human social communication depends largely on exchanges of non-verbal signals, including non-lexical expression of emotions in speech. In this work, we propose a biologically plausible methodology for the problem of emotion recognition, based on the extraction of vowel information from an input speech signal and on the classification of extracted information by a spiking neural network. Initially, a speech signal is segmented into vowel parts which are represented with a set of salient features, related to the Mel-frequency cesptrum. Different emotion classes are then recognized by a spiking neural network and classified into five different emotion classes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Shallow over Deep Neural Networks: A Empirical Analysis for Human Emotion Classification Using Audio Data

Speech Emotions Recognition Using 2-D Neural Classifier

Recognizing Emotional States Using Speech Information

References

McCauley, L., Gholson, B., Hu, X., Graesser, A.: Delivering smooth tutorial dialogue using a talking head. In: Proc. Of WECC 1998, Workshop on Embodied Conversational Characters, Tahoe City, California, AAAI, ACM/SIGCHI (1998)
Google Scholar
Reeves, B., Nass, C.: The Media Equation. Cambridge University Press, Cambridge (1996)
Google Scholar
Petrushin, V.A.: Emotion in speech: Recognition and Application to call centers. In: Accenture 3773 Willow Rd. Northbrook, IL 60062 - Proceedings of the 1999 Conference on Artificial Neural Networks in Engineering (ANNIE 1999), ASME Press (1999)
Google Scholar
Sagisaka, Y., Campbell, N., Higuch, I.N.: Computing Prosody. Springer, New York (1997)
Google Scholar
Chiu, C.C., Chang, Y.L., Lai, Y.J.: The analysis and recognition of human vocal emotions. In: Proc. International Computer Symposium, pp. 83–88 (1994)
Google Scholar
Dellaert, F., Polzin, T., Waibel, A.: Recognizing emotion in speech. In: Proc. International Conf. on Spoken Language Processing, pp. 1970–1973 (1996)
Google Scholar
Von Brandt, A.: Detecting and estimating parameters jumps using adder algorithms and likelihood ratio test. In: Proc. ICASSP, Boston, MA, pp. 1017–1020 (1983)
Google Scholar
Rabiner, J.: Fundamentals of speech recognition. Prentice-Hall, Englewood Cliffs (1993)
Google Scholar
Ferster, D., Spruston, N.: Cracking the neural code. Science (270), 756–757 (1995)
Google Scholar
Horn, D., Opher, I.: Collective Exitation Phenomena and Their Apllications. In: Maass, W., Bishop, C.M. (eds.) Pulsed Neural Networks, MIT Press, Cambridge (1999)
Google Scholar
Gerstner, W.: Spiking Neurons. In: Maass, W., Bishop, C.M. (eds.) Pulsed Neural Networks, MIT Press, Cambridge (1999)
Google Scholar
Gerstner, W., Kempter, R., Leo van Hammen, J., Wagner, H.: Hebbian Learning of Pulse Timing in the Brain Owl Auditory Mass. In: Maass, W., Bishop, C.M. (eds.) Pulsed Neural Networks, MIT Press, Cambridge (1999)
Google Scholar
Hopfield, J., Brody, C.D.: What is a moment? Transient synchrony as a collective mechanism for spatiotemporal integration. PNAS 98(3), 1282–1287 (2001)
Article Google Scholar
Maass, W.: Computation with spiking neurons. In: Arbib, M.A. (ed.) The Handbook of Brain Theory and Neural Networks, 2nd edn., MIT Press, Cambridge (2001)
Google Scholar
Steeneken, H., Hansen, J.: Speech Under Stress Conditions: Overview of the Effect of Speech Production and on System Performance. In: IEEE ICASSP-1999: Inter. Conf. on Acoustics, Speech, and Signal Processing, Phoenix, Arizona, March 1999, vol. 4, pp. 2079–2082 (1999)
Google Scholar
Praat homepage: http://www.fon.hum.uva.nl/praat
Yacoub, S., Simske, S., Lin, X., Burns, J.: Recognition of emotions in interactive voice response systems. In: Proc. Eurospeech, Geneva (2003)
Google Scholar
Kwon, O.-W., Chan, K.-L., Hao, J., Lee, T.-W.: Emotion Recognition by Speech Signals. In: Eurospeech 2003, September 2003, pp. 125–128 (2003)
Google Scholar

Download references

Author information

Authors and Affiliations

Dipartimento di Informatica, Universita degli Studi di Bari, Via E. Orabona 4, 70126, Bari, Italy
Cosimo A. Buscicchio, Przemysław Górecki & Laura Caponetti

Authors

Cosimo A. Buscicchio
View author publications
You can also search for this author in PubMed Google Scholar
Przemysław Górecki
View author publications
You can also search for this author in PubMed Google Scholar
Laura Caponetti
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Dipartimento di Informatica, Università degli Studi di Bari,
Floriana Esposito
Department of Computer Science, University of North Carolina, NC 28223, Charlotte, USA
Zbigniew W. Raś
Dipartimento di Informatica, Università degli Studi di Bari, via Orabona, 4, 70126, Bari, Italy
Donato Malerba
Dipartimento di Informatica, Università di Bari, Via E. Orabona, 4, 70125, Bari, Italia
Giovanni Semeraro

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Buscicchio, C.A., Górecki, P., Caponetti, L. (2006). Speech Emotion Recognition Using Spiking Neural Networks. In: Esposito, F., Raś, Z.W., Malerba, D., Semeraro, G. (eds) Foundations of Intelligent Systems. ISMIS 2006. Lecture Notes in Computer Science(), vol 4203. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11875604_6

Download citation

DOI: https://doi.org/10.1007/11875604_6
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-45764-0
Online ISBN: 978-3-540-45766-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Speech Emotion Recognition Using Spiking Neural Networks

Abstract

Access this chapter

Preview

Similar content being viewed by others

Shallow over Deep Neural Networks: A Empirical Analysis for Human Emotion Classification Using Audio Data

Speech Emotions Recognition Using 2-D Neural Classifier

Recognizing Emotional States Using Speech Information

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Speech Emotion Recognition Using Spiking Neural Networks

Abstract

Access this chapter

Preview

Similar content being viewed by others

Shallow over Deep Neural Networks: A Empirical Analysis for Human Emotion Classification Using Audio Data

Speech Emotions Recognition Using 2-D Neural Classifier

Recognizing Emotional States Using Speech Information

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation