Abstract
In this paper, a wavelet packet based adaptive filter-bank construction method is proposed, with additive Fisher ratio used as wavelet packet tree pruning criterion. A novel acoustic feature named discriminative band wavelet packet power coefficients (db-WPPC) is proposed and on this basis, a speech emotion recognition system is constructed. Experimental results show that the proposed feature improves emotion recognition performance over the conventional MFCC feature.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Zeng ZH, Pantic M, Roisman GI et al (2009) A survey of affect recognition methods: audio, visual, and spontaneous expressions. IEEE Trans Pattern Anal Mach Intell 31:39–58
Morrison D, Wang RL, De Silva LC (2007) Ensemble methods for spoken emotion recognition in call-centres. Speech Commun 49:98–112
Petrushin V (2000) Emotion recognition in speech signal: experimental study, development, and application. Presented at the ICSLP2000, Beijing
Caponetti L, Buscicchio CA, Castellano G (2011) Biologically inspired emotion recognition from speech. EURASIP J Adv Sign Process 2011:24
Malta L, Miyajima C, Kitaoka N et al (2009) Multimodal estimation of a driver’s spontaneous irritation, In: Intelligent Vehicles Symposium, IEEE, pp 573–577
Daubechies I (1992) Ten Lectures on Wavelets. Society for Industrial and Applied Mathematics, Philadelphia
Pavez E, Silva JF (2012) Analysis and design of wavelet-packet cepstral coefficients for automatic speech recognition. Speech Commun 54:814–835
Saito N, Coifman RR (1994) Local discriminant bases, In: Proceedings of SPIE 2303, Mathematical Imaging: Wavelet Applications in Signal and Image Processing, pp 2–14
Silva J, Narayanan SS (2009) Discriminative wavelet packet filter bank selection for pattern recognition. IEEE Trans Sign Process 57:1796–1810
Wu SQ, Falk TH, Chan WY (2011) Automatic speech emotion recognition using modulation spectral features. Speech Commun 53:768–785
Scott C (2005) Tree pruning with subadditive penalties. IEEE Trans Sign Process 53:4518–4525
Burkhardt F, Paeschke A, Rolfes M et al (2005) A database of german emotional speech, In: Proceeding INTERSPEECH, ISCA, Lisbon, Portugal, pp 1517–1520
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2013 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Li, Y., Zhang, G., Huang, Y. (2013). Adaptive Wavelet Packet Filter-Bank Based Acoustic Feature for Speech Emotion Recognition. In: Sun, Z., Deng, Z. (eds) Proceedings of 2013 Chinese Intelligent Automation Conference. Lecture Notes in Electrical Engineering, vol 256. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-38466-0_40
Download citation
DOI: https://doi.org/10.1007/978-3-642-38466-0_40
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-38465-3
Online ISBN: 978-3-642-38466-0
eBook Packages: EngineeringEngineering (R0)