Abstract
The proposed system shows the effectiveness of Deep Belief Network(DBN) over Gaussian Mixture model(GMM). The development of the proposed GMM-DBN system is by modeling GMM for each emotion independently using the extracted Mel frequency Cepstral Coefficient(MFCC) features from speech. The minimum distance between the distribution of features for each utterance with respect to each emotion model is derived as Bag of acoustic features(BoF) and plotted as histogram. In histogram, the count represents the number of feature distributions that are close to each emotion model. The BoF is passed in to DBN for developing train models. The effectiveness of the emotion recognition using DBN is empirically observed by increasing the Restricted Boltzmann machine(RBM) layers and further by tuning available parameters. The motivation is by testing the Classical German Speech emotion database(EmodB) with the proposed GMM-DBN system which gives the performance rate increase by 5% than the conventional MFCC-GMM system by empirical observation. Further testing of the proposed system over the recently developed simulated speech emotion database for Tamil language gives a comparable result for the emotion recognition. The effectiveness of the proposed model is empirically observed in EmodB.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Schuller, B., Reiter, S., Muller, R., Al-Hames, M., Lang, M., Rigoll, G.: Speaker independent speech emotion recognition by ensemble classification pp. 864–867 (2005)
Scherer, K.R.: Vocal affect expression: a review and a model for future research. Psychol. Bull. 99(2), 143 (1986)
Govind, D., Joy, T.T.: Improving the flexibility of dynamic prosody modification using instants of significant excitation. Circ. Syst. Sig. Process. 35(7), 2518–2543 (2016)
Keyvanrad, M.A., Homayounpour, M.M.: A brief survey on deep belief networks and introducing a new object oriented toolbox (deebnet). arXiv preprint arXiv:1408.3264 (2014)
Ververidis, D., Kotropoulos, C.: A state of the art review on emotional speech databases. In: Proceedings of 1st Richmedia Conference, pp. 109–119. Citeseer (2003)
Williams, C.E., Stevens, K.N.: Emotions and speech: some acoustical correlates, vol. 52, pp. 1238–1250. ASA (1972)
Erickson, D.: Expressive speech: production, perception and application to speech synthesis. Acoust. Sci. Technol. 26(4), 317–325 (2005)
Salakhutdinov, R., Hinton, G.: Deep Boltzmann machines. In: Artificial Intelligence and Statistics, pp. 448–455 (2009)
Liu, Y., Zhou, S., Chen, Q.: Discriminative deep belief networks for visual data classification. Pattern Recogn. 44(10), 2287–2296 (2011)
Rong, J., Li, G., Chen, Y.-P.P.: Acoustic feature selection for automatic emotion recognition from speech. Inf. Process. Manage. 45(3), 315–328 (2009)
Altun, H., Polat, G.: On the comparison of classifiers performance in emotion classification: critiques and suggestions. In: 2008 IEEE 16th Signal Processing, Communication and Applications Conference, SIU 2008, pp. 1–4. IEEE (2008)
Pravena, D., Govind, D.: Development of simulated emotion speech database for excitation source analysis. Int. J. Speech Technol. 20, 327–338 (2017)
Reynolds, D.: Gaussian mixture models. In: Encyclopedia of Biometrics, pp. 827–832 (2015)
Tieleman, T.: Training restricted Boltzmann machines using approximations to the likelihood gradient. In: Proceedings of the 25th International Conference on Machine Learning, pp. 1064–1071. ACM (2008)
Carreira-Perpinan, M.A., Hinton, G.E.: On contrastive divergence learning. In: AISTATS, vol. 10, pp. 33–40. Citeseer (2005)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this paper
Cite this paper
Srikanth, M., Pravena, D., Govind, D. (2018). Tamil Speech Emotion Recognition Using Deep Belief Network(DBN). In: Thampi, S., Krishnan, S., Corchado Rodriguez, J., Das, S., Wozniak, M., Al-Jumeily, D. (eds) Advances in Signal Processing and Intelligent Recognition Systems. SIRS 2017. Advances in Intelligent Systems and Computing, vol 678. Springer, Cham. https://doi.org/10.1007/978-3-319-67934-1_29
Download citation
DOI: https://doi.org/10.1007/978-3-319-67934-1_29
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-67933-4
Online ISBN: 978-3-319-67934-1
eBook Packages: EngineeringEngineering (R0)