Summary and Conclusions
This chapter summarizes the research work presented in this book, highlights the contributions of the work and discusses the scope for future work. In this book, main attention was given to emotion specific spectral and prosodic features for performing the robust emotion recognition. The book is organized into 7 chapters. The first chapter introduces speech emotion recognition as the contemporary research area. In Chap. 2 the spectral features extracted from sub-syllabic regions such as vowels, consonants and CV transition regions are proposed for robust emotion recognition from speech. Pitch synchronously extracted spectral features are also used in Chap. 2 for recognizing the emotions. Chapter 3 proposes use of dynamic prosodic features for recognition of emotions. These dynamic features along with the static prosodic features derived from sentence, word and syllable levels are used for characterizing the emotions. Emotion specific information present in different positions (initial, middle and final) of the speech utterances is used for emotion classification. In Chap. 4, combinations of various emotion specific speech features are explored for developing the robust emotion recognition systems. Chapter 5 deals with the method of multistage emotion classification using combination of features. In this chapter two stage emotion recognition system is developed using spectral and prosodic features. Chapter 6 introduces real life emotion recognition approach using different features. Chapter 7 concludes the present work and flashes light on the directions for further research.
- 1.S.G. Koolagudi, K.S. Rao, Emotion recognition from speech using sub-syllabic and pitch synchronous spectral features. Int. J. Speech Technol. 15, 495–511, (Springer, Dec 2012). doi: 10.1007/s10772-012-9150-8
- 2.K.S. Rao, S.G. Koolagudi, R. R. Vempada, Emotion recognition from speech using global and local prosodic features. Int. J. Speech Technol. 15, (Springer, Aug 2012). doi: 10.1007/s10772-012-9172-2
- 3.S.G. Koolagudi, K.S. Rao, Emotion recognition from speech using source, system and prosodic features. Int. J. Speech Technol. 15(3), 265–289, (Springer , 2012)Google Scholar
- 5.S.G. Koolagudi, A. Barthwal, S. Devliyal, K.S. Rao, Real life emotion classification from speech using gaussian mixture models, in Communications in Computer and Information Science: Contemporary Computing, ed. by M. Parashar, D. Kaushik, O. F. Rana, R. Samtaney, Y. Yang, A. Zomaya, vol. 306 (Springer, New York, Aug 6–8 2012) pp. 250–261Google Scholar
- 6.S.G. Koolagudi, S. Devliyal, A. Barthwal, K.S. Rao, Emotion recognition from semi natural speech using artificial neural networks and excitation source features, in Communications in Computer and Information Science: Contemporary Computing, ed. by M. Parashar, D. Kaushik, O. F. Rana, R. Samtaney, Y. Yang, A. Zomaya, vol. 306 (Springer, New York, Aug 6–8 2012), pp. 273–282Google Scholar