Skip to main content

IITKGP-SESC: Speech Database for Emotion Analysis

  • Conference paper
Contemporary Computing (IC3 2009)

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 40))

Included in the following conference series:

Abstract

In this paper, we are introducing the speech database for analyzing the emotions present in speech signals. The proposed database is recorded in Telugu language using the professional artists from All India Radio (AIR), Vijayawada, India. The speech corpus is collected by simulating eight different emotions using the neutral (emotion free) statements. The database is named as Indian Institute of Technology Kharagpur Simulated Emotion Speech Corpus (IITKGP-SESC). The proposed database will be useful for characterizing the emotions present in speech. Further, the emotion specific knowledge present in speech at different levels can be acquired by developing the emotion specific models using the features from vocal tract system, excitation source and prosody. This paper describes the design, acquisition, post processing and evaluation of the proposed speech database (IITKGP-SESC). The quality of the emotions present in the database is evaluated using subjective listening tests. Finally, statistical models are developed using prosodic features, and the discrimination of the emotions is carried out by performing the classification of emotions using the developed statistical models.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 84.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 109.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. Database for Indian languages. Speech and Vision lab, Indian Institute of Technology Madras, India (2001)

    Google Scholar 

  2. Ramamohan, S., Dandapat, S.: Sinusoidal model-based analysis and classification of stressed speech. IEEE Trans. Speech and Audio Processing 14, 737–746 (2006)

    Article  Google Scholar 

  3. Sagar, T.V.: Characterisation and synthesis of emotionsin speech using prosodic features. Master’s thesis, Dept. of Electronics and communications Engineering, Indian Institute of Technology Guwahati (May 2007)

    Google Scholar 

  4. Lee, C.M., Narayanan, S.: Toward detecting emotions in spoken dialogs. IEEEAUP 13(2), 293–303 (2005)

    Google Scholar 

  5. Ververidis, D., Kotropoulos, C.: A state of the art review on emotional speech databases. In: Eleventh Australasian International Conference on Speech Science and Technology, Auckland, New Zealand (December 2006)

    Google Scholar 

  6. Yang, L.: The expression and recognition of emotions through prosody. In: Proc. Int. Conf. Spoken Language Processing, pp. 74–77 (2000)

    Google Scholar 

  7. Cowie, R., Cornelius, R.R.: Describing the emotional states that are expressed in speech. Speech Communication 40, 5–32 (2003)

    Article  Google Scholar 

  8. Prasanna, S.R.M., Yegnanarayana, B.: Extraction of pitch in adverse conditions. In: Proc. IEEE Int. Conf. Acoust., Speech, Signal Processing, Montreal, Canada (May 2004)

    Google Scholar 

  9. Haykin, S.: Neural Networks: A Comprehensive Foundation. Pearson Education Aisa, Inc., New Delhi (1999)

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2009 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

Koolagudi, S.G., Maity, S., Kumar, V.A., Chakrabarti, S., Rao, K.S. (2009). IITKGP-SESC: Speech Database for Emotion Analysis. In: Ranka, S., et al. Contemporary Computing. IC3 2009. Communications in Computer and Information Science, vol 40. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-03547-0_46

Download citation

  • DOI: https://doi.org/10.1007/978-3-642-03547-0_46

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-642-03546-3

  • Online ISBN: 978-3-642-03547-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics