Skip to main content

Robust Emotion Recognition using Sentence, Word and Syllable Level Prosodic Features

  • Chapter
  • First Online:
Robust Emotion Recognition using Spectral and Prosodic Features

Part of the book series: SpringerBriefs in Electrical and Computer Engineering ((BRIEFSSPEECHTECH))

  • 948 Accesses

Abstract

This chapter discuss about the use of prosodic information in discriminating the emotions. The motivation for exploring the prosodic features to recognize the emotions is illustrated using the gross statistics and time varying patterns of prosodic parameters. Prosodic correlates of speech such as energy, duration and pitch parameters are computed from the emotional utterances. Global prosodic features representing the gross statistics of prosody and local prosodic features representing the finer variations in prosody are introduced in this chapter for discriminating the emotions. These parameters are further extracted separately from different levels such as entire utterances, words and syllables. The analysis of contribution of emotional information by the initial, middle and final portions of sentences, words and syllables are studied. Use of support vector machines for classifying emotional utterances based on prosodic features has been demonstrated. Chapter ends with discussion on emotion recognition results and important conclusions.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

eBook
USD 16.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 16.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. K. Murty, B. Yegnanarayana, Epoch extraction from speech signals. IEEE Trans. Audio Speech Lang. Process. 16, 1602–1613 (2008)

    Article  Google Scholar 

  2. S.R.M. Prasanna, B.V.S. Reddy, P. Krishnamoorthy, Vowel onset point detection using source, spectral peaks, and modulation spectrum energies. IEEE Trans Audio Speech Lang Process 17, 556–565 (May 2009)

    Article  Google Scholar 

  3. S.R.M. Prasanna, J.M. Zachariah, Detection of vowel onset point in speech, in Proceedings of IEEE International Conference Acoustics Speech, Signal Processing Orlando, Florida, USA May 2002

    Google Scholar 

  4. S.R.M. Prasanna, Event-Based Analysis of Speech. PhD thesis, Department of Computer Science and Engineering, Indian Institute of Technology Madras, Chennai, India, Mar 2004

    Google Scholar 

  5. http://web.nickshanks.com/fonts/telugu/

  6. K.S. Rao, S.G. Koolagudi, R.R. Vempada, Emotion recognition from speech using global and local prosodic features. Int. J. Speech Technol. 15, 265–289 (2012). doi:10.1007/s10772-012-9172-2

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to K. Sreenivasa Rao .

Rights and permissions

Reprints and permissions

Copyright information

© 2013 The Author(s)

About this chapter

Cite this chapter

Rao, K.S., Koolagudi, S.G. (2013). Robust Emotion Recognition using Sentence, Word and Syllable Level Prosodic Features. In: Robust Emotion Recognition using Spectral and Prosodic Features. SpringerBriefs in Electrical and Computer Engineering(). Springer, New York, NY. https://doi.org/10.1007/978-1-4614-6360-3_3

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-6360-3_3

  • Published:

  • Publisher Name: Springer, New York, NY

  • Print ISBN: 978-1-4614-6359-7

  • Online ISBN: 978-1-4614-6360-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics