Skip to main content

Basic Speech Processing Concepts

  • Chapter
  • First Online:
  • 1319 Accesses

Abstract

Before we explore the algorithms and techniques used to process speech signals to accomplish various objectives in an embedded application, we need to understand some fundamental principles behind the nature of speech signals. Of particular importance are the temporal and spectral characteristics of different types of vocal sounds produced by humans and what role the human speech production system itself plays in determining the properties of these sounds. This knowledge enables us to efficiently model the sounds generated, thereby providing the foundation of sophisticated techniques for compressing speech. Moreover, any spoken language is based on a combination and sequence of such sounds; hence understanding their salient features is useful for the design and implementation of effective speech recognition and synthesis techniques. In this section, we will learn how to classify the basic types of sounds generated by human voice and the underlying time-domain and frequency-domain characteristics behind these different types of sounds. Finally, and most importantly, we will explore some popular speech processing building-block techniques that enable us to extract critical pieces of information from the speech signal, such as which category a speech segment belongs to, the pitch of the sound, and the energy contained therein.

This is a preview of subscription content, log in via an institution.

Buying options

Chapter
USD   29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD   84.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD   129.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD   109.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

References

  1. Rabiner LR, Schafer RW Digital processing of speech signals, Prentice Hall, 1998.

    Google Scholar 

  2. Chau WC Speech coding algorithms, Wiley-Interscience, 2003.

    Google Scholar 

  3. Holmes J, Holmes W Speech synthesis and recognition, CRC Press, 2001.

    Google Scholar 

  4. Proakis JG, Manolakis DG Digital Signal Processing – Principles, Algorithms and Applications, Prentice Hall, 1995.

    Google Scholar 

  5. Flanagan JL, Speech Analysis and Perception, Springer Science+Business Media B.V., 1965.

    Google Scholar 

  6. Rubin P, Vatikiotis-Bateson E Measuring and Modeling Speech Production, Animal Acoustic Communication, Springer Science+Business Media B.V., 1998.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Priyabrata Sinha .

Rights and permissions

Reprints and permissions

Copyright information

© 2010 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Sinha, P. (2010). Basic Speech Processing Concepts. In: Speech Processing in Embedded Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-75581-6_3

Download citation

  • DOI: https://doi.org/10.1007/978-0-387-75581-6_3

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-0-387-75580-9

  • Online ISBN: 978-0-387-75581-6

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics