Abstract
Before we explore the algorithms and techniques used to process speech signals to accomplish various objectives in an embedded application, we need to understand some fundamental principles behind the nature of speech signals. Of particular importance are the temporal and spectral characteristics of different types of vocal sounds produced by humans and what role the human speech production system itself plays in determining the properties of these sounds. This knowledge enables us to efficiently model the sounds generated, thereby providing the foundation of sophisticated techniques for compressing speech. Moreover, any spoken language is based on a combination and sequence of such sounds; hence understanding their salient features is useful for the design and implementation of effective speech recognition and synthesis techniques. In this section, we will learn how to classify the basic types of sounds generated by human voice and the underlying time-domain and frequency-domain characteristics behind these different types of sounds. Finally, and most importantly, we will explore some popular speech processing building-block techniques that enable us to extract critical pieces of information from the speech signal, such as which category a speech segment belongs to, the pitch of the sound, and the energy contained therein.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsReferences
Rabiner LR, Schafer RW Digital processing of speech signals, Prentice Hall, 1998.
Chau WC Speech coding algorithms, Wiley-Interscience, 2003.
Holmes J, Holmes W Speech synthesis and recognition, CRC Press, 2001.
Proakis JG, Manolakis DG Digital Signal Processing – Principles, Algorithms and Applications, Prentice Hall, 1995.
Flanagan JL, Speech Analysis and Perception, Springer Science+Business Media B.V., 1965.
Rubin P, Vatikiotis-Bateson E Measuring and Modeling Speech Production, Animal Acoustic Communication, Springer Science+Business Media B.V., 1998.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2010 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Sinha, P. (2010). Basic Speech Processing Concepts. In: Speech Processing in Embedded Systems. Springer, Boston, MA. https://doi.org/10.1007/978-0-387-75581-6_3
Download citation
DOI: https://doi.org/10.1007/978-0-387-75581-6_3
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-0-387-75580-9
Online ISBN: 978-0-387-75581-6
eBook Packages: EngineeringEngineering (R0)