Abstract
The initial stages in speech processing, discussed in Chapter 4, are commonly performed using a short-time Fourier transformation (STFT) of the digitally-sampled acoustic time series. Several representations of the STFT have been employed for automatic speech recognition, including linear, logarithmic scale, logarithmic mel-scale, cepstral and differenced-cepstral coefficients. However, recent investigations of mammalian auditory processing have determined that the cochlea is a time-domain analyzer, and that the STFT representation is not always the most appropriate method of signal analysis. Therefore, this chapter reviews the properties and behavior of cochlear models and their importance to ASR. It emphasizes the benefits gained from better models of “early” signal processing in mammals. A discussion of artificial neural network applications for conventional signal processing problems follows. The remainder of this chapter discusses how low-level “feature maps” may be created and used in ASR applications.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Rights and permissions
Copyright information
© 1991 Springer Science+Business Media New York
About this chapter
Cite this chapter
Morgan, D.P., Scofield, C.L. (1991). Signal Processing and Feature Extraction. In: Neural Networks and Speech Processing. The Springer International Series in Engineering and Computer Science, vol 130. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-3950-6_6
Download citation
DOI: https://doi.org/10.1007/978-1-4615-3950-6_6
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-6763-5
Online ISBN: 978-1-4615-3950-6
eBook Packages: Springer Book Archive