Abstract
Speech is an acoustic representation of a word or sequence of words, characterized by a slowly changing spectral envelope. Humans perceive this spectral envelope, and convert it into the underlying word string and its associated meaning. The ultimate goal of speech and language processing is to mimic this process so that a machine can hold a natural conversation with a human. Speech and language processing has a far wider role to play, however, in performing less complex tasks such as transcription, language identification, or audio document retrieval, all of which are feasible to a certain extent now. The basic step in all of these systems is to perform the inverse mapping of the speech into the underlying sequence of symbols, usually words, that produced it, as shown in Fig. 2.1. This chapter describes a statistical approach to solving the automatic speech recognition problem, based on stochastic Markov process models.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media Dordrecht
About this chapter
Cite this chapter
Knill, K., Young, S. (1997). Hidden Markov Models in Speech and Language Processing. In: Young, S., Bloothooft, G. (eds) Corpus-Based Methods in Language and Speech Processing. Text, Speech and Language Technology, vol 2. Springer, Dordrecht. https://doi.org/10.1007/978-94-017-1183-8_2
Download citation
DOI: https://doi.org/10.1007/978-94-017-1183-8_2
Publisher Name: Springer, Dordrecht
Print ISBN: 978-90-481-4813-4
Online ISBN: 978-94-017-1183-8
eBook Packages: Springer Book Archive