Abstract
In this paper, we describe and present an overall evaluation of several features for distributed speech recognition systems. These systems are based on a client-server architecture. This means that recognizers access only the coded parameters of the speech coder employed in communication networks (e.g., cellular mobile and IP networks). The recognition features considered in this paper are obtained from transformations of codec parameters. In particular, features generated from LPC and LSF parameters, in intervals of 10 ms and 20 ms, are analyzed in a continuous observation HMM-based speaker independent recognizer.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
Choi, H.S., Kim, H.K., Lee, H.S.: Speech Recognition Using Quantized LSP Parameters and their Transformations in Digital Communication. Speech Communication 30, 223–233 (2000)
Ohshima, Y.: Environmental Robustness in Speech Recognition using Physiologically-Motivated Signal Processing. PH. D. Thesis, Carnegie Mellon University, Pittsburgh, Pennsylvanya (December 1993)
Oppenheim, A.V., Johnson, D.H.: Discrete Representation of Signals. Proc. IEEE 60, 681–691 (1972)
Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill International Editions, New York (1998)
Wölfel, M., McDonough, J., Waibel, A.: Minimum Variance Distortionless Response on a Warped Frequency Scale. Eurospeech, Geneva (2003)
Kleijn, W.B., Paliwal, K.K.: Speech Coding and Synthesis. Elsevier, Amsterdam (1995)
Kim, H.K., Choi, S.H., Lee, H.S.: On Approximating Line Spectral Frequencies to LPC Cepstral Coefficients. IEEE Trans. Speech and Audio Processing 8, 195–199 (2000)
Gurgen, F.S., Sagayama, S., Furui, S.: Line Spectrum Frequency-Based Distance Measures for Speech Recognition. In: Proc. ICSLP, Kobe, Japan, pp. 521–524 (November 1990)
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.2.1) December (2002)
Davies, S.B., Mermelstein, P.: Comparasion of Parametric Representations for Mono syllabic Word Recognition in Continuously Spoken Sentences. IEEE Trans. ASSP 28, 357–366 (1980)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2005 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
de Alencar, V.F.S., Alcaim, A. (2005). Transformations of LPC and LSF Parameters to Speech Recognition Features. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds) Pattern Recognition and Data Mining. ICAPR 2005. Lecture Notes in Computer Science, vol 3686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551188_57
Download citation
DOI: https://doi.org/10.1007/11551188_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28757-5
Online ISBN: 978-3-540-28758-2
eBook Packages: Computer ScienceComputer Science (R0)