Transformations of LPC and LSF Parameters to Speech Recognition Features

de Alencar, Vladimir Fabregas Surigué; Alcaim, Abraham

doi:10.1007/11551188_57

Vladimir Fabregas Surigué de Alencar²⁰ &
Abraham Alcaim²⁰

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 3686))

Included in the following conference series:

International Conference on Pattern Recognition and Image Analysis

1889 Accesses
5 Citations

Abstract

In this paper, we describe and present an overall evaluation of several features for distributed speech recognition systems. These systems are based on a client-server architecture. This means that recognizers access only the coded parameters of the speech coder employed in communication networks (e.g., cellular mobile and IP networks). The recognition features considered in this paper are obtained from transformations of codec parameters. In particular, features generated from LPC and LSF parameters, in intervals of 10 ms and 20 ms, are analyzed in a continuous observation HMM-based speaker independent recognizer.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Choi, H.S., Kim, H.K., Lee, H.S.: Speech Recognition Using Quantized LSP Parameters and their Transformations in Digital Communication. Speech Communication 30, 223–233 (2000)
Article Google Scholar
Ohshima, Y.: Environmental Robustness in Speech Recognition using Physiologically-Motivated Signal Processing. PH. D. Thesis, Carnegie Mellon University, Pittsburgh, Pennsylvanya (December 1993)
Google Scholar
Oppenheim, A.V., Johnson, D.H.: Discrete Representation of Signals. Proc. IEEE 60, 681–691 (1972)
Article Google Scholar
Mitra, S.K.: Digital Signal Processing: A Computer-Based Approach. McGraw-Hill International Editions, New York (1998)
Google Scholar
Wölfel, M., McDonough, J., Waibel, A.: Minimum Variance Distortionless Response on a Warped Frequency Scale. Eurospeech, Geneva (2003)
Google Scholar
Kleijn, W.B., Paliwal, K.K.: Speech Coding and Synthesis. Elsevier, Amsterdam (1995)
Google Scholar
Kim, H.K., Choi, S.H., Lee, H.S.: On Approximating Line Spectral Frequencies to LPC Cepstral Coefficients. IEEE Trans. Speech and Audio Processing 8, 195–199 (2000)
Article Google Scholar
Gurgen, F.S., Sagayama, S., Furui, S.: Line Spectrum Frequency-Based Distance Measures for Speech Recognition. In: Proc. ICSLP, Kobe, Japan, pp. 521–524 (November 1990)
Google Scholar
Young, S., Evermann, G., Hain, T., Kershaw, D., Moore, G., Odell, J., Ollason, D., Povey, D., Valtchev, V., Woodland, P.: The HTK Book (for HTK Version 3.2.1) December (2002)
Google Scholar
Davies, S.B., Mermelstein, P.: Comparasion of Parametric Representations for Mono syllabic Word Recognition in Continuously Spoken Sentences. IEEE Trans. ASSP 28, 357–366 (1980)
Article Google Scholar

Download references

Author information

Authors and Affiliations

Centro de Estudos em Telecomunicações – CETUC, Pontifícia Universidade Católica do Rio de Janeiro – PUC-RIO, Rua Marquês de São Vicente, 225, 22453-900, Rio de Janeiro, RJ, Brazil
Vladimir Fabregas Surigué de Alencar & Abraham Alcaim

Authors

Vladimir Fabregas Surigué de Alencar
View author publications
You can also search for this author in PubMed Google Scholar
Abraham Alcaim
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Research School of Infomatics, Loughborough, UK
Sameer Singh
ATR Lab, Research School of Informatics, University of Loughborough, Loughborough, UK
Maneesha Singh
IBM Corporation, 1133 Wetchester Avenue, White Plains, 10604, New York, United States
Chid Apte
Institute of Computer Vision and applied Computer Sciences, IBaI, Germany
Petra Perner

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

de Alencar, V.F.S., Alcaim, A. (2005). Transformations of LPC and LSF Parameters to Speech Recognition Features. In: Singh, S., Singh, M., Apte, C., Perner, P. (eds) Pattern Recognition and Data Mining. ICAPR 2005. Lecture Notes in Computer Science, vol 3686. Springer, Berlin, Heidelberg. https://doi.org/10.1007/11551188_57

Download citation

DOI: https://doi.org/10.1007/11551188_57
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-28757-5
Online ISBN: 978-3-540-28758-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics