Abstract
After a short summary of some basic properties of speech signals and of speech signal models the effect of linear prediction and vector quantization for data compression in speech coding is outlined. Some well-known coding schemes are reviewed. The recently developed RELP-S schemes based on speech analysis by synthesis are discussed in more detail. In particular a scheme using stochastic excitation sequences is expected to guarantee high speech quality at data rates far below 8 kb/s.
This is a preview of subscription content, log in via an institution.
Buying options
Tax calculation will be finalised at checkout
Purchases are for personal use only
Learn about institutional subscriptionsPreview
Unable to display preview. Download preview PDF.
References
Atal, B.S., “Predictive Coding of Speech at Low Bit Rates”, IEEE Trans. on Communications, COM-30, (1986) pp. 600–614.
Atal, B.S., and Rabiner, L.R., “Speech Research Directions”, AT&T Techn. Journal 65, (1986) pp. 75–88.
Atal, R., and Remde, J.R., “A New Model of LPC Excitation for Producing Natural-Sounding Speech at Low Bit Rates”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Paris 1982, pp. 614–617.
Brehm, H., and Stammler, W., “Description and Generation of Spherically Invariant Speech-Model Signals”, Signal Processing 12, (1987) pp. 119–141.
Buzo, A., Gray, H., Gray, R.M.,and Markel, J.D., “Speech Coding Based upon VectorQuantization”, IEEE A., “Adaptive Differential Conference Record Globe-Com
Caspers, B., and Atal, B.S., “Role of Multi-Pulse Excitation in Synthesis of Natural-Sounding Voiced Speech”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Dallas 1987, pp. 2388–2394.
Cheng, D.Y., Gersho, A., Ramamurthi, B., and Shoham, Y., “Fast Search Algorithms for Vector Quantization and Pattern Matching”, Proc. Int. Conf. Acoust., Speech, and Signal Processing, San Diego (CA) 1984, pp. 9.11.1–9.11.4.
Cuperman, V., and Gersho, A Adaptive Differential Vector Coding of Speech”,Conference Record Globe-com 82, (1982) pp. 1092–1096.
Flanagan, J.L.f “Speech Analysis, Synthesis, and Perception”, Springer-Verlag Berlin, Heidelberg, New York 1972.
Flanagan, J.L., et al., “Speech Coding”, IEEE Trans, on Communications, COM-27, (1979) pp. 710–736.
Gersho, A., “On the Structure of Vector Quantizers”, IEEE Trans. Inform. Theory, IT-28, (1982) pp.157–166.
Gray, R.M., and Karnin, E.D., “Multiple Local Optima in Vector Quantizers”, IEEE Trans. Inform. Theory, IT-28, (1982) pp. 256–261.
Gray, R.M., “Vector Quantization”, IEEE ASSP Magazine 1, (1984) pp. 4–29.
Guth, P., Reininger, H., und Wolf, D., “Zur Vektorquantisierung der Pradiktorparameter”, Kleinheubacher Berichte 29, (1986) pp. 91–94.
Itakura, F., and Saito, S., “Analysis Synthesis Telephony Based upon the Maximum Likelihood Method”, Reports on the 6th Int. Cong. Acoust., ed. by Y. Kohasi, Tokyo, (1968) pp. C-5-5 – C17-20.
Jayant, N.S., “Coding Speech at Low Bit Rates”, IEEE Spectrum 23, (1986) pp. 58–63.
Jayant, N.S., and Noll, P., “Digital Coding of Waveforms”, Prentice Hall, Inc., Englewood Cliffs, New Jersey 1984.
Kroon, P., and Atal, B.S., “Quantization Procedures for the Excitation in CELP Coders”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Dallas 1987, pp. 1649–1652.22
Kroon, P., Deprettere, E.F., and Sluyter, R.J., “Regu- lar-Pulse Excitation - A Novel Approach to Effective and Efficient Coding of Speech”, IEEE Trans, on Acoust., Speech, and Signal Processing, ASSP 34, (1986) pp. 1054–1063.
Linde, Y., Buzo, A., and Gray, R.M., “An Algorithm for Vector Quantizer Design”, IEEE Trans, on Communications, COM-28, (1980) pp.84–95.
Makhoul, J., Roucos, S., and Gish, H., “Vector Quantization in Speech Coding”, Proc. IEEE, 73, (1985) pp. 1551–1588.
Makhoul, J., “Linear Prediction: A Tutorial Review”, Proc. IEEE, 63, (1975) pp. 561–580.
Marke1, J.D., and Gray Jr., A.H., “Linear Prediction of Speech”, Springer-Verlag, Berlin, Heidelberg, New York 1976.
Rabiner, L.R., and Schafer, R.W., “Digital Processing of Speech Signals”, Prentice-Hall, Inc., Englewood Cliffs, New Jersey 1978.
Ramachandran, R.P., and Kabal, P., “Stability and Performance Analysis of Pitch Filters in Speech Coders”, IEEE Trans. Acoust., Speech, and Signal Processing, ASSP-35, (1987) pp. 937–946.
Reininger, H., “Prinzipien der digitalen Sprachcodierung und ihre Anwendung zur Sprachübertragung über Fadingkanäle bei mittleren Datenraten”, Dissertation, Institut für Angewandte Physik, Universität Frankfurt am Main, 1987.
Reininger, H., and Wolf, D., “Fast Search Algorithms for Speech Coding Schemes Using Vector Quantization”, Signal Processing III: Theories and Applications, North Holland, Amsterdam 1986, pp. 453–456.
Schroeder, M.R., and Atal, B.S., “Code-Excited Linear Prediction (CELP): High-Quality Speech at Very Low Bit Rates”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Tampa 1985, pp. 937–940.
Wolf, D., “Speech Coding”, Proc. Zurich Seminar on Digital Communications, (1984) pp. 1–5.
Wolf, D., “Statistical Models of Speech”, NTG-FachBerichte 65, (1978) pp. 1–9.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1988 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Wolf, D., Reininger, H. (1988). Recent Advances in Speech Coding. In: Niemann, H., Lang, M., Sagerer, G. (eds) Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series, vol 46. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83476-9_1
Download citation
DOI: https://doi.org/10.1007/978-3-642-83476-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-83478-3
Online ISBN: 978-3-642-83476-9
eBook Packages: Springer Book Archive