Recent Advances in Speech Coding

Wolf, D.; Reininger, H.

doi:10.1007/978-3-642-83476-9_1

Recent Advances in Speech Coding

D. Wolf³ &
H. Reininger³

Conference paper

97 Accesses
2 Citations

Part of the book series: NATO ASI Series ((NATO ASI F,volume 46))

Abstract

After a short summary of some basic properties of speech signals and of speech signal models the effect of linear prediction and vector quantization for data compression in speech coding is outlined. Some well-known coding schemes are reviewed. The recently developed RELP-S schemes based on speech analysis by synthesis are discussed in more detail. In particular a scheme using stochastic excitation sequences is expected to guarantee high speech quality at data rates far below 8 kb/s.

This is a preview of subscription content, log in via an institution.

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Learn about institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

References

Atal, B.S., “Predictive Coding of Speech at Low Bit Rates”, IEEE Trans. on Communications, COM-30, (1986) pp. 600–614.
Google Scholar
Atal, B.S., and Rabiner, L.R., “Speech Research Directions”, AT&T Techn. Journal 65, (1986) pp. 75–88.
Google Scholar
Atal, R., and Remde, J.R., “A New Model of LPC Excitation for Producing Natural-Sounding Speech at Low Bit Rates”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Paris 1982, pp. 614–617.
Google Scholar
Brehm, H., and Stammler, W., “Description and Generation of Spherically Invariant Speech-Model Signals”, Signal Processing 12, (1987) pp. 119–141.
Article Google Scholar
Buzo, A., Gray, H., Gray, R.M.,and Markel, J.D., “Speech Coding Based upon VectorQuantization”, IEEE A., “Adaptive Differential Conference Record Globe-Com
Google Scholar
Caspers, B., and Atal, B.S., “Role of Multi-Pulse Excitation in Synthesis of Natural-Sounding Voiced Speech”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Dallas 1987, pp. 2388–2394.
Google Scholar
Cheng, D.Y., Gersho, A., Ramamurthi, B., and Shoham, Y., “Fast Search Algorithms for Vector Quantization and Pattern Matching”, Proc. Int. Conf. Acoust., Speech, and Signal Processing, San Diego (CA) 1984, pp. 9.11.1–9.11.4.
Google Scholar
Cuperman, V., and Gersho, A Adaptive Differential Vector Coding of Speech”,Conference Record Globe-com 82, (1982) pp. 1092–1096.
Google Scholar
Flanagan, J.L.f “Speech Analysis, Synthesis, and Perception”, Springer-Verlag Berlin, Heidelberg, New York 1972.
Google Scholar
Flanagan, J.L., et al., “Speech Coding”, IEEE Trans, on Communications, COM-27, (1979) pp. 710–736.
Google Scholar
Gersho, A., “On the Structure of Vector Quantizers”, IEEE Trans. Inform. Theory, IT-28, (1982) pp.157–166.
Google Scholar
Gray, R.M., and Karnin, E.D., “Multiple Local Optima in Vector Quantizers”, IEEE Trans. Inform. Theory, IT-28, (1982) pp. 256–261.
Google Scholar
Gray, R.M., “Vector Quantization”, IEEE ASSP Magazine 1, (1984) pp. 4–29.
Article Google Scholar
Guth, P., Reininger, H., und Wolf, D., “Zur Vektorquantisierung der Pradiktorparameter”, Kleinheubacher Berichte 29, (1986) pp. 91–94.
Google Scholar
Itakura, F., and Saito, S., “Analysis Synthesis Telephony Based upon the Maximum Likelihood Method”, Reports on the 6th Int. Cong. Acoust., ed. by Y. Kohasi, Tokyo, (1968) pp. C-5-5 – C17-20.
Google Scholar
Jayant, N.S., “Coding Speech at Low Bit Rates”, IEEE Spectrum 23, (1986) pp. 58–63.
Google Scholar
Jayant, N.S., and Noll, P., “Digital Coding of Waveforms”, Prentice Hall, Inc., Englewood Cliffs, New Jersey 1984.
Google Scholar
Kroon, P., and Atal, B.S., “Quantization Procedures for the Excitation in CELP Coders”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Dallas 1987, pp. 1649–1652.22
Google Scholar
Kroon, P., Deprettere, E.F., and Sluyter, R.J., “Regu- lar-Pulse Excitation - A Novel Approach to Effective and Efficient Coding of Speech”, IEEE Trans, on Acoust., Speech, and Signal Processing, ASSP 34, (1986) pp. 1054–1063.
Article Google Scholar
Linde, Y., Buzo, A., and Gray, R.M., “An Algorithm for Vector Quantizer Design”, IEEE Trans, on Communications, COM-28, (1980) pp.84–95.
Google Scholar
Makhoul, J., Roucos, S., and Gish, H., “Vector Quantization in Speech Coding”, Proc. IEEE, 73, (1985) pp. 1551–1588.
Article Google Scholar
Makhoul, J., “Linear Prediction: A Tutorial Review”, Proc. IEEE, 63, (1975) pp. 561–580.
Article Google Scholar
Marke1, J.D., and Gray Jr., A.H., “Linear Prediction of Speech”, Springer-Verlag, Berlin, Heidelberg, New York 1976.
Google Scholar
Rabiner, L.R., and Schafer, R.W., “Digital Processing of Speech Signals”, Prentice-Hall, Inc., Englewood Cliffs, New Jersey 1978.
Google Scholar
Ramachandran, R.P., and Kabal, P., “Stability and Performance Analysis of Pitch Filters in Speech Coders”, IEEE Trans. Acoust., Speech, and Signal Processing, ASSP-35, (1987) pp. 937–946.
Article Google Scholar
Reininger, H., “Prinzipien der digitalen Sprachcodierung und ihre Anwendung zur Sprachübertragung über Fadingkanäle bei mittleren Datenraten”, Dissertation, Institut für Angewandte Physik, Universität Frankfurt am Main, 1987.
Google Scholar
Reininger, H., and Wolf, D., “Fast Search Algorithms for Speech Coding Schemes Using Vector Quantization”, Signal Processing III: Theories and Applications, North Holland, Amsterdam 1986, pp. 453–456.
Google Scholar
Schroeder, M.R., and Atal, B.S., “Code-Excited Linear Prediction (CELP): High-Quality Speech at Very Low Bit Rates”, Proc. IEEE Int. Conf. Acoust., Speech, and Signal Processing, Tampa 1985, pp. 937–940.
Google Scholar
Wolf, D., “Speech Coding”, Proc. Zurich Seminar on Digital Communications, (1984) pp. 1–5.
Google Scholar
Wolf, D., “Statistical Models of Speech”, NTG-FachBerichte 65, (1978) pp. 1–9.
Google Scholar

Download references

Author information

Authors and Affiliations

Institut für Angewandte Physik, Universität Frankfurt a.M., Robert-Mayer-Straße 2-4, D-6000, Frankfurt a. M., Germany
D. Wolf & H. Reininger

Authors

D. Wolf
View author publications
You can also search for this author in PubMed Google Scholar
H. Reininger
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Universität Erlangen-Nürnberg, Martensstr. 3, D-8520, Erlangen, Germany
H. Niemann & G. Sagerer &
ZT ZTI SYS 5, Siemens AG, Otto-Hahn-Ring 6, D-8000, München 83, Germany
M. Lang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wolf, D., Reininger, H. (1988). Recent Advances in Speech Coding. In: Niemann, H., Lang, M., Sagerer, G. (eds) Recent Advances in Speech Understanding and Dialog Systems. NATO ASI Series, vol 46. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-83476-9_1

Download citation

DOI: https://doi.org/10.1007/978-3-642-83476-9_1
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-83478-3
Online ISBN: 978-3-642-83476-9
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics