Digital Audio

Stanković, Srdjan; Orović, Irena; Sejdić, Ervin

doi:10.1007/978-1-4614-4208-0_2

Srdjan Stanković⁴,
Irena Orović⁴ &
Ervin Sejdić⁵

1131 Accesses

Abstract

As one of the basic types of multimedia data, the audio signals (including speech analysis) have been considered in this chapter. The basic properties of music and speech signals are presented, together with the concepts of human speech production and hearing system. The main focus is on the lossless and lossy audio compression algorithms (MUSICAM; MPEG layer I, II, and III; ASPEC) and masking, based on the psychoacoustic model. Furthermore, the special purpose techniques for audio signals characterization are considered as well: voice activity indicators, word endpoints detector, time-frequency analysis of audio signals, and singular value decomposition of audio signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Institutional subscriptions

Notes

1.
For the signal with spectrum bandwidth B, the sampling frequency is f _s=2B if (2f _c + B)/2B is an integer (f _cis the central frequency).

References

Bosi M, Goldberg RE (2003) Introduction to digital audio coding and standards. Springer, New York
Book Google Scholar
Chu WC (2003) Speech coding algorithms. Wiley, Hoboken
Book MATH Google Scholar
Gibson J, Berger T, Lookabaugh T, Baker R, Lindbergh D (1998) Digital compression for multimedia: principles and standards. Morgan Kaufmann, San Francisco
Google Scholar
Hankersson D, Greg AH, Peter DJ (1997) Introduction to information theory and data compression. CRC Press, Boca Raton
Google Scholar
Hassanpour H, Mesbah M, Boashash B (2004) Time-frequency feature extraction of newborn EEG seizure using SVD-based techniques. EURASIP J Appl Signal Process 16:2544–2554
Google Scholar
Hoeg W, Lauterbach T (2003) Digital audio broadcasting: principles and applications of digital radio. Wiley, Chichester
Google Scholar
Kaplan R (1997) Intelligent multimedia systems. Willey, New York
Google Scholar
Kovačević B, Milosavljević M, Veinović M, Marković M (2000) Robustna Digitalna Obrada Signala. Akademska misao, Beograd
Google Scholar
Maes J, Vercammen M, Baert L (2002) Digital audio technology, 4th edn. In association with Sony, Focal Press
Google Scholar
Mataušek M, Batalov V (1980) A new approach to the determination of the glottal waveform. IEEE Trans Acoust Speech Signal Process ASSP-28(6):616–622
Article Google Scholar
Painter T (2000) Perceptual coding of digital audio. Proc IEEE 88(4):451–513
Article MathSciNet Google Scholar
Pan D (1995) A tutorial on MPEG/audio compression. IEEE Multimedia 2(2):60–74
Article Google Scholar
Pohlmann KC (2005) Principles of digital audio. McGraw-Hill, New York
Google Scholar
Salomon D, Motta G, Bryant D (2009) Handbook of data compression. Springer, London
Google Scholar
Sayood K (2000) Introduction to data compression, 2nd edn. Morgan Kaufmann, San Francisco
Google Scholar
Smith MT (1999) Audio engineer’s reference book, 2nd edn. Focal Press, Oxford
Google Scholar
Spanias A, Painter T, Atti V (2007) Audio signal processing and coding. Wiley-Interscience, Hoboken
Book Google Scholar
Stanković LJ (1994) A method for time-frequency signal analysis. IEEE Trans Signal Process 42(1):225–229
Article Google Scholar
Stanković S, Orović I (2010) Time-frequency based speech regions characterization and eigenvalue decomposition applied to speech watermarking. EURASIP J Adv Signal Process, Special Issue on Time-Frequency Analysis and its Application to Multimedia signals, Article ID 572748, Pages(s) 10 pages
Google Scholar
Steinmetz R, Nahrstedt K (2004) Multimedia systems. Springer-Verlag, Berlin Heidelberg
Google Scholar
Vetterli M, Kovačević J (1995) Wavelets and subband coding. Prentice-Hall, Englewood Cliffs
MATH Google Scholar
Watkinson J (2001) The art of digital audio, 3rd edn. Focal Press, London
Google Scholar
Watkinson J (2001) The MPEG handbook. Focal Press, Oxford
Google Scholar
Wong DY, Markel JD, Gray AH (1979) Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans Acoust Speech Signal Process ASSP-27(4):350–355
Article Google Scholar

Download references

Author information

Authors and Affiliations

University of Montenegro, Džordža Vašingtona bb, Podgorica, Montenegro
Srdjan Stanković & Irena Orović
University of Pittsburgh, Benedum Hall, Pittsburgh, PA, USA
Ervin Sejdić

Authors

Srdjan Stanković
View author publications
You can also search for this author in PubMed Google Scholar
Irena Orović
View author publications
You can also search for this author in PubMed Google Scholar
Ervin Sejdić
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Srdjan Stanković .

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Stanković, S., Orović, I., Sejdić, E. (2012). Digital Audio. In: Multimedia Signals and Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-4208-0_2

Download citation

DOI: https://doi.org/10.1007/978-1-4614-4208-0_2
Published: 17 July 2012
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-4207-3
Online ISBN: 978-1-4614-4208-0
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics