Abstract
As one of the basic types of multimedia data, the audio signals (including speech analysis) have been considered in this chapter. The basic properties of music and speech signals are presented, together with the concepts of human speech production and hearing system. The main focus is on the lossless and lossy audio compression algorithms (MUSICAM; MPEG layer I, II, and III; ASPEC) and masking, based on the psychoacoustic model. Furthermore, the special purpose techniques for audio signals characterization are considered as well: voice activity indicators, word endpoints detector, time-frequency analysis of audio signals, and singular value decomposition of audio signals.
Notes
- 1.
For the signal with spectrum bandwidth B, the sampling frequency is f s=2B if (2f c + B)/2B is an integer (f cis the central frequency).
References
Bosi M, Goldberg RE (2003) Introduction to digital audio coding and standards. Springer, New York
Chu WC (2003) Speech coding algorithms. Wiley, Hoboken
Gibson J, Berger T, Lookabaugh T, Baker R, Lindbergh D (1998) Digital compression for multimedia: principles and standards. Morgan Kaufmann, San Francisco
Hankersson D, Greg AH, Peter DJ (1997) Introduction to information theory and data compression. CRC Press, Boca Raton
Hassanpour H, Mesbah M, Boashash B (2004) Time-frequency feature extraction of newborn EEG seizure using SVD-based techniques. EURASIP J Appl Signal Process 16:2544–2554
Hoeg W, Lauterbach T (2003) Digital audio broadcasting: principles and applications of digital radio. Wiley, Chichester
Kaplan R (1997) Intelligent multimedia systems. Willey, New York
Kovačević B, Milosavljević M, Veinović M, Marković M (2000) Robustna Digitalna Obrada Signala. Akademska misao, Beograd
Maes J, Vercammen M, Baert L (2002) Digital audio technology, 4th edn. In association with Sony, Focal Press
Mataušek M, Batalov V (1980) A new approach to the determination of the glottal waveform. IEEE Trans Acoust Speech Signal Process ASSP-28(6):616–622
Painter T (2000) Perceptual coding of digital audio. Proc IEEE 88(4):451–513
Pan D (1995) A tutorial on MPEG/audio compression. IEEE Multimedia 2(2):60–74
Pohlmann KC (2005) Principles of digital audio. McGraw-Hill, New York
Salomon D, Motta G, Bryant D (2009) Handbook of data compression. Springer, London
Sayood K (2000) Introduction to data compression, 2nd edn. Morgan Kaufmann, San Francisco
Smith MT (1999) Audio engineer’s reference book, 2nd edn. Focal Press, Oxford
Spanias A, Painter T, Atti V (2007) Audio signal processing and coding. Wiley-Interscience, Hoboken
Stanković LJ (1994) A method for time-frequency signal analysis. IEEE Trans Signal Process 42(1):225–229
Stanković S, Orović I (2010) Time-frequency based speech regions characterization and eigenvalue decomposition applied to speech watermarking. EURASIP J Adv Signal Process, Special Issue on Time-Frequency Analysis and its Application to Multimedia signals, Article ID 572748, Pages(s) 10 pages
Steinmetz R, Nahrstedt K (2004) Multimedia systems. Springer-Verlag, Berlin Heidelberg
Vetterli M, Kovačević J (1995) Wavelets and subband coding. Prentice-Hall, Englewood Cliffs
Watkinson J (2001) The art of digital audio, 3rd edn. Focal Press, London
Watkinson J (2001) The MPEG handbook. Focal Press, Oxford
Wong DY, Markel JD, Gray AH (1979) Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans Acoust Speech Signal Process ASSP-27(4):350–355
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Stanković, S., Orović, I., Sejdić, E. (2012). Digital Audio. In: Multimedia Signals and Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-4208-0_2
Download citation
DOI: https://doi.org/10.1007/978-1-4614-4208-0_2
Published:
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4614-4207-3
Online ISBN: 978-1-4614-4208-0
eBook Packages: Computer ScienceComputer Science (R0)