Skip to main content

Digital Audio

  • Chapter
  • First Online:
Multimedia Signals and Systems

Abstract

As one of the basic types of multimedia data, the audio signals (including speech analysis) have been considered in this chapter. The basic properties of music and speech signals are presented, together with the concepts of human speech production and hearing system. The main focus is on the lossless and lossy audio compression algorithms (MUSICAM; MPEG layer I, II, and III; ASPEC) and masking, based on the psychoacoustic model. Furthermore, the special purpose techniques for audio signals characterization are considered as well: voice activity indicators, word endpoints detector, time-frequency analysis of audio signals, and singular value decomposition of audio signals.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Notes

  1. 1.

    For the signal with spectrum bandwidth B, the sampling frequency is f s=2B if (2f c + B)/2B is an integer (f cis the central frequency).

References

  1. Bosi M, Goldberg RE (2003) Introduction to digital audio coding and standards. Springer, New York

    Book  Google Scholar 

  2. Chu WC (2003) Speech coding algorithms. Wiley, Hoboken

    Book  MATH  Google Scholar 

  3. Gibson J, Berger T, Lookabaugh T, Baker R, Lindbergh D (1998) Digital compression for multimedia: principles and standards. Morgan Kaufmann, San Francisco

    Google Scholar 

  4. Hankersson D, Greg AH, Peter DJ (1997) Introduction to information theory and data compression. CRC Press, Boca Raton

    Google Scholar 

  5. Hassanpour H, Mesbah M, Boashash B (2004) Time-frequency feature extraction of newborn EEG seizure using SVD-based techniques. EURASIP J Appl Signal Process 16:2544–2554

    Google Scholar 

  6. Hoeg W, Lauterbach T (2003) Digital audio broadcasting: principles and applications of digital radio. Wiley, Chichester

    Google Scholar 

  7. Kaplan R (1997) Intelligent multimedia systems. Willey, New York

    Google Scholar 

  8. Kovačević B, Milosavljević M, Veinović M, Marković M (2000) Robustna Digitalna Obrada Signala. Akademska misao, Beograd

    Google Scholar 

  9. Maes J, Vercammen M, Baert L (2002) Digital audio technology, 4th edn. In association with Sony, Focal Press

    Google Scholar 

  10. Mataušek M, Batalov V (1980) A new approach to the determination of the glottal waveform. IEEE Trans Acoust Speech Signal Process ASSP-28(6):616–622

    Article  Google Scholar 

  11. Painter T (2000) Perceptual coding of digital audio. Proc IEEE 88(4):451–513

    Article  MathSciNet  Google Scholar 

  12. Pan D (1995) A tutorial on MPEG/audio compression. IEEE Multimedia 2(2):60–74

    Article  Google Scholar 

  13. Pohlmann KC (2005) Principles of digital audio. McGraw-Hill, New York

    Google Scholar 

  14. Salomon D, Motta G, Bryant D (2009) Handbook of data compression. Springer, London

    Google Scholar 

  15. Sayood K (2000) Introduction to data compression, 2nd edn. Morgan Kaufmann, San Francisco

    Google Scholar 

  16. Smith MT (1999) Audio engineer’s reference book, 2nd edn. Focal Press, Oxford

    Google Scholar 

  17. Spanias A, Painter T, Atti V (2007) Audio signal processing and coding. Wiley-Interscience, Hoboken

    Book  Google Scholar 

  18. Stanković LJ (1994) A method for time-frequency signal analysis. IEEE Trans Signal Process 42(1):225–229

    Article  Google Scholar 

  19. Stanković S, Orović I (2010) Time-frequency based speech regions characterization and eigenvalue decomposition applied to speech watermarking. EURASIP J Adv Signal Process, Special Issue on Time-Frequency Analysis and its Application to Multimedia signals, Article ID 572748, Pages(s) 10 pages

    Google Scholar 

  20. Steinmetz R, Nahrstedt K (2004) Multimedia systems. Springer-Verlag, Berlin Heidelberg

    Google Scholar 

  21. Vetterli M, Kovačević J (1995) Wavelets and subband coding. Prentice-Hall, Englewood Cliffs

    MATH  Google Scholar 

  22. Watkinson J (2001) The art of digital audio, 3rd edn. Focal Press, London

    Google Scholar 

  23. Watkinson J (2001) The MPEG handbook. Focal Press, Oxford

    Google Scholar 

  24. Wong DY, Markel JD, Gray AH (1979) Least squares glottal inverse filtering from the acoustic speech waveform. IEEE Trans Acoust Speech Signal Process ASSP-27(4):350–355

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Srdjan Stanković .

Rights and permissions

Reprints and permissions

Copyright information

© 2012 Springer Science+Business Media, LLC

About this chapter

Cite this chapter

Stanković, S., Orović, I., Sejdić, E. (2012). Digital Audio. In: Multimedia Signals and Systems. Springer, Boston, MA. https://doi.org/10.1007/978-1-4614-4208-0_2

Download citation

  • DOI: https://doi.org/10.1007/978-1-4614-4208-0_2

  • Published:

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4614-4207-3

  • Online ISBN: 978-1-4614-4208-0

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics