Skip to main content

MPEG Audio Compression

  • Chapter
  • First Online:
Fundamentals of Multimedia

Part of the book series: Texts in Computer Science ((TCS))

  • 101k Accesses

Abstract

In this chapter, we consider the set of tools for audio compression applicable to general audio, such as music, as opposed to specifically speech compression developed under the aegis of the Motion Picture Experts Group, MPEG. Surprisingly, this subject has much to do with psychology, specifically within the field of aural sense perception–psychoacoustics. The phenomena of frequency masking and temporal masking are exploited in a waveform coding approach that makes use of a psychoacoustic model of hearing, with the result generally referred to as perceptual coding. We look in some detail at audio compression as it benefits from psychoacoustics, and how this plays out in MPEG-1 Audio Compression (mp3) and later MPEG audio developments: MPEG-2 and 4 including MPEG Advanced Audio Coding (AAC). We begin the study of psychoacoustics as it applies here with the determination of the equal-loudness relations, which leads to a discussion of frequency masking. Critical Bands are introduced as well as the Bark Unit. Temporal Masking is a familiar phenomenon from our own experience. MPEG Audio is introduced to make use of these properties, along with MPEG Audio Layers including MP3. MPEG-2 AAC (Advanced Audio Coding) is considered next and MPEG-4 Audio is also discussed.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 69.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 89.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. D.W. Robinson, R.S. Dadson, A re-determination of the equal-loudness relations for pure tones. British Journal of Applied Physics 7, 166–181 (1956)

    Article  Google Scholar 

  2. H. Fletcher, W.A. Munson, Loudness, its definition, measurement and calculation. Journal of the Acoustical Society of America 5, 82–107 (1933)

    Article  Google Scholar 

  3. T. Painter, A. Spanias, Perceptual coding of digital audio. Proceedings of the IEEE 88(4), 451–513 (2000)

    Article  Google Scholar 

  4. B. Truax, Handbook for Acoustic Ecology, 2nd edn. (Street Publishing, Cambridge, 1999)

    Google Scholar 

  5. D. O’Shaughnessy, Speech Communications: Human and Machine. (IEEE Press, New York, 1999)

    Google Scholar 

  6. A.J.M. Houtsma, Psychophysics and modern digital audio technology. Philips J. Res. 47, 3–14 (1992)

    Google Scholar 

  7. E. Zwicker, U. Tilmann, Psychoacoustics: matching signals to the final receiver. J. Audio Eng. Soc. 39, 115–126 (1991)

    Google Scholar 

  8. D. Lubman, Objective metrics for characterizing automotive interior sound quality. in Inter-Noise ’92, pp. 1067–1072, 1992

    Google Scholar 

  9. D. Pan, A tutorial on MPEG/Audio compression. IEEE Multimedia 2(2), 60–74 (1995)

    Article  Google Scholar 

  10. S. Shlien, Guide to MPEG-1 audio standard. IEEE Trans. Broadcast. 40, 206–218 (1994)

    Article  Google Scholar 

  11. P. Noll, Mpeg digital audio coding. IEEE Signal Process. Mag. 14(5), 59–81 (1997)

    Article  MathSciNet  Google Scholar 

  12. International Standard: ISO/IEC 13818-7. Information technology—Generic coding of moving pictures and associted audio information. in Part 7: Advanced Audio Coding (AAC), 1997

    Google Scholar 

  13. K. Brandenburg, MP3 and AAC explained. in 17th International Conference on High Quality Audio Coding, pp. 1–12 (1999)

    Google Scholar 

  14. International Standard: ISO/IEC 14496-3. Information technology—Coding of audio-visual objects. in Part 3: Audio, 1998

    Google Scholar 

  15. Vorbis audio compression, (2013), http://xiph.org/vorbis/

  16. J. Engdegård, B. Resch, C. Falch, O. Hellmuth, J. Hilpert, A. Hoelzer, L. Terentiev, J. Breebaart, J. Koppens, E. Schuijers, W. Oomen, Spatial Audio Object Coding (SAOC)—The Upcoming MPEG Standard on Parametric Object Based Audi Coding. In Audio Engineering Society 124th Convention, 2008

    Google Scholar 

  17. Information technology—Multimedia content description interface, Part 4: Audio. International Standard: ISO/IEC 15938-4, 2001

    Google Scholar 

  18. A.T. Lindsay, S. Srinivasan, J.P.A. Charlesworth, P.N. Garner, W. Kriechbaum, Representation and linking mechanisms for audio in MPEG-7. Signal Processing: Image Commun. 16, 193–209 (2000)

    Google Scholar 

  19. P. Philippe, Low-level musical descriptors for MPEG-7. Signal Processing: Image Commun. 16, 181–191 (2000)

    Google Scholar 

  20. M.I. Mandel, D.P.W. Ellis, Song-level features and support vector machines for music classification. In: The 6th International Conference on Music Information Retrieval

    Google Scholar 

  21. K. Brandenburg, O. Kunz, A. Sugiyama, MPEG-4 natural audio coding. Signal Processing: Image Commun. 15, 423–444 (2000)

    Google Scholar 

  22. E.D. Scheirer, Structured audio and effects processing in the MPEG-4 multimedia standard. Multimedia Syst. 7, 11–22 (1999)

    Article  Google Scholar 

  23. J.D. Johnston, S.R. Quackenbush, J. Herre, B. Grill, in Multimedia Systems, Standards, and Networks, eds. by A. Puri and T. Chen. Review of MPEG-4 general audio coding, (Marcel Dekker Inc, New York, 2000), pp. 131–155

    Google Scholar 

  24. E.D. Scheirer, Y. Lee, J.-W. Yang, in Multimedia Systems, Standards, and Networks eds. by A. Puri & T. Chen. Synthetic audio and SNHC audio in MPEG-4, (Marcel Dekker Inc, New York, 2000), pp. 157–177

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ze-Nian Li .

Rights and permissions

Reprints and permissions

Copyright information

© 2014 Springer International Publishing Switzerland

About this chapter

Cite this chapter

Li, ZN., Drew, M.S., Liu, J. (2014). MPEG Audio Compression. In: Fundamentals of Multimedia. Texts in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-05290-8_14

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-05290-8_14

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-05289-2

  • Online ISBN: 978-3-319-05290-8

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics