Abstract
MPEG Audio was the first international standard for high quality audio coding and it opened the doors to a variety of applications in the world of digital music. In this chapter we review the basic ideas and features behind the general purpose, perceptual audio coders specified in the MPEG-1 and MPEG-2 audio standards which include the MP3 and AAC formats. The widely successful MP3 and AAC coders represent some of the most remarkable achievements of the MPEG committee that highly influenced not only the technology but also largely enabled different ways of digital media consumption.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Notes
- 1.
Although this notation may suggest that we are blending the input signals in the time domain, this approach is usually carried out in the frequency domain over a frequency range in which the power of the spectral lines is similar in the two channels. This is because there is not much correlation between stereo channels in the time domain. In general, stereo redundancies can be more easily exploited for systems with high frequency resolution.
References
Schroeder M R, Atal B S, Hall JL (1979), Optimizing Digital Speech Coders by Exploiting Masking Properties of the Human Ear. J Acoust Soc Am, 66:1647–1652
ISO/IEC 11172–3 (1993), Information Technology, Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s, Part 3: Audio
ITU-R BS.1115 (1994), Low Bitrate Audio Coding. ITU, Geneva
ISO/IEC 13818–3 (1994–1997), Information Technology - Generic Coding of Moving Pictures and Associated Audio, Part 3: Audio
Nussbaumer H J (1981), Pseudo-QMF Filter Bank”. IBM Tech Disclosure Bull, 24: 3081–3087
Rothweiler J H (1983), Polyphase Quadrature Filters - A new Subband Coding Technique. International Conference IEEE ASSP, Boston, 1280–1283
Princen J P, Johnson A, Bradley A B (1987), Subband/Transform Coding Using Filter Bank Designs Based on Time Domain Aliasing Cancellation. Proc of the ICASSP, 2161–2164
Soulodre G A, Grusec T, Lavoie M, Thibault L (1998), Subjective Evaluation of State-of-the Art Two-Channel Audio Codecs. J Audio Eng Soc, 46:164–177
ISO/IEC JTC 1/SC 29/WG 11 N7950 (2006), Performance of MPEG Surround Technology
J. D. Johnston and A. J. Ferreira (1992), Sum-Difference Stereo Transform Coding, Proc. ICASSP, pp. 569–571
vd Waal R G and Veldhuis R N J (1991), Subband Coding of Stereophonic Digital Audio Signals, Proc. ICASSP, pp. 3601 – 3604
Davies M (1993), The AC-3 Multichannel Coder, presented at the 95th AES Convention, New York, pre-print 3774
Thiede T, Treurniet W, Bitto R, Schmidmer C, Sporer T, Beerends J, Colomes C, Keyhl M, Stoll G, Brandenburg K, Feiten B (2000), PEAQ-The ITU Standard for Objective Measurement of Perceived Audio Quality. J Audio Eng Soc, 48:3–29
Bosi M, Goldberg R E (2003), Introduction to Digital Audio Coding and Standards. Springer, New York
Malvar H S (1990), Lapped transforms for efficient transform/sub-band coding. IEEE Transactions on Acoustics Speech and Signal Processing, 38:969 – 978
Fielder L, Bosi M, Davidson G, Davis M, Todd C, Vernon S (1996), AC-2 and AC-3: Low-Complexity Transform-Based Audio Coding. Collected Papers on Digital Audio Bit-Rate Reduction, Neil Gilchrist and Christer Grewin (ed), AES 54–72
Edler B (1989), Coding of Audio Signals with Overlapping Transform and Adaptive Window Shape (in German), Frequenz, 43:252–256
Bosi M, Brandenburg K, Quackenbush S, Fielder L, Akagiri K, Fuchs H, Dietz M, Herre J, Davidson G, Oikawa Y (1997), ISO/IEC MPEG-2 Advanced Audio Coding. J Audio Eng Soc, 45:789 – 812
Edler B (1992), Aliasing reduction in sub-bands of cascaded filter banks with decimation. Electronics Letters, 28:1104–1105
E. Zwicker E, Fastl H (1990), Psychoacoustics: Facts and Models. Springer-Verlag, Berlin
Hellman R (1972), Asymmetry of Masking Between Noise and Tone. Percep Psychphys, 11:241–246
Fletcher H (1940), Auditory Patterns. Rev Mod Phys, 12:47–55
EBU (1988), Tech 3253 - Sound Quality Assessment Material (SQAM). Tech Rep, European Broadcasting Union
K. Brandenburg K, Johnston JD (1990), Second Generation Perceptual Audio Coding: The Hybrid Coder. 88th AES Convention, Montreux
Johnston J D (1988), Estimation of Perceptual Entropy Using Noise Masking Criteria. Proc ICASSP, 2524–2527
Blauert J (1983), Spatial Hearing. MIT Press, Cambridge
Dehéry Y F, Stoll G, Kerkhof L vd (1991), MUSICAM Source Coding for Digital Sound. Symp Rec Broadcast Sessions, 612–617
Brandenburg K, Herre J, Johnston J D, Mahieux Y, Schroeder E F (1991), ASPEC-Adaptive Spectral Perceptual Entropy Coding of High Quality Music Signals. 90th AES Convention, 3011
Ryden T, Grewin C and Bergman S (1991), The SR report on the MPEG audio subjective listening tests in Stockholm April/May 1991, ISO/IEC JTC1/SC29/WG 11 MPEG 91/010
ITU-R BS.775-1 (1992–1994), Multichannel Stereophonic Sound System with and without Accompanying Picture
ten Kate W, Boers P, Maekivirta A, Kuusama J, Christensen K E, Soerensen E (1992), Matrixing of Bit-Rate Reduced Signals. Proc ICASSP, 2:205–208
Stoll G (1996), ISO-MPEG-2 Audio: A Generic Standard for the Coding of Two-Channel and Multichannel Sound. Gielchrist, Grewin (ed), Collected Papers on Digital Audio Bit-Rate Reduction, 43–53, AES 1996
ISO/IEC 13818–7 (1997), Information Technology - Generic Coding of Moving Pictures and Associated Audio, Part 7: Advanced Audio Coding
ISO/IEC JTC 1/SC 29/WG 11 N1420 (1996), Overview of the Report on the Formal Subjective Listening Tests of MPEG-2 AAC Multichannel Audio Coding
ISO/IEC 14496–3 (1999–2001), Information Technology – Coding of Audio Visual Objects, Part 3: Audio
Herre J, Johnston J D (1996), Enhancing the Performance of Perceptual Audio Coders by Using Temporal Noise Shaping (TNS). 101st AES Convention, 4384
Sinha D, Johnston J D, S. Dorward S, Quackenbush S R (1998), The Perceptual Audio Coder (PAC). The Digital Signal Processing Handbook, Madisetti, Williams (ed), CRC Press, 42.1-42.18
Herre J, Schulz D (1998), Extending the MPEG-4 AAC Codec by Perceptual Noise Substitution. 112th AES Convention, 4720
Dietz M, Liljeryd L, Kjoerling K, Kunz O (2002), Spectral Band Replication, a novel approach in audio coding. 112th AES Convention, 5553
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2012 Springer Science+Business Media, LLC
About this chapter
Cite this chapter
Bosi, M. (2012). MPEG Audio Compression Basics. In: Chiariglione, L. (eds) The MPEG Representation of Digital Media. Springer, New York, NY. https://doi.org/10.1007/978-1-4419-6184-6_6
Download citation
DOI: https://doi.org/10.1007/978-1-4419-6184-6_6
Published:
Publisher Name: Springer, New York, NY
Print ISBN: 978-1-4419-6183-9
Online ISBN: 978-1-4419-6184-6
eBook Packages: EngineeringEngineering (R0)