Abstract
In this chapter, we consider the set of tools for audio compression applicable to general audio, such as music, as opposed to specifically speech compression developed under the aegis of the Motion Picture Experts Group, MPEG. Surprisingly, this subject has much to do with psychology, specifically within the field of aural sense perception–psychoacoustics. The phenomena of frequency masking and temporal masking are exploited in a waveform coding approach that makes use of a psychoacoustic model of hearing, with the result generally referred to as perceptual coding. We look in some detail at audio compression as it benefits from psychoacoustics, and how this plays out in MPEG-1 Audio Compression (mp3) and later MPEG audio developments: MPEG-2 and 4 including MPEG Advanced Audio Coding (AAC). We begin the study of psychoacoustics as it applies here with the determination of the equal-loudness relations, which leads to a discussion of frequency masking. Critical Bands are introduced as well as the Bark Unit. Temporal Masking is a familiar phenomenon from our own experience. MPEG Audio is introduced to make use of these properties, along with MPEG Audio Layers including MP3. MPEG-2 AAC (Advanced Audio Coding) is considered next and MPEG-4 Audio is also discussed.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
D.W. Robinson, R.S. Dadson, A re-determination of the equal-loudness relations for pure tones. British Journal of Applied Physics 7, 166–181 (1956)
H. Fletcher, W.A. Munson, Loudness, its definition, measurement and calculation. Journal of the Acoustical Society of America 5, 82–107 (1933)
T. Painter, A. Spanias, Perceptual coding of digital audio. Proceedings of the IEEE 88(4), 451–513 (2000)
B. Truax, Handbook for Acoustic Ecology, 2nd edn. (Street Publishing, Cambridge, 1999)
D. O’Shaughnessy, Speech Communications: Human and Machine. (IEEE Press, New York, 1999)
A.J.M. Houtsma, Psychophysics and modern digital audio technology. Philips J. Res. 47, 3–14 (1992)
E. Zwicker, U. Tilmann, Psychoacoustics: matching signals to the final receiver. J. Audio Eng. Soc. 39, 115–126 (1991)
D. Lubman, Objective metrics for characterizing automotive interior sound quality. in Inter-Noise ’92, pp. 1067–1072, 1992
D. Pan, A tutorial on MPEG/Audio compression. IEEE Multimedia 2(2), 60–74 (1995)
S. Shlien, Guide to MPEG-1 audio standard. IEEE Trans. Broadcast. 40, 206–218 (1994)
P. Noll, Mpeg digital audio coding. IEEE Signal Process. Mag. 14(5), 59–81 (1997)
International Standard: ISO/IEC 13818-7. Information technology—Generic coding of moving pictures and associted audio information. in Part 7: Advanced Audio Coding (AAC), 1997
K. Brandenburg, MP3 and AAC explained. in 17th International Conference on High Quality Audio Coding, pp. 1–12 (1999)
International Standard: ISO/IEC 14496-3. Information technology—Coding of audio-visual objects. in Part 3: Audio, 1998
Vorbis audio compression, (2013), http://xiph.org/vorbis/
J. Engdegård, B. Resch, C. Falch, O. Hellmuth, J. Hilpert, A. Hoelzer, L. Terentiev, J. Breebaart, J. Koppens, E. Schuijers, W. Oomen, Spatial Audio Object Coding (SAOC)—The Upcoming MPEG Standard on Parametric Object Based Audi Coding. In Audio Engineering Society 124th Convention, 2008
Information technology—Multimedia content description interface, Part 4: Audio. International Standard: ISO/IEC 15938-4, 2001
A.T. Lindsay, S. Srinivasan, J.P.A. Charlesworth, P.N. Garner, W. Kriechbaum, Representation and linking mechanisms for audio in MPEG-7. Signal Processing: Image Commun. 16, 193–209 (2000)
P. Philippe, Low-level musical descriptors for MPEG-7. Signal Processing: Image Commun. 16, 181–191 (2000)
M.I. Mandel, D.P.W. Ellis, Song-level features and support vector machines for music classification. In: The 6th International Conference on Music Information Retrieval
K. Brandenburg, O. Kunz, A. Sugiyama, MPEG-4 natural audio coding. Signal Processing: Image Commun. 15, 423–444 (2000)
E.D. Scheirer, Structured audio and effects processing in the MPEG-4 multimedia standard. Multimedia Syst. 7, 11–22 (1999)
J.D. Johnston, S.R. Quackenbush, J. Herre, B. Grill, in Multimedia Systems, Standards, and Networks, eds. by A. Puri and T. Chen. Review of MPEG-4 general audio coding, (Marcel Dekker Inc, New York, 2000), pp. 131–155
E.D. Scheirer, Y. Lee, J.-W. Yang, in Multimedia Systems, Standards, and Networks eds. by A. Puri & T. Chen. Synthetic audio and SNHC audio in MPEG-4, (Marcel Dekker Inc, New York, 2000), pp. 157–177
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
Copyright information
© 2014 Springer International Publishing Switzerland
About this chapter
Cite this chapter
Li, ZN., Drew, M.S., Liu, J. (2014). MPEG Audio Compression. In: Fundamentals of Multimedia. Texts in Computer Science. Springer, Cham. https://doi.org/10.1007/978-3-319-05290-8_14
Download citation
DOI: https://doi.org/10.1007/978-3-319-05290-8_14
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-05289-2
Online ISBN: 978-3-319-05290-8
eBook Packages: Computer ScienceComputer Science (R0)