DRA Audio Coding Standard: An Overview

  • Yu-Li You
  • Wenhua Ma

20.1 Introduction

A lossy audio coding or compression algorithm explores statistical redundancy and perceptual irrelevance of an input audio signal, which may include multiple channels, to obtain a compact representation suitable for efficient transmission or storage. Figure 20.1 is a generic architecture designed to achieve this and is the basis for most audio coding algorithms or standards. The following is a brief description of its major components:
  • Time-Frequency Analysis: Frequently referred to as an analysis filter bank, it transforms each channel of the input audio signal into a set of time-frequency parameters suitable for quantization and encoding so that their statistical redundancy and perceptual irrelevance can be readily exploited. It may come in the form of Fourier transform, discrete cosine transform(DCT), linear prediction, or subband filter banks. Modified discrete cosine transform (MDCT) is a filter bank widely used in audio coding standards.

  • Joint Channel Coding:...


Filter Bank Audio Signal Window Function Current Frame Quantization Noise 
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.


  1. 1.
    Yu-Li You, Weixiong Zhang, Mao Xu, and Subin Zhang, Electronics Industry Standard: Multichannel Digital Audio Coding Technology, SJ/T11368-2006, Ministry of Information Industry, People's Republic of China, 2007.Google Scholar
  2. 2.
    J. P. Princen and A. B. Bradley, “Analysis/synthesis filter bank design based on time domain aliasing cancellation,” IEEE Transactions on ASSP, vol. 34, no. 5, pp. 1153–1161, 1986.CrossRefGoogle Scholar
  3. 3.
    T. Painter and A. Spanias, “Perceptual coding of digital audio,” Proceedings of the IEEE, vol. 88, no. 4, pp. 451–513, 2000.CrossRefGoogle Scholar
  4. 4.
    G. Davidson and M. Bosi, “AC-2: high quality audio coding for broadcasting and storage,” 46th Annual Broadcasting Engineering Conference, pp. 98–105, April 1992.Google Scholar
  5. 5.
    Dolby Laboratories, Digital Audio Compression Standard A/52B, Advanced Television Systems Committee (ATSC), 2005.Google Scholar
  6. 6.
    K. Tsutsui, “ATRAC (adaptive transform acoustic coding) and ATRAC 2,” The Digital Signal Processing Handbook, V. Madisetti and D. Williams, Editors, CRC Press, pp. 43.16– 43.20, 1998.Google Scholar
  7. 7.
    J. Johnston, D. Sinha, S. Dorward, and S. Quackenbush, “AT&T perceptual audio coding (PAC),” Collected Papers on Digital Audio Bit-Rate Reduction, pp. 73–81, 1996.Google Scholar
  8. 8.
    MPEG, Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s — Part 3: Audio, vol. 11172-3, ISO/IEC, 1992.Google Scholar
  9. 9.
    MPEG, Information technology: Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC), vol. 13818-7, ISO/IEC, 1997.Google Scholar
  10. 10.
    Vorbis I specification, Foundation, 2004.Google Scholar
  11. 11.
    Wikipedia, Windows Media Audio, Media Audio, October 2007.
  12. 12.
    J. Herre and J.D. Johnston, “Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS),” 101st AES Convention, 1996 Reprint #4384.Google Scholar
  13. 13.
    International Telecommunication Union, Recommendation ITU-R BS.1116 — Methods for the subjective assessment of small impairments in audio systems, including multichannel sound systems, 1994.Google Scholar
  14. 14.
    EBU, Sound Quality Assessment Material Recordings for Subjective Tests, Tech. 3253, April 1988.Google Scholar

Copyright information

© Springer Science+Business Media, LLC 2009

Authors and Affiliations

  • Yu-Li You
    • 1
  • Wenhua Ma
    • 1
  1. 1.Guangdong Provincial Key Lab for Digital Audio TechnologiesDigital Rise Technology Co. Ltd., South China University of TechnologyGuangzhouChina

Personalised recommendations