DRA Audio Coding Standard: An Overview
A lossy audio coding or compression algorithm explores statistical redundancy and perceptual irrelevance of an input audio signal, which may include multiple channels, to obtain a compact representation suitable for efficient transmission or storage. Figure 20.1 is a generic architecture designed to achieve this and is the basis for most audio coding algorithms or standards. The following is a brief description of its major components:
Time-Frequency Analysis: Frequently referred to as an analysis filter bank, it transforms each channel of the input audio signal into a set of time-frequency parameters suitable for quantization and encoding so that their statistical redundancy and perceptual irrelevance can be readily exploited. It may come in the form of Fourier transform, discrete cosine transform(DCT), linear prediction, or subband filter banks. Modified discrete cosine transform (MDCT) is a filter bank widely used in audio coding standards.
- Yu-Li You, Weixiong Zhang, Mao Xu, and Subin Zhang, Electronics Industry Standard: Multichannel Digital Audio Coding Technology, SJ/T11368-2006, Ministry of Information Industry, People's Republic of China, 2007.
- J. P. Princen and A. B. Bradley, “Analysis/synthesis filter bank design based on time domain aliasing cancellation,” IEEE Transactions on ASSP, vol. 34, no. 5, pp. 1153–1161, 1986. CrossRef
- T. Painter and A. Spanias, “Perceptual coding of digital audio,” Proceedings of the IEEE, vol. 88, no. 4, pp. 451–513, 2000. CrossRef
- G. Davidson and M. Bosi, “AC-2: high quality audio coding for broadcasting and storage,” 46th Annual Broadcasting Engineering Conference, pp. 98–105, April 1992.
- Dolby Laboratories, Digital Audio Compression Standard A/52B, Advanced Television Systems Committee (ATSC), 2005.
- K. Tsutsui, “ATRAC (adaptive transform acoustic coding) and ATRAC 2,” The Digital Signal Processing Handbook, V. Madisetti and D. Williams, Editors, CRC Press, pp. 43.16– 43.20, 1998.
- J. Johnston, D. Sinha, S. Dorward, and S. Quackenbush, “AT&T perceptual audio coding (PAC),” Collected Papers on Digital Audio Bit-Rate Reduction, pp. 73–81, 1996.
- MPEG, Coding of moving pictures and associated audio for digital storage media at up to about 1.5 Mbit/s — Part 3: Audio, vol. 11172-3, ISO/IEC, 1992.
- MPEG, Information technology: Generic coding of moving pictures and associated audio information Part 7: Advanced Audio Coding (AAC), vol. 13818-7, ISO/IEC, 1997.
- Vorbis I specification, Xiph.org Foundation, 2004.
- Wikipedia, Windows Media Audio, http://en.wikipedia.org/wiki/Windows Media Audio, October 2007.
- J. Herre and J.D. Johnston, “Enhancing the performance of perceptual audio coders by using temporal noise shaping (TNS),” 101st AES Convention, 1996 Reprint #4384.
- International Telecommunication Union, Recommendation ITU-R BS.1116 — Methods for the subjective assessment of small impairments in audio systems, including multichannel sound systems, 1994.
- EBU, Sound Quality Assessment Material Recordings for Subjective Tests, Tech. 3253, April 1988.
- DRA Audio Coding Standard: An Overview
- Book Title
- Mobile Multimedia Broadcasting Standards
- Book Subtitle
- Technology and Practice
- Book Part
- pp 587-606
- Print ISBN
- Online ISBN
- Springer US
- Copyright Holder
- Springer-Verlag US
- Additional Links
- Industry Sectors
- eBook Packages
- Fa-Long Luo (1)
- Editor Affiliations
- 1. Anyka, Inc. & Element CXI
- Author Affiliations
- 2. Guangdong Provincial Key Lab for Digital Audio Technologies, Digital Rise Technology Co. Ltd., South China University of Technology, 6th Floor, Bldg. 2, Science and Tech Park, Guangzhou, Guangdong, China
To view the rest of this content please follow the download PDF link above.