Abstract
The MPEG committee has recently completed development of a new audio coding standard, the MPEG-4 Advanced Audio Coding-Enhanced Low Delay (AAC-ELD). State-of-the-art MPEG audio coding standards, such as MPEG-4 AAC Low Complexity (AAC-LC), High Efficiency AAC (HE-AAC), and AAC Low Delay (AAC-LD), utilize the time-to-frequency transformation of an audio block and vice versa, the well-known time domain aliasing cancellation modified discrete cosine transform (TDAC-MDCT). In order to achieve low algorithmic delay, the AAC-ELD has adopted a perfect reconstruction low delay cosine-modulated filter bank, called the low delay MDCT (LD-MDCT). Although the use of LD-MDCT substantially reduces the algorithmic delays, the transform operations in the AAC-ELD codec are still computationally intensive and the LD-MDCT filter banks need to have fast algorithms. Therefore, this chapter is concentrated on the analysis/synthesis LD-MDCT filter banks used the AAC-ELD codec and mainly on their efficient implementations. This chapter presents: Definitions of the analysis/synthesis LD-MDCT (and TDAC-MDCT) filter banks, general symmetry properties of LD-MDCT transforms both in the time and frequency domains, relations between the LD-MDCT and TDAC-MDCT transforms in the analytical forms as well as in the equivalent matrix representations, and efficient implementations of the even-length analysis/synthesis LD-MDCT filter banks. For each fast LD-MDCT algorithm the complete formulae are derived. All the fast even-length LD-MDCT algorithms are investigated and compared in terms of arithmetic complexity and structural simplicity.
References
Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Amendment 9: Enhanced Low Delay AAC, ISO/IEC 14496-3:2005/FDAM 9:2007(E), N9499, Shenzhen, October 2007
MPEG-4 AAC-LC and HE-AAC Audio Coding Standards
Information Technology – Coding of Audio-Visual Objects – Part 3: Audio, Subpart 4: General Audio Coding (GA)-AAC, TwinVQ, BSAC, ISO/IEC 14496–3:2005(E) (2005)
M. Wolters, K. Kjörling, D. Homm, H. Purnhagen, A closer look into MPEG-4 high efficiency AAC, in 115th AES Convention, New York, NY, October 2003. Preprint #5871
MPEG-4 AAC-LD Audio Coding Standard
E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929
J. Hilpert et al., Real-time implementation of the MPEG-4 low delay advanced audio coding algorithm (AAC-LD) on Motorola DSP56300, in 108th AES Convention, Paris, February 2000. Preprint #5081
M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601
MPEG-4 AAC-ELD Audio Coding Standard
M. Lutzky, M.L. Valero, M. Schnell, J. Hilpert, AAC-ELD v2 – the new state of the art in high quality communication audio coding, in 131st AES Convention, New York, NY, October 2011. Preprint #8516
M. Schnell et al., Enhanced MPEG-4 low delay AAC – low bitrate high quality communication, in 122nd AES Convention, Vienna, May 2007. Preprint #6998
M. Schnell et al., Low delay filter banks for enhanced low delay audio coding, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics, New Paltz, NY, October 2007, pp. 235–238
M. Schnell et al., MPEG-4 enhanced low delay AAC – a new standard for high quality communication, in 125th AES Convention, San Francisco, CA, October 2008. Preprint #7503
M.L. Valero et al., A new parametric stereo and multichannel extension for MPEG-4 enhanced low delay AAC (AAC-ELD), in 128th AES Convention, London, May 2010. Preprint #8099
Efficient Implementations of LD-MDCT Filter Banks
V. Britanak, New fast algorithms for the low delay MDCT computation in the MPEG-4 AAC enhanced low delay audio coding standard. Signal Process. 105(12), 410–418 (2014)
R.K. Chivukula, Y.A. Reznik, V. Devarajan, Efficient algorithms for MPEG-4 AAC-ELD, AAC-LD and AAC-LC filter banks, in Proceedings of the IEEE International Conference on Audio, Language and Image Processing, (ICALIP’2008), Shanghai, July 2008, pp. 1629–1634
R.K. Chivukula, Y.A. Reznik, Y. Hu, V. Devarajan, Fast algorithms for low-delay TDAC filter banks in MPEG-4 AAC-ELD. IEEE Trans. Audio Speech Lang. Process. 22(12), 1701–1712 (2014)
Z.G. Gui, Y. Ge, D.Y. Zhang, J.S. Wu, Generalized fast mixed-radix algorithm for the computation of forward and inverse MDCTs. Signal Process. 92(2), 363–373 (2012)
J. Wu, H. Shu, L. Senhadji, L. Luo, Mixed-radix algorithm for the computation of forward and inverse MDCTs. IEEE Trans. Circuits Syst. I: Regul. Pap. 56(4), 784–794 (2009)
Existing Audio Broadcasting and Speech Communication Codecs
3GGP2 C.S0014-C v1.0, Enhanced variable rate codec, speech service Option 3, 68 and 70 for wide-band spread spectrum digital systems, 2007
R.K. Chivukula, Y.A. Reznik, Efficient implementation of a class of MDCT/IMDCT filterbanks for speech and audio coding, in Proceedings of the IEEE ICASSP’2008, Las Vegas, NV, March–April 2008, pp. 213–216
Digital Radio Mondiale (DRM): System Specification, ETSI ES 201 980 v3.1.1 (2009–08), ETSI Standard, August 2009 (available at website www.drm.org)
W. Hoeg, T. Lauterbach (eds.), Digital Audio Broadcasting: Principles and Applications of DAB, DAB+ and DMB, chap. 3, 3rd edn. (Wiley, Chichester, 2009), pp. 93–165
ITU-T Recommendation G.722.1 Annex C, Low-complexity coding at 24 and 32 kbit/s for hands-free operation in systems with low frame loss, Annex C: 14 kHz at 24, 32 and 48 kbit/s, May 2005
ITU-T SG16 Q9 – Contribution 199: extended high-level description of the Q9 EV-VBR baseline codec, June 2007
L. Laaksonen et al., Super wide–band extension of G.718 and G.729.1 speech codec, in Proceedings of the 11th Annual Conference of the International Speech Communication Association, INTERSPEECH 2010, Makuhari, September 2010
T. Vaillancourt et al., ITU-T EV-VBR: a robust 8–32 kbit/s scalable coder for error prone telecommunication channels, in Proceedings of the 16th European Signal Processing Conference, Lausanne, August 2008
M. Xie, D. Lindbergh, P. Chu, From ITU-T G.722.1 to ITU-T G.722.1 Annex C: a new low-complexity 14 kHz bandwidth audio coding standard, in Proceedings of the IEEE ICASSP’2006, vol. 5, Toulouse, May 2006, pp. 173–176, also published in J. Multimed. 2(2), 65–76 (2007)
M. Xie, P. Chu, A. Taleb, M. Briand, ITU-T G.719: a new low-complexity full-band (20 kHz) audio coding standard for high quality conversational applications, in Proceedings of the IEEE Workshop on Applications of Signal Processing to Audio and Acoustics (WASPAA’2009), New Paltz, NY, October 2009, pp. 265–268
Supporting Literature
V. Britanak, Survey of efficient MDCT implementations in MP3 audio coding standard: retrospective and state-of-the-art. Signal Process. 91(4), 1379–1394 (2011)
V. Britanak, H.J. Lincklaen Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)
V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic Press Inc., Elsevier Science, Amsterdam, 2007)
R. Gluth, Regular FFT-related transform kernels for DCT/DST-based polyphase filter banks, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 2205–2208
R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction systems, in Proceedings of the IEEE ICASSP’93, vol. III, Minneaopolis, MN, April 1993, pp. 157–160
C.W. Kok, Fast algorithm for computing discrete cosine transform. IEEE Trans. Signal Process. 45(3), 757–760 (1997)
H.S. Malvar, Signal Processing with Lapped Transforms, chap. 2 (Artech House, Norwood, MA, 1992), pp. 71–75
J.P. Princen, A.W. Johnson, A.B. Bradley, Subband/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164
G.D.T. Schuller, M.J.T. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)
G.D.T. Schuller, T. Karp, Modulated filter banks with arbitrary system delay: efficient implementations and time-varying case. IEEE Trans. Signal Process. 48(3), 737–748 (2000)
Author information
Authors and Affiliations
Rights and permissions
Copyright information
© 2018 Springer International Publishing AG
About this chapter
Cite this chapter
Britanak, V., Rao, K.R. (2018). Efficient Implementations of Perfect Reconstruction Low Delay Cosine-Modulated Filter Banks in the MPEG-4 AAC-ELD. In: Cosine-/Sine-Modulated Filter Banks. Springer, Cham. https://doi.org/10.1007/978-3-319-61080-1_8
Download citation
DOI: https://doi.org/10.1007/978-3-319-61080-1_8
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-61078-8
Online ISBN: 978-3-319-61080-1
eBook Packages: EngineeringEngineering (R0)