Advertisement

MDCT/MDST, MLT, ELT, and MCLT Filter Banks: Definitions, General Properties, and Matrix Representations

  • Vladimir Britanak
  • K. R. Rao
Chapter

Abstract

The perfect reconstruction cosine/sine-modulated filter banks belonging to the class of modulated filter banks have been studied extensively due to their attractive features (simple structure, analysis and synthesis filters are of equal length, low computational complexity), and consequently, they have received a great interest in audio coding applications. In fact, they are employed in the international speech and audio coding standards and proprietary audio compression algorithms. The oddly and evenly stacked modified discrete cosine transform (MDCT) and the corresponding modified discrete sine transform (MDST), the modulated lapped transform (MLT), the extended lapped transforms (ELTs), and their biorthogonal versions are real-valued cosine/sine-modulated filter banks satisfying the perfect reconstruction property. The modulated complex lapped transform (MCLT) is the complex-valued filter bank whose real part is the MLT or equivalently, the oddly stacked MDCT, and the imaginary part is the oddly stacked MDST. In this chapter, definitions, general properties, and matrix representations of the MDCT/MDST, MLT, ELT, and MCLT filter banks are presented. In order to an analysis/synthesis filter bank be perfect reconstruction, the necessary and sufficient conditions imposed on the analysis and synthesis windowing functions play an important role. Therefore, additionally the windowing procedure and perfect reconstruction (biorthogonal) conditions in the case of identical and (nonidentical) analysis and synthesis windowing functions, design of a windowing function including definitions of commonly windowing functions used in audio coding applications, adaptive switching of transform block sizes and windowing functions, and general perfect reconstruction conditions for the ELT filter bank with multiple overlapping factor both for the orthogonal and biorthogonal cases are derived and/or discussed in detail.

References

  1. 1.
    R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 157–160Google Scholar
  2. 2.
    R. Gluth, U. Heute, Analysis/synthesis filter banks based on generalized sinusiodal transforms with an application to speech coding, in Proceedings of the 6th European Signal Processing Conference (EUSIPCO’92), vol. 1, Brussels, August 1992, pp. 215–218Google Scholar
  3. 3.
    R.A. Gopinath, Modulated filter banks and wavelets – a general unified theory, in Proceedings of the IEEE ICASSP’96, Atlanta, GA, May 1996, pp. 1585–1588Google Scholar
  4. 4.
    R.A. Gopinath, C.S. Burrus, Theory of modulated filter banks and modulated wavelet tight frames, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 169–172Google Scholar
  5. 5.
    R.D. Koilpillai, P.P. Vaidyanathan, New results on cosine-modulated FIR filter banks satisfying perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1793–1796Google Scholar
  6. 6.
    R.D. Koilpillai, P.P. Vaidyanathan, Cosine-modulated FIR filter banks satisfying perfect reconstruction. IEEE Trans. Signal Process. 40(4), 770–783 (1992)Google Scholar
  7. 7.
    Y.-P. Lin, P.P. Vaidyanathan, Linear phase cosine modulated maximally decimated filter banks with perfect reconstruction. IEEE Trans. Signal Process. 42(11), 2525–2539 (1995)Google Scholar
  8. 8.
    J. Mau, Perfect reconstruction modulated filter banks, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 273–276Google Scholar
  9. 9.
    J. Mau, Perfect reconstruction modulated filter banks: fast algorithms and attractive new properties, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 225–228Google Scholar
  10. 10.
    J. Mau, Regular M-band modulated orthogonal transforms, in Proceedings of the IEEE ICASSP’93, vol. III, Adelaide, April 1994, pp. 125–128Google Scholar
  11. 11.
    T.Q. Nguyen, R.D. Koilpillai, The theory and design of arbitrary-length cosine-modulated filter banks and wavelets, satisfying perfect reconstruction. IEEE Trans. Signal Process. 44(3), 473–483 (1996)Google Scholar
  12. 12.
    T.A. Ramstad, J.P. Tanem, Cosine-modulated analysis-synthesis filterbank with critical sampling and perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1789–1792Google Scholar
  13. 13.
    G.D. Schuller, M.J. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)Google Scholar
  14. 14.
    M. Vetterli, D. Le Gall, Perfect Reconstruction FIR filter banks: some properties and factorizations. IEEE Trans. Acoust. Speech Signal Process. 37(7), 1057–1071 (1989)Google Scholar

Evenly Stacked MDCT/MDST Analysis and Synthesis Filter Banks

  1. 15.
    V. Britanak, A note on the MDCT/MDST and pseudoinverse matrix. Comput. Inform. 23(3), 205–214 (2004)MathSciNetzbMATHGoogle Scholar
  2. 16.
    V. Britanak, K.R. Rao, A unified fast MDCT/MDST computation in the evenly stacked analysis/synthesis system. Circuits, Syst. Signal Process. 21(4), 415–426 (2002)MathSciNetzbMATHGoogle Scholar
  3. 17.
    T.D. Lookabaugh, M.G. Perkins, Application of the Princen–Bradley filter bank to speech and image compression. IEEE Trans. Acoust. Speech Signal Process. ASSP-38(11), 1914–1926 (1990)Google Scholar
  4. 18.
    J.P. Princen, A.B. Bradley, Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Trans. Acoust. Speech Signal Process. ASSP-34(5), 1153–1161 (1986)Google Scholar

Oddly Stacked MDCT/MDST Analysis and Synthesis Filter Banks

  1. 19.
    V. Britanak, An efficient computing of oddly stacked MDCT/MDST computation via evenly stacked MDCT/MDST and vice versa. Signal Process. 85(7), 1353–1374 (2005)zbMATHGoogle Scholar
  2. 20.
    V. Britanak, H.J.L. Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)zbMATHGoogle Scholar
  3. 21.
    V. Britanak, K.R. Rao, A new fast algorithm for the unified forward and inverse MDCT/MDST computation. Signal Process. 82(3), 433–459 (2002)zbMATHGoogle Scholar
  4. 22.
    S. Cramer, R. Gluth, Computationally efficient real-valued filter banks based on a modified O 2 DFT, in Proceedings of EUSIPCO’90, Signal Processing V: Theories and Applications (Elsevier Science Publishers B.V., Barcelona, 1990), pp. 585–588Google Scholar
  5. 23.
    A.W. Johnson, A.B. Bradley, Adaptive transform coding incorporating time domain aliasing cancellation. Speech Commun. 6(4), 299–308 (1987)Google Scholar
  6. 24.
    J.P. Princen, A.W. Johnson, A.B. Bradley, Sub-band/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164Google Scholar
  7. 25.
    K. Suresh, T.V. Sreenivas, Direct MDCT domain psychoacoustic modeling, in Proceedings of the IEEE International Symposium on Signal Processing and Information Technology (ISSPIT’2007), Cairo, December 2007, pp. 742–747Google Scholar
  8. 26.
    K. Suresh, T.V. Sreenivas, Linear filtering in DCT-IV/DST-IV and MDCT/MDST domain. Signal Process. 89(6), 1081–1089 (2009)zbMATHGoogle Scholar
  9. 27.
    Y. Wang, M. Vilermo, Modified discrete cosine transform – its implications for audio coding and error concealment. J. Audio Eng. Soc. 51(1/2), 52–61 (2003)Google Scholar
  10. 28.
    Y. Wang, L. Yaroslavsky, M. Vilermo, On the relationship between MDCT, SDFT and DFT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 44–47Google Scholar
  11. 29.
    Y. Wang, L. Yaroslavsky, M. Vilermo, M. Väänänen, Some peculiar properties of the MDCT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 61–64Google Scholar
  12. 30.
    K. Wright, Notes on Ogg Vorbis and the MDCT. Draft document on web site: www.free-comp-shop.com/vorbis.html, May 2003, 7 pp.
  13. 31.
    S. Zhang, W. Dou, H. Yang, DFT spectrum estimation from critically sampled lapped transforms. Signal Process. 91(2), 300–310 (2011)zbMATHGoogle Scholar

Windowing Procedure, Perfect Reconstruction Conditions, and Design of Windowing Function

  1. 32.
    M. Bosi, R.E. Golberg, Introduction to Digital Audio Coding and Standards (Springer Science+Business Media, New York, NY, 2003)Google Scholar
  2. 33.
    M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, M. Dietz, J. Herre, G. Davidson, Y. Oikawa, ISO/IEC MPEG-2 advanced audio coding, in 101st AES Convention, Los Angeles, CA, November 1996. Preprint #4382. Also published in J. Audio Eng. Soc. 45(10), 789–813 (1997)Google Scholar
  3. 34.
    B. Edler, Coding of audio signals with overlapping block transform and adaptive window functions. Frequenz 43(9), 252–256 (1989) (in German)Google Scholar
  4. 35.
    A.J. Ferreira, Convolutional effects in transform coding with TDAC: an optimal window. IEEE Trans. Speech Audio Process. 4(2), 104–114 (1996)Google Scholar
  5. 36.
    F.J. Harris, On the use of windows for harmonic analysis with the discrete Fourier transform. Proc. IEEE 66(1), 51–83 (1978)Google Scholar
  6. 37.
    T. Mochizuki, Perfect reconstruction conditions for adaptive blocksize MDCT. IEICE Trans. Fundam. E77-A(5), 894–899 (1994)Google Scholar
  7. 38.
    T. Painter, A. Spanias, Perceptual coding of digital audio. Proc. IEEE 88(4), 451–513 (2000)Google Scholar
  8. 39.
    J.R. Rice, The Approximation of Functions, vol. I (Addison-Wesley, Reading, MA, 1964), pp. 124–131Google Scholar
  9. 40.
    G. Smart, A.B. Bradley, Filter bank design based on time domain aliasing cancellation with non-identical windows, in Proceedings of the IEEE ICASSP’94, vol. III, Adelaide, April 1994, pp. 185–188Google Scholar
  10. 41.
    A. Spanias, T. Painter, V. Atti, Audio Signal Processing and Coding, chap. 6 (Wiley, Hoboken, NJ, 2007), pp. 145–210Google Scholar
  11. 42.
    G. Wolberg, Digital Image Warping (IEEE Computer Press, Los Alamitos, CA, 1990), pp. 137–144Google Scholar

Low (Reduced) Overlap Windowing Functions

  1. 43.
    E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929Google Scholar
  2. 44.
    M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601Google Scholar
  3. 45.
    OPUS interactive speech and audio codec, Available on web site: www.opus-codec.org (2016)
  4. 46.
    J.-M. Valin, T.B. Terriberry, G. Maxwell, A full-band audio codec with low complexity and very low delay, in Proceedings of the 17th European Signal Processing Conference (EUSIPCO’2009), Glasgow, August 2009, pp. 1254–1258Google Scholar
  5. 47.
    J.-M. Valin, T.B. Terriberry, C. Montgomery, G. Maxwell, A high-quality speech and audio codec with less than 10 ms delay. IEEE Trans. Audio Speech Lang. Process. 18(1), 58–67 (2010)Google Scholar
  6. 48.
    J.-M. Valin, G. Maxwell, T.B. Terriberry, C. Montgomery, K. Vos, High-quality, low-delay music coding in the Opus codec, in 135th AES Convention, New York, NY, October 2013. Preprint #8942Google Scholar

(Generalized) Lapped Orthogonal Transforms

  1. 49.
    A.N. Akansu, F.E. Wadas, On lapped orthogonal transform. IEEE Trans. Signal Process. 40(2), 439–443 (1992)Google Scholar
  2. 50.
    R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, Generalized lapped orthogonal transforms. Electron. Lett. 30(2), 107–108 (1994)Google Scholar
  3. 51.
    R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, The GenLOT: generalized linear-phase lapped orthogonal transform. IEEE Trans. Signal Process. 44(3), 497–507 (1996)Google Scholar
  4. 52.
    H.S. Malvar, Reduction of blocking effects in image coding with a lapped orthogonal transform, in Proceedings of the IEEE ICASSP’88, New York, NY, April 1988, pp. 781–784Google Scholar
  5. 53.
    H.S. Malvar, The LOT: a link between block transform coding and multirate filter banks, in Proceedings of International Symposium on Circuits and Systems, Espoo, June 1988, pp. 835–838Google Scholar
  6. 54.
    H.S. Malvar, Pseudolapped orthogonal transforms. Electron. Lett. 25(5), 312–314 (1989)Google Scholar
  7. 55.
    H.S. Malvar, D.H. Staelin, The LOT: transform coding without blocking effects. IEEE Trans. Audio Speech Lang. Process. 37(4), 553–559 (1989)Google Scholar
  8. 56.
    V.K. Madisetti, D.B. Williams (eds.), The Digital Signal Processing Handbook, Part 38, Lapped Transforms (CRC & IEEE, Boca Raton, FL, 1998), pp. 1–7Google Scholar
  9. 57.
    K.R. Rao, P.C. Yip (eds.), Lapped transforms for image compression (Chapter 5), in The Transform and Data Compression Handbook (CRC, Boca Raton, FL, 2001), pp. 197–265Google Scholar

MLT and ELT (Biorthogonal, Nonuniform) Analysis and Synthesis Filter Banks

  1. 58.
    S. Cheung, J.S. Lim, Incorporation of biorthogonality into lapped transforms for audio compression, in Proceedings of the IEEE ICASSP’95, Detroit, MI, April 1995, pp. 3079–3082Google Scholar
  2. 59.
    H.S. Malvar, Lapped transforms for efficient transform/sub-band coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)Google Scholar
  3. 60.
    H.S. Malvar, Modulated QMF filter banks with perfect reconstruction. Electron. Lett. 26(13), 906–910 (1990)Google Scholar
  4. 61.
    H.S. Malvar, Extended lapped transforms: fast algorithms and applications, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1797–1800Google Scholar
  5. 62.
    H.S. Malvar, Extended lapped transforms: properties, applications, and fast algorithms. IEEE Trans. Signal Process. 40(11), 2703–2714 (1992)zbMATHGoogle Scholar
  6. 63.
    H.S. Malvar, Signal Processing with Lapped Transforms (Artech House, Norwood, MA, 1992). Chapter 4: Lapped Orthogonal Transforms, pp. 143–173 and Chapter 5: Modulated Lapped Transforms, pp. 175–218Google Scholar
  7. 64.
    H.S. Malvar, Extended cosine bases and applications to audio coding. Comput. Appl. Math. 15(2), 111–123 (1996)zbMATHGoogle Scholar
  8. 65.
    H.S. Malvar, Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts. IEEE Trans. Signal Process. 46(4), 1043–1053 (1998)MathSciNetGoogle Scholar
  9. 66.
    M. Padmanabhan, K. Martin, Some further results on modulated/extended lapped transforms, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 265–268Google Scholar

MCLT Analysis and Synthesis Filter Banks

  1. 67.
    T. Aach, D. Kunz, A lapped directional transform for spectral image analysis and its application to restoration and enhancement. Signal Process. 80(11), 2347–2364 (2000)zbMATHGoogle Scholar
  2. 68.
    V. Britanak, New recursive fast radix-2 algorithm for the modulated complex lapped transform. IEEE Trans. Signal Process. 60(12), 6703–6708 (2012)MathSciNetzbMATHGoogle Scholar
  3. 69.
    S. Cheng, Z. Xiong, Audio coding and image denoising based on the nonuniform modulated complex lapped transform. IEEE Trans. Multimedia 7(10), 817–827 (2005)Google Scholar
  4. 70.
    D. Kunz, T. Aach, Lapped directional transform: a new transform for spectral image analysis, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 3433–3436Google Scholar
  5. 71.
    H. Malvar, A modulated complex lapped transform and its applications to audio processing, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 1421–1424Google Scholar
  6. 72.
    Z. Xiong, H.S. Malvar, A nonuniform modulated complex lapped transform. IEEE Signal Process. Lett. 8(9), 257–260 (2001)Google Scholar
  7. 73.
    B.-J. Yoon, H.S. Malvar, Coding over-complete representations of audio using the MCLT, in Proceedings of the IEEE Data Compression Conference, Snowbird, UT, March 2008, pp. 152–161Google Scholar

Supporting Literature

  1. 74.
    V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic, Elsevier Science, Amsterdam, 2007)Google Scholar
  2. 75.
    R.E. Crochiere, L.R. Rabiner, Multirate techniques in filter banks and spectrum analyzers and synthesizers (Chapter 7), in Multirate Digital Signal Processing (Prentice-Hall, Englewood Cliffs, NJ, 1983), pp. 289–400Google Scholar
  3. 76.
    M. Fiedler, Special Matrices and their using in Numerical Mathematics (SNTL, Prague, 1981) (in Czech)zbMATHGoogle Scholar
  4. 77.
    F.R. Gantmacher, The Theory of Matrices, 2nd edn. (Nauka, Moscow, 1966) (in Russian). English translation: Vol. 1 and 2, Chelsea, New York, 1959Google Scholar
  5. 78.
    G.H. Golub, C.F. Van Loan, Matrix Computations, 3rd edn. (Johns Hopkins University Press, Baltimore, MD, 1996)zbMATHGoogle Scholar
  6. 79.
    P.P. Vaidyanathan, Multirate Systems and Filter Banks (Prentice-Hall, Englewood Cliffs, NJ, 1992)zbMATHGoogle Scholar

Copyright information

© Springer International Publishing AG 2018

Authors and Affiliations

  • Vladimir Britanak
    • 1
  • K. R. Rao
    • 2
  1. 1.Institute of InformaticsSlovak Academy of SciencesBratislavaSlovakia
  2. 2.The University of Texas at ArlingtonArlingtonUSA

Personalised recommendations