Skip to main content

MDCT/MDST, MLT, ELT, and MCLT Filter Banks: Definitions, General Properties, and Matrix Representations

  • Chapter
  • First Online:
Cosine-/Sine-Modulated Filter Banks
  • 665 Accesses

Abstract

The perfect reconstruction cosine/sine-modulated filter banks belonging to the class of modulated filter banks have been studied extensively due to their attractive features (simple structure, analysis and synthesis filters are of equal length, low computational complexity), and consequently, they have received a great interest in audio coding applications. In fact, they are employed in the international speech and audio coding standards and proprietary audio compression algorithms. The oddly and evenly stacked modified discrete cosine transform (MDCT) and the corresponding modified discrete sine transform (MDST), the modulated lapped transform (MLT), the extended lapped transforms (ELTs), and their biorthogonal versions are real-valued cosine/sine-modulated filter banks satisfying the perfect reconstruction property. The modulated complex lapped transform (MCLT) is the complex-valued filter bank whose real part is the MLT or equivalently, the oddly stacked MDCT, and the imaginary part is the oddly stacked MDST. In this chapter, definitions, general properties, and matrix representations of the MDCT/MDST, MLT, ELT, and MCLT filter banks are presented. In order to an analysis/synthesis filter bank be perfect reconstruction, the necessary and sufficient conditions imposed on the analysis and synthesis windowing functions play an important role. Therefore, additionally the windowing procedure and perfect reconstruction (biorthogonal) conditions in the case of identical and (nonidentical) analysis and synthesis windowing functions, design of a windowing function including definitions of commonly windowing functions used in audio coding applications, adaptive switching of transform block sizes and windowing functions, and general perfect reconstruction conditions for the ELT filter bank with multiple overlapping factor both for the orthogonal and biorthogonal cases are derived and/or discussed in detail.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Institutional subscriptions

Change history

  • 08 January 2019

    On p. v, in the third line from top, “in” was inserted after comma to read “. . ., and vice versa, in many sub-band/transform-based schemes . . .”

References

  1. R. Gluth, A unified approach to transform-based FIR filter banks with special regard to perfect reconstruction, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 157–160

    Google Scholar 

  2. R. Gluth, U. Heute, Analysis/synthesis filter banks based on generalized sinusiodal transforms with an application to speech coding, in Proceedings of the 6th European Signal Processing Conference (EUSIPCO’92), vol. 1, Brussels, August 1992, pp. 215–218

    Google Scholar 

  3. R.A. Gopinath, Modulated filter banks and wavelets – a general unified theory, in Proceedings of the IEEE ICASSP’96, Atlanta, GA, May 1996, pp. 1585–1588

    Google Scholar 

  4. R.A. Gopinath, C.S. Burrus, Theory of modulated filter banks and modulated wavelet tight frames, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 169–172

    Google Scholar 

  5. R.D. Koilpillai, P.P. Vaidyanathan, New results on cosine-modulated FIR filter banks satisfying perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1793–1796

    Google Scholar 

  6. R.D. Koilpillai, P.P. Vaidyanathan, Cosine-modulated FIR filter banks satisfying perfect reconstruction. IEEE Trans. Signal Process. 40(4), 770–783 (1992)

    Article  Google Scholar 

  7. Y.-P. Lin, P.P. Vaidyanathan, Linear phase cosine modulated maximally decimated filter banks with perfect reconstruction. IEEE Trans. Signal Process. 42(11), 2525–2539 (1995)

    Google Scholar 

  8. J. Mau, Perfect reconstruction modulated filter banks, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 273–276

    Google Scholar 

  9. J. Mau, Perfect reconstruction modulated filter banks: fast algorithms and attractive new properties, in Proceedings of the IEEE ICASSP’93, Minneapolis, MN, vol. III, April 1993, pp. 225–228

    Google Scholar 

  10. J. Mau, Regular M-band modulated orthogonal transforms, in Proceedings of the IEEE ICASSP’93, vol. III, Adelaide, April 1994, pp. 125–128

    Google Scholar 

  11. T.Q. Nguyen, R.D. Koilpillai, The theory and design of arbitrary-length cosine-modulated filter banks and wavelets, satisfying perfect reconstruction. IEEE Trans. Signal Process. 44(3), 473–483 (1996)

    Article  Google Scholar 

  12. T.A. Ramstad, J.P. Tanem, Cosine-modulated analysis-synthesis filterbank with critical sampling and perfect reconstruction, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1789–1792

    Google Scholar 

  13. G.D. Schuller, M.J. Smith, New framework for modulated perfect reconstruction filter banks. IEEE Trans. Signal Process. 44(8), 1941–1954 (1996)

    Article  Google Scholar 

  14. M. Vetterli, D. Le Gall, Perfect Reconstruction FIR filter banks: some properties and factorizations. IEEE Trans. Acoust. Speech Signal Process. 37(7), 1057–1071 (1989)

    Article  Google Scholar 

Evenly Stacked MDCT/MDST Analysis and Synthesis Filter Banks

  1. V. Britanak, A note on the MDCT/MDST and pseudoinverse matrix. Comput. Inform. 23(3), 205–214 (2004)

    MathSciNet  MATH  Google Scholar 

  2. V. Britanak, K.R. Rao, A unified fast MDCT/MDST computation in the evenly stacked analysis/synthesis system. Circuits, Syst. Signal Process. 21(4), 415–426 (2002)

    Article  MathSciNet  MATH  Google Scholar 

  3. T.D. Lookabaugh, M.G. Perkins, Application of the Princen–Bradley filter bank to speech and image compression. IEEE Trans. Acoust. Speech Signal Process. ASSP-38(11), 1914–1926 (1990)

    Article  Google Scholar 

  4. J.P. Princen, A.B. Bradley, Analysis/synthesis filter bank design based on time domain aliasing cancellation. IEEE Trans. Acoust. Speech Signal Process. ASSP-34(5), 1153–1161 (1986)

    Article  Google Scholar 

Oddly Stacked MDCT/MDST Analysis and Synthesis Filter Banks

  1. V. Britanak, An efficient computing of oddly stacked MDCT/MDST computation via evenly stacked MDCT/MDST and vice versa. Signal Process. 85(7), 1353–1374 (2005)

    Article  MATH  Google Scholar 

  2. V. Britanak, H.J.L. Arriëns, Fast computational structures for an efficient implementation of the complete TDAC analysis/synthesis MDCT/MDST filter banks. Signal Process. 89(7), 1379–1394 (2009)

    Article  MATH  Google Scholar 

  3. V. Britanak, K.R. Rao, A new fast algorithm for the unified forward and inverse MDCT/MDST computation. Signal Process. 82(3), 433–459 (2002)

    Article  MATH  Google Scholar 

  4. S. Cramer, R. Gluth, Computationally efficient real-valued filter banks based on a modified O 2 DFT, in Proceedings of EUSIPCO’90, Signal Processing V: Theories and Applications (Elsevier Science Publishers B.V., Barcelona, 1990), pp. 585–588

    Google Scholar 

  5. A.W. Johnson, A.B. Bradley, Adaptive transform coding incorporating time domain aliasing cancellation. Speech Commun. 6(4), 299–308 (1987)

    Article  Google Scholar 

  6. J.P. Princen, A.W. Johnson, A.B. Bradley, Sub-band/transform coding using filter bank designs based on time domain aliasing cancellation, in Proceedings of the IEEE ICASSP’87, Dallas, TX, April 1987, pp. 2161–2164

    Google Scholar 

  7. K. Suresh, T.V. Sreenivas, Direct MDCT domain psychoacoustic modeling, in Proceedings of the IEEE International Symposium on Signal Processing and Information Technology (ISSPIT’2007), Cairo, December 2007, pp. 742–747

    Book  Google Scholar 

  8. K. Suresh, T.V. Sreenivas, Linear filtering in DCT-IV/DST-IV and MDCT/MDST domain. Signal Process. 89(6), 1081–1089 (2009)

    Article  MATH  Google Scholar 

  9. Y. Wang, M. Vilermo, Modified discrete cosine transform – its implications for audio coding and error concealment. J. Audio Eng. Soc. 51(1/2), 52–61 (2003)

    Google Scholar 

  10. Y. Wang, L. Yaroslavsky, M. Vilermo, On the relationship between MDCT, SDFT and DFT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 44–47

    Google Scholar 

  11. Y. Wang, L. Yaroslavsky, M. Vilermo, M. Väänänen, Some peculiar properties of the MDCT, in Proceedings of the 5th International Conference on Signal Processing (ICSP’2000), Beijing, August 2000, pp. 61–64

    Google Scholar 

  12. K. Wright, Notes on Ogg Vorbis and the MDCT. Draft document on web site: www.free-comp-shop.com/vorbis.html, May 2003, 7 pp.

  13. S. Zhang, W. Dou, H. Yang, DFT spectrum estimation from critically sampled lapped transforms. Signal Process. 91(2), 300–310 (2011)

    Article  MATH  Google Scholar 

Windowing Procedure, Perfect Reconstruction Conditions, and Design of Windowing Function

  1. M. Bosi, R.E. Golberg, Introduction to Digital Audio Coding and Standards (Springer Science+Business Media, New York, NY, 2003)

    Book  Google Scholar 

  2. M. Bosi, K. Brandenburg, S. Quackenbush, L. Fielder, K. Akagiri, H. Fuchs, M. Dietz, J. Herre, G. Davidson, Y. Oikawa, ISO/IEC MPEG-2 advanced audio coding, in 101st AES Convention, Los Angeles, CA, November 1996. Preprint #4382. Also published in J. Audio Eng. Soc. 45(10), 789–813 (1997)

    Google Scholar 

  3. B. Edler, Coding of audio signals with overlapping block transform and adaptive window functions. Frequenz 43(9), 252–256 (1989) (in German)

    Article  Google Scholar 

  4. A.J. Ferreira, Convolutional effects in transform coding with TDAC: an optimal window. IEEE Trans. Speech Audio Process. 4(2), 104–114 (1996)

    Article  MathSciNet  Google Scholar 

  5. F.J. Harris, On the use of windows for harmonic analysis with the discrete Fourier transform. Proc. IEEE 66(1), 51–83 (1978)

    Article  Google Scholar 

  6. T. Mochizuki, Perfect reconstruction conditions for adaptive blocksize MDCT. IEICE Trans. Fundam. E77-A(5), 894–899 (1994)

    Google Scholar 

  7. T. Painter, A. Spanias, Perceptual coding of digital audio. Proc. IEEE 88(4), 451–513 (2000)

    Article  Google Scholar 

  8. J.R. Rice, The Approximation of Functions, vol. I (Addison-Wesley, Reading, MA, 1964), pp. 124–131

    Google Scholar 

  9. G. Smart, A.B. Bradley, Filter bank design based on time domain aliasing cancellation with non-identical windows, in Proceedings of the IEEE ICASSP’94, vol. III, Adelaide, April 1994, pp. 185–188

    Google Scholar 

  10. A. Spanias, T. Painter, V. Atti, Audio Signal Processing and Coding, chap. 6 (Wiley, Hoboken, NJ, 2007), pp. 145–210

    Google Scholar 

  11. G. Wolberg, Digital Image Warping (IEEE Computer Press, Los Alamitos, CA, 1990), pp. 137–144

    Google Scholar 

Low (Reduced) Overlap Windowing Functions

  1. E. Allamanche, R. Geiger, J. Herre, T. Sporer, MPEG-4 low delay audio coding based on the AAC codec, in 106th AES Convention, Munich, May 1999. Preprint #4929

    Google Scholar 

  2. M. Lutzky, M. Schnell, M. Schmidt, R. Geiger, Structural analysis of low latency audio coding schemes, in 119th AES Convention, New York, NY, October 2005. Preprint #6601

    Google Scholar 

  3. OPUS interactive speech and audio codec, Available on web site: www.opus-codec.org (2016)

  4. J.-M. Valin, T.B. Terriberry, G. Maxwell, A full-band audio codec with low complexity and very low delay, in Proceedings of the 17th European Signal Processing Conference (EUSIPCO’2009), Glasgow, August 2009, pp. 1254–1258

    Google Scholar 

  5. J.-M. Valin, T.B. Terriberry, C. Montgomery, G. Maxwell, A high-quality speech and audio codec with less than 10 ms delay. IEEE Trans. Audio Speech Lang. Process. 18(1), 58–67 (2010)

    Article  Google Scholar 

  6. J.-M. Valin, G. Maxwell, T.B. Terriberry, C. Montgomery, K. Vos, High-quality, low-delay music coding in the Opus codec, in 135th AES Convention, New York, NY, October 2013. Preprint #8942

    Google Scholar 

(Generalized) Lapped Orthogonal Transforms

  1. A.N. Akansu, F.E. Wadas, On lapped orthogonal transform. IEEE Trans. Signal Process. 40(2), 439–443 (1992)

    Article  Google Scholar 

  2. R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, Generalized lapped orthogonal transforms. Electron. Lett. 30(2), 107–108 (1994)

    Article  Google Scholar 

  3. R.L. de Queiroz, T.Q. Nguyen, K.R. Rao, The GenLOT: generalized linear-phase lapped orthogonal transform. IEEE Trans. Signal Process. 44(3), 497–507 (1996)

    Article  Google Scholar 

  4. H.S. Malvar, Reduction of blocking effects in image coding with a lapped orthogonal transform, in Proceedings of the IEEE ICASSP’88, New York, NY, April 1988, pp. 781–784

    Google Scholar 

  5. H.S. Malvar, The LOT: a link between block transform coding and multirate filter banks, in Proceedings of International Symposium on Circuits and Systems, Espoo, June 1988, pp. 835–838

    Google Scholar 

  6. H.S. Malvar, Pseudolapped orthogonal transforms. Electron. Lett. 25(5), 312–314 (1989)

    Article  Google Scholar 

  7. H.S. Malvar, D.H. Staelin, The LOT: transform coding without blocking effects. IEEE Trans. Audio Speech Lang. Process. 37(4), 553–559 (1989)

    Article  Google Scholar 

  8. V.K. Madisetti, D.B. Williams (eds.), The Digital Signal Processing Handbook, Part 38, Lapped Transforms (CRC & IEEE, Boca Raton, FL, 1998), pp. 1–7

    Google Scholar 

  9. K.R. Rao, P.C. Yip (eds.), Lapped transforms for image compression (Chapter 5), in The Transform and Data Compression Handbook (CRC, Boca Raton, FL, 2001), pp. 197–265

    Google Scholar 

MLT and ELT (Biorthogonal, Nonuniform) Analysis and Synthesis Filter Banks

  1. S. Cheung, J.S. Lim, Incorporation of biorthogonality into lapped transforms for audio compression, in Proceedings of the IEEE ICASSP’95, Detroit, MI, April 1995, pp. 3079–3082

    Google Scholar 

  2. H.S. Malvar, Lapped transforms for efficient transform/sub-band coding. IEEE Trans. Acoust. Speech Signal Process. 38(6), 969–978 (1990)

    Article  Google Scholar 

  3. H.S. Malvar, Modulated QMF filter banks with perfect reconstruction. Electron. Lett. 26(13), 906–910 (1990)

    Article  Google Scholar 

  4. H.S. Malvar, Extended lapped transforms: fast algorithms and applications, in Proceedings of the IEEE ICASSP’91, Toronto, May 1991, pp. 1797–1800

    Google Scholar 

  5. H.S. Malvar, Extended lapped transforms: properties, applications, and fast algorithms. IEEE Trans. Signal Process. 40(11), 2703–2714 (1992)

    Article  MATH  Google Scholar 

  6. H.S. Malvar, Signal Processing with Lapped Transforms (Artech House, Norwood, MA, 1992). Chapter 4: Lapped Orthogonal Transforms, pp. 143–173 and Chapter 5: Modulated Lapped Transforms, pp. 175–218

    Google Scholar 

  7. H.S. Malvar, Extended cosine bases and applications to audio coding. Comput. Appl. Math. 15(2), 111–123 (1996)

    MATH  Google Scholar 

  8. H.S. Malvar, Biorthogonal and nonuniform lapped transforms for transform coding with reduced blocking and ringing artifacts. IEEE Trans. Signal Process. 46(4), 1043–1053 (1998)

    Article  MathSciNet  Google Scholar 

  9. M. Padmanabhan, K. Martin, Some further results on modulated/extended lapped transforms, in Proceedings of the IEEE ICASSP’92, vol. IV, San Francisco, CA, April 1992, pp. 265–268

    Google Scholar 

MCLT Analysis and Synthesis Filter Banks

  1. T. Aach, D. Kunz, A lapped directional transform for spectral image analysis and its application to restoration and enhancement. Signal Process. 80(11), 2347–2364 (2000)

    Article  MATH  Google Scholar 

  2. V. Britanak, New recursive fast radix-2 algorithm for the modulated complex lapped transform. IEEE Trans. Signal Process. 60(12), 6703–6708 (2012)

    Article  MathSciNet  MATH  Google Scholar 

  3. S. Cheng, Z. Xiong, Audio coding and image denoising based on the nonuniform modulated complex lapped transform. IEEE Trans. Multimedia 7(10), 817–827 (2005)

    Article  Google Scholar 

  4. D. Kunz, T. Aach, Lapped directional transform: a new transform for spectral image analysis, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 3433–3436

    Google Scholar 

  5. H. Malvar, A modulated complex lapped transform and its applications to audio processing, in Proceedings of the IEEE ICASSP’99, Phoenix, AR, May 1999, pp. 1421–1424

    Google Scholar 

  6. Z. Xiong, H.S. Malvar, A nonuniform modulated complex lapped transform. IEEE Signal Process. Lett. 8(9), 257–260 (2001)

    Article  Google Scholar 

  7. B.-J. Yoon, H.S. Malvar, Coding over-complete representations of audio using the MCLT, in Proceedings of the IEEE Data Compression Conference, Snowbird, UT, March 2008, pp. 152–161

    Google Scholar 

Supporting Literature

  1. V. Britanak, P. Yip, K.R. Rao, Discrete Cosine and Sine Transforms: General Properties, Fast Algorithms and Integer Approximations (Academic, Elsevier Science, Amsterdam, 2007)

    Google Scholar 

  2. R.E. Crochiere, L.R. Rabiner, Multirate techniques in filter banks and spectrum analyzers and synthesizers (Chapter 7), in Multirate Digital Signal Processing (Prentice-Hall, Englewood Cliffs, NJ, 1983), pp. 289–400

    Google Scholar 

  3. M. Fiedler, Special Matrices and their using in Numerical Mathematics (SNTL, Prague, 1981) (in Czech)

    MATH  Google Scholar 

  4. F.R. Gantmacher, The Theory of Matrices, 2nd edn. (Nauka, Moscow, 1966) (in Russian). English translation: Vol. 1 and 2, Chelsea, New York, 1959

    Google Scholar 

  5. G.H. Golub, C.F. Van Loan, Matrix Computations, 3rd edn. (Johns Hopkins University Press, Baltimore, MD, 1996)

    MATH  Google Scholar 

  6. P.P. Vaidyanathan, Multirate Systems and Filter Banks (Prentice-Hall, Englewood Cliffs, NJ, 1992)

    MATH  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Rights and permissions

Reprints and permissions

Copyright information

© 2018 Springer International Publishing AG

About this chapter

Cite this chapter

Britanak, V., Rao, K.R. (2018). MDCT/MDST, MLT, ELT, and MCLT Filter Banks: Definitions, General Properties, and Matrix Representations. In: Cosine-/Sine-Modulated Filter Banks. Springer, Cham. https://doi.org/10.1007/978-3-319-61080-1_3

Download citation

  • DOI: https://doi.org/10.1007/978-3-319-61080-1_3

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-319-61078-8

  • Online ISBN: 978-3-319-61080-1

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics