Skip to main content

Part of the book series: The Springer International Series in Engineering and Computer Science ((SECS,volume 378))

Abstract

A multiscale representation for video signals based on wavelet decomposition and multi-resolution motion estimation (MRME) is presented in this chapter. The multiresolution/multifrequency nature of the discrete wavelet transform lends itself as an ideal tool for representing images and video signals for a variety of scan formats and resolutions. The wavelet transform decomposes a video frame into a set of sub-frames with different resolutions corresponding to different frequency bands. These multiresolution frames also provide a representation of the global motion structure in the scene at different scales. The motion activities in different sub-frames are hence highly correlated since they actually specify the same motion structure at different scales.

In the proposed MRME the motion vectors in higher resolution are first predicted by the motion vectors in the lower resolution, and then refined for that scale. In particular, we propose a variable block-size multi-resolution motion compensation (MRMC) scheme in which the size of a block in a sub-frame is varied according to its level in the pyramid. This scheme not only considerably reduces the searching and matching time but also provides a meaningful characterization of the intrinsic motion structure. After wavelet decomposition, each scaled sub-frame or wavelet tends to have different statistical properties and treated independently of each other.

For quantization, an adaptive truncation process is implemented in our work. Another scheme for bit allocation is examined by adapting to the local variance distribution of the coefficients in each scaled wavelet. Based on the wavelet representation, variable-size MRMC approach and a uniform quantization scheme, four variations of the proposed motion-compensated wavelet video compression system are presented. It is shown that the motion-compensated wavelet transform coding approach provides a truly scalable representation for full-motion video information. Since all the wavelets are treated independent of each other except for the lowest resolution sub-frame, selective processing of different sub-frames for different formats and quality make this scheme very suitable for the broadcast environment where incompatible formats can coexist simultaneously.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 169.00
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 219.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 219.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

References

  1. E. Adelson, Orthogonal Pyramid Transforms for Image Coding, SPIE Visual Communication and Image Processing II, vol. 845, 1987, pp. 50–58.

    Google Scholar 

  2. A. Akansu and R. Haddad and H. Caglar, for Multiresolution Signal Decomposition, Signal Processing, To appear in the journal, 1993.

    Google Scholar 

  3. M. Antonini and M. Barlaud and P. Mathieu and I. Daubechies, Image Coding Using Vector Quantization in the Wavelet Transform Domain, Proc. of ICASSP’90, 1990, pp. 2297-2300.

    Google Scholar 

  4. M. Antonini and M. Barloud and P. Mathieu and I. Doubechies, Image Coding Using Wavelet Transform, IEEE Trans. on Image Processing, Vol. 1, No. 2, April 1992, pp. 205–220.

    Article  Google Scholar 

  5. FCC ACATS Document SS/WP1-0184, Advanced Television Research Consortium (ATRC), System Description — Advanced Digital Television, 1991.

    Google Scholar 

  6. N. Baaziz and Claude Labit, Laplacian Pyramid Versus Wavelet Decomposition for Image Sequence Coding, Proc. of ICASSP’90, April, 1990.

    Google Scholar 

  7. S. Brofferio and F. Rocca, Interframe redundancy reduction of video signals generated by translating objects, IEEE Trans. on Communications, Vol. 25, April 1977, pp. 448–455.

    Article  Google Scholar 

  8. P. Burt and E. Adelson, The Laplacian Pyramid as a Compact Image Code, IEEE Trans. on Communications, Vol. 31, April 1983, pp. 532–540.

    Article  Google Scholar 

  9. P. Burt, Multi-resolution techniques for image representation, analysis, and “smart” transmission, SPIE Visual Communications and Image Processing IV, Vol. 1199, November 1989.

    Google Scholar 

  10. C. Cafforio and F. Rocca, Method for measuring small displacements of TV images, IEEE Trans, on Information Theory, Vol. 22, September 1976, pp. 573–579.

    Article  Google Scholar 

  11. Document 572, CCITT SG XV, Working Party XV/1, Spec. Grp. on Coding for Visual Telephony, Draft Revision of Recommendation H.261, 1990.

    Google Scholar 

  12. M. Liou, Overview of the px64 kbit/s Video Coding Standard, Communications of the ACM, Vol. 34, No. 4, April, 1991, pp. 59–63.

    Article  Google Scholar 

  13. W. Chen and W. Pratt, Scene Adaptive Coder, IEEE Trans. on Communications, March 1984, pp. 225–232.

    Google Scholar 

  14. R. R. Coifman and M. V. Wickerhauser, Entropy-based algorithms for best basis selection, IEEE Tran. on Information Theory, Vol. 38, 1992, pp. 713–718.

    Article  MATH  Google Scholar 

  15. I. Daubechies, Orthonormal Bases of Compactly Supported Wavelets, Communications on Pure and Applied Mathematics, Vol. XLI909-996, 1988.

    Google Scholar 

  16. I. Daubechies, The Wavelet Transform, Time-Frequency Localization and Signal Analysis, IEEE Trans. on Information Theory, Vol. 36, No. 6, September 1990, pp. 961–1005.

    Article  MathSciNet  MATH  Google Scholar 

  17. M. Antonini and M. Barloud and P. Mathieu and I. Doubechies, Image Coding Using Wavelet Transform, IEEE Trans. on Image Processing, Vol. 1, No. 2, April 1992, pp. 205–220.

    Article  Google Scholar 

  18. I. Daubechies, Ten Lectures on Wavelets, SIAM, 1992.

    Google Scholar 

  19. S. N. Efstratiadis and A. K. Katsaggeloso, Nonstationar AR Modeling and Constrained Recursive Estimation of the Displacement Field, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 2, No. 4, December 1992, pp. 334–346.

    Article  Google Scholar 

  20. F. Yegenoglu and B. Jabbari and Ya-Qin Zhang, Motion Classified Autoregressive Modeling for Variable Bit Rate, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 3, No. 1, February 1993, pp. 42–53.

    Article  Google Scholar 

  21. A. Gersho, Asymptotically optimal block quantization, IEEE Trans. Information Theory, Vol. IT-25, July 1979.

    Google Scholar 

  22. M. Ghanbari, The cross-search algorithm for motion-estimation, IEEE Trans. on Communications, Vol. 38, No. 7, July 1990, pp. 950–953.

    Article  Google Scholar 

  23. submitted on behalf of American Television Alliance, FCC ACTS Document SS/WP1-0193, DigiCipher HDTV System Description, General Instruments Corporation, 1991.

    Google Scholar 

  24. J. Jain and A. Jain, Displacement measurement and its application in interframe image coding, IEEE Trans. on Communications, Vol. 29, No. 12, December 1981.

    Google Scholar 

  25. G. Wallace, The JPEG Still Picture Compression Standardz, Communications of the ACM, Vol. 34, No. 4, April 1991, pp. 30–44.

    Article  Google Scholar 

  26. T. Koga and K. Iinuma and A. Hirano and Y. Iijima and T. Ishiguro, Motion-compensated interframe coding for video conferencing, Proc. NTC’ 81, 1981, pp. G5.3.1 — G5.3.5.

    Google Scholar 

  27. J. Limb and J. Murphy, Measuring the speed of moving objects from TV Signals, IEEE Trans. on Communications, Vol. 23, April 1975, pp. 474–478.

    Article  Google Scholar 

  28. V. Bove and A. Lippman, Scalable Open-Architecture Television, SMPTE Journal, January 1992, pp. 2-5.

    Google Scholar 

  29. S. Mallat, Multifrequency Channel Decompositions of Images and Wavelet Models, IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 17, No. 12, December 1989.

    Google Scholar 

  30. S. Mallat, A Theory for Multi-Resolution Signal Decomposition: The Wavelet Representation, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 11, No. 7, July 1989.

    Google Scholar 

  31. D. Marr, Vision, Freedman, New York, 1982.

    Google Scholar 

  32. F. Mintzer, Filters for Distortion-Free Two-Band Multirate Filter Banks, IEEE Trans. on Acoust., Speech, Signal Proa, Vol. 33, June 1985, pp. 626–630.

    Article  Google Scholar 

  33. Submitted on behalf of The American Television Alliance, FCC ACATS Document SS/WP1-0184, ATVA-Progressive System, Massachusetts Institute of Technology, 1991.

    Google Scholar 

  34. R. Moorhead and S. Rajala and L. Cook, Image Sequence Compression Using a Pel-Recursive Motion-Compensated Technique, IEEE Journal of Selected Areas in Communications, Vol. 5, No. 7, August 1987.

    Google Scholar 

  35. Motion Picture Experts Group, ISO/IEC JTC1/SC2/WG8, CCITT SGVIII, Coded representation of picture and audio information, MPEG video simulation model two, 1990.

    Google Scholar 

  36. International Organization for Standardization Coded Representation of Picture and Audio Information, MPEG Video Simulation Model Three, 1990.

    Google Scholar 

  37. D. Le Gall, MPEG: A Video Compression Standard for Multimedia Applications, Communications of the ACM, Vol. 34, No. 4, April 1991, pp. 46–58.

    Article  Google Scholar 

  38. H. Musmann and P. Pirsch and H. Grallert, Advances in picture coding, Proc. of IEEE, Vol. 73, No. 4, April 1985, pp. 523–548.

    Article  Google Scholar 

  39. T. Naveen and J. W. Woods, Motion Compensated Multiresolution Transmission of High Definition Video, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 4, No. 1, February 1994, pp. 29–41.

    Article  Google Scholar 

  40. A. Netravali and J. Robbins, Motion-Compensated Television Coding: Part I, Bell Technical Journal, Vol. 58, No. 3, March 1979, pp. 631–670.

    MATH  Google Scholar 

  41. A. Netravali and B. Haskell, Digital Pictures — Representation and Compression, Plenum Press, New York, 1989.

    Google Scholar 

  42. N. Ohta and M. Nomura and T. Fujii, Variable Rate Coding Using Motion-Compensated DCT for Asynchronous Transfer Mode Network, IEEE ICC’88, 1988, pp. 1257-1261.

    Google Scholar 

  43. W. Paik, Digicipher — All Digital, Channel Compatible, HDTV Broadcast System, IEEE Trans. on Broadcasting, Vol. 36, No. 4, December 1990, pp. 245–254.

    Article  Google Scholar 

  44. A. Puri, Multi-Frame Conditional Motion-Compensated Interpolation and Coding, Proc. Picture Coding Symposium, Cambridge, MA, March 1990, pp. 8.3-1-8.3-2.

    Google Scholar 

  45. O. Rioul and M. Vetterli, Wavelet and Signal Processing, IEEE Signal Processing Magazine, Vol. 8, No. 4, October 1991, pp. 14–38.

    Article  Google Scholar 

  46. O. Rioul, Simple regularity criteria for subdivision schemes, SIAM J. of Math. Anal., Vol. 23, 1992, pp. 1544–1576.

    Article  MathSciNet  MATH  Google Scholar 

  47. F. Rocca and S. Zanoletti, Bandwidth reduction via movement compensation on a model of the random video process, IEEE Trans. on Communications, Vol. 20, October 1972, pp. 960–965.

    Article  Google Scholar 

  48. M.J.T. Smith and T.P. Barnwell, Exact Reconstruction for Tree-Structured Subband Coders, IEEE Trans. on Acoust., Speech and Signal Proc, Vol. ASSP-34, June 1986, pp. 434–441.

    Article  Google Scholar 

  49. R. Srinivasan and K. Rao, Predictive Coding based on efficient motion estimation, IEEE ICC’84, 1984, pp. 521-526.

    Google Scholar 

  50. A. H. Tewfik and D. Sinha and P. Jorgensen, On the Optimal Choice of a Wavelet for Signal Representation, IEEE Tran. on Information Theory, Vol. 38, 1992, pp. 747–765.

    Article  MATH  Google Scholar 

  51. A. Wong and C. Chen and D. LeGall and F. Chang and M. Uz, MCPIC: A Video Coding Algorithm for Transmission and Storage Applications, IEEE Communications Magazine, November 1990, pp. 24-32.

    Google Scholar 

  52. K. Uz and M. Vetterli and D. LeGall, Interpolative Multi-resolution Coding of Advanced Television and Compatible Subchannels, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 1, No. 1, March 1991.

    Google Scholar 

  53. P.P. Vaidyanathan and P.Q. Hoang, Lattice Structures for Optimal Design and Robust Implementation of Two-Band Perfect Reconstruction QMF Banks, IEEE Trans. on Acoust., Speech and Signal Proc, Vol. ASSP-36, No. 1, January 1988, pp. 81–94.

    Article  Google Scholar 

  54. M. Vetterli, Multidimensional Subband Coding: Some Theory and Algorithms, Signal Processing, Vol. 6, April 1984, pp. 97–112.

    Article  MathSciNet  Google Scholar 

  55. M. Vetterli, Filter Banks Allowing perfect Reconstruction, Signal Processing, Vol. 10, No. 3, April 1986, pp. 219–244.

    Article  MathSciNet  Google Scholar 

  56. M. Vetterli and C. Herley, Wavelets and Filter Banks: Relationships and New Results, Proc. of ICASSP’90, April 3–6, 1990.

    Google Scholar 

  57. M. Vetterli and J. Kovacevic, Perfect Reconstruction Filter Banks for HDTV Representation and Coding, Signal Processing: Image Communication 2, 1990, pp. 349–363.

    Article  Google Scholar 

  58. M. Vetterli and C. Herley, Wavelets and Filter banks: Theory and design, IEEE Trans. on Signal Processing, Vol. 40, 1992, pp. 2207–2232.

    Article  MATH  Google Scholar 

  59. J. D. Villasenor and B. Beizer and J Liao, Filter Evaluation and Selection in Wavelet Image Compression, Proceedings Data Compression Conference, editor: J. A. Storer and M. Cohn, IEEE Computer Society Technical Committee on Computer Communications, March 29–31, Snowbird Utah, 1994, pp. 351-360.

    Google Scholar 

  60. D. Walker and K. Rao, New Techniques in Pel-Recursive Motion Estimation, Proceedings of ICC, 1984, pp. 703-706.

    Google Scholar 

  61. A. Watson, The Cortex Transform: Rapid Computation of Simulated Neural Images, Computer Vision, Graphics, Image Processing, Vol. 39, 1987, pp. 311–327.

    Article  Google Scholar 

  62. J. Woods and S. O’Neil, Subband Coding of Images, IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 34, No. 5, October 1986.

    Google Scholar 

  63. J. Woods, Subband Image Coding, Kluwer Academic, 1991.

    Google Scholar 

  64. J. Woods and T. Naveen, A Filter Based Bit Allocation Scheme for Subband Compression of HDTV, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 1, No. 3, July 1992, pp. 436–440.

    Google Scholar 

  65. K. Xie and L. Eycken and A. Oosterilinck, Some New Techniques in Displacement Estimation Algorithms, SPIE Visual Communications and Image Processing, Vol. 1199, 1989, pp. 18–29.

    Google Scholar 

  66. S. Zafar and Y. Zhang and J. Baras, Predictive Block-Matching Motion Estimation for TV Coding — Part I: Inter-block Prediction, IEEE Trans. on Broadcasting, Vol. 37, No. 3, September 1991, pp. 97–101.

    Article  Google Scholar 

  67. Y. Zhang and S. Zafar, Predictive Block-Matching Motion Estimation for TV Coding — Part II: Inter-Frame Prediction, IEEE Trans. on Broadcasting, Vol. 37, No. 3, September 1991, pp. 101–105.

    Article  Google Scholar 

  68. S. Zafar and Y. Zhang and J. Baras, Predictive Block-Matching Motion Estimation Schemes for Video Compression, Proceedings of IEEE Int’l Conference on Consumer Electronics, Chicago IL, June 6–8, 1991, pp. 300-301.

    Google Scholar 

  69. Y. Zhang and S. Zafar, Motion-Compensated Wavelet Transform Coding for Color Video Compression, SPIE Visual Communications and Image Processing’ 91: Visual Communication, Vol. 1605, November 1991, pp. 301–316.

    Google Scholar 

  70. Y. Zhang and S. Zafar, Motion-Compensated Wavelet Transform Coding for Color Video Compression, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 2, No. 3, September 1992, pp. 285–296.

    Article  Google Scholar 

  71. S. Zafar and Y. Zhang and B. Jabbari, Multiscale Video Representation Using Multi-Resolution Motion Compensation and Wavelet Decomposition, IEEE Journal of Selected Areas in Communications, January 1993.

    Google Scholar 

  72. B. Jabbari and F. Yegenoglu and Y. Kuo and S. Zafar and Y. Zhang, Statistical Characterization and Block-Based Modeling of Motion Adaptive Coded Video, IEEE Trans. on Circuit and Systems for Video Technology, April 1993.

    Google Scholar 

  73. Sohail Zafar, Motion Estimation and Encoding Algorithms for Hierarchical Representation of Digital Video, George Mason University 1994.

    Google Scholar 

  74. FCC ACATS Document SS/WP1-0193, Digital Spectrum-Compatible HDTV: Technical Details, Zenith Electronics Corporation and AT&T Bell Laboratories, 1991.

    Google Scholar 

  75. W. Zettler and J. Huffman and D. Linden, Application of Compactly Supported Wavelets to Image Compression, SPIE Image Processing Algorithms and Techniques, Santa Clara, CA, Vol. 1244, February 1990, pp. 150-160.

    Google Scholar 

  76. J. Joen and J. Kim, On the Hierarchical Edge-Based Block Motion Estimation for Video Subband Coding at Low Bit Rates, SPIE Visual Communications and Image Processing, Boston, MA, Vol. 2094, November 9-13, 1993.

    Google Scholar 

  77. S. Panchanathan, E. Chan, and X. Wang, Fast Multiresolution Motion Estimation Scheme for a Wavelet Transform Video Coder, SPIE Visual Communications and Image Processing,(Also IEEE Trans. Image Processing), Chicago, IL, Vol. 2308, September 25–29, 1994, pp. 671-681.

    Google Scholar 

  78. K. Goh, J. Soragham, and T. Durrani, Multiresolution Based Algorithms for Low Bit Rate Image Coding, IEEE Int. Conf. Image Processing, Austin, TX, Vol. 3, November 13-16, 1994, pp. 285–289.

    Google Scholar 

  79. R.J. Clarke and Y. Wang, Multiresolution Motion Estimation Scheme for Very Low Bit Rate Video Coding, Very Low Bit Rate Video Workshop, Colchester, UK, Vol. 1, May 25-29, 1994, pp. 671–681.

    Google Scholar 

  80. H. Oh, Y. Baek, G. Kim, G. Park, H. Lee, and J. Jeon, Very Low Bitrate Video Coding Wavelet Decomposition, ISO/IEC JTC1/SC29/WG11, MPEG94/392, Singapore, November 25-29, 1994, pp. 1-10.

    Google Scholar 

  81. W. Li and Y.-Q. Zhang, Vector-Based Signal Processing and Quantization for Image and Video Compression, Proceedings of IEEE, Vol. 83, February 1995, pp. 671–681.

    Article  Google Scholar 

  82. J. Shapiro, Embedded Image Coding Using Zerotrees of Wavelet Coefficients, IEEE Trans. Signal Processing, Vol. 41, December 1993, pp. 3443–3463.

    Article  Google Scholar 

  83. M. Bierling, Displacement Estimation by Hierarchical Block Matching, SPIE Visual Communications and Image Processing, Vol. 1001, November 1988, pp. 942–951.

    Article  Google Scholar 

  84. H. Gharavi, Subband Coding Algorithms for Video Applications: Videophone to HDTV Conferencing, IEEE Trans. Circuits and Systems for Video Tech., Vol. 1, June 1991, pp. 174–183.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1997 Springer Science+Business Media New York

About this chapter

Cite this chapter

Zhang, YQ., Zafar, S. (1997). Wavelet-Based Video Compression. In: Li, H.H., Sun, S., Derin, H. (eds) Video Data Compression for Multimedia Computing. The Springer International Series in Engineering and Computer Science, vol 378. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6239-9_1

Download citation

  • DOI: https://doi.org/10.1007/978-1-4615-6239-9_1

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-7862-4

  • Online ISBN: 978-1-4615-6239-9

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics