Abstract
A multiscale representation for video signals based on wavelet decomposition and multi-resolution motion estimation (MRME) is presented in this chapter. The multiresolution/multifrequency nature of the discrete wavelet transform lends itself as an ideal tool for representing images and video signals for a variety of scan formats and resolutions. The wavelet transform decomposes a video frame into a set of sub-frames with different resolutions corresponding to different frequency bands. These multiresolution frames also provide a representation of the global motion structure in the scene at different scales. The motion activities in different sub-frames are hence highly correlated since they actually specify the same motion structure at different scales.
In the proposed MRME the motion vectors in higher resolution are first predicted by the motion vectors in the lower resolution, and then refined for that scale. In particular, we propose a variable block-size multi-resolution motion compensation (MRMC) scheme in which the size of a block in a sub-frame is varied according to its level in the pyramid. This scheme not only considerably reduces the searching and matching time but also provides a meaningful characterization of the intrinsic motion structure. After wavelet decomposition, each scaled sub-frame or wavelet tends to have different statistical properties and treated independently of each other.
For quantization, an adaptive truncation process is implemented in our work. Another scheme for bit allocation is examined by adapting to the local variance distribution of the coefficients in each scaled wavelet. Based on the wavelet representation, variable-size MRMC approach and a uniform quantization scheme, four variations of the proposed motion-compensated wavelet video compression system are presented. It is shown that the motion-compensated wavelet transform coding approach provides a truly scalable representation for full-motion video information. Since all the wavelets are treated independent of each other except for the lowest resolution sub-frame, selective processing of different sub-frames for different formats and quality make this scheme very suitable for the broadcast environment where incompatible formats can coexist simultaneously.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
References
E. Adelson, Orthogonal Pyramid Transforms for Image Coding, SPIE Visual Communication and Image Processing II, vol. 845, 1987, pp. 50–58.
A. Akansu and R. Haddad and H. Caglar, for Multiresolution Signal Decomposition, Signal Processing, To appear in the journal, 1993.
M. Antonini and M. Barlaud and P. Mathieu and I. Daubechies, Image Coding Using Vector Quantization in the Wavelet Transform Domain, Proc. of ICASSP’90, 1990, pp. 2297-2300.
M. Antonini and M. Barloud and P. Mathieu and I. Doubechies, Image Coding Using Wavelet Transform, IEEE Trans. on Image Processing, Vol. 1, No. 2, April 1992, pp. 205–220.
FCC ACATS Document SS/WP1-0184, Advanced Television Research Consortium (ATRC), System Description — Advanced Digital Television, 1991.
N. Baaziz and Claude Labit, Laplacian Pyramid Versus Wavelet Decomposition for Image Sequence Coding, Proc. of ICASSP’90, April, 1990.
S. Brofferio and F. Rocca, Interframe redundancy reduction of video signals generated by translating objects, IEEE Trans. on Communications, Vol. 25, April 1977, pp. 448–455.
P. Burt and E. Adelson, The Laplacian Pyramid as a Compact Image Code, IEEE Trans. on Communications, Vol. 31, April 1983, pp. 532–540.
P. Burt, Multi-resolution techniques for image representation, analysis, and “smart” transmission, SPIE Visual Communications and Image Processing IV, Vol. 1199, November 1989.
C. Cafforio and F. Rocca, Method for measuring small displacements of TV images, IEEE Trans, on Information Theory, Vol. 22, September 1976, pp. 573–579.
Document 572, CCITT SG XV, Working Party XV/1, Spec. Grp. on Coding for Visual Telephony, Draft Revision of Recommendation H.261, 1990.
M. Liou, Overview of the px64 kbit/s Video Coding Standard, Communications of the ACM, Vol. 34, No. 4, April, 1991, pp. 59–63.
W. Chen and W. Pratt, Scene Adaptive Coder, IEEE Trans. on Communications, March 1984, pp. 225–232.
R. R. Coifman and M. V. Wickerhauser, Entropy-based algorithms for best basis selection, IEEE Tran. on Information Theory, Vol. 38, 1992, pp. 713–718.
I. Daubechies, Orthonormal Bases of Compactly Supported Wavelets, Communications on Pure and Applied Mathematics, Vol. XLI909-996, 1988.
I. Daubechies, The Wavelet Transform, Time-Frequency Localization and Signal Analysis, IEEE Trans. on Information Theory, Vol. 36, No. 6, September 1990, pp. 961–1005.
M. Antonini and M. Barloud and P. Mathieu and I. Doubechies, Image Coding Using Wavelet Transform, IEEE Trans. on Image Processing, Vol. 1, No. 2, April 1992, pp. 205–220.
I. Daubechies, Ten Lectures on Wavelets, SIAM, 1992.
S. N. Efstratiadis and A. K. Katsaggeloso, Nonstationar AR Modeling and Constrained Recursive Estimation of the Displacement Field, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 2, No. 4, December 1992, pp. 334–346.
F. Yegenoglu and B. Jabbari and Ya-Qin Zhang, Motion Classified Autoregressive Modeling for Variable Bit Rate, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 3, No. 1, February 1993, pp. 42–53.
A. Gersho, Asymptotically optimal block quantization, IEEE Trans. Information Theory, Vol. IT-25, July 1979.
M. Ghanbari, The cross-search algorithm for motion-estimation, IEEE Trans. on Communications, Vol. 38, No. 7, July 1990, pp. 950–953.
submitted on behalf of American Television Alliance, FCC ACTS Document SS/WP1-0193, DigiCipher HDTV System Description, General Instruments Corporation, 1991.
J. Jain and A. Jain, Displacement measurement and its application in interframe image coding, IEEE Trans. on Communications, Vol. 29, No. 12, December 1981.
G. Wallace, The JPEG Still Picture Compression Standardz, Communications of the ACM, Vol. 34, No. 4, April 1991, pp. 30–44.
T. Koga and K. Iinuma and A. Hirano and Y. Iijima and T. Ishiguro, Motion-compensated interframe coding for video conferencing, Proc. NTC’ 81, 1981, pp. G5.3.1 — G5.3.5.
J. Limb and J. Murphy, Measuring the speed of moving objects from TV Signals, IEEE Trans. on Communications, Vol. 23, April 1975, pp. 474–478.
V. Bove and A. Lippman, Scalable Open-Architecture Television, SMPTE Journal, January 1992, pp. 2-5.
S. Mallat, Multifrequency Channel Decompositions of Images and Wavelet Models, IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 17, No. 12, December 1989.
S. Mallat, A Theory for Multi-Resolution Signal Decomposition: The Wavelet Representation, IEEE Trans. on Pattern Analysis and Machine Intelligence, Vol. 11, No. 7, July 1989.
D. Marr, Vision, Freedman, New York, 1982.
F. Mintzer, Filters for Distortion-Free Two-Band Multirate Filter Banks, IEEE Trans. on Acoust., Speech, Signal Proa, Vol. 33, June 1985, pp. 626–630.
Submitted on behalf of The American Television Alliance, FCC ACATS Document SS/WP1-0184, ATVA-Progressive System, Massachusetts Institute of Technology, 1991.
R. Moorhead and S. Rajala and L. Cook, Image Sequence Compression Using a Pel-Recursive Motion-Compensated Technique, IEEE Journal of Selected Areas in Communications, Vol. 5, No. 7, August 1987.
Motion Picture Experts Group, ISO/IEC JTC1/SC2/WG8, CCITT SGVIII, Coded representation of picture and audio information, MPEG video simulation model two, 1990.
International Organization for Standardization Coded Representation of Picture and Audio Information, MPEG Video Simulation Model Three, 1990.
D. Le Gall, MPEG: A Video Compression Standard for Multimedia Applications, Communications of the ACM, Vol. 34, No. 4, April 1991, pp. 46–58.
H. Musmann and P. Pirsch and H. Grallert, Advances in picture coding, Proc. of IEEE, Vol. 73, No. 4, April 1985, pp. 523–548.
T. Naveen and J. W. Woods, Motion Compensated Multiresolution Transmission of High Definition Video, IEEE Trans. on Circuits and Systems for Video Technology, Vol. 4, No. 1, February 1994, pp. 29–41.
A. Netravali and J. Robbins, Motion-Compensated Television Coding: Part I, Bell Technical Journal, Vol. 58, No. 3, March 1979, pp. 631–670.
A. Netravali and B. Haskell, Digital Pictures — Representation and Compression, Plenum Press, New York, 1989.
N. Ohta and M. Nomura and T. Fujii, Variable Rate Coding Using Motion-Compensated DCT for Asynchronous Transfer Mode Network, IEEE ICC’88, 1988, pp. 1257-1261.
W. Paik, Digicipher — All Digital, Channel Compatible, HDTV Broadcast System, IEEE Trans. on Broadcasting, Vol. 36, No. 4, December 1990, pp. 245–254.
A. Puri, Multi-Frame Conditional Motion-Compensated Interpolation and Coding, Proc. Picture Coding Symposium, Cambridge, MA, March 1990, pp. 8.3-1-8.3-2.
O. Rioul and M. Vetterli, Wavelet and Signal Processing, IEEE Signal Processing Magazine, Vol. 8, No. 4, October 1991, pp. 14–38.
O. Rioul, Simple regularity criteria for subdivision schemes, SIAM J. of Math. Anal., Vol. 23, 1992, pp. 1544–1576.
F. Rocca and S. Zanoletti, Bandwidth reduction via movement compensation on a model of the random video process, IEEE Trans. on Communications, Vol. 20, October 1972, pp. 960–965.
M.J.T. Smith and T.P. Barnwell, Exact Reconstruction for Tree-Structured Subband Coders, IEEE Trans. on Acoust., Speech and Signal Proc, Vol. ASSP-34, June 1986, pp. 434–441.
R. Srinivasan and K. Rao, Predictive Coding based on efficient motion estimation, IEEE ICC’84, 1984, pp. 521-526.
A. H. Tewfik and D. Sinha and P. Jorgensen, On the Optimal Choice of a Wavelet for Signal Representation, IEEE Tran. on Information Theory, Vol. 38, 1992, pp. 747–765.
A. Wong and C. Chen and D. LeGall and F. Chang and M. Uz, MCPIC: A Video Coding Algorithm for Transmission and Storage Applications, IEEE Communications Magazine, November 1990, pp. 24-32.
K. Uz and M. Vetterli and D. LeGall, Interpolative Multi-resolution Coding of Advanced Television and Compatible Subchannels, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 1, No. 1, March 1991.
P.P. Vaidyanathan and P.Q. Hoang, Lattice Structures for Optimal Design and Robust Implementation of Two-Band Perfect Reconstruction QMF Banks, IEEE Trans. on Acoust., Speech and Signal Proc, Vol. ASSP-36, No. 1, January 1988, pp. 81–94.
M. Vetterli, Multidimensional Subband Coding: Some Theory and Algorithms, Signal Processing, Vol. 6, April 1984, pp. 97–112.
M. Vetterli, Filter Banks Allowing perfect Reconstruction, Signal Processing, Vol. 10, No. 3, April 1986, pp. 219–244.
M. Vetterli and C. Herley, Wavelets and Filter Banks: Relationships and New Results, Proc. of ICASSP’90, April 3–6, 1990.
M. Vetterli and J. Kovacevic, Perfect Reconstruction Filter Banks for HDTV Representation and Coding, Signal Processing: Image Communication 2, 1990, pp. 349–363.
M. Vetterli and C. Herley, Wavelets and Filter banks: Theory and design, IEEE Trans. on Signal Processing, Vol. 40, 1992, pp. 2207–2232.
J. D. Villasenor and B. Beizer and J Liao, Filter Evaluation and Selection in Wavelet Image Compression, Proceedings Data Compression Conference, editor: J. A. Storer and M. Cohn, IEEE Computer Society Technical Committee on Computer Communications, March 29–31, Snowbird Utah, 1994, pp. 351-360.
D. Walker and K. Rao, New Techniques in Pel-Recursive Motion Estimation, Proceedings of ICC, 1984, pp. 703-706.
A. Watson, The Cortex Transform: Rapid Computation of Simulated Neural Images, Computer Vision, Graphics, Image Processing, Vol. 39, 1987, pp. 311–327.
J. Woods and S. O’Neil, Subband Coding of Images, IEEE Trans. on Acoustics, Speech and Signal Processing, Vol. 34, No. 5, October 1986.
J. Woods, Subband Image Coding, Kluwer Academic, 1991.
J. Woods and T. Naveen, A Filter Based Bit Allocation Scheme for Subband Compression of HDTV, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 1, No. 3, July 1992, pp. 436–440.
K. Xie and L. Eycken and A. Oosterilinck, Some New Techniques in Displacement Estimation Algorithms, SPIE Visual Communications and Image Processing, Vol. 1199, 1989, pp. 18–29.
S. Zafar and Y. Zhang and J. Baras, Predictive Block-Matching Motion Estimation for TV Coding — Part I: Inter-block Prediction, IEEE Trans. on Broadcasting, Vol. 37, No. 3, September 1991, pp. 97–101.
Y. Zhang and S. Zafar, Predictive Block-Matching Motion Estimation for TV Coding — Part II: Inter-Frame Prediction, IEEE Trans. on Broadcasting, Vol. 37, No. 3, September 1991, pp. 101–105.
S. Zafar and Y. Zhang and J. Baras, Predictive Block-Matching Motion Estimation Schemes for Video Compression, Proceedings of IEEE Int’l Conference on Consumer Electronics, Chicago IL, June 6–8, 1991, pp. 300-301.
Y. Zhang and S. Zafar, Motion-Compensated Wavelet Transform Coding for Color Video Compression, SPIE Visual Communications and Image Processing’ 91: Visual Communication, Vol. 1605, November 1991, pp. 301–316.
Y. Zhang and S. Zafar, Motion-Compensated Wavelet Transform Coding for Color Video Compression, IEEE Trans. on Circuit and Systems for Video Technology, Vol. 2, No. 3, September 1992, pp. 285–296.
S. Zafar and Y. Zhang and B. Jabbari, Multiscale Video Representation Using Multi-Resolution Motion Compensation and Wavelet Decomposition, IEEE Journal of Selected Areas in Communications, January 1993.
B. Jabbari and F. Yegenoglu and Y. Kuo and S. Zafar and Y. Zhang, Statistical Characterization and Block-Based Modeling of Motion Adaptive Coded Video, IEEE Trans. on Circuit and Systems for Video Technology, April 1993.
Sohail Zafar, Motion Estimation and Encoding Algorithms for Hierarchical Representation of Digital Video, George Mason University 1994.
FCC ACATS Document SS/WP1-0193, Digital Spectrum-Compatible HDTV: Technical Details, Zenith Electronics Corporation and AT&T Bell Laboratories, 1991.
W. Zettler and J. Huffman and D. Linden, Application of Compactly Supported Wavelets to Image Compression, SPIE Image Processing Algorithms and Techniques, Santa Clara, CA, Vol. 1244, February 1990, pp. 150-160.
J. Joen and J. Kim, On the Hierarchical Edge-Based Block Motion Estimation for Video Subband Coding at Low Bit Rates, SPIE Visual Communications and Image Processing, Boston, MA, Vol. 2094, November 9-13, 1993.
S. Panchanathan, E. Chan, and X. Wang, Fast Multiresolution Motion Estimation Scheme for a Wavelet Transform Video Coder, SPIE Visual Communications and Image Processing,(Also IEEE Trans. Image Processing), Chicago, IL, Vol. 2308, September 25–29, 1994, pp. 671-681.
K. Goh, J. Soragham, and T. Durrani, Multiresolution Based Algorithms for Low Bit Rate Image Coding, IEEE Int. Conf. Image Processing, Austin, TX, Vol. 3, November 13-16, 1994, pp. 285–289.
R.J. Clarke and Y. Wang, Multiresolution Motion Estimation Scheme for Very Low Bit Rate Video Coding, Very Low Bit Rate Video Workshop, Colchester, UK, Vol. 1, May 25-29, 1994, pp. 671–681.
H. Oh, Y. Baek, G. Kim, G. Park, H. Lee, and J. Jeon, Very Low Bitrate Video Coding Wavelet Decomposition, ISO/IEC JTC1/SC29/WG11, MPEG94/392, Singapore, November 25-29, 1994, pp. 1-10.
W. Li and Y.-Q. Zhang, Vector-Based Signal Processing and Quantization for Image and Video Compression, Proceedings of IEEE, Vol. 83, February 1995, pp. 671–681.
J. Shapiro, Embedded Image Coding Using Zerotrees of Wavelet Coefficients, IEEE Trans. Signal Processing, Vol. 41, December 1993, pp. 3443–3463.
M. Bierling, Displacement Estimation by Hierarchical Block Matching, SPIE Visual Communications and Image Processing, Vol. 1001, November 1988, pp. 942–951.
H. Gharavi, Subband Coding Algorithms for Video Applications: Videophone to HDTV Conferencing, IEEE Trans. Circuits and Systems for Video Tech., Vol. 1, June 1991, pp. 174–183.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1997 Springer Science+Business Media New York
About this chapter
Cite this chapter
Zhang, YQ., Zafar, S. (1997). Wavelet-Based Video Compression. In: Li, H.H., Sun, S., Derin, H. (eds) Video Data Compression for Multimedia Computing. The Springer International Series in Engineering and Computer Science, vol 378. Springer, Boston, MA. https://doi.org/10.1007/978-1-4615-6239-9_1
Download citation
DOI: https://doi.org/10.1007/978-1-4615-6239-9_1
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-7862-4
Online ISBN: 978-1-4615-6239-9
eBook Packages: Springer Book Archive