Abstract
Coding of the stereoscopic video source has received significant interest recently. The MPEG committee decided to form an ad hoc group to define a new profile which is referred to as Multiview Profile (MVP) [4]. The importance of multiview video representation is also recognized by the MPEG 4 committee as one of the eight functionalities to be addressed in the near future. In this paper, we will first review the technical results using temporal scalability (disparity analysis) in MPEG-2 as pioneering by [9] and [10]. Based on temporal scalability, the concept is further generalized to affine transformation to consider the deformation and foreshortening due to the change of view point. Estimation of the affine parameters is crucial for the performance of the estimator. In this paper we propose a novel technique to find a convergent solution which results in the least mean square errors. Our result shows that about 40 percent of the macroblocks in a picture has benefited by using the affine transformation. In our approach, the additional computational complexity is minimal since a pyramidal scheme is used. In one of our experiments, only four interations are necessary to find a convergent solution. The improvement in prediction gain is found to be around 0.77 dB.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
REFERENCES
ISO, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information: Video, Recommendation H.262, (Paris), May 1994.
Chassaing F., Choquet B., Pele D., “A stereoscopic television system (3D-TV) and compatible transmission on a MAC channel (3D-MAC)”, Image Communication Nov, 1991.
International Organisation for Standardisation, “Report of the ad hoc group on MPEG-2 applications for multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 861 March, 1995.
International Organisation for Standardisation, “Status Report on the study of Multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 906 March, 1995.
A. Zakhor, F. Lari, “Edge-Based 3-D Camera Motion Estimation with Application to Video Coding”, IEEE Trans. on Image Processing, Vol. 2, No. 4, E 1993.
Roger Y. Tsai, Thomas S. Huang, “Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces”, IEEE Trans. on PAMI, Vol. 6, No. 1 Jan. 1984.
Randall B. Perlow, Ph.D. Dissertation University of Pennsylvania, “The Application of Stereoscopic Techniques to High Resolution Radar Images for Improved Detection of Targets in Clutter”, 1994.
Tihao Chiang, Ph.D. Dissertation Columbia University, “Hierarchical Coding of Digital Television”, 1995.
A. Puri, V. Kollarits, B.G. Haskell, “Stereoscopic Video Compression using temporal scalability”, SPIE Visual Communications and Image Processing, Taipei, Taiwan, May 1995.
B. L. Tseng, D. Anastassiou, “Compatible video coding of stereoscopic sequences using MPEG-2’s scalability and interlaced structure”, Workshop on HDTV’94, Torino, Oct. 1994.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 1996 Plenum Press
About this chapter
Cite this chapter
Chiang, T., Zhang, YQ. (1996). Stereoscopic Video Coding. In: Wang, Y., Panwar, S., Kim, SP., Bertoni, H.L. (eds) Multimedia Communications and Video Coding. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0403-6_45
Download citation
DOI: https://doi.org/10.1007/978-1-4613-0403-6_45
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-8036-8
Online ISBN: 978-1-4613-0403-6
eBook Packages: Springer Book Archive