Stereoscopic Video Coding

Chiang, Tihao; Zhang, Ya-Qin

doi:10.1007/978-1-4613-0403-6_45

Tihao Chiang² &
Ya-Qin Zhang²

97 Accesses
1 Citations

Abstract

Coding of the stereoscopic video source has received significant interest recently. The MPEG committee decided to form an ad hoc group to define a new profile which is referred to as Multiview Profile (MVP) [4]. The importance of multiview video representation is also recognized by the MPEG 4 committee as one of the eight functionalities to be addressed in the near future. In this paper, we will first review the technical results using temporal scalability (disparity analysis) in MPEG-2 as pioneering by [9] and [10]. Based on temporal scalability, the concept is further generalized to affine transformation to consider the deformation and foreshortening due to the change of view point. Estimation of the affine parameters is crucial for the performance of the estimator. In this paper we propose a novel technique to find a convergent solution which results in the least mean square errors. Our result shows that about 40 percent of the macroblocks in a picture has benefited by using the affine transformation. In our approach, the additional computational complexity is minimal since a pyramidal scheme is used. In one of our experiments, only four interations are necessary to find a convergent solution. The improvement in prediction gain is found to be around 0.77 dB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 39.99; Price excludes VAT (USA)

Softcover Book: USD 54.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

REFERENCES

ISO, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information: Video, Recommendation H.262, (Paris), May 1994.
Google Scholar
Chassaing F., Choquet B., Pele D., “A stereoscopic television system (3D-TV) and compatible transmission on a MAC channel (3D-MAC)”, Image Communication Nov, 1991.
Google Scholar
International Organisation for Standardisation, “Report of the ad hoc group on MPEG-2 applications for multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 861 March, 1995.
Google Scholar
International Organisation for Standardisation, “Status Report on the study of Multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 906 March, 1995.
Google Scholar
A. Zakhor, F. Lari, “Edge-Based 3-D Camera Motion Estimation with Application to Video Coding”, IEEE Trans. on Image Processing, Vol. 2, No. 4, E 1993.
Google Scholar
Roger Y. Tsai, Thomas S. Huang, “Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces”, IEEE Trans. on PAMI, Vol. 6, No. 1 Jan. 1984.
Google Scholar
Randall B. Perlow, Ph.D. Dissertation University of Pennsylvania, “The Application of Stereoscopic Techniques to High Resolution Radar Images for Improved Detection of Targets in Clutter”, 1994.
Google Scholar
Tihao Chiang, Ph.D. Dissertation Columbia University, “Hierarchical Coding of Digital Television”, 1995.
Google Scholar
A. Puri, V. Kollarits, B.G. Haskell, “Stereoscopic Video Compression using temporal scalability”, SPIE Visual Communications and Image Processing, Taipei, Taiwan, May 1995.
Google Scholar
B. L. Tseng, D. Anastassiou, “Compatible video coding of stereoscopic sequences using MPEG-2’s scalability and interlaced structure”, Workshop on HDTV’94, Torino, Oct. 1994.
Google Scholar

Download references

Author information

Authors and Affiliations

David Sarnoff Research Center, 201 Washington Road, Princeton, New Jersey, 08543, USA
Tihao Chiang & Ya-Qin Zhang

Authors

Tihao Chiang
View author publications
You can also search for this author in PubMed Google Scholar
Ya-Qin Zhang
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Polytechnic University, Brooklyn, New York, USA
Yao Wang , Shivendra Panwar , Seung-Pil Kim & Henry L. Bertoni , , &

Rights and permissions

Reprints and permissions

Copyright information

About this chapter

Cite this chapter

Chiang, T., Zhang, YQ. (1996). Stereoscopic Video Coding. In: Wang, Y., Panwar, S., Kim, SP., Bertoni, H.L. (eds) Multimedia Communications and Video Coding. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0403-6_45

Download citation

DOI: https://doi.org/10.1007/978-1-4613-0403-6_45
Publisher Name: Springer, Boston, MA
Print ISBN: 978-1-4613-8036-8
Online ISBN: 978-1-4613-0403-6
eBook Packages: Springer Book Archive

Publish with us

Policies and ethics