Skip to main content

Abstract

Coding of the stereoscopic video source has received significant interest recently. The MPEG committee decided to form an ad hoc group to define a new profile which is referred to as Multiview Profile (MVP) [4]. The importance of multiview video representation is also recognized by the MPEG 4 committee as one of the eight functionalities to be addressed in the near future. In this paper, we will first review the technical results using temporal scalability (disparity analysis) in MPEG-2 as pioneering by [9] and [10]. Based on temporal scalability, the concept is further generalized to affine transformation to consider the deformation and foreshortening due to the change of view point. Estimation of the affine parameters is crucial for the performance of the estimator. In this paper we propose a novel technique to find a convergent solution which results in the least mean square errors. Our result shows that about 40 percent of the macroblocks in a picture has benefited by using the affine transformation. In our approach, the additional computational complexity is minimal since a pyramidal scheme is used. In one of our experiments, only four interations are necessary to find a convergent solution. The improvement in prediction gain is found to be around 0.77 dB.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

REFERENCES

  1. ISO, Information Technology-Generic Coding of Moving Pictures and Associated Audio Information: Video, Recommendation H.262, (Paris), May 1994.

    Google Scholar 

  2. Chassaing F., Choquet B., Pele D., “A stereoscopic television system (3D-TV) and compatible transmission on a MAC channel (3D-MAC)”, Image Communication Nov, 1991.

    Google Scholar 

  3. International Organisation for Standardisation, “Report of the ad hoc group on MPEG-2 applications for multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 861 March, 1995.

    Google Scholar 

  4. International Organisation for Standardisation, “Status Report on the study of Multi-viewpoint pictures”, ISO/IEC JTC/SC29/WG11 No. 906 March, 1995.

    Google Scholar 

  5. A. Zakhor, F. Lari, “Edge-Based 3-D Camera Motion Estimation with Application to Video Coding”, IEEE Trans. on Image Processing, Vol. 2, No. 4, E 1993.

    Google Scholar 

  6. Roger Y. Tsai, Thomas S. Huang, “Uniqueness and Estimation of Three-Dimensional Motion Parameters of Rigid Objects with Curved Surfaces”, IEEE Trans. on PAMI, Vol. 6, No. 1 Jan. 1984.

    Google Scholar 

  7. Randall B. Perlow, Ph.D. Dissertation University of Pennsylvania, “The Application of Stereoscopic Techniques to High Resolution Radar Images for Improved Detection of Targets in Clutter”, 1994.

    Google Scholar 

  8. Tihao Chiang, Ph.D. Dissertation Columbia University, “Hierarchical Coding of Digital Television”, 1995.

    Google Scholar 

  9. A. Puri, V. Kollarits, B.G. Haskell, “Stereoscopic Video Compression using temporal scalability”, SPIE Visual Communications and Image Processing, Taipei, Taiwan, May 1995.

    Google Scholar 

  10. B. L. Tseng, D. Anastassiou, “Compatible video coding of stereoscopic sequences using MPEG-2’s scalability and interlaced structure”, Workshop on HDTV’94, Torino, Oct. 1994.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 1996 Plenum Press

About this chapter

Cite this chapter

Chiang, T., Zhang, YQ. (1996). Stereoscopic Video Coding. In: Wang, Y., Panwar, S., Kim, SP., Bertoni, H.L. (eds) Multimedia Communications and Video Coding. Springer, Boston, MA. https://doi.org/10.1007/978-1-4613-0403-6_45

Download citation

  • DOI: https://doi.org/10.1007/978-1-4613-0403-6_45

  • Publisher Name: Springer, Boston, MA

  • Print ISBN: 978-1-4613-8036-8

  • Online ISBN: 978-1-4613-0403-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics