Skip to main content
Log in

Consistent depth maps estimation from binocular stereo video sequence

  • Published:
Journal of Shanghai Jiaotong University (Science) Aims and scope Submit manuscript

Abstract

In the paper, an approach is proposed for the problem of consistency in depth maps estimation from binocular stereo video sequence. The consistent method includes temporal consistency and spatial consistency to eliminate the flickering artifacts and smooth inaccuracy in depth recovery. So the improved global stereo matching based on graph cut and energy optimization is implemented. In temporal domain, the penalty function with coherence factor is introduced for temporal consistency, and the factor is determined by Lucas-Kanade optical flow weighted histogram similarity constraint (LKWHSC). In spatial domain, the joint bilateral truncated absolute difference (JBTAD) is proposed for segmentation smoothing. The method can smooth naturally and uniformly in low-gradient region and avoid over-smoothing as well as keep edge sharpness in high-gradient discontinuities to realize spatial consistency. The experimental results show that the algorithm can obtain better spatial and temporal consistent depth maps compared with the existing algorithms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  1. KHOSHABEH R, CHAN S H, NGUYEN T Q. Spatio-temporal consistency in video disparity estimation [C]//Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing. Prague, Czech Republic: IEEE, 2011: 885–888.

    Google Scholar 

  2. CIGLA C, ALATAN A A. Temporally consistent dense depth map estimation via belief propagation [C]//IEEE 3DTV Conference on the True Vision—–Capture, Transmission and Display of 3D Video. Potsdam, Germany: IEEE, 2009: 1–4.

    Google Scholar 

  3. GARCIA F, AOUADA D, MIRBACH B, et al. A new multi-lateral filter for real-time depth enhancement [C]//IEEE International Conference on Advanced Video and Signal-Based Surveillance. Klagenfurt, Austria: IEEE, 2011: 42–47.

    Google Scholar 

  4. RICHARDT C, ORR D, DAVIES I, et al. Realtime spatiotemporal stereo matching using the dualcross-bilateral grid [C]//Proceedings of the European Conference on Computer Vision. Hersonissos, Greece: Springer-Verlag, 2010, 6313: 510–523.

    Google Scholar 

  5. LEE S B, HO Y S. Temporally consistent depth map estimation for 3D video generation and coding [J]. China Communications, 2013, 10(5): 39–49.

    Article  Google Scholar 

  6. PHAM C C, NGUYEN V D, JEON J W. Efficient spatio-temporal local stereo matching using information permeability filtering [C]//IEEE International Conference on Image Processing. Orlando, USA: IEEE, 2012: 2965–2968.

    Google Scholar 

  7. MIN D B, LU J B, DO M N. Depth video enhancement based on weighted mode filtering [J]. IEEE Transactions on Image Processing, 2012, 21(3): 1176–1190.

    Article  MathSciNet  Google Scholar 

  8. FUSIELLO A, IRSARA L. Quasi-Euclidean uncalibrated epipolar rectification [C]//19th International Conference on Pattern Recognition. Tampa, USA: IEEE, 2008: 1–4.

    Google Scholar 

  9. HEO Y S, LEE K M, LEE S U. Joint depth map and color consistency estimation for stereo images with different illuminations and cameras [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2013, 35(5): 1094–1106.

    Article  Google Scholar 

  10. BOYKOV Y, VEKSLER O, ZABIH R. Fast approximate energy minimization via graph cuts [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2001, 23(11): 1222–1239.

    Article  Google Scholar 

  11. QIN S, XIE G. LW-PGD method and fusion feature based real-time pedestrian detection in video [J]. Journal of Computational Information Systems, 2014, 10(6): 2273–2281.

    Google Scholar 

  12. JIANG L C, SHEN G Q, ZHANG G X. An image retrieval algorithm based on HSV color segment histograms [J]. Mechanical & Electrical Engineering Magazine, 2009, 26(11): 54–57 (in Chinese).

    Google Scholar 

  13. ZHANG Y, ZHANG J W, YANG G Q, et al. Video de-hazing using spatial-temporal coherence optimization [J]. Application Research of Computers, 2011, 28(10): 3983–3985 (in Chinese).

    Google Scholar 

  14. BUADES A, COLL B, MOREL J M. Nonlocal image and movie denoising [J]. International Journal of Computer Vision, 2008, 76(2): 123–139.

    Article  Google Scholar 

  15. FERREIRA L, ASSUNCAO P, DA SILVA CRUZ L A. 3D video shot boundary detection based on clustering of depth-temporal features [C]//2013 11th International Workshop on Content-based Multimedia Indexing. Veszprem, Hungary: IEEE, 2013: 1–6.

    Chapter  Google Scholar 

  16. MA G H, WANG C, LIU P, et al. Sequential similarity detection algorithm based on image edge feature [J]. Journal of Shanghai Jiaotong University (Science), 2014, 19(1): 79–83.

    Article  Google Scholar 

  17. ZHANG G F, JIA J Y, WONG T T, et al. Consistent depth maps recovery from a video sequence [J]. IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009, 31(6): 974–988.

    Article  Google Scholar 

  18. YANG Q X, YANG R G, DAVIS J, et al. Spatialdepth super resolution for range images [C]//IEEE Conference on Computer Vision and Pattern Recognition. Minneapolis, USA: IEEE, 2007: 1–8.

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Fengfeng Duan  (段峰峰).

Additional information

Foundation item: the Science and Technology Innovation Project of Ministry of Culture of China (No. 2014KJCXXM08), the National Key Technology Research and Development Program of the Ministry of Science and Technology of China (No. 2012BAH37F02), and the National High Technology Research and Development Program (863) of China (No. 2011AA01A107)

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Duan, F. Consistent depth maps estimation from binocular stereo video sequence. J. Shanghai Jiaotong Univ. (Sci.) 21, 184–191 (2016). https://doi.org/10.1007/s12204-016-1710-7

Download citation

  • Received:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s12204-016-1710-7

Keywords

CLC number

Document code

Navigation