Recovering Depth Map from Video with Moving Objects

Chen, Hsiao-Wei; Lai, Shang-Hong

doi:10.1007/978-3-642-25346-1_30

Hsiao-Wei Chen¹⁷ &
Shang-Hong Lai¹⁷

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 7088))

Included in the following conference series:

Pacific-Rim Symposium on Image and Video Technology

1546 Accesses

Abstract

In this paper, we propose a novel approach to reconstructing depth map from a video sequence, which not only considers geometry coherence but also temporal coherence. Most of the previous methods of reconstructing depth map from video are based on the assumption of rigid motion, thus they cannot provide satisfactory depth estimation for regions with moving objects. In this work, we develop a depth estimation algorithm that detects regions of moving objects and recover the depth map in a Markov Random Field framework. We first apply SIFT matching across frames in the video sequence and compute the camera parameters for all frames and the 3D positions of the SIFT feature points via structure from motion. Then, the 3D depths at these SIFT points are propagated to the whole image based on image over-segmentation to construct an initial depth map. Then the depth values for the segments with large reprojection errors are refined by minimizing the corresponding re-projection errors. In addition, we detect the area of moving objects from the remaining pixels with large re-projection errors. In the final step, we optimize the depth map estimation in a Markov random filed framework. Some experimental results are shown to demonstrate improved depth estimation results of the proposed algorithm.

Download to read the full chapter text

Chapter PDF

Video object segmentation by integrating trajectories from points and regions

Article 22 August 2014

Geng Zhang, Zejian Yuan, … Nanning Zheng

Detection and Segmentation of Moving Objects from Dynamic RGB and Depth Images

Semantically Coherent 4D Scene Flow of Dynamic Scenes

Article Open access 03 October 2019

Armin Mustafa & Adrian Hilton

Keywords

References

Saxena, A., Sun, M., Ng, A.Y.: Make3D: Learning 3D Scene Structure from a Single Still Image. IEEE Trans. on Pattern Analysis and Machine Intelligence (2008)
Google Scholar
Liu, B., Gould, S., Koller, D.: Single Image Depth Estimation From Predicted Semantic Labels. In: CVPR 2010 (2010)
Google Scholar
Zhang, G., Jia, J., Wong, T., Bao, H.: Recovering Consistent Video Depth Maps via Bundle Optimization. In: CVPR (2008)
Google Scholar
Zhang, G., Jia, J., Wong, T., Bao, H.: Consistent Depth Maps Recovery from a Video Sequence. IEEE Trans. on Pattern Analysis and Machine Intelligence 31(6), 974–988 (2009)
Article Google Scholar
Seitz, S.M., Curless, B., Diebel, J., Scharstein, D., Szeliski, R.: A Comparison and Evaluation of Multi-View Stereo Reconstruction Algorithms. In: CVPR (2006)
Google Scholar
Newcombe, R.A., Davison, A.J.: Live Dense Reconstruction with a Single Moving Camera. In: CVPR (2010)
Google Scholar
Comanicu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence (May 2002)
Google Scholar
Felzenszwalb, P.F., Huttenlocher, D.P.: Efficient Graph-Based Image Segmentation. International Journal of Computer Vision 59(2) (September 2004)
Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Recovering Occlusion Boundaries from an Image. In: IJCV (2010)
Google Scholar
Sun, J., Shum, H.Y., Zheng, N.N.: Stereo Matching Using Belief Propagation. In: Heyden, A., Sparr, G., Nielsen, M., Johansen, P. (eds.) ECCV 2002. LNCS, vol. 2351, pp. 510–524. Springer, Heidelberg (2002)
Chapter Google Scholar
Felzenszwalb, P., Huttenlocher, D.: Efficient belief propagation for early vision. In: IJCV, pp. 1–8 (2007)
Google Scholar
Pele, O., Werman, M.: A Linear Time Histogram Metric for Improved SIFT Matching. In: Forsyth, D., Torr, P., Zisserman, A. (eds.) ECCV 2008, Part III. LNCS, vol. 5304, pp. 495–508. Springer, Heidelberg (2008)
Chapter Google Scholar
Martinec, D., Pajdla, T.: 3D Reconstruction by Fitting Low-Rank Matrices with Missing Data. In: CVPR 2005, pp. 198-205, IEEE (June 2005)
Google Scholar
Pollefeys, M., Van Gool, L., Vergauwen, M., Verbiest, F., Cornelis, K., Tops, J., Koch, R.: Visual modeling with a hand-held camera. Intern. Journal of Computer Vision 59(3), 207–232 (2004)
Article Google Scholar
Alsabti, K., Ranka, S., Singh, V.: An Efficient k-means Clustering Algorithm. Pattern Recognit. Lett. 14(10), 763–769 (1993)
Article Google Scholar
Szeliski, R., Zabih, R., Scharstein, D., Veksler, O., Kolmogorov, V., Agarwala, A., Tappen, M., Rother, C.: A Comparative Study of Energy Minimization Methods for Markov Random Fields. In: Leonardis, A., Bischof, H., Pinz, A. (eds.) ECCV 2006, Part II. LNCS, vol. 3952, pp. 16–29. Springer, Heidelberg (2006)
Chapter Google Scholar
Boykov, Y., Veksler, O., Zabih, R.: Fast Approximate Energy Minimization via Graph Cuts. IEEE Transactions on Pattern Analysis and Machine Intelligence 23(11), 1222–1239 (2001)
Article Google Scholar
Kolmogorov, V., Zabih, R.: What Energy Functions can be Minimized via Graph Cuts? IEEE Transactions on Pattern Analysis and Machine Intelligence 26(2), 147–159 (2004)
Article MATH Google Scholar
Um, G., Bang, G., Hur, N., Kim, J., Ho, Y.-S.: Test Sequence “Lovebird1&2”
Google Scholar
Domański, M., Grajek, T., Klimaszewski, K., Kurc, M., Stankiewicz, O., Stankowski, J., Wegner, K.: Poznań Multiview Video Test Sequences and Camera Parameters. ISO/IEC JTC1/SC29/WG11 MPEG 2009/M17050, Xian, China (October 2009)
Google Scholar

Download references

Author information

Authors and Affiliations

Computer Science, National Tsing Hua University, No. 101, Section 2, Kuang-Fu Road, Hsinchu, Taiwan, 30013, R.O.C.
Hsiao-Wei Chen & Shang-Hong Lai

Authors

Hsiao-Wei Chen
View author publications
You can also search for this author in PubMed Google Scholar
Shang-Hong Lai
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Gwangju Institute of Science and Technology (GIST), 1 Oryong-dong Buk-gu, 500-712, Gwangju, South Korea
Yo-Sung Ho

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Chen, HW., Lai, SH. (2011). Recovering Depth Map from Video with Moving Objects. In: Ho, YS. (eds) Advances in Image and Video Technology. PSIVT 2011. Lecture Notes in Computer Science, vol 7088. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-642-25346-1_30

Download citation

DOI: https://doi.org/10.1007/978-3-642-25346-1_30
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-642-25345-4
Online ISBN: 978-3-642-25346-1
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The International Association for Pattern Recognition (opens in a new tab)

Recovering Depth Map from Video with Moving Objects

Abstract

Chapter PDF

Similar content being viewed by others

Video object segmentation by integrating trajectories from points and regions

Detection and Segmentation of Moving Objects from Dynamic RGB and Depth Images

Semantically Coherent 4D Scene Flow of Dynamic Scenes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Societies and partnerships

Navigation

Recovering Depth Map from Video with Moving Objects

Abstract

Chapter PDF

Similar content being viewed by others

Video object segmentation by integrating trajectories from points and regions

Detection and Segmentation of Moving Objects from Dynamic RGB and Depth Images

Semantically Coherent 4D Scene Flow of Dynamic Scenes

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Societies and partnerships

Search

Navigation