A Depth Map Generation Algorithm Based on Saliency Detection for 2D to 3D Conversion

Yang, Yizhong; Hu, Xionglou; Wu, Nengju; Wang, Pengfei; Xu, Dong; Rong, Shen

doi:10.1007/s13319-017-0138-7

A Depth Map Generation Algorithm Based on Saliency Detection for 2D to 3D Conversion

3DR Express
Published: 02 August 2017

Volume 8, article number 29, (2017)
Cite this article

3D Research

Yizhong Yang ORCID: orcid.org/0000-0001-7115-2384¹,
Xionglou Hu¹,
Nengju Wu¹,
Pengfei Wang¹,
Dong Xu¹ &
…
Shen Rong¹

329 Accesses
5 Citations
Explore all metrics

Abstract

In recent years, 3D movies attract people’s attention more and more because of their immersive stereoscopic experience. However, 3D movies is still insufficient, so estimating depth information for 2D to 3D conversion from a video is more and more important. In this paper, we present a novel algorithm to estimate depth information from a video via scene classification algorithm. In order to obtain perceptually reliable depth information for viewers, the algorithm classifies them into three categories: landscape type, close-up type, linear perspective type firstly. Then we employ a specific algorithm to divide the landscape type image into many blocks, and assign depth value by similar relative height cue with the image. As to the close-up type image, a saliency-based method is adopted to enhance the foreground in the image and the method combine it with the global depth gradient to generate final depth map. By vanishing line detection, the calculated vanishing point which is regarded as the farthest point to the viewer is assigned with deepest depth value. According to the distance between the other points and the vanishing point, the entire image is assigned with corresponding depth value. Finally, depth image-based rendering is employed to generate stereoscopic virtual views after bilateral filter. Experiments show that the proposed algorithm can achieve realistic 3D effects and yield satisfactory results, while the perception scores of anaglyph images lie between 6.8 and 7.8.

Graphical Abstract

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Low-light image enhancement using transformer with color fusion and channel attention

Article 15 May 2024

Image Inpainting: A Review

Article 06 December 2019

Pixelwise View Selection for Unstructured Multi-View Stereo

References

Redert, A., Beeck, M. O. D., Fehn, C., Ijsselsteijn, W., Pollefeys, M., Gool, L. V., et al. (2002). ATTEST: Advanced three-dimensional television system technologies. In Proceedings of International Symposium on 3D Data Processing Visualization and Transmission, 2002 (pp. 313–319).
Fan, Y.-C., Kung, Y.-T., & Lin, B.-L. (2011). Three-dimensional auto-stereoscopic image recording, mapping and synthesis system for multi-view 3D display. IEEE Transactions on Magnetics, 47(3), 683–686.
Article Google Scholar
Jantet, V., Guillemot, C., & Morin, L. (2011). Joint projection filling method for occlusion handling in depth-image-based rendering. 3D Research, 2(4), 1–13.
Article Google Scholar
Phan, Raymond, & Androutsos, D. (2014). Robust semi-automatic depth map generation in unconstrained images and video sequences for 2D to stereoscopic 3D conversion. IEEE Transactions on Multimedia, 16(1), 122–136.
Article Google Scholar
Xiong, Y., & Shafer, S. A. (1993). Depth from focusing and defocusing. In Computer Vision and Pattern Recognition, 1993. Proceedings CVPR’93., 1993 IEEE Computer Society Conference on IEEE (pp. 68–73).
Kulkarni, J. B., & Sheelarani, C. M. (2015). Generation of depth map based on depth from focus: A survey. In International Conference on Computing Communication Control and Automation. IEEE.
Jung, Y. J., Baik, A., & Park, D. (2009). A novel 2D-to-3D conversion technique based on relative height-depth cue. In Proceedings of SPIE (vol. 7237, p. 72371.
Jung, C., Wang, L., Zhu, X., & Jiao, L. (2015). 2D to 3D conversion with motion-type adaptive depth estimation. Multimedia Systems, 21(5), 451–464.
Article Google Scholar
Battiato, S., Capra, A., Curti, S., & Cascia, M. L. (2004). 3D stereoscopic image pairs by depth-map generation. In International Symposium on 3D Data Processing, Visualization and Transmission (pp. 124–131).
Cozman, F., & E. Krotkov. (1997). Depth from scattering. In IEEE Computer Society Conference on Computer Vision & Pattern Recognition (pp. 801–806). Springer.
Zhou, Y., Hu, B., & Zhang, J. (2006). Occlusion detection and tracking method based on bayesian decision theory. In Pacific-Rim Symposium on Image and Video Technology (pp. 474–482). Berlin: Springer.
Prados, E., & Faugeras, O. (2006). Shape From Shading. Mathematical Models in Computer Vision the Handbook, 21, 375–388.
Article MathSciNet MATH Google Scholar
Loh, A. M., & Hartley, R. (2005). Shape from non-homogeneous, non-stationary, anisotropic, perspective texture. In British Machine Vision Conference (pp. 69–78).
Harman, P. V., Flack, J., Fox, S., & Dowley, M. (2002). Rapid 2D-to-3D conversion. In Electronic Imaging 2002 (pp. 78–86). International Society for Optics and Photonics.
Saxena, A., Chung, S. H., & Ng, A. Y. (2008). 3D depth reconstruction from a single still image. International Journal of Computer Vision, 76(1), 53–69.
Article Google Scholar
Kauff, P., Atzpadin, N., Fehn, C., Müller, M., Schreer, O., Smolic, A., et al. (2007). Depth map creation and image-based rendering for advanced 3DTV services providing interoperability and scalability. Signal Processing: Image Communication, 22(2), 217–234.
Google Scholar
Hough, P. V. C. (1962). Method and means for recognizing complex patterns. U.S. Patent (no. 3069654).
Battiato, S., Curti, S., La Cascia, M., Tortora, M., & Scordato, E. (2004). Depth map generation by image classification. In Electronic Imaging 2004 (pp. 95–104). International Society for Optics and Photonics.
Lee, J., Yoo, S., Kim, C., & Vasudev, B. (2013). Estimating scene-oriented pseudo depth with pictorial depth cues. IEEE Transactions on Broadcasting, 59(2), 238–250.
Article Google Scholar
Zhang, Z., Yin, S., Liu, L., & Wei, S. (2015). A real-time time-consistent 2D-to-3D video conversion system using color histogram. IEEE Transactions on Consumer Electronics, 61(4), 524–530.
Article Google Scholar
Rahtu, E., Kannala, J., Salo, M., & Heikkilä, J. (2010). Segmenting salient objects from images and videos. In Computer Vision–ECCV 2010 (pp. 366–379).
Zhao, Y. X., Tai, H. P., Fang, S. J., & Chou, C. H. (2012). A new validity measure and fuzzy clustering algorithm for vanishing-point detection. In International Conference on Automatic Control and Artificial Intelligence (pp. 195–198). IET.
Paris, S., Kornprobst, P., Tumblin, J., & Durand, F. (2009). Bilateral filtering: Theory and applications. Foundations and Trends® in Computer Graphics and Vision, 4(1), 1–73.
Article MATH Google Scholar
Yin, S., Dong, H., Jiang, G., Liu, L., & Wei, S. (2015). A novel 2D-to-3D video conversion method using time-coherent depth maps. Sensors, 15(7), 15246–15264.
Article Google Scholar
Huynh-Thu, Q., & Ghanbari, M. (2008). Scope of validity of PSNR in image/video quality assessment. Electronics Letters, 44(13), 800–801.
Article Google Scholar
Wang, Z., Bovik, A. C., Sheikh, H. R., & Simoncelli, E. P. (2004). Image quality assessment: from error visibility to structural similarity. IEEE Transactions on Image Processing, 13(4), 600–612.
Article Google Scholar
Sheikh, H. R., & Bovik, A. C. (2006). Image information and visual quality. IEEE Transactions on Image Processing, 15(2), 430–444.
Article Google Scholar

Download references

Acknowledgements

This work was supported by the National Natural Science Foundation of China under Grants 61401137 and 61404043, and the Fundamental Research Funds for the Central Universities under Grant No. J2014HGXJ0083.

Author information

Authors and Affiliations

School of Electronic Science and Applied Physics, Hefei University of Technology, Hefei, 230009, China
Yizhong Yang, Xionglou Hu, Nengju Wu, Pengfei Wang, Dong Xu & Shen Rong

Authors

Yizhong Yang
View author publications
You can also search for this author in PubMed Google Scholar
Xionglou Hu
View author publications
You can also search for this author in PubMed Google Scholar
Nengju Wu
View author publications
You can also search for this author in PubMed Google Scholar
Pengfei Wang
View author publications
You can also search for this author in PubMed Google Scholar
Dong Xu
View author publications
You can also search for this author in PubMed Google Scholar
Shen Rong
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Yizhong Yang.

Ethics declarations

Conflict of interest

Authors declare that there is no conflict of interest regarding the publication of this paper.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yang, Y., Hu, X., Wu, N. et al. A Depth Map Generation Algorithm Based on Saliency Detection for 2D to 3D Conversion. 3D Res 8, 29 (2017). https://doi.org/10.1007/s13319-017-0138-7

Download citation

Received: 12 May 2017
Revised: 17 July 2017
Accepted: 27 July 2017
Published: 02 August 2017
DOI: https://doi.org/10.1007/s13319-017-0138-7

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions