Video-rate calculation of depth from defocus on a FPGA

Joseph Raj, Alex Noel; Staunton, Richard C.

doi:10.1007/s11554-014-0480-4

Video-rate calculation of depth from defocus on a FPGA

Original Research Paper
Published: 30 December 2014

Volume 14, pages 469–480, (2018)
Cite this article

Journal of Real-Time Image Processing Aims and scope Submit manuscript

Alex Noel Joseph Raj¹ &
Richard C. Staunton²

331 Accesses
6 Citations
Explore all metrics

Abstract

Depth from defocus is a ranging method that provides depth estimation at every pixel in an image. It uses a pair of defocused images from a conventional monocular camera. To enable video-rate processing, which is important for industrial applications, an established algorithm was implemented on a field programmable gate array (FPGA). A bifurcating pipeline of 2-D filters provided the depth calculation. Multiplier and SRAM facilities on the FPGA were utilised efficiently by exploiting the filter symmetry. The coefficient and data bit-widths were limited to improve efficiency. The results compared favourably with the full width calculation. Range images were processed within 14 ms.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Fast depth from defocus from focal stacks

Article 03 December 2014

Sensitivity analysis and measurement uncertainties of a two-camera depth from defocus imaging system

Article 10 October 2021

Depth from Defocus and Coded Apertures for 3D Scene Sensing

References

Starck, J., Maki, A., Nobuhara, S., Hilton, A., Matsuyama, T.: The multiple-camera 3-D production studio. IEEE Trans. Circuits Syst. Video Technol. 19, 856–859 (2009)
Article Google Scholar
De Silva, D.V.S.X., Fernando, W.A.C., Yasakethu, S.L.P.: Object based coding of the depth maps for 3D video coding. IEEE Trans. Consum. Electron. 55, 1699–1706 (2009)
Article Google Scholar
Andreopoulos, A., Hasler, S., Wersing, H., Janssen, H., Tsotsos, J.K., Korner, E.: Active 3D object localization using a humanoid robot. IEEE Trans. Robot. 27, 47–64 (2011)
Article Google Scholar
De Cristóforis, P., Nitsche, M.A., Krajník, T., Mejail, M.: Real-time monocular image-based path detection: a GPU-based embedded solution for on-board execution on mobile robots. J. Real-time Image Process. 1–14 (2013). doi:10.1007/s11554-013-0356-z
Taylor, R.H.: A perspective on medical robotics. Proc. IEEE 94, 1652–1664 (2006)
Article Google Scholar
Dandekar, O., Castro-Pareja, C., Shekhar, R.: FPGA-based real-time 3-D image preprocessing for image-guided medical interventions. J. Real-Time Image Process. 1, 285–301 (2007)
Article Google Scholar
Broggi, A., Cappalunga, A., Caraffi, C., Cattani, S., Ghidoni, S., Grisleri, P., Porta, P.P., Posterli, M., Zani, P.: TerraMax vision at the urban challenge 2007. IEEE Trans. Intell. Transp. Syst. 11, 194–205 (2010)
Article Google Scholar
Chang, K.I., Bowyer, K.W., Flynn, P.J.: An evaluation of multimodal 2D+3D face biometrics. IEEE Trans. Pattern Anal. Mach. Intell. 27, 619–624 (2005)
Article Google Scholar
Pentland, A.P.: A new sense for depth of field. IEEE Trans. Pattern Anal. Mach. Intell. 9, 523–531 (1987)
Article Google Scholar
Pentland, A., Darrell, T., Scherock, S., Girod, B.: Simple range cameras based on focal error. J Opt. Soc. Am. A 11(11), 2925–2934 (1994)
Article Google Scholar
Subbarao, M., Wei, T.C.: Depth from defocus and rapid auto focussing: a practical approach, pp. 773–776. Proc. IEEE Comput. Soc. Conf. Compu. Vis. Pattern Recognit., Champaign (1992)
Google Scholar
Subbarao, M., Surya, G.: Depth from defocus: spatial domain approach. Int. J. Compu. Vis. 13(3), 271–294 (1994)
Article Google Scholar
Ens, J., Lawrence, P.: An investigation of methods for determining depth from defocus. IEEE Trans. Pattern Anal. Mach. Intell. 15, 97–108 (1993)
Article Google Scholar
Watanabe, M., Nayar, S.K.: Rational filters for passive depth from defocus. Int. J. Compu. Vis. 27(3), 203–225 (1998)
Article Google Scholar
Joseph-Raj, A.N., Staunton, R.C.: Rational filter design for depth from defocus. Pattern Recognit. 45, 198–207 (2012)
Article Google Scholar
Joseph-Raj, A.N.: Accurate depth from defocus estimation with video-rate implementation. Ph.D. dissertation, School of Eng., Univ. Warwick, Coventry (2009)
Google Scholar
Rajagopalan, A.N., Chaudhuri, S.: Depth from defocus: a real aperture imaging approach. Springer, New York (1999)
Google Scholar
Rajagopalan, A.N., Chaudhuri, S.: An MRF model-based approach to simultaneous recovery of depth and restoration from defocused images. IEEE Trans. Pattern Anal. Mach. Intell. 21, 577–589 (1999)
Article Google Scholar
Favaro, P., Soatto, S.: A geometric approach to shape from defocus. IEEE Trans. Pattern Anal. Mach. Intell. 27, 406–417 (2005)
Article Google Scholar
Favaro, P., Soatto, S.: Learning shape from defocus. In: Proc. 7th European Conf. Computer Vision, Copenhagen, Part 2 pp. 735–745 (2002)
Deschenes, F., Ziou, D., Fuchs, P.: Improved estimation of defocus blur and spatial shifts in spatial domain: a homotopy-based approach. Pattern Recognit. 36(9), 2105–2125 (2003)
Article MATH Google Scholar
Deschenes, F., Ziou, D., Fuchs, P.: A homotopy-based approach for computing defocus blur and affine transform simultaneously. Pattern Recognit. 41(7), 2263–2282 (2008)
Article MATH Google Scholar
Watanabe, M., Nayar, S.K.: Minimal operator set for passive depth from defocus, pp. 431–438. IEEE Conference on CVPR, San Francisco (1996)
Google Scholar
Nayar, S.K., Watanabe, M., Noguchi, M.: Real-time focus range sensor. IEEE Trans. Pattern Anal. Mach. Intell. 18(12), 1186–1198 (1996)
Article Google Scholar
Ghita, O., Whelan, P.F.: A video-rate sensor based on depth from defocus. Opt. Laser Technol. 33(3), 167–176 (2001)
Article Google Scholar
Leroy, J.V., Simon, T., Deschenes, F.: Real time monocular depth from defocus. In: 3rd International Conference on Image and Signal Processing, pp. 103–111 (2008)
Favaro P.: Recovering thin structures via nonlocal-means regularization with application to depth from defocus. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1133–1140 (2010)
Ben-Ari R., Raveh G.: Variational, depth from defocus in real-time. IEEE Int. Conf. Comput. Vis. 522–529 (2011)
Zhou, C., Lin, S., Nayar, S.K.: Coded aperture pairs for depth from defocus and defocus deblurring. Int. J. Comput. Vis. 93(1), 53–72 (2011)
Article Google Scholar
Jin, S., Kim, D., Nguyen, T.T., Kim, D., Kim, M., Jeon, J.W.: Design and implementation of a pipelined datapath for high-speed face detection using FPGA. IEEE Trans. Ind. Informat. 8(1), 158–167 (2012)
Article Google Scholar
Monmasson, E., Idkhajine, L., Cirstea, M.N., Bahri, I., Tisan, A., Naouar, M.W.: FPGAs in industrial control applications. IEEE Trans. Ind. Informat. 7, 224–243 (2011)
Article Google Scholar
Xilinx, Virtex-II Pro/Virtex-II Pro X, Complete Data Sheet [Online]. Available: http://www.xilinx.com/support/documentation/user_guides/ug012.pdf. Accessed 14 Dec 2014
Reddy, H.C., Khoo, I.H., Rajan, P.K.: 2-D symmetry: theory and filter design applications. IEEE Circuits Syst. Mag. 3(3), 4–33 (2003)
Article Google Scholar
Cheng, Chao, Parhi, K.K.: Hardware efficient fast parallel FIR filter structures based on iterated short convolution. IEEE Trans. Circuits Syst. I Regul. Pap. 51(8), 1492–1500 (2004)
Article MathSciNet MATH Google Scholar
Bougas, P., Kalivas, P., Tsirikos, A., Pekmestzi, K.Z.: Pipelined array-based FIR filter folding. IEEE Trans. Circuits Syst. I Regul. Pap. 52(1), 108–118 (2005)
Article MathSciNet MATH Google Scholar
Lim, Jae S.: Two dimensional signal and image processing, pp. 196–199. Prentice Hall Inc, New Jersey (1990)
Google Scholar
Kung, H.T.: Why systolic architectures? IEEE Trans. Compu. 15, 37–46 (1982)
Google Scholar
Kung, H.T., Leiserson, C.E., Mead, C.A., Conway, L.A.: Systolic arrays (for VLSI). Sparse Matrix Proc. 1978, 256 (1979)
MathSciNet Google Scholar
Hsia, S.-C.: Parallel VLSI design for a real-time video-impulse noise-reduction processor. IEEE Trans. Very Large Scale Integr. Syst. 11, 651–658 (2003)
Article Google Scholar
Claxton, C.D., Staunton, R.C.: Measurement of the point-spread function of a noisy imaging system. J. Optical Soc. Am. A 25, 159–170 (2008)
Article Google Scholar
Bonyadi, Y.: 3D TV from miniature camera systems. MSc. dissertation, School of Eng., Univ. Warwick, Coventry (2011)
Google Scholar

Download references

Author information

Authors and Affiliations

School of Electronics Engineering, VIT University, Vellore, Tamil Nadu, India
Alex Noel Joseph Raj
School of Engineering, University of Warwick, Coventry, CV4 7, UK
Richard C. Staunton

Authors

Alex Noel Joseph Raj
View author publications
You can also search for this author in PubMed Google Scholar
Richard C. Staunton
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Alex Noel Joseph Raj.

Appendices

Appendix 1

See Table 4.

Table 4 A comparison of various depth estimation techniques

Full size table

Appendix 2: General triangular convolution

The following algorithm can be used to generate the partial equations used by the triangular method. For an m × m convolution where m is odd, there is eightfold symmetry, and the central pixel is I _0,0, then the pixels in the window will be indexed I _−n,−n to I _+n,+n, where n = (m ₋1)/2. The number of partial equations, coefficients and multipliers required \(a = \sum\nolimits_{i = 1}^{n + 1} i\). Let the partial results be P _b and the coefficients C _b where b is an integer from 1 to a. Then if n > 1:

As algorithm:

Rights and permissions

Reprints and permissions

About this article

Cite this article

Joseph Raj, A.N., Staunton, R.C. Video-rate calculation of depth from defocus on a FPGA. J Real-Time Image Proc 14, 469–480 (2018). https://doi.org/10.1007/s11554-014-0480-4

Download citation

Received: 17 December 2013
Accepted: 07 December 2014
Published: 30 December 2014
Issue Date: February 2018
DOI: https://doi.org/10.1007/s11554-014-0480-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Video-rate calculation of depth from defocus on a FPGA

Abstract

Access this article

Similar content being viewed by others

Fast depth from defocus from focal stacks

Sensitivity analysis and measurement uncertainties of a two-camera depth from defocus imaging system

Depth from Defocus and Coded Apertures for 3D Scene Sensing

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1

Appendix 2: General triangular convolution

Rights and permissions

About this article

Cite this article

Keywords

Navigation

Video-rate calculation of depth from defocus on a FPGA

Abstract

Access this article

Similar content being viewed by others

Fast depth from defocus from focal stacks

Sensitivity analysis and measurement uncertainties of a two-camera depth from defocus imaging system

Depth from Defocus and Coded Apertures for 3D Scene Sensing

References

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix 1

Appendix 2: General triangular convolution

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation