Machine Vision and Applications

, Volume 9, Issue 2, pp 43–55 | Cite as

Depth extraction using a single moving camera: an integration of depth from motion and depth from stereo

  • Arun K. DalmiaEmail author
  • Mohan Trivedi


An integrated approach to extract depth, efficiently and accurately, from a sequence of images is presented in this paper. The method combines the ability of the stereo processing to acquire highly accurate depth measurements and the efficiency of spatial and temporal gradient analysis. As a result of this integration, depth measurements of high quality are obtained at a speed approximately ten times greater than that of stereo processing. Without any a priori information of the locations of the points in the scene, the correspondence problem in stereo processing is computationally expensive. In our approach, we use spatial and temporal gradient (STG) analysis, which has been shown to provide depth with great efficiency, but limited accuracy, to guide the matching process of stereo. The camera motion used in the approach can be either lateral or axial. Extensive experiments on real scenes have shown the ability of the integrated approach to acquire depth with a mean error of less than 3%.

Key words

Depth Structure Integration Motion Stereo 


Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.


  1. 1.
    Besl PJ (1988) Active, optical range imaging sensors. Machine Vision Appl 1:127–152Google Scholar
  2. 2.
    Mitiche A (1988) Motion understanding: robot and human vision, pp 81–99. Kluwer Academic, Boston, MAGoogle Scholar
  3. 3.
    Grosso E, Sandini G, Tistarelli M (1989) 3-D Object reconstruction using stereo and motion. IEEE Trans Syst Man Cybernetics 19:1465–1476Google Scholar
  4. 4.
    Symosek PF, Bhanu B, Snyder S, Roberts B (1990) Motion and binocular stereo for passive ranging. Proceedings, Image Understanding Workshop, Pittsburgh, PA, September 11–13, 1990, Morgan Kaufman, San Mateo, CA, pp 358–363Google Scholar
  5. 5.
    Vernon D, Tistarelli M (1990) Using camera motion to estimate range for robotic parts manipulation. IEEE Trans Robotics Automation 6:509–521Google Scholar
  6. 6.
    Sridhar B, Suorsa R (1990) Integration of motion and stereo sensors in passive ranging systems. The American Control ConferenceGoogle Scholar
  7. 7.
    Horn BKP (1986) Robot vision. MIT Press, Cambridge, MAGoogle Scholar
  8. 8.
    Dhond UR, Aggarwal JK (1989) Structure from stereo — a review. IEEE Trans Syst Man Cybernetics 19:1489–1510Google Scholar
  9. 9.
    Nevatia R (1976) Depth measurement by motion stereo. Comput Graph Image Processing 5:203–214Google Scholar
  10. 10.
    Bolles RC, Baker HH, Hannah MJ (1993) The JISCT stereo evaluation. Proceedings, [ARPA] Image Understanding Workshop, Washington, DC, April 18–21, 1993, Morgan Kaufman, San Mateo, CA, pp 263–274Google Scholar
  11. 11.
    Skifstad K, Jain R (1989) Range estimation from intensity gradient analysis. Machine Vision Appl 2:81–102Google Scholar
  12. 12.
    Horn BKP, Schunk BG (1981) Determining optical flow. Artif Intell 17:185–203Google Scholar
  13. 13.
    Nguyen TC, Huang TS (1992) Quantization errors in axial motion stereo on rectangular-tessellated image sensors. Proceedings of the International Conference on Pattern Recognition, pp 13–16Google Scholar
  14. 14.
    Press WH, Flannery BP, Teukolsky SA, Vetterling WI (1988) Numerical recipes in C. Cambridge University Press, CambridgeGoogle Scholar
  15. 15.
    Carmona C, Ayache A, Krey C (1990) A study of an axial stereovision system. Proceedings of International Conference on Automation, Robotics and Computer Vision, Singapore, pp 1101–1106Google Scholar
  16. 16.
    Marapane S, Trivedi MM (1994) Multi-primitive hierarchical (MPH) stereo analysis. IEEE Trans Patt Anal Machine Intell 16:227–240Google Scholar
  17. 17.
    Aggarwal JK, Davis LS, Martin WN (1981) Correspondence processes in dynamic scene analysis. Proceedings of the IEEE 69:562–572Google Scholar
  18. 18.
    Dalmia AK, Trivedi MM (1993) A real time implementation of structure estimation using image streams. Proceedings of the SPIE — Sensor Fusion VI 2059:248–259Google Scholar
  19. 19.
    Dalmia AK, Trivedi MM (1995) High-speed extraction of 3D structures using image streams obtained by a single camera. Computer Vision, Graphics and Image Processing: Image Understanding, 1994. To appear 1995Google Scholar
  20. 20.
    Dalmia AK, Trivedi MM (1994) On integrating depth from motion and stereo. Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics, San Antonio, Tex, pp 61–66Google Scholar
  21. 21.
    Dalmia AK, Trivedi MM (1994) Acquisition of 3D structure of selectable quality from image streams. Proceedings of the IEEE Workshop on Applications of Computer Vision, Sarasota, Fla., pp 289–296Google Scholar
  22. 22.
    Dalmia AK, Trivedi MM (1994) Active depth extraction using image streams. Proceedings of the SPIE — Sensor Fusion and Aerospace Application II, Orlando, Fla.Google Scholar

Copyright information

© Springer-Verlag 1996

Authors and Affiliations

  1. 1.Electrical and Computer Engineering Department, Computer Vision and Robotics Research LaboratoryThe University of TennesseeKnoxvilleUSA

Personalised recommendations