Skip to main content
Log in

Recursive 3-D Visual Motion Estimation Using Subspace Constraints

  • Published:
International Journal of Computer Vision Aims and scope Submit manuscript

Abstract

The 3-D motion of a camera within a static environment produces a sequence of time-varying images that can be used for reconstructing the relative motion between the scene and the viewer. The problem of reconstructing rigid motion from a sequence of perspective images may be characterized as the estimation of the state of a nonlinear dynamical system, which is defined by the rigidity constraint and the perspective measurement map. The time-derivative of the measured output of such a system, which is called the “2-D motion field” and is approximated by the “optical flow”, is bilinear in the motion parameters, and may be used to specify a subspace constraint on the direction of heading independent of rotation and depth, and a pseudo-measurement for the rotational velocity as a function of the estimated heading. The subspace constraint may be viewed as an implicit dynamical model with parameters on a differentiable manifold, and the visual motion estimation problem may be cast in a system-theoretic framework as the identification of such an implicit model. We use techniques which pertain to nonlinear estimation and identification theory to recursively estimate 3-D rigid motion from a sequence of images independent of the structure of the scene. Such independence from scene-structure allows us to deal with a variable number of visible feature-points and occlusions in a principled way. The further decoupling of the direction of heading from the rotational velocity generates a filter with a state that belongs to a two-dimensional and highly constrained state-space. As a result, the filter exhibits robustness properties which are highlighted in a series of experiments on real and noisy synthetic image sequences. While the position of feature-points is not part of the state of the model, the innovation process of the filter describes how each feature is compatible with a rigid motion interpretation, which allows us to test for outliers and makes the filter robust with respect to errors in the feature tracking/optical flow, reflections, T-junctions. Once motion has been estimated, the 3-D structure of the scene follows easily. By releasing the constraint that the visible points lie in front of the viewer, one may explain some psychophysical effects on the nonrigid percept of rigidly moving objects.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Similar content being viewed by others

References

  • Adiv, G. 1985. Determining three-dimensional motion and structure from optical flow generated by several moving objects. IEEE Trans.Pattern Anal.Mach.Intell.

  • Azarbayejani, A., Horowitz, B., and Pentland, A. 1993. Recursive estimation of structure and motion using relative orientation constraints. Proc.IEEE Conf.Comp.Vision Pattern Recognition, New York.

  • Bouguet, J.-Y. and Perona, P. 1995. A visual odometer and gyroscope. Proc.of the 5 IEEE Int.Conf.Comp.Vision.

  • Broida, T. and Chellappa, R. 1986. Estimation of object motion parameters from noisy images. IEEE Trans.PAMI.

  • Bryant, R. L., Chern, S. S., Gardner, R. B., Goldshmidt, H. L., and Griffith, P. A. 1991. Exterior Differential Systems. Mathematical Research Institute. Springer Verlag.

  • Bucy, R. S. 1965. Non-linear filtering theory. IEEE Trans.A.C. AC-10, 198.

  • Dickmanns, E. D. 1994. Historical development of use of dynamical models for the representation of knowledge about real-world processes in machine vision. Signal Processing, 35(3):305–306.

    Google Scholar 

  • Gennery, D. B. 1982. Tracking known 3-dimensional object. In Proc. AAAI 2nd Natl.Conf.Artif.Intell., Pittsburg, PA, pp. 13–17.

  • Gennery, D. B. 1992. Visual tracking of known 3-dimensional object. Int.J.of Computer Vision, 7(3):243–270.

    Google Scholar 

  • Gibson, E. J., Gibson, J. J., Smith, O. W., and Flock, H. 1959. Motion parallax as a determinant of perceived depth. J.Exp.Psych., Vol. 45.

  • Heeger, D. and Jepson, A. 1992. Subspace methods for recovering rigid motion i: Algorithm and implementation. Int.J.Computer Vision, 7(2).

  • Heel, J. 1990. Direct estimation of structure and motion from multiple frames. AI Memo 1190, MIT AI Lab.

  • Horn, B. K. P. 1990. Relative orientation. Int.J.of Computer Vision, 4:59–78.

    Google Scholar 

  • Isidori, A. 1989. Nonlinear Control Systems. Springer Verlag.

  • Jazwinski, A. H. 1970. Stochastic Processes and Filtering Theory. Academic Press.

  • Jepson, A. and Heeger, D. 1991. Subspace methods for recovering rigid motion ii: Theory. RBCV TR-90-35, University of Toronto.

  • Kalman, R. E. 1960. A new approach to linear filtering and prediction problems. Trans.of the ASME-Journal of Basic Engineering, pp. 35–45.

  • Kolb, C., Braun, J., and Perona, P. 1994. Object segmentation and 3d structure from motion. In Invest.Ophtalmol.Vis.Sci. (Supplement), p. 1275.

    Google Scholar 

  • Longuet-Higgins, H. C. 1981. A computer algorithm for reconstructing a scene from two projections. Nature, 293:133–135.

    Google Scholar 

  • Lucas, B. D. and Kanade, T. 1981. An iterative image registration technique with an application to stereo vision. Proc.7th Int.Joint Conf.on Art.Intell.

  • Matthies, L., Szeliski, R., and Kanade, T. 1989. Kalman filter-based algorithms for estimating depth from image sequences. Int.J.of Computer Vision.

  • McLauchlan, P., Reid, I., and Murray, D. 1994. Recursive affine structure and motion from image sequences. Proc.of the 3 ECCV.

  • Oliensis, J. and Inigo-Thomas, J. 1992. Recursive multi-frame structure from motion incorporating motion error. Proc.DARPA Image Understanding Workshop.

  • Soatto, S. 1997. 3-D Structure from visual motion: modeling, representation and observability. Automatica (in press).

  • Soatto, S., Perona, P., Frezza, R., and Picci, G. 1993. Recursive motion and structure estimation with complete error characterization. In Proceedings of the IEEE Conf.on Computer Vision and Pattern Recognition, CVPR, New York, pp. 428–433.

  • Soatto, S. and Perona, P. 1994. Three dimensional transparent structure segmentation and multiple 3d motion estimation from monocular perspective image sequences. In IEEE Workshop on Motion of Nonrigid and Articulated Objects, Austin, IEEE Computer Society, pp. 228–235.

    Google Scholar 

  • Soatto, S., Frezza, R., and Perona, P. 1994. Motion estimation on the essential manifold. In Proc.3rd Europ.Conf.Comput.Vision, J.-O. Eklundh (Ed.), LNCS-Series, Springer-Verlag: Stockholm, 800–801:II-61–II-72.

  • Soatto, S., Frezza, R., and Perona, P. 1995. Structure from visual motion as a nonlinear observation problem. In Proceedings of the IFAC Symposium on Nonlinear Control Systems NOLCOS, Tahoe City.

  • Soatto, S., Frezza, R., and Perona, P. 1996. Motion estimation via dynamic vision. IEEE Trans.on Automatic Control, 41(3):393–414.

    Google Scholar 

  • Soderstorm, T. and Stoica, P. 1989. System Identification. Prentice Hall.

  • Von Helmholtz, H. 1910. Treatise on Physiological Optics.

  • Young, G. S. and Chellappa, R. 1990. 3-d motion estimation using a sequence of noisy stereo images-models, estimation and uniqueness results. IEEE Trans.Pattern Anal.Mach.Intell.

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Stefano Soatto.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Soatto, S., Perona, P. Recursive 3-D Visual Motion Estimation Using Subspace Constraints. International Journal of Computer Vision 22, 235–259 (1997). https://doi.org/10.1023/A:1007930700152

Download citation

  • Issue Date:

  • DOI: https://doi.org/10.1023/A:1007930700152

Navigation