3-D Modeling of an Outdoor Scene from Multiple Image Sequences by Estimating Camera Motion Parameters

  • Tomokazu Sato
  • Masayuki Kanbara
  • Naokazu Yokoya
Conference paper
Part of the Lecture Notes in Computer Science book series (LNCS, volume 2749)


Three-dimensional (3-D) models of outdoor scenes can be widely used in a number of fields such as object recognition, navigation, scenic simulation, and mixed reality. Such models are often made manually with high costs, so that automatic 3-D reconstruction has been widely investigated. In related works, a dense 3-D model is generated by using a stereo method. However, such approaches cannot use several hundred images together for dense depth estimation of large constructs and urban environments because it is difficult to accurately calibrate a large number of cameras. This paper proposes a novel dense 3-D reconstruction method that uses multiple image sequences. First, our method estimates extrinsic camera parameters of each image sequence, and then reconstructs a dense 3-D model of a scene using an extended multi-baseline stereo and voxel voting techniques.


  1. 1.
    N. Yokoya, T. Shakunaga and M. Kanbara: “Passive Range Sensing Techniques: Depth from Images,” IEICE Trans. Inf. and Syst., Vol. E82-D, No. 3, pp. 523–533, 1999.Google Scholar
  2. 2.
    P. Beardsley, A. Zisserman and D. Murray: “Sequential Updating of Projective and Affine Structure from Motion,” Int. Jour. of Computer Vision, Vol. 23, No. 3, pp. 235–259, 1997.CrossRefGoogle Scholar
  3. 3.
    M. Pollefeys, R. Koch, M. Vergauwen, A. A. Deknuydt and L. J. V. Gool: “Three-dimentional Scene Reconstruction from Images,” Proc. SPIE, Vol. 3958, pp. 215–226, 2000.CrossRefGoogle Scholar
  4. 4.
    C. Tomasi and T. Kanade: “Shape and Motion from Image Streams under Orthography: A Factorization Method,” Int. Jour. of Computer Vision, Vol. 9, No. 2, pp. 137–154, 1992.CrossRefGoogle Scholar
  5. 5.
    T. Sato, M. Kanbara, N. Yokoya and H. Takemura: “Dense 3-D Reconstruction of an Outdoor Scene by Hundreds-baseline Stereo Using a Hand-held Video Camera,” Int. Jour. of Computer Vision, Vol. 47, No. 1–3, pp. 119–129, 2002.zbMATHCrossRefGoogle Scholar
  6. 6.
    T. Sato, M. Kanbara, H. Takemura and N. Yokoya: “3-D Reconstruction from a Monocular Image Sequence by Tracking Markers and Natural Features,” Proc. 14th Int. Conf. on Vision Interface, pp. 157–164, 2001.Google Scholar
  7. 7.
    M. Okutomi and T. Kanade: “A Multiple-baseline Stereo,” IEEE Trans. Pattern Analysis and Machine Intelligence, Vol. 15, No. 4, pp. 353–363, 1993.CrossRefGoogle Scholar
  8. 8.
    N. Yokoya: “Surface Reconstruction Directly from Binocular Stereo Images by Multiscale-multistage Regularization,” Proc. 11th Int. Conf. on Pattern Recognition, Vol. I, pp. 642–646, 1992.CrossRefGoogle Scholar
  9. 9.
    R. Y. Tsai: “An Efficient and Accurate Camera Calibration Technique for 3D Machine Vision,” Proc. IEEE Conf. on Computer Vision and Pattern Recognition, pp. 364–374, 1986.Google Scholar
  10. 10.
    S. Ikeda, T. Sato and N. Yokoya: “An Calibration Method for an Omnidirectional Multi-camera System,” Proc. SPIE, Vol. 5006, 2003.Google Scholar

Copyright information

© Springer-Verlag Berlin Heidelberg 2003

Authors and Affiliations

  • Tomokazu Sato
    • 1
  • Masayuki Kanbara
    • 1
  • Naokazu Yokoya
    • 1
  1. 1.Graduate School of Information ScienceNara Institute of Science and TechnologyNaraJapan

Personalised recommendations