Minimal Aspect Distortion (MAD) Mosaicing of Long Scenes
Long scenes can be imaged by mosaicing multiple images from cameras scanning the scene. We address the case of a video camera scanning a scene while moving in a long path, e.g. scanning a city street from a driving car, or scanning a terrain from a low flying aircraft.
A robust approach to this task is presented, which is applied successfully to sequences having thousands of frames even when using a hand-held camera. Examples are given on a few challenging sequences. The proposed system consists of two components: (i) Motion and depth computation. (ii) Mosaic rendering.
In the first part a “direct” method is presented for computing motion and dense depth. Robustness of motion computation has been increased by limiting the motion model for the scanning camera. An iterative graph-cuts approach, with planar labels and a flexible similarity measure, allows the computation of a dense depth for the entire sequence.
In the second part a new minimal aspect distortion (MAD) mosaicing uses depth to minimize the geometrical distortions of long panoramic images. In addition to MAD mosaicing, interactive visualization using X-Slits is also demonstrated.
KeywordsVideo mosaicing Ego motion Stereo Panorama X-slits Multi-perspective
Unable to display preview. Download preview PDF.
- Agarwala, A., Dontcheva, M., Agrawala, M., Drucker, S., Colburn, A., Curless, B., Salesin, D., & Cohen, M. (2004). Interactive digital photomontage. In SIGGRAPH (pp. 294–302). Google Scholar
- Agarwala, A., Agrawala, M., Cohen, M., Salesin, D., & Szeliski, R. (2006b). Photographing long scenes with multi-viewpoint panoramas. In SIGGRAPH’06 (pp. 853–861), July 2006. Google Scholar
- Bergen, J., Anandan, P., Hanna, K., & Hingorani, R. (1992). Hierarchical model-based motion estimation. In ECCV (pp. 237–252). Google Scholar
- Birchfield, S., & Tomasi, C. (1999). Multiway cut for stereo and motion with slanted surfaces. In ICCV (Vol. 1, pp. 489–495). Google Scholar
- Deng, Y., Yang, Q., Lin, X., & Tang, X. (2005). A symmetric patch-based correspondence model for occlusion handling. In ICCV (pp. 1316–1322), Washington, DC, USA. Google Scholar
- Feldman, D., & Zomet, A. (2004). Generating mosaics with minimum distortions. In Proceedings of the 2004 conference on computer vision and pattern recognition workshop (CVPRW’04) (Vol. 11, pp. 163–170), Washington, DC, USA. Google Scholar
- Hanna, K. (1991). Direct multi-resolution estimation of ego-motion and structure from motion. In MOTION’91 (pp. 156–162). Google Scholar
- Hong, L., & Chen, G. (2004). Segment-based stereo matching using graph cuts. In CVPR (Vol. 1, pp. 74–81), Los Alamitos, CA, USA. Google Scholar
- Irani, M., Rousso, B., & Peleg, S. (1992). Detecting and tracking multiple moving objects using temporal integration. In ECCV’92 (pp. 282–287). Google Scholar
- Kawasaki, H., Murao, M., Ikeuchi, K., & Sakauchi, M. (2001). Enhanced navigation system with real images and real-time information. In ITSWC’01, October 2001. Google Scholar
- Kolmogorov, V., & Zabih, R. (2001). Computing visual correspondence with occlusions via graph cuts. In ICCV (Vol. 2, pp. 508–515), July 2001. Google Scholar
- Kolmogorov, V., & Zabih, R. (2002). What energy functions can be minimized via graph cuts? In ECCV’02 (pp. 65–81), May 2002. Google Scholar
- Montoliu, R., & Pla, F. (2003). Robust techniques in least squares-based motion estimation problems. In Lecture notes in computer science : Vol. 2905. Progress in pattern recognition, speech and image analysis (pp. 62–70). Berlin: Springer. Google Scholar
- Ono, S., Kawasaki, H., Hirahara, K., Kagesawa, M., & Ikeuchi, K. (2003). Ego-motion estimation for efficient city modeling by using epipolar plane range image. In ITSWC’03, November 2003. Google Scholar
- Rav-Acha, A., & Peleg, S. (2004). A unified approach for motion analysis and view synthesis. In Second IEEE international symposium on 3D data processing, visualization, and transmission (3DPVT), Thessaloniki, Greece, September 2004. Google Scholar
- Rav-Acha, A., & Peleg, S. (2006). Lucas–Kanade without iterative warping. In ICIP’06 (pp. 1097–1100). Google Scholar
- Rav-Acha, A., Shor, Y., & Peleg, S. (2004). Mosaicing with parallax using time warping. In Second IEEE workshop on image and video registration, Washington, DC, July 2004. Google Scholar
- Román, A., & Lensch, H. P. A. (2006). Automatic multiperspective images. In Proceedings of eurographics symposium on rendering (pp. 161–171). Google Scholar
- Román, A., Garg, G., & Levoy, M. (2004). Interactive design of multi-perspective images for visualizing urban landscapes. In IEEE visualization 2004 (pp. 537–544), October 2004. Google Scholar
- Shi, M., & Zheng, J. Y. (2005). A slit scanning depth of route panorama from stationary blur. In CVPR’05 (Vol. 1, pp. 1047–1054). Google Scholar
- Wexler, Y., & Simakov, D. (2005). Space–time scene manifolds. In ICCV’05 (Vol. 1, pp. 858–863). Google Scholar
- Yang, Q., Wang, L., & Yang, R. (2006). Real-time global stereo matching using hierarchical belief propagation. In BMVC (pp. 989–998), Edinburgh, September 2006. Google Scholar