Abstract
We consider the problem of estimating 3-d structure from a single still image of an outdoor urban scene. Our goal is to efficiently create 3-d models which are visually pleasant. We chose an appropriate 3-d model structure and formulate the task of 3-d reconstruction as model fitting problem. Our 3-d models are composed of a number of vertical walls and a ground plane, where ground-vertical boundary is a continuous polyline. We achieve computational efficiency by special preprocessing together with stepwise search of 3-d model parameters dividing the problem into two smaller sub-problems on chain graphs. The use of Conditional Random Field models for both problems allows to various cues. We infer orientation of vertical walls of 3-d model vanishing points.
Chapter PDF
Similar content being viewed by others
Keywords
These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.
References
Hoiem, D., Efros, A., Hebert, M.: Automatic photo pop-up. In: Proc. of ACM SIGGRAPH (2005)
Hoiem, D., Efros, A., Hebert, M.: Geometric context from a single image. In: Proc. of ICCV (2005)
Delage, E., Lee, H., Ng, A.: Automatic single-image 3d reconstructions of indoor manhattan world scenes. In: Proc. of ISRR (2005)
Delage, E., Lee, H., Ng, A.: A dynamic bayesian network model for autonomous 3d reconstruction from a single indoor image. In: Proc. of CVPR (2006)
Criminisi, A., Reid, I., Zisserman, A.: Single view metrology. IJCV 40 (2000)
Hoiem, D., Sten, A.N., Efros, A.A., Hebert, M.: Recovering Occlusion Boundaries from a Single Image. In: Proc. of ICCV (2007)
Saxena, A., Sun, M., Ng, A.: Learning 3-D Scene Structure from a Single Still Image. In: Proc. of ICCV workshop on 3D representation for Recognition (2007)
Saxena, A., Chung, S., Ng, A.: Depth Reconstruction from a Single Still Image. In: IJCV (2007)
Saxena, A., Chung, S., Ng, A.: Learning Depth from Single Monocular Images. In: Proc. of NIPS, vol. 18 (2005)
Canny, J.: A Computational Approach To Edge Detection. PAMI 8 (1986)
Kosecka, J., Zhang, W.: Video Compass. In: Proc. of ECCV, vol. 7, pp. 476–491 (2002)
Vezhnevets, V., Konushin, A., Ignatenko, A.: Interactive image-based urban modeling. In: Proc. of PIA, pp. 63–68 (2007)
Barinova, O., Kuzmishkina, A., Vezhnevets, A., Vezhnevets, V.: Learning class specific edges for vanishing point estimation. In: Proc. of Graphicon, pp. 162–165 (2007)
Chum, O., Matas, J., Kittler, J.: Locally Optimized RANSAC. In: DAGM Symposium on Pattern Recognition, vol. 25, pp. 236–243 (2003)
Comaniciu, D., Meer, P.: Mean shift: A robust approach toward feature space analysis. IEEE Trans. on Pattern Analysis and Machine Intelligence 24, 5 (2002)
Lafferty, J., McCallum, A., Pereira, F.: Conditional random fields: Probabilistic models for segmenting and labeling sequence data. In: Proc. of ICML (2001)
Sutton, C., McCallum, A.: Piecewise training of undirected models. In: Proc. of UAI, vol. 21 (2005)
Shotton, J., Winn, J., Rother, C., Criminisi, A.: TextonBoost for Image Understanding: Multi-Class Object Recognition and Segmentation by Jointly Modeling Texture, Layout, and Context. In: IJCV (2007)
Niculescu-Mizil, A., Caruana, R.: Obtaining Calibrated Probabilities from Boosting. In: Proc. of UAI, pp. 413–420 (2005)
Wasserman, L.: All of Nonparametric Statistics. Springer, Heidelberg (2006)
Hulse, J., Khoshgoftaar, T., Napolitano, A.: Experimental perspectives on learning from imbalanced data. In: Proc. of ICML, pp. 935–942 (2007)
Viola, P., Jones, M.: Robust Real-time Object Detection. In: IJCV (2001)
Kang, H., Pyo, S., Anjyo, K., Shin, S.: Tour into the picture using a vanishing line and its extension to panoramic images. In: Proc. Eurographics, pp. 132–141 (2001)
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Electronic Supplementary Material
Rights and permissions
Copyright information
© 2008 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Barinova, O., Konushin, V., Yakubenko, A., Lee, K., Lim, H., Konushin, A. (2008). Fast Automatic Single-View 3-d Reconstruction of Urban Scenes. In: Forsyth, D., Torr, P., Zisserman, A. (eds) Computer Vision – ECCV 2008. ECCV 2008. Lecture Notes in Computer Science, vol 5303. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-88688-4_8
Download citation
DOI: https://doi.org/10.1007/978-3-540-88688-4_8
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-88685-3
Online ISBN: 978-3-540-88688-4
eBook Packages: Computer ScienceComputer Science (R0)