Unfolding an Indoor Origami World

Fouhey, David Ford; Gupta, Abhinav; Hebert, Martial

doi:10.1007/978-3-319-10599-4_44

David Ford Fouhey¹⁹,
Abhinav Gupta¹⁹ &
Martial Hebert¹⁹

Part of the book series: Lecture Notes in Computer Science ((LNIP,volume 8694))

Included in the following conference series:

European Conference on Computer Vision

Abstract

In this work, we present a method for single-view reasoning about 3D surfaces and their relationships. We propose the use of mid-level constraints for 3D scene understanding in the form of convex and concave edges and introduce a generic framework capable of incorporating these and other constraints. Our method takes a variety of cues and uses them to infer a consistent interpretation of the scene. We demonstrate improvements over the state-of-the art and produce interpretations of the scene that link large planar surfaces.

Download to read the full chapter text

Chapter PDF

PlaneFormers: From Sparse View Planes to 3D Reconstruction

3DNN: 3D Nearest Neighbor

Article 22 July 2014

Geometric Pose Affordance: Monocular 3D Human Pose Estimation with Scene Constraints

Keywords

These keywords were added by machine and not by the authors. This process is experimental and the keywords may be updated as the learning algorithm improves.

References

Choi, W., Chao, Y.W., Pantofaru, C., Savarese, S.: Understanding indoor scenes using 3D geometric phrases. In: CVPR (2013)
Google Scholar
Clowes, M.: On seeing things. Artificial Intelligence 2, 79–116 (1971)
Article Google Scholar
Coughlan, J., Yuille, A.: The Manhattan world assumption: Regularities in scene statistics which enable Bayesian inference. In: NIPS (2000)
Google Scholar
Del Pero, L., Bowdish, J., Fried, D., Kermgard, B., Hartley, E.L., Barnard, K.: Bayesian geometric modeling of indoor scenes. In: CVPR (2012)
Google Scholar
Delage, E., Lee, H., Ng, A.Y.: A dynamic Bayesian network model for autonomous 3D reconstruction from a single indoor image. In: CVPR (2006)
Google Scholar
Fouhey, D.F., Gupta, A., Hebert, M.: Data-driven 3D primitives for single image understanding. In: ICCV (2013)
Google Scholar
Guo, R., Hoiem, D.: Support surface prediction in indoor scenes. In: ICCV (2013)
Google Scholar
Gupta, A., Efros, A.A., Hebert, M.: Blocks world revisited: Image understanding using qualitative geometry and mechanics. In: Daniilidis, K., Maragos, P., Paragios, N. (eds.) ECCV 2010, Part IV. LNCS, vol. 6314, pp. 482–496. Springer, Heidelberg (2010)
Chapter Google Scholar
Gupta, S., Arbelaez, P., Malik, J.: Perceptual organization and recognition of indoor scenes from RGB-D images. In: CVPR (2013)
Google Scholar
Hedau, V., Hoiem, D., Forsyth, D.: Recovering the spatial layout of cluttered rooms. In: ICCV (2009)
Google Scholar
Hoiem, D., Efros, A., Hebert, M.: Automatic photo pop-up. In: SIGGRAPH (2005)
Google Scholar
Hoiem, D., Efros, A., Hebert, M.: Geometric context from a single image. In: ICCV (2005)
Google Scholar
Hoiem, D., Efros, A.A., Hebert, M.: Recovering occlusion boundaries from an image. IJCV 91(3), 328–346 (2011)
Article MATH MathSciNet Google Scholar
Huffman, D.: Impossible objects as nonsense sentences. Machine Intelligence 8, 475–492 (1971)
Google Scholar
Jia, Z., Gallagher, A., Chang, Y.J., Chen, T.: A learning based framework for depth ordering. In: CVPR (2012)
Google Scholar
Jia, Z., Gallagher, A., Saxena, A., Chen, T.: 3D-based reasoning with blocks, support, and stability. In: CVPR (2013)
Google Scholar
Jiang, H., Xiao, J.: A linear approach to matching cuboids in RGBD images. In: CVPR (2013)
Google Scholar
Kanade, T.: A theory of origami world. Artificial Intelligence 13(3) (1980)
Google Scholar
Karsch, K., Liao, Z., Rock, J., Barron, J.T., Hoiem, D.: Boundary cues for 3D object shape recovery. In: CVPR (2013)
Google Scholar
Karsch, K., Liu, C., Kang, S.B.: Depth extraction from video using non-parametric sampling. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 775–788. Springer, Heidelberg (2012)
Chapter Google Scholar
Ladický, L., Shi, J., Pollefeys, M.: Pulling things out of perspective. In: CVPR (2014)
Google Scholar
Lee, D.C., Gupta, A., Hebert, M., Kanade, T.: Estimating spatial layout of rooms using volumetric reasoning about objects and surfaces. In: NIPS (2010)
Google Scholar
Lee, D.C., Hebert, M., Kanade, T.: Geometric reasoning for single image structure recovery. In: CVPR (2009)
Google Scholar
Liu, M., Salzmann, M., He, X.: Discrete-continuous depth estimation from a single image. In: CVPR (2014)
Google Scholar
Nitzberg, M., Mumford, D.: The 2.1D sketch. In: ICCV (1990)
Google Scholar
Ramalingam, S., Kohli, P., Alahari, K., Torr, P.: Exact inference in multi-label CRFs with higher order cliques. In: CVPR (2008)
Google Scholar
Ramalingam, S., Pillai, J., Jain, A., Taguchi, Y.: Manhattan junction catalogue for spatial reasoning of indoor scenes. In: CVPR (2013)
Google Scholar
Roberts, L.: Machine perception of 3D solids. PhD Thesis (1965)
Google Scholar
Saxena, A., Chung, S.H., Ng, A.Y.: Learning depth from single monocular images. In: NIPS (2005)
Google Scholar
Saxena, A., Sun, M., Ng, A.Y.: Make3D: Learning 3D scene structure from a single still image. TPAMI 30(5), 824–840 (2008)
Google Scholar
Schwing, A.G., Fidler, S., Pollefeys, M., Urtasun, R.: Box In the Box: Joint 3D Layout and Object Reasoning from Single Images. In: ICCV (2013)
Google Scholar
Schwing, A.G., Urtasun, R.: Efficient Exact Inference for 3D Indoor Scene Understanding. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part VI. LNCS, vol. 7577, pp. 299–313. Springer, Heidelberg (2012)
Chapter Google Scholar
Silberman, N., Hoiem, D., Kohli, P., Fergus, R.: Indoor segmentation and support inference from RGBD images. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 746–760. Springer, Heidelberg (2012)
Chapter Google Scholar
Sugihara, K.: Machine Interpretation of Line Drawings. MIT Press (1986)
Google Scholar
Xiang, Y., Savarese, S.: Estimating the aspect layout of object categories. In: CVPR (2012)
Google Scholar
Xiao, J., Russell, B., Torralba, A.: Localizing 3D cuboids in single-view images. In: NIPS (2012)
Google Scholar
Yamaguchi, K., Hazan, T., McAllester, D., Urtasun, R.: Continuous markov random fields for robust stereo estimation. In: Fitzgibbon, A., Lazebnik, S., Perona, P., Sato, Y., Schmid, C. (eds.) ECCV 2012, Part V. LNCS, vol. 7576, pp. 45–58. Springer, Heidelberg (2012)
Chapter Google Scholar
Yu, S.X., Zhang, H., Malik, J.: Inferring spatial layout from a single image via depth-ordered grouping. In: Workshop on Perceptual Organization (2008)
Google Scholar
Zhang, J., Chen, K., Schwing, A.G., Urtasun, R.: Estimaing the 3D Layout of Indoor Scenes and its Clutter from Depth Sensors. In: ICCV (2013)
Google Scholar
Zhao, Y., Zhu, S.: Image parsing via stochastic scene grammar. In: NIPS (2011)
Google Scholar
Zhao, Y., Zhu, S.: Scene parsing by integrating function, geometry and appearance models. In: CVPR (2013)
Google Scholar

Download references

Author information

Authors and Affiliations

The Robotics Institute, Carnegie Mellon University, USA
David Ford Fouhey, Abhinav Gupta & Martial Hebert

Authors

David Ford Fouhey
View author publications
You can also search for this author in PubMed Google Scholar
Abhinav Gupta
View author publications
You can also search for this author in PubMed Google Scholar
Martial Hebert
View author publications
You can also search for this author in PubMed Google Scholar

Editor information

Editors and Affiliations

Department of Computer Science, University of Toronto, 6 King’s College Road, M5H 3S5, Toronto, ON, Canada
David Fleet
Faculty of Electrical Engineering, Department of Cybernetics, Czech Technical University in Prague, Technicka 2, 166 27, Prague 6, Czech Republic
Tomas Pajdla
Max-Planck-Institut für Informatik, Campus E1 4, 66123, Saarbrücken, Germany
Bernt Schiele
ESAT - PSI, iMinds, KU Leuven, Kasteelpark Arenberg 10, Bus 2441, 3001, Leuven, Belgium
Tinne Tuytelaars

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Fouhey, D.F., Gupta, A., Hebert, M. (2014). Unfolding an Indoor Origami World. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds) Computer Vision – ECCV 2014. ECCV 2014. Lecture Notes in Computer Science, vol 8694. Springer, Cham. https://doi.org/10.1007/978-3-319-10599-4_44

Download citation

DOI: https://doi.org/10.1007/978-3-319-10599-4_44
Publisher Name: Springer, Cham
Print ISBN: 978-3-319-10598-7
Online ISBN: 978-3-319-10599-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Unfolding an Indoor Origami World

Abstract

Chapter PDF

Similar content being viewed by others

PlaneFormers: From Sparse View Planes to 3D Reconstruction

3DNN: 3D Nearest Neighbor

Geometric Pose Affordance: Monocular 3D Human Pose Estimation with Scene Constraints

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Unfolding an Indoor Origami World

Abstract

Chapter PDF

Similar content being viewed by others

PlaneFormers: From Sparse View Planes to 3D Reconstruction

3DNN: 3D Nearest Neighbor

Geometric Pose Affordance: Monocular 3D Human Pose Estimation with Scene Constraints

Keywords

References

Author information

Authors and Affiliations

Editor information

Editors and Affiliations

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation