An interactive approach for functional prototype recovery from a single RGBD image

Rong, Yuliang; Zheng, Youyi; Shao, Tianjia; Yang, Yin; Zhou, Kun

doi:10.1007/s41095-016-0032-x

An interactive approach for functional prototype recovery from a single RGBD image

Research Article
Open access
Published: 29 January 2016

Volume 2, pages 87–96, (2016)
Cite this article

Download PDF

You have full access to this open access article

Computational Visual Media Aims and scope Submit manuscript

An interactive approach for functional prototype recovery from a single RGBD image

Download PDF

Yuliang Rong¹,
Youyi Zheng²,
Tianjia Shao¹,
Yin Yang³ &
…
Kun Zhou¹

866 Accesses
5 Citations
Explore all metrics

Abstract

Inferring the functionality of an object from a single RGBD image is difficult for two reasons: lack of semantic information about the object, and missing data due to occlusion. In this paper, we present an interactive framework to recover a 3D functional prototype from a single RGBD image. Instead of precisely reconstructing the object geometry for the prototype, we mainly focus on recovering the object’s functionality along with its geometry. Our system allows users to scribble on the image to create initial rough proxies for the parts. After user annotation of high-level relations between parts, our system automatically jointly optimizes detailed joint parameters (axis and position) and part geometry parameters (size, orientation, and position). Such prototype recovery enables a better understanding of the underlying image geometry and allows for further physically plausible manipulation. We demonstrate our framework on various indoor objects with simple or hybrid functions.

Article PDF

High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review

Article Open access 06 March 2022

Realistic surface geometry reconstruction using a hand-held RGB-D camera

Article 18 February 2016

Joint 3D Object and Layout Inference from a Single RGB-D Image

Discover the latest articles, news and stories from top researchers in related subjects.

Artificial Intelligence

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

References

Han, Y.; Lee, J.-Y.; Kweon, I. S. High quality shape from a single RGB-D image under uncalibrated natural illumination. In: Proceedings of IEEE International Conference on Computer Vision, 1617–1624, 2013.
Google Scholar
Izadi, S.; Kim, D.; Hilliges, O.; Molyneaux, D.; Newcombe, R.; Kohli, P.; Shotton, J.; Hodges, S.; Freeman, D.; Davison, A.; Fitzgibbon, A. KinectFusion: Real-time 3D reconstruction and interaction using a moving depth camera. In: Proceedings of the 24th Annual ACM Symposium on User Interface Software and Technology, 559–568, 2011.
Google Scholar
Shao, T.; Monszpart, A.; Zheng, Y.; Koo, B.; Xu, W.; Zhou, K.; Mitra, N. J. Imagining the unseen: Stabilitybased cuboid arrangements for scene understanding. ACM Transactions on Graphics Vol. 33, No. 6, Article No. 209, 2014.
Article Google Scholar
Shen, C.-H.; Fu, H.; Chen, K.; Hu, S.-M. Structure recovery by part assembly. ACM Transactions on Graphics Vol. 31, No. 6, Article No. 180, 2012.
Article Google Scholar
Zheng, Y.; Chen, X.; Cheng, M.-M.; Zhou, K.; Hu, S.-M.; Mitra, N. J. Interactive images: Cuboid proxies for smart image manipulation. ACM Transactions on Graphics Vol. 31, No. 4, Article No. 99, 2012.
Google Scholar
Sullivan, L. H. The tall office building artistically considered. Lippincott’s Magazine 57, 1896.
Google Scholar
Koo, B.; Li, W.; Yao, J.; Agrawala, M.; Mitra, N. J. Creating works-like prototypes of mechanical objects. ACM Transactions on Graphics Vol. 33, No. 6, Article No. 217, 2014.
Article Google Scholar
Li, Y.; Wu, X.; Chrysanthou, Y.; Sharf, A.; Cohen-Or, D.; Mitra, N. J. GlobFit: Consistently fitting primitives by discovering global relations. ACM Transactions on Graphics Vol. 30, No. 4, Article No. 52, 2011.
Google Scholar
Lafarge, F.; Alliez, P. Surface reconstruction through point set structuring. Computer Graphics Forum Vol. 32, No. 2pt2, 225–234, 2013.
Article Google Scholar
Arikan, M.; Schwärzler, M.; Flöry, S.; Wimmer, M.; Maierhofer, S. O-snap: Optimization-based snapping for modeling architecture. ACM Transactions on Graphics Vol. 32, No. 1, Article No. 6, 2013.
Article MATH Google Scholar
Gupta, A.; Efros, A. A.; Hebert, M. Blocks world revisited: Image understanding using qualitative geometry and mechanics. In: Lecture Notes in Computer Science, Vol. 6314. Daniilidis, K.; Maragos, P.; Paragios, N. Eds. Springer Berlin Heidelberg, 482–496, 2010.
Article Google Scholar
Gupta, A.; Hebert, M.; Kanade, T.; Blei, D. M. Estimating spatial layout of rooms using Volumetric reasoning about objects and surfaces. In: Advances in Neural Information Processing Systems 23. Lafferty, J.; Williams, C.; Shawe-Taylor, J.; Zemel, R.; Culotta, A. Eds. Curran Associates, Inc., 1288–1296, 2010.
Google Scholar
Del Pero, L.; Bowdish, J.; Fried, D.; Kermgard, B.; Hartley, E.; Barnard, K. Bayesian geometric modeling of indoor scenes. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2719–2726, 2012.
Google Scholar
Jia, Z.; Gallagher, A.; Saxena, A.; Chen, T. 3Dbased reasoning with blocks, support, and stability. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 1–8, 2013.
Google Scholar
Jiang, H.; Xiao, J. A linear approach to matching cuboids in RGBD images. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2171–2178, 2013.
Google Scholar
Zheng, B.; Zhao, Y.; Yu, J. C.; Ikeuchi, K.; Zhu, S.-C. Beyond point clouds: Scene understanding by reasoning geometry and physics. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 3127–3134, 2013.
Google Scholar
Umetani, N.; Igarashi, T.; Mitra, N. J. Guided exploration of physically valid shapes for furniture design. ACM Transactions on Graphics Vol. 31, No. 4, Article No. 86, 2012.
Article Google Scholar
Shao, T.; Li, W.; Zhou, K.; Xu, W.; Guo, B.; Mitra, N. J. Interpreting concept sketches. ACM Transactions on Graphics Vol. 32, No. 4, Article No. 56, 2013.
Article Google Scholar
Bokeloh, M.; Wand, M.; Seidel, H.-P.; Koltun, V. An algebraic model for parameterized shape editing. ACM Transactions on Graphics Vol. 31, No. 4, Article No. 78, 2012.
Article Google Scholar
Gal, R.; Sorkine, O.; Mitra, N. J.; Cohen-Or, D. iWIRES: An analyze-and-edit approach to shape manipulation. ACM Transactions on Graphics Vol. 28, No. 3, Article No. 33, 2009.
Article Google Scholar
Xu, W.; Wang, J.; Yin, K.; Zhou, K.; van de Panne, M.; Chen, F.; Guo, B. Joint-aware manipulation of deformable models. ACM Transactions on Graphic Vol. 28, No. 3, Article No. 35, 2009.
Article Google Scholar
Daniel, M.; Lucas, M. Towards declarative geometric modelling in mechanics. In: Integrated Design and Manufacturing in Mechanical Engineering. Chedmail, P.; Bocquet, J.-C.; Dornfeld, D. Eds. Springer Netherlands, 427–436, 1997.
Chapter Google Scholar
Yvars, P.-A. Using constraint satisfaction for designing mechanical systems. International Journal on Interactive Design and Manufacturing Vol. 2, No. 3, 161–167, 2008.
Article Google Scholar
Zhang, Q.; Ye, M.; Yang, R.; Matsushita, Y.; Wilburn, B.; Yu, H. Edge-preserving photometric stereo via depth fusion. In: Proceedings of IEEE Conference on Computer Vision and Pattern Recognition, 2472–2479, 2012.
Google Scholar
Rother, C.; Kolmogorov, V.; Blake, A. “GrabCut”: Interactive foreground extraction using iterated graph cuts. ACM Transactions on Graphics Vol. 23, No. 3, 309–314, 2004.
Article Google Scholar
Schnabel, R.; Wahl, R.; Klein, R. Efficient RANSAC for point-cloud shape detection. Computer Graphics Forum Vol. 26, No. 2, 214–226, 2007.
Article Google Scholar

Download references

Author information

Authors and Affiliations

State Key Lab of CAD&CG, Zhejiang University, Hangzhou, 310058, China
Yuliang Rong, Tianjia Shao & Kun Zhou
ShanghaiTech University, Shanghai, 200031, China
Youyi Zheng
The University of New Mexico, Albuquerque, NM, 87131, USA
Yin Yang

Authors

Yuliang Rong
View author publications
You can also search for this author in PubMed Google Scholar
Youyi Zheng
View author publications
You can also search for this author in PubMed Google Scholar
Tianjia Shao
View author publications
You can also search for this author in PubMed Google Scholar
Yin Yang
View author publications
You can also search for this author in PubMed Google Scholar
Kun Zhou
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Tianjia Shao.

Additional information

This article is published with open access at Springerlink.com

Yuliang Rong is currently a senior undergraduate student majoring in computer science at Zhejiang University. He will work towards a master degree at the State Key Lab of CAD&CG in Zhejiang University after graduation. His research interests include image analysis and geometric modeling.

Youyi Zheng is currently an assistant professor at the School of Information Science and Technology, ShanghaiTech University. He obtained his Ph.D. degree from the Department of Computer Science and Engineering at Hong Kong University of Science & Technology, and his M.S. and B.S. degrees from the Department of Mathematics, Zhejiang University. His research interests include geometric modeling, imaging, and human–computer interaction.

Tianjia Shao is currently an assistant researcher at the State Key Lab of CAD&CG, Zhejiang University. He received his Ph.D. degree in computer science from the Institute for Advanced Study, and his B.S. degree from the Department of Automation, both at Tsinghua University. His research interests include RGBD image processing, indoor scene modeling, structure analysis, and 3D model retrieval.

Yin Yang received his Ph.D. degree in computer science from the University of Texas at Dallas in 2013. He is an assistant professor in the Electrical Communication Engineering Department, University of New Mexico, Albuquerque, USA. His research interests include physics-based animation and simulation, visualization, and medical imaging analysis.

Kun Zhou is a Cheung Kong professor in the Computer Science Department of Zhejiang University, and the director of the State Key Lab of CAD&CG. Prior to joining Zhejiang University in 2008, he was a lead researcher in the Internet Graphics Group at Microsoft Research Asia. He received his B.S. and Ph.D. degrees in computer science from Zhejiang University in 1997 and 2002, respectively. His research interests are visual computing, parallel computing, human–computer interaction, and virtual reality. He currently serves on the editorial/advisory boards of ACM Transactions on Graphics and IEEE Spectrum. He is a Fellow of the IEEE.

Open Access The articles published in this journal are distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Other papers from this open access journal are available free of charge from http://www.springer.com/journal/41095. To submit a manuscript, please go to https://www. editorialmanager.com/cvmj.

Electronic supplementary material

Supplementary material, approximately 41628 KB.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (https://creativecommons.org/licenses/by/4.0), which permits use, duplication, adaptation, distribution, and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.

Reprints and permissions

About this article

Cite this article

Rong, Y., Zheng, Y., Shao, T. et al. An interactive approach for functional prototype recovery from a single RGBD image. Comp. Visual Media 2, 87–96 (2016). https://doi.org/10.1007/s41095-016-0032-x

Download citation

Received: 01 December 2015
Accepted: 09 December 2015
Published: 29 January 2016
Issue Date: March 2016
DOI: https://doi.org/10.1007/s41095-016-0032-x

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An interactive approach for functional prototype recovery from a single RGBD image

Abstract

Article PDF

Similar content being viewed by others

High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review

Realistic surface geometry reconstruction using a hand-held RGB-D camera

Joint 3D Object and Layout Inference from a Single RGB-D Image

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 41628 KB.

Rights and permissions

About this article

Cite this article

Keywords

Navigation

An interactive approach for functional prototype recovery from a single RGBD image

Abstract

Article PDF

Similar content being viewed by others

High-quality indoor scene 3D reconstruction with RGB-D cameras: A brief review

Realistic surface geometry reconstruction using a hand-held RGB-D camera

Joint 3D Object and Layout Inference from a Single RGB-D Image

Explore related subjects

References

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material, approximately 41628 KB.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation