Neural Scene Decoration from a Single Photograph

Pang, Hong-Wing; Chen, Yingshu; Le, Phuoc-Hieu; Hua, Binh-Son; Nguyen, Duc Thanh; Yeung, Sai-Kit

doi:10.1007/978-3-031-20050-2_9

Hong-Wing Pang¹²,
Yingshu Chen¹²,
Phuoc-Hieu Le¹³,
Binh-Son Hua¹³,
Duc Thanh Nguyen¹⁴ &
…
Sai-Kit Yeung¹²

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13683))

Included in the following conference series:

European Conference on Computer Vision

2068 Accesses
1 Citations

Abstract

Furnishing and rendering indoor scenes has been a long-standing task for interior design, where artists create a conceptual design for the space, build a 3D model of the space, decorate, and then perform rendering. Although the task is important, it is tedious and requires tremendous effort. In this paper, we introduce a new problem of domain-specific indoor scene image synthesis, namely neural scene decoration. Given a photograph of an empty indoor space and a list of decorations with layout determined by user, we aim to synthesize a new image of the same space with desired furnishing and decorations. Neural scene decoration can be applied to create conceptual interior designs in a simple yet effective manner. Our attempt to this research problem is a novel scene generation architecture that transforms an empty scene and an object layout into a realistic furnished scene photograph. We demonstrate the performance of our proposed method by comparing it with conditional image synthesis baselines built upon prevailing image translation approaches both qualitatively and quantitatively. We conduct extensive experiments to further validate the plausibility and aesthetics of our generated scenes. Our implementation is available at https://github.com/hkust-vgd/neural_scene_decoration.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 89.00; Price excludes VAT (USA)

Softcover Book: USD 119.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Capsule Based Image Synthesis for Interior Design Effect Rendering

Real-scene-constrained virtual scene layout synthesis for mixed reality

Article 12 December 2023

Research on Image-to-Image Generation and Optimization Methods Based on Diffusion Model Compared with Traditional Methods: Taking Façade as the Optimization Object

References

Bau, D.: Semantic photo manipulation with a generative image prior. ACM Trans. Graph. 38(4), 1–11 (2019)
Article Google Scholar
Bińkowski, M., Sutherland, D.J., Arbel, M., Gretton, A.: Demystifying MMD GANs. In: Proceedings of the International Conference on Learning Representations (2018)
Google Scholar
Fisher, M., Ritchie, D., Savva, M., Funkhouser, T.A., Hanrahan, P.: Example-based synthesis of 3d object arrangements. ACM Trans. Graph. 31(6), 1–11 (2012)
Article Google Scholar
Germer, T., Schwarz, M.: Procedural arrangement of furniture for real-time walkthroughs. Comput. Graph. Forum 28(8), 2068–2078 (2009)
Article Google Scholar
Goodfellow, I., et al.: Generative adversarial nets. In: Proceedings of the Advances in Neural Information Processing Systems (2014)
Google Scholar
He, S., et al.: Context-aware layout to image generation with enhanced object appearance. In: CVPR (2021)
Google Scholar
Henderson, P., Subr, K., Ferrari, V.: Automatic generation of constrained furniture layouts. arXiv preprint arXiv:1711.10939 (2017)
Heusel, M., Ramsauer, H., Unterthiner, T., Nessler, B., Hochreiter, S.: GANs trained by a two time-scale update rule converge to a local nash equilibrium. In: Proceedings of the Advances in Neural Information Processing Systems (2017)
Google Scholar
Hu, R., Huang, Z., Tang, Y., van Kaick, O., Zhang, H., Huang, H.: Graph2Plan: learning floorplan generation from layout graphs. ACM Trans. Graph. 39(4), 118–128 (2020)
Article Google Scholar
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2017)
Google Scholar
Karras, T., Aila, T., Laine, S., Lehtinen, J.: Progressive growing of GANs for improved quality, stability, and variation. In: Proceedings of the International Conference on Learning Representations (2018)
Google Scholar
Karras, T., Aittala, M., Hellsten, J., Laine, S., Lehtinen, J., Aila, T.: Training generative adversarial networks with limited data. In: Proceedings of the Advances in Neural Information Processing Systems (2020)
Google Scholar
Karras, T., et al.: Alias-free generative adversarial networks. arXiv preprint arXiv:2106.12423 (2021)
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Karras, T., Laine, S., Aila, T.: A style-based generator architecture for generative adversarial networks. IEEE Trans. Pattern Anal. Mach. Intell. 43(12), 4217–4228 (2021)
Article Google Scholar
Karras, T., Laine, S., Aittala, M., Hellsten, J., Lehtinen, J., Aila, T.: Analyzing and improving the image quality of StyleGAN. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2020)
Google Scholar
Karsch, K.: Inverse Rendering Techniques for Physically Grounded Image Editing. Ph.D. thesis, University of Illinois at Urbana-Champaign (2015)
Google Scholar
Karsch, K., Hedau, V., Forsyth, D., Hoiem, D.: Rendering synthetic objects into legacy photographs. ACM Trans. Graph. 30(6), 1–14 (2011)
Article Google Scholar
Karsch, K., et al.: Automatic scene inference for 3D object compositing. ACM Trans. Graph. 33(3), 1–15 (2014)
Article MATH Google Scholar
Kingma, D.P., Welling, M.: Auto-encoding variational Bayes. In: Proceedings of the International Conference on Learning Representations (2014)
Google Scholar
Krishna, R., et al.: Visual genome: Connecting language and vision using crowdsourced dense image annotations. Int. J. Comput. Vis. 123, 32–73 (2017)
Article MathSciNet Google Scholar
Li, J., Yang, J., Hertzmann, A., Zhang, J., Xu, T.: LayoutGAN: generating graphic layouts with wireframe discriminators. In: Proceedings of the International Conference on Learning Representations (2019)
Google Scholar
Li, M., et al.: GRAINS: generative recursive autoencoders for indoor scenes. ACM Trans. Graph. 38(2), 1–16 (2019)
Article Google Scholar
Li, Y., Cheng, Y., Gan, Z., Yu, L., Wang, L., Liu, J.: BachGAN: high-resolution image synthesis from salient object layout. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2020)
Google Scholar
Li, Z., Wu, J., Koh, I., Tang, Y., Sun, L.: Image synthesis from layout with locality-aware mask adaption. In: ICCV (2021)
Google Scholar
Liang, Y., Fan, L., Ren, P., Xie, X., Hua, X.S.: Decorin: an automatic method for plane-based decorating. IEEE Trans. Vis. Comput. Graph. 27, 3438–3450 (2021)
Article Google Scholar
Lin, T.-Y., et al.: Microsoft COCO: common objects in context. In: Fleet, D., Pajdla, T., Schiele, B., Tuytelaars, T. (eds.) ECCV 2014. LNCS, vol. 8693, pp. 740–755. Springer, Cham (2014). https://doi.org/10.1007/978-3-319-10602-1_48
Chapter Google Scholar
Liu, B., Zhu, Y., Song, K., Elgammal, A.: Towards faster and stabilized GAN training for high-fidelity few-shot image synthesis. In: Proceedings of the International Conference on Learning Representations (2021)
Google Scholar
Mirza, M., Osindero, S.: Conditional generative adversarial nets. arXiv preprint arXiv:1411.1784 (2014)
Nauata, N., Chang, K.-H., Cheng, C.-Y., Mori, G., Furukawa, Y.: House-GAN: relational generative adversarial networks for graph-constrained house layout generation. In: Vedaldi, A., Bischof, H., Brox, T., Frahm, J.-M. (eds.) ECCV 2020. LNCS, vol. 12346, pp. 162–177. Springer, Cham (2020). https://doi.org/10.1007/978-3-030-58452-8_10
Chapter Google Scholar
Nichol, A., et al.: GLIDE: towards photorealistic image generation and editing with text-guided diffusion models. arXiv preprint 2112.10741 (2021)
Google Scholar
Obukhov, A., Seitzer, M., Wu, P.W., Zhydenko, S., Kyl, J., Lin, E.Y.J.: High-fidelity performance metrics for generative models in pytorch (2020)
Google Scholar
Park, T., Liu, M.Y., Wang, T.C., Zhu, J.Y.: Semantic image synthesis with spatially-adaptive normalization. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Ritchie, D., Wang, K., Lin, Y.A.: Fast and flexible indoor scene synthesis via deep convolutional generative models. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Sun, W., Wu, T.: Image synthesis from reconfigurable layout and style. In: ICCV (2019)
Google Scholar
Sun, W., Wu, T.: Learning layout and style reconfigurable gans for controllable image synthesis. IEEE Trans. Pattern Anal. Mach. Intel. (PAMI) 44, 5070–5087 (2021)
Google Scholar
Szegedy, C., et al.: Going deeper with convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2015)
Google Scholar
Tang, H., Xu, D., Sebe, N., Wang, Y., Corso, J.J., Yan, Y.: Multi-channel attention selection GAN with cascaded semantic guidance for cross-view image translation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2019)
Google Scholar
Turkoglu, M.O., Thong, W., Spreeuwers, L., Kicanaoglu, B.: A layer-based sequential framework for scene generation with gans. In: AAAI Conference on Artificial Intelligence (2019)
Google Scholar
Wang, K., Lin, Y.A., Weissmann, B., Savva, M., Chang, A.X., Ritchie, D.: Planit: planning and instantiating indoor scenes with relation graph and spatial prior networks. ACM Trans. Graph. 38(4), 1–15 (2019)
Article Google Scholar
Wang, K., Savva, M., Chang, A.X., Ritchie, D.: Deep convolutional priors for indoor scene synthesis. ACM Trans. Graph. 37(4), 1–14 (2018)
Google Scholar
Wang, T.C., Liu, M.Y., Zhu, J.Y., Tao, A., Kautz, J., Catanzaro, B.: High-resolution image synthesis and semantic manipulation with conditional GANs. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (2018)
Google Scholar
Yang, C., Shen, Y., Zhou, B.: Semantic hierarchy emerges in deep generative representations for scene synthesis. Int. J. Comput. Vision 129(5), 1451–1466 (2020)
Article Google Scholar
Yu, L.F., Yeung, S.K., Tang, C.K., Terzopoulos, D., Chan, T.F., Osher, S.J.: Make it home: automatic optimization of furniture arrangement. ACM Trans. Graph. 30(4), 1–11 (2011)
Article Google Scholar
Yu, L.F., Yeung, S.K., Terzopoulos, D.: The clutterpalette: an interactive tool for detailing indoor scenes. IEEE Trans. Vis. Comput. Graph. 22, 1138–1148 (2015)
Article Google Scholar
Zhang, E., Cohen, M.F., Curless, B.: Emptying, refurnishing, and relighting indoor spaces. ACM Trans. Graph. 35(6), 1–14 (2016)
Google Scholar
Zhang, S.K., Li, Y.X., He, Y., Yang, Y.L., Zhang, S.H.: Mageadd: real-time interaction simulation for scene synthesis. In: ACM International Conference on Multimedia (2021)
Google Scholar
Zhang, Z., et al.: Deep generative modeling for scene synthesis via hybrid representations. ACM Trans. Graph. 39(2), 1–21 (2020)
Google Scholar
Zheng, J., Zhang, J., Li, J., Tang, R., Gao, S., Zhou, Z.: Structured3D: a large photo-realistic dataset for structured 3D modeling. In: Proceedings of the European Conference on Computer Vision (2020)
Google Scholar
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision (2017)
Google Scholar
Zhu, J.Y., et al.: Toward multimodal image-to-image translation. In: Proceedings of the Advances in Neural Information Processing Systems (2017)
Google Scholar

Download references

Acknowledgment

This paper was partially supported by an internal grant from HKUST (R9429) and the HKUST-WeBank Joint Lab.

Author information

Authors and Affiliations

Hong Kong University of Science and Technology, Hong Kong, China
Hong-Wing Pang, Yingshu Chen & Sai-Kit Yeung
VinAI Research, Hanoi, Vietnam
Phuoc-Hieu Le & Binh-Son Hua
Deakin University, Geelong, Australia
Duc Thanh Nguyen

Authors

Hong-Wing Pang
View author publications
You can also search for this author in PubMed Google Scholar
Yingshu Chen
View author publications
You can also search for this author in PubMed Google Scholar
Phuoc-Hieu Le
View author publications
You can also search for this author in PubMed Google Scholar
Binh-Son Hua
View author publications
You can also search for this author in PubMed Google Scholar
Duc Thanh Nguyen
View author publications
You can also search for this author in PubMed Google Scholar
Sai-Kit Yeung
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hong-Wing Pang .

Editor information

Editors and Affiliations

Tel Aviv University, Tel Aviv, Israel
Shai Avidan
University College London, London, UK
Gabriel Brostow
Google AI, Accra, Ghana
Moustapha Cissé
University of Catania, Catania, Italy
Giovanni Maria Farinella
Facebook (United States), Menlo Park, CA, USA
Tal Hassner

1 Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (pdf 7560 KB)

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Pang, HW., Chen, Y., Le, PH., Hua, BS., Nguyen, D.T., Yeung, SK. (2022). Neural Scene Decoration from a Single Photograph. In: Avidan, S., Brostow, G., Cissé, M., Farinella, G.M., Hassner, T. (eds) Computer Vision – ECCV 2022. ECCV 2022. Lecture Notes in Computer Science, vol 13683. Springer, Cham. https://doi.org/10.1007/978-3-031-20050-2_9

Download citation

DOI: https://doi.org/10.1007/978-3-031-20050-2_9
Published: 28 October 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20049-6
Online ISBN: 978-3-031-20050-2
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Neural Scene Decoration from a Single Photograph

Abstract

Access this chapter

Similar content being viewed by others

Capsule Based Image Synthesis for Interior Design Effect Rendering

Real-scene-constrained virtual scene layout synthesis for mixed reality

Research on Image-to-Image Generation and Optimization Methods Based on Diffusion Model Compared with Traditional Methods: Taking Façade as the Optimization Object

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 7560 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Publish with us

Navigation

Neural Scene Decoration from a Single Photograph

Abstract

Access this chapter

Similar content being viewed by others

Capsule Based Image Synthesis for Interior Design Effect Rendering

Real-scene-constrained virtual scene layout synthesis for mixed reality

Research on Image-to-Image Generation and Optimization Methods Based on Diffusion Model Compared with Traditional Methods: Taking Façade as the Optimization Object

References

Acknowledgment

Author information

Authors and Affiliations

Corresponding author

Editor information

Editors and Affiliations

1 Electronic supplementary material

Supplementary material 1 (pdf 7560 KB)

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Search

Navigation