Unsupervised Domain Adaptation for Semantic Segmentation with Global and Local Consistency

Shan, Xiangxuan; Yin, Zijin; Gao, Jiayi; Liang, Kongming; Ma, Zhanyu; Guo, Jun

doi:10.1007/978-3-031-20497-5_13

Part of the book series: Lecture Notes in Computer Science ((LNAI,volume 13604))

Included in the following conference series:

CAAI International Conference on Artificial Intelligence

1414 Accesses

Abstract

Unsupervised domain adaptation(UDA) for semantic segmentation aims to learn from labeled synthetic data to segment the unlabeled real data. Many recent methods use generative networks to acquire real-like images for mitigating domain shift. However, these methods only ensure global style consistency between two domains and fail to impose pixel-wise constraint which is also referred to as local content consistency. To address the above problem, we propose a global and local consistency network to reduce the domain gap in unsupervised domain adaptation for semantic segmentation. To this end, we first constrain global style consistency through a generative adversarial network to acquire real-like latent domain images. Then we enhance local content consistency based on pixel-wise entropy minimization. Experimental results show that our method has superiority over other competitive methods on GTA5 \(\rightarrow \) Cityscapes.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 99.00; Price excludes VAT (USA)

Softcover Book: USD 129.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Borse, S., Wang, Y., Zhang, Y., Porikli, F.: Inverseform: A loss function for structured boundary-aware segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 5901–5911 (2021)
Google Scholar
Cardace, A., Ramirez, P.Z., Salti, S., Di Stefano, L.: Shallow features guide unsupervised domain adaptation for semantic segmentation at class boundaries. In: Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision, pp. 1160–1170 (2022)
Google Scholar
Chen, L.C., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: Deeplab: Semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected crfs. IEEE Trans. Pattern Analysis Mach. Intell. 40(4), 834–848 (2017)
Article Google Scholar
Chen, M., Xue, H., Cai, D.: Domain adaptation for semantic segmentation with maximum squares loss. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 2090–2099 (2019)
Google Scholar
Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3213–3223 (2016)
Google Scholar
Fang, Y., Deng, W., Du, J., Hu, J.: Identity-aware cyclegan for face photo-sketch synthesis and recognition. Pattern Recogn. 102, 107249 (2020)
Article Google Scholar
Gong, R., Li, W., Chen, Y., Gool, L.V.: Dlow: domain flow for adaptation and generalization. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2477–2486 (2019)
Google Scholar
Hoffman, J., et al.: Cycada: cycle-consistent adversarial domain adaptation. In: International Conference on Machine Learning, pp. 1989–1998. PMLR (2018)
Google Scholar
Isola, P., Zhu, J.Y., Zhou, T., Efros, A.A.: Image-to-image translation with conditional adversarial networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1125–1134 (2017)
Google Scholar
Johnson, J., Alahi, A., Fei-Fei, L.: Perceptual losses for real-time style transfer and super-resolution. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 694–711. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_43
Chapter Google Scholar
Li, Q., Du, J., Song, F., Wang, C., Liu, H., Lu, C.: Region-based multi-focus image fusion using the local spatial frequency. In: 2013 25th Chinese Control and Decision Conference (CCDC), pp. 3792–3796. IEEE (2013)
Google Scholar
Liu, Y., Deng, J., Gao, X., Li, W., Duan, L.: Bapa-net: boundary adaptation and prototype alignment for cross-domain semantic segmentation. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 8801–8811 (2021)
Google Scholar
Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3431–3440 (2015)
Google Scholar
Luo, Y., Zheng, L., Guan, T., Yu, J., Yang, Y.: Taking a closer look at domain shift: category-level adversaries for semantics consistent domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2507–2516 (2019)
Google Scholar
Piva, F.J., Dubbelman, G.: Exploiting image translations via ensemble self-supervised learning for unsupervised domain adaptation. arXiv preprint arXiv:2107.06235 (2021)
Radford, A., Metz, L., Chintala, S.: Unsupervised representation learning with deep convolutional generative adversarial networks. arXiv preprint arXiv:1511.06434 (2015)
Richter, S.R., Vineet, V., Roth, S., Koltun, V.: Playing for data: ground truth from computer games. In: Leibe, B., Matas, J., Sebe, N., Welling, M. (eds.) ECCV 2016. LNCS, vol. 9906, pp. 102–118. Springer, Cham (2016). https://doi.org/10.1007/978-3-319-46475-6_7
Chapter Google Scholar
Ros, G., Sellart, L., Materzynska, J., Vazquez, D., Lopez, A.M.: The synthia dataset: a large collection of synthetic images for semantic segmentation of urban scenes. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 3234–3243 (2016)
Google Scholar
Shannon, C.E.: A mathematical theory of communication. Bell Syst. Technol. J 27(3), 379–423 (1948)
Article MathSciNet MATH Google Scholar
Tsai, Y.H., Hung, W.C., Schulter, S., Sohn, K., Yang, M.H., Chandraker, M.: Learning to adapt structured output space for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7472–7481 (2018)
Google Scholar
Vu, T.H., Jain, H., Bucher, M., Cord, M., Pérez, P.: Advent: adversarial entropy minimization for domain adaptation in semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 2517–2526 (2019)
Google Scholar
Wang, C., et al.: Active boundary loss for semantic segmentation. arXiv preprint arXiv:2102.02696 (2021)
Wang, Z., Li, Y., Wang, S.: Noisy boundaries: lemon or lemonade for semi-supervised instance segmentation? arXiv preprint arXiv:2203.13427 (2022)
Xu, L., Du, J., Li, Q.: Image fusion based on nonsubsampled contourlet transform and saliency-motivated pulse coupled neural networks. Math. Prob. Eng. 2013 (2013)
Google Scholar
Yang, Y., Lao, D., Sundaramoorthi, G., Soatto, S.: Phase consistent ecological domain adaptation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 9011–9020 (2020)
Google Scholar
Yang, Y., Soatto, S.: FDA: fourier domain adaptation for semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 4085–4095 (2020)
Google Scholar
Yin, Z., Liang, K., Ma, Z., Guo, J.: Duplex contextual relation network for polyp segmentation. In: 2022 IEEE 19th International Symposium on Biomedical Imaging (ISBI), pp. 1–5. IEEE (2022)
Google Scholar
Zhang, P., Zhang, B., Zhang, T., Chen, D., Wang, Y., Wen, F.: Prototypical pseudo label denoising and target structure learning for domain adaptive semantic segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 12414–12424 (2021)
Google Scholar
Zhang, X., Zhang, H., Lu, J., Shao, L., Yang, J.: Target-targeted domain adaptation for unsupervised semantic segmentation. In: 2021 IEEE International Conference on Robotics and Automation (ICRA), pp. 13560–13566. IEEE (2021)
Google Scholar
Zhu, C., Zhang, X., Li, Y., Qiu, L., Han, K., Han, X.: Sharpcontour: a contour-based boundary refinement approach for efficient and accurate instance segmentation. arXiv preprint arXiv:2203.13312 (2022)
Zhu, J.Y., Park, T., Isola, P., Efros, A.A.: Unpaired image-to-image translation using cycle-consistent adversarial networks. In: Proceedings of the IEEE International Conference on Computer Vision, pp. 2223–2232 (2017)
Google Scholar

Download references

Acknowledgements

This work was supported in part by National Natural Science Foundation of China (NSFC) No. 61922015, 62106022, U19B2036, 62225601, and in part by Beijing Natural Science Foundation Project No. Z200002.

Author information

Authors and Affiliations

School of Artificial Intelligence, Beijing University of Posts and Telecommunications, Beijing, 100876, China
Xiangxuan Shan, Zijin Yin, Jiayi Gao, Kongming Liang, Zhanyu Ma & Jun Guo

Authors

Xiangxuan Shan
View author publications
You can also search for this author in PubMed Google Scholar
Zijin Yin
View author publications
You can also search for this author in PubMed Google Scholar
Jiayi Gao
View author publications
You can also search for this author in PubMed Google Scholar
Kongming Liang
View author publications
You can also search for this author in PubMed Google Scholar
Zhanyu Ma
View author publications
You can also search for this author in PubMed Google Scholar
Jun Guo
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kongming Liang .

Editor information

Editors and Affiliations

Tsinghua University, Beijing, China
Lu Fang
Xiaomi Inc., Beijing, China
Daniel Povey
Shanghai Jiao Tong University, Shanghai, China
Guangtao Zhai
JD Explore Academy, Beijing, China
Tao Mei
Chinese Academy of Sciences, Beijing, China
Ruiping Wang

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Shan, X., Yin, Z., Gao, J., Liang, K., Ma, Z., Guo, J. (2022). Unsupervised Domain Adaptation for Semantic Segmentation with Global and Local Consistency. In: Fang, L., Povey, D., Zhai, G., Mei, T., Wang, R. (eds) Artificial Intelligence. CICAI 2022. Lecture Notes in Computer Science(), vol 13604. Springer, Cham. https://doi.org/10.1007/978-3-031-20497-5_13

Download citation

DOI: https://doi.org/10.1007/978-3-031-20497-5_13
Published: 17 December 2022
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-20496-8
Online ISBN: 978-3-031-20497-5
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Unsupervised Domain Adaptation for Semantic Segmentation with Global and Local Consistency