Semantic segmentation of high-resolution images

Wang, Juhong; Liu, Bin; Xu, Kun

doi:10.1007/s11432-017-9252-5

Semantic segmentation of high-resolution images

Moop
Published: 07 November 2017

Volume 60, article number 123101, (2017)
Cite this article

Science China Information Sciences Aims and scope Submit manuscript

Juhong Wang¹,
Bin Liu¹ &
Kun Xu¹

403 Accesses
13 Citations
Explore all metrics

Abstract

Image semantic segmentation is a research topic that has emerged recently. Although existing approaches have achieved satisfactory accuracy, they are limited to handling low-resolution images owing to their large memory consumption. In this paper, we present a semantic segmentation method for high-resolution images. First, we downsample the input image to a lower resolution and then obtain a low-resolution semantic segmentation image using state-of-the-art methods. Next, we use joint bilateral upsampling to upsample the low-resolution solution and obtain a high-resolution semantic segmentation image. To modify joint bilateral upsampling to handle discrete semantic segmentation data, we propose using voting instead of interpolation in filtering computation. Compared to state-of-the-art methods, our method significantly reduces memory cost without reducing result quality.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

High-dimensional features of adaptive superpixels for visually degraded images

Article 01 May 2019

Feng-feng Liao, Ke-ye Cao, … Sheng Liu

Superpixels with contour adherence via label expansion for image decomposition

Article 06 June 2022

Cheng Li, Wangpeng He, … Baolong Guo

Soft Cost Aggregation with Multi-resolution Fusion

References

Carneiro G, Chan A B, Moreno P J, et al. Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell, 2007, 29: 394–410
Article Google Scholar
Gould S, Fulton R, Koller D. Decomposing a scene into geometric and semantically consistent regions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Kyoto, 2009. 1–8
Google Scholar
Ren X, Bo L, Fox D. RGB-(D) scene labeling: features and algorithms. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, 2012. 2759–2766
Google Scholar
Farabet C, Couprie C, Najman L, et al. Learning hierarchical features for scene labeling. IEEE Trans Pattern Anal Mach Intell, 2013, 35: 1915–1929
Article Google Scholar
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 2015. 3431–3440
Google Scholar
Kopf J, Cohen M F, Lischinski D, et al. Joint bilateral upsampling. ACM Trans Graph, 2007, 26: 96
Article Google Scholar
Tomasi C, Manduchi R. Bilateral filtering for gray and color images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Bombay, 1998. 839–846
Google Scholar
Zhou B, Zhao H, Puig X, et al. Scene parsing through ADE20K dataset. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 2017
Google Scholar
Li X, Liu K, Dong Y. Superpixel-based foreground extraction with fast adaptive trimaps. IEEE Trans Cybern, 2017, doi: 10.1109/TCYB.2017.2747143
Google Scholar
Huang H, Fang X, Ye Y, et al. Practical automatic background substitution for live video. Comp Visual Media, 2017, 3: 273–284
Article Google Scholar
Li X, Liu K, Dong Y, et al. Patch alignment manifold matting. IEEE Trans Neural Netw Learn Syst, 2017, doi: 10.1109/TNNLS.2017.2727140
Google Scholar
Zheng Z H, Zhang H T, Zhang F L, et al. Image-based clothes changing system. Comput Vis Media, 2017, in press
Google Scholar
Maerki N, Perazzi F, Wang O, et al. Bilateral space video segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 2016. 743–751
Google Scholar

Download references

Acknowledgements

This work was supported by National Natural Science Foundation of China (Grant No. 61521002), a research grant from the Beijing Higher Institution Engineering Research Center, and the Tsinghua- Tencent Joint Laboratory for Internet Innovation Technology.

Author information

Authors and Affiliations

Department of Computer Science and Technology, Tsinghua University, Beijing, 100084, China
Juhong Wang, Bin Liu & Kun Xu

Authors

Juhong Wang
View author publications
You can also search for this author in PubMed Google Scholar
Bin Liu
View author publications
You can also search for this author in PubMed Google Scholar
Kun Xu
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Kun Xu.

Electronic supplementary material