Image semantic segmentation is a research topic that has emerged recently. Although existing approaches have achieved satisfactory accuracy, they are limited to handling low-resolution images owing to their large memory consumption. In this paper, we present a semantic segmentation method for high-resolution images. First, we downsample the input image to a lower resolution and then obtain a low-resolution semantic segmentation image using state-of-the-art methods. Next, we use joint bilateral upsampling to upsample the low-resolution solution and obtain a high-resolution semantic segmentation image. To modify joint bilateral upsampling to handle discrete semantic segmentation data, we propose using voting instead of interpolation in filtering computation. Compared to state-of-the-art methods, our method significantly reduces memory cost without reducing result quality.
This is a preview of subscription content, access via your institution.
Buy single article
Instant access to the full article PDF.
Tax calculation will be finalised during checkout.
Carneiro G, Chan A B, Moreno P J, et al. Supervised learning of semantic classes for image annotation and retrieval. IEEE Trans Pattern Anal Mach Intell, 2007, 29: 394–410
Gould S, Fulton R, Koller D. Decomposing a scene into geometric and semantically consistent regions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Kyoto, 2009. 1–8
Ren X, Bo L, Fox D. RGB-(D) scene labeling: features and algorithms. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Providence, 2012. 2759–2766
Farabet C, Couprie C, Najman L, et al. Learning hierarchical features for scene labeling. IEEE Trans Pattern Anal Mach Intell, 2013, 35: 1915–1929
Long J, Shelhamer E, Darrell T. Fully convolutional networks for semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Boston, 2015. 3431–3440
Kopf J, Cohen M F, Lischinski D, et al. Joint bilateral upsampling. ACM Trans Graph, 2007, 26: 96
Tomasi C, Manduchi R. Bilateral filtering for gray and color images. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Bombay, 1998. 839–846
Zhou B, Zhao H, Puig X, et al. Scene parsing through ADE20K dataset. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Honolulu, 2017
Li X, Liu K, Dong Y. Superpixel-based foreground extraction with fast adaptive trimaps. IEEE Trans Cybern, 2017, doi: 10.1109/TCYB.2017.2747143
Huang H, Fang X, Ye Y, et al. Practical automatic background substitution for live video. Comp Visual Media, 2017, 3: 273–284
Li X, Liu K, Dong Y, et al. Patch alignment manifold matting. IEEE Trans Neural Netw Learn Syst, 2017, doi: 10.1109/TNNLS.2017.2727140
Zheng Z H, Zhang H T, Zhang F L, et al. Image-based clothes changing system. Comput Vis Media, 2017, in press
Maerki N, Perazzi F, Wang O, et al. Bilateral space video segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Las Vegas, 2016. 743–751
This work was supported by National Natural Science Foundation of China (Grant No. 61521002), a research grant from the Beijing Higher Institution Engineering Research Center, and the Tsinghua- Tencent Joint Laboratory for Internet Innovation Technology.
About this article
Cite this article
Wang, J., Liu, B. & Xu, K. Semantic segmentation of high-resolution images. Sci. China Inf. Sci. 60, 123101 (2017). https://doi.org/10.1007/s11432-017-9252-5
- image semantic segmentation
- high-resolution images
- joint bilateral upsampling