AM-PSPNet: Pyramid Scene Parsing Network Based on Attentional Mechanism for Image Semantic Segmentation

Wu, Dikang; Zhao, Jiamei; Wang, Zhifang

doi:10.1007/978-981-19-5194-7_32

Dikang Wu¹¹,
Jiamei Zhao¹¹ &
Zhifang Wang¹¹

Part of the book series: Communications in Computer and Information Science ((CCIS,volume 1628))

Included in the following conference series:

International Conference of Pioneering Computer Scientists, Engineers and Educators

719 Accesses

Abstract

In this paper, AM-PSPNet is proposed for image semantic segmentation. AM-PSPNet embeds the efficient channel attention (ECA) module in the feature extraction stage of the convolutional network and makes the network pay more attention to the channels with obvious classification characteristics through end-to-end learning. To recognize the edges of objects and small objects more effectively, AM-PSPNet proposes a deep guidance fusion (DGF) module to generate global contextual attention maps to guide the expression of shallow information. The average crossover ratio of the proposed algorithm on the Pascal VOC 2012 dataset and Cityscapes dataset reaches 78.8% and 69.1%, respectively. Compared with the other four network models, the accuracy and average crossover ratio of AM-PSPNet are improved.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 84.99; Price excludes VAT (USA)

Softcover Book: USD 109.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Wang, J., Liu, B., Xu, K.: Semantic segmentation of high-resolution images. Sci. China Inf. Sci. 60(12), 1–6 (2017). https://doi.org/10.1007/s11432-017-9252-5
Article Google Scholar
Yan, B., Niu, X., Bare, B., Tan, W.: Semantic segmentation guided pixel fusion for image retargeting. IEEE Trans. Multimedia 22, 676–687 (2020)
Article Google Scholar
Zhao, Y., Qi, M., Li, X., Meng, Y., Yu, Y., Dong, Y.: P-LPN: toward real time pedestrian location perception in complex driving scenes. IEEE Access 8, 54730–54740 (2020)
Article Google Scholar
Cheng, Z., Qu, A., He, X.: Contour-aware semantic segmentation network with spatial attention mechanism for medical image. Vis. Comput. 38(3), 749–762 (2021). https://doi.org/10.1007/s00371-021-02075-9
Article Google Scholar
Zhang, R., Chen, J., Feng, L., Li, S., Yang, W., Guo, D.: A refined pyramid scene parsing network for polarimetric SAR image semantic segmentation in agricultural areas. IEEE Geosci. Remote Sens. Lett. 19, 1–5 (2022)
Article Google Scholar
Bai, S., Wang, C.: Information aggregation and fusion in deep neural networks for object interaction exploration for semantic segmentation. Knowl. Based Syst. 218, 106843 (2021)
Article Google Scholar
Hao, S., Zhou, Y., Zhang, Y., Guo, Y.: Contextual attention refinement network for real-time semantic segmentation. IEEE Access 8, 55230–55240 (2020)
Article Google Scholar
Ji, J., Lu, X., Luo, M., Yin, M., Miao, Q., Liu, X.: Parallel fully convolutional network for semantic segmentation. IEEE Access 9, 673–682 (2021)
Article Google Scholar
Shelhamer, E., Long, J., Darrell, T.: Fully Convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 640–651 (2015)
Article Google Scholar
Badrinarayanan, V., Kendall, A., Cipolla, R.: SegNet: a deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39, 2481–2495 (2017)
Article Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Chapter Google Scholar
Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: IEEE Computer Society (2016)
Google Scholar
Lin, Z.K., Sun, W., Tang, B., Li, J.D., Yao, X.Y., Li, Y.: Semantic segmentation network with multipath structure, attention reweighting and multiscale encoding. Vis. Comput. 1−12 (2022). https://doi.org/10.1007/s00371-021-02360-7
Li, H., Qiu, K., Chen, L., et al.: SCAttNet: semantic segmentation network with spatial and channel attention mechanism for high-resolution remote sensing images. IEEE Geosci. Remote. Sens. Lett 18(5), 905–909 (2021)
Article Google Scholar
Xia, Z., Kim, J.: Mixed spatial pyramid pooling for semantic segmentation. Appl. Soft Comput. 91, 106209 (2020)
Article Google Scholar
Wang, Z., Wang, J., Yang, K., Wang, L., Su, F., Chen, X.: Semantic segmentation of high-resolution remote sensing images based on a class feature attention mechanism fused with Deeplabv3+. Comput. Geosci. 158, 104969 (2022)
Article Google Scholar
Yin, J., Xia, P., He, J.: Online hard region mining for semantic segmentation. Neural Process. Lett. 50(3), 2665–2679 (2019). https://doi.org/10.1007/s11063-019-10047-3
Article Google Scholar
Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 11531–11539 (2020)
Google Scholar
Jie, H., Li, S., Gang, S., Albanie, S.: Squeeze-and-excitation networks. In: IEEE Transactions on Pattern Analysis and Machine Intelligence (2017)
Google Scholar
Wang, Y.-N., Tian, X., Zhong, G.: FFNet: feature fusion network for few-shot semantic segmentation. Cogn. Comput. 14(2), 1–12 (2022). https://doi.org/10.1007/s12559-021-09990-y
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electronic Engineering, Heilongjiang University, Harbin, 150080, China
Dikang Wu, Jiamei Zhao & Zhifang Wang

Authors

Dikang Wu
View author publications
You can also search for this author in PubMed Google Scholar
Jiamei Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Zhifang Wang
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Zhifang Wang .

Editor information

Editors and Affiliations

Southwest Petroleum University, Chengdu, China
Yang Wang
University of Electronic Science and Technology of China, Chengdu, China
Guobin Zhu
Harbin Engineering University, Harbin, China
Qilong Han
Harbin Institute of Technology, Harbin, China
Hongzhi Wang
Harbin University of Science and Technology, Harbin, China
Xianhua Song
National Academy of Guo Ding Institute of Data Sciences, Beijing, China
Zeguang Lu

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wu, D., Zhao, J., Wang, Z. (2022). AM-PSPNet: Pyramid Scene Parsing Network Based on Attentional Mechanism for Image Semantic Segmentation. In: Wang, Y., Zhu, G., Han, Q., Wang, H., Song, X., Lu, Z. (eds) Data Science. ICPCSEE 2022. Communications in Computer and Information Science, vol 1628. Springer, Singapore. https://doi.org/10.1007/978-981-19-5194-7_32

Download citation

DOI: https://doi.org/10.1007/978-981-19-5194-7_32
Published: 10 August 2022
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-5193-0
Online ISBN: 978-981-19-5194-7
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics