Skip to main content

PFANet: A Network Improved by PSPNet for Semantic Segmentation of Street Scenes

  • Conference paper
  • First Online:
Communications, Signal Processing, and Systems (CSPS 2022)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 873))

Abstract

Streetscape scene understanding is an important task for semantic segmentation. In this paper, we propose a PFANet improved based on PSPNet for Street Semantic Segmentation. For further improving the feature fusion capability of our model, we added a feature fusion module to PSPNet to incorporate features of different dimensions. Meanwhile, we introduce an attention mechanism, using a combination of spatial attention mechanism and channel attention mechanism attention module to enhance the contextual dependencies of local features and the spatial interdependencies of features. We experimented on the Cityscapes dataset, and achieved a mIoU score of 80.38% on Cityscapes validation dataset which improved 0.68% than PSPNet.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 349.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 449.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 449.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Similar content being viewed by others

References

  1. Davis, L.S., Rosenfeld, A., Weszka, J.S.: Region extraction by averaging and thresholding. IEEE Trans. Syst. Man Cybern. SMC-5(3), 383–388 (1975). https://doi.org/10.1109/TSMC.1975.5408419

  2. Senthilkumaran, N., Rajesh, R.: Edge detection techniques for image segmentation– a survey of soft computing approaches. Int. J. Recent Trends Eng. 1(2) (2009)

    Google Scholar 

  3. Ng, H.P., Ong, S.H., Foong, K.W.C., Goh, P.S., Nowinski, W.L.: Medical image segmentation using k-means clustering and improved watershed algorithm. In: 2006 IEEE Southwest Symposium on Image Analysis and Interpretation, Denver, CO, pp. 61–65 (2006)

    Google Scholar 

  4. Zhao, H., Shi, J., Qi, X., Wang, X., Jia, J.: Pyramid scene parsing network. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 2881–2890 (2017)

    Google Scholar 

  5. Chen, L., Papandreou, G., Kokkinos, I., Murphy, K., Yuille, A.L.: DeepLab: semantic image segmentation with deep convolutional nets, atrous convolution, and fully connected CRFs. CoRR, abs/1606.00915 (2016)

    Google Scholar 

  6. Chen, L.-C., Papandreou, G., Schroff, F., Adam, H.: Rethinking atrous convolution for semantic image segmentation. CoRR, abs/1706.05587 (2017)

    Google Scholar 

  7. Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49

    Chapter  Google Scholar 

  8. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

    Google Scholar 

  9. Wang, Q., Wu, B., Zhu, P., Li, P., Zuo, W., Hu, Q.: Eca-net: efficient channel attention for deep convolutional neural networks. In: IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2020)

    Google Scholar 

  10. Wang, X., Ross, G., Abhinav, G., He, K.: Non-local neural networks. In IEEE Conference on Computer Vision and Pattern Recognition (CVPR) (2018)

    Google Scholar 

  11. Cao, Y., Xu, J., Lin, S., Wei, F., Hu, H.: Gcnet: non-local networks meet squeeze-excitation networks and beyond. In: ICCV Workshops (2019)

    Google Scholar 

  12. Park, J., Woo, S., Lee, J.-Y., Kweon, I.S.: Bam: bottleneck attention module. In: British Machine Vision Conference (BMVC) (2018)

    Google Scholar 

  13. Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1

    Chapter  Google Scholar 

  14. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: CVPR (2016)

    Google Scholar 

  15. Long, J., Shelhamer, E., Darrell, T.: Fully convolutional networks for semantic segmentation. In: CVPR, pp. 3431–3440 (2015)

    Google Scholar 

  16. Ronneberger, O., Fischer, P., Brox, T.: U-Net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28

    Chapter  Google Scholar 

  17. Cordts, M., et al.: The cityscapes dataset for semantic urban scene understanding (2016)

    Google Scholar 

  18. MMSegmentation Contributors. MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark (2020). https://github.com/open-mmlab/mmsegmentation

  19. He, T., et al.: Bag of tricks for image classification with convolutional neural networks. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (2019)

    Google Scholar 

  20. Zhou, Z., RahmanSiddiquee, M.M., Tajbakhsh, N., Liang, J.: Unet++: a nested u-net architecture for medical image segmentation. In: Stoyanov, Danail, et al. (eds.) DLMIA/ML-CDS -2018. LNCS, vol. 11045, pp. 3–11. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-00889-5_1

  21. Sun, K., et al.: High-resolution representations for labeling pixels and regions arXiv:1904.04514 (2019)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yan Li .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Ge, J., Li, Y., Jiu, M., Cheng, Z., Zhang, J. (2023). PFANet: A Network Improved by PSPNet for Semantic Segmentation of Street Scenes. In: Liang, Q., Wang, W., Liu, X., Na, Z., Zhang, B. (eds) Communications, Signal Processing, and Systems. CSPS 2022. Lecture Notes in Electrical Engineering, vol 873. Springer, Singapore. https://doi.org/10.1007/978-981-99-1260-5_16

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-1260-5_16

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-1259-9

  • Online ISBN: 978-981-99-1260-5

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics