Abstract
Water body extraction is a significant researching fields in remote sensing image interpretation. However, due to the great differences of water bodies in shapes and sizes, how to improve the accuracy of water body extraction is a research issue in recent years. In this paper, a gated convolution neural network based on pyramid split attention (PSAGNet) is proposed for water body extraction. The network includes two main modules: pyramid split attention module and gated convolution module. PSAGNet can extract small water bodies accurately in complex remote sensing scenes, because the gated convolution module can extract the shape features of small water bodies from the encoder’s shallow feature map. The pyramid split attention module can extract and fuse the high-order features of multi-scale water bodies, and provide valuable feature encoding for improving water body extraction accuracy. In addition, in order to address the problems of fuzzy boundary and imbalanced water and background in the dataset, a novel loss function called FE loss is proposed to train network. FE loss significantly sharpens the segmentation boundary and improves the extraction accuracy. Our model achieves the best performance with the precision of 93.52%, recall of 94.66%, and Intersection over Union (IoU) of 86.13% on Gaofen Image Dataset (GID). In general, our method achieves the purpose of accurately extracting small water bodies from remote sensing images, and can be widely applied in water body extraction of high-resolution remote sensing images.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Similar content being viewed by others
References
Mantzafleri, N., Psilovikos, A., Blanta, A.: Water quality monitoring and modeling in lake Kastoria, using GIS assessment and management of pollution sources. Water Resour. Manag. 23(15), 3221–3254 (2009). https://doi.org/10.1007/s11269-009-9431-4
Pawełczyk, A.: Assessment of health hazard associated with nitrogen compounds in water. Water Sci. Technol. 66(3), 666–672 (2012)
Haibo, Y.: Water body extraction methods study based on RS and GIS. In: 3rd International Conference on Environmental Science and Information Application Technology (ESIAT), pp. 2619–2624. Elsevier, Amsterdam (2011)
Long, J.: Fully convolutional networks for semantic segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(4), 640–651 (2017)
Gautam, S.: Cosine-similarity watershed algorithm for water-body segmentation applying deep neural network classifier. Environ. Earth Sci. 81(9), 251 (2022)
Gasnier, N., et al.: Narrow river extraction from SAR images using exogenous information. IEEE J. Select. Topics Appl. Earth Observ. Remote Sens. 14, 5720–5734 (2021)
Yuan, K., et al.: Deep-learning-based multispectral satellite image segmentation for water body detection. IEEE J. Select. Top. Appl. Earth Observ. Remote Sens. 14, 7422–7434 (2021)
Zhang, Z.: Rich CNN features for water-body segmentation from very high resolution aerial and satellite imagery. Remote Sens. 13(10), 1912 (2021)
Lu, M., et al.: NFANet: A novel method for weakly supervised water extraction from high-resolution remote-sensing imagery. IEEE Trans. Geosci. Remote Sens. 60, 1–14 (2022)
Li, L.: Water body extraction from very high spatial resolution remote sensing data based on fully convolutional networks. Remote Sens. 11(10), 1162 (2019)
Duan, L.: Multiscale refinement network for water-body segmentation in high-resolution satellite imagery. IEEE Geosci. Remote Sens. Lett. 17(4), 686–690 (2019)
Guo, H.: A Multi-scale water extraction convolutional neural network (MWEN) method for GaoFen-1 remote sensing images. In: International Conference on Geo-Information Technology and its Applications (ICGITA). MDPI, Nanchang (2020)
Ronneberger, O., Fischer, P., Brox, T.: U-Net: Convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Badrinarayanan, V.: Segnet: A deep convolutional encoder-decoder architecture for image segmentation. IEEE Trans. Pattern Anal. Mach. Intell. 39(12), 2481–2495 (2017)
Wang, B.: SADA-Net: A shape feature optimization and multiscale context information-based water body extraction method for high-resolution remote sensing images. IEEE J. Select. Top. Appl. Earth Observ. Remote Sens. 15, 1744–1759 (2022)
Zhao, H.: Pyramid Scene Parsing Network. In: 30th IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 6230–6239. IEEE, Honolulu (2017)
Chen, L.-C., Zhu, Y., Papandreou, G., Schroff, F., Adam, H.: Encoder-decoder with atrous separable convolution for semantic image segmentation. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 833–851. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_49
Kruthiventi, S.S., et al.: DeepFix: A fully convolutional neural network for predicting human eye fixations. IEEE Trans. Image Process. 26(9), 4446–4456 (2017)
Fu, J.: Dual attention network for scene segmentation. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 3141–3149. IEEE, Long Beach (2014)
Mnih, V.: Recurrent models of visual attention. In: Proceedings of the 28th Conference on Neural Information Processing Systems (NIPS). NIPS, Montreal (2014)
Hu, J.: Squeeze-and-excitation networks. IEEE Trans. Pattern Anal. Mach. Intell. 42(8), 2011–2023 (2020)
Woo, S., Park, J., Lee, J.-Y., Kweon, I.S.: CBAM: Convolutional block attention module. In: Ferrari, V., Hebert, M., Sminchisescu, C., Weiss, Y. (eds.) ECCV 2018. LNCS, vol. 11211, pp. 3–19. Springer, Cham (2018). https://doi.org/10.1007/978-3-030-01234-2_1
Park, J.: Bottleneck attention module. arXiv:1807.06514 (2018)
Li, X.: Selective kernel networks. In: 32nd IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp. 510–519. IEEE, Long Beach (2019)
Van den Oord, A.: Conditional image generation with pixelcnn decoders. In: 30th Conference on Neural Information Processing Systems (NIPS), pp. 4790–4798. NIPS, Barcelona (2016)
Takikawa, T.: Gated-SCNN: Gated shape CNNs for semantic segmentation. In: IEEE/CVF International Conference on Computer Vision (ICCV), pp. 5228–5237. IEEE, Seoul (2019)
Tong, X.Y.: Land-cover classification with high-resolution remote sensing images using transferable deep models. Remote Sens. Environ. 237 (2020)
Zhang, H.: EPSANet: An Efficient Pyramid Squeeze Attention Block on Convolutional Neural Network. arXiv:2105.14447 (2021)
Acknowledgements
This research was supported in part by The Fundamental Research Funds for the Central Universities (Grant No. B210202080), Project of Water Science & Technology of Jiangsu Province (Grant No. 2021080).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Fang, Y. et al. (2023). PSAGNet: A Water Body Extraction Method for High Resolution Remote Sensing Images. In: You, P., Li, H., Chen, Z. (eds) Proceedings of International Conference on Image, Vision and Intelligent Systems 2022 (ICIVIS 2022). ICIVIS 2022. Lecture Notes in Electrical Engineering, vol 1019. Springer, Singapore. https://doi.org/10.1007/978-981-99-0923-0_26
Download citation
DOI: https://doi.org/10.1007/978-981-99-0923-0_26
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-0922-3
Online ISBN: 978-981-99-0923-0
eBook Packages: EngineeringEngineering (R0)