Abstract
This study focuses on adding channel and spatial attention modules to a residual block for more effectively interpreting high-resolution remote sensing images. Three feature extraction networks are constructed through parallel and sequential combinations to select a network for high-resolution remote sensing images. Furthermore, these three networks are combined with classification, semantic segmentation, and object detection networks. The results of ablation and comparative experiments on different remote sensing data sets—NWPU-RESISC45, DeepGlobe, and DOTA—reveal that the three constructed methods have achieved good results in semantic segmentation and classification tasks. The construction method of sequential combination (spatial–channel) is the most improved. The effectiveness and general applicability of the proposed feature extraction networks are verified. These feature extraction networks can be used in classification, semantic segmentation, and object detection tasks for remote sensing images and can effectively improve the accuracy of the network.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Lecun, Y., Bottou, L., Bengio, Y., Haffner, P.: Gradient-based learning applied to document recognition. Proc. IEEE 86(11), 2278–2324 (1998). https://doi.org/10.1109/5.726791
Krizhevsky, A., Sutskever, I., Hinton, G.: ImageNet classification with deep convolutional neural networks. Adv. Neural Inform. Process. Syst. 25(2) (2012)
He, K., Zhang, X., Ren, S., et al.: Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778 (2016)
Hu, J., Li, S., Sun, G., et al.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)
Wang, Q., Wu, B., Zhu, P., et al.: ECA-Net: efficient channel attention for deep convolutional neural networks. In: Proceedings of the CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, pp. 11531–11539 (2020)
Tao, A., Sapra, K., Catanzaro, B.: Hierarchical Multi-Scale Attention for Semantic Segmentation (2020). https://doi.org/10.48550/arXiv.2005.10821
Huang, H., Xu, K.-J., Shi, G.-Y.: Scene classification of high-resolution remote sensing image by multi-scale and multi-feature fusion. Acta Electron. Sin. 48(9), 1824–1833 (2020)
Che, S., Guo, R., Li, Z., Yang, J.: Attention mechanism combined with residual shrinkage network to classify remote sensing images. Appl. Res. Comput. 39(8), 2532–2537 (2022)
Cheng, X., Liu, L., Song, C.: A cyclic information-interaction model for remote sensing image segmentation. Remote Sens. 13(19), 3871 (2021)
Xia, G.-S., Bai, X., Ding, J., et al.: Dota: a large-scale dataset for object detection in aerial images. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition. IEEE (2018)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Zhang, X., Wang, Z., Wei, C., Zhang, J. (2024). SCANet: Spatial-Channel Attention Feature Extraction Network for Remote Sensing Images. In: Wang, Y., Zou, J., Xu, L., Ling, Z., Cheng, X. (eds) Signal and Information Processing, Networking and Computers. ICSINC 2023. Lecture Notes in Electrical Engineering, vol 1188. Springer, Singapore. https://doi.org/10.1007/978-981-97-2124-5_36
Download citation
DOI: https://doi.org/10.1007/978-981-97-2124-5_36
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-2123-8
Online ISBN: 978-981-97-2124-5
eBook Packages: EngineeringEngineering (R0)