SD-Net: Spatial Dual Network for Aerial Object Detection

Gao, Yangte; Bi, Fukun; Chen, Liang; Nie, Xiaoyu

doi:10.1007/s12524-023-01750-9

SD-Net: Spatial Dual Network for Aerial Object Detection

Research Article
Published: 03 September 2023

Volume 51, pages 2067–2076, (2023)
Cite this article

Journal of the Indian Society of Remote Sensing Aims and scope Submit manuscript

Yangte Gao¹,
Fukun Bi²,
Liang Chen¹ &
…
Xiaoyu Nie ORCID: orcid.org/0000-0001-8999-300X³

113 Accesses
Explore all metrics

Abstract

The distribution direction of aerial objects is arbitrary compared to objects in natural images. However, the existing detectors identify and locate the targets by relying on the shared features, which leads to the contradiction of regression and classification tasks. To be specific, the classifier suppresses rotation-sensitive features, while the regressor relies on rotation-variable features. To address the above contradictions, a Spatial Dual Network (SD-Net) is proposed, which consists of two modules: Polarization Dual Pyramid Module (PDPM) and Spatial Coordinate Attention Module (SCAM). In the SCAM module, to be able to capture channel-related features and global spatial features in different directions, an attention module is built with different convolution kernels that slide in both horizontal and vertical directions. In addition, the polarization function in the Polarization Dual Pyramid Module can split features into features suitable for classification and regression tasks for use in the classifier and regressor of the network, enabling more refined detection. The experimental results on three remote sensing datasets (i.e., DOTA, UCAS-AOD, and HRSC2016) demonstrate that the proposed method achieves higher performance on detection tasks while maintaining high efficiency.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Anchor Free remote sensing detector based on solving discrete polar coordinate equation

Article 18 September 2023

DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images

Article 28 September 2023

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

Article 12 March 2024

References

Ding, J., Xue, N., Long, Y., Xia, G. S., & Lu, Q. (2019). Learning ROI transformer for oriented object detection in aerial images. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2019-June). https://doi.org/10.1109/CVPR.2019.00296
Fu, K., Chang, Z., Zhang, Y., Xu, G., Zhang, K., & Sun, X. (2020). Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images. ISPRS Journal of Photogrammetry and Remote Sensing, 161, 294–308. https://doi.org/10.1016/j.isprsjprs.2020.01.025
Article Google Scholar
Girshick, R. (2015). Fast R-CNN. In 2015 IEEE international conference on computer vision (ICCV) (pp. 1440–1448). IEEE. https://doi.org/10.1109/ICCV.2015.169
Han, J., Ding, J., Li, J., & Xia, G.-S. (2020). Align deep features for oriented object detection. http://arxiv.org/abs/2008.09397
Han, J., Ding, J., Xue, N., & Xia, G.-S. (2021). ReDet: A rotation-equivariant detector for aerial object detection. http://arxiv.org/abs/2103.07733
Huang, Z., Li, W., Xia, X. G., & Tao, R. (2022). A general gaussian heatmap label assignment for arbitrary-oriented object detection. IEEE Transactions on Image Processing, 31, 1895–1910. https://doi.org/10.1109/TIP.2022.3148874
Article Google Scholar
Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., et al. (2017). R2CNN: Rotational region CNN for orientation robust scene text detection. http://arxiv.org/abs/1706.09579
Liao, M., Zhu, Z., Shi, B., Xia, G. S., & Bai, X. (2018). Rotation-sensitive regression for oriented scene text detection. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2018.00619
Lin, T. Y., Goyal, P., Girshick, R., He, K., & Dollar, P. (2020). Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(2), 318–327. https://doi.org/10.1109/TPAMI.2018.2858826
Article Google Scholar
Liu, G., Zhang, Y., Zheng, X., Sun, X., Fu, K., & Wang, H. (2013). A new method on inshore ship detection in high-resolution satellite images using shape and context information. IEEE Geoscience and Remote Sensing Letters, 11(3), 617–621. https://doi.org/10.1109/LGRS.2013.2272492
Article Google Scholar
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016). SSD: Single shot multibox detector. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9905 LNCS). https://doi.org/10.1007/978-3-319-46448-0_2
Liu, Z., Yuan, L., Weng, L., & Yang, Y. (2017). A high resolution optical satellite image dataset for ship recognition and some new baselines. In ICPRAM 2017—Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods (Vol. 2017-January). https://doi.org/10.5220/0006120603240331
Ming, Q., Miao, L., Zhou, Z., & Dong, Y. (2021). CFC-Net: A critical feature capturing network for arbitrary-oriented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–14. https://doi.org/10.1109/TGRS.2021.3095186
Article Google Scholar
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2016-December). https://doi.org/10.1109/CVPR.2016.91
Ren, S., He, K., Girshick, R., & Sun, J. (2017). Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031
Article Google Scholar
Wang, J., Chen, K., Yang, S., Loy, C. C., & Lin, D. (2019). Region proposal by guided anchoring. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2019-June, pp. 2960–2969). IEEE Computer Society https://doi.org/10.1109/CVPR.2019.00308
Xia, G.-S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., et al. (2017). DOTA: A large-scale dataset for object detection in aerial images. http://arxiv.org/abs/1711.10398
Xu, Y., Fu, M., Wang, Q., Wang, Y., Chen, K., Xia, G. S., & Bai, X. (2021). Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(4), 1452–1459. https://doi.org/10.1109/TPAMI.2020.2974745
Article Google Scholar
Yang, X., & Yan, J. (2020). On the arbitrary-oriented object detection: classification based approaches revisited. http://arxiv.org/abs/2003.05597
Yang, X., Yan, J., Feng, Z., & He, T. (2019a). R3Det: refined single-stage detector with feature refinement for rotating object. http://arxiv.org/abs/1908.05612
Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., et al. (2019b). SCRDet: Towards more robust detection for small, cluttered and rotated objects. In Proceedings of the IEEE International Conference on Computer Vision (Vol. 2019b-October, pp. 8231–8240). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICCV.2019.00832
Zhang, G., Lu, S., & Zhang, W. (2019). CAD-net: A context-aware detection network for objects in remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing, 57(12), 10015–10024. https://doi.org/10.1109/TGRS.2019.2930982
Article Google Scholar
Zhang, Z., Guo, W., Zhu, S., & Yu, W. (2018). Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks. IEEE Geoscience and Remote Sensing Letters, 15(11), 1745–1749. https://doi.org/10.1109/LGRS.2018.2856921
Article Google Scholar
Zhu, C., Zhou, H., Wang, R., & Guo, J. (2010). A novel hierarchical method of ship detection from spaceborne optical image based on shape and texture features. IEEE Transactions on Geoscience and Remote Sensing, 48(9), 3446–3456. https://doi.org/10.1109/TGRS.2010.2046330
Article Google Scholar
Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., & Jiao, J. (2015). Orientation robust object detection in aerial images using deep convolutional neural network. In Proceedings—international conference on image processing, ICIP (Vol. 2015-December). https://doi.org/10.1109/ICIP.2015.7351502

Download references

Funding

This research was funded by the National Natural Science Foundation of China (Grant No. 61971006).

Author information

Authors and Affiliations

School of Information and Electronics, Beijing Institute of Technology, Beijing, China
Yangte Gao & Liang Chen
School of Information, North China University of Technology, Beijing, China
Fukun Bi
Party School of Liaoning Provincial Party Committee, Liaoning Academy of Governance, Shenyang, China
Xiaoyu Nie

Authors

Yangte Gao
View author publications
You can also search for this author in PubMed Google Scholar
Fukun Bi
View author publications
You can also search for this author in PubMed Google Scholar
Liang Chen
View author publications
You can also search for this author in PubMed Google Scholar
Xiaoyu Nie
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Xiaoyu Nie.

Ethics declarations

Conflict of interest

The authors declare no conflicts of interest.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

About this article

Cite this article

Gao, Y., Bi, F., Chen, L. et al. SD-Net: Spatial Dual Network for Aerial Object Detection. J Indian Soc Remote Sens 51, 2067–2076 (2023). https://doi.org/10.1007/s12524-023-01750-9

Download citation

Received: 25 July 2022
Accepted: 14 August 2023
Published: 03 September 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s12524-023-01750-9

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

SD-Net: Spatial Dual Network for Aerial Object Detection

Abstract

Access this article

Similar content being viewed by others

Anchor Free remote sensing detector based on solving discrete polar coordinate equation

DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

About this article

Cite this article

Keywords

Navigation

SD-Net: Spatial Dual Network for Aerial Object Detection

Abstract

Access this article

Similar content being viewed by others

Anchor Free remote sensing detector based on solving discrete polar coordinate equation

DMA-YOLO: multi-scale object detection method with attention mechanism for aerial images

MS-YOLO: integration-based multi-subnets neural network for object detection in aerial images

References

Funding

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Additional information

Publisher's Note

About this article

Cite this article

Share this article

Keywords

Search

Navigation