Abstract
The distribution direction of aerial objects is arbitrary compared to objects in natural images. However, the existing detectors identify and locate the targets by relying on the shared features, which leads to the contradiction of regression and classification tasks. To be specific, the classifier suppresses rotation-sensitive features, while the regressor relies on rotation-variable features. To address the above contradictions, a Spatial Dual Network (SD-Net) is proposed, which consists of two modules: Polarization Dual Pyramid Module (PDPM) and Spatial Coordinate Attention Module (SCAM). In the SCAM module, to be able to capture channel-related features and global spatial features in different directions, an attention module is built with different convolution kernels that slide in both horizontal and vertical directions. In addition, the polarization function in the Polarization Dual Pyramid Module can split features into features suitable for classification and regression tasks for use in the classifier and regressor of the network, enabling more refined detection. The experimental results on three remote sensing datasets (i.e., DOTA, UCAS-AOD, and HRSC2016) demonstrate that the proposed method achieves higher performance on detection tasks while maintaining high efficiency.
Similar content being viewed by others
References
Ding, J., Xue, N., Long, Y., Xia, G. S., & Lu, Q. (2019). Learning ROI transformer for oriented object detection in aerial images. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2019-June). https://doi.org/10.1109/CVPR.2019.00296
Fu, K., Chang, Z., Zhang, Y., Xu, G., Zhang, K., & Sun, X. (2020). Rotation-aware and multi-scale convolutional neural network for object detection in remote sensing images. ISPRS Journal of Photogrammetry and Remote Sensing, 161, 294–308. https://doi.org/10.1016/j.isprsjprs.2020.01.025
Girshick, R. (2015). Fast R-CNN. In 2015 IEEE international conference on computer vision (ICCV) (pp. 1440–1448). IEEE. https://doi.org/10.1109/ICCV.2015.169
Han, J., Ding, J., Li, J., & Xia, G.-S. (2020). Align deep features for oriented object detection. http://arxiv.org/abs/2008.09397
Han, J., Ding, J., Xue, N., & Xia, G.-S. (2021). ReDet: A rotation-equivariant detector for aerial object detection. http://arxiv.org/abs/2103.07733
Huang, Z., Li, W., Xia, X. G., & Tao, R. (2022). A general gaussian heatmap label assignment for arbitrary-oriented object detection. IEEE Transactions on Image Processing, 31, 1895–1910. https://doi.org/10.1109/TIP.2022.3148874
Jiang, Y., Zhu, X., Wang, X., Yang, S., Li, W., Wang, H., et al. (2017). R2CNN: Rotational region CNN for orientation robust scene text detection. http://arxiv.org/abs/1706.09579
Liao, M., Zhu, Z., Shi, B., Xia, G. S., & Bai, X. (2018). Rotation-sensitive regression for oriented scene text detection. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2018.00619
Lin, T. Y., Goyal, P., Girshick, R., He, K., & Dollar, P. (2020). Focal loss for dense object detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 42(2), 318–327. https://doi.org/10.1109/TPAMI.2018.2858826
Liu, G., Zhang, Y., Zheng, X., Sun, X., Fu, K., & Wang, H. (2013). A new method on inshore ship detection in high-resolution satellite images using shape and context information. IEEE Geoscience and Remote Sensing Letters, 11(3), 617–621. https://doi.org/10.1109/LGRS.2013.2272492
Liu, W., Anguelov, D., Erhan, D., Szegedy, C., Reed, S., Fu, C. Y., & Berg, A. C. (2016). SSD: Single shot multibox detector. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) (Vol. 9905 LNCS). https://doi.org/10.1007/978-3-319-46448-0_2
Liu, Z., Yuan, L., Weng, L., & Yang, Y. (2017). A high resolution optical satellite image dataset for ship recognition and some new baselines. In ICPRAM 2017—Proceedings of the 6th International Conference on Pattern Recognition Applications and Methods (Vol. 2017-January). https://doi.org/10.5220/0006120603240331
Ming, Q., Miao, L., Zhou, Z., & Dong, Y. (2021). CFC-Net: A critical feature capturing network for arbitrary-oriented object detection in remote sensing images. IEEE Transactions on Geoscience and Remote Sensing, 60, 1–14. https://doi.org/10.1109/TGRS.2021.3095186
Redmon, J., Divvala, S., Girshick, R., & Farhadi, A. (2016). You only look once: Unified, real-time object detection. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2016-December). https://doi.org/10.1109/CVPR.2016.91
Ren, S., He, K., Girshick, R., & Sun, J. (2017). Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Transactions on Pattern Analysis and Machine Intelligence, 39(6), 1137–1149. https://doi.org/10.1109/TPAMI.2016.2577031
Wang, J., Chen, K., Yang, S., Loy, C. C., & Lin, D. (2019). Region proposal by guided anchoring. In Proceedings of the IEEE computer society conference on computer vision and pattern recognition (Vol. 2019-June, pp. 2960–2969). IEEE Computer Society https://doi.org/10.1109/CVPR.2019.00308
Xia, G.-S., Bai, X., Ding, J., Zhu, Z., Belongie, S., Luo, J., et al. (2017). DOTA: A large-scale dataset for object detection in aerial images. http://arxiv.org/abs/1711.10398
Xu, Y., Fu, M., Wang, Q., Wang, Y., Chen, K., Xia, G. S., & Bai, X. (2021). Gliding Vertex on the Horizontal Bounding Box for Multi-Oriented Object Detection. IEEE Transactions on Pattern Analysis and Machine Intelligence, 43(4), 1452–1459. https://doi.org/10.1109/TPAMI.2020.2974745
Yang, X., & Yan, J. (2020). On the arbitrary-oriented object detection: classification based approaches revisited. http://arxiv.org/abs/2003.05597
Yang, X., Yan, J., Feng, Z., & He, T. (2019a). R3Det: refined single-stage detector with feature refinement for rotating object. http://arxiv.org/abs/1908.05612
Yang, X., Yang, J., Yan, J., Zhang, Y., Zhang, T., Guo, Z., et al. (2019b). SCRDet: Towards more robust detection for small, cluttered and rotated objects. In Proceedings of the IEEE International Conference on Computer Vision (Vol. 2019b-October, pp. 8231–8240). Institute of Electrical and Electronics Engineers Inc. https://doi.org/10.1109/ICCV.2019.00832
Zhang, G., Lu, S., & Zhang, W. (2019). CAD-net: A context-aware detection network for objects in remote sensing imagery. IEEE Transactions on Geoscience and Remote Sensing, 57(12), 10015–10024. https://doi.org/10.1109/TGRS.2019.2930982
Zhang, Z., Guo, W., Zhu, S., & Yu, W. (2018). Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks. IEEE Geoscience and Remote Sensing Letters, 15(11), 1745–1749. https://doi.org/10.1109/LGRS.2018.2856921
Zhu, C., Zhou, H., Wang, R., & Guo, J. (2010). A novel hierarchical method of ship detection from spaceborne optical image based on shape and texture features. IEEE Transactions on Geoscience and Remote Sensing, 48(9), 3446–3456. https://doi.org/10.1109/TGRS.2010.2046330
Zhu, H., Chen, X., Dai, W., Fu, K., Ye, Q., & Jiao, J. (2015). Orientation robust object detection in aerial images using deep convolutional neural network. In Proceedings—international conference on image processing, ICIP (Vol. 2015-December). https://doi.org/10.1109/ICIP.2015.7351502
Funding
This research was funded by the National Natural Science Foundation of China (Grant No. 61971006).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare no conflicts of interest.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Gao, Y., Bi, F., Chen, L. et al. SD-Net: Spatial Dual Network for Aerial Object Detection. J Indian Soc Remote Sens 51, 2067–2076 (2023). https://doi.org/10.1007/s12524-023-01750-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12524-023-01750-9