Abstract
Ship detection is a meaningful and challenging task in processing of remote sensing image data. Specifically, the main challenges faced by ship detection in remote sensing images (RSIs) include the variable direction and dense arrangement of ships, as well as the complex nearshore scenes. To address the above challenges, this article proposes an arbitrary-oriented ship detection network based on Kullback-Leibler divergence (KLD) regression. Firstly, a coordinate attention module (CAM) is designed to capture direction-aware and position-sensitive features, which enhances the attention to ships in complex scenes. Meanwhile, a reinforced feature fusion network (RFF-Net) combined with CAM is constructed to obtain fusion features containing rich semantic and location information. Then, the orientation-invariant model (OIM) is introduced to generate depth rotation-invariant features, which improves the adaptability of the network to arbitrary-oriented ships. Finally, a regression loss function based on KLD is defined to solve the problem of boundary discontinuity and realize the accurate detection of densely arranged ships. The experimental results demonstrate that the proposed method reaches 89.87% and 83.62% average precision (AP) on the HRSC2016 and DOTA Ship data sets, respectively, achieving state-of-the-art ship detection performance.
Similar content being viewed by others
Availability of data and materials
Data sharing not applicable to this article as no datasets were generated or analysed during the current study.
References
Deng Z, Sun H, Zhou S, Zhao J (2019) Learning deep ship detector in sar images from scratch. IEEE Trans Geosci Remote Sens 57(6):4021–4039
Shuai T, Sun K, Wu X, Zhang X, Shi B (2016) A ship target automatic detection method for high-resolution remote sensing. In: 2016 IEEE international geoscience and remote sensing symposium (IGARSS), pp 1258–1261. IEEE
Song Z, Sui H, Wang Y (2014) Automatic ship detection for optical satellite images based on visual attention model and lbp. In: 2014 IEEE workshop on electronics, computer and applications, pp 722–725. IEEE
Li-Bing J, Zhuang W, Wei-Dong H (2011) An aiac-based inshore ship target detection approach. Remote Sens Technol Appl 22(1):88–94
Zhu C, Zhou H, Wang R, Guo J (2010) A novel hierarchical method of ship detection from space-borne optical image based on shape and texture features. IEEE Trans Geosci Remote Sens 48(9):3446–3456
Nie X, Duan M, Ding H, Hu B, Wong EK (2020) Attention mask r-cnn for ship detection and segmentation from remote sensing images. IEEE Access 8:9325–9334
Zhao J, Zhang X, Yang L, Ma S, Wang Y, Dong Y, Sun M, Chen C (2020) Ship detection in remote sensing images based on deep learning. Sci Surv Mapp 45(3):110–116
Shi W-X, Jiang J-H, Bao S-L (2020) Ship detection method in remote sensing image based on feature fusion. Acta Photo Sin 49(7):0710004
Chen L, Shi W, Deng D (2021) Improved yolov3 based on attention mechanism for fast and accurate ship detection in optical remote sensing images. Remote Sens 13(4):660
Zhong W, Guo F, Xiang S, Pan C (2019) Ship detection in remote sensing based with rotated rectangular region. J Comput-Aided Des Comput Graph 31(11):1935–1945
Yang X, Sun H, Fu K, Yang J, Sun X, Yan M, Guo Z (2018) Automatic ship detection in remote sensing images from google earth of complex scenes based on multiscale rotation dense feature pyramid networks. Remote Sens 10(1):132
Liu W, Ma L, Chen H (2018) Arbitrary-oriented ship detection framework in optical remote-sensing images. IEEE Geosci Remote Sens Lett 15(6):937–941
Li L, Zhou Z, Wang B, Miao L, Zong H (2020) A novel cnn-based method for accurate ship detection in hr optical remote sensing images via rotated bounding box. IEEE Trans Geosci Remote Sens 59(1):686–699
Han Y, Yang X, Pu T, Peng Z (2021) Fine-grained recognition for oriented ship against complex scenes in optical remote sensing images. IEEE Trans Geosci Remote Sens 60:1–18
Lin TY, Goyal P, Girshick R, He K, Dollár P (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision, pp 2980–2988
Girshick R, Donahue J, Darrell T, Malik J (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 580–587
He K, Zhang X, Ren S, Sun J (2015) Spatial pyramid pooling in deep convolutional networks for visual recognition. IEEE Trans Pattern Anal Mach Intell 37(9):1904–1916
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Adv Neural Inf Process Syst 28
Dai J, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. Adv Neural Inf Process Syst 29
Redmon J, Divvala S, Girshick R, Farhadi A (2016) You only look once: Unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 779–788
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: single shot multibox detector. In: Computer vision–ECCV 2016: 14th European conference, Amsterdam, The Netherlands, October 11–14, 2016, proceedings, Part I 14, pp 21–37. Springer
Zhang S, Wen L, Bian X, Lei Z, Li SZ (2018) Single-shot refinement neural network for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 4203–4212
Fu CY, Liu W, Ranga A, Tyagi A, Berg AC (2017) Dssd: deconvolutional single shot detector. arXiv preprint arXiv:1701.06659
Ma J, Shao W, Ye H, Wang L, Wang H, Zheng Y, Xue X (2018) Arbitrary-oriented scene text detection via rotation proposals. IEEE Trans Multimedia 20(11):3111–3122
Liu Z, Hu J, Weng L, Yang Y (2017) Rotated region based cnn for ship detection. In: 2017 IEEE international conference on image processing (ICIP), pp 900–904. IEEE
Ding J, Xue N, Long Y, Xia GS, Lu Q (2019) Learning roi transformer for oriented object detection in aerial images. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 2849–2858
Zhang G, Lu S, Zhang W (2019) Cad-net: a context-aware detection network for objects in remote sensing imagery. IEEE Trans Geosci Remote Sens 57(12):10015–10024
Zhou X, Yao C, Wen H, Wang Y, Zhou S, He W, Liang J (2017) East: an efficient and accurate scene text detector. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5551–5560
Liao M, Shi B, Bai X (2018) Textboxes++: a single-shot oriented scene text detector. IEEE Trans Image Process 27(8):3676–3690
Yang X, Yan J, Feng Z, He T (2021) R3det: refined single-stage detector with feature refinement for rotating object. Proc AAAI Conf Artif Intell 35:3163–3171
Yang X, Yan J (2020) Arbitrary-oriented object detection with circular smooth label. In: Computer vision–ECCV 2020: 16th European conference, Glasgow, UK, August 23–28, 2020, proceedings, Part VIII 16, pp 677–694. Springer
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 770–778
Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 2117–2125
Zhou Y, Ye Q, Qiu Q, Jiao J (2017) Oriented response networks. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 519–528
Ramachandran P, Zoph B, Le QV (2017) Searching for activation functions. arXiv preprint arXiv:1710.05941
Liu Z, Yuan L, Weng L, Yang Y (2017) A high resolution optical satellite image dataset for ship recognition and some new baselines. In: ICPRAM, pp 324–331
Xia GS, Bai X, Ding J, Zhu Z, Belongie S, Luo J, Datcu M, Pelillo M, Zhang L (2018) Dota: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 3974–3983
Jiang Y, Zhu X, Wang X, Yang S, Li W, Wang H, Fu P, Luo Z R2cnn: rotational region cnn for orientation robust scene text detection. arXiv:1706.09579
Lin Y, Feng P, Guan J, Wang W, Chambers J (2019) Ienet: interacting embranchment one stage anchor free detector for orientation aerial object detection. arXiv preprint arXiv:1912.00969
Zhang Z, Guo W, Zhu S, Yu W (2018) Toward arbitrary-oriented ship detection with rotated region proposal and discrimination networks. IEEE Geosci Remote Sens Lett 15(11):1745–1749
Liao M, Zhu Z, Shi B, Xia GS, Bai X (2018) Rotation-sensitive regression for oriented scene text detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp 5909–5918
Yang X, Yang J, Yan J, Zhang Y, Zhang T, Guo Z, Sun X, Fu K (2019) Scrdet: towards more robust detection for small, cluttered and rotated objects. In: Proceedings of the IEEE/CVF international conference on computer vision, pp 8232–8241
Funding
This work was supported by the National Natural Science Foundation of China [Grant No.: 61901081]; China Postdoctoral Science Foundation [Grant No.: 2020M680927]; Fundamental Research Funds for the Central Universities [Grant No.: 3132022237].
Author information
Ethics declarations
Conflict of interest
The authors have no relevant financial or non-financial interests to disclose.
Additional information
Communicated by: H. Babaie.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Chen, Y., Wang, J., Zhang, Y. et al. Arbitrary-oriented ship detection based on Kullback-Leibler divergence regression in remote sensing images. Earth Sci Inform 16, 3243–3255 (2023). https://doi.org/10.1007/s12145-023-01088-3
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s12145-023-01088-3