Abstract
As satellite remote sensing and aerial photography technologies continue to advance in recent years, there has been a noticeable increase in both the resolution and image quality of remote sensing images. Furthermore, an abundance of data sources has emerged, intensifying the challenges associated with detection. To address the challenges posed by small object size and dense distribution in remote sensing images, an innovative solution has been introduced. This solution entails an enhanced rotating object detection algorithm which leverages the power of vision Transformer technology. By utilizing this approach, the aim is to overcome the limitations of poor robustness and low detection accuracy commonly encountered in such scenarios.The enhancement of the feature extraction capability of the detection algorithm in YOLOv4’s feature fusion part is achieved through the introduction of the MS-Transformer module. This module, known for its self-attention mechanism, facilitates the acquisition of pertinent information among targets, thereby bolstering the algorithm’s ability to detect densely distributed targets. Moreover, the advancement of the five-coordinate YOLOv4 object detection framework enables the realization of multi-angle remote sensing object detection. To mitigate the issue of overlapping prediction frames on dense targets, the model incorporates the soft-NMS suppression method, ultimately refining the detection performance. The efficacy of the proposed algorithm in improving the model’s detection capability is substantiated through experimentation using the DOTA dataset.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Kou, Q., Cheng, D., Zhuang, H., Gao, R.: Cross-complementary local binary pattern for robust texture classification. IEEE Signal Process. Lett. 26(1), 129–133 (2019)
Cheng, D., Chen, L., Lv, C., Guo, L., Kou, Q.: Light-guided and cross-fusion U-Net for anti-illumination image super-resolution. IEEE Trans. Circuits Syst. Video Technol. 32(12), 8436–8449 (2022)
Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint: arXiv:2004.10934 (2020)
Xia, G., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)
Wei, H., et al.: Oriented objects as pairs of middle lines. ISPRS J. Photogramm. Remote. Sens. 169, 268–279 (2020)
Lin, T., et al.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2017)
Wang, J., et al.: Learning center probability mAP for detecting objects in aerial images. IEEE Trans. Geosci. Remote Sens. 59(5), 4307–4323 (2020)
Wang, J., Ding, J., Guo, H., Cheng, W., Pan, T., Yang, W.: Mask OBB: a semantic attention-based mask oriented bounding box representation for multi-category object detection in aerial images. Remote Sens. 11(24), 2930–2951 (2019)
Acknowledgement
Partly funded by the Jining Key Research and Development Program, this work received support. (2021JNZY013).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Hui, S., Wang, P., Luan, B., Zhao, X., Ma, S. (2024). Improved Remote Sensing Image Rotating Target Detection Algorithm Based on Transformer. In: Dong, J., Zhang, L., Cheng, D. (eds) Proceedings of the 2nd International Conference on Internet of Things, Communication and Intelligent Technology. IoTCIT 2023. Lecture Notes in Electrical Engineering, vol 1197. Springer, Singapore. https://doi.org/10.1007/978-981-97-2757-5_60
Download citation
DOI: https://doi.org/10.1007/978-981-97-2757-5_60
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-97-2756-8
Online ISBN: 978-981-97-2757-5
eBook Packages: Computer ScienceComputer Science (R0)