Skip to main content

Improved Remote Sensing Image Rotating Target Detection Algorithm Based on Transformer

  • Conference paper
  • First Online:
Proceedings of the 2nd International Conference on Internet of Things, Communication and Intelligent Technology (IoTCIT 2023)

Abstract

As satellite remote sensing and aerial photography technologies continue to advance in recent years, there has been a noticeable increase in both the resolution and image quality of remote sensing images. Furthermore, an abundance of data sources has emerged, intensifying the challenges associated with detection. To address the challenges posed by small object size and dense distribution in remote sensing images, an innovative solution has been introduced. This solution entails an enhanced rotating object detection algorithm which leverages the power of vision Transformer technology. By utilizing this approach, the aim is to overcome the limitations of poor robustness and low detection accuracy commonly encountered in such scenarios.The enhancement of the feature extraction capability of the detection algorithm in YOLOv4’s feature fusion part is achieved through the introduction of the MS-Transformer module. This module, known for its self-attention mechanism, facilitates the acquisition of pertinent information among targets, thereby bolstering the algorithm’s ability to detect densely distributed targets. Moreover, the advancement of the five-coordinate YOLOv4 object detection framework enables the realization of multi-angle remote sensing object detection. To mitigate the issue of overlapping prediction frames on dense targets, the model incorporates the soft-NMS suppression method, ultimately refining the detection performance. The efficacy of the proposed algorithm in improving the model’s detection capability is substantiated through experimentation using the DOTA dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 299.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 379.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Kou, Q., Cheng, D., Zhuang, H., Gao, R.: Cross-complementary local binary pattern for robust texture classification. IEEE Signal Process. Lett. 26(1), 129–133 (2019)

    Article  Google Scholar 

  2. Cheng, D., Chen, L., Lv, C., Guo, L., Kou, Q.: Light-guided and cross-fusion U-Net for anti-illumination image super-resolution. IEEE Trans. Circuits Syst. Video Technol. 32(12), 8436–8449 (2022)

    Article  Google Scholar 

  3. Bochkovskiy, A., Wang, C.Y., Liao, H.Y.M.: Yolov4: optimal speed and accuracy of object detection. arXiv preprint: arXiv:2004.10934 (2020)

    Google Scholar 

  4. Xia, G., et al.: DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE International Conference on Computer Vision and Pattern Recognition, pp. 3974–3983 (2018)

    Google Scholar 

  5. Wei, H., et al.: Oriented objects as pairs of middle lines. ISPRS J. Photogramm. Remote. Sens. 169, 268–279 (2020)

    Article  Google Scholar 

  6. Lin, T., et al.: Focal loss for dense object detection. IEEE Trans. Pattern Anal. Mach. Intell. 42(2), 318–327 (2017)

    Article  Google Scholar 

  7. Wang, J., et al.: Learning center probability mAP for detecting objects in aerial images. IEEE Trans. Geosci. Remote Sens. 59(5), 4307–4323 (2020)

    Article  Google Scholar 

  8. Wang, J., Ding, J., Guo, H., Cheng, W., Pan, T., Yang, W.: Mask OBB: a semantic attention-based mask oriented bounding box representation for multi-category object detection in aerial images. Remote Sens. 11(24), 2930–2951 (2019)

    Article  Google Scholar 

Download references

Acknowledgement

Partly funded by the Jining Key Research and Development Program, this work received support. (2021JNZY013).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Bin Luan .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Hui, S., Wang, P., Luan, B., Zhao, X., Ma, S. (2024). Improved Remote Sensing Image Rotating Target Detection Algorithm Based on Transformer. In: Dong, J., Zhang, L., Cheng, D. (eds) Proceedings of the 2nd International Conference on Internet of Things, Communication and Intelligent Technology. IoTCIT 2023. Lecture Notes in Electrical Engineering, vol 1197. Springer, Singapore. https://doi.org/10.1007/978-981-97-2757-5_60

Download citation

  • DOI: https://doi.org/10.1007/978-981-97-2757-5_60

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-97-2756-8

  • Online ISBN: 978-981-97-2757-5

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics