Skip to main content
Log in

Improved YOLOv5 network method for remote sensing image-based ground objects recognition

  • Focus
  • Published:
Soft Computing Aims and scope Submit manuscript

Abstract

High-resolution remote sensing images have the characteristics of complex background environment, clustering of objects, etc., the complex background makes the remote sensing image contain a large number of irrelevant ground objects with a high similarity or overlap, which makes the edge and texture of the objects not clear enough, and this leads to low recognition accuracy of ground objects such as airports, dams, and golf field, although the size of this object is large. Based on this problem, this paper proposes a remote sensing image object detection method based on the YOLOv5 network. By improving the backbone extraction network, the network structure can be deepened to get more information about large objects, and the detection effect can be improved by adding an attention mechanism and adding an output layer to enhance feature extraction and feature fusion. The pre-training weight is obtained by transfer learning and used as the training weight of the improved YOLOv5 to speed up the network convergence. The experiment is carried out on the DIOR dataset, the results show that the improved YOLOv5 network can significantly improve the accuracy of large object recognition compared with the YOLO series network and the EfficientDet model on DIOR dataset, and the mAP of the improved YOLOv5 network is 80.5%, which is 2% higher than the original YOLOv5 network.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

The RSOD datasets generated during and/or analyzed during the current study are available in GitHub—RSIA-LIESMARS-WHU/RSOD-Dataset—an open dataset for object detection in remote sensing images. The DIOR datasets and NWPU VHR-10 dataset during and/or analyzed during the current study are not publicly available due to the link failure but are available from the corresponding author on reasonable request.

References

Download references

Funding

The authors declare that no funds, grants, or other support were received during the preparation of this manuscript.

Author information

Authors and Affiliations

Authors

Contributions

All authors contributed to the study conception and design. Material preparation, data collection, and analysis were performed by JX. The first draft of the manuscript was written by JX, and all authors commented on previous versions of the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Yongguo Zheng or Ping Wang.

Ethics declarations

Conflict of interest

The authors have no relevant financial or non-financial interests to disclose.

Additional information

Communicated by Shah Nazir.

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Xue, J., Zheng, Y., Dong-Ye, C. et al. Improved YOLOv5 network method for remote sensing image-based ground objects recognition. Soft Comput 26, 10879–10889 (2022). https://doi.org/10.1007/s00500-022-07106-8

Download citation

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00500-022-07106-8

Keywords

Navigation