Skip to main content
Log in

Aerial image detection and recognition system based on deep neural network

  • Original Paper
  • Published:
Aerospace Systems Aims and scope Submit manuscript

Abstract

Aerial image detection aims to find objects of interest and give their locations, and the goal of aerial image recognition is to classify the scenes or objects. Deep neural network is by far the best model for image detection and recognition. But still it fails to meet the requirements of high precision and low missing alarm. To solve this issue, we propose a two-step system including detection and fine-grained recognition modules. The first module is based on one-stage detection method YOLOv3. Furthermore, the metric-learning-based magnet loss is introduced to realize fine-grain recognition in the second step. The experiments prove the effectiveness of their combinations on improving precision and reducing missing alarm rate.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3

Similar content being viewed by others

References

  1. McKeown DM, Denlinger JL (1988) Cooperative methods for road tracking in aerial imagery. In: Proceedings CVPR'88: The Computer Society conference on computer vision and pattern recognition. IEEE Computer Society

  2. Kim T, Park SR, Kim MG, Jeong S, Kim KO (2004) Tracking road centerlines from high resolution remote sensing images by least squares correlation matching. Photogramm Eng Remote Sens 70(12):1417–1422

    Article  Google Scholar 

  3. Baatz M, Schäpe A (2000) Multiresolution segmentation: an optimization approach for high quality multi-scale image segmentation

  4. Drǎguţ L, Tiede D, Levick SR (2010) ESP: a tool to estimate scale parameter for multiresolution image segmentation of remotely sensed data. Int J Geogr Inf Sci 24(6):859–871

    Article  Google Scholar 

  5. Cheng G et al (2013) Object detection in remote sensing imagery using a discriminatively trained mixture model. ISPRS J Photogramm Remote Sens 85:32–43

    Article  Google Scholar 

  6. Shi Z, Yu X, Jiang Z, Li B (2014) Ship detection in high-resolution optical imagery based on anomaly detector and local shape feature. IEEE Trans Geosci Remote Sens 52(8):4511–4523

    Article  Google Scholar 

  7. Zhong P, Wang R (2007) A multiple conditional random fields ensemble model for urban area detection in remote sensing optical images. IEEE Trans Geosci Remote Sens 45(12):3978–3988

    Article  Google Scholar 

  8. Girshick R et al (2014) Rich feature hierarchies for accurate object detection and semantic segmentation. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  9. Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision

  10. Ren S et al (2015) Faster r-cnn: towards real-time object detection with region proposal networks. Advances in neural information processing systems

  11. Redmon J et al (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  12. Liu W et al (2016) Ssd: single shot multibox detector. In: European conference on computer vision. Springer, Cham

  13. Lin T-Y et al (2017) Focal loss for dense object detection. In: Proceedings of the IEEE international conference on computer vision

  14. Krizhevsky A, Sutskever I, Hinton GE (2017) Imagenet classification with deep convolutional neural networks. Commun ACM 60(6):84–90

    Article  Google Scholar 

  15. Simonyan K, Zisserman A (2014) Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556

  16. Szegedy C et al (2015) Going deeper with convolutions. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  17. Rippel O et al (2015) Metric learning with adaptive density discrimination. arXiv preprint arXiv:1511.05939

  18. Wang J et al (2014) Learning fine-grained image similarity with deep ranking. In: Proceedings of the IEEE conference on computer vision and pattern recognition

  19. Verma N et al (2012) Learning hierarchical similarity metrics. In: 2012 IEEE conference on computer vision and pattern recognition. IEEE

  20. Weinberger KQ, Saul LK (2009) Distance metric learning for large margin nearest neighbor classification. J Mach Learn Res 10(Feb):207–244

    MATH  Google Scholar 

  21. Hadsell R, Chopra S, LeCun Y (2006) Dimensionality reduction by learning an invariant mapping. In: 2006 IEEE Computer Society conference on computer vision and pattern recognition (CVPR’06), vol 2. IEEE

  22. Xia G et al (2018) DOTA: a large-scale dataset for object detection in aerial images. In: Proceedings of the IEEE conference on computer vision and pattern recognition

Download references

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Hongya Tuo or Zhongliang Jing.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zhang, S., Tuo, H., Zhong, H. et al. Aerial image detection and recognition system based on deep neural network. AS 4, 101–108 (2021). https://doi.org/10.1007/s42401-020-00077-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s42401-020-00077-4

Keywords

Navigation