Abstract
Detecting aircraft in remote sensing images becomes increasingly important in both military and civilian fields. However, the accuracy of existing detection approach is not high enough especially for the small-scale aircraft when considering the size and scenario of the remote sensing images. To improve the accuracy of detecting small-scale aircraft, this paper proposes a detection approach for aircraft based on Faster-RCNN, called MFRC. Firstly, the K-means algorithm is used to cluster aircraft data in remote sensing images. Anchors are improved based on clustering results. Secondly, to extract location features of small-scale aircraft, the layer of pooling in the VGG16 network is reduced from four to two. Finally, the Soft-NMS algorithm is used to optimize the aircraft bounding boxes. In the experimentation, MFRC is evaluated under different conditions and compared with other models. The experimental results show that MFRC can detect small-scale aircraft effectively and the accuracy is improved by 3% compared to existing methods.
Similar content being viewed by others
References
Bodla N, Singh B, Chellappa R, Davis LS (2017) Soft-NMS–improving object detection with one line of code. In: Proceedings of the IEEE international conference on computer vision. https://doi.org/10.1109/ICCV.2017.593, pp 5561–5569
Cao C, Wu J, Zeng X, Feng Z, Wang T, Yan X, Wu Z, Wu Q, Huang Z (2020) Research on airplane and ship detection of aerial remote sensing images based on convolutional neural network. Sensors 20:4696
Chen F, Ren R, Xu W, van de Voorde T (2020) Airplane recognition from remote sensing images with deep convolutional neural network. In: IGARSS 2020 - 2020 IEEE International Geoscience and Remote Sensing Symposium. https://doi.org/10.1109/IGARSS39084.2020.9323517, pp 264–267
Girshick R (2015) Fast R-CNN. In: Proceedings of the IEEE International Conference on Computer Vision (ICCV), vol 2, pp 1440–1448
Girshick Ross, Donahue Jeff, Darrell Trevor, Malik Jitendra (2014) Rich feature hierarchies for accurate object de-tection and semantic segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recog-nition (CVPR), pp 580–587, 2, 4
Guo X, Chen L, Shen C (2016) Hierarchical adaptive deep convolution neural network and its application to bearing fault diagnosis. Measurement 93:490–502. https://doi.org/10.1016/j.measurement.2016.07.054
He K, Zhang X, Ren S, Sun J (2016) Deep residual learning for image recognition. In: 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). https://doi.org/10.1109/CVPR.2016.90, pp 770–778
Krishna K, Murty MN (1999) Genetic K-means algorithm. IEEE Transactions on Systems, Man, and Cybernetics Part B (Cybernetics) 29(3):433–439. https://doi.org/10.1109/3477.764879
Laroca R, Severo E, Zanlorensi LA, Oliveira LS, Gonçalves G R, Schwartz WR, Menotti D (2018) A robust real-time automatic license plate recognition based on the YOLO detector. In: 2018 international joint conference on neural networks (ijcnn). https://doi.org/10.1109/IJCNN.2018.8489629, pp 1–10
Li Y, Huang H, Xie Q, Yao L, Chen Q (2018) Research on a surface defect detection algorithm based on mobilenet-SSD. Appl Sci 8(9):2018. https://doi.org/10.3390/app8091678
Li X, Wang S, Jiang B, Chan X (2017) Airplane detection using convolutional neural networks in a coarse-to-fine manner. In: 2017 IEEE 2nd Information Technology, Networking, Electronic and Automation Control Conference (ITNEC). https://doi.org/10.1109/ITNEC.2017.8284943, pp 235–239
Li W, Xiang S, Wang H, Pan C (2011) Robust airplane detection in satellite images. In: 2011 18th IEEE International Conference on Image Processing. https://doi.org/10.1109/ICIP.2011.6116259, pp 2821–2824
Li B, Yan J, Wu W, Zhu Z, Hu X (2018) High performance visual tracking with siamese region proposal network. In: Proceedings of the IEEE conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2018.00935, pp 8971–8980
Lin TY, Dollár P, Girshick R, He K, Hariharan B, Belongie S (2017) Feature pyramid networks for object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2017.106, pp 2117–2125
Liu G, Sun X, Fu K, Wang H (2012) Aircraft recognition in high-resolution satellite images using coarse-to-fine shape prior. IEEE Geosci Remote Sens Lett 10(3):573–577
Nencini F, Garzelli A, Baronti S, Alparone L (2007) Remote sensing image fusion using the curvelet transform. Information Fusion 8(2):143–156. https://doi.org/10.1016/j.inffus.2006.02.001
Pedregosa F, Varoquaux G, Gramfort A, Michel V, Thirion B, Grisel O, Vanderplas J (2011) Scikit-learn: Machine learning in Python. The Journal of Machine Learning Research 12:2825–2830. https://doi.org/10.1524/auto.2011.0951
Qassim H, Verma A, Feinzimer D (2018) Compressed residual-VGG16 CNN model for big data places image recognition. In: 2018 IEEE 8th Annual Computing and Communication Workshop and Conference (CCWC). https://doi.org/10.1109/CCWC.2018.8301729, pp 169–175
Qiu Z, Zhuang Y, Yan F, Hu H, Wang W (2018) RGB-DI Images and full convolution neural network-based outdoor scene understanding for mobile robots. IEEE Trans Instrum Meas 68(1):27–37. https://doi.org/10.1109/TIM.2018.2834085
Ravì D, Wong C, Deligianni F, Berthelot M, Andreu-Perez J, Lo B, Yang GZ (2016) Deep learning for health informatics. IEEE Journal of Biomedical and Health Informatics 21(1):4–21. https://doi.org/10.1109/JBHI.2016.2636665
Redmon J, Divvala S, Girshick R, Al F (2016) You only look once: unified, real-time object de-tection. In: Proceedings of the IEEE conference on computer vision and pattern recognition (CVPR), vol 2, pp 779–788
Redmon J, Farhadi A (2017) YOLO9000: Better, faster,stronger. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), vol 2, pp 7263–7271
Redmon J, Farhadi A (2018) YOLOv3: An incremental improvement. arXiv:1804.02767, 2,4, 7, 11
Ren S, He K, Girshick R, Sun J (2015) Faster r-CNN: Towards real-time object detection with region proposal networks. In: Advances in Neural Information Processing Systems (NIPS), vol 2, pp 91–99
Ren S, He K, Girshick R, Sun J (2015) Faster r-cnn: Towards real-time object detection with region proposal networks. In: Advances in neural information processing systems. https://doi.org/10.1109/TPAMI.2016.2577031, pp 91–99
Sachdeva P, Trofymov D (2019) Object identification and labeling tool for training autonomous vehicle controllers. U.S. Patent 10,169,678
Suykens JA, Vandewalle J (1999) Least squares support vector machine classifiers. Neural Process Lett 9(3):293–300. https://doi.org/10.1023/A:1018628609742
Szegedy C, Ioffe S, Vanhoucke V, Alemi A (2016) Inception-v4, inception-resnet and the impact of residual connections on learning. arXiv:1602.07261
Szegedy C, Vanhoucke V, Ioffe S, Shlens J, Wojna Z (2016) Rethinking the inception architecture for computer vision. In: Proceedings of the IEEE conference on computer vision and pattern recognition. https://doi.org/10.1109/CVPR.2016.308, pp 2818–2826
Wei W, Zhang J (2019) Remote sensing image aircraft detection technology based on deep learning. In: 2019 11th International Conference on Intelligent Human-Machine Systems and Cybernetics (IHMSC), vol 1, pp 173–177
Williams CK, Barber D (1998) Bayesian classification with Gaussian processes. IEEE Trans Pattern Anal Mach Intell 20(12):1342–1351. https://doi.org/10.1109/34.735807
Yang F, Yu H, Silamut K, Maude RJ, Jaeger S, Antani S (2019) Parasite Detection in Thick Blood Smears Based on Customized faster-RCNN on Smartphones. In: 2019 IEEE Applied Imagery Pattern Recognition Workshop (AIPR). https://doi.org/10.1109/AIPR47015.2019.9174565, pp 1–4
Yang Y, Zhuang Y, Bi F, Shi H, Xie Y (2017) M-FCN: Effective fully convolutional network-based airplane detection framework. IEEE Geosci Remote Sens Lett 14(8):1293–1297. https://doi.org/10.1109/LGRS.2017.2708722
Yuille AL, Hallinan PW, Cohen DS (1992) Feature extraction from faces using deformable templates. Int J Comput Vis 8 (2):99–111. https://doi.org/10.1007/BF00127169
Zhang Y, Dong C (2021) MARS: Detecting brain Class/Method code smell based on Metric-Attention mechanism and residual network. Journal of Software: Evolution and Process E2403:1–15. https://doi.org/10.1002/smr.2403
Zhang L, Zhang Y (2016) Airport detection and aircraft recognition based on two-layer saliency model in high spatial resolution remote-sensing images. IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing 10(4):1511–1524. https://doi.org/10.1109/JSTARS.2016.2620900
Zhou L, Yan H, Shan Y, Zheng C, Liu Y, Zou X, Qiao B (2021) Aircraft detection for remote sensing images based on deep convolutional neural networks. Journal of Electrical and Computer Engineering, 1–16. https://doi.org/10.1155/2021/4685644
Zhu X, Ma C (2011) The study of combined invariants optimization method on aircraft recognition. In: 2011 Symposium on Photonics and Optoelectronics (SOPO). https://doi.org/10.1109/SOPO.2011.5780562, pp 1–4
Acknowledgements
The authors also gratefully acknowledge the insightful comments and suggestions of the reviewers, which have improved the presentation. The authors would like to thank Mr. Feiyao Xu for participated in writing or technical editing of the manuscript.
Funding
The authors did not receive support from any organization for the submitted work. The authors have no relevant financial or non-financial interests to disclose.
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of Interests
The authors declare that they have no conflict of interest.
Additional information
Publisher’s note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
About this article
Cite this article
Zhang, Y., Song, C. & Zhang, D. Small-scale aircraft detection in remote sensing images based on Faster-RCNN. Multimed Tools Appl 81, 18091–18103 (2022). https://doi.org/10.1007/s11042-022-12609-5
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11042-022-12609-5