Skip to main content
Log in

Automatic object detection in aerial image using bent identity-convolutional neural network and fine tuning algorithm

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

In RS (Remote Sensing) imaging, one of the most important tasks is Object Detection (OD) which aims to extract the object location and class information. Object detection is termed the highly advanced approach, which is helpful in RS image analysis, scene and image content understanding. In recent years, the researchers have made extensive efforts to present an effective OD model with the limitation of detecting specific classes such as roads and buildings in the RS images. For capturing aerial images, satellite sensors are utilized, and these images are affected by various factors like background noise, variation of viewpoint and interferences etc. To provide accurate detection of objects, an effective method is needed in analysing aerial images. However, this work presents an automatic OD using BI-CNN_FTA (Bent Identity-Convolutional Neural Network Fine-Tuning Algorithm) to detect more object classes relating to urbanization applications. The overall framework of proposed method is divided into five phases-Pre-processing, Object Localization, Segmentation, Feature Extraction and Classification. The input image is pre-processed by Advanced Dual Domain (ADD) filtering. Next, the objects are localized with the help of Grid Guided Localization (GGL) which draw an imaginary box around each object in an image. The segmentation process is done with the help of an improved tree Markov (ITM) random field model. Further, various features like texture, geometric, color descriptors and Zernike moments are extracted. Finally, the object classification is performed by BI-CNN_FTA. During this stage, each object class are categorized based on urbanization applications and learning loss is optimized. The proposed method is executed in the PHYTON platform. It has obtained improved performance in terms of Mean Average Precision (mAP = 85.5%), F1-score (83.78%), Detection rate (97%) and Precision-Recall curve when compared with other deep learning networks on the DOTA dataset.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

References

  1. Alganci U, Soydas M, Sertel E (2020) Comparative research on deep learning approaches for airplane detection from very high-resolution satellite images. Remote Sens 12(3):458

    Article  Google Scholar 

  2. Avtar R, Komolafe AA, Kouser A, Singh D, Yunus AP, Dou J, Kumar P (2020) Assessing sustainable development prospects through remote sensing: a review. Remote Sens Appl: Soc Environ 100402:100402

    Google Scholar 

  3. Ayush K, Uzkent B, Burke M, Lobell D and Ermon S (2020) Generating interpretable poverty maps using object detection in satellite images arXiv preprint arXiv: 2002.01612.

  4. Azimi SM, Vig E, Bahmanyar R, Körner M and Reinartz P (2018) Towards multi-class object detection in unconstrained remote sensing imagery. In Asian Conference on Computer Vision, Springer, Cham 150-165.

  5. Bosquet B, Mucientes M, Brea VM (2021) STDnet-ST: Spatio-temporal ConvNet for small object detection. Pattern Recogn 107929:107929

    Article  Google Scholar 

  6. Campbell JB and Wynne RH (2011) Introduction to remote sensing. Guilford Press.

  7. Chen R, Li X, Li J (2018) Object-based features for house detection from RGB high-resolution images. Remote Sens 10(3):451

    Article  Google Scholar 

  8. Chen S, Zhan R, Zhang J (2018) Geospatial object detection in remote sensing imagery based on multiscale single-shot detector with activated semantics. Remote Sens 10(6):820

    Article  Google Scholar 

  9. Chen C, Gong W, Chen Y, Li W (2019) Object detection in remote sensing images based on a scene-contextual feature pyramid network. Remote Sens 11(3):339

    Article  Google Scholar 

  10. DOTA Dataset: https://captain-whu.github.io/DOTA/dataset.html

  11. Fan Q, Hou FH, Shi F (2020) Bent identity-based CNN for image Denoising. J Appl Sci Eng 23(3):547–554

    Google Scholar 

  12. Grinias I, Panagiotakis C, Tziritas G (2016) MRF-based segmentation and unsupervised classification for building and road detection in peri-urban areas of high-resolution satellite images. ISPRS J Photogramm Remote Sens 122:145–166

    Article  Google Scholar 

  13. Guo W, Yang W, Zhang H, Hua G (2018) Geospatial object detection in high resolution satellite images based on multi-scale convolutional neural network. Remote Sens 10(1):131

    Article  Google Scholar 

  14. Khoshelham K, Nardinocchi C, Frontoni E, Mancini A, Zingaretti P (2010) Performance evaluation of automated approaches to building detection in multi-source aerial data. ISPRS J Photogramm Remote Sens 65(1):123–133

    Article  Google Scholar 

  15. Li K, Cheng G, Bu S, You X (2017) Rotation-insensitive and context-augmented object detection in remote sensing images. IEEE Trans Geosci Remote Sens 56(4):2337–2348

    Article  Google Scholar 

  16. Li Y, Zhang Y, Huang X, Yuille AL (2018) Deep networks under scene-level supervision for multi-class geospatial object detection from remote sensing images. ISPRS J Photogramm Remote Sens 146:182–196

    Article  Google Scholar 

  17. Li X, Li Z, Feng R, Luo S, Zhang C, Jiang M, Shen H (2020) Generating high-quality and high-resolution seamless satellite imagery for large-scale urban regions. Remote Sens 12(1):81

    Article  Google Scholar 

  18. Li C, Yang T, Zhu S, Chen C and Guan S (2020) Density map guided object detection in aerial images. In proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops 190-191.

  19. Long Y, Gong Y, Xiao Z, Liu Q (2017) Accurate object localization in remote sensing images based on convolutional neural networks. IEEE Trans Geosci Remote Sens 55(5):2486–2498

    Article  Google Scholar 

  20. Ma L, Liu Y, Zhang X, Ye Y, Yin G, Johnson BA (2019) Deep learning in remote sensing applications: a meta-analysis and review. ISPRS J Photogramm Remote Sens 152:166–177

    Article  Google Scholar 

  21. Mandal M, Shah M, Meena P and Vipparthi SK (2019) Sssdet: simple short and shallow network for resource efficient vehicle detection in aerial scenes. In 2019 IEEE international conference on image processing (ICIP), 3098-3102.

  22. Mo N, Yan L (2020) Improved faster RCNN based on feature amplification and oversampling data augmentation for oriented vehicle detection in aerial images. Remote Sens 12(16):2558

    Article  Google Scholar 

  23. Pi Y, Nath ND, Behzadan AH (2020) Convolutional neural networks for object detection in aerial imagery for disaster response and recovery. Adv Eng Inform 43:101009

    Article  Google Scholar 

  24. Rasti B, Ghamisi P (2020) Remote sensing image classification using subspace sensor fusion. Inform Fusion 64:121–130

    Article  Google Scholar 

  25. Samanta S, Pal DK, Palsamanta B (2018) Flood susceptibility analysis through remote sensing, GIS and frequency ratio model. Appl Water Sci 8(2):66

    Article  Google Scholar 

  26. Ševo I, Avramović A (2016) Convolutional neural network based automatic object detection on aerial images. IEEE Geosci Remote Sens Lett 13(5):740–744

    Article  Google Scholar 

  27. Song Y, Gao L, Li X, Shen W (2020) A novel robotic grasp detection method based on region proposal networks. Robot Comput Integr Manuf 65:101963

    Article  Google Scholar 

  28. Twumasi NYD, Shao Z and Orhan A (2019) Remote sensing and GIS methods in urban disaster monitoring and management–an overview.

  29. Wellmann T, Lausch A, Andersson E, Knapp S, Cortinovis C, Jache J, Scheuer S (2020) Remote sensing in urban planning: contributions towards ecologically sound policies? Landsc Urban Plan 204:103921

    Article  Google Scholar 

  30. Xia G-S, Bai X, Ding J, Zhu Z, Belongie S, Luo J, Datcu M, Pelillo M and Zhang L (2018) DOTA: A large-scale dataset for object detection in aerial images. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3974–3983.

  31. Yan S, Shi K, Li Y, Liu J, Zhao H (2020) Integration of satellite remote sensing data in underground coal fire detection: a case study of the Fukang region, Xinjiang, China. Front Earth Sci 14(1):1–12

    Article  Google Scholar 

  32. Yang F, Fan H, Chu P, Blasch E and Ling H (2019) Clustered object detection in aerial images, in proceedings of the IEEE/CVF international conference on computer vision, 8311-8320.

  33. Yi Y, Zhang Z, Zhang W, Zhang C, Li W, Zhao T (2019) Semantic segmentation of urban buildings from VHR remote sensing imagery using a deep convolutional neural network. Remote Sens 11(15):1774

    Article  Google Scholar 

  34. Zhang C, Yue P, Tapete D, Shangguan B, Wang M, Wu ZA (2020) Multi-level context-guided classification method with object-based convolutional neural network for land cover classification using very high resolution remote sensing images. Int J Appl Earth Observ Geoinform 88:102086

    Article  Google Scholar 

  35. Zhang X, Han L, Han L, Zhu L (2020) How well do deep learning-based methods for land cover classification and object detection perform on high resolution remote sensing imagery? Remote Sens 12(3):417

    Article  Google Scholar 

  36. Zhong Y, Ma A, soon Ong Y, Zhu U and Zhang L (2018) Computational intelligence in optical remote sensing image processing. Appl Soft Comput 64: 75–93.

  37. Zhou L, Liu J, Chen L (2020) Vehicle detection based on remote sensing image of Yolov3. in 2020 IEEE 4th information technology, networking, electronic and automation control conference (ITNEC). IEEE 1:468–472

    Google Scholar 

Download references

Data availability statement

Data sharing is not applicable to this article as no new data were created or analyzed in this study.

Funding

No funding is provided for the preparation of manuscript.

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Lalitha V.P..

Ethics declarations

Ethical approval

This article does not contain any studies with human participants or animals performed by any of the authors.

Conflict of interest

Authors Lalitha V. P and Dr. Shanta Rangaswamy declared that they have no conflict of interest.

Additional information

Publisher’s note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Lalitha V.P., Rangaswamy, S. Automatic object detection in aerial image using bent identity-convolutional neural network and fine tuning algorithm. Multimed Tools Appl 81, 9713–9740 (2022). https://doi.org/10.1007/s11042-022-11948-7

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s11042-022-11948-7

Keywords

Navigation