Skip to main content

Miniscule Object Detection in Aerial Images Using YOLOR: A Review

  • Conference paper
  • First Online:
Proceedings of International Conference on Communication and Computational Technologies

Abstract

Object detection has made immense improvements in natural images during the last decade but not so much in aerial images. Detection of miniature objects in aerial images remains challenging as they contain only a few pixels and extremely large input sizes. Moreover, tiny objects are easily fooled by the backstory and increase the difficulty of accurate detection. Many algorithms are used for object detection purposes, and YOLOR is one of them. YOLOR “You Only Learn One Representation” is a one-stage detector. It is specially made for object detection, whereas other algorithms include object classification or analysis. In CNN, only, one task is carried out at a time, whereas YOLOR is a unified model useful for multitasking purposes. In this paper, we discussed tiny object detection in aerial images using YOLOR. Based on our research, we found that the AI-TOD dataset contains object instances in eight categories, with 86% of the objects being smaller than 16 pixels in size. The AI-TOD can be used to assess the performance of a variety of small objects. The mean size of objects is approximately 12.8 pixels, which is considerably smaller than the other datasets.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 259.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 329.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book
USD 329.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Yi J, Wu P, Liu B, Huang Q, Qu H, Metaxas AD (2021) Oriented object detection in aerial images with box boundary-aware vectors. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, USA

    Google Scholar 

  2. Yang F, Fan H, Chu P, Blasch E, Ling H (2019) Clustered object detection in aerial images. In: Proceedings of the IEEE/CVF international conference on computer vision, USA

    Google Scholar 

  3. Wang CY, Yeh IH, Liao HYM (2021) You only learn one representation: unified network for multiple tasks. In arXiv preprint arXiv:2105.04206, Taiwan

  4. Tang Y, Gu B, Jiang B (2021) Research on feature enhancement for small object detection. J Phys: Conf Series

    Google Scholar 

  5. Ming Q, Miao L, Zhou Z, Song J, Yang X (2021) Sparse label assignment for oriented object detection in aerial images. In Remote sensing, China

    Google Scholar 

  6. Pang J, Li C, Shi J, Xu Z, Feng H (2019) CNN-fast tiny object detection in large-scale remote sensing images. In: IEEE Trans Geosci Remote Sens

    Google Scholar 

  7. Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision

    Google Scholar 

  8. Ren, Shaoqing, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137–1149

    Google Scholar 

  9. Dai, Jifeng, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. In: Advances in neural information processing systems

    Google Scholar 

  10. Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: single shot multibox detector. In European conference on computer vision, USA

    Google Scholar 

  11. Redmon, Joseph, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition

    Google Scholar 

  12. Redmon J, Farhadi, Ali (2018) Yolov3: an incremental improvement. arXiv:1804.02767

  13. Wang, Chien-Yao, Bochkovskiy A, Liao HYM (2021) Scaled-yolov4: Scaling cross stage partial network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Taiwan

    Google Scholar 

  14. Wang J, Yang W, Guo H, Zhang R, Xia, Gui-Song (2021) Tiny object detection in aerial images. In: 2020 25th international conference on pattern recognition (ICPR), Wuhan, Hubei, China

    Google Scholar 

  15. Kong T, Sun F, Liu H, Jiang Y, Li L, Shi, Jianbo (2020) Foveabox: beyound anchor-based object detection. IEEE Trans Image Process 29:7389–7398

    Google Scholar 

  16. Li Z, Peng C, Yu G, Zhang X, Deng Y, Sun, Jian (2018) Detnet: a backbone network for object detection. arXiv:1804.06215

  17. Han J, Ding J, Xue N, Xia, Gui-Song (2021) Redet: a rotation-equivariant detector for aerial object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition

    Google Scholar 

  18. Liang, Dong, Wei Z, Zhang D, Geng Q, Zhang L, Sun H, Zhou H, Wei M, Gao P (2021) Learning calibrated-guidance for object detection in aerial images. arXiv:2103.11399

  19. Adarsh P, Rathi P, Kumar, Manoj (2020) YOLO v3-tiny: object detection and recognition using one stage improved model. In: 2020 6th international conference on advanced computing and communication systems (ICACCS), Delhi

    Google Scholar 

  20. Zou Z, Shi Z, Guo Y, Ye, Jieping (2019) Object detection in 20 years: a survey. arXiv:1905.05055

  21. Zhao, Zhong-Qiu P, Zheng, Xu ST, Wu X (2019) Object detection with deep learning: a review. IEEE Trans Neural Netw Learn Syst 30:3212–3232

    Google Scholar 

  22. Sajat MAS, Hashim H, Tahir, Md N (2020) Detection of human bodies in lying position based on aggregate channel features. In: 2020 16th IEEE international colloquium on signal processing and its applications (CSPA), Malaysia

    Google Scholar 

  23. Dalal N, Triggs, Bill (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), France

    Google Scholar 

  24. Viola P, Jones M et al (2001) Robust real-time object detection. Int J Comput Vision 4:4

    Google Scholar 

  25. Reese K, Zheng Y, Elmaghraby, Adel (2012) A comparison of face detection algorithms in visible and thermal spectrums. In: International conference on advances in computer science and application, USA

    Google Scholar 

  26. Pavel FA, Wang Z, Feng, Dagan D (2009) Reliable object recognition using sift features. In: 2009 IEEE international workshop on multimedia signal processing, Hong Kong, China

    Google Scholar 

  27. Bay H, Tuytelaars T, Gool V, Luc (2006) Surf: speeded up robust features. In: European conference on computer vision

    Google Scholar 

  28. Felzenszwalb P, McAllester D, Ramanan, Deva (2008) A discriminatively trained, multiscale, deformable part model. In: 2008 IEEE conference on computer vision and pattern recognition, Chicago

    Google Scholar 

  29. Schneider S, Taylor GW, Kremer, Stefan (2018) Deep learning object detection methods for ecological camera trap data. In: 2018 15th Conference on computer and robot vision (CRV)

    Google Scholar 

  30. Li K, Cao, Lu (2020) A review of object detection techniques. In: 2020 5th international conference on electromechanical control technology and transportation (ICECTT), China

    Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Neha Pawar .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Pawar, N. et al. (2023). Miniscule Object Detection in Aerial Images Using YOLOR: A Review. In: Kumar, S., Hiranwal, S., Purohit, S.D., Prasad, M. (eds) Proceedings of International Conference on Communication and Computational Technologies . Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-3951-8_52

Download citation

Publish with us

Policies and ethics