Abstract
Object detection has made immense improvements in natural images during the last decade but not so much in aerial images. Detection of miniature objects in aerial images remains challenging as they contain only a few pixels and extremely large input sizes. Moreover, tiny objects are easily fooled by the backstory and increase the difficulty of accurate detection. Many algorithms are used for object detection purposes, and YOLOR is one of them. YOLOR “You Only Learn One Representation” is a one-stage detector. It is specially made for object detection, whereas other algorithms include object classification or analysis. In CNN, only, one task is carried out at a time, whereas YOLOR is a unified model useful for multitasking purposes. In this paper, we discussed tiny object detection in aerial images using YOLOR. Based on our research, we found that the AI-TOD dataset contains object instances in eight categories, with 86% of the objects being smaller than 16 pixels in size. The AI-TOD can be used to assess the performance of a variety of small objects. The mean size of objects is approximately 12.8 pixels, which is considerably smaller than the other datasets.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Yi J, Wu P, Liu B, Huang Q, Qu H, Metaxas AD (2021) Oriented object detection in aerial images with box boundary-aware vectors. In: Proceedings of the IEEE/CVF winter conference on applications of computer vision, USA
Yang F, Fan H, Chu P, Blasch E, Ling H (2019) Clustered object detection in aerial images. In: Proceedings of the IEEE/CVF international conference on computer vision, USA
Wang CY, Yeh IH, Liao HYM (2021) You only learn one representation: unified network for multiple tasks. In arXiv preprint arXiv:2105.04206, Taiwan
Tang Y, Gu B, Jiang B (2021) Research on feature enhancement for small object detection. J Phys: Conf Series
Ming Q, Miao L, Zhou Z, Song J, Yang X (2021) Sparse label assignment for oriented object detection in aerial images. In Remote sensing, China
Pang J, Li C, Shi J, Xu Z, Feng H (2019) CNN-fast tiny object detection in large-scale remote sensing images. In: IEEE Trans Geosci Remote Sens
Girshick R (2015) Fast r-cnn. In: Proceedings of the IEEE international conference on computer vision
Ren, Shaoqing, He K, Girshick R, Sun J (2016) Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans Pattern Anal Mach Intell 39:1137–1149
Dai, Jifeng, Li Y, He K, Sun J (2016) R-fcn: object detection via region-based fully convolutional networks. In: Advances in neural information processing systems
Liu W, Anguelov D, Erhan D, Szegedy C, Reed S, Fu CY, Berg AC (2016) Ssd: single shot multibox detector. In European conference on computer vision, USA
Redmon, Joseph, Divvala S, Girshick R, Farhadi A (2016) You only look once: unified, real-time object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition
Redmon J, Farhadi, Ali (2018) Yolov3: an incremental improvement. arXiv:1804.02767
Wang, Chien-Yao, Bochkovskiy A, Liao HYM (2021) Scaled-yolov4: Scaling cross stage partial network. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, Taiwan
Wang J, Yang W, Guo H, Zhang R, Xia, Gui-Song (2021) Tiny object detection in aerial images. In: 2020 25th international conference on pattern recognition (ICPR), Wuhan, Hubei, China
Kong T, Sun F, Liu H, Jiang Y, Li L, Shi, Jianbo (2020) Foveabox: beyound anchor-based object detection. IEEE Trans Image Process 29:7389–7398
Li Z, Peng C, Yu G, Zhang X, Deng Y, Sun, Jian (2018) Detnet: a backbone network for object detection. arXiv:1804.06215
Han J, Ding J, Xue N, Xia, Gui-Song (2021) Redet: a rotation-equivariant detector for aerial object detection. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition
Liang, Dong, Wei Z, Zhang D, Geng Q, Zhang L, Sun H, Zhou H, Wei M, Gao P (2021) Learning calibrated-guidance for object detection in aerial images. arXiv:2103.11399
Adarsh P, Rathi P, Kumar, Manoj (2020) YOLO v3-tiny: object detection and recognition using one stage improved model. In: 2020 6th international conference on advanced computing and communication systems (ICACCS), Delhi
Zou Z, Shi Z, Guo Y, Ye, Jieping (2019) Object detection in 20 years: a survey. arXiv:1905.05055
Zhao, Zhong-Qiu P, Zheng, Xu ST, Wu X (2019) Object detection with deep learning: a review. IEEE Trans Neural Netw Learn Syst 30:3212–3232
Sajat MAS, Hashim H, Tahir, Md N (2020) Detection of human bodies in lying position based on aggregate channel features. In: 2020 16th IEEE international colloquium on signal processing and its applications (CSPA), Malaysia
Dalal N, Triggs, Bill (2005) Histograms of oriented gradients for human detection. In: 2005 IEEE computer society conference on computer vision and pattern recognition (CVPR'05), France
Viola P, Jones M et al (2001) Robust real-time object detection. Int J Comput Vision 4:4
Reese K, Zheng Y, Elmaghraby, Adel (2012) A comparison of face detection algorithms in visible and thermal spectrums. In: International conference on advances in computer science and application, USA
Pavel FA, Wang Z, Feng, Dagan D (2009) Reliable object recognition using sift features. In: 2009 IEEE international workshop on multimedia signal processing, Hong Kong, China
Bay H, Tuytelaars T, Gool V, Luc (2006) Surf: speeded up robust features. In: European conference on computer vision
Felzenszwalb P, McAllester D, Ramanan, Deva (2008) A discriminatively trained, multiscale, deformable part model. In: 2008 IEEE conference on computer vision and pattern recognition, Chicago
Schneider S, Taylor GW, Kremer, Stefan (2018) Deep learning object detection methods for ecological camera trap data. In: 2018 15th Conference on computer and robot vision (CRV)
Li K, Cao, Lu (2020) A review of object detection techniques. In: 2020 5th international conference on electromechanical control technology and transportation (ICECTT), China
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Pawar, N. et al. (2023). Miniscule Object Detection in Aerial Images Using YOLOR: A Review. In: Kumar, S., Hiranwal, S., Purohit, S.D., Prasad, M. (eds) Proceedings of International Conference on Communication and Computational Technologies . Algorithms for Intelligent Systems. Springer, Singapore. https://doi.org/10.1007/978-981-19-3951-8_52
Download citation
DOI: https://doi.org/10.1007/978-981-19-3951-8_52
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-19-3950-1
Online ISBN: 978-981-19-3951-8
eBook Packages: Intelligent Technologies and RoboticsIntelligent Technologies and Robotics (R0)