Abstract
Aiming at the difficulty of small object detection, a small object detection model combining coordinated attention mechanism and P2-BiFPN (P2 Bidirectional Feature Pyramid Network) structure is constructed based on YOLOv5. Firstly, we introduce the coordinated attention mechanism into the residual units of the backbone network to achieve more accurate localization of small objects. Secondly, to reduce the number of model parameters, we decompose the square convolution in the residual unit into parallel asymmetric convolutions. Then, the P2-BiFPN feature fusion network was constructed to enrich the information of small objects, so as to improve the small objects detection accuracy. Finally, we train and test the model on the WiderPerson dataset. The experimental results shows that compared with YOLOv5, our small object detection model has a 1.7% improvement in mAP and a 5.66 m reduction in the amount of parameters, with better detection performance for small-object pedestrians.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Evo, I., Avramovi, A.: Convolutional neural network based automatic object detection on aerial images. IEEE Geosci. Rem. Sens. Lett. 13(5), 740–744 (2017)
Erdelj, M., Natalizio, E., Chowdhury, K.R., et al.: Help from the sky: leveraging UAVs for disaster management. IEEE Perv. Comput. 16(1), 24–32 (2017)
Joy, A., Jayanthi, V.S., Baskar, D.: Automatic object detection in car-driving sequence using neural network and optical flow analysis. In: IEEE International Conference on Computational Intelligence and Computing Research. IEEE (2014)
Song, Z., Zhang, Y., Liu, Y., et al.: MSFYOLO: feature fusion-based detection for small objects. IEEE Lat. Am. Trans. 20(5), 823–830 (2022)
Liu, Z., Li, D., Ge, S.S., et al.: Small traffic sign detection from large image. Appl. Intell. 50, 1–13 (2020)
Guo, L., Wang, Q., Xue, W., et al.: A small object detection algorithm based on improved YOLOv5. Univ. Electron. Sci. Technol. J. 51(02), 251–258 (2022)
Liu, Y., Yang, F., Hu, P.: Small-object detection in UAV-captured images via multi-branch parallel feature pyramid networks. IEEE Access 8, 145740–145750 (2020)
Yi, H., Song, W., Huang, J.: UAV small target detection based on improved YOLOv5. Electromech. Eng. Technol. 52(02), 139–144 (2023)
Zhang, T., Chen, E., Xiao, W., et al.: Fast target detection method for improving MobileNet_YOLOv3 network. Minicomp. Syst. 42(5), 1008–1014 (2021)
Sun, B., Zuo, Z., Wu, P., et al.: Object detection for environment perception of unmanned surface vehicles based on the improved SSD. J. Instrument. 42(09), 52–61 (2021)
Zhao, P., Xie, L., Peng, L.: Deep small object detection algorithm integrating attention mechanism [J/OL]. Comp. Sci. Explor. (2021)
Hou, S., Wang, Z., Dong, Z., et al.: Self-supervised recalibration network for person re-identification. Defence Technology (2023)
Wei, W.A.N.G., Xiaogang, W.A.N.: A small target detection method combining attention mechanism and feature fusion. J. Xi’an Eng. Univ. 36(06), 115–123 (2022)
Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)
Liu, S., Qi, L., Qin, H., et al.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)
Hou, S., Yin, K., Liang, J., et al.: Gradient-supervised person re-identification based on dense feature pyramid network. Complex Intell. Syst. 8(6), 5329–5342
Ghiasi, G., Lin, T.Y., Le, Q.V.: NAS-FPN: learning scalable feature pyramid architecture for object detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2019)
Tan, M., Pang, R., Le, Q.V.: Efficientdet: scalable and efficient object detection[C]. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)
Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)
Paszke, A., Chaurasia, A., Kim, S., et al.: Enet: a deep neural network architecture for realtime semantic segmentation. arXiv preprint arXiv:1606.02147 (2016)
Carion, N., Massa, F., Synnaeve, G., et al.: End-to-end object detection with transformers. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16. Springer International Publishing, pp. 213–229 (2020)
Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector. In: Proceedings of the 14th European Conference on Computer Vision—ECCV 2016, Amsterdam, The Netherlands, October 11–14, 2016, Part I 14. Springer International Publishing, pp. 21–37 (2016)
Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Patt. Anal. Mach. Intell. 39(6), 1137–1149 (2017)
Ge, Z., Liu, S., Wang, F., et al.: Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)
Cai, Z., Vasconcelos, N.: Cascade R-CNN: Delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6154–6162 (2018)
Acknowledgements
This work was supported by the Shenzhen Science and Technology Program (No. JSGG20220301090405009).
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.
About this paper
Cite this paper
Juanjuan, Z., Xiaohan, H., Zebang, Q., Guangqiang, Y. (2024). Small Object Detection Algorithm Combining Coordinate Attention Mechanism and P2-BiFPN Structure. In: Zhang, Y., Qi, L., Liu, Q., Yin, G., Liu, X. (eds) Proceedings of the 13th International Conference on Computer Engineering and Networks. CENet 2023. Lecture Notes in Electrical Engineering, vol 1125. Springer, Singapore. https://doi.org/10.1007/978-981-99-9239-3_27
Download citation
DOI: https://doi.org/10.1007/978-981-99-9239-3_27
Published:
Publisher Name: Springer, Singapore
Print ISBN: 978-981-99-9238-6
Online ISBN: 978-981-99-9239-3
eBook Packages: EngineeringEngineering (R0)