Skip to main content

Small Object Detection Algorithm Combining Coordinate Attention Mechanism and P2-BiFPN Structure

  • Conference paper
  • First Online:
Proceedings of the 13th International Conference on Computer Engineering and Networks (CENet 2023)

Part of the book series: Lecture Notes in Electrical Engineering ((LNEE,volume 1125))

Included in the following conference series:

  • 199 Accesses

Abstract

Aiming at the difficulty of small object detection, a small object detection model combining coordinated attention mechanism and P2-BiFPN (P2 Bidirectional Feature Pyramid Network) structure is constructed based on YOLOv5. Firstly, we introduce the coordinated attention mechanism into the residual units of the backbone network to achieve more accurate localization of small objects. Secondly, to reduce the number of model parameters, we decompose the square convolution in the residual unit into parallel asymmetric convolutions. Then, the P2-BiFPN feature fusion network was constructed to enrich the information of small objects, so as to improve the small objects detection accuracy. Finally, we train and test the model on the WiderPerson dataset. The experimental results shows that compared with YOLOv5, our small object detection model has a 1.7% improvement in mAP and a 5.66 m reduction in the amount of parameters, with better detection performance for small-object pedestrians.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 229.00
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Hardcover Book
USD 299.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Evo, I., Avramovi, A.: Convolutional neural network based automatic object detection on aerial images. IEEE Geosci. Rem. Sens. Lett. 13(5), 740–744 (2017)

    Google Scholar 

  2. Erdelj, M., Natalizio, E., Chowdhury, K.R., et al.: Help from the sky: leveraging UAVs for disaster management. IEEE Perv. Comput. 16(1), 24–32 (2017)

    Article  Google Scholar 

  3. Joy, A., Jayanthi, V.S., Baskar, D.: Automatic object detection in car-driving sequence using neural network and optical flow analysis. In: IEEE International Conference on Computational Intelligence and Computing Research. IEEE (2014)

    Google Scholar 

  4. Song, Z., Zhang, Y., Liu, Y., et al.: MSFYOLO: feature fusion-based detection for small objects. IEEE Lat. Am. Trans. 20(5), 823–830 (2022)

    Article  Google Scholar 

  5. Liu, Z., Li, D., Ge, S.S., et al.: Small traffic sign detection from large image. Appl. Intell. 50, 1–13 (2020)

    Article  Google Scholar 

  6. Guo, L., Wang, Q., Xue, W., et al.: A small object detection algorithm based on improved YOLOv5. Univ. Electron. Sci. Technol. J. 51(02), 251–258 (2022)

    Google Scholar 

  7. Liu, Y., Yang, F., Hu, P.: Small-object detection in UAV-captured images via multi-branch parallel feature pyramid networks. IEEE Access 8, 145740–145750 (2020)

    Article  Google Scholar 

  8. Yi, H., Song, W., Huang, J.: UAV small target detection based on improved YOLOv5. Electromech. Eng. Technol. 52(02), 139–144 (2023)

    Google Scholar 

  9. Zhang, T., Chen, E., Xiao, W., et al.: Fast target detection method for improving MobileNet_YOLOv3 network. Minicomp. Syst. 42(5), 1008–1014 (2021)

    Google Scholar 

  10. Sun, B., Zuo, Z., Wu, P., et al.: Object detection for environment perception of unmanned surface vehicles based on the improved SSD. J. Instrument. 42(09), 52–61 (2021)

    Google Scholar 

  11. Zhao, P., Xie, L., Peng, L.: Deep small object detection algorithm integrating attention mechanism [J/OL]. Comp. Sci. Explor. (2021)

    Google Scholar 

  12. Hou, S., Wang, Z., Dong, Z., et al.: Self-supervised recalibration network for person re-identification. Defence Technology (2023)

    Google Scholar 

  13. Wei, W.A.N.G., Xiaogang, W.A.N.: A small target detection method combining attention mechanism and feature fusion. J. Xi’an Eng. Univ. 36(06), 115–123 (2022)

    Google Scholar 

  14. Lin, T.Y., Dollár, P., Girshick, R., et al.: Feature pyramid networks for object detection. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 2117–2125 (2017)

    Google Scholar 

  15. Liu, S., Qi, L., Qin, H., et al.: Path aggregation network for instance segmentation. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 8759–8768 (2018)

    Google Scholar 

  16. Hou, S., Yin, K., Liang, J., et al.: Gradient-supervised person re-identification based on dense feature pyramid network. Complex Intell. Syst. 8(6), 5329–5342

    Google Scholar 

  17. Ghiasi, G., Lin, T.Y., Le, Q.V.: NAS-FPN: learning scalable feature pyramid architecture for object detection. In: 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR). IEEE (2019)

    Google Scholar 

  18. Tan, M., Pang, R., Le, Q.V.: Efficientdet: scalable and efficient object detection[C]. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 10781–10790 (2020)

    Google Scholar 

  19. Hou, Q., Zhou, D., Feng, J.: Coordinate attention for efficient mobile network design. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 13713–13722 (2021)

    Google Scholar 

  20. Paszke, A., Chaurasia, A., Kim, S., et al.: Enet: a deep neural network architecture for realtime semantic segmentation. arXiv preprint arXiv:1606.02147 (2016)

  21. Carion, N., Massa, F., Synnaeve, G., et al.: End-to-end object detection with transformers. In: Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part I 16. Springer International Publishing, pp. 213–229 (2020)

    Google Scholar 

  22. Liu, W., Anguelov, D., Erhan, D., et al.: SSD: single shot multibox detector. In: Proceedings of the 14th European Conference on Computer Vision—ECCV 2016, Amsterdam, The Netherlands, October 11–14, 2016, Part I 14. Springer International Publishing, pp. 21–37 (2016)

    Google Scholar 

  23. Ren, S., He, K., Girshick, R., et al.: Faster R-CNN: towards real-time object detection with region proposal networks. IEEE Trans. Patt. Anal. Mach. Intell. 39(6), 1137–1149 (2017)

    Article  Google Scholar 

  24. Ge, Z., Liu, S., Wang, F., et al.: Yolox: Exceeding yolo series in 2021. arXiv preprint arXiv:2107.08430 (2021)

  25. Cai, Z., Vasconcelos, N.: Cascade R-CNN: Delving into high quality object detection. In: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 6154–6162 (2018)

    Google Scholar 

Download references

Acknowledgements

This work was supported by the Shenzhen Science and Technology Program (No. JSGG20220301090405009).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Yin Guangqiang .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2024 The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd.

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Juanjuan, Z., Xiaohan, H., Zebang, Q., Guangqiang, Y. (2024). Small Object Detection Algorithm Combining Coordinate Attention Mechanism and P2-BiFPN Structure. In: Zhang, Y., Qi, L., Liu, Q., Yin, G., Liu, X. (eds) Proceedings of the 13th International Conference on Computer Engineering and Networks. CENet 2023. Lecture Notes in Electrical Engineering, vol 1125. Springer, Singapore. https://doi.org/10.1007/978-981-99-9239-3_27

Download citation

  • DOI: https://doi.org/10.1007/978-981-99-9239-3_27

  • Published:

  • Publisher Name: Springer, Singapore

  • Print ISBN: 978-981-99-9238-6

  • Online ISBN: 978-981-99-9239-3

  • eBook Packages: EngineeringEngineering (R0)

Publish with us

Policies and ethics