Skip to main content
Log in

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

  • Published:
Multimedia Tools and Applications Aims and scope Submit manuscript

Abstract

Addressing the challenges of high model complexity, low generalization capability, and suboptimal detection performance in most algorithms for crop leaf disease detection, the paper propose a lightweight enhanced YOLOv8 algorithm. First, by incorporating the advantages of GhostNet's feature redundancy reduction and MobileNet's ability to fuse diverse channel features using the concept of Group Convolution, the paper enhance the backbone network. This improves the network's ability to extract critical features from a multitude of similar redundant feature maps. Second, to improve detection accuracy while reducing model parameters and computational load, the paper introduce the slim-Neck module. Finally, addressing the issue where detection head parameters and computations account for over half of the model, the paper restructure the Head using the concept of shared parameters and integrate convolution blocks that enhance multi-scale information recognition. Results from multiple experiments, averaged for consistency, indicate that compared to the original YOLOv8 algorithm, the improved algorithm achieves an increase in mAP50 from 86% to 87.3% and mAP50:95 from 67% to 68.6%. The model's size is a mere 5.45 MB, and the computational parameter GFLOPs is reduced from 8.1 to 5.5, even lower than the most lightweight YOLOv5. In comparison to other large-model algorithms, this model also demonstrates strong competitiveness in detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7

Similar content being viewed by others

Data availability

Data will be made available on reasonable request.

References

  1. Orchi H, Sadik M, Khaldoun M, Sabir E (2023) Real-time detection of crop leaf diseases using enhanced YOLOv8 algorithm. In: 2023 International Wireless Communications and Mobile Computing (IWCMC). Marrakesh, Morocco 1690–1696

    Google Scholar 

  2. Terven J, Cordova-Esparza D (2023) A comprehensive review of YOLO: From YOLOv1 to YOLOv8 and beyond. arXiv preprint arXiv:2304.00501

  3. Bhosale YH, Zanwar SR, Ali SS, Vaidya NS, Auti RA, Patil DH (2023) Multi-plant and multi-crop leaf disease detection and classification using deep neural networks, machine learning, image processing with precision agriculture - A review. In: 2023 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, pp 1–7. https://doi.org/10.1109/ICCCI56745.2023.10128246

  4. Li Y, Fan Q, Huang H, Han Z, Gu Q (2023) a modified yolov8 detection network for UAV aerial image recognition. Drones 7(5):304

    Article  Google Scholar 

  5. Liu J, Wang X (2020) Tomato diseases and pests detection based on improved yolo v3 convolutional neural network. Front Plant Sci 11:898

    Article  Google Scholar 

  6. Li H, Li C, Li G, Chen L (2021) A real-time table grape detection method based on improved yolov4-tiny network in complex back-ground. Biosys Eng 212:347–359

    Article  Google Scholar 

  7. Mathew MP, Mahesh TY (2022) Leaf-based disease detection in bell pepper plant using yolo v5. Signal Image Video Process 16(3):841–847. https://doi.org/10.1007/s11760-021-02024-y

    Article  Google Scholar 

  8. Jia M, Tang L, Chen BC, Cardie C, Belongie S, Hariharan B, Lim SN (2022) Visual prompt tuning. In: European conference on computer vision, vol 13693. LNCS, pp 709–727. https://doi.org/10.1007/978-3-031-19827-4_41

    Chapter  Google Scholar 

  9. Han C, Wang Q, Cui Y, Cao Z, Wang W, Qi S, Liu D (2023) E2VPT: An effective and efficient approach for visual prompt tuning. arXiv preprint arXiv:2307.13770

  10. Yan L, Han C, Xu Z, Liu D, Wang Q (2023) Prompt learns prompt: exploring knowledge-aware generative prompt collaboration for video captioning. In: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI). International Joint Conferences on Artificial Intelligence Organization, vol 180. https://doi.org/10.24963/ijcai

    Chapter  Google Scholar 

  11. Wang X, Kan M, Shan S, Chen X (2019) Fully learnable group convolution for acceleration of deep neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9041–9050. https://doi.org/10.1109/CVPR.2019.00926

    Chapter  Google Scholar 

  12. Zhang T, Qi GJ, Xiao B, Wang J (2017) Interleaved group convolutions. In: Proceedings of the IEEE international conference on computer vision, pp 4383–4392. https://doi.org/10.1109/ICCV.2017.469

    Chapter  Google Scholar 

  13. Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: More features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1577–1586. https://doi.org/10.1109/CVPR42600.2020.00165

    Chapter  Google Scholar 

  14. Liu D, Liang J, Geng T, Loui A, Zhou T (2023) Tripartite feature enhanced pyramid network for dense prediction. IEEE Trans Image Processing 32:2678–2692. https://doi.org/10.1109/TIP.2023.3272826

    Article  Google Scholar 

  15. Liu D, Cui Y, Yan L, Mousas C, Yang B, Chen Y (2021) Densernet: Weakly supervised visual localization using multi-scale feature aggregation. In Proceedings of the AAAI Conference on Artificial Intelligence 35(7):6101–6109

    Google Scholar 

  16. Liu D, Cui Y, Tan W, Chen Y (2021) Sg-net: Spatial granularity network for one-stage video instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 9811–9820. https://doi.org/10.1109/CVPR46437.2021.00969

    Chapter  Google Scholar 

  17. Li H, Li J, Wei H, Liu Z, Zhan Z, Ren Q (2022) Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv preprint arXiv:2206.02424

  18. Cohen T, Welling M (2016) Group equivariant convolutional networks. In International conference on machine learning ICML 6:4375–4386

    Google Scholar 

  19. Dai X, Chen Y, Xiao B, Chen D, Liu M, Yuan L, Zhang L (2021) Dynamic head: Unifying object detection heads with attentions. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7369–7378. https://doi.org/10.1109/CVPR46437.2021.00729

    Chapter  Google Scholar 

  20. Moehrs S, Del Guerra A, Herbert DJ, Mandelkern MA (2006) A detector head design for small-animal PET with silicon photomultipliers (SiPM). Phys Med Biol 51(5):1113

    Article  Google Scholar 

  21. Albert PS, Follmann DA (2008) Shared-parameter models. In: Longitudinal data analysis. Chapman and Hall/CR, pp 447–466. https://doi.org/10.1201/9781420011579.CH19

    Chapter  Google Scholar 

  22. Aboah A, Wang B, Bagci U, Adu-Gyamfi Y (2023) Real-time multi-class helmet violation detection using few-shot data sampling technique and yolov8. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 5350–5358. https://doi.org/10.1109/CVPRW59228.2023.00564

    Chapter  Google Scholar 

  23. Talaat FM, ZainEldin H (2023) An improved fire detection approach based on YOLO-v8 for smart cities. Neural Comput Applications 35(28):20939–20954. https://doi.org/10.1007/s00521-023-08809-1

    Article  Google Scholar 

  24. Lou H, Duan X, Guo J, Liu H, Gu J, Bi L, Chen H (2023) DC-YOLOv8: Small-size object detection algorithm based on camera sensor. Electronics 12(10):2323

    Article  Google Scholar 

  25. Hussain M (2023) YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection. Machines 11(7):677

    Article  Google Scholar 

  26. Wang W, Cheng H, Zhou T et al (2023) Visual recognition with deep nearest centroids. arXiv:2209.07383 [cs.CV]

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Ming Li.

Ethics declarations

Conflict of interest

The authors declare that they do not have any conflicts of interest that influence the work reported in this paper.

Ethical approval

No animals were involved in this study. All applicable international, national, and/or institutional guidelines for the care and use of animals were followed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Wen, G., Li, M., Luo, Y. et al. The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-023-17957-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s11042-023-17957-4

Keywords

Navigation