The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

Wen, Guihao; Li, Ming; Luo, Yonghang; Shi, Chaoshan; Tan, Yunfei

doi:10.1007/s11042-023-17957-4

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

Published: 02 January 2024

(2024)
Cite this article

Multimedia Tools and Applications Aims and scope Submit manuscript

Guihao Wen¹,
Ming Li¹,
Yonghang Luo¹,
Chaoshan Shi¹ &
…
Yunfei Tan¹

1034 Accesses
3 Citations
1 Altmetric
Explore all metrics

Abstract

Addressing the challenges of high model complexity, low generalization capability, and suboptimal detection performance in most algorithms for crop leaf disease detection, the paper propose a lightweight enhanced YOLOv8 algorithm. First, by incorporating the advantages of GhostNet's feature redundancy reduction and MobileNet's ability to fuse diverse channel features using the concept of Group Convolution, the paper enhance the backbone network. This improves the network's ability to extract critical features from a multitude of similar redundant feature maps. Second, to improve detection accuracy while reducing model parameters and computational load, the paper introduce the slim-Neck module. Finally, addressing the issue where detection head parameters and computations account for over half of the model, the paper restructure the Head using the concept of shared parameters and integrate convolution blocks that enhance multi-scale information recognition. Results from multiple experiments, averaged for consistency, indicate that compared to the original YOLOv8 algorithm, the improved algorithm achieves an increase in mAP50 from 86% to 87.3% and mAP50:95 from 67% to 68.6%. The model's size is a mere 5.45 MB, and the computational parameter GFLOPs is reduced from 8.1 to 5.5, even lower than the most lightweight YOLOv5. In comparison to other large-model algorithms, this model also demonstrates strong competitiveness in detection accuracy.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

MobileNetV2-Incep-M: a hybrid lightweight model for the classification of rice plant diseases

Article 04 March 2024

A fast accurate fine-grain object detection model based on YOLOv4 deep neural network

Article 07 January 2022

Small-target weed-detection model based on YOLO-V4 with improved backbone and neck structures

Article 12 June 2023

Data availability

Data will be made available on reasonable request.

References

Orchi H, Sadik M, Khaldoun M, Sabir E (2023) Real-time detection of crop leaf diseases using enhanced YOLOv8 algorithm. In: 2023 International Wireless Communications and Mobile Computing (IWCMC). Marrakesh, Morocco 1690–1696
Google Scholar
Terven J, Cordova-Esparza D (2023) A comprehensive review of YOLO: From YOLOv1 to YOLOv8 and beyond. arXiv preprint arXiv:2304.00501
Bhosale YH, Zanwar SR, Ali SS, Vaidya NS, Auti RA, Patil DH (2023) Multi-plant and multi-crop leaf disease detection and classification using deep neural networks, machine learning, image processing with precision agriculture - A review. In: 2023 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, pp 1–7. https://doi.org/10.1109/ICCCI56745.2023.10128246
Li Y, Fan Q, Huang H, Han Z, Gu Q (2023) a modified yolov8 detection network for UAV aerial image recognition. Drones 7(5):304
Article Google Scholar
Liu J, Wang X (2020) Tomato diseases and pests detection based on improved yolo v3 convolutional neural network. Front Plant Sci 11:898
Article Google Scholar
Li H, Li C, Li G, Chen L (2021) A real-time table grape detection method based on improved yolov4-tiny network in complex back-ground. Biosys Eng 212:347–359
Article Google Scholar
Mathew MP, Mahesh TY (2022) Leaf-based disease detection in bell pepper plant using yolo v5. Signal Image Video Process 16(3):841–847. https://doi.org/10.1007/s11760-021-02024-y
Article Google Scholar
Jia M, Tang L, Chen BC, Cardie C, Belongie S, Hariharan B, Lim SN (2022) Visual prompt tuning. In: European conference on computer vision, vol 13693. LNCS, pp 709–727. https://doi.org/10.1007/978-3-031-19827-4_41
Chapter Google Scholar
Han C, Wang Q, Cui Y, Cao Z, Wang W, Qi S, Liu D (2023) E2VPT: An effective and efficient approach for visual prompt tuning. arXiv preprint arXiv:2307.13770
Yan L, Han C, Xu Z, Liu D, Wang Q (2023) Prompt learns prompt: exploring knowledge-aware generative prompt collaboration for video captioning. In: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI). International Joint Conferences on Artificial Intelligence Organization, vol 180. https://doi.org/10.24963/ijcai
Chapter Google Scholar
Wang X, Kan M, Shan S, Chen X (2019) Fully learnable group convolution for acceleration of deep neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9041–9050. https://doi.org/10.1109/CVPR.2019.00926
Chapter Google Scholar
Zhang T, Qi GJ, Xiao B, Wang J (2017) Interleaved group convolutions. In: Proceedings of the IEEE international conference on computer vision, pp 4383–4392. https://doi.org/10.1109/ICCV.2017.469
Chapter Google Scholar
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: More features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1577–1586. https://doi.org/10.1109/CVPR42600.2020.00165
Chapter Google Scholar
Liu D, Liang J, Geng T, Loui A, Zhou T (2023) Tripartite feature enhanced pyramid network for dense prediction. IEEE Trans Image Processing 32:2678–2692. https://doi.org/10.1109/TIP.2023.3272826
Article Google Scholar
Liu D, Cui Y, Yan L, Mousas C, Yang B, Chen Y (2021) Densernet: Weakly supervised visual localization using multi-scale feature aggregation. In Proceedings of the AAAI Conference on Artificial Intelligence 35(7):6101–6109
Google Scholar
Liu D, Cui Y, Tan W, Chen Y (2021) Sg-net: Spatial granularity network for one-stage video instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 9811–9820. https://doi.org/10.1109/CVPR46437.2021.00969
Chapter Google Scholar
Li H, Li J, Wei H, Liu Z, Zhan Z, Ren Q (2022) Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv preprint arXiv:2206.02424
Cohen T, Welling M (2016) Group equivariant convolutional networks. In International conference on machine learning ICML 6:4375–4386
Google Scholar
Dai X, Chen Y, Xiao B, Chen D, Liu M, Yuan L, Zhang L (2021) Dynamic head: Unifying object detection heads with attentions. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7369–7378. https://doi.org/10.1109/CVPR46437.2021.00729
Chapter Google Scholar
Moehrs S, Del Guerra A, Herbert DJ, Mandelkern MA (2006) A detector head design for small-animal PET with silicon photomultipliers (SiPM). Phys Med Biol 51(5):1113
Article Google Scholar
Albert PS, Follmann DA (2008) Shared-parameter models. In: Longitudinal data analysis. Chapman and Hall/CR, pp 447–466. https://doi.org/10.1201/9781420011579.CH19
Chapter Google Scholar
Aboah A, Wang B, Bagci U, Adu-Gyamfi Y (2023) Real-time multi-class helmet violation detection using few-shot data sampling technique and yolov8. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 5350–5358. https://doi.org/10.1109/CVPRW59228.2023.00564
Chapter Google Scholar
Talaat FM, ZainEldin H (2023) An improved fire detection approach based on YOLO-v8 for smart cities. Neural Comput Applications 35(28):20939–20954. https://doi.org/10.1007/s00521-023-08809-1
Article Google Scholar
Lou H, Duan X, Guo J, Liu H, Gu J, Bi L, Chen H (2023) DC-YOLOv8: Small-size object detection algorithm based on camera sensor. Electronics 12(10):2323
Article Google Scholar
Hussain M (2023) YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection. Machines 11(7):677
Article Google Scholar
Wang W, Cheng H, Zhou T et al (2023) Visual recognition with deep nearest centroids. arXiv:2209.07383 [cs.CV]

Download references

Author information

Authors and Affiliations

Computer Science and Information Sciences, Chongqing Normal University, Shapingba, Chongqing, 401331, China
Guihao Wen, Ming Li, Yonghang Luo, Chaoshan Shi & Yunfei Tan

Authors

Guihao Wen
View author publications
You can also search for this author in PubMed Google Scholar
Ming Li
View author publications
You can also search for this author in PubMed Google Scholar
Yonghang Luo
View author publications
You can also search for this author in PubMed Google Scholar
Chaoshan Shi
View author publications
You can also search for this author in PubMed Google Scholar
Yunfei Tan
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Ming Li.

Ethics declarations

Conflict of interest

The authors declare that they do not have any conflicts of interest that influence the work reported in this paper.

Ethical approval

No animals were involved in this study. All applicable international, national, and/or institutional guidelines for the care and use of animals were followed.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Cite this article

Wen, G., Li, M., Luo, Y. et al. The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-023-17957-4

Download citation

Received: 20 September 2023
Revised: 06 December 2023
Accepted: 18 December 2023
Published: 02 January 2024
DOI: https://doi.org/10.1007/s11042-023-17957-4

Keywords

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

Abstract

Access this article

Similar content being viewed by others

MobileNetV2-Incep-M: a hybrid lightweight model for the classification of rice plant diseases

A fast accurate fine-grain object detection model based on YOLOv4 deep neural network

Small-target weed-detection model based on YOLO-V4 with improved backbone and neck structures

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Keywords

Navigation

The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules

Abstract

Access this article

Similar content being viewed by others

MobileNetV2-Incep-M: a hybrid lightweight model for the classification of rice plant diseases

A fast accurate fine-grain object detection model based on YOLOv4 deep neural network

Small-target weed-detection model based on YOLO-V4 with improved backbone and neck structures

Data availability

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation