Abstract
Addressing the challenges of high model complexity, low generalization capability, and suboptimal detection performance in most algorithms for crop leaf disease detection, the paper propose a lightweight enhanced YOLOv8 algorithm. First, by incorporating the advantages of GhostNet's feature redundancy reduction and MobileNet's ability to fuse diverse channel features using the concept of Group Convolution, the paper enhance the backbone network. This improves the network's ability to extract critical features from a multitude of similar redundant feature maps. Second, to improve detection accuracy while reducing model parameters and computational load, the paper introduce the slim-Neck module. Finally, addressing the issue where detection head parameters and computations account for over half of the model, the paper restructure the Head using the concept of shared parameters and integrate convolution blocks that enhance multi-scale information recognition. Results from multiple experiments, averaged for consistency, indicate that compared to the original YOLOv8 algorithm, the improved algorithm achieves an increase in mAP50 from 86% to 87.3% and mAP50:95 from 67% to 68.6%. The model's size is a mere 5.45 MB, and the computational parameter GFLOPs is reduced from 8.1 to 5.5, even lower than the most lightweight YOLOv5. In comparison to other large-model algorithms, this model also demonstrates strong competitiveness in detection accuracy.
Similar content being viewed by others
Data availability
Data will be made available on reasonable request.
References
Orchi H, Sadik M, Khaldoun M, Sabir E (2023) Real-time detection of crop leaf diseases using enhanced YOLOv8 algorithm. In: 2023 International Wireless Communications and Mobile Computing (IWCMC). Marrakesh, Morocco 1690–1696
Terven J, Cordova-Esparza D (2023) A comprehensive review of YOLO: From YOLOv1 to YOLOv8 and beyond. arXiv preprint arXiv:2304.00501
Bhosale YH, Zanwar SR, Ali SS, Vaidya NS, Auti RA, Patil DH (2023) Multi-plant and multi-crop leaf disease detection and classification using deep neural networks, machine learning, image processing with precision agriculture - A review. In: 2023 International Conference on Computer Communication and Informatics (ICCCI), Coimbatore, pp 1–7. https://doi.org/10.1109/ICCCI56745.2023.10128246
Li Y, Fan Q, Huang H, Han Z, Gu Q (2023) a modified yolov8 detection network for UAV aerial image recognition. Drones 7(5):304
Liu J, Wang X (2020) Tomato diseases and pests detection based on improved yolo v3 convolutional neural network. Front Plant Sci 11:898
Li H, Li C, Li G, Chen L (2021) A real-time table grape detection method based on improved yolov4-tiny network in complex back-ground. Biosys Eng 212:347–359
Mathew MP, Mahesh TY (2022) Leaf-based disease detection in bell pepper plant using yolo v5. Signal Image Video Process 16(3):841–847. https://doi.org/10.1007/s11760-021-02024-y
Jia M, Tang L, Chen BC, Cardie C, Belongie S, Hariharan B, Lim SN (2022) Visual prompt tuning. In: European conference on computer vision, vol 13693. LNCS, pp 709–727. https://doi.org/10.1007/978-3-031-19827-4_41
Han C, Wang Q, Cui Y, Cao Z, Wang W, Qi S, Liu D (2023) E2VPT: An effective and efficient approach for visual prompt tuning. arXiv preprint arXiv:2307.13770
Yan L, Han C, Xu Z, Liu D, Wang Q (2023) Prompt learns prompt: exploring knowledge-aware generative prompt collaboration for video captioning. In: Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence (IJCAI). International Joint Conferences on Artificial Intelligence Organization, vol 180. https://doi.org/10.24963/ijcai
Wang X, Kan M, Shan S, Chen X (2019) Fully learnable group convolution for acceleration of deep neural networks. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 9041–9050. https://doi.org/10.1109/CVPR.2019.00926
Zhang T, Qi GJ, Xiao B, Wang J (2017) Interleaved group convolutions. In: Proceedings of the IEEE international conference on computer vision, pp 4383–4392. https://doi.org/10.1109/ICCV.2017.469
Han K, Wang Y, Tian Q, Guo J, Xu C, Xu C (2020) Ghostnet: More features from cheap operations. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 1577–1586. https://doi.org/10.1109/CVPR42600.2020.00165
Liu D, Liang J, Geng T, Loui A, Zhou T (2023) Tripartite feature enhanced pyramid network for dense prediction. IEEE Trans Image Processing 32:2678–2692. https://doi.org/10.1109/TIP.2023.3272826
Liu D, Cui Y, Yan L, Mousas C, Yang B, Chen Y (2021) Densernet: Weakly supervised visual localization using multi-scale feature aggregation. In Proceedings of the AAAI Conference on Artificial Intelligence 35(7):6101–6109
Liu D, Cui Y, Tan W, Chen Y (2021) Sg-net: Spatial granularity network for one-stage video instance segmentation. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp 9811–9820. https://doi.org/10.1109/CVPR46437.2021.00969
Li H, Li J, Wei H, Liu Z, Zhan Z, Ren Q (2022) Slim-neck by GSConv: A better design paradigm of detector architectures for autonomous vehicles. arXiv preprint arXiv:2206.02424
Cohen T, Welling M (2016) Group equivariant convolutional networks. In International conference on machine learning ICML 6:4375–4386
Dai X, Chen Y, Xiao B, Chen D, Liu M, Yuan L, Zhang L (2021) Dynamic head: Unifying object detection heads with attentions. In: Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pp 7369–7378. https://doi.org/10.1109/CVPR46437.2021.00729
Moehrs S, Del Guerra A, Herbert DJ, Mandelkern MA (2006) A detector head design for small-animal PET with silicon photomultipliers (SiPM). Phys Med Biol 51(5):1113
Albert PS, Follmann DA (2008) Shared-parameter models. In: Longitudinal data analysis. Chapman and Hall/CR, pp 447–466. https://doi.org/10.1201/9781420011579.CH19
Aboah A, Wang B, Bagci U, Adu-Gyamfi Y (2023) Real-time multi-class helmet violation detection using few-shot data sampling technique and yolov8. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops, pp 5350–5358. https://doi.org/10.1109/CVPRW59228.2023.00564
Talaat FM, ZainEldin H (2023) An improved fire detection approach based on YOLO-v8 for smart cities. Neural Comput Applications 35(28):20939–20954. https://doi.org/10.1007/s00521-023-08809-1
Lou H, Duan X, Guo J, Liu H, Gu J, Bi L, Chen H (2023) DC-YOLOv8: Small-size object detection algorithm based on camera sensor. Electronics 12(10):2323
Hussain M (2023) YOLO-v1 to YOLO-v8, the rise of YOLO and its complementary nature toward digital manufacturing and industrial defect detection. Machines 11(7):677
Wang W, Cheng H, Zhou T et al (2023) Visual recognition with deep nearest centroids. arXiv:2209.07383 [cs.CV]
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they do not have any conflicts of interest that influence the work reported in this paper.
Ethical approval
No animals were involved in this study. All applicable international, national, and/or institutional guidelines for the care and use of animals were followed.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Wen, G., Li, M., Luo, Y. et al. The improved YOLOv8 algorithm based on EMSPConv and SPE-head modules. Multimed Tools Appl (2024). https://doi.org/10.1007/s11042-023-17957-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s11042-023-17957-4