Skip to main content
Log in

SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution

  • Regular Paper
  • Published:
Multimedia Systems Aims and scope Submit manuscript

Abstract

Plant diseases are the leading cause of crop yield reduction. Rapid diagnosis using deep learning-based methods can effectively control the deterioration and spread of diseases. Convolutional Neural Network (CNN)-based methods are the current mainstream disease classification solution. However, most methods based on CNN are aimed at different diseases of a single crop, and they are difficult to distinguish similar diseases, which does not perform well in large-scale and fine-grained disease diagnosis tasks. In this paper, an image classification model for large-scale and fine-grained diseases named Squeeze-and-Excitation Vision Transformer (SEViT) is proposed to solve the above problems. SEViT uses ResNet embedded with channel attention module as the preprocessing network, ViT as the feature classification network. It aims to improve the model’s classification accuracy in the case of many types of diseases and high similarity of disease features. Experimental results show that the classification accuracy of SEViT in the test set achieves 88.34%, higher than comparison models. Compared with the baseline model, the classification accuracy of SEViT is improved by 5.15%.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8

Similar content being viewed by others

Availability of data and materials

Data are available from the authors upon request.

References

  1. Strange, R.N., Scott, P.R.: Plant disease: a threat to global food security. Annu. Rev. Phytopathol. 43, 83–116 (2005)

    Article  Google Scholar 

  2. Arivazhagan, S., Shebiah, R.N., Ananthi, S., Varthini, S.V.: Detection of unhealthy region of plant leaves and classification of plant leaf diseases using texture features. Agric. Eng. Int. CIGR J. 15(1), 211–217 (2013)

    Google Scholar 

  3. Guo, P., Liu, T., Li, N.: Design of automatic recognition of cucumber disease image. Inf. Technol. J. 13(13), 2129 (2014)

    Article  Google Scholar 

  4. Zhang, S., Wu, X., You, Z., Zhang, L.: Leaf image based cucumber disease recognition using sparse representation classification. Comput. Electron. Agric. 134, 135–141 (2017)

    Article  Google Scholar 

  5. Panchal, P., Raman, V.C., Mantri, S.: Plant diseases detection and classification using machine learning models. In: 2019 4th International Conference on Computational Systems and Information Technology for Sustainable Solution (CSITSS), 4, 1–6 (2019). IEEE

  6. Zhang, S., Zhang, S., Zhang, C., Wang, X., Shi, Y.: Cucumber leaf disease identification with global pooling dilated convolutional neural network. Comput. Electron. Agric. 162, 422–430 (2019)

    Article  Google Scholar 

  7. Too, E.C., Yujian, L., Njuki, S., Yingchun, L.: A comparative study of fine-tuning deep learning models for plant disease identification. Comput. Electron. Agric. 161, 272–279 (2019)

    Article  Google Scholar 

  8. Karthik, R., Hariharan, M., Anand, S., Mathikshara, P., Johnson, A., Menaka, R.: Attention embedded residual CNN for disease detection in tomato leaves. Appl. Soft Comput. 86, 105933 (2020)

    Article  Google Scholar 

  9. Zhao, Y., Chen, J., Xu, X., Lei, J., Zhou, W.: Sev-net: residual network embedded with attention mechanism for plant disease severity detection. Concurr. Comput.: Pract. Exper. 33(10), 6161 (2021)

    Article  Google Scholar 

  10. Vaswani, A., Shazeer, N., Parmar, N., Uszkoreit, J., Jones, L., Gomez, A.N., Kaiser, Ł., Polosukhin, I.: Attention is all you need. Adv. Neural Inf. Process. Syst. 30 (2017)

  11. Dosovitskiy, A., Beyer, L., Kolesnikov, A., Weissenborn, D., Zhai, X., Unterthiner, T., Dehghani, M., Minderer, M., Heigold, G., Gelly, S., et al.: An image is worth 16x16 words: Transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  12. Deng, J.: A large-scale hierarchical image database. In: Proceeding of IEEE Computer Vision and Pattern Recognition, 2009 (2009)

  13. Hirani, E., Magotra, V., Jain, J., Bide, P.: Plant disease detection using deep learning. In: 2021 6th International Conference for Convergence in Technology (I2CT), pp. 1–4 (2021). IEEE

  14. Hughes, D., Salathé, M., et al.: An open access repository of images on plant health to enable the development of mobile disease diagnostics. arXiv preprint arXiv:1511.08060 (2015)

  15. Wu, S., Sun, Y., Huang, H.: Multi-granularity feature extraction based on vision transformer for tomato leaf disease recognition. In: 2021 3rd International Academic Exchange Conference on Science and Technology Innovation (IAECST), pp. 387–390 (2021). IEEE

  16. Zhuang, L.: Deep-learning-based diagnosis of cassava leaf diseases using vision transformer. In: 2021 4th Artificial Intelligence and Cloud Computing Conference, pp. 74–79 (2021)

  17. Zhang, Z., Gong, Z., Hong, Q., Jiang, L.: Swin-transformer based classification for rice diseases recognition. In: 2021 International Conference on Computer Information Science and Artificial Intelligence (CISAI), pp. 153–156 (2021). IEEE

  18. Hu, J., Shen, L., Sun, G.: Squeeze-and-excitation networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 7132–7141 (2018)

  19. He, K., Zhang, X., Ren, S., Sun, J.: Deep residual learning for image recognition. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 770–778 (2016)

  20. Howard, A., Sandler, M., Chu, G., Chen, L.-C., Chen, B., Tan, M., Wang, W., Zhu, Y., Pang, R., Vasudevan, V., et al.: Searching for mobilenetv3. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1314–1324 (2019)

  21. Tan, M., Le, Q.V.: Efficientnet: rethinking model scaling for convolutional neural networks. ArXiv arXiv:1905.11946 (2019)

  22. Simonyan, K., Zisserman, A.: Very deep convolutional networks for large-scale image recognition. arXiv preprint arXiv:1409.1556 (2014)

  23. Huang, G., Liu, Z., Van Der Maaten, L., Weinberger, K.Q.: Densely connected convolutional networks. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 4700–4708 (2017)

  24. Szegedy, C., Ioffe, S., Vanhoucke, V., Alemi, A.A.: Inception-v4, inception-resnet and the impact of residual connections on learning. In: Thirty-first AAAI Conference on Artificial Intelligence (2017)

  25. Chollet, F.: Xception: Deep learning with depthwise separable convolutions. In: Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pp. 1251–1258 (2017)

  26. Liu, Z., Mao, H., Wu, C.-Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. arXiv preprint arXiv:2201.03545 (2022)

Download references

Funding

This work was supported in part by NSFC (U1931207 and 61702306), Sci. Tech. Development Fund of Shandong Province of China (ZR2022MF288, ZR2017MF02 and ZR2022MF319), and the Taishan Scholar Program of Shandong Province.

Author information

Authors and Affiliations

Authors

Corresponding authors

Correspondence to Shansong Wang or Weijian Ni.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

This paper contains no cases of studies with human participants performed by any of the authors.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Zeng, Q., Niu, L., Wang, S. et al. SEViT: a large-scale and fine-grained plant disease classification model based on transformer and attention convolution. Multimedia Systems 29, 1001–1010 (2023). https://doi.org/10.1007/s00530-022-01034-1

Download citation

  • Received:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1007/s00530-022-01034-1

Keywords

Navigation