Skip to main content

Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment

  • Conference paper
  • First Online:
Mitosis Domain Generalization and Diabetic Retinopathy Analysis (MIDOG 2022, DRAC 2022)

Abstract

People with diabetes are more likely to develop diabetic retinopathy (DR) than healthy people. However, DR is the leading cause of blindness. At present, the diagnosis of diabetic retinopathy mainly relies on the experienced clinician to recognize the fine features in color fundus images. This is a time-consuming task. Therefore, in this paper, to promote the development of UW-OCTA DR automatic detection, we propose a novel semi-supervised semantic segmentation method for UW-OCTA DR image grade assessment. This method, first, uses the MAE algorithm to perform semi-supervised pre-training on the UW-OCTA DR grade assessment dataset to mine the supervised information in the UW-OCTA images, thereby alleviating the need for labeled data. Secondly, to more fully mine the lesion features of each region in the UW-OCTA image, this paper constructs a cross-algorithm ensemble DR tissue segmentation algorithm by deploying three algorithms with different visual feature processing strategies. The algorithm contains three sub-algorithms, namely pre-trained MAE, ConvNeXt, and SegFormer. Based on the initials of these three sub-algorithms, the algorithm can be named MCS-DRNet. Finally, we use the MCS-DRNet algorithm as an inspector to check and revise the results of the preliminary evaluation of the DR grade evaluation algorithm. The experimental results show that the mean dice similarity coefficient of MCS-DRNet v1 and v2 are 0.5161 and 0.5544, respectively. The quadratic weighted kappa of the DR grading evaluation is 0.7559. Our code is available at https://github.com/SupCodeTech/DRAC2022.

Supported by Ministry of Higher Education, Malaysia.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 54.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 69.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

  1. Bao, H., Dong, L., Wei, F.: Beit: bert pre-training of image transformers. arXiv preprint arXiv:2106.08254 (2021)

  2. Caron, M., Bojanowski, P., Mairal, J., Joulin, A.: Unsupervised pre-training of image features on non-curated data. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2959–2968 (2019)

    Google Scholar 

  3. Caron, M., Touvron, H., Misra, I., Jégou, H., Mairal, J., Bojanowski, P., Joulin, A.: Emerging properties in self-supervised vision transformers. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9650–9660 (2021)

    Google Scholar 

  4. Chen, X., He, K.: Exploring simple siamese representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15750–15758 (2021)

    Google Scholar 

  5. Contributors, M.: MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark. https://github.com/open-mmlab/mmsegmentation (2020)

  6. Contributors, M.: MMSelfSup: Openmmlab self-supervised learning toolbox and benchmark. https://github.com/open-mmlab/mmselfsup (2021)

  7. Dai, L., et al.: A deep learning system for detecting diabetic retinopathy across the disease spectrum. Nat. Commun. 12(1), 1–11 (2021)

    Article  Google Scholar 

  8. Doersch, C., Gupta, A.K., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1422–1430 (2015)

    Google Scholar 

  9. DosoViTskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)

  10. He, K., Chen, X., Xie, S., Li, Y., Dollár, P., Girshick, R.: Masked autoencoders are scalable vision learners. arXiv preprint arXiv:2111.06377 (2021)

  11. Khan, A., AlBarri, S., Manzoor, M.A.: Contrastive self-supervised learning: a survey on different architectures. In: 2022 2nd International Conference on Artificial Intelligence (ICAI), pp. 1–6. IEEE (2022)

    Google Scholar 

  12. Liu, R., et al.: Deepdrid: diabetic retinopathy-grading and image quality estimation challenge. Patterns 3(6), 100512 (2022)

    Article  Google Scholar 

  13. Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)

    Google Scholar 

  14. Madzin, H., Zainuddin, R.: Feature extraction and image matching of 3d lung cancer cell image. In: 2009 International Conference of Soft Computing and Pattern Recognition, pp. 511–515. IEEE (2009)

    Google Scholar 

  15. Madzin, H., Zainuddin, R., Mohamed, N.S.: Analysis of visual features in local descriptor for multi-modality medical image. Int. Arab J. Inf. Technol. (IAJIT) 11(5) (2014)

    Google Scholar 

  16. Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28

    Chapter  Google Scholar 

  17. Sheng, B., et al.: An overview of artificial intelligence in diabetic retinopathy and other ocular diseases. Front. Public Health 10, 971943 (2022)

    Article  Google Scholar 

  18. Sheng, B., et al.: Diabetic retinopathy analysis challenge 2022, March 2022. https://doi.org/10.5281/zenodo.6362349

  19. Srivastava, N.: Unsupervised learning of visual representations using videos (2015)

    Google Scholar 

  20. Tan, M., Le, Q.: Efficientnetv2: smaller models and faster training. In: International Conference on Machine Learning, pp. 10096–10106. PMLR (2021)

    Google Scholar 

  21. Tan, Z., Hu, Y., Luo, D., Hu, M., Liu, K.: The clothing image classification algorithm based on the improved xception model. Int. J. Comput. Sci. Eng. 23(3), 214–223 (2020). https://doi.org/10.1504/ijcse.2020.111426

    Article  Google Scholar 

  22. Wang, X., He, K., Gupta, A.K.: Transitive invariance for self-supervised visual representation learning. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1338–1347 (2017)

    Google Scholar 

  23. Wu, Z., Xiong, Y., Yu, S.X., Lin, D.: Unsupervised feature learning via non-parametric instance discrimination. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3733–3742 (2018)

    Google Scholar 

  24. Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., Luo, P.: Segformer: simple and efficient design for semantic segmentation with transformers. arXiv preprint arXiv:2105.15203 (2021)

  25. Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: Cutmix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)

    Google Scholar 

  26. Zhai, X., Oliver, A., Kolesnikov, A., Beyer, L.: S4l: self-supervised semi-supervised learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1476–1485 (2019)

    Google Scholar 

  27. Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)

Download references

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Hizmawati Madzin .

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG

About this paper

Check for updates. Verify currency and authenticity via CrossMark

Cite this paper

Tan, Z., Madzin, H., Ding, Z. (2023). Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment. In: Sheng, B., Aubreville, M. (eds) Mitosis Domain Generalization and Diabetic Retinopathy Analysis. MIDOG DRAC 2022 2022. Lecture Notes in Computer Science, vol 13597. Springer, Cham. https://doi.org/10.1007/978-3-031-33658-4_10

Download citation

  • DOI: https://doi.org/10.1007/978-3-031-33658-4_10

  • Published:

  • Publisher Name: Springer, Cham

  • Print ISBN: 978-3-031-33657-7

  • Online ISBN: 978-3-031-33658-4

  • eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics