Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment

Tan, Zhuoyi; Madzin, Hizmawati; Ding, Zeyu

doi:10.1007/978-3-031-33658-4_10

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 13597))

Included in the following conference series:

242 Accesses

Abstract

People with diabetes are more likely to develop diabetic retinopathy (DR) than healthy people. However, DR is the leading cause of blindness. At present, the diagnosis of diabetic retinopathy mainly relies on the experienced clinician to recognize the fine features in color fundus images. This is a time-consuming task. Therefore, in this paper, to promote the development of UW-OCTA DR automatic detection, we propose a novel semi-supervised semantic segmentation method for UW-OCTA DR image grade assessment. This method, first, uses the MAE algorithm to perform semi-supervised pre-training on the UW-OCTA DR grade assessment dataset to mine the supervised information in the UW-OCTA images, thereby alleviating the need for labeled data. Secondly, to more fully mine the lesion features of each region in the UW-OCTA image, this paper constructs a cross-algorithm ensemble DR tissue segmentation algorithm by deploying three algorithms with different visual feature processing strategies. The algorithm contains three sub-algorithms, namely pre-trained MAE, ConvNeXt, and SegFormer. Based on the initials of these three sub-algorithms, the algorithm can be named MCS-DRNet. Finally, we use the MCS-DRNet algorithm as an inspector to check and revise the results of the preliminary evaluation of the DR grade evaluation algorithm. The experimental results show that the mean dice similarity coefficient of MCS-DRNet v1 and v2 are 0.5161 and 0.5544, respectively. The quadratic weighted kappa of the DR grading evaluation is 0.7559. Our code is available at https://github.com/SupCodeTech/DRAC2022.

Supported by Ministry of Higher Education, Malaysia.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Log in via an institution

Chapter: USD 29.95; Price excludes VAT (USA)

eBook: USD 54.99; Price excludes VAT (USA)

Softcover Book: USD 69.99; Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

References

Bao, H., Dong, L., Wei, F.: Beit: bert pre-training of image transformers. arXiv preprint arXiv:2106.08254 (2021)
Caron, M., Bojanowski, P., Mairal, J., Joulin, A.: Unsupervised pre-training of image features on non-curated data. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2959–2968 (2019)
Google Scholar
Caron, M., Touvron, H., Misra, I., Jégou, H., Mairal, J., Bojanowski, P., Joulin, A.: Emerging properties in self-supervised vision transformers. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9650–9660 (2021)
Google Scholar
Chen, X., He, K.: Exploring simple siamese representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15750–15758 (2021)
Google Scholar
Contributors, M.: MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark. https://github.com/open-mmlab/mmsegmentation (2020)
Contributors, M.: MMSelfSup: Openmmlab self-supervised learning toolbox and benchmark. https://github.com/open-mmlab/mmselfsup (2021)
Dai, L., et al.: A deep learning system for detecting diabetic retinopathy across the disease spectrum. Nat. Commun. 12(1), 1–11 (2021)
Article Google Scholar
Doersch, C., Gupta, A.K., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1422–1430 (2015)
Google Scholar
DosoViTskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
He, K., Chen, X., Xie, S., Li, Y., Dollár, P., Girshick, R.: Masked autoencoders are scalable vision learners. arXiv preprint arXiv:2111.06377 (2021)
Khan, A., AlBarri, S., Manzoor, M.A.: Contrastive self-supervised learning: a survey on different architectures. In: 2022 2nd International Conference on Artificial Intelligence (ICAI), pp. 1–6. IEEE (2022)
Google Scholar
Liu, R., et al.: Deepdrid: diabetic retinopathy-grading and image quality estimation challenge. Patterns 3(6), 100512 (2022)
Article Google Scholar
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)
Google Scholar
Madzin, H., Zainuddin, R.: Feature extraction and image matching of 3d lung cancer cell image. In: 2009 International Conference of Soft Computing and Pattern Recognition, pp. 511–515. IEEE (2009)
Google Scholar
Madzin, H., Zainuddin, R., Mohamed, N.S.: Analysis of visual features in local descriptor for multi-modality medical image. Int. Arab J. Inf. Technol. (IAJIT) 11(5) (2014)
Google Scholar
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Chapter Google Scholar
Sheng, B., et al.: An overview of artificial intelligence in diabetic retinopathy and other ocular diseases. Front. Public Health 10, 971943 (2022)
Article Google Scholar
Sheng, B., et al.: Diabetic retinopathy analysis challenge 2022, March 2022. https://doi.org/10.5281/zenodo.6362349
Srivastava, N.: Unsupervised learning of visual representations using videos (2015)
Google Scholar
Tan, M., Le, Q.: Efficientnetv2: smaller models and faster training. In: International Conference on Machine Learning, pp. 10096–10106. PMLR (2021)
Google Scholar
Tan, Z., Hu, Y., Luo, D., Hu, M., Liu, K.: The clothing image classification algorithm based on the improved xception model. Int. J. Comput. Sci. Eng. 23(3), 214–223 (2020). https://doi.org/10.1504/ijcse.2020.111426
Article Google Scholar
Wang, X., He, K., Gupta, A.K.: Transitive invariance for self-supervised visual representation learning. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1338–1347 (2017)
Google Scholar
Wu, Z., Xiong, Y., Yu, S.X., Lin, D.: Unsupervised feature learning via non-parametric instance discrimination. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3733–3742 (2018)
Google Scholar
Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., Luo, P.: Segformer: simple and efficient design for semantic segmentation with transformers. arXiv preprint arXiv:2105.15203 (2021)
Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: Cutmix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)
Google Scholar
Zhai, X., Oliver, A., Kolesnikov, A., Beyer, L.: S4l: self-supervised semi-supervised learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1476–1485 (2019)
Google Scholar
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)

Download references

Author information

Authors and Affiliations

Faculty of Computer Science and Information Technology, Universiti Putra Malaysia, Serdang, 43400, Malaysia
Zhuoyi Tan, Hizmawati Madzin & Zeyu Ding

Authors

Zhuoyi Tan
View author publications
You can also search for this author in PubMed Google Scholar
Hizmawati Madzin
View author publications
You can also search for this author in PubMed Google Scholar
Zeyu Ding
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Hizmawati Madzin .

Editor information

Editors and Affiliations

Shanghai Jiao Tong University, Shanghai, China
Bin Sheng
Technische Hochschule Ingolstadt, Ingolstadt, Germany
Marc Aubreville

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Tan, Z., Madzin, H., Ding, Z. (2023). Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment. In: Sheng, B., Aubreville, M. (eds) Mitosis Domain Generalization and Diabetic Retinopathy Analysis. MIDOG DRAC 2022 2022. Lecture Notes in Computer Science, vol 13597. Springer, Cham. https://doi.org/10.1007/978-3-031-33658-4_10

Download citation

DOI: https://doi.org/10.1007/978-3-031-33658-4_10
Published: 30 May 2023
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33657-7
Online ISBN: 978-3-031-33658-4
eBook Packages: Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Societies and partnerships

The Medical Image Computing and Computer Assisted Intervention Society (opens in a new tab)

Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment