Abstract
People with diabetes are more likely to develop diabetic retinopathy (DR) than healthy people. However, DR is the leading cause of blindness. At present, the diagnosis of diabetic retinopathy mainly relies on the experienced clinician to recognize the fine features in color fundus images. This is a time-consuming task. Therefore, in this paper, to promote the development of UW-OCTA DR automatic detection, we propose a novel semi-supervised semantic segmentation method for UW-OCTA DR image grade assessment. This method, first, uses the MAE algorithm to perform semi-supervised pre-training on the UW-OCTA DR grade assessment dataset to mine the supervised information in the UW-OCTA images, thereby alleviating the need for labeled data. Secondly, to more fully mine the lesion features of each region in the UW-OCTA image, this paper constructs a cross-algorithm ensemble DR tissue segmentation algorithm by deploying three algorithms with different visual feature processing strategies. The algorithm contains three sub-algorithms, namely pre-trained MAE, ConvNeXt, and SegFormer. Based on the initials of these three sub-algorithms, the algorithm can be named MCS-DRNet. Finally, we use the MCS-DRNet algorithm as an inspector to check and revise the results of the preliminary evaluation of the DR grade evaluation algorithm. The experimental results show that the mean dice similarity coefficient of MCS-DRNet v1 and v2 are 0.5161 and 0.5544, respectively. The quadratic weighted kappa of the DR grading evaluation is 0.7559. Our code is available at https://github.com/SupCodeTech/DRAC2022.
Supported by Ministry of Higher Education, Malaysia.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
References
Bao, H., Dong, L., Wei, F.: Beit: bert pre-training of image transformers. arXiv preprint arXiv:2106.08254 (2021)
Caron, M., Bojanowski, P., Mairal, J., Joulin, A.: Unsupervised pre-training of image features on non-curated data. In: 2019 IEEE/CVF International Conference on Computer Vision (ICCV), pp. 2959–2968 (2019)
Caron, M., Touvron, H., Misra, I., Jégou, H., Mairal, J., Bojanowski, P., Joulin, A.: Emerging properties in self-supervised vision transformers. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 9650–9660 (2021)
Chen, X., He, K.: Exploring simple siamese representation learning. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 15750–15758 (2021)
Contributors, M.: MMSegmentation: Openmmlab semantic segmentation toolbox and benchmark. https://github.com/open-mmlab/mmsegmentation (2020)
Contributors, M.: MMSelfSup: Openmmlab self-supervised learning toolbox and benchmark. https://github.com/open-mmlab/mmselfsup (2021)
Dai, L., et al.: A deep learning system for detecting diabetic retinopathy across the disease spectrum. Nat. Commun. 12(1), 1–11 (2021)
Doersch, C., Gupta, A.K., Efros, A.A.: Unsupervised visual representation learning by context prediction. In: 2015 IEEE International Conference on Computer Vision (ICCV), pp. 1422–1430 (2015)
DosoViTskiy, A., et al.: An image is worth 16x16 words: transformers for image recognition at scale. arXiv preprint arXiv:2010.11929 (2020)
He, K., Chen, X., Xie, S., Li, Y., Dollár, P., Girshick, R.: Masked autoencoders are scalable vision learners. arXiv preprint arXiv:2111.06377 (2021)
Khan, A., AlBarri, S., Manzoor, M.A.: Contrastive self-supervised learning: a survey on different architectures. In: 2022 2nd International Conference on Artificial Intelligence (ICAI), pp. 1–6. IEEE (2022)
Liu, R., et al.: Deepdrid: diabetic retinopathy-grading and image quality estimation challenge. Patterns 3(6), 100512 (2022)
Liu, Z., Mao, H., Wu, C.Y., Feichtenhofer, C., Darrell, T., Xie, S.: A convnet for the 2020s. In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) (2022)
Madzin, H., Zainuddin, R.: Feature extraction and image matching of 3d lung cancer cell image. In: 2009 International Conference of Soft Computing and Pattern Recognition, pp. 511–515. IEEE (2009)
Madzin, H., Zainuddin, R., Mohamed, N.S.: Analysis of visual features in local descriptor for multi-modality medical image. Int. Arab J. Inf. Technol. (IAJIT) 11(5) (2014)
Ronneberger, O., Fischer, P., Brox, T.: U-net: convolutional networks for biomedical image segmentation. In: Navab, N., Hornegger, J., Wells, W.M., Frangi, A.F. (eds.) MICCAI 2015. LNCS, vol. 9351, pp. 234–241. Springer, Cham (2015). https://doi.org/10.1007/978-3-319-24574-4_28
Sheng, B., et al.: An overview of artificial intelligence in diabetic retinopathy and other ocular diseases. Front. Public Health 10, 971943 (2022)
Sheng, B., et al.: Diabetic retinopathy analysis challenge 2022, March 2022. https://doi.org/10.5281/zenodo.6362349
Srivastava, N.: Unsupervised learning of visual representations using videos (2015)
Tan, M., Le, Q.: Efficientnetv2: smaller models and faster training. In: International Conference on Machine Learning, pp. 10096–10106. PMLR (2021)
Tan, Z., Hu, Y., Luo, D., Hu, M., Liu, K.: The clothing image classification algorithm based on the improved xception model. Int. J. Comput. Sci. Eng. 23(3), 214–223 (2020). https://doi.org/10.1504/ijcse.2020.111426
Wang, X., He, K., Gupta, A.K.: Transitive invariance for self-supervised visual representation learning. In: 2017 IEEE International Conference on Computer Vision (ICCV), pp. 1338–1347 (2017)
Wu, Z., Xiong, Y., Yu, S.X., Lin, D.: Unsupervised feature learning via non-parametric instance discrimination. In: 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 3733–3742 (2018)
Xie, E., Wang, W., Yu, Z., Anandkumar, A., Alvarez, J.M., Luo, P.: Segformer: simple and efficient design for semantic segmentation with transformers. arXiv preprint arXiv:2105.15203 (2021)
Yun, S., Han, D., Oh, S.J., Chun, S., Choe, J., Yoo, Y.: Cutmix: regularization strategy to train strong classifiers with localizable features. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 6023–6032 (2019)
Zhai, X., Oliver, A., Kolesnikov, A., Beyer, L.: S4l: self-supervised semi-supervised learning. In: Proceedings of the IEEE/CVF International Conference on Computer Vision, pp. 1476–1485 (2019)
Zhang, H., Cisse, M., Dauphin, Y.N., Lopez-Paz, D.: mixup: beyond empirical risk minimization. arXiv preprint arXiv:1710.09412 (2017)
Author information
Authors and Affiliations
Corresponding author
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2023 The Author(s), under exclusive license to Springer Nature Switzerland AG
About this paper
Cite this paper
Tan, Z., Madzin, H., Ding, Z. (2023). Semi-supervised Semantic Segmentation Methods for UW-OCTA Diabetic Retinopathy Grade Assessment. In: Sheng, B., Aubreville, M. (eds) Mitosis Domain Generalization and Diabetic Retinopathy Analysis. MIDOG DRAC 2022 2022. Lecture Notes in Computer Science, vol 13597. Springer, Cham. https://doi.org/10.1007/978-3-031-33658-4_10
Download citation
DOI: https://doi.org/10.1007/978-3-031-33658-4_10
Published:
Publisher Name: Springer, Cham
Print ISBN: 978-3-031-33657-7
Online ISBN: 978-3-031-33658-4
eBook Packages: Computer ScienceComputer Science (R0)