Skip to main content
Log in

Unraveling the complexity: deep learning for imbalanced retinal lesion detection and multi-disease identification

  • Original Article
  • Published:
Network Modeling Analysis in Health Informatics and Bioinformatics Aims and scope Submit manuscript

Abstract

Deep learning (DL) has been widely used to detect abnormalities in retinal image. Typically, this task has been focused on a specific domain, such as diseases related to glaucoma or diabetic retinopathy, for example. In this study, we propose to identify lesions associated with both diseases using a single base model, Cascade R-CNN, avoiding the use of multiple DL models. The task is complicated by the need for annotations in datasets related to damages in another domain for which it was created. In addition, the size and shape of objects and bias toward predominant classes are evident. Several techniques characterize this work, including soft labeling for mask predictions, normalized Wasserstein distance for handling small objects, and experiments in image sampling during training with cross-entropy loss combined with Online Hard Negative Mining or asymmetric loss. For result refinement, cluster-weighted with Distance IoU improved final predictions. Based on mean average precision (mAP), a standard metric in object detection models, the reported results were 0.46, and all experiments were conducted on the public DDR dataset. A detailed error analysis by category was provided. In conclusion, the feasibility of using a single model was demonstrated, while the techniques employed helped to increase mAP-related metrics. Our research provides novel insights into the use of retinal photographs for the prediction of systemic biomarkers associated with multiple diseases.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Fig. 7
Fig. 8
Fig. 9
Fig. 10
Fig. 11

Similar content being viewed by others

Data availability

Not applicable.

References

Download references

Acknowledgements

The authors would like to thank the National Council of Research of Mexico (CONACyT), the Faculty of Engineering at the Autonomous University of Querétaro, the Instituto Mexicano de Oftalmología IAP, and University of Saskatchewan, Department of Electrical and Computer Engineering for their advice and support in this investigation.

Funding

This research received no external funding.

Author information

Authors and Affiliations

Authors

Contributions

GA-F: Conceptualization, methodology, software, validation, formal analysis, data curation, writing—original draft, and visualization; ST-A: Conceptualization, writing—original draft, supervision, and project administration; JCP-O: Writing—review and editing, and supervision; MT-A: Methodology and funding; MAA-F: Writing—reviewing and editing; JR-R: Writing—reviewing and editing; MB-F: Review and editing, and supervision; S-BK: Writing—reviewing and editing. All authors have read and agreed to the published version of the manuscript.

Corresponding author

Correspondence to Saul Tovar-Arriaga.

Ethics declarations

Conflict of interest

The authors declare no conflict of interest.

Ethical approval

Not applicable.

Consent to participate

Not applicable.

Consent for publication

Not applicable.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Alfonso-Francia, G., Pedraza-Ortega, J.C., Toledano-Ayala, M. et al. Unraveling the complexity: deep learning for imbalanced retinal lesion detection and multi-disease identification. Netw Model Anal Health Inform Bioinforma 13, 3 (2024). https://doi.org/10.1007/s13721-023-00438-x

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s13721-023-00438-x

Keywords

Navigation