An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases

Inoda, Satoru; Takahashi, Hidenori; Arai, Yusuke; Tampo, Hironobu; Matsui, Yoshitsugu; Kawashima, Hidetoshi; Yanagi, Yasuo

doi:10.1007/s00417-023-06054-9

An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases

Retinal Disorders
Open access
Published: 11 May 2023

Volume 261, pages 2775–2785, (2023)
Cite this article

Download PDF

You have full access to this open access article

Graefe's Archive for Clinical and Experimental Ophthalmology Aims and scope Submit manuscript

An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases

Download PDF

Satoru Inoda¹,
Hidenori Takahashi ORCID: orcid.org/0000-0001-5331-4730¹,
Yusuke Arai¹,
Hironobu Tampo¹,
Yoshitsugu Matsui²,
Hidetoshi Kawashima¹ &
…
Yasuo Yanagi^3,4

1681 Accesses
2 Altmetric
Explore all metrics

Abstract

Purpose

To develop an artificial intelligence (AI) model for estimating best-corrected visual acuity (BCVA) using horizontal and vertical optical coherence tomography (OCT) scans of various retinal diseases and examine factors associated with its accuracy.

Methods

OCT images and associated BCVA measurements from 2,700 OCT images (accrued from 2004 to 2018 with an Atlantis, Triton; Topcon, Tokyo, Japan) of 756 eyes of 469 patients and their BCVA were retrospectively analysed. For each eye, one horizontal and one vertical OCT scan in cross-line mode were used. The GoogLeNet architecture was implemented. The coefficient of determination (R²), root mean square error (RMSE) and mean absolute error (MAE) were computed to evaluate the performance of the trained network.

Results

R², RMSE, and MAE were 0.512, 0.350, and 0.321, respectively. R² was higher in phakic eyes than in pseudophakic eyes. Multivariable regression analysis showed that a higher R² was significantly associated with better BCVA (p < 0.001) and a higher standard deviation of BCVA (p < 0.001). However, the performance was worse in an external validation, with R² of 0.19. R² values for retinal vein occlusion and age-related macular degeneration were 0.961 and 0.373 in the internal validation but 0.20 and 0.22 in the external validation.

Conclusion

Although underspecification appears to be a fundamental problem to be addressed in AI models for predicting visual acuity, the present results suggest that AI models might have potential for estimating BCVA from OCT in AMD and RVO. Further research is needed to improve the utility of BCVA estimation for these diseases.

Deep learning visual field global index prediction with optical coherence tomography parameters in glaucoma patients

Article Open access 25 October 2023

Automated deep learning-based AMD detection and staging in real-world OCT datasets (PINNACLE study report 5)

Article Open access 09 November 2023

Estimation of best corrected visual acuity based on deep neural network

Article Open access 24 October 2022

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Multimodal imaging technologies support the management of ophthalmic diseases, especially in the diagnosis, treatment, and prognostication of retinal diseases. One of the most widely available noninvasive imaging modalities for visualising retinal microstructures is optical coherence tomography (OCT). Many studies of OCT image features have explored the relationship among diseases, prognosis, and clinical significance, including drusen, [1,2,3] pigment epithelial detachment, [4] hyperreflective foci, [5,6,7] and macular oedema. [8, 9] Some of these features are associated with visual function. In particular, macular microstructures guide clinicians to roughly estimate patients’ visual acuity.

Numerous recent studies have reported the possible application of artificial intelligence (AI) to retinal imaging modalities. AI models can detect and delineate the retinal fluid in conditions such as retinal vein occlusion (RVO), age-related macular degeneration (AMD), and diabetic macular oedema (DMO). [10, 11] Moreover, AI can not only detect exudative AMD, [12] but also predict AMD progression. [13, 14] Importantly, an AI cloud was reported to estimate BCVA from OCT imaging in a population of patients with AMD. [15] Some other reports have also estimated and predicted BCVA from OCT images, with two using patients who were enrolled in the HARBOR trial, [16, 17] two using patients who had exudative AMD, and other ones each using patients who had DMO and geographic atrophy. [15, 18,19,20] When baseline OCT was used to predict visual acuity, the accuracy was R² = 0.21. [16] When the 3-month data were also used, the accuracy was R² = 0.70. In another study, the regression model to predict BCVA obtained R² = 0.24 in eyes with AMD. [17] In the other study, [18] the authors used OCT images of treatment-naïve, first-treated eyes of patients with exudative AMD and predicted future visual outcomes. Visual acuity at a distant point was predicted with R² values of 0.80 and 0.70 after injection at 3 and 12 months after baseline, respectively.

In the field of AI research on visual acuity prediction, several gaps needed to be addressed. First, previous reports focused mainly on a single disease entity (such as diabetic retinopathy [DR] and AMD) or specific features (such as drusen and macular oedema). Lack of training with various diseases or disease features limits the applicability of the trained model to other diseases or features. Second, factors associated with the accuracy of the visual acuity prediction were not clarified previously. For example, although changes in lens status may have few discernible effects on OCT images, such changes may significantly affect patients’ visual acuity. Therefore, we trained/developed an AI model with vertical and horizontal OCT images of a variety of retinal diseases to estimate BCVA and investigated the factors associated with its accuracy.

Methods

Design

This retrospective study was approved by the institutional review board of Jichi Medical University (Jichi-CU19-094) and adhered to the tenets of the Declaration of Helsinki. The study procedures followed institutional guidelines, and informed consent was obtained in the form of opt-out on the website of the Department of Ophthalmology of Jichi Medical University. Individuals who declined to join the study were excluded. Where necessary, all patients provided informed consent to the procedures performed as part of their clinical management.

Procedure

This study included 2,700 OCT scans from 756 consecutive eyes of 469 patients. The images were taken between 2014 and 2018 at Jichi Medical University using swept source (SS)-OCT (Atlantis, Triton; Topcon, Tokyo, Japan) as part of a clinical examination for retinal diseases. As a routine examination, a raster scan protocol comprising five lines centred on the fovea was used. Horizontal and vertical SS-OCT grey images centred on the fovea were obtained from each patient (S1 Fig). The assessing ophthalmologist determined the horizontal and vertical B-scan image closest to the centre of the fovea. All OCT images were assessed by a single ophthalmologist (SI) overseen by retinal specialists (H.T. and Y.I.). BCVA was measured by an experienced optometrist as decimal visual acuity on the same day the OCT image was taken. BCVA was converted to the logarithm of the minimum angle of resolution [logMAR] for statistical analysis.

Patients’ medical records were used to collect the following clinical data: disease condition, with eight conditions delineated (AMD, DR, macular hole or epithelial retinal membrane [MH/ERM], RVO, central serous chorioretinopathy [CSC], myopic choroidal neovasculopathy [mCNV], other, and normal OCT); and lens status, characterised as either phakia or pseudophakia.

Deep learning

Ten-fold cross-validation was used to train and test this model. We randomly split the data into 10 folds: 9 folds for training and 1 for testing. No images from the same patient were included in the same fold. Although the validation set doubled as the testing set, their use was combined only in the first training. Moreover, because validation loss did not increase again and there was no overfitting, the number of iterations was not set before overfitting but after the validation loss reached the plateau. We set the training epochs to a sufficient number and the other nine training sets were trained using the same number of epochs without validation.

The original images had dimensions of 992 × 992 pixels, and the data were augmented with a horizontal flip and a random 892 × 892-pixel crop. In our preliminary work, AlexNet, GoogLeNet, and ResNet were tested as baseline CNN models, which demonstrated that GoogLeNet performed best (data not shown). As such we decided to use GoogleNet in this study, and tuned the hyperparameters of the CNN model (Fig. 1). After the grid search, the base learning rate was 1.0 × 10⁻⁷ and the training option ‘SDG’ was set to use stochastic gradient descent with a momentum optimiser. The neural network comprised two units corresponding to the horizontal and vertical OCT images, each comprising 22 layers of a convolutional neural network with an inception module. [21] The size and number of convolution filters were same as those in the original GoogLeNet. The two outputs were combined to generate a single output through a fully connected layer. To regularise the model and prevent overfitting of very deep networks, GoogLeNet utilises auxiliary classifiers, which are used during training to perform classification based on the inputs within the network's mid-section and then add the loss to the total loss of the network. In this study, as in the original GoogLeNet architecture, subnetworks branched from the same two locations of the GoogLeNet network for each of the horizontal and vertical OCT images. The auxiliary loss was calculated by combining the losses from the subnetwork of the horizontal image and that of the vertical image. The auxiliary loss was multiplied by 0.3, as in the original GoogLeNet. A regression layer was included at the end of the network to predict continuous data.

After a per-pixel mean subtraction, the fully randomly initialised neural network was trained. The epoch number was 300 and the learning time was 24 h.

Disease groups

The OCT images were examined and grouped into disease groups, with the images classified according to the patients’ macular structure rather than their diagnosis. Specifically, eyes with no DR were not classified into DR groups, even if that patient had been diagnosed with diabetic mellitus. AMD groups included patients over the age of 50 years whose eyes had a choroidal neovascular membrane (CNV) confirmed by indocyanine green angiography (ICGA), avascular serous pigmented epithelial detachment (PED), soft drusen, or drusenoid PED. DR groups included eyes with DMO, subretinal haemorrhage involving the macular area, and a proliferative membrane with/without traction. MH/ERM groups included eyes with MH, ERM, a vitreomacular traction membrane, pseudo MH, and retinal detachment induced by MH. RVO groups included branch or central RVO. The CSC group included eyes with subretinal fluid involving the macula and diffuse and/or focal leakage on fluorescein angiography ICGA. Eyes with CNV, other maculopathy, active intraocular inflammation, or infection were not classified into CSC groups. The mCNV group included eyes with CNV and pathologic myopia, namely, a refractive error > 6 dioptres, axial length > 26 mm, or staphyloma on OCT. Eyes without CNV and pathologic myopia were classified as ‘other’.

The eyes categorised into the other group included retinitis induced as uveitis (number of eyes; n = 18), reattached retina after surgery (n = 11), retinitis pigmentosa (n = 11), and others (S1 Table).

Performance statistical index

Three indices—the coefficient of determination (R²), root mean square error (RMSE), and mean absolute error (MAE)—were selected to evaluate errors. The calculation formulae were as follows.

$${R}^{2}=1-\frac{\sum_{i}({Ypred, i-Yobs, i)}^{2}}{\sum_{i}{(Yobs, i-\overline{Yobs })}^{2}}$$

$$RMSE=\sqrt{\frac{\sum_{i}{(Yobs,i-Ypred, i)}^{2}}{n}}$$

$$MAE=\frac{\sum_{i}\left|Yobs,i-Ypred,i\right|}{n}$$

Here, Ypred is the estimated BCVA and Yobs is the actual BCVA. Thus, R² can take a negative value, such as when the estimated value is very different from the actual value.

Statistical analyses

Statistical analyses were performed using JMP Pro ver. 15.0.0 (SAS Institute, Cary, NC). A paired t-test was used to evaluate the association between the estimated and actual BCVAs for each disease group and phakia or pseudophakia. Three validity indices—R², RMSE, and MAE—were measured to evaluate the prediction model in terms of each disease group and phakia or pseudophakia.

Univariate regression was used to evaluate the associations among the three validity indices and the number of images, mean BCVA, standard deviation (SD) of BCVA, mean age, and SD of age for each disease group. A multivariable logistic regression model was used to evaluate the association of the three validity indices with variables selected by stepwise variable selection. The model was also used to evaluate the association between the square of the difference between the estimated and actual BCVAs and age, sex, phakia/pseudophakia, and actual BCVA. Statistical significance was defined as p < 0.05.

Results

Patients’ demographic characteristics

The characteristics of the patients in this study are summarised in Table 1. Mean age was 69.6 (SD, 11.9) years, and 249 (53%) were men. The most and least frequent diseases were AMD (n = 167 patients) and mCNV (n = 13 patients). S2 and S3 Tables summarise the characteristics of the images. The most and least frequent images were AMD (n = 1017 images) and DR (n = 73 images). Patients with phakia were younger than those with pseudophakia. There were about twice as many phakic eye images as pseudophakic eye images. The estimated BCVA of each disease group was significantly associated with the actual BCVA (all p < 0.001).

Table 1 Patient characteristics

Full size table

Accuracy

Estimated BCVA was significantly associated with actual BCVA (p < 0.001). Figure 2 shows the distribution of the actual and estimated BCVAs (estimated – actual). R², RMSE, and MAE were 0.512, 0.350, and 0.321, respectively (Table 2). The distribution of the absolute error of the estimation |ε| is illustrated in Fig. 3. Error distributions were non-symmetrical with a long tail. Overall, 89% of all estimates were within an absolute error of 0.5.

Table 2 Difference between phakia and pseudophakia

Full size table

The phakia group had smaller errors compared with the pseudophakia group (Fig. 4), and the errors in the normal OCT and CSC groups were smaller than the average median absolute error |ε| (S3 Fig).

R² of the phakic group was higher compared with the pseudophakic group in all images (Table 2). Accuracy by disease group is shown in S4 Table; R², RMSE, and MAE ranged from − 1.233 to 0.961, from 0.003 to 0.669, and from < 0.001 to 0.446, respectively. R² values for RVO, AMD, and mCNV were 0.961, 0.373, and 0.355, respectively. However, lens status had somewhat different effects on accuracy among the different disease groups (S4 Table). In the AMD group, R² was higher in phakic eyes than in pseudophakic eyes, but was higher in pseudophakic eyes than in phakic eyes in the RVO group. Figure 5 shows representative OCT images together with the estimated BCVA and accuracy to illustrate the characteristics of images with good and poor accuracy.

Factors associated with accuracy

Univariate linear regression analysis revealed that a higher RMSE and MAE were significantly associated with better mean BCVA and higher SD of BCVA (Table 3). Meanwhile, multivariable regression analysis determined that R² was significantly higher with a smaller mean BCVA and higher SD of BCVA (Table 4). Finally, factors significantly associated with the square of the difference between the estimated and actual BCVA were actual BCVA (p < 0.001), phakia/pseudophakia (p < 0.001), age (p = 0.005), and sex (p = 0.021) in the multivariable analysis (Table 5). Actual BCVA (p < 0.001) and phakia/pseudophakia (p < 0.001) were selected via stepwise variable selection. Because the estimation depended on actual BCVA, we divided the full BCVA range into five, using the first values of the 16th, 8th, 4th, and 2nd quantiles of the value as the boundaries, and calculated the median |ε| within each segment. Figure 6 shows that the metrics increased with the values of the index.

Table 3 Association between validity and factors in univariate linear regression

Full size table

Table 4 Association between validity and factors in multivariable linear regression

Full size table

Table 5 Association between factors and the square of the difference between the estimated and actual BCVA in multivariable analysis

Full size table

Validation study

We aimed to validate our model using data from another institution, either with or without standardisation of the OCT image size. The validation study was performed using data from Saitama Medical Center (Saitama, Japan), which included 678 images of AMD, DR, RVO, macular telangiectasia, and uveitis, with a resolution of 1143 × 622 pixels due to the use of a different scanning mode. The scans were cropped to 992 × 992 pixels and the margin was filled with black for standardisation. R², RMSE, and MAE were 0.19, 0.30, and 0.223, respectively. For each disease group, the R² of AMD, RVO, DR, and others was 0.22, 0.20, 0.036, and − 0.59, respectively.

Discussion

The AI model in this study estimated BCVA with an R² of 0.512, RMSE of 0.350, and MAE of 0.321, using OCT images of various retinal diseases. R² was higher in phakic eyes than in pseudophakic eyes in all images. Multivariable regression analysis also revealed that R² was significantly higher with better BCVA and a higher SD of BCVA. Finally, a smaller square of the difference between the estimated and actual BCVA was significantly associated with better actual BCVA and phakic eyes. However, the performance was worse in an external test set, with an R² of 0.19.

Surprisingly, R² was higher in phakic eyes than in pseudophakic eyes in all images. We hypothesised that one of the reasons for the moderate accuracy (R², 0.512) might be the data used, namely, only OCT data. Although some studies reported that SS-OCT could evaluate cataract density [22,23,24], SS-OCT images include little information regarding other ocular conditions, including the condition of the cornea, lens, vitreous humour, and ophthalmic nerve. Strong corneal opacity or opacitas corporis vitreous can blur the resolution of SS-OCT images. There are no data in OCT images on ocular aberrations, including astigmatism and higher-order aberrations. In addition, with slight corneal opacity or opacitas corporis vitreous, images should be relatively clear, even when the ocular condition is sufficient to decrease the actual BCVA. Thus, we hypothesised that a lack of anterior or intermediate segment information might worsen the validity index. This assumption did not hold, possibly because the opacity of intraocular lens or after cataracts would affect BCVA or OCT images in the same way as lens opacity.

Here, R² was significantly higher with better BCVA and a higher SD of BCVA. We can suggest some reasons why a worse BCVA was associated with poor validity. First, we measured BCVA as decimal value and converted it to logMAR units for statistical analysis. This conversion approach characteristically overestimates visual acuity, especially at lower levels [25]. Images of eyes with better BCVA might be measured more accurately, whereas those of eyes with worse BCVA might be overestimated. Second, eyes with worse BCVA may have some factors affecting BCVA, such as the condition of the ophthalmic nerve or a subtle lens opacity, that cannot be assessed using OCT. A recent study evaluated regression-based BCVA estimation using classified training data, showing that one of their AI models achieved a lowest RMSE of 0.028 and a highest R² of 0.654 [26]. They also showed their AI model focused on the optic disc in addition to the area near the macula and blood vessels. Using fundus images and/or optic disc OCT scans as training data in addition to macular OCT images is a possible strategy for improving our AI model in the future.

In the internal test set, the R² for prediction was 0.512, which was relatively high compared with a previous study of the HARBOR trial (R², 0.21). [16] However, the generalisability of an AI algorithm to predict visual acuity has never been explored. Therefore, we validated our model with OCT images from another institution. The results were disappointing. For example, R² was 0.19, which was almost the same as that obtained from our pseudophakia data. Although actual BCVA may be associated with R², one of our present results—the mean actual BCVA of the validation study data was 0.326 ± 0.33—was similar to that of our previous data. There are several possible reasons for this. First, this result might be due to overtraining; however, we selected a hyperparameter with no overfitting. Second, the OCT images used for the external test set were obtained with an OCT machine that was different from that used for training and validation (DRI OCT Triton Plus, Topcon). Although the optics of that system are the same as that of the machine used for development (Atlantis, Topcon), as well as being from the same company, the acquisition protocols and image size are different, which might have caused the relatively poor R². As for the R² values for each disease group, the trend was similar in the validation study; that is, the R² of AMD and RVO were higher than that of DR. This is possibly because eyes with chronic diseases might have acquired paracentral fixation, unlike eyes with acute disease. Therefore, the BCVA measurements will be more dependent on the examiners’ skills and patients’ cooperation in acute diseases and thus highly variable, even for patients with similar fundus findings. Even for the same patient, the repeatability of BCVA measurement is not high, especially for patients with acute severe vision loss. As mentioned earlier, R² was significantly higher with better BCVA. Additionally, our R² values were better in chronic diseases than in acute diseases (R² values from RVO [0.96], AMD [0.37], and mCNV [0.36] were better than those from MH/ERM [0.09], DR [− 0.12], and CSC [− 0.22]), further supporting this idea.

The present study has three main strengths. The first is the high versatility of our BCVA prediction model, which was trained with OCT images of various retinal diseases. The AI predicted BCVA from OCT images even in the presence of various other retinal diseases. Second, we developed the model based on horizontal and vertical OCT images from each patient. Such images are usually taken in the clinical setting, and horizontal and vertical surrounding macular images of the eye provide more information for AI model training. Some recent studies have trained AI models for estimation of BCVA by using colour fundus photograph (CFP) [26, 27]. Since CFP has a wider field of view compared to macular OCT scans, it may provide more information, such as diseases of the optic disc; however, on the contrary, it is intuitively obvious that OCT contains more information about the retina. Finally, a novel aspect of this study is that a linear function was estimated, which might replace image analysis, which contrast to the recent studies which estimated visual acuity as a level classification [26, 27]. Although the validity is thus far poor, it should be improved with further work.

There are also some limitations. One limitation is common to this type of prospective analysis of institutional data. All of the study participants were Japanese. The number of participants was relatively small, particularly in some groups. It would be interesting to see how much the results improve with vertical scans; however, this is beyond the scope of the present study.

In conclusion, this study confirmed that AI (neural network) could estimate BCVA only with OCT images. Further studies are nonetheless warranted to evaluate the broader applicability of this approach.

Data availability

The datasets used and/or analysed in the present study are available from the corresponding author on reasonable request.

Code availability

The code used and/or analysed in the present study are available from the corresponding author on reasonable request.

References

Khanifar AA, Koreishi AF, Izatt JA, Toth CA (2008) Drusen ultrastructure imaging with spectral domain optical coherence tomography in age-related macular degeneration. Ophthalmology 115:1883–1890. https://doi.org/10.1016/j.ophtha.2008.04.041
Article PubMed Google Scholar
Leuschen JN, Schuman SG, Winter KP et al (2013) Spectral-domain optical coherence tomography characteristics of intermediate age-related macular degeneration. Ophthalmology 120:140–150. https://doi.org/10.1016/j.ophtha.2012.07.004
Article PubMed Google Scholar
Spaide RF, Curcio CA (2010) Drusen characterization with multimodal imaging. Retina 30:1441–1454. https://doi.org/10.1097/IAE.0b013e3181ee5ce8
Article PubMed PubMed Central Google Scholar
Querques G, Srour M, Massamba N, Georges A, Ben Moussa N, Rafaeli O, Souied EH (2013) Functional characterization and multimodal imaging of treatment-naive “quiescent” choroidal neovascularization. Invest Ophthalmol Vis Sci 54:6886–6892. https://doi.org/10.1167/iovs.13-11665
Article PubMed Google Scholar
Christenbury JG, Folgar FA, O’Connell RV, Chiu SJ, Farsiu S, Toth CA, Age-related Eye Disease Study 2 Ancillary Spectral Domain Optical Coherence Tomography Study G (2013) Progression of intermediate age-related macular degeneration with proliferation and inner retinal migration of hyperreflective foci. Ophthalmology 120:1038–1045. https://doi.org/10.1016/j.ophtha.2012.10.018
Article PubMed Google Scholar
Folgar FA, Yuan EL, Sevilla MB et al (2016) Drusen Volume and Retinal Pigment Epithelium Abnormal Thinning Volume Predict 2-Year Progression of Age-Related Macular Degeneration. Ophthalmology 123:39-50 e31. https://doi.org/10.1016/j.ophtha.2015.09.016
Article PubMed Google Scholar
Wu Z, Luu CD, Ayton LN et al (2014) Optical coherence tomography-defined changes preceding the development of drusen-associated atrophy in age-related macular degeneration. Ophthalmology 121:2415–2422. https://doi.org/10.1016/j.ophtha.2014.06.034
Article PubMed Google Scholar
Markomichelakis NN, Halkiadakis I, Pantelia E, Peponis V, Patelis A, Theodossiadis P, Theodossiadis G (2004) Patterns of macular edema in patients with uveitis: qualitative and quantitative assessment using optical coherence tomography. Ophthalmology 111:946–953. https://doi.org/10.1016/j.ophtha.2003.08.037
Article PubMed Google Scholar
Yeung L, Lima VC, Garcia P, Landa G, Rosen RB (2009) Correlation between spectral domain optical coherence tomography findings and fluorescein angiography patterns in diabetic macular edema. Ophthalmology 116:1158–1167. https://doi.org/10.1016/j.ophtha.2008.12.063
Article PubMed Google Scholar
Schlegl T, Waldstein SM, Bogunovic H et al (2018) Fully Automated Detection and Quantification of Macular Fluid in OCT Using Deep Learning. Ophthalmology 125:549–558. https://doi.org/10.1016/j.ophtha.2017.10.031
Article PubMed Google Scholar
Seebock P, Orlando JI, Schlegl T et al (2020) Exploiting Epistemic Uncertainty of Anatomy Segmentation for Anomaly Detection in Retinal OCT. IEEE Trans Med Imaging 39:87–98. https://doi.org/10.1109/tmi.2019.2919951
Article PubMed Google Scholar
Treder M, Lauermann JL, Eter N (2018) Automated detection of exudative age-related macular degeneration in spectral domain optical coherence tomography using deep learning. Graefes Arch Clin Exp Ophthalmol 256:259–265. https://doi.org/10.1007/s00417-017-3850-3
Article CAS PubMed Google Scholar
Prahs P, Radeck V, Mayer C, Cvetkov Y, Cvetkova N, Helbig H, Marker D (2018) OCT-based deep learning algorithm for the evaluation of treatment indication with anti-vascular endothelial growth factor medications. Graefes Arch Clin Exp Ophthalmol 256:91–98. https://doi.org/10.1007/s00417-017-3839-y
Article PubMed Google Scholar
Russakoff DB, Lamin A, Oakley JD, Dubis AM, Sivaprasad S (2019) Deep Learning for Prediction of AMD Progression: A Pilot Study. Invest Ophthalmol Vis Sci 60:712–722. https://doi.org/10.1167/iovs.18-25325
Article PubMed Google Scholar
Aslam TM, Zaki HR, Mahmood S, Ali ZC, Ahmad NA, Thorell MR, Balaskas K (2018) Use of a Neural Net to Model the Impact of Optical Coherence Tomography Abnormalities on Vision in Age-related Macular Degeneration. Am J Ophthalmol 185:94–100. https://doi.org/10.1016/j.ajo.2017.10.015
Article PubMed Google Scholar
Schmidt-Erfurth U, Bogunovic H, Sadeghipour A et al (2018) Machine Learning to Analyze the Prognostic Value of Current Imaging Biomarkers in Neovascular Age-Related Macular Degeneration. Ophthalmol Retina 2:24–30. https://doi.org/10.1016/j.oret.2017.03.015
Article PubMed Google Scholar
Kawczynski MG, Bengtsson T, Dai J, Hopkins JJ, Gao SS, Willis JR (2020) Development of Deep Learning Models to Predict Best-Corrected Visual Acuity from Optical Coherence Tomography. Transl Vis Sci Technol 9:51. https://doi.org/10.1167/tvst.9.2.51
Article PubMed PubMed Central Google Scholar
Fu DJ, Faes L, Wagner SK et al (2021) Predicting Incremental and Future Visual Change in Neovascular Age-Related Macular Degeneration Using Deep Learning. Ophthalmol Retina 5:1074–1084. https://doi.org/10.1016/j.oret.2021.01.009
Article PubMed Google Scholar
Lin TY, Chen HR, Huang HY et al (2022) Deep learning to infer visual acuity from optical coherence tomography in diabetic macular edema. Front Med (Lausanne) 9:1008950. https://doi.org/10.3389/fmed.2022.1008950
Article PubMed Google Scholar
Balaskas K, Glinton S, Keenan TDL et al (2022) Prediction of visual function from automatically quantified optical coherence tomography biomarkers in patients with geographic atrophy using machine learning. Sci Rep 12:15565. https://doi.org/10.1038/s41598-022-19413-z
Article CAS PubMed PubMed Central Google Scholar
Szegedy C, Wei L, Yangqing J et al (2015) Going deeper with convolutions. In: 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). IEEE, Boston, MA, pp 1–9. https://doi.org/10.48550/arXiv.1409.4842
Panthier C, Burgos J, Rouger H, Saad A, Gatinel D (2017) New objective lens density quantification method using swept-source optical coherence tomography technology: Comparison with existing methods. J Cataract Refract Surg 43:1575–1581. https://doi.org/10.1016/j.jcrs.2017.09.028
Article PubMed Google Scholar
Brás JEG, Sickenberger W, Hirnschall N, Findl O (2018) Cataract quantification using swept-source optical coherence tomography. J Cataract Refract Surg 44:1478–1481. https://doi.org/10.1016/j.jcrs.2018.08.009
Article PubMed Google Scholar
Chen D, Li Z, Huang J, Yu L, Liu S, Zhao YE (2019) Lens nuclear opacity quantitation with long-range swept-source optical coherence tomography: correlation to LOCS III and a Scheimpflug imaging-based grading system. Br J Ophthalmol 103:1048–1053. https://doi.org/10.1136/bjophthalmol-2018-312661
Article PubMed Google Scholar
Mataftsi A, Koutsimpogeorgos D, Brazitikos P, Ziakas N, Haidich AB (2019) Is conversion of decimal visual acuity measurements to logMAR values reliable? Graefes Arch Clin Exp Ophthalmol 257:1513–1517. https://doi.org/10.1007/s00417-019-04344-9
Article PubMed Google Scholar
Lee W, Kim JH, Lee S, Kim K, Kang TS, Han YS (2022) Estimation of best corrected visual acuity based on deep neural network. Sci Rep 12:17808. https://doi.org/10.1038/s41598-022-22586-2
Article CAS PubMed PubMed Central Google Scholar
Kim JH, Jo E, Ryu S et al (2022) A Deep Learning Ensemble Method to Visual Acuity Measurement Using Fundus Images. Appl Sci 12. https://doi.org/10.3390/app12063190

Download references

Acknowledgements

We thank Shinichi Sakamoto, Ryota Takahashi, Kosuke Nagaoka, and Mayumi Chiba for their participation in this study.

Funding

No funds, grants, or other support was received.

Author information

Authors and Affiliations

Department of Ophthalmology, Jichi Medical University, 3311-1 Yakushiji, Shimotsuke-shi, Tochigi, 329-0431, Japan
Satoru Inoda, Hidenori Takahashi, Yusuke Arai, Hironobu Tampo & Hidetoshi Kawashima
Department of Ophthalmology, Mie University Graduate School of Medicine, Tsu, Japan
Yoshitsugu Matsui
Department of Ophthalmology and Micro-Technology, Yokohama City University, Yokohama, Japan
Yasuo Yanagi
Retina Research Group, Duke-NUS Medical School, Singapore Eye Research Institute, Singapore Eye-ACP, Singapore, Singapore
Yasuo Yanagi

Authors

Satoru Inoda
View author publications
You can also search for this author in PubMed Google Scholar
Hidenori Takahashi
View author publications
You can also search for this author in PubMed Google Scholar
Yusuke Arai
View author publications
You can also search for this author in PubMed Google Scholar
Hironobu Tampo
View author publications
You can also search for this author in PubMed Google Scholar
Yoshitsugu Matsui
View author publications
You can also search for this author in PubMed Google Scholar
Hidetoshi Kawashima
View author publications
You can also search for this author in PubMed Google Scholar
Yasuo Yanagi
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

All authors contributed to the study conception and design. Material preparation, data collection, and analysis were performed by SI, HT, and YY. The first draft of the manuscript was written by SI. All authors provided input on previous versions of the manuscript and have read and approved the final manuscript.

Corresponding author

Correspondence to Hidenori Takahashi.

Ethics declarations

Ethics approval

This retrospective study was approved by the institutional review board of Jichi Medical University (Jichi-CU19-094) and adhered to the tenets of the Declaration of Helsinki. The study procedures followed institutional guidelines.

Consent to participate

Informed consent was obtained in the form of opt-out on the website of the Department of Ophthalmology of Jichi Medical University.

Financial interests

SI, H Tampo, and YA have no relevant financial or non-financial interests to disclose.

H Takahashi is a consultant to Kowa Pharmaceutical, Novartis Pharmaceutical, Bayer Pharmaceutical, Senju Pharmaceutical, Pfizer Pharmaceutica, and Santen Pharmaceutical, and receives salary from DeepEyeVision Inc.

YM is a consultant to Alcon Japan, AMO Japan, Novartis Pharmaceuticals, Santen Pharmaceutical, Bayer Pharmaceutical and Senju Pharmaceutical.

HK is a consultant to Kowa Pharmaceutical, Novartis Pharmaceuticals, Santen Pharmaceutical, Mitsubishi-Tanabe Pharmaceutical, Senju Pharmaceutical, and the Tochigi Prefectural Ophthalmologists Association.

YY is a consultant to Novartis Pharmaceuticals, Bayer Yakuhin, Santen Pharmaceuticals and Senju Pharmaceutical.

Non-financial interests

None.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Below is the link to the electronic supplementary material.

Supplementary file1 (XLSX 10 KB)

Supplementary file2 (XLSX 11 KB)

Supplementary file3 (XLSX 10 KB)

Supplementary file4 (XLSX 11 KB)

417_2023_6054_MOESM5_ESM.pdf

Supplementary file5 S1 Fig. SS-OCT image. Horizontal and vertical B-scan SS-OCT grey images centred on the fovea were obtained. S2 Fig. Errors of each group. * p<0.001. Median difference between the actual BCVA and estimated BCVA (ε). The errors in the normal OCT and CSC groups were smaller than the median absolute error |ε|. AMD, age-related macular degeneration; BCVA, best-corrected visual acuity; CSC, central serous chorioretinopathy; DR, diabetic retinopathy; mCNV, myopic choroidal neovascularisation; MH/ERM, macular hole or epithelial retinal membrane; OCT, optical coherence tomography; RVO, retinal vein occlusion. (PDF 1850 KB)

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Inoda, S., Takahashi, H., Arai, Y. et al. An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases. Graefes Arch Clin Exp Ophthalmol 261, 2775–2785 (2023). https://doi.org/10.1007/s00417-023-06054-9

Download citation

Received: 08 September 2022
Revised: 26 February 2023
Accepted: 30 March 2023
Published: 11 May 2023
Issue Date: October 2023
DOI: https://doi.org/10.1007/s00417-023-06054-9

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

An AI model to estimate visual acuity based solely on cross-sectional OCT imaging of various diseases

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Introduction

Methods

Design

Procedure

Deep learning

Disease groups

Performance statistical index

Statistical analyses

Results

Patients’ demographic characteristics

Accuracy

Factors associated with accuracy

Validation study

Discussion

Data availability

Code availability

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethics approval

Consent to participate

Financial interests

Non-financial interests

Additional information

Publisher's note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation