Abstract
This retrospective study examined the effect of the size of training data on the accuracy of machine learning-assisted SRK/T power calculation. Clinical records of 4800 eyes of 4800 Japanese patients with intraocular lenses (IOLs) were reviewed. A support vector regressor (SVR) was used for refining the SRK/T formula, and dataset sizes for training and evaluation were reduced from full to 1/64. The prediction errors from the postoperative refractions were calculated, and the proportion within ± 0.25 D, ± 0.50 D, and ± 1.00 D of errors were compared with those using full data. The influence of the difference in A-constant was also evaluated. Prediction errors within ± 0.50 D in the use of full data were obtained with the dataset of ≥ 150 eyes (P = 0.016), whereas the datasets of ≥ 300 eyes were required for the error within ± 0.25 D (P < 0.030). The prediction errors did not alter with the A-constant values among IOLs with open-loop haptics, except for IOLs with plated haptics. In conclusion, the accuracy of SVR-assisted SRK/T could be achieved with the training dataset of ≥ 150 eyes for the Japanese population, and the calculation was versatile for any open-looped IOLs.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.Introduction
In the use of premium intraocular lenses (IOLs) for astigmatism and presbyopia corrections, accurate IOL power calculation for postoperative emmetropia is necessary for IOL functions. Although postoperative refractive errors within ± 1.0 D could be obtained in 93% of eyes using third- to fourth-generation calculation formulas such as the SRK/T and Haigis formula1, accuracy of > 90% within the absolute errors of 0.5 D is desired for patients undergoing premium IOL implantation. Thus, advanced calculation methods, such as the Barrett Universal II (BUII)2, Hill-radial basis function (Hill-RBF)3, and Kane formula4, have been used, and several publications have reported their superiority5,6,7. New-generation formulas enable higher accuracy by adding more biometric measurements such as lens thickness and corneal diameter, utilizing a complex model of ocular geometry, and utilizing machine learning with a large dataset.
As most of the advanced calculations are based on the biometry of Caucasian eyes, performances could be inherently altered by patients’ ethnicity, race, and region. The alternations for patient groups of a site have been adjusted with the constants of third- to fourth-generation formulas, such as the A-constant. However, such optimization is not available for advanced calculations8. Recently, we demonstrated that the use of machine learning with the SRK/T formula effectively improved the power calculation accuracy for a patient group9. Predicted refractions derived from the SRK/T formula were adjusted with support vector regression (SVR) machine learning. The SVR nonlinearly provided a regression equation in which the total errors of the training data outside a certain margin from the equation were minimized10 and were suitable for IOL power calculation11. With training data of 1211 eyes, the prediction errors were less than that with BUII for patients in the Kyushu Island of Japan9. Adaptation was achieved using a small size of training data by incorporating SRK/T; however, how much training data are required for a specific accuracy is not certain. Thus, this retrospective study aimed to assess the effect of training data size on the accuracy of IOL power calculation and evaluate the influence of the difference in A-constants.
Methods
Participates
This retrospective study was approved by the Institutional Ethics Committee of Tsukazaki Hospital (Approval No. 181011) and adhered to the tenets of the Declaration of Helsinki. For all participants, the use of clinical records related to cataract surgery was approved as stated in the informed consent obtained before surgery. Clinical records of consecutive patients who underwent cataract surgery with IOL implantation between September 2017 and April 2021 were reviewed. The inclusion criteria were as follows: no history of refractive surgery, postoperative corrected distance visual acuity (CDVA) of 16/20 in Snellen or better, and optimized constants of implanted IOLs. For bilateral implantation, an eye with regular and mild astigmatism was selected for the analysis.
Preoperative axial length (AXL), corneal radius (CR), anterior chamber depth (ACD), lens thickness (LT), and white-to-white distance (WTW) were measured using a swept-source biometer IOLMaster 700 (Carl Zeiss, Oberkochen, Germany). IOL power was determined using the SRK/T formula, and all IOLs were implanted in the capsule without complications. Three months after surgery, the manifest refraction spherical equivalent (MRSE) was measured during the examination for CDVA.
Machine learning-assisted power calculation
SVR was used to enhance the accuracy of the SRK/T formula9. Initially, predicted postoperative refractions were obtained using the SRK/T with biometry measurements of AXL and CR and an optimized A-constant. The predicted postoperative refractions were refined for the patient group with additional inputs of AXL, CR, ACD, LT, and WTW. The SVR with an RBF kernel was trained by using the “scikit-learn” library (https://scikit-learn.org/stable/modules/svm.html#svm-regression) in Python 3. Hyperparameters such as the C-constant and shape parameter γ of the kernel function were tuned using a grid search for avoiding overfitting11.
To evaluate the effect of training data sizes on calculation accuracy, a dataset of the participants was randomly divided into five groups. Initially, four groups were used for SVR training to refine the accuracy of the predicted postoperative refractions, and the remaining group was used to evaluate the trained calculator. As shown in Fig. 1, the groups used for training were rearranged four times to obtain evaluation results for all data. Then, size of the dataset was reduced by half and divided into five groups, and training and evaluation were conducted similarly. The dataset had been divided by two until the size was 1/64 of the original size. When the original data size was 4800 eyes, training and evaluation were conducted with datasets of 4800, 2400, 1200, 600, 300, 150, and 75 eyes.
Analysis
To assess the accuracy for each training data size from the SVR, prediction errors of the predicted postoperative refractions from MRSE were obtained, and its means and standard deviations (SDs) were calculated. The median of the absolute prediction error (MedAE) was also obtained. Changes in the mean prediction errors with the dataset sizes were examined using the analysis of variance (ANOVA), followed by Holm’s multiple comparisons in the presence of a significant change. The proportion of eyes within prediction errors ± 0.25 D, ± 0.50 D, and ± 1.00 D was calculated, and differences from the use of full data were examined using the Chi-squared test.
The influence of eyes with long AXL (> 26.0 mm) was also compared with those of eyes with normal AXL (between 22.0 and 26.0 mm). Owing to the limited sample size (178 eyes), eyes with short AXL (< 22.0 mm) were not analyzed. The prediction errors were compared using a t-test, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors were compared using the Chi-squared test.
To investigate whether the calculator would accommodate various IOL models, the influence of A-constants on prediction errors was also evaluated. Prediction errors were compared between four groups according to the ranges of A-constants, such as ≤ 119.0, 119.0–119.2, 119.2–119.4, and 119.4–119.6, using ANOVA following the Tukey multiple comparison. P < 0.05 was considered a statistically significant difference.
Ethics approval and consent to participate
This retrospective study was approved by the Institutional Ethics Committee of Tsukazaki Hospital (Approval No. 181011) and adhered to the tenets of the Declaration of Helsinki. For all participants, the use of clinical records related to cataract surgery was approved as stated in the informed consent obtained before surgery.
Consent to publish
Name and other personally identifiable information were removed from all text/figures/tables/images.
Results
Clinical records of 4800 eyes from 4800 eligible patients were available. The mean age of the patients was 71.5 (SD 8.4) years, and there were 2195 men and 2605 women. The preoperative mean AXL, CR, and ACD were 24.0 (SD 1.5; range 20.5–30.5) mm, 7.63 (SD 0.26; range 6.72–8.54) mm, and 3.11 (SD 0.40; range 1.75–4.62) mm, respectively. The LT and WTW were 4.53 (SD 0.46) mm and 11.7 (SD 0.4) mm, respectively. The implanted IOLs and A-constants used are listed in Table 1. The power of the implanted IOLs ranged from 5.0 to 30.0 D, and the mean power was 19.4 (SD 4.0) D for targeting refractions between − 7.42 D and 1.13 D (mean − 0.20 D). The mean postoperative MRSE was − 0.18 (SD 0.90) D, and the CDVA was − 0.11 (SD 0.08) logMAR.
Table 2 shows the mean prediction errors, MedAE, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors in the use of SVR-assisted calculation for seven dataset sizes. The mean prediction errors did not change with the data size (P = 1.00, ANOVA), whereas the SD values increase compared with the overall data of 4800 when the data size were ≤ 300 (P < 0.027, F-test). Figure 2 shows the change in proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors with the dataset size. Compared with the results using overall data, the proportions within ± 0.50 D error for the dataset of 75 eyes were significantly low (P = 0.016, Chi-squared test). For errors within ± 0.25 D, the use of datasets of 75 and 150 eyes resulted in a lower proportion (P = 0.014 and 0.030, respectively). In comparison with the results of SRK/T only (N = 4800 eyes), the proportion within ± 0.50 D error was higher when the dataset size was ≥ 150, whereas it was lower for the size of 75 (P < 0.001).
The influence of long eyes (AXL > 26.0 mm) was evaluated in comparison with normal eyes (AXL of 22.0–26.0 mm). Table 3 lists refractive errors for long and normal eyes. In the mean prediction errors, no differences were found for all dataset sizes (P > 0.19, t-test). Within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors, the proportions in long eyes were significantly less for a dataset size of 75 eyes (P < 0.003).
Changes in the prediction error with the A-constant used were examined. In this study, 603, 1109, 1442, and 1646 eyes had A-constants of implanted IOLs of ≤ 119.0, 119.0–119.2, 119.2–119.4, and 119.4–119.6, respectively. As seen in Table 1, the A-constants of ≤ 119.0 consisted of only a single type of IOLs (LS-313 MF15 and LS-313 MF15T) of hydrophilic acrylic material with plated haptics, whereas other IOLs had open-loop haptics with various materials. Table 4 shows the mean prediction errors, MedAE, and proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors. The mean prediction error for A-constants of ≤ 119.0 significantly shifted to hyperopia compared with A-constants of 119.2–119.4 (P = 0.0041, Tukey multiple comparisons), whereas no change was observed among IOLs of A-constants of 119.0–119.6.
Discussion
The use of SVR with the SRK/T formula improved the accuracy of IOL power calculations, and the accuracy did not degrade when the dataset size for SVR training was ≥ 150 within ± 0.50 D errors. The calculator was versatile for any IOLs with an open loop. In the analysis by Aristodemou et al. using data from 8180 eyes and conventional statistical techniques, data from 243 eyes would be required to optimize each A-constant, and the accuracy increases with the sample size12. In the current results, the accuracy remained until the dataset size of 300. This superior performance with a small sample size would result from the use of nonlinear SVR. In addition, the refining of SRK/T outputs accommodated the IOL with A-constants of 119.0–119.6. Previous assessments of machine-learning power calculations used multiple types of IOLs for trainings11,13; however, the difference in IOLs were not examined. Our results indicated that the calculator accommodated most of the one-piece hydrophobic acrylic IOLs with open haptics.
While the mean prediction errors and MedAE did not change with the dataset size, the variance increases when the size was ≤ 150. As a result, the accuracy within ± 0.25 D errors was lower when the dataset size was ≤ 150. For attaining high accuracy, data of ≥ 300 eyes would be preferred. Thanks to the accommodation of multiple IOL types for training, such dataset size would be acceptable for optimization for a patient group at each site or surgeon.
In the comparison between long and normal eyes, significant differences were found in the use of 75-eye dataset. Similarly, the use of small datasets results in lower performance in the proportion within ± 0.5 D and ± 0.25 D errors. One of the factors would be limited coverage of datasets; thus, accommodating eyes with long or short AXL and minor IOL design would be difficult. In the current analysis, ≥ 150 eyes were the least recommended for Japanese patients in the territory of the site. To provide favorable postoperative outcomes, collecting data from patients within each territory would be better.
In the comparison of A-constants, only a particular IOL type showed lower outcomes. This IOL was extended depth-of-focus, made of hydrophilic acrylic material, and equipped with plated haptics. Compared with other IOL types of one-piece and open-loop haptics, the mean prediction errors were significantly and slightly shifted to hyperopic. As the shifting of the IOLs posteriorly resulted in hyperopic errors14, bending of plated haptics due to capsule contraction would induce this prediction error. Further investigation is required. Except for a particular IOL model, the current machine learning-assisted power calculation improved the accuracy for the A-constants in the range of 119.0–119.6, whereas the training dataset insisted on data with multiple IOL models. This finding would be attributed to the use of SRK/T outputs and optimized A-constants; thus, optimized IOL power calculation for our patient group with a limited training dataset would be beneficial. Further investigation is necessary to verify this speculation.
This study has some limitations. First, owing to the retrospective design, the topographic data of the cornea was not measured. Refractive powers of the cornea were obtained from the powers of the anterior (keratometric) and posterior surfaces and corneal thickness. Thus, the influence of the posterior cornea could not be evaluated. Further evaluation with the use of a rotational Scheimpflug camera or optical coherence tomography15 is necessary for more accurate power calculation. In addition, the influence of the asphericity of the cornea16 should be examined. Moreover, multiple IOLs are available for training and evaluation. As per the guideline presented8, an IOL power calculation was evaluated for a single IOL model. In the previous assessment of the same calculation with 1611 eyes with SN60WF alone, the mean prediction error was 0.01 (SD, 0.38) D, and the proportions within ± 0.25 D, ± 0.50 D, and ± 1.00 D errors were 54.4%, 83.5%, and 98.5%, respectively9, which were slightly better than the current results. As expected, a higher accuracy would be obtained by selecting the IOL type routinely used. In other cases, the range of the dataset was determined by the biometry of limited patients within the territory. Hence, there would be patients who would be out of the range of the dataset used for the training. Ideally, a dataset includes a heterogeneous cohort of patients as much as possible; however, this is not practical. However, indicating the minimum requirement for a clinical situation would be important. Finally, implementing the proposed calculation in clinical practice is not easy, since the calculator works in Python 3. To examine the effectiveness of the proposed calculator in other sites, an environment in which a user-friendly calculator can be used through the web is warranted.
Conclusions
This study using data from 4800 eyes revealed that the accuracy of SVR-assisted SRK/T power calculation could be achieved with the training dataset of ≥ 150 within ± 0.50 D errors for the Japanese population. The calculation was versatile for any open-looped IOL models.
Data availability
The datasets used and/or analyzed during the current study are available from the corresponding author upon reasonable request.
References
Behndig, A. et al. Aiming for emmetropia after cataract surgery: Swedish National Cataract Register study. J. Cataract Refract. Surg. 38, 1181–1186 (2012).
Barrett, G. D. An improved universal theoretical formula for intraocular lens power prediction. J. Cataract Refract. Surg. 19, 713–720 (1993).
Hill, W. E. Hill-RBF (Radial Basis Function) calculator version 3.0. https://rbfcalculator.com (2016).
Connell, B. J. & Kane, J. X. Comparison of the Kane formula with existing formulas for intraocular lens power selection. BMJ Open Ophthalmol. 4, e000251 (2019).
Roberts, T. V., Hodge, C., Sutton, G. & Lawless, M. Contributors to the Vision Eye Institute IOL outcomes registry. Comparison of Hill-radial basis function, Barrett Universal and current third generation formulas for the calculation of intraocular lens power during cataract surgery. Clin. Exp. Ophthalmol. 46, 240–246 (2018).
Melles, R. B., Holladay, J. T. & Chang, W. J. Accuracy of intraocular lens calculation formulas. Ophthalmology 125, 169–178 (2018).
Darcy, K., Gunn, D., Tavassoli, S., Sparrow, J. & Kane, J. X. Assessment of the accuracy of new and updated intraocular lens power calculation formulas in 10930 eyes from the UK National Health Service. J. Cataract Refract. Surg. 46, 2–7 (2020).
Hoffer, K. J. & Savini, G. Update on intraocular lens power calculation study protocols: The better way to design and report clinical trials. Ophthalmology 128, e115–e120 (2021).
Mori, Y. et al. Machine learning adaptation of intraocular lens power calculation for a patient group. Eye Vis. (Lond.) 8, 42. https://doi.org/10.1186/s40662-021-00265-z (2021).
Drucker, H., Burges, C., Kaufman, L., Smola, A. & Vapnik, V. Support vector regression machines. Adv. Neural Inf. Process. Syst. 9, 155–161 (1996).
Yamauchi, T., Tabuchi, H., Takase, K. & Masumoto, H. Use of a machine learning method in predicting refraction after cataract surgery. J. Clin. Med. 10, 1103. https://doi.org/10.3390/jcm10051103 (2021).
Aristodemou, P., Knox Cartwright, N. E., Sparrow, J. M. & Johnston, R. L. Intraocular lens formula constant optimization and partial coherence interferometry biometry: Refractive outcomes in 8108 eyes after cataract surgery. J. Cataract Refract. Surg. 37, 50–62 (2011).
Carmona González, D. & Palomino Bautista, C. Accuracy of a new intraocular lens power calculation method based on artificial intelligence. Eye (London) 35, 517–522 (2021).
Miyata, K., Kataoka, Y., Matsunaga, J., Honbo, M. & Minami, K. Prospective comparison of one-piece and three-piece tecnis aspheric intraocular lenses: 1-year stability and its effect on visual function. Curr. Eye Res. 40, 930–935 (2015).
Swartz, T., Marten, L. & Wang, M. Measuring the cornea: The latest developments in corneal topography. Curr. Opin. Ophthalmol. 18, 325–333 (2007).
Savini, G., Hoffer, K. J., Barboni, P., Schiano Lomoriello, D. & Ducoli, P. Corneal asphericity and IOL power calculation in eyes with aspherical IOLs. J. Refract. Surg. 33, 476–481 (2017).
Acknowledgements
Dr. Keiichiro Minami, Ph.D. (KK. Evidence Slyme) helps in medical writing and preparation of this article.
Author information
Authors and Affiliations
Contributions
H.T. and T.Y. wrote the main manuscript text. H.T and T.Y. designed the study. H.T. created the database. T.Y. and M.T. programmed the model. T.S. and K.T. collected data.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Tabuchi, H., Yamauchi, T., Shojo, T. et al. Training data size and predication errors in the use of machine-learning assisted intraocular lens power calculation. Sci Rep 13, 11348 (2023). https://doi.org/10.1038/s41598-023-38616-6
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-023-38616-6
- Springer Nature Limited