Introduction

Due to the increasing number of refractive errors, more and more myopia patients are now facing cataract surgery with aging. Eyes with the axial length (AL) longer than 24.5 mm usually come with axial myopia, which are also defined as moderately long or long eyes [1,2,3]. It is challenging for the surgeon to select appropriate intraocular lens (IOL) power calculation formulae for axial myopic eyes, especially for the highly axial myopic eyes [4, 5]. Three main sources of errors in IOL power calculation of axial myopic eyes are the AL measurement error, anterior chamber depth (ACD) measurement error and effective lens position (ELP) error [6]. With the introduction of optical biometry, the accuracy of AL and ACD measurement has been significantly improved. Now, the main source of predictive refraction error of IOL power calculation in axial myopic eyes is ELP error which is determined by IOL formulae [6].

IOL formulae have been proposed more than 40 years to calculate IOL power after cataract surgery [5]. Over the past few decades, the third- or fourth-generation formulae (traditional formulae) still were the most common method used in IOL power calculation [7]. The third-generation formulae, including SRK/T, Hoffer Q, Holladay I, predict the postoperative ELP through the AL and corneal curvature. Earlier studies have shown that, in axial myopic eyes, SRK/T might be the most accurate formula for IOL power calculation [8]. However, some studies also suggested that the third-generation formula has no significant difference in calculating IOL power in axial myopic eyes [9, 10]. After that, the fourth-generation formulae, represented by Haigis and Holladay II, introduce other parameters, such as ACD, lens thickness and white-to-white distance, to estimate postoperative ELP and calculate IOL power. Several studies showed Haigis had a better outcome in calculating IOL power for axial myopic eyes than SRK/T, Holladay I, Holladay II and Hoffer Q formulae [11, 12]. However, consistent hyperopic errors have previously been reported in axial myopic eyes using traditional IOL power calculation formulae with ALs measured by A-scan, B-scan or optical biometry [13].

Recently, the fifth-generation formula has become available for commercial use and its performance showed promise in previous study [14]. The fifth-generation formula is also called the new-generation formula, including the Barrett Universal II (BUII), Olsen and the other new formulae [15]. Both BUII and Olsen are based on ray-tracing method and also have been shown to be a good consistent and have a great popularity in calculating IOL power for axial myopic eyes [16]. In addition, the other formulae based on artificial intelligence (AI) were also introduced to improve the accuracy of IOL power calculation, such as Kane, Hill Radial Basis Function Calculator (Hill-RBF), Ladas Super Formula (LSF) and Pearl DGS [15]. Moreover, Emmetropia Verifying Optical (EVO), T2, VRF and Naeser formula also showed acceptable accuracy in IOL power calculation [15]. With the increasing development of the new-generation IOL formulae, the predictive refraction error after cataract surgery has steadily decreased [15, 17]. Yet there is still considerably doubt about which formula provides the most accurate predictive refraction in axial myopic eyes. The purpose of this meta-analysis is to compare the accuracy of the new-generation formulae with traditional formulae and provide valuable clinical guidelines for choosing appropriate IOL power calculation formulae in axial myopic eyes.

Patients and methods

Study inclusion and exclusion criteria

Inclusion criteria for studies were: (1) eyes with AL longer than 24.5 mm; (2) uneventful phacoemulsification with in-bag IOL implantation; (3) at least two types of the following IOL power calculation formulae have been compared: SRK/T, Hoffer Q, Holladay I, Holladay II, Haigis, BUII, Olsen, T2, VRF, EVO, Kane, Hill-RBF, LSF; (4) biometry measurements were measured by optical biometers; (5) IOL constants were optimized; (6) postoperative refraction was performed at least two weeks for small incision surgery and 1-piece IOL implantation, more periods are needed for the others. Exclusion criteria for studies were: (1) eyes with AL shorter than 24.5 mm; (2) eyes had surgical complications or refractive surgery; (3) the percentages of eyes with refractive prediction error (PE) in ± 0.5D and ± 1.0D were unavailable. The other new formulae, Naeser 2, Panacea and Pearl-DGS, were eliminated due to the limited number of study.

Literature search

Two independent authors (H.Y.L and Y.L) searched the following databases, including PubMed, Web of Science, EMBASE and Cochrane Library. The following search terms were used in PubMed: ("lenses, intraocular"[MeSH Terms] OR "intraocular lenses"[Title/Abstract] OR "lens intraocular"[Title/Abstract] OR "intraocular lens"[Title/Abstract] OR "IOL"[Title/Abstract] OR "IOLs"[Title/Abstract]) AND ("calculat*"[Title/Abstract] OR "formula*"[Title/Abstract]) AND ("myopi*"[Title/Abstract] OR "long eye"[Title/Abstract] OR "long axial length"[Title/Abstract] OR "long eyes"[Title/Abstract] OR "long AL"[Title/Abstract] OR "long ALs"[Title/Abstract]) AND ("cohort studies"[MeSH Terms] OR "case–control studies"[MeSH Terms] OR "comparative study"[Publication Type] OR "risk factors"[MeSH Terms] OR "cohort"[Text Word] OR "compared"[Text Word] OR "groups"[Text Word] OR "case control"[Text Word] OR "multivariate"[Text Word]) AND ("last 10 years"[PDat]). After excluding the duplicate, all possible studies were reviewed ignoring the main outcomes or languages. The two authors separately evaluated the titles and abstracts and performed a manual search by searching the reference list of all the eligible studies.

Data extraction and quality assessment

The new-generation IOL power calculation formulae, including BUII, Olsen, T2, VRF, EVO, Kane, Hill-RBF and LSF, were compared with the traditional formulae including SRK/T, Hoffer Q, Holladay I, Holladay II, Haigis. PE was defined as the difference between the predictive spherical equivalent (SE) and the actually postoperative SE. The primary outcomes were the percentages of eyes with PE in ± 0.5D and ± 1.0D. The higher of the percentages, the better of the IOL power formulae. The two authors (H.Y.L and Y.L) independently extracted the data and compared the results. Discrepancies were resolved by another author (Z.Y). A modified checklist adapted from the QUADAS-2 tool was used to assess the quality of the evidence [18, 19]. Study characteristics extracted from the eligible studies were the first author, publication year, sample size, axial length, following-up period, postoperative refraction, IOL power formulae and the percentages of eyes with PE in ± 0.5D and ± 1.0D.

Statistical analysis

Pooled estimates of the odds ratio (OR) were calculated with fixed-effects model when comparing the percentage of eyes with a PE in ± 0.5D and ± 1.0D of each formula. And the results were described in forest plots, with lines representing the estimated values of different studies and their confidence intervals, and the boxes graphically representing the weight assigned to each study in calculating the combined estimator of a given outcome. Substantial heterogeneity, caused by the differences across studies rather than sampling error, was defined as I2 value greater than 50%, and the P value for heterogeneity was less than 0.10. The random-effect model was used when heterogeneity was found. Sensitivity analysis and subgroup analysis were used to assess the change in overall effect when the I2 value was greater than 50%. Funnel plots were used to evaluate publication bias and small-study effect. Review Manager was used to data pooling (version 5.3, Cochrane Collaboration, Oxford, UK). A possibility less than 0.05 was considered to be statistically significant.

Results

Initially, 1097 articles were identified through literature search (Fig. 1). After deleting the duplicates, 580 records remained, of which 535 records were removed because of irrelevance, such as not a clinical study, not patients without eye surgery, not comparing the accuracy of IOL formulae. Finally, 45 articles were selected for full-text assessment. Among these, five studies did not include the target formula or constant optimization, and the primary outcomes were not available in seventeen studies, and four studies evaluated eyes with axial length less than 24.5 mm, and four studies were excluded due to the other reasons, such as reviews, conference abstracts or not being an English literature. After excluding of these studies, 15 articles were used for qualitative analysis [3,4,5, 20,21,22,23,24,25,26,27,28,29,30,31].

Fig. 1
figure 1

Flow diagram of articles selection

Study characteristics and quality assessment

A total of 2273 eyes were enrolled in 15 studies (Table 1). All eligible studies included eyes after phacoemulsification with a mono-focal IOL implanting in the capsular bag. The modified QUADAS-2 was used to quality assessment (Fig. 2). Detailed information on the comprehensive assessment is described in Appendix 1. For patient selection, only one study had inappropriate exclusions, resulting in a high risk of bias [4]. Seven studies did not clarify patient enrollment methods, resulting in an unclear risk of bias. For reference standard and flow and timing assessment, six studies performed subjective refraction and three studies did not describe the refraction method. For the index test, only two studies did not declare the definition of outcomes.

Table 1 Characteristics of study participants
Fig. 2
figure 2

Quality assessment of the eligible studies according to the modified QUADAS-2

Outcomes

Among the 2273 eyes included, 115 eyes were calculated with Kane, 115 with EVO, 115 with LSF, 504 with Olsen, 1986 with BUII, 879 with Hill-RBF, 633 with T2, 1823 with Haigis, 121 with VRF, 1807 with SRK/T, 741 with Holladay II, 2131 with Holladay I and 1431 with Hoffer Q. The overall percentages of eyes with PE in ± 0.5D (± 1.0D) of the above formulae are 86.96% (99.13%), 86.09% (97.39%), 83.48% (97.39%), 77.18% (96.03%), 76.38% (96.27%), 74.74% (95.34%), 69.51% (93.21%), 67.80% (91.72%), 65.29% (91.74%), 65.08% (91.20%), 63.97% (90.28%), 59.50% (87.19%) and 54.51% (83.02%), respectively (Fig. 3).

Fig. 3
figure 3

The overall percentage of refractive prediction error within ± 0.5 D and ± 1.0 D of the included formulae (HQ: Hoffer Q; H1: Holladay I; H2: Holladay II; BUII: Barrett Universal II; D: diopter)

Percentage of eyes with a PE in ± 0.5D

Figure 4 shows the difference in the percentage of eyes with a PE in ± 0.5D comparing BUII with the other formulae. BUII was more accurate than SRK/T (P < 0.001), Hoffer Q (P < 0.001), Holladay I (P < 0.001), Holladay II (P = 0.01) and Haigis (P < 0.001). Appendices 2–7 show the results of pairwise comparisons of other formulae in terms of the percentages of eyes with PE in ± 0.5D. SRK/T had a significantly higher percentage of eyes with PE in ± 0.5D than Hoffer Q (P = 0.009) and lower of that than Haigis, Olsen and Hill-RBF (Appendix 2, P = 0.02, < 0.001, = 0.02, respectively). And Hoffer Q also had the lowest percentage when comparing with Haigis, Olsen, EVO, Kane, Hill-RBF and LSF (Appendix 3, P = 0.02, 0.01, 0.007, 0.004, 0.002, 0.03, respectively). Additionally, significant differences were found between Holladay I and Holladay II, EVO and LSF (Appendix 4, P = 0.02, 0.03 and 0.009, respectively) and between Olsen and Haigis (Appendix 6, P = 0.003).

Fig. 4
figure 4figure 4

Forest plots of the percentage of eyes with refractive prediction error in ± 0.5 D when comparing Barrett Universal II with SRK/T (A), Hoffer Q (B), Holladay I (C), Holladay II (D), Haigis (E), Olsen (F), T2 (G), EVO (H), Kane (I), H-RBF (J), LSF (K)

Percentage of eyes with a PE in ± 1.0D

The comparison between BUII with the others in terms of percentage of eyes with a PE in ± 1.0D is shown in Fig. 5. The percentage of eyes with a PE in ± 1.0D calculated by the BUII formula was significantly higher than the SRK/T (P < 0.001), Hoffer Q (P < 0.001), Holladay I (P < 0.001), Holladay II (P < 0.001), Haigis (P < 0.001) and T2 (P = 0.02). Appendices 8–13 show the results of pairwise comparisons of other formulae in terms of the percentages of eyes with PE in ± 1.0D. SRK/T produced a higher percentage than Hoffer Q (P = 0.006) and Haigis (P = 0.03), but a lower of that than Olsen and Hill-RBF (Appendix 8, P = 0.003 and 0.009, respectively). Additionally, Hoffer Q was less accurate than Holladay I, Haigis and Olsen (Appendix 9, P = 0.007, 0.03, 0.02, respectively). And a higher percentage of eyes with PE in ± 1.0D was also found in Olsen when comparing with Holladay I (Appendix 10, P < 0.001) and Haigis (Appendix 12, P = 0.002).

Fig. 5
figure 5figure 5

Forest plots of the percentage of eyes with refractive prediction error in ± 1.0 D when comparing Barrett Universal II with SRK/T (A), Hoffer Q (B), Holladay I (C), Holladay II (D), Haigis (E), Olsen (F), T2 (G), EVO (H), Kane (I), H-RBF (J), LSF (K)

Heterogeneity and subgroup analysis

Forest plots showed the I2 values, mean difference and 95% confidence interval (CI). Substantial heterogeneity was detected in 11 pairs, and then the random-effect model was used (Appendix 14–15). Sensitivity analysis indicated that I2 value decreased to 0% in the comparison of the percentage of eyes with PE in ± 0.5D between BUII and Holladay II by omitting Ji 2021 (Appendix 14C, P < 0.001). In addition, substantial heterogeneity was decreased to insignificant when the enrolled eyes were sub-grouped by the biometry measurement difference (IOL Master and Lenstar). The results showed that there were significant differences in comparison of the percentage of eyes with PE in ± 0.5D (1.0D) between BUII and Hoffer Q and Holladay I in both subgroups (Appendix 14 A-B and 15 A-B, all P < 0.01). Similar results were also found when comparing SRK/T with Hoffer Q and Holladay I in ± 0.5D group and Haigis with SRK/T and Hoffer Q in ± 1.0D group (all P < 0.05). Funnel plot is shown in Appendix 16.

Discussion

The widespread application of phacoemulsification combined with in-bag IOL implantation has led to the improvement of surgical techniques and the reduction of surgical complications. Therefore, the postoperative refractive status is less affected by surgical factors than it was in the past. Yet the utmost importance that affects the refractive status after cataract surgery is the accuracy of IOL power calculation which is closely related to the choice of IOL formula. A previous meta-analysis by Wang et al. compared the accuracy of different IOL formulae in axial myopic eyes and demonstrated the superiority of BUII over SRK/T, Hoffer Q, Holladay I and Holladay II [32]. However, they did not compare the newest formulae based on ray tracing or AI methods.

Previous studies indicated that several measures could be used to evaluate the accuracy of IOL formulae, such as MAE, median absolute prediction error (MedAE), as well as the percentage of eyes with PE in ± 0.5D and ± 1.0D and > 2.00D [33]. It is better to compare MedAE and MAE when comparing the accuracy of different IOL power calculation formulae. However, both MedAE and MAE are abnormal Gaussian distribution and it is also difficult to obtain standard deviation values, resulting in the impossibility to compare the MedAE and MAE in this meta-analysis. Therefore, we choose to compare the percentage of eyes with PE in ± 0.5D and ± 1.0D. In addition, as the central cornea alter to the flattest area after myopic ablation, the ratio between anterior and posterior corneal curvature would decrease [34]. This meta-analysis excluded myopic eyes with previous refractive surgery because of the corneal power measurement error and ELP error after laser ablation [35, 36]. Moreover, in the present study, we included studies that used the optical biometer (IOL Master or Lenstar) for biometry measurements, which was recommended as the best precision of measurements [33]. Postoperative refraction should be measured at least 2 weeks, because it could be considered to be stable at 1 week after small incision cataract surgery with a 1-piece IOL implantation [37, 38]. In order to control heterogeneity and biasness, we also excluded the studies which did not perform the constant optimization procedure. Additionally, the new-generation IOL formulae (BUII, Olsen, T2, VRF, EVO, Kane, Hill-RBF and LSF) were compared with traditional ones (SRK/T, Hoffer Q, Holladay I, Holladay II, Haigis). To our knowledge, this is the first meta-analysis to assess the accuracy of the new-generation IOL power calculation formulae in axial myopic eyes by comparing the percentage of eyes with PE in ± 0.5D and ± 1.0D.

Traditionally, IOL power calculation formulae were classified by generation. For example, SRK II is a second-generation formula; SRK/T, Hoffer Q and Holladay I are third-generation formulae; Holladay II and Haigis are fourth generation; Barrett Universal II, Olsen and another new formula are fifth generation, which also called the new generation. Over the past four decades, cataract surgeons usually applied the third- or fourth-generation formulae to calculate the IOL power in virgin eyes. Previous studies showed that, for axial myopic eyes, SRK/T was slightly better than Hoffer Q and Holladay I [39,40,41]. Similar results were found in this meta-analysis. However, all third-generation formulae showed a lower percentage of eyes with PE in ± 0.5D and ± 1.0D than Haigis. The disadvantage of third-generation formulae is that only two variables (AL and corneal power) are used to determine the postoperative IOL position, making it inaccurate to evaluate the actual size of the anterior segment of the eyes. The Haigis formula adds ACD and the Holladay II adds ACD, lens thick, corneal diameter, pre-refraction and gender to calculate the postoperative IOL position [42]. Although the fourth-generation formulae introduce more parameters to estimate the ELP, most traditional formulae could make a hyperopic drift postoperatively, and even the longer the axial length is, the higher the PE is made. It might be associated with a slight but significant reduction in the AL measurement after cataract surgery. Maddalena et al. pointed out two hypotheses to expound the differences in AL measurement, including the decrease in the volume of the eye postoperatively and the wrong refractive index of the lens due to the cataract grade preoperatively [43].

In order to improve the accuracy of IOL power calculation, it has recently come up with several new formulae, collectively called the fifth-generation formulae, such as BUII, Olsen and formulae based on AI [15]. Each of these formulae uses at least five variables to calculate IOL position. The BUII is the evolution of the Barrett Universal I, published as a thick-lens paraxial formula [44]. Kane et al. have proved that BUII is the most accurate formula in comparison with the third- and fourth- generation formulae for IOL power calculation in medium, medium-long and long eyes [3]. It showed a higher percentage of eyes with PE in ± 0.5D and a lower MAE as well as MedAE. In our meta-analysis, BUII also showed a higher percentage of eyes with PE in ± 0.5D and ± 1.0D than SRK/T, Hoffer Q, Holladay I, Holladay II and Haigis. In addition, our study included Olsen formula which is based on ray-tracing method and uses a special C-constant to estimate ELP [45]. C-constant is based on ACD, lens thickness and IOL constant, and no longer dependent on AL or corneal power. Meanwhile, it also considers corneal irregularity, pupil diameter and IOL thickness to minimize the aberration. Melles and Rong demonstrated its superiority for IOL power calculation in comparison with traditional formulae [16, 22]. Our meta-analysis also found the accuracy of Olsen could be comparable to BUII, which is significantly better than the third- and fourth-generation formulae.

Except for all above-mentioned formulae, there are a lot of the other new formulae being proposed to calculate IOL power, where T2 is a modification of the original SRK/T, and VRF or EVO is a vergence formula even it is unpublished, as well as several formulae based on AI, such as Kane, Hill-RBF and LSF [15, 46]. Although all these formulae were reported high accuracy, they have been tested by a few studies [30, 31]. Hill-RBF is the first AI formula installed on the Lenstar, which uses pattern recognition and data interpolation to calculate the IOL power; Kane is a new formula based on theoretical optics and contains some element of AI, but its structure is largely unknown; LSF uses the postoperative data of more than 4000 eyes to build a three-dimensional model to calculate IOL power [24, 30, 47]. It recently was reported that EVO and Hill-RBF had a more popularity and excellent outcome in IOL power calculation for axial myopic eyes [30]. In this meta-analysis, we found that Hill-RBF had a significantly higher percentage of eyes with PE in ± 0.5D and ± 1.0D than SRK/T. No statistical difference was found between the other new formulae and traditional formulae. Interestingly, our study indicated that Kane, EVO and LSF showed a higher percentage of eyes with PE in ± 0.5D and ± 1.0D than the others, even without statistically significant differences. The possible reason might be that all three formulae are created with “big data” techniques and using several basic parameters to make its predictions [48, 49]. Due to the limit number of the enrolled eyes with VRF, EVO, LSF and Kane, it is still difficult to determine which new formulae are the most accurate one for IOL power calculation in axial myopic eyes.

This meta-analysis has several limitations. Firstly, a few of the enrolled studies were retrospective settings with a small sample size, which could cause a selective bias that is relative to the variability of patient characteristics, multiple IOL types and a single-center study. However, it is considered acceptably to compare the accuracy of IOL power calculation formulae [50]. Second, although the AL of all included eyes was longer than 24.5 mm, the scope of AL was different between different studies. This would have a small effect on the eventual results. Next, one of the enrolled study might include both eyes of the same patient, which could lead smaller P-values [23]. But there were only 136 eyes of 92 patients in this study. It might not significantly affect the final results of 2273 eyes. Finally, only one study included the EVO, Kane and LSF. Therefore, the results in comparison with these three formulae have a limited application. Future studies need to investigate the accuracy of these results for IOL power calculation in axial myopic eyes.

Conclusions

This meta-analysis reveals promising results for the new-generation formula, such as Kane, EVO, LSF, BUII and Hill-RBF, in IOL power calculation for axial myopic eyes. These new formulae have a higher percentage of eyes with a PE both in ± 0.5 D and ± 1.0 D than traditional formulae, such as SRK/T, Hoffer Q, Holladay I/II and Haigis.