Keywords

In Memory of Wolfgang Haigis

The late Wolfgang Haigis proposed a concept of quality metrics of measuring the performance IOL power calculation formulas. The final index is known as the IOL formula performance index, PI. This is a quantitative analysis. For a good and fair comparison, the constants should be optimized before analyzing their performances. This eliminates the bias of the lens constant that was chosen for the analysis. After optimizing the constants, the formulas are compared on their standard deviation, SDME, of prediction (numerical) error; the median absolute error, MedAE; the dependency of prediction error on axial length, m, and; finally, the reciprocal of the percentage of predicted refraction within ±1.00 D, n10.

A good formula comparison is when, ME = 0:

  1. 1.

    SDME → 0

  2. 2.

    MedAE → 0

  3. 3.

    \( \left|m\right|=\frac{\varDelta \mathrm{PE}}{\varDelta \mathrm{AL}}\to 0 \)

  4. 4.

    \( \frac{1}{n_{10}}\to 0 \)

where ME is the Mean (numerical) prediction error of the formula and should be zero when the constant is optimized. PE is the prediction error. SDME is the standard deviation of prediction (numerical) error; MedAE is the median absolute error; |m| is the absolute gradient of the relationship of prediction error with axial length; and finally, n10 is the percentage of eyes within ±1.00 D of predicted spherical equivalent refraction target.

Thereafter, f = SDME + MedAE + 10 ∗ |m| + 10 ∗ (n10)−1

Finally, the IOL formula performance index, PI

$$ PI=\frac{1}{f}=\frac{1}{{\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MedAE}+10\ast \left|m\right|+10\ast {\left({n}_{10}\right)}^{-1}} $$

The metrics |m| and n10−1 were amplified by a factor of 10 because of their small values. Absolute values are used to prevent false reduction of outcomes. In any case, a good formula should be independent of axial length. Whether positive or negative gradient would denote dependency of formula on axial length.

Modification

Wolfgang Haigis first presented the above metric in an ESCRS Meeting and it is available to view on the ESCRS website. It was updated and published in JCRS in 20 [1] which is the only publication of it to date. Today [1], the newer formulas have become more accurate and therefore some updates to his original concept are due to allow for better resolution. There is an increasing emphasis on the importance of MAE, and rightly so, since this should be included as a metric. Besides n10, n5 is added also is. n5 which is defined as the reciprocal of the percentage of correctly predicted refractions within ±0.50 D. This should provide a better resolution. n10 is kept as a safety metric. n5 and n10 are normalized by multiply by 20.

Besides having a dependency on AL, some formulas also exhibit bias against K. For more detailed analysis, the relationship between prediction outcomes and K is also included as a metric in the modified Haigis index.

With the additional metrics to the equation, the PI becomes:

$$ f={\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+10\ast \left|m\right|+3\ast \left|k\right|+20\ast {\left({n}_5\right)}^{-1}+20\ast {\left({n}_{10}\right)}^{-1} $$
$$ PI=\frac{1}{f}=\frac{1}{{\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+10\ast \left|m\right|+3\ast \left|k\right|+20\ast {\left({n}_5\right)}^{-1}+20\ast {\left({n}_{10}\right)}^{-1}} $$

where |k| is the gradient, \( \left(\frac{\varDelta PE}{\varDelta k}\right) \) of prediction error against keratometry. MAE is the mean absolute error.

Application

It must be noted at the outset of evaluating these formulas, that the author of the Hoffer Q formula [2, 3] recommended it primarily for short eyes (<22.0 mm) and never for eyes with an AL greater than 24.5 mm and definitely not for very long eyes (>26.0 mm), yet most of these studies evaluated the Hoffer Q over the full range of ALs, thus insuring it’s rating would be rather low.

In 2017, Fam presented a paper at the annual conference of the Asia-Pacific Association of Cataract and Refractive Surgeons (APACRS) [4]. The paper detailed the outcomes of a single IOL, ZCB00. A total of 291 eyes from 291 patients with preoperative biometry measured with partial coherent interferometry (PCI) (IOLMaster 500) and postoperative refractions carried out between 4 and 6 weeks. All the third-generation formulas are calculated using constants from a previous pool of patients. Barrett Universal II (BUII) [5, 6], EVO, and RBF are based on using an optimized A constant from the same pool of patients. BUII, EVO 1.0, and RBF 1.0 were more accurate than the third-generation theoretical formulas. The Haigis formula, both with personalized triple optimization and ULIB constants, did also very well (see Fig. 33.1, Table 33.1).

Fig. 33.1
3 graphs. A. A box plot titled prediction error with an irregular horizontal trend. B. A box plot titled Absolute Errors with an irregular horizontal trend near the X axis. C. A stacked bar graph titled percentage A E. The highest stacked bar is Barrett.

This Figure and Table 33.1 depict the outcomes of the various formulas. (a) The spread of the prediction errors of the eyes of the different formulas. The bottom and top error plots represent the lower and upper quartiles while the blue and red boxes, the second and third quartiles. The dotted line is the mean prediction errors and the dashed lines, the lower and upper SDs. (b) is a graph showing the absolute errors of the formulas. The MAE and MedAE are represented by the dotted line and blue dashed lines, respectively. Chart (c) is a stacked histogram showing the percentage of eyes within a predicted spherical equivalent (SE) (EVO is EVO 1.0; HaigisT is Haigis with personalized triple optimization; HaigisU is Haigis with ULIB constants; RBF is RBF 1.0. SRK/T-F1 [10,11,12] and SRK/T-F2 are the Fam-adjusted SRK/T formulas [13] (Fam, The Formula1 of IOL Power Calculation [7])

Table 33.1 This table shows the results of the various formulas (EVO is EVO 1.0; HaigisT is Haigis with personalized triple optimization; HaigisU is Haigis with ULIB constants; RBF is RBF 1.0) [7]. The ±0.50 D is in bold and important clinically, as are ±1.00 D

Using the modified Haigis’ quality metrics on IOL power calculation formula, as described in Table 33.1, the following f values and performance indices are generated for the above data. These values are tabulated in Table 33.2 and featured in Fig. 33.2.

Table 33.2 This table shows the values of the Haigis quality metrics based on the data from the previous table
Fig. 33.2
A stacked bar graph title performance index. Hoffer Q presents the highest peak followed by Holladay I and S R K by T. A line with a fluctuating trend overlaps the stacked bar graph.

The stacked histogram depicts the values of individual metrics, based on the previous table. The lower the individual component and overall height f of the stacked histogram the better. The scale for the stacked histogram f is on the left. The red line graph depicts the performance indices of the formulas. The performance index is the reciprocal of the total value of the stacked column. The higher, the better is the performance. The scale for the performance index is on the right. As illustrated, the best performing formula is EVO followed by the 2 Haigis, RBF 1.0, and BUII. These 4 formulas performed much better than the other formulas

Unfortunately, the bias of the prediction errors against K and AL were not available in most studies in this review and therefore have to be omitted as metrics. Ideally, only optimized constants should be used when comparing formulas. In this review, not all studies were based on optimized constants, especially in subgroup analyses. In this review, ME would be omitted in the ranking of formulas in general studies across ALs. This is to avoid a systematic error. For subgroup analyses, PE would be included as a metric to capture bias against the subgroup.

For analysis of the general group, the following metrics would be included:

  1. 1.

    Standard deviation of prediction error SDME 0

  2. 2.

    Mean absolute error MAE 0

    Mean Absolute Error MedAE 0

  3. 3.

    Percentage of error within ±0.5D \( {n_5}^{-1}=\frac{1}{n_5}\;0 \)

  4. 4.

    Percentage of error within ±1.0D \( {n_{10}}^{-1}=\frac{1}{n_{10}}\;0 \)

f is the sum of all the above metrics:

$$ f={\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+20\ast {\left({n}_5\right)}^{-1}+20\ast {n_{10}}^{-1} $$

and finally PI, the performance index:

$$ \mathbf{PI}=\frac{1}{f}=\frac{1}{{\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+20\ast {\left({n}_5\right)}^{-1}+20\ast {\left({n}_{10}\right)}^{-1}} $$

The following metrics will be used for analyzing subgroup studies:

  1. 1.

    Absolute mean numerical prediction error ⌈ME⌉ 0

  2. 2.

    Standard deviation of prediction error SDME 0

  3. 3.

    Mean absolute error MAE 0

  4. 4.

    Median absolute error MedAE 0

  5. 5.

    Percentage of error within ±0.50 D \( {n_5}^{-1}=\frac{1}{n_5}\;0 \)

  6. 6.

    Percentage of error within ±1.00 D \( {n_{10}}^{-1}=\frac{1}{n_{10}}\;0 \)

f is the sum of all the above metrics:

$$ f=\left|\mathrm{ME}\right|+{\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+20\ast {\left({n}_5\right)}^{-1}+20\ast {n_{10}}^{-1} $$

and finally PIsub, the performance index (subgroup):

$$ \mathbf{P}{\mathbf{I}}_{\mathbf{sub}}=\frac{1}{f}=\frac{1}{\left|\mathrm{ME}\right|+{\mathrm{SD}}_{\mathrm{ME}}+\mathrm{MAE}+\mathrm{MedAE}+20\ast {\left({n}_5\right)}^{-1}+20\ast {\left({n}_{10}\right)}^{-1}} $$

Not all studies detailed all of the above metrics. For this review, we will only rank formulas in studies, in both general and subgroups, that have more than 3 of the above 6 metrics.

Further Review

There have been numerous studies published comparing the outcomes of the newer formulas, as well as against the established 3rd generation theoretical formulas. We will review some of these published articles and papers presented during recent conferences. A summary of the review is tabulated in Table 33.3.

Table 33.3 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively. BUII-noACD and EVO 2.0-no ACD signify ACD values were omitted in the related formulas. Holladay 2 PreSurgRef and Holladay 2 NoRef refer to Holladay 2 formula with and without preoperative refractions, respectively. Holladay 2018 and Holladay 2019 pertains to the versions of the Holladay 2 formula. Holladay 2-ALadj is a non-linear AL adjustment available as an option in the Holladay 2 program for eyes that are longer than 24.0 mm. LSF stands for Ladas Super Formula. Olsen2P and Olsen4P are Olsen formula using 2 parameters and 4 parameters to determine ELPs, respectively. Olsen2P is preinstalled in biometers while Olsen4P is also known as Olsen standalone and is available in the program, PhacoOptics. SRK/T-F1 and SRK/T-F2 are SRK/T with Fam-adjustment to the ALs and Ks. When specified, ULIB implies using the constants from the ULIB website. _WK indicates Wang-Koch adjustment

Table 33.3 is a summary of outcomes in the literature as well as papers presented at conferences. The orders of the formula for each source are sorted in an order based on a modification of the Haigis performance index (PI) for comparing IOL power calculation formulas as explained above. The parameters used in this modified quality metrics are the SD, MAE, and MedAE, percentage of absolute error within ±0.50 D and ±1.00 D. The inverse of the percentage of absolute error are used and these are normalized by amplifying by 20 for ±0.50 D and ±1.00 D, respectively. All the parameters are added up quantitatively. All the 4 to 6 parameters are summed up. The lower the sum the better. The reciprocal of that sum is the PI. The order above was sorted in decreasing performance index. The outcome is quite similar to that employed by Cooke et al. The formulas are ranked within the same study and not between studies, as the available parameters and clinical situations may be different.

The stacked histogram (Fig. 33.3) shows how the formulas fare in 17 articles, of which sixteen are ranked. Each box indicates the frequency the formula is ranked first, second, third, and fourth based on their PI. These are denoted by blue for 1st; magenta for 2nd; turquoise for 3rd and yellow for 4th. The line graph represents the number of ranked studies the formula was being compared. BUII was the most quoted and had performed well with most studies ranking it as first. EVO and Kane had also done well, with Kane having a relatively high proportion as best performing formula while EVO 2.0 had the highest proportion of being featured as one of the top 4 ranked formulas.

Fig. 33.3
A stacked bar graph with a superimposed line graph titled all axial lengths. Both the bar and the line graph present trends. The stacked bar that represents Barret presents the highest peak.

Stacked histogram showing the performance indices of the various formulas in the literature

Subgroup Analyses

The third-generation theoretical formulas are good but are noted to have a bias against AL and K. In the past, different formulas were recommended for different ALs and Ks as first recommended and published by Hoffer in 1993. For normal, these older formulas function well. Against this backdrop, newer formulas must show improvement in longer and shorter axial lengths and extreme corneal curvatures.

The Long and Short of It

Short Eyes

A short eye is generally defined as an eye that is 22.0 mm in AL or shorter. IOL power calculation in short eyes is always a challenge. The biometric measurements have to be more precise. The IOL powers are of higher iopter and are consequently more sensitive to even small variations in ELP. Hence, the prediction errors are generally higher than in normal eyes.

The charts (Fig. 33.4) and Table 33.4 showed the accuracy of the different formulas in short eyes (≤22.0 mm). IOL constants for the third-generation formulas were from the greater pool of patients and IOLs. ULIB constants were used for Haigis as some IOLs did not have sufficient numbers for triple optimization. 8 different IOLs are used in this study. BUII, EVO, and RBF were calculated with the optimized A-constant. Fig. 33.4a shows the prediction errors of the formulas, while Fig. 33.4b, c show the absolute errors and percentage of absolute errors.

Fig. 33.4
3 graphs. A. A box plot titled Prediction errors with an irregular horizontal trend. B. A box plot titled Absolute Errors with an irregular horizontal trend near the X axis. C. A stacked bar graph titled percentage A E. Barret, E V O, Haigis U, and R B F present the highest peaks.

Chart (a) displays the prediction error of the formulas. The dual colored boxes in chart (a) represent the 2nd and 3rd quartiles of the spread of prediction errors. The error plots are the 1st and 4th quartiles. The line graphs are the upper and lower SDs. Chart (b) shows the absolute error of the formulas. The tri-colored boxes are the 1st, 2nd, and 3rd quartiles while the error plot is the last quartile. The blue and black dashed lines are the MedAEs and MAEs. Chart (c) is a stacked histogram showing the percentage of eyes within ±0.25, ±0.50, ±0.75, and ±1.00 D of the refraction target

Table 33.4 This table shows the modified Haigis performance indices of the various formulas (EVO is EVO 1.0; RBF is RBF 1.0) [32]

From Fig. 33.4 and Table 33.4, BUII, Haigis (ULIB), RBF 1.0 and EVO had better outcome metrics than the other formulas. BUII, Haigis, RBF 1.0, and EVO 1.0 had lower than 0.40 D and 0.30 D of MAE and MedAE, respectively, and more than 70% within ±0.5 D of expected refraction. All four formulas scored better than 0.60 on the performance index.

Review (Short Axial Lengths)

Table 33.5 is a summary of outcomes in the literature as well as papers presented at conferences on short eyes. As with the above table, the order of the formulas for each source are sorted in order based on a modification of Haigis “Quality metrics for comparing IOL calculation formulas.”

Table 33.5 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively. Holladay 2 PreSurgRef and Holladay 2 NoRef refer to Holladay 2 formula with and without preoperative refractions, respectively. LSF stands for Ladas Super Formula [33]. Olsen2P and Olsen4P are Olsen [34,35,36,37,38,39,40,41] using 2 parameters and 4 parameters to determine ELPs, respectively. Olsen2P is preinstalled in biometers while Olsen4P is also known as Olsen standalone and is available in the program, PhacoOptics. When specified, ULIB implies using the constants from the ULIB website

The stacked histogram (Fig. 33.5) shows how the formulas fare in 8 ranked datasets of 11 articles. Each box indicates the number of times the formula is being ranked based on its PI. Blue is for 1st ranking; magenta for 2nd; turquoise for 3rd and yellow for 4th. The line graph represents the frequency of ranked studies the formula was being compared. Most of the new formulas performed reasonably well. PEARL-DGS was ranked 1st in both studies quoted. Holladay 1 and Barrett were the two most featured formulas. Both had performed reasonably well with most studies ranking it as among the top 4. Among the older theoretical formulas, Haigis and Holladay 1 stand out.

Fig. 33.5
A stacked bar graph and superimposed line graph titled short axial length. The line graph presents a fluctuating decreasing trend. Barrett and Holladay 1 present the highest peaks.

Stacked histogram showing the performance indices of the various formulas for short axial length

Wendelstein et al. did a study to look at the accuracy of 13 different concepts in extreme short eyes [4]. 150 eyes of 150 patients were recruited for this study and 2 IOL models (SA60AT and ZCB00) were used. The constants were optimized from a separate patient cohort. Biometry was measured with either LenStar LS 900 or IOLMaster 700 (Carl Zeiss Meditec AG, Jena, Germany). Postoperative refraction was done at 4 weeks. They concluded that PEARL-DGS, Okulix [43], Kane, and Castrop showed the lowest MAE.

From the graph (Fig. 33.6), Castrop had good accuracy for both groups. PEARL-DGS was the most accurate for the >28.5 D group and was also good for the ≤28.5D group. Okulix had also performed well with the subgroup performance index of above 0.60.

Fig. 33.6
A stacked bar and a scatter plot titled Short Eyes Wendelstein 2021. The stacked bars present an increasing trend, while the superimposed scatterplots present a decreasing trend.

The stacked histograms show the quality metrics f of the formulas in extremely short eyes [4]. Each formula is divided into 2 groups (1. Emmetropic IOL power ≤28.5D and 2. Emmetropic IOL power >28.5D). The scale for the stacked histogram f is on the left. The lower the stacked histogram, the better is the formula performance. The circles and triangles represent the PI. The scale for PI is on the right. The higher the PI score, the better. BUII = Barrett, Hai  = Haigis, HoffQ = Hoffer Q, Holl1 = Holladay 1, Holl2 = Holladay 2, PEARL = PEARL-DGS, RBF =RBF 2.0.

Medium Axial Length

Medium AL is the range of a AL where most eyes are found. It is generally taken to be between 22.0 mm to 24.5 mm, with minor variations. Most formulas perform well in these eyes.

Review (Medium Axial Lengths)

Table 33.6 is a summary of outcomes in the literature as well as papers presented at conferences on medium AL eyes. As with the earlier tables, the orders of the formula for each source are sorted in order based on a modification of Haigis “Quality metrics for comparing IOL calculation formulas.”

Table 33.6 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively. LSF stands for Ladas Super Formula. Olsen2P and Olsen4P are Olsen using 2 parameters and 4 parameters to determine ELPs, respectively. Olsen2P is preinstalled in biometers, while Olsen4P is also known as Olsen standalone and is available in the program, PhacoOptics. SRK/T-F1 and SRK/T-F2 are SRK/T with Fam-adjustment to the ALs and Ks. When specified, ULIB implies using the constants from the ULIB website

The stacked histogram (Fig. 33.7) shows how the formulas fare in 6 ranked datasets in 9 papers. Each box indicates the frequency the formula is being ranked based on PI. Blue for 1st; magenta for 2nd; turquoise for 3rd; and yellow for 4th. The dotted line joins the number of ranked studies the formula was being compared to. There were far fewer studies specifically focused on this range. This chart mirrored that of all ALs, as most of the eyes fall into this group. The performances in this range of ALs were quite spread out. This is not surprising as most formulas perform well in this “normal” range. BUIIt and Holladay 1 were the most quoted and had the highest number of top 4 rankings. RBF 2.0, Kane, and Olsen were next.

Fig. 33.7
A stacked bar graph with a superimposed line graph plots medium axial lengths. The stacked bar, which presents Barrett, exhibits the highest peak. The line follows a fluctuating trend.

Stacked histogram comparing the performance indices of the various formulas for medium ALs

Very Long Axial Length (>26.0 mm)

The threshold for medium long AL is from 24.5 mm to 26.0 mm. Very long ALs are defined as >26.0 mm. 

At the 2016 APACRS annual conference in Bali, Fam presented his findings on the performances of the various formulas for eyes with very long ALs [32]) (Fig. 33.8, Table 33.7).

Fig. 33.8
6 graphical illustrations. A and B. Box plots titled prediction errors greater and less than 5 with irregular horizontal trends. C and D. 2 box plots titled absolute errors greater and less than 5 with irregular horizontal trends near the X axis. E and F. Stacked bar graphs plot percentage A E.

The charts and Table 33.7 depict the outcomes for very long eyes (≥26.0 mm). Charts on the left column were for eyes 26.0 mm and longer and implanted with IOL ≥ 5.0D. 11 different IOLs were used in this study. IOL constants for the third-generation formulas were from the greater pool of patients and IOLs. ULIB constants were used for Haigis as some IOLs did not have sufficient numbers for triple optimization. BUII, EVO, and RBF 2.0 were calculated with the optimized A-constant of SRK/T. The charts on the right column show outcomes for eyes 26.0 mm and longer, and implanted with IOL <5.0D. 7 different IOLs were included in the study; most of these were special very low or negative-diopter IOLs. Figure (a, b) display the numerical prediction errors of the formulas, while Figs. (c, d) depict the absolute errors; and (e, f) the percentage of absolute errors. Most of these eyes were out of the domain for RBF 1.0. RBF in the original presentation was updated to RBF 2.0 in these charts. The formulas in Table 33.7 are arranged in order of their subgroup PI ranking. n is for the number of eyes. ME and SD are the means and standard deviations of numerical prediction errors, respectively. MAE and MedAE are the mean and median absolute errors. ±0.50 D and ±1.00 D are the percentage of eyes within those ranges of target refractions, respectively

Table 33.7 This table shows the modified Haigis performance indices of the various formulas (EVO is EVO 1.0; RBF is updated to RBF 2.0) [32]

In long eyes, the third-generation formulas underestimated the dioptric powers and the resultant refractions were hyperopic. The newer formulas such as BUII, EVO, and RBF 2.0 were more accurate in their calculations. EVO was the most accurate in both datasets. The Fam and Wang-Koch adjustment compensated well for the otherwise hyperopic outcomes of Holladay 1. The hyperopic errors and inconsistencies were more apparent and exacerbated in the low dioptric lens powers.

Review (Long Axial Lengths)

Table 33.8 is a summary of outcomes in the literature as well as papers presented at conferences on long eyes. As with the earlier tables, the orders of the formula for each source are sorted in order based on a modification of Haigis “Quality metrics for comparing IOL calculation formulas.”

Table 33.8 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively. Barrett-noACD and EVO 2.0-no ACD signify ACD values were omitted in the related formulas. Holladay 2 PreSurgRef and Holladay 2 NoRef refer to Holladay 2 formula with and without preoperative refractions, respectively. Holladay 2018 and Holladay 2019 pertain to the versions of the Holladay 2 formula. Holladay 2-ALadj is a nonlinear AL adjustment available as an option in Holladay 2 program for eyes that are longer than 24.0 mm. LSF stands for Ladas Super Formula. Olsen2P and Olsen4P are Olsen using 2 parameters and 4 parameters to determine ELPs, respectively. Olsen2P is preinstalled in biometers, while Olsen4P is also known as Olsen standalone and is available in the program, PhacoOptics. SRK/T-F1 and SRK/T-F2 are SRK/T with Fam-adjustment to the axial lengths and corneal powers. -AL1, AL2, and nonlinear AL indicate the first and second linear versions and the non-linear version of Wang-Koch axial length adjustments, respectively. CMAL pertains to the Cook-modified AL. When specified, ULIB implies the constants from the ULIB website are being used in the calculations. _WK indicates ALs with Wang–Koch adjustments

The stacked histogram (Fig. 33.9) shows how the formulas fare in 16 articles, of which sixteen are ranked. Each box indicates the number of times the formula is being ranked according to the color: blue for 1st; magenta for the 2nd; turquoise for 3rd and yellow for 4th. The dotted line joins the number of ranked studies the formula was being compared to. BUII was the most quoted and had performed well. EVO 2,0 was quoted in 6 articles but had a proportionately higher number of first ranking. RBF 2.0 and Haigis had also done well.

Fig. 33.9
A stacked bar graph titled long axial lengths. The graph presents a line graph superimposed on it. The stacked bar, which presents Barrett, exhibits the highest peak. The line follows a fluctuating trend.

Stacked histogram comparing the performance indices of the various formulas for long axial lengths (≥26 mm)

We will look deeper into the accuracy of the formulas in long axial length but between low-diopter and even lower-diopter eyes.

The 2 charts (Fig. 33.10) illustrate the difference in formula precision as the ALs approach low diopter or negative diopter territory. Chart A is by Abulafia [44] and Chart B by Fam [32]. Abulafia used 6 D while Fam used 5 D as thresholds. The newer formulas such as EVO 2.0, BUII, and RBF 2.0 showed good precisions throughout both groups, as demonstrated by the high subgroup PIs. Wang-Koch adjustments also showed good results, especially with the Holladay 1.

Fig. 33.10
2 graphs titled long eyeball Abulafia 2015 and long axial length Fam 2016. The superimposed scatterplots in both graphs present decreasing trends. The stacked bars present an increasing trend.

Stacked histograms depicting the components of quality metrics and the line charts showing the subgroup Performance Indices, PI of the formulas for very long axial lengths. (a) is from the study by Abulafia [44] (b) is from Fam [32]. The circles are for higher diopter PIs, while the crosses are for lower diopter PIs. The scales for the stacked histograms f are on the left while the scales for PIs are on the right. BUII is Barrett. Holl and Hoff are short for Holladay and Hoffer Q respectively. SRK/T-F1 and SRK/T-F2 are Fam adjusted ALs [13]. -WK is with the Wang-Koch adjustments to the AL

Other Parameters

Flat Cornea (<42.0D) & Steep Cornea (>48.0D)

The charts (Figs. 33.11 and 33.12) and Table 33.9 depict the extremes of cornea curvatures. These were virgin eyes without any history of corneal refractive surgery. Charts on the left column were for a flat cornea (<42.0D) and on the right column for a steep cornea (>48.0D). 7 different IOLs were used for flat eyes and 8 different IOLs for steep eyes. IOL constants for the third-generation formulas were from the larger pool of patients. ULIB constants were used for Haigis as some IOLs did not have enough numbers for triple optimization. BUII, EVO 2.0, and RBF 2.0 were calculated with the optimized A-constant. Fig. 33.11a, b shows the prediction errors of the formulas while Fig. 33.11 c, d show the absolute errors. Figure 33.11e, f are the percentage of absolute errors. Furthers details on the outcomes are in the following tables. Formulas had different accuracy in flat (<42.0D) and steep (>48.0D) eyes. Using the Haigis Quality Metrics, EVO 2.0 and BUII performed the best for flat corneas while RBF 2.0 and EVO 2.0 for steep corneas. In these extremes of curvatures, Haigis and SRK/T were biased and were oppositely affected. Haigis overestimated while SRK/T underestimated for the flat cornea. The converse was true for the steep cornea. From graph G, most formulas were slightly better with a steep cornea than with a flat, except for SRK/T. However, this may not be conclusive, as the comparison was not with the same number of eyes. The above paper was presented in APACRS 2016 in Bali [32].

Fig. 33.11
6 graphs. a and b. 2 stacked box and line graphs for prediction errors with steep and flat cornea, respectively. In graph a, the maximum median is in S R K over T. 3 dashed lines overlap the boxes. e. A stacked bar graph of values in percentage versus 6 formula. The highest stacked bar is for R B F.

The charts and Table 33.7 depict the outcomes for flat (<42.0D) and steep corneas (>48.0D). Charts on the left column were for flatter corneas and the right for steeper corneas. (a, b) Display the numerical prediction errors of the formulas. The colored boxes are for the 2nd and 3rd quartiles, while the error plots are for the 1st and 4th quartiles. The 2 dashed lines are the upper and lower SD. (c, d) depict the absolute errors. The tri-colored boxes are the 1st, 2nd, and 3rd quartiles, and the black and blue dashed lines are the MAEs and MedAEs. (e, f) The percentage of absolute errors

Fig. 33.12
A dual-axis scatter plot and stacked bar graph. It plots values ranging from 0.0 to 2.1 and 0.0 to 0.7 versus 6 sets of formulas. The dataset represents performance indexes and is plotted on and above the stacks. The highest stacked bar is for S R K over T greater than 48.

The stacked histogram shows the quality metrics of the formulas with different corneal powers. <42 is for a corneal power of less than 42 D and >48 is for a corneal power of greater than 48 D. The scale for the stacked histogram is on the left. The lower the stacked histogram, the better is the formula. The circles and triangles are for the performance indices (PI). The scale for PI is on the right. The higher the PI score, the better. The formulas in Table 33.9 are arranged in order of their subgroup PI ranking. n is for the number of eyes. ME and SD are the means and standard deviations of numerical prediction errors, respectively. MAE and MedAE are the mean and median absolute errors. ±0.50 and ±1.00 are the percentage of eyes within ±0.50 D and ±1.00 D target refractions, respectively

Table 33.9 This table shows the modified Haigis performance indices of the various formulas (EVO is EVO 1.0; RBF is RBF 1.0) [32]

The bias or neutrality of formulas with AL and K was reflected with the many charts above and below. This trend was also noted by Melles et al. [20, 21].

Ametropia

At the APACRS annual conference in Hangzhou, Fam presented his finding on ametropia outcomes [7]. The study included 111 eyes with 3 different IOLs. The IOL constants were optimized for the third-generation formulas from a larger pool. The BUIIt was calculated using the optimized A-constant. The targeted refraction ranged from −1.00 D to −5.00 D with the average at −2.00 D.

The charts (Fig. 33.13) and Table 33.10 detailed the outcomes for the ametropia study. IOL constants for the third-generation formulas were optimized from the larger pool of patients. HaigisT was Haigis with triple optimization. BUII and EVO were calculated with the optimized A-constant. Figure 33.13a, b show the prediction (numerical) errors and the absolute errors of the formulas, respectively, while Fig. 33.13c is a stacked histogram depicting the percentage of eyes within a specified Diopter range of predicted spherical equivalent. Figure 33.13d is the stacked histogram of the quality metrics for each of the formulas. The circle represents the subgroup performance index, PI. The table shows the details of Haigis’ Quality Metrics. EVO was the highest-ranking followed by Haigis and Barrett. All three formulas have performance indices that were better than 0.6.

Fig. 33.13
4 graphs. a and b. 2 stacked box and line graphs for prediction and absolute errors, respectively. In graph a, the maximum median is in Hoffer Q. 3 dashed lines overlap the boxes. c. A stacked bar graph of values in percentage versus 6 formula. The highest stacked bar is for E V O.

The charts present the numerical prediction error (a), absolute error (b), the percentage of eyes within the specified prediction errors (c), and the quality metrics (d). The colored boxes in (a) are for the second and third quartiles while the error plot are for the first and fourth quartiles. The 2 dashed lines are the upper and lower standard deviations. The 3 colored boxes in (b) are the first, second, and third quartiles and the black and blue dashed lines are the mean and median absolute errors. The stacked histograms in (d) are the components of quality metrics. The lower the total column the better. The circles represent the subgroup PI. The higher the better. The details of the charts are tabulated in Table 33.10. The formulas in Table 33.10 are arranged in order of their subgroup PI ranking. n is for the number of eyes. ME and SD E are the means and standard deviations of numerical prediction errors, respectively. MAE and MedAE are the mean and median absolute errors. ± 0.50 and ± 1.00 are the percentage of eyes within 0.5 and 1.0D target refractions, respectively

Table 33.10 This table shows the modified Haigis performance indices of the various formulas [7]

Monovision is a fairly common practice to reduce spectacles dependency. Turnbull et al. looked at the accuracy of various formulas when targeting ametropia [26]. They used a single IOL (SN6ATx ) with the constants optimized for the entire dataset. 88 patients planning for monovision were recruited for the study with one eye targeting distance and the other for −1.25 D for near (Table 33.11, Fig. 33.14). Postoperative refractions were done 4 weeks postoperatively.

Table 33.11 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively
Fig. 33.14
A dual-axis scatter plots and stacked bar graphs. It plots 2 ranges of values versus a set of formulas. 2 datasets represent P I D and P I N and are plotted on and above the stacks. In graph A, the highest stacked bar is for Hoffer Q and N and the lowest for Barrett D.

The stacked histogram shows the quality metrics of the formulas with a different refractive target. -D is for distance target and -N for near (−1.00D). The scale for the stacked histogram is on the left. The lower the stacked histogram, the better is the formula. The circles and triangles are for the performance index (PI). The scale for PI is on the right. The higher the PI score, the better

The formulas perform better when targeting emmetropia than they do for ametropia. BUII and RBF 2.0 were similar in their accuracy and had the least difference between targeting emmetropia and targeting for near. BUII had 87.5% and 86.4%, while RBF had 86.4% and 81.8% within ±0.50 D for distance and near respectively. While Haigis and SRK/T had more than 80% (Haigis, 85.2% and SRK/T, 83.0%) within ±0.50 D for distance, that figure dropped down to 69.3% and 70.5% for near respectively. The differences were statistically significant. Holladay 1 and Hoffer Q had less than 70% for both distance and near eyes. The paper highlighted the decrease in accuracy when targeting ametropia as opposed to emmetropia in IOL power calculation. BUII and RBF were the least affected by this phenomenon.

In the year following his earlier study on short eyes, Gökce et al. published another paper looking into the accuracy of 8 different formulas with different ACDs in patients with normal ALs [47]. Gökce et al. stratified the ACD into 3 groups: ≤3.0 mm, >3.0 to <3.5 mm, and finally ≥3.5 mm. Only patients with AL between 22.0 and 25.0 mm were recruited in this study. For the medium ACD group, all formulas had mean prediction error values that were close to zero. In the shallow ACD and deep ACD groups, BUII, Holladay 2, Haigis, and Olsen4P had mean prediction errors that were not significantly deviated from zero. BUII had the lowest MAE in all 3 ACD groups. It had the lowest MedAE (0.18 D) in the shallow ACD group and next to the lowest (0.21 D) in the deep ACD group. BUII, Haigis, and Holladay 2 (with and without refraction) were noted to have no bias against ACD. RBF 2.0 was good for medium and large ACD groups. Olsen4P was good for shallow and deep ACD groups. The study noted that when the mean numerical PE for each formula for the dataset was optimized to zero, the MedAE for BUII, Haigis, Holladay 1 and 2, Olsen, and RBF 2.0 were found to have no differences. The paper inferred that ACD was an important variable in the accuracy of IOL power calculation and that multiple-variable formulas were more accurate than 2-variable formulas (3rd generation).

Hipólito-Fernandez also looked at the impact of ACD and LT on the accuracy of the formulas [48]. Like Gökce, they divide the ACD into 3 similar groups. They included ALs between 22.0 and 26.0 mm. This is a single IOL (SN60WF) with LenStar LS900 (Haag-Streit AG, Köniz, Switzerland) as the preoperative biometer. 695 eyes of 695 patients were recruited. Postoperative refraction was done at 4 weeks. Their conclusion was the new generation formulas, particularly Kane, PEARL-DGS and EVO 2.0 were more reliable and robust across the various ACD and LT combinations.

From the 2 stacked histograms, the newer formulas such BUII, Kane, PEARL-DGS, and EVO 2.0 were more precise and robust than the third-generation theoretical formulas (Table 33.12, Fig. 33.15). For normal ACD (3.0 to 3.5 mm) most formulas perform well. It was in the shallow and deeper ACDs that we see the new formulas perform more consistently better. Without requiring ACD as a parameter, most of the third-generation formulas were unable to take ACD variation into account.

Table 33.12 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error respectively. Holladay 2 PreSurgRef and Holladay 2 NoRef refer to Holladay 2 formula with and without preoperative refractions, respectively. Olsen2P and Olsen4P are Olsen using 2 parameters and 4 parameters to determine ELPs, respectively. Olsen2P is preinstalled in biometers, while Olsen4P is also known as Olsen standalone and is available in the program, PhacoOptics
Fig. 33.15
2 dual-axis scatter plots and stacked bar graphs. They plot 2 ranges of values versus a set of formulas. The dataset represents performance indexes and is plotted above the stacks. In graph a, the highest stacked bar is for Hoff Q is less than or equal to 3.0 millimeters.

The stacked histograms show the quality metrics of the formulas with different ACDs. Chart (a) and (b) are based on Gökce et al. [47] and Hipólito-Fernandez et al. [48] respectively. Each formula is divided into 3 ACD groups (≤3.00 mm; 3.00 to 3.50 mm; ≥3.50 mm). The scale for the stacked histogram is on the left. The lower the stacked histogram, the better is the formula perfformance. The circles and triangles represent the performance index (PI). The scale for PI is on the right. The higher the PI score, the better. BUII = Barrett Universal II, Hai = Haigis, Hoff = Hoffer Q, Holl = Holladay 1, PEARL = PEARL-DGS, RBF = RBF 2.0

From Fig. 33.16, the newer formulas such as Kane, EVO 2.0, PEARL-DGS, and BUII show remarkable robustness between the 3 subgroups of LT (≤4.19 mm; 4.20–4.76 mm; ≥4.77 mm) and show good precision overall. The third-generation formulas were sensitive to thin and thick lens thickness.

Fig. 33.16
2 scatter charts of values ranging from 0.2 to 0.55 versus 9 formula. Graph a plots 3 sets of data. A C D is less than or equal to 3.0 millimeters. 3.0 is less than A C D is less than 3.5 millimeters. A C D greater than or equal to 3.5 millimeters. In each graph, the dataset is dispersed randomly.

The line graphs show the relationship of the mean (a) and median (b) absolute errors with varying ACDs and LTs. BUII = Barrett, Hai = Haigis, Hoff = Hoffer Q, Holl = Holladay 1, PEARL = PEARL-DGS, RBF = RBF 2.0

Ray Tracing and Intraoperative Aberrometry

Hoffmann et al. looked at the benefits of raytracing IOL power calculation for 3 aspheric-correcting IOLs in 2013 [49]. The study compared the outcomes of 308 eyes of 185 patients using Okulix ray-tracing software (version 8.79) with Hoffer Q, Holladay 1, and SRK/T. All preoperative measurements were done with LenStar and the one-month postoperative refractions were used. The constants of the third-generation formulas were optimized. The ray-tracing calculation with offset correction (mean error adjust to zero) had the lowest SD/MAE/MedAE of 0.37D/0.30D/0.24D compared to the third-generation formulas. Raytracing with offset correction had the highest percentage (81.1%) of eyes within ±0.50 D of prediction error. The paper commented that raytracing reduced the number of outliers in calculating IOL powers.

Raufi et al. published a paper looking into the outcomes of intraoperative aberrometry and comparing it with BUII and RBF [50]. 949 virgin eyes of 949 patients with 4 different IOLs were included in this study. Preoperatively, all eyes were measured with Lenstar LS 900, and postoperatively, all eyes were refracted no earlier than one month. Overall, BUII had the lowest MAE/MedAE with 0.29 D and 0.23 D, respectively. BUII had the highest percentage of eyes within ±0.50 D, 84.0%. They concluded that there was no significant difference between ORA [51] and the 2 preoperative IOL formulas.

The accuracy of intraoperative aberrometry in short eyes was studied by Sudhakar et al. [52]. Using ULIB constants, the subjects in the retrospective study were implanted with 6 different IOLs. Preoperatively, measurements were done with IOLMaster 500 PCI, and refractions were done between 20 and 60 days postoperatively. Except for Haigis (+0.26 D), most of the formulas had mean prediction errors that were insignificantly different from zero. RBF and ORA had the lowest MAE with 0.49 D and 0.48 D and the highest percentage of eyes within ±0.50 D, 60.8%, and 58.8%, respectively. The conclusion was that ORA was equivalent to the best preoperative IOL formulas.

Even More Parameters

Table 33.13 is a summary of outcomes in the literature as well as papers presented at conferences on other parameters affecting IOL power calculation. As with the earlier table, the orders of the formula for each source are sorted in order based on a modification of Haigis “Quality metrics for comparing IOL calculation formulas.”

Table 33.13 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error respectively

The stacked histogram (Fig. 33.17) shows how the formulas fare in 4 articles, all of which are ranked. Each box indicates the number of times the formula is being ranked. Blue is for 1st; magenta for 2nd ranking; turquoise for 3rd, and yellow for 4th. The dotted line joins the number of ranked studies the formula was being compared tp. BUII was the most quoted and had demonstrated good precision. Ray tracing (including Okulix) and intraoperative aberrometry (ORA) had shown results as good but not better than the newer formulas.

Fig. 33.17
A stacked bar and line combination graph. It plots values ranging from 0 to 4 versus 9 different formulas. The stacks represent ranks from 0 to 4. It depicts that the maximum value is for the second rank stack with Hill R B F. The line represents the source, which overlaps the bars.

Stacked histogram comparing the performance indices intraoperative aberrometry, ray tracing methods with the more more popular formulas of determining IOL power

Elderly

The impact of the formulas on elderly patients was investigated by Reitblat et al. [30]. Her cohort of 90 eyes from 90 patients was measured with IOLMaster PCI. All patients were implanted with SN60WF and postoperative refractions were carried out at 1 to 3 months postoperatively. There were 2 arms to the study; one for the age group of 75–84 years old and the other was 85 years old or older. For both age groups, BUII, with MAE/MedAE of 0.36D/0.31D and 0.53D/0.39D and Kane, 0.37D/0.32D and 0.56D/0.42D, respectively, were found to be the most accurate. The percentage errors within ±0.5 D for Kane were 78.26% and 65.91%; and for BUII, 82.61% and 61.36% for the younger and older age group, respectively. The rest of the formulas were Haigis, Hoffer Q, Holladay 1 and SRK/T. All formulas showed lower accuracy in the more elderly group.

The graph (Fig. 33.18) and Table 33.14) shows quite clearly that all formulas performed worse in the more elderly age group. The drops in PIs were consistent throughout the formulas. BUII and Kane were the more accurate formulas in this study.

Fig. 33.18
A dual-axis scatter plot and stacked bar graph. It plots values ranging from 0.0 to 3.2 and 0.0 to 0.8 versus 6 sets of formulas. The dataset represents performance indexes and is plotted on and above the stacks. The highest stacked bar is for Hoff Q greater than or equal to 85.

The stacked histograms show the quality metrics of the formulas on different age groups (75–84 and ≥ 85) [30]. The scale for the stacked histogram is on the left. The lower the stacked histogram, the better is the formula. The circles and triangles represent the performance index (PI). The scale for PI is on the right. The higher the PI score, the better. BUII = Barrett Universal II, Hoff = Hoffer Q, Holl = Holladay 1, PEARL = PEARL-DGS, RBF=RBF 2.0.

Table 33.14 ME, SD, MAE, and MedAE refer to mean numerical prediction error, the standard deviation of prediction error, mean absolute error, and median absolute error, respectively. This table is a summary of outcomes from Reitblat et paper [30]. As with the earlier tables, the orders of the formula for each source are sorted in order based on a modification of the Haigis “Quality metrics for comparing IOL calculation formulas”

Conclusion

The third-generation theoretical formulas were popular in the past. Hoffer Q, Holladay 1 and 2, Haigis, and SRK/T were commonly used. These were good formulas. In the last decade, newer formulas began emerging. Barrett Universal II, Hoffer QST, Kane and then RBF 2.0 are the more prominent among these newer formulas. Subsequently, more and more formulas emerged and are still emerging. These formulas, unlike the third generation, are constantly being upgraded and enhanced. These are reflected by the changing version numbers.

Generally, the newer formulas are more accurate than the third-generation formulas. BUII, EVO, RBF 3.0, Hoffer QST and Kane are more frequently being quoted and have been shown to perform better, almost across all ALs, Ks, and ACDs. The other newer formulas also show promise. With these more accurate formulas, cataract surgery is becoming truly a refractive surgery. These will also allow for newer concepts of optical design to be developed.

The above reviews are by no means, exhaustive. The rankings method used here is a modification of the Haigis quality metrics. There are other ways of ranking but this, in my opinion, is an objective and quantitative way of ranking the formulas. The parameters used are limited to the data that were made available in the papers and presentations. Finally, these reviews were on virgin eyes. Post-corneal refractive surgery, keratoconus, etc. are beyond the scope of this chapter.