FormalPara Key Summary Points

Why carry out this study?

The current standard of care (SoC) options for treatment of diabetic macular oedema (DME) are anti-vascular endothelial growth factor (anti-VEGF) intravitreal injections (IVT) such as ranibizumab, aflibercept and bevacizumab (not licensed) used in flexible treatment regimens such as Pro re nata (PRN) and Treat & Extend (T&E)

We conducted a network meta-analysis (NMA) to compare the efficacy and safety of faricimab to other therapies typically in use for treatment of DME

Network meta-analyses (NMA) allow treatments that are not compared directly within randomized controlled trials (RCTs) to be compared indirectly

What was learned from the study?

Faricimab T&E is associated with superior or comparable visual outcomes in terms of best-corrected visual acuity (BCVA) and superior anatomical outcomes in terms of decreasing retinal thickness against all other treatments given in flexible dosing regimens, which are the standard of care in clinical practise, whilst offering a low treatment burden

Introduction

Diabetes mellitus (DM) is a global epidemic, and its prevalence is expected to increase from 463 million in 2019 to 700 million in 2045 [1]. Diabetic retinopathy (DR) is a common microvascular complication of diabetes that can lead to vision loss and blindness and is one of the leading causes of vision impairment in the working age population [2, 3]. Diabetic macular oedema (DME) is a serious manifestation of DR characterised by exudative fluid accumulation in the macular and is the primary cause of central vision loss among patients with DR [4, 5]. Around 7% of patients with diabetes are affected by DME, and risk of DME increases with severity of DR [6].

Intravitreal injections (IVT) of anti-vascular endothelial growth factor (anti-VEGF) agents such as ranibizumab, aflibercept and bevacizumab (not licensed for ophthalmic use) represent the preferred treatment for DME [7]. These treatments have replaced IVT corticosteroids and laser photocoagulation as the current standard of care for patients with DME [8]. However, real-world outcomes following anti-VEGF therapy for DME lag behind those noted in clinical trials [9]. The need for frequent treatment visits has been associated with non-adherence to anti-VEGF treatment and subsequently worsening visual outcomes [10].

Faricimab is a novel dual-pathway inhibitor of VEGF-A and angiopoietin-2 (Ang-2), which is thought to promote vascular stability and reduce inflammation [11]. In two pivotal phase III trials, YOSEMITE and RHINE, after the loading phase faricimab T&E was applied in a personalized treat-and-extend- based regimen (T&E). Intervals were extended by 4 weeks up to every 16 weeks (Q16W), maintained or reduced by 4 or 8 weeks (as low as Q4W) based on prespecified central subfield thickness (CST) and best-corrected visual acuity (BCVA) criteria. Faricimab T&E demonstrated non-inferiority to aflibercept given every 8 weeks (Q8W) in the primary endpoint of change in best-corrected visual acuity (BCVA) at 12 months in the treatment of DME. Faricimab T&E also demonstrated greater improvement for anatomical outcomes, i.e. reduction in CST, absence of intraretinal fluid (IRF) and absence of DME [12].

YOSEMITE and RHINE provide insight into the efficacy and safety of faricimab T&E relative to aflibercept Q8W. However, indirect comparisons of faricimab T&E with other treatments and particularly flexible dosing regimens such as T&E and Pro re nata (PRN) are needed, given that these are typically used in clinical practise. The objective of this study was to estimate the relative treatment effects of faricimab and other therapies that are currently in use for treatment of DME, based on available evidence from randomized controlled trials (RCT). Network meta-analyses (NMA) allow treatments that are not compared directly within RCTs to be compared indirectly.

Methods

Search Strategy

A systematic literature review (SLR) of RCTs investigating the efficacy and safety of treatments for DME was conducted using electronic databases including Embase, MEDLINE, DARE and the Cochrane library, as well hand-searching supplementary sources including conference proceedings between January 2017 and August 2021, HTA agency websites, clinical trial registries, key government/international bodies, reference lists of included publications and related SLRs. The electronic sources were last searched on August 19, 2021, and the following treatments were included in the search strategy: faricimab, ranibizumab, brolucizumab, aflibercept, bevacizumab, dexamethasone intravitreal implants, laser therapy and placebo/sham. There was no restriction on country of origin or language of publication. Previously published systematic reviews and meta-analyses were included for the purpose of bibliographic searches to identify relevant primary studies. The search strategy can be found in the Supplementary Material.

This SLR was conducted in compliance with published guidelines issued by the Cochrane Collaboration, the Centre for Reviews & Dissemination (CRD; York, UK), the National Institute for Health and Care Excellence (NICE), UK, and the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) statement [13,14,15,16]. This study was exempt from ethics approval as it is based on previously conducted studies and does not contain any new studies with human participants or animals performed by any of the authors.

Study Population and Selection Criteria

The selection criteria for the NMA are presented in Table 1. Adults with DME were included, and studies investigating patients with diabetic retinopathy without DME or macular oedema not associated with diabetes mellitus were excluded. RCTs of at least 48 weeks or 12 months were included, along with open-label extensions of RCTs up to 24 months. Treatments included only licensed and/or standard doses of faricimab and its comparators. Efficacy outcomes of interest included mean change in BCVA and mean change in CST. The safety outcomes of interest included overall treatment discontinuation/withdrawal and ocular adverse events (AE). The mean number of injections was also evaluated.

Table 1 PICO framework for the network meta-analysis

Study Screening

Titles and abstracts were screened by two independent reviewers, as were the full text articles. Any disputes as to eligibility for inclusion were referred to a third party.

Data Extraction and Quality Assessment

Data from included studies were extracted into a predesigned data extraction table (DET) by a single reviewer and quality checked by a second reviewer. Disputes were referred to a third party. Quality and risk of bias were assessed using the seven-criteria checklist provided in Sect. 2.5 of the NICE single technology appraisal user guide [17].

Synthesis and Statistical Analysis

Following the identification of relevant studies, a feasibility assessment was conducted to determine whether it was possible and appropriate to conduct network meta-analyses, resulting in a network of studies for each outcome of interest. NMAs enable the simultaneous comparison of multiple interventions that have not been directly compared in RCTs. All networks considered monotherapies only (IVT monotherapies for the number of injections outcome). If a study had two arms that were classed as the same treatment node, results from those two arms were pooled using standard meta-analytical methods.

Network meta-analyses were conducted and are presented here for the following outcomes: mean change in BCVA score, mean change in CST, mean number of injections, ocular adverse events and all cause discontinuations. If CST was missing for a study but one or more other anatomical outcomes were reported, the other value was used as a proxy in the following order of preference: central retinal thickness (CRT), central foveal thickness (CFT) or central macular thickness (CMT). Other outcomes analysed presented in the supplemental material were proportion of patients losing or gaining ≥ 10/15 ETDRS letters, serious ocular adverse events, systemic serious adverse events and discontinuations due to adverse events.

The NMAs were conducted under a Bayesian framework using a random effects model as base case. Changes from baseline in BCVA score, CST and number of injections were modelled as normally distributed data, using the arm level mean change from baseline (or for number of injections, the mean number of injections since baseline) as the outcome. All other endpoints were modelled using a binomial likelihood with logit link.

Vague priors were used for all parameters (see Supplementary Material). Sensitivity analyses around model assumptions include a Bayesian fixed-effect model and exclude studies where laser rescue therapy could potentially have influenced the BCVA score. Estimates were calculated using Markov Chain Monte Carlo (MCMC) sampling in R version 3.6.1 and JAGS version 4.3.0 [18, 19]. Three parallel chains were run with at least 40,000 iterations, a burn-in of at least 10,000 and a thinning parameter of at least 10. Convergence was assessed via review of trace plots, posterior density plots, effective sample size and Brooks-Gelman-Rubin statistics, and parameters were increased if necessary to improve convergence. For non-convergence due to rare events, a continuity correction was applied. The deviance information criterion (DIC) was used to compare the relative fit of the models, and models with lower DIC were preferred. Random effect models are used as base case given that the presence of between study heterogeneity is plausible. Differences in DIC < 5 points were not considered meaningful. To assess the absolute fit, the total residual deviance was calculated and compared against the total number of independent data points.

The NMA models for the primary outcome of change in BCVA score were extended by incorporating the following patient characteristics as covariates via meta-regression analyses [20]: BCVA at baseline and CST at baseline. Fixed and exchangeable interaction models were fit using aflibercept as the interaction reference treatment.

Particular subgroups of interest were patients’ naïve to anti-VEGF and experienced with anti-VEGF, but since few studies reported outcomes in the anti-VEGF experienced population, only the anti-VEGF treatment naïve population was evaluated for outcomes with sufficient data and a connected network.

To assess inconsistency of direct and indirect evidence in the Bayesian framework, inconsistency models found in NICE DSU TSD 4 were used [21]. The DIC and residual deviance were used to compare the fit of the standard (consistency) and inconsistency models, with a lower or similar DIC for the standard model indicating a better fit to the data and no evidence of substantial inconsistency.

Output from the NMAs included estimates and 95% credible intervals (CrI), the surface under the cumulative ranking curve (SUCRA) and probability that each treatment performs best, along with the probability of non-inferiority using a four-letter threshold following the definition in YOSEMITE and RHINE [12]. Treatments are referred to as statistically different when the 95% credible interval excludes zero for difference outcomes or one for ratio outcomes.

Results

Systematic Literature Review

The electronic database search identified 6923 citations in total, and 65 publications were identified through other sources. After removal of duplicates, 5319 citations remained, and after title and abstract screening, 682 full-text publications were assessed for eligibility. In total, 140 publications reporting 86 studies were deemed eligible for inclusion in the SLR. Of these, 130 were full publications, 7 were conference abstracts and 3 were trial registry records. The PRISMA diagram in Fig. 1 depicts the process of study identification and selection.

Fig. 1
figure 1

PRISMA diagram of the studies included in the systematic literature review

After applying the PICOS (population, intervention, comparators, outcomes and study design) criteria for inclusion in the NMA, 26 studies formed a connected network and were considered appropriate for the NMA at 12 months, the details of which can be found in Table 2. Of the 86 studies identified in the SLR, 36 were excluded for treatment-related reasons, 4 were not connected to the faricimab T&E network, 13 were excluded for investigating unlicensed combination regimens and 4 for other reasons. Although identified by the SLR, the RIDE, RISE and Ahmadieh (2008) studies report outcomes at 24 months only and therefore were not included in this report of outcomes at 12 months [22, 23].

Table 2 Details of the 26 trials included in the network meta-analysis

Results of the study quality assessment indicated that the studies were generally of moderate to high quality, and the majority reported clear details on patient population and prognostic factors at baseline as well as methodological details regarding treatment regimens and missing data. Included studies were of low to medium risk of bias and sensitivity analyses were conducted excluding high risk of bias study showing consistent results (more details are provided in Fig. S9 in the supplemental material).

The network of studies reporting a mean change in BCVA score from baseline to 12 months is displayed in Fig. 2. Networks for other outcomes included in the NMA are reported in the Supplementary Material. The 0.3 mg and 0.5 mg ranibizumab doses were merged because of evidence suggesting no difference between them when used monthly [44]. Different treatment schedules (Q4W, Q8W, PRN and T&E) were treated as separate nodes.

Fig. 2
figure 2

Network for mean change in best-corrected visual acuity (BCVA) score from baseline to 12 months. AFL aflibercept, BEV bevacizumab, DEX dexamethasone, FAR faricimab, IVT intravitreal, PBO placebo, PRN treatment as needed (pro re nata), Q4/8/16W every 4/8/16 weeks, RAN ranibizumab, T&E treat and extend. Colour-coded nodes illustrate the number of steps (pairwise comparisons) between faricimab and a given comparator to highlight how they are connected and what informs the estimation of the treatment effect. Blue = randomized controlled trial, red = 1 step, green = 2 steps, Yellow = 3 steps

Characteristics and Comparability of Included Studies

Covariates that previous studies identified as potential effect modifiers are baseline BCVA and baseline CST [45]. Although reporting of baseline CST was poor across studies, all studies reported similar baseline BCVA.

In terms of other baseline characteristics, there was some variation in age, gender and race between studies. For example, ages ranged from between 55 and 56.6 years in Fouda 2017 to between 68 and 69 years in the ROTATE trial, and the percentage of white patients ranged between 0% in REVEAL to 94–98% in RETAIN [29, 38,39,40].

Study design features that could potentially influence outcomes included permitting the use of rescue therapy for patients who require it and whether studies permitted randomization of one or both eyes to the intervention or comparator. Another study design feature that could potentially influence outcomes is the difference in treatment criteria within flexible dosing regimens. There is some variation between studies regarding loading doses and re-treatment criteria. Details of these are included in the Supplementary Material.

Network Meta-Analysis

The analyses for all outcomes were conducted using the standard NMA framework, given that there was no evidence indicating that the treatment effect differed by patient characteristics or that model fit was improved in the patient meta-regressions. No evidence of inconsistency was identified.

In general, convergence diagnostics were good, and model fit was good in most cases, with the exception of the fixed-effect model for the number of injections (including laser PRN) at 12 months. In this case, the random-effects model provided a much better fit. Random-effects models were chosen for all endpoints.

Efficacy

A statistically greater decrease in CST at 12 months from baseline was observed for faricimab T&E versus all other treatments given in flexible dosing regimens with credible intervals not crossing zero. This greater reduction in retinal thickness is considered clinically meaningful [46]. Amongst all treatments given in flexible dosing regimens, faricimab T&E had the highest probability of performing best.

The mean change in BCVA score from baseline of faricimab T&E was statistically greater than that of ranibizumab, bevacizumab, laser and dexamethasone therapy. Faricimab T&E was also associated with a numerical improvement of two letters compared to aflibercept PRN with credible intervals crossing zero. It had the highest probability of performing best amongst all treatments given in flexible dosing regimens. Accordingly, the probability that faricimab T&E was non-inferior versus all comparators was close to one, using a four-letter threshold following the definition in YOSEMITE and RHINE [12].

NMA efficacy results including mean difference in change from baseline of BCVA score and mean difference in change from baseline of CST are presented in Fig. 3. Additional results including median rank, probability of each treatment performing best and surface under the cumulative ranking curve (SUCRA) are contained in the Supplementary Material for all efficacy outcomes.

Fig. 3
figure 3figure 3

Forest plot of results obtained through the network meta-analysis for A mean change in BCVA at 12 months from baseline, B mean change in central subfield thickness (CST) at 12 months from baseline, C injection frequency at 12 months, D ocular adverse events at 12 months and E all cause discontinuation. AFL aflibercept, BEV bevacizumab, DEX dexamethasone, FAR faricimab, IVT intra-vitreal treatment, LP loading phase, OLE open label extension, PBO placebo, PRN pro re nata, Q6/8/12/16W every 6/8/12/16 weeks, RAN ranibizumab, CrI credible interval

Injection Frequency

NMAs for injection frequency at 12 months were constructed with and without laser PRN. Treatment effects from networks with and without laser PRN were deemed similar, but since between-study heterogeneity was greater when laser PRN was included, the network without laser PRN was preferred.

Injection frequency was statistically lower for faricimab T&E compared to other treatments given in fixed Q4W or Q8W regimens. Faricimab T&E was also associated with numerically fewer injections versus other treatments given in flexible dosing regimens. Accordingly, faricimab T&E had the highest probability of performing best (83.3%) in terms of reducing injection frequency. NMA results for other comparators are shown in Fig. 3, and additional results including median rank, probability of each treatment performing best and the SUCRA are contained in the Supplementary Material.

Safety

Given the rare occurrence of ocular adverse events and treatment discontinuations as well as the limited available evidence, there is uncertainty in the model and the results should be interpreted with caution. Considering these limitations, the occurrence of ocular adverse events of faricimab T&E was statistically lower or comparable to all regimens. The results for the numbers of all-cause discontinuations were not statistically different.

Numbers and percentages of safety outcomes per treatment in each trial can be found in the Supplementary Material. NMA results for both the ocular adverse events and all cause discontinuations are shown in Fig. 3. Median rank, probability of each treatment performing best and the SUCRA are contained in the Supplementary Material.

Sensitivity and Subgroup Analyses

There were no meaningful differences in terms of model fit or results between the fixed- and random-effects models for each outcome, with the exception of mean number of injections at 12 months as discussed in Sect. 4.4. The fixed-effects model for the mean number of injections network, particularly when including laser PRN, was found to be a poor fit to the data, with a larger residual deviance than number of data points. The random effects model was a much better fit for this outcome.

A sensitivity analysis was conducted for vision outcomes by excluding studies with laser rescue therapy (studies excluded were DA VINCI, DRCR Network Protocol T, RESOLVE, TREX-DME) and results were found to be consistent with the base case analysis [25, 26, 35, 41].

Treatment-naïve networks could be formed for mean change in BCVA score. Results for the anti-VEGF treatment-naïve population were consistent with the results of the whole population.

Discussion

To the knowledge of the authors, this SLR and NMA provides the first indirect comparison of faricimab T&E with other therapies in use for treatment of DME. The studies identified in the SLR and included in the NMA were of moderate-to-high quality. In YOSEMITE and RHINE, faricimab T&E was compared to aflibercept Q8W, meaning indirect comparisons were required to understand the relative efficacy of faricimab T&E compared to other DME treatments, particularly applied in flexible dosing regimens that are most commonly used in clinical practice. Since RCTs are considered gold standard in terms of evidence, conducting an NMA was the most appropriate method to use to attain this objective.

The NMA indicated that the safety profile of faricimab T&E was comparable to those of ranibizumab and aflibercept, which have well-established safety profiles, with similar results for the number of all-cause discontinuations.

The analysis also showed that after 1 year of treatment, faricimab T&E was associated with superior disease control, as expressed by a reduction in retinal thickness, versus all other treatments applied in flexible dosing regimens. This greater reduction in retinal thickness is considered clinically meaningful (> 50 microns) [45] and may point to the role of dual-pathway inhibition of VEGF and Ang-2 in promoting vascular stability and reducing inflammation. This finding is also consistent with the direct evidence in YOSEMITE and RHINE where faricimab T&E also demonstrated greater improvements versus aflibercept Q8W for anatomical outcomes, i.e. reduction in CST, absence of intraretinal fluid (IRF) and absence of DME [12].

Anatomical outcomes determine treatment efficacy and retreatment of patients. Accordingly, faricimab T&E was associated with numerically fewer injections versus other flexible dosing regimens indicating a higher probability for a lower treatment burden. Given the loading phase in year 1 and the gradual extension of intervals (e.g. in 4-week increments in YOSEMITE and RHINE), durability in T&E regimens is expected to further improve in year 2. This effect could also be seen in the pivotal studies as more patients were able to be maintained on longer treatment intervals [47].

The mean change in BCVA score from baseline of faricimab T&E was superior to ranibizumab and bevacizumab. Faricimab T&E was associated with an improvement of approximately five letters or one line on the BCVA chart. This result is consistent with a previous RCT by Sahni (2019), in which a statistically significant gain of 3.6 letters was observed for faricimab compared with ranibizumab after 6 months of treatment [46]. Faricimab T&E was also associated with a numerical improvement compared to aflibercept PRN and had the highest probability of performing best amongst all treatments given in flexible dosing regimens.

The NMA was conducted in accordance with the most recent methodological guidelines [20, 21, 48]. This includes the employment of a Bayesian framework as well as the inclusion of sensitivity analyses where appropriate. Study populations varied in some of their baseline characteristics, such as age and race, but were deemed similar in terms of characteristics that are established to affect outcomes, such as baseline BCVA [45].

Results were broadly consistent with those of the previous NMA of anti-VEGF treatments in DME by Muston (2018) and with a more recent NMA by Wang (2022) [49, 50]. These findings along with the above elements regarding study design demonstrate that the results from this NMA provide a robust, up-to-date comparison of faricimab T&E and other treatments for DME.

Muston suggests that previous analyses that have not adjusted for baseline BCVA are subject to ecological bias [49]. This was addressed in this study by incorporating BCVA at baseline as a covariate via meta-regression. However, there was no evidence that treatment effect differed by patient characteristics or that model fit was improved; hence, all further analyses were conducted under the standard NMA framework. This study captures variation within a treatment due to different dosing regimens, whereas Wang groups treatments regardless of regimen [50].

Time equivalence was assumed between 48 and 56 weeks, 12 months and 1-year outcomes. This is because the mean change in BCVA score was measured from baseline through weeks 48–56 in YOSEMITE and RHINE. However, several trials demonstrate that gains in visual acuity in DME are usually achieved within the first months of treatment with anti-VEGF therapy, and any further therapy beyond that point typically preserves these vision gains, with no further improvement. This suggests that there was no impact on the results because of the equivalence assumption. Aspiring to include as much relevant evidence as possible, other definitions of retinal thickness (CST, CRT, CFT, CMT, in that order) were used if CST values were not reported. These definitions are often used interchangeably, and previous NMAs have used similar approaches [51].

While the results from this NMA provide a current evaluation of the comparative efficacy, injection frequency and safety of faricimab T&E relative to other relevant therapies for treatment of diabetic macular oedema, several limitations were identified. First, while baseline CRT was listed as a potential effect modifier, reporting of baseline CRT was poor across studies. However, a published NMA found that CRT was highly correlated with baseline BCVA [49], and since baseline BCVA was found to be similar across studies, this is unlikely to limit the study. In addition, limited evidence was available for subgroup analyses, as well as for AEs, which made these networks less robust. Networks for BCVA for the anti-VEGF naïve subgroup contained only nine studies, whilst networks for ocular AEs contained only ten studies.

Conclusion

The results of this study support faricimab T&E as a treatment option for DME with the potential to offer the best disease control because of dual Ang-2/VEGF inhibition, improving outcomes and preventing vision loss, which can have a significant impact on patients’ quality of life. Applied in a T&E regimen with a low treatment burden, it can give people living with DME greater independence and meet the current capacity constraints in many health systems and the expected increase in future demand for ophthalmology services [52].