FormalPara Key Summary Points

Why carry out this study?

The comparative efficacy of targeted systemic therapies for moderate to severe atopic dermatitis (AD) has not been systematically assessed using recent phase 3 data. Network meta-analysis is a useful tool for clinicians, payers, and healthcare providers to inform decision-making about various therapies when treating patients with moderate to severe AD.

The study analyzed 11 clinical trials for IGA 0/1, EASI-75, EASI-90, and Pruritus NRS (≥ 4-point improvement) at the primary endpoint (week 12 or 16) and at earlier timepoints.

What was learned from the study?

The study found that upadacitinib 30 mg daily, upadacitinib 15 mg daily, and abrocitinib 200 mg daily may be the most efficacious targeted systemic therapies across 12–16 weeks of therapy.

This NMA suggests that some targeted systemic treatment options provide greater efficacy across key disease domains, such as skin and itch responses. These findings can help healthcare providers evaluate the overall efficacy benefit of these treatments when personalizing a patient’s treatment plan. Additionally, other factors, including safety, benefit–risk, and patient preferences, should be taken into account when personalizing a patient’s treatment plan.

Introduction

Moderate to severe atopic dermatitis (AD) is characterized by extensive eczematous lesions, along with persistent and severe itch, excoriation, skin pain, and discomfort [1,2,3,4,5,6,7,8], posing a significant burden for the affected patients, their families, and society [9, 10]. Targeted systemic therapies, including abrocitinib, baricitinib, dupilumab, tralokinumab, and upadacitinib, are new potential treatments for moderate to severe AD. While there are robust clinical trial programs for these therapies, including trials in combination with topical corticosteroids (TCS) and/or topical calcineurin inhibitors (TCI), little is known about the comparative efficacy of these agents as monotherapy, i.e., without the addition of TCS and/or TCI. Additionally, there is a paucity of head-to-head monotherapy trials for these therapies [11]. Network meta-analysis (NMA) can provide indirect comparisons where head-to-head data are not available. This makes NMA a useful tool for clinicians, payers, and healthcare providers to inform decision-making about various therapies when treating patients with moderate to severe AD [12].

The objective of this study was to assess the comparative efficacy of targeted therapies as monotherapy for moderate to severe AD on the basis of a systematic literature review and NMA.

Methods

Data Source

A systematic literature review was conducted in accordance with the Cochrane Handbook for Systematic Reviews of Interventions 2019 [13], the National Institute for Health and Care Excellence Guide to the Methods of Technology Appraisal 2013 [14], and the Centre for Reviews and Dissemination Guidance for Undertaking Reviews in Health Care [15]. Databases examined are listed in the Supplementary Material. Two reviewers independently identified studies for inclusion at each stage of study selection. Discrepancies between inclusion/exclusion decisions were resolved by a third independent reviewer. Data from unpublished upadacitinib trials meeting search criteria were supplied by AbbVie. These trials have since been published [16]. This article is based on previously conducted studies and does not contain any new studies with human participants or animals performed by any of the authors.

Selection Criteria

The systematic literature search was performed on records up to 17 May 2021 and was designed to identify all phase 3 or 4 randomized controlled trials evaluating targeted therapies in adults with moderate to severe AD who had an inadequate response to TCS or TCI treatment, or for whom topical treatments were medically inadvisable (Supplementary Material). The search identified publications for targeted therapies that are approved or could gain approval. Actual or potential licensed doses were included. Detailed in the Supplementary Material are the Preferred Reporting Items for Systematic Reviews and Meta-Analyses diagram (Fig. S1), patient/population, intervention, comparison, outcomes, and study design criteria (Table S1), and quality and potential bias assessments of included trials (Table S2).

Outcomes

Efficacy outcomes included the proportion of patients achieving trial co-primary endpoints [Investigator Global Assessment (IGA) score of 0 or 1 (clear or almost clear) and reduction of ≥ 2 points from baseline; proportion of patients achieving Eczema Area and Severity Index (EASI) improvement ≥ 75% from baseline (EASI-75)], EASI improvement ≥ 90% from baseline (EASI-90), and ≥ 4-point improvement on Pruritus Numerical Rating Scale from baseline (ΔNRS ≥ 4) [17,18,19]. Different doses were considered independent treatment options. Outcomes were evaluated at the primary endpoint timepoint of each trial (week 12 for abrocitinib, week 16 for all other therapies), as well as at week 4 and week 8. Additionally, ΔNRS ≥ 4 and EASI-75 were compared at week 2, the earliest timepoint for which most treatments reported these outcomes. Values for earlier timepoints that were only available in figures were extracted using DigitizeIt software, version 2.5 [20].

Feasibility Assessment

NMA feasibility was assessed per Cope et al. [21]. Network connectivity of included trials was checked. Relevant study and baseline patient characteristics, including potential treatment effect modifiers (age, gender, duration of disease, EASI, IGA, itch), and mean placebo outcomes were compared to identify potential sources of cross-study heterogeneity [22,23,24].

Statistical Analysis

NMAs were conducted in a generalized linear model (GLM) framework using Bayesian Markov chain Monte Carlo (MCMC) simulations with four chains of 100,000 posterior iterations each. NMAs were run in JAGS (version 4.3.0) via R statistical software (R Foundation for Statistical Computing, Vienna, Austria; version 4.0.2) using the bnma package (version 1.4.0) [25,26,27,28]. Convergence was assessed with the Brooks–Gelman–Rubin method using potential scale reduction factor (PSRF). Fixed effects, random effects, and baseline risk-adjusted models were evaluated [28, 29], and NMA consistency assumptions were checked [29] to identify the best-fitting models. Placebo-unadjusted response rates, numbers needed to treat (NNT), odds ratios, and Surface Under the Cumulative RAnking curve (SUCRA) scores were estimated [29]. Statistical significance was determined by examining calculated odds ratio 95% credible intervals.

Results

Systematic Literature Review

Of 3415 unique records identified, 500 publications were assessed for eligibility and six records representing 9 unique studies were extracted [30,31,32,33,34,35]. Additionally, two upadacitinib trials (Measure Up 1, Measure Up 2) were included [16]. Studies included the following targeted therapy study arms: abrocitinib 100 mg daily, abrocitinib 200 mg daily, baricitinib 2 mg daily, baricitinib 4 mg daily, dupilumab 300 mg once every 2 weeks, tralokinumab 300 mg once every 2 weeks, upadacitinib 15 mg daily, and upadacitinib 30 mg daily.

For the network analysis of outcomes at the primary endpoint timepoint, 11 unique trials encompassing 6254 patients in 28 arms were included (Table 1). For the network analysis of EASI-75 at weeks 2, 4, and 8, and IGA 0/1 at week 4 and week 8, six records and two upadacitinib trials representing 11 unique trials were extracted, encompassing 6254 patients in 28 arms [32, 35,36,37,38,39]. Two of the studies were pooled [39]. For the network analysis of ΔNRS ≥ 4 at weeks 2, 4, and 8, nine unique trials of 4658 patients in 24 arms were analyzed [31,32,33, 36,37,38,39]. Two of the studies were pooled at week 8 [39]. Tralokinumab was excluded as a treatment option, given that no trials reported ΔNRS ≥ 4 at those timepoints. For the network analysis of EASI-90 at week 4 and week 8, ten unique trials encompassing 5961 patients in 26 arms were included [32, 35, 37,38,39]. Two of the studies were pooled [39].

Table 1 Overview of studies for the network meta-analysis

Network Meta-Analysis

All trials were placebo controlled (Fig. 1) and generally comparable on the basis of enrollment inclusion and exclusion criteria and potential treatment effect modifiers (Table 1). Some differences were observed in placebo outcomes. Baseline risk-adjusted models were estimated to account for this heterogeneity but did not improve model fit in most cases. Model consistency checks and fit diagnostics supported fixed-effects models as the best-fitting parsimonious models for all efficacy outcomes evaluated except for ΔNRS ≥ 4 at week 2, EASI-75 at week 4, and IGA 0/1 at week 8, where the fixed-effects baseline risk-adjusted model had the best fit.

Fig. 1
figure 1

Network meta-analysis diagram. Network above is for primary endpoint analysis. The ΔNRS ≥ 4 network of the week 2 analysis is identical to the above except without ECZTRA 1 and ECZTRA 2 (tralokinumab) as these trials did not report ΔNRS ≥ 4 at week 2. The EASI-75 network of the week 2 analysis is identical except with pooled SOLO 1 and SOLO 2 data as reported in Thaçi et al. [39]. EASI Eczema Area and Severity Index, NRS Numerical Rating Scale, ΔNRS ≥ 4 Pruritus Numerical Rating Scale reduction of ≥ 4 points from baseline

Results at the primary endpoint timepoint are presented in Table 2. The IGA 0/1 response rate was highest for upadacitinib 30 mg, followed by upadacitinib 15 mg, abrocitinib 200 mg, and dupilumab (Fig. 2). For ΔNRS ≥ 4, upadacitinib 30 mg also had the highest response rate, followed by abrocitinib 200 mg, upadacitinib 15 mg, and dupilumab. This rank order was also observed for EASI-75 and EASI-90 response rates (Fig. 3). Response rates for all efficacy outcomes at the primary endpoint are also shown in Fig. S2. Response rate rankings were the same as NNT and SUCRA score rankings. NNT and SUCRA score rankings for all efficacy outcomes at the primary endpoint are also shown in Figs. S3 and S5.

Table 2 Odds ratios versus placebo, NNT, response rate, and SUCRA scores, at week 2 and primary endpoint timepoint (NMA fixed-effects results*)
Fig. 2
figure 2

IGA 0/1 versus ΔNRS ≥ 4 absolute response rate estimates for moderate to severe atopic dermatitis (primary endpoint timepoint). ΔNRS ≥ 4 Pruritus Numerical Rating Scale reduction of ≥ 4 points from baseline, IGA Investigator Global Assessment for Atopic Dermatitis

Fig. 3
figure 3

EASI-75 and EASI-90 absolute response rate estimates for moderate to severe atopic dermatitis (primary endpoint timepoint). EASI Eczema Area and Severity Index

The odds ratios of all targeted therapies were statistically more efficacious than placebo for each outcome assessed at the primary endpoint timepoint (Table 2) with statistical differences observed between some targeted therapies (Table S3). For IGA 0/1, upadacitinib 30 mg was statistically more efficacious than all other therapies. For EASI-75, EASI-90, and ΔNRS ≥ 4, upadacitinib 30 mg was statistically more efficacious than all other therapies except for abrocitinib 200 mg. For select outcomes, upadacitinib 15 mg was statistically more efficacious than abrocitinib 100 mg (IGA 0/1), baricitinib 2 mg (all outcomes), baricitinib 4 mg (IGA 0/1, EASI-75, EASI-90), dupilumab (IGA 0/1, EASI-75, EASI-90), and tralokinumab (all outcomes). Abrocitinib 200 mg was statistically more efficacious than abrocitinib 100 mg (all outcomes), baricitinib 2 mg (all outcomes), baricitinib 4 mg (EASI-75), dupilumab (EASI-75), and tralokinumab (all outcomes).

Results at week 2 are presented in Table 2. The EASI-75 response rate was highest for upadacitinib 30 mg, followed by upadacitinib 15 mg, abrocitinib 200 mg, and baricitinib 4 mg, whereas the ΔNRS ≥ 4 response rate was highest for upadacitinib 30 mg, followed by abrocitinib 200 mg, upadacitinib 15 mg, and abrocitinib 100 mg. Response rates and NNT for EASI-75 and ΔNRS ≥ 4 at week 2 are also shown in Fig. S4. The EASI-75 odds ratios indicate that all therapies except for tralokinumab were statistically more efficacious than placebo, with upadacitinib 30 mg statistically more efficacious than all other therapies except abrocitinib 200 mg (Table S4). The ΔNRS ≥ 4 odds ratios indicate that all targeted therapies analyzed were statistically more efficacious than placebo, with both upadacitinib 30 mg and abrocitinib 200 mg statistically more efficacious than all other remaining therapies (Table S4). Results at other early timepoints are available in the Supplementary Material (Fig. S7, Table S6-S7).

Discussion

NMA allows for the simultaneous comparison of interventions that were not directly compared in head-to-head randomized controlled trials [40]. It can be very useful in ranking interventions in order of their relative efficacy [41]. When performed correctly, NMA can be an essential tool for decision-makers in the healthcare field to draw conclusions from the cumulative scientific evidence [42].

Skin and itch response are very important clinical responses for patients. This NMA found that monotherapy with upadacitinib 30 mg daily had the highest efficacy at the primary endpoint evaluation, followed by abrocitinib 200 mg daily (second in EASI-75, EASI-90, ΔNRS ≥ 4, third in IGA 0/1) and upadacitinib 15 mg daily (second in IGA 0/1, third in the other outcomes). All targeted therapies were superior to placebo at the primary endpoint. At all earlier timepoints analyzed for EASI-75, EASI-90, and IGA 0/1 (weeks 2, 4, and 8 for EASI-75, weeks 4 and 8 for EASI-90 and IGA 0/1), upadacitinib 30 mg daily had the highest efficacy, followed by upadacitinib 15 mg daily and abrocitinib 200 mg daily. For ΔNRS ≥ 4, upadacitinib 30 mg daily had the highest efficacy at weeks 2, 4, and 8, followed by upadacitinib 15 mg daily (third at week 2, second at weeks 4 and 8), abrocitinib 200 mg daily (second at week 2, third at week 8), and baricitinib 4 mg daily (third at week 4).

Drucker et al. published an ongoing NMA of patients with moderate to severe AD [43, 44]. They included randomized controlled trials with potentially heterogeneous baseline patient severity and concomitant medication use but did not report the same outcomes as in our research. Siegels et al. published a meta-analysis comparing outcomes for 13 different treatments for moderate to severe AD, but did not perform an NMA, making comparison of targeted therapies more challenging [45]. Silverberg et al. published an NMA comparing systemic therapies in monotherapy and combination therapy for AD but did not include data after October 2019, and so excluded the phase 3 upadacitinib trial program [46]. This analysis focused only on targeted therapies studied as monotherapy and considered the primary endpoints used in clinical trials (IGA 0/1 and EASI-75) and itch, the hallmark symptom of AD. This analysis also provided novel examination of early treatment benefits. Our research was based on an NMA to make treatment comparisons more directly comparable and investigated models with baseline risk adjustment to account for placebo response heterogeneity.

When it comes to personalized treatment selection for AD, many factors need to be considered, including age, childbearing age, treatable traits, patient needs, concomitant therapies, and appropriate treatment targets [47, 48]. Some patients may also prefer to use monotherapy, or may prefer not to use topical treatments. Treatments that provide rapid and greater efficacy across multiple disease domains may better align with personalized treatment expectations and benefit more patients. This NMA suggests that some targeted systemic treatment options provide greater efficacy across key disease domains, such as skin and itch responses. These findings can help healthcare providers evaluate the overall efficacy benefit of these treatments when personalizing a patient’s treatment plan.

In addition to efficacy, the choice of therapy is based on safety and benefit–risk. Further analysis is needed to assess the relative safety of targeted systemic treatments for AD, their benefit–risk profiles, and how these relate to patient preferences. These areas of future research can better inform shared decision-making processes.

Limitations

If any of the assumptions of an NMA, including network connectivity, homogeneity, and transitivity or consistency, are violated, its conclusions may be invalid [49]. Additionally, NMAs are susceptible to the methodological quality of included studies, reporting biases, and choices of study eligibility criteria [50]. NMAs are not substitutes for multiple head-to-head randomized controlled trial comparisons. We tested the assumptions in this NMA, tried multiple modeling approaches to ascertain best fit, and reviewed the quality of the underlying trial data to mitigate these limitations. NMAs use aggregated statistics from studies and not individual data.

Specific limitations to this study include variability in the primary endpoint timepoint across trials (12 weeks for abrocitinib and 16 weeks for the other therapies). Additionally, we assessed efficacy at earlier timepoints and did not find evidence of differing treatment effects from those observed at the primary endpoint assessment.

There appears to be some heterogeneity in placebo response rates across trials, though baseline risk-adjusted models accounting for this heterogeneity did not provide a better fit for all outcomes assessed except for ΔNRS ≥ 4 at week 2, EASI-75 at week 4, and IGA 0/1 at week 8. There are other sources of heterogeneity in the analyzed trials, including prior corticosteroid exposure and inclusion of adolescent patients in some trials. This analysis also excluded trials of patients receiving TCS or TCIs in combination with targeted therapies for AD, which may mimic current real-world use of treatments. A substantial set of trial results assessing targeted therapies in this patient population are available. Future research will examine their relative efficacy.

There are some nuanced cross-trial differences in outcome methodologies. For example, all trials utilized a five-level IGA scale that included “clear” and “almost clear,” though the descriptions of each level were not identical across trials. Similarly, all trials evaluated patient-reported itch on an 11-point NRS, though the questionnaire text differed slightly across trials. Specific to the abrocitinib trials, EASI and IGA excluded the scalp, palms, and soles from the assessment [51]. Although the results presented here utilize efficacy rates based on nonresponder imputation, the upadacitinib trials also employed a multiple imputation process to account for missing data due to the COVID pandemic [52, 53].

This NMA is over a relatively short period of treatment, up to 16 weeks. Long-term trials for these therapies are underway. Of note, long-term trials may present methodological challenges for NMAs due to attrition and differences in design.

Finally, this NMA focused on select efficacy outcomes and did not evaluate overall symptom severity, quality of life, or safety outcomes. Safety information for the therapies should be studied carefully, with attention to the risk–benefit of each treatment option. Future research in this area is warranted to better understand the risk–benefit profile of these therapies.

Conclusion

In this NMA, looking at targeted systemic therapies for AD used as monotherapy, upadacitinib 30 mg daily appears to be the most efficacious targeted therapy, followed by abrocitinib 200 mg daily and upadacitinib 15 mg daily, after 12 or 16 weeks of therapy. Relative differences in efficacy were apparent as early as week 2 of treatment, indicating the potential for early response. While upadacitinib appears to be the most efficacious therapy, other factors, including safety, benefit–risk, and patient preferences, should be taken into account when personalizing a patient’s treatment plan.