FormalPara Key Summary Points

Why carry out this study?

Approximately 4 million people live with rheumatoid arthritis (RA) in China, with an illness cost of $3826/patient annually, resulting in a heavy financial burden for RA patients

The introduction of novel drugs into the Chinese market brings another option for RA patients, but its cost-effectiveness remains unknown

From the perspective of the Chinese healthcare system, whether adding baricitinib to the current treatment sequence is cost-effective for patients with moderate-to-severe rheumatoid arthritis in whom methotrexate therapy failed is investigated.

What was learned from the study?

Adding baricitinib as a first-line treatment resulted in an incremental cost-effectiveness ratio of $10,036/quality-adjusted life-years per patient and had a 100% likelihood of being cost-effective if the willingness-to-pay threshold was > $16,430 compared with the current treatment sequence

From a Chinese payer perspective, the introduction of baricitinib into the current treatment sequence is projected to be a cost-effective option for patients with moderate-to-severe rheumatoid arthritis.

Digital Features

This article is published with digital features, including a summary slide to facilitate understanding of the article. To view digital features for this article go to https://doi.org/10.6084/m9.figshare.14381429.

Introduction

Rheumatoid arthritis (RA), a chronic, debilitating and systemic autoimmune disease, brings substantial economic burdens to patients with RA and affects their quality of life. In China, the prevalence of RA is 0.28% [1], indicating that there are > 4 million RA patients; the male:female ratio is 1:4 [2]. As the clinical remission rate of RA patients is only 8.6%, there is a high rate of disability, at 50.3% [3]. Overall, RA has become the leading cause of disability and a major public health problem in China [4].

For patients with active RA, the most commonly prescribed agent is the conventional disease-modifying antirheumatic drug (cDMARD) methotrexate (MTX) [5]. However, because of poor tolerability and inadequate efficacy, MTX use is limited [6]. The American College of Rheumatology (ACR) and European League Against Rheumatism both recommend that biologic treatment be initiated in patients with RA who have a suboptimal response to cDMARDs [5, 7]. Consequently, biologic DMARDs (bDMADs), including the tumor necrosis factor inhibitors (TNFis) etanercept, infliximab, adalimumab, golimumab, and certolizumab, are widely employed in clinical practice [8]. Although the use of biologic agents has contributed significantly to effective disease control, studies have found that many patients have poor adherence and active disease and discontinue treatment because of severe adverse events (SAEs), intolerance, or other factors [9, 10].

The Janus kinase (JAK) signaling pathway is implicated in the pathogenesis of RA [11]. JAK inhibitors are collectively referred to as targeted DMARDs (tDMARDs) with bDMARDs [12]. Recent studies have demonstrated that compared to biologic inhibitors, JAK inhibitors result in greater clinical improvement or are at least non-inferior [13, 14]. BCT, an oral reversible and selective tyrosine-protein inhibitor of JAK1/JAK2 [15], was approved by the US Food and Drug Administration (FDA) in July 2018. Three clinical trials have shown the efficacy of BCT with or without MTX in patients naïve to cDMARDs and in those with intolerance to or an inadequate response to both cDMARDs and bDMARDs [16,17,18]. Recently, the RA-BEAM study, a phase 3, randomized, double-blind, placebo- and active-controlled clinical trial, compared baricitinib (BCT) + MTX with adalimumab (ADA) + MTX as a first-line strategy in patients with RA who had inadequate responses to MTX monotherapy [14]. The results of the study confirm the findings of other trials of BCT therapy [18], both supporting that BCT is associated with significant clinical improvement and introducing a new agent to the clinical treatment sequence, which may broaden treatment options for RA patients and rheumatologists [14].

Although the previous study of Max Schlueter et al. indicated that BCT is a cost-effective strategy compared with ADA for Spanish patients with active RA and that of Celine reached the same conclusion for patients with refractory RA in The Netherlands, the cost-effectiveness of BCT + MTX relative to ADA + MTX has not been assessed in China [19, 20]. New agents often come with a financial burden, even though they might improve the clinical outcome for patients [21, 22]. Therefore, it is necessary to assess the economic value of BCT for patients with moderate-to-severe RA, as BCT was approved by the Chinese National Medical Products Administration (CNMPA) in June 2019. Therefore, the aim of this study was to estimate from the perspective of the Chinese healthcare system the cost-effectiveness of adding BCT to the current treatment sequence for patients with moderate-to-severe RA in whom MTX therapy failed.

Methods

Model Structure Overview

To best reflect clinical practice, a microsimulation model in which patients transitioned through a predefined treatment sequence was constructed on the basis of individual patient-level simulation using Treeage Pro software (https://www.treeage.com/). The model tracked patients with different population characteristics over a lifetime horizon (50 years), with the follow-up time separated into 6-month cycles. Patients were moved through the treatment sequence and finally progressed into the palliative care (PC) phase according to the ACR criterion and discontinuation probability. Mortality risk, utility values, and hospitalization costs in every cycle were calculated based on HAQ scores. Health-related costs and quality of life were evaluated throughout the patient’s lifetime. Incremental cost, QALYs, and the incremental cost-effectiveness ratio (ICER) are presented as the main outcomes of the cost-effectiveness analysis.

Patient Baseline Characteristics

A total of 50,000 patients with moderate-to-severe RA who had an inadequate response to MTX were simulated in the analysis. All eligible patients were enrolled on the basis of the RA-BEAM study population, mainly including parameters of age, sex, and baseline HAQ [14]. The mean patient weight of the Chinese general population was used because of the lack of data in the RA-BEAM trial [23]. Details of the patients’ characteristics are presented in Table 1. This article is based on previously conducted studies and does not contain any new studies with human participants or animals performed by any of the authors.

Table 1 Key model parameters

Treatment Sequence of RA Patients

The current treatment sequence (comparator group) of the study and comparator groups we set for RA patients in China was obtained based on the 2018 RA treatment guideline in China and the expert opinion of four rheumatologists from three tertiary hospitals [24]. In the comparator group, TNFi bDMARDs (i.e., ADA) were used as first-line treatment when patients were unresponsive to cDMARDs [22]. After failure of the first-line treatment, patients were switched to another TNFi (i.e., etanercept) as second-line treatment [25]. After etanercept failure, it was assumed that patients would be treated with non-TNFi biologics (i.e., tocilizumab), which is another class of biologic agents with different mechanisms of action [24, 26]. After failure of third-line treatment, patients were eventually switched to the PC phase, mainly comprising MTX, cyclosporine, hydroxychloroquine, leflunomide, azathioprine, sulfasalazine, celecoxib, and prednisolone, until death [22] (Fig. 1).

Fig. 1
figure 1

Treatment sequence. PC palliative care, MTX methotrexate

To understand the comprehensive cost-effectiveness of adding BCT to the current treatment sequence for patients with moderate-to-severe RA in China, a base case and three scenario analyses were performed. In the base case analysis, the study group was administered BCT before the current treatment sequence as a first-line treatment, and the current treatment sequence was used as a comparator group. Three scenarios were then separately compared with the comparator group: (1) BCT after ADA as a second-line therapy; (2) BCT after second-line treatment with TNFi (i.e., etanercept) as third-line therapy; (3) BCT after non-TNFi biologic (i.e., tocilizumab) as fourth-line therapy. The model assumed that all treatments were administered in combination with MTX.

Treatment Response and Discontinuation

The core of the model is treatment response measured using the ACR criteria, classified as non-responders (not achieving ACR 20% improvement criteria), ACR 20 (achieving ACR 20% improvement criteria), ACR 50 (ACR 50% improvement criteria), or ACR 70 (ACR 70% improvement criteria). A decision to remain on current treatment (maintenance phase) or switch to the next treatment is determined by identifying whether the patients achieve ACR 20 or above ACR20 (i.e., ACR50 or ACR70) at the end of the initial cycle of every treatment line (i.e., BCT, ADA, etanercept, and tocilizumab); otherwise, the patient will move to the next treatment line [22, 27]. Therefore, ACR20 was applied to estimate the discontinuation probability of the initial cycle of each treatment line. The process was repeated until the RA patient was switched to the PC phase. Treatment responses for BCT and ADA were derived from the RA-BEAM study, and etanercept and tocilizumab were obtained from a Bayesian network meta-analysis that included 96 unique random controlled trials (RCTs) of RA treatment to estimate the treatment effects of anti-rheumatic drugs in the initial cycle [12, 14, 20] (Supplementary Material Table 1).

After the initial cycle of every treatment line, responders who at least attained ACR20 continued on the current treatment but experienced a discontinuation of treatment due to lack of efficacy or intolerance. Specifically, the discontinuation probability of other cycles was based on a survival curve obtained from the CORRONA database using a lognormal model [28, 29]. A treatment discontinuation curve was generated, and the subsequent discontinuation probability was calculated from a lognormal distribution to estimate a transition from one treatment to another for long-term progression (Supplementary Material Figure 2 and Supplementary Material Table 2).

In addition to lack of response, severe adverse events (SAEs) may also lead to treatment switching [22, 26, 27]. Based on a previous study, we only included severe infection (i.e., pneumonia) in the model as an SAE [12]. The probabilities of serious infection are presented in Table 1.

Change in HAQ

The Health Assessment Questionnaire (HAQ) is an instrument for estimating the physical function of patients, with impacts on the patient mortality rate, health utility and hospitalization rate in the model. The HAQ score ranged from 0 to 3 in multiples of 0.125, whereby a higher score indicated a worse state of disease [30]. Therefore, change in the HAQ score was the prime driver of both cost and benefit in the model [25]. The microsimulation model allows HAQ changes over time to simulate disease progression for each patient. The four response levels were associated with different changes in the HAQ score. The relationship between the change in HAQ score and ACR response rate at the initial 6 months for tDMARDs was derived from a previous cost-effective study of RA [27] (Table 1 and Fig. 2). If a patient responds to treatment, then the HAQ score of the treatment will be decreased, indicating disease improvement [31]. The change in HAQ score in the initial cycle of every treatment line was subtracted from the baseline HAQ score, and the HAQ change of other cycles was subtracted from the HAQ score of the last cycle to simulate the influence of therapy on disease progression. Similar to other studies, we assumed that the change in HAQ score of the response level at the initial 6 months was the same for all tDMARDs [12, 27]. After the initial cycle of every treatment line, except for the PC phase, the change in HAQ score is assumed to be constant (no disease progression in HAQ) for the long-term period if the patient remains on the current treatment, which was tested in a sensitivity analysis [12, 27]. In the PC phase, we assumed that the HAQ score progressed at a constant linear rate (increase of 0.031 every 6 months) over time [32].

Fig. 2
figure 2

Overview of the RA lifetime model. The schematic illustrates how treatment impacts the HAQ score within the model. BCT baricitinib, ADA adalimumab, MTX methotrexate, ACR American College of Rheumatology, HAQ Health Assessment Questionnaire

Any improvement in the HAQ score for patients who quit the current treatment and move to next treatment will be lost, and they will experience a rebound in the HAQ score before starting the next treatment within the predefined sequence, at which point the treatment process commences again [25]. We assumed that the HAQ score would increase back to the beginning of the first cycle for each treatment line [22].

Mortality

Death was considered to be related to the HAQ score and could occur at any time in the model [33]. The mortality rate in the model was estimated based on age-/sex-specific mortality from the Chinese life table, which was sourced from the mortality database of the World Health Organization (WHO) [34]. The equation of mortality adjustment (Eq. 1), which has been published previously, is dependent on the HAQ score [22]:

$$ {\text{RA-specific adjusted mortality }} = {\text{mortalityrate from life table}} \times {1}.{33}^{{{\text{HAQ}}}} $$
(1)

Utility

Health-related quality of life of the RA patients for each cycle was calculated by using a previously published equation that is widely utilized by economic evaluation analysis in RA and shows great value for moderate-to-severe RA [27, 35]. Equation 2 below mapped the HAQ score into the EuroQol five-dimensional questionnaire (EQ-5D).

$$ {\text{EQ-5D }} = \, 0.{82 } - \, \left( {0.{11 } \times {\text{ HAQ}}} \right) \, {-} \, \left( {0.0{7 } \times {\text{ HAQ}}^{{2}} } \right) $$
(2)

Grade 3/4 AEs with a frequency ≥ 5% or a difference of > 2% between treatments in the RA-BEAM trial were included in our study. Therefore, we only considered pneumonia as an SAE in the model because the safety profiles of tDMADRDs are similar and would not significantly alter the results [12]. Based on a prior study, we assumed that disutility caused by severe infection, such as pneumonia, was decreased by 0.156 units of utility for the duration of the month of infection [36].

Resource Use and Costs

In this study, we included in the model direct medical costs (i.e., drug acquisition, administration, monitoring, SAE management and hospitalization) and indirect costs (i.e., productive loss) relevant to the Chinese healthcare system. Unit costs were estimated from national databases, published studies, or a Chinese expert panel. Drug acquisition costs were derived from the website of China Medical Bidding [37]. All drug agents in the model except BCT and MTX were administered subcutaneously. Monitoring costs, including chest x-rays and physician visits, were considered to be the same in both sequences. The cost of any SAE is assumed to be equal to that of different tDMARDs and equal to the cost of pneumonia ($1761.4). The cost of hospitalization was considered related to the HAQ score: a study from Carlson et al. [27] revealed the relationship between HAQ score and days in hospital annually. Cost per day in the hospital was derived from an expert panel. Costs were adjusted to 2019 US dollars (1 USD = 6.87 RMB) [38]. According to Chinese guidelines for economic evaluation, costs and utilities were discounted at 3% per year.

Validation

All sources, inputs, and assumptions were validated by an expert panel, including four Chinese rheumatologists, one pharmacist, and two health economists. An independent modeler verified the implementation of the model. Half-cycle correction was applied for all costs and utilities by taking the average of the input parameters at the beginning and end of the cycle, considering transitions (events) that occurred halfway through a cycle.

Sensitivity Analysis

One-way sensitivity analysis was performed to determine key drivers in the model. Twenty-one variables, including drug costs, discount rate, baseline HAQ, initial HAQ change in different response levels, annual HAQ score change in PC phase, and other costs, were varied individually to estimate the proportional effect on model outcomes. In addition, to test the robustness of the model and degree of uncertainty in our results estimates, we performed multiple scenarios that considered sources of data around key input parameters and alternative assumptions. For example, BCT was inserted at different positions in the treatment sequence to evaluate its impact on the cost-effective analysis. We also considered a series of scenarios regarding the heterogeneity of RA patients (Supplementary Material Table 4; Part B).

Probability analyses were conducted for 2000 sets of 2000 patients to assess overall parameter uncertainty. Key input parameters were simulated from parametric distribution to generate 2000 estimates of the costs and benefits in each strategy. All costs followed gamma distributions; age, baseline HAQ, and HAQ score changes followed normal distributions. Discontinuation probabilities followed beta distributions, weight and disutility of SAEs followed uniform distributions, and sex followed binomial distribution. In this study, willingness to pay (WTP) was set at $30,950/QALY in China by using three times the per capita gross domestic product, as recommended by WHO guidelines [39]. If the result of ICER we estimated was lower than the WTP in China, then it could be considered that the study group was cost-effective compared with the current treatment sequence.

Results

Base Case Analysis

A 50,000 Monte Carlo simulation was used to conduct base case analysis, and the results are displayed in Table 2. The base case analysis indicated that compared with the comparator group, the study group provided more accumulated QALYs, 12.26 versus 9.61 QALY (difference, 2.66 QALY), but was also more costly (incremental cost: $26,662) over the lifetime period. The ICER of the study group vs. comparator group (i.e., 10,036) was below the one time per capita GDP (i.e., $10,316/QALY), which was recommended by the WHO as a “very cost-effective” threshold [40].

Table 2 Base case analysis results: total costs, QALYs, and ICERs

Sensitivity and Scenario Analyses

The results of the model were most sensitive to age, drug cost of BCT and etanercept, discount rate, and probability of serious infection in the PC phase. The cost-effectiveness acceptability curve showed that the study group was cost-effective under the Chinese threshold. Moreover, the study group had a 90% and 100% likelihood of being cost-effective if the WTP was > $13,660 and $16,430 compared with the current treatment sequence (Fig. 3).

Fig. 3
figure 3

Tornado diagram for one-way sensitivity analysis. BCT baricitinib, ADA adalimumab, MTX methotrexate, ETA etanercept, INF infliximab, ACR20 American College of Rheumatology 20% improvement criteria, ACR50 ACR 50% improvement criteria, ACR70 ACR 70% improvement criteria, NR no response, HAQ Health Assessment Questionnaire, PC palliative care, HAQ progression annual HAQ score change in PC phase, AEs adverse events, cDMARDs conventional disease-modifying antirheumatic drug, tDMARDs targeted disease-modifying antirheumatic drug

Scenario analyses of sequences, including BCT added to the comparator group as second-, third-, and fourth-line treatment, also revealed improved costs and QALY gains compared with the comparator group: ICERs of $10,619; $11,789, and $10,820 per QALY estimated for the administration of BCT as a second-, third-, and fourth-line treatment, respectively. According to all scenario analyses of sequences, adding BCT to the comparator group as a second-, third-, and fourth-line treatment was < $30,950/QALY and likely to be cost-effective in China compared with the comparator group (Supplementary Material Table 3; Part A). Furthermore, we used statistical methods to explore whether there was statistical difference between each scenario. The results demonstrated that although administrating BCT as second-line therapy had the lowest ICER compared with those in all scenario analyses of sequences, there was no statistical difference related to second-line therapy compared to other scenarios (Supplementary Material Table 4; Part A).

Part B illustrated that the cost-effectiveness of the study group was < $30,950/QALY for all scenarios, though we set the baseline HAQ of the study group at the upper limit (HAQ = 2). For example, even if the age of patients was set at the upper limit (age = 80 years), the ICER for the BCT + MTX strategy was still < $30,950/QALY.

Discussion

To our knowledge, this is the first study to evaluate the cost-effectiveness in China of introducing BCT to adalimumab-etanercept-tocilizumab-PC (current treatment sequence) for patients with moderate-to-severe RA in whom MTX is considered inappropriate. To fill this evidence gap, we developed a cost-effectiveness model by adopting a microsimulation approach that models individual patients to account for heterogeneity instead of a patient cohort. The topic of this study has relevance for patients, clinicians, and policy makers. RCTs have suggested that BCT results in a significant clinical improvement compared with ADA for patients with moderate-to-severe RA, despite previously receiving treatment with MTX. Our study compared a series of treatment sequences that introduced BCT into current clinical practice for RA, providing a comprehensive RA treatment guideline for clinical use. Specifically, our study found that adding BCT to the current treatment sequence was cost-effective compared with the comparator group alone.

Due to the different model settings and input parameters (i.e., drug acquisition cost, treatment sequence, and time horizon), the results of our study were inconsistent with those of evaluation analyses of BCT compared with biologics in other countries. For example, a study in Spain conducted by Schlueter et al. [20] revealed that a BCT first-line strategy was associated with 0.09 QALYs gained over a lifetime horizon, with an incremental cost of —€558 versus ADA first-line treatment. BCT is a cost-effective option for patients with moderate-to-severe RA in whom there is an inadequate response to cDMARDs. Another study from The Netherlands developed by Celine estimated the ICER to be —€238,418, which suggested that the use of BCT instead of ADA in the treat-to-target approach is cost-effective for refractory RA patients [19]. The negative ICER in the cases above confirms the superiority of the BCT strategy: it accumulates higher QALYs with lower cost over the model time horizon.

This study had several limitations, primarily because of the inherent shortcomings of the RCT in terms of the follow-up period. Most data we obtained from the RCT and NMA in the model were for approximately 12–24 months, but the time horizon of the model was the lifetime, and short-term clinical data may introduce some biases in extrapolating long-term efficacy and costs. Indeed, inadequate information about HAQ progression for the long-term period (> 24 months on treatment) led to an assumption that the HAQ changed constantly. This is an unavoidable limitation for most RCTs because conducting long-term follow-up is resource intensive and prohibitively expensive. Thus, we built a decision analytic model and made reasonable assumptions to improve the robustness and practicality of the model. Second, the population characteristics of RA patients were taken from the RA-BEAM trial because of the lack of a head-to-head study that directly compared BCT + MTX with ADA + MTX in China. However, sensitivity analysis revealed that population characteristics only marginally impact the model results. Third, patients in the treatment sequence in this study were switched to PC for the remainder of their lifetime (model time horizon); this is justifiable and widely used by published articles on RA economics evaluation [12, 22]. Nonetheless, the clinical treatment sequence will have more permutations and combinations than the limited treatment sequence in this study. Fourth, according to previous studies, we assumed that only serious infection had a significant impact on the results in the model [41, 42]. However, in real life, patients may experience different AEs; regardless, because the safety profiles of bDMARDs are similar [27], adding other AEs to the model would not alter the results. The results of sensitivity analyses also confirmed that the impact on bDMARDs caused by AEs was insignificant. Finally, we did not include treatment with BCT monotherapy, which has significant clinical improvement and may be beneficial for patients with RA who are intolerant to cDMARDs. The economic evaluation of BCT monotherapy compared to BCT + cDMARDs needs to be assessed in future studies.

Conclusions

From the Chinese payer perspective, the introduction of BCT to the current treatment sequence for treating patients with moderate-to-severe RA for whom MTX fails is a cost-effective treatment option. Moreover, cost-effectiveness was obtained regardless of the position of BCT within the treatment sequence. Compared with its addition at subsequent positions in the treatment sequence, addition of BCT as a first-line treatment is the best option.