Introduction

Rheumatoid arthritis (RA) is a multifactorial chronic autoimmune disease that is associated with chronic systemic inflammation, leading to irreversible joint damage and multiple extra-articular morbidities [1, 2]. RA affects females two to three times more than males and can present at any age, with a peak prevalence in the seventh decade of life [3]. Early diagnosis and proper treatment are critical to control RA activity and avoiding permanent disabilities and joint destruction [1].

No curative therapy for RA exists; however, multiple therapeutic options are available to achieve the principle of “treat to target” that was established by the American College of Rheumatology (ACR) [4], which is achieved by either remission or low disease activity and sustaining that for at least 6 months. Different tools are used to evaluate the disease severity. Disease activity score 28 based on C-reactive protein (CRP) and erythrocyte sedimentation rate (ESR) (DAS28-CRP and DAS28-ESR) [5] and ACR 20, 50, and 70 criteria [6] are common indices evaluating RA activity based on symptoms, signs, and laboratory data. Other indices are being used as well, including Health Assessment Questionnaire Disability Index (HAQ-DI) [5] and Clinical Disease Activity Index (CDAI) [7]. The European Alliance of Associations for Rheumatology (EULAR) [8] and ACR [9] recommended disease-modifying anti-rheumatic drugs (DMARDs) and symptomatic treatment as soon as the accurate diagnosis is made to stop the ongoing joint erosions and relieve the patient’s symptoms.

Despite the advancement in the DMARDs, less than 50% of patients with RA are in remission, and 10–20% are refractory to the available treatment options [10]. New treatments targeting different pathways in the disease’s pathophysiology are emerging to cover this unmet gap, including IL-6 inhibitors. IL-6 is a pro-inflammatory cytokine that plays different roles in the pathogenesis of RA. It activates the T-cells, induces B-cell proliferation, induces osteoclast differentiation, enhances angiogenesis, and induces acute phase reactants [2]. The unregulated production of IL-6 in RA is associated with autoimmunity, chronic inflammation, joint edema, and joint destruction [11]. Accordingly, multiple drugs targeting IL-6 started to emerge, including tocilizumab (TCZ), sarilumab, sirukumab, clazakizuma, and olokizumab (OKZ).

Furthermore, IL-6 inhibitors are not used as a monotherapy for RA, but they are combined with conventional synthetic DMARDs (csDMARDs), such as methotrexate (MTX), for a better outcome. When TCZ and MTX were given to patients with inadequate response to TNF-α inhibitors, they achieved rapid and sustained response [12]. Also, sarilumab plus MTX showed significant suppression in the biomarkers of bone resorption and joint damage, compared to placebo plus MTX [13]. Finally, OKZ is a novel IL6 inhibitor with promising treatment outcomes.

To clarify OKZ is a new humanized monoclonal antibody targeting the IL-6 cytokine. Unlike TCZ, it does not target the IL-6 receptor. Instead, it targets specific sites on the IL-6 itself and blocks the formation of the extracellular signaling complex, consequently inhibiting the transmembrane signaling [14]. Two phase II randomized controlled trials (RCTs) [15, 16] reported significant improvement in DAS28-CRP, ACR20, and ACR50 indices in OKZ group, compared to the placebo. In another phase III clinical trial, OKZ combined with MTX was superior to MTX monotherapy and non-inferior to adalimumab combined with MTX [17]. Nasonov et al. [18] also reported that the combination of OKZ and MTX was superior to MTX plus placebo in improving the signs, symptoms, and physical function in RA patients. This was further supported by Feist et al. [19]. Therefore, we performed this systematic review and meta-analysis to synthesize evidence from RCTs on the efficacy and safety of OKZ in patients with RA and to investigate the optimal dosing regimen.

Methodology

Protocol registration

This systematic review and meta-analysis was thoroughly conducted in accordance with the Preferred Reporting Items for Systematic Reviews, Meta-analysis, and Network Meta-Analyses (PRISMA [20] and PRISMA NMA [21]) and the Cochrane Handbook of Systematic reviews and meta-analysis [22]. Moreover, this review’s protocol was prospectively published and registered in PROSPERO with ID: CRD42022358082. The thorough PRISMA checklist is in Table S1.

Data sources and search strategy

Electronic databases, including, PubMed, EMBASE, Web of Science, Scopus, and Cochrane CENTAL were systematically searched by two reviewers (B.A. and M.T.) until August 31, 2022. No filters were used. The comprehensive search terms and findings are elaborated in Table S2.

Eligibility criteria

We included RCTs with the following PICO criteria: population (P): patients with RA irrespective of their current treatment; intervention (I): OKZ irrespective of the dose (C): placebo (O): primary outcome of this study is the ACR20. Secondary endpoints include ACR50, ACR70, DAS28 CRP < 3.2, CDAI score ≤ 2.8, HAQ-DI score change, DAS28 ESR score change, and finally, safety data, including all-cause mortality, any treatment-emergent adverse events (TAEAs), any serious TAEAs, TAEAs leading to drug discontinuation, gastrointestinal disorder, and infection. All outcomes were measured at 12 and 24 weeks.

Single-arm studies, observational studies, conference abstracts, animal studies, and non-randomized trials were excluded.

Study selection

After excluding duplicates with Covidence online program, four investigators (A.B., A.F., A.M., and M.A.E.) independently assessed the titles and abstracts of the retrieved articles. Then, they checked the full texts of the relevant records for the previously mentioned eligibility criteria. Disagreements were resolved through discussion.

Data extraction

Guided by a pilot-tested extraction form, four reviewers (A.B., A.F., A.M., and M.A.E.) independently extracted the following: study characteristics (study design, NCT number, country recruitment duration, total participants, OKZ dose, and frequency of administration, main inclusion criteria, current adjuvant medication, primary outcome and follow up duration); baseline data, including (age, sex, number of patients in each group, basal metabolic index (BMI), rheumatoid factor (RF) positive, anti-CCP positive, DAS28-CRP, CDAI score, HAQ-DI score, tender joint count (TJC), swollen joint count (SJC), patient global assessment of disease activity (PtGA), visual analog scale (VAS), physician global assessment of disease (PGA), MTX dose, Duration of prior MTX use, Glucocorticoid use, Prednisone dose or its equivalent, and Prior exposure to TNF inhibitors). Finally, efficacy and safety outcomes include ACR20, ACR50, and ACR70 response, conversion to DAS28 (CRP) < 3.2 and CDAI score ≤2.8, HAQ-DI, DAS28 (ESR) change from baseline, and adverse events. Disagreements were resolved through discussion.

Risk of bias and quality assessment

We implemented the revised Cochrane collaboration’s tool for assessing the risk of bias in RCTs (ROB 2) [23], and four reviewers (A.B., A.F., A.M., and M.A.) independently evaluated the included RCTs for the risk of selection, performance, reporting, attrition, and overall biases. Disagreements were resolved by discussion. For the quality of evidence appraisal, two reviewers (M.T. and B.A.) used the Grading of Recommendations Assessment, Development, and Evaluation (GRADE) guidelines [24, 25]. Our decision was rationalized and reported for each outcome. Any disagreements were resolved via discussion.

Statistical analysis

For the pairwise meta-analysis, we used Revman version 5.4 [26] to pool dichotomous outcomes using risk ratio (RR) and continuous outcomes using mean difference (MD) presented with the corresponding 95% confidence interval (CI). We used the fixed-effect model; however, the random-effect model was utilized in case of significant heterogeneity. I2 and chi-squared test were used to evaluate the statistical heterogeneity. P value < 0.05 was considered significant for the chi-squared test, and I2 > 50% indicated substantial heterogeneity, in which case sensitivity analysis was conducted by excluding one study each time to determine the source of heterogeneity.

For network meta-analysis, we performed a network meta-analysis using a frequentist framework [21], pooling dichotomous outcomes using risk ratio (RR), and continuous data using mean difference (MD) presented with the corresponding 95% confidence interval (CI). Analysis was performed using the R-software netmeta and netrank package (R version 4.2.0), and meta-insight software [27,28,29] with statistical inconsistency in between network arms was evaluated by calculating I2. Finally, we did not investigate the publication bias by funnel plots as we included less than ten RCTs [30].

Results

Search results and study selection

We retrieved 201 records, and then, 85 duplicates were excluded using Covidence, leaving 116 records to be screened. After title and abstract screening, we excluded 91 irrelevant records and proceeded with 25 articles for full-text screening. Finally, we included five RCTs [15,16,17,18,19] (Fig. 1).

Fig. 1
figure 1

PRISMA flow chart of the screening process

Characteristics of included studies

Five RCTs [15,16,17,18,19] were included with 2277 patients of whom 1749 patients in the OKZ group and 528 patients in the placebo group. Two RCTs were phase II following up patients for 12 weeks [15, 16], while three RCTs were phase III following up patients for 24 weeks [17,18,19]. Phase III RCTs [17,18,19] investigated OKZ in the dose of 64 mg given every 2 or 4 weeks, while phase II RCTs [15, 16] were dose-ranging studies investigating the doses of 60, 120, and 240 mg given every 2 or 4 weeks. Summary RCT characteristics and baseline characteristics of the participants are outlined in Tables 1 and 2, respectively.

Table 1 Summary characteristics of the included RCTs
Table 2 Baseline characteristics of the participants

Risk of bias and quality of evidence

All of the included RCTs [15, 17,18,19] showed a low risk of overall bias, except Takeuchi et al. [16], with some concerns due to not available information about outcome assessor blinding (Fig. 2). The quality of evidence is outlined in a GRADE evidence profile (Table S3).

Fig. 2
figure 2

Quality assessment of risk of bias in the included trials. The upper panel presents a schematic representation of risks (low = red, unclear = yellow, and high = red) for specific types of biases of each of the studies in the review. The lower panel presents risks (low = red, unclear = yellow, and high = red) for the subtypes of biases of the combination of studies included in this review

Efficacy outcomes

ACR20 response

In the pairwise meta-analysis, the pooled RR favored OKZ over placebo after 12 weeks (RR: 1.97 with 95% CI [1.49, 2.58], P = 0.00001) (moderate-quality evidence) and after 24 weeks (RR: 1.75 with 95% CI [1.35, 2.27], P = 0.0001) (moderate-quality evidence) (Fig. 3A, Table S3). The pooled studies were heterogenous (P = 0.01, I2 = 70%) and (P = 0.06, I2 = 72%), respectively. Heterogeneity after 12 weeks was best resolved after excluding Nasonov et al. (2021) (P = 0.21, I2 = 34%) (Table S4).

Fig. 3
figure 3

Forest plot of the efficacy outcomes (A ACR20, B ACR50); RR, risk ratio; CI, confidence interval

In the network meta-analysis, all OKZ regimens were significantly associated with greater ACR20 response, compared to placebo either after 12 or 24 weeks, with low heterogeneity observed of I2 = 37% and I2 = 43%, respectively (Table 3, Figs. S1-6).

Table 3 Ranking table for all our network meta-analyses’ outcomes

ACR50 response

In the pairwise meta-analysis, the pooled RR favored OKZ over placebo after 12 weeks (RR: 3.83 with 95% CI [2.13, 6.87], P = 0.00001) (moderate-quality evidence) and after 24 weeks (RR: 3.53 with 95% CI [1.35, 9.23], P = 0.01) (low-quality evidence) (Fig. 3B, Table S3). The pooled studies were heterogenous (P = 0.02, I2 = 67%) and (P = 0.002, I2 = 89%), respectively. Heterogeneity after 12 weeks was best resolved after excluding Nasonov et al. (2021) (P = 0.36, I2 = 7%) (Table S4).

In the network meta-analysis, all OKZ regimens were significantly associated with greater ACR50 response, compared to placebo either after 12 or 24 weeks, with low heterogeneity observed of I2 = 22% and I2 = 34%, respectively (Table 3, Figs. S7-12).

ACR70 response

In the pairwise meta-analysis, the pooled RR favored OKZ over placebo after 12 weeks (RR: 3.83 with 95% CI [2.13, 6.87], P = 0.00001) (moderate-quality evidence) and after 24 weeks (RR: 5.09 with 95% CI [1.53, 16.91], P = 0.008) (low-quality evidence) (Fig. 4A, Table S3). The pooled studies were homogenous after 12 weeks (P = 0.18, I2 = 42%) and heterogenous after 24 weeks (P = 0.02, I2 = 82%).

Fig. 4
figure 4

Forest plot of the efficacy outcomes (A ACR70, B DAS28-CPR); RR, risk ratio; CI, confidence interval

In the network meta-analysis, OKZ 64 Q2w and Q4w regimens were significantly associated with greater ACR70 response, compared to placebo either after 12 or 24 weeks with, substantial heterogeneity observed of I2 = 52% and 73%, respectively (Table 3, Figs. S13-18).

DAS28-CRP ≤ 3.2

In the pairwise meta-analysis, the pooled RR favored OKZ over placebo after 12 weeks (RR: 3.91 with 95% CI [2.65, 5.79], P = 0.00001) (high-quality evidence) and after 24 weeks (RR: 3.54 with 95% CI [1.55, 8.10], P = 0.003) (low-quality evidence) (Fig. 4B, Table S3). The pooled studies were homogenous after 12 weeks (P = 0.17, I2 = 38%) and heterogenous after 24 weeks (P = 0.009, I2 = 86%).

In the network meta-analysis, all OKZ regimens were significantly associated with improved DAS28-CRP, compared to placebo either after 12 or 24 weeks, with low heterogeneity observed of I2 = 43% for 12 weeks, yet substantial heterogeneity of I2 = 76% for 24 weeks (Table 3, Figs. S19-24).

CDAI score of ≤ 2.8

In the pairwise meta-analysis, the pooled RR favored OKZ over placebo after 12 weeks (RR: 2.80 with 95% CI [1.43, 5.48], P = 0.003) (moderate-quality evidence) and after 24 weeks (RR: 3.67 with 95% CI [2.01, 6.72], P = 0.0001) (low-quality evidence) (Fig. S25, Table S3). The pooled studies were homogenous after 12 weeks (P = 0.62, I2 = 0%) and after 24 weeks (P = 0.13, I2 = 57%).

In the network meta-analysis, all OKZ regimens were significantly associated with improved CDAI score, compared to placebo either after 12 or 24 weeks, with low heterogeneity observed of I2 = 11% and I2 = 24%, respectively (Table 3, Figs. S26-31).

HAQ-DI score change after 12 weeks

In the pairwise meta-analysis, the pooled MD favored OKZ over placebo (MD: − 0.28 with 95% CI [− 0.38, − 0.18], P = 0.00001) (moderate-quality evidence) (Fig. S32, Table S3). The pooled studies were heterogenous (P = 0.002, I2 = 76%). A sensitivity analysis was conducted to determine the source of heterogeneity, but it was not resolved by sensitivity analysis (Table S4).

In the network meta-analysis, all treatment regimens were associated with a reduction in HAQ-DI score change after 12 weeks, except for OKZ 240 Q4W (MD: − 0.16 with 95% CI [− 0.43, 0.10]), with substantial heterogeneity observed of I2 = 54% (Table 3, Figs. S33-35).

DAS28-ESR score change after 12 weeks

We found no difference between OKZ and placebo (MD: − 3.69 with 95% CI [− 8.13, 0.75], P = 0.1) (very low-quality evidence) (Fig. S36). The pooled studies were heterogenous (P = 0.0001, I2 = 94%).

Safety outcomes

Any TEAEs

In the pairwise meta-analysis, OKZ was significantly associated with more incidence of TEAEs (RR: 1.15 with 95% CI [1.06, 1.25], P = 0.0005) (moderate-quality evidence) (Fig. 5A, Table S3). The pooled studies were homogenous (P = 0.15, I2 = 41%). In network meta-analysis, OKZ 64 Q2w and Q4w regimens were significantly associated with a higher incidence of TEAEs (RR: 1.14 with 95% CI [1.03, 1.25]) and (RR: 1.15 with 95% CI [1.0,4 1.26]), respectively, with low observed heterogeneity of I2 = 34% (Table 3, Figs. S37-39).

Fig. 5
figure 5

Forest plot of the safety outcomes (A TEAEs, B TEAEs leading to drug discontinuation, C TESAEs, and D all-cause mortality); RR, risk ratio; CI, confidence interval

Any TEAEs leading to drug discontinuation

In the pairwise meta-analysis, OKZ was significantly associated with more incidence of TEAEs leading to drug discontinuation (RR: 1.86 with 95% CI [1.05, 3.29], P = 0.03) (moderate-quality evidence) (Fig. 5B, Table S3). The pooled studies were homogenous (P = 0.34, I2 = 10%). In the network meta-analysis, OKZ 64 Q4w regimen was significantly associated with a higher incidence of TEAEs leading to drug discontinuation (RR: 2.03 with 95% CI [1.06, 3.89]), with no observed heterogeneity of I2 = 0% (Table 3, Figs. S40-42).

Any TESAEs

In the pairwise meta-analysis, we found no difference between OKZ and placebo regarding the incidence of TESAEs (RR: 1.12 with 95% CI [0.70, 1.78], P = 0.63) (moderate-quality evidence) (Fig. 4C, Table S3). The pooled studies were homogenous (P = 0.28, I2 = 21%). In the network meta-analysis, we found no difference between different OKZ regimens and placebo regarding the incidence of TESAEs, with low observed heterogeneity of I2 = 13% (Table 3, Figs. S43-45).

Any-cause mortality

In the pairwise meta-analysis, we found no difference between OKZ and placebo regarding the incidence of any-cause mortality (RR: 1.36 with 95% CI [0.23, 8.03], P = 0.74) (moderate-quality evidence) (Fig. 5D, Table S3). The pooled studies were homogenous (P = 0.94, I2 = 0%). In the network meta-analysis, we found no difference between different OKZ regimens and placebo regarding the incidence of any-cause mortality, with no observed heterogeneity of I2 = 0% (Table 3, Figs. S46-48).

Gastrointestinal disorders

In the pairwise meta-analysis, we found no difference between OKZ and placebo regarding the incidence of gastrointestinal disorders (RR: 1.20 with 95% CI [0.88, 1.64], P = 0.26) (moderate-quality evidence) (Fig. S46, Table S3). The pooled studies were homogenous (P = 0.84, I2 = 0%). In the network meta-analysis, we found no difference between different OKZ regimens and placebo regarding the incidence of gastrointestinal disorders, with no observed heterogeneity of I2 = 0% (Table 3, Figs. S50-52).

Infections

In the pairwise meta-analysis, we found no difference between OKZ and placebo regarding the incidence of infections (RR: 0.93 with 95% CI [0.79, 1.08], P = 0.34) (moderate-quality evidence) (Fig. S50, Table S3). The pooled studies were homogenous (P = 0.36, I2 = 8%). In the network meta-analysis, we found no difference between different OKZ regimens and placebo regarding the incidence of infections, with low observed heterogeneity of I2 = 12% (Table 3, Figs. S54-56).

Discussion

This meta-analysis of five RCTs [15,16,17,18,19] found that the OKZ in RA patients with inadequate response to the standard of care was effective, compared to the placebo. All the disease activity indices favored OKZ except DAS28-ESR, which showed no difference from the placebo. The safety profile of the OKZ was as expected for the IL-6 inhibitors. TEAEs were higher in the OKZ group compared to the placebo. Also, the treatment discontinuation rate due to TEAEs was higher in the OKZ group. However, the OKZ and placebo groups were similar regarding all-cause mortality, TESAEs, gastrointestinal disorders, or infection.

OKZ is a newly developed humanized monoclonal antibody targeting IL-6 cytokine itself rather than its receptor. It has been investigated for the treatment of moderate to severe RA with inadequate response to TNF-α inhibitors in the presence of MTX. Only TCZ and sarilumab, which block the IL-6 receptor, have been approved for RA treatment (31,32). Moreover, other IL-6 inhibitors that target the IL-6 cytokine directly rather than its receptor (sirukuma, clazakizuma, and OKZ) are currently under development. However, none of them has been authorized yet for the treatment of RA. OKZ, being a direct IL-6 inhibitor, can be administrated less frequently compared to IL-6 receptor blockers as less protein dose is required to achieve an effect; hence, OKZ Q4W can be favored over weekly or biweekly dosing of the approved IL-6 receptor blockers (TCZ and sarilumab) [19].

ACR20 was used as the primary endpoint in this analysis because it has been widely accepted value for assessing the drugs’ efficacy in RA over the years. It also makes the comparison between the response to OKZ and other drugs that used the ACR20 value in the past reasonable [18]. Our pooled analysis showed significant improvement in the ACR20 in the OKZ group, compared to the placebo at 12 and 24 weeks following the treatment initiation. Furthermore, Smolen et al. [17] reported that the ACR20 improvement in patients treated with OKZ + MTX was similar to patients treated with adalimumab (TNF-α inhibitor) + MTX. Genovese et al. [15] also reported similar improvement in the ACR20 in patients treated with OKZ and TCZ (IL-6 receptor inhibitor). He also reported that the improvement in the ACR20 started as early as week 1 after treatment [15]. Similarly, ACR50 and ACR70 were significantly improved in the OKZ group versus placebo at weeks 12 and 24. ACR50 improvement was noticed as early as week 4 by Genovese et al. [15], and more patients met this index compared to the ACR70. ACR70 improvement was higher at 24 weeks, compared to 12 weeks indicating that longer treatment duration might induce more improvement.

Moreover, we found that the rate of DAS28-CRP ⩽ 3.2 was higher in the OKZ group versus the placebo. On the other hand, we did not find a difference between the two groups regarding DAS28-ESR. DAS28 index was first presented in 1995 using the count of 28 tender and swollen joints combined with a measure of the general health and acute phase reactant ESR. In 2004, CRP was suggested as an alternative component for the DAS28 instead of the ESR for different reasons. First, ESR is under the influence of different factors, including age, gender, and plasma proteins. Second, CRP is more susceptible to short-term alterations in inflammation, which in turn will reflect the rapid response to the treatment [15]. Since the development of the DAS28-CRP index, it has been investigated for validation and comparison to the original DAS28-ESR. Arguably, indices using CRP to evaluate drugs blocking IL-6 cannot be very accurate. This comes back to the fact that blocking IL-6 cytokine directly or blocking its receptor interferes with the production of CRP [17]. Therefore, due to possibly suspicious results, future studies should focus on clinical and radiological assessments. With debates on the superiority of one over the other, both indices have been validated by the EULAR and the ACR to monitor the disease severity and achieve treatment to the target concept [5]. To clarify, the DAS28 score ranges from 0 to 9.4, with values < 2.6 representing remission, while values ⩽ 3.2 represent low disease activity [5, 31].

Regarding CDAI, pooled data favored the OKZ over the placebo at weeks 12 and 24 of treatment. CDAI is another index to evaluate the disease severity in RA patients that does not use acute phase reactants for measurement. Instead, it uses the summation of the number of swollen/tender 28 joints plus patient and physician global assessment on the visual analog scale (VAS) [4]. The simplicity of this score measurement made it more feasible to be used in clinical settings. It has a score ranging from 0 to 76 where values ⩽ 2.8 represent remission, while values > 2.8 to 10 represent low disease activity [32]. CDAI has been validated compared to DAS28 and HAQ-DI indices [33] and recommended by the ACR to monitor the disease activity and achieve treatment to target [8, 34].

Regarding HAQ-DI, we found that patients’ disability improvement at week 12 based on HAQ-DI favored the OKZ group, compared to the placebo. Assessment of disability that results from either joint damage or pain in RA patients is important [35]. HAQ-DI is one of the most used patient-oriented tools to assess functional disability in RA patients. It has a range from 0 to 3, with higher numbers representing more disabilities [17]. The current consensus is that the lowest clinically significant difference in the HAQ-DI is a change of 0.22 to 0.25 [35]. However, Ward et al. suggested that a change of 0.375 is needed to show a clinically significant difference [35].

Moreover, OKZ safety profile was expected of IL-6 inhibitors and similar to the FDA-approved anti-IL-6 drugs, tocilizumab, and sarilumab [36, 37]. From the pooled data, we found that the OKZ group experienced more TEAEs and TEAEs leading to drug discontinuation compared to the placebo. However, we could not find a significant difference between the incidences of treatment-emergent any-cause mortality, TESAEs, gastrointestinal disorders, and infections between OKZ and placebo.

Genovese et al. [15] reported mild to moderate TEAEs in the OKZ group, which included gastrointestinal disorders, infections, and nervous system disorders. They found that TEAEs were similar in quality and frequency to those observed in the TCZ and placebo groups [15]. They also reported an increase in the patient’s liver enzymes in both the OKZ and TCZ groups without severe liver injury [15]. A mild increase in the patients’ lipids in both OKZ and TCZ was also noticed, besides a decrease in their neutrophilic counts [15]. Takeuchi et al. [16], Nasonov et al. [18], and Feist et al. [19] reported similar TEAE profiles, with most TEAEs being mild to moderate and more common in the OKZ groups. Furthermore, Smolen et al. [17] reported that most of the TEAEs were mild to moderate as well and were similar in all treatment groups, including the adalimumab group.

Higher doses or higher frequency of OKZ administration are likely to cause higher TEAEs or TESAEs. This was supported by Feist et al. [19] reporting a frequency-dependent increase in the rate of TESAEs in most of the events in the OKZ 64 mg once every two weeks (Q2W) group. Genovese et al. [15] also reported similar findings as they noticed higher injection site reactions in the OKZ 240 mg Q2W group. However, after we analyzed the pooled data, we found that OKZ 64 mg Q2W and Q4W were both associated with a higher rate TEAEs, compared to the placebo. Also, our network meta-analysis found that drug discontinuation due to TEAEs was higher in the OKZ 64 mg Q2W group. Based on our observations, the concept of a dose-dependent increase in TEAEs or TESAEs with OKZ might need further research.

With most of the reported TEAEs being mild to moderate in all the included studies, few TESAEs were reported. Two TESAEs were reported by Genovese et al. [15], one pneumonia and another abscess in each of the OKZ and TCZ groups. Takeuchi et al. [16] also reported four TESAEs, including two RAs in the placebo group and two infections in the OKZ group. Nasonov et al. [18] reported few TESAEs in the OKZ groups, including pulmonary tuberculosis. The largest number of the TESAEs were reported by Smolen et al. [17], including major cardiovascular events (which were similar in the OKZ and adalimumab groups), pneumonia, sepsis, and pulmonary tuberculosis. The reported TESAEs by Feist et al. [19] included sepsis, hepatobiliary disorders, and cellulitis.

Furthermore, no deaths were reported by Genovese et al. [15], Takeuchi et al. [16], or Feist et al. [19]. However, Nasonov et al. [18] reported one death in the OKZ 64 mg Q2W group, and Smolen et al. [17] also reported deaths due to TESEAs in three patients in the OKZ 64 mg Q2W group, two patients in the OKZ 64 mg Q4W group, one patient in the adalimumab group, and one patient receiving placebo.

Strengths

This is the first meta-analysis, to the best of our knowledge, that evaluated the efficacy and safety of OKZ for RA, investigating the optimal regimen through a thorough network meta-analysis. Our review was strictly conducted according to PRISMA guidelines [20] with prospectively published protocol and GRADE evidence assessment.

Limitations

Our review is limited by the following: first, some of our network meta-analysis comparisons included only one to two arms, and our 24-week pairwise analysis included only two RCTs, which can limit the external validity of its findings. Second, the number of patients in the OKZ group is relatively higher than the placebo group, which is an inherited limitation of the included trials due to ethical reasons. Third, none of the included trials conducted a radiological examination of the joints to confirm the observed effects. Finally, some of our meta-analysis outcomes showed significant heterogeneity.

Implications for future research

By combining the data from three phase III and two phase II RCTs, we gained more power to provide the highest level of evidence to estimate the effectiveness and safety of OKZ as a new treatment for RA. Our study confirmed the added treatment benefits of OKZ when combined with MTX. The lack of differences in TESAEs and any-cause mortality between different doses of OKZ and placebo confirmed the safety profile of this new medication. Our study justifies the need for designing more RCTs investigating, the still-lacking, long-term effects and side effects of OKZ beyond 24 weeks. It also points to the need to investigate the most effective dose of OKZ, given the dose–response increase in TEAEs incidence. With most of the included studies comparing the OKZ to placebo, the need to look into the comparative efficacy of OKZ versus the established TNF-α inhibitors and IL-6 inhibitors became important. Eventually, the results of this study might provide supportive evidence for getting the OKZ approved as a treatment for RA. A step equips physicians with a new tool for fighting this disease and gives hope to many RA patients.

Conclusion

OKZ treatment with MTX was effective in improving RA indices with an improvement of the RA’s symptoms and signs along with the expected safety profile; however, more large-scale RCTs are still required to investigate the optimal dosing, long-term effects, and comparative efficacy versus established biological DMARDs.