FormalPara Key Points
Table 1

1 Introduction

Non-steroidal anti-inflammatory drugs (NSAIDs) inhibit the enzyme cyclooxygenase (COX), which mediates the conversion of arachidonic acid to inflammatory prostaglandins (PGs). COX-1 is constitutively expressed in the gastric epithelium, therefore providing protection to the gastric mucosa as it regulates the production of acid and mucus. COX-2 is involved in inflammation and is a source of the inflammatory mediators prostaglandin E2 (PGE2) and prostacyclin (PGI2) via the arachidonic acid pathway [1]. COX-2 inhibitors are a subclass of NSAIDs that were designed to selectively interrupt the production of inflammatory mediators without compromising gastric epithelial function [2]. COX-2 inhibitors are indicated for anti-inflammatory, anti-pyretic and analgesic effects in disorders such as osteoarthritis (OA), rheumatoid arthritis (RA) and acute pain.

There are few COX-2 inhibitors in current clinical use: celecoxib (200 mg/day, oral), etoricoxib [60 mg/day, oral; although not approved by the US Food and Drug Administration (FDA)] and parecoxib (80 mg/day, intravenous or intramuscular injection for postoperative pain; not approved by the FDA). Further, the FDA and European Medicines Agency (EMA) have withdrawn approval for other COX-2 inhibitors due to an increased risk of cardiovascular adverse events (AEs); rofecoxib was withdrawn in 2004 due to thrombotic cardiovascular events observed in a randomized controlled trial (RCT) [3]; valdecoxib was withdrawn in 2005 due to skin reactions and cardiovascular events post-coronary artery bypass graft surgery in two RCTs [4, 5]; and lumiracoxib was withdrawn by the EMA in 2007 due to liver toxicity and skin reactions [6], and did not gain full FDA approval.

The proposed mechanism for an increase in occurrence of cardiovascular events is an imbalance between prostacyclin and thromboxane A2 (TXA2), with prostacyclin production decreased but TXA2 continuing, leading to platelet activation, vascular proliferation, vascular contraction and cell adhesion, which may occur within the coronary arteries. COX-2 inhibition leads to reduced PGI2 from the vascular endothelium; thus, the protective effect of PGI2 is lost, which predisposes to injury, cell adhesion and vessel contraction. TXA2 production by platelets is not inhibited by COX-2 inhibitors, which leads to platelet activation and a prothrombotic state (Fig. 1) [7].

Fig. 1
figure 1

Effect of cyclooxygenase (COX) inhibition on platelets and endothelium. NSAID non-steroidal anti-inflammatory drug

There are meta-analyses comparing the relative safety of COX-2 inhibitors with non-selective NSAIDs [8,9,10,11,12,13,14]. However, the objective of this study was to assess the safety of oral COX-2 inhibitors in the management of OA in a systematic review and meta-analysis of randomized, placebo-controlled trials.

2 Methods

The protocol of this systematic review and meta-analysis was previously registered in the PROSPERO database (registration number: CRD42017068278). The systematic review was performed in accordance with the recommendations in the Cochrane Handbook for Systematic Reviews of Interventions [15]. The findings were reported according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines [16]. All the review process (study selection and risk of bias assessment) was undertaken using Covidence, the Cochrane platform for systematic reviews, and was performed by EC, NF, SS and LS.

2.1 Eligibility Criteria

Randomized, double-blind, placebo-controlled, parallel-group trials that have assessed the AEs associated with COX-2 inhibitors (celecoxib, rofecoxib, etoricoxib, valdecoxib but not lumiracoxib as it never gained full FDA approval) in patients with OA were eligible for inclusion in this meta-analysis.

Studies that allowed concomitant anti-osteoarthritis treatments during the trial (other than rescue medication as acetaminophen or aspirin) were also excluded, as were animal trials.

2.2 Data Sources and Search Strategies

A comprehensive literature search was undertaken in the following databases: MEDLINE (via Ovid), Cochrane Central Register of Controlled Trials (Ovid CENTRAL) and Scopus. Each database was searched from inception up to 30 June 2017. We searched for randomized placebo-controlled trials of COX-2 inhibitors in OA, using a combination of study design-, treatment- and disease-specific key words and Medical Subject Heading (MeSH) terms.

While adverse effects were the outcomes of interest for this study, we decided to avoid the outcome-specific key words in the search strategies, because of the possibility that a study on the efficacy of a drug may have not mentioned terms related to adverse events in its title, abstract or in the keyword section. The search was limited to English and French publications and to human subjects. Detailed search strategies for MEDLINE/CENTRAL and Scopus databases are reported as Electronic Supplementary Material (ESM1).

Two clinical trial registries, ClinicalTrials.gov (clinicaltrials.gov/) and the World Health Organization’s International Clinical Trials Registry Platform Search portal (apps.who.int/trialsearch/) were also checked for trial results that were unpublished. Finally, recent meta-analyses were also screened for any additional relevant studies.

2.3 Study Selection

Two members of the review team independently evaluated each title and abstract to exclude only obvious irrelevant studies, according to the predefined eligibility criteria. At this step, the criteria related to adverse effects was not considered for selection, as studies focusing on the efficacy of a treatment may not report data about adverse effects in the abstract; this means that all trials mentioning only the efficacy information were retrieved at this step. After this first step, the two investigators independently reviewed the full text of each of the articles not excluded during the initial screening stage to determine whether the studies met all selection criteria. At this stage, studies were excluded due to previously unidentified duplication, conference abstracts alone being available, a non-placebo comparator being used alone against COX-2 medication in the trial, an indication other than OA, safety not being included as an outcome of the trial, a non-COX-2 intervention or incorrect study design. All differences of opinion regarding the selection of articles were resolved through discussion and consensus between the two investigators; any persistent disagreement was solved with the intervention of a third person (another member of the review team).

2.4 Data Extraction

The full texts of the selected studies were screened by independent reviewers for extraction of relevant data, using a standard data extraction form. Outcome results data were independently extracted by two investigators from the review team. For each study, the following data were extracted: characteristics of the manuscript, characteristics of the trial, objective and design of the study, characteristics of the patients, characteristics of the disease, characteristics of the treatments, AEs (outcomes) reported during the trial and the main conclusion of the study. In the case of multiple dosage arms for COX-2 inhibitors being included in a trial, the maximum dose was used to categorize the study. If multiple follow-up times were included, the longest follow-up time was used to categorize the study. The raw data (number of events in each group) were extracted for each outcome. The number of patients who experienced at least once any body-system–related AE (e.g. nervous system, gastrointestinal system), as well as AEs within each body system (e.g. headache, abdominal pain) were extracted. As much as possible, data from the intention-to-treat (ITT) analysis were considered.

2.5 Outcomes of Interest

The main System Organ Classes (SOCs) that are likely to be affected by the use of COX-2 inhibitors in the treatment of OA were explored in this meta-analysis. The primary outcomes of interest were gastrointestinal disorders, cardiac disorders, vascular disorders, nervous system disorders, skin and subcutaneous tissue disorders, hepatobiliary disorders, renal and urinary disorders, as well as overall severe and serious AEs, drug-related AEs and mortality. Secondary outcomes were withdrawals because of AEs (i.e. the number of participants who stopped the treatment because of an AE) and total number of AEs (i.e. the number of patients who experienced any AE at least once).

2.6 Assessment of Risk of Bias in Included Studies

Two authors of the review team independently assessed the risk of bias in each study using the Cochrane Collaboration’s tool for risk of bias assessment [15]. The following characteristics were evaluated:

  • Random sequence generation: we assessed whether the allocation sequence was adequately generated.

  • Allocation concealment: we assessed the method used to conceal the allocation sequence, evaluating whether the intervention allocation could have been foreseen in advance.

  • Blinding of participants and personnel: we assessed the method used to blind study participants and personnel from knowledge of which intervention a participant received and whether the intended blinding was effective.

  • Blinding of outcome assessment: we assessed the method used to blind outcome assessors from knowledge of which intervention a participant received and whether the intended blinding was effective.

  • Incomplete outcome data: we assessed whether participants’ exclusions, attrition and incomplete outcome data were adequately addressed in the paper.

  • Selective outcomes reporting: we checked whether there was evidence of selective reporting of adverse events.

Each of these items was either categorized as ‘low risk of bias’, ‘high risk of bias’, or ‘unclear risk of bias’. ‘Low risk of bias’ or ‘high risk of bias’ was attributed to an item when there was sufficient information in the manuscript to judge the risk of bias as ‘low’ or ‘high’; otherwise, ‘unclear risk of bias’ was attributed to the item. Disagreements were solved by discussion between the two reviewers during a consensus meeting and involved, when necessary, another member of the review team for the final decision.

2.7 Data Analysis

Analyses were performed using STATA 14.2 software. The units of analysis were the number of participants experiencing a specific adverse event. We described harms associated with the treatment as risk ratio with 95% confidence interval (95% CI). We computed an overall effect size for each primary or secondary outcome (AE). Anticipating substantial variability among trial results (i.e. the inter-study variability), we assumed heterogeneity in the occurrence of the AEs; thus, we planned to use random-effects models for the meta-analyses. We estimated the overall effects and heterogeneity using the DerSimonian and Laird random-effects model [17]. As this method provides a biased estimate of the between-study variance with sparse events [18, 19], we also performed the meta-analyses using the Restricted Maximum Likelihood (REML) method [20]. As rofecoxib was withdrawn by the FDA and EMA in 2004 due to thrombotic cardiovascular events, we performed a sensitivity analysis for AEs for the COX-2 inhibitor class minus rofecoxib.

We tested heterogeneity using the Cochran’s Q test. As we are performing a random-effect meta-analysis, we used the Tau-squared (τ2) estimate as the measure of the between-study variance. The I-squared (I2) statistic was used to quantify heterogeneity, measuring the percentage of total variation across studies due to heterogeneity [21]. The quality of each evidence was assessed using the GRADE approach [22] and a summary of findings table was prepared using GRADEpro online software [23].

3 Results

3.1 Study Selection

Database searches initially identified 2149 records. After exclusions, 73 articles were screened in full against the inclusion criteria. A flowchart (Fig. 2) with the number of included studies at each step was established, including the reasons for excluding studies during the full-text reading process. Forty-one of these met the eligibility criteria and included a placebo comparator, but a further five were excluded as the specific outcomes of interest to our strategy were not specified. Thirty-six papers were included, which actually comprised 40 trials as four papers presented results from two studies [24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58].

Fig. 2
figure 2

Flowchart of the study selection process

3.2 Study Characteristics

Table 1 presents the characteristics of the studies included through the systematic review process. The year of publication of the included studies ranged from 1999 to 2017 and the follow-up time ranged from 6 weeks to 24 months. The number of trials including an arm for each specific COX-2 inhibitor (or two arms if two COX-2 inhibitors were included) were as follows: celecoxib 20 (49%), etoricoxib 6 (15%), rofecoxib 10 (24%), celecoxib and etoricoxib 4 (10%), celecoxib and rofecoxib 1 (2%). The anatomic regions included per trial are as follows; knee 24 (59%), hip 2 (5%), knee or hip 13 (32%), any OA 1 (2%), not specified 1 (2%).

Table 1 Characteristics of the studies included through the systematic review process

3.3 Risk of Bias of Individual Studies

Figures 3 and 4 include a summary of the risk of bias assessed for each study included in the meta-analysis and all our findings were associated with a ‘moderate’ to ‘high’ certainty of evidence.

Fig. 3
figure 3

Risk of bias summary: review of authors’ judgements about each risk of bias item for each included study

Fig. 4
figure 4

Risk of bias graph: review of authors’ judgements about each risk of bias item presented as percentages across all included studies

3.4 Primary Outcomes

We reported only the results from the DerSimonian and Laird random-effects model, because we found no difference in the effects computed by the two methods.

3.4.1 Gastrointestinal Complications

The relative risk of upper gastrointestinal complications overall, including ulcer-related events, dyspepsia and abdominal pain, was significantly increased with COX-2 inhibitors versus placebo (RR 1.19, 95% CI 1.03–1.38; I2 = 0%) (Fig. 5). The risk of dyspepsia (RR 1.08, 95% CI 0.90–1.30; I2 = 0%) and ulcer-related AEs (RR 2.08, 95% CI 0.58–7.46; I2 = 0%) was not significantly increased in the COX-2 inhibitor group; however, the risk of abdominal pain increased significantly, by 40%, with COX-2 inhibitors (RR 1.40, 95% CI 1.08–1.80; I2 = 0%).

Fig. 5
figure 5

Relative risk of upper gastrointestinal adverse events (ulcer-related events, dyspepsia, abdominal pain) for COX-2 inhibitors versus placebo. CI confidence interval, COX-2 cyclooxygenase-2, RR relative risk

There was no increase in the relative risk of nausea and vomiting (RR 0.96, 95% CI 0.75–1.22; I2 = 0%) nor any significant difference in the risk of constipation (RR 1.00, 95% CI 0.58–1.75; I2 = 0%) between the COX-2 inhibitors and placebo group (ESM2).

3.4.2 Cardiovascular Events

The risk of hypertension increased significantly, by 45% overall across the 15 studies, including celecoxib, rofecoxib and etoricoxib (RR 1.45, 95% CI 1.01–2.10; I2 = 25%) (Fig. 6). However, when rofecoxib was removed from the analysis, the risk of hypertension in the COX-2 inhibitor group was no longer significant (RR 1.21, 95% CI 0.80–1.83; I2 = 20%) (ESM2).

Fig. 6
figure 6

Relative risk of hypertension adverse events for COX-2 inhibitors versus placebo. CI confidence interval, COX-2 cyclooxygenase-2, RR relative risk

The incidence of heart failure (HF)-related events was reported differently across the studies; thus, a grouped analysis was performed. The overall risk of HF and edema was increased with COX-2 inhibitors versus placebo (RR 1.68, 95% CI 1.22–2.31; 0%) (Fig. 7). The overall increase in risk of around 70% was not changed appreciably when rofecoxib studies were excluded (RR 1.67, 95% CI 1.21–2.29; 0% for celecoxib and etoricoxib only) (ESM2).

Fig. 7
figure 7

Relative risk of edema (peripheral and generalized) and heart failure adverse events for COX-2 inhibitors versus placebo. CI confidence interval, COX-2 cyclooxygenase-2, RR relative risk

The risk of congestive HF (CHF) was not significantly increased (RR 1.18, 95% CI 0.24–5.71; p = 0.944; I2 = 0%). The risk of peripheral edema was increased by 61% (RR 1.61, 95% CI 1.09–2.40; I2 = 0%) and the risk of generalized edema increased by 91% (RR 1.91, 95% CI 1.08–3.39; I2 = 0%).

3.4.3 Other Primary Outcomes

Central nervous system (CNS) AEs were grouped into dizziness, headache and other (incorporating insomnia, depression and psychiatric disorders). No appreciable difference in the risk of CNS AEs was observed between the placebo and COX-2 inhibitor groups (RR 0.94, 95% CI 0.84–1.04; 0%) (ESM2). Neither was there any increase in the risk of dermatological AEs (rash or pruritis) (RR 1.02, 95% CI 0.76–1.37; 0%) (ESM2). Too few studies reported on renal and hepatic AEs and mortality for these to be included in the meta-analysis.

The risk of drug-related AEs was significantly increased with COX-2 inhibitors versus placebo (RR = 1.26, 95% CI 1.09–1.46; I2 = 24%) (Fig. 8). There was no significant difference in risk of serious AEs (SAEs) with COX-2 inhibitors versus placebo, although a lower rate of SAEs was recorded (RR 0.76, 95% CI 0.48–1.19; I2 = 21%) (ESM2).

Fig. 8
figure 8

Relative risk of drug-related adverse events for COX-2 inhibitors versus placebo. AE adverse event, CI confidence interval, COX-2 cyclooxygenase-2, RR relative risk

3.5 Secondary Outcomes

Overall, there was no increase in total risk of AEs between the COX-2 inhibitor group and the placebo group, even with the highest dose of COX-2 inhibitor included in the analysis (RR 1.03, 95% CI 0.97–1.10; I2 = 66%) (ESM2). Too few studies reported on withdrawals due to AEs for this to be included in the meta-analysis.

3.6 GRADE Assessment of Findings

We assessed the certainty of evidence for each primary or secondary outcome for COX-2 inhibitors compared with placebo, using the GRADE approach [22]. Our findings were associated with ‘moderate’ to ‘high’ certainty of evidence. Table 2 summarizes the significant findings while Table 3 summarizes the non-significant findings for the safety outcomes assessed in this meta-analysis.

Table 2 Summary of significant safety findings for COX-2 inhibitors versus placebo in patients with osteoarthritis
Table 3 Summary of non-significant safety findings for COX-2 inhibitors versus placebo in patients with osteoarthritis

4 Discussion

Overall, our analysis found a significant increased risk of drug-related AEs with COX-2 inhibitors compared with placebo. While it is widely accepted that NSAIDs with COX-2 selectivity are associated with less gastrointestinal toxicity compared with non-selective NSAIDs, nonetheless, our study found a 19% increase in upper gastrointestinal AEs overall with COX-2 inhibitors compared with placebo, and a 40% increase in risk of abdominal pain. This finding is in line with another recent meta-analysis from Bhala et al., that found an increase in upper gastrointestinal complications with COX-2 inhibitors, albeit a lower risk than that with the non-selective NSAIDs ibuprofen and naproxen, but similar to the risk with diclofenac [11]. A retrospective pooled analysis of 21 RCTs of 9461 patients aged ≥ 65 years with OA, rheumatoid arthritis or ankylosing spondylitis that examined the incidence of gastrointestinal AEs (abdominal pain, constipation, diarrhea, dyspepsia, flatulence, nausea) with celecoxib versus non-selective NSAIDs found a significantly lower incidence of gastrointestinal intolerability reported with celecoxib than naproxen, ibuprofen or diclofenac (p < 0.001) [59].

As may be expected with COX-2 inhibitors, we found a significant increase in cardiovascular AEs; specifically, a 45% increased risk of hypertension that decreased to 21% when rofecoxib was excluded from the analysis. Nonselective NSAIDs and COX-2 selective inhibitors increase blood pressure in both normotensive subjects as well as in patients with hypertension. The mechanism for the increase in blood pressure is most likely due to their impact on vasoactive endothelium-derived factors, particularly via the inhibition of prostaglandin synthesis, important for the regulation of vascular tone and sodium excretion [60]. The comparative effect of rofecoxib, celecoxib and naproxen on ambulatory blood pressure has been studied in 400 patients with arterial hypertension, diabetes mellitus and OA. Rofecoxib, but not celecoxib and naproxen, significantly increased 24-hour systolic blood pressure after 6 weeks of therapy [61].

Elevated arterial hypertension is a major risk factor for stroke, ischemic heart disease and HF [62]. We found a 68% overall increase in risk of CHF and edema (peripheral and generalized) with COX-2 inhibitors, which was not reduced when rofecoxib was excluded from the analysis. A systematic review and meta-analysis assessed the risk of developing incident HF with the use of NSAIDs for any indication, finding a higher risk of developing HF with NSAIDs, which was significantly elevated with non-selective NSAIDs, but not with COX-2 inhibitors [63]. The rates of hospital admission for CHF with rofecoxib, celecoxib and NSAIDs have been compared in 150,000 individuals aged ≥ 65 years. Users of rofecoxib and NSAIDs, but not celecoxib, were associated with a higher incidence of admission than non-NSAIDs users [64].

In the Prospective Randomized Evaluation of Celecoxib Integrated Safety vs Ibuprofen Or Naproxen (PRECISION) trial, celecoxib was found to be non-inferior to naproxen or ibuprofen for the primary composite outcome of cardiovascular death (including hemorrhagic death), nonfatal myocardial infarction or nonfatal stroke [65]. In a subgroup analysis of OA patients, fewer major cardiovascular AEs were observed in patients treated with celecoxib compared with ibuprofen [66]. This confirms the findings of a Japanese prospective observational study of 10,529 patients with OA or RA prescribed celecoxib or an NSAID, in which celecoxib was not shown to be associated with increased cardiovascular risk in comparison with NSAIDs [67].

We found no significant increase in the risk of total AEs or serious AEs with the use of COX-2 inhibitors (Table 3). Neither was there an increase in the rate of constipation, nausea and vomiting, CNS AEs or dermatological AEs with COX-2 inhibitors versus placebo. Insufficient data on renal and hepatic events or death, or withdrawals due to AEs were reported to include in this meta-analysis.

4.1 Limitations

Around half of the studies identified from the literature search that met the inclusion criteria did not provide AE data suitable for inclusion in the meta-analysis. Included studies were of short duration, that is, 75% of studies were between 2 and 26 weeks’ duration. Many studies lacked detail on how AEs were collected, with many relying on patient self-reporting. The grouping of AEs could have led to lack of resolution, and possible double counting. In combining different drugs and doses into one meta-analysis, we chose the highest dose when multiple doses were presented in a trial, which could have led to an exaggeration of AEs. Sensitivity analysis of the two approaches (highest dose vs multiple dose) revealed only marginal differences in the magnitude of the outcome. Our analysis was limited to studies in OA patients only; thus, safety issues could be missed in relevant subgroups such as in patients after coronary artery bypass graft surgery, or in patients with rheumatoid arthritis.

5 Conclusions

Along with non-selective NSAIDs, the use of COX-2 inhibitors in OA is associated with a significant increased risk of drug-related AEs compared with placebo. Specifically, we found an increased risk of upper-gastrointestinal AEs with COX-2 inhibitors, especially abdominal pain. We also found an increased risk of cardiovascular AEs with COX-2 inhibitors, namely hypertension, heart failure and edema. Our results confirm that a cautious approach to the use of NSAIDs and COX-2 inhibitors for the management of pain and inflammation in OA is advisable, with selection of treatment tailored to the individual patient characteristics. In OA patients at increased risk of gastrointestinal AEs, COX-2 inhibitors with the addition of a proton pump inhibitor may be used in preference to a non-selective NSAID; while for patients at increased risk of cardiovascular AEs, the use of COX-2 inhibitors should be avoided. Low-dose celecoxib (200 mg/day) may be the preferred NSAID due to its lower propensity for gastrointestinal and cardiovascular toxicity. To maximize the risk : benefit of NSAIDs, including COX-2 inhibitors, the European Society for Clinical and Economic Aspects of Osteoporosis, Osteoarthritis, and Musculoskeletal Diseases (ESCEO) recommends intermittent or longer cycles of NSAID use rather than chronic treatment in order to minimize safety concerns [68].