FormalPara Key Points

This study suggests that there is no association between exposure to golimumab and an increased risk of prespecified outcomes.

The results of this study are consistent with golimumab’s overall safety profile and generally comparable with observations from other studies in patients with treated rheumatic disease.

1 Introduction

Golimumab (Janssen Biotech, Inc.) is a fully-human tumor necrosis factor (TNF)-alpha inhibitor indicated for the treatment of rheumatoid arthritis (RA), psoriatic arthritis (PsA), ankylosing spondylitis (AS), and ulcerative colitis in the USA, with additional indications for non-radiographic axial spondyloarthritis and polyarticular juvenile idiopathic arthritis in the European Union. Golimumab was first approved in the USA in 2009 for once-monthly subcutaneous administration in RA, PsA, and AS and in 2013 for intravenous administration in RA.

RA is characterized by chronic, progressive inflammation of the joints that may lead to deformity, disability, and in some cases, premature death [1, 2]. The overall prevalence of RA ranges between 0.3 and 1% among adults worldwide and is more common with increased age and among women [3, 4]. AS and PsA are related chronic inflammatory arthritides that are distinct from and less prevalent than RA. The goals of rheumatic disease management are to decrease pain, to prevent or control joint damage, and to prevent loss of function [4]. Current treatments include nonsteroidal anti-inflammatory drugs, corticosteroids, and disease-modifying anti-rheumatic drugs (DMARDs) [5,6,7]. Tumor necrosis factor inhibitors (TNFi) were the first biologic DMARDs (bDMARDs) approved for the treatment of these conditions and have revolutionized pharmacologic treatment by preventing disease progression [8,9,10].

Because of inherent immunological abnormalities associated with rheumatic disease and the immunosuppressive effects of therapeutic agents, patients may be at an increased risk of infection, hematologic malignancies, and other autoimmune diseases [11,12,13]. The incidence of infections requiring hospitalization or parenteral antibiotics in RA patients prior to the introduction of bDMARDs is estimated to be in the range of 0.02–0.12 per person-year, with infection-related deaths 6–9 times higher in RA patients compared to the general population [13, 14]. Risk factors of infection in RA patients include age, extra-articular manifestations, leukopenia, comorbidities, use of corticosteroids, skin breakdown, and joint surgery [14, 15].

Case reports and post-marketing data have documented the occurrence of serious and opportunistic infections in patients treated with bDMARDs [13, 16,17,18,19,20,21,22,23], which should not be used in patients with active infections [14]. However, not all studies have found an increased risk associated with their use [13]. As per current prescribing guidelines, use of bDMARDs is contraindicated in patients with severe active infections, and patients receiving these agents should be monitored for signs and symptoms of infection [14]. In addition to infections, malignancy, autoimmune conditions, hepatotoxicity, hepatitis B reactivation, and heart failure have been observed with use of TNFi [14].

Clinical trials of patients with RA, PsA, and AS have shown that golimumab is effective and generally well tolerated [24,25,26,27]. Due to the rarity of some outcomes and potential differences between patients enrolled in clinical trials and those under standard care, comprehensive assessments of risks associated with golimumab, a newer TNFi, among large populations reflecting routine practice are warranted. In this study, we aimed to provide a comprehensive overview of the safety of golimumab using claims data from a large US health insurer along with validation of outcomes using linked medical records and National Death Index (NDI) records. This study question is important as there is a lack of agent-specific safety data. We sought to estimate the risks of prespecified outcomes in a cohort of patients with RA, PsA, or AS initiating golimumab compared to a cohort of similar patients initiating non-biologic systemic (NBS) treatments, which is considered the standard of care and preferred comparator for biologic treatment [1, 2, 9].

2 Patients and Methods

2.1 Data Source

This prospective, observational, cohort study was conducted in the Optum Research Database (ORD), a proprietary research database containing claims and enrollment data dating back to 1993 for approximately 14 million members of a large US health plan with both medical and pharmacy benefit coverage and an average of 2.5 years of enrollment. The medical and pharmacy claims for these individuals form a longitudinal record of reimbursed medical services, irrespective of treatment site, along with detailed information on drug dispensing. The database is routinely updated and maintained. Access to a subset of patients’ medical charts (i.e. approximately 35% of all patients within the ORD) allows for confirmation of outcomes identified through claims. The underlying insured population is geographically diverse across the USA, representing approximately 4% of its population.

This study was approved by the New England Institutional Review Board and Privacy Board.

2.2 Patient Population

Patients who initiated golimumab or NBS treatments (methotrexate, azathioprine, cyclosporine, penicillamine, hydroxychloroquine, sulfasalazine, apremilast, tofacitinib, leflunomide, or gold compounds) were accrued from 24 April 2009 (date of first availability of golimumab in the USA) until 30 November 2014. The date of cohort entry (i.e. index date) is the date of the first claim of the cohort-defining drug with no previous claim of that specific drug in the prior 6 months (baseline period). Eligible patients included those of any age with complete medical coverage and pharmacy benefits who were continuously enrolled for at least 6 months prior (baseline period) to the index date.

The operational definition of an initiator included patients who were naïve to antirheumatic treatments, those who switched to the cohort-defining therapy from another antirheumatic treatment, and those who started the cohort-defining therapy as add-on treatment to existing therapy. For example, golimumab initiators included patients who began therapy with golimumab as the first prescription treatment for rheumatic disease observed in the claims database, and those who switched from or added to another prescription rheumatic disease treatment regimen. The cohort for NBS treatment initiators may include, for example, methotrexate initiators without any other antirheumatic treatment in baseline, and methotrexate initiators without prior methotrexate use but with prior use of another antirheumatic (including other NBS and biologic) treatment during baseline.

Initiators were required to have at least one claim for RA, PsA, or AS [International Classification of Diseases (ICD), 9th Revision diagnosis codes 714.xx, 696.0x, and 720.0x, respectively] in the 6 months before or up to 3 months following the index date.

2.3 Propensity Score Modeling and Matching

For the prediction model discriminating golimumab initiators from patients in the NBS cohort, using claims and membership data, a set of potential a priori confounders in addition to the 100 most frequently occurring diagnoses, procedures, and drug dispensing were included in the logistic regression model for the propensity score calculation. Age, sex, region, number of days enrolled before index date, calendar time, cost (total and associated with rheumatic disease), and healthcare utilization variables were forced into the model. Additional predictors were retained based on a stepwise regression technique with an entry criterion p-value of ≤ 0.10 and an exit criterion p-value of ≥ 0.30.

The golimumab cohort was matched by propensity score to NBS initiators using a standard greedy matching algorithm [28,29,30]. In order to achieve adequate power in the context of a medication with a large pool of comparators, the final cohort included up to 4 matched comparators for each golimumab patient, with the comparison cohort member selected at random from all potential comparators whose propensity scores were within the variable caliper applied by the matching algorithm that allows a maximum difference of 0.10 to that of the golimumab patients.

2.4 Outcome Identification and Follow-Up

Prespecified outcomes of interest were identified by a claim for an outpatient physician visit or inpatient hospitalization associated with the specific disease, procedure, or drug code as listed on claims for medical services. These outcomes include:

  • serious infection (i.e. requiring hospitalization or parenteral antibiotic therapy)

  • tuberculosis (TB)/non-TB mycobacterial infection

  • malignancy

  • lymphoma

  • systemic hypersensitivity

  • congestive heart failure (CHF)

  • hepatotoxicity

  • new onset hypertension (identified among those without a baseline claim for hypertension)

  • autoimmune disease

  • hematologic reaction

  • depression

  • mortality.

See Electronic Supplementary Materials (ESM) Appendix 1 for codes that were used to identify these outcomes.

Outcome identification began the day after cohort entry through the earliest of: (1) health plan disenrollment; (2) death; or (3) 30 September 2015. For systemic hypersensitivity and mortality, follow-up began on the index date to identify those outcomes that occurred immediately after exposure. Patients were censored for subsequent occurrences of the same outcome but were eligible for others. Person-time was calculated from the day after index date (or on the index date for systemic hypersensitivity and mortality) through end of follow-up.

2.5 Medical Record Review and Refinement of Outcome Definitions

Medical record review was undertaken to confirm potential outcomes identified in the claims data between 24 April 2009 and 31 May 2014. Medical records were adjudicated by a panel of 3 clinicians to determine whether each potential case met the clinical definition of the outcome. Using the results from adjudication, a positive predictive value (PPV) was calculated for each outcome as the number of confirmed cases divided by the number of charts adjudicated, excluding those cases with insufficient information to determine case status.

Outcomes with high PPVs (> 75%) were included in analyses using the original claims definition. In order to improve the PPV and increase the likelihood of identifying true cases, the claims definitions for outcomes with low-to-moderate PPVs (≤ 75%) were revised by requiring the codes (diagnosis, procedure, drug) used to identify each outcome to appear in the principal (primary) position on an inpatient or outpatient claim. PPVs were recalculated for the revised definition of each outcome as the number of confirmed cases with codes in the primary position on the healthcare claim divided by the total number of confirmed cases and non-cases with codes in the primary position and were included in analyses using the revised definition.

2.6 National Death Index

The NDI is considered a gold standard mortality assessment in studies based on health insurance claims data, and a NDI Plus search (including fact, date, and causes of death) was conducted on all golimumab initiators and matched comparators who disenrolled from the health plan during the study period with no evidence of re-enrollment.

After restricting to patients considered to have a high probability of being a true match based on an algorithm developed by the NDI, deaths occurring within 60 days of health plan disenrollment were selected as cases. Since exposure information is only available during enrolled time and deaths occurring after disenrollment may have been exposed to other medications, a 60-day window was chosen as a trade-off between completeness of mortality identification and uncertainty regarding medication exposure. Cause-specific mortality outcomes were identified from the NDI data using ICD-10 coded causes of death.

The mortality outcome used in statistical analyses included those deaths identified in the claims data through end of follow-up with an NDI search used to validate vital status for these patients.

2.7 Statistical Analyses

All analyses used SAS (Statistical Analysis Software™) version 9.2 (SAS Institute).

For non-cancer outcomes, the primary analysis was the as-treated analysis, where incidence rates (IR) per 1000 person-years and incidence rate ratios (IRR) were estimated during periods of current drug use and non-use among the matched golimumab and matched NBS cohorts. Drug exposure was characterized based on claims records during follow-up, including date of dispensing and days supplied or recommended number of days between injections/infusions. A 60-day “grace period” was added to account for medication non-adherence and the uncertainty surrounding the duration of biologic effect for these drugs.

  • Current use: dispensing/injection/infusion date + days supplied or recommended number of days between injection/infusion + 60 days. With each new dispensing/injection/infusion, a patient continues or reinitiates on-therapy current status

  • Non-use: days during which patients are not using the study drug(s) of interest, i.e. neither current nor prior use of the drug(s) in question during the follow-up. Person-time attributed to non-use of golimumab came from the matched NBS cohort before any golimumab was dispensed during follow-up; and vice versa, person-time attributed to non-use of NBS came from the matched golimumab cohort before any NBS was dispensed during follow-up.

Multivariable Poisson regression was used to estimate the IRR of non-cancer outcomes during periods of current drug use compared with non-use. Each model was stratified by matching ratio (number of NBS matches for each golimumab initiator) and adjusted for age and sex.

For cancer outcomes, the primary analysis was the as-matched analysis where IRs per 1000 person-years were calculated based on the cohort-initiating drug. In the as-matched analysis, Cox proportional hazards regression was used to compare golimumab to NBS initiators in the matched cohorts. Models for all outcomes were stratified by matching ratio and adjusted for propensity score. To avoid overfitting or lack of model convergence, the propensity score was used for adjustment since the number of covariates that could be included in the Cox proportional hazards models was limited.

To address the different potential mechanisms of cancer occurrence, the incidence of malignancy and lymphoma was estimated using a cumulative dose measure (considered a secondary analysis for cancer outcomes) in the matched cohorts. With each claim for the dispensing of a study drug during follow-up, cumulative dose was updated, and patients were categorized into mutually exclusive categories of dose (i.e. 1–5, 6–10, and ≥ 11 dispensings). If a patient had a cancer that occurred during follow-up, it was counted within the dose category it occurred, and the patient was no longer at risk for that outcome. Similar to the as-treated analysis, the Poisson regression models were stratified by matching ratio and adjusted for age and sex.

As an additional assessment of medication exposure in relation to occurrence of outcomes, a nested case–control (NCC) analysis was performed among propensity score-matched cohorts. Cases were defined as the first outcome identified during follow-up with no baseline claims for the outcome. Each risk set included all individuals from the propensity score-matched quintuplet of the larger drug cohorts who were still at risk of the event (controls). After excluding those controls with baseline claims for a specific outcome, for each case, up to 2 controls enrolled in the health plan were randomly selected among the cohort members in the risk sets defined by the case (matching preserved), and a case/control date was assigned as the outcome date of the matched case.

Exposure to study medications was assessed among cases and controls during the 6 months prior to the case/control date, and age, sex, region, and calendar time were assessed on this date. Cases and controls were compared with respect to study drug exposure (golimumab and NBS), with multivariable-adjusted odds ratios (OR) generated from conditional logistic regression models (conditioned on the matched set) and adjusted for age, sex, region, and calendar time.

2.8 Sensitivity Analyses

Three sensitivity analyses were conducted. First, as-treated and as-matched analyses were repeated using a narrowed definition of depression where initiators with a baseline claim of a diagnosis of depression or dispensing of an antidepressant were excluded from each matched cohort. During follow-up, the first occurrence of a diagnosis for depression or antidepressant claim was identified. If both a diagnosis and dispensing occurred within 14 days of one another, then the date of the later diagnosis or dispensing was assigned as the outcome date.

Second, residual confounding in the propensity score model by covariates not included in the propensity score model or directly ascertainable in claims data (e.g. disease severity, duration) was assessed. A standard epidemiological spreadsheet was used to assess the robustness of the results in the presence of an unmeasured confounder [31]. The results of the primary analyses and ranges of association between an unmeasured confounder and disease outcome were entered into the spreadsheet and used to calculate the difference from the observed result if the unmeasured confounder was accounted for. In addition to using the adjusted IRR from the primary analyses, the value of the observed lower 95% CI was also plotted to determine the range in which the 95% CI would cross the null. The prevalence of the unmeasured confounder was fixed at 40%, which provided close to maximum potential confounding in many situations while the prevalence of the exposure was calculated as the number of person-years during current use divided by the number of person-years during non-use. The resulting figures include the range of unmeasured confounder characteristics assuming varying levels of strength of the confounder-disease association and strength of association with current golimumab use that would result in a change in the point estimates. Detailed assumptions for conducting this analysis are described in ESM Appendix 2.

For the last sensitivity analysis, another standard epidemiological spreadsheet was used to adjust primary analysis point estimates by exploring different pairs of sensitivity and specificity values in order to assess different degrees of nondifferential misclassification [32]. To assess the potential effect of nondifferential exposure misclassification, sensitivity and specificity were set to 99, 95, and 90%, respectively, to assess the effect on primary as-treated analysis point estimates in the golimumab cohort matched to NBS. Similarly, to determine the potential effect of nondifferential outcome misclassification, sensitivity was assumed to be 99% while specificity was defined as a transformation of the PPV based on contingency table analysis. The original PPV was used for malignancy, lymphoma, and autoimmune disease while the revised PPV was used for the remaining outcomes. A PPV of 50% was used to estimate misclassification bias for mortality. The specificity was varied by ± 5% to assess the effect on primary analysis point estimates across a plausible range of alternate specificity values. Detailed assumptions for conducting this analysis are described in ESM Appendix 3.

3 Results

3.1 Cohort Characteristics

From 24 April 2009 through 30 November 2014, 1515 golimumab (97.0% dispensed subcutaneous formulation) and 48,975 NBS initiators were identified, and 1337 golimumab initiators (88.3% of those eligible) were matched to 4227 NBS initiators (3.2:1 matching ratio).

Prior to matching, differences were present between the golimumab and NBS initiators with respect to baseline characteristics (Table 1). Afterwards, balance improved on most of these variables (Table 2), with absolute standardized differences generally less than 0.1. A higher proportion of matched golimumab initiators than NBS initiators had ≥ 4 different treatments for rheumatic disease (33.7% vs 28.5%, standardized difference = 0.11). Both disease-specific and total healthcare utilization costs were also higher in the golimumab group (standardized differences = 0.11).

Table 1 Distribution of select baseline characteristicsa for the golimumab and non-biologic cohorts before propensity score matching, identified 24 April 2009 through 30 November 2014, Optum Research Database
Table 2 Distribution of select baseline characteristicsa for the golimumab and non-biologic cohorts after propensity score matching, identified 24 April 2009 through 30 November 2014, Optum Research Database

3.2 Outcome Chart Abstraction and Adjudication Results

Medical records were obtained and abstracted for 777 of 917 subjects with potential study outcomes (84.7%). Of these, 384 were assessed to have definite evidence supporting the diagnosis of the outcome while 364 did not contain supporting evidence. Determination of case status was not possible for 29 charts given the information available in the medical record.

The PPVs for malignancy, lymphoma, and autoimmune disease were > 75% (Table 3) so the original claims definitions for these outcomes were used for analyses. Claims definitions were refined for the remaining outcomes with moderate (30–75%) or low (< 30%) PPVs to require that the codes used to identify them occur in the primary position. Revisions resulted in increases in the PPV for outcomes other than serious infection, systemic hypersensitivity, and new onset hypertension, whose PPVs remained largely unchanged (Table 3).

Table 3 Results of chart abstraction and adjudication including calculation of PPVs, outcomes identified 24 April 2009 through 31 May 2014, Optum Research Database

3.3 NDI Results

Table 4 lists the number of NDI-identified death matches occurring within 60 days of health plan disenrollment for the matched golimumab and NBS cohort and the unmatched golimumab cohort by ICD-10 category. Person-time was calculated from the index date to the death date provided by the NDI or from the index date to the earliest of end of enrollment or data cut-off date (30 September 2015) plus 60 days, and crude IRs were calculated in order to compare the cohorts.

Table 4 NDI ICD-10 Causes of death in matched golimumab and non-biologic cohorts and in the unmatched golimumab cohort, deaths identified 24 April 2009 through 30 September 2015, Optum Research Database

Four deaths were identified in the golimumab cohort matched to NBS for a crude IR of 1.22 per 1000 person-years (4 deaths/3270 person-years), and no deaths were identified in the unmatched golimumab cohort (383 person-years).

3.4 Outcome Analyses

In as-treated analyses, most findings were consistent with no increase in risk of the outcomes in the golimumab cohort (Table 5). Compared to golimumab non-use in the matched NBS cohort, the risk for depression was increased during current golimumab use (RR 1.45, 95% CI 1.31–1.61). There was no association between cumulative dose of golimumab and elevated risk of malignancy and lymphoma (Table 6).

Table 5 Incidence rates and incidence rate ratios of outcomes, as-treated analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database
Table 6 Incidence rates and incidence rate ratios of malignancy and lymphoma outcomes, cumulative dose analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database

As-matched analyses in Table 7 showed a decreased risk for malignancy and mortality in golimumab versus NBS initiators (RR 0.66, 95% CI 0.50–0.87 and RR 0.41, 95% CI 0.17–0.98, respectively). No elevated rate ratios were seen for any of the outcomes in the as-matched analyses, including depression (RR 1.08, 95% CI 0.97–1.19).

Table 7 Incidence rates and incidence rate ratios of outcomes, as-matched analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database

In NCC analyses (Table 8), no associations were observed between golimumab exposure and most outcomes. Golimumab use was associated with a decreased risk of systemic hypersensitivity and mortality (OR 0.57, 95% CI 0.37–0.90 and OR 0.23, 95% CI 0.08–0.67, respectively). No association between golimumab exposure and depression was seen (OR 1.01, 95% CI 0.78–1.31).

Table 8 Odds ratios of outcomes comparing treatment exposures, nested case–control analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database

3.5 Sensitivity Analyses

Results using the restricted definition of incident cases (as opposed to prevalent cases) of depression had corresponding 95% CIs that suggested chance differences (Tables 9 and 10). The results of this sensitivity as-treated analysis reversed those seen in the primary as-treated analysis; the risk of incident depression included the null during current golimumab use compared to golimumab non-use (RR 0.71, 95% CI 0.30–1.69) (Table 9).

Table 9 Sensitivity analysis of incidence rates and incidence rate ratios of depression, as-treated analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database
Table 10 Sensitivity analysis of incidence rates and incidence rate ratios of depression, as-matched analysis, matched treatment cohorts, identified 24 April 2009 through 30 November 2014 with follow-up through 30 September 2015, Optum Research Database

The assessment of the effect of residual confounding (i.e. as a result of an unmeasured or unavailable confounding variable omitted from the propensity score model) indicates that strong risk factors for the outcome that are also not balanced among the matched golimumab and NBS cohorts would be required in order to change the results of the analyses substantially. Most known independent risk factors were already included in the propensity score model; therefore, any unmeasured confounder of sufficient strength to alter findings would also have to be independent of the confounders that were included in the propensity score. See ESM Appendix 2 for outcome-specific results and figures.

In the assessment of nondifferential exposure or outcome misclassification, estimates in the golimumab cohort were only slightly altered, suggesting no change in inferences related to the point estimates. For depression, the observed as-treated result of an increased risk in current golimumab use versus non-use changed substantially when corrected for potential misclassification, suggesting that this result was sensitive to misclassification. See ESM Appendix 3 for outcome-specific results.

4 Discussion

This post-approval observational study assessed the risk of several safety outcomes in a population of golimumab and NBS users over a 6-year period. The as-treated analysis showed no elevation in risk for most of the outcomes associated with use of golimumab compared to NBS, with the exception of depression during current golimumab use. In the cumulative drug exposure analysis for malignancy and lymphoma, no evidence of a dose–response effect for golimumab was observed. In the as-matched analysis that does not account for changes in exposure during follow-up, there were no elevated rates in the golimumab cohort versus the NBS cohort; similarly, no elevated rates were seen in the NCC analysis of golimumab.

A study strength is the use of automated medical claims that reflect routine care within a large, US insurance database, providing generalizability to patients aged < 65 with commercial health insurance. The comprehensive nature of the database means that any billable medical service would be recorded, including rare events. However, there are limitations inherent to using claims databases. Patients were identified based on claims for medication dispensings, but it is not known whether the medication was taken as prescribed. Actual use of the drug is inferred from the dispensings, which may result in exposure misclassification. Sensitivity analyses were conducted to examine the impact of a range of assumed misclassification of exposure on the effect estimates and indicated that nondifferential misclassification of exposure likely biased the effect estimates toward the null or did not change the inference of the point estimates. Also, diagnosis codes used for outcome identification do not necessarily indicate true disease. Accordingly, some degree of outcome misclassification may be present [33]. Medical record confirmation was used to revise the outcome claims definitions applied to the full population, which improved the specificity of outcome definitions.

The PPVs associated with each study outcome reflect the accuracy of the claims data and allow an interpretation of the RR with the knowledge that cases of outcomes with high to moderate PPVs are likely to be real cases of disease. For such outcomes, the results presented would be close to those that might be observed from a study that included confirmation of all study outcomes [34,35,36,37]. Among those outcomes with lower PPVs, the observed RRs in this study might diverge from that of RRs based on confirmed cases, and this divergence would in most cases tend to bias the effect estimate towards the null, potentially obscuring either a positive or negative treatment effect. The original and revised PPVs for depression in this study (42.5 and 71.4%, respectively) were consistent with a range of PPVs identified in a review that used various database algorithms (31.5–98.8%) [38], and with a study that focused on incident depression (48.6%) [33]. The results of the sensitivity analyses indicate that outcome misclassification of the extent indicated by the PPVs would not change effect estimates substantially, with the exception of depression. Additional limitations include limited follow-up time and lack of information on disease duration and disease activity.

Propensity score matching created cohorts that were similar with respect to available baseline data on underlying risk factors related to outcomes, claims proxies for disease activity and severity, healthcare utilization, and other factors that may be considered by clinicians to inform treatment selection. The labelled indications for golimumab do not specify a treatment sequence (such as requiring prior failure of NBS medications), although preferential prescribing (channeling) of sicker patients to biologic agents is plausible and reflected by the inability to find eligible matches for some golimumab patients (n = 178). However, this study accounted for an extensive list of important confounders, and comparison with NBS provides useful evidence for the benefit-risk balance in real-world practice not only for golimumab but also for other biologics when the majority of patients are treated by NBS. A limitation to this approach is the potential for confounding by unmeasured factors which may not be balanced by matching on the variables included in the propensity score model. Of note, a study drawing on medical chart data demonstrated that propensity score matching can lead to balance in potential confounders not directly measurable in claims (e.g. smoking status, BMI) but correlated with covariates included in the propensity score model [30]. A sensitivity analysis was conducted to address residual confounding associated with propensity score modeling and matching, but there are limitations associated with this analysis: (1) it was constrained to one binary confounder, which may not be informative if several confounders were unmeasured and the joint effect was unknown, and (2) it did not provide an assessment of the magnitude of existing residual confounding.

Given the purpose of the study to assess the safety profile of golimumab in the real-world and the value obtained by having a larger patient population and more potential outcomes, combining the different rheumatologic indications was warranted. Specifically, regarding the inclusion of ICD-9 diagnosis codes for juvenile idiopathic arthritis (714.3x), before matching, 5 patients in the golimumab cohort (5/1515 = 0.3%) and 557 patients in the NBS cohort (557/48,975 = 1.1%) had a claim for 714.3x. Among these 562 pre-matched initiators, zero golimumab and 3 NBS patients also had a claim for rheumatoid arthritis (ICD-9 714.0) during the baseline period, indicating little overlap in coding of these 2 conditions. After matching, the numbers with a claim for ICD-9 714.3 × dropped to 4 golimumab (4/1337 = 0.3%) and 16 non-biologic initiators (16/4227 = 0.4%), with zero golimumab and 2 non-biologics also identified with a claim for ICD-9 714.0 during baseline. The small numbers and good balance obtained on this variable means that these patients are unlikely to have any influence on study results.

Several types of analyses were undertaken in order to understand the association between golimumab use and the prespecified study outcomes. Based on the exposure definition used in this study, determining the temporal relationship between golimumab exposure and these outcomes, particularly chronic outcomes (i.e. malignancy and lymphoma) is difficult. Protopathic bias, a form of reverse causation in which treatment is initiated in response to symptoms of undiagnosed disease, cannot be ruled out. Protopathic symptoms of study outcomes may be mistaken for rheumatic disease activity leading to initiation of golimumab or NBS. The resulting association between the study drug and outcome could, in fact, be due to reverse causation, in which the underlying, undiagnosed disease leads to use of the study drug [39].

The current study design did not specifically account for the possibility of golimumab to be used in a manner that might produce protopathic bias. In order to address potential protopathic bias in the analysis of malignancy and lymphoma, a sensitivity analysis was conducted where person-time in the as-matched analysis that is particularly susceptible to protopathic bias (the first 6 months of exposure time) was removed, and the effect estimates recalculated [40]. Results from this sensitivity analysis were consistent with the primary findings, suggesting no strong protopathic bias effect (data not shown).

An elevated rate of depression during current golimumab use was observed in the as-treated analysis, but not in the as-matched or NCC analyses. Sensitivity analyses indicated depression is sensitive to misclassification, so this result should be interpreted cautiously. Furthermore, as patients with a baseline medical history of depression were excluded from the NCC and sensitivity analyses, the increased risk seen in the as-treated analysis may be related to an imbalance in baseline history of depression. Due to the design of the study, which focused on baseline balance of patient characteristics, the effect of changes in severity or treatment for depression after initiation of golimumab were not analyzed.

Assessment of depression in insurance claims databases presents challenges (e.g. patients underreporting symptoms due to stigma of mental illness, failure of physicians to recognize or treat depression). The definition of depression used here was comprehensive (i.e. any inpatient or outpatient claim with a diagnosis code in the primary position or a dispensing for an antidepressant) and captured more patients at the expense of generating false positive diagnoses. When a more stringent definition of depression that required a hospitalized diagnosis in the primary position was used, 3 cases were identified in the matched golimumab cohort among 1969 person-years (crude IR = 1.52 per 1000 person-years) and 19 in the NBS cohort among 6254 person-years (crude IR = 3.04 per 1000 person-years).

In chronic inflammatory diseases, comorbid depression is common. Among RA patients, prevalence of depression ranges from 14–48% [41,42,43] and is associated with higher levels of pain and disability, lower health-related quality of life, and increased mortality [44]. Evidence suggests a role of pro-inflammatory cytokines including interleukin (IL)-6, TNF-alpha, and IL-1 in promoting depression, and indeed, patients with severe RA have a higher risk of depression [45,46,47,48,49]. Specifically, TNF-alpha has been shown to be associated with depression and reducing its levels may actually reverse symptoms. In a randomized clinical trial investigating the effect of etanercept on fatigue and depression in patients with plaque psoriasis, improvements in depression were noted [50]. Smaller studies in patients with RA and inflammatory bowel disease confirm that therapy with TNFi could have a beneficial role in the treatment of depressive symptoms in patients with inflammatory conditions [51,52,53].

5 Conclusion

In summary, this study involving active surveillance of prespecified outcomes identified no specific safety concerns in patients with rheumatic disease treated with golimumab versus a NBS medication. The results are consistent with golimumab’s overall safety profile and generally comparable with observations from other studies in patients with treated rheumatic disease.