FormalPara Key Points

Previous studies have reported an increased risk of stroke in patients taking antipsychotics. However, most of these studies have been conducted in the elderly population.

We estimated and compared the risk of stroke among new users of first-generation antipsychotics and haloperidol with new users of second-generation antipsychotics in non-elderly patients (aged ≤ 64 years) without a recent dementia diagnosis and, separately, regardless of a recent dementia diagnosis. We observed that stroke risk was elevated among non-elderly patients without a recent dementia diagnosis taking haloperidol compared with patients using second-generation antipsychotics.

1 Introduction

Antipsychotics, first generation and second generation, are approved for the treatment of schizophrenia, schizoaffective disorder, bipolar disorder, hyperactivity, severe behavioral problems, agitation, psychotic disorders, Tourette syndrome, generalized nonpsychotic anxiety, major depressive disorder (adjunctive treatment), and autism-associated irritability [1]. In the USA, the number of prescriptions for psychiatric conditions has increased substantially among adults [3,4,5], with a considerable proportion of these prescriptions being for off-label indications [6]. There is substantial evidence that patients who use antipsychotics are at increased risk of stroke [7,8,9,10], as well as other life-threatening or fatal medical events such as pneumonia, hip fracture, thromboembolism, sudden cardiac death, and myocardial infarction [11]. However, it is difficult to adequately address the potentially relevant confounders and compare the stroke risks associated with different antipsychotics or classes of antipsychotics because users of various antipsychotics may differ in health habits, economic status, comorbid conditions, and concurrent medications. Indeed, some studies suggest a greater risk of stroke among users of first-generation antipsychotics as compared with second-generation antipsychotics [10, 12]. Shin et al. reported that among elderly patients, haloperidol, a first-generation antipsychotic, was associated with a greater risk of stroke as compared with risperidone, a second-generation antipsychotic [13]. However, other studies have suggested a similar risk between first-generation and second-generation antipsychotics [7, 12, 14] or a greater risk with second-generation antipsychotics [15, 16].

Though strokes are more common among elderly individuals than among the young individuals [17], more than half of antipsychotic users are < 65 years old [18, 19] and though many studies evaluating antipsychotics and the risk of stroke have been conducted in the elderly population [13, 20,21,22], relatively few have been conducted in younger populations. Mundet-Tuduri et al. [8] reported a higher prevalence of stroke in adult patients using antipsychotics compared with non-users (odds ratio 2.33). Another recent retrospective study conducted using the Sentinel database in the non-elderly population without dementia showed a hazard ratio (HR) for stroke of 1.75, 95% confidence interval (CI 1.17–2.63) among users of first-generation antipsychotics relative to second-generation antipsychotics and 1.80 (0.93–3.48) among users of haloperidol relative to second-generation antipsychotics, before propensity score (PS) matching. After PS matching, the HRs were 0.87 (0.54–1.41) and 1.31 (0.53–3.21), respectively [12]. We have described the risk of stroke among new users of first-generation antipsychotics and haloperidol vs new users of second-generation antipsychotics among elderly patients, regardless of dementia diagnosis, in a separate report [23].

The present study was conducted to: (1) replicate the findings of the Sentinel study [12] using a different database and PS matching mimicking that of the Sentinel study and (2) to extend those findings by matching on a more comprehensive adapted (large-scale) PS. We estimated and compared the risk of stroke among new users of first-generation antipsychotics and haloperidol vs new users of second-generation antipsychotics in non-elderly patients without a recent dementia diagnosis and, separately, regardless of a recent dementia diagnosis. We stratified on dementia to mimic the approach adopted by the Sentinel study, and because the label of haloperidol, which differentiates between elderly and non-elderly populations, has a black-box warning for dementia [24].

2 Methods

2.1 Data Sources

Data for this study were obtained from the IBM MarketScan® Commercial Database (CCAE), a large, nationally representative, US health claims database of patients < 65 years of age that includes data for active employees, early retirees, Consolidated Omnibus Budget Reconciliation Act continuers, and their dependents who are insured by employer-sponsored plans. The database includes information on demographic characteristics; outpatients’ prescription drugs, diagnoses, and procedures; and hospitalizations including discharge diagnoses for 142 million beneficiaries between January 2000 and August 2018. The use of CCAE was reviewed by the New England Institutional Review Board and determined to be exempt from broad institutional review board approval, as this research project did not involve human patient research.

2.2 Study Design and Study Population

This retrospective comparative cohort study included data available from 1 January, 2001 through 31 December, 2017 with the earliest cohort entry starting 1 July, 2001. Prior data from 2001 contributed information on the conditions required for cohort inclusion. Thus, the start and end dates were similar to those of the Sentinel study (1 January, 2001–30 September, 2015).

The original CCAE database has been converted to the observational medical outcomes partnership Common Data Model [25, 26] to allow for consistent analyses using standardized analytic and statistical software [27,28,29]. This model standardizes the structure and content of data by mapping coding systems such as the International Classification of Diseases, Ninth Revision, Clinical Modification (ICD-9-CM) and the International Classification of Diseases, Tenth Revision, Clinical Modification (ICD-10-CM) to Systematized Nomenclature of Medicine - Clinical Terms (SNOMED-CT) concepts by leveraging relationships that have been curated by the Unified Medical Language System [30]. The SNOMED-CT defines cohorts by walking up and down the comprehensive hierarchy. The relationship between SNOMED-CT and ICD-9-CM (and ICD-10-CM) is used to create the definition included in each code. In the present study, we were limited to the ICD-9-CM codes used in the Sentinel study by Taylor et al. [12] to define stroke. Furthermore, we used a similar approach and cross-walked from ICD-9-CM to SNOMED-CT. The relationships between SNOMED-CT and ICD-10-CM were examined. As we did not have a reference from Taylor et al. [12] for the ICD-10-CM codes to be used, we relied on the mappings provided by the Unified Medical Language System. These ICD-10-CM codes were used for the incidence comparison (Electronic Supplementary Material [ESM]). As these figures indicated a change in incidence estimates after 30 September, 2015, we conducted a post-hoc analysis that was limited to the data through 30 September, 2015.

New users of first-generation antipsychotics, haloperidol, or second-generation antipsychotics aged 18–64 years were included in the study cohort when they first met all the following conditions: received a dispensing of a first-generation or second-generation antipsychotic, had at least 183 days of continuous enrollment before that dispensing, had no diagnosis of stroke or cancer during the 183 days up to and including the dispensing date, and during the 183 days before that dispensing had received no dispensing of a first-generation or second-generation antipsychotic. The dispensing date was the patient’s index date. Patients who were exposed to both first-generation and second-generation antipsychotics on their index date were excluded.

We focused on four new user target cohorts (FGAw/oD: first-generation antipsychotics without a recent dementia diagnosis, i.e., without a dementia diagnosis in the 183 days prior to entering the cohort, HALw/oD: haloperidol without a recent dementia diagnosis, FGA: first-generation antipsychotics with or without a recent dementia diagnosis, and HAL: haloperidol with or without a recent dementia diagnosis) and two new user comparator cohorts (SGAw/oD: second-generation antipsychotics without a recent dementia diagnosis and SGA: second-generation antipsychotics with or without a recent dementia diagnosis). We compared the stroke risk in FGAw/oD and HALw/oD cohorts to the stroke risk in the SGAw/oD cohort and the stroke risk in FGA and HAL cohorts to the stroke risk in the SGA cohort. For each pairwise comparison, patients contributed “as-treated” time at risk in the cohort for which they first qualified from the day after they first entered that cohort until the first occurrence among: receiving a medication associated with the other cohort, having the study outcome (see below), having a gap of > 30 days in the supply of the cohort-defining drug (> 30 days from the end of the days’ supply to the next dispensing), reaching the end of insurance enrollment, or reaching the end of the study period.

As in the Sentinel study [12], and using the same code list for stroke as did that study, stroke as the study outcome was defined by a diagnosis code for stroke as the primary condition in the insurance claim associated with an inpatient stay, and stroke as a covariate was defined as an inpatient or outpatient diagnosis of stroke using the same code list as above. The code lists used for identifying first-generation antipsychotics, haloperidol, second-generation antipsychotics, and stroke are available in the protocol, which is pre-registered at https://clinicaltrials.gov/ct2/show/NCT04002700 and is publicly available in greater detail at https://github.com/ohdsi-studies/StrokeRiskInApUsers/tree/master/documents.

Reimbursement coding requirements in the USA changed from ICD-9-CM to ICD-10-CM on 1 October, 2015 (ESM). We observed a decrease in the yearly incidence of stroke after 2015 associated with the change in coding from ICD-9-CM to ICD-10-CM and indicating that the ICD-9-CM codes used for stroke correspond to ICD-10-CM codes that identify a clinically different patient population. Because this change appeared likely to affect the incidence of stroke in the study population, we conducted a post-hoc analysis where the study end date was redefined as 30 September, 2015. That end date for the post-hoc analysis was selected to reflect the end date of the Sentinel study [12], whose end date coincided with the switch of many US healthcare databases from ICD-9-CM to ICD-10-CM.

2.3 Statistical Analysis

The crude incidence rate (IR) and incidence proportion (IP) of stroke were estimated within each exposure cohort and pre-defined sex subgroup. Incidence rate was calculated as the number of patients with the outcome during the time-at-risk window divided by the total time-at-risk in years and was reported as IR/1000 person-years. Incidence proportion was calculated as the number of patients with the outcome during the time-at-risk window divided by the total number of patients with time-at-risk and was reported as IP/1000 patients. Population-level effect estimation analyses of new users were performed using a comparative cohort design including four pairwise comparisons: FGAw/oD vs SGAw/oD, HALw/oD vs SGAw/oD, FGA vs SGA, and HAL vs SGA.

Propensity score matching was used to reduce potential confounding because of an imbalance in baseline covariates between the target and comparator cohorts [31]. The PS adjustments are explained in detail in the ESM. A covariate balance was summarized by computing before and after PS matching, the mean value, and the associated standardized mean difference for each baseline covariate.

The HR for each outcome during the time at risk was estimated by applying a Cox proportional hazards model conditioned on the PS-matched sets. For each outcome model, the uncalibrated and empirically calibrated HR, 95% CI, and p-value [34] were reported. The method used for empirical calibration is reported in the ESM. For both cohorts being compared, patients were required to have at least 1 day of continuous observation after the time-at-risk start. The time-to-event of the outcome was determined by calculating the number of days from start of the time-at-risk window to the date of the outcome event. Although many hypotheses were tested, no correction for multiple testing was made because doing so may reduce the sensitivity for detecting potential adverse events [35].

3 Results

Results of the present study are publicly available through the interactive online tool at https://data.ohdsi.org/StrokeRiskInNonElderlyApUsers/.

3.1 Baseline Characteristics

Patients initially identified for inclusion in each cohort were: FGAw/oD cohort: 50,864, HALw/oD cohort: 12,472, FGA cohort: 51,473; HAL cohort: 12,898, SGAw/oD cohort: 926,152, and SGA cohort: 932,845. The slightly lower counts after limiting the cohorts to patients with at least 1 day at risk after exposure are shown in Table 1. Differences between the cohorts exposed to first-generation antipsychotics and the cohorts exposed to second-generation antipsychotics before and after matching are shown in Table 1, ESM, and Figs. 1, 2, 3 and 4. These figures demonstrate the greatly improved covariate balance after matching and indicate that the adapted PS matching strategy addressed the covariate imbalance to a greater extent than the Sentinel PS matching strategy.

Table 1 Distribution of selected baseline characteristics after adapted propensity score matching across the study cohorts
Fig. 1
figure 1

Covariate balance before and after propensity score (PS) matching for new users of first-generation antipsychotics without recent dementia diagnosis vs new users of second-generation antipsychotics without recent dementia diagnosis cohorts. A 1:1 PS matching of selected covariates (pre-planned analyses); B 1:10 PS matching of full covariates (pre-planned analyses); C 1:1 PS matching of selected covariates (post-hoc analyses); D 1:10 PS matching of full covariates (post-hoc analyses). Each dot represents the standardized difference of means for a single covariate before and after PS adjustment. In panels (A, C), the blue data points represent the before and after matching standardized difference in means for variables that were included in the Sentinel PS model and the red data points represent covariates unadjusted for. In panels (B, D), the blue dots represent the before and after matching standardized difference in means for all observed covariates

Fig. 2
figure 2

Covariate balance before and after propensity score (PS) matching for new users of haloperidol without recent dementia diagnosis vs new users of second-generation antipsychotics without recent dementia diagnosis cohorts. A 1:1 PS matching of selected covariates (pre-planned analyses); B 1:10 PS matching of full covariates (pre-planned analyses); C 1:1 PS matching of selected covariates (post-hoc analyses); D 1:10 PS matching of full covariates (post-hoc analyses). Each dot represents the standardized difference of means for a single covariate before and after PS adjustment. In panels (A, C), the blue data points represent the before and after matching standardized difference in means for variables that were included in the Sentinel PS model and the red data points represent covariates unadjusted for. In panels (B, D), the blue dots represent the before and after matching standardized difference in means for all observed covariates

Fig. 3
figure 3

Covariate balance before and after propensity score (PS) matching for new users of first-generation antipsychotics regardless of dementia diagnosis vs new users of second-generation antipsychotics regardless of dementia diagnosis cohorts. A 1:1 PS matching of selected covariates (pre-planned analyses); B 1:10 PS matching of full covariates (pre-planned analyses); C 1:1 PS matching of selected covariates (post-hoc analyses); D 1:10 PS matching of full covariates (post-hoc analyses). Each dot represents the standardized difference of means for a single covariate before and after PS adjustment. In panels (A, C), the blue data points represent the before and after matching standardized difference in means for variables that were included in the Sentinel PS model and the red data points represent covariates unadjusted for. In panels (B, D), the blue dots represent the before and after matching standardized difference in means for all observed covariates

Fig. 4
figure 4

Covariate balance before and after propensity score matching for new users of haloperidol regardless of dementia diagnosis vs new users of second-generation antipsychotics regardless of dementia diagnosis cohorts. A 1:1 propensity score (PS) matching of selected covariates (pre-planned analyses); B 1:10 PS matching of full covariates (pre-planned analyses); C 1:1 PS matching of selected covariates (post-hoc analyses); D 1:10 PS matching of full covariates (post-hoc analyses). Each dot represents the standardized difference of means for a single covariate before and after PS adjustment. In panels (A, C), the blue data points represent the before and after matching standardized difference in means for variables that were included in the Sentinel PS model and the red data points represent covariates unadjusted for. In panels (B, D), the blue dots represent the before and after matching standardized difference in means for all observed covariates

3.2 Crude First Post-Index Stroke Incidence

The crude IR for stroke per 1000 person-years was 3.10 for patients of the FGAw/oD cohort, 5.99 for the HALw/oD cohort, 3.14 for FGA cohort, 6.11 for HAL cohort, 0.85 for the SGAw/oD cohort, and 0.90 for the SGA cohort during the pre-planned analyses. These IRs were similar in the post-hoc analyses (Table 2).

Table 2 Crude first post-index event IR and IP within the exposure cohorts

3.3 PS-Adjusted HR for First Post-Index Stroke

Table 3 reports HRs for the study comparisons and Figs. 1, 2, 3 and 4 report covariate balance diagnostics that correspond with each HR. A HR that corresponds with a covariate balance diagnostic where the after-matching standardized mean differences are low is less likely to be affected by observed confounding than where after-matching standardized mean differences are high. The HR for stroke for patients of the FGAw/oD vs SGAw/oD cohort in the pre-planned analysis was 2.75 (95% CI 1.27–6.59) with the Sentinel PS matching strategy (Fig. 1A) and the cHR was 2.05 (95% CI 1.13–3.89) with the adapted PS matching strategy (Fig. 1B). The HR of stroke in patients of the HALw/oD vs SGAw/oD cohort was 1.80 (0.62–5.86) with Sentinel PS matching (Fig. 2A) and the cHR was 2.47 (1.14–5.48) with the adapted PS matching strategy (Fig. 2B). The HR for stroke in patients of the FGA vs SGA cohort was 2.18 (1.09–4.64) with the Sentinel PS matching strategy (Fig. 3A) and the cHR was 1.64 (0.94–2.97) with the adapted PS matching strategy (Fig. 3B). The HR for stroke in patients of the HAL vs SGA cohort was 2.20 (0.81–6.98) with the Sentinel PS matching strategy (Fig. 4A) and 1.98 (0.99–4.00) with the adapted PS matching strategy (Fig. 4B). Results of the post-hoc analysis are also presented in Table 3.

Table 3 PS matched risk of stroke in non-elderly patients

4 Discussion

In the present study, among patients without a recent dementia diagnosis (the population analyzed in the Sentinel study), the crude (unadjusted) HR for stroke for first-generation vs second-generation antipsychotics and for haloperidol vs second-generation antipsychotics had point estimates >1 and CIs that excluded 1 in both the pre-planned and post-hoc analyses. The corresponding estimates from the Sentinel study were 1.75 (1.17–2.63) and 1.80 (0.93–3.48), respectively [12]. The PS-matched HR estimates for stroke with first-generation antipsychotics and with haloperidol relative to second-generation antipsychotics in the Sentinel study were 0.87 (0.54–1.41) and 1.31 (0.54–3.21), respectively [12]. For first-generation antipsychotics vs second-generation antipsychotics, the HR in the present study (matched on the PS that mimicked the PS used in the Sentinel study) overlapped the upper end of in the CI from the Sentinel study in the pre-planned analysis but not in the post-hoc analysis. For haloperidol vs second-generation antipsychotics, the lower ends of the CIs for the HR (matched on the PS that mimicked the PS used in the Sentinel study) overlapped the upper end of the CI from the Sentinel study estimate in both the pre-planned and post-hoc analyses. Thus, the findings of the present study partially replicated the findings of the Sentinel study.

Replication of a previously conducted study provides an opportunity to increase confidence in established hypotheses, theories, or claims [36]. Replication is often subject to several challenges including, but not limited to, variation in study populations (different database, inadequate reporting of patient selection criteria, or different geographical location); presence of other confounding factors; and the difference in calendar time between the original study and the replication study can directly influence study outcomes [37, 38]. As noted above, the adapted PS strategy included more variables than the Sentinel PS strategy that mimicked the PS used in the Sentinel study and provided better matching on baseline covariates. Therefore, in summarizing the results of the present study, we focus on the HRs that were adjusted using the adapted PS strategy, and the associated calibrated CIs (cCIs).

For patients without a recent dementia diagnosis, after matching on the adapted PS strategy, the estimated HR point estimate for stroke risk among new users of first-generation antipsychotics compared to new users of second-generation antipsychotics was >1 and the 95% cCIs excluded 1 in the pre-planned analysis, and the point estimate was >1 but the cCI included 1 in the post-hoc analysis. Among new users of haloperidol vs second-generation antipsychotics, the point estimates were >1 and the 95% cCIs excluded 1 in both the pre-planned and post-hoc analyses. Thus, in this population, the present study found that the stroke risk associated with haloperidol exposure is higher than the stroke risk associated with second-generation antipsychotic exposure; however, the findings for first-generation vs second-generation antipsychotics are less conclusive.

Among all patients with or without a recent dementia diagnosis, after matching on the adapted PS matching strategy in both the pre-planned and post-hoc analyses, the HRs for stroke with first-generation vs second-generation antipsychotics and for haloperidol vs second-generation antipsychotics had point estimates >1 but all the cCIs included 1 except the cCI for haloperidol vs second-generation antipsychotics in the post-hoc analyses. Thus, in this broader population, the present study does not find an increased stroke risk for either first-generation vs second-generation antipsychotics or haloperidol vs second-generation antipsychotics.

Our study had several strengths including a large sample size, inclusion of patients from many regions of the USA, and the use of two different PS matching approaches. Among its limitations were a lack of information on socioeconomic status, health-related behaviors (including smoking and alcohol use and abuse), and body mass index. Because the database describes US residents insured through employment and their dependents, the study’s finding may not generalize to other populations. In addition, because complementary drug sample usage is not captured in claims databases, for certain patients, exposure to the drug might have been classified as non-exposure or an ongoing antipsychotic medication use might have been classified as new drug use.

5 Conclusions

The present study found that among adults aged ≤64 years, who did not have a recent diagnosis of dementia, the risk of stroke is higher among those exposed to haloperidol than those exposed to second-generation antipsychotics.