FormalPara Key Points

Using real-world data, the incidence of severe uterine bleeding was low but varied by antithrombotic therapy and occurred more often in patients with venous thromboembolism vs those with atrial fibrillation.

Compared with warfarin, apixaban, and dabigatran, rivaroxaban was associated with a moderately increased risk of severe uterine bleeding in patients with atrial fibrillation that was sensitive to the time-at-risk period analyzed and a more consistently increased risk in patients with venous thromboembolism regardless of the time-at-risk period analyzed.

Careful evaluation of the benefits and risks of antithrombotic therapy in women requires consideration of factors including age, comorbidities, treatment duration, and indication for use.

1 Introduction

Direct oral anticoagulants (DOACs), including rivaroxaban, apixaban, and dabigatran, are approved for use in patients with non-valvular atrial fibrillation (AF) to prevent stroke and systemic embolism, in patients with venous thromboembolism (VTE) for the treatment and prevention of recurrent events, and in patients undergoing total knee or hip replacement surgery as prophylaxis against VTE [1,2,3]. Direct oral anticoagulants do not require routine laboratory monitoring and have fewer drug–drug and drug–food interactions, making them reliable and convenient alternatives to vitamin K antagonists, such as warfarin. Because of their biologic mechanism of action, all anticoagulants are associated with an increased risk of bleeding. In the ROCKET-AF trial of patients with non-valvular AF, major bleeding rates were similar between rivaroxaban and warfarin, but an increased risk of major gastrointestinal bleeding was observed in rivaroxaban-treated patients compared with warfarin-treated patients [2.0 vs 1.2 events/100 patient-years (PY), respectively; hazard ratio (HR) 1.66; 95% confidence interval (CI) 1.34–2.05] [4, 5].

Women of reproductive age have a higher risk of VTE compared with men of the same age, and this difference is attributed, in part, to the use of hormonal contraceptives and pregnancy [6]. Abnormal uterine bleeding events have been reported in clinical trials of DOACs for the treatment or prevention of VTE, and post hoc analyses of these trials have provided inconsistent results on the risk of abnormal uterine or vaginal bleeding among DOACs [7,8,9,10]. Two of these trials evaluated uterine bleeding with DOACs using the International Federation of Gynecology and Obstetrics criteria [8, 10], which defined abnormal uterine bleeding as prolonged menstrual bleeding; intermenstrual bleeding; heavy menstrual bleeding; or menstrual blood loss causing anemia or requiring an unscheduled contact with a physician, a medical or surgical intervention, or adaptation of anticoagulant therapy [11]. In contrast, abnormal uterine bleeding events have not been commonly reported in other populations, such as those with AF. One possible reason for the scarcity of information on uterine bleeding in clinical trials is that clinically relevant bleeding based on the International Society on Thrombosis and Haemostasis definition does not include the uterus as a critical organ [12]. Thus, the post hoc analyses of clinical trials are limited by the highly selected populations included in the trials and by differences in analysis design. Evidence from routine clinical practice across larger, more diverse populations, including different anticoagulant indications is lacking.

Therefore, the present, voluntary, sponsor-initiated study aimed to evaluate the risk of severe uterine bleeding, defined as vaginal bleeding requiring same-day transfusion or surgical management within 60 days, in relation to rivaroxaban, apixaban, dabigatran, and warfarin therapies. We conducted a set of retrospective comparative cohort analyses to assess whether individual DOACs (rivaroxaban, apixaban, and dabigatran) were associated with a risk of severe uterine bleeding compared with warfarin and with each other among women with prior diagnoses of the aforementioned clinical indications (AF and VTE, including deep vein thrombosis and pulmonary embolism) separately.

2 Methods

2.1 Study Design

This retrospective, observational, comparative, new-user cohort study evaluated data from adult women who were newly exposed to DOACs or warfarin in a population of insured patients in the USA (Fig. 1). The study was pre-registered at ClinicalTrials.gov (NCT04394234), and the complete specification of all analyses is available in the protocol at https://github.com/ohdsi-studies/DoacsWarfarinSub/tree/master/Protocol.

Fig. 1
figure 1

Comparative cohort study design. AF atrial fibrillation, CCI Charlson Comorbidity Index, CHA2DS2-VASc stroke risk index, PH proportional hazards

The overall study period was from 19 October 2010 (coinciding with initial approval of dabigatran) to 31 December 2018. In any pairwise comparison, the study period was restricted to calendar time when both exposures (drugs) were observed in each data source.

2.2 Data Sources

This distributed database network study was executed using patient-level data from four US de-identified, administrative claims databases. All four databases have been standardized into the Observational Medical Outcomes Partnership Common Data Model (https://ohdsi.github.io/CommonDataModel/background.html) [13]. The databases included: (1) IBM MarketScan® Commercial Database (CCAE), a medical and drug insurance claims database of active employees, early retirees, Consolidated Omnibus Budget Reconciliation Act continuers, and their dependents insured by employer-sponsored plans; (2) IBM MarketScan® Multi-state Medicaid (MDCD), an administrative claims database containing the pooled healthcare experience of Medicaid enrollees from multiple states; (3) IBM MarketScan® Medicare Supplemental Beneficiaries (MDCR), an administrative health claims database for Medicare-eligible active and retired employees and their Medicare-eligible dependents from employer-sponsored supplemental plans; and (4) Optum’s Clinformatics® Data Mart Database (Optum), an administrative health claims database for members who are fully insured in commercial plans or in administrative services only and commercial Medicare. Analyses of de-identified publicly available data do not constitute human subjects research and, as such, do not require institutional review board review/approval.

All analyses were performed independently within each of these four databases and no patient-level data were pooled across the databases for any analysis. Instead, meta-analysis estimates were computed based on per-database aggregate statistics as described below.

2.3 Study Population

Cohorts of mutually exclusive, adult (aged ≥ 18 years) female new users of rivaroxaban, apixaban, dabigatran, and warfarin with ≥ 183 days of prior continuous database observation were identified. The date of new anticoagulant dispensing was the index date. Patients were excluded for edoxaban exposure; exposure to other drugs of interest (e.g., DOAC users were excluded for the warfarin exposure cohort); hysterectomy; vaginal bleed; and medical, surgical, or transfusion management for vaginal bleeding any time prior to or on the index date. From these base exposure populations, additional inclusion criteria were added to construct new user cohorts with the following prior indications with consideration of the number of days prior to and including the index date: (1) AF population: AF diagnosis any time prior and no VTE in the past 183 days and no total knee or hip replacement surgery in the past 35 days; and (2) VTE population: VTE in the past 183 days, no AF diagnosis any time prior, and no total knee or hip replacement surgery in the past 35 days. All inclusion criteria are based on information referenced to the index date, such that “any time prior” means all observed time before and including the index date. In total, 12 new user cohorts were constructed for the three indications across four treatments. New user cohort definition details are available in the protocol Section 7.3 including code sets in Annex A and an example in Annex B (https://github.com/ohdsi-studies/DoacsWarfarinSub/tree/master/Protocol/Tables_Annex).

2.4 Outcome Assessment

Patients with severe uterine bleeding were identified by having vaginal bleeding with a blood transfusion on the same day or surgical management on or within the subsequent 60 days, based on diagnostic and procedure codes recorded. Outcome ascertainment details are available in the protocol Section 7.4 and codes for the outcome definitions are available in Annex A (https://github.com/ohdsi-studies/DoacsWarfarinSub/tree/master/Protocol/Tables_Annex). Cases were identified based on patients’ records provided by the source data and no additional case adjudication was performed.

Time-at-risk is the interval relative to the index date during which the outcome can be ascertained. We defined time-at-risk three ways in the study. The primary on-treatment time-at-risk period was defined as the time from 1 day after the anticoagulant index date to the end of inferred persistent exposure, the last day of observation, or the end of the study period, whichever came first. For DOACs, the final exposure record allowed for no more than a 3-day gap between successive exposure intervals (inferred by days’ supply) plus 5 days appended to the last exposure date. This definition was intended to reflect the short half-life (approximately 12 h) of DOACs. For warfarin, the final exposure record allowed for no more than a 7-day gap between successive exposure intervals (inferred by days’ supply) plus 5 days appended to the last exposure date. This definition was intended to reflect the longer half-life (20–60 h) of warfarin.

Two sensitivity analyses were conducted using variations of time-at-risk periods to provide an increased time to observe severe uterine bleeding events. In the first sensitivity analysis, the on-treatment time-at-risk period was similarly defined as in the primary analysis, but the final exposure record allowed for no more than a 30-day gap between successive exposure intervals (inferred by days’ supply) with no days appended to the last exposure date for DOACs and warfarin. This definition was intended to reflect the dispensing behavior in routine clinical practice. The second sensitivity analysis was an intent-to-treat (ITT) analysis, defined as the time from 1 day after the anticoagulant index date until the last day of observation or the end of the study period, whichever came first, regardless of the subsequent drug dispensing records. All time-at-risk period definitions required that patients have at least 1 day of time-at-risk after the index date.

2.5 Statistical Analysis

The unadjusted (crude) incidence rates (IRs) of severe uterine bleeding within each exposure cohort were reported and calculated as the number of persons with the outcome during each time-at-risk period, divided by the total time-at-risk (IR/1000 PY). Incidence proportion, defined as the number of persons with an outcome during the time-at-risk period divided by the total number of persons at risk, was also calculated. The number of severe uterine bleeding events observed during each time-at-risk period, total time-at-risk in years, and the IR and incidence proportion in the exposure cohorts for DOACs and warfarin were reported for each indication population in each database. The crude IRs do not reflect any statistical adjustment and are not intended for comparative purposes.

For population-level effect estimation, all pairwise comparisons of DOACs (rivaroxaban, apixaban, dabigatran) and warfarin were assessed nested within each of the two indication groups (AF or VTE). Propensity scores (PSs) were used to reduce potential confounding due to an imbalance of observed patient characteristics at baseline and were calculated for each patient using the predicted probability from a regularized logistic regression model, fit with a Laplace prior (LASSO) and the regularization hyperparameter selected by optimizing the likelihood in a tenfold cross validation [14,15,16]. The primary PS strategy matched the target to comparator patients in a 1:1 ratio and a sensitivity PS strategy variably matched the target to comparator patients in a maximum of a 1:100 ratio. Both approaches used a greedy matching algorithm with a caliper of 0.2 of the standard deviation on the logit scale of the PS distribution. Covariates used as input to the PS model included all available baseline covariates on demographics; prior conditions (including possible confounders such as cardiovascular comorbidities, polyp of cervix, endometriosis, other cancers, and pregnancy); medication use (including hormonal therapy drugs); procedures; and the risk scores, including the Charlson Comorbidity Index and the stroke risk index (CHA2DS2-VASc), during the 183 days prior to or on the index date. Propensity score matching was evaluated based on a covariate balance before and after PS matching and considered successful when all standardized differences in means were < 0.1 [17]. Study diagnostics (PS distribution, covariate balance, and empirical null distribution) for the primary analysis of each pairwise comparison for each database in AF and VTE populations are reported in the Electronic Supplementary Material (ESM). Because of limited dabigatran exposures in VTE, no formal comparisons were performed.

Cox proportional hazards regression models were used to model the time to the first outcome occurrence for the target relative to the comparator cohort in pairwise comparisons while accounting for the baseline covariate imbalance using the two PS matching strategies (1:1, unconditional; and 1:100, conditional) [18, 19]. Empirical calibration using 127 negative control outcomes (ESM) was performed to account for residual error unaddressed by PS matching [20, 21]. In addition to negative control outcomes, we also generated and included synthetic positive control outcomes. Positive control outcomes were developed on the basis of the negative controls, but where the effect size was artificially increased to a predefined effect size by injection of additional simulated outcomes [21]. For each indication, estimates of severe uterine bleeding risk were reported as the empirically calibrated hazard ratio (cHR) with associated 95% calibrated CIs and p values. For each indication-target-comparator-analysis combination, heterogeneity of the HRs across the four databases was estimated, using I2 as a metric [22]. When there was sufficient homogeneity across sources (I2 < 40%) [23], database-specific estimates were pooled through a random effect meta-analysis using the Hartung–Knapp–Sidik–Jonkman inverse-variance method [24]. Pooled results including p values corrected for multiple testing using Hochberg’s step-up procedure are provided. We conducted this study using open-source software developed by the OHDSI community, CohortMethod (https://ohdsi.github.io/CohortMethod/) and Cyclops (https://ohdsi.github.io/Cyclops/) [25].

3 Results

The full result set with associated study diagnostics is publicly available in an interactive web application at https://data.ohdsi.org/DoacsWarfarinSub/. Detailed results can be accessed by clicking on individual rows listed for each database under the “Explore Results” tab. This section summarizes the key findings.

A total of 363,919 women newly exposed to a DOAC or warfarin with a prior diagnosis of AF (60.8%) or VTE (39.2%) were included in the study, with the largest proportion from Optum (43.0%) and followed by MDCR (25.3%), CCAE (24.6%), and MDCD (7.2%) [ESM]. Treatment duration varied across indications and databases. For patients in the MDCD database, treatment duration was generally shorter than that for patients in other databases and sample size was also limited. For patients in other databases and with AF, the median treatment duration for the primary analysis was 1.9–2.6 months for DOACs and 2.1–2.9 months for warfarin. Corresponding treatment durations for patients with VTE were 1.4–2.4 months and 2.2–2.4 months, respectively. As expected, the average (and median) treatment durations based on the on-treatment and ITT sensitivity analyses were longer than that for the primary on-treatment approach.

3.1 AF

3.1.1 Study Participants

Among patients with AF, median age varied slightly by exposure cohort, with new users of warfarin and apixaban being older (aged 60–80 years) than new users of dabigatran and rivaroxaban (aged 60–78 years) [ESM]. Table 1 illustrates the baseline characteristics of the AF study population by treatment exposure group for the pairwise comparison of rivaroxaban vs warfarin in the Optum database; data for the other five treatment comparisons are included in the ESM. Most patients in the four treatment cohorts had underlying heart disease, hypertension, and hyperlipidemia, and diabetes mellitus and osteoarthritis were also common. Consistent with these conditions, common baseline medications included beta-blockers, renin-angiotensin inhibitors, lipid-modifying agents, and diuretics. The online application is searchable for the baseline prevalence of any covariate by navigating to the “Population Characteristics” tab and selecting “Raw” view and searching for a covariate of interest.

Table 1 Selected baseline characteristics. For the full set of baseline characteristics, visit the online application: https://data.ohdsi.org/DoacsWarfarinSub/. The online application is searchable for the baseline prevalence of any covariate by navigating to the “Population Characteristics” tab and selecting the “Raw” view and searching for a covariate of interest before and after 1:1 propensity score matching for the comparison of rivaroxaban vs warfarin in the atrial fibrillation population from the Optum database

3.1.2 Incidence Rates of Severe Uterine Bleeding

In the AF population, the unadjusted IRs during the primary on-treatment time-at-risk period ranged from 1.9 events/1000 PYs among warfarin new users in Optum to 10.0 events/1000 PYs among rivaroxaban new users in CCAE (Table 2). During sensitivity time-at-risk periods, IRs ranged from 1.6 to 12.1 events/1000 PYs when on-treatment was defined with 30-day persistence gaps and from 1.8 to 10.0 events/1000 PYs over the ITT follow-up. Across all AF cohorts, the unadjusted IRs were generally higher in the CCAE database and among rivaroxaban new users.

Table 2 Unadjusted incidence rates of severe uterine bleeding in the atrial fibrillation population by treatment exposure group during the primary and sensitivity observation periods

3.1.3 Comparative Results

For the AF population after PS matching, patient counts, event counts, calibrated HRs, and associated 95% CIs for database-specific and meta-analytic estimates are reported for each pairwise comparison for the primary and sensitivity analyses in Fig. 2. Compared to warfarin, rivaroxaban was associated with cHRs (95% CI) from 0.83 (0.27–2.48) to 2.84 (1.32–6.23) in the primary analysis; a pooled estimate was not reported for this comparison because of observed heterogeneity across the databases. During longer time-at-risk periods, pooled cHRs (95% CI) were 1.74 (1.20–2.50) and 1.55 (1.16–2.07) during the sensitivity on-treatment and ITT periods, respectively (Fig. 2a). Among new users of apixaban compared to warfarin, a suggestive increased risk of severe uterine bleeding was observed with a cHR (95% CI) of 1.69 (0.91–3.09) during the primary analysis. However, in the sensitivity 30-day on-treatment [1.11 (0.88–1.40)] and ITT [1.07 (0.75–1.52)] analyses, the estimates were attenuated towards the null (Fig. 2b). For new users of dabigatran compared to warfarin, no difference in the risk of severe uterine bleeding was observed, and uncertainty decreased with increased time-at-risk (Fig. 2c). A non-significant increased risk of severe uterine bleeding among rivaroxaban new users compared to apixaban new users was observed in the primary analysis with a pooled cHR (95% CI) of 1.45 (0.91–2.28). Increased time-at-risk decreased the variance of the estimate to reach statistical significance, with a pooled cHR (95% CI) of 1.47 (1.20–1.80) during the sensitivity on-treatment period and 1.22 (0.99–1.55) during the ITT period (Fig. 2d). New use of rivaroxaban compared with dabigatran was associated with a cHR (95% CI) of 2.12 (1.01–4.43) in the primary analysis, 1.62 (0.94–2.72) during the sensitivity on-treatment period, and 1.40 (1.07–1.79) during the ITT period (Fig. 2e). The risk of severe uterine bleeding in apixaban compared to dabigatran new users was highly uncertain during primary and sensitivity on-treatment periods and suggestive of no risk difference during the ITT period (Fig. 2f).

Fig. 2
figure 2figure 2

Calibrated hazard ratios and calibrated 95% confidence intervals (CIs) for each pairwise comparison during primary and sensitivity observation periods in the atrial fibrillation (AF) population after 1:1 propensity score matching: (a) rivaroxaban vs warfarin; (b) apixaban vs warfarin; (c) dabigatran vs warfarin; (d) rivaroxaban vs apixaban; (e) rivaroxaban vs dabigatran; and (f) apixaban vs dabigatran. Estimates are reported for pairwise comparisons in databases where study diagnostic passed. Summary meta-analytic estimates are reported where I2 < 40%. CCAE IBM MarketScan® Commercial Database, cHR calibrated hazard ratio, ITT intent-to-treat, MDCD IBM MarketScan® Multi-state Medicaid, MDCR IBM MarketScan® Medicare Supplemental Beneficiaries

3.2 VTE

3.2.1 Study Population

Among patients with VTE, median age varied slightly by cohort, with apixaban (51–78 years) and dabigatran new users the oldest (50–78 years) compared with new users of warfarin (50–77 years) and rivaroxaban (48–76 years) [ESM]. Table 3 illustrates the baseline characteristics of the VTE study population by treatment exposure group for the pairwise comparison of rivaroxaban vs warfarin in the Optum database; data for the other five treatment comparisons are included in the ESM. Most patients had hypertension and approximately half had underlying heart disease and hyperlipidemia. Other common baseline comorbidities included osteoarthritis, acute respiratory disease, gastrointestinal reflux disease, diabetes mellitus, and renal impairment. The most prescribed baseline medications included systemic antibiotics, renin-angiotensin inhibitors, lipid-modifying agents, diuretics, opioids, and drugs for acid-related disorders. As noted previously, other covariates can be viewed in the online application.

Table 3 Selected baseline characteristics. For the full set of baseline characteristics, visit the online application: https://data.ohdsi.org/DoacsWarfarinSub/. The online application is searchable for the baseline prevalence of any covariate by navigating to the “Population Characteristics” tab and selecting the “Raw” view and searching for a covariate of interest before and after 1:1 propensity score matching for the comparison of rivaroxaban vs warfarin in the venous thromboembolism population from the Optum database

3.2.2 Incidence Rates of Severe Uterine Bleeding

In the VTE population during the primary on-treatment time-at-risk period, unadjusted IRs ranged from 2.8 events/1000 PYs among apixaban new users in MDCR to 33.7 events/1000 PYs among rivaroxaban new users in CCAE (Table 4). During sensitivity time-at-risk periods, IRs ranged from 2.3 to 25.6 events/1000 PYs when on-treatment was defined with 30-day persistence gaps and from 2.0 to 14.1 events/1000 PYs over the ITT follow-up. Small sample sizes and zero-event counts precluded generating meaningful IRs for dabigatran-exposed women with VTE.

Table 4 Unadjusted incidence rates (IRs) of severe uterine bleeding in the venous thromboembolism population by treatment exposure group during the primary and sensitivity observation periods

3.2.3 Comparative Results

For the VTE population after PS matching, patient counts, event counts, calibrated HRs, and associated 95% CIs for database-specific and meta-analytic estimates are reported for each pairwise comparison for the primary and sensitivity analyses in Fig. 3. In the primary analysis, a twofold increased risk of severe uterine bleeding was observed among rivaroxaban new users compared to warfarin new users, with a pooled cHR (95% CI) of 2.03 (1.19–3.27). The risk was consistently elevated across time-at-risk definitions with slight attenuation as time-at-risk increased. The pooled cHR (95% CI) was 1.84 (1.38–2.37) during the sensitivity on-treatment period and 1.32 (1.06–1.65) during the ITT period (Fig. 3a). New use of apixaban compared with warfarin was associated with a cHR (95% CI) of 1.13 (0.55–2.28) in the primary analysis, 1.14 (0.73–1.76) during the sensitivity on-treatment period and 1.00 (0.81–1.22) during the ITT period (Fig. 3b). Compared to apixaban, rivaroxaban was associated with cHR (95% CI) of 2.25 (1.45–3.41) for severe uterine bleeding, which was consistently observed during the sensitivity 30-day on-treatment period, with a cHR (95% CI) of 2.32 (1.41–3.59). An increased risk was also observed during the ITT period, albeit attenuated to 1.63 (1.11–2.45) (Fig. 3c).

Fig. 3
figure 3

Calibrated hazard ratios and calibrated 95% confidence intervals (CIs) for each pairwise comparison during primary and sensitivity observation periods in the venous thromboembolism (VTE) population after 1:1 propensity score matching: (a) rivaroxaban vs warfarin; (b) apixaban vs warfarin; and (c) rivaroxaban vs apixaban. Estimates are reported for pairwise comparisons in databases where study diagnostic passed. Note that limited exposures precluded producing estimates for comparisons including dabigatran in the VTE population. Summary meta-analytic estimates are reported where I2 < 40%. CCAE IBM MarketScan® Commercial Database, cHR calibrated hazard ratio, ITT intent-to-treat, MDCD IBM MarketScan® Multi-state Medicaid, MDCR IBM MarketScan® Medicare Supplemental Beneficiaries

3.3 Other Analyses

Results from the sensitivity PS matching strategy (1:100) were qualitatively similar to the primary PS approach in all comparisons (see ESM for AF and VTE populations).

4 Discussion

Using real-world data from routine clinical practice, we evaluated and compared the occurrence of severe uterine bleeding in relation to the use of rivaroxaban, apixaban, dabigatran, and warfarin for patients with AF and VTE. The overall incidence of severe uterine bleeding was low in the populations exposed to DOACs, although relatively higher in the younger VTE population than in the AF population (unadjusted IRs: 2.9–34.2 vs 1.9–10.1 events/1000 PYs). Among patients with AF, the risk of severe uterine bleeding was statistically comparable among all treatment groups in the main analysis. However, a trend toward an increased risk was observed among new users of rivaroxaban as compared to all other drugs. These results were sensitive to the time-at-risk periods. The sensitivity analyses of the on-treatment period allowing a 30-day gap between successive exposure intervals and the ITT period resulted in a statistically significant increased risk of severe uterine bleeding among new users of rivaroxaban vs warfarin with a calibrated pooled HR of 1.74 and 1.55, respectively. In the sensitivity on-treatment analysis, new users of rivaroxaban were also at a higher risk of severe uterine bleeding compared to apixaban, with a calibrated pooled HR of 1.47. In general, the statistically significant pooled HR was driven by results obtained from Optum and CCAE data sources. A potential explanation for the different observations may be that the on-treatment analysis is more restrictive, whereas the ITT analysis may include outcome events that are not associated with the treatment, particularly if they occurred long after the treatment discontinuation. In addition, while we observed differences in the statistical significance of the estimates across the different time-at-risk specifications, the estimates were directionally consistent.

Clinical trials of anticoagulant therapies in AF populations report bleeding adverse events, often referred to as major bleeding and clinically relevant non-major bleeding based on the definitions from the International Society on Thrombosis and Haemostasis [12]. Events may include gastrointestinal bleeding, intracranial bleeding, and bleeding from other sites, such as uterine bleeding. Results of these analyses have demonstrated comparable or improved overall major bleeding rates for DOACs vs warfarin [4, 26, 27]. A systematic review and meta-analysis of safety data from controlled trials of patients with atrial fibrillation determined no difference in the major bleeding risk between rivaroxaban or dabigatran compared with vitamin K antagonists but a 31% risk reduction with apixaban vs vitamin K antagonists based on the ARISTOLE trial [28].

Several observational studies have compared bleeding rates, but not specifically uterine bleeding, among DOACs and vitamin K antagonists in the AF population with varying results. No significant differences in short-term bleeding were identified between new users of dabigatran and a vitamin K antagonist (HR 0.88; 95% CI 0.64–1.21) and between new users of rivaroxaban and a vitamin K antagonist (HR 0.98; 95% CI 0.64–1.51) [24] or among patients with AF who switched from vitamin K antagonists to dabigatran or rivaroxaban [99 (1%)] vs those who did not switch therapy [193 (2%); p = 0.54] [29]. A significant difference in major bleeding risk among matched cohorts of patients with AF was identified between rivaroxaban and apixaban (4.2% vs 2.4%; HR 1.82; 95% CI 1.36–2.43) but not between new users of rivaroxaban and warfarin (4.9% vs 5.1%; HR 0.98; 95% CI 0.83–1.17) or between new users of rivaroxaban and dabigatran (3.3% vs 3.1%; HR 1.05; 95% CI 0.74–1.49) [30]. Optum data reported an increased risk of major bleeding among users of rivaroxaban compared to both apixaban (HR 2.56; 95% CI 1.85–3.57; p < 0.001 [data were inverted to provide the comparison of rivaroxaban vs apixaban]) and dabigatran (HR 1.30; 95% CI 1.10–1.53; p < 0.01) [31]. A similar PS 1:1 matching analysis using Medicare Advantage coverage found that apixaban was associated with a significantly lower risk of major bleeding than rivaroxaban and warfarin (HR 0.49; 95% CI 0.41–0.59; p < 0.001 and HR 0.53; 95% CI 0.45–0.63; p < 0.001, respectively) [32]. These data suggest possible differences in bleeding risk associated with anticoagulants in patients with AF.

Our results show that the risk of severe uterine bleeding was higher among patients with VTE than in patients with AF across all treatment groups. Further, rivaroxaban was associated with a statistically significant increased risk of severe uterine bleeding compared to both warfarin and apixaban in patients with VTE. These results were consistent across sensitivity analyses both over longer time-at-risk periods and using different PS matching approaches. A trend toward an increased risk of severe uterine bleeding was observed for new users of apixaban compared to warfarin during the primary observation period, but estimates attenuated with a longer time-at-risk duration. The higher risk of severe uterine bleeding in general, and the more evident relative risk difference for rivaroxaban users in the VTE population, is likely because of the age of the population. Female patients with VTE are more likely to be of reproductive age compared to female patients with AF, putting them at higher risk for severe uterine bleeding. Our findings are consistent with the results of prior studies. In an analysis of the EINSTEIN-DVT and EINSTEIN-PE clinical trials, women < 60 years of age had more frequent abnormal uterine bleeding when treated with rivaroxaban compared with enoxaparin/vitamin K antagonist (HR 2.13; 95% CI 1.57–2.89) [7]. The AMPLIFY trial reported that clinically relevant non-major vaginal bleeding occurred in similar proportions of women receiving apixaban and those receiving enoxaparin/warfarin (2.5% vs 2.1%, respectively; odds ratio 1.2, 95% CI 0.7–2.0) [8]. Moreover, in a single-center retrospective study, the risk of abnormal uterine bleeding was assessed among 104 female patients with VTE of reproductive age who were treated with either rivaroxaban or a vitamin K antagonist [33]. Patients receiving rivaroxaban were more likely to report an unscheduled contact with a physician (41% vs 25%; p = 0.096) and an increased need for menorrhagia-related medical or surgical intervention (25% vs 7.7%; p = 0.032) compared with women receiving a vitamin K antagonist [34].

This study provides real-world evidence on the comparative risk of severe uterine bleeding among anticoagulation treatments and across different patient populations. Past studies have suggested that the risk of bleeding in general and, specifically uterine bleeding, may vary for different indications of anticoagulation. However, existing comparative safety studies tended to focus on evaluating an outcome of general severe bleeding in the AF population and an outcome of abnormal uterine bleeding for the VTE population. Our results provide important evidence on considering patient characteristics and the indication of anticoagulation when balancing the risk and benefits of alternative anticoagulation treatments. Patients with AF are mostly aged older than 60 years and therefore beyond the reproductive age, but many female patients receiving DOACs for VTE are younger and can potentially be affected by this complication. Women requiring anticoagulation should be counseled and informed to assist in making appropriate treatment choices. The personalized management strategy for patients who might be at a higher risk of bleeding, such as those with heavy menstrual bleeding, requires careful evaluation of the risk vs the benefit of continued anticoagulation considering the severity of bleeding as well as individual thrombotic risk.

This study evaluated outcomes from large observational datasets that include a wide variety of clinical care settings of insured US adults and provide complementary information about anticoagulation treatment utilization and effects in routine clinical practice. The outcome under study, severe uterine bleed, is very rare and it is unlikely that randomized and/or prospective studies would have enough events and statistical power for a meaningful risk evaluation. In addition, the International Society on Thrombosis and Haemostasis definition of clinically relevant non-major bleeding does not include heavy menstrual bleeding, a less severe but more common bleeding outcome, which also contributes to the lack of uterine bleeding data and related research. Finally, our study allowed for direct estimation of incidence rates following the exposure to four investigated drugs, and the new-user cohort design captured early events following treatment exposures while avoiding confounding from previous treatment effects. This design also allowed for a clear exposure index date designation.

The following limitations of this study need to be considered. First, indication and outcome misclassification are a concern in studies using administrative databases because diagnosis codes intended for reimbursement cannot verify patient clinical condition with certainty. In addition, to our knowledge, the diagnostic and procedure codes used to define the outcome were not previously validated, which could be subject to measurement and classification errors. Random misclassification can attenuate the results toward the null. Second, as with any observational study, residual confounding may still be present. Some confounders may be unmeasured or inadequately represented in claims data, including weight, smoking status, many clinical measurements, and lifestyle behaviors, such as diet and exercise. Adjustment by PS cannot completely remove bias because of unmeasured or incorrectly specified confounders. It is worth mentioning that upon investigating the baseline characteristics of the treatment group (ESM Online Resources 3, 5, and 7), new users of apixaban tended to have higher comorbidity compared to the other three treatment groups. However, our diagnostics for all the presented results show that we achieved acceptable balance after PS matching (https://data.ohdsi.org/DoacsWarfarinSub/; “Covariate Balance” tab). In addition, our results were empirically calibrated on the basis of negative and positive controls and indicated sufficient control over residual error and unmeasured confounding inherent to the study design and data. It was noted that clinical equipoise [35] for some of the treatment comparisons was far below 50% based on the PS distribution, for example, rivaroxaban vs warfarin in the VTE population. When clinical equipoise is above 50%, the estimate from a treatment comparison may represent evidence that is reasonably generalizable to most patients treated in clinical practice. In contrast, when clinical equipoise is below 50%, the estimates may be less generalizable to the population treated in clinical practice because many patients in the comparison are incomparable and discarded from analysis. However, reduced generalizability does not undermine the strong internal validity of a finding where a subset of patients is well matched on the PS. Clinical equipoise varied considerably in this study because we made multiple exposure comparisons across multiple databases; therefore, interpretation of the results requires careful assessment of both generalizability and internal validity. Finally, some treatment comparisons exhibited bimodal and multimodal PS distributions, for example, rivaroxaban vs apixaban in the AF population. In these instances, the modal peaks represent a strong predictive contribution of one or a few covariates in the PS model.

5 Conclusions

The incidence of severe uterine bleeding was low in the AF and VTE populations studied. Among patients with AF receiving anticoagulants, the risk of severe uterine bleeding was moderately increased for rivaroxaban new users compared with new users of warfarin, apixaban, and dabigatran. This risk was sensitive to the time-at-risk period. In the younger VTE population, a consistent increased risk of severe uterine bleeding was observed for rivaroxaban in comparison to apixaban and warfarin. There was no strong evidence suggesting differential severe uterine bleeding risk between apixaban, dabigatran, and warfarin in either population. Personalized management strategies with careful evaluation of benefits and risks are required for women who need antithrombotic therapy.