Introduction

Atrial fibrillation (AFib) is the most common type of cardiac arrhythmia [1]. In the United States (US), AFib affects 2.7–6.1 million Americans and is associated with more than 454,000 hospitalizations and 158,000 deaths each year [2,3,4].

Oral anticoagulants (OACs) including warfarin and direct oral anticoagulants (DOACs) are approved for the management of AFib patients [5]. In the general AFib population, DOACs are associated with lower risk of ischemic stroke and bleeding compared with warfarin [6]. However, in the context of malignancy, AFib is associated with higher burden of mortality and mortality since both cancer and AFib are independent risk factors of ischemic stroke and venous thromboembolism (VTE) [7, 8]. Specifically, compared to patients with AFib, those with concomitant AFib and cancer have a fourfold to sevenfold higher risk of VTE and a twofold higher risk of bleeding [7, 8]. However, the effectiveness and safety profiles of DOACs and warfarin have not been well established among patients with AFib and cancer. Current guidelines from American Heart Association (AHA), European Society of Cardiology (ESC), American Society of Clinical Oncology (ASCO), and the International Society on Thrombosis and Haemostasis (ISTH) do not recommend any OAC over another [9,10,11,12]. As a result, less than half of AFib or atrial flutter patients with cancer initiated OACs [13,14,15].

Early randomized controlled trials (RCTs) comparing DOACs and warfarin in AFib patients excluded cancer patients due to potential drug-drug interactions with chemotherapy and low life expectancy (e.g., ENGAGE, ROCKET-AF, and ARISTOLE) [16,17,18]. A meta-analysis of these RCTs was conducted in subgroups of patients who developed cancer after randomization revealed that DOACs were associated with non-inferior rates of thromboembolic and bleeding events and reduced risk of VTE [19]. However, interpretation of the meta-analysis may be limited by heterogeneity in study design, cancer type, and treatment among included studies [19]. Recent observational studies comparing DOACs and warfarin among patients AFib and cancer have shown consistent findings compared with RCTs in favor of DOACs [20,21,22,23,24,25,26,27,28]. However, these observational studies were prone to limitations such as unmeasured confounding (cancer characteristics such as cancer stage, tumor grade, and cancer treatment). In addition, safety and effectiveness were rarely reported across cancer characteristics although they are important for tailoring the treatment selection in patients with cancer [20, 21, 26, 29].

In this study, we implemented a target trial framework to compare the effectiveness and safety profiles of DOACs and warfarin among newly diagnosed AFib patients with cancer using the Surveillance, Epidemiology, and End Results (SEER) registry linked to Medicare claims data to further capture cancer characteristics. The target trial emulation framework articulates the causal questions similar to an RCT protocol and explicitly emulating the components of that protocol using the observational data, which improves the quality of observational studies [30, 31].

Materials and Methods

Study Design and Data Source

We followed the Strengthening The Reporting of Observational studies in Epidemiology (STROBE) to report the components of our study [32]. We conducted a retrospective, population-based cohort study on the target trial framework [33, 34] using SEER registry linked to Medicare database from 2011 to 2019. SEER covers 48% of US population, collects, and releases data on cancer patients such as demographics, cancer characteristics, of treatment, and follow-up [35]. The linkage to Medicare data adds to SEER health care services utilization of beneficiaries [36]. The protocol for a target trial and emulation procedure is described in Table 1. The study design and timeline are illustrated in Figure S1.

Table 1 Protocol for a target trial and emulation procedure using the SEER-Medicare data base to estimate the risks of stroke and bleeding between warfarin users and DOAC users 

Study Sample and Eligibility Criteria

Study Sample

The study sample included individuals aged ≥ 66 with new onset non-valvular atrial fibrillation (NVAF) between January 1, 2012, and December 31, 2019, defined as any International Classification of Disease-9th Revision-Clinical Modification (ICD-9-CM) codes 427.31 or 427.32 or any International Classification of Disease-10th Revision-Clinical Modification (ICD-10-CM) codes I48.xx in any position on one Medicare inpatient claim or on two outpatient claims at least 7 days but < 1 year apart (Table S1, Supplemental materials) [37]. We retained new NVAF patients with any record of breast (ICD-O-3 codes C50.0-C50.9), lung (ICD-O-3 codes C34.0, C34.1, C34.2, C34.3, C34.8, C34.9, C33.9), and prostate cancer (ICD-O-3 codes C61.9) — the most commonly concomitant cancer types in AFib [21, 38, 39] in SEER files at any time before the initial AFib diagnosis. We required patients to continuously enroll in Medicare parts A, B, D, and without Medicare Advantage or Health Maintenance Organization (HMO) for 12 months before AFib diagnosis. We furthered restrict the study sample to those with a moderate to high risk of stroke, defined as CHA2DS2-VASc score ≥ 2 [40, 41].

Exclusion Criteria

We adapted exclusion criteria based on RE-LY and ROCKET-AF trials [40, 41]. In brief, we removed participants with indications for warfarin or DOACs other than NVAF or absolute contra-indications of warfarin or DOACs such as (1) any OAC use during the 12 months baseline period; (2) presence of mitral valve disease, heart valve repair or replacement, deep vein thrombosis, pulmonary embolism, or joint replacement during the 12 months baseline period; (3) any stroke within 14 days before first NVAF diagnosis; (4) major surgery (i.e., hip fracture, cardiac surgery), intracranial, intraocular, spinal, retroperitoneal or atraumatic intra‐articular bleeding, and gastrointestinal hemorrhage within 30 days before AFib diagnosis; (5) renal impairment stage 5 or end-stage renal diseases, during the 12 months baseline period. Individuals with any outcome of interest occurred before OAC initiation were also excluded from the analysis. All ICD codes to identify these conditions can be found in Table S1,Supplemental material.

Treatment Strategies and Assignments

In the hypothetical target trial, eligible individuals were randomly assigned to either (1) warfarin or (2) DOACs and continued the assigned treatment during follow-up. In the emulation of target trial, we assumed randomization was attained given baseline covariates [42]. OAC prescriptions (including warfarin and dabigatran, apixaban, rivaroxaban, edoxaban) were identified from Medicare Part D Prescription Drug Event (PDE) files using NDC, regardless of OAC dosage [43]. We used a grace period of 3 months to determine the treatment group of eligible individuals since a 3-month grace period has been specified in previous RCTs and the risk of stroke in general AFib patients is highest in the first 3 months after new AFib diagnosis [44, 45]. OAC discontinuation was defined as a gap in OAC prescription for ≥ 30 days from the last day of days’ supply in PDE file.

Follow-up

The follow-up started when the patients received the treatment within a grace period and ended at the occurrence of a specific study outcome, administrative censoring (12 months after baseline), death (all-cause deaths from SEER and Medicare files via the variables of “Date of Death Flag”), loss to follow-up (the earliest of 30 days after the end of continuous Medicare part A, B, or D enrollment or enrollment in an HMO), or December 31, 2019, whichever came first. For per-protocol analyses, follow-up also ends when the observed treatment deviated from initial treatment. Specifically, eligible individuals who discontinued the assigned treatment or switched their assigned treatment during follow-up (i.e. switching from warfarin to DOACs or vice versa) were censored.

Outcomes

The primary effectiveness and safety outcomes were ischemic stroke and major bleeding, respectively. We defined major bleeding based on the bleeding site, according to the algorithm by previously developed algorithms (i.e., intraarticular, intracranial, intramuscular, intraocular, intraspinal, pericardial, and retroperitoneal), identified by ICD-9-CM and ICD-10-CM codes in the primary diagnosis from Medicare medical claims files [21, 46, 47]. Secondary outcomes such as VTE, intracranial bleeding, gastrointestinal (GI) bleeding, and other non-critical site bleeding were defined by ICD-9-CM and ICD-10-CM codes from Medicare inpatient claims using validated algorithms [46,47,48].

Covariates

We carefully selected covariates based on published RCTs and observational studies [21, 38, 40]. Baseline covariates were extracted within 12-month period prior to first AFib diagnosis, including demographics (index age, sex, race/ethnicity, calendar year, geographical region, urbanicity), socioeconomic factors (household median income, percentage of household below poverty level, education level, Medicaid eligibility), risk score (CHA2DS2-VASc, HAS-BLED, and Comorbidity Scores SEER-Medicare version 2021 (NCI) [49]), individual comorbidities (asthma/chronic obstructive pulmonary disease, hematological disorders, dementia, depression, thrombocytopenia, acute kidney disease (AKD), peptic ulcer disease (PUD)), cancer characteristics (time from cancer diagnosis to the onset of AFib, cancer stage, tumor grade, cancer type, active cancer status [21, 38]), cancer treatment (radiation, and cancer-directed surgery, and potentially interacting antineoplastic agents), and medication history (angiotensin-converting enzyme inhibitors (ACEIs)/angiotensin II receptor blockers (ARBs), calcium channel blockers (CCB), beta blockers, antiarrhythmic medications, diuretics, statin, pump proton inhibitors, and serotonin reuptake inhibitors). Socioeconomic factors such as household income and education level are available at the aggregate area level. If patients had more than one type of cancer, we retained the most recent cancer diagnosis. Cancer treatment was obtained from diagnosis codes or procedures codes within 30 days before NVAF diagnosis. Other cancer characteristics such as number of regional nodes examined, tumor size, TNM classification, and other cancer-type specific characteristics such as hormone receptor status (HR), and human epidermal growth factor receptor 2 (HER2) for breast cancer or histologic type for lung cancer were used for descriptive purpose but not adjusted in the analysis due to high proportion of missing values [50].

For per-protocol analysis, the following time-varying covariates were extracted at a monthly basis after OAC initiations, including CHA2DS2-VASc score, HAS-BLED score, thrombocytopenia, AKD, radiation, cancer-directed surgery, and use of interacting treatment. All diagnosis codes and procedure codes for covariate ascertainment are described in Table S1, Supplementary materials. We used multiple imputation algorithm (fully conditional specification with logistic regression for categorical variables and predictive mean matching for continuous variables) to impute missing values [51].

Causal Contrast

We computed the observational analog of both intention-to-treat (ITT) and per-protocol (PP) effects. ITT effect refers to effect of being assigned to warfarin or DOACs, regardless of whether individuals adhere to initial strategies during follow-up [52, 53]. For PP effect, those who were assigned to warfarin group were censored if they discontinued warfarin or switched from warfarin to DOACs and vice versa.

Statistical Analysis

Descriptive statistics with mean and standard deviation (SD) for continuous variables, frequency count and percentage for categorical variables were used to describe the study sample. In the main analysis, we obtained the ITT effect (the effect of initiating DOACs compared with warfarin) and PP effect (the effect of sustaining DOACs compared with warfarin). First, we adjusted for potential confounders between treatment groups at each month using weighting approach. The total weights were a product of stabilized inverse probability treatment weights (IPTWs) and stabilized inverse probability censoring weights (IPCWs) due to loss to follow-up. Next, we fitted a weighted pooled logistic regression estimated by generalized estimating equations (GEEs) with robust variance estimators, adjusted for baseline and time-varying confounders. Incidence rate (case per person-year), absolute risk difference (RD, 95% CIs), and summary hazard ratios with 95% confidence intervals (HR, 95% CIs), and weighted survival curves were obtained. All weights were truncated at 99th percentile. The technical details of the adjustment are described in Technical Appendix. Statistical analysis was conducted using SAS (version 9.4, SAS Institute Inc., Cary, NC, USA).

Subgroup Analyses and Sensitivity Analyses

Treatment effects were estimated under the following subgroups: cancer type (breast, lung, prostate), cancer status at baseline (active, history), cancer stage (local, regional, and distant), and tumor grade (I, II, and III). A series of sensitivity analyses were conducted to confirm the robustness of main findings. First, we extended a grace period to 6 months from AFib diagnosis to warfarin/DOAC initiation to reflect the uncertainty in clinical practice when patients are allowed time to complete clinical tests before treatment initiation or get access to initial treatment. Second, we included individuals with all levels of baseline CHA2DS2-VASc score because patients with cancer are at higher risk of stroke and may be eligible to initiate OACs. Third, we estimated the long-term treatment effects by extending follow-up time to 36 months. Fourth, we removed individuals with metastatic cancer at baseline, who may have a low life expectancy based on previous RCTs [40, 41]. Fifth, we excluded individuals with thrombocytopenia at baseline, since these patients are at elevated risk of bleeding [29, 54]. Six, we further truncated stabilized weights at 95th percentile to test the robustness of the treatment effects to the presence of extreme weights.

Results

Study Sample and Characteristics

Among 70,035 patients with newly diagnosis of AFib and concomitant cancer in SEER-Medicare 2012–2019, 5371 DOAC initiators (3264 apixaban, 314 dabigatran, 1786 rivaroxaban, seven edoxaban) and 1788 warfarin initiators were included in the final sample (Fig. 1).

Fig. 1
figure 1

Flowchart diagram for study sample selection

Study sample characteristics are fully described in Table S2. Patients who initiated DOACs had a higher socio-economic status (higher household median income and education level) and lower comorbidity burden (CHA2DS2-VASc, HAS-BLED, and NCI scores) compared to those initiated warfarin. Regarding cancer characteristics, patients with breast or prostate cancer were more likely to receive DOACs while more patients with lung cancer were on warfarin. Warfarin was more commonly prescribed in patients with active cancer and patients in the advanced stage of cancer (regional and distant) (Table S2).

Main Analysis

The incidence rates of stroke and major bleeding between DOAC and warfarin initiators were 9.97 vs. 9.91 and 7.74 vs 9.24 cases per 1000 person-years, respectively. In adjusted ITT analysis, DOAC initiators had no statistically significant difference in risk of ischemic stroke (HR = 1.14, 95% CI 0.77–1.68) and major bleeding (HR = 0.87, 95% CI 0.52–1.44) as warfarin initiators. DOAC initiators also had no statistically significant difference in risk of VTE (HR = 0.73, 95% CI 0.47–1.15), and intracranial bleeding (HR = 0.78, 95% CI 0.45–1.35), but lower risk of GI bleeding (HR = 0.77, 95% CI 0.59–0.99) and non-critical site bleeding (HR = 0.63, 95% CI 0.50–0.77) compared to warfarin initiators (Fig. 2; Table 2). The findings were mostly consistent in adjusted PP analysis (ischemic stroke (HR = 1.81, 95% CI 0.75–4.36), VTE (HR = 0.50, 95% CI 0.20–1.09), intracranial bleeding (HR = 0.38, 95% CI 0.13–1.12), GI bleeding (HR = 0.91, 95% CI 0.61–1.35), and non-critical site bleeding (HR = 0.69, 95% CI 0.48–0.98)) (Fig. 3; Table 3). However, DOAC initiators had lower risk of major bleeding (HR = 0.35, 95% CI 0.12–0.99) than warfarin initiators in adjusted PP analysis, although 95% CI was wide. The distributions of total weights and truncated weights are described in Table S2.

Fig. 2
figure 2

Weighted survival curves for intention-to-treat analysis comparing safety and effectiveness between direct oral anticoagulants and warfarin in patients with atrial fibrillation and cancer

Table 2 Intention-to-treat analysis for comparative effectiveness and safety between DOACs and warfarin in patients with AFib and cancer
Fig. 3
figure 3

Weighted survival curves for per-protocol analysis comparing safety and effectiveness between direct oral anticoagulants and warfarin in patients with atrial fibrillation and cancer

Table 3 Per-protocol analysis for comparative effectiveness and safety between DOACs and warfarin in patients with AFib and cancer

Subgroup and Sensitivity Analyses

The main findings were consistent with minor heterogeneity across subgroups of cancer type, active/inactive cancer status, cancer stage, and tumor grade. Due to small sample size and few events, subgroup analyses were not conducted for some outcomes in prostate cancer, regional or metastatic cancer, or tumor grade I. Statistically significant benefits of DOACs over warfarin were detected for VTE in breast cancer; GI bleeding and non-critical site bleeding in prostate cancer and local cancer stage; VTE and non-critical site bleeding in inactive cancer and tumor grade II; major bleeding and intracranial bleeding in patients with regional cancer stage and tumor grade III (Table S3). Finding remained robust under different assumption of grace period, additional inclusion and inclusion criteria, extended follow-up, and extreme weights (Table S4).

Discussion

By explicitly emulating the target trial comparing effectiveness and safety between DOACs and warfarin, we found that DOAC initiators had no significant difference in risks of ischemic stroke and major bleeding compared with warfarin. However, DOAC initiators had lower risk of major bleeding than warfarin initiators in adjusted PP analysis. In addition, DOACs initiators also had lower risk of secondary outcomes including GI bleeding and non-critical site bleeding than warfarin initiators. Findings remain consistent across subgroups and robust in sensitivity analyses.

The management of patients with AFib and cancer are more complicated than general AFib patients because the presence of cancer increases the risk of stroke and bleeding in these patients [7, 18]. Although clinical guidelines are inconclusive, DOACs were increasingly preferred for the management of AFib in the presence of cancer [21, 28, 39]. The growing body of evidence in the literature showed inconsistent findings in comparative effectiveness and safety between DOACs and warfarin in patients with AFib and cancer. In RCTs such as ENGAGE, ROCKET-AF, and ARISTOLE, patients with existing cancer were generally excluded due to their low life expectancy [16,17,18]. A meta-analysis of these RCTs found that DOACs were associated with non-inferior rates of thromboembolic and bleeding events compared with warfarin [19]. A potential limitation of the meta-analysis is clinical heterogeneity (i.e., patient characteristics, treatment, follow-up) and methodological heterogeneity (i.e., study design and analysis) among included studies [19]. Using administrative claims data, Shah (2018) and Deitelzweig (2021) found similar risk of stroke and bleeding between DOACs and warfarin, except for apixaban (lower risk of stroke and bleeding) [21, 38]. Shah also found a lower risk of stroke among DOAC user compared with warfarin. An important limitation of these studies is unmeasured confounding, such as cancer stage and tumor grade (not available in claims data), which may result in a differential risk of stroke and bleeding between DOACs and warfarin. In addition, although DOACs are associated with reduced risk of intracranial bleeding in non-cancer patients [55, 56], the benefit over warfarin was not found in our study. Indeed, both ITT and PP analyses showed a trend of reduced intracranial bleeding for DOACs initiators than warfarin initiators (HR = 0.78 (0.45, 1.35) and HR = 0.38 (0.13, 1.12)), but the treatment effects were not statistically different. This may be due to the small number of events in both groups and the small sample sizes of this study. In addition, pharmacokinetic and pharmacokinetics of OACs in cancer patients may not be stable as in non-cancer patients due to hemodynamic changes or drug-drug interactions with antineoplastic agents, which may prevent OACs from achieving their therapeutic effects [17, 18, 57,58,59]. Recently, Mehta (2022) used the SEER-Medicare data and conducted a new-user cohort study comparing risk of stroke and bleeding between DOACs and warfarin to further capture cancer characteristics from SEER registry [28]. The authors found an increased risk of stroke (HR = 1.41, 95% CI 0.92–2.14) but decreased risk of bleeding (HR = 0.90, 95% CI 0.70–1.17) among DOACs users compared to warfarin users. Since Mehta included patients with existing AFib and prior stroke or recent bleeding before OAC initiation, some patients may take DOACs or warfarin for secondary prevention of stroke. As a result, the observed incidence rates of ischemic stroke and bleeding are higher than our study because prior stroke or bleeding are strong risk factors for subsequent events [60,61,62,63]. In the current study, we emulated a target trial to explicitly answer a causal question: among patients with existing cancer who were newly diagnosed with AFib, what is the effect of initiating and/or sustaining DOACs compared with warfarin? We required patients to be outcome-free shortly before AFib diagnosis and during the grace period before they actually received OACs. This approach ascertains unbiased estimates for effectiveness and safety of OACs for primary prevention purposes and better interpretability in our study. Other strengths of our study included the adjustment for selection bias due to loss to follow-up [64], and adjustment for time-varying covariates [65], and estimand of interest specification (ITT or PP) [66]. Our findings contribute to the growing evidence and are expected to help clinicians optimize anticoagulation therapy in patients with AFib and cancer.

Our study may not perfectly emulate components of hypothetical target trials and referent RCTs such as inclusion/exclusion criteria, outcomes, and follow-up [40, 41]. For instance, RCTs excluded patients platelet count < 90,000/μL, systolic blood pressure ≥ 180 mmHg or diastolic blood pressure ≥ 100 mmHg, or creatinine clearance less than 30 mL/min at the baseline [40, 41]. However, we failed to obtain these lab values in SEER-Medicare. As a solution, we defined these conditions by the presence of thrombocytopenia or severe renal impairments or ESRD. In addition, several conditions were evaluated by clinicians’ assessment in RCTs rather than ICD codes in medical records, such as AFib definition by an abnormal electrocardiogram (ECG) [40, 41]. Also, we could not use international normalized ratio (INR) and liver-function tests to monitor treatment responses as in RCTs [40, 41]. It is also necessary to highlight that misspecification of time zero is the major source bias in observational studies, but not lack of randomization [33, 34]. Successful emulation of hypothetical and real RCTs using observational data has been conducted recently [67,68,69]. In this study, we assured exchangeability between DOAC users and warfarin users by adjusting for baseline and time-varying confounding [42]. In addition, we aligned the time when all inclusion and exclusion criteria met, start of treatment strategies, and start of follow-up. Correct time zero specification in observational studies removed immortal time bias and prevalent user bias [33, 34].

This study has several limitations. First, unmeasured confounding such as patient frailty, body mass index, or physician’s preference were not adjusted in the analysis. Frailty was found to be a predictor of OAC selection and adverse outcomes in previous studies [70]. However, these variables are not available in SEER-Medicare data and we did not quantify the magnitudes of these unmeasured confounding in our analysis. Second, although we used validated algorithms to define exposure, outcomes, and covariates, measurement bias may still persist in claims data. For example, we measured patients’ behavioral risk factors (i.e., alcohol use disorders in HAS-BLED score) using ICD codes [71]. In recent studies, CHA2DS2-VASc score did not performed well in predicting risk of stroke in cancer patients, and novel assessment tool should be developed and validated for this population [72, 73]. Thus, our study could not fully capture the potential candidates for OAC initiation at study entry because a threshold of CHA2DS2-VASc score ≥ 2 may not be applicable for patients with AFib and cancer [74]. However, we found no difference when patients with all levels of CHA2DS2-VASc scores were included in the sensitivity analysis. In addition, 12-month baseline period before AFib diagnosis was insufficient to capture patients’ baseline characteristics. It is also noticed that socioeconomic factors were available on an aggregate Census tract level. Third, the presence of extreme weights may increase the variability of the treatment effects, although we found consistent findings after truncating weights to 95th percentile [75]. Fourth, some analyses, especially subgroup analyses, may be underpowered or could not be performed due to small sample size, leading to unstable treatment effects and wide 95% CIs. Likewise, we could not conduct the analysis stratified by individual DOACs due to limited sample size. Prior studies suggested apixaban stood out among DOACs in reducing risk of stroke and bleeding [21, 38]. Fifth, our findings may not be generalizable to non-Medicare populations or those who developed cancer after AFib diagnosis.

Conclusions

In this target trial emulation using linked cancer registry and administrative claims data, we found that DOACs are safe and effective alternatives to warfarin in the management of patients with AFib and cancer.