Introduction

Atrial fibrillation (AF) is the most common arrhythmia, with an estimated prevalence of about 1–2% in the overall population and up to 15% in the elderly aged over 80 years of age [1]. Multiple risk factors are associated with an increased likelihood of AF development, such as hypertension, valvular heart disease, heart failure, ischemic heart disease and sleep apnea [2]. Hence, AF patients are commonly multi-morbid, with frailty, polypharmacy and clinical complexity, with implications for treatment and prognosis [3,4,5]. The current holistic or integrated care approach to AF management [6], which is associated with improved clinical outcomes [7], is therefore recommended in guidelines [8, 9].

Reducing the risk of risk of stroke and thromboembolism using appropriate oral anticoagulation (OAC) therapy is one the fundamental pillars of the management of AF. Several studies have shown that direct-acting oral anticoagulants (DOACs) avoid most of the drawbacks of vitamin K antagonists (VKAs), but there is still scarce evidence about their efficacy and safety in certain subpopulations. Indeed, the pivotal DOAC randomized clinical trials (RCTs) were characterized by strict inclusion and exclusion criteria [10,11,12,13].

Although the strongest scientific evidence is derived from RCTs, these studies are not free of limitations, such as a careful patient selection and exclusion criteria. The reality is that in daily clinical practice, patients are much more heterogeneous than in RCTs and present with comorbidities that would be considered as an exclusion criterion for RCT enrollment. Therefore, there is a need to validate the event rates seen in RCTs in observational studies to investigate how these clinical characteristics influence adverse events and verify if RCT results could be extrapolated to daily clinical practice. Herein, we aimed to investigate whether the clinical characteristics of AF patients who meet RCT exclusion criteria are different from those of patients who do not meet exclusion criteria, and second, to compare the risk of adverse events between these two groups.

Methods

This was a prospective observational cohort study including 1,050 outpatients diagnosed recently with any type AF (either paroxysmal, persistent, long-term persistent or permanent) and naïve for OAC, starting VKA therapy at our anticoagulation clinic of a tertiary hospital (Murcia, Spain) from July 1, 2016, to June 30, 2018. Only adult patients were included. Patients with prosthetic heart valves, rheumatic mitral valves or other severe valvular disease were excluded, with no other exclusion criteria. A more detailed description of the methods and the Murcia AF Project II (MAFP-II) cohort has been previously published [14]. The present is post hoc analysis of an all-comers design study. Hence, no sample size calculations were performed.

At baseline, a complete medical history was obtained by collecting socio-demographic and anthropometric data, comorbidities, and concomitant therapies. In addition, stroke (CHA2DS2-VASc) and bleeding (HAS-BLED) risk scores were calculated.

The study protocol was approved by the Ethics Committee from the University Hospital Morales Meseguer (reference: EST: 20/16) and carried out in accordance with the ethical standards established in the 1964 Declaration of Helsinki its subsequent amendments. Informed consent was required to participation in this study.

Definition of exclusion criteria for RCT

Patients were classified according to the presence, at study baseline, of exclusion criteria commonly established in the RE-LY, ARISTOTLE, ROCKET-AF, and ENGAGE AF-TIMI 48 trials [10,11,12,13]. Patients were classified as “meeting exclusion criteria” if they fulfilled at least one of the following:

  1. i.

    Moderate–severe anemia (defined as a hemoglobin < 100 g/L).

  2. ii.

    Severe renal failure (estimated creatinine clearance ≤ 30 mL/min) or serum creatinine > 2.5 mg/dL.

  3. iii.

    Uncontrolled hypertension (systolic blood pressure > 180 mm Hg or diastolic blood pressure > 100 mm Hg).

  4. iv.

    Concurrent treatment with aspirin and a thienopyridine (e.g., clopidogrel, ticlopidine).

  5. v.

    Platelet count < 90,000/μL.

  6. vi.

    Frailty: severe comorbid condition with life expectancy ≤ 1 year.

  7. vii.

    Active alcohol abuse.

  8. viii.

    Recent severe stroke (< 6 months).

Follow-up and clinical outcomes

Follow-up was for 2 years. During this period, ischemic stroke/transient ischemic attack (TIA), major bleeding (according to the 2005 International Society on Thrombosis and Haemostasis [ISTH] criteria) [15], major adverse cardiovascular events (MACE, defined as the composite of ischemic stroke/TIA, myocardial infarction or cardiovascular death), and all-cause mortality were the primary endpoints. The investigators identified, confirmed, and recorded all clinical outcomes. Of note, no patient was lost to follow-up.

Statistical analysis

Quantitative variables were expressed as mean ± standard deviation (SD) or median and interquartile range (IQR) as appropriate, while categorical variables were expressed as absolute frequencies and percentages. Pearson’s chi-square test was used to compare proportions, and differences between quantitative and categorical variables were assessed using the Mann–Whitney U test or Student t test as appropriate.

Incidence rates with their Poisson 95% confidence intervals (CI) for primary outcomes during the follow-up were calculated in patients with and without exclusion criteria. Incidence rates were then compared and reported as incidence rate ratios (IRRs), with “no exclusion criteria” as the reference category.

Adjusted Cox proportional hazard regression models were performed to determine the independent risk of the primary outcomes based on the sum of exclusion criteria, on the presence of at least one exclusion criteria, and on the presence of ≥ 2 exclusion criteria. Results were expressed as adjusted hazard ratio (aHR) with the corresponding 95% CI. Survival analyses were performed using Kaplan–Meier curves, testing the difference by the log-rank test.

A p value < 0.05 was accepted as statistically significant. Statistical analyses were performed with SPSS v. 25.0 (SPSS, Inc., Chicago, IL, USA) and MedCalc v. 16.4.3 (MedCalc Software bvba, Ostend, Belgium) for Windows.

Results

A total of 1050 AF patients (51.5% female, median age 77, IQR 69–83 years) were included, with a median CHA2DS2-VASc of 4 (IQR 3–5) and a median HAS-BLED of 2 (IQR 2–3). Of these, 682 (65%) had no RCT exclusion criteria, and 368 (35%) patients met at least one RCT exclusion criterion. The distribution of patients according to the number of exclusion criteria is detailed in Fig. 1 and the proportion of patients with different exclusion criteria is presented in the Supplementary Table 1.

Fig. 1
figure 1

Distribution of patients based on the number of exclusion criteria

Patients meeting at least one exclusion criteria were older and characterized by a greater presence of comorbidities as observed in Table 1. As a result, patients with at least one exclusion criteria had significantly higher median CHA2DS2-VASc (4 [IQR 3–6] vs. 4 [IQR 3–5]; p < 0.001) and HAS-BLED (3 [IQR 2–4] vs. 2 [IQR 2–3]; p < 0.001) scores. Additionally, the time in therapeutic range (TTR) was significantly lower in the group with at least one exclusion criteria (60.3% [IQR 47–74] vs. (65.7% [IQR 51–80]; p < 0.001).

Table 1 Baseline clinical characteristics

Primary adverse events and relationship to RCT exclusion criteria

During a median follow-up of 2 years, 67 (6.4%) patients suffered an ischemic stroke/TIA; 64 (6.1%) had a major bleeding event; 65 (15.7%) suffered from any of the MACE components; and 173 (16.5%) died. The IRRs for major bleeding, MACE, and all-cause mortality were significantly higher among patients with any exclusion criteria (all p < 0.001), but the IRR of ischemic stroke/TIA was non-significantly different. Detailed comparisons are shown in Table 2.

Table 2 Incidence rate and incidence rate ratio of the different outcomes among periods

Notwithstanding the low numbers and wide 95% CIs, there was no statistical difference for ischemic stroke/TIA in relation to RCT exclusion criteria either for the number of accumulated exclusion criterion met (HR 1.01, 95% CI 0.70–1.45, p = 0.940), or according to meeting at least one criterion (HR 1.24, 95% CI 0.75–2.03, p = 0.401) or ≥ 2 criteria (HR 0.503, 95% CI 0.12–2.06, p = 0.339).

Cox regression showed that patients had a significantly higher risk of major bleeding for each exclusion criterion, even after adjusting for HAS-BLED score (aHR 1.90, 95% CI 1.44–2.50, p < 0.001). The risk of major bleeding was significantly higher in patients meeting at least one exclusion criteria compared to those without exclusion criteria (aHR 1.48; 95% CI 1.22–1.81; p < 0.001), and in those fulfilling at least two exclusion criteria (aHR 2.94; 95% CI 1.52–5.70; p = 0.001).

For MACE, there was an increased risk per additional exclusion criterion in the model adjusted by CHA2DS2-VASc and HAS-BLED scores (aHR 1.29, 95% CI 1.05–1.58, p = 0.015). In turn, patients with at least one exclusion criterion had 50% increased risk of MACE (aHR 1.51, 95% CI 1.10–2.09, p = 0.012), whereas patients with ≥ 2 criteria exclusion criteria had a non-significantly increase in risk (aHR 1.35, 95% CI 0.81–2.25, p = 0.255).

The risk of death was significantly higher as the number of RCT exclusion criteria increased on Cox regression analysis adjusted by CHA2DS2-VASc and HAS-BLED (HR 1.86, 95% CI 1.58–2.19, p < 0.001). Patients with at least one exclusion criterion had a threefold higher risk of death (aHR 3.22, 95% CI 2.32–4.48, p < 0.001), and those with at least two exclusion criteria were also at increased risk (aHR 2.58, 95% CI 1.72–3.87, p < 0.001).

Kaplan–Meier analyses showed that patients with exclusion criteria had lower event-free survival from major bleeding (log-rank p < 0.001), MACE (log-rank p < 0.001), and all-cause mortality (log-rank p < 0.001), but not from ischemic stroke/TIA (log-rank p = 0.400) (Fig. 2).

Fig. 2
figure 2

Kaplan–Meier survival curves for the primary outcomes. Solid black line = patients with no exclusion criteria. Dashed black line = patients with at least one exclusion criteria

As there were significant differences in median TTR between both groups of patients, a sub-analysis adjusting also for TTR in the Cox regression analyses was performed (Supplementary Table 2). Overall, the results were similar to the main analysis and the risk of major bleeding, MACE and all-cause death was significantly increased in patients with at least one exclusion criterion compared to those without exclusion criteria, even accounting for the quality of VKA treatment (aHRs of 2.37, 1.43 and 2.50, respectively).

Discussion

In this prospective cohort study, our principal findings are as follows: (i) one-third of AF patients met some common RCT exclusion criteria, and thus, these patients would be systematically excluded from RCT participation; (ii) patients who met exclusion criteria in RCTs had a higher risk profile, and greater comorbidities; and (iii) the presence of exclusion criteria was associated with an increased risk of major bleeding, MACE, and mortality, which was even higher per with each additional exclusion criteria.

Stroke prevention with OAC is an essential part of the management of AF [8, 9]. However, there remains a significant proportion of patients in whom the management of OAC is particularly challenging. This may be because many of the studies and clinical trials on OAC established strict inclusion and exclusion criteria, resulting in under-representation of such patients who are usually managed in everyday clinical practice with limited evidence-based support, but leading to uncertainties and lack of information when trying to extrapolate the results of RCTs to everyday clinical practice.

Therefore, although RCTs provide the highest level of scientific evidence, observational studies would supplement the evidence gap in this field. For this reason, there should be an increase in the number of pragmatic clinical trials where participants resemble the less selected populations that would benefit from the treatments and interventions under investigation. In pragmatic trials, inclusion and exclusion criteria should be reduced as much as possible, and both the number of reviews and the completeness of patient follow-up should be increased, given the trends toward more personalized medicine. This approach allows the results of pragmatic trials to closely resemble daily clinical practice [16].

Undoubtedly, the presence of cardiovascular risk factors adds to the clinical complexity and challenges of AF management. Conditions such as thrombocytopenia may indicate the presence of comorbidities associated with poor survival in AF [17], and acts as an independent risk factor for bleeding [18]. There are even additional exclusion criteria beyond those considered in this study. For instance, patients with liver dysfunction were excluded from all RCTs comparing DOACs to warfarin due to their higher risk of both thrombosis and bleeding compared to AF patients without liver disease, thereby creating a significant knowledge gap [19]. Indeed, liver disease and fibrosis have shown to increase the risk of worse clinical outcomes, particularly in VKA users [20,21,22]. Despite the high risk of adverse outcomes in these complex patients, anticoagulation is under-prescribed and discontinuations are common [3]. DOACs offer a better safety and effectiveness profile, but the scientific evidence from RCTs is limited as these patients are less likely to have been included in a clinical trial.

The insights gleaned from post-authorization studies provide several significant benefits for patients undergoing treatment with DOACs. These studies serve as valuable complements to earlier-phase clinical trials, providing additional data on the safety and efficacy of these medications in less restricted scenarios. For example, the ETNA-AF and XANTUS studies confirmed the safety and efficacy of edoxaban and rivaroxaban over warfarin in clinical practice settings. Notably, their findings are even more reassuring than those from their respective pivotal trials. The broader patient profiles observed in these studies suggest a wider applicability of DOACs compared to the more stringent inclusion criteria of phase III trials [23]. Moreover, the comorbidity profiles of patients in these studies closely resemble those in our own research.

We previously demonstrated that everyday clinical practice patients have inherent factors that make them different from patients included in an RCT, and are at a higher risk of adverse events when compared to RCT patients, even if differences are appropriately adjusted using statistical methods [24]. Similarly, a study showed that risk profiles are higher in AF patients included in observational studies compared to patients included in RCTs, and the higher rates of bleeding and mortality not attributable to thromboembolism in these patients suggest that there are additional differences in their characteristics that are important contributors to this difference in event rates [25]. Another recent study showed that in elderly patients with venous thromboembolism, those with ≥ 2 exclusion criteria had a twofold (sHR 2.16, 95% CI 1.38–3.39) increased risk of major bleeding compared to eligible patients, and the bleeding risk increases significantly with the number of exclusion criteria present [26]. On the contrary, Steinberg et al. found that patients treated with OACs and contraindication to anticoagulation had a lower risk of stroke, hospitalization, and death although there was an increased risk of intracranial hemorrhage. In addition, they showed that even in cases of blood dyscrasias and intracranial bleeding, the net clinical benefit was still positive, hence demonstrating that a high risk of bleeding should not be an absolute contraindication to OAC as many of the risk factors for bleeding are also risk factors for stroke [27].

Certainly, many comorbidities, including anemia, thrombocytopenia, renal failure, or uncontrolled hypertension do not represent an absolute contraindication to OAC. Therefore, future RCTs should reduce the number of exclusion criteria so that they can provide a more objective and less restrictive scenario, thus providing a more realistic picture and broadening the generalizability of study results. In parallel, it would be desirable to investigate which specific exclusion criteria are associated with a higher risk of events, in order to design specific studies in such populations.

Limitations

There are some limitations in relation to this study that should be acknowledged. The main one lies in its observational nature, with a Caucasian-based population and single-center design. Another potential limitation is patient selection because only those patients who initiated VKA therapy were included as this was the only OAC authorized for OAC-naïve patients in Spain. Thus, we need to recognize that our results could not apply to a DOAC-treated population, where the risk of several outcomes could be attenuated by the greater effectiveness and safety of DOACs.

In addition, we acknowledge that the definitions of stroke exclusion criteria are not exactly the same between the four DOAC RCTs (RE-LY: strokes within the previous 14 days were excluded [6 months for severe stroke]; ARISTOTLE: strokes within the previous 7 days were excluded; ROCKET AF: strokes within the previous 14 days were excluded [3 months for severe stroke]; ENGAGE TIMI AF: strokes within the previous 30 days were excluded). Hence, we decided to use the 6 months timing for severe strokes.

Finally, although we included severe frailty condition as a criterion, we did not consider other subjective exclusion criteria such as “the investigator’s opinion that a subject would have an unacceptable risk from study participation, a suboptimal compliance, or any condition precluding successful study completion”. It is therefore likely that the real proportion of non-eligible patients for a RCT would be even higher.

Conclusions

In this prospective cohort of AF patients taking VKA therapy, more than one-third of patients had at least one RCT exclusion criteria, which translates into an increased risk of major bleeding, MACE, and death. Our findings suggest that may be variations in the results of RCTs when applied to patients from everyday clinical practice and the risk profile could be difference. These issues should be considered when translating RCTs results to AF patients for a proper and a more patient-centered management.