Background

Atrial fibrillation (AF) is the most common arrhythmia in clinical practice affecting 44 million people worldwide, and its incidence is expected to increase further in the following years [1].

AF presence has been associated with adverse long-term outcomes, namely twofold increase in total mortality, heart failure prevalence and hospitalizations, as compared to non-AF patients. What is more, AF course is frequently complicated by systemic embolic events, vascular dementia and impaired quality of life [2].

Anticoagulation, heart rate control and management of comorbidities are the three major therapeutic pillars in the treatment of patients with AF and have been shown to reduce overall mortality, hospital admissions and thromboembolic events [3]. Rhythm control is currently recommended for symptom control and quality of life improvement in symptomatic AF patients, despite optimally tolerated heart rate and comorbidities control. In particular, AF catheter ablation (AFCA) is indicated for rhythm control after one failed or intolerant antiarrhythmic drug has been tested [2].

AF catheter ablation (AFCA) has revolutionized AF management and two recently published trials suggested that cryoablation is more effective first line approach than antiarrhythmic drugs [4, 5]. On the contrary, AFCA has been hindered by a ceiling of long term success rate ranging between 65 and 78% [6]. Importantly, current arrhythmia recurrence prediction scores post-AFCA feature moderate discriminatory ability [7].

2020 European Society of Cardiology guidelines for the diagnosis and management of atrial fibrillation endorsed (class IIa recommendation) a structured characterization of AF (the so-called 4S-AF scheme). Stroke risk, severity of AF burden, symptom status and AF substrate constitute the 4S-AF scheme. Hence, the role of AF substrate is increasingly recognized and non-invasive multimodality imaging is capable of characterizing left atrial (LA) morphology and function, which have been proven so far predictive of stoke development [8] as well as AF recurrence post-AFCA [9].

Since left atrial appendage (LAA) is a critical morphofunctional component of LA, we hypothesized that AF related atrial cardiomyopathy also affects LAA anatomic and functional indices. Previous studies have shown that LAA emptying velocity and ejection fraction are improved three months after catheter ablation in patients with paroxysmal AF [10]. In this systematic review we attempted to answer the question of whether pre-ablation LAA indices could be predictive of AFCA long term success.

Methods

Data sources and search strategy

Studies including patients undergoing first catheter ablation for AF in whom late arrhythmia recurrence was assessed were evaluated for inclusion in this meta-analysis. Search strategy, study selection, data extraction, and data analysis were performed according to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses 2009 guidelines [11]. Two reviewers (KP and DV) independently identified the relevant studies by an electronic search of the MEDLINE database, ClinicalTrials.gov, medRxiv and Cochrane Library from inception to 15th of August 2022. The following search query was used: ‘’atrial appendage’’ and ‘’atrial fibrillation ablation recurrence” (see Additional file 1: Table S1). Articles and book chapters cited in the reference lists of initially identified articles by this query were reviewed in order to identify any supplemental studies (“snowball procedure”). The final list of eligible articles was filtered manually to exclude duplicates. No language restriction was utilized, and all relevant studies were screened irrespective of study design (randomized and non-randomized studies of retrospective or prospective design).

Inclusion and exclusion criteria

In order for a study to be eligible, it had to fulfill the following inclusion criteria: (1) evaluated late arrhythmia recurrence rates in patients undergoing AFCA (2) employed a clearly stated definition of arrhythmia recurrence (3) reported data on pre-ablation LAA indices and described the employed imaging modality and technique. Studies were excluded if they were: (1) not reporting data on arrhythmia recurrence and/or LAA indices (2) case reports (3) evaluating arrhythmia recurrence and/or LAA indices in other ablation modalities for AF (surgical, epicardial or hybrid ablation).

Data extraction

Data were independently extracted and reviewed from each study by two reviewers (KP, DV). Any discrepancy between data extractions was resolved by discussion or a third reviewer (SG). The following data were extracted: first author, year of publication, country, study design (prospective/retrospective), number of patients, patient demographics, matching criteria and descriptive statistics of recurrence and no recurrence groups, LAA indices and imaging technique, and catheter ablation modality and technique.

Quality assessment

Quality of the included studies was conducted via the Newcastle–Ottawa Scale (NOS) [12], in which a study was judged on three categories: selection, comparability and exposure/outcome. A nine-point scale of the NOS (range, 0–9 points) was eventually used for the evaluation. Two authors (KP, SG) discussed the implementation of this quality assessment tool and independently assessed the studies. Studies were defined as high quality if they had more than seven points, as medium quality if they had between four and six points, and as poor quality if they had fewer than four points.

Outcomes of interest

The pre-specified primary endpoint was pre-ablation difference in LAA anatomic or functional indices between patients suffering arrhythmia recurrence and arrhythmia free counterparts post-AFCA. LAA indices were assessed and measured according to the definitions reported in the original study protocols (see Additional file 1: Tables S2 and S3).

Statistical analysis

The descriptive statistics were described as mean ± standard deviation (SD). For continuous variables the standardized mean difference (SMD) with 95% confidence intervals (CI) was used as the summary statistic and trial-specific data were pooled with the inverse-variance random-effects method. When mean and standard deviation were not available, they were derived from sample size, median, and range based on a method previously described by Wan et al. [13]. For categorical variables statistical pooling was performed according to a random-effect model with generic inverse-variance weighting of odds ratio, computing risk estimates with 95% CI. The presence of heterogeneity among studies was evaluated with the Cochran Q chi-square test with p ≤ 0.1 considered to be of statistical significance, estimating the between-studies variance tau-square, and using the I2 test to evaluate inconsistency. I2 values of 25%, 50% and 75% were assigned adjectives of low, moderate and high heterogeneity. A leave-one-out sensitivity analysis was performed by iteratively removing one study at a time to confirm that our results were not driven by any single. In addition, a sensitivity analysis by calculating SMD using the inverse-variance fixed-effects method was performed for all outcomes of interest. Publication biases were assessed by visual inspection with funnel plots. All analyses were performed with Review Manager, version 5.3 (Copenhagen: The Nordic Cochrane Centre, The Cochrane Collaboration, 2014). The guidelines summarized in the MOOSE statements were followed [14].

Results

Search results

Figure 1 displays the PRISMA study search and selection process. The electronic database search identified 470 studies. After screening, a total of 34 studies [15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,44,45,46,47,48] met the inclusion criteria. In particular, 5, 11, 6, 5, and 20 studies were included in the LAA ejection fraction (LAAEF), LAA volume (LAAV), LAA morphology, LAA orifice area (LAAOA) and LAA emptying velocity (LAAeV) analyses, respectively. Individual study characteristics are presented in Table1.

Fig. 1
figure 1

PRISMA flowchart

Table 1 Studies main characteristics

Nineteen studies included a mixed population of paroxysmal and persistent AF patients, seven studies included only paroxysmal AF patients, and the remaining eight studies included patients suffering from persistent AF. Regarding AF ablation modality, four studies reported cryoablation [24, 30, 36, 45], one study reported a mixed population of cryoablation and radiofrequency ablation [33], and the remaining studies reported only radiofrequency ablation. Of note all studies defined early recurrence as a recurrence of AF within three months post-ablation and late recurrence as the recurrence of AF episodes or other atrial tachyarrhythmias lasting more than 30 s after the three months blanking period. Furthermore, all studies report a follow-up period of at least 6 months post-AFCA and 24-h Holter monitoring at least at 3, 6 and 12 months post-AFCA. As far as antiarrhythmic drug (AAD) post-AFCA is concerned, twelve studies do not report any standard protocol, while most of the remaining studies permitted AAD administration post-AFCA, if patients were already taking them or if they presented with persistent AF subtype. All LAA indices were evaluated before the procedure. LAAV, LAAEF, LAAOA and LAA morphology were assessed via cardiac computed tomography, while LAAeV was evaluated by transesophageal echocardiography. All five LAA indices were calculated after averaging 3–5 cardiac cycles if patients were on sinus rhythm or after averaging 5–10 cardiac cycles if patients were on AF. Additional file 1: Tables S2 and S3 provides more details in ablation methodology, LAA imaging techniques, follow-up and antiarrhythmic medication protocols.

The study by Chang et al. [15] was included in LAAOA analysis, and the study by Tsao et al. [16] was included in LAAEF analysis, so as to avoid duplications. Similarly, the studies by Kim et al. [37] (2021; Int J Cardiovasc Imaging) and Kim et al. [39] (2021; JACC Clin Electrophysiol) were included in LAAV and LAAeV analyses, respectively, in order to avoid duplications. Finally, the study by Szegedi et al. [42] was included in LAA morphology analysis and data from the study by Simon et al. [41] were employed in LAAV, LAAeV and LAAOA analyses only.

As far as LAAeV analysis is concerned, Yoshida et al. [43] and Ma et al. [48] report data on PAF and Per-AF separately, while Kim et al. [39] reported data on early recurrence and early recurrence-free subgroups separately. Since these three studies did not report LAAeV for the whole study populations, we opted to make double entries for them.

Clinical data

Left atrial appendage ejection fraction

A total of five studies involving 537 patients undergoing AFCA compared pre-ablation LAAEF between arrhythmia recurrence and arrhythmia free counterparts. LAAEF was significantly lower in patients with AF recurrence post-ablation [SMD = − 0.66; 95% CI (− 1.01, − 0.32); I2 = 68%; p = 0.0002] as compared to arrhythmia free controls (Fig. 2).

Fig. 2
figure 2

Mean difference in LAA ejection fraction between arrhythmia recurrence and arrhythmia free groups

Left atrial appendage volume

A total of eleven studies involving 2835 patients undergoing AFCA compared pre-ablation LAAV between arrhythmia recurrence and arrhythmia free counterparts. LAAV was significantly higher in patients with AF recurrence post-ablation (SMD = 0.51; 95% CI 0.35–0.67; I2 = 64%; p < 0.00001) as compared to arrhythmia free controls (Fig. 3). We have also conducted a LAA volume subgroup analysis for pulmonary vein isolation (PVI) only and PVI plus additional lines subgroups and our findings remained robust (see Additional file 1: Fig. S1).

Fig. 3
figure 3

Mean difference in LAA volume between arrhythmia recurrence and arrhythmia free groups

Left atrial appendage morphology

A total of six studies involving 1480 patients undergoing AFCA compared pre-ablation chicken wing versus non-chicken wing LAA morphology between arrhythmia recurrence and arrhythmia free groups. LAA morphology was not predictive of AF recurrence post-ablation (chicken wing morphology; OR = 1.27; 95% CI 0.79–2.02; I2 = 59%; p = 0.32) (Fig. 4). A subgroup analysis for pulmonary PVI only and PVI plus additional lines subgroups did not reveal any arrhythmia recurrence difference between patients featuring chicken wing versus non-chicken wing LAA morphology (see Additional file 1: Fig. S2).

Fig. 4
figure 4

Chicken wing versus non-chicken wing LAA morphology between arrhythmia recurrence and arrhythmia free groups

Left atrial appendage orifice area

A total of five studies involving 1578 patients undergoing AFCA compared pre-ablation LAAOA between arrhythmia recurrence and arrhythmia free counterparts. LAAOA was significantly higher in patients with AF recurrence post-ablation (SMD = 0.35; 95% CI 0.20–0.49; I2 = 36%; p < 0.00001) as compared to arrhythmia free controls (Fig. 5).

Fig. 5
figure 5

Mean difference in LAA orifice area between arrhythmia recurrence and arrhythmia free groups

Left atrial appendage emptying velocity

A total of twenty studies involving 5995 patients undergoing AFCA compared pre-ablation LAAeV between arrhythmia recurrence and arrhythmia free counterparts. LAAeV was significantly reduced in patients with AF recurrence post-ablation [SMD = − 0.56; 95% CI − 0.73:− 0.40; I2 = 82%; p < 0.00001] as compared to arrhythmia free controls (Fig. 6). Two subgroup analyses were conducted and found that these observations remain robust for different ablation techniques [PVI only and PVI plus additional lines subgroups (see Additional file 1: Fig. S3)] and AF subtypes [paroxysmal and persistent AF subgroups (see Additional file 1: Fig. S4)].

Fig. 6
figure 6

Mean difference in LAA emptying velocity between arrhythmia recurrence and arrhythmia free groups

Sensitivity analyses

Compared to the main analysis, results remained unchanged after pooling the data using a fixed-effects model. Leave-one out sensitivity analysis by iteratively removing one study at a time did not drastically changed results concerning LAAeV, LAAV and LAAEF. As far as LAA morphology analysis is concerned, the study by Gong et al. [31] was found to be a significant source of statistical heterogeneity since removal of this study reduced heterogeneity to 0%, though LAA chicken wing morphology did not reach statistical significance (OR 1.04; 95% CI 0.78–1.39; I2 = 0%; p = 0.77). Lastly, when the study of Kocyigit et al. [24] was removed, heterogeneity in LAAOA analysis was also significantly reduced (SMD = 0.39; 95% CI 0.36–0.52; I2 = 9%; p < 0.00001) (Additional file 1: Table S4).

Risk of bias assessment

The quality assessment scores of the NOS are shown in Additional file 1: Table S5. Seventeen studies included in the featured analyses were of high quality, while the remaining seventeen were of moderate quality.

Assessment of publication bias

Funnel plot distributions of the featured analyses indicated absence of publication bias and small study effect for LAAV and LAAeV outcomes (see Additional file 1: Figs. S5 and S6). For outcomes with less than ten studies funnel plots were not assessed.

Discussion

In our meta-analysis, we evaluated 34 published studies involving over 8000 patients and found that increased LAA volume and LAA orifice area, as well as decreased LAA ejection fraction and LAA emptying velocity are associated with arrhythmia recurrence in a mixed population of AF subtypes (paroxysmal and persistent) and AFCA modalities (radiofrequency ablation and cryoablation). LAA morphology (chicken wing versus non-chicken wing) was not associated with AF recurrence.

From a clinical perspective, the magnitude of the absolute difference in above-mentioned indices cannot be gauged by standardized mean difference employed in our analyses [49], yet we believe that our findings support the anatomic-mechanistic involvement of LAA in the myopathic AF substrate. In a recently published study, Vaishnav et al. found that unfavorable LA anatomy (including pulmonary vein ostial diameter and antral circumference) had a sensitivity and specificity above 80% to predict long-term arrhythmia recurrence post-cryoablation [50]. Furthermore, in a prospective sub-study of CryoLAEF we have shown that LAA function is improved after pulmonary vein isolation (radiofrequency ablation or cryoablation) in patients with paroxysmal AF [10].

As previously stated, the majority of the included studies employed a mixed population of paroxysmal and persistent AF subjects undergoing wide antral circumferential ablation with further ablation lines and substrate modification in most of the cases (see Table 1 and Additional file 1: Table  S4). This heterogeneous population and the lack of separately reported data prevented us from conducting some sub-analysis concerning AF subtype and ablation strategy. Yet, we found that both LAAeV and LAAV associations with arrhythmia recurrence remain robust for different ablation techniques (PVI only and PVI plus additional lines subgroups) and AF subtypes (paroxysmal and persistent AF subgroups). Of note, some of the available studies in the field report prediction statistics and support the validity of our findings (see Table 2).

Table 2 Arrhythmia recurrence prediction statistics in each individual study

Specifically, Kim et al. [37] utilized LAA volume index data and found that LAA volume index ≥ 7 ml/m2 is associated with 66% increased risk of AF recurrence. Importantly, this association remained significant after adjustment for clinically relevant predictors of arrhythmia recurrence such age, LA volume index, diabetes mellitus, heart failure, body mass index and AF subtype [9, 51, 52]. Tian et al. have also found that increased LAA volume increases AF recurrence risk by 16%, and this finding is independent of LA volume and CHA2DS2-VASc score [26]. Similarly, Du et al. suggested that LAA volume increases arrhythmia recurrence risk by 34% after adjustment for LA volume, AF subtype and additional ablation lines [25].

As far as LAA ejection fraction is concerned, the studies by He et al. and Tian et al. underscore that increased LAA ejection fraction is associated with a reduction in AF recurrence ranging from 10 to 20% after adjustment for LA volume [23, 26]. Regarding LAA orifice area, both Du et al. and Simon et al. have adjusted their finding for LA volume and found that LAA orifice area is not an independent predictor of arrhythmia recurrence [25, 41].

Lastly, many investigators have examined LAA emptying velocity as an independent predictor of arrhythmia recurrence post-AFCA (see Table 2) and the vast majority of them found a significant association. Both Fukushima et al. and He et al. have adjusted for LA volume index and showed that reduced LAA emptying velocity is an independent predictor of AF recurrence [23, 46]. In addition, Fukushima et al. suggested that a LAA emptying velocity below 48.5 cm/sec is associated with a nearly threefold increase in AF recurrence risk [46]. On the contrary, Simon et al. have adjusted for LA volume index, among other variables, and found that LAA emptying velocity is not predictive of AF recurrence post-ablation in both paroxysmal and persistent AF subtypes [41].

In view of our findings it is evident that LAA is affected by the underlying AF related atrial cardiomyopathy, yet in a varying degree. While it is highly unlikely that future studies would prove that LAA indices can single-out ideal AFCA candidates, it should be kept in mind that LAA morphofunctional features could be an incremental and valuable addition in artificial intelligence algorithms and AF recurrence prediction scores.

Of note, machine learning models are proven to outperform existing arrhythmia recurrence scores such as APPLE and CHA2DS2-VASc and when combining clinical, electrocardiographic and electrophysiology data they achieve an area under the receiver operating characteristics curve (AUC) of 0.86 [53]. Interestingly, a recently published study by Zhou et al. suggested that a deep learning algorithm utilizing four variables (LA volume, LAA volume, N-terminal pro-BNP levels and AF subtype) has superior performance than conventional statistical analysis in AF recurrence prediction and an average C-index of 0.76 [54]. While methodologically different from our analysis, another novel artificial intelligence approach found that a deep learning algorithm analyzing post-ablation (within 24 h) LA indices (diameter, emptying fraction and strain rate) outperforms conventional logistic regression models in predicting late AF recurrence [55].

Since cardiac computed tomography (CCT) and transesophageal echocardiography are common and readily accessible pre-operative imaging modalities in AFCA candidates, automated LAA anatomic and functional indices estimations could be rapidly, easily and timely integrated in clinical decision making to facilitate better long term outcomes. For the time being, deep learning techniques can accurately estimate LA volume and geometry via CCT data and assist in arrhythmia recurrence prediction post-AFCA [56, 57]. Lastly, cardiac magnetic resonance imaging datasets have been utilized in a proof-of-concept study showing that deep learning can assist in patient specific ablation strategy selection [58].

Whether implementation of LAA datasets in artificial intelligence approaches could facilitate arrhythmia recurrence prediction, ablation strategy selection or even duration of antiarrhythmic drugs post-ablation remains to be proven. Importantly, artificial intelligence models for clinical decision making should be further tested since ethical, legal and social considerations remain both relevant and hotly debated [59,60,61,62].

Limitations

The present meta-analysis should be interpreted taking into account its limitations. First, this is a study-level meta-analysis providing average treatment effects. The lack of patient-level data prevented us from assessing the impact of baseline clinical and procedural characteristics on treatment effects. Yet, some crucial subgroup analyses (see Additional file 1: Figs. S1–S4) were conducted and showed that our findings remain robust for paroxysmal and persistent AF subgroups as well as different ablation approaches (PVI with or without additional ablation lines). Second, results of this study were grounded on a relatively small number of case–control studies of moderate quality. In particular, comparability between groups may be inadequate, since adjustment for confounding factors was not part of the initial design in the majority of the studies. Finally, moderate statistical heterogeneity was observed, which can imply methodology issues, such as different cardiac imaging and catheter ablation modalities and techniques, diverse population characteristics and post-ablation AAD protocols (see Additional file 1: Table S2 and S3). As previously stated, overall LAA indices calculation, follow-up and AAD administration were largely comparable in the included studies. In addition, considering that no strong evidence exist on post-AFCA AAD administration, follow-up protocol and additional ablation lesions beyond pulmonary vein isolation in AF subjects [2, 63], any heterogeneity of the included studies is indicative of the real-word practice.

Conclusions

LAA morphofunctional indices differ between patients suffering from arrhythmia recurrence post-ablation and arrhythmia free counterparts. Their clinical significance and their role in better risk stratification of catheter ablation AF candidates is an open research question that definitely merits further investigation.