1 Introduction

Cryoballoon ablation is one of the most common single-shot procedures for pulmonary vein (PV) isolation [1]. Although this technique is highly effective with a high rate of PV isolation after a single freeze [2], early reconnection during the procedure may occur. Previous research showed that the incidence of early reconnection increases in the first 30–60 min after acute PV isolation [3]. In this respect, we previously showed that incorporating a 30-min waiting period followed by infusion of adenosine to unmask dormant PV conduction (adenosine testing) increased the incidence of early reconnection after cryoballoon ablation and addressing this reconnection by additional ablation decreased atrial fibrillation (AF) recurrences [4]. Additionally, in a subsequent study, we found that the 1-year AF-free survival was similar in early reconnection/dormant conduction (ERC) patients receiving additional cryoballoon applications compared to non-ERC patients [5]. These findings signify the importance of recognizing and treating ERC.

A 30-min waiting period with subsequent adenosine administration is not without downsides. First, this adds a considerable amount of time to a routine procedure. Additionally, adenosine causes discomfort for the patient, such as chest discomfort, dyspnea, and flushing [6]. As such, predicting the absence of ERC immediately after ablation using a model based on procedural parameters instead of waiting to observe it or trying to unmask it with adenosine would be a considerable improvement.

To that end, we previously developed a prediction model for ERC which contained three independent variables [5]. From this model, a formula was constructed with a single cutoff that allowed us to predict the absence of ERC, thereby obviating the necessity for a 30-min waiting period with adenosine testing in many patients. The aim of the current study was to validate this previously developed prediction model in a new group of patients. Secondarily, we aimed to compare the 1-year AF-free survival between patients with and without ERC.

2 Methods

2.1 Study population

Consecutive patients that underwent their first cryoballoon ablation at the Leiden University Medical Center in 2018 and 2019 were retrospectively included. Baseline and procedural characteristics and follow-up data of these patients were collected using EPD-vision, the departmental cardiology information system. Biophysical parameters were obtained from cryo-applications by extracting ablation data from the ablation console using an R script (R version 4.0.2/Rstudio version 1.3.1093). All data were de-identified from personal details and anonymously stored in a Castor EDC database (https://www.castoredc.com/). Data were collected on a patient or PV level as appropriate. All patients signed an informed consent before the ablation procedure for anonymous retrospective data analysis. Due to the retrospective nature of the analysis, the ethical committee approved the study for exemption from review according to the Dutch Medical Research Involving Human Subjects Act.

2.2 Ablation procedure

For details regarding the ablation procedure, we refer to our previous study on the prediction of ERC [5]. In short, the second- and fourth-generation cryoballoons (Medtronic Arctic Front Advance and Advance Pro, Min, USA) were used for the ablation procedure. If no isolation was achieved within 90 s, the balloon was repositioned to improve occlusion. If necessary, this procedure was repeated until isolation was achieved. Ablation time was set at our standard approach of time to isolation (TTI) + 150 s, with a maximum of 180 s in the right superior PV (RSPV), as we previously showed this to be the optimal ablation strategy [2]. The TTI was measured as the time (in seconds) between the start of an application and the moment the PV potentials disappeared or became dissociated. Following isolation, the vein was re-examined after 30 min and, if still isolated, tested with adenosine. An increasing dose of adenosine starting from 18 mg up to a maximum of 30 mg was administered intravenously until more than 1 sinus beat with blocked atrioventricular conduction was observed. ERC was defined as PV reconnection 30 min after isolation, with or without adenosine. In the case of ERC, a maximum of 2–3 additional applications were performed to abolish the ERC.

2.3 Biophysical data collection

The freeze area under the curve (AUC) for each cryo-application was determined, where the x-axis was time (s) and the y-axis temperature (°C), for the section of the curve where the temperature was below 0 °C. The freeze magnitude for each cryo-application was determined by dividing the freeze AUC by the time (s). Finally, for the variable “warming time to 20 °C,” an actual temperature of 20 °C was often not reached before the application data ended in the console due to automatic balloon deflation. For those applications, the collected warming time was the warming time to balloon deflation.

2.4 Follow-up

Patients were followed up on AF-free survival for at least 12 months, usually at approximately 3, 6, and 12 months after the ablation procedure. At these moments, a 24-h Holter and a 12-lead ECG were performed. Immediately after the ablation procedure, previously used antiarrhythmic drugs (AAD) were restarted. Subsequently, if no further atrial tachycardia or AF recurrence were registered, AADs were generally stopped at the first follow-up visit, unless decided otherwise by the treating physician or preferred by the patient. In addition, all patients were advised to visit the outpatient clinic when experiencing palpitations and to document the arrhythmia using 12-lead ECGs or a 24-Holter recording. Ablation success was defined as the absence of AF/atrial tachycardia on a 12-lead ECG or the absence of an episode lasting more than 30 s on a Holter/device recording, after a blanking period of 90 days. For the survival analysis, the date of the first AF/atrial tachycardia recording or the date of first typical symptoms, later confirmed by AF/atrial tachycardia recording, was selected as the recurrence date, whichever occurred first.

2.5 Statistical analysis

All statistical tests were performed in IBM SPSS Statistics 25/29. Test results were considered statistically significant at P < 0.05.

2.5.1 Descriptive variables

Baseline characteristics, procedural characteristics, and biophysical parameters were compared between the ERC group and the non-ERC group using the unpaired sample T-test or Mann–Whitney U test for the continuous variables, depending on the normality of the data. Categorical variables were compared using the chi-squared test or Fisher’s exact test when applicable. Baseline characteristics were analyzed on the patient level, procedural characteristics on the patient or vein level as appropriate, and biophysical parameters on a vein level.

2.5.2 Validation of the previously developed prediction model

The previously established prediction model [5], which operates on the vein level, was evaluated by applying the previously reported formula (0.02 * TTI + 0.5 * number of unsuccessful applications + 0.2 * nadir balloon temperature) on the newly collected data. Subsequently, a receiver operating characteristics (ROC) curve was created, and the area under the ROC curve was calculated as a measure of the accuracy of the formula. Next, the sensitivity, specificity, likelihood ratios, number of veins that would be predicted to have ERC, number of ERC veins that would be missed, and predictive values of the previously established optimal cutoff value were determined. The number of veins that would be correctly predicted to have ERC was determined as follows: sensitivity * number of ERC veins. Likewise, the number of veins that would incorrectly be predicted to have ERC was determined as follows: (1-specificity) * number of non-ERC veins. The number of ERC veins that would be missed was determined by multiplying “1-sensitivity” with the number of ERC veins.

Next, a new cutoff value was determined to obtain similar performance in data from both the previous and present studies. This value was determined by visually comparing the coordinates of the ROC curve from the previous and present studies, searching for a cutoff that would result in a good performance in both datasets (Fig. 1). With this new cutoff, the sensitivity, specificity, likelihood ratios, number of veins that would be predicted to have ERC, number of ERC veins that would be missed, and predictive values were again determined for the data from both the previous and the present studies. Subsequently, the model with the new cutoff was applied to individual patients in the present study population, illustrating how many patients would not have required testing if the model had been available during the ablation procedures.

Fig. 1
figure 1

Receiver operating characteristics curves. Validation and previously constructed ROC curve. Shown here is the validation ROC curve overlayed on the previously constructed ROC curve. The validation ROC curve was also constructed using the previously developed prediction model (0.02 * TTI + 0.5 * number of unsuccessful applications + 0.2 * nadir balloon temperature). The area under the curve of the validation ROC curve is 0.773 (95% CI [0.690, 0.856]), which was 0.752 for the previous curve. An upright shift of the validation curve compared to the previous curve is visible

Moreover, an easy-to-use table was created to aid in deciding whether or not waiting and testing for ERC with adenosine is necessary during an ablation procedure. This table was created based on the new cutoff and by using different values for the parameters of the prediction model.

2.5.3 Survival analysis

Finally, 1-year Kaplan–Meier curves were constructed for the AF-free, AAD-free survival of ERC and non-ERC patients. After a blanking period of 90 days, patients entered the curve. AF-free days were not counted for the survival time if a patient was still (or again) on an AAD. A recurrence event while using an AAD resulted in censoring at the last follow-up date without AAD (events on AAD could not be counted as events without AAD but were still events, therefore leading to censoring). Patients who only had AF-free days with an AAD were not included in the Kaplan–Meier curves. The 1-year AF-free survival of ERC and non-ERC patients was compared using the log-rank test.

3 Results

3.1 Descriptives

3.1.1 Baseline characteristics

In 2018–2019, 223 patients underwent a cryoballoon AF ablation. Of these patients, 21 were excluded as cryoconsole files were unavailable and one patient because of unclear cryoconsole data, leading to a total of 201 patients for analysis. Median age was 61 years (IQR 53–69), 139 were male (69%), and 172 (86%) had paroxysmal AF. None of the collected baseline characteristics was significantly different between the ERC group and the non-ERC group (Table 1). Although no statistics were performed on the patients who were excluded from survival analysis due to continued AAD usage, the proportion of females seemed to be higher in this group.

Table 1 Baseline characteristics 

3.1.2 Procedural characteristics

From the 201 patients, a total of 774 veins were analyzed. Four veins were excluded because of unclear data, and four veins were excluded because they did not achieve isolation using the cryoballoon. ERC occurred in 35 of the 201 patients (17.41%) and in 41 of the 774 veins (5.30%) (Table 2). Although one successful application (in case of no ERC) per PV was the aim, bonus applications were often performed at the discretion of the operator. The rate of bonus applications did not differ between the groups.

Table 2 Procedural characteristics

On a patient level, the procedure time was significantly longer in the ERC group than in the non-ERC group (180 IQR 168209 min vs. 160 IQR 142–185 min, P < 0.001). This was also the case for total ablation time (22 IQR 18–25 min vs. 15 IQR 13–18 min, P < 0.001), as well as for the total number of applications (7 IQR 5–8 vs. 5 IQR 4–6, P < 0.001). In contrast, significantly more patients from the non-ERC group had an electrical cardioversion during the ablation procedure than patients from the ERC group (33%, 54/166 vs. 11%, 4/35; P = 0.012).

On a vein level, TTI was significantly longer in the ERC group (50 IQR 38–75 s vs. 40 IQR 27–60 s, P = 0.003). This was also the case for the application time (200 IQR 180–240 s vs. 190 IQR 180–220 s, P = 0.024), as well as for the number of unsuccessful applications per vein (0 IQR 0–1 vs. 0 IQR 0–0, P = 0.023). The same was the case for the total number of applications per vein (2 IQR 2–4 vs. 1 IQR 1–2, P < 0.001). Finally, significantly more veins had a TTI of more than 90 s in the ERC group (16%, 6/37 vs. 5%, 31/613; P = 0.014).

3.1.3 Biophysical parameters

Freeze temperature at 30 and 60 s was significantly lower in the non-ERC group (respectively − 31 IQR (− 34)–(− 28) °C vs. − 28 IQR (− 32)–(− 26) °C and − 39 IQR (− 42)–(− 36) °C vs. − 34.5 IQR (− 38.0)–(− 32.0) °C; both P < 0.001) (Table 3). Nadir balloon temperature was also significantly lower in the non-ERC group (− 47 IQR (− 51)–(− 43) °C vs. − 41 IQR (− 45)–(− 38) °C, P < 0.001). Consistently, the freeze AUC and freeze magnitude were significantly larger in the non-ERC group (P < 0.001). Finally, the warming times to 0 °C, 15 °C, and 20 °C were significantly longer in the non-ERC group (P < 0.001 for all).

Table 3 Biophysical parameters

3.1.4 Early reconnection/dormant conduction

From the 201 patients, 18 showed reconnection after 30 min waiting but before adenosine (8.96%), and 19 had dormant conduction with adenosine in one or more veins (9%). From the 774 veins, 19 had reconnection before adenosine (2%), and 23 had dormant conduction with adenosine (3%). In one patient, a LSPV received an additional application for early reconnection, after which testing with adenosine also resulted in dormant conduction.

3.2 Validation of the previously developed prediction model

Using the previously established prediction model [5], an ROC curve was created (Fig. 1). A total of 650 veins had data for all three variables in the model and were thus included in the ROC curve, 37 of which had ERC. The area under the ROC curve was 0.773 (95% CI [0.690, 0.856]), compared to 0.75 in the derivation cohort, signifying that the model performed equally well in the validation cohort compared to the derivation cohort. However, the previously established cutoff value of − 6.7 (meaning that an outcome equal to, or above that value indicates ERC), resulted in a sensitivity of 37.84% and a specificity of 89.07% to predict ERC. The positive likelihood ratio of the model was 3.462, and the negative likelihood ratio was 0.698. The number of veins that would be correctly predicted to have ERC was 14 of the 37 ERC veins with data for all three model variables. The number of veins that would incorrectly be predicted to have ERC was 67 out of the 613 non-ERC veins with data for all three model variables. Furthermore, the number of ERC veins (with data for all three model variables) that would be missed was 23 out of the 37 ERC veins. Finally, with an ERC prevalence of 5.30% in the PVs, the positive predictive value (PPV) was 16.22%, and the negative predictive value (NPV) was 96.24%.

Because of the low sensitivity of the − 6.7 cutoff for the validation cohort presented in the current study, a new cutoff value was determined to obtain similar performances in both the derivation [5] and validation cohorts. The maneuver to change an ROC cutoff to an optimal value for both derivation and validation cohorts has been described before [7]. At a cutoff value of − 7.2 (where equal to or above − 7.2 indicates ERC), the model provided a sensitivity of 72.50% and a specificity of 78.22% in the ROC curve from the previous study (ROC curve included 421 veins, 40 with ERC and 381 without ERC). This resulted in a positive likelihood ratio of 3.328 and a negative likelihood ratio of 0.352. The number of veins that would be correctly predicted to have ERC was 29 of the 40 ERC veins with data for all three model variables. The number of veins that would be incorrectly predicted to have ERC was 83 of the 381 non-ERC veins with data for all three model variables. Furthermore, the number of ERC veins (with data for all three model variables) that would be missed was 11 out of the 40 ERC veins. Lastly, the PPV was 25.26% and the NPV 96.55%. In data from the present study, a cutoff value of − 7.2 resulted in a sensitivity of 72.97% and a specificity of 78.63%. This resulted in a positive likelihood ratio of 3.415 and a negative likelihood ratio of 0.344. The number of veins that would be correctly predicted to have ERC was 27 of the 37 ERC veins with data for all three model variables. The number of veins that would be incorrectly predicted to have ERC was 131 of the 613 ERC veins with data for all three model variables. Furthermore, the number of ERC veins (with data for all three model variables) that would be missed was 10 out of the 37 ERC veins. Lastly, the PPV was 16.04% and the NPV was 98.11%. Therefore, at this cutoff of − 7.2, the model performed similarly in data from both the previous and present study.

Subsequently, the model was applied to individual patients of the present study population. A total of 101 patients had all the necessary data available for all four veins. Out of these 101 patients, 43 received a score below − 7.2 for all four veins, thus not requiring a 30-min waiting period and adenosine testing. Out of these 43 patients, 2 patients with ERC would have been missed.

Finally, Table 4 was created based on the new cutoff to aid in deciding whether the 30-min waiting period with adenosine testing will be necessary.

Table 4 Decision-making tool based on new cutoff to aid in deciding whether or not ERC testing is necessary

3.3 Survival analysis

Kaplan–Meier curves were constructed to visualize the AF-free survival without AADs in ERC and non-ERC patients (Fig. 2). From the 201 patients, 158 patients entered the curve. From the other 43 patients, 13 either had no/unclear follow-up data, 7 had no follow-up data without AAD (were using AAD after a blanking period and had no event/were censored), and 23 had a recurrence on AAD outside the blanking period before stopping the AAD. From the 158 patients that entered the curve, 35 (22.15%) had a recurrence event and 31 (19.62%) were censored before reaching 365 days. No significant difference was found in the AF-free survival between the two groups (P = 0.629).

Fig. 2
figure 2

Kaplan–Meier curves of the one-year AF-free survival (without AADs) in the ERC and non-ERC group. 24 ERC patients and 134 non-ERC patients entered the curve after a 90-day blanking period. No significant difference in AF-free survival was found between the ERC and non-ERC group (P = 0.629). AF, atrial fibrillation; ERC, early reconnection/dormant conduction; AAD, anti-arrhythmic drug

4 Discussion

4.1 Main findings

The main aim of the present study was to validate the previously established ERC prediction model [5], by applying the model to data from a new group of patients. Even though the model showed a similar specificity, the sensitivity decreased substantially. However, shifting the cutoff value to a suitable value for both datasets led to a significant improvement, making the model applicable to both datasets. Secondly, no difference was found in the 1-year AF-free survival between ERC and non-ERC patients, which may signify the importance of treating early reconnection/dormant conduction with additional ablation.

4.2 Variables in the prediction model

The previously developed prediction model contains three predictors [5]. One of these predictors is the TTI, which has more often been found to predict durable isolation [2, 8,9,10]. A longer TTI may be caused by incomplete occlusion of the pulmonary vein, a relation previously demonstrated by Aryana et al. [9]. An incomplete occlusion has been demonstrated to lead to more ERC and late reconnections [10,11,12].

The relationship between incomplete occlusion and ERC may also explain why the number of unsuccessful applications is a predictor, as we previously pointed out that unsuccessful applications may be caused by insufficient occlusion [5]. Edema occurrence after the initial application could also be an explanation, making subsequent applications less effective [13].

The final predictor is the nadir balloon temperature [5]. At different time points during the freeze, balloon temperature has often been shown to hold predictive value for ERC and late reconnection [2, 8,9,10,11,12, 14]. As we and others previously explained [5, 15, 16], a possible mechanism is that a better occlusion reduces warm blood leakage from the vein around the balloon, allowing lower temperatures to be reached.

4.3 Validation of the previously developed prediction model

The area under the ROC curve (0.773) was similar to the previously found AUC of 0.752 [5]. Although the specificity found in the present study (89%) was similar, or even indicated a minor improvement over the previous specificity (86%), the sensitivity reduced substantially to 38%, compared to the 70% sensitivity from the previous study. These results indicate a similar ability of the model to identify the absence of ERC, but a decreased ability to identify the presence of ERC, implying a large reduction in the usability of the model.

With a shift in the optimal cutoff value from − 6.7 to − 7.2, the model performed equally well in both populations with reasonable sensitivity and specificity [5]. Moreover, the low ERC prevalence in PVs in data from both the previous and present study (9.22% and 5.30% respectively) resulted in very high NPVs (96.55% and 98.11% respectively). The difference in the ERC prevalence between the previous and present study could itself also be a reason for the difference in model performance. Note that the predictive values are dependent on ERC prevalence [17]. Such low occurrences of ERC in PVs have also been found in previous studies [14, 18]. The resulting high NPVs we found in our previous and present data are essential, as this indicates that a very high percentage of negative predictions will be correct, allowing 30 min to be saved in many procedures without too much risk of missing ERC. Taking the model into practice in our present study population demonstrated how well the model performs. Out of 101 patients that had all necessary data, 43 would not have required testing. Out of these 43, only 2 would have been false negative.

4.4 Clinical implications

Confirming the absence of ERC is an important part of the current ablation procedure, as in the presence of ERC additional cryoballoon applications can reduce AF recurrences and improve AF-free survival [4, 5, 10, 19, 20]. In the present study, no difference was found between the 1-year AF-free survival of ERC and non-ERC patients, once again demonstrating the effectiveness of the additional cryoballoon applications that were performed on the ERC patients. Our prediction model allows us to predict the absence of ERC, thereby obviating a 30-min waiting period with subsequent adenosine testing in approximately 40% of patients. Therefore, applying the model during the ablation procedures would have saved 21.5 h of lab time for the whole team and avoided the discomfort of four adenosine administrations in 43 patients in this study.

Although we employ adenosine testing as the standard of care during a PVI procedure, there is conflicting evidence on the utility of this maneuver. Macle et al. [20] found that patients with dormant conduction randomized to subsequent additional ablation had significantly less recurrences with an absolute risk reduction of 27% compared to patients with dormant conduction not receiving additional applications. In contrast, Kobori et al. [21] did not find a reduction of recurrences during 1-year follow-up in patients randomly assigned to either adenosine-guided radiofrequency catheter ablation with additional ablation or radiofrequency catheter ablation without subsequent adenosine testing. Taken all data together, McLellan et al. [22] found in their meta-analysis on the role of adenosine usage after AF ablation that the induction of reconnection by adenosine reduced freedom from AF, particularly when no additional ablation was performed. They found no difference when comparing outcomes in studies of routine adenosine challenge vs no adenosine. However, they found a non-significant trend to an increase in freedom from AF in patients receiving adenosine challenge in cryoballoon ablation. Most likely, therefore, adenosine provoked reconnection decreases AF freedom, but it is less clear whether additional ablation subsequently increases this freedom. We speculate that this may be due to edema or low catheter stability in the reconnection locations. With cryoballoon, catheter stability is less of an issue than in RF ablation, and lesions are generally deeper [23], possibly overcoming edema. Therefore, we believe that adenosine testing may be beneficial in cryoballoon ablation, unless proven unnecessary by the current model.

Utilizing the previously established and now validated prediction model at a common cutoff of − 7.2 will save time and do so without missing many false negative cases. As such, its use is recommended in centers which routinely use adenosine testing following PVI.

4.5 Limitations

The present study was a single-center study. The small size of the study populations (especially the ERC groups) of the previous and present ERC prediction study was likely partially responsible for the difference in sensitivity. Moreover, the statistics used to build the prediction model did not take into account that PVs from a single patient are not completely independent from each other. Additionally, data from the present study included 23 patients and 92 veins that were not tested for dormant conduction. Excluding these 92 veins (73 of which were used for the validation ROC curve) did not cause a relevant difference in sensitivity and specificity in the validation ROC curve at a cutoff of − 7.2 (72.22% and 79.48% instead of 72.97% and 78.63%). To make the ERC prediction model work equally well both in the derivation and the validation cohort, we had to shift the ROC cutoff to a new value for both cohorts. This is not a unique maneuver however [7], and areas-under-the-curve (AUC) were comparable between the derivation and validation cohorts, confirming the validation of the prediction model. Bonus applications may have influenced the absence or presence of ERC. The rate of bonus applications was not different between the ERC and no-ERC groups, however. Follow-up was performed with 24-h Holter recordings and 12-lead ECGs in case of symptoms. Asymptomatic episodes are therefore unlikely to have been recorded. We therefore cannot exclude that asymptomatic episodes differed between the groups. However, as 1-year rhythm outcome was not the primary outcome in this study, we believe that our primary analysis is still valid with these shortcomings. Demographic information on the patient population is not routinely collected in our departmental cardiology information system and was therefore not available for this study. Most of our patients are of Caucasian European descent. Therefore, the results of this study should be interpreted with caution for patients with other backgrounds.

4.6 Conclusion

In the present study, the previously established ERC prediction model was validated. At a cutoff of − 7.2, the previously established ERC prediction model performed well in the study populations from both the previous and the present study. Utilizing this model will allow shorter procedure times in selected patients in centers which routinely use adenosine testing following PVI.