Introduction

Exercise stress echocardiography (ESE) is a well-validated diagnostic method for detecting coronary artery disease (CAD) [1]. However, substantial expertise and experience are required to achieve optimal diagnostic accuracy [2]. Furthermore, a significant proportion of false-positive ESEs may occur, resulting in unnecessary normal coronary angiograms [3, 4]. The development of practical quantitative analysis techniques to increase the diagnostic yield of ESE has been a primary research focus [1]. Two-dimensional speckle-tracking echocardiography (STE), a quantitative measure of myocardial deformation, has been a recently developed method under investigation for the detection of CAD [5].

STE has not reliably demonstrated an increased accuracy compared to visual assessment for the detection of CAD with dobutamine stress echocardiography (DSE) [1, 6,7,8]. While researchers have explored the combination of STE with exercise stress echocardiography (ESE) with bicycle protocols, few have explored the combination with a treadmill protocol [9,10,11]. Furthermore, several studies that reported promising results of STE as a potential diagnostic technique have used unmatched controls [12,13,14,15]. As age, sex, diabetes, hypertension, smoking, and metabolic syndrome influence STE values, these studies may have been confounded by a higher burden of comorbidities in patients with CAD [16,17,18,19,20]. In addition, numerous studies have excluded patients with known CAD and baseline wall motion abnormalities (WMA), reducing the findings' potential generalizability [15, 21].

The majority of STE studies in DSE have used the global longitudinal strain (GLS) as a potential marker of ischemia [1]. However, the apical segments may be of greater interest because they typically display ischemic changes regardless of the location of the coronary stenosis [1]. Furthermore, a previous study suggested larger rest and stress variability in STE values in apical segments than mid-ventricular and basal segments, possibly increasing discrimination for significant coronary stenoses [22].

There is a need for studies that validate STE use in treadmill ESE. We aimed to conduct a pilot study to foremost evaluate the feasibility and interobserver reliability for STE in treadmill ESE. Secondly, as a hypothesis-generating analysis, we explored the incremental value of STE of the apical segments combined with the visual analysis of treadmill ESE compared to treadmill ESE alone to reclassify false-positive treadmill ESE.

Material and methods

Population and study design

We conducted a case–control study between March 2018 and March 2020. We included all consecutive patients with an abnormal ESE between March 2018 and March 2020 who subsequently underwent a coronary angiogram within a 6-month interval for the investigation of obstructive CAD at the Sherbrooke University Hospital Center (CHUS) in Quebec, Canada.

We included patients who had reached at least 85% of their maximum predicted heart rate with a normal ESE for the control group. Normal ESE controls were only selected from March 2018 to August 2018 to allow for a one-year ascertainment period. Therefore, patients with normal ESE from August 2018 to March 2020 were excluded. We ascertained new significant coronary artery stenoses by non-invasive testing (stress EKG, exercise stress echocardiogram, dobutamine stress echocardiogram or nuclear myocardial perfusion imaging) and coronary angiogram after the normal ESE using the hospital's electronic health records. Deaths were recorded as a competing risk. Controls who were diagnosed with a new coronary stenosis within one year despite a normal ESE were considered false negatives.

We only used the first ESE per patient for both groups and excluded all patients who underwent DSE, those who required contrast agent for ESE, and those with prior coronary artery bypass grafting (CABG). Of note, all patients with previously diagnosed CAD required a diagnosis by coronary angiogram and required subsequent complete percutaneous revascularization (all residual stenoses ≤ 50%) to allow for ascertainment of new significant coronary stenoses during follow-up. The study was approved by the Sherbrooke University Hospital Research Center (CRCHUS) ethics committee, and individual patient consent was not required.

Treadmill exercise stress echocardiography

We performed ESE as per the recommendations of the American Heart Association (AHA) and the American Society of Echocardiography (ASE) guidelines [1, 23]. Images were obtained by five experienced sonographers using the GE Vivid E95 and M5Sc transducer (GE Vingmed Ultrasound AS, Horten, Norway) at baseline and within 90 s post-exercise with patients in the left lateral decubitus position. We recorded standard 2D grayscale images for the assessment of regional wall thickening in the apical four-chamber, apical two-chamber, apical long-axis, parasternal long- and short-axis views. We stored cine images of three representative cardiac cycles of the three apical views at baseline and post-exercise on an external hard drive for future analysis. The mean frame rate for all ESE was 48.7 ± 9 frames/sec.

Wall motion was evaluated at the time of examination by one of the six experienced echocardiographers in our laboratory. As the wall motion analysis was performed before angiography and STE, the echocardiographers were blinded to all other imaging results. Visual analysis was done at baseline and stress using all views and the 16-segment model as per guideline recommendations.1 We considered any stress-induced WMA as an abnormal result. In patients with baseline WMA, akinesis that became dyskinesis was not considered as an abnormal result as this generally corresponds to the mechanical response of an infarcted segment rather than ischemia [1]. Furthermore, patients with baseline WMA were considered as having a normal ESE if no additional WMA were observed during ESE.

Two-dimensional speckle-tracking strain analysis

Two operators (MAD, GF), blinded to all other imaging results, performed 2D speckle-tracking on all eligible ESE using the automated function imaging (AFI) stress software version 201 (GE Vingmed Ultrasound AS, Horten, Norway) directly on the GE Vivid E95. The first operator (MAD) was a first-year general cardiology fellow who underwent formal echocardiography training and completed a ten-patient STE training set under the supervision of a senior echocardiographer (BE). The second operator (GF) was a final-year general cardiology fellow with level II certification. We manually traced the endocardial borders at the end-systolic frame in the three apical views at baseline and peak stress while optimizing the region of interest to better capture the myocardium. We defined the timing of aortic valve closure using the apical long-axis view. AFI software was then used to track the myocardial deformation and reject the poorly captured segments. We readjusted the endocardial tracing in the presence of poor tracking. We excluded all patients whose ESE had more than two rejected segments overall or any rejected apical segments at rest or stress. We also excluded all patients whose largest heart rate difference between two of the three apical images exceeded 30%, as this precluded the software from calculating the bull's-eye. Notably, the AFI software was unable to calculate the STE in the presence of a contrast agent.

The software automatically calculated a 17-segment bull's-eye map and GLS. We defined segments 1, 2, 7, 8, and 13–17 as corresponding to the left anterior descending (LAD) artery, and segments 3, 4, 5, 6, and 9–12 as corresponding to the non-LAD territory using the standard template of coronary anatomy (Fig. 1) [1, 5]. Using the apical segments exclusively, we also defined segments 13–17 for the overall examination; segment 17 as corresponding to the LAD; and segments 15–16 as corresponding to the non-LAD territory. As we included patients with baseline WMA, we also calculated the delta strain values for standardization by subtracting the baseline values from the stress values. Stenosis of the left main coronary artery was considered to involve both the LAD and the circumflex coronaries.

Fig. 1
figure 1

Schematic diagram of the 17 left ventricular segment model used in our study. LAD: Left anterior descending artery; CX: Circumflex artery; RCA: Right coronary artery

Coronary angiogram

An experienced interventional cardiologist interpreted the images from the diagnostic coronary angiogram. We defined significant CAD as ≥ 70% luminal diameter stenosis in at least one of the coronary arteries or as ≥ 50% in the left main coronary artery measured in the worst-view angiographic projection [24].

Statistical analysis

To control for imbalances in baseline patient characteristics, we calculated propensity scores as the predicted probability of having an abnormal ESE using a multivariable logistic regression model. The covariates were chosen using a priori subject matter knowledge and included age, sex, hypercholesterolemia, hypertension, diabetes mellitus, current smoking status, and prior diagnosis of CAD. We matched 1:1 all patients with abnormal ESE to a normal ESE control using the optimal matching algorithm [25]. We assessed the covariate balance between the abnormal and normal ESE groups before and after propensity score matching using standardized differences [26].

We compared the clinical and ESE characteristics. Continuous data with normal distribution were presented as means with standard deviations and analyzed using the Student's t-test. Non-normal data were presented as medians with interquartile ranges and compared using the Wilcoxon rank-sum test. We performed chi-square tests for dichotomous variables unless the expected number of frequencies was less than 5, in which case we used a Fisher's exact test. We used a two-sided significance level of 0.05.

Interobserver variability in GLS and apical strain was determined by calculating the Pearson correlation coefficient, bias, limit of agreement, mean relative difference, and mean intra-class correlation coefficient. We calculated the mean intra-class correlation (ICC) using a two-way mixed-effect model measuring absolute agreement based on the mean rating [27].

We created receiver operating characteristic (ROC) curves to compare the diagnostic performance of STE and visual assessment. We used the mean between both operators for all strain measurements. Optimal cut-offs for sensitivity and specificity were determined using Youden's J statistic. For patients with abnormal ESE, the coronary angiogram was used as the gold standard. Patients with normal ESE were considered positive if they had a newly diagnosed significant coronary stenosis in the one-year ascertainment period, and negative otherwise.

To determine the incremental value of STE to reclassify the false-positive ESE, we considered a visually normal ESE to be negative. As a second step, we then used STE to further classify the abnormal ESEs as positive or not for significant CAD using the coronary angiograms as the gold standard. When combined with visual assessment, the optimal cut-offs of STE were only calculated for abnormal ESEs. We performed statistical testing to compare the areas under the curve (AUCs) based on the DeLong method and computed 95% confidence intervals [28]. We used a two-sided significance level of 0.05 and did not adjust for multiplicity. We conducted statistical analyses using SPSS version 26 (IBM Corporation, Armonk, NY, USA) and R version 4.0.2 (R Core Team, Vienna, Austria).

Results

We excluded 61 patients from the STE analysis from 201 (30.3%) eligible ESE due to technical issues; 47 (23.4%) because of suboptimal image quality and 14 (7.0%) because the heart rate was too variable between apical views precluding the software from calculating the bull's-eye (Fig. 2). Of note, STE analysis was possible for all patients at rest. Exclusions for suboptimal image quality was based on imaging at peak stress. After propensity score matching, there were 51 patients left in each group. Small differences (standardized differences ≥ 0.1) remained in terms of age, hypercholesterolemia, hypertension, and diabetes, but none were statistically significant (Table S1). The mean age of patients with abnormal ESE and normal ESE were 66.4 (39.2% female) and 65.1 years (35.3% female), respectively (Table 1).

Fig. 2
figure 2

Study population flowchart. CABG: coronary artery bypass graft; DSE: dobutamine stress echocardiography; ESE: exercise stress echocardiography

Table 1 Clinical characteristics and rest echocardiographic parameters in the propensity score matched patients

While both groups had similar baseline left ventricular ejection fractions and achieved comparable systolic blood pressures and target heart rates, the patients with positive ESE had more WMA at rest, attained lower metabolic equivalent (METS) levels and double product levels (Table 2). In the abnormal ESE group, 33 patients (64.7%) had significant CAD on angiography (Table 3). In the 1-year follow-up of the normal ESE group, 2 patients (3.9%) required revascularization. One patient had persisting unequivocal symptoms and underwent a positive nuclear myocardial perfusion imaging study with subsequent percutaneous revascularization, while the other had typical angina and underwent percutaneous revascularization directly. There were no deaths in the propensity score matched normal ESE group.

Table 2 Treadmill ESE parameters at peak stress
Table 3 Coronary angiogram characteristics

The interobserver reliability of STE is presented in Table 4. Overall, we demonstrated good-to-excellent interobserver reliability for the GLS as well as for the STE for the apical segments as demonstrated by the intra-class correlation coefficient. Bland–Altman plots are shown in the Supplementary material (Figures S1-S4).

Table 4 Interobserver variability for strain imaging

For the overall examination, the utilization of GLS alone at stress (AUC = 0.69 vs. 0.84, p = 0.006) and the delta GLS (AUC = 0.69 vs. 0.84, p = 0.01) resulted in inferior discrimination compared to visual assessment alone (Table 5). When considering normal ESE as negative and using STE to reclassify abnormal ESE, this combination was superior to visual assessment alone for the overall examination (AUC = 0.89 vs. 0.84, p = 0.025), for the non-LAD territory (AUC = 0.83 vs. 0.70, p = 0.006), but not for the LAD territory (AUC = 0.79 vs. 0.73, p = 0.11) (Figs. 3, 4 and 5). The sensitivity, specificity, negative predictive, and positive predictive values are shown in Table 5. Finally, we compared the strain characteristics of patients with significant CAD to those without significant CAD to demonstrate the increased variability and tendency toward larger differences between stress and rest values at the apex (Table S2).

Table 5 Diagnostic accuracy by visual and strain imaging for detecting significant coronary stenoses
Fig. 3
figure 3

Receiver operator characteristic curve for the detection of significant coronary artery disease for the overall examination. *Denotes p < 0.05 compared to visual assessment

Fig. 4
figure 4

Receiver operator characteristic curve for the detection of significant coronary artery disease in the left anterior descending artery territory. *Denotes p < 0.05 compared to visual assessment

Fig. 5
figure 5

Receiver operator characteristic for the detection of significant coronary artery disease in the non-left anterior descending artery territory. *Denotes p < 0.05 compared to visual assessment

Discussion

Our findings demonstrate that STE post-treadmill ESE is relatively feasible in patients who do not require a contrast agent, has good-to-excellent interobserver reliability between non-expert operators, and may provide reclassification value for patients with false-positive ESE. Furthermore, we showed that utilizing the STE difference between the apical segments at peak stress and baseline, in addition to visual assessment, maximized the diagnostic yield.

The sensitivity, specificity, and accuracy values of 94.3%, 73.1%, and 80.4% for the overall examination for visual assessment alone in our study are comparable to published estimates [1]. The high negative predictive value (96.1%) supports the excellent prognostic value for symptomatic obstructive CAD of a negative ESE [29]. However, the visual assessment sensitivity must be interpreted with caution because of the test verification bias and because the study sample was created by propensity score matching.

The combination of visual assessment and STE did lead to a slightly increased discrimination capacity. Compared to visual assessment alone, the combination of visual assessment and STE seemed to improve the accuracy of the detection of non-LAD stenoses in a clinically meaningful manner (77.5% vs. 83.3%). Of note, 30.3% of eligible patients who did not require a contrast agent were excluded, a higher proportion than typically displayed in DSE studies [1]. This significant exclusion of patients displayed the difficulty in obtaining high-quality images with little heart rate variability suitable for strain analysis in the ninety-second post-exercise window. However, our non-expert interobserver reliability was comparable to previous data of expert operators reflecting the intuitiveness of STE [30].

Our study is concordant with previous DSE studies, demonstrating that various strain measures may provide an incremental diagnostic yield to visual assessment [7, 31]. However, this has not been shown consistently by other authors [30]. Previous studies with promising results have utilized GLS [1, 32]. In our study, the use of peak stress GLS or the use of the difference between peak stress GLS and baseline GLS resulted in inferior discrimination than visual assessment for the overall examination. We hypothesize that our adjustment for confounding by selecting non-healthy controls may have attenuated the value of GLS for detecting significant CAD. Furthermore, a previous study that combined treadmill ESE and strain imaging did not utilize peak stress images and did not demonstrate conclusive results [10].

Our findings are in agreement with prior research that showed the utility of the apex (segment 17) for diagnosing LAD CAD disease, as well as the applicability of calculating the difference between peak and stress values in patients with WMA [6]. As ESE can provide important prognostic information by reproducing the physiological response with exercise, it is generally considered the test of choice over DSE [1]. Further studies are needed to validate the additional diagnostic value of strain imaging in treadmill ESE.

Study limitations

The main limitation of our study was the absence of systematic coronary angiograms for all eligible patients. This was not feasible due to the study's retrospective nature. While we could not evaluate CAD in all patients with negative ESE, the one-year follow-up allowed us to examine for clinically driven diagnoses of coronary stenoses. For this reason, our diagnostic parameters must be interpreted with caution and viewed as hypothesis-generating. Our study may provide investigators with pilot data to conduct a prospective study with systematic angiograms to provide true estimates of speckle-tracking sensitivity in the post treadmill exercice context. Second, we could not complete STE in patients who required contrast and did not have optimal images, thus limiting our study's generalizability. Third, the visual assessment of the ESEs was only done by one single echocardiographer. Fourth, STE evaluations were performed by two non-expert operators. Finally, we did not achieve perfect correction of characteristics imbalance between the two groups of patients with standardized differences above 0.1 in some covariates. However, achieving standardized differences below 0.25 may be reasonable given our modest sample size [26].

Conclusions

Two-dimensional speckle-tracking of the apical segments combined with treadmill ESE is relatively feasible in patients who do not require contrast, has good interobserver reliability, and may provide incremental diagnostic value for reclassifying false-positive ESE. Future prospective studies of ESE with two-dimensional speckle-tracking with larger sample sizes and coronary angiograms for all study participants are needed to confirm our findings.