1 Introduction

Cardiopulmonary exercise testing (CPET), where patients exercise to peak exertion provides an objective indicator of an individual’s aerobic fitness and is commonly advocated for preoperative risk stratification [1, 2]. Whilst CPET is considered the gold standard tool for assessing an individual’s fitness; it is not widely available, costly to conduct and the maximal nature may place a potentially unnecessary physiological burden on individuals with significant co-morbidities [3]. Submaximal exercise testing (SET) offers a cheaper, less physically demanding and easier to conduct alternative to CPET; during SETs individuals exercise to the point of subjective limitation [3]. Unlike CPET however, there is no widely accepted protocol for conducting and assessing the results of SETs [3]. Heart rate recovery (HRR) is thought to be governed by vagal reactivation and sympathetic withdrawal [4,5,6,7]. Imai et al. showed that there are significant differences in HRR between athletes and heart failure patients suggesting a relationship between HRR and exercise capacity [5]. Further studies have confirmed this finding, suggesting that HRR is a surrogate marker of an individual’s fitness [4, 7, 8]. Assessment of HRR following SETs could therefore provide an easily obtained indicator of an individual’s fitness [4,5,6,7] therefore allowing SETs to become widely adopted tool for pre-operative risk assessment, monitoring responses to pre-habilitation and treatment, assessing fitness to undergo treatment and monitoring disease progression.

HRR1, calculated as the difference between heart rate (HR) at cessation of exercise and HR at 1 min of rest (HRR1), is a commonly used method for quantification of HRR. This was first validated by Cole et al. [9] in a population of individuals at risk of coronary artery disease referred for exercise testing. They found that a HRR1 ≤ 12 beats per minute (bpm) after maximal exercise was a strong predictor of mortality in the 6-year follow-up period. HRR1 (following maximal exertion) has since been shown to be a useful predictive and prognostic tool in a variety of clinical populations including as a predictor of poorer postoperative outcomes in patients undergoing non-cardiac surgery [10]. In a multivariable analysis impaired HRR1 response following a 6-min walk test was independently associated with a significantly increased risk of postoperative cardiopulmonary complications in patients undergoing lung resection surgery for lung cancer [11]. A secondary analysis of the measurement of exercise tolerance before surgery study demonstrated an impaired HRR1, in multivariable analysis, was independently associated with a significantly increased risk of myocardial injury in the perioperative period [10]. A recent study comparing HRR following an orthostatic challenge (a markedly ‘submaximal’ haemodynamic challenge) found that patients who suffered from perioperative myocardial injury had significantly slower HRR as compared with patients who remained free of myocardial injury [12]. Furthermore, in a recent secondary analysis of prospectively collected pre-operative HRR data following six-minute walk test (a submaximal exercise test) in patients undergoing lung cancer resection our group has demonstrated that area under the HRR vs. time curve (HRRAUC), a novel method for quantifying HRR devised by our research team, was significantly elevated in patients who suffered from cardiopulmonary complications (55.2 bpm*min) compared with those without (38.9 bpm*min) (95% confidence interval (CI) 0.13–32.5, p = 0.048) [13].

However, there has been significant debate surrounding the reproducibility of HRR following SET [14,15,16]. Studies have shown that at fixed workloads HRR1 and HRR2 are reproducible measures [14, 16], however when the workloads between tests are varied the reproducibility of HRR decreases [15]. Furthermore studies have also found HRR1 to be significantly different across different modalities of maximal exercise [17, 18].

Since by their nature, the workload for SETs cannot be predetermined and given the submaximal nature of SETs the workload may vary depending on exercise modality. Before testing of the validity of HRR following SETs as a marker of fitness in clinical populations we sought to assess the influence of both exercise intensity and modality on the reproducibility of HRR. We sought to determine the reproducibility of HRR1, HRR2 and a novel area under the HRR vs. time curve (HRRAUC) method for quantifying HRR, across different workloads and different modalities of submaximal exercise testing in healthy volunteers. We hypothesized that by including all heart rate data for a full six minutes of recovery, and as a result being influenced less by peak heart rate during exercise, our novel HRRAUC method would be more reproducible than HRR1/2.

2 Methods

We used the STROBE cohort checklist when writing our report [19]. SEARCH I (exercise intensity study) was externally peer reviewed and funded by Medical Research Scotland. SEARCH II (exercise modality study) was externally peer reviewed and funded by the British Journal of Anesthetics and Royal College of Anesthetists. Ethical approval for both studies was granted by the Golden Jubilee Research Institute (GJRI). GJRI reference numbers: 18/ANAES/03 (SEARCH‐I) and 19/ANAES/03 (SEARCH‐II). Since both studies recruited healthy volunteers, they were exempt from NHS Research Ethics procedures. Written informed consent was obtained from all volunteers in both studies. The studies were conducted in accordance with standards set by the Declaration of Helsinki and in accordance with local statutory requirements [20]. Sample size calculation was not performed in both studies, however the sample size is consistent with recommendations for pilot studies [21].

2.1 Exercise intensity study

Thirty-four healthy volunteers were recruited by convenience sampling method to undergo two SETs by cycle ergometry at two different workloads at the Golden Jubilee Research Institute, from June 2018 to August 2018. Inclusion criteria were > 18 years old and World Health Organization performance status ≤ 1. Exclusion criteria were; taking β blockers, pregnant or where exercise testing was contraindicated per the American Thoracic Society’s contraindications for six-minute walk test [22]. Further details regarding the inclusion and exclusion criteria are shown in Supplementary Table 1.

2.2 Cycle ergometry

The cycle ergometry test followed a modified version of Astrand-Rhyming cycle ergometer test [23]. Volunteers took part in 2 SETs of differing workloads in a randomized order with a minimum of one hour of rest between each test. The workload was varied between each test; 40% and 60% of the volunteer’s predicted maximal workload in watts (Wmax). The Wmax was calculated using the following formula [24]:

$$\mathrm{Wmax}=\left(\frac{20.4\;\left(\mathrm{Height}\;\mathrm{in}\;\mathrm{cm}\right)-8.74\;\left(\mathrm{Age}\right)-288\;\left(\mathrm{Sex}\right)-1,909\mathrm k\mathrm p\mathrm m/\min}{6.116}\right)\mathrm \times1.08$$

Volunteers were requested not to consume any caffeine three hours prior to SETs [25]. Prior to the SET the seat position and handlebar position on the cycle ergometer were adjusted such that they were comfortable for the volunteers. Before the SET, the volunteers read a standard written instruction, stating that the protocol and all volunteers underwent a practice test at 20% volunteers Wmax to familiarize them with the controls of the cycle ergometer and protocol. After sitting on the bike at rest to record their baseline HR, volunteers were then instructed to cycle until they achieved and maintained a cadence of 50–60 revolutions per minute. During the first minute of the test volunteers cycled unloaded to warm up. The resistance was then increased to the predetermined target workload over the next two minutes. Volunteers continued to cycle at this effort level for the next three minutes. Volunteers were instructed to cease pedaling at 6-min and to stay stationary for 6 min of rest. Volunteers were asked not to speak during the exercise test or recovery as previous studies have shown this significantly impacts HR variability [4]. If during the test volunteers reached a HR > 85% age predicted maximal HR (HRmax) (calculated using the formula from Tanaka et al. [26]) exercise test was terminated.

Volunteers were permitted to terminate the test at any stage. HR data was continuously measured and recorded at 30 s intervals during and after exercise was measured using an electrocardiography (ECG) GE DASH 4000 device and GE CRIKTON DINAMAP device and stored on paper-based case report form.

2.3 Exercise modality study

A second cohort of thirty-one healthy volunteers were recruited by convenience sampling method to undergo three SETs of differing exercise modalities, using the same inclusion and exclusion criteria (Supplementary Table 1) at the Golden Jubilee Research Institute, from January 2020 to March 2020. All volunteers underwent three SETs of differing modalities in a randomized order, with a minimum of one hour of rest between each modality; cycle ergometry, step test and shuttle walk test. The exercise protocol for the Shuttle Walk test was developed by modifying and combining elements from the 6-min Walk Test and Shuttle Walk test [22, 27]. The Step Test followed a modified version of Step Oximetry test [28]. Both protocols were modified to match the modified version of Astrand-Rhyming cycle ergometer test used in the exercise intensity study [22, 23, 28]. Pilot testing and data from exercise intensity study demonstrated that three minutes of exercise was sufficient to achieve target HR of > 50% HRmax across all three modalities of exercise. During the first two minutes of exercise the volunteers were encouraged to exercise until they achieved a target HR of > 50% of their age predicted maximal HR (HRmax) calculated using the formula from Tanaka et al. [26]. The target HR of > 50% HRmax was selected as a mid-way point between the workloads in exercise intensity study (40% and 60%). Volunteers were then maintained at this HR for one further minute. At the end of the minute the volunteers were asked to stop exercising immediately and stay seated at rest for six minutes. If the volunteers failed to achieve HR of > 50% HRmax within the first three minutes, they were encouraged to exercise further until they achieved and maintained a HR of > 50% HRmax for one minute. The exercise test was terminated at six minutes, regardless of whether volunteers achieved target HR. If during the test volunteers reached a HR > 85% age predicted maximal HR (HRmax) exercise test was terminated. All other factors were kept the same as exercise intensity study. HR was recorded at 30 s intervals throughout the exercise tests and recovery using ECG GE DASH 4000 device and wrist-worn HR monitor (Nonin WristOx2® Model 3150 OEM).

2.4 Cycle ergometry

The Cycle ergometry test was carried out as per the protocol described above at 50% of the volunteer’s predicted maximal workload in watts (Wmax) and volunteers were exercised until they achieved and maintain target HR of > 50% HRmax for one minute.

2.5 Step test

The Step test followed a modified version of the Step Oximetry test using a 15 cm step [29]. Volunteers were asked to climb up and down on the step at a self-selected pace but were encouraged to progressively increase their pace until target HR of > 50% HRmax was achieved. Volunteers were then asked to maintain this pace to maintain HR of > 50% HRmax for one minute.

2.6 Shuttle walk test

The Shuttle Walk test followed a protocol modified version of the the 6-min Walk Test and Shuttle Walk Test [22, 27]. Volunteers were asked to walk back and forth in a 10-m corridor until volunteers achieved and maintained target HR of > 50% HRmax for minute. At the end of the test participants were asked to walk back to the start and sit down for six minutes of rest. During exercise HR was monitoring using a wrist worn HR monitor and with ECG during the recovery phase.

2.7 Statistics

HRR was quantified using the following methods, shown in Fig. 1:

  1. A-

    HRR1—Absolute difference between HR at exercise cessation and HR at 1 min of recovery

  2. B-

    HRR2—Absolute difference between HR at exercise cessation and HR at 2 min of recovery

  3. C-

    Area under the curve (HRRAUC)—Area under the HRR vs. Time curve over 360 s of recovery computed using trapezoid method

Fig. 1
figure 1

Methods for quantifying Heart rate recovery (HRR). Example HRR vs time curve; time 0 reflects cessation of exercise. A HRR1—Absolute difference between HR at exercise cessation and HR at 1 min of recovery. B HRR2—Absolute difference between heart rate (HR) at exercise cessation and HR at 2 min of recovery. C Area under the curve (HRRAUC)—Area under the HRR vs. Time curve over 360 s of recovery computed using trapezoid method

Normality of distribution of data was assessed by Shapiro‐Wilk test. Categorical data is presented as counts and percentages and continuous data as mean (± standard deviation) or median [interquartile range] as appropriate to distribution. To ascertain the reproducibility of HRR indices Pearson’s correlation coefficient (r) and Intra-class correlation coefficient (ICC) (single rater, absolute agreement, 2-way mixed-effects model) were calculated [30]. To assess agreement between HRR indices following exercise at differing intensity and modality, Bland‐Altman analysis was conducted. Data analysis was carried out using Rstudio (© 2009 – 2019 Rstudio, Inc.).

3 Results

Thirty-four volunteers were recruited to the exercise intensity study, the median age of the sample was 34 years (22–54) and 15 volunteers (44%) were female. Thirty-two volunteers were recruited to exercise modality study, the median age of the sample was 26 years (21–44) and 16 volunteers (52%) were female. Baseline characteristics from both samples are shown in are shown in Table 1. The HR value at peak exercise on the Bike test could not be recorded due to a HR monitor fault in one volunteer and two volunteers did not achieve target HR of > 50% HRmax during the Walk test despite encouragement; these subjects were retained in the main analysis, but a sensitivity analysis was conducted by excluding these participants.

Table 1 Demographic information on volunteers in the SEARCH I (exercise intensity) and SEARCH II (exercise modality) studies

HRR1 and HRR2 were poorly reproducible when workloads were varied between tests (ICC < 0.40 for all comparisons). HRR1 had very weak positive correlation across different workloads (r = 0.22, p = 0.22), whilst for HRR2, the association was stronger (r = 0.62, p = 0.001). The HRRAUC method proved to be moderately reproducible across different workloads (ICC 0.58, 95% CI 0.14–0.80) and demonstrated a moderate positive correlation across different workloads (r = 0.68, p < 0.001), Table 2.

Table 2 Intra-class Correlation Coefficient and Pearson’s Correlation Coefficient values for HRR models across different workloads (SEARCH-I study (exercise intensity))

Bland‐Altman plots and scatterplots for HRR1, HRR2 and HRRAUC from the exercise intensity study are shown in Fig. 2. Visual inspection of the Bland‐Altman plots did not reveal any evidence of heteroscedastic scatter. Bland‐Altman analysis of the exercise intensity study revealed a systematic bias such that exercise conducted at 60% Wmax on average estimated a higher value for HRRAUC, HRR1 and HRR2 than at 40% Wmax, though this bias was less for the HRRAUC method than HRR1 and HRR2 (mean difference 6.87% versus 18.33% and 29.21% respectively), Fig. 2.

Fig. 2
figure 2

Bland‐Altman plots and Scatterplots of SEARCH I (exercise intensity). Bland‐Altman plots and Scatterplots reflecting agreement between HRR values across two different workloads (60% and 40% of calculated maximal workload (Wmax)) for HRR1, HRR2 and HRRAUC values. Bland‐Altman plots; A HRR1 60% Wmax vs. 40% Wmax, B HRR2 60% Wmax vs. 40% Wmax and C HRRAUC 60% Wmax vs. 40% Wmax. Scatterplots; D HRR1 60% Wmax vs. 40% Wmax, E HRR2 60% Wmax vs. 40% Wmax and F HRRAUC 60% Wmax vs. 40% Wmax. The reference line represents the line of equality: y = x, bpm = beats per minute

HRR1 and HRR2 were poorly reproducible between different modalities of exercise (ICC < 0.50). HRR1 for Cycle Ergometry vs. Step Test showed a moderate positive association (r = 0.53, p = 0.003) whilst for the comparisons of Cycle Ergometry vs. Shuttle Walk Test and Step Test vs. Shuttle Walk Test, HRR1 showed a low positive correlation (r < 0.4). Sensitivity analysis conducted by excluding individual 2 and 8 showed an improvement in these associations (r = 0.42, p = 0.03) and (r = 0.53, p = 0.003) respectively). The HRRAUC method proved to be moderately reproducible between different modalities of exercise, Cycle ergometry vs. Step Test (ICC = 0.63 (0.14–0.84)), Cycle ergometry vs. Shuttle Walk Test (ICC = 0.54 (−0.05–0.81)) and Step Test vs. Shuttle Walk Test (ICC = 0.80 (0.61–0.90)). The HRRAUC method demonstrated very strong positive correlation across all comparisons (r ≥ 0.75), in Table 3.

Table 3 Intra-class Correlation Coefficient and Pearson’s Correlation Coefficient values for HRR models across exercise modalities (SEARCH- II study (exercise modality))

Bland‐Altman plots and scatter plots for HRR1, HRR2 and HRRAUC from the exercise modality study are shown in, Figs. 3, 4 and 5 respectively. Visual inspection of the Bland‐Altman plots did not reveal any evidence of heteroscedastic scatter. Bland‐Altman analysis of HRR1 from the exercise modality study revealed a systemic bias such that the Step Test, on average estimated a higher value for HRR1 across both comparisons involving the Step Test (mean difference of 32.07% for Step Test vs. Cycle Ergometry and 24.28% for Step Test vs. Shuttle Walk Test), Fig. 3. Bland–Altman analysis of HRRAUC also revealed a systemic bias such that the Cycle ergometry test on average estimated a higher value for HRRAUC for the same person across both comparisons involving the Cycle Ergometry though this bias was less for the HRRAUC method than HRR1 (mean difference of 7.29% for Cycle Ergometry vs. Step Test and 2.47% for Cycle Ergometry vs. Shuttle Walk Test), Fig. 3. The Bland‐Altman and scatterplots for HRR2 are shown in Fig. 5 and demonstrate a systemic bias such that the Step Test, on average estimated a higher value for HRR2 across both comparisons involving the Step Test (mean difference of 25.05% for Step Test vs. Cycle Ergometry and 30.32% for Step Test vs. Shuttle Walk Test), Fig. 5.

Fig. 3
figure 3

Bland‐Altman plots SEARCH‐II (exercise modality) for HRR1and HRRAUC. Bland‐Altman plots reflecting percentage difference between HRR values across three different exercise modalities (Cycle ergometry, Step test and Shuttle walk test) HRR1 and HRRAUC. Bland‐Altman plots; A HRR1 Cycle ergometry vs. Step Test, B HRRAUC Cycle ergometry vs. Step Test, C HRR1 Cycle ergometry vs. Shuttle walk test, D HRRAUC Cycle ergometry vs. Shuttle walk test, E) HRR1 Step Test vs. Shuttle walk test and F) HRRAUC Step Test vs. Shuttle walk test

Fig. 4
figure 4

Scatterplots SEARCH‐II (exercise modality) for HRR1 and HRRAUC. Scatter plots reflecting agreement between HRR values across three different exercise modalities (Cycle ergometry, Step test and Shuttle walk test) HRR1 and HRRAUC Scatter plots: A HRR1 Cycle ergometry vs. Step Test, B HRR1 Cycle ergometry vs. Shuttle walk test, C HRR1 Step Test vs. Shuttle walk test, D HRRAUC Cycle ergometry vs. Step Test, E HRRAUC Cycle ergometry vs. Shuttle walk test, and F) HRRAUC Step Test vs. Shuttle walk test. The reference line represents the line of equality: y = x, bpm = beats per minute

Fig. 5
figure 5

Bland‐Altman plots and Scatterplots of SEARCH II (exercise modality) for HRR2. Bland‐Altman plots and Scatterplots reflecting agreement between HRR2 values across three different exercise modalities (Cycle ergometry, Step test and Shuttle walk test). Bland‐Altman Plots: A HRR2 Cycle ergometry vs. Step Test, B HRR2 Cycle ergometry vs. Shuttle walk test, C HRR2 Step Test vs. Shuttle walk test. Scatterplots: D HRR2 Cycle ergometry vs. Step Test, E HRR2 Cycle ergometry vs. Shuttle walk test, and F HRR2 Step Test vs. Shuttle walk test. The reference line represents the line of equality: y = x, bpm = beats per minute

4 Discussion

The main findings of these studies are that the novel HRRAUC method for quantifying HRR following SETs demonstrates moderate reproducibility across different workloads and different modalities of exercise where HRR1 and HRR2 are poorly reproducible. This suggests that the novel HRRAUC may be a more objective method for quantifying HRR following SET.

Previous studies have shown that at fixed workloads HRR1 and HRR2 are reproducible measures. Arduini et al. [14] recruited 21 healthy individuals to undergo a set of cycle ergometer submaximal exercise tests at 65% and 80% age predicted maximum heart rate followed by passive recovery; this was then repeated 2 weeks later. These authors demonstrated that at fixed workloads HRR1 and HRR2 were reproducible measures between individuals at each workload (mean ICC between trial 1 and trial 2: 0.827 and 0.747 respectively), but no comparison was made of HRR between the different workloads. Similarly, Orini et al. [15] repeated exercise stress tests at 50% of the absolute predicted maximum workload in 1187 individuals 34.2 ± 2.8 months apart and found that at fixed workloads (when the difference in workload between tests was ≤ 10 watts) HRR indices were reproducible. However, they found that in participants where the difference in workload between tests was > 10W, this negatively impacted the reproducibility of HRR1 and HR response during exercise, suggesting that both HR response to exercise and HRR may be effort dependent. The results of the current study are in keeping with these findings and demonstrate that when workloads are varied (by 20W between cycle ergometry tests), HRR1 and HRR2 are poorly reproducible. The novel HRRAUC method however appears to cope better with this difference in workload and demonstrates moderate reproducibility across different workloads (ICC 0.58, 95% CI (0.14–0.8)). This is further emphasized by the Bland‐Altman analysis which shows that there is poor agreement with very wide limits of agreement between HRR1 and HRR2 values when the workload is varied. HRRAUC however demonstrates better agreement and narrower limits of agreement with changing workload.

Whilst it has been widely accepted that in healthy individuals, the time constant of heart rate decay in the first 30 s of recovery (early phase of recovery) is almost exclusively a function of vagal reactivation [5], there is evidence to challenge this philosophy which suggests that early phase of recovery is instead due to a combination of vagal reactivation and sympathetic withdrawal [6, 7]. Pierpont et al. [6], modelled the relative contributions of the sympathetic and parasympathetic nervous system on HRR following maximal and submaximal exercise. This demonstrated that the relative contributions of each arm of the autonomic nervous system were dependent on the intensity of exercise. Higher intensities of exercise (which lead to increased sympathetic activation during exercise) were associated with increased contribution of sympathetic withdrawal to HRR in the early phase of recovery, whilst at lesser exercise intensities (which led to lesser sympathetic activation during exercise) were associated with lesser contribution of sympathetic withdrawal to HRR in the early phase of recovery. Gourine et al. [4], suggested a similar physiology for HR during exercise; increasing workload during exercise is associated with increased sympathetic contribution to HR control. They also suggested that there is a threshold of HR beyond which rises in HR during exercise are governed exclusively by sympathetic activation. This suggests, as is perhaps intuitive, the level of sympathetic activation during exercise is dependent on the intensity of exercise. Subsequently the degree of sympathetic withdrawal during recovery depends on the degree of sympathetic activation during exercise. From these findings, it is reasonable to hypothesize that since different individuals would exercise to different intensities during SETs, they would have different levels of sympathetic activation during exercise and in turn will have different levels of sympathetic withdrawal during recovery. The HRR response following SETs would therefore naturally be dependent on exercise intensity (i.e., effort dependent). We hypothesized that since HRR1/2 ignore the HRR data beyond the first 2 min of recovery and may miss key information contained within the HRR curve regarding an individual’s HRR dynamics and autonomic innervation beyond two minutes. Visual inspection of the HRR vs. time plots revealed that in many volunteers’ HR had not plateaued until 4 or more minutes into recovery. Further, by quantifying the dynamics of HRR for the entire 6 min of recovery, the HRRAUC is likely to be less influenced by peak HR which in turn is influenced by peak workload during exercise.

Previous studies have documented that HRR1 differs significantly between different modes of maximal exercise. Maeder et al. [17] found that HRR1 values obtained following maximal cycling were significantly higher than following maximal treadmill exercise, both in healthy subjects and in heart failure patients. In their study, peak oxygen consumption (VO2) obtained during the treadmill test was significantly higher than that obtained during cycle ergometry despite similar peak HR, suggesting increased work associated with the treadmill test. The authors hypothesized that exercises involving greater muscle mass (reflected by a higher VO2), require a higher heart rate during the early phase of recovery to maintain cardiac output and allow clearance of metabolites that have accumulated in these muscles; thus, resulting in a slower rate of HRR. Randive et al. [18] found similar results; HRR1 was significantly lower during leg ergometry as compared with arm ergometry (where peak VO2 was lower) despite similar peak HR between both tests. The current study also found HRR1 and HRR2 to be poorly reproducible across different exercise modalities despite similar peak HR; this may reflect the differences in muscle use between the differing exercise tests, in keeping with the hypothesis of Maeder et al. Moreover, our study also found HRRAUC to be highly reproducible between Shuttle walk vs. Step test comparison, given that these two exercises involve greater muscle mass than cycle ergometry this adds further evidence to the hypothesis that muscle mass involvement may also influence the early phase of heart rate recovery. Despite this however the novel HRRAUC method demonstrates moderate reproducibility across all three comparisons. We hypothesize, by including HRR data beyond the early phase the HRRAUC method proves to be less influenced by factors impacting early phase of recovery such as greater muscle mass and therefore demonstrates moderate reproducibility despite differing exercise modality and therefore muscle mass involvement. Lastly, the Bland‐Altman analysis also revealed poor agreement and very wide limits of agreement for HRR1 and HRR2 across all comparisons. In comparison the HRRAUC method demonstrates better agreement between different exercise modalities with mean percentage difference < 10% across all comparisons. Furthermore, the limits of agreement are much narrower for the HRRAUC across all three comparisons which is emphasized by visual inspection of the Bland–Altman plots in Fig. 3.

Two volunteers (volunteer 2 and 8) did not achieve target HR > 50% of HRmax on the Shuttle Walk test; both of whom were young and comparatively fit suggesting that the Shuttle Walk test may not be sufficient work for younger fitter individuals and hence an inappropriate test in this demographic. Sensitivity analysis conducted by excluding these individuals showed an increase in reproducibility for both HRR1 and HRR2, but interestingly did not affect the reproducibility of HRRAUC. This suggests that the HRRAUC may require a much lower minimum peak work to yield reproducible results. Further work is however required to establish the minimum peak effort required to yield reproducible HRR responses and consolidate this finding.

Whilst these studies identify important novel findings for HRR following SETs, they are not without their limitations. These studies included mostly young healthy individuals which limits the extrapolation of results to an older age group; this simply reflects the ease of recruiting students as healthy volunteers to take part in an exercise study. The study excluded volunteers taking beta blockers as this could potentially affect heart rate during test however a large proportion of participants were consuming alcohol, and a small proportion of the participants were on calcium channel blockers which may have affected the heart rate, though this likely reflects the clinical population in which HRR may be used. Standardization of workloads between exercise modalities may have increased reproducibility, however this would be technically challenging without expensive exercise equipment. We did however aim to maintain volunteers at HRmax at peak exercise between 50%–85% of the volunteers age predicted maximum HR. Furthermore, to be widely adopted as an easily obtained indicator of fitness, SETs need to be able to be conducted without expensive exercise equipment which is not as widely available. The duration of exercise between modalities of exercise also varied which may have been another potential confounder but all volunteers achieved and maintained target HR (HR > 50% of age predicted HRmax) before cessation of exercise. Lastly, whilst the results of this study show that HRRAUC method has better reproducibility across differing workload and exercise modalities as compared with HRR1 and HRR2, there is currently limited evidence supporting the utility of HRRAUC method as for assessing pre-operative fitness in individuals highlighting the need for further studies. Further research is also required to establish the effect of exercise time on the reproducibility of HRR and the minimum peak effort required to produce a HRR response.

5 Conclusion

The current study demonstrates that HRRAUC is a moderately reproducible measure across different workloads and modalities of exercise and therefore, may be a superior method to traditional methods for quantifying HRR following SETs. There is currently limited evidence supporting the utility of HRRAUC method as for assessing pre-operative fitness in individuals highlighting the need for further studies. Further research is also required to establish the effect of exercise time on the reproducibility of HRR and the minimum peak effort required to produce a HRR response. This may allow HRR to be more widely utilized across various clinical populations for pre-operative risk assessment, monitoring responses to pre-habilitation and to treatment, assessing fitness to undergo treatment and monitoring disease progression.