Submaximal Fitness Test in Team Sports: A Systematic Review and Meta-Analysis of Exercise Heart Rate Measurement Properties

Shushan, Tzlil; Lovell, Ric; Buchheit, Martin; Scott, Tannath J.; Barrett, Steve; Norris, Dean; McLaren, Shaun J.

doi:10.1186/s40798-023-00564-w

Submaximal Fitness Test in Team Sports: A Systematic Review and Meta-Analysis of Exercise Heart Rate Measurement Properties

Systematic Review
Open access
Published: 24 March 2023

Volume 9, article number 21, (2023)
Cite this article

Download PDF

You have full access to this open access article

Sports Medicine - Open Submit manuscript

Submaximal Fitness Test in Team Sports: A Systematic Review and Meta-Analysis of Exercise Heart Rate Measurement Properties

Download PDF

6045 Accesses
17 Altmetric
Explore all metrics

Abstract

Background

Submaximal fitness tests (SMFT) are a pragmatic approach for evaluating athlete’s physiological state, due to their time-efficient nature, low physiological burden and relative ease of administration in team sports settings. While a variety of outcome measures can be collected during SMFT, exercise heart rate (HRex) is the most popular. Understanding the measurement properties of HRex can support the interpretation of data and assist in decision making regarding athlete’s current physiological state and training effects.

Objectives

The aims of our systematic review and meta-analysis were to: (1) establish meta-analytic estimates of SMFT HRex reliability and convergent validity and (2) examine the moderating influence of athlete and protocol characteristics on the magnitude of these measurement properties.

Methods

We conducted a systematic literature search with MEDLINE, Scopus and Web of Science databases for studies published up until January 2022 since records began. Studies were considered for inclusion when they included team sports athletes and the reliability and/or convergent validity of SMFT HRex was investigated. Reliability statistics included the group mean difference (MD), typical error of measurement (TE) and intraclass correlation coefficient (ICC) derived from test–retest(s) designs. Pearson’s correlation coefficient (r) describing the relationship between SMFT HRex and a criterion measure of endurance performance was used as the statistic for convergent validity. Qualitative assessment was conducted using risk of bias assessment tool for non-randomised studies. Mixed-effects, multilevel hierarchical models combined with robust variance estimate tests were performed to obtain pooled measurement property estimates, effect heterogeneity, and meta-regression of modifying effects.

Results

The electronic search yielded 21 reliability (29 samples) and 20 convergent validity (29 samples) studies that met the inclusion criteria. Reliability meta-analysis indicated good absolute (MD = 0.5 [95% CI 0.1 to 0.9] and TE = 1.6 [95% CI 1.4 to 1.9] % points), and high relative (ICC = 0.88 [95% CI 0.84 to 0.91]) reliability. Convergent validity meta-analysis indicated an inverse, large relationship (r = − 0.58 [95% CI − 0.62 to − 0.54]) between SMFT HRex and endurance tests performance. Meta-regression analyses suggested no meaningful influence of SMFT protocol or athlete characteristics on reliability or convergent validity estimates.

Conclusions

Submaximal fitness test HRex is a reliable and valid proxy indicator of endurance performance in team sport athletes. Athlete and SMFT protocol characteristics do not appear to have a meaningful effect on these measurement properties. Practitioners may implement SMFT HRex for monitoring athlete’s physiological state by using our applied implications to guide the interpretation of data in practice. Future research should examine the utility of SMFT HRex to track within-athlete changes in aerobic capacity, as well as any further possible effects of SMFT protocols design elements or HRex analytical methods on measurement properties.

Registration Protocol registration can be found in Open Science Framework and available through https://doi.org/10.17605/OSF.IO/9C2JV.

The Importance of Muscular Strength: Training Considerations

Article 25 January 2018

No Time to Lift? Designing Time-Efficient Training Programs for Strength and Hypertrophy: A Narrative Review

Article Open access 14 June 2021

Minimal-Dose Resistance Training for Improving Muscle Mass, Strength, and Function: A Narrative Review of Current Evidence and Practical Considerations

Article 25 November 2021

Key Points

Exercise heart rate (HRex) during submaximal fitness tests is a reliable and valid proxy measure of endurance performance in team sports.
Athlete and test protocol-related characteristics do not appear to meaningfully affect HRex measurement properties.
Our findings provide implications for test protocol selection, as well as conceptual and statistical data interpretations when using HRex for evaluating athlete’s physiological state in team sports.

Introduction

Quantifying athlete’s responses to training programmes is an integral part of the training process of team sports [1]. Assessing aerobic-oriented training effects is preferably made via maximal exhaustive tests administered in field-based or laboratory conditions, with higher physiological or performance outcomes indicative of improved aerobic capacity or adaptation to a training programme [2, 3]. However, given the limited viability of repeated maximal endurance performance tests in team sports, there is a growing interest in collecting proxy (surrogate) outcome measures of physiological capacities during alternative, time-efficient and non-exhaustive exercise assessments [4, 5].

Submaximal Fitness Tests (SMFT) provide a pragmatic approach for evaluating physiological state by assessing an athlete’s internal load or responses to a standardised physical stimulus [6, 7]. We recently undertook a review of the literature on SMFT in team sports [8] and identified many variations in protocols, outcome measures and monitoring purposes. Exercise heart rate (HRex) is the most utilised SMFT outcome measure, perhaps due to the strong relationship to oxygen uptake during a continuous, (intended steady-state) exercise [4], supporting its use as a surrogate marker of aerobic capacity of cardiovascular fitness [4, 9]. Although various resting, exercise and recovery HR-derived measures may provide an insight into the cardiac autonomic nervous system state [10], HRex has also been proposed as a marker of shorter-term physiological stress given its sensitivity to factors such as an exposure to extreme environments (e.g., heat, altitude) [11, 12] or a training-induced overreaching state [10, 13]. Therefore, SMFT HRex might provide practitioners with a valuable insight into athlete monitoring in team sports [4, 9].

Understanding the measurement properties of an outcome measure is a fundamental aspect in sports performance testing and monitoring [14]. Reliability refers to the consistency and reproducibility of an outcome measure [15, 16]. This is typically assessed using test–retest measurements [17] and quantifies the expected ‘noise’ (biological or measurement error) of the outcome measure [15, 17], which can subsequently assist in establishing meaningful changes for decision making [4]. Validity refers to the extent to which a measure represents the variable or construct it is intended to and includes several dimensions. For example, construct validity is the extent to which variables are reflective of theoretical constructs described by constitutive definitions and operationalizations. A sub-domain of construct validity is convergent validity—the degree to which two measures of a theoretical construct that should be related, are in fact related [18]. Within the SMFT framework, HRex should, theoretically at least, be associated with both positive [6] and negative [19] training effects.

To better understand the utility of SMFT HRex as a monitoring tool, a thorough knowledge of its measurement properties is required. There have been a considerable number of studies examining the reliability and convergent validity of SMFT HRex in team sport athletes; however, the overall (pooled) estimates of these measurement properties, the associated between- and within-study variance, and the influence of modifying effects have yet to be established. Therefore, the aims of our current paper are: (1) to provide the first meta-analysis of SMFT HRex reliability (absolute and relative) and convergent validity; and (2) to examine the influence of athlete’s and SMFT characteristics on these measurement properties.

Methods

Registration and Search Strategy

Our systematic review and meta-analysis was conducted in accordance with PRISMA (Preferred Reporting Items for Systematic reviews and Meta-Analyses) 2020 guidelines [20] (PRISMA checklist items are available in the Additional file 1), and was prospectively registered in Open Science Framework (available through https://doi.org/10.17605/OSF.IO/9C2JV). Details on the registration process and changes made between the original to final protocol are available in Additional file 2. The research question for this review was developed as part of an iterative process originated in our previous review [8]. For this, we updated the systematic searches with an extension of the inclusion and exclusion criteria (Table 1). The electronic databases MEDLINE, Scopus and Web of Science were searched on multiple occasions, commencing in January 2020 and finalised in January 2022, denoting the earliest search date of the original review and the most updated searches for the meta-analysis, respectively. Detailed descriptions presenting the search strategy and results are provided in the Additional file 2.

Table 1 Meta-analysis inclusion and exclusion criteria

Full size table

Screening and Study Selection

The updated aspects in the inclusion–exclusion criteria (Table 1) for this review were related to study design and SMFT outcome measure, and the searching and screening processes are presented in Fig. 1. We sought to include studies examining SMFT HRex reliability using test–retest(s) designs and reporting absolute (group mean difference [MD], typical error of measurement [TE]) and/or relative (intraclass correlation coefficient [ICC]) effect estimates. Convergent validity studies included correlational study designs examining the relationship (correlation coefficient [r]) between SMFT HRex and an established team sport endurance performance test administered in a cross-sectional manner. In team sports, assessing endurance performance has been undertaken using via a variety of maximal effort and exhaustive tests administered in laboratory (typically, maximal oxygen uptake) or field-based (e.g., intermittent endurance capacity) conditions. While laboratory tests measuring maximal oxygen uptake are considered as a criterion measure of aerobic capacity, field-based tests are more pronounced in team sports (perhaps due to their practicality) and have been shown to deliver a satisfactory degree of reliability and validity [2, 21, 22]. These tests generally involve external intensity measures (either total distance covered or final running speed achieved) that are used as a proxy indicator of aerobic capacity, albeit we acknowledge other, non-aerobic qualities (anaerobic, neuromuscular) may contribute towards performance result [2, 21].

Data Extraction and Coding

We extracted and coded the following study characteristics (Tables 2 and 3):

1.
Author and year of publication.
2.
Cohort mean age (years) and age category (youth [< 18 years] or senior [≥ 18 years]).
3.
Sex.
4.
Sport.
5.
Competition level (elite, sub-elite and mixed). Based on the information provided by the authors, elite players compete in the highest level of their age group in the country. Sub-elite players do not compete in the highest level of their age group, whereas mixed represents a group of players from both levels.
6.
SMFT protocol category. We earlier proposed an operational taxonomy for unique SMFT protocols based on two levels of classification, including the exercise regimen (continuous or intermittent) and rate of changes in exercise intensity (fixed, incremental or variable). Figure 2 presents the characteristics of protocols and their utilisation in the included studies.
7.
SMFT duration (minutes). Based upon the details provided in the text, the total duration was rounded to the nearest 30 s.
8.
SMFT internal exercise intensity, expressed as mean HRex. In our analyses, we considered HRex values analysed as percentage points of heart rate maximum (%HR_max). Accordingly, when a study reported statistical results for both beats·min⁻¹ and %HR_max, we selected the results presented in %HRmax only. In addition, if a study reported findings solely in beats·min⁻¹, we used additional information to convert the results into %HR_max (see “Handling Missing Data” section for details).
9.
SMFT HRex collection methodology (fixed time point, mean range, or mean overall). HRex was recorded at a specific fixed time point(s) throughout the test, calculated as the mean HRex during the last during 10–60 s of the test, or calculated as the mean HRex during the overall test.
10.
Reliability statistics: methodology used to calculate TE and ICC type.
11.
Convergent validity (r): endurance performance indicator and season phase (pre-season or in-season period).

Table 2 Sample and SMFT characteristics of the studies included in the reliability meta-analysis

Full size table

Table 3 Sample and SMFT characteristics of the studies included in the convergent validity meta-analysis

Full size table

Handling Missing Data

We used direct contact details of the corresponding author(s), along with social networks (e.g., ResearchGate, Twitter) to attain missing information including: (1) sample characteristics (e.g., mean age, sample size); (2) SMFT characteristics (duration, exercise or drill/game configuration, etc.); (3) outcome measures, including HRex values recorded during the SMFT, or sample HR_max; and (4) other relevant reliability and/or convergent validity statistical calculation methods or results. Overall, we contacted 12 authors (15 studies) [3, 6, 23,24,25,26,27,28,29,30,31,32,33,34,35] for additional information, and 9 (12 studies) of them replied [3, 6, 23,24,25,26,27, 29, 31,32,33,34], with most providing explanations [23,24,25, 29, 31, 32, 34] or the entire raw data [3, 6, 27, 36]. The remaining information was either excluded from pooled effect estimates analyses, obtained from alternative details provided within the text, or omitted when a meta-regression analysis was performed. All data estimations employed were considered in the qualitative assessment of individual studies (see “Study Qualitative Assessment” section.

In some cases, the original sample size reported in the study was modified when reliability or convergent validity experiments were administered. When the new characteristics (as an example, mean age) relating to the modified sample were not reported, the original information provided was used. With regards to the reliability dataset, two studies [7, 37] that collected SMFT HRex in beats·min⁻¹ also reported HR maximum derived from the maximal version of the same test, meaning the conversion into relative values was straight-forward. One study [26] analysed SMFT HRex similarly (beats·min⁻¹) and did not report HR_max. In addition, ages were reported as U9–U11, U12–U14 and U15–U18. In this case, HR_max was calculated using an estimated equation (HR_max = 208 – 0.7 × mean age [38]), and mean ages were assumed as 9, 12 and 16 years, respectively. Similar HR_max estimation was employed in two previous studies [29, 33]. Graph digitizer software (WebPlotDigitizer, https://apps.automeris.io/wpd/) was used to extract the first two SMFT HRex group means and standard deviations of a test–retest study that administered repeated measures [34].

With respect to the convergent validity dataset, in two studies [7, 30] using SMFT HRex as beats·min⁻¹, one [7] reported HR_max, and for the other [30] we estimated HR_max as described above. One study [35] reported age as youth U19, the estimated mean age was 17, and the same graph digitizer software was used to extract the mean SMFT exercise intensity (HRex). Further, three studies [34, 39, 40] combined sub-groups when convergent validity was analysed. In that case, the weighted mean SMFT HRex and age of the sub-groups were calculated for exercise intensity and mean age, respectively.

Effect Estimates

Mean Difference

The Mean Difference (MD) is an absolute reliability estimate which indicates the degree of variability relating to systematic, group-level bias (i.e., total sample) [15]. As discussed in previous sections, we considered MD effect estimates presented raw units (%HR_max), which were calculated as the change in group means across test–retest [41].

Typical Error of Measurement

Another absolute reliability estimate is Typical Error of Measurement (TE) [15, 16]. The TE is an indicator of random, within-subject variability and is sometimes presented in the form of coefficient of variation (CV%) [15]. This approach is appealing for normalization and interpretation of variability, especially across different outcome measures. However, since the primary outcome measure in our meta-analysis is already analysed in relative values (%HR_max), it is not appropriate (or interpretable) to present TE as a CV (i.e., percentage of percentage). Therefore, when a study presented the TE as a CV value, we estimated the former using the following equation:

$$\left( {\frac{{{\text{CV}}}}{100}} \right) \times {\text{grand}}\;{\text{mean}}\;{\text{HRex}}$$

(1)

where grand mean is the mean HRex across test 1 and test 2.

Two common methods for calculating TE are cited in the sport science literature [15, 16], and were identified in our review. The first approach proposed by Atkinson and Nevill [16] uses the ICC in combination with the pooled between-athlete variance (Eq. 2). The second method proposed by Hopkins [42] uses the mean variance of within-athlete change scores (Eq. 3). Since the former approach requires the sample standard deviation and ICC, it is usually easier to extract from an article. However, after performing different simulations with our previous (unpublished) work and the available data in our included studies, and generating fictitious data with similar values, we found that the difference in TE calculated using both methods negligible to almost non-existent (available in the Additional file 3), suggesting that it is appropriate to meta-analyse estimates calculated from either method without any transformation. If the exact method was not specified nor provided after personal communication to the author, or TE was not disclosed in the analysis, we used the Atkinson and Nevill [16] method (i.e., Eq. 2), where possible [35].

$${\text{TE}} = {\text{SD}} \times \sqrt {1 {-} {\text{ICC}}}$$

(2)

$${\text{TE}} = \frac{{{\text{SD}}_{\Delta } }}{{\sqrt {2} }}$$

(3)

where SD_Δ is the standard deviation of individual change scores (test 2–test 1% point) and SD is the pooled between-athlete standard deviation observed in test 1 and test 2.

Intraclass Correlation Coefficient

The ICC represents reproducibility in the rank order of athletes over repeated measures (i.e., relative reliability) [43]. In most of our eligible studies, two types of ICC were identified: those describing absolute agreement in a single measure from a two-way random effects model (ICC_2,1), and those describing consistency in a single measure from a two-way mixed-effects model (ICC_3,1) [43], whereas in other studies ICC type was not clearly specified or attainable (Table 2). Based on the comparisons used in the TE data synthesis, the differences between ICC types were negligible (Additional file 3), and we thereby treated all data the same. We acknowledge that there are conceptual differences between ICC types [43]. However, it was not possible to examine this effect due to the low number of estimates in some levels (e.g., ICC_2,1, n = 3 studies).

Pearson’s Correlation Coefficient (r)

We elected to extract Pearson’s product-moment correlation coefficient (r) as the principal statistic estimate to describe the magnitude of association between SMFT HRex and maximal endurance performance [44]. As demonstrated in Table 3, most reference tests included field-based, intermittent incremental protocols with change of direction (COD) such as the Yo-Yo Intermittent Recovery Test Level 1 (Yo-YoIR1) [45] or 2 (Yo-YoIR2) [46], while other individual studies involved continuous incremental protocols, including the Vam-Eval test [34] and maximal oxygen uptake in laboratory conditions [47].

Statistical Methods and Data Analysis

Synthesis of Effect Estimates Prior and Post Analysis

An overview of calculations for effect estimates and their sampling variance is presented in Table 4. The MD estimate and its sampling variance were calculated from the test–retest statistics of sample size (n), group means and standard deviations while assuming equal sampling variances within groups [41]. The ICC and r estimates and their sampling variances were converted to Fisher’s z values to approximate normally distributed data [41, 48], and subsequently back-transformed for analyses interpretations of the pooled estimates (intercept-only model). To our knowledge, there is no documented approach for a meta-analysis of TE. Therefore, we used the method proposed by Nakagawa et al. [49] for meta-analysis of variation. This method uses a log-transformed standard deviation (adjusted for sample size), along with its sampling variance for each individual group [49]. For obtaining the pooled estimate, the results were back-converted to their original values, including the bias correction for sample size. It is important to note that considering that the transformation of meta-regression coefficients (full-model) back into their original values is not straight-forward and may result in erroneous interpretations of the modifying effects, coefficients are presented in Fisher’s z-scale (ICC and r) and log-transformed (TE) values. To facilitate models' interpretations, we constructed meta-regression bubble plots, presenting the predicted values and their corresponding confidence and prediction interval bounds.

Table 4 Effect estimates and their sampling variances

Full size table

Overall Meta-Analysis

All data analyses were conducted using the metafor [50] and clubSandwich [51] packages in the R studio environment (version 1.4.1106) [52]. The datasets and analysis codes are openly available in the Additional files (https://osf.io/mqnt9/). In most of the included studies, we were able to extract more than a single effect size. Multiple effect sizes were derived from different sub-groups within studies, but more frequently from a variety of SMFT characteristics (e.g., SMFT protocols, duration or intensities) within sub-groups. In addition, some of the reliability studies [6, 24, 46, 53, 54] included more than two similar consecutive SMFT and provided the results from each matched test–retest. Two studies examining convergent validity [7, 45] implemented within-group repeated measures of the same tests at different time-points across the season (Table 2 and 3 provide the sources of multiple effect estimates within studies).

Given the hierarchical structure in our datasets (multiple effect estimates are nested within groups), as well as the likelihood of statistical dependency, we employed a more recently developed approach using multilevel mixed-effects meta-analysis and robust variance estimation with adjustments for small-samples [55]. This approach allows exploration of the variance present across multiple levels and, hence, within- and between-group variance [56], and provides a robust method for meta-analysis while accounting for the dependency among effect estimates derived from common samples [57]. In such cases, replacing sampling variance with the entire ‘V matrix’, indicating the variance–covariance matrix of the estimates, can further account for the correlation between effect estimates [55, 58]. As it was not possible to attain the correlation between effect estimates drawn from the same participants in most of the included studies, our analyses were conducted using an assumed constant correlation of ρ = 0.6 [55]. In Additional file 4: Table S3 we report sensitivity analyses, whereby a range of correlation values were used to evaluate the influence of the changes in the within-group covariance on the pooled estimates and its variance components. Collectively, these analyses showed identical pooled estimates and nearly similar variance components.

Heterogeneity and Modifying Effects

To describe the extent of heterogeneity, we calculated restricted maximum likelihood estimates of the within- and between-group variances (SD; tau [τ]) [59], as well as the I² statistic in each level [60] which implies the percentages of variance which are due to study heterogeneity rather than sampling error [60]. Log-likelihood-ratio tests were also computed to determine whether the within- and between-group variances were significant [61]. We then conducted regression analyses to examine possible source of heterogeneity, including parameters related to SMFT (e.g., exercise intensity) or athlete’s (e.g., mean age) characteristics. Modifying effects with at least 6 independent samples and overall 10 effect estimates in each level [62] were added separately to the models. These were analysed independently, rather than in combination, due to a wide range of sample sizes for different modifying effects that resulted in a relatively low number of estimates for the requirement of meta-regression [62].

Categorical effects were evaluated as the differences between levels and consisted of athlete’s age category (youth, senior), HRex collection method (reliability: fixed, mean range, mean overall; convergent validity: fixed, mean range) and competition level (elite, sub-elite; for convergent validity only). Continuous effects included athlete’s mean age (years), SMFT exercise intensity (expressed as HRex) and SMFT duration (minutes), and were explored as the changes associated with pre-defined clinically relevant values: (1) for mean age, we explored the effect of 3-year change in age; (2) SMFT exercise intensity effect was evaluated as the changes associated with 5-point % change in HRex; and (3) the effect of SMFT duration was investigated in reference to 2-min change in duration. Finally, model strength was assessed as the percentage of variance explained by the modifying effect with pseudo-R² statistic (i.e., intercept-only versus full-model) [41]. When variances in both levels were identified, we scrutinised R² in the within (R²₂) and between (R²₃) group variance separately [56].

Inferences

Uncertainty in meta-analysis and regression estimates was expressed using 95% compatibility (confidence) intervals (CI), representing ranges of values compatible with our models and assumptions [63]. We also derived 95% prediction intervals (PI), which convey the likely range of the true measurement properties in similar future studies. Similar to others [64], we sought to avoid dichotomizing the presence or absence of an effect using traditional null hypothesis testing. Instead, we considered the entire range of 95% CI, relating mostly to the point estimate and uncertainty of predicted values. Effects were then discussed in terms of their compatibility (or coverage) with practically significant or practically equivalent values. Qualitative inferences were made for standardised effects only (i.e., ICC and r) using thresholds of > 0.99, extremely high; 0.90–0.99, very high; 0.75–0.90, high; 0.50–0.75, moderate; 0.20–0.50, low; < 0.20, very low for ICC [65] and 0.10 small; 0.30 moderate; 0.50 large; 0.70 very large; and 0.90 extremely large for r [66]. Thresholds for the smallest meaningful correlation (i.e., 0.1 and 0.2 for r and ICC, respectively) were used to declare modifying effects as substantial [44]. We elected to not make formal decisive interpretation on the substantiality of MD and TE because these are presented in units of the outcome measure (i.e., HRex), and we are as yet unaware of the true, minimum practically important difference for team sport athletes during SMFT.

Study Qualitative Assessment

Qualitative assessment of individual studies was conducted using the Risk of Bias Assessment Tool for Non-randomised Studies (RoBANS) [67, 68]. The RoBANS comprises six-dimension criteria which were adapted to answer questions that may influence the reliability or convergent validity results reported in the included studies. Answer categories of ‘low’, ‘unclear’ and ‘high’ risk of bias were assigned in each domain. To promote consistency in how domains were evaluated, three reviewers (TS, RL, SJM) created a decision rule for each criterion. Assessments were then performed by one reviewer (TS), with two other authors (RL, SJM) checking for accuracy. A summary detailing the quality assessment criteria in each domain is provided as Additional file 5: Table S1.

Small-Study Effects and Influence Analysis

Small-study effects and asymmetry were visually inspected using funnel plots [69]. To confirm our visual impression, the Begg’s rank correlation [70] and Egger’s regression (by fitting the square root of the sampling variance as a moderator) [69] tests were employed. Further, to assess the influence of each effect size on the summary effect and heterogeneity, Cook’s distance analysis [71] and Baujat plots [72] were obtained. A standard rule of thumb is that Cook distance values greater than three times the mean were used [73], and potential outliers were excluded from the dataset to examine if the change in the summary estimates turned substantial.

Results

Selected Reliability Studies and Characteristics

Table 2 summarises the characteristics of included reliability studies. The overall dataset included 29 samples derived from 21 studies. The MD dataset consisted of 69 effect sizes nested within 25 samples (mean = 2.8 [effect sizes per sample], median = 2, range of 1–24), derived from 18 studies. The TE dataset included 52 effect sizes nested within 25 samples (mean = 2.1, median = 1, range of 1–12), from 18 studies. The ICC dataset included 54 effect sizes nested within 27 samples (mean = 2.0, median = 1, range of 1–12), from 20 studies. The total number of team sport athlete inclusions was 406 in MD, 411 in TE and 480 in ICC datasets. The studies involved male players only, and the majority originated from soccer (76%), followed by rugby codes (10%), Australian football (7%), ice hockey and handball (one study). Elite players were included in 76% of the studies, with a further 16% being mixed and 8% being sub-elite. The proportion of youth and senior athletes was 55% and 45%, respectively. SMFT categories were distributed as follows: intermittent incremental (59%), continuous fixed (24%) and intermittent variable (17%).

Selected Convergent Validity Studies and Characteristics

Table 3 summarises the characteristics of our included convergent validity studies. The overall dataset included 73 effect sizes nested within 29 samples (mean = 2.5, median = 2, range of 1–10), derived from 20 studies, with a total number of 1055 team sport athlete inclusions. Most studies (90%) involved male players, and the majority originated from soccer (83%), ice hockey and rugby codes (6.5% each) and Australian football (one study). Elite players were included in 61% of the studies, with a further 21% being sub-elite and 18% being mixed. Unlike reliability, 62% and 38% of studies were conducted on senior and youth athletes, respectively. Most (73%) studies were conducted during the in-season phase, and the distribution of SMFT categories was similar: intermittent incremental modality (76%), continuous fixed (17%) and intermittent variable (7%).

Overall Meta-Analyses

Table 5 provides a summary of the reliability and convergent validity meta-analyses, with forest plots of the weighted points estimates, 95% CI and PI presented in Additional file 6: Fig. S1–4. The pooled estimates derived from the reliability analyses indicated overall good absolute and high relative reliability. The pooled estimate derived from r dataset indicated an inverse, large relationship between SMFT HRex and endurance performance measures.

Table 5 Meta-analysis of measurement properties

Full size table

Heterogeneity

We found no evidence for heterogeneity in the MD meta-analysis (τ₂ = 0.09 [95% CI 0.00 to 0.49], τ₃ = 0.00 [95% CI 0.00 to 0.72], I²₂ = 0.5% [95% CI 0.0% to 14.2%] and I²₃ = 0% [95% CI 0.0% to 26.4%]) and therefore did not employ meta-regression analysis. In contrast, notable heterogeneity was observed in all other reliability estimates: TE (τ₂ = 0.23 [95% CI 0.15 to 0.32], τ₃ = 0.18 [95% CI 0.00 to 0.34], I²₂ = 43.4% [95% CI 42.8% to 76.8%], I²₃ = 28.4% [95% CI 0.0% to 78.3%]), and ICC (τ₂ = 0.26 [95% CI 0.18 to 0.36], τ₃ = 0.20 [95% CI 0.00 to 0.40], I²₂ = 39.4% [95% CI 35.0% to 68.3%], I²₃ = 24.4% [95% CI 0.0% to 72.5%]). A greater source of within- versus between-group variance was evident in both TE and ICC. Heterogeneity in r was present in the within-group level only (τ₂ = 0.16 [95% CI 0.11 to 0.21], τ₃ = 0.00 [95% CI 0.00 to 0.14], I²₂ = 48.3% [95% CI 33.0% to 62.9%], I²₃ = 0% [95% CI 0.0% to 42.6%]).

Meta-Regression Analysis

Figures 3, 4 and 5 illustrate meta-regression bubble plots of the modifying effects, with different colours indicative of SMFT category. The full meta-regression results are presented in Table S2 in Additional file 4. The TE regression coefficients ranged between ln_diff = − 0.27 to 0.06, which were approximately equal to differences of 0.01 to 0.45 when comparing between TE predicted values (Fig. 3, panel A–E). The ICC regression coefficients ranged between z_diff = − 0.10 to 0.16 and resulted in approximate differences of 0.00 to 0.03 between ICC predicted values (Fig. 4, panel A–E). The r regression coefficients ranged between z_diff = − 0.08 to 0.10, indicating approximate correlation differences of − 0.07 to − 0.02 between r predicted values (Fig. 5, panel A–F). Collectively, none of these modifying effects appeared to have a meaningful influence on the reliability or convergent validity estimates. There was, however, a considerable reduction in the overall heterogeneity owing to the inclusion of some of the variables.

Study Qualitative Assessment

Risk of bias assessment is demonstrated in Tables 2 and 3, including the sum of each risk category (low, unclear or high) in individual articles. A graphical overview of the overall risk in each domain from all studies is illustrated in Fig. 6.

Small-Study Effects

No indications for small-study effects were observed in the funnel plots (Fig. 7). Begg’s rank correlation and Egger’s regression tests revealed no significant asymmetries for all estimates. Overall, the exclusion of potential outliers in all datasets did not have a practically meaningful influence on the results obtained in the original models (Additional file 7).

Discussion

In this review, we sought to meta-analyse the measurement properties of SMFT HRex, while additionally examining the modifying effects of athlete and protocol characteristics. Reliability and convergent validity data were obtained from 1218 and 3032 individual observations, drawn from 522 and 1055 team sport athlete inclusions and nested within 29 and 29 independent samples, respectively. We found that the overall TE of SMFT HRex was 1.6% points, with 95% CI ranging from 1.4 to 1.9% points. The width of the 95% PI suggested that future studies may expect to observe TE ranges between 0.9 to 3.0% points. The ICC overall estimate and 95% CI (0.88 [95% CI 0.84 to 0.91]) indicated high to very high relative reliability, and 95% PI ranged from 0.59 to 0.97, implying moderate to very high magnitudes in future studies. We also observed a large inverse relationship (r = − 0.58) between SMFT HRex and endurance test results, suggesting its validity as a marker of maximal endurance performance when comparing between athletes. Correlation magnitudes remained consistent across the 95% CI (− 0.54 to − 0.62, large), whereas the interpretation of 95% PI suggested that future studies are likely to observe moderate to very large magnitudes (− 0.31 to − 0.77). Meta-regression analyses suggested that variables related to athlete or protocol characteristics do not have a meaningful effect on these measurement properties.

Our results support previous conclusions [24,25,26, 34, 74] suggesting no effect of age on the magnitude of SMFT HRex reliability and convergent validity. Although a trend for slightly improved ICC, TE and r was observed in older players (Fig. 3–5, panel A and D), the differences were not substantial (Table S2 in Additional file 4). Overall, these results imply that the high-degree of SMFT HRex reliability and convergent validity can be confidently applied among a wide range of age groups, at least bounded by the ages we examined. In this respect, when comparisons were made directly between age extremities of the data (i.e., 10 vs 30 years), there did appear to be meaningful improvements in measurement properties. However, this comparison (e.g., 20 years difference) is likely not relevant in practice, as it spans across a team sport career.

We attempted to enhance the understanding of athlete’s performance status by categorising the studies in reference to their competition level (elite versus sub-elite), although this was only possible for the convergent validity dataset. While the relationship between SMFT HRex and established endurance tests appeared slightly larger in athletes who compete in the highest level of their sport compared to athletes who compete in sub-divisions (Fig. 5, panel F), the difference was negligible (sub-elite to elite: r = − 0.56 versus − 0.62, z_diff = − 0.08 [95% CI − 0.26 to 0.09]). Additionally, while this effect could not be explored in the reliability datasets, the results attained in the individual reliability studies did not indicate any potential trend [25, 28]. These observations would further suggest that athlete characteristics do not appear to meaningfully affect SMFT HRex measurement properties.

We examined the modifying effects of a variety of SMFT characteristics, including the SMFT duration, intensity, and HRex collection method. Collectively, all these modifying effects were trivial or had no clearly meaningful effect. Nonetheless, these findings are practically useful to guide SMFT programming factors in research and practice. Besides being non-exhaustive, another advantage of SMFT is their short duration. The datasets in our meta-analysis included SMFT ranging between 2 and 12 min. Several previous reviews dealing with monitoring HRex [4, 9], together with individual studies investigating the effect of SMFT duration in our meta-analysis [25, 37, 46, 54], have suggested a duration of 3–4 min considered the minimum for HR to attain steady-state during submaximal exercise [4, 75]. In accordance with this physiological rationale, we found that a 4-min SMFT is adequate for obtaining comparable reliability and convergent validity results, with no further meaningful improvements beyond.

Although part of our included studies [25, 46], taken together with previous research in endurance athletes [76, 77], have highlighted an improved absolute reliability (i.e., reduced TE) at higher SMFT intensities, one of the key findings of the current meta-analysis was that SMFT exercise intensity (expressed as HRex) had no effect upon absolute and relative reliability estimates (Figs. 3 and 4, panel B). Indeed, we found somewhat improved estimates of absolute reliability (TE) in lower versus higher intensities, but these differences were clearly trivial (ln_diff = 0.06 [95% CI –0.02 to 0.15] for every 5-point % increase in exercise intensity). For example, computing the predicted TE estimates at 80% and 85% HR_max indicated 1.48 (95% CI 1.24 to 1.76) and 1.57 (95% CI 1.37 to 1.80) % points, respectively. Likewise, the predicted ICC values for the same intensities were 0.91 (95% CI 0.88 to 0.94) and 0.89 (95% CI 0.86 to 0.92), respectively. Therefore, while marginal differences are evident, we deem them to have no practical substantiality and might therefore be considered trivial.

While higher SMFT HRex values are hypothetically expected to strengthen the relationships with fitness test performance outcomes, because they are collected at intensities closer to exercise cessation, studies examining this effect have reported disparate findings. For example, some of the included studies reported progressive improvements in the relationships with higher SMFT HRex values [35, 74, 78], while others showed the opposite [46, 79] or no differences (comparable correlation magnitudes) [6]. Our meta-regression analysis demonstrated that this effect was mostly compatible with no meaningful change in the magnitude of r (z_diff = − 0.05 [95% CI − 0.16 to 0.05] for every 5-point % increase in HRex; Fig. 5, panel B). To illustrate, predicted estimates for SMFT intensities at 80% and 85% HR_max are − 0.52 (95% CI − 0.64 to − 0.38) and − 0.56 (95% CI − 0.62 to − 0.49), respectively, with only the lower CI bounds indicating a possibly small effect. These findings have practical importance and suggest that practitioners do not necessarily require SMFT that elicit higher intensities for adequate HRex reliability and convergent validity. This may be of particular relevance considering SMFT are often administered as part of the warm-up of a training [3, 6, 34, 53], in which an exposure to high intensities maybe considered impractical.

The studies analysed herein used three main collection methods: fixed time point(s) during the test, the mean HR over particular time frames before test cessation (e.g., last 30, 60 s), or mean HR during the overall test (particularly during intermittent variable protocols such as passing drills or small-sided games (SSG); refer to Tables 2 and 3). While using the mean HR during the overall test in intermittent variable (e.g., SSG) SMFT seems a reasonable approach, the rationale for using a particular method and/or time span in other SMFT protocols was not provided in the included studies. While we observed trivial differences between methods (Figs. 3, 4, 5, panel E), mean HR over a particular time-frame before the SMFT cessation maybe deemed preferable [4, 9] since it minimises a potential measurement noise such as HR spikes owing to signal error. In view of the above, it should be noted that in some of the studies [28, 45, 78, 79] which were categorised as ‘fixed’ collection method, HRex was recorded in 2–5 s intervals by default. Moreover, while the collection method used could, in theory, alter HRex values when different intermittent SMFT are administered (see discussion on SMFT category in the following paragraphs), future research may be necessary to elucidate whether different collection approaches should be selected based upon the SMFT category and subsequently the protocol used.

While the present review could not examine the effect of SMFT category, given the small number of independent studies adopting protocols other than those classified as an intermittent incremental, a theoretical discussion is warranted. A fundamental assumption inherent to the use of HRex as a key monitoring measure in SMFT is based on its ability to provide a valid marker of within-athlete relative exercise intensity due to its linear relationship with oxygen uptake at a wide spectrum of submaximal intensities [4, 9]. From a physiological perspective, an essential component of this assumption is that the exercise regimen should be continuous (intends to elicit steady-state) [4, 80]. Interestingly, most of our included studies administered SMFT protocols characterised as intermittent (incremental), including shorter versions of the most commonly used intermittent shuttle fitness tests such as Yo-YoIR1 and 30-15_IFT. We assume that these particular SMFT were chosen since most of the studies investigated both reliability and convergent validity and thereby match the characteristics of the submaximal and maximal tests or administer them simultaneously. Furthermore, due to the absence of a criterion-standard of evidence-based SMFT protocol(s) [8], it is perhaps more logical to administer shorter versions of established field tests. However, considering the nature of such assessments, which include intermittent running bouts separated by rest periods, the final HRex result is not entirely derived from steady-state exercise and may be influenced by factors related to the reactivation of the cardiac parasympathetic system such as heart rate recovery following each exercise bout. In addition, the multiple CODs required during the test may add a neuromuscular component which may influence the cardiovascular response [75].

To illustrate this concept, one of the most investigated SMFT is the 6-min Yo-YoIR1 [8]. This test requires repeated 2 × 20 m incremental running bouts (shuttles with 180° CODs), interceded by 10 s rest periods [2]. At 6-min of the test, the players are required to run at a mean velocity of 14.5 km h⁻¹ (~ 10 s for each running bout and work/rest ratio of 1:1). Assuming that HRex is collected in the last 30 s of the test, the final value includes the mean HRex during 20 s of activity and 10 s of recovery. Another example is the 30-15_IFT, which includes a fixed work/rest ratio of 2:1, and a longer absolute recovery period between running bouts (15 s) [21]. Assuming a similar volume (6 min) and collection method (mean HRex during the last 30 s), the final HRex value would be derived from 15 s of activity and recovery, respectively. Albeit speculative, this can lead to a less precise and reproducible HRex values considering that heart rate recovery is associated with an inferior degree of reliability [8], and indeed, studies using continuous protocols generally observed better reliability outcomes. Conversely, it could also be argued that intermittent incremental SMFT are preferable given their greater ecological validity to team sports [81] which subsequently can enhance SMFT HRex convergent validity. A primary purpose of future research should therefore include a more explicit examination of continuous versus intermittent SMFT protocols and their link to SMFT HRex measurement properties.

Another aspect of SMFT categories is the use of intermittent variable protocols (SSG or passing drill formats). Despite the relatively large number of the total reliability estimates, only five independent studies met inclusion criteria (four SSG [27, 33, 82, 83] and one passing drill [27]). Collectively, these studies suggested further exploration of such SMFT as they showed comparable absolute and relative reliability values with the values observed in other SMFT generic categories. Nonetheless, considering the range of contextual factors (e.g., technical and tactical elements, exercise structure) that influence locomotor variability [27, 81, 82] and the increased neuromuscular demands [75], caution is necessary in their interpretation. With regards to convergent validity, only two independent studies administering SSG met the inclusion criteria. Interestingly, while one study [82] among senior professional soccer players using 5v5 SSG observed a significant large relationship (r = − 0.56) between HRex and the distance covered in Yo-YoIR1, a different study [32] using 3v3 SSG in a similar cohort observed no association (r = − 0.22 to − 0.25) with the final velocity achieved in the 30-15_IFT. Therefore, before using SSG as a standardised SMFT protocol, more research addressing its validity is warranted. Further, to appropriately quantify the influence of locomotor variables upon HRex in the longer-term, we recommend the use of linear regression techniques and avoiding ratios (i.e., SMFT HRex divided by an external intensity parameter) [84].

There are several limitations to our meta-analysis that warrant acknowledgment. First, although we adopted strict inclusion criteria, all of the included studies were observational and conducted with less rigorous methodological design and reporting standards (e.g., STROBE). Second, while it is well known that the main outcome measure (HRex) is influenced by a large number of confounding variables [8] (e.g., environmental conditions, training loads, circadian effect, menstrual cycle stage), only some of the included studies (8 of 30 overall studies; 27%) provided an adequate amount of information regarding how all these confounders were controlled prior and during the test(s) (refer to Fig. 6 ‘confounding variables’ and in Additional file 5: Table S1 for a detailed description). This information is important as such factors may have influenced outcomes of the studies included in our analyses, in particular the reliability findings. For this reason, future studies should report these data descriptively and if needed, account for the factors in the analysis (for example, adjusted HRex based on ‘heat index’ [85]). Third, during the screening process, we identified more potential studies for inclusion. However, since their effect sizes could not be appropriately estimated or were not provided by personal contact, we were not able to include them. With this in mind, we decided to employ a considerably careful evaluation of missing data and selective reporting in the included studies (Fig. 6 and Table S1 in Additional file 5, dimension 5 and 6). As discussed earlier, we also employed different methods to estimate data related to SMFT and/or athlete’s characteristics, all of which could have introduced some errors. Finally, our unbalanced datasets with regard to sex and sport limit the generalization of the findings to female athletes and athletes competing in a particular team sport.

Practical Implications

We propose several practical implications for researchers and partitioners wishing to use SMFT HRex with a view to determining athlete’s physiological state.

Protocol Selection

Exercise Regimen

While it is as yet unclear whether SMFT protocol category influences different measurement properties of HRex, we recommend the use of continuous protocols until new evidence is available to suggest otherwise. Continuous fixed protocols are likely the easiest to implement into team sport settings, with pitch markings/poles used to demarcate distance and audio cues to direct the running speed (e.g., dictated by a whistle, metronome, pacing tool). When shuttle courses are used, it is suggested to implement longer distances (where applicable) to reduce the number of total CODs and consequently neuromuscular loading.

For example, on a FIFA standard football (soccer) pitch (105 m in length and 68 m in width), performing a 4 min continuous shuttle of pitch width in 20 s repetitions would result in a running speed of 12.68 km h⁻¹ and a total number of 12 CODs, assuming the time taken to complete a 180° COD is 0.7 s [86]. Here, it may be pragmatic to marginally adjust the running speed so that the time taken to complete the shuttle is an integer. Another option is to administer a ‘digital figure of eight’ course (e.g., 15 s for 50 m line portions at a mean velocity of 12 km h⁻¹), with smaller groups starting at course landmarks. In this approach athletes are exposed to lower neuromuscular loading due to lower COD angles (90° left and right).

Exercise Intensity

The current results demonstrated that exercise intensity does not appear to meaningfully affect HRex validity or reliability. Therefore, we recommend the prescription of fixed intensities that serve the training purposes (e.g., integrated into the warm-up and at what stage, start versus end), and must be easily repeated over time (i.e., individual constraints and resources). Running speeds between 10 and 14 km h⁻¹ depending on age, level and sport appear reasonable (likely to elicit HRex between ~ 75 and 85% HR_max in all team members).

Exercise Volume

Protocol durations of 3–4 min have the strongest theoretical and empirical rationale (refer to earlier section discusses SMFT duration in “Discussion” section).

HRex Collection Method

During continuous protocols, practitioners are advised to use the mean HR over the last 30–60 s of the test, as athletes have most likely reached a physiological (primarily HR) steady-state and this time window potentially minimises measurement noise related to the wearable device. When using intermittent variable protocols, the mean HRex throughout the overall test is more logical to retain, since exercise intensity is sporadically variable by definition. Regardless of the method used, it is important to visually inspect the HR trace for outliers prior to analysis.

Conceptual Interpretation of the Data

SMFT HRex can provide a valid marker of endurance performance when comparing between athletes in the same testing bout. That is, an athlete with a lower SMFT HRex might typically be expected to perform better on a standard maximal test on that day versus an athlete with a higher SMFT HRex (and vice versa). This does not necessarily extrapolate to within-athlete comparisons, however, which are usually the focus of training monitoring. Future studies should therefore examine the validity of SMFT HRex to track intra-athlete changes in aerobic capacity.

Statistical Interpretation of the Data

Under standardised training setting and environmental conditions, SMFT HRex TE can be considered between 1–2% points. The TE can be used to establish thresholds for changes that are beyond the measurement error, which may be useful when interpreting the data to inform decision making regarding an athlete’s physiological state. For example, using our TE pooled estimate of 1.6% points and a z-distribution, confidence limits at 80%, 90%, 95% and 99% for an individual change in SMFT HRex are ± 2.9, ± 3.7, ± 4.4 and ± 5.8% points, respectively. These limits are synonymous with the minimum detectable change (MDC), which is the smallest change that can be detected beyond measurement error (which includes normal biological variation in SMFT HRex) [87]. Note that this is not the minimum practically important difference (MPID) or smallest worthwhile change and should not be used as such. The MPID requires separate discussion and can be combined with the MDC to establish both ‘true’ and meaningful changes in SMFT HRex [88].

Conclusion

Our meta-analysis is the first to provide quantitative syntheses of SMFT HRex reliability and convergent validity. Results demonstrate good absolute and high relative reliability, as well as a large convergent validity, which are not affected by athlete or protocol-related characteristics. Practitioners can use these findings to inform SMFT protocol selection (“Practical Implications” section and Fig. 8), interpret data and identify true physiological effects in individual athletes. Future research should focus on SMFT HRex sensitivity to physiological state (within-athlete changes in aerobic capacity), as well as the methodological elements related to SMFT characteristics, HRex collection and analysis approaches that may influence key measurement properties.

Availability of data and materials

Supplementary materials for this paper are also available on the Open Science Framework at https://osf.io/mqnt9/.

Abbreviations

SMFT:: Submaximal fitness tests
HRex:: Exercise heart rate
TE:: Typical error of measurement
ICC:: Intraclass correlation coefficient
MD:: Mean difference
r :: Pearson’s correlation coefficient
Yo-YoIR:: Yo-Yo intermittent recovery test
30-15_IFT :: 30-15 Intermittent fitness test
PRISMA:: Preferred Reporting Items for Systematic reviews and Meta-Analyses
HR_max :: Heart rate maximum
CV:: Coefficient of variation
SD:: Standard deviation
CI:: Confidence intervals
PI:: Prediction intervals
RoBANS:: Risk of bias assessment tool for non-randomised studies
COD:: Change of direction
MDC:: Minimum detectable change
MPID:: Minimum practically important difference

References

Akenhead R, Nassis GP. Training load and player monitoring in high-level football: current practice and perceptions. Int J Sports Physiol Perform. 2016;11:587–93. https://doi.org/10.1123/ijspp.2015-0331.
Article PubMed Google Scholar
Bangsbo J, Iaia FM, Krustrup P. The Yo-Yo intermittent recovery test: a useful tool for evaluation of physical performance in intermittent sports. Sports Med. 2008;38:37–51. https://doi.org/10.2165/00007256-200838010-00004.
Article PubMed Google Scholar
Scott TJ, McLaren SJ, Caia J, et al. The reliability and usefulness of an individualised submaximal shuttle run test in elite rugby league players. Sci Med Football. 2018;2:184–90. https://doi.org/10.1080/24733938.2018.1448937.
Article Google Scholar
Buchheit M. Monitoring training status with HR measures: do all roads lead to Rome? Front Physiol. 2014. https://doi.org/10.3389/fphys.2014.00073.
Article PubMed PubMed Central Google Scholar
Jeffries AC, Marcora SM, Coutts AJ, et al. Development of a revised conceptual framework of physical training for use in research and practice. Sports Med. 2021. https://doi.org/10.1007/s40279-021-01551-5.
Article PubMed PubMed Central Google Scholar
Hulin BT, Gabbett TJ, Johnston RD, et al. Sub-maximal heart rate is associated with changes in high-intensity intermittent running ability in professional rugby league players. Sci Med Football. 2019;3:50–6. https://doi.org/10.1080/24733938.2018.1475748.
Article Google Scholar
Fanchini M, Castagna C, Coutts AJ, et al. Are the Yo-Yo intermittent recovery test levels 1 and 2 both useful? Reliability, responsiveness and interchangeability in young soccer players. J Sports Sci. 2014;32:1950–7. https://doi.org/10.1080/02640414.2014.969295.
Article PubMed Google Scholar
Shushan T, McLaren SJ, Buchheit M, Scott TJ, Barrett S, Lovell R. Submaximal fitness tests in team sports: a theoretical framework for evaluating physiological state. Sports Med. 2022;52:2605–26. https://doi.org/10.1007/s40279-022-01712-0.
Article PubMed PubMed Central Google Scholar
Schneider C, Hanakam F, Wiewelhove T, et al. Heart rate monitoring in team sports—a conceptual framework for contextualizing heart rate measures for training and recovery prescription. Front Physiol. 2018;9:639. https://doi.org/10.3389/fphys.2018.00639.
Article PubMed PubMed Central Google Scholar
Bosquet L, Merkari S, Arvisais D, et al. Is heart rate a convenient tool to monitor over-reaching? A systematic review of the literature. Br J Sports Med. 2008;42:709–14. https://doi.org/10.1136/bjsm.2007.042200.
Article CAS PubMed Google Scholar
Buchheit M, Voss SC, Nybo L, et al. Physiological and performance adaptations to an in-season soccer camp in the heat: associations with heart rate and heart rate variability: monitoring responses to training in the heat. Scand J Med Sci Sports. 2011;21:e477–85. https://doi.org/10.1111/j.1600-0838.2011.01378.x.
Article CAS PubMed Google Scholar
Buchheit M, Simpson BM, Garvican-Lewis LA, et al. Wellness, fatigue and physical performance acclimatisation to a 2-week soccer camp at 3600 m (ISA3600). Br J Sports Med. 2013;47:i100–6. https://doi.org/10.1136/bjsports-2013-092749.
Article PubMed Google Scholar
Brink MS, Visscher C, Schmikli SL, et al. Is an elevated submaximal heart rate associated with psychomotor slowness in young elite soccer players? Eur J Sport Sci. 2013;13:207–14. https://doi.org/10.1080/17461391.2011.630101.
Article Google Scholar
Robertson S, Kremer P, Aisbett B, et al. Consensus on measurement properties and feasibility of performance tests for the exercise and sport sciences: a Delphi study. Sports Med Open. 2017;3:2.
Article PubMed PubMed Central Google Scholar
Hopkins WG. Measures of reliability in sports medicine and science. Sports Med. 2000;30:1–15. https://doi.org/10.2165/00007256-200030010-00001.
Article CAS PubMed Google Scholar
Atkinson G, Nevill AM. Statistical methods for assessing measurement error (reliability) in variables relevant to sports medicine. Sports Med. 1998;26:217–38. https://doi.org/10.2165/00007256-199826040-00002.
Article CAS PubMed Google Scholar
Matheson GJ. We need to talk about reliability: making better use of test–retest studies for study design and interpretation. PeerJ. 2019;7:e6918.
Article PubMed PubMed Central Google Scholar
Krabbe PFM. The measurement of health and health status: concepts, methods and applications from a multidisciplinary perspective. 1st ed. Amsterdam: Elsevier; 2017.
Google Scholar
Thorpe RT, Strudwick AJ, Buchheit M, et al. Tracking morning fatigue status across in-season training weeks in elite soccer players. Int J Sports Physiol Perform. 2016;11:947–52. https://doi.org/10.1123/ijspp.2015-0490.
Article PubMed Google Scholar
Page MJ, Moher D, Bossuyt PM, et al. PRISMA 2020 explanation and elaboration: updated guidance and exemplars for reporting systematic reviews. BMJ. 2021;372:160. https://doi.org/10.1136/bmj.n160.
Article Google Scholar
Buchheit M. The 30–15 intermittent fitness test: 10 year review. Myorobie J. 2010;1:278.
Google Scholar
Lemmink KAPM, Visscher C, Lambert MI, et al. The interval shuttle run test for intermittent sport players: evaluation of reliability. J Strength Cond Res. 2004;18:821–7. https://doi.org/10.1519/13993.1.
Article PubMed Google Scholar
Buchheit M, Millet GP, Parisy A, et al. Supramaximal training and postexercise parasympathetic reactivation in adolescents. Med Sci Sports Exerc. 2008;40:362–71. https://doi.org/10.1249/mss.0b013e31815aa2ee.
Article PubMed Google Scholar
Deprez D, Fransen J, Lenoir M, et al. The Yo-Yo intermittent recovery test level 1 is reliable in young high-level soccer players. Biol Sport. 2015;32:65–70. https://doi.org/10.5604/20831862.1127284.
Article CAS PubMed Google Scholar
Deprez D, Coutts AJ, Lenoir M, et al. Reliability and validity of the Yo-Yo intermittent recovery test level 1 in young soccer players. J Sports Sci. 2014;32:903–10. https://doi.org/10.1080/02640414.2013.876088.
Article PubMed Google Scholar
Hulse M, Morris J, Hawkins R, et al. A field-test battery for elite, young soccer players. Int J Sports Med. 2012;34:302–11. https://doi.org/10.1055/s-0032-1312603.
Article PubMed Google Scholar
Iacono AD, Beato M, Unnithan V. Comparative effects of game profile-based training and small-sided games on physical performance of elite young soccer players. J Strength Cond Res. 2019. https://doi.org/10.1519/jsc.0000000000003225.
Article Google Scholar
Ingebrigtsen J, Brochmann M, Castagna C, et al. Relationships between field performance tests in high-level soccer players. J Strength Cond Res. 2014;28:942–9. https://doi.org/10.1519/jsc.0b013e3182a1f861.
Article PubMed Google Scholar
Owen C, Jones P, Comfort P. The reliability of the submaximal version of the Yo-Yo intermittent recovery test in elite youth soccer. J Trainol. 2017;6:31–4.
Article Google Scholar
Pereira LA, Abad CCC, Leiva DF, et al. Relationship between resting heart rate variability and intermittent endurance performance in novice soccer players. Res Q Exerc Sport. 2019;90:355–61. https://doi.org/10.1080/02701367.2019.1601666.
Article PubMed Google Scholar
Ryan S, Pacecca E, Tebble J, et al. Measurement characteristics of athlete monitoring tools in professional Australian football. Int J Sports Physiol Perform. 2019. https://doi.org/10.1123/ijspp.2019-0060.
Article PubMed Google Scholar
Younesi S, Rabbani A, Clemente FM, et al. Relationships between aerobic performance, hemoglobin levels, and training load during small-sided games: a study in professional soccer players. Front Physiol. 2021;12:649870. https://doi.org/10.3389/fphys.2021.649870.
Article PubMed PubMed Central Google Scholar
Younesi S, Rabbani A, Manuel Clemente F, et al. Session-to-session variations of internal load during different small-sided games: a study in professional soccer players. Res Sports Med. 2021. https://doi.org/10.1080/15438627.2021.1888103.
Article PubMed Google Scholar
Buchheit M, Mendez-Villanueva A, Quod MJ, et al. Determinants of the variability of heart rate measures during a competitive period in young soccer players. Eur J Appl Physiol. 2010;109:869–78. https://doi.org/10.1007/s00421-010-1422-x.
Article PubMed Google Scholar
Bradley PS, Mohr M, Bendiksen M, et al. Sub-maximal and maximal Yo-Yo intermittent endurance test level 2: heart rate response, reproducibility and application to elite soccer. Eur J Appl Physiol. 2011;111:969–78. https://doi.org/10.1007/s00421-010-1721-2.
Article PubMed Google Scholar
Scott TJ, McLaren SJ, Lovell R, et al. The reliability, validity and sensitivity of an individualised sub-maximal fitness test in elite rugby league athletes. J Sports Sci. 2022;9:1–13. https://doi.org/10.1080/02640414.2021.2021047.
Article Google Scholar
Buchheit M, Lefebvre B, Laursen PB, et al. Reliability, usefulness, and validity of the 30–15 intermittent ice test in young elite ice hockey players. J Strength Cond Res. 2011;25:1457–64. https://doi.org/10.1519/jsc.0b013e3181d686b7.
Article PubMed Google Scholar
Tanaka H, Monahan KD, Seals DR. Age-predicted maximal heart rate revisited. J Am Coll Cardiol. 2001;37:153–6. https://doi.org/10.1016/s0735-1097(00)01054-8.
Article CAS PubMed Google Scholar
Vigh-Larsen JF, Beck JH, Daasbjerg A, et al. Fitness characteristics of elite and subelite male ice hockey players: a cross-sectional study. J Strength Cond Res. 2019;33:2352–60. https://doi.org/10.1519/jsc.0000000000003285.
Article PubMed Google Scholar
Francini L, Rampinini E, Bosio A, et al. Association between match activity, endurance levels and maturity in youth football players. Int J Sports Med. 2019;40:576–84. https://doi.org/10.1055/a-0938-5431.
Article PubMed Google Scholar
Borenstein M, editor. Introduction to meta-analysis. Chichester: Wiley; 2009.
Google Scholar
Hopkins WG. Spreadsheets for analysis of validity and reliability. Sportscience. 2015;19:36–45.
Google Scholar
Koo TK, Li MY. A guideline of selecting and reporting intraclass correlation coefficients for reliability research. J Chiropr Med. 2016;15:155–63. https://doi.org/10.1016/j.jcm.2016.02.012.
Article PubMed PubMed Central Google Scholar
Cohen J. Statistical power analysis for the behavioral sciences. 2nd ed. Erlbaum Associates: Hillsdale; 1988.
Google Scholar
Mohr M, Krustrup P. Yo-Yo intermittent recovery test performances within an entire football league during a full season. J Sports Sci. 2014;32:315–27. https://doi.org/10.1080/02640414.2013.824598.
Article PubMed Google Scholar
Veugelers KR, Naughton GA, Duncan CS, et al. Validity and Reliability of a submaximal intermittent running test in elite Australian football players. J Strength Cond Res. 2016;30:3347–53. https://doi.org/10.1519/jsc.0000000000001441.
Article PubMed Google Scholar
Lignell E, Fransson D, Krustrup P, et al. Analysis of high-intensity skating in top-class ice hockey match-play in relation to training status and muscle damage. J Strength Cond Res. 2018;32:1303–10. https://doi.org/10.1519/jsc.0000000000001999.
Article PubMed Google Scholar
Fisher R. Statistical methods for research workers. 12th ed. Edinburgh: Oliver and Boyd; 1954.
Google Scholar
Nakagawa S, Poulin R, Mengersen K, et al. Meta-analysis of variation: ecological and evolutionary applications and beyond. Methods Ecol Evol. 2015;6:143–52. https://doi.org/10.1111/2041-210X.12309.
Article Google Scholar
Viechtbauer W. Conducting meta-analyses in R with the metafor package. J Stat Soft. 2010. https://doi.org/10.18637/jss.v036.i03.
Article Google Scholar
Pustejovsky J. clubSandwich: cluster‐robust (Sandwich) variance estimators with small‐sample corrections. R package version 0.2.3. 2017. Available from: https://github.com/jepusto/clubSandwich.
RStudio: Integrated development for R. RStudio, Inc., Boston, MA. Available from: https://www.rstudio.com.
Rabbani A, Kargarfard M, Twist C. Reliability and validity of a submaximal warm-up test for monitoring training status in professional soccer players. J Strength Cond Res. 2018;32:326–33. https://doi.org/10.1519/jsc.0000000000002335.
Article PubMed Google Scholar
Doncaster G, Scott M, Iga J, et al. Reliability of heart rate responses both during and following a 6 min Yo-Yo IR1 test in highly trained youth soccer players. Sci Med Football. 2019;3:14–20. https://doi.org/10.1080/24733938.2018.1476775.
Article Google Scholar
Pustejovsky JE, Tipton E. Meta-analysis with robust variance estimation: expanding the range of working models. Prev Sci. 2021;7:1–14. https://doi.org/10.1007/s11121-021-01246-3.
Article Google Scholar
Cheung MWL. Modeling dependent effect sizes with three-level meta-analyses: a structural equation modeling approach. Psychol Methods. 2014;19:211–29. https://doi.org/10.1037/a0032968.
Article PubMed Google Scholar
Hedges LV, Tipton E, Johnson MC. Robust variance estimation in meta-regression with dependent effect size estimates. Res Synth Method. 2010;1:39–65. https://doi.org/10.1002/jrsm.5.
Article Google Scholar
Cheung MWL. A guide to conducting a meta-analysis with non-independent effect sizes. Neuropsychol Rev. 2019;29:387–96. https://doi.org/10.1007/s11065-019-09415-6.
Article PubMed PubMed Central Google Scholar
Higgins JPT. Commentary: heterogeneity in meta-analysis should be expected and appropriately quantified. Int J Epidemiol. 2008;37:1158–60. https://doi.org/10.1093/ije/dyn204.
Article PubMed Google Scholar
Borenstein M, Higgins JPT, Hedges LV, et al. Basics of meta-analysis: I2 is not an absolute measure of heterogeneity. Res Synth Methods. 2017;8:5–18. https://doi.org/10.1002/jrsm.1230.
Article PubMed Google Scholar
Assink M, Wibbelink CJM. Fitting three-level meta-analytic models in R: a step-by-step tutorial. TQMP. 2016;12:154–74. https://doi.org/10.20982/tqmp.12.3.p154.
Article Google Scholar
Higgins JPT, Thomas J, Chandler J, Cumpston M, Li T, Page MJ, Welch VA (editors). Cochrane handbook for systematic reviews of interventions version 6.1 (updated September 2020). Cochrane, 2020. Available from: www.training.cochrane.org/handbook.
Greenland S. Valid P-values behave exactly as they should: some misleading criticisms of P-values and their resolution with S-values. Am Stat. 2019;73:106–14. https://doi.org/10.1080/00031305.2018.1529625.
Article Google Scholar
Halperin I, Malleron T, Har-Nir I, et al. Accuracy in predicting repetitions to task failure in resistance exercise: a scoping review and exploratory meta-analysis. Sports Med. 2022;52:377–90. https://doi.org/10.1007/s40279-021-01559-x.
Article PubMed Google Scholar
Malcata RM, Vandenbogaerde TJ, Hopkins WG. Using athletes’ world rankings to assess countries’ performance. Int J Sports Physiol Perform. 2014;9:133–8. https://doi.org/10.1123/ijspp.2013-0014.
Article PubMed Google Scholar
Hopkins WG, Marshall SW, Batterham AM, et al. Progressive statistics for studies in sports medicine and exercise science. Med Sci Sports Exerc. 2009;41:3–13. https://doi.org/10.1249/mss.0b013e31818cb278.
Article PubMed Google Scholar
Kim SY, Park JE, Lee YJ, et al. Testing a tool for assessing the risk of bias for nonrandomized studies showed moderate reliability and promising validity. J Clin Epidemiol. 2013;66:408–14. https://doi.org/10.1016/j.jclinepi.2012.09.016.
Article PubMed Google Scholar
Park J, Lee Y, Seo H, Jang B, Son H, Kim S, Shin S, Hahn S. Risk of bias assessment tool for non-randomized studies (RoBANS): development and validation of a new instrument. In: Abstracts of the 19th cochrane colloquium; 2011 19–22 Oct; Madrid, Spain. Wiley; 2011.
Egger M, Davey Smith G, Schneider M, et al. Bias in meta-analysis detected by a simple, graphical test. BMJ. 1997;315:629–34. https://doi.org/10.1136/bmj.315.7109.629.
Article CAS PubMed PubMed Central Google Scholar
Begg CB, Mazumdar M. Operating characteristics of a rank correlation test for publication bias. Biometrics. 1994;50:1088–101.
Article CAS PubMed Google Scholar
Viechtbauer W, Cheung MWL. Outlier and influence diagnostics for meta-analysis. Res Synth Method. 2010;1:112–25. https://doi.org/10.1002/jrsm.11.
Article Google Scholar
Baujat B, Mahé C, Pignon J-P, et al. A graphical method for exploring heterogeneity in meta-analyses: application to a meta-analysis of 65 trials. Stat Med. 2002;21:2641–52. https://doi.org/10.1002/sim.1221.
Article PubMed Google Scholar
Algur SP, Biradar JG. Cooks distance and mahanabolis distance outlier detection methods to identify review spam. Ijecs. 2017;6:21638–49.
Google Scholar
Bradley PS, Bendiksen M, Dellal A, et al. The application of the Yo-Yo intermittent endurance level 2 test to elite female soccer populations: Yo-Yo IE2 testing in female soccer players. Scand J Med Sci Sports. 2014;24:43–54. https://doi.org/10.1111/j.1600-0838.2012.01483.x.
Article CAS PubMed Google Scholar
Laursen P, Buchheit M, editors. Science and application of high-intensity interval training. Champaign: Human Kinetics; 2019.
Google Scholar
Lamberts RP, Lemmink KAPM, Durandt JJ, et al. Variation in heart rate during submaximal exercise: implications for monitoring training. J Strength Cond Res. 2004;18:641–5.
PubMed Google Scholar
Lamberts RP, Lambert MI. Day-to-day variation in heart rate at different levels of submaximal exertion: implications for monitoring training. J Strength Cond Res. 2009;23:1005–10. https://doi.org/10.1519/jsc.0b013e3181a2dcdc.
Article PubMed Google Scholar
Araújo M, Baumgart C, Freiwald J, et al. Contrasts in intermittent endurance performance and heart rate response between female and male soccer players of different playing levels. Biol Sport. 2019;36:323–31. https://doi.org/10.5114/biolsport.2019.88755.
Article PubMed PubMed Central Google Scholar
Ingebrigtsen J, Bendiksen M, Randers MB, et al. Yo-Yo IR2 testing of elite and sub-elite soccer players: performance, heart rate response and correlations to other interval tests. J Sports Sci. 2012;30:1337–45. https://doi.org/10.1080/02640414.2012.711484.
Article PubMed Google Scholar
Bock AV, Vancaulaert C, Dill DB, et al. Studies in muscular activity: IV. The ‘steady state’ and the respiratory quotient during work. J Physiol. 1928;66:162–74. https://doi.org/10.1113/jphysiol.1928.sp002515.
Article CAS PubMed PubMed Central Google Scholar
Lacome M, Simpson B, Broad N, et al. Monitoring players’ readiness using predicted heart-rate responses to soccer drills. Int J Sports Physiol Perform. 2018;13:1273–80. https://doi.org/10.1123/ijspp.2018-0026.
Article PubMed Google Scholar
Owen AL, Newton M, Shovlin A, et al. The use of small-sided games as an aerobic fitness assessment supplement within elite level professional soccer. J Hum Kinet. 2020;71:243–53. https://doi.org/10.2478/hukin-2019-0086.
Article PubMed PubMed Central Google Scholar
Hulka K, Weisser R, Belka J, et al. Stability of internal response and external load during 4-a-side football game in an indoor environment. Acta Gymnica. 2015;45:21–5. https://doi.org/10.5507/ag.2015.003.
Article Google Scholar
Wiseman RM. On the use and misuse of ratios in strategic management research. Research methodology in strategy and management, vol. 5. Emerald Group Publishing Limited; 2009. p. 75–110.
Google Scholar
Lacome M, Simpson BM, Buchheit M. Monitoring training status with player-tracking technology: still on the road to Rome. Aspetar Sports Med J. 2018;7:54–63.
Google Scholar
Buchheit M. The 30–15 intermittent fitness test: accuracy for individualizing interval training of young intermittent sport players. J Strength Cond Res. 2008;22:365–74. https://doi.org/10.1519/jsc.0b013e3181635b2e.
Article PubMed Google Scholar
Buchheit M, Simpson BM, Lacome M. Monitoring cardiorespiratory fitness in professional soccer players: is it worth the prick? Int J Sports Physiol Perform. 2020;15:1437–41. https://doi.org/10.1123/ijspp.2019-0911.
Article PubMed Google Scholar
de Vet HCW, Terwee CB. The minimal detectable change should not replace the minimal important difference. J Clin Epidemiol. 2010;63:804–5. https://doi.org/10.1016/j.jclinepi.2009.12.015.
Article PubMed Google Scholar

Download references

Acknowledgements

The authors would like to thank the authors who provided their data for the studies included in our systematic review and meta-analysis. We also thank the reviewers for their constructive feedback and contribution.

Funding

No sources of funding were used to assist in the preparation of this article.

Author information

Authors and Affiliations

School of Health Sciences, Western Sydney University, Sydney, NSW, Australia
Tzlil Shushan, Ric Lovell & Dean Norris
Faculty of Science, Medicine and Health, University of Wollongong, Wollongong, NSW, Australia
Ric Lovell
HIIT Science, Revelstoke, BC, Canada
Martin Buchheit
Laboratory of Sport, Expertise and Performance, French National Institute of Sport (INSEP), 7370, Paris, France
Martin Buchheit
Kitman Labs, Performance Research Intelligence Initiative, Dublin, Ireland
Martin Buchheit
Institute for Health and Sport, Victoria University, Melbourne, VIC, Australia
Martin Buchheit
Netball Australia, Victoria, Australia
Tannath J. Scott
Carnegie Applied Rugby Research (CARR) Centre, Institute for Sport, Physical Activity and Leisure, Leeds Beckett University, Leeds, UK
Tannath J. Scott
Department of Sport Science Innovation, Playermaker, London, UK
Steve Barrett
Newcastle Falcons Rugby Club, Newcastle Upon Tyne, UK
Shaun J. McLaren
Institute of Sport, Manchester Metropolitan University, Manchester, UK
Shaun J. McLaren

Authors

Tzlil Shushan
View author publications
You can also search for this author in PubMed Google Scholar
Ric Lovell
View author publications
You can also search for this author in PubMed Google Scholar
Martin Buchheit
View author publications
You can also search for this author in PubMed Google Scholar
Tannath J. Scott
View author publications
You can also search for this author in PubMed Google Scholar
Steve Barrett
View author publications
You can also search for this author in PubMed Google Scholar
Dean Norris
View author publications
You can also search for this author in PubMed Google Scholar
Shaun J. McLaren
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

TS, SJM and RL conceptualised the idea and developed the project. TS and RL developed the searching strategy. TS SJM and RL constructed the inclusion–exclusion criteria. TS firstly extracted the data, then checked and approved by SJM. Discrepancies in the extracted data were compared and resolved through a discussion (TS, SJM and RL). TS and SJM performed the data analysis. TS wrote the first manuscript and all other versions. SJM, RL, MB, TJS, SB and DN contributed substantially to the conceptual direction, content and provided feedback to the manuscript. All authors read and approved the final manuscript. TS coordinated the submission and revision process. All authors read and approved the final manuscript.

Corresponding author

Correspondence to Tzlil Shushan.

Ethics declarations

Ethical approval and consent to participate

Not applicable.

Consent for publication

Not applicable.

Competing interest

Tzlil Shushan, Ric Lovell, Martin Buchheit, Tannath J. Scott, Steve Barret, Dean Norris and Shaun J. McLaren declare that they have no competing interests.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Additional file 1

. Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) Checklist.

Additional file 2

. Methodological overview of the searching strategy, screening process and protocol registration.

Additional file 3

. Typical error simulation summary and results.

Additional file 4

. Heterogeneity and meta-regression results from metaanalysis of measurement properties.

Additional file 5

. A summary detailing the quality assessment criteria of the included studies.

Additional file 6

. Meta-analysis forest plots of the weighted points estimates.

Additional file 7

. Sensitivity and influence analyses from metaanalysis of measurement properties.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Shushan, T., Lovell, R., Buchheit, M. et al. Submaximal Fitness Test in Team Sports: A Systematic Review and Meta-Analysis of Exercise Heart Rate Measurement Properties. Sports Med - Open 9, 21 (2023). https://doi.org/10.1186/s40798-023-00564-w

Download citation

Received: 16 May 2022
Accepted: 08 March 2023
Published: 24 March 2023
DOI: https://doi.org/10.1186/s40798-023-00564-w

Submaximal Fitness Test in Team Sports: A Systematic Review and Meta-Analysis of Exercise Heart Rate Measurement Properties

Abstract

Background

Objectives

Methods

Results

Conclusions

Similar content being viewed by others

Key Points

Introduction

Methods

Registration and Search Strategy

Screening and Study Selection

Data Extraction and Coding

Handling Missing Data

Effect Estimates

Mean Difference

Typical Error of Measurement

Intraclass Correlation Coefficient

Pearson’s Correlation Coefficient (r)

Statistical Methods and Data Analysis

Synthesis of Effect Estimates Prior and Post Analysis

Overall Meta-Analysis

Heterogeneity and Modifying Effects

Inferences

Study Qualitative Assessment

Small-Study Effects and Influence Analysis

Results

Selected Reliability Studies and Characteristics

Selected Convergent Validity Studies and Characteristics

Overall Meta-Analyses

Heterogeneity

Meta-Regression Analysis

Study Qualitative Assessment

Small-Study Effects

Discussion

Practical Implications

Protocol Selection

Exercise Regimen

Exercise Intensity

Exercise Volume

HRex Collection Method

Conceptual Interpretation of the Data

Statistical Interpretation of the Data

Conclusion

Availability of data and materials

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Ethical approval and consent to participate

Consent for publication

Competing interest

Additional information

Publisher's Note

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation