FormalPara Key Points

Strength training with high loads (≥ 80% of one repetition maximum) can improve running economy and might be particularly effective in athletes running at high speeds (e.g., > 12.00 km/h) and/or possessing a well-developed VO2max.

Plyometric training could improve running economy at speeds less than 12.00 km/h.

The combination of two or more strength training methods (e.g., high load training, plyometric training) may induce greater running economy improvement, compared to isolated training methods.

These results are based on 31 studies with moderate to low certainty of evidence for the main outcomes, involving a total of 652 middle- and long-distance runners.

1 Introduction

Concurrent training involves the execution of both endurance and strength training within the same training cycle [1] and is effective in enhancing the running performance [2, 3] and running economy (RE) [2, 4, 5] of middle- and long-distance runners. RE is defined as the energy demanded at submaximal running speeds [6] and is one of the key determinants of overall running performance along with maximal oxygen uptake (VO2max), anaerobic threshold and anaerobic capacity [7, 8]. However, a variety of other strength training methods, or combinations thereof, may generate different types of adaptations [2] that could affect changes in RE and, whilst the different protocols used to measure RE may also affect the results of the effect of strength training [2, 9].

RE has been shown to be associated with performance in middle- and long- distance running [10,11,12]. It has been found that trained runners have better RE than active untrained subjects [13]. Indeed, in athletes with a similar VO2max, those with better RE have demonstrated better running performance [11] because they can run at a higher relative intensity or maintain a constant intensity for a relatively longer period of time [14]. One of the strategies for improving RE has traditionally been strength training which can generate various neuromuscular adaptations such as improved intra- and inter-muscular coordination, improved muscle–tendon stiffness, and increased recruitment and firing rate of motor units, in addition to morphological changes [15]. However, strength training programmes can be designed in multiple different ways by varying some of the training parameters (e.g., load, number of sets and repetitions, exercise sequence) [16], which could induce different neuromuscular adaptations in middle- and long-distance runners [15]. For example, strength training with high loads [HL, i.e., ≥ 80% 1 repetition maximum (1 RM)], submaximal loads (SL, i.e., 40–79% 1 RM), isometric training (ISO) and plyometric training (PL) can enhance maximal strength development, strength at submaximal loads, isometric force production and stretch-shortening cycle activity, respectively [15, 17]. In addition, the inclusion of different strength training methods (e.g., HL with PL) have been used as a strategy to improve in different areas of the force–velocity relationship (i.e., different loads and speeds of movements) [18].

In addition to the above, the improvement of RE through strength training has exhibited varying effects depending on the running speed that is being assessed. For example, Piacentini et al. [19] found that strength training with HL only significantly improved RE at marathon pace (p < 0.05), but not at 1.00 km/h faster or slower than marathon pace. Another study [20] found that HL, PL, and complex training (i.e., HL with PL executed within the same session) improved RE at a speed of 12.00 km/h, whereas HL and complex training improved it at 14.00 km/h, while complex training only elicited improvements at 16.00 km/h. These differences in results could possibly have occurred due to differences in the measurement of RE (e.g., as energy cost or oxygen cost) and the chosen running speed, which could be classified as absolute speed or relative speed (e.g., speed relative to anaerobic threshold or race pace) [21, 22]. Moreover, a U-shaped relationship between energy cost and running speed has been found because at the lowest and highest speeds, there appears to be varying utilisation of the stretch-shortening cycle and activation of skeletal muscle in the muscle–tendon unit [23]. In addition, it has been suggested that RE should be measured at a speed relative to the anaerobic threshold [22] as it ensures the same relative intensity for each participant. Accordingly, several methodological aspects related to running speed should be considered when assessing changes in RE. Although there are several systematic reviews and meta-analyses that have examined the effect of strength training on RE [2, 4, 5, 24], none of these have assessed the effect of different strength training methods on RE and the influence of running speed used in the assessment of RE.

Therefore, this systematic review with meta-analysis aimed to compare the effect of different strength training methods on the RE in middle- and long-distance runners, over different running speeds.

2 Methods

This systematic review and meta-analysis was conducted according to the guidelines established by the PRISMA statement [25]. The protocol was registered prior to analysis of the data at the Open Science Framework (https://osf.io/gyeku).

2.1 Information Sources and Search Strategy

The search for articles was carried out in PubMed, Web of Science (all databases), Scopus and SPORTDiscus databases. All articles indexed up to January 2022 were included for the selection. Different terms and Boolean operators were used for the search [Table S1 in the Online Supplementary Material (OSM)]. No limits of study design, date, language, age, or sex were imposed on the search. The search was updated in November 2022, through notifications of new studies found in the search strategy in the different databases. In addition, the reference lists of includable articles, and from reviews, systematic reviews and meta-analyses retrieved from our search were scanned for additional articles of interest.

2.2 Selection Process

All titles and abstracts obtained from the database searches were screened independently by two authors (LL and SV) and those potentially meeting the inclusion criteria (Table 1) were included for full text analysis. In the case of a disagreement between the two authors, a third author (RRC) was consulted.

Table 1 Inclusion and exclusion criteria for meta-analysis

2.3 Eligibility Criteria

Studies were eligible for inclusion according to the participants, intervention, comparator, outcome, and study design (PICOS) criteria (Table 1).

2.4 Data Collection Process

From included studies, an independent reviewer (LL) extracted the data regarding participants characteristics, intervention characteristics, and the main outcomes (means and standard deviations). In those articles where only figure data were available, the validated (r = 0.99, p < 0.001) [26] software WebPlotDigitizer (version 4.5, Pacifica, California, USA) was used to extract the data. Once recoded, the reviewers (LL, SV and RRC) discussed on disagreements and controversial data.

2.4.1 Participants

Only subjects over 16 years of age were considered as pubertal stage can affect physical fitness due to hormonal changes during this period [27]. Subjects were considered to have strength training experience based on the information from each study. According to VO2max mean values, the performance level was categorized as moderately trained (male ≤ 55 ml/kg/min, female ≤ 45 ml/kg/min), well-trained (male 55–65 ml/kg/min, female 45–55 ml/kg/min) or highly trained (≥ 65 ml/kg/min, ≥ 55 ml/kg/min) [28]. When both sexes were measured without distinction, the ranges established were the average of males and females for the respective performance levels. In the case where VO2max was not recorded in a study, level performance was based on participants’ level of competition (moderately trained = recreational or local club level; well-trained = collegiate or provincial level; highly trained = national or international level) [9].

2.4.2 Strength Training Intervention

Strength training methods (i.e., HL, SL, PL, ISO, or a combination of strength training methods), duration of intervention (i.e., weeks; frequency; total sessions), programming model (i.e., lineal; constant; undulating) and exercise sequence (i.e., traditional training; complex training) were recorded. The strength training programme was considered when it was added to, or partially substituted (i.e., matched the training load) for, endurance running training, with an intervention duration of three weeks or more. This corresponds to the time in which both neural and hypertrophic factors can affect strength gains [29], with at least one session undertaken per week. The strength training methods were classified focusing on load (% 1 RM) and/or training target: HL was defined as a program in which exercise was performed with heavy loads (≥ 80% 1 RM or ≤ 7 RM) that were intended to improve maximal force development (e.g., barbell squat, deadlift, etc.); SL with moderate to low loads (peak power load, 40–79% 1 RM or 8–20 RM) with the aim of improving strength development at SL; PL using light-load exercises (< 40% 1 RM) with the aim of improving the stretch-shortening cycle and muscle–tendon stiffness (e.g., drop jumps) and; ISO using isometric contraction exercises (e.g., isometric mid-thigh pull). Those groups that performed strength training with low loads (< 40% 1 RM or > 20 RM) were considered as a control group. The duration of the intervention was counted as total number of weeks, sessions per week and total sessions. The exercise sequence within the session was classified into two, comprising of traditional sequences where exercises are executed with light loads followed by heavy loads and complex sequences where exercises are executed with heavy loads followed by light loads [18]. In the case that the different strength training methods were performed in separate sessions or in different periods, these were considered as traditional sequences.

2.4.3 Outcome Measurements

RE was recorded as the energy demand at submaximal running speed. Of note, energy among studies was reported in different units of measurement (i.e., calorie; oxygen). When both units were reported in a study, the unit “calorie” was selected because it considers differences in substrate use and is more sensitive to changes in speed [21]. In addition, when the respiratory exchange ratio (RER) is less than one, oxidative metabolism is the main metabolic pathway; thus, when the RER is greater than one, and blood lactate values were presented, the energy value was corrected with the energy values of blood lactate [30]. RE values were collected for all speeds assessed in the studies and for methodological purposes, different variables were generated from this data. In RE at absolute speed (km/h), the absolute speed was collected and, when data were presented in relation to a physiological value [e.g., anaerobic threshold or velocity at VO2max (vVO2max)] or a race pace (e.g., 3000 m race pace), we calculated the speed from the average baseline values of both groups (i.e., experimental and control groups). Categorical speeds (≤ 12.00 km/h or > 12.00 km/h) were generated from the mean value of all speeds reported in the included studies. On the other hand, when RE is assessed as absolute speed, the difference in substrate energy relative to anaerobic threshold and VO2max is not considered [21]. Therefore, two categories were generated relative to anaerobic threshold (i.e., second lactate threshold, onset of blood lactate accumulation, maximal lactate steady state, lactate turn point, second ventilatory threshold or critical speed) and VO2max. For RE relative to the anaerobic threshold, speed values were classified: speeds > anaerobic threshold and speeds ≤ anaerobic threshold. If speed values were given as an absolute value and anaerobic threshold values were reported, then the average baseline values of anaerobic threshold of both groups was used to determine the corresponding category. For RE relative to VO2max, the speed was presented as percentage of vVO2max or VO2max (%VO2max), but when the speed values were presented as an absolute value and vVO2max was provided, we then calculated the mean baseline vVO2max of both groups and calculated the percentage corresponding to vVO2max. In addition, we created a category of the U-shaped RE–speed relationship (< 11.50 km/h, 11.50–14.50 km/h or > 14.50 km/h) [23]. Where the study reported multiple time points (i.e., more than two data points), the first record and the last record immediately after the intervention were recorded.

2.5 Risk of Bias, Publication Bias and Certainty Assessment

The PEDro (Physiotherapy Evidence Database) scale was used to assess the risk of bias of the studies [31, 32]. Of note, in the context of this systematic review, items five to seven from the PEDro scale were removed from the scale because subjects, assessors and researchers are rarely blinded in supervised exercise interventions [31, 33]. Following previous criteria [33], the studies were categorized as follows: ≥ 6 points = “low risk”, 4–5 points = “moderate risk”, and ≤ 3 points = “high risk”. A funnel plot was performed to assess the publication bias of the studies examining each method of strength training. It was considered to have publication bias if an asymmetry was observed in the funnel plot. The funnel plots were built following the R code provided by Fernández-Castilla et al. [34].

The GRADE (Grading of Recommendations Assessment, Development and Evaluation) approach was conducted for rate the certainty of evidence of this systematic review [35,36,37]. For each of the analyses, we started with a high level of certainty of evidence, which decreases according to the following criteria: risk of bias, downgraded by one level if the median PEDro scale score was moderate risk (< 6 points), or by two levels if it was high risk (< 4 points); inconsistency, downgraded by one level if the Q-test for heterogeneity was significant (i.e., p < 0.05); indirectness, it was considered low risk because the PICO criteria were assured by default; imprecision, downgraded by one level if the number of participants in the control group with the strength training group was < 800 or if the confidence interval was crossed by a small effect size (i.e., ES = − 0.15 to 0.15); publication bias, downgraded by one level if an asymmetry was observed in the funnel plot.

2.6 Effect Measures

The between-group (i.e., control-experimental) standardized mean difference was calculated as previously recommended [38] and expressed as Hedges’ g effect size (ES) [39], which helps to cope with small sample sizes [40], common to sport science literature [41]. When only the mean and standard error (SE) were presented, the standard deviation (SD) was calculated from the SE as follows:

$${\text{SD}}={\text{SE}}\times \sqrt{N}$$

where N is the sample size. Thresholds for the magnitude of ESs were set as 0.15, 0.45 and 0.80 for a small, moderate, and large effect, respectively [42].

2.7 Statistical Analyses

A meta-analysis was performed for each strength training method (i.e., HL, SL, PL, ISO, or combined methods) and its effect on RE when at least three studies provided an outcome measure [43]. If a study had two or more comparison groups in the same analysis, the sample size of the control group was divided by the number of intervention groups [44]. In several studies, different speeds were selected to assess RE. In these cases, the procedure is usually to select a representative ES, to synthesise separately the ES for each outcome across all studies, or to average the sizes of the dependent effects within the study [45]. However, in the first two approaches the sample size is reduced decreasing the statistical power [45], while in the third approach it may negatively impact the validity of the results due to overestimation of standard errors [46]. In addition, these procedures cannot assess the potential differences between ES within studies [47, 48]. Therefore, we used a three-level meta-analysis model [47, 48], which is an extension of the random effect meta-analysis model [40], that considers sampling (level 1), within study (level 2) and between study (level 3) variation.

Due to multiple sources of variation between studies (e.g., training and participant characteristics), a randomized effect model with restricted maximum likelihood estimation method was conducted for estimating the parameters model (\({\tau }^{2}\)) recommended over the traditional DerSimonian and Laird method for continuous data [49]. We based the test statistic and CI in t-distribution with Knapp and Husting adjustment [50].

For each of the strength training methods, the heterogeneity of all ES in the data set was analysed using the test for heterogeneity (Q-test) [51]. Additionally, the one-side log-likelihood-ratio test (LRT) was performed to determine whether the within-study variance (LRTlevel2) and between-study variance (LRTlevel3) are significant [52]. Outliers were defined as ES in which the upper limit of the 95% confidence interval (95% CI) was lower than the lower limit of the pooled effect confidence interval or the lower limit of the 95% CI higher than the upper limit of the pooled effect confidence interval [53]. A sensitivity analysis was then performed with and without the outlier ES to assess their impact on the analysis [53] (i.e., p value from Q-test).

The omnibus test (QM-test) was used to perform a meta-regression (for continuous data), or subgroup analysis (for categorical data), if at least eight studies were pooled. For all analysis alpha was set as 0.05. The three-level meta-analysis were conducted in R (version 4.2.1) with the metafor package [51], following the syntax by Assink and Wibbelink [52]. The forest plot for the three-level meta-analysis was performed using the R code provided by Fernández-Castilla et al. [34], and meta-regression and subgroup analysis plots were built with GraphPad Prism 9 (version 9.2.0).

3 Results

3.1 Study Selection

A total of 1749 records were identified through the search strategy (Fig. 1). Once duplicate records, records not retrieved and articles excluded after review of titles and/or abstracts were excluded, 73 studies were assessed for eligibility. After reading the full text of each document, 42 studies were excluded due to: participants being under 16 years old [54,55,56,57,58,59] being injured before intervention [60,61,62]; studies having no comparator outcomes [63,64,65,66,67,68,69,70,71,72]; the strength training method was considered ineligible for inclusion (e.g., core, isokinetic eccentric training and body weight training) [73,74,75,76,77,78]; RE was not measured [79,80,81,82,83,84,85,86,87,88,89]; outcome results were repeated [90,91,92,93] or study was cross-sectional by design [94, 95]. Therefore, 31 studies were included in the final meta-analyses.

Fig. 1
figure 1

Flow diagram of the study selection process. *Studies found from notifications of new studies found in the search strategy in the different databases. **Studies found in the reference lists of articles, reviews, systematic reviews, and meta-analyses retrieved from our search strategy

3.2 Study Characteristics

A summary of participants’ characteristics and analysis of the studies included in the meta-analysis are presented in Tables 2 and 3, respectively. Thirty-one studies were included in at least one analysis: 11 studies implemented HL, providing 19 ESs [19, 20, 96,97,98,99,100,101,102,103,104]. Of note, the greater number of ES values (i.e., 19) compared to the number of studies (i.e., 11) indicates that some studies assessed RE at two or more running speeds. Eleven studies implemented PL interventions, providing 28 ESs in total [12, 105,106,107,108,109,110,111,112,113,114]. Three studies implemented SL interventions, providing seven ESs [19, 114, 115]. Three studies implemented ISO interventions, providing 7 ESs [112, 116, 117], and nine studies implemented combined methods, providing 20 ESs [8, 20, 102, 115, 118, 119]. The studies included 652 participants [472 males (204 control and 268 treatments) and 180 females (79 control and 101 treatments)], aged between 17 and 45 years old, mean body mass and height of 68.5 kg and 174.3 cm, respectively. Participants were moderately trained (n = 195), well trained (n = 272), and highly trained (n = 185). The strength training programmes were between 6 and 24 weeks’ duration, with one to four sessions per week.

Table 2 Participants and strength training intervention characteristics of included studies
Table 3 Analysis of the studies included in the meta-analysis of running economy

3.3 Risk of Bias, Publication Bias and Certainty Assessment

The median of risk of bias was 6 of 7 [ranging from 5 to 7; moderate to low risk of bias; Table S2 (OSM)]. No publication bias was found in any of the analyses [Fig. S1 (OSM)]. We found a moderate and low level of certainty of evidence, due to an analysis with moderate risk of bias in the combined methods group, and low number of participants included in the analyses and/or the confidence interval crossed the small effect size in all main analyses (Table 4).

Table 4 GRADE assessment for the certainty of evidence

3.4 Main Effects

Thirty-one studies (involving 37 experimental-control comparisons) measured RE at speeds between 7.00 and 18.00 km/h, providing a total of 80 ES for analysis.

Compared to the control group, the HL exerted a small significant effect on RE at speeds ranging from 8.64 to 17.85 km/h (ES [95% CI] = − 0.266 [− 0.516 to − 0.015], p = 0.039; Q(18) = 16.816, p = 0.536, LRTlevel2 = 0, p = 1, LRTlevel3 = 0.089, p = 0.765, Fig. 2). Combined methods group induced a moderate significant effect, although with significant heterogeneity (ES [95% CI] = − 0.647 [− 1.140 to − 0.154], p = 0.013; Q(19) = 40.696, p = 0.003, LRTlevel2 = 0.033, p = 0.855, LRTlevel3 = 2.569, p = 0.109). After removing outlier groups (i.e., Paavolainen et al. [8] and the 16.00 km/h group in Li et al. [20]), the main effect remained moderate significant on RE (i.e., from 10.00 to 14.45 km/h), although without significant heterogeneity (ES [95% CI] = − 0.426 [− 0.768 to − 0.083], p = 0.018, Q(17) = 19.3, p = 0.312, LRTlevel2 = 0, p = 1, LRTlevel3 = 1.671, p = 0.196, Fig. 3).

Fig. 2
figure 2

Forest plots of the included studies for high load training and its effect on running economy. The black squares represent the mean observed effect size of the study, the size of square represent the weight of the study and the black lines represent the 95% confidence interval. The grey line represents a 95% confidence interval based on the sampling variance of individual observed effect sizes in a study, and its thickness is proportional to the number of effect sizes reported within studies. J number of effect sizes within studies

Fig. 3
figure 3

Forest plots of the included studies for combined methods training and its effect on running economy. The black squares represent the mean observed effect size of the study, the size of square represent the weight of the study and the black lines represent the 95% confidence interval. The grey line represents a 95% confidence interval based on the sampling variance of individual observed effect sizes in a study, and its thickness is proportional to the number of effect sizes reported within studies. J number of effect sizes within studies

Compared to the control condition, no significant effect was found for SL at speeds between 9.75 to 16.00 km/h (ES [95% CI] = − 0.365 [− 0.875 to 0.146], p = 0.131, Q(6) = 1.607, p = 0.952, LRTlevel2 = 0, p = 1, LRTlevel3 = 0, p = 1, Fig. 4), for PL at speeds ranging from 7.00 to 18.00 km/h (ES [95% CI] = − 0.122 [− 0.299 to 0.054], p = 0.167, Q(27) = 16.855, p = 0.935, LRTlevel2 = 0, p = 1, LRTlevel3 < 0.001, p = 0.999, Fig. 5) and for ISO at speed between 10.00 to 15.28 km/h. ES [95% CI] = − 0.269 [− 0.79 to 0.252], p = 0.253, Q (6) = 3.276, p = 0.774, LRTlevel2 = 0, p = 1, LRTlevel3 = 0.211, p = 0.646, Fig. 6).

Fig. 4
figure 4

Forest plots of the included studies for submaximal load training and its effect on running economy. The black squares represent the mean observed effect size of the study, the size of square represent the weight of the study and the black lines represent the 95% confidence interval. The grey line represents a 95% confidence interval based on the sampling variance of individual observed effect sizes in a study, and its thickness is proportional to the number of effect sizes reported within studies. J number of effect sizes within studies

Fig. 5
figure 5

Forest plots of the included studies for plyometric training and its effect on running economy. The black squares represent the mean observed effect size of the study, the size of square represent the weight of the study and the black lines represent the 95% confidence interval. The grey line represents a 95% confidence interval based on the sampling variance of individual observed effect sizes in a study, and its thickness is proportional to the number of effect sizes reported within studies. J number of effect sizes within studies

Fig. 6
figure 6

Forest plots of the included studies for isometric training and its effect on running economy. The black squares represent the mean observed effect size of the study, the size of square represent the weight of the study and the black lines represent the 95% confidence interval. The grey line represents a 95% confidence interval based on the sampling variance of individual observed effect sizes in a study, and its thickness is proportional to the number of effect sizes reported within studies. J number of effect sizes within studies

3.5 Sub-group and Meta-regression Analysis

A beneficial moderation for the effect of HL on RE was noted for continuous absolute speed (β1 = − 0.177, p = 0.027, Table 5 and Fig. 7), categorical speed (β1 = − 0.653, p = 0.021, Table 5) and initial (before intervention) VO2max (β1 = − 0.040, p = 0.020, Table 5 and Fig. 8). In PL, moderator analysis showed significant detrimental moderation in categorical speed (β1 = 0.470, p = 0.017, Table 6 and Fig. 9). In combined methods group, significant beneficial moderation was observed for continuous absolute speed (β1 = − 0.263, p = 0.003), categorical speed (β1 = − 0.679, p = 0.020) and U-shaped RE-speed relationship (β1 = − 0.385, β2 = − 2.008, p = 0.001). However, when outliers were removed, no moderating effects were observed [all p > 0.059, Table S3 (OSM)]. An analysis of possible moderators for SL and ISO was not performed because the minimum number of studies (i.e., eight studies) to perform the analysis was not reached.

Table 5 Results of meta-regression and subgroup analyses in search of possible moderators of high load training on running economy
Fig. 7
figure 7

Meta-regression analysis for the effect of absolute speed (continuous) on running economy effect size in high load training. ES effect size, J number of effect sizes within studies

Fig. 8
figure 8

Meta-regression analysis for the effect of initial VO2max on running economy effect size in high load training. ES effect size, J number of effect sizes within studies

Table 6 Results of meta-regression and subgroup analyses in search of possible moderators of plyometric training on running economy
Fig. 9
figure 9

Sub-group analysis for the effect of absolute speed (categorical) on running economy effect size in plyometric training. ES effect size, J number of effect sizes within studies

4 Discussion

The purpose of this systematic review with meta-analysis was to evaluate the effect of different formats of strength training methods (i.e., HL, SL, PL, ISO, and combined methods) on RE in middle- and long-distance runners and to examine the effect of strength training on RE as a function of assessed running speed. The main findings indicate that HL and combined methods formats induced a small improvement in RE, whereas no significant main effect was found after SL or ISO. Moderator analyses revealed that when HL was adjusted for absolute categorical speed (i.e., ≤ 12.00 km/h and > 12.00 km/h) or absolute continuous speed (i.e., 8.64 km/h to 17.85 km/h), or VO2max, the higher the speed or VO2max, the greater the beneficial effect on RE. In contrast, when PL was adjusted for absolute categorical speed, it induced an improvement in RE at speeds ≤ 12.00 km/h. These results suggest that HL, PL, and combined methods can improve RE, although the beneficial effect can be moderated by factors such as RE speed selected for assessment and athletes’ fitness level (i.e., VO2max levels). These results and their implications are discussed in the sections below.

4.1 Main Analysis

4.1.1 High Load Training

HL is characterised by low-speed exercises and high force requirements (≥ 80% 1 RM or ≤ 7 RM) which aim to improve the development of maximal strength [15]. The main analysis revealed that HL induced a small improvement on RE (ES = − 0.266, p = 0.039), a result that is in line with other meta-analyses on this topic [4, 5, 24, 120]. These improvements were observed in interventions of between 6 and 14 weeks’ duration with a training frequency of 2–4 days per week. There appeared to be no moderating effect of duration in weeks, sessions per week or total sessions on RE (all p > 0.111). These results are contrary to those of a recent meta-analysis [120] which found that the implementation of HL over a period of ten weeks or more had a greater effect on RE compared to shorter programmes. This is possibly because the authors of that study included ISO in the analysis, adding two further studies [116, 121] that incorporated 14-week programmes. In contrast, in the current analysis, just one study [103] included a training programme of 14 weeks’ duration. However, despite finding no significant moderating effect of training duration, the slope of the curve in the conducted meta-regression was negative (i.e., the longer the duration of the study, the better the RE; β1 = − 0.09, p = 0.111). It is therefore possible that studies of longer intervention duration may induce an effect of HL on RE.

The improvement of RE following HL may be due to different mechanisms. It is known that HL may induce neuromuscular changes, such as altered recruitment and firing frequency of motor units and changes in fibre type, resulting in increased rate of force development (RFD) [99, 122]. The early-phase RFD (e.g., isometric mid-thigh pull; 90° squat) has been correlated with RE at 10.00 km/h [123], 12.00 km/h [124], and speed corresponding at 70% VO2max [99]. For example, in a study by Støren et al. [99] it was found that eight weeks of HL improved RFD by 26% in the 90° squat and that this correlated with improvements in RE pre- and post-intervention. Furthermore, this increase occurred independent of changes in body weight, and thus the reported increase in RFD may have been due to neuromuscular adaptations [122]. A greater RFD after HL interventions would allow athletes to generate higher levels of force in short periods of time, allowing a rapid transition from the braking phase to the propulsion phase of the gait cycle, promoting favourable muscular conditions [123] that maximise the force–velocity relationship [125] and, thus, RE. On the other hand, HL may improve RE due to changes in lower limb stiffness [123, 126, 127], which would result in more efficient energy storage and release from the lower limbs, thus reducing the energy cost of running [128]. For example, a study by Millet et al. [103] reported significant increases in RE after 14 weeks of HL and this was accompanied by an increase in leg stiffness.

4.1.2 Submaximal Load Training

Only three studies included SL (Fig. 4). The main analysis found no significant effect of this strength training method on RE at speeds between 9.75 to 16.00 km/h (p = 0.131), which could be attributed to several reasons. It has been found that SL training is not as intense a stimulus as HL for the generation of adaptations in muscle–tendon stiffness [129]. For instance, in a study by Piacentini et al. [19], a significant improvement in RE was found after HL whereas no significant improvement was observed after SL. On the other hand, it seems that SL is not as effective a stimulus for improving stretch-shortening cycle function as PL [130]. For example, Berryman et al. [114] found that both PL and SL improved RE but the percentage improvement was greater after PL (7% vs 4%). In another study [115], PL combined with SL and isolated SL improved RE at 12.00 km/h, whilst improvement in RE at 16.00 km/h was only found in PL combined with SL and not SL as a singular training method.

However, these results should be interpreted with caution, as only three studies were included in the analysis meaning that the conclusion could be undermined by low statistical power. Moreover, an analysis of possible moderators was not possible in this case.

4.1.3 Plyometric Training

The main analysis found no significant effect of PL on RE (p = 0.167). This result does not align with recent systematic reviews with meta-analysis related to the effect of strength training on RE in endurance runners [5, 43, 120]. For example, it has been suggested that PL may have a greater effect in athletes with higher performance levels [43, 120] or in training programmes of longer duration in endurance athletes [120] and healthy adults (i.e., longer than 7 weeks) [131]. However, we found no moderating effect of participant characteristics or intervention duration (all p > 0.120, Table 6). From the studies included in the analysis, improvements in RE were found in moderately trained athletes [12] and in 6-week intervention programmes [12, 106, 107]. The lack of any significant effect of PL in the main analysis is possibly due to methodological differences with other meta-analyses that have been carried out on this topic. For example, two meta-analyses [5, 43] included PL with resistance training within the same analysis while only one meta-analysis [120] included isolated PL interventions, as in the current study. This may be relevant given that PL combined with other strength training methods may have a greater effect on RE (see Sect. 4.1.5). Furthermore, the difference in the speeds used to evaluate RE could also have given rise to the discrepancies observed between various different studies [9]. Contrary to the results of other meta-analyses [5, 43, 120], we included all effect sizes that were documented within each study (i.e., different speeds at which RE was assessed). Interestingly, after performing sub-group analysis, we found that PL had a beneficial effect when the speeds are less or equal to 12.00 km/h compared to when the speed is higher than 12.00 km/h (β1 = 0.47, p = 0.017, Fig. 9). Therefore, from these results it is possible that the improvement in RE is primarily influenced by running speed, rather than performance level and/or duration of training programmes.

4.1.4 Isometric Training

ISO is characterised by exercises that require muscle contraction without external movement. ISO can improve RFD [17] and tendon stiffness (i.e., Achilles tendon) [116, 117] without changes in joint stiffness [132, 133], adaptations that could be related to improved RE [117, 123]. However, the main analysis found no significant effect of ISO on RE (p = 0.253). A possible explanation for this result may be due to the difference in muscle action times that were evaluated in the various different studies. For example, of the three studies [112, 116, 117] included in this analysis, just one [117] showed no improvement in RE. Although all three studies performed the same exercise (i.e., isometric ankle plantarflexion) at intensities equal to or greater than 80% of the maximal voluntary contraction, they differed in muscle action times. While two studies [112, 116] used muscle action times of 3 s, in the study by Fletcher et al. [117] the action time was 20 s. This could be relevant because it is known that improvement in RFD is determined by neuromuscular adaptations, muscle size and tendon stiffness [134] and isometric efforts of 1–5 s have been suggested to generate such neuromuscular adaptations [17]. Given that improvements in tendon stiffness were found in both short duration (i.e., 3 s) [116] and long duration (i.e., 20 s) [117] isometric efforts, it is possible that the failure to improve RE was due to an inadequate stimulus to the neuromuscular system in the way that short duration efforts at maximal speed do. However, these interpretations need to be made with caution given the small number of studies in the analysis and the small sample sizes of those studies. In addition, in the three included studies [112, 116, 117], participants performed the same single-joint exercise (i.e., ankle plantarflexion), while the other strength training methods included multi-joint exercises (e.g., squat or jump squat). Therefore, additional investigations are needed to elucidate the effects of ISO on RE including multi-joint exercises (e.g., specific hip or knee run exercises).

4.1.5 Combined Methods Training

Various different studies implemented more than one strength training method, such as SL with PL [8, 102, 115, 118, 119], HL with SL [135, 136] and HL with PL [20]. From the main analysis, we found that combined methods group had a significant small effect on RE (ES = − 0.426, p = 0.018), which was superior to that found in HL and PL after adjusting for categorical speed. One possible explanation for this is that all studies included in the analysis included PL and/or HL. Therefore, it could be hypothesised that the different adaptations induced by these strength training methods could complement each other when included in the same programme, generating a greater effect on RE [20]. However, it is important to mention that most studies included SL [102, 115, 118, 119, 135,136,137] and given that in the individual analysis of this strength training method we found no significant effect on RE (see Sect. 4.1.2), it is possible that this type training, executed in isolation, may not be enough of a stimulus to generate changes in RE. However, combined with other types of strength training it may be [20, 114, 115].

On the other hand, these strength training methods used were either combined within the same training session [20, 102, 115, 118, 119, 137] or performed in a different part of the programme [135, 136]. In the first case, we found different types of exercise sequences within training sessions, such as traditional and complex training (Table 2). A recent meta-analysis found that different exercise sequences can improve the force- and velocity-producing capabilities of an athlete [18]. For example, a complex sequence can be used combining heavy exercises (e.g., HL and/or SL exercises) followed by light exercises (e.g., PL and/or SL exercises), thus inducing post-activation performance enhancement by improving the speed at which PL exercises are executed [18]. This concept refers to the phenomenon in which maximal strength, power and speed are increased after a conditioned contraction [138]. Traditional training employs light exercises followed by heavy exercises which can enhance strength development [18]. The moderator analysis of exercise sequence in this meta-analysis showed no significant moderating effect on this variable (p = 0.956). This may be because the strength training method is more influential than the sequence of exercises within a prescribed training session. Also, it is important to mention that complex training also has different sequences within it (e.g., ascending, descending, French contrast and contrast) that can generate different adaptations [139]. Therefore, future research could investigate the effect of different strength training methods and with different complex training strategies on RE.

4.2 Strength Training and Running Economy Speed

4.2.1 Absolute Speed (Continuous and Categorical)

Absolute and categorical speed acted as beneficial moderators on the effect of HL on RE (β1 = − 0.177, p = 0.027, Fig. 7; β1 = − 0.653, p = 0.021, respectively). Since the increase in energy cost as speed increases could be the result of an increase in muscle energy cost to generate higher levels of force in short periods of time [125], an increase in RFD may be reflected particularly at higher running speeds. Additionally, we found a significant moderator effect of VO2max on RE in HL (β1 = − 0.05, p = 0.02, Fig. 8). Indeed, it has been observed that the correlation between leg stiffness and RE increases with VO2max [127]. Given that more highly trained athletes make more efficient use of elastic energy (i.e., the Achilles tendon) to minimise muscle energy cost [140], coupled with a possible increase in tendon stiffness generated by HL [133], it is possible that athletes with higher levels of performance (i.e., higher initial VO2max) may be better able to transfer these adaptations to running at a lower energy cost. However, it is possible that the speeds chosen to assess RE were in line with the performance level of the runners, with lower speeds for lower-level runners and higher speeds for higher level runners.

Aside from HL, we found that in PL categorical speed has a positive moderating (i.e., detrimental) effect on RE (β1 = 0.47, p = 0.017, Fig. 9). It has been observed that PL can improve joint stiffness in runners [107] and healthy individuals [141], which may be due to an increase in tendon elongation (i.e., Achilles tendon) and a decrease in fascicle length (i.e., medial gastrocnemius) [141]. A more compliant tendon could store and release more elastic energy, decreasing muscle energy cost, in situations where substantial pre-stretching occurs [125], as at low running speeds. Conversely, it has been found that in plantar flexors, as speed increases, tendon energy storage and release become prioritised over muscle work [142], and thus a more compliant tendon could be detrimental. In fact, in the study by Pellegrino et al. [12] it was found that after 6 weeks of PL, RE improved at speeds ranging from 7.74 to 10.62 km/h, while no improvement or detriment was observed at speeds between 12.10 and 16.42 km/h. Taken together, it appears that HL and PL may improve RE but with varied effects depending on running speed. However, future research is required to elucidate the possible mechanisms of RE improvement.

4.2.2 U-Shaped RE–Speed Relationship

Several studies have found a U-shaped relationship between the energy cost of running and speed (from 8.00 to 18.00 km/h) [23, 143], with elastic energy being independent of running speed [23]. This higher energy cost at low speeds (i.e., < 11.50 km/h) may be due to greater muscle activation for greater neuromotor control [144], whereas at high speeds (i.e., > 14.50 km/h) it may be due to the muscle being in a less favourable contractile condition as a priority for storage and release of elastic energy from the tendon [142]. When we performed the moderator analysis with the variable U-shaped RE–speed relationship, we did not find a significant moderating effect for HL, PL or combined methods. However, we did find a near-significant moderating effect for HL (p = 0.055). Indeed, the almost significant regression coefficient showed a beneficial effect at higher speed (β2 = − 0.557, p = 0.053) compared to the regression coefficient at moderate speed (β1 = − 0.195, p = 0.478). This could suggest a role of HL in improving RE at high speeds, covering the higher muscle energy cost at high speeds. A possible explanation for finding a moderating effect on absolute speed and not on U-shaped RE–speed relationship may be due to the wide range of athletes that were included in this analysis. It was reported that recreational athletes had a curvilinear energy cost relationship at a range of speeds lower than those observed in highly trained athletes [143]. Therefore, future research could analyse the impact of HL on RE at higher speeds where muscle energy cost is higher, as well as consider the difference between athletes of different performance levels.

4.2.3 Speed Relative to Anaerobic Threshold

The assessment of RE at speeds relative to the anaerobic threshold has been suggested [22] as this allows consideration of the differences in energy substrates and anaerobic threshold [21] or running at a race pace (e.g., at marathon pace), with the intention of equalising the metabolic conditions of the runners [22]. For example, Piacentini et al. [19] found that HL improved RE only at marathon running pace speed, whereas at 1.00 km/h below or above marathon pace no improvement was found. On the other hand, when the speed is above the anaerobic threshold, anaerobic metabolism starts to become relevant, and it is recommended to correct the values by adding blood lactate energy values [30, 145]. When anaerobic metabolism is added to the energy cost, it has a linear relationship with running speed [30, 145]. However, among the studies that included groups with values above the anaerobic threshold, only one study [113] reported blood lactate concentration values, and thus only one study would allow the application of correction procedures to adjust for anaerobic metabolism contribution. Surprisingly, this study [113] found that PL improved RE at 18.00 km/h (measured in LO2/min); however, a detrimental effect was noted after adjusting for blood lactate values. In the moderator analysis performed with the categorical speed relative to anaerobic threshold we did not find a moderating effect for this variable (all p > 0.115), possibly because the number of groups with speeds less than or equal to the anaerobic threshold was considerably higher than the number of groups with speeds greater than the anaerobic threshold. It is therefore recommended that future research should include the contribution of anaerobic metabolism when assessing RE at speeds above the anaerobic threshold, allowing the effect of strength training on RE at higher speeds to be assessed.

4.2.4 Speed Relative to VO2max

It has been found that athletes are more economical at the speeds at which they compete (i.e., at middle- or long-distance speeds) and that differences in RE between men and women are not significant when assessed at relative running intensity (i.e., as a percentage of VO2max) [146]. Therefore, we created a new variable whereby speeds were estimated as speed relative to VO2max. However, we found no moderating effect of this variable in HL, PL, or combined methods (all p > 0.419). This may be because only two studies [99, 103] assessed running economy at speeds relative to VO2max, while the other values were estimated. On the other hand, it is possible that speed relative to VO2max may not consider differences in energy substrates as speed relative to anaerobic threshold would.

4.3 Strengths and Limitations

Some limitations of this meta-analysis should be mentioned. Firstly, we performed analyses separately for each strength training method due to their different compositions and this limited the number of studies (i.e., < 8) for which a moderator analysis for the effects of SL and ISO could be performed. Secondly, in terms of speeds assessed in RE, all but six studies [99, 100, 103, 111, 117, 118] used absolute speeds; however this does not consider the difference in energy substrates and anaerobic threshold [21], so it is recommended to use speeds relative to anaerobic threshold or relative to race pace [22]. The strengths of this meta-analysis should also be acknowledged. To our knowledge, this is the first meta-analysis to investigate the moderation of assessed speed on the effect of strength training on RE by including all assessed speeds from each study, allowing the effect of different strength training methods on RE at different running speeds to be elucidated.

5 Conclusions

Based on these results, HL, PL, and combined methods can improve RE. Furthermore, PL improves RE at speeds of ≤ 12.00 km/h, combined methods group at 10.00 to 14.45 km/h and, HL at 8.64 to 17.85 km/h (particularly at higher speeds), and as a function of athletes VO2max level. No RE improvement was noted after SL or ISO. Therefore, athletes and coaches might consider including different strength training methods (HL, PL and/or combined methods) in traditional endurance training to improve running economy at different speed ranges in middle- and long-distance runners. Future experimental research is needed to understand the potential effects, and underlying mechanisms, of different strength training methods on RE assessed at different speeds in middle- and long-distance runners, particularly among under-researched populations (e.g., females; highly trained athletes).