Introduction

In traditional classic-style cross-country skiing races over hilly terrain, diagonal stride (DS) and double poling (DP) are the two most frequently employed sub-techniques (Sandbakk et al. 2016b). Typically, DS is employed on uphill sections at speeds that range from ~ 1.5 to ~ 4.5 m·s−1, whereas DP is employed on flatter course sections, and slight downhill sections, at speeds that range from ~ 4.0 to ~ 9.0 m·s−1 (Losnegard 2019). In DS, both arms and legs are involved in the active propulsion with diagonal arm and leg movements (similar to running and walking), whereas DP solely involves active propulsion via the arm-poling action without any active propulsion from the legs. Today, long-distance cross-country ski races that are included in the long-distance ski championship are performed on flatter courses than the traditional world-cup races and are mainly won by skiers exclusively using DP. To prevent the traditional sub-techniques from becoming nonexistent, the International Ski Federation (FIS) recently introduced technical zones on certain uphill sections of competition tracks where DP is forbidden (Stöggl et al. 2019), which in combination with hilly course profiles will mean that DS continues to be an important sub-technique in the traditional races of cross-country skiing.

Irrespective of exercise modality, it has been demonstrated that whole-body peak oxygen uptake (\({\dot{\text{V}}}\)O2peak), or maximal oxygen uptake (\({\dot{\text{V}}}\)O2max), increases when the arms contribute 10–30% of the total power output, while \({\dot{\text{V}}}\)O2peak decreases when the arms contribute > 30% of the total power output (Bergh et al. 1976). When comparing the DP and DS sub-techniques of cross-country skiing, \({\dot{\text{V}}}\)O2peak in DP, which involves a high level of arm activity, is ~ 12% lower than in DS (Losnegard 2019). The lower \({\dot{\text{V}}}\)O2peak in DP than in DS has from a physiological perspective been related to a lower oxygen extraction as well as a lower cardiac output mainly driven by a lower peak heart rate (Andersson et al. 2021; Björklund et al. 2015; Calbet et al. 2005; Stöggl et al. 2013). An additional important performance factor in cross-country skiing is the anaerobic energy supply (Gløersen et al. 2020; Losnegard et al. 2012). Anaerobic capacity has been shown to account for a large portion of the variation in performance, both between (Losnegard et al. 2012) and within athletes, during repeated roller-skiing sprint time trials (Andersson et al. 2016) and has also shown to be important for uphill-section performance during distance races (Gløersen et al. 2020). In addition, the ability to recover the anaerobic energy system is highly important during distance races over undulating terrain (Gløersen et al. 2020).

In an endurance sport such as cross-country skiing, external power output is from a solely physiological perspective dependent on the sum of aerobic and anaerobic metabolic rates (in W) multiplied by gross efficiency (GE) (Andersson et al. 2017). Due to the differing speeds and muscular contributions between DP and DS, it is unsurprising that physiological variables, such as \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE, differ between the two sub-techniques, which is a unique aspect of cross-country skiing (Andersson et al. 2017). Data indicate that both \({\dot{\text{V}}}\)O2peak and anaerobic capacity are lower for “flat” DP (1–2° incline) than uphill DS (7° incline) cross-country skiing, which together with the considerably lower GE (~ 3–4 percentage points) would result in a substantial difference in external power output between the two sub-techniques (Andersson et al. 2017; Sandbakk et al. 2016a). Moreover, recent research has shown a higher between-athlete variation in GE for DP than DS (Andersson et al. 2017). In addition, the relationship between GE and speed (or power output) has been observed to be speed dependent in DP, but not in DS (Andersson et al. 2017), probably due to differences in cycle characteristics and force application patterns as well as the different muscle recruitment and muscle contraction properties between the two sub-techniques (Losnegard 2019).

From a physiological perspective, high-level endurance athletes seem to be relatively heterogeneous in their respective physiological strengths and weaknesses as indicated by the relatively low strength of separate pair-wise correlations between physiological performance factors and performance (Andersson et al. 2017; Laaksonen et al. 2020). Since the finish time in a traditional classic cross-country skiing race on a standard FIS course is related to the sum of all sub-technique-specific performances, the best race performances are characterized by a high DP and DS-specific performance (Sandbakk et al. 2016b). However, the locomotion of DP and DS differ substantially (Pellegrini et al. 2013) and upper body-specific physiological characteristics are likely to be more crucial for DP than DS performance (Stöggl et al. 2019). Due to this, a skier that performs well in DS may not necessarily perform well in DP, or vice versa, and physiological characteristics in one sub-technique, such as \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE may not directly transfer to the other sub-technique.

Although a traditional incremental \({\dot{\text{V}}}\)O2max test may be an adequate test for the assessment of \({\dot{\text{V}}}\)O2max, it is not a reliable and/or race-specific type of test (Jeukendrup et al. 1996; Noakes 2008). In comparison to a traditional incremental \({\dot{\text{V}}}\)O2max test, a short time-trial test (∼4 min) may be a preferable alternative as a laboratory-based performance test as it is more reliable and can be used for the assessment of \({\dot{\text{V}}}\)O2max and anaerobic capacity (McGawley 2017; Watkins et al. 2017). In addition, GE can be determined during the submaximal warm-up exercise (Andersson and McGawley 2018).

Recent research reveals a relatively large between-athlete variation in sub-technique-specific performance differences between DP and DS (Andersson et al. 2017, 2021; Sagelv et al. 2018; Stöggl et al. 2019), which requires further investigation. Research shows that it seems difficult to increase the \({\dot{\text{V}}}\)O2max that is reached during whole-body exercise (usually uphill DS) in already highly trained senior-elite cross-country skiers (Losnegard et al. 2013). Therefore, reducing sub-technique-specific differences in physiological capabilities between DP and DS and/or focusing on improving GE in “weaker” sub-techniques seems like an appropriate training strategy for further performance enhancement in already highly trained athletes. To date, there is a lack of information regarding differences in performance and physiological responses between DP and DS in highly trained senior male cross-country skiers based on a duration-specific time-trial performance test. Therefore, the primary aim of the current study was to compare physiological and perceptual responses between a self-paced 4-min DP time-trial (TTDP) performance versus a 4-min DS time-trial (TTDS) performance. A secondary aim was to examine the relative importance of \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE for the projection of 4-min TTDP and TTDS performances.

Materials and methods

Participants

Sixteen highly trained male cross-country skiers (26 ± 5 years, 182 ± 6 cm, 77.3 ± 6.7 kg), competing at a national level and/or an international level (Tier 3, n = 10; Tier 4, n = 6 [according to McKay et al. (2022)], were recruited for the study that was performed ~ 2–3 weeks after their race season. The participants' distance and sprint FIS points were 70 ± 23 and 135 ± 43, respectively (for details about FIS points, see Jones et al. (2021)). Of the 16 participants, three did not compete in sprint races and had due to this no FIS points. Participants were instructed to engage only in low-intensity exercise (1-h maximum) the day prior to testing and consume carbohydrate-rich meals. The study was preapproved by the Regional Ethical Review Board of Umeå University, Umeå, Sweden (#2018–154-31 M). Participants received both written and verbal information about the experimental protocol and possible risks involved, before providing written informed consent.

Study overview

On separate test days, participants completed in each sub-technique (DP and DS) a continuous incremental submaximal protocol consisting of eight 4-min bouts at intensities between ~ 47–78% of \({\dot{\text{V}}}\)O2peak that was followed by a short break (10 min) and thereafter a self-paced 4-min roller-skiing TT performance test (i.e., TTDP or TTDS) at maximal effort all performed on a treadmill with automated speed control. The two test days were completed within 2 weeks, separated by at least 2 days, and the order of sub-technique was randomized. The inclination of the treadmill was set to 1.5° and 6.5° for the DP and DS roller-skiing tests, respectively, as these are the typical gradients (on average) where the DP and DS sub-techniques are used (for details, see Losnegard (2019)). All participants were familiarized with the specific tests and the testing procedure.

Equipment and measurements

All tests were performed on a treadmill specifically designed for roller-skiing (Rodby Innovation AB, Vänge, Sweden) that allows the athlete to freely adjust the speed and distance completed during the TT was automatically logged (2.46 Hz) and linearly interpolated to second-by-second data. Participants completed all testing using the same pair of classical roller skis (Pro-Ski C2, Sterners, Dala-Järna, Sweden) with the coefficient of rolling resistance (μR) being 0.0215 and determined according to Ainegren et al. (2008). To avoid changes in rolling resistance during test sessions, roller-skis were pre-warmed in a heat box for a minimum of 60 min prior to testing. Participants used their own poles, which were fitted with custom-made rubber tips designed for treadmill skiing, and the same pair of poles was used for both the DP and DS tests. Respiratory measurements were performed using an AMIS 2001, model C (Innovision AS, Odense Denmark). The gas analyzers were calibrated with a known reference gas mixture (16.0% O2 and 4.5% CO2, Air Liquide, Kungsängen, Sweden) and ambient air. The flowmeter was calibrated with a 3-L syringe at low, medium, and high flow rates (Hans Rudolph, Kansas City, Missouri, USA). Calibration was performed before the start of each test. The ambient temperature was 19.5 ± 0.5 °C at a relative humidity of 21 ± 6% which was monitored with a Vaisala PTU200 (Vaisala Oy, Helsinki, Finland). A Biosen S_Line (EKF diagnostics, Magdeburg, Germany) equipment was used to determine the blood lactate concentration, which was calibrated with a known standard solution of 12 mmol·L−1.

All equipment used for the roller-ski assessments was validated prior to the test period. Treadmill speed and incline were validated using an electronic tachometer (Lutron Electronic Enterprise CO, Taipei, Taiwan) and a digital inclinometer (DNM 60 L Pro, Bosch GmbH, Germany), respectively. The ergospirometry AMIS system was validated against a mechanical lung simulator (Metabolic Simulator No 17056, Vacumed, Ventura, CA, USA) and custom-made Douglas bags. Relative concentrations and volumes of expired gas were analyzed using a MOXUS Metabolic Cart (AEI technologies, Bastrop, TX, USA) and a custom-built spirometer (Fabri AB, Spånga, Sweden). The AMIS system was also validated across a wide range of submaximal workloads corresponding to oxygen uptakes between 0.7 and 5.0 L·min−1 and a respiratory exchange ratio < 1.00. The typical error in \({\dot{\text{V}}}\)O2 values prior to testing was < 0.1 L·min−1.

Testing procedures

Upon arrival at the laboratory body mass of the participants, with and without equipment was measured using an electronic scale (Seca 764, Hamburg, Germany) followed by a 5-min supine rest. The DP protocol was performed at an incline of 1.5° and the DS protocol was performed at an incline of 6.5°. The starting speed was either 6 or 6.5 km·h−1 for the DS protocol and either 12.6 or 13.8 km·h−1 for the DP protocol based on previous race results and/or the familiarization. The speed was increased by 0.5 km·h−1 up to a final speed of either 9.5 or 10 km·h−1 for DS, whereas the speed was increased by 1.2 km·h−1 up to a final speed of either 21 or 22.2 km·h−1 for DP. Both protocols consisted of 8 × 4-min submaximal stages (except for the first stage which lasted 8 min), followed by a 10-min passive rest and a 4-min self-paced TT at a maximal effort. The participants were instructed to cover as much distance as possible during the self-paced TT. Participants received feedback on elapsed time every 30 s but received no feedback regarding their speed during the TT. Participants completed a familiarization session on the treadmill before their first test day to minimize the effect of learning on time-trial (TT) performance. This involved submaximal skiing with DP (3 × 5 min) and DS (3 × 5 min) at fixed speeds of 13, 17, and 21 km·h−1 for DP and 6.0, 7.5, and 9.0 km·h−1 for DS and 10 min of varied intensity skiing (5 min each for DP and DS) using the automated speed control system. This was followed by a short break and a race-paced 4 min TTDP/TTDS in a randomized order, with approximately 20 min of recovery between the TTs.

A capillary blood sample (20 μL) was taken from the fingertip for the assessment of blood lactate concentration 2 min after the TT. The skiers rated their perceived exertion (RPE) after the last submaximal stage as well as immediately after the TT using the 10-point scale of Foster et al. (2001) and retrospectively at minutes 1, 2, and 3 of the TT. During the submaximal protocol and TT, both respiratory variables and heart rate were collected continuously. The highest 30-s moving average during the TT was used to calculate \({\dot{\text{V}}}\)O2peak and peak ventilation rate. Peak respiratory exchange ratio (RER) was taken over the same period as \({\dot{\text{V}}}\)O2peak. Peak oxygen pulse was calculated as \({\dot{\text{V}}}\)O2peak divided by heart rate (30-s average) at \({\dot{\text{V}}}\)O2peak. Participants were secured with a safety harness suspended from the ceiling and connected to an emergency brake during all testing that stopped the treadmill in case of a fall.

Calculations

Power output, gross efficiency, and metabolic responses

The power output for submaximal roller-skiing at constant speed was calculated as the sum of the power exerted to overcome the rolling resistance and to elevate system mass (SM) (i.e., body mass and skiing equipment) against gravity

$$\mathrm{Power \,output} \left[W\right]= vSM\left(g \,\mathrm{sin}\left(\alpha \right)+{\upmu }_{R}g\, \mathrm{cos}\left(\alpha \right)\right),$$
(1)

where g is gravitational acceleration, v is the treadmill speed [m·s−1], µR is the rolling resistance coefficient, and α is the treadmill incline. Gross energy expenditure was calculated from oxygen uptake (\({\dot{\text{V}}}\)O2 [L·min−1]) and RER (\({\dot{\text{V}}}\)CO2· \({\dot{\text{V}}}\)O2−1) according to the equation introduced by Weir (1949) and then converted into a metabolic rate

$$\mathrm{Metabolic\, rate} \left[W\right]= \frac{4184{(\dot{\mathrm{V}}\mathrm{O}}_{2}\left(1.1\mathrm{RER}+3.9\right))}{60}.$$
(2)

The GE was calculated using the following equation:

$$\mathrm{GE}= \frac{\mathrm{Power\, output} (W)}{\mathrm{Metabolic\, rate} (W)},$$
(3)

where metabolic rate was based on the average \({\dot{\text{V}}}\)O2 and RER values (≤ 1.00) during the final minute of each submaximal bout.

For determining the most appropriate method used for calculating the anaerobic metabolic rate and anaerobic capacity in DP and DS, a previous methodological study (Andersson et al. 2020) has been published on parts of the data that are included in the current study. Based on the results presented by Andersson et al. (2020), a second-degree polynomial regression model would be more appropriate for DP roller-skiing, while a linear model would be more appropriate for DS roller-skiing; both without using a baseline metabolic rate as a Y-intercept.

For DS, a linear relationship between treadmill power output and metabolic rate during the final minute of each of the 8 × 4-min submaximal stages was derived and used to estimate the instantaneous required metabolic rate during the 4-min TT (MRTT_req) at each 1-s time-point. The same procedure was used for DP, with the exception that a second-degree polynomial regression was used. The power output during the TT (POTT) was calculated according to Eq. 1.

The instantaneous anaerobic metabolic rate (MRAN) at each 1-s time-point (t) of the TT was expressed as

$${\mathrm{MR}}_{AN,t} \left[{J\cdot s}^{-1}\right]= {\mathrm{MR}}_{TT\_req,t}- {\mathrm{MR}}_{\mathrm{AE},t},$$
(4)

where MRAE is the aerobic metabolic rate calculated as described in Eq. 2.

The total anaerobic energy production (in joules) was calculated by integrating MRAN over the 4-min TT.

To be able to compare the average estimated supramaximal GE during the TT based on the regression equation, the following calculations were performed. First, the estimated instantaneous GE at each 1-s time-point (t) of the 4-min TT was calculated as the ratio between POTT (calculated similarly as in Eq. 1) and the MRTT_req derived from the polynomial regression equation in DP and the linear regression equation in DS. Second, the estimated instantaneous GE was expressed as an average value in DP and DS, respectively. The instantaneous GE was also used to calculate the time-course of aerobically attributable power output (i.e., the aerobic contribution to power output) and was calculated as the instantaneous MRAE multiplied by instantaneous GE.

Cycle characteristics and poling power

For all kinematical analyses, all tests were filmed from the side with a Go-Pro camera (GoPro Hero 1, GoPro Inc., San Mateo, CA, USA). A skiing cycle was defined as the moment from the start of the pole plant (i.e., first pole-belt contact) until the same pole made contact again with the treadmill belt. The number of complete cycles within the last minute of each submaximal stage was counted and the exact times were noticed. For the calculation of cycle rate (in Hz, i.e., cycles·s−1), the total number of cycles was divided by the exact time (in s) taken to complete those cycles, whereas cycle length (in m) was calculated by dividing speed (in m·s−1) by cycle rate. Cycle rate and cycle length were determined for both DP and DS. For DP, the times of active propulsion (i.e., the poling time) and no propulsion (i.e., the swing time) were determined based on the last five completed cycles within the final minute of each submaximal stage. Poling time was then determined as the pole-belt contact time and the swing time as the time of no pole-belt contact. Poling and swing times were presented as the average value of those five cycles. To allow for the computation of relative poling and swing times in DP (presented as percentages of cycle time), the average cycle time was calculated for the same five cycles. The average power output during the propulsive poling phase (i.e., poling power) in DP was determined as the power output (Eq. 1) divided by the poling to cycle time ratio (i.e., the relative poling time).

Average values for cycle rate and cycle length were calculated for each 1-min period of the TTDP and TTDS. For the calculation of cycle rate, the number of full cycles that were completed within each respective minute was counted and cycle rate was calculated as the number of completed cycles divided by the exact time. Cycle length was calculated as the average speed for the specific period divided by cycle rate.

Statistics

Data were checked for normality by visual inspection of Q–Q plots and histograms together with the Shapiro–Wilk analysis and are presented as mean ± standard deviation (SD), except in the case of RPE, where data are presented as median and interquartile range (IQR). One-way repeated-measures ANOVA tests were used to compare the eight submaximal stages within each sub-technique. The physiological responses for TTDP and TTDS were compared with a paired sample t test, except in the case of RPE where a Wilcoxon signed-rank test was used. A two-way repeated-measure ANOVA (sub-technique × time-point [i.e., minutes 1–4 of the respective TT]) was used for the comparison of power output, GE, total metabolic rate, aerobic metabolic rate, anaerobic metabolic rate, and cycle characteristics. An alternative method was used for RPE, comparing sub-technique-specific grand median values with a Wilcoxon signed-rank test and using a Fridman test to analyze the effect of time-point based on grand median value for DP and DS per time-point. The assumption of sphericity was tested using Mauchly’s test, and for violated sphericity, the degrees of freedom were corrected using the Greenhouse–Geisser correction (i.e., epsilon ≤ 0.75). Partial eta-squared effect size (ηp2) was also reported for the ANOVA tests. Bonferroni α corrections were applied to all ANOVA tests. Relationships between variables were assessed using linear regression analyses. For the paired t tests, the standardized mean difference (Hedges’ Hgav, effect size [ES]) was reported (calculated according to the equation provided by Lakens (2013)). For RPE, the r effect size was calculated for the Wilcoxon signed-rank tests, which was calculated as the z-value divided by the square root of N, and the Kendall’s W effect size was calculated for the Fridman test as the χ2-value divided by N(K-1) with K being the number of measurements per subject.

Multivariate data analysis methods were used to examine whether TTDP or TTDS sub-technique-specific performance (W·kg[SM]−1) (Y variable) could be predicted by sub-technique-specific \({\dot{\text{V}}}\)O2peak (ml·kg[SM]−1·min−1), anaerobic capacity (kJ·kg[SM]−1), and GE (%) (X variables). Prediction of TTDP and TTDS performance was achieved using principal component analysis and orthogonal projections to latent structures. Detailed information on these methods has been published previously (Eriksson et al. 2013) and specific application of multivariate data analysis in the prediction of performance in winter sports has also been documented (Jones et al. 2021; Nilsson et al. 2018). To evaluate the importance of specific lab test variables, for predicting TTDP and TTDS performance, variable influences on projection (VIP) analyses were conducted. In orthogonal projections to a latent structures model, VIP summarizes the importance of the X variables, both for the X and Y models. Within valid orthogonal projections to the latent structure's model, VIP is normalized and the average squared VIP value is 1; thus, a VIP > 1 indicates that the variable is very likely to be important for the projection, whereas values < 0.5 indicate that the variable is less likely to be important for the projection. The Statistical Package for the Social Sciences (SPSS 21, IBM Corp., Armonk, NY, USA) and SIMCA Multivariate Data Analysis Software (SIMCA 16.0, MKS AB, Umeå, Sweden) were used to carry out statistical analyses and the level of significance was set at α ≤ 0.05.

Results

Submaximal data

Statistical differences between the submaximal stages are denoted in Table 1. Significant within sub-technique main effects of speed were observed across the submaximal stages for oxygen pulse, GE, cycle rate, cycle length, absolute poling time, relative poling time, relative swing time, and poling power while DP at a 1.5° incline. Within DS at a 6.5° incline, main effects of speed across the submaximal stages were observed for oxygen pulse, cycle rate, and cycle length.

Table 1 Mean ± SD of speed, power output, metabolic rate, oxygen (O2) pulse, gross efficiency, and cycle characteristics associated with the eight submaximal stages (SUB1-8) of double-poling and diagonal-stride roller-skiing

4-min TT data

Mean ± SD power output and physiological responses during the “flat” TTDP (1.5°) and uphill TTDS (6.5°) are shown in Fig. 1. Average power output, average total metabolic rate, average aerobic metabolic rate, average anaerobic metabolic rate, anaerobic capacity, anaerobic work capacity, average GE, \({\dot{\text{V}}}\)O2peak, and heart rate at \({\dot{\text{V}}}\)O2peak were all significantly lower during TTDP than TTDS (Fig. 1A–H, J). The average ventilation rate was similar for TTDP and TTDS (165 ± 20 and 165 ± 18 L·min−1, 95% CI of the difference: –7.31 to 7.21, P = 0.988, ES = 0.00). The average breathing frequency was significantly higher for TTDP than TTDS (62 ± 5 and 54 ± 7 breaths·min−1, 95% CI of the difference: 3.43 to 13.20, P = 0.002, ES = 1.33). Both average cycle rate and cycle length were significantly higher for TTDP than TTDS (cycle rate: 1.04 ± 0.08 and 0.86 ± 0.04 Hz, 95% CI of the difference: 0.12 to 0.34, P < 0.001, ES = 1.35; cycle length: 6.94 ± 0.57 and 4.40 ± 0.16 m, 95% CI of the difference: 2.14 to 3.50, P < 0.001, ES = 3.03). The blood lactate concentration measured 2 min after the TT was not significantly different between TTDP and TTDS (11.7 ± 2.1 and 12.6 ± 2.5 mmol·L−1, 95% CI of the difference: –2.23 to 0.55, P = 0.216, ES = -–0.35). The RPE was significantly lower for TTDP than TTDS (9.0 [IQR: 9.0–10.0] and 10.0 [IQR: 9.8–10.0], P = 0.014, ES (r) = -–0.62). Between-sub-technique specific linear relationships for power output, \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE as based on the separate TT performances are shown in Fig. 2A–D.

Fig. 1
figure 1

Mean ± SD power output (PO) and physiological responses to the 4-min treadmill roller-skiing time-trial tests using the double-poling (DP) and diagonal-stride (DS) sub-techniques at uphill gradients of 1.5° and 6.5°, respectively. Abbreviations: MR, metabolic rate; AnC, anaerobic capacity; AnWC, anaerobic work capacity; GE, gross efficiency; \(\dot{\mathrm{V}}\)O2peak, peak oxygen uptake; O2, oxygen; HR, heart rate; RER, respiratory exchange ratio; BM, body mass; SM, system mass; 95% CI, 95% confidence interval of the mean difference; ES, effect size (Hedges’ gav effect size). *Significant difference between conditions (P < 0.05)

Fig. 2
figure 2

Linear relationships between performance A and physiological responses BD for the 4-min diagonal-stride (DS) time-trial (TT) at 6.5° (x-axis) and for the 4-min double-poling (DP) TT at 1.5° (y-axis). Abbreviations: POTT, average time-trial power output (i.e., performance); \(\dot{\mathrm{V}}\)O2peak, peak oxygen uptake; AnC, anaerobic capacity; GE, gross efficiency; SM, system mass; BM, body mass. The gray dashed line represents the identity line (i.e., y = x). *Significant R2 (P < 0.05)

Pacing

Mean ± SD instantaneous speed, mean instantaneous power output, mean instantaneous aerobic contribution to power output, mean instantaneous total metabolic rate, and mean instantaneous aerobic contribution to metabolic rate for TTDP and TTDS are shown in Fig. 3A–F. Statistical comparisons between each of the four 1-min segments for the TTDP versus TTDS are presented in Table 2. For the sub-technique comparison, power output, GE, total metabolic rate, aerobic metabolic rate, anaerobic metabolic rate, and RPE were all lower during TTDP than TTDS, whereas cycle rate and cycle length were higher during TTDP (Table 2). Significant main effects of time-point were observed for power output, GE, total metabolic rate, aerobic metabolic rate, anaerobic metabolic rate, cycle rate, and RPE (Table 2). Significant sub-technique × time-point (minutes 1–4 of the TT) interactions were observed for power output, GE, and cycle length, whereas no interactions were observed for total metabolic rate, aerobic metabolic rate, and anaerobic metabolic rate.

Fig. 3
figure 3

Mean ± SD speed, mean power output (PO), mean aerobic contribution to PO (AE PO contr.), mean total metabolic rate (MR), mean aerobic contribution to MR (AE MR contr.) for the 4-min double-poling (DP) time-trial (TT), in panels A–C, and for the 4-min diagonal stride (DS) TT, in panels D-F. The shaded area in panel B and E, respectively, represents the anaerobically attributable work (ANW) (i.e., the anaerobic work capacity), and the shaded area in panel C and F, respectively, represent the anaerobic energy expenditure (ANEE) (i.e., the anaerobic capacity). The white area in panels B and E, respectively, represents the aerobically attributable work (AEW), and the white area in panels C and F, respectively, represents the aerobic energy expenditure (AEEE). PO and MR are expressed relative to system mass (SM), i.e., the sum of body mass and equipment mass

Table 2 Mean ± SD for mean power output, gross efficiency, total metabolic rate, aerobic metabolic rate, anaerobic metabolic rate, cycle rate, cycle length, and rating of perceived exertion for the double-poling (DP) and diagonal-stride (DS) sub-techniques during each 1-min interval of the 4-min time-trial

Performance determinants and time-trial performance

For each respective TT, linear relationships between \({\dot{\text{V}}}\)O2peak versus average power output, anaerobic capacity versus average power output, and GE versus average power output are presented in Fig. 4A–F. Valid predictive models were identified for TTDP and TTDS performance. The combination of variables including \({\dot{\text{V}}}\)O2peak (mL·kg[SM]−1·min−1), anaerobic capacity (kJ·kg[SM]−1), and GE (%), was able to predict TTDP (R2/Q2 adjusted = 0.97/0.96) and TTDS (R2/Q2 adjusted = 0.85/0.70) performance. The regression coefficients of the underlying models for predicting new observations of TTDP and TTDS performance are presented in Fig. 5A and C. The importance of \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE in predicting TTDP and TTDS performance are presented in Fig. 5B and D. Although all three physiological variables were of importance in the prediction of TTDP and TTDS performances, the most important was \({\dot{\text{V}}}\)O2peak, which was followed by anaerobic capacity as the second most important variable, and GE as the least important variable, with the same order of importance in both TTDP and TTDS.

Fig. 4
figure 4

The 4-min time-trial power output (POTT) in relationship with peak oxygen uptake (\(\dot{\mathrm{V}}\)O2peak), anaerobic capacity, and gross efficiency for the 4-min double-poling (DP) time-trial, in panels AC, and for the 4-min diagonal-stride (DS) time-trial, in panels DF. The data points represent the individual skiers together with the linear regression line. POTT, \(\dot{\mathrm{V}}\)O2peak, and anaerobic capacity are expressed relative to system mass (SM), i.e., the sum of body mass and equipment mass. *Significant R2 (P < 0.05)

Fig. 5
figure 5

Regression coefficients of the underlying models for predicting new observations of double-poling (DP) (Panel A) and diagonal-stride (DS) (Panel C) time-trial power output (POTT) including lines of best fit. The importance of peak oxygen uptake (\(\dot{\mathrm{V}}\)O2peak [mL·kg[SM]−1·min−1]), anaerobic capacity (AnC [kJ·kg[SM]−1]), and gross efficiency (GE [%]), i.e., the X variables, for predicting DP (Panel B) and DS (Panel D) time-trial performances (i.e., the Y variables). Characteristics with variable influence on projection (VIP) values (in arbitrary units) where higher values indicate a higher relevance for explaining Y. The plot is displayed with 95% jackknife uncertainty bars

Discussion

This study compared the physiological responses and pacing strategies between a “flat” TTDP (1.5° incline) and an uphill 4-min TTDS (6.5° incline) with the main findings as follows: (a) compared to TTDS, TTDP generated 32% lower power output, 10% lower total metabolic rate, 5% lower aerobic metabolic rate, and 32% lower anaerobic metabolic rate; (b) TTDP resulted in 4.7 percentage points lower GE, 30% lower anaerobic capacity, and 4% lower \({\dot{\text{V}}}\)O2peak than TTDS; (c) as based on average power output (in W·kg[SM]−1), the TTDP and TTDS performances were not significantly correlated; and (d) multivariate data analysis methods were able to predict performance using \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE (TTDP, R2 = 0.974; TTDS, R2 = 0.848) with \({\dot{\text{V}}}\)O2peak being the most important variable, anaerobic capacity the second most important variable, and GE the least important variable for each respective TT projection.

The importance of the DP sub-technique has increased during the last decade and has resulted in physiological adaptations (Stöggl and Holmberg 2011; Stöggl et al. 2019, 2020). For example, over an approximately 60-year period, the \({\dot{\text{V}}}\)O2peak in DP relative to \({\dot{\text{V}}}\)O2peak in DS has increased from 70% (in the year of 1961) up to 95% (in the year of 2018) (Stöggl et al. 2019) and may be related to a combination of several factors such as changes in training characteristics, skiing technique, skiing equipment, and track preparation. In the current study, the DP-to-DS \({\dot{\text{V}}}\)O2peak ratio was 96%, which is similar to some recent findings (Andersson et al. 2017; Stöggl et al. 2019) but higher than the consensus finding of 88% (Losnegard 2019). The 96% of DP-to-DS \({\dot{\text{V}}}\)O2peak that was observed in the current study indicates that some senior-elite male cross-country skiers may have an even smaller gap in \({\dot{\text{V}}}\)O2peak between the two sub-techniques, possibly due to more specific upper body training. A training regime that emphasizes more specific DP training for reducing the sub-technique-specific gap in \({\dot{\text{V}}}\)O2peak may be advantageous, since highly trained cross-country skiers may have difficulties in increasing their \({\dot{\text{V}}}\)O2max (or \({\dot{\text{V}}}\)O2peak) in DS.

The two main differences between DP and DS that were observed in the current study were the considerably lower anaerobic capacity and GE in DP, which were the main variables to explain the 32% lower power output during the 4-min TTDP than TTDS. The lower anaerobic capacity for almost flat DP versus uphill DS roller-skiing may be due to the lower total muscle mass involved (Björklund et al. 2015) and higher muscle contraction velocities during DP (Hill 1922; Lindinger and Holmberg 2011; Lindinger et al. 2009). As an example, in running, Sloniger et al. (1997) reported lower values of anaerobic capacity for horizontal versus uphill running that in part was explained by the larger active muscle volume during uphill running. During high-speed DP (> 25 km h−1), the short poling times (< 300 ms) (Losnegard 2019) may limit the ability for force impulse generation and may result in a lower active muscle mass than during uphill DS. This could also explain the slightly lower RPE values for DP (post-RPE = 9) than DS (post-RPE = 10) that were observed in the current study. Moreover, Losnegard and Hallén (2014) proposed that the total active muscle mass is highly related to the magnitude of the oxygen deficit (i.e., anaerobic capacity) and the lower active muscle mass in DP than in DS, as has been observed by Björklund et al. (2015), may, at least in part, explain the sub-technique specific difference in anaerobic capacity that was observed in the current study.

Due to the substantially lower GE in DP than DS, the difference in anaerobic work capacity between DP and DS was considerably larger (43% lower in DP) than the noticed difference in anaerobic capacity (30% lower in DP). The lower GE for almost flat DP than uphill DS may be related to several factors such as the physiological characteristics and contraction velocities of the involved muscle groups (Calbet et al. 2005; Hill 1922). One main difference between almost flat DP and uphill DS is the relative propulsive phase that is considerably shorter during DP than DS (Pellegrini et al. 2011) and is a factor that may explain some of the differences in GE between the two sub-techniques. In addition, GE in DP was found to be speed dependent (see Table 1), whereas GE in DS was independent of speed. As demonstrated in Table 1, the speed-GE dependency in DP is likely to be related to the gradually shorter poling phase (both absolutely and relatively), the higher required poling power, and the more rapid increase in cycle rate for DP at higher speeds (speeds > 5.6 m·s−1). The GE in DP peaked during submaximal speeds ranging between approximately 4.3–5.3 m·s−1, with poling times, cycle rates, and poling powers likely to be the most “optimal” for GE in that speed range.

Although the DP and DS \({\dot{\text{V}}}\)O2peak values were linearly associated (R2 = 0.547), the TTDP and TTDS performances were not significantly associated (R2 = 0.044) (Fig. 2A–B). In addition, anaerobic capacity in DP was not significantly associated with anaerobic capacity in DS, and a similar finding was observed for GE (Fig. 2C–D). These results suggest that several physiological and anthropometrical factors that may favor DP performance do not necessarily favor DS performance. The locomotion of the DP and DS sub-techniques is also very different (Pellegrini et al. 2013), and for an effective DP technique, well-developed upper body strength and endurance are likely to be more crucial than in DS (Stöggl et al. 2019). Such factors may also explain why a high anaerobic capacity in DS does not directly transfer to a high anaerobic capacity in DP.

Pacing can from an internal metabolic perspective be described as the distribution of total metabolic rate (i.e., the sum of both aerobic and anaerobic metabolic rates) during a maximal effort (Andersson et al. 2016; Foster et al. 2003). In the current study, 4-min self-paced maximal efforts were performed with DP and DS, respectively. As shown in Fig. 3, the time-course profiles of speed, power output, and metabolic rates were relatively similar between DP and DS. The absence of significant sub-technique × time-point interactions for total metabolic rate, aerobic metabolic rate, and anaerobic metabolic rate confirms that similar parabolic pacing strategies were used from an internal metabolic perspective. Self-paced roller-skiing TT performance in a laboratory is related to total metabolic rate (i.e., the sum of aerobic and anaerobic metabolic rates) multiplied by GE which determines the magnitude of the total metabolic rate that is being converted to external power output (Andersson et al. 2017). As demonstrated in Fig. 4, the pair-wise linear relationships between \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE versus performance revealed DS \({\dot{\text{V}}}\)O2peak to be significantly associated with the TTDS performance, whereas none of the other pair-wise linear relationships showed significance.

The multivariate data analysis method revealed that \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE predicted both TTDP and TTDS performance to a large extent (R2 = 0.974 in DP and R2 = 0.848 in DS). The VIP (i.e., variable influence on projection) values were highest for \({\dot{\text{V}}}\)O2peak in both TTDP and TTDS, followed by anaerobic capacity, and GE (for details, see Fig. 5A–D). Based on these findings, all three variables had a relatively large influence on predicting performance in this group of highly trained cross-country skiers. The somewhat lower projective ability of anaerobic capacity than \({\dot{\text{V}}}\)O2peak for TT performance observed here may, at least in part, be related to the fact that most of the energy turnover was aerobic (on average 86% and 81% during TTDP and TTDS, respectively). In connection with classic-style mass-start races and sprint knock-out heats where athletes race head-to-head, the relative importance of anaerobic factors (power and capacity) to race performance is likely to be even higher, because such races usually are finished with a brief high-speed end spurt involving DP. In addition to the anaerobic factors, the high variability in submaximal GE during DP is likely to play a crucial role in races. The between-athlete variability observed was approximately twice as high for DP than DS with the respective coefficient of variations of 7.1 and 3.5% at the highest submaximal speed. At high supramaximal DP speeds, the between-athlete variability in GE is likely to be even higher due to the short time for force generation (Losnegard 2019; Stöggl and Holmberg 2011). Due to this, GE may be an important performance determinant for maximal DP speed that is likely to be linked to a skier’s muscle strength, muscle power, and technical characteristics (Stöggl and Holmberg 2011).

The lack of significant pair-wise linear relationships between \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE versus TT performance indicate that the physiological characteristics of cross-country skiers versus their performance ability in DP and DS are very heterogeneous. This together with the fact that all the three main performance factors (i.e., \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE) were important in the projection of TT performance (see Fig. 5) which suggest that all three factors should be evaluated, sub-technique-specifically, during a cross-country skier’s training year. For this purpose, a roller-skiing TT is likely to be preferable compared to an incremental \({\dot{\text{V}}}\)O2max time-to-exhaustion test as it provides a more ecologically valid and reliable measure of performance (McGawley 2017) and can be used to determine aerobic and anaerobic metabolic responses as well as pacing strategies that all are important to performance (Andersson et al. 2016; Losnegard et al. 2012, 2013). For example, Losnegard et al. (2013) showed in the V2 ski-skating sub-technique (also referred to as G3) that movement economy and anaerobic capacity changed significantly during a 1-year training/racing period, whereas \({\dot{\text{V}}}\)O2peak remained constant across the year despite changes in performance. Moreover, Sandbakk et al. (2016b) showed that sub-technique-specific performance based on a 3-min roller-skiing TT in a laboratory-predicted section-specific performance during a traditional 10-km classic-style cross-country skiing race. Altogether, the results of the current study and previous findings (Losnegard et al. 2013; Sandbakk et al. 2016b) indicate that sub-technique-specific performance factors in DP and DS should be tested during the training season, so that an athlete’s strengths and weaknesses can be identified, which could allow for appropriate individual evaluation and/or adjustments in an athlete’s training.

In conclusion, the main physiological differences between TTDP and TTDS were related to the somewhat lower aerobic metabolic rate (5%) and substantially lower anaerobic metabolic rate (30%). The conversion of total metabolic rate to external power output was considerably lower during TTDP than TTDS with a considerably lower GE for TTDP than TTDS, which resulted in a 32% lower external power output during TTDP than TTDS. The current study reveals that highly trained male cross-country skiers can attain a \({\dot{\text{V}}}\)O2peak during “flat” DP which is 96% of the \({\dot{\text{V}}}\)O2peak attained during uphill DS but can only generate an anaerobic capacity that is 70% of the DS value. The time-course distribution of total metabolic rate during the 4-min TT did not differ between the two sub-techniques, which based on an internal metabolic perspective confirms that similar parabolic pacing strategies were used. The \({\dot{\text{V}}}\)O2peak, anaerobic capacity, and GE predicted both TTDP and TTDS performances with \({\dot{\text{V}}}\)O2peak and anaerobic capacity having the greatest projective ability.

Perspectives

The results of the current study demonstrate that a skier’s “metabolic profile” is sub-technique specific and that highly trained male cross-country skiers can reach a very high fraction of their “whole-body” exercise \({\dot{\text{V}}}\)O2peak (i.e., \({\dot{\text{V}}}\)O2peak in DS) during DP. The results also show that skiers that perform well in DP do not necessarily perform well in DS. The findings also show that the anaerobic capacity (or GE) in one sub-technique is not directly related to the anaerobic capacity (or GE) in the other sub-technique, which indicates that both physiological characteristics and cross-country skiing performance are highly related to the type of sub-technique being used. Even though the physiological characteristics were different for DP and DS exercise, similar parabolic pacing strategies were used during TTDP and TTDS. Since the overall race performance in a traditional cross-country skiing race is the sum of all the sub-technique-specific performances (Sandbakk et al. 2016b), a testing procedure using a 4-min roller-skiing TT to identify key-performance-related physiological variables in each of the main sub-techniques of cross-country skiing may be a prerequisite for optimal training evaluation. It is therefore important to assess \({\dot{\text{V}}}\)O2peak (or \({\dot{\text{V}}}\)O2max), anaerobic capacity, and GE, on a sub-technique-specific level, as all three variables were shown to be important in the projection of 4-min “sprint-skiing” performance.