Introduction

Numerous studies have been conducted to investigate what is the optimal upcoming daily training plan in terms of balancing exercise loads (e.g. exercise intensity and volume) and proper recovery [e.g. 14]. Some evidence even suggests that daily directed training decisions based on certain physiologic metrics can lead to longer term performance improvements [5,6,7]. This monitoring-based decision-making requires an individual approach [8].

In this context, the term "readiness to train" is often used to describe this process, but there is still a lack of both a universal understanding of this concept, its measurability, and validity [9]. Descriptions of this concept include, the absence of signs of impaired physical performance, mental fatigue or excessive psychological distress and “the athlete’s capacity to complete training activities and perform during competition” [8]. Systemic readiness is most commonly assessed through whole-body neuromuscular function as a marker of fatigue as well as by longitudinal monitoring of chronic fitness adaptations [10]. It is expected that neuromuscular performance (e.g. Counter Movement Jump (CMJ), repeated rebound jumps, sprint testing) worsens with increases in fatigue which is the response to workload alterations [10]. Since it is claimed that “readiness” has also physical, physiological and psychosocial underpinnings [10], monitoring options may also include measures about metabolic (e.g. blood lactate), psychological (e.g. psychometric scales, rating of perceived exertion, RPE) and autonomic status with heart rate (HR) variability (HRV) being attributed to the last group [4]. There are promising results for HRV indices (e.g. high frequency (HF) power, root mean square of successive differences (RMSSD)) with regard to functional overreaching [11], autonomic recovery status [1, 12] or training status and “readiness to perform” [13] when applied during rest or post exercise conditions [1, 11, 12] allowing one to select an appropriate training effort based on a proxy of the regulation balance associated with the autonomic nervous system (ANS). However, studies recommend the parallel use of specific physiological markers with daily training logs and/or psychometric short scales to take full advantage of such an approach [1, 14, 15]. Although seemingly simple at the surface, studies report large day-to-day variations in isolated resting HRV measures (most studied time-domain metric: RMSSD) due to multiple influencing factors like environmental conditions [16], exercise induced changes in blood plasma volume [17] or the presence of residual (acute) fatigue from previous training sessions [18]. From the logistical standpoint, resting HRV requires a regular day-to-day monitoring routine including standardization (e.g., time of day, body position; [19]). Since an application of conventional time-domain parameters during endurance exercise is even less informative due to a loss of dynamics past the aerobic threshold [20], alternative approaches for HRV analysis are needed.

A recent HRV metric that has been associated with endurance exercise related fatigue [21] is the non-linear index alpha 1 of Detrended Fluctuation Analysis (DFA a1) representing fractal correlation properties of the beat to beat pattern. Its dynamic range varies between ~ 1.5 mirroring a more correlated or periodic behavior and ~ 0.5 indicating a loss of fractal dynamics and complexity toward a more random behavior (disorganized randomness) [22]. Studies have shown its low dependency on HR [23, 24], as well as its suitability to describe the complex cardiac autonomic regulation during various exercise intensities, modalities, and environmental conditions [25]. While higher values were attributed to a reintegration and synchronization of subsystems, lower values were supposed to indicate a disintegrating and centralization process [22, 25]. In addition to its piloting usage for detecting endurance exercise fatigue it has also been explored as a delimiter for physiological thresholds during incremental exercise protocols [26,27,28,29]. In one fatigue-related study DFA a1 was analyzed before and after an ultramarathon in an exercise group and respectively pre and post daily activity in a control group [21]. DFA a1 exhibited suppressed behavior only in the ultramarathon group during constant low intensity exercise at around the aerobic threshold, post ultramarathon, associated with reduced CMJ scores indicating neuromuscular fatigue. These results are consistent with the assumption that DFA a1 is associated with total organismic demands including comprehensive organizational approaches such as the Network Physiology of Exercise (NPE) [30]. This programmatic approach aids to understand exercise-related phenomena (e.g. sports performance, fatigue, overtraining) by compiling multiple neuromuscular, biochemical, peripheral and central nervous system inputs to an overall concept of “organismic demand” [30]. Since this analysis was the first to evaluate DFA a1 as an opportunity for potential real-time tracking of physiological status, further investigation is needed to explore its’ applicability to evaluate the athlete's status in everyday training routines. One example would be an assessment during the warm-up phase of dynamic submaximal exercise to help determine whether autonomic balance is disrupted and thereby leading to adjustments in exercise load. Submaximal fitness tests have been shown to provide a feasible approach to evaluate an athlete’s physiological state (e.g. time-efficiency, low physiological impact, feasibility for implementation) [31]. Such submaximal tests provide a pragmatic approach observing internal load responses in relation to a standardized physical stimulus. Shushan et al. [31] define submaximal tests as “[…] short exercise bouts, undertaken at a standardized intensity that is intended to be non-exhausting, and performed with the purpose of inferring an athlete’s physiological state through the monitoring of relevant outcome measures”. With a prior exercise session (or accumulated workload over multiple sessions) potentially causing some level of fatigue, it would be of interest to see if DFA a1 would show inappropriate suppression, while maintained or elevated values would mirror a positively altered physiological status.

The following report is an initial exploration of the ability of DFA a1 to assess this state during 10-min bouts of cycling at 90% power of the first ventilatory threshold (VT1, representing a warm-up bout) before and after (1) a light and (2) a heavy running exercise session separated by one week.

Materials and methods

Participants

Sixteen triathletes were recruited from a local triathlon club. Inclusion criteria comprised healthy men and women in an age range of 18 to 60 with active participation in triathlon and the willingness to attend the 5 planned laboratory visits. Participants were excluded in the case of previous medical history, current medications, and recent illness or with an exceeding artifact number in their data sets. After dropout or rejection of five athletes due to time constraints, injuries or data issues (artifact occurrence rate: > 5%), eleven (ten males, one female) could be included for analysis. They aged 37 ± 10 (range: 21.0–55.0) years and comprised a mean body weight of 72.8 ± 10.4 kg and mean height of 182 ± 8 cm. Triathlon related exercise totaled 9.0 ± 4.4 (range: 3.0–18.0) hours per week. The study was performed in accordance with the Declaration of Helsinki and all participants received the detailed description of the experiment before providing their written informed consent. Ethical approval was obtained from the local ethics committee (University of Hamburg, Department of Psychology and Movement Science, Germany, reference no.: 2021_407).

Baseline assessment

Each participant was invited to an initial appointment where diagnostics and familiarization of testing routine were performed. An incremental exercise test on a mechanically braked cycle (Ergoselect 4 SN, Ergoline GmbH, Germany) was used to determine the exercise intensity (defined as cycling power in watts (W)) for the further tests and for performance level assessment. The ramp protocol comprised a 3 min ride at 50 W with an increase in power by 1 W every 3.6 s (equivalent to 16.7 W per min). RPE and lactate samples from the capillary blood of the earlobe were taken before start, every 3 min and immediately after stopping the exercise. HR, HRV measures and gas exchange kinetics were recorded continuously with a single channel ECG chest belt device (Movesense Medical sensor, firmware version 2.0.99, Movesense, Finland; sampling rate: 512 Hz; software application: Movesense Showcase app version 1.0.9 for iOS; [32]), as well as with a metabolic analyzer (Quark CPET, Omnia software, version 1.6.5, module A-67-100-02, Cosmed, Italy). The protocol was terminated when the participants could not either hold the predetermined cycling cadence (60–80 rpm) or due to voluntary exhaustion, discomfort or pain. Exhaustion was assumed when the following criteria were fulfilled: (A) heart rate > 90% of the maximum predicted heart rate (prediction model according to Tanaka et al. [33]: 208 − (0.7 × age) and (B) respiratory quotient > 1.15 [34]. Maximum oxygen uptake (VO2max) and maximum HR (HRmax) were defined as the average VO2 and HR over the last 30 s of the test. For maximum power (Pmax) the highest observed value was considered.

Determination of the VT1 was made based on the approach of [26]. Thus, oxygen uptake (VO2), carbon dioxide (VCO2), end-tidal oxygen concentration (PetO2), end-tidal carbon dioxide concentration (PetCO2) and minute ventilation (VE) were plotted to apply modified V-slope method, ventilatory equivalencies, excess CO2 production and PetO2 nadir. The first three procedures were based on the recommendations of Gaskill et al. [35] and the last one was suggested by Binder et al. [36]. Power at VT1 was defined as the instantaneous (non-averaged) cycling power reached at that time with 90% thereof prescribing the intensity for follow up DFA a1 testing during the 10-min cycling protocols intended to represent a traditional warm-up. Data resulting from these baseline assessments can be found in Table 1. Familiarization was performed for the upcoming follow-up sessions including the explanation of the further course and the applied methods. The athletes had the opportunity to ask questions and practice on the ground reaction force platform.

Table 1 Demographic data and data from the baseline assessment of all included participants (n = 11)

Follow-up study design

Triathletes were tested on four other occasions: immediately before (PRE) and within 36 h after (POST) (1) a light running session and, (2) a heavy running session separated by one week. The laboratory visits included a designed monitoring test battery of approximately 30 min’ duration consisting of psychometrics, DFA a1 recording during a 10-min low intensity warm-up session at 90% power of VT1 and neuromuscular testing. The training content was designed by the triathletes’ coach in mutual agreement with our research group (Table 2). While the heavy exercise session contained 24 min in a nearly maximal intensity range (95% HRmax), there were just two intensity peaks of 80% HRmax in the light exercise session. Internal workload of these sessions was also assessed by means of a category-ratio “rating of perceived exertion” (RPE) scale from 0 to 10 [37] queried after completed training, so that it could be assessed whether the exercise sessions had fulfilled their purpose of light or heavy impact. Since both sessions comprised a duration of 60 min, multiplication with session duration was neglected. An overview of the study design is shown in Fig. 1.

Table 2 Training schedule of the performed sessions controlled by percent of maximum heart rate (%HRmax)
Fig. 1
figure 1

An overview of the study design consisting of the baseline assessment and the follow-up testing

Upon arrival, the participant was asked to complete a psychometric short questionnaire named Short Recovery and Stress Scale (SRSS) endorsed by studies due to its economic, multidimensional and sensitive properties [38,39,40]. Eight subscales should be rated with a seven-point Likert-type scale ranging from 0 (“does not apply at all”) to 6 (“fully applies”). These include: physical performance capability (PPC), mental performance capability (MPC), emotional balance (EB), general overall recovery (OR), muscular stress (MS), lack of activation (LA), emotional imbalance (EI) and overall stress (OS). For data analysis, mean score for recovery (recovery score (rec); PPC, MPC, EB, OR) and stress (stress (stress) score; MS, LA, EI, OS) were implemented as well as the respective subscales.

Directly after completing the short scale, participants performed a bicycle protocol (LC6 Novo, Monark Exercise AB, Sweden) at the predefined intensity (by means of power) determined in the baseline assessment (90% VT1, see Table 1) and for a duration of ten minutes.

Finally, assessment of effects of the performed exercise session was obtained by the CMJ and Foot Tapping (FT) test on a ground reaction force platform (Leonardo Mechanograph GRFP, Novotec Medical GmbH, Pforzheim, Germany) with associated software (Leonardo Mechanography Software Version 4.3). Three CMJs were carried out according to the specifications of [41] without arm swing. Mean jump height (CMJh) and mean vertical peak force normalized to the participants’ body weight (CMJf) was used for further processing. FT was performed two times in a standing position for 15 s as described by Krauss [42]. Out of these trials, the best one was chosen. Furthermore, number of foot contacts (FTc) and frequency (FTf) were considered for statistical analysis.

RR measurements and calculation of DFA a1

Single channel ECG.csv files (obtained by the Movesense Medical sensor and exported from the Movesense Showcase app; see Baseline Assessment) were imported into Kubios HRV Premium software Version 3.5.0 (Biosignal Analysis and Medical Imaging Group, Department of Physics, University of Kuopio, Kuopio, Finland, [43]). Preprocessing settings were set to the default values including the RR detrending method which was kept at “smoothness priors” (Lambda = 500). For DFA a1 calculation, the root mean square fluctuation of the integrated and detrended data is measured in observation windows of different sizes. This is done using a logarithmic plot of the data against the size of the window. The resulting slope of the line relating the (log) fluctuation to the (log) window size represents the scaling exponent [44]. DFA a1 window width was set to 4 ≤ n ≤ 16 beats [23]. Artifacts in the RR series were corrected by the Kubios “automatic method” [45] and subject data excluded from further analyses when the overall percent artifact exceeded 5% [46]. To minimize influencing factors at the beginning of the warm-up, DFA a1 and average HR were calculated from the RR data series of the 2-min time window consisting of the start of minute 7 to the end of minute 8 of the cycling exercise on all four occasions. The 2-min window width was chosen based on the minimal beats required for valid DFA a1 calculation [47,48,49].

Statistics

Statistical analysis of means and standard deviations (SD) were performed in Microsoft Excel 365, further calculations were done using SPSS 23.0 (IBM Statistics, United States) for Windows (Microsoft, USA). The Shapiro–Wilk test was applied to verify the Gaussian distribution of the data. PRE versus POST differences were respectively analyzed for both conditions (light and heavy exercise session), with a t-test for paired samples in parametric data, whereas nonparametric data was analyzed with a Wilcoxon test for paired samples. Mean differences were specified with the respective 90% confidence interval (90% CI). Statistical significance was accepted at p < 0.05. Cohen’s effect sizes were calculated and interpreted as trivial: < 0.2, small: ≤ 0.2 to < 0.5, moderate: ≥ 0.5 to < 0.8, or large: ≥ 0.8 [50]. In addition, magnitude-based inferences were used to report the percentage changes quantitatively (descriptor: negative/trivial/positive) and in relation to the smallest worthwhile change (SWC) as well as qualitatively with indications about clinically relevance [51]. SWC was determined according to the recommendation of Buchheit et al. [14] with the coefficient of variation (CV = SD/Mean) calculated for the whole group at PRE of the light session and multiplied by 0.3 and respective means of performance measures as well as by 0.5 and means of physiological markers. Qualitative interpretations were based on this preset scale: < 0.5% “most unlikely”, 0.5–5% “very unlikely”, 5–25% “unlikely”, 25–75% “possibly”, 75–95% “likely”, 95–99.5 “very likely”, > 99.5% “most likely”. Analyses of the difference between means were performed using the “xcl_Bayesian.xls” spreadsheet obtained from http://sportsci.org/2007/wghinf.htm [52]. Furthermore, individual changes in DFA a1 were assessed based on the typical error of measurement (TE) and SWC using the “xprecision.xls” spreadsheet available on http://sportsci.org/2017/wghtrend.htm [53]. TE was determined according to the recommendations of [54] by calculating the SD of the difference scores from PRE light session and PRE heavy session values and dividing the value by the root of (2). Precision change was specified using 50% CI also based on the reasoning of Swinton et al. [54].

Results

Mean and SD for the evaluated data can be seen in Tables 3 and 4. RPE was significantly higher for the heavy as opposed to the light running exercise session with large effect size (p < 0.001, d = 0.89). With regard to the PRE versus POST comparisons in group means (see Table 5), there were significant reduced DFA a1 values with large effect size after the heavy exercise session (p = 0.001, d = − 1.44) and a 99% (“very likely”) chance that this negative change was clinically relevant. Moreover, significant changes could only be found in CMJ vertical peak force (p = 0.020, d = 0.84) and FT frequency (p = 0.021. d = 0.82) showing higher results following the light exercise session. Although when considering magnitude-based inferences the chances for the true changes were rated “possibly" positive (00/30/70 and 00/65/35, respectively). In addition, inter-individual differences could be observed and the magnitude of change of all participants of DFA a1 due to the respective exercise sessions is depicted in Fig. 2 and is calculated by making use of the determined SWC (0.12) and TE (0.18) listed in Table 5.

Table 3 PRE and POST light running exercise session data (n = 11)
Table 4 PRE and POST heavy running exercise session data (n = 11)
Table 5 Statistical comparison of the measures for PRE and POST conditions respectively
Fig. 2
figure 2

Individual magnitude of change in DFA a1 due to the A light and B heavy running exercise session for all triathletes. Legend: The absolute differences are depicted by means of confidence intervals (50% CI). The vertical yellow lines mark the smallest worthwhile change representing the area of trivial changes. Percentages and descriptors inform about the individuals’ magnitude of changes in a quantitative and qualitative manner

Discussion

This study examined whether DFA a1 behavior during a low intensity warm-up cycling session could be used to assess an altered physiological status as a surrogate for “readiness to train” and thus guide athletes in their training routine as an easy-to-apply monitoring procedure. In accordance with the findings of Rogers et al. [21], this study indicates a relative suppression of DFA a1 during low intensity exercise, even up to 36 h post exercise. Magnitude-based inferences indicated that suppression of DFA a1 in POST heavy exercise session (99% negative chance) was “very likely” clinically relevant, while alterations in HR and psychometrics were interpreted as “trivial” (74–100% chance). Performance measures demonstrated similar qualitative descriptors as HR and psychometrics in response to the heavy exercise session with no significant change, although showing possibly positive alterations which could only be affirmed in FTf. Interestingly, psychometrics and both HR and DFA a1 showed no sensitivity in response to the light exercise session (64–100% “trivial” chance), while changes in CMJh, CMJf and FTf illustrated a higher likelihood of being clinically positive (35–70% chance). However, significance was only attained in CMJf and FTf.

With respect to the explanations of the “readiness” construct from the introduction, this state was maintained after both training sessions pointing to an absence of fatigue due to no deterioration in neuromuscular function (CMJ, FT) nor SRSS (by definition). However, since practicability of neuromuscular function assessments is challenged by contextual and individual elements (e.g. motivation, familarization, physical qualities, season stage) that may undermine inferences derived from the data [31] as well as varying sensitivity of the metrics depending on the time course of the effect [10], it becomes clear that decision-making based solely on (neuromuscular) performance metrics may not always be correct. Taking a deeper look on the analysis of the SRSS subscales (not included in the tables for reasons of clarity), OR did show a significant decrease when comparing PRE and POST values of the heavy exercise session with moderate effect size (p = 0.031, d = − 0.62). The participants felt less refreshed, rested, muscularly loose and physically relaxed after the heavy running exercise session performed. This is probably also reflected in DFA a1 since this metric is assumed to be a proxy of autonomic balance and organismic demands [25].

Standardized submaximal exercise assessments are very often used to monitor physiological status with HR-derived indices being the most studied outcome measure [31]. However, they seem to detect especially positive chronic endurance-oriented training effects rather than negative transient effects associated with variations in autonomic nervous system function [31]. Although it is a relatively novel concept to use DFA a1 as an outcome measure during a standardized submaximal exercise session, the findings of the existing studies to date [21, 55] are already used by a web based application (AI endurance: https://aiendurance.com/) for the purpose of training monitoring. This training tool assumes that a suppressed DFA a1 from a baseline at a given power or pace indicates that an individual is not performing well and not ready to train or race with high exercise intensity and/or volume [21]. In support of this concept, although there was a drop in DFA a1 for all triathletes after the heavy exercise session indicating a loss of correlation properties of HR time series (and a disturbance of the ANS regulation), some showed greater declines than others, pointing to highly individualized responses to the exercise intensity performed. It is therefore crucial to observe the extent of changes in an individual athlete (for example by means of magnitude-based inferences shown for all participants in Fig. 2), rather than to derive general tendencies or statistical significance of the change. This can be illustrated by examining two example cases: Both participants 4 and 7 showed “likely” and “possibly” PRE versus POST increases in DFA a1 with light exercise session (82 and 74% positive chance respectively). However, the comparison regarding heavy exercise session revealed deviating trends: While the change of participant 7 remained mostly within the determined SWC and values of DFA a1 were still between the range of 1.0 to 1.5, there was a possible decrease (64% negative chance) observed to a value near the anti-correlated range (< 0.5) in participant 4 pointing to a physiologic fatigue status and potential metabolic destabilization [21, 56]. Therefore, it may be reasonable to recommend a decreased exercise load for the upcoming session as a result of the abnormal autonomic response of athlete 4 during the warm-up. For participant 7 training could be continued in the intended way (proceed as planned). With the implementation of this procedure on a regular basis, it may be possible to capture not only acute status, but performance improvements or load tolerance when observing DFA a1 kinetics in regard to different exercise and training interventions. However, this should be investigated in further studies with larger and more homogeneous populations.

In context of the mentioned NPE approach, a reduction of the human organism to any single component is incomplete as the synchronization and interaction of components generates novel information which determines the function of subelements and of the system itself [30]. Since DFA a1 is suspected to be capable to quantify this dynamic network interactions in regards of ANS regulation, it would seem obvious to designate this metric as a biomarker to assess the physiological status as a surrogate metric for “readiness to train”. Nevertheless, the current conceptualization of the term “readiness to train” is highly related to neuromuscular function metrics which has not changed in comparison to DFA a1 in the present data. We therefore hypothesize DFA a1 to be better suitable to reflect the overall physiological status rather than to just depict one measure of a physiological subsystem.

Limitations and future directions

This study focused on the assessment of the physiological status as a surrogate metric for “readiness to train” following a light and heavy running exercise session. Although an acute estimation could be given, it would have been interesting to know the broader training context of the participants which has been not assessed and analyzed. It was not known if there were some triathletes experiencing an overreached, non-overreached or even overtrained state since this could further elucidate potential individual deviations. Additionally, internal load of exercise sessions performed was only quantified by RPE. It would have been useful to capture the internal load in more detail to assure that the sessions were executed according to the prescription and to minimize potential bias in the triathletes’ ratings due to the knowledge of their coaches’ training conception. Future work should account for optimized methodological considerations with regard to the implemented tests, evaluation methods or familiarization periods, so that a fatigued state can be assumed with high certainty. A familiarization and baseline period for some of the metrics could serve for establishing statistical reference data (TE, CV, SWC) for the test measures used a prori. This study calculated CV and SWC from the group values of PRE light exercise session and TE using PRE light and PRE heavy exercise session data from all participants where true scores are not expected to change [54]. Due to the small sample size and low number of repeated tests, the obtained values for TE, CV and SWC are expected to be more accurate with a range of intra-individual data, potentially affecting the results obtained from the magnitude-based inferences. With regard to DFA a1, an individual could obtain better insights by examining their personal SWC, CV and TE during the same exercise protocol [54]. In general, this statistical approach can also help practitioners in decision-making for individual monitoring, irrespective of the parameter used.

Currently, there are real-time approaches available for smartphone application (e.g. Fatmaxxer: https://github.com/IanPeake/FatMaxxer) providing continuous monitoring with a DFA a1 recalculation every 5–30 s (user defined). This provides the opportunity to apply the described monitoring approach in the warm-up period to get a quick snapshot of the physiological status. It would certainly be worthwhile if there were software options to save historical data, thus creating a baseline that corresponds to the SWC corridor (as already used in a similar manner in the already mentioned AI endurance application). With regular data collection, more accurate assessments could be made, which would potentially allow the user to better manage their exercise and training intensity distribution.

Conclusion

Observation of the DFA a1 to power/pace relationship during simulated warm-up bouts (performed PRE- and within 36 h POST-exercise at both low and high demand) seems to represent a valuable measure to depict physiological status in this heterogenous group of triathletes. In the same manner as monitoring resting HRV physiology, it may be possible to utilize inappropriate suppression of DFA a1 at low exercise intensity as a submaximal test and as a means of daily directed training. A regular assessment including individual data history and statistical references for identification of response (change in true score exceeds the smallest worthwhile change, and a normal intra-individual range based on historical data) is recommended. Further data are necessary to confirm the results in a larger and more homogeneous population.