Key Points

  • Predictive equations to establish muscle maximal voluntary isometric endurance present a good reproducibility for all muscle groups.

  • These equations are useful in the field of prevention and/or rehabilitation (risk of falling, low back pain, etc.) or sport (muscle injury risks).

Introduction

The assessment of muscle performance is a requirement in many fields such as prevention [1], sport [2], or rehabilitation [3], as the results of this assessment are used to prevent muscle injuries, guide and individualize training programs. Thus, the knowledge of reference values is a necessity to achieve the above goals. Muscle performance can be measured by different dynamometers, such as the isokinetic dynamometers [4], or the manual dynamometers [5]. Although the use of the isokinetic dynamometers is considered as the gold standard instrument to evaluate muscle strength [6], these devices remain very expensive, bulky, and require trained personnel to handle them and interpret the results. On the other hand, to measure muscle strength, the manual dynamometers present a good alternative, with a good reproducibility for the assessment of the upper and lower limbs [7, 8]. However, this reproducibility depends on the model of dynamometers used during the assessment [9]. In addition, although many authors propose reference values according to anthropometric data [5, 7, 10], these reference values depend on the model of the manual dynamometer and the calibration of the instrument [9]. So, the use of manual dynamometers remains limited as the comparison of the results between models is complicated.

Another way to simply evaluate muscle performance is by using field tests, which allows measuring maximal voluntary isometric endurance. In the literature, many field tests evaluating maximal voluntary isometric endurance have been validated, but, although some normative values exist for some muscle endurances tests [11, 12], these norms do not consider the weight, height, gender, and age of individuals. However, all these anthropometric data are well-known factors that positively or negatively influence muscle performance [13]. In addition, the postural position of some field tests often induces discomfort or pain [14] leading to an underestimation of muscle capacity because the test may be stopped prematurely. Therefore, it seems important to modify the postural positions of the subjects during field tests in order to decrease the limiting factors and to consider the anthropometric data of each subject while respecting the validity of the tests. Finally, the evaluation encountered in the literature on maximal voluntary isometric muscle endurance using isometric contractions is generally limited to four muscle groups: quadriceps, abdominals, lumbar, and quadratus lumborum muscles [12, 15]. Therefore, this assessment provides a somewhat limited overview of the muscle performance.

Another variable which must be taken into account in the evaluation of muscle performance is muscle imbalance or the agonist/antagonist muscle ratios. Indeed, in the case of muscle imbalance, a joint disorder and/or pain can result because the stabilization of the joint is no longer ensured [16]. With the knowledge of the muscle deficits, it is possible to propose an individualized training program that can restore a normal ratio [16]. Generally, to obtain the values of the muscle ratios, it is necessary to measure the strength of the agonist and antagonist muscles and then calculate their ratios. The most widely used method is the isokinetic method, but with the limitations of its use explained previously. McGill et al. (1999) have proposed an easier methodology than the isokinetic method, to determine the ratios in a healthy young adult population, using the endurance time of field tests [17]. However, in their study, muscle ratios were limited to three muscle groups, namely, the abdominals, lumbar, and quadratus lumborum muscles. In addition, all muscle ratios were determined from the performance of the lumbar muscles. Therefore, it is not possible to detect the specific imbalance in relationship with its joint and this form of assessment remains marginal. In addition, as highlighted by the authors, their ratios were established from a young population, which might not be applicable to an older population.

Therefore, we first hypothesize that it would be possible to validate field tests of maximal voluntary isometric muscle endurance with adaptive postural positions to best measure muscle performance; secondly, we hypothesize that the anthropometric data of the subjects could influence the maximal voluntary isometric muscle performance.

The objectives of this study were (1) to validate eight field tests of maximal voluntary isometric muscle endurance with adaptive postural positions; (2) to establish predictive equations of the entire muscle performance in a healthy population based on its anthropometric characteristics, which could allow the prevention of muscle injuries and/or to optimize the individualization of training programs in a quick and cheap manner.

Materials and Methods

Study Design

This study was conducted in two phases. The first phase was to check the inter- and intra-assessor reproducibility for isometric muscle field tests, and the second part was to establish muscle maximal voluntary isometric endurance norms according to these tests. The isometric tests allow assessment of the trunk flexor, trunk extensor, quadratus lumborum, quadriceps, hamstring, and back and chest muscles (Fig. 1).

Fig. 1
figure 1

Figures illustrating the measurement muscle isometric endurance. (A) Trunk flexors, (B) trunk extensors, (C) quadratus lumborum muscles, (D) quadriceps muscles, (E) hamstrings muscles, (F) back muscles, and (G) chest muscles

To be included in the study, the selection criteria for the two phases were as follows: no history of chronic or acute disease; a score < 9 on the Baecke questionnaire, which does not correspond to a sporting way of life [18]; no psychiatric or psychological disorders; and no contraindication for exercise.

Before being included in the study, the design, rules, and protocol were explained to each subject as required by the Declaration of Helsinki. Then each subject signed a written consent form. The study protocol was approved by the Behavioral Science Ethics Committee of University of Lille under No. 2019-380-S77.

Part 1: Reproducibility of Muscle Maximal Voluntary Isometric Endurance Field Tests

The aim of this first part of the study was to examine intra- and inter-assessor reproducibility in a test–retest during isometric field tests in order to assess the maximal voluntary isometric endurance of the quadriceps, hamstring, abdominal, trunk extensor, right and left quadratus lumborum, and back and chest muscles (Fig. 1).

Subjects

Twenty subjects (9 males, 11 females), with a mean age of 40.9 ± 11.6 years (mean ± SD), were included (Table 1). Subjects completed the Baecke questionnaire [18] to determine their physical activity level and were asked about any history of medical conditions in order to verify the entry/exclusion criteria.

Table 1 Characteristics of the study population

Protocol

Two assessors, A and B, each performed two evaluation sessions (A1 and A2 for assessor A and B1 and B2 for assessor B). In order to avoid habituation effect, data from A- and B-rater were collected randomly. Prior to data collection, the assessors practised using the test protocols to ensure that standardized procedures were employed.

Each evaluation session lasted for 2 weeks with a break of at least 24 h between each one. This 24-h interval was chosen in order to avoid the impact of fatigue resulting from two sessions being too close together [19, 20]. Only the data of the last three evaluation sessions (A2, B1, and B2) were analyzed, with the first session (A1) serving as a familiarization and therefore the results of this session were not considered in the analysis of the data.

No attempt was made to standardize the order in which, or the time of day when, the evaluations were completed. Measurements from the two assessors were recorded on separate data collection forms to ensure that they were blinded to each other’s results and their own previous results. Both assessors carried out two assessment sessions for each subject and applied each field test in random order. The randomization was performed with the R software by assigning a number to each test. During the assessment, the antagonist muscle was tested after the agonist muscle. Prior to the assessment of each test, the assessors explained and demonstrated the test procedure to the subject.

Each assessment session started with 10 min of cardiorespiratory warm-up on a cycloergometer at 65% of the target heart rate, as determined by the Karvonen formula [21], and subjects benefited from at least 5-min rest between each test.

Guidelines for Postures During the Tests

For this study, the objective was to reduce pain generated by the discomfort of the position tests during the evaluation. To do this, the postural position of some tests was modified (Fig. 1). These modifications and all instructions for all tests, including pretest cueing, starting position, subject instructions, and termination criteria, are presented in the Additional file 1: Appendix. Next, there followed a brief presentation of the used tests with their alterations or not: (1) trunk flexor muscles were evaluated with the test of Ito et al. [14]. Some modifications were carried out, including the position of the legs and the termination criteria; (2) trunk extension was evaluated using the test of Ito et al. [14] without modification; (3) right and left quadratus lumborum muscles were evaluated using a modification of the positions in McGill et al.’s tests [17]. The modification was the position of the legs; (4) the isometric test of quadriceps using the original Killy test described by Bernard et al. [22] was used; (5) for the hamstring muscles test, a derivative of the bridge exercise to the neutral spine alignment position described by Youdas et al. [23] was used. These modifications included the pelvis position and the feet position; (6) the back muscles test was an adaptation of the behind-the-neck lat pull-down described by Sperandei et al. [24]. This test required the same movement as the behind-the-neck lat pull-down but the subject sat on the floor, with their back, shoulders, elbows, hands, and head against the wall. In addition, this test was carried out without equipment and in an isometric way; (7) for the chest muscles test, a derivative of the push-up on the knees exercise described by Vossen et al. [25] was used. In our study, this test was performed in an isometric way, so the termination criteria were not the same.

Participants were individually instructed and supervised by experienced therapists specifically trained in the testing methodology. In each of these tests, the weight represented the load.

Part 2: Creation of Muscle Norms

Subjects

Four hundred healthy Caucasian participants segmented by age (50 males and 50 females in each age bracket, 20–29 years old; 30–39 years old; 40–49 years old; and 50–59 years old; a total of 200 males and 200 females) were included. The female group was aged 39.7 ± 11.5 years and the male group was aged 39.4 ± 11.4 years (Table 1).

Protocol

Prior to isometrics testing, participants performed 10 min of warm-up, including all articulations of the body. After that, all subjects performed the eight maximal voluntary isometric muscle endurance tests. Each test was carried out once per session. Tests were conducted in random order with at least 5-min rest between two tests. As for the first part, randomization was performed using the R software by assigning a number to each test. The antagonist muscles were systematically tested after the agonist muscles.

Prior to the performance of each test, the assessor explained and demonstrated the test to subjects using standardized instructions. For this part, the same researcher conducted all the tests.

Instructions for End of Tests Parts 1 and 2

For all endurance tests, in both parts 1 and 2, subjects were encouraged to hold the test position until exhaustion, and were given feedback if they deviated from the position. Tests were terminated when the subject could not maintain the position or if there were any obvious signs of fatigue (not maintaining the position in spite of verbal feedback, for example) or a significant emergence of pain or other symptoms. The maximum holding time was recorded in seconds using a stopwatch. The stopwatch was triggered when the subject was in the right position.

Statistical Analysis

Intra- and Inter-assessor Reproducibility

Intra- and inter-assessor reproducibility was estimated using intraclass correlation coefficients (ICCs). The intra-assessor reproducibility was quantified by calculating the ICC between the measurement conducted by the same assessor “B1” and “B2.” The inter-assessor reproducibility was measured by calculating the ICC between the measurement of assessor A (“A2”) and assessor B (“B1” and “B2”). For each ICC, error range and repeatability were calculated with standard error of the measurement (SEM), 95% confidence intervals (CIs), and 95% limits of agreement (LOAs). The standard error of the measurement (SEM) was calculated according to the formula SEM = SD √(1-ICC) [26], to provide an estimate of the precision of measurement, expressed in the units of the measure. The SEM was divided by the mean of the two measurements and multiplied by 100 to give a percentage value (SEM%) [27]. A percentage of 95% of LOA demonstrates the range of measurement error within the sample [28]. The interpretation of the ICCs was obtained according to the study by Shrout: reproducibility was considered strong if the ICC was between 1 and 0.81, moderate between 0.80 and 0.61, fair between 0.60 and 0.41, low between 0.40 and 0.11, and non-reproducible if less than 0.10 [29].

In order to determine the absolute reproducibility, the coefficient of variation (CV) was calculated. The CV is expressed as a percentage (CV%) and is calculated by dividing the standard deviation by the mean, multiplied by 100, for each test. The CV values of 10% [30] and 15% [31] have been used to consider the level of absolute reproducibility of the measurement (CV < 15% = good reproducibility; CV < 10% = excellent reproducibility).

The Breusch-Pagan test was used to verify the heteroscedasticity in the regression model. The heteroscedasticity of the model was confirmed when p value < 0.05 [32]. The performances in all isometric tests for each session of evaluation were expressed as mean ± standard deviation (mean ± SD). Statistical analysis was performed using the SPSS version 20.0 software and the Breusch-Pagan test was performed using the R software version 4.0.4.

Muscle Norms

A sample size of 34 participants per group was needed to detect a 10% difference (power of 0.9 and p < 0.05) according to the results of the studies by Claxton et al. (2009) [11] and Evans et al. (2007) [12]. However, the present data were part of a larger study in which more variables were assessed, which required a larger sample size. So, 400 subjects were included in this study.

The sensitivity of all tests was determined by a receiver operating characteristic (ROC) analysis, as a function of the age brackets 20–29 years and 50–59 years for male and female groups. The area under the ROC curve (AUC) is widely used to estimate the predictive accuracy of distributional models derived from presence/absence data [33]. The relationship between AUC and the sensitivity was considered excellent between 1 and 0.9, very good between 0.9 and 0.8, good between 0.8 and 0.7, average between 0.7 and 0.6, and poor between 0.6 and 0.5 [34].

The gender influence was evaluated by one-way ANOVA. To establish the norms for the eight muscle field tests, multiple regression analyses were conducted to determine the influence of the anthropometric data (age, weight, and height) on the performance of isometric tests. All values were expressed as mean ± standard deviation (mean ± SD). For the gender parameter, data normality was tested using the Shapiro–Wilk test. The significance level was set at the 0.05 level. The statistical analyses were performed using the R software version 3.5.0.

Results

Anthropometric characteristics of all subjects included in this study and the overall means of the last three assessments “A2,” “B1,” and “B2” for all the isometric tests are presented in Table 1.

Part 1: Reproducibility of Muscle Endurance Tests

Table 2 presents the results of the intra-assessor (“B1” and “B2”) reproducibility tests for all muscle measurements. The ICC values of the muscle endurance measurements vary between 0.946 and 0.989, thereby indicating excellent reproducibility. The bias values range from −3.9 to 1.0, staying close to zero and indicating a slight systematic improvement between measurements.

Table 2 Intra-assessor intraclass correlation coefficients

Tables 3 and 4 present the results of the inter-assessor (between assessments of “B1” and “A2” and between assessments of “B2” and “A2”) reproducibility tests for all muscle measurements. The ICC values indicate excellent inter-assessor reproducibility, with measurements varying between 0.915 and 0.996 for the measures of “B1” and “A2” assessments and between 0.955 and 0.996 for the measures of “B2” and “A2” assessments. For inter-assessor comparison, the bias values are negative (ranging from −5.6 to −0.8 for the measurement between “B1” and “A2” and from −4.9 to −0.6 for the measurement between “B2” and “A2”), indicating a slight systematic improvement between measurements.

Table 3 Inter-assessor intraclass correlation coefficients between assessments “B1” and “A2”
Table 4 Inter-assessor intraclass correlation coefficients between assessments “B2” and “A2”

Regarding the absolute reproducibility, the CV values vary between 3.6 and 11.8% (Tables 2, 3, 4). The intra-assessor (Table 2) absolute reproducibility is between good and excellent reproducibility for all tests (minimum measurement, 6.6% for the hamstring test; maximum measurement, 11.3% for the right quadratus lumborum test). The absolute reproducibility of inter-assessors (Tables 3 and 4) is also between good and excellent for all tests (minimum measurement, 3.6% for the hamstring test between “B2” and “A2;” maximum measurement, 11.8% for the chest muscle test between “B1” and “A2”).

With regard to absolute reproducibility, the values of SEM% demonstrate a good reproducibility of the accuracy and precision of the measured values. Indeed, the SEM values of intra- or inter-assessor measurements are less than 10% of the average measured value, so the measurement errors are small, and therefore the measurement is reliable [35].

The Breusch-Pagan p values for all tests are greater than 0.05 (the lowest value, BP = 3.6, df = 1, p value = 0.07 for the chest muscle tests between “B1” and “A2”); thus, the null hypotheses is accepted, i.e., the data are homoscedastic.

The intra-assessor and inter-assessor coefficient of variation (CV) was also used to measure the absolute reproducibility of agonist/antagonist ratios (Table 5). Muscle ratios were determined by dividing the holding time of the anterior chain muscles by the holding times of the posterior chain muscles for all muscle belts. For each muscle belt, the CV values vary between 5.8 and 12.9%, corresponding to between good to excellent reproducibility for all agonist/antagonist ratios.

Table 5 Intra-assessor and inter-assessor coefficient of variation for the muscle agonist/antagonist ratios

Part 2: Muscle Norms

The ROC curves are presented in Fig. 2. The AUC demonstrates a good sensibility according to age brackets with values greater than 0.7 for each test (Tables 6 and 7).

Fig. 2
figure 2

ROC curves for all tests as a function of the age brackets. (A) Male group; (B) female group

Table 6 Receiver operating characteristic (ROC) analysis for all tests as a function of the age brackets for male group
Table 7 Receiver operating characteristic (ROC) analysis for all tests as a function of the age brackets for female group

For the male and female groups, the average hold time for each test is presented in Table 1. Each test carried out was stopped following muscle fatigue by the participant themself or by the assessor when the participant showed obvious signs of fatigue (maintaining of the initial position was impossible). No test was stopped following an emergence of pain or other symptoms.

Prediction equations to estimate muscle performance according to anthropometric criteria are presented in Tables 8 and 9. The muscle ratios of each muscle belt present no significant difference between men and women and no difference according to age bracket (Table 10).

Table 8 Male performance prediction equations for all isometric tests
Table 9 Female performance prediction equations for all isometric tests
Table 10 Variation of agonist/antagonist ratios according to age brackets

Discussion

Our results have demonstrated the reproducibility of eight field tests to measure the maximal voluntary isometric muscle endurance with adapted postural positions and they have enabled the creation of predictive equations to calculate maximal theoretical voluntary isometric muscle endurance in a healthy population based on its anthropometric characteristics (age, weight, height, and gender).

The objective of the first phase of this study was to evaluate the relative and absolute reproducibility as well as the sensibility of muscle tests with modified postures for a healthy adult population, insofar as Ito et al. demonstrated that the original postures could induce pain or hyperlordosis [14]. Our results showed that all maximal voluntary isometric endurance tests present good levels of reproducibility. Indeed, for each test, the ICC was greater than 0.9, the CV less than 12% and the SEM less than 6%. The SEM of our tests was similar or lower to those found in the literature on other field tests [12, 36]. Our ICC values were higher than those obtained in other studies [12, 14, 17], but this is classic in studies using field tests to obtain an ICC > 0.81 [36, 12, 14]. The between-subject variability makes it difficult to compare the ICC between studies [26]. However, the higher ICC values in our study could be due to the fact that the population used for the reproducibility was heterogeneous, including both males and females of different ages. Nevertheless, the good to excellent absolute intra- and inter-assessor reproducibility measured by the coefficient of variation confirms our good level of reproducibility.

The sensitivity was measured by established and by analyzing for each test, whether the muscle performance is dependent on the age of the subjects. Our results showed that the observed values of the area under the ROC curves were greater than 0.7, demonstrating an influence of age on the muscle performance, which indicates young adults must have higher voluntary maximal isometric endurance than older adults. It is generally recognized that muscle endurance and strength decrease with aging [37, 38] and our study results on age-specific muscle maximal voluntary isometric endurance support the validity of our model.

The objective of the second phase of this study was to determine predictive equations of the muscle performance, taking into account the anthropometric criteria of the subjects. In the literature, the maximal voluntary isometric muscle endurance performance of field tests is often analyzed between gender and age, while height and weight are little discussed. Among all the anthropometric criteria, only the influence of age has been well established [11, 39] because of the well-known decline in muscle mass and muscle quality [40]. It is also accepted that an increase in weight has a negative influence on the performance of maximal voluntary isometric muscle endurance, but this variable is not discussed in the studies by Strand et al. and Latikka et al., which used isometric fields tests because the authors argued a weak relationship between weight and holding time during the isometric test [41, 39]. The influence of the other anthropometric criteria—gender and height—is still debated. Let us take the factor gender: one study demonstrated an absence of influence of gender on the muscle performance certain isometric tests [12], whereas other studies highlighted the effect of gender in favor of males [17, 41], or, conversely, in favor of females [42]. One hypothesis which could explain the sex-specific difference could be a difference in the muscle fiber-type distribution, muscle fiber cross-sectional area, capillary supply, oxidative and glycolytic capacity, citrate synthase, 3-hydroxyacyl-CoA-dehydrogenase, and lactate-dehydrogenase enzyme activities between men and women, but an absence of difference of these criteria between non-active adult men and women has been demonstrated [43]. With current knowledge, it is still impossible to generalize regarding the influence of gender on maximal voluntary isometric muscle endurance performance. With regard to the height factor, the literature demonstrates its influence on the holding time [39, 41], but the results of the studies are contradictory: either the holding time is better [39] or it is reduced [41] in maximal voluntary isometric muscle endurance measured with field tests. These differences could be explained by a different methodology, including athletic and non-athletic subjects [41], a group composed only of males with different age [39], or even a small number of subjects [12, 17]. In our study, the number of subjects used to analyze the influence of anthropometric parameters on the muscle performance was higher and more level-headed than the literature studies [12, 14, 17], and the profile used was the same for each group, namely, non-athletic Caucasian healthy adult subjects. Thus, our findings support that height, weight, and age are determining factors in predicting performance of maximal voluntary isometric endurance tests, unlike gender, which does not influence the performance.

Two limitations would be considered for the interpretation of these data. Our study was carried out in a French Caucasian population; thus, as explained by Hogrel et al. (2007) [44], our results could present a few variations depending on the race of the subjects (African, Asian) because of morphologic, anatomic, and cultural differences. In addition, the age bracket of the participants included in this study was from 20 to 59 years old. Thus, our predictive equations and the muscle agonist/antagonist ratios may be not be adapted for subjects who are younger or older than our participants.

Conclusion

Our study has permitted the validation of the reproducibility of field tests and the creation of predictive equations to calculate maximal voluntary isometric muscle endurance in a healthy population according to its anthropometric characteristics, namely, age, weight, height, and gender. The creation of these normative values responds to the need to be able to assess muscle function to identify weakness or muscle imbalance in order to establish personalized training programs in the fields of prevention, sport, or rehabilitation [45].

Practical Applications

These normative equations permit the determination of the weakness of a muscle, a muscle group, or a muscle imbalance according to the anthropometric data of the subject. The assessor simply has to complete the prediction equations with the anthropometric data of the subject (Tables 8 and 9) to obtain the theoretical value which would need to be reached at the target test. If the result is below the maximum theoretical value or if a muscle imbalance exists, physical activity management is required in the field of prevention and/or rehabilitation (risk of falling, difficulty climbing stairs, low back pain, etc.) or sport (muscle injuries risks). In addition, the predictive equations can be integrated into software or a mobile application to allow subjects to self-evaluate and benefit from an individualized program from a distance.