Participants
After gaining university ethical approval, healthy Caucasian males were recruited by word of mouth and poster advertisements. Eighteen participants gave informed consent to participate in the study (age 29.0 ± 5.6 years, height 1.81 ± 0.07 m, and body mass 79.8 ± 10.9 kg). The mean height and body mass of the model-development and cross-validation groups for each muscle group are reported in Tables 2, 3, 4, and 5. All participants were healthy, free from injury for a minimum of 3 months prior to the study and were participating in recreational exercise for a minimum of 30 min per week three times per week. All assessments were conducted on the preferred limb which was defined as that used to balance on one leg or stop oneself from falling when pushed from behind. Fourteen participants preferred their right leg and four preferred their left.
Anthropometric measurements
For all participants, anthropometric measurements were taken before MRI and ultrasound imaging. After measuring height (metres, m) and body mass (kilograms, kg), participants lay supine on a treatment plinth and length measurements (m) of the thigh (from the greater trochanter to the lateral epicondyle of the femur) and shank (from the lateral epicondyle of the femur to the lateral malleolus) were acquired using anthropometric callipers (seca 207, Seca GmbH, Germany). The distance from the greater trochanter to 25%, 50%, and 75% of thigh length, and from the lateral epicondyle of the femur to 25% of shank length was marked on the skin as a horizontal line with permanent marker pen on the anterior, posterior, medial, and lateral aspects of the limb to assist both anthropometric and ultrasound measurement acquisition. Limb circumferences (cm) were measured at each of these four sites with a tape measure and an additional circumference of the hips was measured at the level of the greater trochanter and multiplied by 0.5 to represent half of the hip circumference.
MRI data acquisition
Unilateral axial spin-echo T1-weighted MRI images were acquired from the twelfth thoracic vertebrae (or the level corresponding to the origin of psoas major, verified by an experienced radiographer) to the base of the foot of the preferred leg using 3-Tesla MRI (Discovery MR750w, GE Healthcare, General Electric, Boston, MA, USA). Images were acquired in four or five scanning blocks of 34–86 slices depending on participant height. Block overlap was identified using fish oil capsule references which were visible on the images used in the analysis. A single fish oil capsule reference was also placed at the 25%, 50%, and 75% of thigh length and 25% of shank length sites to enable these to be identified in the analysis. Slice thickness was 5 mm for all participants; inter-slice distance was 0 mm for all participants apart from one where the inter-slice distance was 5 mm. The in-plane resolution for all images was 0.47 mm × 0.47 mm. Echo time (TE = 7.546–16.940 ms) and repetition time (TR = 533–845 ms) varied between scanning blocks for the first four participants and remained constant between scanning blocks for the remaining fourteen participants. The first, most proximal, image block was an exception to this and the optimal TE and TR were chosen by the radiographer to minimise movement artefact in the abdominal region due to breathing. Field of view (144 mm × 144 mm to 450 mm × 450 mm) and flip angle (90º–111º) were varied by the radiographer to obtain the best quality image for each participant in each scanning block. Total scanning time was approximately 50 min per participant including breaks between scanning blocks.
Ultrasound imaging data acquisition
Panoramic B-mode ultrasound images were acquired by a single investigator using a Logiq E9 ultrasound scanner (GE Healthcare, General Electric, Boston, MA, USA) with a 44 mm, 2–8 MHz 9L linear-array transducer, coated in water-soluble transmission gel which enabled acoustic contact without depression of the skin and superficial adipose tissue. Participants lay on a treatment plinth at rest, while images were acquired at the mid-hip, and at 25%, 50%, and 75% of thigh length, and 25% of shank length.
The mid-hip site was identified in side-lying, on the contralateral side to that being imaged, with the hip and knee of the bottom leg flexed, so that a standardised, comfortable, and relaxed position could be maintained. The vertical distance between the iliac crest and the greater trochanter was measured with a measuring tape and the mid-way point was marked with pen. A perpendicular line was drawn, along which a panoramic image was acquired, starting at the anterior superior iliac spine and moving posteriorly towards the sacrum.
Images of the anterior aspect of the thigh were acquired with participants sat in a recumbent position on the treatment plinth with a foam roller positioned under their knees to ensure consistency between measurements, in ~ 10º knee flexion. The transducer was positioned on the lateral side of the leg, perpendicular to the skin in the transverse plane, and moved medially as far as possible without losing contact with the skin or changing the angle of the probe against the skin. This process was repeated for all the anterior sites and in participants with large thighs two images were necessary to ensure that all muscles were acquired. After acquiring images from all the anterior sites, participants lay prone with their toes hanging over the edge of the plinth and images were acquired from the posterior aspect of the thigh and shank sites. Images were reviewed on the ultrasound machine visual display after collection and repeated when required to ensure that muscle CSA could be clearly identified in a single image.
Data processing—image analysis
All MRI and ultrasound images were analysed in OsiriX Lite (v.8.0.1, Pixmeo, Geneva, Switzerland) open source software by a single experienced investigator. The brightness and zoom tools were used to improve tissue contrast and enable muscle boundaries to be better identified. The boundaries of the individual hip extensor (gluteus maximus and medius, biceps femoris long head, semimembranosus, semitendinosus, and adductor magnus) knee extensor (rectus femoris, vastus intermedialis, lateralis, and medialis) and flexor (semimembranosus, semitendinosus, biceps femoris long and short heads, medial and lateral gastrocnemius, popliteus, sartorius, and gracilis), and ankle plantarflexor muscles (medial and lateral gastrocnemius, soleus, peroneals, tibialis posterior, flexor digitorum longus, and flexor hallucis longus) were manually outlined at intervals of 15 mm on the MRI images using a graphics tablet (XP-Pen, XPPEN Technology CO, Fullerton, CA, USA). For the data set with 5 mm spacing, larger muscles such as the quadriceps, hamstrings, and gluteus maximus were analysed with an inter-slice distance of 20 mm and smaller muscles were analysed with an inter-slice distance of 10 mm, as a preliminary analysis in three participants suggested this maintained measurement accuracy and reduced the analysis time compared to using an inter-slice distance of 5 mm. The resulting CSA measurements (in centimetres squared, cm2) were used in the calculation of MV.
For ultrasound images, CSA of the individual quadriceps (rectus femoris, vastus medialis, lateralis, and intermedialis), hamstrings (biceps femoris long and short head, semitendinosus, and semimembranosus), and gastrocnemii (lateral and medial gastrocnemius, and soleus) were outlined on the respective images. Ultrasound images acquired at the hip were not used to measure CSA. Superficial muscles were chosen as they provide the best potential for generalisation of the final model, as they are more likely to be consistently identifiable in the wider population. The sum of the quadriceps and hamstrings CSA at 50% and 75% of thigh length was calculated for inclusion in the regression analyses.
Muscle thickness (MT) measurements were taken by drawing two parallel lines perpendicular to a straight line on the superficial aponeurosis of each muscle, extending to the deep aponeurosis (Fig. 1). The mean value of the length of the two perpendicular lines was taken as MT (in centimetres, cm). As considerable training and experience are necessary to acquire valid ultrasound images of the hip extensors, limiting the potential for practical application, the only ultrasound measurement acquired at the hip was a single MT measurement (including gluteus medius and minimus) to facilitate the potential for clinical application of the findings (Fig. 1a). Single measurements of MT at 25% and 50% of the anterior (including rectus femoris and vastus intermedialis) and lateral thigh (including vastus lateralis and vastus intermedialis), and at 25% of the medial (including medial gastrocnemius and soleus) and lateral (including lateral gastrocnemius and soleus) triceps surae were also acquired for inclusion in the regression analyses to potentially assist in reducing image acquisition time. A single MT measurement to represent the hamstrings was not acquired as this is a superficial muscle group and the underlying adductor magnus does not contribute to knee flexion. To evaluate the application of an MT measurement representing all the individual quadriceps and hamstrings muscles in the respective knee extensor and flexor groups, the sum of the individual MT measurements at 50% and 75% of thigh length was also calculated. The reliability of ultrasound and MRI CSA and MT measurements was evaluated between sessions in two randomly selected participants with a minimum of 7 days between sessions.
The criterion MV was calculated using MRI CSA of the individual hip extensor, knee extensor and flexor, and ankle plantarflexor muscles. The volume of individual muscles (in centimetres cubed, cm3) was calculated using the formula:
$$\mathop \sum \limits_{i = 1 \ldots n - 1}^{n} \left( {\frac{{{\text{CSA}}_{i} \; + \; {\text{CSA}}_{i + 1} }}{2}} \right) \; \times \; h,$$
where CSAi is CSA at slice i, CSAi+1 is CSA at slice i +1, h = distance between slices, and n = total number of slices in the muscle. Individual MV was summed to calculate group MV (i.e., hip extensors, knee extensors and flexors, and ankle plantarflexors).
Statistical analysis
All computational analyses were completed in Microsoft Excel 2010 (Microsoft Inc., Redmond, WA, USA). Measurement reliability of ultrasound CSA and MT measurements was calculated as the typical error of measurement (TEM) between sessions:
$${\text{TEM}}\; = \;\frac{{{\text{SD}}_{\text{differences}} }}{\sqrt 2 },$$
where SDdifferences is the muscle-specific standard deviation of the differences between sessions. TEM are presented as absolute values and as a percentage of the mean value of the first measurement.
All statistical analyses were conducted in IBM SPSS Statistics 23 (IBM Corporation, Armonk, NY, USA) and the alpha level for statistical significance was set to p < 0.05. Kolmogorov–Smirnov tests of normality showed a normal distribution for MV of the ankle plantarflexors. The knee extensors (p = 0.009) and knee flexors (p = 0.041) were not normally distributed due to two outlying participants who were not included in the regression analyses for these muscle groups, to ensure integrity in the final models developed. The hip extensors (p = 0.011) were not normally distributed due to one outlying participant who was not included in the regression analyses for this muscle group. All anthropometric variables were normally distributed. All ultrasound CSA measurements were normally distributed, and all MT measurements were normally distributed with the exception of semitendinosus at 75% of thigh length (p = 0.040) and as this was a small measurement, and semitendinosus was also represented at 25% and 50% of thigh length, it was excluded from the regression analysis.
Regression analyses were conducted for estimating the volume of each muscle group on a level-wise (using all measurements at 25%, 50%, or 75% of limb length) and muscle-wise (using all measurements for individual muscles) basis using all ultrasound and anthropometric measurements. Regression analyses were conducted separately for CSA and MT measurements to prevent collinearity of independent variables. To identify the most appropriate independent variables to include in the regression analysis, semi-partial correlations between all potential independent variables and the dependent variable (the respective MV) were conducted using the data of all participants, excluding outliers. The independent variable with the strongest semi-partial correlation with MV was chosen for the regression analysis to allow an approximate ratio of one independent variable for every 15 cases, as recommended by Field (2013). If two independent variables had a similar strength semi-partial correlation, the Pearson’s product moment correlation between these independent variables was calculated to evaluate collinearity and if this was present, the independent variable with the weakest semi-partial correlation was excluded from further analyses. If collinearity was not present, both independent variables were included in the analysis.
For the regression analysis, the sample was randomly divided into sub-samples of 20% (n = 3 for hip extensors and knee extensors and flexors; n = 4 for ankle plantarflexors) and 80% (n = 14 for hip extensors and ankle plantarflexors; n = 13 for knee extensors and flexors) as recommended by Field (2013). A k-fold leave-one-out cross-validation was conducted using forced entry regression to identify appropriate independent variables for the final models. The suitability of the models developed was evaluated based on the following diagnostic criteria: (1) to evaluate the generalisation of the model; the 95% confidence intervals for the beta coefficients did not cross zero, no more than one participant had a standardised residual greater than ± 2.00, and the R2 and adjusted R2 values were similar; (2) to ensure that all independent variables significantly contributed to the model; the p values of all beta coefficients were less than 0.05; (3) to ensure the absence of collinearity, the independent variables were not strongly correlated (i.e., > 0.80); (4) to evaluate the assumption of additivity; the semi-partial correlations between all independent variables and the dependent variable were similar; (5) to evaluate the assumption of homoscedasticity; there was no positive or negative trend in the plot of standardised residuals against standardised predicted values; (6) to test the assumptions of independent and normally distributed errors, the Durbin–Watson statistic was above the upper limits identified by Durbin and Watson (1951) and the plot of residuals was normally distributed; and (7) the ease and speed of acquisition and measurement reliability were considered when selecting the final model in the context of clinical application.
Independent variables were identified as appropriate when they were present in all k-fold analyses for each respective muscle group. The identified independent variables were included in a forced entry regression analysis to identify the final model using the data of all participants in the 80% sub-sample. The equation was cross-validated in the 20% sub-sample and the final model was evaluated using the same diagnostic criteria described above. When the final model included more than one variable, its performance in simple and multiple regression analyses was evaluated, and if both showed good potential for generalisation, the bivariate correlation between the residuals of each model was calculated to identify the statistical difference between models and assist in selecting the final model. When a strong correlation was observed between residuals of multiple and simple regression models, the final model selected was that with the highest R2 value. For the final models, the systematic difference was examined by plotting the residuals (MRI-derived MV minus model-derived MV) against the mean value of the estimated and actual MV for all participants (Bland and Altman 1986).