Background

Assessment and monitoring of body composition in children and adolescents have great significance when there is the need to: a) study the prevalence of pediatric obesity, b) improve gender screening of body composition, c) track body composition from healthy childhood to adulthood, d) to assess FM changes over time in a given population [1], e) evaluate sport potential of young people, f) monitor training process, and g) prior knowledge of their physical characteristics [2].

Anthropometric based equations continue to be adequate alternatives to determine the body composition of pediatric populations. However, body composition assessment in children is not easy to measure, since the relationship between body components during growth is not constant as in adults [2]. The progress in the study of the quantitative human two-compartment model (2-C) comprising FM and fat-free mass (FFM) to 3-C template (water, fat and residual mass) and 4-C with the estimation of other components in addition to FM, total-body water (TBW), minerals and protein [3], has provided new ways for approaching the traditional body composition, especially when it involves peripubertal people.

Several pediatric anthropometric equations were developed using a model of 2-C from the hydrostatic weighing [4, 5] and other densitometric techniques. However, this approach is based on assumptions of stable relationship for FFM density (1.1 g/cm3) and FFM hydration (73.2%). These values are stable in adults, but vary substantially during growth [6, 7]. In fact, from birth to adolescence bone mineral and protein increase whereas TBW decreases thus raising FFM density until reaching the adult value when the chemical maturity profile is reached [6].

Therefore, body composition models have a multicompartmental approach, as the reference method, which accounts for the variability of the main FFM components due to age and maturational changes, resulting in more valid equations [1]. Even using 4-C models, several anthropometric models have been developed to estimate one body component, usually FM for pediatric populations. However, it could be possible to estimate other components such as water, protein, and mineral. In addition to using DXA as the reference method to develop anthropometric models, lean-soft tissue (LST), FM, and bone mineral content (BMC) could also be determined by using a multivariate regression model. When conceived in a multivariate pattern, considering appropriately all important factors, the likelihood to create robust models is attainable, increasing the predictive capacity and reducing errors of estimation [8,9,10].

Currently, DXA is recognized to have a great potential to determine body composition in pediatric studies [7], due to its ability to provide more than one component (3-C approach). However, the exposition to ionizing radiation, the equipment cost along with the lack of feasibility for large-scale use limits its applicability in field settings (home, school environments, and sport clubs). Consequently, simple solutions to estimate body composition in children from anthropometric techniques including skin folds, bone breadth and circumferences have been widely used and are preferred in different contexts [1]. These alternatives are more convenient due to their low costs, they require low level of personnel training, they are minimally invasive and have good scientific credibility [11].

Although conventional anthropometric methods are scientifically accepted, they do not distinguish bone from fat or muscles, a desirable fractionation of the body composition, in youth sports setting or clinical health. With the exception of the anthropometric models developed in our laboratory to determine LST, FM and BMC of boys [10,11,12], no proposal for girls was found able to estimate these 3 components, by somatic body measurements.

Methods

The aim of this study was to develop and validate a multicomponent anthropometric model to simultaneously estimate LST, BMC and FM in girls, from densitometry as a criterion method. This descriptive study design was conducted with an intentional sample from elementary school and swimming sporting centers, a social project for local children and adolescents.

Study population

The study followed a cross-sectional design and a sample of 84 girls between seven and 17 years recruited from a sports center (n = 69) and elementary schools (n = 15). The girls were healthy with no medical condition, without body parts amputated, no use of drugs or not under medical treatment that could affect metabolism, appetite or growth. All participants presented maturation status below the 5 Tanner stage, and body mass index (BMI) below pediatric obesity [13], except 7 cases slightly above each age limit.

Study protocol

All evaluations were performed at the Imaging Center of the University Hospital, Ribeirao Preto Medical School/University of São Paulo, after an overnight fasting, in a single session; the same technicians performed all exams. Before the measurements, the subjects were asked to empty their bladders. A total-body DXA scan was performed according to manufacturer's guidelines. The anthropometric measures were taken according to the literature guidelines (15, 16), whose procedures are summarized below.

The dependent variables

Whole and regional body composition was estimated with a DXA Scanner Hologic® (Discovery CI/WI, software version 11.2, Bedford, MA). The Lean Soft Tissue (LST, kg), Bone Mineral Content (BMC, kg) and Fat Mass (FM, kg) were determined (3-C) and considered as dependent variables.

The independent variables

The subject’s body mass and height were measured with a digital scale (Welmy®, W 300 A, Santa Barbara d’Oeste, SP) and a hall fixed stadiometer (Physical Therapy Stadiometer - Terrazul®, Sao Paulo, SP), respectively. The skinfolds (n = 10), circumference (n = 10) and widths (n = 9), were measured by conventional procedures in the literature [14, 15] using Prime® equipment. Chronological age groups in decimal value were adjusted to the nearest integer [16]. To ensure the reliability of the results used in the models, intra evaluator technical error of measurement, absolute (TEM) and relative (TEM%), were determined in subsequent days. Thirteen subjects were evaluated in duplicates for all variables used in this study, when the results were always within the expected tolerance limits [14] as shown below in Table 1.

Table 1 Anthropometrics and DXA body composition measures in girls (n = 84), including absolute and relative TEM.

Their maturity offset was determined by gender-specific regression to predict the years for Peak Height Velocity (PHV) according to the following equation [17] for girls:

$$ {\displaystyle \begin{array}{c}\mathrm{PHV}=\hbox{-} 9.376+0.0001882\ \left(\mathrm{Lh}\;\mathrm{x}\;\mathrm{Sh}\right)+0.0022\ \left(\mathrm{A}\;\mathrm{x}\;\mathrm{Lh}\right)+0.005841\ \left(\mathrm{A}\;\mathrm{x}\;\mathrm{Sh}\right)\hbox{-} 0.002658\ \left(\mathrm{A}\;\mathrm{x}\;\mathrm{Wt}\right)+0.07693\\ {}\left(\left(\mathrm{Wt}/\mathrm{Ht}\right)\ \mathrm{x}\ 100\right)\end{array}} $$

Where: Lh stands for legs height (cm), Sh stands for seating height (cm), A stands for age (years), Wt stands for body mass (kg), and Ht stands for height (cm).

Statistical analysis

Interaction terms for ethnicity (Brazilians are highly intermingled) by anthropometric variables and sports activities (initial swimming practitioners) by anthropometric variables were tested and a nonsignificant differences was found (p = 0.238), allowing the use of the entire sample for subsequent analysis.

A sampling design was planned to set the desired maximum error (ε) with some degree of confidence (Z y ) from the prior knowledge of the population variability (σ 2) [18]. In this case, a multiethnic classic study values [19] describing girls body composition of different populations was used as reference values. The highest variability was observed in FM estimated by DXA, for all that ages (6-18 years), similarly to the sample ages of this study (7-17 years). From the estimation of the predetermined error (between 2.25 and 2.0%) and confidence interval (0.90 to 0.95), the ideal n for our study was defined by the equation [18]:

$$ n={\left[\frac{Z_yS}{\varepsilon}\right]}^2 $$

The minimum sample values (n = 74), setting for ε = 2.25 and γ = 0.95 and the estimated value of FM to σ = 9.84 Kg was overcome by our sample (n = 84), from the standard deviation (s) obtained of that (Hispanic group) reference sample [19].

Multicompartmental anthropometric equation development

As earlier procedures [11] multivariate regression model ( n Y m = n X (r + 1) (r + 1) β m + n ε m ) by diagonal mutual analysis, parameter estimation and the least squares errors method was used [8] by R-Free Software. Previously, a criteria for selection and reduction of independent variables was conducted from the following steps: a) principal component analysis and model adequacy (Kaiser-Meyer-Olkin) and Sphericity test (Bartlett); b) univariate regression (stepwise) to determine all common independent variables for each dependent variable (LST, BMC and FM), with significance less than 5%; c) multivariate analysis to estimate the parameters and Pillai approximation method for displaying possible variables exclusions; e) testing of remaining model (enter - univariate method), with estimated values of VIF (<10.0) and multicollinearity (L <1000) maximum permitted; f) adjustments by Pillai approach to test the F values; g) set the number of independent variables, with high coefficients of determination (r ≈ 0.90). Then β parameters (multivariate) were determined, with the proposition of equations and residual distribution for each dependent variable. A similar procedure was earlier conducted in boys [10].

The PRESS statistic was used to measure the efficiency of a predictive equation, when applied to independent samples [20], that is, when used to estimate within the same sample. The process may be understood as design efficiency in estimating the actual parameters by a virtual simulation. In the case of this study, the error was determined by the result of observed Y - Y ' (estimated). For validation: a) the correlation coefficients were estimated between predicted and measured values, and b) cross-validation by PRESS method and coefficients of determination (Q2 PRESS) and error (SPRESS) was conducted for each dependent variable, according to the Johnson and Wichern [8] recommendations.

Other statistical procedures included descriptive analysis with calculation of mean and standard deviation and confidence interval (95%) and coefficient of variation, used to express milesimal age, DXA and anthropometric dimensions of girl’s body components.

Results

Table 1 provides anthropometric and body composition measures, including range (minimum – maximum), the confidence interval (95% CI), and absolute and relative TEM of the measures expressing the main characteristics of the study sample.

The Kaiser-Meyer-Olkin test (0.882) was used as a measure of adequacy of the sample (X 2 = 2961.11) with p < 0.001, indicating suitability of the method in the treatment of the data. The commonalities test ranged from 0.689 to 0.927 for the 22 variables entered into the analysis. The matrix indicated coefficients of correlation from weak (r = 0.20) to very high (r = 0.95) between the 21 independent variables (size measures, skinfolds, circumferences, bone breadths) and the dependent variables (data not shown). To reduce variables of the model, univariate regression was performed (stepwise) in order to select the common dependent variable to the three components (LST, BMC and FM) since common selection occurred at least twice, maintaining the highest possible coefficients.

The high correlation between the independent variables shows the existence of multicollinearity, which brings undesirable consequences for the inferential analysis (24). From the univariate regression (stepwise), the number of remaining variables to LST (n = 7), BMC (n = 5) and FM (n = 4) showed high r2 adj (0.90 to 0.94), while multicollinearity varied from low (L = 55.0) to moderate (L = 275.0) for the independent common variables for the three dependents variables (Table 2), a multivariable analysis.

Table 2 Univariate regression and multicollinearity for selecting common independent variables at least twice (bold)

From the multivariate regression model, statistical significance (Pillai) of the F values (for < 2.2e-16 implies the notation < 2.2 x 10-16) was found for four predictor anthropometric variables.

The precision of the model was initially tested in univariate analysis (Enter method) while maintaining low multicollinearity and high correlation coefficients between measured and predicted values (Table 3). Multicollinearity was low, ranging from tolerance for eigenvalues (L<100), VIF (<10.0), and the determination of simple coefficients (r2) and adjusted (r2 adj).

Table 3 Parameters, precision and validity of multicomponent anthropometric model, to predict body composition in peripubertal girls

The graphic expression of dispersion and the correlation between predicted and measured values is shown in Fig. 1.

Fig. 1
figure 1

Dispersion between measured and predicted values of lean soft tissue (LST), bone mineral content (BMC) and fat mass (FM) of girls

The results of the residuals sum of squares (PRESS), the coefficient of determination PRESS (Q2 PRESS) and the standard deviation of PRESS (SPRESS) obtained for each dependent variable, were also expressed (Table 3). These results suggested that the multivariate prediction model is valid for LST, BMC and FM since it indicates consistency with the conditions defined in the method for Q2 PRESS be close to "1" and SPRESS close to "0". Thus, the final model for each dependent variable including Residual Tissue (RT) as the difference for the BW, can be stated as:

$$ {\displaystyle \begin{array}{c}\mathrm{LST}=0.6662657\ \mathrm{BW}\hbox{-} 0.2157279\ \mathrm{SiSk}\hbox{-} 0.2069373\ \mathrm{HaSk}+0.3411678\ \mathrm{CaCi}\hbox{-} 1.8504187\\ {}\mathrm{BMC}=0.0222185\ \mathrm{BW}\hbox{-} 0.1001097\ \mathrm{SiSk}\hbox{-} 0.0064539\ \mathrm{HaSk}\hbox{-} 0.0084785\ \mathrm{CaCi}+0.3733974\\ {}\mathrm{FM}=0.3645630\ \mathrm{BW}+0.1000325\ \mathrm{SiSk}\hbox{-} 0.2888978\ \mathrm{HaSk}\hbox{-} 0.4752146\ \mathrm{CaCi}+2.8461916\\ {}\mathrm{RT}=\mathrm{BW}\hbox{-} \left(\mathrm{LST}+\mathrm{BMC}+\mathrm{FM}\right)\end{array}} $$

Where: LST = lean soft tissue; BMC = bone mineral content; FM = fat mass; RT = residual tissue; BW = body weight (kg); SiSk = supra-iliac skinfold (mm); HaSk = horizontal abdominal skinfold (mm); CaCi = contracted arm circumference (cm).

Discussion

As novel finding in this study, anthropometric models were developed and validated to determine FM, LST and BMC in a female pediatric sample, using DXA as the reference method. Adult models are not expected to describe or estimate body composition in children and adolescents [21]. Limitations in current available techniques to evaluate body composition hampers, in several circumstances, the measurement of the main body compartments in the molecular level [7]. In our study, DXA returned absolute measures of LST, BMC and FM in a 3-C approach, which represents estimation advantages by minimizing error in a single scan. However, DXA is almost restricted to the medical environment. Consequently, anthropometry is an alternative method since the anthropometric model was valid to estimate body composition using DXA as reference. Greater predictive accuracy was obtained through four anthropometric variables to simultaneously predict LST, BMC and FM.

DXA has great potential in pediatric studies, low level of ionizing radiation exposure, convenience and speed in the estimation of multicomponent body, which allows better interpretation of BC during growth and appropriate association with anthropometric estimates. Since the 80s, predictive equations of FM and FFM by anthropometry and BIA, using DXA (or other 3-C models) and 4-C as criterion methods [22], have been used, although none try to predict more than one component, otherwise only FM or FFM.

The known pediatric equations of Slaughter and colleagues used skinfolds models derived from a 4-C model [23]. However, lower coefficients of determination (r2 = 0.88) and greater standard errors of estimation (SEE = 3.80%) were found for FM determination, when compared to our results, (r2 > .94; SEE < 2.25%; SPRESS = 0.01 to 0.30) and no cross-validation was provided.

In ages younger than 10 years, the dispersion of body tissues tends to be lower than in subsequent ages [19]. This implies that the pre pubertal growth and maturation are quite compatible. At this age, the change in hormonal concentration that occurs during sexual maturation has not yet started [12]. The density of FFM is influenced in large part by bone mineral because the density of bone is markedly higher than that of other components of the FFM. During this special period of life there is an unstable relationship between BMC and muscle mass in comparison to adulthood [24]. In addition, the increase of other body components during growth [25] is markedly higher in FM until about 13 years, registering increases from childhood to adulthood of 17% in girls and 15% in boys. When presenting exponential reduction, no significant changes are found for the three components after age 20 in girls and age 21-22 in boys [26].

Besides extending to girls with high body mass, this new methodology overcome traditional limitations of bi-compartmental models by offering an alternative multivariate analysis, a new way for a comprehensive approach [7, 21, 27, 28]. The observed high coefficients values and low errors estimation (Table 3) reflected a usual good validity when using our laboratory proposed equations [2]. Several factors can affect the accuracy of a predictive equation as: the dependent variable validity; accuracy of predictive variables; statistical intra and inter-biological relationship between independent and dependent variables; appropriated use of statistical methods to formulate the equation; and adequate sample size [29]. The necessary conditions for the proposition of a predictive model must meet statistical requirements for a safe and reliable model/equation.

Assumptions for good predictive models of body composition recommend that models should be proposed for a wider clinical application [30], make valid assumptions in relations between the components [28], i.e., if relations between FFM, FM and TBW are constant for all ages. Some equations proposed are more well-founded descriptions and specific to predict only one component [31], or restricted age range [32]. When it involves wider range age (5-19 years), adopt sophisticated laboratory techniques and expensive procedures [33]. Furthermore, the inclusion of independent variables with strong statistical and biological relationship with the variable of interest, certainly increases its predictive power [28]. The internal validation method PRESS [34], as in our equations, confirms the effectiveness of the model in predicting the body components with high internal validity, high coefficients of determination (Q2 PRESS = 0.81 a 0.93) and low prediction errors (SPRESS = 0.01 a 0.30).

The anthropometric measurements are obtained easily and at a low cost while preserving the accuracy and low error to be considered as an alternative to DXA [35]. We recognize as limitation of this study that DXA is not a gold standard for pediatric body composition models. But conversely, the four compartments (4-C models) is the current state of the art method since it considers the variability of the main FFM components [36]. However, it is also subject to propagation of measurement errors, is time consuming, and requires sophisticated and highly specialized technical equipment. Moreover, it is restricted to research facilities and has high costs, making it difficult to use in large samples [37]. Nevertheless, DXA has been used by various researchers to develop pediatric equations [19, 38,39,40,41], therefore considered as a valid and reliable method [42]. Other limitation consider that the accurate measurement of LST, especially in athletes, requires strict observation of hydration, which may affect DXA assessment [43]; as well the menstrual cycle of girls [44], not controlled in this study. However, the girls in the study had a low level of maturation (PHV = -3.16) and were not athletes, minimizing possible errors of analysis.

Additionally, nutritional data, social or economic status, hereditary or level of sports training were not controlled in this study and may interfere with the female body composition. The small sample size may compromise the accuracy of the models since the sample may not have a well balance representation of age and adiposity levels. In addition, ethnic differences in this sample, although comprising a highly mixed-race, may also limit the generalization of these equations for other populations as the number of females in each ethnic group was not well balanced [2]. Although racial and sports practice showed no interactions terms with anthropometric variables in explaining the reference variables, research is required to test the accuracy of the proposed model for other ethnic groups. Finally, for longitudinal body composition assessment, the accuracy of these models requires further validation, or estimates for age groups.

Conclusions

The goal of this study to develop and validate an anthropometric predictive model to simultaneously estimate LST, BMC and FM was achieved. These models provide a practical, low cost, and reliable tool in assessing body composition of a specific female pediatric population. These models can be used to determine and monitor body growth in peripubertal girls, either to assess health indicators (i.e., overweight control, obesity and the associated risks), or improve sports performance (i.e., body adequacy and physical preparation for sports). However it should be tested and validated for other ethnic groups.

The internal validation method PRESS [34] confirms the effectiveness of the model in predicting the body components with high internal validity, with high coefficients of determination and low prediction errors. These models may be a valid alternative to estimate body composition in girls and allow application in clinical environment or field settings.