Introduction

In the last four decades, an increase in metabolic syndrome (MetS) rate was reported.1,2,3 In 2007, the International Diabetes Federation (IDF) developed criteria for defining MetS in children and adolescents.4 The criteria were inspired by the IDF guidelines for adults and other studies on the prevalence of MetS in children and adolescents.5,6,7,8,9,10 The IDF criteria require the presence of abdominal obesity plus the presence of at least two other components, (elevated triglycerides, low high-density lipoprotein (HDL) cholesterol, high blood pressure, and elevated plasma glucose).4 MetS in childhood is associated with an increased risk of developing cardiovascular disease in adulthood, along with other chronic diseases and even early death. MetS in childhood have odds ratios of 6.2 for tracking into adulthood, 2.3 to 11.5 for type 2 diabetes mellitus, 2.0 for elevated carotid artery thickness, and 14.6 for cardiovascular disease in adulthood.11,12

MetS is associated with increased healthcare burdens and costs.13,14,15 Therefore, early screening and primary prevention are essential. Usually, MetS is diagnosed at a community clinic visit by a professional staff. However, less than 50% of pediatricians and only 22% of all primary care physicians in the community measure anthropometric variables in children and youth.16,17 Moreover, the overall rate of lipid screening in community clinics among children and adolescents is low.18,19 and lipids are only measured in 27% of children with obesity in community clinics.20 Recently, a new easy-to-use MetS prediction model for self-evaluation among an apparently MetS-free adult population was introduced.21

Even though the importance of identifying MetS in youth to prevent adulthood chronic diseases is well published, to date, no easy-to-use self-evaluation screening tool is available. Therefore, to enable an easy-to-use Mets screening tool, this study aimed to develop and validate a simple model based on age, sex, race, and weight-for-age or BMI-for-age to predict the presence of MetS in apparently MetS-free individuals, aged 12–18 years.

Methods

Study design

The study is a cross-sectional study based on the U.S. National Survey of Health and Nutrition (NHANES) database. The NHANES is a continuous survey of the National Center for Health Statistics, which is part of the Centers for Disease Control and Prevention (CDC) with data released every two years.22 The NHANES assesses the health and nutrition status of children, adolescents, and adults in the U.S. population. The database is available online and used for a wide range of epidemiological studies.7,10,21,23,24 The NHANES samples are selected by a complex multidisciplinary probabilistic sample, representing the general U.S. population of all ages.

All participants aged 12–18 years from the 2005–2018 NHANES were included in the study. Participants who reported having major morbidities associated with MetS (i.e., diabetes, hypertension, lipid disorders) were excluded since the prediction model aims to screen for MetS in an apparently MetS-free population. In order to prevent a classification bias, participants with one or more missing criteria from the MetS definition were also excluded. Selection bias was evaluated by comparing those who were excluded according to these criteria with those who were included. All participants were randomly divided into training (70%) and validation (30%) groups. The training group was used to build the models and the validation group was used to verify the final model. The study was approved by the local Institutional Review Board (Approval number 5269-18-SMC).

Variables

Several variables, as defined in the NHANES, were extracted to evaluate the association with metabolic syndrome. These variables can be reported by the individual or a family member without visiting a clinic and most of them have been studied previously7,23,24: age, sex, race (Caucasian/Hispanic/Black/multiracial), height, and weight. BMI was calculated as the individual’s weight in kilograms divided by the square of height in meters. Weight- and BMI-for-age standard deviation scores (SDS) as well as percentiles were calculated using the CDC growth charts.25

In addition, waist circumference, blood pressure measurements, and fasting blood tests (triglycerides, high-density lipoprotein (HDL) cholesterol, glucose), were extracted to evaluate the presence of MetS. Waist circumference (WC) Z-scores were calculated according to NHANES III.26 At the NHANES, three blood pressure measurements were taken for each participant. The mean of the three measurements was calculated for each participant and used to determine the criterion. The IDF criteria for MetS were used in the current study.4 According to these criteria, MetS diagnosis requires the presence of abdominal obesity plus the presence of at least two other components (elevated triglycerides, low HDL cholesterol, high blood pressure, and elevated plasma glucose). Abdominal obesity was defined as WC≥90th percentile or adult cut-off if lower for ages <16 years or WC ≥ 94 cm for boys and ≥80 cm for girls for age 16+ years. Elevated triglycerides level was defined as triglycerides above 150 mg/dL (1.7 mmol/L). Low HDL cholesterol was defined as below 40 mg/dL (1.03 mmol/L) for ages <16 years and below 40 mg/dL (1.03 mmol/L) in boys and below 50 mg/dL (1.29 mmol/L) in girls for age 16+ years. High blood pressure was defined as systolic blood pressure equal to or greater than 130 mmHg or diastolic blood pressure equal to or greater than 85 mmHg. Elevated fasting plasma glucose was defined as greater than 100 mg/dL (5.6 mmol/L).

Statistical analysis

The minimal sample size was calculated using a power of 80% and a significance level of 5%. Previous data showed that approximately 1 out of 20 individuals in the USA have MetS.12 Therefore, 1414 individuals were needed to identify variables with a low-medium association with MetS (effect size d = 0.35).

Categorical variables were summarized as frequency and percentage. The distribution of continuous variables was evaluated using a histogram and Q-Q plot and reported as mean and standard deviation (SD) or as median and interquartile range (IQR). The absolute standardized difference was used to evaluate the difference between the training and validation groups and between those who were included and excluded from the study. The standardized difference is a measure of the magnitude of the difference between two groups in standardized terms. It helps to compare the distribution of baseline covariates between two groups in observational studies. For continuous variables, the standardized difference is the difference in the mean of a variable between two groups divided by the standard deviation of the variable. It can also be used for comparing the prevalence of categorical variables between two groups. An absolute standardized difference of up to 10% (or 0.1) is considered a negligible difference between groups.27 The Chi-square test was used to compare categorical variables between the training and validation groups and the independent samples t-test and Mann-Whitney test were applied to compare continuous variables.

The NHANES has a complex probability sample and as such the statistical analysis requires incorporating sample design elements. Therefore, proper weighting was applied for both the univariate and multivariable analyzes.28 Univariate and multivariable logistic regression was used to evaluate the association between metabolic syndrome and each studied risk factor (age, sex, race, weight-for-age, and BMI-for-age). The regression models were used to estimate the probability of MetS in each participant in the training and validation groups and the area under the receiver operating characteristic (ROC) curve (AUC) was used to assess the discrimination ability of the model. The DeLong test was applied to compare the area under the receiver operating characteristic curves.29 To identify the optimal cut-off value, Youden’s J statistic was calculated for each potential cut-off value. The cut-off was set according to the maximal Youden’s J statistic. Sensitivities and specificities for cut-off values of 85th, 90th, and 95th percentiles as well as 1, 1.5, and 2 SDS were also reported. The positive likelihood ratio and negative likelihood ratio were presented for each cut-off value. Positive and negative predictive values were calculated based on previous reports of MetS prevalence.12 All statistical tests were two-sided. Statistical significance was defined as p < 0.05. SPSS software (IBM SPSS Statistics for Windows, version 28, Armonk, NY, 2021) and R: A Language and Environments for Statistical Computing (R Core Team, R Foundation for Statistical Computing, Vienna, Austria, 2023) were used for all statistical analyzes. The childsds package (Vogel M, childsds: Data and Methods Around Reference Values in Pediatrics, 2022) was used to calculate the WC-, weight- and BMI-for-age percentiles and SDS.

Results

Study population

During the study period, 70,190 individuals participated in the NHANES. Of them, 7686 were aged 12–18 years and 7231 were eligible for inclusion. A total of 4409 were excluded due to missing data of at least one MetS criteria. No demographic, BMI or weight differences were observed between those who met the inclusion criteria and those who did not (absolute standardized difference < 0.1, Appendix A). After exclusion, 2822 apparently MetS-free individuals aged 12–18 years were included in the study, 1947 in the training group, and 848 in the validation group (Appendix B). Participants’ characteristics are summarized in Table 1. MetS prevalence was similar in both groups (5.8% in the training group, and 5.1% in the validation group) as well as the other studied variables (absolute standardized difference < 0.1).

Table 1 Characteristics of individuals who participated in the NHANES and included in the study.

Univariate and multivariable analysis of predictors of MetS

In the training group, male sex (OR = 2.44, 95% CI 1.42–4.20), Hispanic versus Caucasian ethnicity (OR = 2.47, 95% CI 1.36–4.48), Hispanic versus Black ethnicity (OR = 2.53, 95% CI 1.49–4.31), weight-for-age (SDS: OR = 6.91, 95% CI 5.14–9.29) and BMI-for-age (SDS: OR = 11.81, 95% CI 7.53–18.53) were associated with a higher probability of having MetS. Two multivariable models were performed. Both models included age, sex, and race, and either weight-for-age (SDS) or BMI-for-age (SDS). The univariate and multivariable analyses are summarized in Table 2.

Table 2 Weighted univariate and multivariable logistic regression models of potential predictors for the presence of metabolic syndrome.

Discrimination ability

To evaluate the discrimination ability, the area under the ROC curve for weight-for-age, BMI-for-age, and the two multivariable models was calculated. The ROC curves and the areas under curves are summarized in Fig. 1 and Table 3. All models similarly showed a good discrimination ability (AUC ≈ 0.9) with no significant difference between them (p ≥ 0.150). Therefore, weight-for-age was chosen as the sole, easy-to-measure, preferred predictor for MetS.

Fig. 1
figure 1

Receiver operating characteristic (ROC) curves demonstrating the discrimination ability of the models to predict MetS in the training group.

Table 3 Discrimination ability of various weighted multivariable logistic regression models for the prediction of metabolic syndrome.

The optimal cut-off value of weight-for-age for predicting MetS was identified as 1.5 SDS which equals to 93.3 percentile. This cut-off value is illustrated for both, the training and validation groups in Fig. 2. In the training group, the optimal cut-off value presented a sensitivity of 87.8%, specificity of 76.9%, positive likelihood ratio of 3.82, and negative likelihood ratio of 0.16. The specificity and sensitivity values for additional cut-off values are presented in Table 4. The sensitivity and specificity while weighting were similar for each race (sensitivity 87–93%, specificity 70–80%, Appendix C). Using the optimal cut-off value and previously reported MetS prevalence in the US (4.5% to 8.4%),12 the positive predictive values and negative predictive values range between 15.2% and 25.9% and between 98.6% and 99.3%, respectively. The AUC for weight-for-age was similar in training (0.897, 95%CI 0.875–0.918) and validation groups (0.868, 95%CI 0.826–0.909, p = 0.225).

Fig. 2
figure 2

Box and whisker plot demonstrating the weight-for-age (SDS) among individuals who had MetS and who didn’t have across the study groups.

Table 4 Suggested weight-for-age cut-off values for the prediction metabolic syndrome.

Discussion

MetS in childhood is associated with an increased risk of chronic diseases in adulthood.11,12 Studies employing the IDF adolescent criteria for MetS have reported prevalence rates ranging from 0.3% to 9.5%.30 In addition, high prevalence (40%) of undiagnosed MetS among adolescents with obesity and up to 14.3% undiagnosed prediabetes were reported.31,32 Therefore, early screening is essential for both secondary prevention of MetS and primary prevention of chronic diseases in adulthood.33 Screening for MetS is usually performed by primary care clinicians and involves anthropometric measurements, blood pressure assessment, and blood tests.1,34,35,36 Due to the low adherence to treatment during adolescence,37 routine screening for MetS is probably low. Recently, a new score for Mets screening in adolescents based on artificial intelligence (AI) methods was introduced. While the AI model demonstrated good discrimination ability, it still relied on anthropometric measurements and blood tests, which also require visiting the clinic.38 To the best of our knowledge, no easy-to-use self-evaluation screening tool is currently available. Therefore, this study aimed to develop and validate a model based on variables that can be easily reported by the individual or by a family member to identify MetS in apparently MetS-free adolescents.

This cross-sectional study included 1947 and 848 apparently MetS-free adolescents aged 12–18 years who participated in NHANES and served as the training group and validation group, respectively. The study showed that weight-for-age, as a sole predictor, has a good discrimination ability (AUC = 0.9), similar to that of BMI-for-age, and to that of models that also included age, sex, and race. Hence, by simply measuring body weight followed by a percentile calculation using an electronic application, the individual, family member or primary care physician can observe an increased risk for the presence of MetS and advocate further assessment.

Our findings may help to fill a gap highlighted by the technical report that served as a basis for the recent American Academy of Pediatrics clinical practice guidelines on obesity, stating that more information is needed on the specific amount of body fat or BMI levels at which metabolic aberrations occur and that there are currently too few data to determine whether youth with overweight or at the low end of obesity should be screened.39 By utilizing data from adolescents with a continuum of weight and BMIs, we were able to answer that question and identify a suitable cut-off (i.e., the 93rd weight-for-age percentile).

A previous study tried to predict Mets using anthropometric indices among Chinese adolescents.40 The study has reported that the BMI percentile can predict MetS with an AUC of 0.93, similar to that evaluated in the current study (AUC = 0.9). Another study that evaluated the performance of anthropometric indicators as predictors of MetS among Brazilian adolescents showed a lower discrimination ability (AUC of up to 0.73).41 In both studies, other anthropometric indicators also showed similar AUCs.40,41 BMI evaluation also requires height measurements, since during this life stage the height changes. Considering the low cooperation at this age, measuring height can make cooperation even more difficult. A cross-sectional survey of family physicians and pediatricians practicing in 4 US states (Alabama, Colorado, Massachusetts, and West Virginia) showed that only 44% of the physicians had strong intentions to measure BMI in pediatric and adolescent patients.42 Therefore, using weight measurement alone is expected to improve adherence to medical screening. While previous studies focus on BMI and other anthropometric measurements such as waist circumference and waist-to-height ratio, this study showed that weight can be an alternative measurement for BMI in MetS screening. Moreover, applying the optimal cut-off value that was identified (1.5 SDS), demonstrated only a small difference in the projected number of adolescents with MetS between BMI and weight in the screening for MetS (BMI: 271 out of 1000 individuals, weight: 268 out of 1000 individuals, of them 235 were identified by both metrics).

This study showed findings similar to those reported in previous studies that utilized the NHANES database in earlier periods (1988–1994, 1999–2008, 2001–2006).7,23,24 Age, as in the previous studies, was not associated with MetS, while male sex was associated with increased risk. MetS was also more prevalent in Hispanic adolescents than in Caucasian individuals and black adolescents had the lowest probability of having MetS. Consistent with the previous studies, higher BMI-for-age was found to be associated with a higher probability of having MetS.

The study suggests the use of weight-for-age as a single predictor for screening of MetS among adolescents aged 12–18 years. The discrimination ability of the weight-for-age was found to be high, both in the training group (AUC = 0.90) and the validation group (AUC = 0.87). A cut-off value of 1.5 SDS which equals to 93.3 percentile showed a sensitivity of 87.8%, specificity of 76.9%, positive likelihood ratio of 3.82, and negative likelihood ratio of 0.16. Therefore, weight-for-age has a good ability to discriminate between adolescents who do have MetS and those who do not and may be considered for primary self-evaluation screening. Moreover, these values were close in each race. To be practical and feasible for use in clinical practice, the 90th percentile can also be applied as a cut-off value with slightly higher sensitivity (92.2%) and lower specificity (71.4%). The 90th weight-for-age percentile was previously suggested to identify children and adolescents with obesity.43

Utilizing weight-for-age as the sole predictor for screening carries several implications for clinical practice. It facilitates straightforward screening in apparently MetS-free adolescents using an easily measurable parameter, which can be assessed by the individual or a family member without requiring visiting a clinic. Healthcare providers such as health maintenance organizations (HMO), can implement this approach to enhance early detection and precede a visit to the primary physician, by leveraging a mobile application or an online survey.

Since monitoring height in pediatrics is critical for various reasons, it is important to note that height measurement is critical during pediatric and adolescent clinic visits. A continuous effort is needed to increase the rate of height measurements in pediatric clinics.44

The study has several limitations. First, the use of self-reported or missing values for exclusion raises the possibility of a selection bias. To assess a selection bias, we compared excluded individuals to those included. Additionally, we used the NHANES database, a well-supervised survey database that randomly samples the US population and serves as a widely accepted data source for similar studies. In addition, we adhered strictly to the survey’s definitions. Second, the study’s external validity may be limited due to its focus on a US-only population. Cultural, socioeconomic, and healthcare system differences can influence the prevalence and risk factors of MetS in other countries, making it challenging to extrapolate the study’s conclusions to a global context. Therefore, further validation using data from other countries is warranted. However, the use of a large, nationally representative dataset like NHANES enhances the generalizability of the findings to a broader population. The sample weighting employed in this study helps ensure that the sample accurately represents the larger population from which it was drawn. Third, as a cross-sectional study, it was designed for secondary prevention of MetS and cannot identify factors for primary prevention. Fourth, pubertal status could also be included in the study as a potential predictor. A recent study evaluated data from NHANES III (1988–1994) showed that adjusting for pubertal status reduced the prevalence of overweight/obesity in the USA.45 However, pubertal status is not routinely incorporated in the NHANES dataset.

This study presents several notable strengths. First, the weight-for-age can be readily implemented for MetS screening across various platforms. Healthcare providers can utilize this approach by distributing a concise survey to adolescents and their parents to identify individuals with an increased risk of MetS. Second, while previous studies have relied on BMI-for-age, to our knowledge, this is the first model, that utilizes weight-for-age as a single predictor. Weight is a straightforward measurement, making it more likely that a larger proportion of adolescents will respond to the screening efforts. Third, the use of weight-for-age as a sole predictor for MetS screening was validated in a new group of adolescents, demonstrating similar predictive ability.

In conclusion, this study proposes a novel approach for MetS screening in adolescents. Utilizing weight-for-age as the sole predictor for MetS screening may enhance screening participation and, consequently, improve response rates. Beyond secondary prevention of MetS, this approach may help for primary prevention of chronic diseases in adulthood.