Background

Childhood obesity is associated with various cardiometabolic risk factors (CMRs), such as elevated fasting blood glucose (FBG), elevated glycated haemoglobin levels, dyslipidaemia, and elevated blood pressure (BP) [1, 2]. The presence of these CMRs may be tracked to adulthood and is associated with an increased risk of cardiovascular and metabolic diseases [3]. Although several behavioural interventions have been developed to curb the obesity epidemic, the long-term success of these interventions remains unsatisfactory [4]. Therefore, identifying the high-risk groups with CMRs from an early age people with obesity could benefit in improving health awareness and behaviour of children and further promote their long-term adherence [5]. This may be of great potential to prevent the development of advanced stages to cardiometabolic diseases.

Overweight or obesity, FBG, lipid profiles, and BP are recommended as important risk factors of cardiovascular diseases in later life [6], indicating current cardiometabolic risk status with clear diagnostic criteria. The abnormal status of which has been included as the component of metabolic syndrome (MetS), as the widely-used definition proposed by the International Diabetes Federation [7]. A blood test is recommended to identify dyslipidaemia among at-risk children [6]. However, it is not feasible to use blood tests to identify CMRs in apparently healthy children based on routine physical examinations, especially in early age children. As the prevention and early detection of CMRs before the onset of obesity-related medical problems are critically important for children, obesity-related anthropometric indexes with easy measurement and classifications have been considered to detect paediatric CMRs rapidly and economically [8, 9].

Waist-to-height ratio (WHtR) was proposed as an index of abdominal obesity that may predict multiple coronary heart disease risk factors in the 1990s [10]. Compared with BMI and WC classifications in defining childhood obesity, WHtR that was the ratio of WC to height shows much less variable with age. This feature makes the classification of obesity or discrimination of CMRs much easier, with a single cutoff across an age range. Previous work has demonstrated that WHtR is an accurate and simple tool for quick and mass screening of CMR in children compared with body mass index (BMI) and waist circumference (WC) [8, 11, 12]. To date, only two meta-analyses have been reported using WHtR for identifying CMRs in the children population, though original studies of this topic are abundant. However, these studies presented limitations with respect to methodology; further, there were only a small number of eligible original studies [13, 14]. The most recent diagnostic meta-analysis in 2016 has demonstrated comparable performances among WHtR, BMI and WC in screening CMRs in children [13]. Of note, the practical application of WC and BMI is inconvenient in the paediatric population because of the requirement for age- and sex-specific references and studies showed that WHtR is age-independent compared with other obesity-related indexes [15,16,17,18]. Therefore, WHtR can still be recognised as the most promising one [13, 19] due to good screening accuracy, easy calculation and interpretation.

Moreover, the excellent performance of WHtR with the summarised area under the receiver operating characteristic (ROC) curve (AUC) in discriminating MetS exceeded 0.8 [13]. Since that, a range of new original studies have been published, presenting good but diverse discriminating performances of WHtR in screening various CMRs and MetS based on different population and methodologies [8, 20,21,22,23,24].

A boundary value of WHtR of 0.5 has been proposed for the adult population and widely applied in children studies [25]. The optimal WHtR cutoff for the children population is still inconsistent to achieve satisfactory discrimination of cardiometabolic risk across the studies [21, 26]. Our meta-analysis included different populations with various optimal cutoffs to verify the generalizability and robustness of WHtR for CMRs screening. Through evaluating the performance and robustness of WHtR based on up-to-date studies, our meta-analysis extended previous findings to evaluate the practical values of WHtR as a screening tool in discriminating individual and clusters of CMRs. Furthermore, we reported a range of evaluation indexes for the first time to provide more convincing evidence of the feasibility and applicability of using WHtR in routine public health practices.

Methods

Search strategy

Our meta-analysis followed the PRISMA guidelines on a transparent and reproducible process. Comprehensive searches were performed performed using the OVID platform in the following databases: Pubmed, Embase, Medline, Cochrane Library and the Chinese databases of Wan fang, CNKI, VIP, and Sinomed from inception to July 03, 2020. Search terms included "waist-to-height ratio", "children", and the synonym combinations of them. The search strategy used in PubMed is presented in Additional file 1. The same screening terms were used in the rest of the databases. We also conducted forwards and backwards citation tracking of included studies to identify potentially eligible studies.

Inclusion criteria

We included studies that met the following criteria: (1) observational in design, (2) had a target population of children and adolescents, or studies of the general population which provided data on a subgroup of children and adolescents, (3) provided data on WHtR measurement, and had (4) at least one of the following CMRs as the primary outcome: elevated FBG, elevated BP, dyslipidaemia, elevated total cholesterol (TC), elevated triglyceride (TG), low high-density leptin cholesterol (HDL-C), elevated low-density leptin cholesterol (LDL-C), central obesity and clustered of CMRs, and (5) diagnostic studies.

Exclusion criteria

The studies that met the following criteria were excluded: (1) non-original articles, e.g., reviews (2) unable to extract 2 × 2 contingency tables, (3) not in English or Chinese language, (4) populations comprising primarily clinical patients, or (5) duplicate articles.

Data extraction

To determine study eligibility, two reviewers (YJ & Yl D) independently screened titles, abstracts and identified full-texts. Discrepancies between reviewers were settled through discussions with the reviewer team (Wl Y, Y Z, Xt C, Y W).

These two reviewers extracted the following characteristics of the included studies: author, published year, study year, country, study design, sample size, sex, age, WC measurement technique, WHtR cutoff point, and 2 × 2 contingency table. For those articles without an original contingency table, we extracted the prevalence of CMRs, and the reported diagnostic accuracy parameters (sensitivity, specificity or ROCs) to calculate the value of true-positive (tp), false-positive (fp), true-negative (tn), and false-negative (fn). WHtR was computed as the WC in centimetres divided by the height in centimetres in each original articles.

Quality assessment

The quality of the included studies was independently assessed by two reviewers (YJ & Yl D) using the quality assessment of diagnostic accuracy studies-2 (QUADAS-2) tool. QUADAS-2 is a revised tool for systematic reviews in evaluating the quality of original diagnostic accuracy studies in the realm of bias and applicability, which involve four domains (patient selection, index test, reference standard, and flow and timing). The reviewer team was in charge of mediating discrepancies regarding quality assessment [27].

Statistical analysis

We performed the meta-analysis based on five groups of outcomes as follows, referring to the commonly used paediatric MetS definitions of the International Diabetes Federation and the widely studied CMR components in children:

  1. (1)

    Cluster of CMRs

  • CMR3: presenting with least three of CMRs (elevated FBG, elevated BP, dyslipidaemia, central obesity);

  • CMR2: presenting with least two of CMRs;

  • CMR1: presenting with at least one of CMRs.

  1. (2)

    Elevated FBG

  2. (3)

    Elevated BP

  3. (4)

    Dyslipidaemia

    Dyslipidaemia was defined as having at least one of the following abnormalities: elevated TC, or elevated TG, or low HDL-C, or elevated LDL-C. To reduce the heterogeneity of the summarised result, we further pooled dyslipidaemia components separately.

  4. (5)

    Central obesity

The sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), positive likelihood ratio (PLR), negative likelihood ratio (NLR), and prevalence of outcomes were calculated based on tp, fp, tn, and fn extracted from original studies. When results in the original studies were presented by age and sex stratifications, we combined these results into an overall estimation. Summary Area Under Receiver Operating Characteristic (AUSROC), sensitivity, specificity, PLR, NLR, and diagnostic odds ratios (DORs) with 95% confidence intervals (CIs) were estimated using the bivariate mixed-effects model based on the random-effects assumption [28]. The advantages of reporting AUSROC lies in its capability of reflecting the interaction between sensitivity and specificity when the threshold effect exists. We assessed between-study heterogeneity using Cochran's Q, and also quantified heterogeneity using the I2 statistic. Heterogeneities were substantial when I2 over 50% or P-value of Q statistic < 0.05. The threshold effects were evaluated by pairwise correlations between sensitivity and 1-specificity with a P-value < 0.05.

We further carried out subgroup analyses based on sex and an East Asian ethnicity only due to the limited number of studies in other ethnicities. In our sensitivity analysis, we explored the heterogeneity of the results, for the primary outcome CMR3, by only summarising studies that used identical CMR components, e.g. FBG, HDL-C, TG, BP, and central obesity.

Deek's funnel asymmetry tests were used to test publication bias; a non-zero slope coefficient suggested significant bias (P-value < 0.05). We sought to figure out possible sources of heterogeneity from diagnostic standards of outcomes, e.g., FBG, BP, dyslipidaemia components, and central obesity, and WHtR cutoff values using meta-regression based on joint models, when we had enough statistical power to do so [29]. The joint model was a comprehensive index in consideration of heterogeneity for sensitivity and specificity. Specifically, the diagnostic criteria for each outcome were classified into a dichotomous variable as follows: elevated FBG ≥ 5.6 mmol/L(100 ng/dL) vs other cutoffs as a group; divided elevated BP into the fixed value of SBP ≥ 130 mmHg or DBP ≥ 85 mmHg or the percentile based on the reference standard; split elevated TG by ≥ 150 mg/dL/ (1.6935 mmol/L) vs other cutoffs as a group; divided low HDL-C into between < 40 mg/dL (0.998 mmol/L) or not; grouped central obesity according to the WC ≥ the 90th percentile of the age- and sex-specific reference standard. The effect of each covariate on sensitivity and specificity separately was not depicted in our meta-regression, as we emphasised the comprehensive index in consideration of heterogeneity from both of them. All analyses were performed using Stata version 15.0 (Stata, Version 15.0 [computer program], Tex Stata Corp., Coll. Station, 2015).

Role of the funding source

The funding sources had no influence on study design, data collection, analysis, interpretation, paper writing. All authors confirmed that they had full access to all the data in the study and accepted the responsibility to submit for publication.

Results

Figure 1 depicts the search results of this meta-analysis. Of the 2,204 hits, 578 duplicates were removed, and the 1,626 titles and abstracts were screened. Of these, 1,380 were excluded because they were not considered to meet the study inclusion criteria based on the title and abstract. A total of 246 abstracts appeared to be potentially relevant and were collected as full-text articles to be assessed for eligibility for the meta-analysis. A total of 193 articles were subsequently excluded for the reasons listed in Fig. 1, which left 53 studies included in the meta-analysis.

Fig. 1
figure 1

Selection process of primary studies in the meta-analysis. Of the 2204 citations yielded through database searching, 578 citations were removed due to duplicate. In the rest of 1626 citations, 1380 were excluded by screening titles and abstracts. Full-texts were assessed in 246, and an additional of 193 studies further were precluded. Finally, a total of 53 studies were included in the meta-analysis

Characteristics of the included studies are presented in Table 1. These studies were performed between 1993 and 2018 and conducted in 24 countries; of these, nine in Europe, seven in Asia, four in South America, three in Africa, and one in North America. Besides, the majority of studies were in the cross-sectional study design, with only one cohort study and one case–control study in our meta-analysis. Study subjects ranged in age from six to 20. The sample size among these studies ranged from 178 to 105,245. WHtR cutoffs extracted contingency tables, and the calculated diagnostic indexes from original studies are listed in Additional file 2: Table S1.

Table 1 Characteristics of included studies in the meta-analysis

The quality evaluations of the included studies through QUADAS-2 are shown in Additional file 3: Table S2 and Fig. 2. Among 53 included studies, two studies were evaluated in high risk of bias in the domain of flow and timing, two studies with a high probability of bias in the domain of reference standard, zero in the domain of index text(s), and three studies in the domain of patient selection. Moreover, the applicability of original studies was in good performances on the whole, with only one study assessed in the high risk in the domain of reference standard and patient selection, respectively.

Fig. 2
figure 2

The quality evaluations of included studies through QUADAS-2. The risk of bias was low in most of the studies. Among them, the number of studies having a high risk of bias was two, two, zero and three in the domains of flow and timing, reference standard, index test(s), and patent selection, respectively. The majority of studies were of good applicability. Only one studies presented great concerns in the reference standard domain and one in the patient selection domain

Table 2 shows the summarised results of the diagnostic accuracy of WHtR for identifying CMRs. The value of pooled AUSROC increased with the number of CMRs, with 0.91 (95% CI: 0.88, 0.93), 0.85(95% CI: 0.81, 0.88), and 0.75(95% CI: 0.71, 0.79) for CMR3, CMR2, and CMR1, respectively. For CMR3, pooled sensitivity reached to 0.84 (95% CI: 0.76, 0.90), specificity of 0.84 (95% CI: 0.78, 0.88), and DOR reached 28 (95% CI: 14, 54). For single CMR components outcomes, the values of AUSROC were relatively lower in general, with 0.59 (95% CI: 0.55, 0.63) for screening elevated FBG, 0.69 (95% CI: 0.65, 0.73) for elevated blood pressure, and 0.66 (95% CI: 0.62, 0.70) for dyslipidaemia. Notably, the AUSROC reached 0.96 (95% CI: 0.94, 0.97) in discriminating central obesity. Threshold effects were founded in pooled elevated FBG, elevated BP, and dyslipidaemia (P values < 0.001), with a correlation coefficient of 0.949, 0.801, and 0.859, respectively. Substantial heterogeneities were found among studies in each CMR outcome, which showed that all indexes of I2 were 100%, and Q statistics were significant (P < 0.001; Additional file 4: Table S3). No publication bias was founded excepted for elevated blood pressure (P-value = 0.032 based on Deek's funnel plot asymmetry test). The forest plots of pooled sensitivities, specificities, and odd diagnostic ratios in discriminating CMRs are listed (Additional file 5: Figure S1–S14).

Table 2 Summarised performance of WHtR screening for CMRs in children and adolescents

Subgroup analyses by sex showed similar results with the entire population, with AUSROC of CMR3 being 0.89 (95%CI: 0.78, 0.95) for boys and 0.84 (0.76, 0.90) for girls, respectively (Additional file 6: Table S4). The sex difference was not significant among each outcome. Performances of WHtR in summarised studies of the East Asian population were consistent with that of the entire population (Additional file 7: Table S5). The AUSROC reached 0.92 (95% CI: 0.89, 0.94), 0.88 (95% CI: 0.85, 0.90), and 0.79 (95% CI: 0.75, 0.82) for CMR3, CMR2, and CMR1, respectively. Notably, the heterogeneity remained high in each outcome of these two subgroups, with I2 ranging from 96 to 100.

We further performed the sensitivity analysis for CMR3 in 13 studies with five identical components: elevated FBG, low HDL-C, elevated TG, elevated BP, and central obesity. We found that the AUSROC reached 0.90 (95%CI: 0.80, 0.95) with the sensitivity of 0.89 (95%CI: 0.82, 0.93) and specificity of 0.89 (95%CI: 0.82, 0.93). Although the threshold effect was not significant (p = 0.905), the heterogeneity remained large, with I2 of 100% (Additional file 8: Table S6). We did not analyse the other outcomes, e.g., CMR2 or CMR1, because the numbers of studies with identical components were less than three.

In our meta-regression analyses (Table 3), we found that various WHtR cutoffs used in the original studies may account for 69% to 99% of the heterogeneity of the meta-analyses, while the differences in using different diagnostic criteria for defining components of CMRs did not make a significant contribution.

Table 3 Possible sources of heterogeneity: meta-regressions

Discussion

In this study, the excellent performances of WHtR significantly prove its advantage in screening children with CMRs in routine practice. We found that the satisfactory performance of WHtR was notable for discriminating clustered CMRs. Especially for identifying CMR3 (affected with at least three CMR abnormalities), the AUSROC would reach 0.91 with both the sensitivity and specificity achieved 0.84, and the DOR of 28 indicated that the odds of being affecting CMR3 were 28 times higher among children with WHtR over cutoff values than that of non-CMR3. Besides, the AUSROC attained 0.85 for screening CMR2 and 0.75 for CMR1. As a comprehensive concept with public health significance, the clustered CMRs can be detected by WHtR accurately, and the more severe the CMR, the easier it will be identified. Moreover, the summarised results posed that WHtR remained robust in East Asian populations and each sex. These findings further strengthen the evidence of the application values of WHtR and promote this quick and convenient measurement as a routine screening tool in the practice of prevention and control of CMR in children.

After the meta-analysis previously published in 2016 [13], many recent original studies involving more diverse ethnicities and methodologies were summarised in our meta-analysis through the more optimised statistical models, which further provided superior and more informative findings. Our meta-analysis of 23 studies presented a slightly better performance (AUSROC of 0.91) for CMR3 screening compared with the previous one (AUSROC = 0.81, 95% CI: 0.77, 0.86) that identified MetS [13]. Furthermore, the consistent performance of WHtR in screening cardiometabolic diseases has been demonstrated in adult populations based on a meta-analysis [17], with the pooled AUC of WHtR for incident diabetes, MetS, and total incident cardiovascular disease were all over 0.7. These findings suggest that WHtR could be a useful tool in identifying both CMR in children and cardiometabolic diseases in adults.

Central obesity is a critical component and precondition of MetS in children and adolescents [30], which can be defined by the sex and age-specific WC percentile for children [31]. As WHtR had a substantial accuracy in identifying central obesity with the excellent AUSROC of 0.96, the criterion for clustered CMRs including the central obesity in our study may make the accuracy of identification of clustered results slightly higher (the AUSROC reached 0.91for CMR3), which has been indicated in our previous study (AUC for CMR3 including central obesity v.s. that excluding central obesity, 0.89 v.s 0.76) [18]. Early evidence also supported the good performance of WHtR for screening CMR components excluded central obesity. Previous meta-analyses of the paediatric population illustrated that the pooled AUC of WHtR screening MetS factors without central obesity reached 0.71 (95% CI: 0.66, 0.75) [12], and for the cardiometabolic comorbidities of at least three items was 0.69 (95% CI: 0.57, 0.80), which were acceptably accurate [13]. CMRs such as obesity, hypertension, and dyslipidaemia tend to cluster in youth [32]. Cohort studies show strong evidence that single or cluster CMRs present during adolescence likely track forward to adulthood and are related to markedly increased risk for cardiovascular or metabolic disease [33,34,35,36,37]. These all underline the importance of identifying children at increased risk for cardiometabolic comorbidity as early in life as possible.

We found that our meta-analyses suffer from high heterogeneity. Although WHtR over 0.5 has been proposed as a healthy cutoff for avoiding cardiovascular disease and diabetes for adults and been widely used [11], various WHtR cutoffs have been adopted in the original studies to fit their study populations with different prevalence levels of CMR. An Algerian study showed that the optimal cutoff value of 0.55 achieved a specificity of 0.89 in identifying girls with MetS, while this number was only 0.75 using the well-reported cutoff of 0.5 [38]. Our previous study also suggested that the critical value of WHtR of 0.467 is more accurate than 0.50 in Chinese children [20]. We used the AUSROC to minimise the influence of the threshold effects [17, 30]. The findings from our meta-regression demonstrated that heterogeneity of our findings might likely be explained by different threshold cutoffs used to classify WHtR and less likely due to different standards of diagnosing outcome variables, e.g., slightly different diagnosing cutoffs. In addition, subgroup analyses for sex and ethnic background, i.e. Eastern Asian populations, as well as our sensitivity analyses summarising studies with homogenous CMR components also supported the robustness of the main results. Moreover, the biases were limited in included original studies among all four domains by QUADAS-2. Although most of the studies did not report whether blinding was used, we still believe that biases from unblinding do not exist in our study, because both WHtR and the standard of CMRs were defined based on objectively measured numerical values. It is not likely to bias the quality evaluation results.

Our meta-analysis had several strengths. First, our meta-analysis based on the most updated, relevant studies over the world provided the most substantial evidence to date about WHtR as a promising screening tool for CMRs in the children population. Second, our study presented multiple summary diagnostic accuracy indexes, including AUSROC, sensitivity, specificity, DOR, and likelihood ratio, which were more informative for public-health practice decisions [28]. Third, we presented WHtR performance for a series of a single component of CMRs, providing decision-makers with thorough information when using it for populations with different disease status. We further discriminated the summary results in three levels of CMR clusters representing the comprehensive cardiometabolic risk in children. Fourth, this was the first meta-analysis discriminating the performance of WHtR for CMRs in the East Asia population who were less tolerable of obesity and developed earlier to CMRs at a lower level of BMI [39].

Our analyses must be interpreted in the context of the limitations of the available data. First, we found significant heterogeneity of included studies and using diverse WHtR cutoffs for screening CMRs as the main determinant. Overall, the results of analyses for meta-regressions, subgroup analyses, and sensitivity analyses into considerations highlighted the robustness and the accuracy of WHtR in screening CMRs. Additionally, our synthesised results were more evidence-based than a single original study. The variation in measuring and defining WHtR and CMRs existed in the original studies, and our evidence-based synthesis to some extent helped ensure the generalizability of the main findings for translation into clinical practice. Secondly, we found fewer relevant studies from other races, such as Caucasians and Africans; therefore, the performance of WHtR in these populations needed more evidence so that we may be able to accurately generalise to these populations. Third, obesity-related CMRs vary with age; given the lack of this information from the original studies included in our review, we could not consider this factor in our analyses. Fourth, the different WC measurement techniques may cause bias and variability among original studies [40] and lead to the heterogeneity of our meta-analysis. However, a systematic review of 120 studies suggested that different WC measurement techniques are not likely to bias the association between the WC or WHtR and cardiometabolic diseases [41]. Future studies and efforts are expected to translate this evidence into practise of preventing and controlling CMRs in children and adolescents.

Conclusion

In summary, our meta-analysis presents a good performance of WHtR in identifying children and adolescents with CMRs, and the findings appear robust to other factors, including ethnicity. This evidence strongly supports WHtR as a promising and practical tool in routine primary health care practice of controlling obesity-related CMRs in children and adolescents.