Introduction

Being overweight is often associated in society with various negative attributes, such as being undisciplined, unmotivated, lazy, incompetent, ugly or unattractive. Prevailing stereotypes and prejudices are common and also lead to weight discrimination: those categorized as too heavy are likely to face devaluation and unfair treatment [1, 2]. In youth, weight stigma often manifests as weight-related bullying and teasing. Even though this stigma is more prevalent among individuals with overweight, it is a widespread phenomenon that affects children and adolescents across all weight categories [1,2,3]; up to 50% of youth have reported weight-related bullying [3]. Negative health consequences of weight stigma are well-documented and include psychosocial problems (e.g., depressive or anxious symptoms, decreased self-esteem and body image, maladaptive eating behaviors, reduced physical activity, substance abuse), as well as physical impairments (e.g., increased risk for cardiovascular disease) [1,2,3].

Given the fact that cross-sectional studies show that weight stigma is associated with a higher risk for overweight or obesity among youth [4], theoretical frameworks postulate a vicious cycle between weight stigma and weight [5, 6]. Indeed, there is a rationale for the assumption that stigma reinforces weight gain, as well as the reverse [4]. Frequent exposure to weight stigma might increase weight or maintain obesity as a result of various emotional, cognitive, behavioral or physical mechanisms (for details, see [1, 6,7,8,9]). Similarly, higher weight status might result in increased or persisting weight stigma, because, for example, the stigmatized feature becomes more apparent [10] or stigmatization is seen as a legitimate motivator for weight loss [7]. A recent meta-analysis concluded that initial prospective studies (n = 5) provide evidence for both pathways among children and adolescents [4]. So far, only one study [11] simultaneously considered the bidirectional relationship between weight stigma and body mass index (BMI), but only among boys. The majority of the studies examined monodirectional effects only. This might result in spurious effects and therefore an overestimation of the relationship between weight stigma and weight [12]. There is a lack of evidence for the proposed reciprocal reinforcement between weight and weight stigma in the literature. Therefore, the major aim of our study was to analyze the prospective bidirectional relationship between weight and weight stigma. We hypothesized that weight-related stigma would predict future weight and vice versa.

In the context of weight stigma, potential gender differences are discussed. Whereas some studies observed higher perceived weight-related stigma among female children compared to male ones or vice versa, others found no differences [3, 13]. A recent meta-analysis reported similar cross-sectional correlations between BMI and weight stigma across genders [4]. With regard to prospective studies, three studies did not observe gender differences with regard to the effect of weight-related stigma on weight [11, 14] or vice versa [15]. Contrary, other studies reported that previous experience of weight stigma came along with a higher risk for obesity in girls, but not in boys [16], and that the predictive effect of weight stigma for BMI was higher among girls compared to boys [17]. Overall, prospective studies concerning gender differences are still sparse and do not allow meta-analytic pooling [4]. At this point, there is still no study that directly compares longitudinal reciprocal effects between weight and weight-related stigma across genders. Therefore, the second aim of this study was to compare the prospective bidirectional relationship between weight and weight stigma across genders in an exploratory manner.

Methods

Procedure

Data were derived from the PIER study, a large population-based prospective study on intrapersonal developmental risk factors in childhood and adolescence. Participants were recruited from various areas in the Federal State of Brandenburg, Germany, and written and informed consent and assent were obtained. Data were assessed at three points in time. The interval between the first (T1, 2011–2012) and second (T2, 2013–2014) assessments was approximately 20 months (M = 19.97, SD = 3.99); the third assessment (T3, 2016) followed an additional 30 months later (M = 29.97, SD = 3.25).

T1 and T2 were carried out by trained research assistants. The individual sessions included tests and self-report questionnaires (completed on a computer screen or as a paper–pencil version), took around 90–120 min each and were conducted at school, the family’s home or in the university's facilities. For logistical reasons, at T3 questionnaires were completed at home (online or paper–pencil version; approximate duration: 60 min); participation in the additional face-to-face tests was optional. For each wave, vouchers for cinemas or online shops (10€) were provided to reward participation. The study was approved by the local ethics committee.

Sample characteristics

The final sample consisted of 1,381 children and adolescents between 9 and 19 years of age (M = 13.35, SD = 2). Participants who failed to provide self-reported anthropometric data (n = 106) or self-reported weight teasing (n = 2) at least once during measurement procedures were excluded from analyses. The participation rate across genders was balanced (49.2/50.4/55.3% females at T1/T2/T3). Educational background was inquired via parental education level: 46.7% stated a higher education (e.g., BA, MA, Diploma, PhD etc.); 11% reached higher education entrance qualifications (‘Abitur,’ equivalent of a high school degree/A-level) and 24.5% reported secondary school graduation or below; 17.8% did not provide information about their education level. Further descriptive data can be found in Table 1.

Table 1 Descriptive statistics at times 1 (T1), 2 (T2) and 3 (T3)

Data across all points of measurement were available for 45% (n = 624); 33% (n = 454) of participants took part in two of the three assessments, 22% (n = 303) in one of the three. Attrition rate between T1 and T2 was 21.9% and 42.1% from T2 to T3. Changes in procedure and a lower availability of older participants (e.g., due to changes of school or leaving the parental home) might have led to the relatively high attrition rate in the last wave. Systematic comparisons revealed that non-participants at T2 and T3 were older (T2: t(1379) = 8.35, p < 0.001, d = 0.54; T3: t(1379) = 8.03, p < 0.001, d = 0.43). Moreover, non-participants at T3 were heavier (t(1345) = 3.97, p < 0.001, d = 0.22), were more often males (χ2 = 17.07, p < 0.001, \(\phi\) = 0.11) and reported more frequent weight-related teasing (t(1328) = 3.32, p = 0.001, d = 0.17) at baseline.

Materials and measures

Body weight

Based on national reference data, we calculated body mass index standard deviation scores (BMI-SDS) from self-reported weight and height [18, 19]. Additionally to self-reports, weight and height were also measured objectively by research staff. However, at T3 objective weight measurements could only be collected for a few cases (29.7%) due to the voluntary nature of the additional on-site measurements. Accordingly, our analyses refer to subjective data to keep the method consistent across the measurement points. Correlation with objective BMI-SDS was high across all measurement points (rT1 = 0.82 [n = 1267]; rT2 = 0.84 [n = 983]; rT3 = 0.93 [n = 404]).

Weight-related teasing

Weight-related teasing was assessed with five items from the perception of teasing scale (POTS [20]), which was adapted for children. Children and adolescents rated its frequency (e.g., “How often do people laugh at you because you are heavy?”) on a 5-point scale (1 = never to 5 = very often). In the current sample, internal consistencies were high (αT1 = 0.86, αT2 = 0.83, αT3 = 0.83).

Data analyses

Statistical analyses were performed with IBM SPSS Statistics 27 (descriptive data, drop-out-analyses) and MPLUS 7.0 (latent path modelling). We used structural equation modelling to test the theoretical model (displayed in Fig. 1). Our preceding analyses ensured that the POTS showed adequate measurement invariance across time and genders. Therefore, teasing was considered as a latent variable. BMI-SDS was included as a manifest variable. We included autoregressive paths to control for stability and allowed cross-sectional correlations between the variables. All variables were controlled for age and gender. To test for differences across gender, we conducted multigroup comparisons. Wald tests were applied to compare path coefficients. We evaluated the model data fit as follows: a good fit was indicated by a non-significant χ2, χ2/df ratio ≤ 3, root mean square error of approximation (RMSEA) ≤ 0.05, comparative fit index (CFI) ≥ 0.95 and root mean square residual (SRMR) ≤ 0.1 [21]. For all analyses, the alpha level was determined at p < 0.05. We employed full information maximum likelihood (FIML) estimation to cope with missing data on single variables and drop-outs. This model-based procedure is recommended, as it is superior to other techniques [22] and reduces the risk of biased results caused by the list-wise depletion of drop-outs. Due to non-normality of the POTS items, we applied robust maximum likelihood estimators (MLR) [23].

Fig. 1
figure 1

Prospective relationships between weight (BMI-SDS) and weight-related teasing for children and adolescents. Standardized results are reported. Variables are controlled for age and gender at the respective point of measurement. T1/T2/T3 point of measurement 1/2/3, BMI-SDS body mass index standard deviation score. Grey dotted lines indicate non-significant results (p > 0.05). ***p < 0.001, **p < 0.01

Results

Descriptive data on weight-related teasing and BMI-SDS are displayed in Table 1.

Prospective relationships between BMI-SDS and weight-related teasing

The proposed model showed a good fit to the data (χ2(191) = 300.98, p < 0.001; χ2/df = 1.58; RMSEA = 0.02; CFI = 0.98; SRMR = 0.06). Results are displayed in Fig. 1. As expected, significant cross-sectional associations were observed between weight-related teasing and BMI-SDS at each measurement point, with the highest correlation at T1 (β = 0.28, p < 0.001) and slightly lower correlations at T2 and T3 (β = 0.21, p < 0.001). With regard to the autoregressive paths, BMI-SDS (βT1–T2 = 0.65, βT2–T3 = 0.73; p < 0.001) and weight-related teasing (βT1–T2 = 0.46, βT2–T3 = 0.53; p < 0.001) showed a high stability. As expected, weight-related teasing and BMI-SDS at T1 predicted each other at T2. Weight-related teasing at T2 in turn predicted BMI-SDS at T3, whereas the reciprocal association was not observed. Variance in all endogenous variables was explained significantly (perception of weight-related teasing: 29.1% at T2, 28.6% at T3; BMI-SDS: 50.9% at T2, 58.2% at T3; p < 0.05).

Additionally, analyses were rerun with objectively measured weight data. Due to the low rate of objective measurements at T3 (< 30%), we refrained from imputing these data. These exploratory analyses delivered quite comparable results, although cross-lagged paths between T2 and T3 were only marginally significant.

Multigroup comparison across genders

The model of the multigroup comparison showed an acceptable fit to the data (χ2(191) = 609.22, p < 0.001; χ2/df = 1.58; RMSEA = 0.03; CFI = 0.96; SRMR = 0.14). On the whole, results were comparable to those of the total sample (see Fig. 2). The Wald test revealed no significant differences between male and female children and adolescents, except that females reported a higher stability of weight-related teasing (WT1–T2 = 3.51, p = 0.061, WT2–T3 = 5.58, p = 0.018).

Fig. 2
figure 2

Prospective relationships between weight (BMI-SDS) and weight-related teasing for males and females. Standardized path coefficients for male/female children and adolescents. Variables are controlled for age at the respective point of measurement. T1/T2/T3 point of measurement 1/2/3, BMI-SDS body mass index standard deviation score. Grey dotted lines indicate non-significant results (p > 0.05). ***p < 0.001, **p < .01, *p < 0.05

Discussion

To sum up, the cross-lagged panel analyses revealed a reciprocal relationship between weight-related teasing and weight in children and adolescents across all weight groups. More specifically, higher weight-related teasing was consistently associated with higher future BMI-SDS, and BMI-SDS also predicted weight-teasing; however, this was only observed between the first two measurements. We did not find evidence for differences across genders.

Corresponding with previous findings [4], weight-related teasing and weight were not only correlated cross-sectionally but also predicted each other longitudinally. Accordingly, the results indicate a cyclical relationship between weight stigma and weight, although this was not consistent across all points of measurement. Two previous studies reported a longitudinal influence of weight on weight stigma in children and adolescents [11, 15]. We can only speculate as to why BMI-SDS at T2 did not predict weight-related teasing at T3 in the current sample. For one thing, the temporal distance between T1 and T2 (approximately 20 months) was shorter than between T2 and T3 (approximately 30 months). This complicates the clear interpretation of bidirectional effects but might suggest that weight could be predictive of weight stigma only over relatively shorter time periods. However, the design allows no conclusion about underlying mechanisms or individual weight fluctuations between the measurements. These should be examined more in-depth by future studies (e.g., on the basis of shorter measurement intervals).

Besides this, participants were getting older during the course of the survey. Increasing age is associated with a higher acceptance of various body sizes deviating from the thin ideal [24, 25]. This could mean that weight is no longer predictive for weight stigma among older participants. Further, the missing link may result from our operationalization focusing on weight-related teasing defined as laughing at individuals or joking about heavy weight. Even so, the nature of weight stigma might differ by age so that, for older individuals at the last point of measurement, other forms of weight stigma (e.g., social exclusion) might have been more relevant [26]. Ultimately, however, due to overlapping age spans across waves, no conclusion as to the age effect was possible in our study. Accordingly, a comparison of age groups could be of further interest [27].

With respect to gender differences, we observed a lower stability of teasing in male (compared to female) children and adolescents between the last two measurements. This is consistent with a study showing that weight teasing overall might be relatively stable and track into young adulthood, but decreases in males (but not females) from early adolescence to young adulthood [28]. Beyond that, it should be mentioned that we did not observe gender-related differences with respect to the relationship between weight-related stigma and BMI-SDS—neither cross-sectional nor longitudinal. This is in line with previous studies that also reported no gender differences [14, 15], but runs contrary to others indicating that there might be gender differences [16, 17]. A closer examination reveals that moderators might explain inconsistent results regarding gender differences. For one thing, consideration of the perpetrators of teasing might be an essential variable: among females, weight increased when being teased by either families or peers, whereas among males only teasing by peers had a significant contribution to the further weight course [17]. Additionally, one study reported that weight-related teasing predicted future overweight only for girls, but an increase in teasing over time was predictive of future overweight among both genders [16]. Future studies examining and comparing the bidirectional associations among males and females are warranted. This would help explain the complex association between body weight and weight stigma in more detail.

Taken together, our study underscores that weight and weight-related teasing reinforce each other over the long term. As a consequence, experiencing weight stigma or being heavier might lead to a vicious cycle—with the result that both will worsen. Weight-related stigma as well as being overweight are associated with various detrimental negative health outcomes [1,2,3, 29, 30]. These in turn might also be intensified, and negative consequences could persist into adulthood [31]. This highlights the importance of interrupting such vicious cycles to prevent adverse trajectories. Preventive and interventional strategies might be especially effective if they simultaneously address both levels: reduction of weight stigma as well as promoting healthy weight development to prevent future weight gain. Therefore, preventive public health approaches (e.g., health campaigns at schools), as well as health-promoting interventions, should focus on non-stigmatizing content and the promotion of weight-neutral healthy lifestyles instead of focusing on weight per se [7].

Presently, there are several assumptions about the underlying mechanisms that might explain the bidirectional relationship between weight and weight-related teasing [5,6,7], such as weight bias internalization, eating behavior, body image and coping. Knowledge derived from these studies might improve preventive approaches and interventions to stop the vicious cycle between weight and weight stigma.

Strengths, limitations and future implications

This is the first study to analyze the cross-lagged longitudinal relationship between weight and weight-related stigma among youth and compare bidirectional relationships across gender. Nevertheless, there are some limitations that should be mentioned. First, there was a high and also systematic drop-out over time. We applied the FIML approach to take drop-outs into account, but that does not completely eliminate the risk of bias. As those who dropped out showed a higher BMI-SDS and experienced more frequent weight teasing at baseline, the role of weight teasing might be underestimated. Second, current weight-related teasing was assessed by the POTS as laughing or joking about weight, but other types of stigma which might vary across age spans [26], were disregarded. This allowed to keep the method consistent across the measurement points but might lead to an underestimation of other types of weight stigma that might become more relevant in older participants. Furthermore, the study design depicts the current individual evaluation of weight teasing but cannot provide information about fluctuations or underlying mechanisms within the interval between the measurement points. In addition, the widespread of ages at baseline impedes conclusions about the role of developmental stages in the relationship between weight and weight stigma and the timing of interventions to interrupt the cycle. As the current data were not suited to analyze appropriate age bands, future research could take care of this problem, e.g., by means of multi-cohort-sequence designs. Fourth, analyses were based on self-reported weight data. We exploratively reran the analyses with objective weight data and yielded similar results, but these were limited by a very small data base at T3. Although self-reported and objective anthropometric data were highly correlated [32], in general self-reported data underestimated the objective weight status (data not shown). This was acceptable, as we focused on the presence of relationships over time instead of interpreting absolute mean scores. As differences between self-reported and objective data were not associated with weight teasing (data not shown), we presume that a potential underestimation of effects is constant across time points and allows us to draw sound conclusions about the pattern of temporal relationships—even though the magnitude of these associations might be underestimated. Nevertheless, future studies on the basis of objective weight data are warranted. Last but not least, our sample size was not sufficient to compare the model between weight groups. Future studies should therefore examine whether the tested model differs across weight groups. We included individuals with normal and underweight, as there is evidence that weight stigma is prevalent across all weight groups [17]. Nonetheless, one might assume that the association between weight stigma and weight is especially relevant among those who are already overweight and should pay special attention to this group.

The present study also has several strengths. First, the prospective design with three points of measurement allowed an investigation of reciprocal effects over the course of six years. Second, the large sample with a balanced gender ratio allowed us to test complex models. Third, we implemented a latent modeling of weight-related teasing. Overall, this valuable methodological approach enabled us to control for random effects of measuring errors, to verify measurement invariance across time and genders, and to consider the stability of the included variables.

Conclusions

In conclusion, weight and weight-related stigma were shown to be interrelated over the long term, leading to a vicious cycle for both females and males across weight groups. Effective approaches are needed to interrupt detrimental trajectories and comorbid negative health outcomes. Therefore, it might be promising to simultaneously address weight stigma reduction and healthy lifestyle interventions to either promote weight loss or hinder future weight gain. In addition, it might be crucial to identify factors that mediate or moderate the relationship between weight stigma and weight.