Introduction

Obesity rates in Western countries over the past 30 years have increased rapidly [1], and Sweden is not an exception: In Sweden, the share of overweight and obese among men aged 16–84 years has increased from 30 % to more than 50 %; for women, the share has increased from 25 to 35 % (Statistics Sweden, Survey of Living Conditions). The general picture emerging from research on excess weight and labor market outcomes states that heavy individuals, particularly women, are less likely to participate in employment and tend to earn less [222]. However, the results are not conclusive. For example, Norton and Han [23] do not find any negative weight effect on labor market outcomes for American men and women. Similarly, Behrman and Rosenzweig [24] do not observe any weight penalty in wages for US women. Another exception is presented by Brunello and D’Hombres [25], who observe that the negative effect on wages is stronger for men than for women, using data from the European Community Household Panel. Cawley [9] argues that the mixed results are partly a consequence of different identification strategies. The type of weight measure is another factor that could influence results [7].

The literature discusses two main channels through which excess weight may influence labor market outcomes: lower productivity due to bad health and discrimination. Obesity and overweight are associated with comorbidities such as type II diabetes, various types of cancer and cardiovascular diseases [26, 27], conditions that may contribute to reducing individual ability to work [16, 2834]. Furthermore, an association between lower productivity and excess weight on average may pose difficulties for the individual of excess weight to get hired or get a pay raise. Apart from such statistical discrimination, excess weight individuals are also the targets of discriminatory attitudes that ascribe negative characteristics, e.g., laziness and lack of self-discipline, to them [35, 36]. Such inferences, based on physical attractiveness, appear to carry over to the labor market [18, 3739].

Although the channels are often difficult to separate, it is important for studies investigating discrimination to account for potential productivity losses due to bad health.Footnote 1 Studies that investigate the impact of health when analyzing the relationship between excess weight and labor market outcomes observe different effects. Baum and Ford [10], using US data, Morris [4], using UK data, and Greve [3], analyzing Danish data, conclude that health measures have limited influence on the excess weight penalties found in their analyses. By contrast, Lundborg et al. [40] observe a very strong effect of health on the income penalty for excess weight among Swedish men. The mixed findings motivate further research on the influence of health on excess weight penalties in the labor market.

This article investigates the excess-weight penalty in income for men and women in the Swedish labor market, using longitudinal data. Previous studies using Swedish data are rare, and, to our knowledge, we provide the first analysis of weight and income for Swedish women. We regress income on lagged weight categories, applying different identification strategies, OLS and individual fixed effects. The former strategy considers the impact of weight, e.g., the impact of being obese, while the latter considers the impact of changes in weight, e.g., the impact of becoming obese. We distinguish between the productivity and discrimination channels by controlling for individual health, using several measures. Any remaining excess-weight penalty is considered an indication of potential discrimination. In addition, our approach allows us to explore the “health effect” across identification strategies.

The article is outlined as follows. The next section discusses the data and variables and presents descriptive statistics of the sample. The “Methods” section contains a discussion of methodological issues, while the “Results” section presents the estimates from our main analysis as well as sensitivity analyses. The “Discussion” section discusses the results, and the “Conclusion” section concludes the article.

Data and descriptive statistics

Our empirical analysis uses data from the Swedish Survey of Living Conditions (the ULF survey). The ULF survey is an annual systematic survey of living conditions conducted by Statistics Sweden since 1975. The data are collected during 1-h personal interviews with randomly selected individuals aged 16–84 years and complemented with information from various registers. On average 7,500 individuals are interviewed yearly. The database is primarily cross-sectional, but it also contains a longitudinal panel. The panel is complemented with immigrants and young individuals who have become old enough to be included in the population [41]. The questions are divided into four main themes: Health, Social relations, Physical environment and Work. The survey always contains some central questions from all themes. However, every 8 years each theme receives particular attention. This study uses unbalanced panel data from four 2-year waves, 1980–1981, 1988–1989, 1996–1997 and 2004–2005, covering a 25-year period and focusing on health-related issues. The last two survey waves had 75 % response rates [41].

At the outset, the sample consists of n = 22,855 observations. The sample is restricted to working-age individuals, i.e., those aged 20–64 years (n = 16,816) who have not retired (n = 15,779). We are only interested in individuals who appear at least twice (n = 10,048). The lag length in the final sample varies between 8 years and 16 years (<5 % of the sample observations). In addition, we require information on BMI and that BMI is lower than 45, thereby including individuals who are morbidly obese (12 observations) but excluding individuals who are super obese (3 observations).Footnote 2 Those who are or have been underweight are also excluded (n = 9,591), making normal weight the reference group for the two excess-weight categories, overweight and obese.Footnote 3 Furthermore, we eliminate missing observations regarding education (n = 9,570) and health measures (n = 9,567). The final requirement states that individuals must be employed and have a relatively strong connection to the labor market (thereby avoiding the analysis of individuals who work very little during a year, e.g., those who only have a summer job). We code this requirement as annual income from employment exceeding at least 100,000 SEK (approximately $15,750). Our final sample consists of n = 8,214 observations belonging to 2,415 men and 2,184 women (N = 4,599).

Dependent variables

This article examines the association between excess weight and income, measured as the logarithm of annual income from employment, above a threshold of 100,000 SEK. Income from employment is based on tax records and includes salaries and benefits such as sickness, unemployment and parental leave benefits. Benefit payments are conditioned on labor market activity, and the amount is related to the individual income level. If benefit payments mask differences in behavior related to weight, our analysis will generate biased results. This concern is particularly related to women, who tend to allocate more time to the care of children and of the home, and also tend to suffer from worse health than men, factors that may all affect labor supply negatively (see, e.g., [4446]). Unfortunately, there is no measure of income from employment that excludes benefit payments available to us.Footnote 4 The consequence for our analysis is most likely an overestimation of the obesity penalty for women. However, in lieu of an income measure excluding benefits, we specify several health variables that should pick up conditions and circumstances that could influence both weight and income. We also include a control for having small children when analyzing excess-weight penalties for women, thereby taking account of any differences in the family situation that could influence the income level of excess-weight women.Footnote 5 Another factor that may influence the results for women is the income threshold itself. If women have a weaker connection to the labor market, it is possible that the excess weight penalty for women is not observable above the income threshold. We investigate this possibility in the “Sensitivity analyses” section where we perform various sensitivity analyses.

Our income measure is the product of the wage rate and the number of hours worked during a year. In consequence, any indication of income penalties due to excess weight may be associated with either fewer work hours or a lower wage rate or both. However, Antelius and Björklund [47], studying the returns on education in Sweden, observe that the analysis when excluding annual income below 100,000 SEK generates results that are similar to those obtained in an analysis of hourly income. To the degree that this relationship holds in other contexts, our analysis will contribute to elucidating the association between excess weight and wage rates for Swedish employees (see also Lundborg et al. [40], who apply the same income threshold when analyzing obesity and income for Swedish men). In addition, we have run regressions controlling for hours of work per week without observing any marked differences in our main results.

Independent variables

Excess-weight measures

We measure normal weight, overweight, and obesity using BMI, based on self-reported weight and height. This article relies on the WHO classification of weight categories: normal weight is a BMI of 18.5–25, overweight 25–30 and obesity ≥30.

Additional background variables

We control for individual age, age squared, and whether or not the individual is married or cohabiting, respectively. We also control for first generation immigrant status or second-generation immigrant status [born in Sweden by parents, one of which is or both are non-Swedish citizen(s)]. Pregnancy tends to increase weight and decrease income (due to work reduction during pregnancy and after birth). These pregnancy-related effects could bias the estimates for women, implying an amplification of the excess-weight penalties. Unfortunately, we cannot exclude pregnant women from the analysis because the ULF survey does not collect information about pregnancy at the time of the interview. However, the survey collects information about how many children the respondent has in different age ranges (0–6 years, 7–18 years, 0–12 years, etc.). Thus, in lieu of information about pregnant respondents, we use a dummy variable describing whether or not the individual has small children, aged 6 years or younger. (We also try using lagged values of the children dummy in the analysis without observing any material changes in the weight estimates.) The analysis also considers four levels of educational attainment, in the form of dummy variables: primary school, 2 years of secondary school, more than 2 years of secondary school and higher (post-secondary) education. In addition, we control for panel waves and region of residence; living in northern or southern Sweden, or in a large city (Stockholm, Gothenburg or Malmoe).

We use a set of variables to control for health: (1) self-assessed health, (2) pain or discomfort due to disease(s), (3) anxiety, nervousness and uneasiness, and (4) mobility. Self-reported health functions as the general measure of health, while the other measures reflect different dimensions of health; the impact of suffering from a disease, of mental health status and of physical ability. In the first two waves, the measure of self-assessed health uses a three-point scale (“good,” “between good and bad” and “bad”). In the last two waves, the measure uses a five-point scale (“very good,” “good,” “between good and bad,” “bad” and “very bad”). We construct a measure of self-assessed health using the three-point scale, merging assessments of “very good” and “very bad” health into the categories of “good” and “bad” health, respectively. Bad health receives the lowest score (1) and good health the highest score (3).Footnote 6 In the ULF survey, respondents are asked to specify up to six diagnoses from which they suffer and to assess the pain or discomfort experienced because of each diagnosis. Based on the reported frequency and intensity of the pain or discomfort, we construct a measure that ranks the pain along a three-point scale, where high levels of pain receive the highest score (3) and low levels of pain receive the lowest score (1). The variable measuring anxiety, nervousness and uneasiness is also constructed in the same way: a three-point scale indicating severe problems by the highest score (3) and no problems by the lowest score (1). The mobility variable indicates whether the respondent can run a short distance when necessary (e.g., when trying to catch a bus). Table 1 provides the descriptive statistics of our sample.

Table 1 Descriptive statistics

Attrition bias is a potential problem because individuals with certain characteristics may drop out of the panel between the survey waves. We investigate the extent of the attrition bias by comparing the variable means in the panel sample, separated into three groups. Group 1 contains observations belonging to individuals appearing once in the sample and group 2 contains observations belonging to individuals appearing twice in the sample. Because of our use of lagged weight variables, single and double appearances imply that the individuals have responded twice and three times respectively in the survey. In these two groups there are individuals who have responded on all possible occasions, individuals who have not responded on one or two occasions as well as individuals whose responses are excluded from our sample because of the age restrictions we set up. Group 3 contains the observations of individuals appearing three times in the sample, i.e., responding in all four survey waves. Table 4 in Appendix 1 shows the variable means per group (the first three columns) and presents the p values of the t tests when we compare the groups pairwise: group 1 to group 2, group 2 to group 3 and group 1 to group 3 (the last three columns). Generally, when we compare the p values of the pairwise t tests, we observe that the characteristics of group 1 differ significantly at 5 % from the other two groups (column 4 and 6) more often than the characteristics of group 2 compared to group 3 (column 5). Focusing on the comparison of group 1 and group 3 (last column), we note that among other things group 1 tends to earn less, have invested less in higher education (significant at 10 %), have worse health, be less overweight, be younger and have more immigrant representation. Notably, there is no significant difference in average obesity. In fact, across all three groups we observe increasing average income, as well as age, but no significant differences in obesity. However, group 3 is significantly more overweight on average. Partly these observations may indicate a positive relationship among age, income and weight. Indeed, when studying the means for income and weight variables of group 3, while decreasing the maximum age limit, we find that the means become more like the ones of group 1. Overall, we find little indication that attrition bias is a major problem for our analysis.Footnote 7

Methods

In similarity with Cawley [9] and other studies [8, 1113, 17, 23, 40], we use lagged BMI (classified as normal weight, overweight and obese) as a means to control for reversed causality.

The use of two identification strategies allows us to consider different aspects of the relationship between weight and income. While the OLS approach analyzes the impact on current income of lagged weight, the fixed effects approach analyzes the impact on current income of changes in lagged weight.

We estimate the weight impact on income by applying OLS on a pooled data model. Equation 1 shows the baseline model:

$$\ln \left( {y_{it} } \right) \, = \, \beta W_{it - 1} + \, \gamma X_{it} + \, \delta T_{t} + \, \varepsilon_{it}$$
(1)

where y is annual income, W is a vector of dummy variables indicating overweight and obesity, and X is a vector of explanatory variables, including marital status, education attainment, etc., for individual i at time t. The vector T contains panel wave dummies. We assume that the error term ε is random and uncorrelated with the explanatory variables, zero mean and constant variance. To control for health-related productivity reduction, we analyze a second model:

$$\ln \left( {y_{it} } \right) \, = \, \beta W_{it - 1} + \, \gamma X_{it} + \theta H_{it} + \, \delta T_{t} + \, \varepsilon_{it}$$
(2)

where H is a vector of health variables.Footnote 8 If the analysis produces significant excess weight estimates, we attribute the remaining excess weight penalty to discrimination (cf., [3, 10, 40]). However, to the extent that other unobservable characteristics, e.g., self-confidence or time preferences [10, 15], influence the relationship between weight and income, the OLS approach generates biased results. In other words, there is a risk that we overestimate the impact of discrimination as a channel through which excess weight is penalized in the labor market.

Taking advantage of the panel, we can control for individual time-invariant unobserved heterogeneity, which may otherwise bias the OLS estimates.Footnote 9 Among studies investigating the excess weight penalty in the labor market by using (individual or sibling/family) fixed effects [3, 912, 24], Lundborg et al. [40] represents another example where current earnings are regressed on lagged BMI. The baseline model, using individual fixed effects, is:

$$\ln \left( {y_{it} } \right) \, = \, \beta W_{it - 1} + \, \gamma X_{it} + \, \mu_{i} + \, \lambda_{t} + \, \varepsilon_{it}$$
(3)

where y, X and ε are defined as in Eq. 1, μ is the individual-level fixed effect, and λ the time fixed effect, which we estimate using panel wave dummies. Assuming that explanatory variables in vector X and μ are correlated, the fixed effects approach uses the difference within individual observations over time to eliminate μ. We add health-related variables in the second fixed effects model, as shown by Eq. 4:

$$\ln \left( {y_{it} } \right) \, = \, \beta W_{it - 1} + \, \gamma X_{it} + \, \theta H_{it} + \, \mu_{i} + \, \lambda_{t} + \, \varepsilon_{it.}$$
(4)

Our analysis is based on self-reported BMI, which may be subject to measurement error, as (excess weight) respondents tend to under-report weight and over-report height. Under-reporting of BMI also depends on gender and age; women and younger individuals are found to underreport BMI more than men and older individuals [4851]. Socioeconomic status is another factor that may influence misreporting [52, 53]. Some studies correct for misreporting by using fitted values based on anthropometric data [9, 5456]. Another method to deal with reporting error involves lowering the threshold for obesity [49, 57, 58]. Having no access to anthropometric data, we adopt the latter method and investigate the effect of changing the cutoff point by gender in “Sensitivity analyses” section.

Excess weight penalties may work through sorting in different dimensions and stages of life, e.g., education, occupation and on the marriage market [15, 40, 59, 60]. Among the explanatory variables we include educational choices and family status but exclude variables relating to work. In other words, our aim is to analyze the impact of excess weight in the labor market specifically, not the total effect of excess weight.

In the fixed effects approach, we cannot control for (linear) age and time simultaneously, because age is a function of time, βage it  = βage i0 + βt [61]. The first RHS term is time-invariant and will disappear when we apply individual fixed effects. The second RHS term is identical for all individuals at time t and will be picked up by the panel wave dummies. We drop linear age from the model specifications but keep age squared in the fixed effects framework. Thus, the panel wave dummies reflect the cohort effects, while age squared captures the income effect associated with increasing age. We do not observe any considerable changes in the estimates when excluding age squared and keeping the time dummies.

Results

Income

Table 2 summarizes the results from the income regressions for both genders. Appendix 2 contains tables (Tables 5, 6) showing the estimates for the full model specifications. We use two identification strategies, OLS and individual fixed effects, and present the estimation results in that order. For each strategy there are two model specifications, a baseline model (column 1 for OLS and column 3 for fixed effects, FE) that contains individual background variables including educational attainment and a second model (column 2 for OLS and column 4 for FE) that adds health variables. Starting with the OLS results for men (columns 1 and 2 in Table 2), we observe in the baseline a 6 % obesity penalty (p < 0.10). When we take differences in health into account, the penalty decreases to <4 % and loses statistical significance. By contrast, overweight men do not appear to experience lower annual income compared to their normal-weight peers. The penalty is very small and statistically insignificant in the baseline and disappears in the second model containing health variables. For obese women, the baseline OLS estimate reveals a statistically insignificant penalty amounting to 1.3 %, a penalty that is erased in the second model. We find no indication of income differences due to overweight for women.

Table 2 Income and excess weight

Columns 3 and 4 in Table 2 show the fixed effects estimates. The results for obese men are quite strong. In the baseline model, we observe an obesity penalty of 9.6 % (p < 0.01). The penalty proves to be quite robust to controls for health; it decreases to 9.2 % and remains strongly significant. By contrast, the overweight penalty for men is roughly 1 % and insignificant in both models. We do not observe any significant excess weight penalties for women. The obesity penalty amounts to 2.4 % in baseline and falls below 2 % in the second model. The overweight estimates for women are positive insignificant in both models. Overall, the influence of health on the weight penalty appears to be smaller in the fixed effects framework than in the OLS framework.Footnote 10

Sensitivity analyses

Adjustment of the obesity threshold

We investigate whether our results are affected by survey respondents misreporting their height and weight. We apply lower, gender-specific obesity thresholds: BMI ≥29.5 for men and BMI ≥29 for women. The adjustments are based on the findings of Boström and Diderichsen [62], who analyze the misclassification of BMI using questionnaire data on individuals living in Stockholm county, Sweden. They observe an underestimation of BMI that differs by gender: −0.85 for women and −0.4 for men.

Table 7 in Appendix 3 presents the excess weight estimates from the income regressions with the new obesity thresholds. For men, the lower obesity threshold changes the magnitude of the weight estimates in different directions depending on the identification strategy but does not materially alter the results from our main analysis. Thus, we find no clear support for BMI measurement error for obese men. For women, the threshold adjustment implies small changes regarding size in the OLS results, which remain insignificant. When inspecting the fixed effects estimates, we observe a large increase in magnitude but no change in statistical significance. The indications of a BMI misclassification are inconclusive also in this case.

Changes in the income threshold

We investigate the effect of removing the requirement of a minimum annual income of 100,000 SEK, thereby shifting focus from income in terms of hourly wages to income in terms of hours worked (see [47]). Thus, we are able to say something about the excess-weight penalty among individuals with a weak connection to the labor market. Table 3 summarizes the regression results. The results indicate a stronger influence of excess weight on income, in particular for obese men. In the OLS baseline, the obesity penalty for men amounts to 15.5 % (p < 0.05). When we control for health, the penalty decreases to roughly 13 % (p < 0.10). There is (still) no association between overweight and lower income among men in the OLS framework. Continuing with the fixed effects results for men, we observe a baseline obesity penalty of 16.6 % (p < 0.05). The second model produces a penalty of almost the same magnitude, 16.5 %, and of the same statistical significance. In addition, we observe an overweight penalty of almost 2 %, but the difference in relation to their normal-weight peers is insignificant in both models. When analyzing weight and income for women, we find that the OLS estimates are larger compared to those in our main analysis (see Table 2) but remain insignificant. The baseline estimates imply that excess weight decreases income by more than 4 % on average. Furthermore, the overweight penalty is larger than the obesity penalty, a relationship that holds in the second model. Concerning the fixed effects approach, the inclusion of low-income earners in the analysis alters neither estimate size nor significance level; we find no indication of excess-weight penalties for women.Footnote 11

Table 3 Income and excess weight, no income threshold

Including annual income below 100,000 SEK in the analysis, we observe a strong obesity penalty for men. The result implies that obese men with low income work less than their normal-weight peers. However, we cannot rule out that the penalty is (partly) mediated through the wage rate (cf., [40]). The analysis also shows that the obesity penalty is relatively insensitive to controls for health using both identification strategies, a relationship that indicates that health-related productivity reductions do not drive the results (although the statistical significance is weak in the OLS approach; see Table 3, column 2).

Discussion

When analyzing the impact of excess weight on income from employment for Swedish men and women, we observe that men appear to experience an obesity penalty while women do not. When we regress income (of at least 100,000 SEK annually) on weight using OLS, the obesity penalty for men reaches almost 6 %. Using individual fixed effects, we find that the penalty amounts to more than 9 %. Considering the larger penalty found when applying the second identification strategy, individual fixed effects, we conclude that (factors inducing) changes in weight (categories) have a particularly strong influence on income for men, while permanent excess weight (considered in the OLS analysis) does not. However, for men with a weak connection to the labor market differences in “weight profiles” have little influence on the obesity estimates. Analyzing income and weight without the income threshold, we find that obese men experience on average a 14–16 % penalty in annual income irrespective of identification strategy. Our results contrast with previous studies that apply both OLS and individual fixed effects; they find that taking individual fixed effects produce smaller weight estimates compared to OLS [9, 10]. However, these studies do not use lagged weight when applying fixed effects, implying that there may be a problem of reversed causality diluting the negative effect of weight on income. Indeed, when we regress current income on current weight applying individual fixed effects, we observe large, positive, but insignificant obesity estimates (results not shown).

Compared to most other studies finding significant obesity penalties for men, our estimates are large. Baum and Ford [10] observe a significant obesity penalty of <1 % for men in the US, using individual fixed effects and current weight as independent variable. Our results also contrast with the IV estimates in Brunello and D’Hombres [25], who find that obese men earn significantly (3.3 %) less on average, using data from nine European countries (Sweden not included). However, our findings are fairly in line with another analysis of Swedish data. Lundborg et al. [40], who investigate the relationship between current income and excess weight at the age of 18 for Swedish men aged 28–38 years, find considerable and significant effects of excess weight, approximately 9 % in the baseline, using sibling fixed effects. Together these studies indicate that the obesity penalty for men may be relatively large in the Swedish labor market. However, further research on the Swedish labor market is needed to corroborate our results.

We also find that health measures appear to impact differently on the obesity penalty depending on the identification strategy. The OLS results indicate that obese men have worse health and therefore earn less. When we account for individual fixed effects, health is less influential. This result implies that lower productivity is not the main suspect when searching for the source of the obesity penalty. Instead, discrimination may be an important underlying determinant. The difference in impact may again reflect the two identification strategies picking up different properties of the sample population; (bad) health may be an influential factor in explaining the labor market outcome for an individual who already is obese, but when it comes to income and changing weight categories, health may not be a main driver. We find that the relationship between “health effect” and identification strategy is less pronounced when including low-income earners in the analysis (see Table 3), a finding that may indicate that discrimination is a bigger problem for individuals with a weak connection to the labor market, irrespective of constant or changing weight. In similarity with most studies investigating the impact of health on excess-weight penalties, we find a limited influence of health. Lundborg et al. [40] come to a different conclusion. However, they rely on anthropometric data instead of self-reported data when constructing their health measures (e.g., cardiovascular fitness), which may explain the different results.

Contrary to many previous studies, we find that excess weight is not a problem for women concerning income. However, our dependent variable, income from employment, includes income-related benefits, e.g., sickness benefits, which may conceal ways in which weight may influence income. In lack of another income measure, we try to mitigate the problem of measurement error by including controls for several health measures and having small children (as well as labor market status, see footnote 5). As recent research indicates, it is possible that the barriers for heavy women rather exist at the employment stage in the Swedish labor market [38]. Moreover, small and insignificant estimates for women may be an effect of measurement error due to survey respondents underreporting their BMI. We try to correct for that possibility by adjusting the obesity threshold downwards. However, in similarity with other studies adopting that method, our results remain, on the whole, the same [49, 57, 58]. Mitigating the potential problem of BMI measurement error, recent research implies that misreporting may be a decreasing problem because of changes in social norms related to weight [63]. Self-reported or not, BMI may be a flawed measure of obesity, since it does not distinguish between fat mass and fat-free mass. Recent studies on obesity and labor market outcomes use alternative indicators based on body composition, e.g., fat mass and waist circumference [7, 20, 64, 65]. It is possible that our weight estimates for women would alter if we had the opportunity to use such measures. For example, Johansson et al. [7] observe a negative association for Finnish women between income and waist circumference, but not between income and BMI.

Conclusions

This study investigates the relationship between excess weight and income for men and women in the Swedish labor market. Our analysis shows that there is a significant obesity penalty for men, but not for women; a reverse gender pattern relative to the one found in the majority of studies analyzing excess weight and labor market outcomes. In addition, we find that the obesity penalty for men is considerable in magnitude and relates to discrimination rather than lower productivity due to bad health. Our findings fit with previous research of the Swedish labor market, but considering that there are only a few studies to date, the picture of how weight influences labor market outcomes in Sweden is incomplete. Further research on weight, labor market outcomes and gender in the Swedish labor market is clearly warranted.