Introduction

Mental health has been the target of global policy actions since the 2000s. Indeed, on top of the direct moral and social benefits, tackling mental ill health has considerable economic consequences. The costs of mental ill health include both direct costs for e.g. medication, visits to a clinic, and hospitalization, and large indirect costs for society through e.g. absenteeism, presenteeism (a situation in which individuals work at reduced productivity due to poor mental health), turnover, early retirement, and mortality. Recently, the spectrum of mental ill conditions considered by policy makers has widened to include milder forms (both diagnosed and undiagnosed), which are responsible for unpleasant emotional states and interfere with daily functioning, but do not affect insight or cognition. Indeed, the impact of these milder forms of mental ill health has become particularly relevant for labour market outcomes with the growth of the service industry and the transition towards knowledge- and emotion-intensive jobs, since they may lead to problems of concentration, loss of memory, decreased motivation, and distraction, thus reducing the emotional and cognitive resources needed to perform daily work tasks.

Thanks to a series of targeted strategies,Footnote 1 the United Kingdom is among the most innovative OECD countries both in terms of awareness about the impact of mental health on employment outcomes and the level of integration of people with a mental illness into the labour force; nevertheless, it is also one of the countries bearing the highest direct and indirect costs of poor mental health [52]. For example, the Adult Psychiatric Morbidity Survey suggests that 17% of the UK adult population was experiencing at least a common mental disorder in 2014, with females more likely to be affected than males. “Thriving at work", an independent review of mental health and employers published in 2017, estimates the annual costs of poor mental health to the UK to be between \(\pounds 79\) and \(\pounds 99\) billion, including \(\pounds 3\) billion of costs to the NHS, between \(\pounds 37\) and \(\pounds 52\) billion of lost output, and costs to employers and self-employers between \(\pounds 33\) and \(\pounds 44\) billion, including staff turnover, sickness absences, and presenteeism [65].

These reports provide some insights on the potential size of the indirect costs associated with mental ill health, but economic researches are needed to support these estimates by looking empirically for their significance and magnitude. Since Bartel and Taubman [5], a rich economic literature has estimated a negative correlation between diagnosed mental disorders and labour market outcomes; however, the economic literature focusing on milder forms of mental ill health is still in its early stage. Whereas major disorders have large effects on work capacity at the individual level, the greater prevalence of milder forms of mental ill health and psychological distress probably makes them the biggest overall burden for society; as a consequence, understanding this relationship seems necessary to fully outline the indirect public cost of mental ill health and to assess the outcomes of social policies carried out to improve people’s mental health. In this paper, we thus investigate the relationship between a self-assessed measure of psychological distress and average hourly wages (as a proxy for productivity). We exploit data drawn from the British Household Panel Survey, and consider males and females separately to account for the potential differences observed in the psychological and economic literature.

However, studying this relationship presents several challenges. First, causality can go in either direction: while psychological distress is likely to affect labour productivity and thus wages, one could also expect a reverse effect from employment to mental health. Second, unobserved individual factors, such as childhood circumstances and cognitive ability, could be correlated with both mental health status and labour market outcomes. Third, a problem of measurement error may arise, since we rely on a self-reported measure of mental health. Finally, non-random sample selection may result in inconsistent estimation of the effect of psychological well-being on income if one fails to take into account that labour market participation is itself influenced by psychological distress. To overcome these numerous challenges, we apply a series of different estimators to our longitudinal dataset; moreover, we account for correlated individual effects, endogeneity of the mental health variable, and sample selection in an unified framework, using the estimator proposed by Semykina and Wooldridge [62] and instrumenting mental health with the perceived social support network available to the individual.

Results show the signs of endogeneity for the mental health variable and that correcting for sample selection in the labour force is necessary at least for the female sample. Furthermore, we find that psychological distress significantly affects labour market participation across genders: individuals that may be suffering from psychological distress are between 6 and 20 percentage points less likely to be employed. Conversely, psychological distress has a significantly negative effect on wages only for the female sample: the wage gap between a non-distressed and a distressed female may be as high as \(\pounds 1\) per hour. Even if more research is needed, we argue that these results are consistent with different reactions to psychological distress across gender: the reduction in the offered wage may reflect presenteeism for the female sample, whereas this may remain hidden behind increased absenteeism in the male sample.

This paper proceeds as follows. “Literature review” reviews the existing literature. “Econometric method and issues ” presents our estimation procedures, while “Data” describes the dataset. “Results” shows the results. The conclusion is drawn in “Conclusions”.

Literature review

Health status has been included as a determinant of earnings at least since Luft [39]. However, the pioneering study of the impact of mental health on labour market outcomes is due to Bartel and Taubman [5], who, using data on white male twins born between 1917 and 1927, found negative long-term effects of psychoses and neuroses on wages, employment, and hours worked. Since then, a rich body of literature has studied the relationship between diagnosed mental disorders and labour market participation, but only few papers have also considered the impact of mental health on wages, like we do.

Among these, Ettner et al. [21], Marcotte and Wilxox-Gok [41], and Chatterji et al. [13] all use data from the National Comorbidity Survey, a large-scale field survey of mental health in the United States. Ettner et al. [21] found that psychiatric disorders significantly reduce employment for both males and females, but the detrimental effect on income is found to be significant only for females when accounting for the likely endogeneity of mental health. Similarly, Marcotte and Wilxox-Gok [41] showed that diagnosed mental disorders negatively impacts on only women’s earnings. Chatterji et al. [13] found no effects of recent psychiatric disorder on earnings among employed individuals; however, the effect on labour force participation and employment is negative for both females and males.

More recently, Mitra and Jones [46] used data from the US National Survey of Alcohol, Drug and Mental Health Problems and found a positive correlation between developing a mental health condition and the probability of transitioning to non-employment, but no consistent effects on earnings. Hakulinen et al. [31, 32], using data for the whole Finnish population between 1988 and 2015, concluded that severe mental disorders are associated with significantly lower levels of employment, education, and annual income. Finally, the OECD [51, 53] has also been investigating the relationship between mental ill health and labour market outcomes, and generally found mental illnesses to be responsible for decreased labour supply, high rates of unemployment, and reduced productivity at work.

Whereas the studies above focused on diagnosed mental disorders, as far as we know only Contoyannis and Rice [16] and Peng et al. [56] studied undiagnosed mental illnesses in this context, like we do. The former found that worsening psychological health, measured using a reduced version of the self-assessed General Health Questionnaire, is associated with a decrease in hourly wages, but only for males; however, they did not study the impact of mental health status on labour market participation and employment. The latter captured the mental health status of the respondent with an index calculated on the frequency with which, over the 2 weeks before the interview, the respondent has been bothered by “having little interest or pleasure in doing things" and “feeling down, depressed, and hopeless", and found evidences that exhibiting these mild depressive symptoms reduces the likelihood of employment for both males and females, but that it does not significantly affect hourly wage. Furthermore, they examined the relationship between depressive symptoms and absenteeism, and observed that even a mild form of depression increases the number of sick days by 2 days per year for men, while they do not have any statistically significant effect on sickness absences for women.

Whereas both Contoyannis and Rice [16] and Peng et al. [56] tried to deal with the likely endogeneity of the mental health predictor and the unobserved heterogeneity, they did not consider the potential bias associated with self-selection. To correct simultaneously for all these econometric issues, in this paper, we use the estimator proposed by Semykina and Wooldridge [62], allowing us to study the effect of psychological well-being on both labour market participation and income. To our knowledge, this estimator has been only used in a similar context by Jäckle and Himmler [36] who, using data from the German Socio-Economic Panel, found a statistically significant effect of physical health on wages for males and a significant effect on labour market participation for females; however, they did not investigate the effect of mental health status.

Econometric method and issues

The aim of this paper is to estimate the relationship between psychological well-being and wages for the entire sample, and not only for those individuals who are employed. Therefore, we formally specify our model as follows:

$$\begin{aligned} w^*_{it} =\,&\beta _0 + \varvec{x}_{it} \varvec{\beta }_1 +\varvec{y}^*_{it} \varvec{\beta }_2 +\alpha _i + \eta _{it} \end{aligned}$$
(1a)
$$\begin{aligned} w^*_{it} =\,&w_{it}, \quad \varvec{y}^*_{it} = \varvec{y}_{it} \quad \text {if } S_{it}=1 \quad \text {and unobserved otherwise} \end{aligned}$$
(1b)
$$\begin{aligned} S^*_{it} =\,&\gamma _0 + \varvec{z}_{it} \varvec{\gamma }_1 + k_i + e_{it} \quad \text {with } S_{it}=1 \text { if } S^*_{it}>0, \end{aligned}$$
(1c)

where \(w^*_{it}\) is a measure of hourly wage of individual \(i=1,...,N\) at time \(t=1,...,T_i\), \(\varvec{\beta }_1\) is a vector of parameters associated with \(\varvec{x}_{it}\) vectors of independent variables (including a measure of psychological well-being) that can be observed for all the individuals in the samples, and \(\varvec{\beta }_2\) is a vector of parameters associated with \(\varvec{y}^*_{it}\) vectors of variables that can be observed only if the individual works; \(\alpha _i\) is a vector of unobservable time-invariant individual characteristics, whereas \(\eta _{it}\) is a mean zero unobserved error term.

Equation (1a) is a Mincer’s [45] wage equation, modified to account for the impact of psychological well-being (among other variables) and for the panel structure of the dataset. As stated in (1b), we observe \(w^*_{it}\) only if the individual participates to the workforce, \(S_{it}=1\), where \(S_{it}\) denotes market participation. Equation (1c) indicates that we observe participation only if the latent variable \(S^*_{it}\), which represents the unobservable individual propensity to work, is positive. This depends on \(\varvec{z}_{it}\), a superset of \(\varvec{x}_{it}\).

As anticipated, the estimation of (1) involves a number of issues. First, due to the lack in the BHPS of an objective measure that can easily capture psychological well-being in quantitative terms, we need to resort to a self-assessed measure, which is likely to contain inaccuracies. Second, psychological distress can be correlated with unobserved characteristics that can also affect productivity and, hence, wages (e.g., genetic endowment). The consequence is a non-zero correlation between the health regressor and the error component. Third, the direction of the relationship between health and wages can be reversed: on one hand, if investment in health increases with salary, health should rise with wages [29]; on the other hand, there are many risk factors for mental health that may be present in the working environment, especially considering the pressures deriving from a dynamic and innovation-oriented economy [68]. This leads to a correlation between health and the period and individual specific error component of wages, \(\eta _{it}\).Footnote 2

Another potential issue is selection bias. Selection is not an issue if, for example, the decision to participate in the labour market is randomly determined. However, this is unlikely to be the case, as we expect some of the factors that determine participation to also influence psychological distress and income, such as low self-esteem, states of anxiety, and loss of focus. We will correct for sample selection below, but, for the time being, we start our analysis by disregarding the selection problem, and thus by estimating the wage equation in (1a) in isolation. Below is our estimation strategy.

We first estimate the wage equation in (1a) by Pooled OLS using cluster robust standard errors to account for cluster heteroskedasticity. If the regressors are not correlated with the errors, the estimator is unbiased, consistent, and efficient. Second, we implement a within (FE) estimator with cluster robust standard errors, thus assuming that the regressors are uncorrelated with the idiosyncratic error but without making any assumptions on the unobserved heterogeneity. The FE estimator, in fact, will be unbiased and consistent as N and/or T tend to infinity even if the regressors are correlated with \(\alpha _i\).Footnote 3

Third, we estimate the wage equation in (1a) through a two-stage least-squared (2SLS) and a within-2SLS (FE-2SLS) regression with cluster robust standard errors. We run these two instrumental variable approaches (IV) to deal with the endogeneity problem. IV requires the use of an instrument, i.e., a regressor that is predictive of the potentially endogenous mental status, but that is otherwise independent of the dependent variable of interest (here, the labour market outcomes). Following previous literature, we instrument psychological distress using a proxy for the perceived social support network of the individual (see “Instrument”). We test the validity of the instrument using Hansen–Sargan’s test of overidentifying restrictions.

At this point, it remains to deal with sample selection. Thus, in the last step, we run a regression based on Semykina and Wooldridge’s [62] estimator.Footnote 4 To do so, we first follow Mundlak [50], Chamberlain [12], and Wooldridge [70], and explicitly model the correlation between the regressors and the unobserved heterogeneity. In particular, we write \(k_i\) as a linear combination of a constant term, the group means of the time-varying regressors in \(\varvec{z}_{it}\) (the so-called Mundlak effects), and a normally distributed error term \(a_i\). Therefore, the participation equation in (1c) is updated to:

$$\begin{aligned} S^*_{it} = \gamma _0 + \varvec{z}_{it} \varvec{\gamma }_1 + \varvec{{\bar{z}}}_i \varvec{\theta } + \nu _{it}, \end{aligned}$$
(2)

where \(\nu _{it} = e_{it} + a_i\) is an error term, independent from \(\varvec{z}_{it}\) on which, as for \(\eta _{it}\), no restrictions are imposed in terms of heteroskedasticity and serial correlation.

Two assumptions are required to implement the estimator proposed by Semykina and Wooldridge [62]. The first one is that \(\eta _{it}\) is a linear function of \(\nu _{it}\) and mean independent of \(\bar{\varvec{z}}_{i}\) conditional on \(\nu _{it}\). The second is that \(\alpha _i\) is modelled as in Mundlak [50] and Chamberlain [12] as a sum of a constant, the group means, and an error \(b_i\); since we think that our variables of interest, psychological well-being, may be correlated with \(\eta _{it}\), as it stands the first assumption is not satisfied. For this reason, we remove the mental health variable from \(\varvec{z}_{it}\), where it is substituted with the instrument, which is assumed to be uncorrelated with \(\eta _{it}\). This new vector is called \(\varvec{q}_{it}\); its time averages are indicated with \(\bar{\varvec{q}}_{i}\).

Having done all these steps, the following equation is obtained:

$$\begin{aligned} w^*_{it} = \varphi _0 + \varvec{x}_{it} \varvec{\beta }_1 + \bar{\varvec{q}}_{i} \varvec{\varphi }_1 +\varvec{y}_{it}\varvec{\beta }_2 + \bar{\varvec{y}}_{i}\varvec{\varphi }_2 +\xi _t\lambda _{it} + r_{it}. \end{aligned}$$
(3)

In Eq. (3), we substituted \(\bar{\varvec{x}}_{i}\) with \(\bar{\varvec{q}}_{i}\) to satisfy Semykina and Wooldridge’s [62] requirement that \(\eta _{it}\) be uncorrelated with \(\bar{\varvec{z}}_{i}\). Moreover, \(r_{it}\) is the sum of \(b_i\) and \(l_i\) where the latter is the remaining part of \(\eta _{it}\) after including the Inverse Mills Ratios (IMRs), \(\lambda _{it}\). These are obtained estimating T probit models of equation (2). As there is nothing preventing \(\lambda _{it}\) to be correlated with \(r_{is}\) for \(s\ne t\), Eq. (3) can then be estimated using a Pooled 2SLS regression where \(\bar{\varvec{q}}_{i}\), \(\varvec{y}_{it}\), \(\bar{\varvec{y}}_{it}\), and \(\lambda _{it}\) are used as instruments. As suggested by Semykina and Wooldridge [62], we build standard errors robust to heteroskedasticity and serial correlation.

Data

We construct an unbalanced panel using the available 238,996 observations from the British Household Panel Survey (BHPS).Footnote 5 Unfortunately, questions regarding the respondent’s social support network (one of our instruments) are asked only once every 2 years: we thus limit our analysis to nine biennial waves covering the period between 1991 and 2007. We drop individuals who, in a given year, are self-employed, retired, still in education, in maternity leave, or attending a government training scheme. To allow for heterogeneity in coefficients across gender, we split the sample between men and women. In line with the State Pension age in the years covered, we only consider males aged 16–65 and females aged 16–60 at the time of the interview.

After excluding individuals who did not give valid answers for the variables used in the estimations, we obtain a sample of 62,686 observations (of which 31,413 males and 31,273 females), for a total of 7991 individuals who, on average, completed 5.04 waves. For those estimators which do not account for self-selection, we consider only those individuals in employment at the date of the interview. This restricted sample consists of 54,496 employees (26,670 males and 27,826 females).Footnote 6

Dependent variables

Since the BHPS does not provide a measure of hourly wage, we construct it as a weighted average of the gross usual monthly wage from first and second job. First, we obtain the hourly wage in the main job by dividing the usual gross monthly pay by the number of hours worked per month for the main work, including paid overtime. Analogously, we calculate the hourly wage for the individuals who have a secondary job. We then construct an overall average wage by taking a weighted average of the hourly wage in the main and the secondary jobs; the weights correspond to the proportions of total working time spent in each type of job. We find the average hourly wage to be higher for males (\(\pounds 9.62\)) than for females (\(\pounds 7.63\)).

Finally, we calculate the inverse hyperbolic sine (IHS) transformation of the wage. The IHS transformation precisely approximates the logarithmic one and has the advantage of being defined for zero and negative values. In this way, the regression estimates are improved: the outliers influence is damped down and, thus, heteroskedasticity is ameliorated (see Pence [55] and Georgarakos et al., [25] for more details).

Participation in the labour market is determined by the current employment status of the respondent.Footnote 7

Mental health variable

Our main independent variable is a measure of psychological distress, which we suppose endogenous. It consists of a reduced version of the General Health Questionnaire (GHQ), a self-administered psychometric screening tool originally used to screen for minor psychiatric disorders and now considered a robust indicator of subjective psychological distress [8, 15, 27, 28]. The version in the BHPS is the widely used GHQ-12, adopted also by the World Health Organization [26], which consists of 12 questions covering feelings of incompetence, anxiety, depression, difficulty in coping, and sleep disturbance, among others (see Online Appendix A.1). We consider this general measure of psychological distress preferable to specific conditions (like panic attack or depression), since it covers a wider range of aspects that are likely to influence labour market outcomes; moreover, it can provide a measure of severity which is often absent when relying on specific conditions.Footnote 8

Indeed, by counting the number of questions to which the individual responds in the worse two categories (out of four), one obtains a 12-point “Caseness" score, which is widely used in medical, psychological, and sociological research and increasingly used in economics studies.Footnote 9 We invert the Caseness score to obtain a measure ranging from 0 (the most distressed) to 12 (the least distressed). Similarly to Jäckle and Himmler [36], we apply the IHS transformation to this scale, \(h_{it}\), so that our independent variable of interest is obtained as follows,Footnote 10

$$\begin{aligned} \mathrm {Log Mental Health}_{it} = \ln \left( h_{it} + \sqrt{h_{it}^2+1} \right) . \end{aligned}$$

In the rest of the paper, we refer to this variable as psychological well-being. Its observed means are 3.00 for working males and 2.91 for working females. These values decrease to 2.67 and 2.45 for unemployed. Standard t tests confirm that the non-working groups have statistically lower means of this variable than the working groups, and thus are more distressed. Moreover, females report significantly worse psychological well-being than males.

To facilitate the comparison between results obtained from linear and non-liner models, we later report labour market outcomes using a relatively conservative cut-off of 4 in the Caseness score to divide the respondents into “normals" and “possible cases": indeed, the clinical literature using the GHQ-12 argues that the latter group has a significant higher probability of suffering from a psychological disturbance [e.g. 30, 63, 37, 47] and the NHS has recently used this same threshold to monitor the prevalence of probable mental ill health in England [48]. The percentages of “cases" are 10.5% and 16.5% for working males and females, respectively; these increase to 30.8% and 41.3% for unemployed.

Instrument

To address the likely endogeneity of psychological well-being, we resort to the use of an instrument; in particular, we instrument psychological well-being with a measure of the perceived social capital of the individual.Footnote 11 Indeed, since McKenzie et al. [44], a number of epidemiological studies have demonstrated a positive association between social capital and mental health [see 18, 67, forliteraturereviews]; moreover, social support variables have been used as instruments in the previous economics studies, where they were found to be significantly associated with better mental health status [1, 33, 54].Footnote 12

Our measure of social capital is the perceived social support network of the individual, which we construct using the answers to the following questions: “Is there anyone who you:" (1) “can really count on to listen to you when you need to talk?", (2) “can really count on to help you out in a crisis?", (3) “can totally be yourself with?", (4) “feel really appreciates you as a person?", and (5) “can really count on to comfort you when you are very upset?". For each question, we assign 1 if the answer is positive, and 0 otherwise. Our instrument is obtained by summing up these responses.Footnote 13

The resulting variable has an observed mean of 4.68 for working males and 4.83 for working females. These values decrease to 4.38 and 4.54 for unemployed respondents. Standard t tests confirm that females have a significantly stronger support network than males (also controlling for employment status), and that unemployed individuals have a significantly weaker support network than employed (also controlling for sex).

Other regressors

The analysis includes also a full set of socio-demographic characteristics. We control for the marital status (the baseline category is married, having a civil partner, or living as couple) and for the number of dependent children in the household. Since children of different ages could have different effects on labour market outcomes, we consider separately the numbers of infants (aged 0–2 years), children (aged 3–11 years), and adolescents (aged 12–18 years) in the household. As an indicator of educational attainment, we include a dummy variable which indicates if the individual has a degree or a higher education attainment. We also include a dummy variable indicating if the individual is white. Finally, the model is completed with a vector of time dummies to control for inflation and aggregate productivity effects and a dummy to account for living in the capital.

An additional set of explanatory variables is used in the estimation of the wage equation. To capture the concavity of the earnings function, we include third degree polynomials of age and experience. We also include a vector of dummy variables indicating the occupational status of the individual, if employed in the private sector, and if she has undertaken any job-related training in the previous year. We account for the presence of an union at the individual’s workplace and for being a member of this union. Finally, we include the number of employees at the employee’s workplace.

The exclusion restrictions, i.e., those variable that drive participation but can be reasonably omitted from the wage equation, are: the IHS transformation of non-labour income, a dummy variable for having a partner, partner’s monthly gross pay, partner’s education, and third degree polynomials of the partner’s age and experience.

Variable definitions are in the Online Appendix Table A2. Summary statistics are presented in the Online Appendix Tables A3 and A4.

Results

Participation equation

The Online Appendix Tables A5 and A6 present the results for the participation equation, separately for men and women.Footnote 14 The mental health predictor is statistically significant across all specifications: an improvement in the psychological well-being of the respondent is associated with a significant higher participation in the labour market. The coefficients of the IV regressionsFootnote 15 are larger than the corresponding ones from OLS and FE, indicating the presence of measurement errors in the mental health variable that bias the coefficients towards zero. Conversely, the correlation between the mental health variable and latent individual heterogeneity seems to be associated with an upward bias, as accounting for correlated individual effects (i.e., using fixed effects) reduces the magnitude of the mental health coefficient. The coefficient of interest remains strongly significant across genders even in the pooled probit models: the magnitude of the effect of mental health on labour participation decreases only when using the Semykina and Wooldridge’s [62] estimator that accounts simultaneously for endogeneity and correlated individual effects.Footnote 16

To facilitate the comparisons of the results from linear and non-linear models, Table 1 presents participation probabilities of two average (male and female) individuals, who differs only with respect to their Caseness score. The male probit estimates show the probability difference between “normals" and “cases" to vary between 21.3 percentage point when the mental health is considered exogenous and 10.8 percentage points when mental health is considered endogenous. In the female samples, the probability difference varies between 6.6 percentage points when controlling for endogeneity and 14.9 when we do not. These findings are consistent with the previous literature: indeed, in terms of magnitude of the estimated coefficients, they fit in between the results found by researchers considering only more severe forms of mental ill health (like Ettner et al. [21], and Chatterji et al. [13] who estimated probability differences of approximately 14% for males and 13% for females), those considering diagnosed depression and anxiety disorders (Mitra and Jones [46], who estimated a significant point reduction in the probability of being employed between 6.5 and 7.3 percentage points, but do not split the sample by gender), and those by Peng et al. [56], who only considered symptoms associated with mild depression (and found probability differences of 4% for males and 2.3% for females).

Table 1 Participation probabilities (percent), males and females

There are several potential mechanisms that could explain lower participation probabilities for people suffering from psychological distress. First, distressed people might find it more difficult to commit to job hunting if they are demotivated, feeling down, or overwhelmed, or they might find the process more stressful; they may even be less aware of job opportunities if they are not part of a strong social network. Second, if psychological distress is associated with low self-esteem and confidence, they might refrain to apply for certain work positions; if it associated with anxiety and fear, they might not applying if they worry that a job may be too stressful or may lead to unfamiliar situations. Third, they might be inadvertently discriminated by potential employers during the application process: for example, interviewers often inquire about one’s ability to perform under demanding deadlines or in high-pressure situations, which might prove difficult when suffering from distress. Moreover, the fear of being stigmatised may lead to a refusal to volunteer personal information, which may leave a negative impression during an interview. Finally, distressed employees might be more likely to quit: this could be due, for example, to an unsupportive or openly hostile environment at the workplace, or with the difficulties associated with disclosing a state of psychological distress. Unfortunately, the information contained in the BHPS does not allow to investigate if one of these effects is prevalent or in which measure they coexist.

The effects of the other predictors are as expected. For both genders, age is significant, with the probability of entering the workforce increasing with age until it reaches a peak and begins to slowdown. The existence of another source of income significantly reduces the individuals’ labour market attachment for both sexes. The presence of children has an impact only on females participation, and it is found to be positive for children older than 2 years. While one could expect a negative effect on the participation of females of the presence of very young child, the positive effect here may be explained by the need to return to the workforce to provide additional economic support to the household. Having a first or higher degree increases the probability of participation, especially for females. Being white is associated with a significantly higher probability of being employed. The partner characteristics are generally not significant.

Wage equation

In Tables 5 and 6, we present the results of our econometric estimations for the wage equation. The first two columns show results from the pooled OLS and FE estimations: the coefficient obtained using pooled OLS is non-smaller than the FE coefficient, suggesting a positive correlation between psychological well-being and latent individual heterogeneity. In the last three columns, we correct for the assumed endogeneity of the health variables using 2SLS, FE-2SLS, and the Semykina and Wooldridge’s [62] estimator, respectively. As explained above, we use the perceived social support network of the respondent as an instrument for the mental health variable; following Wooldridge [70], we also include as instruments the exclusion restrictions of the participation equation.

Before moving to the interpretation of the estimated coefficients, we briefly discuss the results of the tests on the validity of the instruments and the presence of selection bias. Table 2 summarizes the test statistics. We test the rank conditions using an F test on the joint significance of the instruments in the first stage of the 2SLS and FE-2SLS regressions; the null hypothesis of weak identification is rejected at any sensible level. Moreover, we strongly reject the hypothesis that the models are underidentified in both samples. Finally, we test for overidentifying restrictions using Hansen’s J statistics. For both samples, we strongly reject the null hypothesis that our instruments are valid in the 2SLS; in the FE-2SLS, conversely, the set of instruments appears appropriate, especially for the male sample.

Table 2 Tests statistics on the instruments, males and females

Semykina [61] argues that a rejection of the null hypothesis of valid instruments could be a signal of the presence of selection bias. As a further test of the presence of selection bias, Wooldridge [70] suggests to include the IMRs deriving from the T probit estimations of the participation equation as explanatory variables in the within regression and in the FE-2SLS, and to perform a Wald test on the joint significance of the IMRs coefficients. If the resulting statistic is greater than the critical value, one should reject the null hypothesis of no sample selection. As one can see in Table 3, we reject the null hypothesis in both 2SLS and FE-2SLS only for the female sample. We interpret this as suggesting that, while in the male sample, selection bias does not seem to be a problem, the female sample is not randomly selected, thus requiring the use of the procedure proposed by Semykina and Wooldridge [62].

Table 3 Tests on selection effects, males and females

Now looking at the estimated coefficients, we find a significant positive effect of psychological well-being on wages in most specifications for the female sample (with an elasticity between psychological well-being and hourly wage equal to 0.16 when correcting for all the econometric issues) but no significant effects in the male sample. To facilitate the comparison of the results from different models, Table 4 presents predicted hourly wages in GBP of two average (male and female) individuals, who differs only with respect to their “Caseness" score. A male in “normal" mental health is predicted to have an hourly wage that is between \(\pounds 0.11\) and \(\pounds 0.95\) higher than a “case". Accounting for correlated individual effects reduces the wage gap, whereas accounting for endogeneity increases it. For females, the wage gap is null when health is considered exogenous, but grows to \(\pounds 1.50\) under the 2SLS estimator. Accounting for correlated individual effects shrinks the gap to \(\pounds 0.37\), but when non random selection into the workforce is additionally considered the wage gap between a “normal" and a “case" female jumps to \(\pounds 1.08\) (or, equivalently, 16%). These results are consistent with Ettner et al. [21] and Marcotte and Wilxox-Gok [41], who also found no significant effects for males but large and significant effects for females; our results are smaller in terms of magnitude, perhaps because they considered only more severe disorders (their estimated wage gaps for the female samples are approximately 29% and 49%, respectively, when accounting for endogeneity). Chatterji et al. [13] and Peng et al. [56] found no significant effects on wages.

Table 4 Hourly wage predictions (GBP), males and females

There are various possible interpretations of these results. One possible explanation is that employers discriminate females more than males with regard to psychological distress. Another potential explanation is that the losses in productivity are more apparent for women than for men, perhaps due to selection effects in different tasks or positions. Finally, a further interpretation may be connected to the concepts of presenteeism and absenteeism. Indeed, Ettner et al. [21] showed that mental disorders are negatively correlated with the number of hours worked only for males, and Peng et al. [56] found a positive and significant correlation between the number of work loss days and symptoms of mild depression only for the male sample. These evidences might suggest that women may tend to show up to work even when they are feeling mentally unwell whereas men may prefer to take a sick leave. If this was the case, the significant decrease in the offered wage in the female sample may reflect presenteeism, i.e., a situation of reduced functioning due, for example, to loss of focus or motivation, associated with lower productivity at work. Conversely, men’s reduced functioning due to increased psychological distress would lead to absenteeism, and thus only influence their actual number of working hours. Unfortunately, the BHPS does not contain exact information on working hours or on sick days, as it only reports the number of hours usually worked in a week.

We now turn our attention to the other independent variables (Tables 5, 6). For both samples, the coefficients on the polynomial of age and experience exhibit the expected signs, and are significant at 1% in all specifications. Education is significant and positive for females, indicating a rate of return on having a degree that ranges from 6% to 23%. For males, the coefficient is usually not significant. This might indicate that women tend to self-select in qualification types to show their abilities.

Table 5 Wage equation, males
Table 6 Wage equation, females

For both genders, there is a significant gradient in wages which reflects occupational status. Being a professional is associated with a slightly higher average hourly wage with respect to being a manager, but this could be an indicator that managers can also be compensated with perks which are not accounted for in the hourly wage. For women, being a manual skilled worker has a negligible effect on wage compared with being an unskilled worker, while the difference is more marked for men; this may be explained by the fact that blue collar work is done mostly by men in our samples. For men, working in the private sector increases hourly wage by up to 5% compared to those who work in the public sector. On the contrary, women who work in the private sector earn as much as 6.5% less than in the public sector. This may be due to weakest gender discrimination in the public sector if compared with the private one.

Men’s wages appear to be much more negatively affected by the choice of switching to part-time than women (but only 4% of the male sample is in part-time occupation, while the proportion increases to 34% for females). An increase in the number of employees at the individual’s work place is associated with a strongly significantly but negligibly rise in wages. Training is also associated with a significant increase in wages. As expected, unions have a positive impact on wages, and predictably, the effect is greater (almost double) for members than non-members still covered by union bargaining and renegotiation. In general, union membership appears to have a larger impact on wages for women than for men, which might be caused by positive selection of women in unions.

Having very young children significantly increases hourly wages for females. At least for the females, however, this result may be contaminated by the presence of simultaneity and endogeneity bias for which we do not correct for. The presence of children between 3 and 11 years has a positive and significant effect on the wage for men, while it impacts negatively the wage for females, as does the presence of teenagers in household. This may be due to lost opportunities of career advancements during maternal leave, when compared with women with no children, which do not seem to influence males. The marital status does not appear to be significant across genders.

Conclusions

In this paper, we investigated the relationship between self-assessed psychological well-being and labour market outcomes. Unlike previous papers, we did not only focus on diagnosed mental disorders, but we considered the entire spectrum of unpleasant mental states that may cause psychological distress and result in reduced emotional and cognitive functioning. Furthermore, we addressed the numerous econometric issues that characterize this relationship (i.e., endogeneity, sample selection, and unobserved heterogeneity) in a comprehensive framework.

We provided evidences that correcting for non-random selection into the workforce is necessary at least for the female sub-sample. We showed that mental distress significantly decreases the probability of participating to the labour market, both for males and females. When investigating the direct effect of psychological well-being on hourly wages, we found it to have a significant and positive effect only for the female sample. As a consequence, our findings suggest that psychological distress is important both on the intensive and extensive margins for females, while it only influences males in their participation decision. Whereas this supports the idea of a negative correlation between psychological distress and wages at least for the female sample considered in this study, we caution the readers that claims about causal effect among these two variables may not be made; moreover, since our preferred specification involves the use of instrumental variables, it may not be appropriate to extrapolate our results to the wider population of workers.

It should be noted that the accuracy of the measure of psychological distress that we used in the paper is dependent on individuals providing reliable and accurate responses: indeed, self-assessed measures of health are prone to a potential positive bias, whereby respondents rationalise negative labour market outcomes by self-reporting low health. However, since the questions asked by the GHQ do not explicitly mention work ability, they should be less prone to this bias [38]. Conversely, if respondents have a perceived incentive to under-report mental illness, because of the fear of being stigmatised, socially sanctioned, or disgraced [7, 10], our results are likely to be a lower-bound of the true effect of psychological distress on labour market outcomes.

In conclusion, our results support the idea that even milder forms of mental ill health may have a significant impact on labour market outcomes, but this may differ across genders. Policymakers can try to reduce the associated indirect costs by funding assistance programs for those seeking job opportunities, designing health policies aimed at making psychological support easily accessible, and trying to raise employers awareness on the importance of creating a mentally healthy workplaces and promoting initiatives to help the staff managing stress. However, to inform employment policies, further research is needed on the mechanisms that induce presenteeism and on the productivity trade-off between impaired workplace performances and incentivised absenteeism, especially with a focus on the differences across gender.