Introduction

Depression is one of the most common and consequential mental illnesses (Murray et al. 2013) and is estimated to affect 350 million people worldwide (Marcus et al. 2012). In addition to the large prevalence of depression, it is responsible for the largest global disease burden, accounting for 7% of disability adjusted life years (DALYs, i.e. years of life lost through premature mortality and years lost due to disability for persons suffering from (the consequences of) a health condition) (Ferrari et al. 2013). It is predicted, that by the year 2030, depression will be one of the three leading causes of the global disease burden (Mathers and Loncar 2006). Depression is diagnosed according to the criteria of the ICD-10 (International Statistical Classification of Diseases and Related Health Problems) classification system. The main symptoms of depression include depressed mood, loss of interest or pleasure, and decreased drive and increased fatigability. Additional symptoms of depression include concentration problems, decreased self-esteem, feelings of guilt or worthlessness, negative outlook on the future, suicidal thoughts or actions, sleep disturbances, and loss of appetite. According to the number, type and severity of symptoms and the degree of impairment in social and occupational functioning, the ICD-10 distinguishes between mild, moderate and severe episodes. For a diagnosis of depression, symptoms must have lasted for at least 2 weeks (DIMDI 2019). While the individual impact of depression is tremendous, the substantial economic consequences must be considered as well.

Depression and anxiety disorders cost the global economy an estimated US$1 trillion per year (WHO 2019). Costs of depression include direct costs through healthcare utilization and indirect costs attributable to decreased worker productivity. The prevalence of depression in persons contributing to the workforce is considerable and is associated with decreased work productivity including absenteeism (Henderson and Madan 2014; Musarezaie et al. 2014). In Germany, employers are legally obliged to continuous payment of salary for a maximum period of six consecutive weeks in case of sickness-related absenteeism resulting in excess labour costs without employee output during episodes of sick leave (Federal Government Commissioner for Migration, Refugees and Integration 2020).

However, little is known about the association of depressive symptom severity and duration of absenteeism. Chow et al. (2019) suggested that increasing symptom severity is adversely associated with work productivity. Yet, these findings were based on data originating from the United States of America (USA). To our knowledge, no comparable studies exist for any of the European countries.

While it seems plausible to assume that results will be comparable between the USA and European countries, it must be considered that country-specific features such as economic wealth, social security, human development or geographical region have a relevant influence on work outcomes (van der Zee-Neuen et al. 2017; Chow et al. 2019). Therefore, results should be validated on a national level. The current study aims to shed light on the association of depressive symptom severity and absenteeism as well as on labour costs associated with the duration of sick leave in a representative sample of the general German population. It is hypothesized that (1) absenteeism is increasing with increasing depressive symptom severity and (2) employer labour costs for employees on sick leave raise substantially with increasing symptom severity.

Methods

Secondary data analyses were carried out using data from the German Health Update (GEDA 14/15) (Robert Koch Institute 2015). The GEDA is a cross-sectional survey among the German-speaking adult population aged 18 and older in Germany that is conducted regularly by the Robert Koch Institute as part of the nationwide health monitoring. The GEDA 14/15 was the fourth wave of the GEDA study and was conducted from November 2014 to July 2015 in a sequential mixed-mode design. To represent the target population, a two-stage stratified random sample was drawn. Study participants could take part in the survey via standardized online questionnaire or via standardized written-postal questionnaire. The exact study design and sampling method is described elsewhere (Lange et al. 2017).

All analyses for the current study were performed using STATA 12.0.

Outcome variable

Absenteeism/sick leave was included as outcome variable containing information on the number of weeks absent from work due to sickness during 12 months prior to undertaking the GEDA survey. The variable was calculated based on absenteeism days reported in response to the question “During the past 12 months, how many days in total have you been absent from work due to illness?” Measuring absenteeism through self-report with a 12-month recall period can be considered a valid method. In accordance with previous studies, Ferrie et al. (2005) found that the agreement between self-reported sickness absence with absences recorded in employer’s registers was acceptable and that the association between annual self-reported sickness absence days and self-rated health was comparable to those for recorded absence days (Rees and Cooper 1993; Burdorf et al. 1996; Ferrie et al. 2005).

Covariates and potential confounders

Main independent variable

The eight-item depression patient health questionnaire (PHQ-8) is a self-administered screening tool for the detection of depressive symptoms. It is based on eight of nine criteria of the Diagnostic and Statistical Manual of Mental Disorders—Fourth Edition (DSM‐IV) (Kroenke and Spitzer 2002). The ninth criterion was omitted in the data collection due to its sensitive nature (i.e. the item asks about suicidal thoughts). Omission of the ninth criterion has no impact on the validity of the questionnaire (Kroenke et al. 2009; Shin et al. 2019).

The eight items of the PHQ-8 are scored on a four-point scale, indicating how often one has been bothered by problems during a 2-week period prior to undertaking the survey (0 = not at all, 1 = several days, 2 = half the days, 4 = nearly every day). Items focus on (1) interest and pleasure in doing things, (2) feeling down depressed or hopeless, (3) trouble falling or staying asleep or sleeping too much, (4) feeling tired or having little energy, (5) poor appetite or overeating, (6) feeling bad about oneself, (7) trouble concentrating and (8) restlessness or unusual tranquillity. The PHQ-8 was used as the main independent variable (categorical: five categories, 1 = no/minimal depressive symptoms, sum-score PHQ-8: 0–4, 2 = mild depressive symptoms, sum-score PHQ8: 5–9, 3 = moderate depressive symptoms, sum-score PHQ-8: 10–14, 4 = moderately severe depressive symptoms, sum-score PHQ-8: 15–19, 5 = severe depressive symptoms, sum-score PHQ-8: 20–24).

Of the characteristics included in the GEDA dataset, age, sex, body mass index, smoking status, marital status, socioeconomic status, number of comorbidities, social support, physical activity, alcohol consumption and support/care of others are considered relevant factors for the development or worsening of depression throughout the literature and were therefore included in the analyses as potential confounders or covariates (Blazer 2003; Jacobi et al. 2004; Sareen et al. 2006; Marengoni et al. 2011; Kleppang et al. 2018). Below, an overview of these variables and their metric properties is provided.

Demographics

  • Age (categorical: 1 = 18–39 years, 2 = 40–59 years, 3 ≥ 60 years).

  • Sex (categorical: 1 = male/2 = female).

  • Marital status (categorical: 1 = single, 2 = married/living with partner, 3 = widowed, 4 = divorced/separated).

  • Socioeconomic status (SES, categorical: 1 = low, 2 = average, 3 = high). The SES-Index was part of the data provided to the authors for further analyses. It reflects a multidimensional sum score based on information on education, profession and income. For the calculation of the variable that was provided to and used by the authors, the three dimensions were first converted to metric scales. Next, they were entered in the SES-Index using equal weights (Lampert et al. 2013).

Health behaviour

  • Smoking status (categorical: 1 = daily, 2 = sometimes; 3 = no, cessation, 4 = never smoked).

  • Alcohol consumption (categorical: 1 = daily, 2 = weekly, 3 = monthly, 4 = less than monthly, 5 = quit consumption, 6 = never).

  • Physical activity (categorical: 1 = inactive, 2 = active during leisure time, 3 = active during work, 4 = active during work and leisure time).

Health status

  • Body mass index (BMI, categorical: underweight BMI < 18.5, normal BMI ≥ 18.5 and  ≤ 24.99, overweight ≥ 25 and  < 30, obese ≥ 30).

  • Number of comorbidities (continuous: disease count in the past 12 months prior to undertaking the GEDA-survey: excluding depression, including: asthma, chronic bronchitis, myocardial infarction, coronary heart disease, heart insufficiency, osteoporosis, rheumatoid arthritis, osteoarthritis, stroke, diabetes, chronic kidney disease, cirrhosis of the liver, other liver disease, ulcerative colitis, stomach ulcer).

Social/relational

  • Support of/care for other persons (categorical: 1 = yes, 2 = no).

  • Social support (Oslo-3 Social Support Scale sum-score, 1 = little support, 2 = moderate support, 3 = strong support). The Oslo-3 Scale generates information on (a) the number of persons the respondent can rely on in case of severe, personal problems, (b) the amount of interest and sympathy others have for the respondent and (c) how easy it is to receive practical support from a neighbour. The indicator used in the current study is based on the calculation of three categories reflecting the value ranges of the total point value of the three items asked (Meltzer 2003; Kilpeläinen et al. 2008).

Statistical analyses

For the purpose of the current cross-sectional analyses. only subjects who indicated to work part- or full-time and without missing values in any of the considered variables were included. First, descriptive characteristics were used to characterize the sample as a total (i.e. including all subjects with part- or full-time work) and by depressive symptom severity. Next, due to overdispersion of zeros in the outcome variable (i.e. many persons did report zero sick leave), zero-Inflated Poisson regression analyses were performed.

Zero-inflated Poisson regression (ZIP) is used for the handling of count data (e.g. weeks of absenteeism) and is particularly suitable for situations in which the number of zero counts is high (e.g. many subjects with no absenteeism). ZIP models assume that excess zeros occur due to a separate process than the actual counts and should be modelled independently.

A ZIP model therefore consists of two parts, a Poisson count part and a logit part. The first part predicts the factor increase in the outcome variable (i.e. sick leave duration in terms of weeks), and the latter part predicts the log odds of excess zeros (Lambert 1992; Cameron and Trivedi 2013; Date 2020). To verify that the response variable was captured better by ZIP than by a regular Poisson regression, Vuong’s test was used.

To determine the final model, manual forward selection was employed for the count part of the model after univariable analyses. Variables which were found to have a significant association with the outcome in univariable analyses remained in the model if they confounded the association of the independent variable of interest with the outcome (i.e. ≥ 10% change in coefficient for PHQ-8) or contributed to the overall model fit estimated by log-likelihood ratio tests. For the latter, significance was assumed at p ≤ 0.05 and Chi2 changes ≥ 20 were considered relevant. A variance inflation factor analysis was performed on a simplified model (i.e. a linear regression model including the variable of the ZIP count part) to rule out multicollinearity.

Variables in the logit (i.e. inflate) part of the model were selected based on their potential role in increasing the probability of zero sick leave and were retained in the model if they contributed significantly to this probability. Considered variables included:

- Depressive symptom severity, assuming that the absence of depressive symptoms is likely to result in zero sick leave and that higher symptom severity results in smaller log odds of having no sick leave.

- Age, as younger individuals have lower risk of sick leave.

- SES since lower SES is associated with lower risk of sick leave.

- Number of comorbidities since the absence of any (co-)morbidity is likely to result in a higher probability of zero sick leave.

A two-way interaction term including sex and depressive symptom severity was computed and added to the final multivariable ZIP model to explore potential effect modification through sex. Finally, sick leave weeks for each symptom severity stage and sex were predicted.

Based on the final, fully adjusted models, weeks of sick leave for each symptom severity stage were predicted using the Stata postestimation command “margins”.

Calculation of labour costs

Average weekly labour costs for men and women in full- or part-time employment were retrieved from the statistical yearbook of the Federal German Statistical Office (Statistisches Bundesamt DESTATIS 2019).

Appropriate values (i.e. € 853.47 for women working full-time and € 491.83 for women working part-time/€ 1041.84 for men working full-time and € 539.71 for men working part-time) were assigned to the study subjects. Sick leave weeks predicted by the ZIP model were then multiplied with the average labour costs for men and women, respectively.

Results

The full sample including persons with part- or full-time work consisted of 12,405 subjects of whom 8910 (71.83%) worked full-time, 6605 (53.42%) were women, 7070 (56.99%) were between 40 and 59 years of age, 6081 (49.02%) had a normal BMI and 7119 (57.39%) were married or in a relationship and living together. The average sum of sick leave in 1 year was 1.89 (SD 4.26) weeks. A more extensive description of the characteristics of the sample as a total and by depressive symptom severity is provided in Table 1.

Table 1 Characteristics of the study population as a total and by depressive symptom severity

Association of depressive symptom severity with sick leave

Vuong’s test confirmed that the ZIP model was superior to a regular Poisson regression model. Results of the log-likelihood ratio tests during the model building process and estimated association of symptom severity with sick leave for each step in the model building process are presented in Supplementary Table S1.

The final model included the following covariates in the count part of the model: age, sex, BMI, smoking status, marital status, SES and number of comorbidities. The logit part of the model entailed depressive symptom severity, age, SES and number of comorbidities. Alcohol consumption was removed from the model due to multicollinearity.

The interaction term of sex with depressive symptom severity was significantly associated with absenteeism. Therefore, stratified analyses based on sex were performed.

Compared to men and women with no or minimal depressive symptoms, men and women with mild, moderate, moderately severe and severe depressive symptoms had a significantly higher number of sick leave weeks. In men, the number of sick leave weeks increased by a factor of 1.33 [1.27;1.40] for mild depressive symptoms, 1.61 [1.50;1.73] for moderate depressive symptoms, 1.91 [1.71;2.13] for moderately severe depressive symptoms and 3.84 [3.29;4.50] for severe depressive symptoms.

In women, the number of sick leave weeks increased by factor 1.37 [1.31;1.43] for mild, 2.04 [1.93;2.15] for moderate, 2.54 [2.35;2.75] for moderately severe and 2.96 [2.63;3.33] for severe depressive symptoms. Table 2 shows the univariable and multivariable results of the ZIP analyses.

Table 2 Adjusted association of depressive symptom severity with weeks of sick leave during 12 months in men and women with full- and part-time employment (n = 12,405)

Based on the fully adjusted ZIP models, the predicted weeks of sick leave were 1.46 [95% CI 1.41;1.51] in men with no/minimal depressive symptoms, 2.25 [95% CI 2.13;2.37] in men with mild depressive symptoms, 3.02 [95% CI 2.74;3.29] in men with moderate depressive symptoms, 3.43 [95% CI 2.87;3.99] in men with moderately severe depressive symptoms and 7.15 [95% CI 5.00;9.30] in men with severe depressive symptoms.

In women with no/minimal, mild, moderate, moderately severe and severe depressive symptoms, the predicted weeks of sick leave were 1.53 [95% CI 1.48;1.58], 2.35 [95% CI 2.25;2.50], 3.77 [95% CI 3.53;4.01], 4.76 [95% CI 4.23;5.29] and 6.30 [95% CI 5.35;7.26], respectively (Fig. 1).

Fig. 1
figure 1

Predicted weeks of sick leave and associated per employee labour costs in men and women with increasing depressive symptom severity

Labour costs

The average weekly labour costs in the study sample were €1764.94 (SD 1277.62) for men and €1,370.99 (SD 914.54) for women. Costs of sick leave according to depressive symptom severity in men were €1486.22 for no/minimal symptoms, €2262.67 for mild symptoms, €3037.00 for moderate symptoms. €3449.31 for moderately severe symptoms and €7190.25 for severe symptoms. In women, these costs were €1045.82; €1606.32; €2576.95; €3253.65 and €4306.30, respectively (Fig. 1).

Discussion

The present study showed that sick leave and costs for employers of persons with depressive symptoms rise substantially with increasing, self-reported symptom severity. To our knowledge, this is the first study focussing on the association of depressive symptom severity with absenteeism and related costs in the European region. However, the present findings are in line with other international studies confirming that absenteeism in employees with depressive symptoms is larger than in those without and that the severity of depression adversely affects their workability (Lerner et al. 2010; Beck et al. 2011; Johnston et al. 2019).

In agreement with the current study findings, Mcternan et al. (2013) reported that sick leave costs in employees with mild depressive symptoms are notably higher than in employees with no or minimal depressive symptoms. Moreover, we found that an increasing severity of symptoms was associated with an increasing risk of sick leave duration.

These results apply to both men and women.

Interestingly, the rather continuous increase in predicted sick leave duration and associated costs in men was substantially steeper when comparing the moderately severe symptom category to the severe symptom category.

A potential explanation might be that men are more likely to continue work until their health fully prevents work continuation. On the other hand, considering the cross-sectional nature of our analyses, the phenomenon might indicate that absence from work is related to an increased symptom severity. An underlying mechanism could be a decreased sense of self-esteem related to labour market inactivity, which is related to more severe symptoms of depression in men but not in women (Alvaro et al. 2019).

The large study sample, the lengthy assessment period of work absenteeism, comprehensiveness and quality of available data as well as their analyses pronounce the strength of the present study. However, it must also be considered that the cross-sectional data collection resulted in the recording of all variables at the same time with the same methodology. Therefore, there is a risk for the occurrence of same-source bias (i.e. people with pronounced depressive symptoms might have assessed other variables, like absenteeism more negatively). This might have resulted in an overestimation of the adverse association of depressive symptom severity and absenteeism.

As in all cross-sectional studies, no conclusions on the causality of observed associations can be drawn. While it seems plausible to assume that higher levels of depressive symptoms lead to an increased degree of absenteeism, being absent from work could also lead to social isolation and, in extension, to higher levels of depression. However, both directions would indicate a need for measures aiming at the prevention of depression at or initiated by the work place. Here, the subjectively perceived health risk through work, that appeared to be most pronounced in individuals with severe depression symptoms in our study, should be taken into account. Since individuals’ health and work beliefs are known to influence individual sickness absence decisions Sallis and Birkin 2014), there is a need for specific rehabilitation strategies with person-centred approach (UWV 2020).

The external validity of the current study results is strong for the German context, which is mainly attributable to the adequate sampling strategy used for the collection of data. In Germany, salary payment in case of sickness absence continues for a period of up to six subsequent weeks for each cause of sickness. Additional payment continuation for the same cause of sick leave may be required if a period of ≥ 6 months lies between the two periods of absenteeism.

However, payment continuation in case of sick leave is organized differently in other countries indicating that our cost analysis is not directly transferable to other countries. Yet, the overarching message of the necessity of prevention strategies is very well transferable.

In countries in which employer payment continuation exceeds the duration of that in Germany, the matter may be even more pressing. For example, employers in the Netherlands have to maintain salary payment of sick employees for a duration of two years (i.e. 100% of salary during the first and 70% during the second year) (UWV 2020).

In the current study, we chose to explore the association of depressive symptom severity based on a self-report questionnaire with sick leave and associated costs. However, it may be assumed that similar or higher absenteeism and costs apply for persons with clinically confirmed depression.

Due to a limited identifiability of mental health conditions in the working environment, initiation of professional and adequate treatment may take a prolonged period of time. In this context, early prevention and promotion of (mental) health represent fields of action for all employers. The present study results emphasize the need for implementation, realignment or extension of professional work-site health promotion programs aiming (1) at the improvement and maintenance of employee health and (2) at the reduction of labour costs associated with depression related sick leave. Investment in such programs will, in addition to health benefits for employees, likely result in direct monetary benefit for employers, health insurances and the health security system in general.