Introduction

Employees’ health and well-being is important not only for individuals themselves, but also for companies and public organizations, as it leads to higher productivity levels, less absenteeism, less employee turnover and lower health care costs (Wright and Bonett 2007; Ford et al. 2011; Faragher et al. 2013; Kramer and Son 2016; Beehr 2019). Creating and developing a healthy workplace is therefore an important challenge for companies, organizations and political stakeholders (World Health Organization 2005).

Supervisors function as connecting link between individual health and organizational health promotion and are promoters of occupational health and safety in worksite settings (Kelloway and Barling 2010; Rump et al. 2016; Straub et al. 2017; Rudolph et al. 2020). However, health promotion as a leadership task is often not recognized as such or is not included in leadership training. Therefore, such leadership interventions have grown in popularity (Struhs-Wehr 2017; Spitzenverband Bund der Krankenkassen 2018; Rudolph et al. 2020). Consequently, questions about the effectiveness of these interventions have arisen. To that end, a recent systematic review (Kuehnl et al. 2019) addressed the effectiveness of leadership interventions on outcomes at the employee level. Kuehnl et al. (2019) examined the effects of human resource management training of supervisors for improving the health and well-being of employees. The review included interventions aiming to improve supervisor-employee interaction and the supervisors’ capability of designing the work environment. Interventions addressing health-oriented behaviors were excluded. Twenty-five studies were included in that review. Overall, inconsistent evidence was found for a positive impact of human resource management training programs on employees’ stress, well-being or absenteeism when compared to no intervention, and no evidence of a considerable effect was found when compared to other interventions. These results are surprising, since the influence of leadership behavior on employee health has repeatedly been postulated (Kuoppala et al. 2008; Nyberg et al. 2009; Skakon et al. 2010; Gregersen et al. 2011; Schyns and Schilling 2013; Montano et al. 2017). This may be partly attributable to the inclusion and exclusion criteria for studies proposed by Kuehnl et al. (2019). According to the concept of health-oriented leadership described by Franke et al. (2014), leaders’ management of their own health (described as i.e., leaders’ self-care), composed of the three dimensions’ value, awareness and behavior, serves as a relevant precondition for health-oriented leadership behavior (Franke et al. 2015). In line with this concept, leaders must perceive their own health as important (dimension value), must be aware of their health status (dimension awareness) and take care of their health (dimension behavior) in order to be able to lead in a health-oriented way. Franke et al. (2015) suggested that leadership training should be built on these dimensions and as such should (i) raise awareness for the importance of health issues (dimension value), (ii) teach mindfulness practices for conscious awareness (dimensions awareness) and (iii) reduce stress and promote resources at the level of individual behavior (dimension behavior). However, these aspects of leadership training were not taken into account by Kuehnl et al. (2019). Therefore, the aim of this systematic review was to identify and summarize the evidence for the effects of health-oriented training programs targeted at supervisors—which explicitly address leaders’ management of their own health based on the concept of health-oriented leadership by Franke et al. (2014)—on the psychomental stress, absenteeism or well-being of employees.

Methods

For this systematic review, the Preferred Reporting Items for Systematic Reviews and Meta-Analyses (PRISMA) guidelines (Moher et al. 2009; checklist contained in Appendix 1) were adhered to, and it was registered in the International Prospective Register of Systematic Reviews (PROSPERO) (CRD42020205502).

Data sources

A literature search was performed on September 3, 2020, using four databases: Cochrane Library, MEDLINE, PsycINFO and Web of Science. No search restrictions were made regarding language or year of publication. Searches were performed using database-specific index terms (e.g. Medical Subject Headings) and relevant free text words in titles and abstracts. The specific search strategies, including applied search terms for each database, are outlined in Appendix 2. To identify unpublished and ongoing trials, the following trial registers were searched: ClinicalTrials.gov, WHO International Clinical Trials Registry Platform, and Trials Register of Promoting Health Interventions. A manual search was performed in Google Scholar and the Federal Institute for Occupational Safety and Health (BAuA). Additionally, systematic reviews and reference lists of included studies were checked for eligible references.

Inclusion and exclusion criteria

The population-intervention-comparison-outcome-study design schema (Higgins et al. 2020) was utilized to determine the eligibility criteria:

  1. 1.

    Population (P): Studies that enrolled supervisors as the target group, independently of gender, management level and number of employees.

  2. 2.

    Intervention (I): Studies that conducted interventions to (i) raise awareness of the importance of health issues (dimension value), (ii) teach mindfulness practices for conscious awareness (dimensions awareness) and (iii) reduce stress and promote resources at the level of individual behavior (dimension value). Interventions that focused on improving supervisor–employee interaction (e.g. providing social support, communication skills) and were also open to employees as participants were excluded.

  3. 3.

    Comparison (C): Studies comparing health-oriented interventions targeted at supervisors with a passive control group or with an active control group receiving an alternative intervention.

  4. 4.

    Outcome (O): Studies with measurements of psychomental stress, such as the Maslach Burnout Inventory (Maslach et al. 1996), any estimate of absenteeism and measures of well-being, such as the Utrecht Work Engagement Scale (Schaufeli et al. 2002). Outcome measurements must have been performed in employees, not supervisors.

  5. 5.

    Study design (S): Randomized controlled trials (RCTs), cluster-randomized controlled trials (cRCTs) and controlled before–after studies (CBAs) measuring outcomes both before and after the intervention.

Following the Cochrane Handbook of Systematic Reviews of Interventions (Higgins et al. 2020) suggestions, two reviewers (I.D. and H.L.-W.) independently searched the different databases for potential studies. In a first step, titles and abstracts were screened and irrelevant studies were excluded. In a second step, the full text of the remaining studies was retrieved and screened according to the inclusion and exclusion criteria. Any disagreements were resolved by discussion or, if necessary, by third-party consultation (A.K.). Reasons for study exclusion are outlined in Appendix 3.

Data extraction

Two researchers (I.D. and V.G.) independently extracted data from each study into a predefined data extraction form. Information was collected on study framework, intervention, sample size and relevant outcomes including description of measurement, statistical analyses and description of main results. Any disagreement was resolved by discussion or, if necessary, by third-party consultation (H.L.-W.).

Risk of bias assessment

RCTs and cRCTs were assessed using the Revised Cochrane Tool for randomized trials, described in the Cochrane Handbook for Systematic Reviews of Interventions (Higgins et al. 2020). The following potential sources of bias were added for cluster-randomized trails: recruitment bias, baseline imbalance, loss of cluster, incorrect statistical analysis, comparability with randomized trials. CBAs were evaluated following the Cochrane Effective Practice and Organisation of Care (EPOC) criteria (EPOC 2017). Based on the overall risk of bias judgment, each domain of bias of included studies was rated as high, low or unclear. At study level, risk of bias was considered to be high when one or more key domains were judged to be at high risk of bias (Higgins et al. 2020). In RCTs and cRCTs, key domains were random sequence generation, allocation concealment, incomplete outcome data and selective outcome reporting. In CBAs, key domains were similarity of baseline outcome measurements, similarity of baseline participant characteristics, adequately addressing incomplete outcome data, adequately preventing knowledge of the allocated interventions during the study, adequately protecting against contamination, and freedom from selective outcome reporting. Rating criteria for the risk of bias assessment that was independently performed by two reviewers (I.D. and V.G.), are presented in Appendix 4 and Appendix 5. Any disagreement was resolved by consensus or, where needed, with consultation of a third reviewer (H.L.-W.).

Results

Study selection

In total, 6126 results were generated from the initial database search. Sixteen articles were identified through manual search. After removal of 861 duplicates, title-abstract screening of 5281 publications was performed. Of these, 31 full-text papers were selected for detailed eligibility screening. Finally, ten studies remained for qualitative analyses. Figure 1 illustrates the selection process. Reasons for exclusion of the full-text screened studies are described in Appendix 3.

Fig. 1
figure 1

Flow diagram for the study selection regarding health-oriented leadership interventions for improving health- and well-being of employees

Study characteristics

Among the ten identified studies, six were cRCTs and four were controlled before–after studies. Seven studies were performed between 2013 and 2019 (Angelo and Chambel 2013; Elo et al. 2014; Stansfeld et al. 2015; Milligan-Saville et al. 2017; Barrech et al. 2018; Lange and Rowold 2019; Veloso-Besio et al. 2019), the others in 2006 and 2005 (Kawakami et al. 2005; Kawakami et al. 2006; Takao et al. 2006). Five studies were conducted in Europe (Angelo and Chambel 2013; Elo et al. 2014; Stansfeld et al. 2015; Barrech et al. 2018; Lange and Rowold 2019), three in Japan (Kawakami et al. 2005; Kawakami et al. 2006; Takao et al. 2006), one in Chile (Veloso-Besio et al. 2019) and one in Australia (Milligan-Saville et al. 2017). Interventions were heterogeneous, ranging from a single 4-hour training session (Milligan-Saville et al. 2017) to comprehensive training programs lasting several days, including short lectures on well-being, role playing and group discussions on leadership (Elo et al. 2014). In addition to face-to-face interventions, three studies (Kawakami et al. 2005; Kawakami et al. 2006; Stansfeld et al. 2015) applied a web-based training program for supervisors. Interventions were carried out in different organizational settings, including fire departments (Angelo and Chambel 2013; Milligan-Saville et al. 2017), a computer software engineering company (Kawakami et al. 2006) and a public hospital (Veloso-Besio et al. 2019). The number of employees included in the statistical analyses ranged from 70 (40 in intervention group and 30 in control group) (Veloso-Besio et al. 2019) to 1966 (1233 employees in intervention group, 733 employees in control group) (Milligan-Saville et al. 2017). All studies were screened for the assessment of stress, absenteeism and well-being of employees. Eight studies measured outcomes of stress using seven different questionnaires (Kawakami et al. 2005; Kawakami et al. 2006; Takao et al. 2006; Angelo and Chambel 2013; Elo et al. 2014; Stansfeld et al. 2015; Barrech et al. 2018; Lange and Rowold 2019), two studies measured outcomes of absenteeism differentiating between self-reported, work-related and standard sick leave (Stansfeld et al. 2015; Milligan-Saville et al. 2017) and three studies measured well-being outcomes with three different scales (Angelo and Chambel 2013; Stansfeld et al. 2015; Veloso-Besio et al. 2019). Only one study addressed all three outcomes simultaneously (Stansfeld et al. 2015). Table 1 presents a summary of the characteristics of the ten studies.

Table 1 Study characteristics of the ten studies included

Risk of bias

A summary of the risk of bias assessment is presented in Fig. 2. Overall, seven studies were judged to be at an overall high risk of bias, and three studies to be at an overall unclear risk of bias. For detailed justifications for risk of bias judgment, see Appendix 6. Due to the low methodological quality of most studies and the insufficient number of outcomes within each comparison, meta-analysis and a quality of evidence assessment were not performed.

Fig. 2
figure 2

Summary of risk of bias assessment of the ten included studies

Effects of interventions

Table 2 presents the reported results of the ten included intervention studies regarding the investigated outcomes of stress, absenteeism and well-being. Results are presented separately according to control type (intervention versus no intervention; intervention versus other intervention). Studies were grouped according to study design, follow-up times and outcome. Follow-up times were categorized into short-term (less than 3 months), mid-term (3 months to 1 year) and long-term (more than 1 year).

Table 2 Reported results of the ten included intervention studies

Type of control group

Eight studies evaluated the effectiveness of their intervention compared to no intervention (Takao et al. 2006; Angelo and Chambel 2013; Stansfeld et al. 2015; Elo et al. 2014; Milligan-Saville et al. 2017; Barrech et al. 2018; Lange and Rowold 2019; Veloso-Besio et al. 2019), and three studies compared health-oriented training programs of supervisors to another intervention (Kawakami et al. 2005; Kawakami et al. 2006; Lange and Rowold 2019).

Outcome variable Stress

Three cRCTs examined the effectiveness of health-oriented training programs on outcomes of stress compared to no intervention using a mid-term follow-up (Takao et al. 2006; Stansfeld et al. 2015; Barrech et al. 2018). Statistically significant intervention effects were only reported on exhaustion tendency as measured by the Giessen Subjective Complaints List (Brähler and Scheer 1995). Regarding mid- and long-term follow-up of CBAs in comparison to no intervention, no statistically significant intervention effect on employees’ stress was found for supervisor training (Angelo and Chambel 2013; Elo et al. 2014; Lange and Rowold 2019). In comparison to another training, no intervention effect was found on irritation, a stress outcome, at mid-term follow-up of one CBA (Lange and Rowold 2019) and on psychological distress stress in two cRCTs (Kawakami et al. 2005; Kawakami et al. 2006).

Outcome variable Absenteeism

Two cRCTs analyzed whether training of supervisors had an effect on employees’ absenteeism (compared to no intervention) using mid-term follow-up (Stansfeld et al. 2015; Milligan-Saville et al. 2017). Marginally significant intervention effects (p < 0.10) were seen on self-reported sickness absence (Stansfeld et al. 2015). Statistically significant intervention effects (p < 0.05) were reported on work-related sick leave (Milligan-Saville et al. 2017).

Outcome variable Well-Being

Well-being as outcome was examined in three studies comparing the intervention to no intervention, one cRCT using a mid-term follow-up (Stansfeld et al. 2015), one CBA using a short-term follow-up (Veloso-Besio et al. 2019) and one CBA using a mid-term follow-up (Angelo and Chambel 2013). Intervention effects of supervisor training on well-being outcomes were only reported in two CBAs. Marginally significant intervention effects (p < 0.10) were seen on vigor at mid-term follow-up (Angelo and Chambel 2013), and statistically significant changes (p < 0.05) were seen on job satisfaction at short-term follow-up (Veloso-Besio et al. 2019).

Discussion

Summary and evaluation of available evidence is a fundamental prerequisite for developing effective evidence-based health promotion interventions and preventive activities (Knorpp and Kroke 2012). Therefore, this systematic review identified and summarized the evidence for the effectiveness of health-oriented training programs targeted at supervisors that addressed leaders’ management of their own health, based on the concept of health-oriented leadership by Franke et al. (2014), on psychomental stress, absenteeism or well-being of employees. Overall, ten intervention studies could be identified assessing at least one employee-related outcome. Significant positive effects of leadership training were reported on exhaustion tendency (Barrech et al. 2018), self-reported sickness absence (Stansfeld et al. 2015), work-related sickness absence (Milligan-Saville et al. 2017) and job satisfaction (Veloso-Besio et al. 2019) in studies comparing health-oriented training programs to no intervention. However, due to the high overall risk of bias assessed for three studies and the overall unclear risk of bias judgment for one study, these effects should be interpreted with caution. Nonsignificant effects, on the other hand, were reported from studies comparing health-oriented leadership training to other training. Also, these results should be interpreted with caution due to the overall low study quality. In addition to the aforementioned high risk of bias in most studies, small sample sizes, nonrandomized study designs and nonrandom allocation, as well as incomplete and selective reporting, were identified. Based on these findings, the evidence for the effectiveness of the evaluated health-oriented trainings programs targeted at supervisors on employees´ psychomental stress, absenteeism or wellbeing should be judged as low. This finding is similar to that of Kuehnl et al. (2019) and Stuber et al. (2020), who also found no clear evidence for the effectiveness of the evaluated training programs targeted at supervisors. Hence, similar to their conclusion, a strong need for further well-designed studies is to be stated.

Beyond study quality, other explanations for the findings should also be explored, given the strong notion of a relevant role of leaders in promoting employee’s health (Gregersen et al. 2011; Montano et al. 2017; Kaluza et al. 2020). Firstly, (the majority) of leadership training programs may not be effective; i.e. the underlying theories, applied methods and training content focusing on improving leaders’ behavior and capabilities might be not sufficient to yield comprehensive improvements in employees’ health and well-being.

Second, studies may be unable to measure the effectiveness of health-oriented leadership interventions on outcomes at the employee level, as these are complex interventions with a comprehensive, multistep pathway from leadership training to improved employee health and well-being: training programs must be perceived as helpful by supervisors, must induce changes in attitudes and must result in successful acquisition of knowledge and skills. Eventually, these new attitudes, knowledge and skills need to be transferred into practice. The modified practice then has to effectively exert its influence on employees in terms of improved health or well-being. Finally, these changes must be assessed. Given this complex multistep pathway, future studies might better focus on single steps.

A third explanation relates to the workplace settings in which the studies are conducted. Carrying out intervention studies in this dynamic and complex setting has inherent complications, which may influence the effectiveness of the interventions. These are mostly “sideline” activities which are not directly relevant to core task completion (Kristensen 2005). The resulting incomplete study compliance and rapidly changing organizational contexts might hamper long-term follow-up measurements (Nielsen et al. 2010). In addition, competing high job demands or available resources also shape leadership behavior (Zimber et al. 2015; Knudsen et al. 2009; Mohr and Wolfram 2010; Arnold and Rigotti 2020). Therefore, mutual commitment between researchers and organizations/managers needs to be built up first in order to improve the evidence base in occupational health research (Kristensen 2005; Schelvis et al. 2016).

Limitations of the included studies

The included studies are subject to various limitations. First, due to the natural settings, randomization and blinding was difficult to realize, thereby increasing the likelihood of bias. Second, outcomes were mainly assessed using measurement tools relying on self-reports, which might have distorted study results. Third, a wide variety of different measurement tools were applied to assess study outcomes. Fourth, high dropout rates increased the likelihood of an underestimated intervention effect. Fifth, it remained unclear which moderating or meditating factors influenced the observed intervention effects. Sixth, the time between intervention and follow-up might have been too short to produce intervention effects on employees’ health or well-being. Finally, the heterogeneity in study design (e.g. training content, timing and mode of delivery of interventions) reduced study comparability.

Limitations of the systematic review

Although this systematic review was conducted according to the standards of the PRISMA statement (Moher et al. 2009), several limitations have to be considered. It is possible that not all relevant intervention studies were identified, although an expanded search in various databases was conducted. Other or additional search terms might have led to more potentially relevant publications. The inclusion and exclusion criteria were based on a selected health-oriented leadership concept, that of Franke et al. (2014). However, previous authors have criticized the construct proliferation (accumulation of seemingly different healthy leadership concepts but potentially identical constructs) of different health-oriented leadership approaches (Rudolph et al. 2020). Therefore, the consideration of different leadership concepts considering different behaviors, e.g. relationship-oriented, task-oriented, change-oriented (Wegge et al. 2014; Inceoglu et al. 2018) or health-beneficial leadership styles (such as transformational leadership (Bass and Riggio 2006), may add further insights.

Conclusion

In summary, evidence for the effectiveness of the evaluated health-oriented leadership interventions on employees’ stress, absenteeism or well-being is judged to be low. Instead, the results of this systematic review call for more and higher-quality research. Future results might then allow recommendations to be made for the conception of effective leadership interventions, that is, to exert significant positive effects on employees’ health. Thus, research on health-oriented leadership remains a central task in the field of occupational health research (Rudolph et al. 2020).