External validation of a prediction model and decision tree for sickness absence due to mental disorders

van Hoffen, Marieke F. A.; Norder, Giny; Twisk, Jos W. R.; Roelen, Corné A. M.

doi:10.1007/s00420-020-01548-z

External validation of a prediction model and decision tree for sickness absence due to mental disorders

Original Article
Open access
Published: 11 May 2020

Volume 93, pages 1007–1012, (2020)
Cite this article

Download PDF

You have full access to this open access article

International Archives of Occupational and Environmental Health Aims and scope Submit manuscript

External validation of a prediction model and decision tree for sickness absence due to mental disorders

Download PDF

Marieke F. A. van Hoffen ORCID: orcid.org/0000-0003-1563-5327^1,2,4,
Giny Norder¹,
Jos W. R. Twisk² &
…
Corné A. M. Roelen^2,3

1410 Accesses
2 Citations
Explore all metrics

Abstract

Purpose

A previously developed prediction model and decision tree were externally validated for their ability to identify occupational health survey participants at increased risk of long-term sickness absence (LTSA) due to mental disorders.

Methods

The study population consisted of N = 3415 employees in mobility services who were invited in 2016 for an occupational health survey, consisting of an online questionnaire measuring the health status and working conditions, followed by a preventive consultation with an occupational health provider (OHP). The survey variables of the previously developed prediction model and decision tree were used for predicting mental LTSA (no = 0, yes = 1) at 1-year follow-up. Discrimination between survey participants with and without mental LTSA was investigated with the area under the receiver operating characteristic curve (AUC).

Results

A total of n = 1736 (51%) non-sick-listed employees participated in the survey and 51 (3%) of them had mental LTSA during follow-up. The prediction model discriminated (AUC = 0.700; 95% CI 0.628–0.773) between participants with and without mental LTSA during follow-up. Discrimination by the decision tree (AUC = 0.671; 95% CI 0.589–0.753) did not differ significantly (p = 0.62) from discrimination by the prediction model.

Conclusion

At external validation, the prediction model and the decision tree both poorly identified occupational health survey participants at increased risk of mental LTSA. OHPs could use the decision tree to determine if mental LTSA risk factors should be explored in the preventive consultation which follows after completing the survey questionnaire.

Development of Prediction Models for Sickness Absence Due to Mental Disorders in the General Working Population

Article 16 August 2019

External Validation and Update of a Prediction Rule for the Duration of Sickness Absence Due to Common Mental Disorders

Article Open access 03 June 2016

Predicting long-term sickness absence among employees with frequent sickness absence

Article Open access 24 November 2018

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Mental disorders are the major cause of long-term sickness absence (LTSA) in member countries of the Organization of Economic Co-operation and Development (OECD 2015). Mental LTSA disconnects employees from the workplace and marginalizes them from the labor market, leading to unemployment, social isolation, and poorer mental health (Henderson et al. 2011). Therefore, it is important to identify employees at risk of mental LTSA before they report sick.

Previous studies have shown that distress symptoms identify non-sick-listed employees with an increased risk of future mental LTSA (Roelen et al. 2013; van Hoffen et al. 2015, 2016). Recently, Van Hoffen et al. (2019) included distress in a multivariable prediction model for mental LTSA with gender, marital status, economic sector, years employed at the company, role clarity, cognitive demands, learning opportunities, co-worker support, social support from family/friends, and work satisfaction as additional predictor variables. The authors reported that this 11-predictor model correctly assigned the highest risk to those who had mental LTSA during 1-year follow-up in 71.3% of the cases. Furthermore, they found that a 3-knot decision tree based on distress, gender, work satisfaction and work pace correctly identified employees with mental LTSA in 70.9% of the cases. Decision trees are easier to interpret than regression formulas and are therefore more user-friendly for healthcare practice (Loh 2014). However, decision trees more than regression models depend on the data in which they are developed, and may therefore be less valid in new populations of employees (Stiglic et al. 2012).

The aim of the present study was to validate the previously developed prediction model and decision tree in a new sample of occupational health survey participants. If externally valid, the prediction model and/or decision tree can be implemented in occupational healthcare practice to identify employees at risk of mental LTSA.

Methods

Study population and design

In The Netherlands, employers have to enable their employees to participate in occupational health surveys once every 4 years. Participation in occupational health surveys is voluntary for employees. Occupational health surveys are conducted by an occupational health service (OHS) and consist of an online survey questionnaire addressing health status and working conditions, followed by a preventive consultation with an occupational health provider (OHP, i.e., occupational physician or nurse). Preventive consultations as part of the occupational health survey are restricted to one session per participant. Those who need more support can be referred to other OHPs such as lifestyle coaches, physiotherapists, social workers, or psychologists.

In 2016, 3415 employees in mobility services were invited to participate in an occupational health survey. A total of 1736 (51%) non-sick-listed employees agreed to participate in the occupational health survey. The study was designed as a prospective cohort study with the occupational health survey as baseline and OHS recorded sickness absence as an outcome at 1-year follow-up. Results are presented in line with the transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (Moons et al. 2015).

Outcome: long-term sickness absence (LTSA)

Sickness absence was defined as a temporary paid leave from work due to any (i.e., work-related as well as non-work-related) injury or illness, and was recorded from the first to the last sickness absence day in the OHS register. In The Netherlands, sickness absence is medically certified by an occupational physician (OP) within 42 days of reporting sick. Therefore, LTSA was defined as sickness absence lasting 42 days or longer.

Based on a consultation with a sick-listed employee, the OP records a diagnostic code derived from the 10th International Classification of Diseases (ICD-10) in the OHS register. Mental LTSA was defined as LTSA with diagnostic codes of the ICD-10 chapter V (Mental and Behavioral Disorders). Mental LTSA in the 12 months prior to the occupational health survey was used for the predictor variable ‘prior mental LTSA’. Mental LTSA during 1-year follow-up was used as the outcome variable.

Predictors

The predictor variables were measured with the same items and scales as in the development study (van Hoffen et al. 2019). Gender and the number of years employed at the company were retrieved from the occupational health survey questionnaire.

Work pace (five items, Cronbach’s α = 0.85), role clarity (five items, α = 0.84), cognitive demands (five items, α = 0.82), learning opportunities (four items, α = 0.86), and co-worker support (three items, α = 0.87) were measured with the Questionnaire on the Experience and Evaluation of Work (van Veldhoven et al. 2002). Survey participants responded on a five-point frequency scale ranging from ‘never’ (= 1) to ‘always’ (= 5) and item scores were summed to a total subscale score, which was then divided by the number of items in the scale. Consequently, all psychosocial work characteristics had a score range between 1 (= low) and 5 (= high).

Social support from family and friends was assessed with three items (Can you count on the support of partner/family/friends when you have some difficulty at work? Is work at home taken out of your hands if you are busier at work? Do you feel appreciated by your partner/family/friends?; α = 0.78). Survey participants responded on a five-point frequency scale ranging from ‘never’ (= 1) to ‘always’ (= 5) and item scores were summed and averaged so that social support from family/friends ranged between 1 (= low) and 5 (= high).

Work satisfaction was measured with six items (α = 0.91) about pleasure in work (e.g., I am pleased to start my day’s work; I find my work stimulating; I enjoy my work). Responses were given on 5-point frequency scales ranging from ‘never’ (= 1) to ‘always’ (= 5). Items scores were summed and averaged so that work satisfaction ranged between 1 (= low) and 5 (= high).

Distress was measured with the Four-Dimensional Symptom Questionnaire (4DSQ), which was included in the occupational health survey questionnaire. The distress scale consisted of 16 items addressing symptoms elicited by stressors or the efforts to maintain psychosocial functioning, such as worry, irritability, tension, listlessness, poor concentration, sleeping problems and demoralization (Terluin et al. 2004). Survey participants were asked if they experienced these symptoms in the past week, ‘no’ (= 0), ‘sometimes’ (= 1), ‘regularly’ (= 2), ‘often’ (= 2), or ‘very often/constantly’ (= 2). Item scores were summed (score range 0–32; Cronbach’s α = 0.94) so that higher scores reflected higher levels of distress. Terluin et al. (2008) defined scores ≤ 10 as low, 11–20 as moderate, and > 20 as high distress.

Statistical analysis

Statistical analyses were done in IBM SPSS Statistics for Windows, version 24 (released 2016; IBM Corp. Armonk, NY).

Missing data

Of the 1736 occupational health survey participants, 116 (7%) had missing data on role clarity and social support from family/friends. The missing data were imputed in SPSS by using series means. Marital status was not available from the occupational health survey questionnaire and was therefore excluded from the prediction model.

External validation of the regression model.

The regression coefficients of gender, years employed at the company, role clarity, cognitive demands, learning opportunities, co-worker support, social support from family/friends, work satisfaction, and distress from the development setting were combined with the predictor values of the validation setting. As all employees worked in mobility services, the economic sector was constant.

External validation of the decision tree

The decision tree was based on the development study, using distress categories (low, moderate, high), gender, work satisfaction and work pace as predictor variables. In accordance with the development study, work satisfaction was dichotomized into low (≤ 3.3) and high (> 3.3). Likewise, work pace was dichotomized into low (≤ 3.8) and high (> 3.8).

Discrimination by regression model and decision tree

Discrimination between participants with and without mental LTSA during follow-up was evaluated by receiver operating characteristic curve (ROC) analysis, using the probabilities estimated by the prediction model and the decision tree. The area under the ROC-curve (AUC) represents discrimination between employees with and without mental LTSA during follow-up. AUC < 0.60 represents failing, 0.60–0.69 poor, 0.70–0.79 fair, 0.80–0.89 good, and 0.90–1.00 perfect discrimination. The AUCs were compared by using the non-parametric Wilcoxon statistic according to Hanley and McNeil (1982).

For the decision tree, risk groups were defined according to the development study. For the regression model, cut-off points set at 0.5, 1.0, 1.5, and 2.0 times the population mental LTSA risk were examined in more detail.

Results

The 1736 occupational health survey participants had a mean age of 46.1 (standard deviation [SD] = 10.1) years; they worked on average 36.4 (SD = 7.3) hours per week as technicians (50%), office workers (43%), or shop assistants (7%). Table 1 shows the scores on the predictor variables of employees with and without mental LTSA during 1-year follow-up. Employees with mental LTSA had lower scores on learning opportunities, support from co-workers, support from family/friends and work satisfaction.

Table 1 Population characteristics of occupational health survey participants (n = 1736)

Full size table

At 1-year follow-up, 51 (3%) participants had mental LTSA: n = 34 stress-related disorders, n = 8 depressive disorders, n = 4 anxiety disorders and n = 5 other psychiatric disorders.

Validation of the prediction model

The regression model discriminated (AUC = 0.700; 95% CI 0.628–0.773) between participants with and without mental LTSA during follow-up. This implicates that for each random pair of participants, the prediction model correctly assigned the highest risk to the participant with mental LTSA during follow-up in 70.0% of the cases.

At a cut-off risk 0.5 times population risk, most participants (n = 47) with mental LTSA would be identified, at the cost of inviting 76% of all survey participants to preventive consultations (Table 2). At a cut-off risk 1.5 times the population risk, 17% of all survey participants would be invited for preventive consultations, but only 20 of 51 participants with mental LTSA would be identified.

Table 2 Cut-off points for risk of mental LTSA

Full size table

Validation of the decision tree

The decision tree correctly assigned the highest risk of mental LTSA in 67.1% of the cases (AUC = 0.671; 95% CI 0.589–0.753). Although lower, the AUC did not differ significantly (p = 0.62) from that of the regression model. Survey participants with low, moderate and high distress scores had a 1.7%, 5.3% and 7.5% mental LTSA risk, respectively (Fig. 1). Among survey participants reporting low distress, there was no substantial gender difference in mental LTSA risk. Among women experiencing moderate distress, those reporting low work pace had a higher mental LTSA risk than those reporting high work pace.

Discussion

The present study externally validated the ability of a previously developed prediction model and decision tree to identify occupational health survey participants at increased risk of mental LTSA in the year following the survey. Both the prediction model and the decision tree poorly discriminated between survey participants with and without mental LTSA at 1-year follow-up. Discrimination by the prediction model including nine predictor variables was not better than discrimination by a simpler decision tree based on distress, gender and work satisfaction.

It is easier for OHPs to use the decision tree because it readily shows the risk groups. Despite the ‘pruning’ in the development study, the decision tree was not stable. For example, women reporting moderate distress and low work pace had the highest risk of mental LTSA, whereas in the development study those with high work pace had the highest mental LTSA risk (van Hoffen et al. 2019). It should be noted that in the present study the number of women reporting high work pace was limited (n = 22). If the decision tree was re-estimated work pace would not partition the current study population. Only distress and work satisfaction would split the present population into risk groups [data not shown]. Although the decision tree requires further validation, we assume it will apply to new samples of occupational health survey participants because distress, gender and work satisfaction were significant splitting variables in both the development and the validation study.

Strengths and weaknesses of the study

External validation studies are necessary to evaluate the generalization of risk predictions by prediction models and decision trees. The use of a new sample of occupational health survey participants is an asset of the present study since it provides insight whether predictions hold true in subjects working in a different setting and in a different time frame (Steyerberg and Harrell 2016). The prospective design and the use of recorded OP-certified mental LTSA are further strengths of the study.

Marital status was not available from the occupational health survey questionnaire. Consequently, it was not possible to externally validate the original regression model. Marital status was the least strong predictor in the development study (van Hoffen et al. 2019) and therefore it is unlikely that excluding marital status as predictor variable has substantially weakened the quality of the prediction model. The AUCs of the validated prediction model (AUC = 0.700) and decision tree (AUC = 0.671) were lower but of the same magnitude as those in the development study (AUC = 0.713 and AUC = 0.709, respectively). Although the prediction model and decision tree were internally validated at development, the poorer discrimination might still be indicative of over-optimistic predictions in the development study. Poorer discrimination may also have been caused by the fact that all participants in the present study worked in the same economic sector.

Practical implications

Discrimination by the prediction model was better than discrimination by chance, but an AUC of 0.70 is not sufficient for using the prediction model in occupational healthcare practice, all the more because there is no optimal cut-off point to decide who is at increased risk of mental LTSA. For high-risk cut-off points, sensitivities are dangerously low, whereas for low-risk cut-off points the number of false-positives is approximately 20-fold the number of true positives. As a result, many survey participants may be incorrectly stigmatized as future mental health patients. Thus, the prediction model should not be used to identify survey participants for actions or measures aimed at preventing mental LTSA.

Nevertheless, mental LTSA risk estimates can be used to determine whether or not to address mental LTSA risk factors in the preventive OHP consultations after completing the occupational health survey questionnaire. For that purpose, the decision tree is more user-friendly than the prediction model, because it is easier to interpret and readily shows the mental LTSA risk groups. When consulting a survey participant who experiences low distress, the OHP may not be inclined to address mental LTSA risk factors in the preventive consultation. However, the decision tree showed that women reporting low distress levels have a two-fold risk of mental LTSA as compared to the total population when they experience low work satisfaction. In that case, it would important to explore mental LTSA risk factors, even if distress levels are low.

It is obvious to pay extra attention to mental LTSA risk factors in preventive OHP consultations when survey participants report high distress levels. Apart from addressing psychosocial stressors and coping strategies, it may be important to explore work satisfaction as the decision tree showed that survey participants with high distress and low work satisfaction have a three-fold higher mental LTSA risk than the population, as compared to a two-fold higher risk among those with high distress who are highly satisfied with their work.

At the moment, the decision tree is introduced in the OHS as a tool to target preventive consultations to mental LTSA risk factors if appropriate. It would be interesting to investigate if OHPs use the decision tree and whether or not this results in more actions (e.g., coaching by social workers or preventive treatment by psychologists) to prevent mental LTSA.

References

Hanley JA, McNeil BJ (1982) The meaning and use of the area under a Reicever Operating Characteristic (ROC) curve. Radiology 143:29–36
Article CAS Google Scholar
Henderson M, Harvey SB, Øverland S, Mykletun A, Hotopf M (2011) Work and common psychiatric disorders. J R Soc Med 104:198–207
Article CAS Google Scholar
Loh WY (2014) Fifty years of classification and regression trees. Int Stat Rev 82:329–348
Article Google Scholar
Moons KG, Altman DG, Reitsma JB, Collins GS (2015) Transparent reporting of a multivariable prediction model for individual prognosis or diagnosis (TRIPOD): explanation and elaboration. Ann Intern Med 162:W1–73
Article Google Scholar
OECD (2015) Fit mind, fit job. From evidence to practice in mental health and work. OECD Publishing, Paris
Book Google Scholar
Roelen CA, Hoedeman R, van Rhenen W, Groothoff JW, van der Klink JJ, Bültmann U (2013) Mental health symptoms as prognostic risk markers of all-cause and psychiatric sickness absence in office workers. Eur J Public Health 24:101–105
Article Google Scholar
Steyerberg EW, Harrell FE (2016) Prediction models need appropriate internal, internal-external and external validation. J Clin Epidemiol 69:245–247
Article Google Scholar
Stiglic G, Kocbek S, Pernek I, Kokol P (2012) Comprehensive decision tree models in bioinformatics. PLoS ONE 7:e33812
Article CAS Google Scholar
Terluin B, van Rhenen W, Schaufeli WB, de Haan M (2004) The Four-Dimensional Symptom Questionnaire (4 DSQ): measuring distress and other mental health problems in a working population. Work Stress 18:187–207
Article Google Scholar
Terluin B, Terluin M, Prince K, van Marwijk H (2008) The Four-Dimensional Symptom Questionnaire (4 DSQ) detects psychological problems. Huisarts en Wet 51:251–255
Article Google Scholar
Van Hoffen MFA, Joling CI, Heymans MW, Twisk JW, Roelen CA (2015) Mental health symptoms identify workers at risk of long-term sickness absence due to mental disorders: prospective cohort study with 2-year follow-up. BMC Public Health 15:1235
Article Google Scholar
Van Hoffen MFA, Twisk JWR, Heymans MW, De Bruin J, Joling CI, Roelen CAM (2016) Psychological distress screener for risk of future mental sickness absence in non-sicklisted employees. Eur J Public Health 26:510–512
Article Google Scholar
Van Hoffen MFA, Norder G, Twisk JWR, Roelen CAM (2019) Development of prediction models for sickness absence due to mental disorders in the general working population. J Occup Rehabil. https://doi.org/10.1007/s10926-019-09852-3
Article Google Scholar
Van Veldhoven MV, Jonge JD, Broersen S, Kompier M, Meijman T (2002) Specific relationships between psychosocial job conditions and job-related stress: a three-level analytic approach. Work Stress 16:207–228
Article Google Scholar

Download references

Author information

Authors and Affiliations

Department of Research and Development, Human Total Care, Utrecht, The Netherlands
Marieke F. A. van Hoffen & Giny Norder
Department of Epidemiology and Biostatistics, Amsterdam Public Health Research Institute, Amsterdam UMC, Location VU University Medical Center, Amsterdam, The Netherlands
Marieke F. A. van Hoffen, Jos W. R. Twisk & Corné A. M. Roelen
Department of Health Sciences, University of Groningen, University Medical Center Groningen, Groningen, The Netherlands
Corné A. M. Roelen
HumanCapitalCare, Laan van Nieuw Oost-Indië 133-G, 2593 BM, Den Haag, The Netherlands
Marieke F. A. van Hoffen

Authors

Marieke F. A. van Hoffen
View author publications
You can also search for this author in PubMed Google Scholar
Giny Norder
View author publications
You can also search for this author in PubMed Google Scholar
Jos W. R. Twisk
View author publications
You can also search for this author in PubMed Google Scholar
Corné A. M. Roelen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Marieke F. A. van Hoffen.

Ethics declarations

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical approval

All procedures performed in studies involving human participants were in accordance with the 1964 Helsinki declaration and its later amendments and the ethical standards of the Medical Ethics Review Committee of the VU University Medical Center (reference 2019.473).

Informed consent

Informed consent was obtained from all individual participants included in the study.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

van Hoffen, M.F.A., Norder, G., Twisk, J.W.R. et al. External validation of a prediction model and decision tree for sickness absence due to mental disorders. Int Arch Occup Environ Health 93, 1007–1012 (2020). https://doi.org/10.1007/s00420-020-01548-z

Download citation

Received: 08 October 2019
Accepted: 24 April 2020
Published: 11 May 2020
Issue Date: November 2020
DOI: https://doi.org/10.1007/s00420-020-01548-z

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

External validation of a prediction model and decision tree for sickness absence due to mental disorders

Abstract

Purpose

Methods

Results

Conclusion

Similar content being viewed by others

Development of Prediction Models for Sickness Absence Due to Mental Disorders in the General Working Population

External Validation and Update of a Prediction Rule for the Duration of Sickness Absence Due to Common Mental Disorders

Predicting long-term sickness absence among employees with frequent sickness absence

Introduction

Methods

Study population and design

Outcome: long-term sickness absence (LTSA)

Predictors

Statistical analysis

Missing data

External validation of the regression model.

External validation of the decision tree

Discrimination by regression model and decision tree

Results

Validation of the prediction model

Validation of the decision tree

Discussion

Strengths and weaknesses of the study

Practical implications

References

Author information

Authors and Affiliations

Corresponding author

Ethics declarations

Conflict of interest

Ethical approval

Informed consent

Additional information

Publisher's Note

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation