Introduction

With an aging workforce (Douwes et al. 2014; Duin and Stoeldraijer 2013; Sas and Suarez 2014), the need increases to preserve health and wellbeing as employees work until retirement age. Ageing leads to an increased risk of developing chronic diseases (Sas and Suarez 2014). Chronic diseases in ageing workers are associated with decreased workability (Leijten et al. 2014) and health issues are the most common reason for leaving the workforce before the statutory retirement age (Perosh 2014). Workers with long sickness absence are more likely to exit the labor force through disability benefits and unemployment (Reeuwijk et al. 2015).

In the Netherlands, the main reasons for calling in sick longer than 3–4 weeks are psychological (33%) and musculoskeletal (28.9%) problems. In the group of ‘other diagnoses’ (38.1%), the main reasons for long term sick-leave are coronary heart disease (4.9%), intestinal (4.8%) and neurological (4.7%) problems (Volksgezondheidenzorg.info 2017). Between 2007 and 2013, the number of absent days due to concentration problems and fatigue increased. When duration of sick-leave is accounted for, musculoskeletal problems, psychological problems, including burnout and being overworked account for 40% of the total sick-leave volume. Early identification of these potentially modifiable health problems and prevention initiatives could decrease sick-leave and improve sustained employability (Leijten et al. 2014).

Workers’ Health Assessments (WHA) are used to identify specific work-related risks in order to prevent injuries or disease to promote sustained employability (WHO 2002; van Holland et al. 2015; Weel et al. 2007). A periodic medical examination can be part of a WHA (Aldana et al. 2005). Health status and work-related problems are often assessed with biometrical measurements and questionnaires. Biometrical measurements can be used to assess risk for metabolic syndrome, diabetes and coronary heart disease. Self-report measures often include the Work Ability Index (WAI), which aims specifically at determining workability (Ilmarinen 2009) and includes questions on neurological, intestinal and coronary heart diseases. A large number of other questionnaires may be used to assess the physical and mental health status and associated risks, as well as perceived work strain. All parameters together provide information about the health and sustained employability status of the employee. However, the construct of sustained employability is difficult to measure and more objective measures may increase the chances for finding workers with health-related threats to sustained employability.

One biometrical concept that may be related to sustained employability is heart rate variability (HRV). HRV is associated with physical and mental health issues (Beauchaine and Thayer 2015; Geisler et al. 2013; McCraty et al. 1998; Wulsin et al. 2015) and, in turn, potentially to sustained employability. HRV is a measure of the consecutive differences in time between heartbeats. The oscillation in the beat-to-beat interval of heart rate is the result of hormonal, neural and mechanical interactions of local and central systems (heart rate variability: standards of measurement, physiological interpretation and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology 1996). HRV is a marker for functioning of the autonomic nervous system (Shaffer et al. 2014). The time variations between consecutive heartbeats in a healthy heart are large and reflect the ability of the brain, autonomic nervous system and heart to adapt to changing circumstances. HRV declines with age (Umetani et al. 1998), but lower HRV values are also an independent predictor of decreased health (Dekker et al. 2000). Low HRV is indicated as a predictor for metabolic syndrome and cardiovascular diseases (Dekker et al. 2000; Liao et al. 1998; Licht et al. 2013; Thayer and Lane 2007; Tsuji et al. 1996; Wulsin et al. 2016). Low HRV has been observed in patients with diabetes, hypertensive cardiac hypertrophy and atherosclerosis. Low HRV is also associated with lower self-regulation and resilience (Geisler et al. 2013), with lower mental and social flexibility (Geisler et al. 2013; Williams et al. 2015), was found to be a marker for fatigue (Volker et al. 2016), and is more present in people with clinical burnout compared to healthy controls (Lennartsson et al. 2016). Summarized, a low HRV for one’s age is related to several health issues that may compromise a worker’s sustained employability. There is, however, insufficient knowledge about the additional value of measuring HRV in a working population with the focus on sustained employability.

For HRV, 24-h measurements are considered the gold standard but are expensive and time consuming. However, new technologies now enable HRV measurement in the occupational setting as a potential health screen. With photo plethysmography (PPT) technology, rapid and non-invasive measurements became available. Promising results have been reported for a drastically shortened 1-minute paced deep-breathing protocol (Russoniello et al. 2013). The next step to validate HRV as a potential screening instrument for future SE will be to explore associations of HRV with health related issues. Therefore, the objectives of this study are to: (1) compare the WHA outcomes of workers with a very low, low and normal HRV; and (2) examine associations between a 1-minute HRV measurement and health related outcomes derived from a WHA.

Methods

Study Design and Setting

A cross-sectional design was carried out. In the Netherlands, employers of moderate or large companies are required to offer their workers a voluntary health assessment, performed by an independent occupational healthcare supplier (OHS). Data from WHAs that were part of a regular health screening service were collected between November 2015 and June 2016 and explored in relation to HRV. Three OHS’s performed the measurements examined in the current study. The Ethics Board at the University Medical Center Groningen in The Netherlands decided that formal approval of the study was not necessary because all workers were subjected to care as usual only.

Participants

Workers from divisions of 14 companies in the Netherlands were included. All workers were invited to participate. The company was responsible for the invitation process. Data of workers between 18 and 65 years were used. Pregnant women and workers with severe health problems who were currently absent from work and workers with pacemakers were excluded from this study.

Data Collection

The WHA data were collected using two procedures. The first was an online questionnaire completed at home (30–45 min) prior to assessment. The second procedure was a physical assessment that included biometric measurements collected by a physician assistant or occupational physical therapist (OPT) at the company (25 min). Immediately after the biometric measurements, there was a counseling session with an OPT (25 min). The time between answering the questionnaire and the biometry measurements varied between 2 weeks and 1 h. For HRV, raw data were collected during the test in the emWave® Pro Plus software (Heartmath Institute, Boulder Creek). The participants were asked not to eat or drink for 45 min prior to the appointment (which was 60 min prior to the HRV measurements), and to avoid heavy physical work during the last hour.

Measurements

Questionnaires

Personal (age, sex, level of education) and work characteristics [working hours per week, number of years employed, workload [categories of the Dictionary of Occupational Titles (DOT)] were registered. In the DOT, occupations are classified into five categories of physical workload, based on intensity and duration of lifting or carrying needed for the job: sedentary, light, medium, heavy/very heavy. Because very heavy work hardly exists in the Netherlands, the last two categories were combined (Soer et al. 2009).

The following questionnaires were administered for the study:

  1. 1.

    Work Ability was measured with the short version of the Work Ability Index (WAI). It consists of seven items (Tuomi et al. 1998). The WAI has been shown to be valid for determination of sustained employability (Alavinia et al. 2008; de Zwart et al. 2002; Radkiewitz and Widerszal-Bazyl 2005) and is internally consistent [Cronbach’s alpha 0.72–0.80 (Tuomi et al. 2004)]. The scale ranges between 7 and 49, with higher scores indicating better workability. WAI can be used to identify workers at high risk for prolonged sickness absence (Reeuwijk et al. 2015).

  2. 2.

    Work engagement was measured with the Utrecht Work Engagement Scale-9 (UWES-9) (Schaufeli et al. 2006; Schaufeli and Bakker 2003). The UWES-9 consists of nine questions and assesses ‘work engagement’. It consists of three subscales, measuring dedication (three questions), vigor (three questions), and absorption (three questions), with a 0–6 point scale per question. The total scale ranges between 0 and 54, with a higher score indicating more engagement. Psychometric properties of the questionnaire are good (Schaufeli et al. 2006; Schaufeli and Bakker 2003).

  3. 3.

    Psychological problems were measured with the General Health Questionnaire-12 (GHQ-12) (Koeter and Ormel 1991; Sluiter and Hulshof 2013), consisting of 12 questions with 4 possible answers; 2 are valued positive (score = 0) and 2 negative (score = 1). The scale ranges between 0 and 12, with a lower score indicating less functional psychological problems.

  4. 4.

    Perceived Workload was measured with subscales of the Questionnaire on the Experience and Evaluation of work (QEEW, in Dutch: VBBA) (Veldhoven et al. 2002): need for recovery (NfR, 11 items)—Workpace (Wp, 11 items)—perceived mental strain (MS, 9 items). Subscale scores range between 0 and 100, and a lower score indicates more favorable situations. Reliability is satisfactory (Veldhoven et al. 2002).

  5. 5.

    A lifestyle questionnaire asking about physical activity, smoking, alcohol, nutrition and recreation. The questionnaire is descriptive in nature, based on guidelines (RIVM 2016), and has not been validated as an outcome measure. The results are scored with an ordinal scale 1–3, with a lower score indicating healthier behavior.

Detailed information about the full protocol and interpretation of results is based on guidelines as presented in Appendix 1.

Heart Rate Variability

HRV was measured with the ear lobe pulse sensor of the emWave® Pro Plus. The emWave® Pro Plus uses photo pletysmography (PPG) technology, which is based on the ability of hemoglobin to absorb light and can measure the blood pulse wave through the skin. PPG technology is a reliable and valid method of capturing and quantifying HRV from a deep breathing test (Russoniello et al. 2013). The HRV measurement was recorded individually in a quiet room, in a slightly reclined (~ 10°) seated position. The sensor was placed on the earlobe and the participant was introduced to the emWave® Pro Plus software. Participants were instructed to remain seated and relaxed and to refrain from making any significant or rapid body movements. Participants were instructed to breathe according to the 1-minute, 6-breath protocol that was paced at a rhythm of 6 breaths per minute (0.1 Hz) while breathing as deeply as they comfortably could. This breathing method was designed to provide a physiological challenge to assess the maximum HRV range (amplitude) their system was capable of producing at that time (Shields 2009). A visual breath-pacer is used to facilitate the regularity of the breathing. The entire minute should be artefact-free. Compliance was closely monitored to ensure sufficient deep breathing and synchronization with the breath pacer. If quality was doubted, a second test was performed.

HRV is usually expressed by either time- or frequency-domain parameters (heart rate variability: standards of measurement, physiological interpretation and clinical use. Task Force of the European Society of Cardiology and the North American Society of Pacing and Electrophysiology 1996). For exploring the added value of HRV for a WHA, the mean heart rate range (MHRR) was used as outcome. The MHRR is a time domain variable indicating the magnitude of the amplitudes in acceleration and deceleration of the heart. The range of the interbeat intervals is measured and expressed in the average change in beats per minute (BPM). For each breathing cycle, the maximum in BPM is calculated and averaged over the 6 breaths to obtain the MHHR. MHRR has been suggested as a good indicator of autonomic nervous system health and biological age (Russoniello et al. 2013; Shields 2009) with the practical advantage that it can be explained easily to participants. Pearson correlation between MHRR and another regularly used measure, Root Mean Square of Successive Differences (RMSSD), was calculated to confirm the correlation between MHRR and RMSSD in this study compared to other studies.

Biometric Measures

Besides HRV, other biometric measures were: weight (kg), body length (m), waist circumference (cm), systolic and diastolic blood pressure (mmHg), glucose (mmol/l), and total cholesterol (mmol/l). BMI was also calculated (weight/length2) (see Appendix 1 for full protocol and cut off values).

Data Handling

The EmWave Pro Plus® software calculated MHRR. All data from the questionnaires, biometry, and HRV were entered and processed in a software program from the OHS. All WHA data were compiled into an Excel file and de-identified by the OHS. The data were then sent to the researchers. Only data with a complete set for WHA and HRV were used for statistical analyses. For descriptives, the data were divided into subgroups because literature indicates low HRV is related to health problems (Dekker et al. 2000; Geisler et al. 2013; Lennartsson et al. 2016; Liao et al. 1998; Licht et al. 2013; Shaffer et al. 2014; Thayer and Lane 2007; Tsuji et al. 1996; Volker et al. 2016; Williams et al. 2015; Wulsin et al. 2016). In absence of validated meaningful cut-off values, statistical criteria were used to determine cut-off values. Cut-off values are provided on the left side of the normal distribution of MHRR of a healthy reference group (proprietary, Emwave Pro Plus® software): very low HRV when MHRR > 2 SD below mean, low HRV when MHRR is 1–2 SD below mean, normal-high HRV when the deviation is higher than 1 SD below mean. Categorization of HRV was age corrected, because most variance of HRV is explained by age and we are interested in the contribution of other measures.

Statistical Analyses

First, to obtain insight into which levels of the WHA measures were observed in the workers with very low, low or normal-high HRV, descriptive results are presented for each group. Age was normally distributed and described with mean and standard deviation. The other personal and work characteristics and WHA data are provided with medians and minimum–maximum for ordinal and for skewed continuous data, frequencies and percentages for nominal data and the lifestyle-questionnaires data (because they have only three categories). Secondly, differences between groups were tested. An Anova was used for age. The other continuous variables were not normally distributed and therefore the non-parametric Kruskall–Wallis H test was used. Then all single associations between raw MHRR and raw WHA measures were calculated with a Spearman correlation. The 95% confidence interval (CI) around r was calculated with a bootstrap accelerated procedure (1000 ×). If the 95% CI did not cross zero, the association between MHRR and the WHA measure was considered significant. Finally a multivariable analysis was performed to explore the shared variance of combined measures (independent variables) to MHRR (dependent variable). The WHA measures with a significant single correlation with MHRR were included as independent measures for backward regression analyses. Continuous skewed data were 10log-transformed to meet the assumptions for linear statistics and dummy variables were made for categorical data. A best fit was found with the independent measures that contributed significantly to the model (p < 0.05). Adjusted R2 was calculated for cross-validation of the model. All results were statistically analyzed with IBM SPSS Statistics 24.0 software.

Results

Participants

Participation rate was 70–100% in the 14 included companies. 1420 Workers participated in the WHA; 155 were excluded from the study because they did not meet the inclusion criteria and 388 had incomplete WHA data, resulting in 877 participants included in the study. The majority of participants were male (77%), mean age was 43.4 years (SD = 10.2, range = 19–65). Most participants (44.0%) performed work with moderate physical demands (DOT3) and had a high education level (49.3%). Age (F(2,874) = 12.39, p = 0.00) and years of employment (H(2) = 8.87, p = 0.01) differed significantly between HRV groups; the post hoc Bonferroni and Mann–Whitney U test revealed that the group with normal-high MHRR was significantly younger than the other two groups and employed shorter than the low MHRR group. Characteristics of participants and work are presented in Table 1, divided by HRV (MHRR) category.

Table 1 Participants and work characteristics, divided by heart rate variability (MHRR) outcome

WHA Results

The WHA measures are presented in Table 2, divided by HRV category.

Table 2 Results of the Workers Health Assessment (WHA) measures, divided by heart rate variability (MHRR) outcome

Mean Heart Rate Range

In the studied population, when controlled for age 4.0% of the workers had a very low HRV, 5.5% a low HRV, and 90.5% had a normal-high HRV (Table 2).

Questionnaires

Median workability was good (> 41) in all groups. All groups reported being engaged, had few problems regarding need for recovery, psychological problems, or work pace. Mental strain was reported high in all groups. Differences between groups are small and insignificant with large variations within groups. Observation of the lifestyle results show that in the group with normal-high HRV, 53.4% were not physically active enough, while in the group with very low HRV this was 45.7%. Alcohol consumption and smoking were reported as a normal risk by most of the workers (more than 80% in all groups) and only 2.2% were classified in the high risk alcohol category. For the statistical comparison the moderate and high health risk of lifestyle factors were collapsed, to ensure cells were filled with more than five participants. Although some small differences in lifestyle were observed, these were statistically insignificant (H(2) = 3.37, p = 0.20 (recreation) to H(2) = 0.53, p = 0.79 (smoking)), including physical activity (H(2) = 1.51, p = 0.48).

Biometry

Differences in biometry between groups were small. BMI was significantly different (H(2) = 7.22, p = 0.03) and waist circumference was borderline significantly different (H(2) = 5.96, p = 0.05) between groups. A post hoc Mann–Whitney U test revealed that only the very low HRV group (median BMI = 26.3) differed significantly from the normal-high HRV (median BMI = 25.4) group for BMI (U = 11140.5, z = − 1.99, p = 0.047). All groups on average were overweight.

Correlations

The continuous parameters were skewed and a non-parametric Spearman test was used. Correlation between RMSSD and MHRR was r = 0.66 (p < 0.01). Because the residuals did not meet the assumptions of homoscedasticity, the 95% CI around r was created with a robust Bias-Corrected and accelerated bootstrap procedure of 1000 samples (BCa 95%). The Spearman correlation was small but significant for most of the biometry measures (diastolic and systolic blood-pressure, cholesterol, BMI, waist circumference), the workability index, physical activity, and alcohol consumption, but not for the other measures (Table 3). MHRR decreases when more physical activity is reported, the association with alcohol was negative. In line with previous literature (Almeida-Santos et al. 2016; Koenig et al. 2015), age and MHRR correlated r = 0.40 (p < 0.01) in our study, which confirms the necessity to control for age in the regression analyses (Table 4).

Table 3 The correlation of raw MHRR with parameters of the WHA
Table 4 Results of the linear regression analyses with a backward procedure

Regression Analyses

BMI, waist circumference, blood-pressure (systolic and diastolic), cholesterol, WAI, physical activity and alcohol consumption were entered in the linear regression model with the BCa 95% procedure, controlled for age. MHRR and biometry were 10log-transformed. The WAI was treated as a continuous scale because dummy variables would be very uninterpretable with the wide scoring range (7–49) of the WAI. Reversed WAI data were 10log-transformed.

In the first model the nine variables are not significantly associated with HRV when entered together. BMI and waist-circumference both showed VIF values greater than 3 and systolic and diastolic blood-pressure showed VIF values greater than 2, indicating collinearity. With the backward procedure, in each step the least significant variable is excluded from the model until a model is found with only significant contributing variables and acceptable collinearity. Appendix 2 shows all steps of the backward procedure.

In the final model (adjusted R2 = 20.8%) with only significant contributing variables, 21.1% of MHRR was explained by age, BMI and Physical activity. Physical activity was coded as zero for healthy physical activity behavior and one for moderate and unhealthy physical activity behavior. BMI appeared to be a better predictor for MHRR than waist circumference.

In the final model, MHRR was significantly lower in aging workers with a higher BMI and healthy physical behavior.

Discussion

The aim of this study was to explore associations between a novel compressed measure of HRV and other health related parameters, as a first step to studying the added value of HRV in a WHA. In this cross-sectional study, the very low HRV group was older and longer employed than the high HRV group. Because the classification was age corrected, a difference in age was remarkable. Although a forced breathing protocol was found to be a reliable and sensitive measure previously (Russoniello et al. 2013; Shields 2009), more studies in the occupational setting could contribute to stronger evidence for HRV reference values for age. Analyzing the raw MHRR data revealed significant univariate correlations between HRV and age, workability, blood pressure, cholesterol, BMI, waist circumference, physical activity and alcohol consumption. A high HRV was correlated to preferable health outcomes, except for physical activity. These results appear quite consistent with others (Almeida-Santos et al. 2016; Koenig et al. 2015) who also reported similar relations between HRV, age, and being overweight, although the strength of the associations may vary and the most dominant factors may differ [e.g. BMI in our study and waist circumference in others (Koenig et al. 2015)]. The result that HRV was lower when workers reported to be more physically active, controlled for age, was not expected. Others have reported that physical training had no or a positive effect on HRV on the long term (Amano et al. 2001; Jurca et al. 2004; Loimaala et al. 2000). In previous intervention studies (Amano et al. 2001; Jurca et al. 2004; Loimaala et al. 2000), exercise level was strictly controlled and, therefore, internal validity of these measurements is likely higher than internal validity of the self-report questionnaire in our study; Participants at risk may have provided preferable responses to our questionnaire regarding physical activity, biasing results. However, the observed association with physical activity was borderline significant and small and clinical relevance is questionable.

No significant association was observed between mental and psychological problems and MHRR. This result is not supported by others (Beauchaine and Thayer 2015; Geisler et al. 2013; Lennartsson et al. 2016; Thayer et al. 2009; Williams et al. 2015; Zahn et al. 2016), and might be attributed to floor and ceiling effects. In the current practical setting where predominantly healthy workers were assessed and prevalence of dysfunction in all the measured constructs was low, except for mental strain. This questionnaire however was not discriminative, since most workers scored very high on experienced mental strain (ceiling effect). With low prevalence and little variation, a significant association is less likely to be detected (Altman 1991).

We used MHRR as a measure for HRV because it is easier to explain to workers than the abstract time domain parameter RMSSD. However, RMSSD has been studied in most previous research examining correlations with psychological and mental problems (Beaumont et al. 2012; Geisler et al. 2013; Thayer et al. 2009; Williams et al. 2015; Zahn et al. 2016). In the current study, the correlation between RMSSD and MHRR was high (r = 0.66) and similar to other studies. The choice of MHRR as the HRV measure, therefore, is not expected to account for our lack of correlations observed. A post hoc study with RMSSD as dependent variable confirmed this assumption. To our knowledge this is only the second study (Russoniello et al. 2013) to examine MHRR assessed by a 1-minute deep paced-breathing protocol, and the first study in the context of WHA. More research would strengthen any conclusions.

The positioning of the lifestyle measures was different from the other measures. An unhealthy lifestyle is thought to precede health problems, while the other measures are already signs of a health risk. When a person has an unhealthy lifestyle, it is possible that the influence on the body and the autonomic nervous system is not yet measurable. Although we used the best available lifestyle questionnaires, these are not validated which limits our ability to draw strong conclusions. For example, reported alcohol consumption seemed to be very low in our high risk group (2.2%), compared to previous studies of the general population [7.3% (female)–10.4% (male) heavy drinkers in the Netherlands in 2016 (RIVM 2017)]. This again could indicate preferable answering by our participants.

Although the cut-off values were in line with the finding in previous literature that a low HRV level is related to health risks (Dekker et al. 2000; Geisler et al. 2013; Lennartsson et al. 2016; Liao et al. 1998; Licht et al. 2013; Shaffer et al. 2014; Thayer and Lane 2007; Tsuji et al. 1996; Volker et al. 2016; Williams et al. 2015; Wulsin et al. 2016), the estimated cut-off values were not based on meaningful diagnostic outcomes. The protocol used is relatively new and the necessary data are not available for sustained employability yet. We used the categorization to describe group characteristics, which might have been different with other cut-off values based on ROC curves. The categorization was not used in correlation and multivariable analyses, and had, therefore, no influence on the main study results.

A general point of discussion is that we compared subjective self-report measures with the objective measure, HRV. Although others (Jarczok et al. 2015) reported a relationship between HRV and self-reported health, this was not confirmed in the current study. Previous research on other health and functioning aspects (Bieleman et al. 2009; Reneman et al. 2002) has shown that self-report and physical measures do not correlate well, even when they aim to measure the same construct. This would support the idea that HRV (MHRR) could measure a different aspect of risk for psychological and mental health than the questionnaires. A longitudinal design should be applied to study the value of HRV as a screener for health problems and sustained employability.

This is the first study to explore a 1-minute paced deep-breathing HRV protocol as an objective screening measure for multiple future health issues in a working population. A large sample was included, which enhances external validity. The WHA protocol content was based on expert knowledge but limited by practical restrictions such as time constraints within the OHS encounters. Although from a theoretical point of view this might not be ideal, it can be argued that this is the way it works in actual practice and our design contributes to external validity of results. On the other hand, selection bias cannot be ruled out. The employers invited the participants to the WHA. Participation rate was supplied by the OHS and non-response numbers were not available. In this study, the focus was limited to finding cross sectional relationships and therefore the predictive value of HRV for future health had to be studied in relation to other predictors, and not to an outcome of future health. Since we studied an apparently healthy population, low prevalence of actual health problems might account for the low correlations observed. With a longitudinal study and a health behavior outcome measure (sick-leave or sustained employability), the predictive value of HRV in a WHA can be studied and cut-off values could be determined for this particular goal.

In conclusion, this study with apparently healthy workers showed that the age controlled, very low HRV group was significantly older, employed longer, less educated, had a higher BMI and larger waist circumference than the group with normal-high HRV. A lower HRV was significantly associated with aging, higher measures of obesity (BMI), and with higher levels of reported physical activity, which provides a first glance of the utility of a 1-minute paced deep-breathing HRV protocol as part of a comprehensive preventive Workers’ Health Assessment. Predictive validity of MHRR should be evaluated longitudinally as part of a screen for actual health outcomes. Based on this cross-sectional study of the value of HRV assessed by a 1-minute paced deep-breathing protocol, caution has to be taken for individual decision making when using only MHRR as a screen for future health and sustained employability.