1 Introduction

This paper aims to analyse the association between workers’ well-being with psychosocial and physical risk factors in the workplace, identifying if—and how much—well-being is differently impacted by each of the two set of risk factors. The main purpose of our study is to build a synthetic measure suitable to generate insights on well-being at work and on individual risk factors.

Using the conceptual framework by the European Union Agency for Occupational Safety and Health (EU-OSHA 2013), we analyse the sixth European Working Conditions Survey (EWCS) data to detect how and to what extent selected determinants may influence subjective well-being, measured as a proxy by the self-assessed health (SAH).

Studies on such topics frequently employ the ratings given by respondents to each item in a Generalized Linear Models (GLM) framework (Agresti 2010). In survey questionnaires as that of the EWCS the items are often numerous, thereby making the identification of relevant determinants challenging. In our study, a selection (based on existing research) of reliable information related to physical and psychosocial risks in the workplace was carried out prior to the analysis, also considering that some of the variables affecting workers’ well-being can overlap, presumably because they measure very similar concepts. Their impacts may therefore become difficult to detect, especially considering that the GLM involve a large number of variables, which are potentially correlated to each other. This circumstance affect the interpretation of the results and may prevent general readership and decision makers to appreciate the role of specific determinants, thus making it difficult to acknowledge any risk assessment and management practices in the workplace.

To deal with this issue, in this paper we employed a Principal Component Analysis (PCA), a strategy to deal with a considerable number of variables and that may sometimes lead to overcome the composite indicators (see, among others, Nardo et al. 2005 and OECD and JRC 2008). The PCA is here carried out to calculate two distinct synthetic indicators, for the physical (PC1.Phy) and the psychosocial (PC1.PS) determinants, respectively. These indicators are then used in a simplified GLM model to analyse their impact on workers’ well-being. In such a way, the contribution of each risk determinant can be more easily evaluated by means of its factor loading in the construction of the PCA.

Our findings display that both categories of risk factors do have a significant impact on workers’ well-being measured by the SAH, although psychosocial risk factors seem to exert a greater effect. Attempts of measuring the interaction between physical and psychosocial risk factors at work and their impact on workers’ well-being, especially using synthetic indicators, appear to be limited in existing research. Thus, our study aims to contribute to fill this research gap.

The paper is organized as follows. Section 2 presents a brief overview of the literature on well-being and working conditions. Section 3 depicts the data employed, the identification of risks’ determinants in the workplace, and the distributions of the individual responses. Section 4 details the proposed empirical strategy to model responses and to build the two synthetic indicators of risk at work. Section 5 describes the implemented models, the proposed synthetic indicators, and discusses the empirical findings. In Sect. 6 some concluding remarks are provided also with respect to policy implications.

2 Well-being at work as a multifaceted concept

Subjective well-being is a multidimensional concept encompassing several related phenomena, including emotional responses, feelings, and overall judgements of life satisfaction and its multifaceted domains, such as family life, job, health, and so on (WHO 2012; Bryson et al. 2014). In line with the initiatives, which existed at the global level since the 1990s, aimed at assessing social and economic progress through measures that go beyond the classical macroeconomic indicators (World Bank 1990), a sizeable literature, also inspired by the seminal work of Stiglitz et al. (2009), has been published in the wake of the “Beyond GDP movement” (see, among others, OECD 2013; Durand 2015; Bacchini et al. 2021). A branch of the literature has mainly investigated the determinants of life satisfaction, as a proxy of the overall well-being, at the individual-level whereas others aim at examining country-specific determinants of life satisfaction (Bjørnskov et al. 2008; Pittau et al. 2010). Furthermore, three main aspects can be classically distinguished within the concept of subjective well-being, as pointed out by Steptoe et al. (2012): evaluative well-being, affective or hedonic well-being, and eudemonic well-being.

More in general, the guidelines on the measurement of well-being developed by the OECD (2013) comprise “all of the various evaluations, positive and negative, that people make of their lives, and the affective reactions of people to their experiences” (OECD 2013, p. 29). Examples of life circumstances include health, education, work, social relationships, built and natural environments, security, civic engagement and governance, housing, and work-life balance (WHO 2012).

When it comes to health issues in particular, it can be essential to build an articulated indicator system. Health is indeed considered one of the main aspects which matters to well-being and evidence displays that a good health is correlated with higher life satisfaction (British Office for National Statistics 2011). As widely recognised in the literature over the last two decades, the relationship between health and well-being is indisputably complex and it is not just one-way: health influences well-being and well-being itself influences health (Howell et al. 2007; Pressman and Cohen 2005; Stoll et al. 2012; Steptoe et al. 2012; WHO 2012). There is also evidence about correlations between well-being and physical health outcomes such as cardiovascular health, improved immune system response, higher pain tolerance, reproductive health, lower pain, greater pain tolerance, and increased longevity (United Kingdom Department of Health 2014). On the other hand, health is reported to influence well-being through both its physical and mental health components, considering their strong interconnections. Being associated with numerous benefits related to individuals’ health, family and economic circumstances, well-being is an outcome meaningful to the public and relevant in the area of public health (Diener et al. 2009).

In Psychology, Howell et al. (2007) used the term well-being for positive psychological constructs that are measured (e.g., positive affect, optimism) or manipulated (in a positive emotion induction), contrasted by the term ill-being for negative psychological constructs (e.g. negative moods, stress, depression).

In light of the intricate network of interactions underlying social measurement processes in complex societies, a proposal by Arcagni et al. (2021) has raised new critical considerations in order to increase awareness of the inherent limitations of current practices used to measure social issues and in particular to effectively evaluate policies directed towards well-being and social sustainability. As an instance, other scholars (Bacchini et al. 2020, among others) have highlighted several limits in the use of the composite indicators for comparison along the time.

2.1 Well-being at work

Work has long been recognised as having important (both positive and negative) impact on health and well-being, as effectively summarized by Litchfield et al. (2016). In modern workplaces -alongside to physical, chemical and biological risks, depending mostly on the type of industry- hazards are frequently related more to work organization and the nature of work itself (EU-OSHA 2013), rather than to specific agents, harm is therefore more psychological than physical. In fact, the work factors that can affect psychological (but also physical) health usually refer to job content, work organisation and management, their environmental and organisational conditions, as well as to worker’s competencies and needs. This interaction can prove to be hazardous to employee’s health through their perceptions and experience (EU-OSHA 2013).

The literature provides a comprehensive account of the job characteristics whose absence or poor-quality result in psychosocial risk factors that can affect workers’ health and well-being (Bryson et al. 2014; Brill 2021). These aspects include job content (i.e.: variety of tasks, correct matching of skills and tasks), job demands (i.e.: workload, work intensity, deadlines), control (i.e.: control over workload, participation in decision-making), work schedule, work-life balance, role in organisation, relationships at work (with colleagues and managers), physical environment and equipment (including work equipment availability, suitability and maintenance, environmental conditions such as light, noise, etc.), career development and prospects, and organisational culture and function (i.e.: communication, problem solving, definition of organisational objectives). It can be observed that when such characteristics result in psychosocial risks, they more frequently impact on workers’ health (and well-being) through a stress-mediated pathway. Within the field of occupational health and safety, the well-established “hazard-harm pathway model” to interpreting the association between exposure to occupational hazards and employee’s safety and health, was adapted to include and account for psychosocial risks (EU-OSHA 2013, p. 3).

Well-being at work has been extensively analysed within research focusing on job quality and job characteristics (among others: Hackman and Oldham 1976). A substantial part of the literature originates from the pivotal contribution of Karasek (1979) who conceptualised the “Job Demand-Control” (JD-C) model, according to which jobs involving high demands (resulting from intensification of work, increased workload) and low control (limited freedom to make decisions about how to organise and carry out own work) are related to higher levels of occupational stress. The “Job Design-Resource” (JD-R) model, conceptualised by Demerouti et al. (2001), considers that organisational job factors necessarily interact with job design. Demands are defined as those elements that entail physical or psychological effort; whereas resources, e.g., job characteristics, enable the worker to perform the required tasks. As underlined by Bakker and Demerouti (2014, p. 38): “Whereas job design theories have often ignored the role of job stressors or demands, job stress models have largely ignored the motivating potential of job resources. JD-R theory combines the two research traditions, and explains how job demands and resources have unique and multiplicative effects on job stress and motivation”. For a review of early and contemporary research on job design theory in organizations, see Oldham and Fried (2016).

Other scholars highlight that there is a considerable amount of evidence indicating that a positive correlation between well-being and an employee’s job performance does exist (Askitas and Zimmermann 2015; Bryson et al. 2014; Guzi and de Pedraza 2015, among others): results even display that higher level of worker’s well-being can lead to higher levels of job performance. The debate on these topics is part of a much broader body of research on job quality (Osterman 2013, Cazes et al. 2015; Padrosa 2021) and its relationship with health, both physical and psychological (see, for example, Henseke 2018).

It is worth mentioning that subjective characteristics (such as gender, age, personality, genes, education, and ability) also exert an impact on well-being in a number of ways. On the one hand, depending on their specific characteristics, the individuals can have a different perception or awareness of psychosocial risks and be resilient, with different outcomes in terms of well-being and perceived health conditions. On the other hand, individual attributes are inextricably linked to job and workplace factors and therefore are related to different levels of well-being (EU-OSHA 2017): for example, educated workers get access to better jobs; jobs held by older employees may offer more autonomy and higher income; immigrants work frequently in bad jobs (Bryson et al. 2014; Stoll et al. 2012).

Because it is subjective, well-being is usually measured with self-assessments. Self-reported well-being can be assessed in different ways, depending on whether well-being needs to be measured as a clinical outcome, a population health outcome, for cost-effectiveness studies, or for other purposes (Centers for Disease Control and Prevention 2018; WHO 2012). Some studies support the use of single items (e.g., global life satisfaction) to measure well-being parsimoniously. More specifically, and with all the caveats due to possible bias associated with the self-assessment, also in terms of comparability of results over space and time, well-being in the workplace is often assessed by proxies, as the level of overall job satisfaction or for some specific job features (Bryson et al. 2014; Maxwell 2015).

3 Data

Data employed for this study stem from sixth wave of the European Working Conditions Survey (EWCS) carried out in 2015 by the European Foundation for the Improvement of Living and Working Condition.Footnote 1 A sample of 43,850 employees and self-employed workers in 35 European countries, representative of the employed population, were interviewed: the European Union (EU) Member States (28 countries in 2015) plus Norway, Switzerland, Albania, the Former Yugoslav Republic of Macedonia, Montenegro, Serbia, and Turkey. The survey questionnaire is extremely rich and covers a wide number of topics referred to worker characteristics, working conditions, work environment factors, impacts on worker’s health (Eurofound 2017), thus representing a prominent data sourceFootnote 2 for comparison across countries, occupations, sectors, and age groups. The employed data are the most up-to-date statistically representative information. The 2020 round of the EWCS was abruptly interrupted at the beginning of the fieldworkFootnote 3 due to the outbreak of COVID-19. To control for extreme heterogeneity within the data, our study considers only the 2015 EU Member States.

As previously mentioned, several studies relate higher subjective worker’s well-being with higher self-assessed health (SAH), and studies based on longitudinal data also show a strong effect of health on subjective well-being (Dolan et al. 2006). Therefore, given its relationship with well-being and considering the limitations of other indicators such as those measuring job satisfaction, in this paper we adopt SAH as a parsimonious measure of workers’ well-being, using question Q75—“How is your health in general? Would you say it is: 1 Very good; 2 Good; 3 Fair; 4 Bad; 5 Very bad”, measured on a 5-point scale.

Common individual characteristics here considered are gender (question Q2a, expressed by a dummy variable where female = 2); age (in years, Q2b); education level (from the original Q106) is defined as a dummy where university degree = 1 (tertiary). Given the high number of missing values with reference to income, to investigate the effect of the individual’s economic status, information is derived from Q100: “Thinking of your household’s total monthly income, is your household able to make ends meet” (make-ends-meet) rated on a six-point wording scale from “Very easily” (1) to “With great difficulty” (6). Job features described by answers to Q2d and Q11, are represented by two dummies: full-time vs. part-time job (fulltime, where full-time = 1) and permanent vs. non-permanent job (permjob, where permanent job = 1). Moreover, we reckon the number of working days per week (Q26, d4w) and of hours weekly spent at work (Q24, whours).

We consider two sets of drivers for SAH: physical risk factors and psychosocial risk factors at work. Table 1 lists the questions of interest from which the variables under study originate. The question corresponding codes, as in the original dataset, and the variable labels as they are used in the figures, are also provided in Table 1. For each question, we also report the scales as used in the elaborations oriented from the lowest to the highest level of agreement.

Table 1 List of selected questions and related variables (physical and psychosocial risk factors)

Notice that, for most of the considered variables, we needed to conveniently reverse the original scales to make the wording of the questions consistent with the adopted scale for SAH.

Variable dEU12 refers to the original 12 European Union Member States, those supposed to share similar political and socio-economic features.

Missing values and “Don’t know” responses have not been considered in the analyses; therefore, our sub-sample eventually consists of 19,996 individuals.

Women are 51% of the sample. 27.2% of respondents hold a university degree. About 81.2% of the sample have a permanent job; 78.6% have a full-time job and 67.4% work in the private sector. The average age is 42 years (std error: 11.9). Respondents work on average 37 h per week (std. error: 10.5) on 5 working days.

The proportion of those claiming a bad or very bad SAH is 2.4%, while more than 77% report a positive or very positive evaluation (good: 52.6%; very good: 26%). As shown in Fig. 1, there are not prominent differences in the frequency distributions with regards to gender and sector. The descriptive statistics referring to all remaining variables are included in the Appendix (Tables 5, 6, 7).

Fig. 1
figure 1

Source: Authors’ elaborations on EWCS 2015 data

SAH frequency distribution by gender and sector.

4 Empirical analysis

In surveys on topics referred to working population, such as well-being and self-reported working conditions are usually measured by means of scales administered to respondents, who are asked to select a response category out of a list. This is the case of the variable SAH chosen for our analysis. Generally, in a modelling approach, the ordinal responses are used as they are when considered as explanatory variables. Consolidated statistical procedures are those derived by GLM (Agresti 2010). Nevertheless, such classic model-based analysis, carried out employing this kind of explanatory variables, may be difficult to interpret by general readership, therefore composite and synthetic measures are largely preferred (OECD and JRC 2008). Several statistical approaches have been exploited in the field of well-being measurement (among others, Maggino 2016), even in a model-based perspective (see, e.g.: Capecchi and Simone 2019).

We propose to simplify the model-based analysis introducing synthetic measures obtained by Principal Component Analysis (PCA, e.g.: Jolliffe 2011).

The methodology implemented in this study involve several steps. First, given the nature of SAH, preliminary Ordered Probit analyses are run, based on explanatory drivers describing 15 physical and 26 psychosocial risk factors; 9 individual characteristics are also considered. Selected profiles are illustrated. A PCA is then carried out to obtain two distinct synthetic indicators summarising the selected sets of risk drivers. The resulting synthetic indicators – the first principal components (PCs) for each set of risks—are subsequently used as explanatory variables in further, simplified, Ordered Probit models. This strategy allows to make the interpretation of the results more straightforward, since 42 different drivers for physical and psychosocial risk at work are replaced by two continuous synthetic indicators: PC1.Phy and PC1.PS, respectively.

5 Models and findings

In Orderd Probit models an underlying score is estimated as a linear function of the explanatory variables and a set of cutpoints. Even though the explanatory variables are ordinal, consistent support has been found for treating them as approximately continuous (Norman 2010; Johnson and Creech 1983), and as proxies of non-observable latent variables (Agresti 2010). In this framework, the probability of observing the outcome j corresponds to the probability that the estimated linear function, plus random error, is within the range of the cutpoints estimated for the outcome. The model has the following expression:

$${\text{Pr}}\left( {{\text{Y}}_{{\text{i}}} = {\text{ j}}} \right) = {\text{Pr}}({\text{k}}_{{{\text{j}} - {1}}} < \upbeta_{{1}} {\text{x}}_{{{\text{1i}}}} + \upbeta_{{2}} {\text{x}}_{{{\text{2i}}}} + \, . \, . \, . + \, \upbeta_{{\text{h}}} {\text{x}}_{{{\text{hi}}}} + {\text{ u}}_{{\text{i}}} \le {\text{ k}}_{{\text{j}}} )$$
(1)

where ui ∼ N (0, σ2), β1... βh (the coefficients associated to h drivers) and cutpoints k1... kJ−1 are the parameters to be estimated, J is the number of possible outcomes, and i = 1...n refer to the individuals; finally, k0 is taken as − ∞ and kJ is taken as + ∞.

Estimates are obtained using STATA (version 14) where the dummy variables are treated as factors, σ2 = 1 and the constant term set to zero are the normalization constrains (see Veerbek 2004, Sect. 7.2.2). We estimate three models, all of them including the individual and job-related covariates. Model 1 considers as explanatory variables the physical risk factors only, Model 2 considers the psychosocial risks only, and Model 3 is a comprehensive one, allowing to verify how the two sets of risks interact with each other and the individual variables. Results are presented in Table 2.

Table 2 Ordered Probit estimates

In line with the existing literature, the individual characteristics and the main job characteristics are in general significant. Some of the risk variables lose their statistical significance when passing from the models for single risk set to a comprehensive Model 3, whereas a few ones do acquire significance. These results may originate from the circumstance that some questions can be considered “equivalent” by respondents and therefore lead to disperse information and artificially alter the variability in the sample. The dummy variable (dEU12), which identify the EU12 countries, is always significant, indicating that sharing similar socio-economic features does exert an impact on response pattern.

Overall, in the transition from the models for a specific risk set of risks to the comprehensive one, the psychosocial determinants of perceived health seem to have a more robust impact, as they undergo fewer variations in terms of statistical significance. This evidence may suggest that, in this dataset, the psychosocial risks do seem to exert a greater influence on workers’ perception of health as compared to the physical ones.

It is well known that for the intermediate response categories the interpretation of the coefficients in the Ordered Probit model is complex, since the coefficients’ sign and magnitude do not offer a clear indication of the partial effect extent for a given explanatory variable. Therefore, the impact of the covariates for such categories may result fundamentally ambiguous when observing the estimated coefficients (Daykin and Moffatt 2002; Greene 2008, Sect. 23.10). On the other hand, with respect to the extreme response categories on the scale (e.g. very good/very bad) the sign of the estimates can be meaningfully interpreted.

However, as the variables under consideration are numerous it may be more convenient to observe specific profiles of respondents to capture their effect. In fact, the effect on estimated probabilities of a drivers’ change depends on all model parameters, data, and dependent variable category of interest. The evaluation of such probabilities is based on specific values of the individual variables, hence defining some respondents’ profiles of interest.

Table 3 reports in the first two lines the observed frequencies for each SAH category in the whole sample, and the corresponding estimated probabilities obtained by using the coefficients of Model 3. Moreover, for each SAH category, the estimated probabilities for two types of respondents are reported, distinguishing by gender, type of contract (permanent, non-permanent), and age. In particular, age is considered at three different moments of working life: at 20, 40 and 65 years old.

Table 3 Estimated probabilities for selected respondents' profiles (Ordered Probit Model 3)

The first profile refers to a worker with tertiary education, working in the private sector, with a full-time permanent job (distinguishing by gender and age). The second one considers a male worker, with tertiary education and a full-time job in the private sector (distinguishing by contract type and age). All the remaining determinants related to the risk factors are set to their mean value.

With respect to the Profile 1, it can be observed that the estimated probabilities for each SAH category (from “very bad” to “very good”) are quite similar for male and female respondents. The age of respondents is positively correlated to the SAH, that is young respondents are more likely to report good or very good health. The same can be said for Profile 2, where it can be observed that the contract type produces limited differences in the estimated probabilities, for each age group. Actually, temporary jobs are held by around 18% of the sample and are concentrated in the age range between 30 and 50 years old. At a disaggregated level, as the age varies, SAH decreases as expected, and the difference between workers in non-permanent jobs and workers in permanent jobs is still very limited and likely dependent on the small number of non-permanent workers over 50 in the sample (less than 1,000 in total in the considered sample).

5.1 Derivation of the PCs as composite indicators

A PCA has been performed separately on the two sets of risk factors listed in Table 1. As the variables are ordinal, polychoric correlation has been considered (for details, see Olsson 1979). The main assumption is that ordered categorical variables arise polychotomising underlying continuous variables, then polychoric correlation can be regarded as an estimate of classical Pearson correlation coefficient and are read in the same way.

Panel A in Fig. 2 displays the polychoric correlations among the physical risk factors, using a graph where each node represents a risk item, while lines connecting the nodes indicate the correlations among the items. The thicker the line the higher the correlation, while the colour denotes whether the correlation is positive (green) or negative (red). Stronger positive correlations can be noticed mainly between variables referring to vibrations (vibra), breathing in vapours (vaps), smoke and fumes (smoke), presence of chemical (chem) or infectious materials (infect), which represent physical risk factors present in the workplace (see Table 1). It can be noticed that the only negative correlation is that between two variables related to work-life balance (fitwell and timeIwant).

Fig. 2
figure 2

Source: Authors’ elaborations on EWCS 2015 data

Polychoric correlations among risk factors (threshold 0.35 in absolute value).

The eigenvalue associated with the first PC, say “PC1.Phy”, is 6.18, while the second largest is much smaller (1.59), thus PC1.Phy accounts alone for 41% (6.18/15, where 15 is the number of risk drivers) of the total variance suggesting that one dimension is able to adequately synthetize the data, summarizing the relevant information content, as also confirmed by the high significance of the test of the hypothesis that one component is sufficient.

An insight into the meaning of PC1.Phy is provided by the factor loadings reported in Fig. 3. Loadings are the coefficients of the linear combination of the original variables from which the PC is constructed. They are basically correlation coefficients between observed variables and the principal component; therefore, the loading magnitude measures the relevance of the risk driver. As usual rule of thumb, a variable should have a factor loading of at least 0.4 in absolute value. Items whose factor loadings are below 0.3 (or even below 0.4) provide little contribution and could be ignored.

Fig. 3
figure 3

Source: Authors’ elaborations on EWCS 2015 data

PC1.Phy factor loadings for the observed covariates.

Figure 3 shows that all covariates (but, consistently, comput and sitting) present a positive loading, thus, the higher the value of the covariates (which, given the direction of the scale employed for coding, indicates worse physical working conditions), the higher the value of the PC1.Phy indicator (the other way around for comput and sitting). Along PC1.Phy it is possible to identify, at one end, workers with a high exposure to physical risk agents (with positive high scores on PC1.Phy) and, at the opposite end, those in desk-based jobs (low negative score on the PC1.Phy). From the values in Fig. 3 we see that the variables more affecting the PC1.Phy are: vibra, noise, htemp, smoke, vapours, chem and loads. The only loadings below the 0.4 threshold are those of variables repet and liftp: nevertheless, in the subsequent analysis we keep them for sake of homogeneity with respect to the estimated Probit models above discussed.

Since PC1.Phy ranges from better to worse physical conditions, it tends to be negatively correlated to SAH, as clearly displayed in Panel A of Fig. 4, which reports the boxplots of the distributions of the indicator against the levels of SAH. Despite the presence of several outliers, we can see that increasing levels of SAH (from “fair” to “very good”) are associated with lower levels of the indicator and, correspondingly, of the variables in the linear combination, specifically of those with the highest loading.

Fig. 4
figure 4

Source: Authors’ elaborations on EWCS 2015 data

Distributions of the synthetic indicators versus the levels of SAH.

Similarly, an indicator has been derived for the set of variables describing the psychosocial risk factors. The corresponding polychoric correlations are plotted in Panel B of Fig. 2. Stronger positive correlations can be mainly observed between the factors referring to the relationships with colleagues and manager, the emotional sphere, such as feeling useful and fairly treated, being supported, trusted, involved in work organization, and being appreciated.

In this case, the eigenvalue associated to the first PC (let “PC1.PS”) is 7.34, while the second largest is 2.85. As expected, PC1.PS explains a lower percentage of the total variance (28%) as the number of items in this case is much larger. Nevertheless, one dimension (PC1.PS) captures most of the information in the data and, also in this case, the hypothesis test that one component is sufficient is highly significant. Looking at the factor loadings reported in Fig. 5, it can be noted that most of the variables are positively correlated to PC1.PS and show a loading which is well greater than the 0.4 threshold.

Fig. 5
figure 5

Source: Authors’ elaborations on EWCS 2015 data

PC1.PS factor loadings for the observed covariates.

Particularly, in this case, PC1.PS is mainly explained by the variables fair, apprec, coop, truste, trustm, motiv, welld, supportM, geton and useful. The higher the value of these determinants, which denote a positive and motivating work environment, the higher the value of PC1.PS. In this case, higher values of the indicator denote healthier working conditions, thus PC1.PS is positively correlated to SAH, as shown in Panel B of Fig. 4 where the distribution of the PC1.PS is plotted against the levels of SAH. Despite the presence of outliers, clearly, increasing levels of SAH correspond to higher levels of the synthetic indicator and of the above-mentioned variables which provide the highest contributions to the variance of the corresponding linear combination.

5.2 Ordered probit models including synthetic indicators

Based on the findings of the PCA, three further Ordinal Probit models have been estimated considering the respondents’ characteristics and using the synthetic indicators of risks. As reported in Table 4, Model 4 and Model 5 consider the impact on SAH of PC1.Phy and PC1.PS, respectively, whereas Model 6 simultaneously considers both the indicators. In all cases, the two indicators, either together or alone, turn out to be significant, confirming that they may provide an effective synthesis of the underlying variables which do exert some impact on SAH.

Table 4 Ordered probit with synthetic indicators for physical and psychosocial risk factors

In particular, the negative effect of the PC1.Phy indicator is due to the fact that the underpinning items represent detrimental physical working conditions. Hence, they are negatively correlated to SAH: an increasing exposure to such risk factors (higher values of the employed scale and, correspondingly, of the PC1.Phy indicator) is associated with worse SAH levels. This is especially true for those items having a strong effect on the PC1.Phy indicator (see Fig. 3). Conversely, the positive effect of the PC1.PS indicator on SAH is due to the positive correlations to SAH of most of the items whose corresponding loadings are mainly positive (see Fig. 5). In fact, many of the items summarized by the PC1.PS denote a constructive and motivating working environment.

In these simplified Ordered Probit models, the dummy variable dEU12 turns out to be always significant. Also, respondents’ characteristics remain significant in the same way, in all the models.

Concerning the physical risk factors, when comparing results from Table 2, it can be noted that although the variables vaps (breathing vapours) and infect (referred to the exposure to biological hazards) are not significant in Model 1, they are included in the synthetic indicator PC1.Phy with a loading close to vibra and noise (0.79 respectively, see Fig. 4). On the contrary, the variable liftp, which is significant in Model 1, presents small loadings (0.26) to the variance of PC1.Phy. However, there is quite a broad agreement between the significance of drivers in Model 1 and the content of the synthesis provided by PC1.Phy. When, on the other hand, we focus on psychosocial risk factors in Model 2, results display that a number of variables which are related to the workplace social environment and to the relationships with colleagues and manager (supportc, supportm, useful, expected, fair, truste, trustm, apprec, coop) are either not significant or weakly significant. Nevertheless, these variables present high loadings for PC1.PS, ranging from 0.54 to 0.71. Thus, for the psychosocial risk covariates, we may conclude that the first PC provides a synthesis of the information that is somewhat complementary with respect to the standard model with the original covariates, as it disentangles a specific cluster of psychosocial risk factors affecting SAH.

6 Conclusions

The objective of this paper was to explore the association of workers’ well-being -measured by self-assessed health (SAH)- with psychosocial and physical risk factors in the workplace, introducing synthetic measures suitable to identify if -and how much- the two sets of risk factors exert an impact on worker’s well-being.

For our purposes we used data from the sixth EWCS and carried out Ordered Probit models to measure the effect of specific items, operationalising the physical and psychosocial risks. The estimated standard Ordered Probit models confirmed that both types of risk factors have a significant effect on SAH.

Some of the psychosocial risk factors are not significant in any of the models (or are weakly significant), while other variables display a strong correlation with SAH. This is the case of variables related to work-life balance (fitwell and timeIwant) or to a positive and motivating work environment, in which workers have a sense of fulfilment with work (welld), are motivated (motiv), are consulted and participate in decisions (hasay, decisions), are supported by management and trust managers (supportm, trustm) and have good relationships with colleagues (geton). Our findings are in line with existing literature, although they seem to point to the fact that, among the aspects most frequently indicated in the literature as related to well-being, only few of them– and in particular those related to the social environment at work and the work-life balance– are those contributing the most to worker’s well-being. Therefore, those aspects not intrinsically related to work seems to be the most impacting on workers well-being.

Among the physical risk factors, those related to vibrations, high or low temperatures, and tobacco smoke significantly impact on SAH in the model including only this type of risk factors, but cease to be significant in the comprehensive model. On the other hand, all the physical risk factors related to positions or movements during work (painp, liftp, loads, sitting, repet, comput) display a significant correlation with SAH. Such results are in line with literature that indicates that the worker’s physical environment and the equipment (including work equipment availability, suitability and maintenance, environmental conditions such as light, noise, etc.) also have an impact on well-being, although in this case risk factors directly related to physical efforts appear to be the most relevant, considering the strong association with the well-being as expressed by the SAH variable.

When considering the two types of risk factors together, it seems nevertheless that the psychosocial risks do exert a greater influence on workers’ perception of health as compared to the physical ones.

The introduction of two synthetic indicators, implemented using a Principal Component Analysis (PCA) for each of the two sets of risk factors, makes the interpretation more straightforward. Such indicators are the first Principal Component computed for each set (PC1.Phy for physical risk factors, and PC1.PS combining all the psychosocial drivers), accounting for most of the variance providing an effective synthesis of the information. These measures have subsequently been included in further Ordered Probit models for SAH, and they turned out to be statistically significant. Such evidence may suggest that the provided synthesis is to some extent complementary to the standard models.

The added value of building synthetic measures relies on that they allow either for simplifying a model-based analysis or for disentangling specific drivers of work-related well-being, as long as they are actually carriers of information, with the additional advantage of removing redundancies and obtaining more “robust” models.

In conclusion, the results presented allow to respond to our research questions in terms of strength and type of risk factors which seem to impact more on worker’s health and well-being in European workplaces.

It is important to also stress the limitations of this exercise, which stem directly from the data used and the survey tool itself. First and foremost, the physical risk factors are not extensively surveyed in the case of the EWCS and therefore refer only to a subset of respondents that is too limited to obtain reliable estimates. This is due to several reasons, such as: the specificities of this type of risk factors which cannot always be measured as self-reported; the circumstance that they exist more frequently in specific working contexts (manufacturing or construction) which employ only a reduced share of European workforce; the fact that European and national legislations have targeted this type of risk factors for several decades, now resulting in their steadily decrease. Thus, ad hoc surveys investigating physical risk factors where they mostly exist would be the right solution to provide more reliable analysis.

Furthermore, empirical evidence suggests that similar or highly correlated questions may be perceived as repetitive as confirmed by the circumstance that only the first principal component in the two groups of selected variables is significant. In fact, there is no clear division between the loadings that allow to characterize the main components, no great contrasts are captured, and there is a major noise. All in all, a questionnaire including fewer and more targeted questions would allow for grasping better-quality information and would be a more cost-effective solution.

Finally, the availability of statistically representative data can impact future research developments, which would inevitably focus on the different sets of occupational risks, as a consequence of the COVID-19 pandemic, but also as a result of wider transformations of the way we work due to digitalization of the workplaces.