1 Introduction

Though the global health crisis has ended, its economic impacts have only started to ripple over the global labor market. In the United States, about four million workers voluntarily quit their jobs in April 2021 (Reuters, 2019). This so-called Great Resignation is observed in other advanced economies as well. In the Netherlands, it is reported that nearly one out of five people have switched their jobs in 2022 (Algemeen Dagblad, 2023). A macroeconomic explanation maintains that labor shortages in some sectors lead workers to believe that they may find a better offer to compensate the drops in real wage under a tight labor market, which may explain the high turnovers (Duval et al., 2022). While this explanation has some truth in it, research on organizational behaviors may argue that the phenomenon can be driven by low job satisfaction. Unhappy workers are more likely to quit their jobs (Green, 2010), especially during and after the pandemic period. For instance, Martin et al. (2022) found that remote working has increased job stress and reduced job satisfaction in Luxembourg. Demirkaya et al. (2022) reported that the feeling of entrapment is significantly correlated with job quits in Turkey.

Departing from the job satisfaction perspective, this article evaluates an array of individual-level factors that influence job satisfaction of Dutch workers using newly available Dutch household survey data. This study does not seek to explain the occurrence of Great Resignation. It highlights that job satisfaction has escaped from the discussion. Based on the multifaceted approach of job attitudes (Judge & Kammeyer-Mueller, 2012), an economic explanation would suggest that satisfaction of pay and of work conditions are major drivers of the behaviors of workers. In this study we examine to what extent economic concerns predict overall job satisfaction during 2022 using a novel machine-learning approach. Findings from this study should help us to assess which facets of job satisfaction are associated with the ‘Great Resignation’ in the Dutch labor market.

2 Job Satisfaction and the Utility Approach

Job satisfaction is commonly referred to as employees' affect and attitude toward their job. Emphasized on the affective dimension, Locke (1976) defined the concept as “a pleasurable or positive emotional state resulting from the appraisal of one's job or job experiences” (p. 1300). Focusing on the cognitive aspect of job satisfaction, Weiss (2002) defined the concept as “a positive (or negative) evaluative judgment one makes about one's job or job situation” (p. 6). The affective view sees affect at work as an indicator of job satisfaction, while the cognitive, evaluative approach considers affective experiences on the job as a source of job satisfaction.

In economics, job satisfaction is frequently treated as a unidimensional variable and a function of wage (Borjas, 1979; Hamermesh, 1977).Footnote 1 An implication of this formulation is that, when members of certain social groups (e.g., women) are discriminated in the labor markets, they would experience a lower level of job satisfaction because of a lower wage or other non-wage benefits (Bartel, 1981). Interestingly, while a gender wage gap is found to exist (Blau and Kahn, 2017), studies have shown that women usually report a higher level of job satisfaction than men, a stylized fact that contradicts what the theory predicts (Blanchflower & Oswald, 1999; Clark, 1997; Sloane & Ward, 2001). However, when factors such as age and job expectations are controlled for, the gap in job satisfaction between the two genders diminishes, suggesting that job satisfaction can depend on some often-overlooked demographic factors.

The utility function approach also implies that satisfaction is, at least partly, driven by extrinsic incentives. This early conceptualization is based on the utility theory that labor supply is a rational choice after a careful deliberation to the trade-off between leisure and consumption backed by the paycheck. Although studies have found that salary is an important determinant of job satisfaction, many studies have documented that workers do value a wide range of nonpecuniary characteristics of a job, which include job security, autonomy, shorter and more flexible work hour (Berger et al., 2019; Clark, 2001; Lepinteur, 2019; Origo & Pagani, 2009). A study by Lange (2012) further showed that these job characteristics could be even more influential than individual-specific factors like personality traits and values.

If people are happy with their jobs, they should be less likely to switch jobs. Some studies use quit data to analyze impacts of pecuniary and non-pecuniary factors on job satisfaction. Clark (2001) not only showed that job satisfaction is a powerful predictor of job change, it also found that job security and pay are the most important determinants of quit. Although not been emphasized in the literature, analyses of quit data may suggest that factors contributing to satisfaction and dissatisfaction may not be identical and their impacts could also be asymmetric. These asymmetric effects, however, have not been highlighted and emphasized in the existing literature.

One major conclusion from the job satisfaction literature is that happy workers are more productive (Oswald et al., 2015). But what beneath is the belief that a well-designed reward system should improve workers productivity. The starting point of personnel economics is the principal-agent conceptualization (Laffont & Martimort, 2002) that workers may shirk (i.e., moral hazard) and productive workers are costly to recruit (i.e., adverse selection). Clever economic mechanisms are required to identify these workers ex ante and to induce their efforts ex post. From this perspective, many of the human resource management practices such as performance pay, promotion, and job autonomy can be seen as performance optimizing mechanisms. Although financial and non-pecuniary rewards are parts of the job satisfaction equation (Cassar & Meier, 2018; Cornelissen et al., 2011; Ellingsen & Johannesson, 2007; Gosnell et al., 2020; Jones et al., 2009), from this cynical view, job satisfaction is nothing more than a happy by-product of human resources policies or a means to productivity and performance.

3 Individual and Social Dimensions of Job Satisfaction

An underlying assumption behind the utility approach is that income induces satisfaction and/or happiness. Although studies have found a significant but weak relationship between the two variables, behavioral studies have challenged this fundamental assumption (Clark et al., 2005; Easterlin, 1995). While some studies suggest that the relationship may be causal (Powdthavee, 2010), three decades of economics of happiness research have contested this finding (Clark et al., 2008). One major conclusion from the literature is that subjective well-being does not always increase with income. When examined the dynamic relationship between income and subjective well-being, Easterlin (2001) argued that income increases happiness initially. But aspirations grow as one climbs the income ladder. Over a life cycle, people's level of happiness remains stable and does not increase along with salary. Increase in income has only a short-term effect on happiness. The same mechanism may explain why job satisfaction may not catch up with income.

Another explanation to the weak statistical correlation between money and satisfaction over time is related to social comparisons. Taking a geographical approach, Luttmer (2005) found that people who earn an income lower than the local average feel worse off. A large amount of economics research in job satisfaction has been testing this hypothesis. Using data on British workers, Clark and Oswald (1996) found a similar relationship between job satisfaction and expected income. In an experimental study, Card et al. (2012) showed that the knowledge about their earnings below the median income of their peers significantly reduces job satisfaction and increases job search intentions. People form expectations as well as inspirations based on their peers, work conditions, and wage history (Diriwaechter & Shvartsman, 2018; McBride, 2001; Poggi, 2010). When the expectation-aspiration spiral is kick-started, an increase in aspirations can negatively affect people's satisfaction levels (Mcbride, 2010). What remains unclear is whether job dissatisfaction is due to pure social comparisons or fairness concerns (Card et al., 2012; d’Ambrosio et al., 2018; Smith, 2015). Interestingly, in a study using matched employer–employee panel data in Denmark, Clark et al. (2009) found that job satisfaction is positively correlated with their co-workers', a finding that counters many of the existing research in this area. They interpret that this relationship can be related to people's expectations about their future earnings. A higher average salary level leads the thinking that their potential wage may increase soon. All in all, while studies have shown that expectations and aspiration matter, it remains unclear how they are formed based on social comparisons and wage profiles.

One major assumption behind the neoclassical formulation of the utility function is that people gain utility mainly from consumption but not from the job per se. While the personnel economics literature covers aspects such as reward systems from a well-grounded, humanized (i.e., incentive-based) perspective, it is not unrealistic to think that under certain conditions, work can lead to a sense of achievement, which in turns shapes aspirations and hence job satisfaction (Genicot & Ray, 2020). Another stylized fact in the literature is that entrepreneurs and the self-employed are found to have a higher level of job satisfaction (Lange, 2012; Millán et al., 2013). The risk-adjusted returns and job security of entrepreneurship is known to be low. The existence of intrinsic motivations, however, offers a plausible explanation to the surprising, stylized fact (Carree & Verheul, 2012). In a laboratory setting, Ariely et al. (2008) and Chandler and Kapelner (2013) manipulated the perceived meaningfulness of a task and found that meaningfulness influences effort and labor supply behaviors. This lends support to the idea that the meaning of work could be part of the utility function in its own right (Cassar & Meier, 2018). Drakopoulos and Theodossiou (1997) considered a hierarchical utility function, in which increase in earnings, up to a certain point, ceases to induce utility, and the marginal utility of other work-related variables becomes much higher thereafter. Although the proposed modification does not directly speak to the fulfillment mechanism, the modification formally speaks to human's intrinsic motivations and the feeling of satisfaction.

If intrinsic motivations involve meaning making, a job that is connected to a person's educational background, skill sets, and competency should be more fulfilling. In fact, Nikolova and Cnossen (2020) showed that intrinsic motivations, measured by perceived job autonomy and competence, matter even more than extrinsic rewards. Feeling competent is pleasant and induces satisfaction (Loewenstein, 1999). Nevertheless, some studies have shown that the competence concern may backfire. García-Mainar and Montuenga-Gómez (2020) found that, in terms of education, overqualified workers tend to dissatisfy with their jobs. The same applies to horizontal educational mismatch—when graduates are employed in an occupation unrelated to their fields of study (Levels et al., 2014)—and skill mismatches (Vieira, 2005). However, there is only little evidence on the effects of skill obsolescence and gaps on job satisfaction (McGuinness et al., 2018).

One fundamental criticism on the study of job satisfaction is that many findings from these empirical studies may not be easily fed into neoclassical economic theories. To some economists, if job satisfaction is nothing more than yet another term in a utility function, it can only be assumed but not explained. Accordingly, this leads to a proposal that, because job satisfaction is unobserved and is related to some volatile, external factors like economic fluctuations and labor market policies (Pilipiec et al., 2020, 2021; Ravid et al., 2017), and is unstable even within-individual (Bryson & MacKerron, 2017), instead of fixating on job satisfaction and treating it as a dependent variable, a more fruitful approach is to treat it as an explanatory variable to study worker behaviors such as quits and labor market functioning like employee turnover (Hamermesh, 2004).

Although the general working environment and coworker relationship has been an important dimension of job satisfaction in organizational psychology (Jolly et al., 2021; Kinicki et al., 2002; Smith et al., 1969), the variable has received relatively little attention in economics literature, probably due to its difficultly in incorporating into the existing theoretical framework. The same also seems to hold in organizational studies (Judge & Kammeyer-Mueller, 2012). Cassar and Meier (2018) discussed feelings of relatedness in the context of meaning of work and productivity, but they did not articulate how the concept can be related to job satisfaction. They also mentioned social comparison and fairness, which was discussed above, but they clearly did not relate it to coworker relationship. Intuitively it is easy to understand why collegial relationship may influence job satisfaction. Karlsson et al. (2004) explained that social extensions such as a family and work ameliorate feelings of inconsequentiality, and people could find meaning in one's life. But it is not obvious how it can be related to the meaning of a job (Nikolova & Cnossen, 2020). One possibility is about the pursuit of common (organizational) goals in a team setting. On a material level, cooperation makes success more likely and helps to achieve higher output or goals which cannot be completed alone. On the social level, contributing to a common goal can create a warm-glow effect when individuals consider working with or “helping” their colleagues as altruistic acts and gain utility from that (Andreoni, 1990). From this perspective, a friendly work environment can be considered as a public good. Even for pure egoists, they can gain utility from contributing to the building of a constructive work environment and creating positive spillovers simultaneously. In this regard, an affable work environment can be considered as a by-product rather than a source of job satisfaction.

In our study, the relative importance of different facets of job satisfaction will be tested using a predictive, machine learning approach. Many psychometric tests have been developed to assist in clinical diagnoses. Thus, a predictive approach is well-established in psychology. In fact, predictive validity is a core property of psychometric measures (Mulder et al., 2014). Linearity is commonly assumed in traditional statistical measures and tools such as correlation coefficients and structural equation modeling when linear algebra is used. However, higher dimension interactions and linearity are plagued in many relations. The machine learning approach employed in this study is able to capture nonlinearity which is not easily modeled using traditional regression methods (James et al., 2013). Additionally, as argued above, there are two major limitations in traditional studies using regression analysis: (1) a variable that influences job satisfaction necessarily affects dissatisfaction, and (2) the effects of a variable on job satisfaction are symmetric. An analytical advantage of the machine learning approach is its ability to reveal potential asymmetry between variables in a relationship. The notion of nonlinearity would become clearer when it is discussed in the result section.

4 Data

Data in our analysis were drawn from the Work and Schooling module of the Dutch Longitudinal Internet studies for the Social Sciences (LISS; Streefkerk & Centerdata, 2022). The LISS panel is administered by Centerdata research institute based in Tilburg University. Based on the population register of the national statistical office of the Netherlands, a random, nationally representative sample was drawn. In this study the fifteenth wave of the LISS survey was used, which is the most recent one after the pandemic. The online survey was conducted between the 4th of April 2022 and the 31st of May 2022. The cross-sectional dataset that was used consists of 420 variables with 5775 responses.

One advantage of the machine learning approach is its ability to include in an analysis a large number of variables, an approach which is seldom adopted in a typical regression analysis due to the concern of multicollinearity. Instead of pre-selecting variables, which can involve personal bias, the machine learning method takes a data-driven approach and includes as many of context relevant variables as possible. The method, however, involves a trade-off: the inclusion of additional variables usually reduces the sample size because of the missing value problem. Therefore, following Celbiş et al. (2023), an algorithmic process is implemented in order to optimize the number of observations. In each iteration, a simple regression tree analysis is conducted, and the root mean squared error (RMSE) is recorded. In the next step, the algorithm searches for the variable that, if excluded, would cause the greatest reduction in the number of observations. Then the variable is dropped from the dataset and the process is repeated. The data matrix that keeps most of the observations and has smaller impact on accuracy is used for the final analysis. Figure 6 (in the Appendix), visualizes the observation-feature trade-off of the procedure. In each iteration, represented by the x-axis, either one or more variables are dropped. Identifying the variable(s) to be dropped in each iteration is done through generating an UpSet plot developed by Lex et al. (2014).Footnote 2 In Fig. 7 (in the Appendix) we present a sample UpSet plot built in the 4th iteration as an example. According to this UpSet plot, the intersection of the features cw22o582, cw22o583, cw22o584, cw22o585, and cw22o586 account for the largest loss in observation as shown by the first vertical bar, suggesting that these features may represent connected or follow-up questions in the survey which are usually entered as missing in conjunction. Within the iteration, a regression tree is fitted into the subset of the training data which omits the above specified variables and its RMSE is noted. In the next iteration, the highest horizontal bar will correspond to the variable cw22o510, as the five features with higher bars below it will have already been dropped. In this new iteration, cw22o510 would account for the largest decrease in observations by itself unlike the earlier dropped group of variables. Therefore, the iteration will only drop cw22o510.Footnote 3 A new regression tree then is fit into this new subset of the training data and a new RMSE value will be computed. The recursive steps continue until the features of the dataset are exhausted or until the dataset has no observations left with missing values. In our case, this corresponds to iteration 54 (as shown in Fig. 6) where the y-axis represents the percentage of observations (persons) or variables that is left in this iteration. As variables responsible for high missing values are dropped, the percentage of observations retained increases. The purpose of this procedure is to identify the optimum combination of the number of observations and variables that is likely to yield the highest prediction performance in our subsequent main empirical implementations of machine learning models. However, the case at hand suggests that RMSE (not represented by any axis) is not sensitive to this trade-off as suggested by the nearly flat RMSE curve in Fig. 6. As a result, the choice of the desired observation-variable balance becomes somewhat subjective. We selected the combination which retains a balance in this trade-off such that the difference between the percentage of persons retained and the percentage of variables retained is at a minimum. This corresponds to iteration 30. Prior to implementing the above outlined steps, variables that are completely consisted of missing information, variables with no variation (i.e., same value reported for all persons), administrative variables coded into the questionnaire (e.g., start date of the interview, duration of the interview) were dropped. After extensive data cleaning and validation procedure, which involved the above algorithmic trimming of the dataset, the final dataset consists of 1878 individuals and 89 variables. 30% of this data is randomly selected and set aside as the test dataset. All models are applied on the remaining training data. The results are assessed by evaluating the root mean squared error of the models using the test data. The definitions of the top ten features selected by the model in addition to the dependent variable (job satisfaction) are presented in Table 1.

Table 1 Feature definitions

5 Empirical Models

The empirical analysis takes on two steps: prediction and interpretation. The prediction step is based on a variation of the Gradient Boosting Machine (GBM) technique by Friedman (2001, 2002). GBM is applied using the Extreme Gradient Boosting (XGBoost) algorithm by Chen and Guestrin (2016). XGBoost extends the usability of GBM by allowing regularization and adding further randomization options. The prediction also relies, to a lesser extent, on the Random Forest (Breiman, 2001) technique for clustering. Both XGBoost and Random Forest are collections of weak learners based on the binary recursive partitioning algorithm by Breiman et al. (1984). Hence, randomized (Random Forest) and sequential and randomized (XGBoost) tree-based ensemble machine learning models are used in this study. The XGBoost algorithm allows for cross validation for regularization and determining the optimum model parameters including the learning rate. We partitioned the training sample into 10 subsamples (i.e., internal validation sets) to decide parameters pertaining to tree complexity (i.e., the maximum tree depth and minimum number of observations in terminal nodes) and the learning rate through cross validation. While regression trees are normally pruned through n-fold cross validation, the random forest model produces unpruned trees. However, while cross validation is absent from the random forest proximity clustering, unbiasedness is achieved through the use of the out-of-bag (OOB) observations. The resulting Shapley Additive Explanations (SHAP) values are derived from the above mentioned cross validated gradient boosting model.

XGBoost and Random Forest present several advantages thanks to their ability to consider all possible interactions and nonlinearities as the algorithms are based on binary recursive partitioning (James et al., 2013; Varian, 2014). The aggregation of many classification trees with high variance but low bias (due to their unpruned structures), built by taking repeated samples from the training data, can significantly improve prediction accuracy while reducing the variance on the prediction function (Breiman, 1996, 2001; Friedman, 2001; Friedman et al., 2001; James et al., 2013). However, as trees built using the same training dataset are expected to be highly correlated, the benefits of using an ensemble would be limited (Aldrich & Auret, 2013; Breiman, 2001; Friedman et al., 2001). The random forest algorithm aims to cope with this correlation by introducing randomized restrictions on the feature space at each iteration (i.e., it randomly selects input features in each tree) (Breiman, 2001; Breiman & Cutler, 2020; Friedman et al., 2001; James et al., 2013). Therefore, in addition to the reduction in variance through aggregation, further reduction is made possible compared to bootstrap aggregation which is an ensemble model with correlated trees (James et al., 2013). However, gradient boosting does not decorrelate trees. Instead, each tree is a modified version of the previous one (Friedman, 2001, 2002). Nevertheless, GBM embodies randomization like Random Forest but also introduces regularization which is not present in Random Forest. As a result, a group of weak learners with low variance are chained sequentially and modified with learning steps in between leading to the bias in prediction being lowered gradually in each iteration (Friedman, 2001, 2002; Friedman et al., 2001).

A random forest with 500 unregularized regression trees is generated for predicting the individual job satisfaction level for the \(N\) persons in the training data. At each iteration, some individuals are left out of the computation, because the bootstrap aggregation algorithm on which a random forest is based draws random subsamples of 2/3 of the size of the training dataset (Breiman, 2001; James et al., 2013). Further randomization is applied by selecting a split feature from a random subset of size 1/3 of the feature set at each split (Breiman, 2001). A random forest proximity matrix (\(N\times N\)) is produced where the proximity score of two persons is increased by 1 each time when they are predicted to fall into the same terminal unpruned regression tree node in a random forest iteration in which they were out-of-bag (i.e., randomly left out). The matrix is divided by 500 (the number of trees) and the additive inverse is computed (Aldrich & Auret, 2013; Breiman & Cutler, 2020; Friedman, 2001).

The exploration of clusters is performed based on the random forest results. Random Forest uses the proximity scores among the observations in the training dataset in order to detect cluster structures (Aldrich & Auret, 2013; Cutler et al., 2009; Friedman, 2001). The distance measures used in conventional clustering techniques such as hierarchical and k-means clustering are prone to be dominated by uninformative features that may cloud the effects of the important model features (Cutler et al., 2012). In this regard, the main advantage of the random forest proximity matrix is due to its randomization procedure which aims the aforementioned decorrelation process. In addition, unlike classic clustering approaches, feature selection in random forest proximity plots is based on the underlying model which employs algorithmic selection (Xu & Tian, 2015). Furthermore, the random forest proximity plot used for clustering in the present study is generated using OOB observations pair-wise frequencies of sharing a terminal node, which is an internal validation procedure, leading to improvements in out-of-sample performance (Breiman & Cutler, 2020; Friedman, 2001).

The random forest proximity plots tend to detect and represent one class in one arm of a star shaped visual where pure class regions of out-of-bag observations in the training data are grouped towards the extremities of an arm due to the tree-based structure of the underlying algorithm (Hastie et al., 2009; Aldrich & Auret, 2013; Cutler et al., 2009). In this regard Friedman et al., (2001, p. 595) state that “The idea is that even though the data may be high-dimensional, involving mixed variables, etc., the proximity plot gives an indication of which observations are effectively close together in the eyes of the random forest classifier”.

The interpretation of the machine learning findings is an essential part of any study in social sciences, as predictions alone—regardless of their success—cannot provide clear information and policy implications. The main interpretable machine learning tool employed in this paper is the computation and assessment of Shapley Additive Explanations (SHAP) values (Lundberg & Lee, 2017) based on the cooperative game theoretical approach by Shapley (1953). SHAP values have been introduced recently to the machine learning fields. As opposed to older approaches such as calculating variable importance scores (Lundberg & Lee, 2017; Molnar, 2019), the SHAP values approach can measure both the sizes and the directions of the relationships. The computation of SHAP values is preformed using the “SHAPforxgboost” module by Liu and Just (2020). A remarkable advantage of the SHAP value approach is that the calculations of the effect sizes are done by considering many different values and (theoretically all) combinations of model features (Celbiş, 2022; Lundberg & Lee, 2017; Molnar, 2019). Consequently, when computing the effect of a given feature for a given data instance (i.e., individual) all other variables are not held constant as is usually done in traditional econometric approaches. The departure from the ceteris paribus restriction leads to more realistic assessments of effect sizes, as in the real-world other factors can never be held constant in the context of a social science research. Finally, because considering all possible feature combinations and values is computationally not feasible, an approximation formulated by Štrumbelj and Kononenko (2013) is used in this study.

6 Findings

Among the included variables, both satisfaction with coworker relations and the pay are important features in predicting job satisfaction. The random forest proximity plot shown in Fig. 1 visualizes the clusters based on proximities in prediction between individuals.Footnote 4 The plot suggests the existence of about three clusters based on the roles of model features in explaining job satisfaction.

  • highly satisfied by colleagues atmosphere (yellow).

  • moderately satisfied by colleagues atmosphere (green).

  • poorly satisfied by colleagues atmosphere (dark green).

Fig. 1
figure 1

Clusters detected by random forest predictions

In the plot, larger-sized circles indicate individuals with lower wage satisfaction. These individuals are grouped towards the intersection of the “arms”, suggesting that the model has a harder time in distinguishing them (i.e., in iterations in which they were out of sample, people with low wage satisfaction often fell into same terminal nodes when run down the tree). The clear formation of the arms as separate clusters suggests a successful differentiation of the individuals through their inherent similarities. We also observe that people who are poorly satisfied with their jobs are not all part of the same cluster; suggesting that job dissatisfaction may arise from a diverse set of factors.

Regarding the SHAP analysis, a grid search procedure yielded the following optimal parameters for the XGBoost model except for subsample and number of trees, where the former is decided based on the finding by Friedman (2002) and the latter on computational restrictions:

  • Learning rate: 0.01

  • Maximum tree depth: 10

  • Minimum number of observations in terminal nodes: 1

  • Subsample ratio in each iteration: 0.5

  • Feature subsample ratio in each iteration: 0.5

  • Number of Trees: 10,000

The model is run on the training data, which consists of 70% of the observations randomly sampled from the full dataset. The job satisfaction levels of the individuals in the remaining test data are predicted with a RMSE of 1.14. The SHAP values that are computed based on the XGBoost predictions are visualized in Fig. 2, where each dot represents an individual. The values show the contribution of each feature value—where features are on the y-axis and higher values are represented with darker colors—on the deviation of a specific individual's predicted value from the mean prediction (the point 0 on the x-axis). The top ten features with the highest SHAP importance values, listed next to the variable name, are presented in the figure. The features that affect job satisfaction the strongest are the first three variables, as the SHAP importance values of the remaining features are all less than 0.1.

Fig. 2
figure 2

SHAP values

The SHAP analysis summarized in Fig. 2 suggests that a high satisfaction of the atmosphere among colleagues has a positive effect on job satisfaction. Furthermore, this variable has the highest importance in the prediction of job satisfaction. The relationship is shown in more detail in the SHAP dependence plot in Fig. 3 (a slight amount of jitter is used for better representation). A value of 10 for this variable alone can account to up to more than 1 point (in Likert scale) positive deviation from the mean job satisfaction value in the training data.

Fig. 3
figure 3

SHAP feature dependence plots

Focusing on wage satisfaction, we observe that high wage satisfaction has a positive effect on job satisfaction. While this is not surprising, it should be highlighted that the importance of wage satisfaction is less than half of that of satisfaction in colleagues. The interaction and dependence between these two top variables are further visualized in the form of a two-way partial dependence plot (PDP) in panel A of Fig. 4. Unlike the SHAP dependence plots, the PDPs represent joint predictions by holding constant all other features, except one or two features of interest (Friedman, 2001). The predictions are recalculated for each value of the features(s) of interest and averaged over the individuals in the training data (Friedman, 2001). However, some predictions suggested by the PDPs may be implausible for features that have high correlation with variables that are held constant (Friedman, 2001; Molnar, 2019). The PDP for Colleagues and Wage Satisfaction suggests that high wage satisfaction, while making a difference, does not predict truly high job satisfaction levels in the absence of high satisfaction in the atmosphere with colleagues. A similar outcome can be deducted from panel B of Fig. 4 which shows that high wage satisfaction without feeling of appreciation does not predict high job satisfaction. We also observe in panel C of Fig. 4 that higher wage satisfaction is associated with high job satisfaction, but the effect is stronger for older individuals. Finally, panel D suggests that high wage satisfaction without work freedom does not predict high job satisfaction.

Fig. 4
figure 4

Two-way patrial dependence plots

The feature Appreciate is ranked third in Fig. 2. This result suggests that the perception that people get the appreciation they deserve for their work has a positive effect on job satisfaction. Feeling unappreciated has a slightly stronger negative effect than the positive effect of feeling appreciated.

The SHAP values of the variable Birth Year are quite symmetrically spread about zero and indicate that younger individuals in the dataset tend to be less satisfied with their jobs. The effect of this variable and the remaining ones are relatively small compared to those of Colleagues, Wage Satisfaction, and Appreciate.

The remaining features in the set of top ten variables with the highest SHAP importance values do not determine job satisfaction to a high extent individually, but collectively they affect the prediction. We briefly summarize the suggestions of their SHAP values. Lack of freedom in organizing one's work has a negative effect on job satisfaction. However, the effect is not symmetric: the positive effect of high freedom is less than the negative effect of lower freedom. Furthermore, the perception of getting enough support in difficult situations has a positive effect. Similar to the freedom variable, the effect of perceived support is also not symmetric.

The income variable is among the top used variables by the algorithm in predicting satisfaction. However, its effect is small and not clear. In elaborating personal economic gain, we instead focus on the wage satisfaction feature.

Preferring to work less hours than current is negatively associated with job satisfaction, but the effect is not very pronounced. Another feature in relation to work hours, Work Home Hours suggest that more hours worked from home has a negative effect on job satisfaction. Again, the effect is not symmetric. Finally, for travel time the effect is weak, but it is evident that long travel duration has a negative impact on job satisfaction.

Further information is given by the SHAP dependence plots. In panels A and B of Fig. 3 we observe that the positive effect of Colleagues and Wage Satisfaction begin after about a score of 7, and that there exist individuals for whom their high satisfaction with their colleagues make a big impact on their job satisfaction despite low wage satisfaction. On the other hand, panel C shows that feeling appreciated has a positive effect that becomes stronger if the person agrees with this opinion stronger. Finally, in panel D we observe that younger persons (born after around 1985) tend to have lower job satisfaction even if they may have high wage satisfaction. The effect is more negative the younger the person is.

Whereas the above discussed partial dependence computations visualize the mean predictions of individual job satisfaction level, individual conditional expectation (ICE) plots (Goldstein et al., 2015) visualize the predicted change in everyone’s job satisfaction level by plotting individual curves. Centered ICE plots provide a more explicit presentation by anchoring each individual curve at a given y-intercept value (Goldstein et al., 2015; Molnar, 2019). The ICE plots shown in Fig. 5 visualize the predicted paths for each individual in the training data where each line represents an individual. The plots for Colleagues, Wage Satisfaction, Appreciate, and Birth Year respectively suggest that the direction of the effects is mostly similar for individuals, although a small amount of heterogeneity in expectations exists.

Fig. 5
figure 5

Individual conditional expectation plots

7 Discussion and Conclusion

Welfare maximization is generally, in standard economic textbooks, regarded as a respectable economic objective or driver in any society. But the empirical measurement of welfare (including life satisfaction) is still fraught with many hurdles and uncertainties. In practice, GDP per capita—or, in a labor market context, wage rates—are often regarded as signposts for economic performance. However, this measuring rod has many serious shortcomings, such as the neglect of distributional and equity aspects, the exclusive focus on income to the detriment of essential consumption categories (such as human health, food, safety, education, green environment, quality of life), the bias caused by the presence of negative externalities or social costs (e.g., climate change, social stress, environmental decay), or the omission of the welfare implications of the worker’s balance between leisure time and working time. The welfare of a society depends among other aspects on how satisfied laborers in that society are with their work and work environments. The present study has aimed to shed further light on the individual dimension of social welfare.

The present study used the most recent wave available of the Work and Schooling module of the LISS survey on individuals in the Netherlands. The empirical analysis was mostly founded on tree-based sequential ensemble prediction algorithms. The predictions were elaborated in detail using interpretable machine learning techniques to quantify the strengths and directions of the relationships between the survey features and the level of job satisfaction of an individual.

The main result is that wage satisfaction alone is not sufficient to ensure job satisfaction for the analyzed sample of individuals from the Netherlands. Being satisfied with the atmosphere among one's colleagues and feeling appreciated are also essential for job satisfaction. While low wage satisfaction can have a strong negative effect on job satisfaction, high satisfaction with colleagues has a stronger potential positive effect on job satisfaction compared to the effect of wage satisfaction. Among other results, we also observe that younger people are less satisfied with their jobs.

It is believed that facet-based measures such as the Job Descriptive Index (JDI) predict overall job satisfaction well (Judge & Klinger, 2008). These measures cover similar dimensions of job satisfaction (Dunham et al., 1977; Kinicki et al., 2002). For instance, the JDI examines job satisfaction in five dimensions: work, supervision, coworkers, pay, and promotion. The Minnesota Satisfaction Questionnaire (MSQ; Weiss et al., 1967) measures job satisfaction in terms of compensation, advancement, coworkers, and supervisor human relations. The Index of Organizational Reactions (IOR; Smith, 1976) looks at supervision, the kind and amount of work, finance, coworkers, physical conditions, career prospects, and company identification. Most if not all related variables are featured in our machine learning analysis. Results from our machine learning approach generally support the construct of these popular job satisfaction measurements. However, based on the SHAP importance values of features, using the value of 0.1 as a cut-off point, it is found that three features predict job satisfaction particularly well: coworker atmosphere, pay satisfaction, and recognition. And among the top three factors, the coworker dimension performs the best. Based on results from our study, simple measures like the JDI and MSQ perform reasonably well. Nevertheless, the lower predictive power of some features suggests that looking only at few features may not be sufficient capturing the overall picture. More is not necessarily better given the lower predictive performance of some other indicators.

Relatedly, features that capture job characteristics do not perform particularly well. This challenges the job characteristics model (Hackman & Oldham, 1976), the dominant approach in job satisfaction research (Judge et al., 2017). Although the social dimension of job satisfaction is well recognized and is featured in all major measures, coworker relations is understudied when compared with other dimensions such as work conditions and pay satisfaction (Judge & Kammeyer-Mueller, 2012). Our findings also suggest that if an objective of the many traditional human resources management policies is to improve job satisfaction, some of the focus on, for example, skills mismatch and training, could be less important than the cultivation of a supportive and collegial working environment. Future research should focus more on the social aspect. There are several prominent theories related to the social environment in workplace (Jolly et al., 2021). The conservation of resources theory (Halbesleben et al., 2014; Hobfoll, 1989) maintains that, as a resource, (perceived) social supports from colleague and job supervisors helps workers to regulate resources in times of difficulty to prevent (mental) resource loss such as burnout. Focusing on job performance and engagement, the job demands–resources theory (Bakker & Demerouti, 2007; Gerich & Weber, 2020) suggests that social resources in workplace could help workers to improve their performance. Given the general interpretation of the survey question, it remains unclear in which way coworker atmosphere in our analysis entails. Future research is still required to disentangle the mechanism behind the finding. Furthermore, since support has been picked up by a separate top-ranked variable, and work pressure, while included, was not a highly relevant variable in predicting job satisfaction. One interpretation is that work pressure has been captured by other variables, which have lower importance, one may favor the job demands–resources theory which emphasizes the role of support on performance. Another interpretation is that work pressure has a lower importance because part of it has been captured by social support. If this interpretation is correct, the conservation of resources theory might be more important. Which explanation of the finding is correct remains a topic of future research.

What do our findings suggest about the role of job satisfaction in the Great Resignation in the Netherlands? The importance of pay satisfaction is partly consistent with the explanation about the tightness of the labor market in the pandemic era. Under high inflation, workers are predictably unsatisfied with their wages. A tight labor market would favor workers to shift jobs and to ask for a high bid. However, the predictive power of collegial atmosphere out-weights that of wage satisfaction by a lot. It is likely that job satisfaction, mainly through its impact on coworker relations, plausibly related to remote working and home office, has a greater impact on job shifts when compared with the wage factor. As the Netherlands is a developed economy, the hierarchy of needs for individuals are likely to be different from middle and low income countries. The importance of coworker relationship may have relatively less importance compared to wages and other features pertaining to living standards in countries when working individuals are considerably more concerned about their socioeconomic well-being. While our evidence could shed light on the research question at hand with respect to high-income countries, empirical research on data from lower income countries may point towards different results.