1 Introduction

The long-term success of organisations relies on their ability to attract and retain high-quality labour. Voluntary turnover (VT), or “a worker’s conscious and deliberate willfulness to leave the organisation” (Tett & Meyer, 1993, p. 262), disrupts this capacity, erodes valuable social capital (Chung et al., 2022) and can adversely impact those left behind (Laulié & Morgeson, 2021).Footnote 1 While VT is not necessarily negative, ‘dysfunctional turnover’, or the failure to retain highly-valued workers (Zivkovic et al., 2020), is incompatible with long-term profit maximisation (Heavey et al., 2013). The cost of replacing a UK worker on the average salary of £27,000 is estimated at £12,000, with senior employees costing considerably more (Wright-Whyte, 2019).

VT is nonetheless pervasive, with the UK workforce 1-year-retention rate averaging just 81% (ONS, 2019). In the US, 27% of workers voluntarily quit in 2018, at a cost of $617 billion (Work Institute, 2019). 77% of these quits were deemed 'avoidable’, or within the employer’s control (Bakar et al., 2021). COVID-19 has exacerbated an already problematic situation, triggering a ‘Great Resignation’ (Klotz, 2021) of employees in search of  a better work-life balance. US monthly resignation rates peaked at 4.3 million (2.9% of the workforce) in August 2021 (BLS, 2021). A 2022 PWC survey of 52,000 workers in 44 countries (Ellerbeck, 2022), revealed that 20% of surveyed workers were planning to quit their jobs. While pay was the main factor cited by those who were considering switching jobs (71%), job fulfilment (69%) and the ability to be oneself at work (66%) were ranked 2nd and 3rd respectively.

Given the prevalence and high costs of dysfunctional VT, the question of why workers quit continues to attract considerable research attention (e.g., Hom et al., 2017; Rubenstein et al., 2018). While quit rate data are useful for identifying a dysfunctional VT problem, they are limited to the extent that they are obtained ‘after-the-fact’, in that they describe an event which has already occurred. Quitting can be construed as the end-point of a gradual withdrawal process (Hulin, 1991). If the aim is to prevent quits, then it is crucial to identify potential quitters at an earlier stage of this process. The majority of VT process models (e.g., Hom and Griffeth’s expanded model, Hom & Griffeth, 1995) are informed by the Theory of Planned Behaviour (TPB; Ajzen, 1985, 1991), which posits that behavioural intentions are the best predictor of actual behaviour. TPB-grounded VT models position quit intentions (QI) (motivation) as the most proximal and accurate antecedent of VT, through which subjective norms about quitting, perceived behavioural control (ability to quit) and all other antecedents (e.g. attitudes towards the job; personal characteristics) are mediated (Peltokorpi et al., 2023; Van Breukelen et al., 2004).

Focusing on QI is also advantageous for the reasons outlined in Nichelson et al. (1977). Firstly, QI are a useful indicator of work-related discontent which can serve as a valuable early-warning signal that organisations may need to put in place measures to prevent a potential quit problem. Secondly, investigating QI focuses attention on the attitudinal components of VT. QI represent an individual’s motivation to stay or quit (Cotton & Tuttle, 1986), information which is critical for understanding the types of workers who quit, and why. This data is arguably more important than overall quit rate data for organisations who are seeking to take remedial or preventative action. Thirdly, using QI as the dependent variable avoids many of the ‘accidental’ reasons for VT which lie outside of an organisation’s control (e.g., moving house). QI are also a good measure of psychological or emotional withdrawal when actually quitting may not be an option due to high unemployment etc. (Lee et al., 2008). Finally, while QI merit study in their own right for the reasons outlined above, they also have strong theoretical (Mobley et al, 1979; Hom & Kinicki, 2001) and empirical (Cho & Lewis, 2012; Cohen et al., 2016; Harrison et al., 2006; Nguyen et al., 2022; Steele & Ovalle, 1984; Van Breukelen et al., 2004) links with VT.Footnote 2

The primary purpose of this study is to identify factors that may predict QI formation. While early research in economics focused on wages, hours and personal/work-related characteristics, extant research shows that the effect sizes associated with these factors are typically low to moderate (see Griffeth et al., 2000). Focus has thus switched to the role of non-monetary aspects of the job that influence how workers evaluate and experience working life (Akerlof et al., 1988; Lazear & Shaw, 2007). Collectively these attitudinal factors comprise ‘work utility’ and are typically captured through the latent umbrella construct of worker well-being (WWB). Labour economists typically rely on just one WWB dimension, job satisfaction (JS), to proxy for WWB as a whole. Within the VT literature, JS has also traditionally been used to proxy for ‘job quality’, defined by Clark (2015) as the conditions encountered at work, and the perceived and actual impact of work in terms of enhancing or diminishing WWB.

Organisations rely heavily on employee surveys to gauge how their workers feel about their jobs (Wiles, 2018). The fact that JS measures are both widely-used and demonstrably reliable predictors of QI, raises questions as to the research merit of investigating whether organisations should consider replacing or augmenting JS with other WWB measures in surveys aimed at identifying potential quitters. WWB is, however, incontrovertibly a complex, multidimensional construct that extends far beyond JS (Bryson et al., 2014). Relying solely on JS thus constitutes a missed opportunity for organisations to obtain valuable insights into the role that other aspects of the work experience, e.g., work meaningfulness, may play in shaping QI. To date, there has been little empirical examination of the extent to which the proxying role of JS is justified. This work is necessary to establish whether WWB constructs which are conceptually distinct are also empirically distinct (correlation of < 1; Harter & Schmidt, 2008) with regard to predicting workers’ QI. The current study addresses this gap.

I build on the earlier work of Green (2010) by providing a simple validation test of WWB indices which are used in organisational psychology in order to examine whether they might be suitable for labour market VT research. However, I extend the scope of Green’s investigation by comparing the predictive power of JS not just to that of affect, but also to that of engagement and basic psychological needs satisfaction indicators. In doing so, I address several open questions around the best way to conceptualise WWB in QI functions. If the decision to quit is, as Green (2010) suggests, primarily an evaluative one, then the current reliance on JS may be justified. If, however, QI formation is strongly predicted by affective or psychosocial measures, then it may behove organisations to expand their conceptualisation of WWB and employ additional measures. My primary contribution is to assess the extent to which doing so could potentially afford organisations a ‘head start’ in identifying potential quitters. I use novel survey data to compare, for the first time, the ‘head-to-head’ performance of hedonic well-being (HWB) (JS and affect) and eudemonic well-being (EWB) (engagement and basic psychological needs satisfaction) indicators in relation to explaining QI variation.

While there is some evidence that JS may have greater predictive power than affect, when global, or summative (e.g., ‘over the past month’), WWB questions are used to predict VT (Green, 2010), it remains untested whether this is still the case when experiential, or in viva, measures of affect are employed. As Affleck et al. (1999) point out, relationships between constructs measured at different levels of analysis my represent different psychological processes. Kahneman and Riis (2005 p. 285) characterise humans as having “two selves”—“the remembering, evaluating self” and “the experiencing self”. While the two are correlated, in that an individual’s current affective state will affect her subjective evaluations and vice versa, they are conceptually and empirically distinct.Footnote 3 They argue that comprehensive SWB studies should therefore endeavour to capture both aspects separately using global (evaluative) and experiential (experiencing) measures, particularly if the two SWB conceptualisations have different consequences for decision making in a particular context. As far as QI formation is concerned, this remains an open question. This study contributes uniquely to the SWB measurement literature by including, for the first time, global and experiential measures of JS, affect and basic psychological needs satisfaction within one survey, thus enabling their relative predictive power in relation to QI to be directly compared.

My final contribution is to investigate heterogeneity in QI amongst categories of workers. As Pratt et al. (2022) point out, not all VT is dysfunctional. They argue that extant research highlights the need for organisations to focus on VT composition as much as on raw VT rates, given the demonstrated impact of ‘talent density’ (the percentage of high performers in the workforce) on organisational performance (Hastings & Meyer, 2020). Identifying categories of potential quitters also has important HR policy implications.Footnote 4 The current study contributes to this effort by identifying systematic differences in the personal/work-related and WWB profiles of potential quitters and stayers, and by testing whether particular WWB indicators might be better suited to identifying QI amongst ‘target’ groups of workers that organisations might be particularly keen to retain e.g., high performers, new recruits etc.

I address four research questions in this study, all of which have important implications for organisations seeking to identify and prevent dysfunctional VT. Firstly, does JS outperform other WWB indicators in terms of predicting QI? Secondly, is explanatory power improved by using multiple WWB measures? Thirdly, do global WWB measures perform better than experiential measures in terms of predicting QI? Fourthly, is there any evidence of heterogeneity in the profiles of potential quitters and stayers and / or in the ability of WWB indicators to predict the QI of women, graduates, high-performers, recent hires or senior workers?

The remainder of the paper is organised as follows. Section 2 provides an overview of the conceptual frameworks which inform this paper and reviews the relevant extant WWB and QI literatures. Section 3 describes the data. Section 4 sets out the empirical framework. Section 5 describes the results. Section 6 outlines the robustness checks. Section 7 discusses the results and concludes.

2 Conceptual Framework and Literature Review

2.1 Quit Intentions (QI) and Voluntary Turnover (VT)

QI/VT research has traditionally been dominated by two perspectives, the ‘economic/labour market school’ and the ‘psychological school’ (Morrell et al., 2001). The labour market school has traditionally emphasised the role of externally determined macro-level variables in shaping QI, focusing on topics such as job search, job and wage mobility and person-job matching (e.g., Fehr & Schneider, 2004). The psychological school, on the other hand, has typically focused on identifying and predicting micro-level drivers of the quit decision such as organisational commitment, job involvement, organisational climate, affect and JS. In recent years however, this division has been eroded through a growing focus on the part of labour market school researchers on the role of expected utility in predicting QI/VT. QI are characterised as a function of decision utility, which in turn, is inferred from, and is used to explain, observed choices (Kahneman & Riis, 2005) such as the decision to quit. Decision utility acts as a ‘utility signal’ at the decision-making point (Berridge & O’Doherty, 2014, p. 336). It is determined through an evaluative process, usually proxied by JS scores, in which workers retrospectively rate their overall work experiences over an extended period (‘remembered utility’) and assess their expected future experiences (‘predicted utility’) (Kahneman et al., 1997; Lévy-Garboua et al., 2007).

Early research into the drivers of decision utility focused on monetary ‘pecuniary factors’ (salary, bonus etc.) and personal characteristics. Effect sizes are however generally small,Footnote 5 resulting in increased interest in the role of ‘non-pecuniary factors’ such as work relationships; supervision; type of work; and whether workers enjoy and feel positively challenged by their jobs (Akerlof et al., 1988; Lazear & Shaw, 2007). These factors are typically subsumed under the broad heading of WWB, a latent, and highly subjective construct, which captures the extent to which individuals believe that they have a ‘good working life’, as defined by them. WWB is a ‘sub-utility function’ that captures workers’ subjective evaluations of their jobs and the utility received from all aspects of those jobs (Clark, 1997, p. 191; Clark & Oswald, 1994). It reflects workers’ “post-decisional preference” for their current job (Lévy-Garboua et al., 2007, p. 252), with workers sorting out of jobs that are perceived to drain utility (Blanchflower & Oswald, 1999). The issue remains, however, of how to capture (unobserved) WWB within the utility function.

With a few exceptions (e.g., Green, 2010; Nikolova & Cnossen, 2020), labour economists tend to rely exclusively on JS. This is usually measured by asking workers to rate their level of overall JS using a 0–10 scale (Diener et al, 1985). Dissatisfied workers are assumed to assign a lower present value to their current jobs, resulting in a higher propensity to form QI. QI thus represent a mental representation of, and an action plan to realise, a desired future (Shuck et al., 2015). This approach of using JS to proxy for WWB as a whole has also dominated VT research for the past 30 years in the form of process models (e.g., Hom & Kinicki, 2001; Judge et al., 2017; Lee & Mitchell, 1994; Steers & Mowday, 1981). VT is characterised as a process of gradual physical and psychological withdrawal (Lewis, 2019), which may be preceded by other indicators such as absenteeism, diminished performance etc. (Hulin, 1991; Sablynski et al., 2002). Process models draw heavily on the TPB (Ajzen, 1985, 1991) which posits that behavioural intentions are the best predictors of behaviour. QI (or their variant, ‘withdrawal cognitions’, Hom & Griffeth, 1991) are the most proximal and strongest VT antecedent (Hom et al., 2017). The majority of VT models, including the most dominant, Hom and Griffeth’s Expanded Model (1995; Hom & Kinicki, 2001), seek to determine the ‘intermediate linkages’ between job attitudes (JS) and voluntary quits (Davis et al., 2015; Hom et al., 2017), with JS viewed to be a necessary condition for QI formation.

JS measures have been shown to be relatively stable and to correlate highly with enduring job circumstances (Helliwell & Putnam, 2004), rendering them effective at capturing longer-term evaluations of working life. JS has also been revealed to have high predictive power in relation to QI (Lambert & Paoline., 2010; Ozkan et al., 2020; Rubenstein et al, 2018; Shields & Price, 2002; Shields & Ward, 2001; Sousa-Poza & Sousa-Poza, 2007).Footnote 6 Relying exclusively on JS, however, raises two issues. Firstly, doing so discounts a vast literature (summarised in DeSimone, 2014) which confirms the multidimensionality of WWB. Secondly, while JS coheres well with decision utility, it fails to accommodate ‘experienced utility’ (Kahneman, Wakker & Sarin), which characterises WWB, not as a comparative evaluative judgement between competing job opportunities, but rather as the accumulation of ‘lived’ emotional experiences.

2.2 Subjective Well-Being (SWB)/Worker Well-Being (WWB)

2.2.1 Tri-Partite Model of Hedonic Well-Being (HWB)

To date, the SWB/WWB literature has been dominated by the standard tripartite model of well-being. The tripartite model characterises WWB as comprising two independent constructs, a relatively stable evaluative component and a transient (positive and negative) affective component (Eid & Diener, 2004). Together, these components form HWB, a state in which work-related desires are satisfied and positive emotions are experienced more frequently than negative emotions at work (Disabato et al., 2016). The evaluative dimension is typically captured using single-item JS. With regard to the affective dimension, most WWB researchers adhere to Watson and Tellegen’s (1985) well-supported (e.g., Ekkekakis, 2013) ‘two domain theory’ of affect, which characterises positive affect (PA) and negative affect (NA) as two separate constructs, with divergent and non-overlapping determinants. Individuals may experience high or low levels of PA and NA simultaneously (Larsen et al., 2001) e.g., they may feel simultaneously inspired and stressed when assigned a new project. Affect is often further decomposed using orthogonal axes of pleasure and psychological arousal, which characterise PA and NA in terms of ‘active’ (e.g., excited) and ‘passive’ (e.g., calm) emotions. Active feelings, by virtue of their “energised nature” are hypothesised to be more likely to trigger action e.g., quitting (Patterson et al., 2004, p. 197).Workers who are ‘happy’ at work, will experience PA relatively frequently and NA relatively infrequently.

2.2.2 Eudemonic Well-Being

While HWB, regardless of how it is measured, forms the core component of WWB, there is growing support for the notion that achieving ‘the good working life’ requires more than a pleasurable existence (e.g., Sousa-Poza & Henneberger, 2004).Footnote 7 Kopperud and Vittersø (2008) argue that relying purely on evaluative and affective WWB measures may distort the true work experience by over-simplifying it. Whereas studies which use pure measures of affect typically find working is associated with low levels of experienced pleasure and high levels of experienced displeasure (Bryson & MacKerron, 2017; Kahneman et al, 2004; Knabe et al., 2010), studies which adopt a eudemonic approach typically find work to be a source of growth, personal challenge and meaning (Csikszentmihalyi et al., 2005; Salanova et al., 2006).Footnote 8 Bryson & McKerron speculate that affective measures may mask the complexity of the work experience and the potential for eudemonic components such as purpose, personal growth and autonomy to compensate for low levels of pleasure. Cassar and Meier (2018) emphasise the need to acknowledge the additional, non-pecuniary (psychological) rewards provided by working, in particular the sense of meaning which it affords. This view is echoed by Chater and Loewenstein (2016) who argue that utility maximisation depends not just on valence (the ability of individuals to construe their lives in a positive fashion) but also on sense-making (the ability of individuals to make sense of their lives). Similarly, Kaplan and Schulhofer-Wohl (2018) emphasise the importance of distinguishing between jobs based on the psychological rewards of autonomy and meaning that they provide.

EWB indicators capture workers’ subjective evaluation of their capacity to maximise their potential and flourish (Bartels et al., 2019) at work. It is not sufficient for workers to feel happy and satisfied (Martela & Sheldon, 2019). They also need to feel that they have some input and control (Ryff, 1989), that they are performing meaningful work for, and with, people they like and trust (Baumeister & Leary, 1995), and that they are on a path to personal growth (Deci & Ryan, 2008; Graham & Nikolova, 2015). EWB captures the Aristotelian notion of a well-lived life rich in meaning, purpose and personal fulfilment. Unfortunately, an all-encompassing measure of EWB does not currently exist.Footnote 9 I therefore draw on Martela and Sheldon’s (2019) meta-analysis, which highlights engagement and the satisfaction of basic psychological needs (BPN) as the two most widely used EWB measures, to inform the conceptualisation of EWB that I employ in this paper. In doing so I subscribe to Wijngaards et al.’s (2021, p. 798) categorisation of a WWB construct as eudemonic if “intrinsic motivation, activation, purpose and meaningfulness are at its core (Ryan and Deci 2001)”.

The first component of EWB as conceptualised in this paper, engagement, is a motivational construct which emerged from the field of organizational psychology. It extends the positive psychology concept of ‘flow’ (Csikszentmihalyi 1990), or total immersion in an activity, and applies it specifically to the workplace. It also draws heavily on the Job Demands-Resources (JD-R) model (Bakker & Demerouti, 2007, 2017) which hypothesises that the experience of working is characterised by two opposing forces, demands and resources. Job demands are work characteristics that demand sustained effort from employees, potentially depleting their energy / vigour and thus triggering an associated physiological and psychological cost, whereas job resources are work characteristics that motivate employees by supporting their personal growth and assisting them to achieve their professional goals. While engagement has been conceptualised in different ways, for the purposes of this paper it is defined as a positive and fulfilling affective-cognitive state of mind which is characterized by high levels of dedication, vigour, and absorption (Bakker and Demerouti, 2008; Schaufeli et al., 2002). Engaged employees exhibit high levels of intrinsic motivation, commitment and “focused energy” (Carter et al., 2018, p. 2484). They view their work as a positive challenge. The opposite of engagement, disengagement, represents a state of detachment and disconnectedness, in which workers may question the value, purpose or meaning of their jobs. Disengagement, combined with chronic exhaustion, may produce work dis-utility in the form of burnout, a state of psychological distress.

A burgeoning body of research summarised in Straume and Vittersø, (2015) supports the idea that HWB and EWB are corelated but distinct dimensions of WWB and play different roles in the development and maintenance of ‘the good working life’. Kopperud and Vittersø, (2008) summarised a large body of evidence, including distinctive physiological correlates, which supports the independence of engagement from its evaluative (satisfaction) and affective (happiness) counterparts. For example, Thorsteinsen and Vittersø, (2020) showed that, whereas pleasure acts as a reward which increases the likelihood of a behaviour being repeated, engagement serves to keep individuals committed to difficult goals when faced with (unpleasant) setbacks. Similarly, Straume and Vittersø (2012) reported that while challenging tasks reduce pleasure, they simultaneously increase engagement and inspiration.

The second component of EWB as conceptualised here, the satisfaction of BPN, captures functional and psychological WWB. Basic Psychological Needs Satisfaction Theory (BPNT) draws heavily on Carol Ryff’s (1989) developmental well-being model which focuses on ‘self-realisation’, a concept which has, in turn, been linked to performing meaningful work (Martela et al., 2021). It is also strongly influenced by Self-Determination Theory (SDT; Deci & Ryan, 2000; Ryan & Deci, 2000, 2017; Deci et al., 2017) which aims to identify socio-contextual factors in the workplace that boost workers’ levels of ‘autonomous’ (intrinsic) motivation and eudaimonia in the form of optimal psychological and behavioural functioning (Ryan & Deci, 2017 p. 3).Footnote 10 BPNT posits that while humans naturally strive to achieve personal growth, they can be supported to do so through the provision of environmental ‘nutriments’ or resources / supports which enable their shared evolved, and distinct, ‘basic’ needs for autonomy, relatedness and competence (BPN) to be met within the work environment. Providing conditions that favour BPN satisfaction should in turn foster more autonomous motivation (Fernet et al., 2020; Vansteenkiste et al., 2020).

The need for autonomy refers to workers’ need to be able to ‘own’ and endorse their actions (Sheldon & Niemiec, 2006). The need for competence (White, 1959) is met when workers experiencea sense of mastery over their environment and are able to perform effectively. Relatedness (Baumeister and Leary, 1995) refers to the need to feel understood, cared for and appreciated at work. The extent to which BPN are satisfied or thwarted depends on workplace conditions and individual differences in disposition, locus of control etc. BPN satisfaction results in significantly higher levels of self-determined motivation amongst individuals (Van den Broek et al., 2016). Failure to meet any one of these basic needs, however, will directly result in reduced self-motivation and well-being, causing individuals to languish instead of flourish (Nikolova & Cnossen, 2020).Footnote 11 The Basic Psychological Needs Satisfaction at Work scale (BPNSW; Deci et al., 2001) measures the extent to which BPN are successfully met at work.

2.2.3 Global and Experiential WWB Measures

WWB measures can be further decomposed by their underlying temporal structure. Global measures capture workers’ beliefs about the typical patterns of WWB that they experience at work. They operate on a remembered basis (Bakker & Oerlemans, 2011), requiring respondents to retrieve and evaluate memories of previous work experiences. For example, individuals may be asked to report the extent to which they experienced particular emotions ‘over the past month’. Global measures are criticised for being influenced by peer comparisons and recall biases such as peak-end bias (Fredrickson & Kahneman, 1993) and duration neglect (Redelmeier & Kahneman, 1996). As the length of time between the actual experience and the self-report increases, the details become harder to remember, causing respondents to rely more on semantic knowledge and situation-specific beliefs than episodic memory (Robinson & Clore, 2002). In addition, global measures can be affected by socially pervasive, but frequently erroneous, normative beliefs of what ‘should’ constitute a good working life (e.g., earning a high salary) (Dolan, 2014).

Experiential measures capture short-term, in-the-moment, context-dependent shifts in WWB. They measure discrete moments of experienced or ‘instant utility’, the integral of which, weighted by duration, comprises ‘total utility’ over an extended period of time (Kahneman & Riis, 2005). By capturing within-person changes in affective states, experiential measures provide a more “textured tool” for understanding the ups and downs of daily working life (Helliwell & Barrington-Leigh, 2010, p. 734). Respondents’ very recent personal experience serves as the baseline when rating their current happiness (Reis et al., 2000), thus avoiding recall biases (Robinson & Clore, 2002). While the Experience Sampling Method (ESM) (Larson & Csikszentmihalyi, 2014) is the gold standard in experiential measurement,Footnote 12 it is not always feasible due to its potentially prohibitive time and cost burden (e.g., Eisele et al., 2020). The Day Reconstruction Method (DRM) (Kahneman et al., 2004) offers a less invasive alternative. While it stops short of capturing live feelings, it comes close by using a structured questionnaire/diary technique which induces participants to reconstruct emotions experienced during one or more work ‘episodes’ the previous day. It also incorporates procedures (e.g., obtaining separate estimates of episode durations) that reduce the impact of known biases. DRM self-ratings have been shown to substantively replicate those obtained with ESM (e.g., Dockray et al., 2010; Tweten et al., 2016).Footnote 13

Global and experiential measures have been shown to be separable (Krueger & Schkade, 2008) and differentially determined (Hudson et al., 2016). However, despite growing evidence of within-person variations in WWB (e.g., Fisher, 2000; Ilies & Judge, 2002; Ilies et al., 2009; Shi et al., 2021), global measures, which tap the remembering, evaluative and stable self, continue to dominate the WWB literature, a finding which may reflect a general over-reliance on the standard single-item measure of JS. While most measures of EWB rely on global measures, it is also possible, though far less usual, to measure moment to moment fluctuations.Footnote 14 The fact that WWB is mainly investigated as a static phenomenon may mask important within-person fluctuations over the course of the working day.

2.3 Literature Review—QI and WWB

As noted in 2.1 and 2.2, extant QI research has historically relied on JS to proxy for work-related utility / WWB and job quality. A large body of evidence documents a negative link between JS and QI/VT, supporting the central claim of VT process models that JS is a reliable predictor of QI (e.g. Feng et al., 2022a; Griffeth et al., 2000; Warden et al., 2021; Yang et al., 2022). For example, Madigan and Kim’s (2021) recent meta-analysis reported a significant negative relationship between JS and the QI of teachers (ρ = − 0.40). However, there is some evidence that the relationship between JS and QI may be moderated by external factors such as the unemployment rate (Trevor, 2001) and that dynamic measures of JS may offer more predictive power.Footnote 15

As noted in 2.2, however, JS is just one sub-component of WWB. Several theories, for example Affective Events Theory (AET; Weiss & Cropanzano, 1996)Footnote 16 and the System Feedback Model (Baumeister et al., 2007), posit that evaluation and affect work in tandem. Emotional experiences trigger the cognitive processing of those experiences, which in turn forges new learned associations or ‘affective traces’ which are ‘fed back’ to inform specific future action tendencies e.g. approach or avoidance. In the context of QI, a worker may receive negative feedback from her boss for a project on which she has worked hard. She feels angry and frustrated (affective trace). This prompts her to reflect on how to avoid a similar outcome in the future (evaluation) by creating a series of if–then rules for future behaviours, one of which may involve quitting in the event of a reoccurrence (specific action tendency).

Despite a strong theoretical argument for investigating potential links between affect and QI however, this issue was, until recently, largely bypassed in the QI literature, with some notable exceptions e.g. Green (2010). Green used a pre-existing dataset to compare the relative predictive power of JS and affect in relation to VT. He found that while PA and NA both predicted quits, they were outperformed by JS. He attributed this finding to the fundamentally evaluative nature of JS which renders it more suitable for capturing the complex cognitive processes engaged when workers consider quitting. A growing number of studies have, however, since documented the important role played by affect in shaping QI (e.g., Feng et al., 2022b; Gordon et al., 2018). For example, Hong et al., (2021) found that PA positively predicted engagement, which in turn predicted QI amongst Chinese workers in the IT sector. The authors attributed their results to Trait Activation Theory (TAT; Tett et al., 2013) which conceptualises work engagement as a state of high activation PA combined with increased job motivation. Green’s (2010) early work was also limited to the extent that it relied exclusively on global affective measures. Shi et al. (2021) recently addressed this limitation by drawing on AET to design an experiential ESM study which revealed that day-level within-person PA and NA predicted within-person variance in QI.

Despite a growing body of research summarised in Sect. 2.2 which indicates that EWB measures are distinct from evaluative and affective measures (Tov & Lee, 2016), and may be particularly relevant for the work context, there has been relatively little by way of empirical validation of a direct link between EWB and QI/VT to date. As regards engagement, engaged employees are posited to be more committed to their organisations and should thus report lower QI (Schaufeli & Salanova, 2008). Halbesleben’s (2010) meta-analysis revealed a significant negative correlation between overall engagement and QI of ρ = − 0.22 (N = 1893), a result which was supported by Schaufeli and Bakker’s (2004) finding that highly engaged Dutch employees experienced fewer QI than their less engaged colleagues. More recent studies (Agarwal et al., 2012; Albrecht et al., 2015; Bailey et al., 2017; Bakar et al., 2021; Juhdi et al., 2013; Karatepe, 2013; McCarthy et al., 2020; Memon et al., 2020; Park & Gursoy, 2012; Sandhya & Sulphey, 2020; Shuck et al., 2014; Zhang et al., 2018) have also confirmed a significant negative relationship between engagement and QI /VT.

At the organisational level, high aggregate levels of engagement are also directly, or indirectly, negatively associated with QI (Du Plooy & Roodt, 2010; Harter et al., 2002; Han et al., 2022). For example, Harter, Schmidt & Hayes found a significant correlation of ρ = − 0.30 between engagement and VT. There is also evidence that engagement may mediate the relationship between QI and other factors. Sheehan et al. (2019) found that the significant link between psychological contract fulfilment and  QI of Australian nurses was mediated by engagement, whereas Shin and Jeung (2019) found that engagement mediated the relationship between proactive personality and QI amongst Korean workers. In China, Fu et al. (2022) found that engagement significantly mediated he relationship between SWB and QI amongst teachers. Finally, significant links have been reported between engagement and other withdrawal behaviours such as absenteeism (e.g., Karatepe & Olugbade, 2016; Truss et al., 2013), and between work meaningfulness and QI (Dechawatanapaisal, 2022; Koh & Joseph, 2016; Leunissen et al., 2018; Sun et al., 2019; Uriesi, 2016).

In relation to BPNT, SDT proposes that for workers to voluntarily remain in their jobs, they must perceive that they are able to behave intentionally and of their own volition (Austin et al., 2020). Several studies have shown that workers who exhibit high levels of autonomous (intrinsic) motivation are more committed to their organisations and are less likely to report QI (e.g., Austin et al., 2020; Fernet et al., 2021; Gagné et al., 2015; Li & Yao, 2022; Miligi et al., 2020; Pham et al., 2021; Volmer et al., 2012). A hypothesised link between relatedness and VT is also supported by both Socialization Theory (e.g. Bauer et al., 2007) and Job Embeddedness Theory (Mitchell et al., 2001), which posit that the more ‘adjusted’ workers are to their work, and the more ‘connected’ they feel to their colleagues and organisations, the more ‘stuck’, they will feel in their jobs, and the less likely they will be to quit (Allen, 2006; Burrows et al., 2022; Mitchell et al., 2001; Peltokorpi et al., 2022; Self & Gordon, 2019; Tanova & Holtom, 2008).

Until recently, however, very few studies have directly examined the links between satisfaction of BPN and QI. Initial indications support a negative relationship. For example, Van den Broeck et al. (2016) reported a significant negative correlation between QI and the satisfaction of the needs for relatedness (ρ = − 0.21), competence (ρ = − 0.05) and autonomy (ρ = − 0.31). De Clerck et al., (2022) found a negative relationship between BPN satisfaction and QI (ρ = − 0.39) and a positive relationship between BPN frustration and QI (ρ = 0.51), whereas Heyns et al. (2022) reported a negative relationship between BPN satisfaction and QI of South African pharmaceutical employees (ρ = − 0.65). Trépanier et al., (2015) found that satisfaction of BPN increased engagement and hindered QI over time. Finally, Vansteenkiste et al., (2007) used mediation analysis to demonstrate that BPN satisfaction explained the negative relationship between extrinsic motivation and QI, a finding which is consistent with SDT’s contention that a strong focus on wages detracts from BPN satisfaction, reducing WWB.

It is also plausible that BPN satisfaction may influence QI indirectly. The Motivational Model of Work Turnover (Richer et al., 2002; Vallerand, 1997) proposes that perceptions of BPN satisfaction, influence self-determined intrinsic and extrinsic motivation, which in turn influence JS and, by extension, QI. This claim is supported by evidence that greater autonomy at work is associated with higher JS (Bartling et al., 2013; Benz & Frey, 2008). Competence, the feeling which occurs when workers are able to apply their skills to achieve a desired goal, has also been linked with JS (Cassar, 2010; Hundley, 2001). Finally, relatedness, or feeling more connected to work colleagues and having more positive social relations at work, has also been shown to increase JS (Morgeson & Humphrey, 2006). There is also evidence that BPN satisfaction may act as a mediator between QI and other factors such as role ambiguity (e.g., Boudrias et al., 2020).

Based on the theory and literature review outlined above, I propose and test the following four research hypotheses:

  • H1: Positive WWB indicators (JS, PA, engagement, BPN satisfaction) will be significantly negatively associated with QI

  • H2: Negative WWB indicators (NA) will be significantly positively associated with QI.

  • H3: While a high degree of overlap between WWB indicators is expected, the conceptual distinctiveness of the WWB indicators employed in this study should ensure that incorporating additional WWB measures over and beyond JS in the model will significantly increase explanatory power in relation to predicting QI.

  • H4: Due to their more evaluative nature, global measures will significantly outperform experiential measures in terms of QI predictive power

3 Data and Methodology

3.1 Survey Design and Sample Characteristics

I employ a novel survey which was specifically designed to capture the multidimensionality of WWB. The survey was piloted using a convenience sample (n = 30) to inform the development of the protocol. The final survey was issued online to 994 participants sourced by Prolific Academic, an online survey-panel provider.Footnote 17 The survey was completed online between 25/11/2019 and 19/2/2020.Footnote 18 Due to the study’s focus on WWB, the sample comprises full-time workers based in the UK.Footnote 19Footnote 20 Standard Prolific pre-screening criteria were used to recruit respondents between 18 and 65 years, who were engaged in full-time paid employment for more than 2 months, in organisations with 5 or more workers, for at least 21 hours per week.

Table 1 depicts the main characteristics of the sample. It also compares the key demographic variables to those of a worker sub-sample from the nationally representative UK Understanding Society dataset used in Wheatley (2021). Compared to Wheatley, the current sample contains a higher proportion of women, university graduates and workers in the 25–39 age bracket.

Table 1 Personal and work-related descriptives

3.2 Measures

The outcome variable is QI. Respondents are asked “Are you actually planning to leave your job within the next six months?”. Responses are “Yes” (17.5%), “No” (59.9%) and “Not sure” (22.6%). The independent (WWB) variables are described in Table 2. To exclude potential confounding effects, I also control for a wide range of personal and work characteristics which are supported by an Imai et al. (2010) causal mediation analysis.Footnote 21 Demographic covariates comprise age, gender, education and parental status. Personality is assessed using the validated (Lovik et al., 2017) 10-item Big-5 Inventory-10 (Rammstedt & John, 2007) which assesses 5 dimensions: neuroticism, openness to experience, agreeableness, conscientiousness and extraversion. Work-related covariates comprise net monthly salary (GBP’000); self-reported hours worked the previous monthFootnote 22; self-assessed seniority (0–5 self-rating scale, where 5 = “most senior”) and tenure (years working at the organisation to date).Footnote 23 See Table 1 for more details. 7 observations with missing values are excluded from the baseline analysis (65 observations when controls are included). No discernible pattern is detectable.

Table 2 Description of measures

4 Empirical Framework

I adopt the standard approach used in the economics literature (e.g., Clark & Oswald, 1996; Shields & Ward, 2001) and specify QI as a function of personal and work characteristics and of work utility.Work utility is in turn characterised as a function of the total non-pecuniary benefits derived from work. It is proxied by WWB and is assumed to guide the quit decision (Green, 2010). Equation 1 is estimated to isolate the impact of WWB on QI formation:

$$Q_{i} = \, \beta_{0} + \, \beta_{1} WWB_{i} + \, \beta_{2} X_{i} +_{{}} \beta_{3} \lambda_{i} + \, \varepsilon_{i}$$
(1)

where Qi is the probability of worker i intending to quit within the next 6 months; β0 is the intercept; WWBi is the self-reported WWB of worker i; Xi is a vector of personal characteristics; λi is a vector of work characteristics which includes wages and hours and ɛi is an error term. The parameter β1 captures the change in the probability of worker i intending to quit which is associated with a one standard-deviation increase in WWBi. In line with the criterion validity test of a well-being indicator, β1 is hypothesised to be negative for ‘positive’ WWB indicators (JS, PA, engagement, relatedness, competence and autonomy) (H1) and positive for ‘negative’ indicators (NA) (H2). Rather than use a single latent factor of WWB, I examine 4 different proxies for WWB so as to identify those WWB measures which are likely to be most relevant for organisations concerned with identifying and preventing dysfunctional turnover.

In line with most studies which use QI as the dependent variable, I employ a cross-sectional design. As such, I cannot rule out the existence of unobserved individual level factors which predict both WWB and QI (e.g., risk preferences; locus of control) and I make no claims to a causal interpretation of the results. While the analysis would benefit from the use of a panel dataset which controls for time invariant unobserved heterogeneity and which includes a similarly wide range of WWB measures, unfortunately no such dataset currently exists. However, I estimate the results controlling, and not controlling, for a wide range of personal and work-related covariates, including personality.

For ease of interpretability, in my main analysis I use a binary linear probability model (LPM) to estimate the baseline specification (Eq. 1 excluding Xi and λi) which isolates the effect of WWB on QI, holding all other variables constant at their means. Following Green (2010), I merge “Yes” and “Not Sure” responses (coded 1) to the QI question.Footnote 24No” responses are coded 0. The alternative would be to exclude the “Not sure” responses, or to analyse them as a third outcome. The impact of adopting either of these approaches is depicted in S1 and S2 respectively. I use the Huber-Sandwich-White correction to ensure standard errors are robust to heteroskedasticity. The Benjamini and Hochberg (1995) multiple inference method is used to control the false discovery rate (the proportion of significant results that represent false positives).Footnote 25 For all models, I investigate which WWB indicator best fits the data by comparing explanatory power (R2) and goodness of fit (Bayesian Information Criterion, BIC), with and without controls (Xi and λi).

Next, I examine whether a composite multi-dimensional WWB model outperforms the standard unidimensional JS model. I employ a hierarchical (stepwise) regression model comparison framework, in which a composite regression model is built by gradually adding WWB indicators to the previous model at each step, starting with JS. The hypothesis is that additional specifications should significantly increase the explanatory power (R2) and goodness of fit by capturing a larger proportion of variance in the outcome variable than a model which relies solely on the base measure, JS. I use the Stata hireg command (Bern, 2005) to formally test the null hypothesis that there will be no difference in the explanatory power offered by a composite model (JS + additional WWB indicators) versus the base model (JS). If the R2 of the later model is significantly higher than the R2 of the earlier model, then the later model is assumed to offer a better fit. I gradually introduce additional WWB indicators, changing the order of inclusion each time to take account of potential sequence sensitivity (Gelbach, 2016). I also investigate the impact of changing the base measure from JS to engagement. Finally, I re-run the main analyses replacing the global WWB indicators used in the main analysis with their experiential equivalents.

5 Results

5.1 Descriptive Statistics

17.5% of workers report positive QI, 59.9% report negative QI and 22.6% are not sure. Bonferonni-adjusted bivariate correlations are provided in S5.Footnote 26 All correlations are signed in line with standard WWB theory. Global JS is highly (ρ > 0.5) or moderately (0.2 ≤  ρ ≤ 0.5) positively correlated with all WWB indicators, except for experiential NA ( ρ = − 0.17).

Following Green (2010), I investigate systematic differences in the profiles of ‘Quitters’ (workers who respond “Yes” or “Not Sure” to the QI question) and ‘Stayers’ (workers who respond “No”). The results are outlined in Table 3. Columns 2–4 depict the mean scores/responses associated with the personal/work-related and WWB variables for all workers, Quitters, and Stayers respectively. Column 5 reports the p-value from a t-test or chi-test of identical means. I find that, in line with extant research, Quitters are on average younger, less  likely to be parents and more likely to earn a low monthly salarythan Stayers. They also on average more likely to be  less senior and to report   lower tenure and self-rated performance. In terms of personality, Quitters score on average higher on openness and neuroticism and lower on extraversion, conscientiousness and agreeableness than Stayers. They are also more likely to have a mental health condition and to rate their mental health lower than Stayers.

Table 3 Quitters v Stayers

I also find evidence of systematic differences in the WWB profiles of Quitters and Stayers. Quitters report significantly lower scores for all positive WWB indicators, and higher scores for all negative WWB indicators, than Stayers. As WWB indicators are ordinal rather than cardinal, an engagement score of 2 cannot be interpreted as being twice as high as a score of 1. I thus show the percentage of Quitters and Stayers who score ‘highly’ in respect to each of the outcomes (≥ 75th percentile) in the bottom-half of Table 3. I find that a higher percentage of Quitters fall into the top (bottom) quartile of all negative (positive) WWB scores. I find the largest raw differences between Quitters and Stayers in relation to JS (24.7% v 65.6%) and engagement (50.7% v 15.8%). WWB differences between the two groups are significant at the 1% level across all indicators.

I refine this analysis in columns by decomposing Quitters into ‘Definite Quitters’ (“Yes” responders) and ‘Possible Quitters’ (“Not Sure” responders) and testing for differences between the two groups. The results are set out in columns 6–8 of Table 3. On average, Definite Quitters are significantly younger, more likely to have been working for their organisation for less time and to report lower self-rated performance scores for the past month than Possible Quitters. Definite Quitters also report higher (lower) scores on average across  all positive (negative) WWB measures than Possible Quitters, suggesting an element of progression. In particular, Definite Quitters report significantly lower levels of JS, PA, engagement, competence, relatedness and autonomy and significantly higher levels of NA, than Possible Quitters.

5.2 Regression Analysis

5.2.1 Main Regression Analysis—Individual Indicators

Next, I formally test the relationship between WWB indicators and QI using multivariate regression analysis. I compare the extent to which 4 (global) WWB indicators: (1) JS (2) Affect (PA + NA); (3) engagement and (4) BPN satisfaction (relatedness + competence + autonomy) explain variations in QI, with and without controls. A positive coefficient implies that a one standard deviation increase in the WWB measure increases the probability of QI. The results (with controls) are set out in Table 4. With the exception of relatedness, all WWB measures are significantly associated with QI at the 1% level adjusted for multiple inference. All WWB coefficients are signed in line with expectations and the extant VT literature, with higher levels of positive WWB indicators (JS, PA, engagement, BPN) associated with reduced QI, and higher levels of negative WWB indicators (NA) associated with increased QI. This meets the criterion validity requirements of a WWB measure and supports the assumption outlined in the Empirical Framework section that workers are using perceived levels of WWB to guide QI formation. It also allows me to accept H1 and H2, namely that positive (negative) WWB indicators are significantly negatively (positively) associated with QI.

Table 4 Main regression analysis: head-head comparison of links between Global WWB indicators and the probability of QI (binary outcome; standardised scores)

Individual effect sizes (without controls) range from 0.017 (relatedness) to 0.232 (engagement). Following Mehmetoglu and Jakobsen (2016),Footnote 27 I find large effects for JS and engagement, a medium effect for NA and small effects for PA, relatedness, competence and autonomy. The engagement measure produces the largest individual effect. A one standard deviation (2.2 units) increase is associated with a 22.2 percentage point decrease in QI, even after incorporating a wide range of controls, including wages and hours.Footnote 28 The effect size associated with the standard JS measures is, however, only marginally smaller, at 0.215. All WWB indicators offer greater predictive power than wages and hours combined, which explain just 2.7% of variation in QI. Consistent with previous research, earning more, working longer hours, being a parent, not being university educated and scoring lower on the Big-5 openness trait, are significantly negatively associated with QI. In terms of overall explanatory power (R2) and fit, I find that engagement outperforms JS, explaining 25.2% of variation in QI (22.% without controls) versus 23.8% for JS (21.1% without controls), and producing a lower BIC and RMSE. The WWB indicators are ranked as follows in terms of explanatory power: (1) engagement (2) JS (3) affect and (4) BPN.

To glean further insights into the relationship between WWB and QI, I analyse the affect, engagement and BPN sub-scale items. In relation to PA, three emotions appear to primarily drive the negative relationship with QI, namely enthusiastic (− 0.091; p < 0.001), inspired (− 0.028; p = 0.092) and at ease (− 0.036; p = 0.025). Similarly, for NA the positive relationship with QI appears to be driven by three emotions, namely depressed (0.063; p < 0.001), dejected (0.058; p = 0.001) and despondent (0.039; p = 0.018). Interestingly, while high activation positive emotions appear, on balance, to have greater explanatory power in relation to QI, a finding which is in line with AET, the opposite is the case for NA. This may reflect reverse causality if people who have already formed QI are forced to stay for reasons outside of their control, resulting in negative emotions.

In relation to engagement, the extent to which workers talk about their work in a negative way (− 0.173, p = 0.020), find their work to be a positive challenge (0.064; p = 0.016), can’t imagine doing any other type of work (0.055; p = 0.002) or feel engaged in their work (0.096; p < 0.001) appears to be particularly relevant in terms of shaping QI formation.Footnote 29 Finally, in relation to BPN, workers who report feeling a sense of accomplishment (− 0.045; p < 0.001), who get a chance to show how capable they are (− 0.032; p = 0.002) and who believe that their feelings are taken into consideration work (− 0.068; p < 0.001) report lower QI.Footnote 30 Somewhat surprisingly, workers who are told they are good at their job (0.024; p = 0.038) and who feel free to express their opinions (0.030, p = 0.038) are more likely to form positive QI. This may reflect higher levels of self-confidence and self-efficacy, which could in turn be independently positively associated with QI if those same workers feel that there are a wider range of job options available to them.

5.2.2 Hierarchical Step-Wise Regression—WWB Model Comparison

I next examine whether using a composite model increases explanatory power in relation to QI. Table 5 depicts the results. Starting with the standard WWB measure, single-item JS (column 2), I gradually introduce affect, engagement and BPN, changing the order each time. I find little evidence of sequence sensitivity. However, combining affect with any other measure reverses the sign of PA and renders it non-significant. To identify the ‘best’ model, I use the Stata hireg command (Bern, 2005) to test for significant differences in R2 across the 4 models. Based on the criteria of highest R2 and lowest RMSE and BIC, Model 6 (column 7 in Table 5) (JS + affect + engagement + controls) is optimal, explaining 29.4% of variation in QI, versus 23.8% for JS + controls. Using a composite model significantly increases explanatory power by 5.6 percentage points.Footnote 31 This result allows me to accept H3, namely that incorporating additional WWB indicators over and beyond JS will significantly increase explanatory power in relation to predicting QI.

Table 5 Hierarchical Regression Model (binary QI; standardised variables)

5.2.3 Heterogeneity Analysis

Organisations may be concerned about retaining particular sub-groups of workers. I thus re-run the main regression analysis, incorporating interaction terms to investigate heterogeneity in the predictive power of alternative WWB indicators for university graduates, women, high self-rated performers, recent hires and senior workers. Table 6 outlines the results. I find limited evidence of heterogeneity. The JS, engagement, PA and competence measures all appear to offer greater predictive power in relation to longer-tenured employees. Interestingly however, I also find a larger competence effect for less senior workers. This may reflect the fact that more senior workers are more confident in their own ability to perform their jobs effectively, thus rendering competence a potentially less salient issue for them in relation to QI formation, whereas less senior workers may intend qutting due a lack of promotion opportunities, something which they may, in turn, attribute to a lack of perceived competence on their part. Similarly, autonomy has more predictive power for low performers, possibly due to the fact that they are performing poorly as a result of working in jobs/roles which do not cohere with their own sense of self and personal goals.

Table 6 LPM—Heterogeneity analysis

5.2.4 Global v Experiential

Finally, I compare the predictive power of global and experiential measures. Table 7 depicts the results. While all experiential coefficients are significant and are signed identically to their global equivalents, they are generally smaller and are outperformed by global measures in terms of the standard criteria (higher R2, lower BIC, lower RMSE). Furthermore, a composite model containing experiential measures + disengagement explains just 13.3% of variation in QI versus 29.4% for the global equivalent. The experiential affective measures appear to be more stable than their global counterparts when combined with other WWB measures, with experiential PA remaining significant and negatively signed throughout.

Table 7 Main regression analysis—global v experiential measures (without controls). Head-head comparison of links between global and experiential worker well-being models and QI (binary outcome; standardised scores)

As a final piece of analysis, I therefore investigate whether using experiential instead of global affect in the composite model improves explanatory power. I find no material differences between the two specifications. These findings allow me to accept H4, namely that global measures will significantly outperform experiential measures in relation to predicting QI.

6 Robustness Checks

I perform a number of robustness checks as part of my analysis. First, I address the issue of sample non-representativeness by re-running the main regression analysis using weights for gender, age and education taken from Wheatley’s (2021) nationally representative sub-sample of non-self-employed workers in the UK (see Table 1). The results are set out in S6. Using weights does not materially affect the sign, magnitude or significance level of any of the individual WWB coefficients, with the exception of BPN. Using a weighted sample, produces a larger and significant negative effect for relatedness on QI. In terms of overall explanatory power, using a weighted sample increases R2 across the board by a range of approximately + 1% (engagement) to approximately + 4% (BPN).Footnote 32

I then test the appropriateness of merging Yes and Not Sure responses by examining the impact of excluding Not Sure responses. I estimate a binary LPM (No responses are coded as zero and Yes responses are coded as 1). The results are outlined in S1. While the coefficients are identically signed, with similar p-values, excluding the Not Sures increases the effect size of all WWB measures substantially, almost doubling the magnitude of the engagement and JS coefficients, to − 0.43 and − 0.42 respectively. It also increases the R2 of all indicators, in particular JS and engagement, both of which increase by approximately 7%, to 30.4% and 31.8% respectively. Next, I relax the assumption of linearity, assume that the error terms of Eq. 1 have a logistic distribution, and estimate a logistic regression. The results of the logistic model (marginal effects) are reported in S4. I find no differences in the signs of the coefficients in the LPM and logistic regression modes, although effect sizes are generally smaller, and explanatory power lower, in the logistic model than in the LPM model.

I then drop the binary model assumption by unbundling the Yes and Not Sure responses and using all 3 category responses (Long & Freese, 2006). I first run a standard OLS model (see S2). I then run an ordered logistic model, which assumes that the responses are ordered, increasing from No to Yes, with Not Sure positioned in-between (see S3 for the ordered logistic marginal effects), and finally a multinomial logistic regression model. The binary logistic model fits the data better than either the multinomial or ordered logistic models in that it produces higher R2 values and lower log likelihood figures for all measures. Furthermore, (as shown in S3) the coefficients for the Not Sure responses are signed identically (but with smaller magnitudes) to the Yes coefficients across all measures, providing empirical support for the merging of these two categories in the main analysis.

Finally, I draw on Boes and Winkelmann (2006) by relaxing the parallel regression assumption and running a generalised ordered logistic regression model. Unlike the standard ordered logit model, the generalised ordered logistic model does not assume that the relative magnitudes of the effects of each of the explanatory variables are constant across the distribution of single-scale responses. I find no material differences in the sign or magnitude of the marginal effects in the generalised ordered logistic model as compared to the ordered logistic model. Yes and Not Sure responses are signed the same (opposite to No responses). In general, the generalised ordered logistic model has more predictive power for Yes and No than Not Sure responses and fits the data marginally better than both the multinomial and ordered logistic models. However, it significantly underperforms both the logistic and LPM models.

7 Discussion

My results build on earlier work in organisational psychology and labour economics by clearly demonstrating the crucial role played by WWB in shaping QI. I find that WWB indicators explain between 11.6% and 22.5% of variation in QI (without controls), compared to just 2.7% for wages and hours combined. Furthermore, I find systematic differences in the WWB profiles of Quitters and Stayers across all indicators examined, with Quitters reporting significantly lower WWB than Stayers. Furthermore, there is evidence of progression, with Definite Quitters reporting lower WWB than Possible Quitters across all WWB measures. This is encouraging for organisations as it supports the process model characterisation of VT as a gradual withdrawal process, which can potentially be halted, or even reversed, through the judicious use of targeted WWB interventions, particularly when increasing salaries may not be feasible.

The primary aim of this paper is to examine whether the historic use of JS as a proxy for WWB in the VT and labour economics literatures is empirically justifiable. My results favour a (qualified) affirmative response. While the engagement measure outperforms JS in terms of effect size, explanatory power and fit, the difference between the two measures is marginal. However, while the standard single-item JS measure produces just a single figure by way of output, analysing the engagement sub-scale items reveals potentially actionable insights into aspects of the work experience which employees are struggling with. Interestingly, despite the fact that JS and engagement are highly positively correlated, combining the two measures explains a significantly higher proportion of variation in QI than either measure on its own, suggesting that, while the two measures overlap, there is still value to be added in using both. This may reflect the fact that the engagement indicator causes workers to focus on specific aspects of their day-to-day experiences at work, whereas the single-item JS measure asks workers to reflect on and evaluate their experiences of working overall.

With regards to my second research question, namely does using additional WWB indicators improve explanatory power, my results suggest that WWB measures may act as complements. Combining JS with any additional WWB measure increases explanatory power significantly. Organisations may, however, need to trade-off increased predictive accuracy with pragmatism. Multi-faceted measures are more costly and burdensome, increasing the risk of non- and / or incomplete responses. If the organisation’s aim is to merely identify a future potential VT problem, then single-item JS may suffice. However, where organisations do not face cost or time constraints and / or are keen to obtain more nuanced insights which can be fed back into the design of interventions which are targeted at potential quitters, then incorporating additional multifaceted measures, in particular engagement, may be justified.

My results suggest that, given the wide variation in explanatory power associated with different WWB indicators, the choice of measure matters when predicting QI. In relation to my third research question, namely do global measures outperform experiential measures in relation to QI prediction, I find that, as hypothesised, global WWB measures outperform experiential measures across the board. This finding, when combined with the low effect sizes found for all affective measures, lends support to Green (2010)’s conclusion that global measures may be inherently better suited to predicting QI than experiential measures due to their more evaluative nature. My results suggest that what workers think about their jobs matters more than how workers actually experience their jobs moment-to-moment when it comes to QI formation. It is, however, possible that this result may reflect the temporal framing of the QI question used in this study (6 months), and that experiential measures could prove more effective were a more immediate, or no, timeframe to be employed. It is also plausible that this finding only holds up to a certain ‘threshold’, and that if the lived daily reality of working life were to become sufficiently miserable and / or lacking in purpose, autonomy, sense of accomplishment or social support, it could eventually become intolerable, forcing workers to act with their feet and quit.

VT is only problematic if organisations are losing employees who they wish / need to retain, for example high performers. Information as to the relative superiority of particular WWB indicators in predicting QI amongst target sub-groups of employees could therefore be extremely valuable for organisations. My final research question thus examines whether there is any evidence that particular WWB indicators might be better suited to predicting the QI of women, graduates, high-performers, recent hires or more senior workers. I find limited evidence of heterogeneity using self-reported data. Future research which incorporated third-party data may, however, yield fresh insights in this regard.

The paper has a number of limitations which could be addressed by future research. The sample is, by design, non-representative. While the evidence that ‘professional’ survey participants differ from other survey participants is mixed (Hillygus et al., 2014; Huff & Tingley, 2015), the participants may differ systematically from the ‘average’ UK worker which would detract from wider generalisability. The fact that my main results still hold using a re-weighted sample provides some comfort in this regard however, at least in relation to the UK labour market. An obvious direction for future research would be to target a more diverse online sample and/or to extend the study to a field setting/other labour markets. Secondly, the data comprises cross-sectional, self-rated scales, which may raise concerns about self-report and recall bias and which limits the conclusions to conditional correlations. While subjective data is essential to uncovering subjective perceptions of how well one is doing at work, using multiple data points and/or additional longituidinal quit data would strengthen internal and external validity. Finally, it is possible that using alternative WWB indicators to the ones employed in this study, for example a dynamic or multi-faceted JS measure, would produce a different ranking of WWB indicators. It is also possible that the regression analyses used in this paper may mask potential indirect links between the WWB indicators and QI. Further studies, in particular mediation analyses, would yield valuable insights in this regard.

That said, my findings have important implications for organisations seeking to proactively identify and prevent dysfunctional VT. They highlight the potential for organisations to vary those non-pecuniary factors which are under their control (e.g., promotion policies, annual leave, flexible work practices, job design etc.,) with a view to improving WWB, thus reducing the risk of QI formation and by extension, actual quits. My results also highlight the potential value-add to organisations of using more nuanced WWB indicators than single-item JS which may afford them a valuable head-start in terms of assessing and reducing quit risk.