Background

Bipolar disorder is one of the leading causes of disability worldwide, which incurs higher costs on the health care system and society than most other illnesses (Murray and Lopez 1997). The bulk of societal costs of bipolar disorders are due to indirect costs such as sick-leave, unemployment, and early retirement (Ekman et al. 2013). This is because as many as 30 to 60% of patients with bipolar disorder do not regain full occupational or social functioning after illness onset (MacQueen et al. 2001). Functional recovery also lags behind recovery from clinical symptoms and may be incomplete despite the absence of mood symptoms (Goldberg et al. 1995).

Employment is an important area of functioning. Factors previously associated with poor occupational or functional outcome in bipolar disorder include the number of affective episodes (MacQueen et al. 2000; Zimmerman et al. 2010), lingering subsyndromal depression (Dickerson et al. 2004; Bonnín et al. 2010; Burdick et al. 2010), history of psychiatric hospitalizations (Dickerson et al. 2004; Burdick et al. 2010; Gutiérrez-Rojas et al. 2011), total symptom severity (Goetza et al. 2007), age (Depp et al. 2012), educational attainment (Reed et al. 2010), age at onset (Perlis et al. 2009), history of psychotic symptoms (Tohen et al. 1990, 2000), and comorbid psychiatric conditions such as panic disorder and substance abuse (Hajek et al. 2005; Zimmerman et al. 2010).

Cognitive performance is another domain that has been linked to poor occupational functioning (Mur et al. 2009; Gilbert and Marwaha 2013; Ryan et al. 2013; Bonnín et al. 2014). Executive functions seem to be particularly important for psychosocial functioning and practical tasks of daily life (Lezak 2012). Bipolar disorder patients feature cognitive impairments (mainly deficits in attention, verbal memory, and executive functioning) on group level that persist during euthymia (Robinson et al. 2006; Mur et al. 2007; Palsson et al. 2013; Sparding et al. 2015a, b; Salarvan et al. 2019).

It is important to establish factors influencing occupational functioning in order to tailor and optimize care management in bipolar disorder. Previous research has linked clinical, demographic, and cognitive factors to occupational functioning, but their relative influence remains unclear. The aim of the current study was to investigate the relative importance of demographic, clinical and neuropsychological factors on occupational dysfunction in bipolar disorder.

Methods

Subjects

The present study is part of the St. Göran Bipolar Project, which is an interdisciplinary, prospective, longitudinal study of bipolar disorder conducted in Sweden. Patients were recruited and examined at the Northern Stockholm Mental Health Service and followed over time. The methods of this study have been outlined in detail previously (Rydén et al. 2009a; b; Ekman et al. 2010). Briefly, patients were examined at a time when the patient was considered stable with respect to mood symptoms by the treating physician. This means that patients were not in an acute depressive or hypomanic episode, but not always completely asymptomatic. Patients were diagnosed using the Affective Disorder Evaluation (ADE) and the Mini International Neuropsychiatric Interview (M.I.N.I.) (Sheehan et al. 1998). The ADE is a semi-structured interview that was employed in the Systematic Treatment Enhancement Program of Bipolar Disorder (STEP-BD) project (Sachs et al. 2003). It includes an adapted version of the affective module of the Structural Clinical Interview for DSM-IV (SCID). The M.I.N.I. was used to screen for other psychiatric disorders. A final best estimate diagnosis was determined by a consensus panel of experienced psychiatrists. Bipolar disorder severity was rated by an experienced clinician using the Clinical Global Impression (CGI) rating scale (Guy 1976). To screen for alcohol and substance abuse, patients completed two self-report questionnaires: the Alcohol Use Disorders Identification Test (AUDIT) (Saunders et al. 1993) and the Drug Use Disorders Identification Test (DUDIT) (Berman et al. 2005). Data on medication were collected from either the baseline diagnostic assessment or the somatic examination, depending on which appointment occurred closest in time to the neuropsychological testing.

The retirement age for the guaranteed pension scheme in Sweden is 65 years. The current study therefore included participants aged 18 to 64 years who met DSM-IV criteria for bipolar I or II disorder (n = 120). Only data for individuals with ≤ 14 on the Montgomery-Åsberg Depression Rating Scale (MADRS) (Asberg and Schalling 1979) and the Young Ziegler Mania Rating Scale (YMRS) (Young et al. 1978) were included in the present study. The reason for choosing 14 as the cut off on MADRS and YMRS was to strike a balance between excluding patients with manifest mood symptoms while keeping patients with subsyndromal lingering symptoms, which is prevalent in bipolar disorder (Judd et al. 2002).

There were no cases of intellectual disability, nor any somatic disorders of clinical significance that might affect cognitive performance (such as stroke or other neurological disorders) in this cohort. Patients with current alcohol or substance dependency were excluded, but patients with a history of alcohol or substance abuse were retained in order to study the natural course of the disorder and to obtain a representative clinical sample.

The project was approved by Stockholm Regional Ethical Review Board, and all participants provided written and oral informed consent.

Neuropsychological assessments

In the overarching St. Göran bipolar study, the participants completed a broad neuropsychological test battery (Sparding et al. 2015a, b) to assess cognitive functioning. For the purpose of this study, we preselected the Wechsler Adult Intelligence Scale III as a measure of general cognitive ability (Wechsler 1997), and the DelisKaplan Executive Function System (Delis et al. 2001) as measures of executive functions. No other cognitive domains were examined.

The Wechsler Adult Intelligence Scale (WAIS-III) is the most widely used intelligence quotient (IQ) test and is composed of 10 verbal and non-verbal performance tests: Information, Similarities, Vocabulary, Arithmetic, Digit Span, Block Design, Matrix Reasoning, Picture Completion, Digit Symbol-Coding, and Symbol Search. The subtests generate a measure of general intelligence (mean value = 100; standard deviation = 15). The Vocabulary subtest is frequently used as proxy for premorbid IQ (Martinez-Aran et al. 2007; Ryan et al. 2013; Bonnín et al. 2014).

The Delis–Kaplan Executive Function System (D-KEFS) is a normed and standardized set of nine subtests used to evaluate key components of executive function, including mental flexibility, concept formation, problem solving, and inhibition. Most subtests include one baseline task (condition 1) followed by more difficult tasks/conditions. The latter tasks are considered to be the primary measures of executive functioning. We used five individual tests from the D-KEFS:

  1. i.

    The Color-Word Interference Test, which corresponds to the Stroop Color Word Test and measures the ability to inhibit automatic verbal responses, in this case by reading names of colors (e.g., red, blue, green) printed in incongruent font colors. The test conditions are: (1): Color Naming, (2): Word Reading, (3): Inhibition, and (4): Inhibition/Switching.

  2. ii.

    The Verbal Fluency Test measures the ability to produce as many words as possible from a given category in 60 s. The categories are either phonemic (i.e., words beginning with a specified letter) or semantic (i.e., objects such as animals or fruit). Test results provide information about language skills and verbal processing ability, as well as problem solving and inhibition. The test conditions are: (1): Letter Fluency, (2): Category Fluency, and (3): Category Switching.

  3. iii.

    The Design Fluency Test evaluates the subjects’ ability to draw as many different designs as possible in 60 s by connecting a set of printed dots. The executive functions required for the test include the capacity of initiating problem-solving behavior, cognitive flexibility, and inhibition of automatic responses. The test conditions are: (1): Filled Dots, (2): Empty Dots Only, (3): Switching.

  4. iv.

    The Trail Making Test (TMT) contains measures of cognitive flexibility, visual attention, and motor speed. The test requires the subject to connect a sequence of 25 consecutive numbers and/or letters as fast as possible. In TMT 2, the subject connects a number sequence only. In TMT 4, which is similar to TMT B, the subject alternates between numbers and letters (e.g., 1-A, 2-B, etc.). The test conditions are: (1): Visual scanning, (2): Number Sequencing, (3): Letter Sequencing, (4): Number-Letter Switching, and (5): Motor Speed.

  5. v.

    The Tower Test assesses planning and spatial problem-solving abilities such as the ability to inhibit perseverative and impulsive responses. Visual attention and visual-spatial ability are fundamental skills required for this task.

Assessment of occupational status

We had access to categorized information on occupational functioning. The alternatives were (i) working or studying more than 50% time in a competitive employment (or educational) setting; (ii) working or studying in a competitive employment (or educational) setting 50% time or less (which in Sweden usually means receiving early retirement benefit, or temporary or permanent sick-leave benefits at ≥ 50%), (iii) unemployed, (iv) supported employment, (v) idle. The sample was divided into two groups according to their current occupational status: active versus inactive. Active patients (n = 86) included the highest functioning group including subjects who held a competitively-obtained occupation, or were university students, at > 50% (i.e., working/studying more than 20 h per week). Inactive patients (n = 34) included all the other options, i.e., subjects who were unemployed, received early retirement benefit, or were on temporary or permanent sick-leave at ≥ 50% (i.e., working/studying 20 h or less per week). This group also included subjects who held supported employments (i.e., outside the open labor market), or were enrolled in vocational rehabilitation programs.

Assessment of psychosocial functioning

We used scores from both the symptom and function domains of the Global Assessment of Functioning (GAF) Scale (Luborsky 1975) to assess overall functioning. The Sheehan Disability Scale (Sheehan 1983) was administered to obtain information about patients’ subjective assessment of psychosocial dysfunction following their illness. This scale is a composite of three self-rated items designed to assess the extent to which three major areas in life are compromised by psychiatric symptomatology. The patient rates the extent to which his or her (1) work or school, (2) social life or leisure activities, and (3) home life or family responsibilities are impaired by his or her symptoms on a 10-point visual analogue scale (0 = not at all, 1–3 = mildly, 4–6 = moderately, 7–9 = markedly, 10 = extremely).

Statistical procedures

Data analyses were carried out with the statistical package IBM SPSS 23.0 for Windows (IBM Corp. Armonk, NY). We first examined which variables were different between the groups of active and inactive patients. Between-group differences in demographic and clinical characteristics were evaluated with univariate analyses of variance (ANOVAs) by examining a single factor of the group (active vs. inactive). Pearson’s Chi-square tests were run for dichotomous and categorical variables.

For cognitive measures, Analyses of Covariance (ANCOVAs) were performed for each test to investigate differences between the active and inactive group. Effect sizes were calculated with the formula for partial eta squared (η2). Current medication with benzodiazepines was used as a covariate because a significantly higher proportion of patients in the inactive group were prescribed such drugs. Scaled scores were used as outcome measures because they account for group differences in age. Two-tailed tests were performed for all analyses, and statistical significance was defined as p < 0.05.

A composite measure of executive function was created using principal component analysis (PCA) using the SIMCA software (SIMCA-P 13.0 software, Umetrics AB, Umeå, Sweden). A PCA reduces data dimensions and summarizes systematic variation in a model by aggregating variables in ‘components’ or ‘latent variables’ that describe the correlations structure (Eriksson et al. 2013). By creating a PCA-model, it is possible to (i) examine measures that are highly correlated, (ii) look at all the executive tests together without losing any test specific information, (iii) avoid problems with mass significance. A total of 11 subtest scores were selected from five D-KEFS tests: the Tower Test, the Verbal Fluency Test, the Color Word Test, the Design Fluency Test, and the Trail Making Test. All test conditions were included, except the baseline tasks and non-executive measures (i.e., TMT Motor Speed, which did not differ between the groups [p = 0.112]). 18 subjects (12 active, 6 inactive) were automatically excluded from the analysis as they had completed less than 50% of the tests. Two parameters are important to consider when evaluating a principal component: goodness of fit (R2), which expresses the explained variation, and goodness of prediction (Q2), which expresses the power to anticipate data, either internally or externally (Eriksson et al. 2013).

The first principal component (PC1) was thereafter used as an executive functioning score as a predictor in the binary logistic regression models to predict occupational status. Each test’s loading (P1) reveals how much each test contributes to the latent variable, executive functioning. We conducted three successive logistic regressions to estimate the relative influence of executive functions, IQ, and clinical variables to predict occupational status (active vs. inactive). In the first model, only executive functioning (PC1) was used as predictor variable. In the second model, IQ was added. The active and the inactive group differed in age, but we did not add age and sex into the logistic regression model because the cognitive variables (IQ and the PC of executive functioning) were in scaled scores already adjusted for age and sex. In the third model, we added clinical variables that differed significantly between groups in the univariate analyses: history of psychotic symptoms, history of involuntary psychiatric care (i.e., being sectioned under the Mental Health Act), and current benzodiazepine use. All clinical variables were categorical and coded 1 or 0 (“yes” = 1, “no” = 0). Variance of inflation tests performed to control for multicollinearity did not reveal conflicting relations between the covariates.

Results

The characteristics of the 86 active and 34 inactive patients are displayed in Table 1. Subjects in the inactive group were slightly older than the active group. A significantly larger proportion of the inactive patients had a history of psychotic symptoms, and had at least one involuntary hospitalization (i.e., had been sectioned under the Mental Health Act). The use of benzodiazepines was more prevalent in the inactive group than the active group (44% vs. 20%, p < 0.05). No other group differences were observed with respect to comorbid conditions, bipolar subtype, age at illness onset, subsyndromal symptomatology (as measured with YMRS and MADRS), number of previous depressive or manic episodes, or bipolar disorder severity as rated with CGI. The inactive patients showed worse functioning as measured by the GAF symptom and function scales, as well as all three dimensions of the Sheehan Disability Scale.

Table 1 Demographic, clinical, functional, and pharmacological variables

The ANCOVAs showed differences between the active and inactive group in the majority of cognitive tests, where active patients performed better overall. Active patients also showed a significantly higher IQ than inactive patients (109 vs. 101, p = 0.014).

The PCA model created with 11 D-KEFS executive functioning subtests was significant (R2X = 0.611; Q2X = 0.415). The executive functioning subtests loaded robustly on the first principal component (R2 = 0.464; Q2 = 0.329; Table 2).

Table 2 Neurocognitive assessment for all participants

Three logistic regressions were run to evaluate the relative influence of executive functioning in predicting occupational status (active vs. inactive). In the first regression model (Table 3), we only used our composite measure of executive functioning as predictor. It demonstrated that the odds of a person to be in the inactive group decreased by 36% as the principal component score increased by 1.

Table 3 Logistic regression predicting likelihood of an inactive occupational status (executive functioning as only predictor)

The second logistic regression (Table 4) was used to evaluate the additional impact of IQ. IQ did not make a unique statistically significant contribution to the model (p = 0.183) despite a considerably higher rate of correctly classified inactive patients in this model compared to the previous one (42.3% vs. 25.0%).

Table 4 Logistic regression predicting likelihood of an inactive occupational status (executive functioning and IQ as predictors)

The third logistic regression was used to evaluate the additional impact of both IQ and clinical factors. No clinical variables made a statistically significant contribution to the model (p values > 0.05). Only executive function significantly predicted occupational status (p = 0.001). The odds of a person to be in the inactive group decreased by 45% as each principal component score increased by 1. None of the other variables made a unique, significant contribution to the model (Table 5).

Table 5 Logistic regression predicting likelihood of an inactive occupational status (cognitive and clinical factors as predictors)

Discussion

The high rate of occupational dysfunction is a major issue in bipolar disorder. Here we studied the relative influence of clinical, demographic, and cognitive factors on occupational status by comparing bipolar disorder patients that worked or studied (active group) with those who did not (inactive group). We found that executive functioning was a stronger determinant of occupational functioning than general cognitive functioning (IQ) and other important clinical factors, including illness severity. In fact, executive functioning was the only factor that accounted for a significant amount of the variance in occupational status when clinical factors and IQ was taken into account. Taken together, our results suggest that poor executive functioning is the main factor associated with occupational dysfunction in bipolar disorder patients.

When individual predictors were considered separately, our findings align with previous cross-sectional and longitudinal studies that assessed the relationship with occupational status in bipolar disorder. First, psychotic symptoms during affective episodes and hospitalizations have been linked to a worse occupational and functional outcome in previous studies (Dickerson et al. 2004; Burdick et al. 2010; Gutiérrez-Rojas et al. 2011). We find similar differences regarding both lifetime history of psychotic symptoms and previous hospitalizations between the active and the inactive group. Second, general cognitive ability has been strongly associated with occupational status in both clinical and control samples (Schmidt and Hunter 2004). In line with this, we found in univariate analyses that the active group had significantly higher IQ than inactive patients. The groups did not, however, differ regarding premorbid IQ. Third, previous studies have found an association between executive functioning test scores and occupational status in bipolar disorder (Mur et al. 2008; Baune and Malhi 2015) similar to our findings. Finally, we found a higher prescription of benzodiazepines in the inactive patients.

Importantly, however, we demonstrate that none of the above variables significantly predict occupational status above and beyond the influence of executive functions. In terms of cognitive functioning, it was particularly notable that IQ was not a significant predictor of occupational status when compared to the model with executive function alone, further demonstrating the strong relationship between executive functioning and occupational status in bipolar disorder.

Instead, cognitive flexibility and inhibitory control as measured by condition 3 of the Color Word Test, the Verbal Fluency Test, and the Trail Making Test captured the largest differences between active and inactive patients (η2 = 0.08–0.14). Cognitive flexibility is needed to adjust thinking and behavior in response to changing demands, and inhibitory control function facilitates inhibition of a dominant response in favor of one that is consistent with a long-term goal (Delis et al. 2001). Tests measuring these executive processes also stood out in previous studies predicting the occupational status of bipolar disorder patients (Ryan et al. 2013; Bonnín et al. 2014; O’Donnell et al. 2017).

The construct of executive functioning generally describes the integrated cognitive processes responsible for planning, initiating, sequencing, and monitoring complex goal-directed behavior (Royall et al. 2002). Different measures of executive function generally correlate strongly with each other but less with IQ (Friedman et al. 2006). Executive impairments are considered to be more disabling than other cognitive deficits as they affect many aspects of behavior (Lezak 2012). Despite this, the ecological validity of executive functioning performance tests has been questioned by ADHD researchers, who have found that self-report ratings were better predictors of occupational impairment (Barkley and Murphy 2010). The fact that our study clearly captured executive deficits in the inactive group suggests that executive function test scores do reflect real-life executive impairments and can be useful in clinical assessment.

The finding that occupationally inactive bipolar disorder patients manifest executive cognitive deficits has clinical implications. Early identification these impairments could give valuable information about the individuals’ ability to return to his/her work, and guide which possible adaptions could be made in the workplace. Some cognitive remediation programs have been developed and shown promising results on occupational and social outcomes (Sanchez-Moreno et al. 2017).

Strengths and limitations

Strengths of this study include that we defined occupational functioning by means of actual time working (or studying), which provides a hard outcome of functioning. Our study also involved a meticulous clinical characterization, including well-established neuropsychological tests. The participants were representative of bipolar disorder patients receiving psychiatric care, as nearly all patients with this diagnosis in the catchment area were referred to the Bipolar Affective Disorder unit for treatment by the time of enrollment. Finally, our study complements the literature with data from a Scandinavian cohort in a field where most studies have been conducted by research groups in the US and Spain.

We also acknowledge a number of limitations. First, it should be noted that 72% of the subjects were occupationally active, which is a rather high number compared with previous reports (MacQueen et al. 2001). A possible explanation is that the catchment area of the recruiting clinic includes districts in Stockholm where the unemployment rates are lower than in the rest of the country, and the mean incomes above the Swedish average. Second, our analyses did not investigate the relative prediction of executive functioning compared to other cognitive functions previously found to predict occupational status in bipolar disorder patients, such as verbal memory (Dickerson et al. 2004), visual memory (Mur et al. 2009), and processing speed (Burdick et al. 2010). Future studies should further explore the relative contributions of these factors. Third, a potential source of bias is that 12 active and 6 inactive participants were excluded from the PCA because they had completed less than 50% of the tests. However, the proportion of excluded participants did not differ between the groups (Fisher exact test statistic value = 0.3585, p > 0.05). Fourth, the difference between measures of premorbid IQ and Full Scale IQ may indicate that the inactive group has declined in cognitive functioning. Our study did not investigate whether such decline predicted occupational status. The last and most important limitation is the cross-sectional design. We cannot determine if executive dysfunction causes occupational disability or if unemployment worsens executive functioning.

Conclusions

Bipolar disorder patients with worse executive functioning are more likely to be occupationally inactive. These deficits can be identified with neuropsychological tests and should be targeted in treatment and rehabilitation.