Introduction

Negative symptoms are core features of schizophrenia spectrum disorders and recognized as markers of an unfavorable illness course and outcome [1]. The etiology and pathogenesis of negative symptoms are mostly unknown, and current available treatments are not sufficient [2,3,4]. Negative symptoms are traditionally seen as stable; however, more recent follow-up studies indicate both symptom persistence and significant fluctuations [5,6,7,8]. Some studies indicate that the most noticeable changes occur during the first year of follow-up [9, 10], supporting the notion of a “critical period” of symptom evolvement [11]. However, the current evidence for a critical period for negative symptom evolvement is inconclusive [12].

Recent research indicates that negative symptoms comprise five sub-symptoms, clustering into two domains with different associations to the outcome: The expressive domain (i.e. blunted affect and alogia) and the experiential domain (i.e. anhedonia, avolition-apathy and asociality) [13,14,15]. These domains appear to have a continuous distribution that includes the general population [16]. There is evidence that avolition-apathy (“apathy” for short) is more strongly associated with a poor functional outcome than the other sub-symptoms [1, 17]. Apathy is usually defined as a reduction in goal-directed behavior due to a lack of motivation [18]. The prevalence of apathy in the early stages of a psychotic disorder can exceed 50% [19, 20], and higher levels are associated with male gender, reduced premorbid functioning, a long duration of untreated psychosis (DUP) and a diagnosis of schizophrenia [15, 19, 21, 22].

Despite the high prevalence of apathy in early psychosis, most studies have included participants with chronic illness [23,24,25], applied cross-sectional or short-term follow-up designs [21, 26] and/or used psychometric tools not primarily made to assess apathy [23, 27, 28]. The long-term development of apathy from the first treatment and its predictors thus remain mostly unexplored [29]. The only study so far investigating longer-term apathy development in first-episode psychosis (FEP) is the TIPS study [30]. At 10-year follow-up (10YFU), the study used a specialized psychometric tool to assess apathy, the Apathy Evaluation Scale-self-report version (AES-S) [31] and found that 30% of participants had high apathy levels, as defined by the AES-S. Using items from the Positive and Negative Syndrome Scale (PANSS) [32] as a proxy for AES measures, the trajectories of apathy over the follow-up period were then investigated retrospectively, with findings of a reduction in apathy levels during the first 1-to-2 years of treatment and stable levels from that point onward. No baseline variables predicted apathy levels at 10 years, but the use of different measures at different time-points limits interpretation. The study also lacked a healthy control group to examine the development of apathy over time.

The main aim of the current study was thus to investigate the development of apathy prospectively over 10 years in a FEP sample, using the AES at all time-points and additionally including a healthy control group (HC). Our research questions were:

  1. 1.

    How does apathy develop over 10 years in FEP compared to HC?

  2. 2.

    Do early clinical or demographic characteristics predict the development of apathy in FEP?

  3. 3.

    How prevalent is clinically significant apathy at 10YFU?

  4. 4.

    What are the functional consequences of high apathy levels at 10YFU?

We hypothesized that apathy would be higher in the FEP population than in HC, be predominantly stable over the follow-up period with changes primarily taking place early on. We also hypothesized that premorbid functioning and DUP would predict baseline levels of apathy and that premorbid function, DUP and baseline levels of apathy would predict the development of apathy over time. Finally, we hypothesized that the level of apathy would be a significant contributor to reduced functioning at 10YFU.

Methods

Participants

Two-hundred and fourteen participants with FEP aged 18 to 65 years were consecutively recruited from outpatient or inpatient units of hospitals in the regions of Oslo and Innlandet, as part of the Thematically Organized Psychosis (TOP) study in Norway. Inclusions into the study took place between March 2004 and December 2007 in Oslo, and between December 2007 and October 2009 at Innlandet. Participants were reassessed after 7 years (7YFU) at Innlandet, and after 10 years in Oslo (10YFU). A subset of the Oslo participants also had an intermediate assessment (6 and/or 12 months).

All FEP participants met the diagnostic criteria of a non-affective psychotic disorder, i.e. schizophrenia, schizophreniform disorder, schizoaffective disorder (“Schizophrenia spectrum disorders”) or delusional disorder, brief psychotic disorder or psychosis not otherwise specified. A psychotic episode was defined as having a score of ≥ 4 on items p1 (delusions), p2 (conceptual disorganisation), p3 (hallucinatory behavior), p5 (grandiosity), p6 (suspiciousness/persecution) or g9 (unusual thought content) for ≥ 1 week on the PANSS. Participants were not defined as FEP if they had previously received adequate treatment for psychosis (i.e. hospitalization or antipsychotic medication in adequate dosage for ≥ 12 weeks or until remission). Since some patients were not able to give informed consent during the acute phase, FEP participants were eligible for inclusion within 52 weeks of the start of first adequate treatment.

Exclusion criteria were: Not speaking a Scandinavian language, IQ < 70, current neurological or medical condition which could cause negative symptoms or psychosis, psychosis due to substance use, moderate/severe head injury prior to inclusion or during the follow-up period.

Based on these criteria, 16 participants initially deemed eligible were excluded, leaving 198 for analyses at baseline (BL) (Fig. 1). Of these, 98 (49%) had an intermediate assessment at 6MFU and/or at 1YFU. A total of 77 (41%) completed assessments at the long-term follow-up. One participant was excluded due to a severe head injury between 1 and 10YFU, leaving 76. Of the 121 lost to follow up, nine had died (all from Oslo), nine had moved abroad, and 43 were untraceable despite multiple attempts to contact them, and 60 said no to further participation.

The HC were 18–65 years old and were randomly selected from the national population registry of Norway [33], and invited to participate by letter. All HC were interviewed with the Primary Screening for Mental Disorders [34] at BL and follow-ups to ensure that they, or any first-degree relative, did not have a current or previous severe mental illness. The same exclusion criteria used for participants with FEP were applied, and 199 HC were included. One HC developed a severe mental illness during the follow-up and was excluded from analyses at both BL and follow-ups, leaving 198 HC at BL, 82 (41%) with intermediate measures (1YFU) and 59 (30%) at 10YFU (Fig. 1).

Fig. 1
figure 1

Participation in a 10-year follow-up of people with first-episode psychosis and in healthy controls. aPatients with first-episode psychosis were consecutively referred to the study from their clinical units. Since Norwegian law does not allow researchers to access medical charts of patients before they give an informed consent or to keep data on those who do not consent, we have no report of the number of eligible patients that were not referred or said no to study referral. bNine participants had died (all at the Oslo Site), nine had moved abroad, 43 were untraceable, and 60 refused further participation. cOne participant was excluded due to a newly acquired severe head injury between 1 and 10 years. dAt Innlandet, mean follow-up time was 7.1 years

Clinical assessment

At each follow-up point, participants were interviewed by psychologists or medical doctors, applying a comprehensive clinical assessment protocol. The Structured Clinical Interview for Mental Disorders (SCID-I) was used to diagnose participants, according to the DSM IV [35]. All interviewers completed a SCID-assessment training program based in the University of California, Los Angeles [36]. Diagnostic consensus meetings led by experienced clinical researchers were held regularly, and inter-rater reliability was found satisfactory [37]. Medical charts from in- and outpatient treatments during the follow-up were inspected to supplement information given by the participants.

Premorbid functioning was measured with the Premorbid Adjustment Scale (PAS) [38]. Since the baseline assessments were done in the mid-2000s, the structured interview for PAS published in 2009 was not used [39]. Scores were divided into age intervals (childhood ≤ 11 years, early adolescence 12–15 years, late adolescence 16–18 years, adult > 18 years), and further into social and academic functioning within each interval. To reduce chances of prodromal symptoms influencing adjustment, we only used childhood scores in the analyses, and PAS scores for patients with age at onset less than 12 years of age were treated as missing. The age at onset (AAO) refers to the individual’s age when the first psychotic episode started. Duration of untreated psychosis (DUP) was defined as the time in weeks from the first psychotic episode until first adequate treatment [40].

Psychotic and other symptoms were measured with the Positive and Negative Syndrome Scale (PANSS), divided into five factors (positive, negative, disorganized, depressed and excited) [41]. The Apathy Evaluation Scale self-report version (AES-S) was used to assess apathy [31]. A shortened 12-item version with superior psychometric properties in FEP was applied [42]. The AES-S has shown a high concordance with the clinician-rated version, AES-C, in a partly overlapping sample [43], and reliably distinguishes patients from HC [19, 25]. The AES-S maps one’s interests and engagement during the last month. A higher score indicates higher levels of apathy. Following previous studies [19, 30], we used a sum-score cut-off of ≥ 27 points (two standard deviations above mean for HC) to indicate clinically significant apathy. To better distinguish between apathy as a part of negative symptoms and symptoms of depression [44], depression was measured with the Calgary Depression Scale for Schizophrenia (CDSS) [45]. Higher CDSS scores indicate higher levels of depressive symptoms.

Global functioning was measured with the Global Assessment of Functioning Scale-split version, functioning subscale (GAF-F) [46]. Scores range from 0 (extremely impaired) to 100 (perfect function). Alcohol and drug use the last year were measured with the Alcohol Use Disorder Identification Test (AUDIT) [47] and the Drug Use Disorder Identification Test (DUDIT) [48], respectively.

The current load of antipsychotic medication (AP), were represented by dividing the actual daily dosage of used antipsychotics with its Defined Daily Dosage (DDD) (dosage recommended by the WHO Collaborating Centre for Drug Statistics Method [49]). If a participant used two or three different AP, one ratio was computed for each AP, and the ratios subsequently summarized to ‘Sum AP’.

Statistical analyses

Analyses were carried out in the SPSS version 25. Variables were inspected for outliers, normality, collinearity and heteroscedasticity. Tests were two-tailed, and significance levels pre-set to 0.05.

Site characteristics and follow-up intervals

Mean long-term follow-up time was 7.1 years at Innlandet and 10.8 years in Oslo. We expected higher stability of symptoms and functioning this late in the course of illness and thus assumed that the difference in follow-up time would not have a significant influence on the results of the analyses. The ‘10YFU’ variables thus included measures from both 7 (Innlandet) and 10 years (Oslo). There could, however, be other systematic or random site differences. In our sample, patients from the rural communities at Innlandet had a longer DUP than Oslo (median DUPInnlandet = 104 weeks; median DUPOslo = 52 weeks, t = − 4.4, p < 0.001) and a significantly higher proportion meeting a schizophrenia spectrum diagnosis at baseline (χ2 = 4.0, p = 0.045). ‘Inclusion site’ was thus adjusted for in the multivariate analyses in the case of a significant bivariate association between “Inclusion site” and other covariates and/or outcome variables in initial analyses.

Missing data

We evaluated differences in BL characteristics between those who completed and those who did not complete the long-term follow-up using χ2 test for categorical and t-tests or Mann–Whitney U-tests for continuous data (Table 5). Participants with a lower PANSS general symptoms score, male gender or non-European ethnicity were significantly less likely to complete the long-term follow-up assessments. No other significant differences were found.

The AES-S had no missing data in those who completed reassessment at each follow-up point. The GAF-F score was missing in one participant at 10YFU. For the CDSS, AUDIT and DUDIT, between one and five participants had missing scores for two or fewer items at one or more follow-up points. These missing items were replaced with item scores imputed as the mean value of the non-missing items for the scale in question for that participant at that specific follow-up. If more than two items were missing, which was the case in less than five participants, no imputations were done and the variable was treated as missing. Missing data did not exceed 4% for any BL data, except for the Sum AP, which had 7% missing.

Analyses

FEP and HC samples were analyzed separately for the first research question. We used a scatter-dot with a fitted regression line to explore the longitudinal development of apathy. To account for missing data and dependencies caused by repeated measurements, we then applied linear mixed models analyses [50]. In FEP, AES-S scores at four follow-up points were used as the dependent, continuous apathy variable. Longitudinal apathy development was described by employing a growth model, and maximum likelihood used to select the best-fitted model. Time was first introduced as fixed factor. We then explored whether a curvilinear function (time*time) improved model fit. Subsequently, random intercept and random slope were introduced, and an autoregressive heterogeneous (AR1H) covariance structure between them was inspected. The same procedure was then applied for HC separately, using the available three assessment points for the dependent, continuous apathy variable.

For the second research question, relevant early predictors and covariates of apathy development in FEP were chosen based on previous research and theory. We used Pearson’s bivariate correlation analyses to investigate associations between predictors, covariates and the AES-S scores at BL and 10YFU. Variables with significant (p ≤ 0.1) bivariate associations to apathy development were introduced into the linear mixed models analyses in order of lifetime appearance. Interaction effects with time were explored only for BL predictors with a significant association to apathy development. Such interaction effects describe whether the predictor’s effect on apathy development increases or decreases with time. Predictors and covariates with non-significant estimates (p > 0.05) were removed from the final equation. The following equation describes the basic model:

$$Y_{{{\text{ij}}}} = \, \left( {\beta_{0} + {\text{ b}}_{{0{\text{i}}}} } \right) \, + \, \left( {\beta_{{{\text{1ij}}}} + {\text{ b}}_{{{\text{1ij}}}} } \right)*{\text{time }} + \, \beta_{{{\text{2ij}}}} *{\text{time}}*{\text{time }} + \, \beta_{{{\text{3ij}}}} *{\text{predictor }} + \beta_{{{\text{4ij}}}} *{\text{predictor}}*{\text{time }} + \, \varepsilon_{{{\text{ij}}}}$$

Yij is apathy in an individual i = 1…, 198 at year j = 1…, 10. β0β4ij are the estimates of the population’s means (i.e. fixed effects). The b0i and b1ij represent the specific random variation between individuals in BL apathy levels and in the slope of apathy development, respectively.

For the third research question, we employed Pearson’s bivariate correlation analyses to evaluate the association between GAF-F at 10YFU and concurrent symptoms, diagnosis and demographic variables in FEP. Multiple hierarchical linear regression analyses were used to investigate associations to GAF-F further. Independent variables with significant (p ≤ 0.1) bivariate associations to GAF-F were introduced in a block-wise manner, with the AES-S score in the final block.

Results

Table 1 displays the characteristics of HC and participants with FEP. A total of 198 FEP patients and 198 HC were included at BL. At 1YFU, 89 patients and 82 HC were reassessed, while 76 of the included patients and 52 of the HC were reassessed at 10YFU. At BL, 36% of the FEP patients and 48% of the HC were female. The mean age in FEP and HC was 27 years and 33 years, respectively. Among patients, 67% had schizophrenia, schizophreniform or schizoaffective disorder diagnosis (i.e. a schizophrenia spectrum disorder).

Table 1 Characteristics of first-episode psychosis participants and healthy controls during follow-up

Development of apathy in HC

Development of apathy in HC is presented in Fig. 2. Mean apathy levels appeared stable over the follow-up period, as indicated by a non-significant fixed effect of time in the apathy growth model (p = 0.215). However, apathy levels varied significantly between individuals at BL and between individuals over time, as shown by a significant effect of a random intercept (p < 0.001) and random slope (p = 0.019), respectively. The individual level of apathy at BL was not associated with the individual development of time, as indicated by a non-significant covariance between the random intercept and slope (p = 0.106). Gender and age did not contribute significantly to the model.

Fig. 2
figure 2

Development of apathy (AES-S scores) in first-episode psychosis (FEP) patients and in healthy controls during the 10-year follow-up

Development of apathy in participants with FEP

Apathy development in FEP participants is displayed in Fig. 2. The scatter-dot regression line indicated that apathy levels declined during the first year, levelling off thereafter. In the growth model, apathy levels decreased over the long-term follow-up, i.e. there was a significant, fixed effect of time (− 2 log likelihood = 2836.8; BIC = 2854.9, p = 0.002). When quadratic time (time*time) was added to the equation, the model fit was improved (− 2 log likelihood = 2818.8, BIC = 2842.9). The linear effect of time was negative, while the quadratic effect was positive (both: p < 0.001). Apathy levels significantly varied between individuals at BL, as indicated by a significant random intercept (p < 0.001). The random slope and the covariance between the random intercept and slope did not significantly improve model fit, which suggested that the development of apathy did not significantly differ between individuals over time, with an enduring effect of baseline apathy levels.

Early clinical and demographic predictors of apathy development in FEP

Bivariate correlations are presented in Table 2, followed by the linear mixed models analyses in Table 3. The AES-S level at BL was significantly associated with the PAS social and academic scores, DUP, concurrent CDSS, and the AES-S and CDSS at 10YFU. The AES-S level at 10YFU was significantly associated with gender, DUP, concurrent CDSS and PANSS disorganized symptoms.

Table 2 Pearson’s bivariate correlation analyses between patient characteristics at baseline and 10 years, AES-S at baseline and 10 years and GAF-F at 10 years
Table 3 Linear mixed model analysis. Early predictors of apathy (AES-S) development in first-episode psychosis during 10-year follow-up

In the linear mixed models analysis, DUP had a significant, positive association with the development of apathy. There was an enduring effect of DUP, as shown by a non-significant interaction effect of DUP*time. Baseline CDSS levels showed a significant, positive association with the development of apathy. The interaction term CDSS*time was negative and statistically significant, indicating that the effect of BL depression decreased with time. Gender, AAO, PAS, BL disorganized symptoms, AUDIT, DUDIT, Sum AP, or having a schizophrenia spectrum diagnosis did not contribute significantly to the model. The inclusion site was, however, significantly associated with apathy development, with higher apathy scores at the Innlandet site, also after correcting for other statistically significant variables in the equation.

Prevalence of clinically significant apathy at 10 years and the associations between apathy and global functioning

The prevalence of clinically significant apathy at 10YFU was 5% in HC and 37% in FEP participants (Table 1). Results from the multiple hierarchical linear regression analysis at 10YFU are shown in Table 4. Concurrent positive and disorganized symptoms, and having a schizophrenia spectrum diagnosis, had statistically significant, negative associations with GAF-F. After adjusting for these variables and concurrent depression, apathy added 5% to the explained variance in GAF-F. Age, gender, AUDIT, DUDIT and Sum AP did not contribute significantly to the model.

Table 4 Multiple hierarchical regression analyses at 10-year follow-up in first-episode psychosis, GAF-Fa is the dependent variable

Discussion

Main findings

We found a significant decrease in mean apathy scores during the first year of treatment in FEP, followed by long-term stability over the next 6 to 9 years. A high BL apathy score increased the likelihood of apathy scores above the group mean throughout the follow-up. Also, a long DUP and high BL depression score predicted higher apathy scores over the follow-up period. However, while the effect of BL depression levels decreased over time, the effect of DUP persisted. The mean apathy scores in the HC group were lower and stable over time, but with inter-individual variation both in BL levels and in later trajectories. Accordingly, the BL apathy score was not equally predictive of the later development of apathy in HC.

In FEP, a schizophrenia spectrum diagnosis together with concurrent positive- and disorganized symptoms together were significantly associated with poorer global functioning at the long-term follow-up. The level of apathy had an independent and statistically significant influence on global functioning also after adjusting for other clinical characteristics in the multivariate analyses.

Development of apathy in participants with FEP

The finding of an overall decrease in apathy levels in the long-term is in line with results from two previous follow-up studies from FEP [30] and first-admission schizophrenia participants [51]. In the FEP TIPS study, a group characterized by enduring high apathy levels was discernible in the second year of treatment. Another group with lower and decreasing apathy levels over time explained most of the overall reduction in apathy levels in the total sample [30]. The primary reduction in apathy levels both in the TIPS study and the current study took place within the first years of treatment. This finding supports that the notion of a critical period for symptom development in FEP, i.e. a time interval where symptoms may be more amenable to interventions, also comprises the development of apathy [11].

We also found that the individual variations in apathy levels already at BL were carried forward through the follow-up period, corresponding to the “persistently high apathy” group in the TIPS study [30]. Since the TIPS was an early intervention study, it recruited FEP with a short DUP during their first week of treatment [52], which may explain why symptom trajectories were less stable over the first years of treatment. Taken together this indicates that factors influencing apathy trajectories are in place well before the first adequate treatment of the psychotic illness. This notion is supported by findings of stable negative symptoms in ultra-high-risk populations [53].

Early clinical or demographic predictors of apathy development in FEP

In line with our hypothesis and evidence from more broadly defined negative symptoms [54], we found that a long DUP in FEP predicted higher levels of apathy throughout the follow-up period. This finding expand on previous research from our group that identified statistically significant associations between a long DUP and high apathy scores at 1-year follow-up in a sample partly overlapping with the current [21]. The TIPS study found statistically significant associations between a long DUP and high negative symptoms in the short term. DUP did, however, not predict the level of apathy at 10 years [30], possibly because the short median DUP in the TIPS study reduced statistical power. We do not know by which mechanisms, DUP contributes to a poor outcome [55,56,57]. However, findings from the TIPS study indicate that shortening DUP will lead to lower levels of negative symptoms and improved functioning from treatment start through long-term follow-ups [58,59,60,61]. In our sample, patients from Innlandet had a longer DUP than Oslo. This may partly explain why Innlandet also had higher apathy scores at BL (mean AES-S Innlandet = 31.6 (7.5), mean AES-S Oslo = 27.7 (7.5), t = − 3.2, p = 0.002) and at 10YFU (mean AES-S Innlandet = 27.5 (7.2); mean AES-S Oslo = 22.9 (6.4), t = − 3.0, p = 0.004) [54].

The associations between BL depression and the development of apathy is intriguing. Depression is common also in non-affective psychotic disorders, especially in FEP [62, 63]. Although the phenomenology of depressive symptoms resembles those of negative symptoms [64], the different symptoms do not cluster together in factor analyses and show modest or inconsistent overlap in both cross-sectional- and longitudinal studies [65]. Research suggests that low mood and suicidal ideation are more linked to depressive symptoms and alogia/blunted affect more linked to negative symptoms, while reduced motivation (i.e. apathy) and anhedonia are common to both [66]. The association of apathy-anhedonia to both depression and negative symptoms indicates similarities in underlying CNS functions [67].

We found that the effect of BL depression on apathy trajectories decreased over time, while the cross-sectional association between concurrent depressive symptoms and apathy was stable. The results are in line with findings from a 13-year follow-up study of early psychosis, describing three trajectories for negative symptoms, where the high-and-increasing trajectory was predicted by BL depression, cognitive dysfunction and reduced premorbid functioning [68]. Another study of the longitudinal development of anhedonia/apathy and depressive symptoms in FEP found that the symptom domains levelled off after 2-to-5 years, while the associations between concurrent levels of apathy and depression increased in strength in the female participants over time [51]. Due to sample size and participant attrition, our findings should be interpreted with caution. They nevertheless serve as an argument for careful assessment- and active treatment of depression in FEP [69,70,71].

Finally, we expected that participants with poor premorbid adjustment as measured by the PAS and/or an earlier AAO had higher levels of apathy as a correlate of a more severe, neurodevelopmentally based illness. We did, however, not find any significant associations between PAS, AAO and apathy development in the multivariate analyses. We also hypothesized that a high BL Sum AP was associated with higher levels of apathy, since AP side effects may mimic negative symptoms [67]. Again, there were no significant associations between BL Sum AP and levels of apathy.

Prevalence of clinically significant apathy at 10 years and associations to global functioning

In line with previous long-term studies in FEP [30], the prevalence of clinically significant apathy was substantial. Apathy also had an independent negative association with global functioning at 10 years. While the cross-sectional design for this particular research question precludes causal inference, our findings corroborate previous cross-sectional findings at BL and 1YFU in overlapping samples to the current sample [19, 21] and the TIPS study [30], and thus add to the suggested burden of apathy in psychosis [17].

Strengths and limitations

The main strengths of this study include a richly phenotyped FEP sample and a prospective study design with a long follow-up period and a healthy control group. We also used validated psychometric tools, including a specialized tool for the assessment of apathy that was applied at BL and all follow-up assessments. Study participants were recruited through the Norwegian mental health care system, which is available to all citizens independent of socioeconomic status and thus increases the representativity of the study sample. Finally, we used robust statistical methods to handle dependencies in the data set.

There are also some limitations: First, we were not able to fully match FEP-participants with HC due to sample size. Second, our study may be subject to bias if reduced insight into illness impairs the ability to self-report apathy or the CDSS and AES-S do not adequately distinguish depression from apathy. However, recent research suggests that people with schizophrenia are aware of and report negative symptoms in a similar manner to external observers [22, 72], in contrast to findings from older studies [73, 74]. The AES-S shows a high concordance with the clinician-rated AES-C in FEP [43], and the PANSS insight item was not significantly associated with AES-S at BL or 10YFU. Additionally, the CDSS was designed to reduce confounding from negative symptoms [45].

Third, due to state-effects, depressed participants may evaluate themselves as more apathetic than others perceive them. Fourth, apathy was not assessed between years one and ten, and further variability in the trajectory may thus go unobserved.

Fifth, our sample size was limited, with subsequent attrition of participants. Our long-term attrition rate (59%) is at the same level as naturalistic FEP studies [75,76,77] but higher than in the TIPS (38%) [30] and OPUS cohorts (39%) [6], where retention can be boosted by the intervention designs or more frequent follow-ups. Attrition analyses revealed that being male, having non-European ethnicity or a lower BL PANSS general symptom score was associated with an increased likelihood of study drop-out at 10YFU. We did, however, not find any differences in other variables of interest, including DUP, BL AES-S, BL CDSS or BL GAF-F scores in follow-up analyses (Table 5). Follow-up analyses of BL symptoms, demographics and BL functioning across genders and ethnicity (data not shown), found no statistically significant differences in most variables of interest, including DUP, AES-S, the five PANSS factors and the GAF-F. Men were more likely to be single, have lower premorbid academic functioning, lower BL levels of depression and use more drugs. Europeans were more likely to use alcohol and have poorer premorbid social functioning, but higher IQ scores. We were not able to find a systematic trend of attrition that could affect our results and linear mixed models analysis is a recommended and robust statistical method when data are missing. Baseline predictors of attrition are regularly used to evaluate the likelihood of selection bias in longitudinal studies. The estimates of associations between variables are, however, not necessarily affected by attrition in long-term longitudinal studies, even in the presence of differences in the mean scores of BL variables between completers and non-completers [78]. We thus assume that the follow-up sample at 10YFU was likely to be representative for the general distribution of symptoms and functioning in our full FEP sample.

Table 5 Comparisons of baseline characteristics between completers and non-completers at 10-year follow-up

Sixth, to ensure that also initially acutely psychotic participants were able to give informed, written consent, FEP patients were eligible to enter the study up to 52 weeks after the start of the first adequate treatment, which could introduce more heterogeneity in BL symptom scores. Both positive- and depressive symptoms are causes of secondary negative symptoms and are higher at the start of the first adequate treatment. The observed decline from BL to 1YFU in levels of apathy may thus have been higher if the whole sample had entered the study at the start of the first adequate treatment.

Conclusion and clinical implications

The current study supports the notion that the early treated- and untreated phases of the first psychotic episode is a critical period for the development of apathy. Based on the long-term effects of DUP, we can hypothesize that detecting and treating psychosis adequately at an early stage could reduce long-term apathy levels. The effect of BL depression on early apathy levels supports the idea of more active treatment of depression in FEP [69,70,71]. Considering the lack of evidence-based treatments for negative symptoms, efforts to reduce DUP and to treat co-occurring depressive symptoms could help to prevent high levels of apathy in the long term and thus improve functional outcome.