Introduction

Meta-analyses have consistently identified an association between longer duration of untreated psychosis (DUP), the duration between the onset of positive symptoms and treatment, and worse clinical outcome1. Longer DUP is associated with worse negative symptoms at first treatment contact, and with poorer symptomatic and functional recovery from the first psychotic episode1,2,3,4. The relationship between DUP and outcome is found across various lengths of follow-up periods, suggesting DUP influences the long-term course of the illness4. In addition, there is evidence that the association of DUP with outcome is more pronounced for longer DUP (>12 months) compared to shorter DUP (ref. 4). A deleterious effect of psychosis has been hypothesized; later it was proposed that hypofunction of N-methyl-D-aspartate receptors on gamma-aminobutyric acid (GABA) inhibitory neurons results in a hyperactivation of glutamate neurons, leading to excess glutamate release, and, if the disruption is sustained, neuronal insult5,6.

The hippocampus plays a key role in memory processes, especially in long-term episodic memory7, which has been shown to be impaired in schizophrenia (SZ)8. At the cytoarchitectonical level, hippocampal subdivisions include the subiculum complex, cornu ammonis (CA) 1–4, and the dentate gyrus (DG)9. At the neuronal level, the pyramidal layers of the hippocampus are tightly packed with glutamatergic neurons that have a low firing threshold, ensuring a high level of neuroplasticity in the region10. The glutamatergic pyramidal neurons account for ~90% of hippocampal neurons, a much higher percentage than in other parts of the cortex11. The remaining 10% of hippocampal neurons consist of inhibitory GABAergic interneurons tasked with regulation of the easily excited glutamatergic neurons10,12. Postmortem studies in SZ have indicated normal hippocampal pyramidal neuronal density and number13,14, but reduced number of GABAergic interneurons15. In addition, the GABA-synthesizing enzymes glutamic acid decarboxylase (GAD)65 and GAD67 are decreased in the hippocampus in SZ (ref. 16).

Converging lines of evidence from memory assessments8, molecular studies17,18 to multimodal brain imaging19,20,21 points to hippocampal alterations in SZ. In addition, hippocampal dysfunction is at the root of some prominent neurobiological models of SZ (refs. 10,22,23). Hippocampal alterations have been identified in family members24, and those at risk to develop psychosis25. More generally, hippocampal dysfunction has also been described in psychotic disorders2,21,25,26,27,28,29,30, suggesting this dysfunction plays a key role in etiology of the psychotic disorders.

Consistent with a deleterious effect of psychosis on brain structure, a recent study reported that, in FEP, longer DUP was associated with accelerated hippocampal atrophy over the initial 8 weeks of antipsychotic treatment, suggesting a persistent effect of DUP on brain structure31. In addition, we previously reported elevated hippocampal glutamate + glutamine (Glx) levels in a group of unmedicated patients with SZ (refs. 32,33,34), and an association between decreased hippocampal volume and increased Glx (ref. 32). Furthermore, clinical outcomes in individuals at clinical high risk for psychosis may be associated with an increase in baseline hippocampus Glx levels35. Taken together, these studies suggest that hippocampal dysfunction plays a critical role in the onset of psychosis; however, the biologic basis for this dysfunction and its relationship to the DUP have not been identified36.

In this study, we obtained measurements of hippocampal Glx and hippocampal volume segmentation in a group of antipsychotic-naive FEP and matched healthy controls (HC). We systematically assessed the DUP based on information provided by the patient and their caregivers during screening and, at any time during the 32-week follow-up. Based on the existing literature, we hypothesized that we would observe higher Glx levels in FEP compared to controls. We further hypothesized that, in FEP, longer DUP would be associated with elevated Glx levels, and with smaller whole hippocampal and subfield volumes. Finally, we hypothesized that in FEP higher Glx levels would be associated with smaller hippocampal volumes, and mediate the association between DUP and smaller whole hippocampus and subfield volumes. Finally, we also conducted exploratory analyses to investigate associations with clinical and behavioral memory measures.

Material and methods

Participants

Sixty-six antipsychotic-naive FEP subjects were recruited from the emergency department, inpatient units, and outpatient psychiatry clinics at the University of Alabama at Birmingham (UAB). Seven patients dropped out for the following reasons: they withdrew consent before scan (n = 2), they did not tolerate the scan environment (n = 3) or for unknown reason (n = 2).

Diagnoses were established according to DSM-V criteria by review of medical records and consensus of two board-certified psychiatrists (A.C.L. and N.V.K.). Forty-one HC, matched on age, gender, and parental socioeconomic status were recruited by advertisements. Exclusion criteria were major neurological or medical conditions, history of significant head trauma, substance use disorders (excluding nicotine and cannabis) within 1 month of imaging, >5 days of lifetime antipsychotic exposure, pregnancy or breastfeeding, and MRI contraindications. HC with a personal or family history in a first-degree relative of a psychiatric disorder were excluded. The UAB Institutional Review Board gave approval for this study, and written informed consent was obtained prior to enrollment and after subjects were deemed to have capacity to provide consent37.

Duration of untreated psychosis

DUP was defined as the duration between the first onset of discernable positive symptoms to the time of initial treatment contact (the time at which the first antipsychotic prescription was written, which also coincided with enrollment in the study) and is reported in months. Two experienced psychiatrists systematically assessed the DUP based on information provided by the patient and their caregivers during screening, and at any time during the follow-up period. Moreover, we dichotomized DUP into short (<12 months) and long (>12 months) based on a number of studies that used this cutoff to define long and short DUP (refs. 4,38,39,40,41), including those predicting treatment outcomes in early psychosis42 and poorer outcomes after the first hospitalization43.

Clinical and behavioral scales

The brief psychiatric rating scale (BPRS) was used for assessments of symptom severity44. The repeatable battery for the assessment of neuropsychological status (RBANS) was used to characterize memory cognitive function of our subject, via their total, immediate, and delayed memory subscales45.

Image acquisition of the proton magnetic resonance spectroscopy

Data were collected from a voxel in the left hippocampus, such that the amount of gray matter was maximized while avoiding major vessels (27 × 15 × 10 mm3; Fig. 1). Following automatic and manual shimming to optimize field homogeneity across the voxel, chemical shift selective pulses were used to suppress the water signal. Then, spectra were obtained using a point-resolved spectroscopy sequence (TR/TE = 2000/80 ms, flip angle = 90°, vector size 1024, 192 averages; the echo time was chosen in order to resolve and separate the C4 resonance of Glx from other j-coupling metabolites46,47). Moreover, eight averages of unsuppressed water scans with the same acquisition parameters were acquired for quantify metabolite according to the water peak.

Fig. 1: Proton magnetic resonance spectroscopy of the left hippocampus.
figure 1

a Boxplot of glutamate level in antipsychotic-naive first episode psychosis patients (FEP) compared with healthy controls (HC; P = 0.39). Each point corresponds to one participant (mean ± standard deviation). HC: n = 41, FEP: n = 54. b Single voxel location in the left hippocampus. c Spectrum from left hippocampus volume of interest; the black line is a collected spectrum and the green line is a model fit obtained by the AMARES algorithm in jMRUI. Cr indicates creatine, Cho choline, Glx glutamate and glutamine, and NAA N-acetyl aspartate.

Proton magnetic resonance spectroscopy data processing

All spectra were analyzed in jMRUI version 6.0 using the AMARES algorithm48. Prior knowledge derived from in vitro and in vivo spectra was included in the model. A phantom solution of 20 mM glutamate in buffer (30 mM sodium hydrogen carbonate and 30 mM sodium carbonate; pH, 7.1) was imaged using the same acquisition parameters from the in vivo study. The model consisted of peaks for N-acetyl aspartate, choline, creatine (Cr + Cr2), and Glx was modeled as a triplet (large peak with two small outer wings), as previously described32. After removing the residual water peak using the Hankel-Lanczos singular values decomposition filter, the amplitude of the center Glx peak was estimated, and Glx levels were calculated relative to the unsuppressed voxel water and expressed in institutional units49. Metabolite levels were corrected for partial volume effects according to Gasparovic and colleagues50,51; the fraction of cerebrospinal fluid, gray, and white matter were calculated by segmentation of the T1-weighted images in SPM 8 (see in Supplementary Data).

Exclusion criteria for Glx were failure of the fitting algorithm, signal to noise ratio < 3, full-width at half-maximum > 0.1 ppm (ref. 52), and Cramer–Rao lower bounds > 20%. Five FEP subjects were excluded from the analyses based on those criteria.

Hippocampus subregions segmentation

Each participant’s high-resolution T1- and T2-weighted images were preprocessed using the FreeSurfer 6.0 (ref. 53). All structural images underwent skull stripping via FSL (ref. 54) before submission to the recon-all preprocessing pipeline53. Following this step, we used Freesurfers’ hippocampus subfield segmentation module to calculate each participants left and right subregions volumes55. Freesurfer 6.0 has a significantly improved segmentation algorithm that uses Bayesian inference combined with a manually delineated hippocampal atlas55. The estimated total intracranial volume (ICV) was also obtained via FreeSurfer 6.0 (ref. 56; Table 1).

Table 1 Demographic and clinical information of the sample.

We assessed the overall hippocampal volume, and considered eight subregions across allocortical regions of the hippocampal formation29: the hippocampal tail (comprised of portions of CA and DG), CA1, CA3 (CA2 and CA3 are combined in the Freesurfer atlas), CA4, molecular layer (ML) of the subiculum sub volumes and the DG, the granule cell layer of DG (GC-ML-DG: a combination of the granular cell layer, the ML, and the DG), and finally subiculum subfields (subiculum and presubiculum). These regions have been found to be altered in SZ (refs. 21,27,30). Smaller area, as the fimbria, the hippocampal fissure and the parasubiculum were not included in this study due to reliability concerns in their segmentations21,57.

Data quality was assessed using the Qoala-T-tool58 that uses a supervised-learning model to assess accuracy of manual quality control of automated segmentation. Each participant’s subfield segmentation was assessed by visual inspection (E.N. and O.M.), no hippocampal segmentation failures were reported.

Statistical analysis

Statistical analyses were performed in R (http://cran.r-project.org).

To compare Glx and hippocampus volumes between groups, we used a two-sample t-test (FEP vs HC) and a one-way ANCOVA to compared HC, long and short DUP, controlling for age, sex, and smoking status (pack-day). To assess the relationship between Glx, hippocampus volume measures, and clinical variables, we used partial correlations with age, sex, and smoking status included as covariates. For any analyses, including hippocampus volume, we included ICV as an additional covariate.

To complement analyses, we also calculated Cohen’s d effect sizes. All analyses were controlled for multiple comparisons using the false detection rate (FDR) correction method by Benjamini and Hochberg59.

Results

FEP and HC were well matched in terms of age, gender, and socioeconomic status. However, they differ in smoking status (Table 1, P < 0.01).

Glx levels in HC, FEP, and relationship with DUP

There was no significant group difference in Glx levels in the left hippocampus (Fig. 1c, Cohen’s d = −0.17; t(94) = −0.86; P = 0.39).

Although Glx levels were not significantly correlated with DUP in FEP (r = 0.22, P = 0.11), when patients were dichotomized into those with long and short DUP based on a cutoff point of 12 months, the ANOVA revealed a significant group effect (F(2,89) = 4.92; P < 0.01). Those with long DUP (n = 17) had significantly higher Glx levels than those with shorter DUP (n = 37; Fig. 2, Cohen’s d = −0.82; t(89) = 3.14; P < 0.01) and than HC (Cohen’s d = −0.74; t(89) = 2.67; P = 0.03).

Fig. 2: Left hippocampus glutamate (Glx) level in the divided group of antipsychotic-naive FEP patient compared to HC.
figure 2

Those with a longer duration of untreated psychosis (DUP; >12 months) showed an increase of Glx concentration compared to those with a short DUP (P < 0.02) and to HC (P = 0.03). Each point corresponds to one participant (mean ± standard deviation). Long DUP: n = 17, short DUP: n = 37, and HC: n = 41.

Hippocampus volume in HC, FEP, and relationship with DUP

Left whole hippocampus volume was significantly reduced in FEP compared to HC (Cohen’s d = 0.59; t(85.7) = 2.87; P < 0.01). We also found lower volumes in the following hippocampus subfields: subiculum (Cohen’s d = 0.53; t(85.7) = 2.53; P = 0.03), presubiculum (Cohen’s d = 0.71; t(85.7) = 3.41; P < 0.01), CA1 (Cohen’s d = 0.62; t(85.7) = 3.04; P = 0.01), GC-ML-DG (Cohen’s d = 0.48; t(85.7) = 2.35; P = 0.04), and CA4 (Cohen’s d = 0.45; t(85.7) = 2.21; P = 0.04; P < 0.05 FDR corrected; Table 2). Similar group differences were obtained for the right hippocampus (Supplementary Table 1). Although left and right whole hippocampus volumes, and all the subregions are reduced in long DUP compared to short DUP, these did not survive multiple comparisons (Table 2, Supplementary Table 1).

Table 2 Statistical comparisons of left hippocampal subfield volumes (mm3) in FEP and HC.

In FEP subjects, longer DUP was significantly associated with smaller whole hippocampus volume (Fig. 3, Table 3, r = −0.37, P < 0.01). This relationship was also seen in the following subfields: hippocampus tail (r = −0.32, P = 0.04), CA1 (r = −0.31, P = 0.04), ML (r = −0.30, P = 0.04), subiculum (r = −0.41, P = 0.02), and presubiculum (r = −0.33, P = 0.04) volumes.

Fig. 3: Association of left hippocampus volume and DUP in antipsychotic-naive FEP patients.
figure 3

Left: relationship between the whole left hippocampus volume and the DUP, controlling for age, gender, and total ICV (r = −0.37, P < 0.01). Right: association between the left hippocampus subregions volumes and DUP, controlling for age, gender, and total ICV. Medial view of the left hippocampus volumes (left: anterior to posterior, right: posterior to anterior). *P < 0.05. Each point corresponds to one participant. HIP hippocampus, CA cornu ammonis, GC-ML-DG granular cell of the dentate gyrus, ML molecular layer. FEP: n = 54.

Table 3 Correlations between left hippocampus subfield volumes and DUP, Glx level.

Relationship between Glx and hippocampal volumes

In both FEP and HC, there were no significant relationship between Glx levels and hippocampal volumes (Table 3).

Clinical and behavioral relationships

Memory function: relationship with DUP, volume, and Glx

There were no significant relationship between the memory subscales of the RBANS and DUP, hippocampal volumes or Glx levels for the HC, the FEP, or the short and long DUP subgroups (Supplementary Tables 2 and 3).

Symptoms: relationship with DUP, volume, and Glx

Across the entire FEP sample, after multiple comparison correction, lower severity of positive symptoms were associated with longer DUP (r = −0.34, P = 0.03), but not severity of negative (r = −0.09, P = 0.52) or general symptoms (r = −0.22, P = 0.16). Across the DUP subgroups, no significant correlations were found (Supplementary Tables 3 and 4).

There were no significant associations between positive or negative symptoms, and volume (Supplementary Table 4) or Glx levels for the entire FEP sample or for DUP groups (Supplementary Table 3).

Discussion

To our knowledge, were are the first to simultaneously evaluate hippocampal glutamate levels and volumes, including subfields, in antipsychotic-naive FEP in order to shed light into the neurobiological underpinnings of the relationship between longer DUP and poorer clinical outcomes. In medication-naive FEP, we found that hippocampal Glx levels were significantly higher in those who underwent a long DUP (>12 months) compared to those with a short DUP, and compared to HC. In addition, we found that smaller hippocampal volumes, including CA1, ML, tail, subiculum, and presubiculum, were significantly associated with longer DUP. However, we found no significant association between hippocampal Glx levels, and hippocampal volume or subfields. Our results strongly suggest that one or several pathophysiological processes underlie the DUP.

Glutamate levels in left hippocampus

Contrary to our hypothesis, we found no significant differences in levels of Glx between medication-naive FEP and HC in left hippocampus. This is in agreement with another study in a small sample of medication-naive FEP (n = 15) where no group difference in Glx levels was identified in the medical temporal cortex60. This is in contrast to our prior study, where elevated hippocampus Glx levels was found in a group of unmedicated patients with SZ (refs. 32,33,34). Illness chronicity or prior exposure antipsychotic medication may affect hippocampus Glx. In FEP, Glx levels showed greater variance compared to controls, suggesting heterogeneity in glutamatergic metabolism in FEP. It is possible that only a subset of patients have developed a hyperglutamatergic state61,62. Our findings underscore the importance of considering heterogeneity when characterizing psychosis spectrum patients in the early illness stages.

Association between left hippocampus glutamate level and DUP

Because there is evidence that the association of DUP with outcome is more pronounced for longer DUP (>12 months) compared to shorter DUP (ref. 4), we dichotomized our FEP patients based on that cutoff value. We found increased hippocampus Glx levels in those with a DUP > 12 months, compared to those with a DUP < 12 months, and compared to HC. The two DUP subgroups were not significantly different based on demographic characteristics or symptom burden, indicating difference in Glx levels was not related to these variables (Table 1). Our results are in line with the report that higher hippocampus glutamate levels in clinical high-risk individuals for psychosis were associated with a poor functional outcome35.

Left hippocampus volume and subregions in FEP compared to HC

Here, we report significantly lower whole left hippocampus volumes, as well as CA1, CA4, GC-ML-DG, subiculum, and presubiculum in antipsychotic-naive FEP patients compared to HC. While volume deficits of the hippocampus in SZ have been consistently reported [21, 26, 28, 29, 56, 57], only two prior studies measured hippocampal volumes in antipsychotic-naive FEP (refs. 31,63). Consistent with our findings, one of them31 observed decreased hippocampal volumetric integrity in a cohort of 71 FEP, but they did not measure subfield volumes. The other63, reported no significant change in total hippocampal volumes and significantly greater subfield volumes in the ML, granular layer, and CA4 in a cohort of 41 FEP. Interestingly, Ho and colleagues30 measured hippocampal volumes and subfields in a large group of patients with SZ in the early stages of illness (mean duration of illness of 7 years), and found deficits in hippocampal whole volume and CA1, but not in other subfields. In addition, they measured hippocampal subfields in an older cohort (mean duration of illness of 18 years) of patients and report deficits in all subfields. It is not clear why this study detected deficits limited to the CA1 region in the early stage psychosis cohort, while we observed more extensive deficits in medication-naive FEP. This cohort was collected in Singapore, while ours was collected from the southeast of the US. It is possible that differences in DUP, stress, or other environmental factors could explain those differences. Another possibility is their hippocampal segmentation was obtained using T1-weighted image, and not T1- and T2-weighted images like we did, which has been shown to improve segmentation55. More consistent with our results are the report of deficits in CA1, CA2/3, and CA4/DG in ultra-high-risk subjects compared to HC, deficits that were not as extensive as in a group of SZ patients21. However, because ~2/3 of the high-risk population will not convert to psychosis, the interpretation of high-risk studies is complicated. In high-risk individuals who were followed until conversion to psychosis, elevated cerebral blood volume in CA1 region, thought to represent increased metabolism secondary to elevated neuronal activity, predicted subsequent conversion64. The progression of atrophy from CA1 to other hippocampal subfields could originate from the particularly vulnerability of CA1 to dysregulation of glutamatergic neurotransmission and excitotoxic injury30,65,66.

In summary, our results support that hippocampal abnormalities are already present at psychosis onset but indicate a more extensive reduction in hippocampal subfield volumes in early psychosis than previously thought.

Relationship between left hippocampus subregions and DUP

We found significant associations between longer DUP and smaller whole hippocampus volumes, as well as CA1, ML, subiculum, presubiculum, and tail. Our findings are consistent with those of Goff et al.31 who reported that, in FEP, longer DUP was associated with accelerated hippocampal atrophy over just 8 weeks of antipsychotic treatment, suggesting a persistent effect of DUP on brain structure. These results are consistent with evidence of a damaging effect of DUP on other brain structures31,67. Although we did not find an association between hippocampal volumes and memory performances, another study did find associations between hippocampus subregions and cognitive function in high risk and SZ (ref. 21). Taken together these data could potentially explain how longer DUP acting through deficits in hippocampus subfields is linked to worse outcome in FEP patients1,4.

Glutamate as a candidate to explain the hippocampus atrophy

We did not report association between whole hippocampus or subfield volumes and Glx levels. Our prior study in unmedicated SZ suggested that alterations in hippocampus glutamatergic neurotransmission may play a role in hippocampus deficits32. The lack of association may be the consequence of a nonoptimal overlap between the MRS voxel and the hippocampus volumes. It is also possible that there is a threshold value for a toxic effect of psychosis, rather than a linear relationship between DUP and a neurotoxic effect36. Other mechanism, such as increased dopaminergic activity68, or persistent catecholaminergic and hypothalamic pituitary adrenal axis activity69 have been proposed as causal factors for the association of DUP with poor outcome.

Limitations

Our findings need to be placed in the context of several strengths and limitations. One of the major strengths here is that all FEP were enrolled at the time of first treatment contact that allowed us to mitigate possible confounds of antipsychotic medication exposure. We also recruited a control group that was carefully matched on key demographic variables, and we controlled for several possible confounding variables (e.g., total ICV) during statistical analyses. We determined DUP with a clinical interview. A quantitative review comparing methods of assessing DUP found that clinical interviews are no less reliable than standardized assessment tools70. While our MRS acquisition sequence does have some drawbacks such as J-modulation and T2 relaxation effects on the spectrum, as well as a water signal that is highly T2-weighted and sensitive to cerebrospinal fluid contamination52, a significant advantage of these acquisition parameters is that it allows us put findings in context of a number of our previous studies for which we used the same acquisition parameters32,71,72. Regarding the hippocampus segmentation, the previous version of computerized segmentation of the hippocampus by Freesurfer was controversial73, but the new version, by using high-resolution T2-weighted MR imaging, addresses these criticisms55.

Summary

Here, we demonstrate evidence of lower hippocampal subfield volumes and altered glutamatergic metabolism in those with longer DUP, suggesting possible biological mechanisms underlying the clinical observation that patients with longer DUP suffer from worse overall outcomes. Our data highlight the critical need for reducing the DUP and for early pharmacological intervention, with the hope to prevent structural deficits and improve clinical outcomes.