Introduction

The perinatal period spanning across pregnancy and postpartum is a vulnerable time for increased mental health issues, which presents several challenges for women and families1. Postpartum depression (PPD) is a common mood disorder that occurs in 10–15% of pregnant women2. As per the Diagnostic and Statistical Manual (DSM-5, fifth edition), PPD is defined as a major depressive episode that occurs within 4 weeks post-delivery3, however many studies investigate depressive symptoms up to 1 year postpartum4,5.

Children who are born to mothers with PPD have poor health outcomes compared to the general population including shorter gestation periods, impaired cardiovascular functioning, increased gastrointestinal infections, reduced weight gain and lower respiratory tract infections6,7,8. PPD is also associated with a range of cascading long-term negative health outcomes for the mother and infant including an increased risk of psychiatric disorders and neurodevelopmental deficits such as behavioural problems and learning difficulties in the offspring9,10,11 and perturbed immune system in mothers and maternal suicide12. In addition, partners of women with PPD are also at risk of having higher rates of depression than expected13.

Despite the high prevalence and negative consequences of PPD, very little is currently known about its underlying biology. Several studies suggest that screening early on in pregnancy can reduce anxiety or depressive symptoms; however, there is very little research and intervention for primary prevention14,15. Potential causes for PPD include psychosocial factors such as a history of depression or a psychiatric disorder during pregnancy, inadequate social support, emotional isolation, financial strain, stressful life events and biological factors such as genetic risk and sensitivity to hormonal changes have been proposed16,17,18. Studies have reported the association between single nucleotide polymorphisms in genes including the serotonin transporter-linked promoter region polymorphism (5-HTTLPR)19, catechol-O-methyl transferase (COMT), monoamine oxidase (MAO)20 and brain-derived neurotrophic factor (BDNF)21. However, the pathogenesis of PPD still remains largely obscure.

One potential mechanism for biological changes associated with PPD are changing in the gene expression and/or DNA methylation of certain genes across pregnancy and postpartum. We and others have previously identified a panel of predictive gene expression markers whose transcriptome profiles in the third trimester of pregnancy could predict, with 88% accuracy (within sample), which women were at a higher risk of PPD; the panel was also replicated in an independent sample22. In the same study, we further investigated the functional relevance of genes associated with PPD and identified a role for the oestrogen signalling pathway in PPD. In a separate study, we confirmed that the predictive gene expression panel for PPD was also associated with transcriptomic and epigenetic sensitivity to sex-steroid manipulation and mapped onto depressive symptoms and serotonin transporter binding in the brain23. Additionally, we have also previously demonstrated that there is an altered peripheral blood transcriptomic landscape in a pharmacological model of sex-steroid induced depression24.

To the best of our knowledge, there is no large study that has investigated genome-wide gene expression profiles in PPD using the latest sequencing technologies and there is still a gap in our knowledge of the understanding of the biological mechanisms underlying PPD.

The aim of this study was to identify genes and pathways associated with PPD by investigating global changes in peripheral blood gene expression in association with depressive symptoms in a large cohort of pregnant women. We also aimed to evaluate and cross-validate earlier observations in independent clinical cohorts and test if patterns translate to (estradiol) sex hormone manipulation model.

Methods

Participants

Women who were pregnant and had a previous history of a clinical diagnosis of depression were recruited to this study as part of the Mood, Mother and Infant (MMI) study (R01HD073220, mmi.web.unc.edu), an ongoing longitudinal cohort study of mother–infant dyads who were extensively phenotyped during the first postpartum year25. The total cohort comprised of 222 mother–infant dyads enrolled in the MMI study. Of these, 164 women drawn from a longitudinal cohort of mother–infant dyads were followed from the third trimester through 12 months postpartum. From these, we selected women who agreed to participate in the genetic component of the study and provided blood sample (n = 159) and those with good RNA quality with RNA integrity numbers of over 8, amounting to a total of 137 women. These were the women included in this study. Participants underwent a baseline visit with a structured clinical interview in the third trimester of pregnancy and also postpartum via trained clinicians. The study was approved by the Human Research Ethics Committees of the University of North Carolina. All subjects gave written informed consent to participation. To reduce technical variation, all the patients were recruited from one site, and all had blood collected with the same protocol using the same tubes.

Clinical measurements

Maternal depressive symptoms during pregnancy and postnatal depressive symptoms were the main outcomes, measured using the Edinburgh Postnatal Depression Scale (EPDS), which has been validated in the antenatal and postnatal periods26,27. The EPDS has been recommended as a screening tool for perinatal depression among women worldwide.

Data on postpartum-depressive symptoms were collected 2 months after birth. The total number of postpartum-depressive symptoms was also tallied to obtain a total score (out of 30) and was then coded as a categorical variable (score <10 or score ≥10) to indicate scores suggestive of postpartum-depressive symptoms. The EPDS cut-points used to indicate probable depression were selected based on previously published studies28,29. For the study, the EPDS was used as the primary assessment measure and the PPD case definition defined as EPDS score of ≥10 postpartum.

Gene expression experiments

The total RNA was extracted from whole blood in PAXgene tubes using the PAXgene Blood miRNA Kit from Qiagen. RNA was quantified on the Nanodrop1000 and Qubit™ fluorimeter using the RNA HS Assay Kit followed by a quality check using the Agilent Bioanalyzer RNA 6000 Nano kit. mRNA libraries were prepared using the TruSeq® Stranded mRNA Library Prep Kit for NeoPrep™ following the manufacturer’s instructions. The Qubit Fluorometric quantitation was used to determine the 100 ng input for each sample. The libraries were sequenced as pools of up to 24 libraries. Post library construction each library was quantified using the Qubit™ dsDNA HS Assay Kit, followed by size determination using the Agilent High Sensitivity DNA Kit. Up to 24 unique indexed libraries were equimolar pooled and run on the NextSeq500/550 using NextSeq500/550 High Output v2 kit (75 cycles) following the manufacturer’s instructions at a final concentration of 1.5 pM. Sequencing runs were monitored in real time through Illumina BaseSpace.

RNASeq read filtering, alignment and normalisation were carried out using the RNA Aligner application hosted on BaseSpace. The average percentage of reads passing filter including abundant reads = 21,049,665, with >98% aligned to the human reference genome.

Statistical analysis

The QC steps were initially performed in R. Raw count data were available from 22,459 genes. The gene-level counts were imported into edgeR and pre-processing involved a filter threshold of >0.7 counts per million (CPM) to remove low expressed genes in at least 50% of the samples followed by trimmed mean of M-values (TMM) normalisation to scale for library size, allowing a total of 12,121 genes for further analysis.

The edgeR package30 was used to test for differential expression by fitting a model to the negative binomial distribution with the model including the CPM data against the phenotype of interest (PPD) and adjusting for maternal age, gestational age, current and past use of medication and psychological treatment, BMI and ethnicity. Multiple testing corrections were performed using the false discovery rate (FDR) at 5% threshold.

In silico functional annotation of the differentially expressed genes was performed using the Webgestalt interface31. Overrepresentation analysis of the biological processes was done as per the Gene Ontology Functional Database using an FDR for multiple testing correction.

Comparison of differentially expressed genes to previous studies was performed on a gene-level and merging was done using R. First, genes overlapping in the current dataset were merged with those assessed in the previously published study. Next, using R, the two results files from this study and previous study were merged to confirm whether genes differentially expressed in the current study were also significant in that study based on the P values of significance. Next, to test whether the overlap of significant genes was more than expected by chance, enrichment testing was performed using 1000 permutations (using random sets) and applying a two-sided Binominal test in R to give a P value of enrichment for the comparisons.

Results

Demographics

The study comprised 137 women from the MMI study cohort at the University of North Carolina, USA, including 15 women with PPD and 122 with no PPD. Details of the women included in the study are provided in Table 1. All gene expression profiles were measured during recruitment during the third trimester of pregnancy.

Table 1 Demographics and clinical characteristics of the samples included in the study.

Gene expression associated with postpartum-depressive symptoms

First, we tested if gene expression profiles of women during the third trimester of pregnancy were significantly associated with depressive symptoms at 2 months postpartum. Through negative binomial regression models using the edgeR package in R30, the gene expression profiles were regressed against the quantitative EPDS scores, with maternal age, ethnicity, gestational age in weeks, current and past medications and psychological treatments and BMI included as covariates. In the MMI cohort, a total of 71 genes were significantly associated with depression scores at 2 months postpartum, after correction for multiple testing at 5% FDR (Fig. 1). Among the top genes were TNFRSR17, GYPA, JCHAIN, SMIM1, TRPM8, SPATA17, HEMGN, PRRT4 and ADAMS15 (Table 2).

Fig. 1: Differentially expressed genes in PPD.
figure 1

MA plot showing up and downregulated genes associated with depressive symptoms at 2 months postpartum.

Table 2 List of differentially expressed genes at the third trimester of pregnancy associated with depressive symptoms at 2 months postpartum at 5% FDR.

Using the Webgestalt interface, we found that the 71 genes were enriched for several immune-related categories as per the Gene Ontology Functional Database (Fig. 2) and the genes were overrepresented within the neutrophil degranulation (P = 1.1E-9, n = 13 genes), innate immunity (P = 1.8E-6, n = 14 genes) and fibrin clot formation (P = 4.5E-5, n = 3 genes) pathways using the Reactome Pathway database.

Fig. 2: Functional annotation.
figure 2

Functional annotation of the differentially expressed genes associated with depressive symptoms at 2 months postpartum.

Gene expression associated with a change in postpartum-depressive symptoms

The gene expression profiles were regressed against the changes in EPDS scores, with maternal age, ethnicity, gestational age in weeks, current and past medications and psychological treatments and BMI included as covariates. When assessing gene expression profiles significantly associated with changes in depressive symptoms from recruitment to 2 months postpartum (2 months postpartum EPDS–EPDS at recruitment), a total of 66 genes were significant at an FDR of 5% in the MMI cohort.

Of these genes, 33 genes were also significantly associated (P < 0.05) with depressive symptoms at 2 months postpartum and of these 14 remained significant at 5% FDR. The genes included OLFM4, RASAL2, BHZ, DEFA1B, DEFA4, C4BPA, CPA5, GPRC5C, LTF, ADAMTS15, CLEC12A and ELANE (Table 3).

Table 3 List of differentially expressed genes at the third trimester of pregnancy associated with changes in depressive symptoms (from the third trimester to 2 months postpartum) at 5% FDR.

Gene expression associated with postpartum depression onset

Next, we assessed the onset of PPD at 2 months postpartum. First, the women were split into three groups

  1. i.

    PPD-onset group—these were women with EPDS < 10 at recruitment and EPDS ≥ 10 at postpartum (anytime between 1 and 12 months postpartum) (N = 25)

  2. ii.

    Depressed group—women with ≥ 10 EPDS at recruitment and ≥10 EPDS at postpartum (anytime between 1 and 12 months postpartum) (N = 44)

  3. iii.

    Control/euthymic group—women with <10 EPDS at recruitment and postpartum (anytime between 1 and 12 months postpartum) (N = 68).

Separate regressions were performed to compare the PPD and depressed groups to the control groups, to identify PPD-onset specific genes. The gene expression profiles were regressed against the group status, with maternal age, ethnicity, gestational age in weeks, current and past medications and psychological treatments and BMI included as covariates. A total of 38 genes were significantly differentially expressed between the PPD and the control groups (Table 4), and of these, 8 genes overlapped with those differentially expressed between the depressed and control groups, indicating that 30 genes showed PPD-onset specific gene expression. Of the 30 genes, 16 genes were also significant in the difference in depressive symptoms (P < 0.05). The full analysis results are shown in Supplementary Table 1.

Table 4 List of differentially expressed genes at the third trimester of pregnancy associated with changes in depressive symptoms (from the third trimester to 2 months postpartum) at 5% FDR.

Overlap with previous postpartum depression onset candidate genes

We compared our results to our previously reported candidate genes whose third trimester of pregnancy gene expression profiles were significantly associated with PPD-onset22. Of the 116 genes associated with PPD onset in our previous study, 84 genes were present in this study. From these, three genes (TMEM189, GALNT10 and FBXL20) were also associated with PPD onset in this study and not with depression per se (enrichment P = 0.08, indicating no significant overlap than expected by chance).

Next, we compared genes associated with PPD in this study with our previous study investigating gene expression and DNA methylation changes in a pharmacological model of depression where women were treated with the gonadotrophin-releasing hormone agonist (GnRHa) to induce depressive symptoms24. In the GnRHa study, we have previously shown that gene expression changes post-GnRHa induction directly relate oestrogen-induced biological changes with depressive symptoms and associated markers of serotonin-signalling in the brain, suggesting that individual variations in molecular sensitivity to oestrogen are associated with susceptibility to hormone-induced mood changes23.

A total of 7552 evaluated genes in this study overlapped with the GnRHa study. Of the genes associated with depressive symptoms (Table 2), change in depressive symptoms at 2 months postpartum (Table 3) or PPD onset (Table 4) at FDR P < 0.05, 10 (of 34, 29% overlap, P value = 4.2e-06), 17 (of 32, 53% overlap, P value = 2.1e-14) and 6 (of 17, 35% overlap, P = 1.2e-4) genes respectively were significantly associated with estradiol changes among the GnRHa-treated women.

Since the GnRhHa model in particular provoked biphasic changes in estradiol, these findings provide further evidence of the role of oestrogen signalling in PPD.

Overlap with the previously reported list of core maternal genes

We investigated a list of 700 core maternal genes identified by Gammie and colleagues32 that showed significant gene expression changes across multiple brain regions. This list was synthesised via bioinformatics analysis using findings from four microarray studies of different maternal brain regions. Of the 700 maternal genes from Gammie et al.32, we had 455 in the current gene expression study that was detected above background and after quality control. Of these, 70/455 (15.4%) were significantly differentially expressed in one of the above analysis (Table 5) of association with depressive symptoms at 2 months postpartum or changes in depressive symptoms from recruitment to 2 months postpartum or differentially expressed between the PPD and the control groups. This is a significant enrichment than expected by chance (P = 7.9e-05).

Table 5 Comparison of differentially expressed genes at the third trimester of pregnancy associated with PPD with previously reported maternal genes by Gammie et al. 2016.

Among genes that showed significant maternal brain gene expression changes32 were interesting candidate genes including the ZFP57 gene, a maternal-zygotic effect gene involved in the stable maintenance of methylation imprints during development33 and the NR1D1 gene which has been highlighted as a key PPD gene and is linked to the steroid receptor.

Discussion

PPD is a common disorder that is frequently underdiagnosed and can lead to severe long-term negative health outcomes of the mother, infant and the family hence early identification of women at high risk for PPD is essential to allow timely preventive intervention and treatment before the symptoms of PPD begin to manifest.

The aim of this study was to evaluate genome-wide gene expression profiles of women from their third trimester of pregnancy to identify transcriptomic changes associated with PPD. To the best of our knowledge, this is the largest PPD gene expression study to date (n = 137), assessing early gene expression profiles during late pregnancy and their association with PPD.

A total of 71 genes were significantly associated with depression scores at 2 months postpartum, after correction for multiple testing. The PPD related genes were enriched for immune-related gene ontology biological processes including humoral immune response and neutrophil-mediated immunity and were overrepresented in the neutrophil degranulation and innate immune system pathways, suggesting a role of immune-pathway genes in PPD. These findings suggest that an altered immunological profile during pregnancy might play a role in PPD vulnerability. There are several theories linking depression with immune responses including the Pathogen Host Defense (PATHOS-D) hypothesis that suggests that depressive symptoms are integral components of immune-mediated host defence against pathogens, having an underlying evolutionary significance34. It remains to be established whether and at what extent the immune abnormalities observed are pre-existing even before pregnancy and how these change across pregnancy, possibly contributing to the development of PPD; larger longitudinal studies will uncover the exact mechanisms.

The top genes associated with 2 months postpartum-depressive scores included TNFRSF17, GYPA, JCHAIN, SMIM1, TRPM8, SPATA17, HEMGN, PRRT4 and ADAMS15. The TNFRSF17 gene is a member of the TNF-receptor superfamily, specifically leading to NF-kappaB and MAPK8/JNK activation, with a likely role in cell survival and proliferation. In another study35, during the third trimester of pregnancy, the levels of TNFRSF17 gene expression significantly upregulated in women with PPD as compared to euthymic pregnant women and depressed patients (depressed during pregnancy and postpartum). TNFRSF17 was also one of the immune response genes upregulated in patients with bipolar disorder36.

Next, we tested whether gene expression in the third trimester of pregnancy was associated with a difference in the third trimester of pregnancy to 2 months postpartum-depressive symptoms, as this was more likely to be of clinical relevance. We identified 66 genes whose gene expression profiles were associated with changes in depressive symptoms at a 5% FDR, these were substantially overlapped (50% at P < 0.05 and 21% at FDR 5%) with genes associated with only 2 months postpartum-depressive symptoms. The genes included OLFM4, RASAL2, BHZ, DEFA1B, DEFA4, C4BPA, CPA5, GPRC5C, LTF, ADAMTS15, CLEC12A and ELANE. The C4BPA has been reported in studies of individuals with psychotic experiences37 and schizophrenia38. The ELANE gene encodes for a protein that is found in neutrophils and plays a role in inflammation and in fighting infection.

When comparing the postpartum-onset group to euthymic controls (and using the depression group as a negative control), the study revealed 30 PPD-onset specific gene expression. Of the 30 genes, 16 genes were also significant in the difference in depressive symptoms (P < 0.05), 15 were also associated with depressive symptoms at 2 months postpartum and 8 of the genes were associated with both. Genes included MMP8, a matrix metalloproteinase–collagen degradation gene, previously reported to be associated with MDD in a genome-wide association study39. Other genes included SLC35F1, which was one of the significant genes for extraversion and neuroticism, and symptoms of anxiety, depression and psychological distress in a genome-wide association uncovering shared genetic effects among personality traits and mood states40.

Our previous research has investigated the transcriptomic landscape among women with PPD and identified that oestrogen signalling plays a key role in vulnerability to PPD. For instance, we and others had identified a panel of 116 genes significantly differentially expressed between women with PPD and euthymic controls22. These genes could predict with 88% accuracy which women went on to develop PPD within the sample and were enriched for those belonging to the oestrogen signalling pathway. In this study, we found three of these predictive PPD genes (TMEM189, GALNT10 and FBXL20) were also associated with PPD onset in the current study and not with depression per se. This overlap is consistent with chance alone (P = 0.80). The lack of enrichment in gene associations across the two studies could be the result of different samples and different technologies (microarray versus next-generation sequencing methods). Nevertheless, when comparing the gene expression profiles in PPD identified in this study with our previous pharmacological model of depression via GnRHa24, we found evidence that 29–53% of the genes were also associated with estradiol changes among the GnRHa-treated women (P values range = 1.2e-4–2.1e-14). Therefore, these results again emphasise the role of oestrogen signalling in PPD.

To further extrapolate and understand the function of the differentially expressed genes associated with PPD, the comparison was performed with the maternal genes identified by Gammie et al.32. This included a list of 700 core maternal genes that showed significant gene expression changes across multiple brain regions. Of the 455 genes present in the current gene expression study, 15.4% were significantly associated with PPD (enrichment P = 7.9e-05). Among genes that showed significant maternal brain gene expression changes32 was the ZFP57 gene, a maternal-zygotic effect gene that contributes towards the stable maintenance of methylation imprints during development33. This gene was significantly associated with depressive symptoms at 2 months postpartum in both discovery and replication cohorts, changes in depressive symptoms from late pregnancy to 2 months postpartum and was differentially expressed between the PPD and the control groups. Another interesting gene from the Gammie et al. study was the NR1D1 gene which was highlighted by the authors as a key PPD gene and is linked to the steroid receptor. NR1D1 gene expression is affected by estradiol, glucocorticoids and progesterone treatment, making this a key maternal brain gene, whose expression can be moderated by placenta-produced hormones and changes in these hormones. In this study, the NR1D1 gene was associated with depressive symptoms at 2 months postpartum.

This study has several limitations. Since this was a cohort-based study, some women (18%) were on medications and this likely influenced gene expression profiles, however, we did correct for medication status in the analysis. Given that only 11% of the women had PPD, to overcome the consequential issue of low power, we analysed both quantitative depression symptoms as well as PPD onset to better understand the influence of depressive symptoms on the transcriptome. Strengths of the study include that this is the largest gene expression study in PPD to date and we validate our results in other independent samples and show enrichment of the identified PPD genes in relevant studies.

In conclusion, this study interrogated the transcriptomic landscape during pregnancy and tested its association with PPD revealing novel genes and pathways associated with PPD and validating previous PPD candidates. Consistent with previous studies including our own research, this study strengthened the findings of altered immune genes in PPD and the role of oestrogen in the pathogenesis of PPD, suggesting avenues for treatment of PPD. These findings add to the existing knowledge of the molecular mechanisms and pathways associated with PPD, providing avenues for improved diagnosis and treatment.