Transcriptome-wide association study identifies new susceptibility genes and pathways for depression

Li, Xiaoyan; Su, Xi; Liu, Jiewei; Li, Huijuan; Li, Ming; Li, Wenqiang; Luo, Xiong-Jian

doi:10.1038/s41398-021-01411-w

Transcriptome-wide association study identifies new susceptibility genes and pathways for depression

Article
Open access
Published: 21 May 2021

Volume 11, article number 306, (2021)
Cite this article

Download PDF

You have full access to this open access article

Translational Psychiatry

Transcriptome-wide association study identifies new susceptibility genes and pathways for depression

Download PDF

Xiaoyan Li^1,2^na1,
Xi Su^3,4^na1,
Jiewei Liu¹,
Huijuan Li¹,
Ming Li ORCID: orcid.org/0000-0002-8197-6552^1,5,
the 23andMe Research Team,
Wenqiang Li^3,4 &
…
Xiong-Jian Luo ORCID: orcid.org/0000-0003-2543-8845^1,5,6

5570 Accesses
26 Citations
3 Altmetric
Explore all metrics

Abstract

Depression is the most prevalent mental disorder with substantial morbidity and mortality. Although genome-wide association studies (GWASs) have identified multiple risk variants for depression, due to the complicated gene regulatory mechanisms and complexity of linkage disequilibrium (LD), the biological mechanisms by which the risk variants exert their effects on depression remain largely unknown. Here, we perform a transcriptome-wide association study (TWAS) of depression by integrating GWAS summary statistics from 807,553 individuals (246,363 depression cases and 561,190 controls) and summary-level gene-expression data (from the dorsolateral prefrontal cortex (DLPFC) of 1003 individuals). We identified 53 transcriptome-wide significant (TWS) risk genes for depression, of which 23 genes were not implicated in risk loci of the original GWAS. Seven out of 53 risk genes (B3GALTL, FADS1, TCTEX1D1, XPNPEP3, ZMAT2, ZNF501 and ZNF502) showed TWS associations with depression in two independent brain expression quantitative loci (eQTL) datasets, suggesting that these genes may represent promising candidates. We further conducted conditional analyses and identified the potential risk genes that driven the TWAS association signal in each locus. Finally, pathway enrichment analysis revealed biologically pathways relevant to depression. Our study identified new depression risk genes whose expression dysregulation may play a role in depression. More importantly, we translated the GWAS associations into risk genes and relevant pathways. Further mechanistic study and functional characterization of the TWS depression risk genes will facilitate the diagnostics and therapeutics for depression.

Further confirmation of netrin 1 receptor (DCC) as a depression risk gene via integrations of multi-omics data

Article Open access 17 March 2020

Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions

Article 04 February 2019

Mendelian randomization integrating GWAS and eQTL data revealed genes pleiotropically associated with major depressive disorder

Article Open access 17 April 2021

Introduction

Depression is a complex and heterogeneous mental disorder characterized by depressed mood, loss of interests, appetite and sleep disturbances, cognitive impairments, feelings of worthlessness and hopelessness^1,2. Depression has a high global prevalence (~4.7%) and over 298 million of the global population were affected by depression^3,4,5. Females were more likely (about twice) to develop depressive symptoms than males⁶. As depression has a high prevalence and is accompanied with substantial morbidity and mortality, it becomes a leading cause of disability worldwide and a major contributor to global disease burden (e.g. the economic burden of depression was estimated about $210.5 billion in US in 2010^4,7). To date, the etiology of depression remains largely unknown. However, accumulating evidence indicate that depression was caused by a combination of genetic and environmental factors. Twin study has estimated the heritability of depression to be ~37%⁸, indicating the important role of genetic component in the development of depression.

In the past decade, we have witnessed the rapid progress of genetic study of depression. Since CONVERGE consortium identified two genome-wide significant risk loci for depression in 2014⁹, multiple exciting findings have been reported by genome-wide association studies (GWASs)^10,11,12,13. Recently, Howard et al. reported the largest GWAS of depression and identified over 100 risk loci that reached genome-wide significance level¹⁴. Despite the fact that GWASs of depression have made great progress in recent years and over 100 depression risk loci have been identified by GWASs, mechanistic investigations and biological interpretations lag far behind. For most of the risk loci, the implicated genes and corresponding mechanisms remain largely unknown. In traditional GWASs, the gene (or genes) that located the nearest to the most significant risk variant was usually assigned as the potential candidate gene. However, due to the complexity of LD (each risk locus usually contains several genes that were in high linkage disequilibrium (LD)) and gene regulation (genetic variants may regulate distal genes rather than the nearest gene through affecting chromosomal conformation change), the nearest gene may not necessarily represent the causal gene by which the identified GWAS risk variants exert their effects on depression¹⁵. As the vast majority of depression risk variants identified by GWASs were located in non-coding regions, it is possible that these risk variants confer depression risk through modulating gene expression rather than altering protein sequences or structures¹⁶.

Transcriptome-wide association study (TWAS) is a powerful approach aimed at identifying risk genes whose expression perturbations may confer disease susceptibility. Through integrating expression quantitative loci (eQTL) results with GWAS associations, TWAS could identify genes whose genetically regulated expression may be associated with diseases risk¹⁷. This method leverages a relatively small set of reference individuals with expression and genotype measured data to impute the expression-trait association statistics from GWAS summary data¹⁸. In addition to prioritizing putative target genes at genome-wide significant loci, TWAS provides an opportunity to detect genes with small effect sizes and located in regions that do not contain genome-wide significant variants¹⁹. Furthermore, the susceptibility genes identified by TWAS can more accurately inform follow-up experimental validation and potential treatment strategies.

In this study, we carried out a TWAS of depression by integrating brain eQTL data (from the dorsolateral prefrontal cortex (DLPFC)) of 1003 subjects (including the CommonMind Consortium (CMC)²⁰ and the second phase of the BrainSeq Consortium (BrainSeq2)²¹) and the largest depression GWAS (including 246,363 cases and 561,190 controls)¹⁴. We identified 53 transcriptome-wide significant (TWS) depression risk genes that reached Bonferroni-corrected significance level. We subsequently performed conditional analysis of all significant TWAS associations to identify independent associations (i.e. the driven genes at each risk locus). Pathway and gene ontology analyses of the TWS genes identified relevant pathways, including formation of fibrin clot, female pregnancy and peripheral axonal degeneration pathways. Finally, we compared the expression level of the TWS depression risk genes in brains of depression cases and controls using RNA-sequencing (RNA-seq) expression data. Overall, our findings highlight the power of TWAS in identifying depression risk genes with small effect size and provide testable targets for further functional validation of depression. Future mechanistic investigations and functional experiments of the TWS depression risk genes will provide pivotal information for the diagnostics and therapeutics of depression.

Methods and materials

GWAS meta-analysis

The summary statistics from the largest GWAS of depression¹⁴ were used in this study. To identify the common risk variants for depression, Howard et al. performed a genome-wide meta-analysis through combining three large-scale depression cohorts (including 23andme, PGC2 and UK Biobank)^10,11,22 (a total of 246,363 cases and 561,190 controls, after excluding overlapping samples) and identified 102 independent genetic variants associated with depression¹⁴. Detailed information about the participants ascertainment, genotyping, quality control and statistical analysis can be found in the original study¹⁴. The summary statistics for the meta-analysis of depression in UK Biobank and PGC_139k cohorts were downloaded from the Edinburgh data share center (https://doi.org/10.7488/ds/2458)¹⁴. We then conducted a meta-analysis by combining summary statistics from above two cohorts and 23andMe¹⁰ using PLINK²³ software and more detailed procedure of fixed-effect meta-analysis has been previously described in study of Li et al.¹². We obtained the depression GWAS summary statistics of 23andme under a data transfer agreement¹².

Transcriptome-wide association analysis

We performed a TWAS through integrating the depression GWAS summary statistics and two sets of eQTL data (i.e. CMC²⁰ (N = 452) and BrainSeq2²¹ (N = 551), respectively). The CMC SNP-expression weights (The SNP-expression weights represent the correlations between SNPs and gene expression in the reference panel while accounting for LD and were computed from different linear models (including BLUP, BSLMM, LASSO, Elastic Net and top SNPs)) were downloaded directly from the FUSION/TWAS website (http://gusevlab.org/projects/fusion/). The BrainSeq2 SNP-expression weights were obtained from the Lieber Institute for Brain Development (LIBD) browser (http://eqtl.brainseq.org/phase2/). Please refer to the original papers for further details on the sample collection, RNA extraction and sequencing, genotyping and statistical analysis^20,21. The SNP-expression weights (i.e. expression weights) were derived using the default processing pipelines described in FUSION (http://gusevlab.org/projects/fusion)¹⁷.

The TWAS was performed using the FUSION software with default settings¹⁷ and a strict Bonferroni-corrected study-wise P threshold (i.e. P = 3.95 × 10⁻⁶ (0.05/12,647) (total number of genes across panels) was used in this study. TWAS analysis utilizes several regularized linear models (including BLUP, LASSO and elastic net) and an additional Bayesian linear mixed model (BSLMM) to evaluate expression imputation. Furthermore, FUSION performs a fivefold cross-validation for each of the desired models to determine which model is the best. For a given gene, SNP-expression weights in the cis-locus were computed using the best prediction model and FUSION typically restricts the cis-locus to 500 kb boundary on either side of the gene. The TWAS calculated Z-score results were used to assess the association between gene and depression and the absolute value of the Z-score reflects the association strength between implicated genes and disease. More detailed information about the principle of FUSION, statistical model and Z-score calculation can be found in the original paper¹⁷.

Transcriptome-wide association analysis using SNP-expression weights from PsychENCODE

We further performed a TWAS using the SNP-expression weights data from PsychENCODE²⁴. Briefly, TWAS was performed using the FUSION package¹⁷ (https://github.com/gusevlab/fusion_twas) as above described, with the use of SNP-expression weights from PsychENCODE (1321 unique individuals). Detailed information about PsychENCODE was provided in PsychENCODE website (http://resource.psychencode.org/) and related publication¹⁷.

Transcriptome-wide association study in the Asian dataset

To further explore the ethnic difference for the TWAS results in depression across populations, we carried out a depression TWAS in the Asian population by integrating eQTL data from lymphoblastoid cell lines of 162 samples (including 80 Han Chinese from Beijing and 82 Japanese in Tokyo, Japan populations)²⁵ and the Chinese GWAS summary statistics (including 5303 women depression and 5337 controls) from the CONVERGE consortium⁹. More detailed information about eQTL and GWAS datasets can be found in the original studies^9,25.

Defining of genes implicated in depression GWAS

We first extracted the genomic coordinates of all TWS genes (based on hg19). We then compared the genomic locations of the TWS genes with genes identified by Howard et al.¹⁴ (i.e. gene located in the 102 depression risk loci identified by Howard et al.). Genes that overlap with any risk locus defined by Howard et al. were considered as genes implicated in depression GWAS.

Conditional and joint analysis

Conditional and joint analysis (based on genes rather than SNPs) were performed for genome-wide significant (Bonferroni-corrected) TWAS signals using FUSION¹⁷. The joint and conditional tests aim to evaluate the GWAS association signal after removing expression association from TWAS (i.e., to investigate if the GWAS signals are still significant after removing the expression association from TWAS). To evaluate the joint/conditional gene model, marginal association statistics (i.e., the main TWAS results) and a correlation/LD matrix are required. Each depression GWAS SNP association was conditioned on the joint gene model (one SNP at a time). The permutation test was used in conditional and joint analysis, with a maximum of 100,000 permutations and an initiate permutation P-value threshold of 0.05 for each feature. “FUSION.post_process.R” script was used for post-processing and generating multiple conditional output plots along with summary statistics.

Pathway analysis

Considering the correlation (i.e., co-expression) of the TWAS implicated genes may affect the independence of the Bonferroni-correction assumption, we tend to relax the Bonferroni-corrected P threshold instead of stringent Bonferroni-corrected standard. Therefore, a relaxed Bonferroni significance threshold (estimated as P = 7.91 × 10⁻⁶ (0.10/12,647)) was used for pathway analyses in our analysis (which allows for more genes to include in the pathway enrichment analysis). GeneNetwork v2.0 (https://genenetwork.nl), which uses gene co-regulation to predict pathway membership and HPO term associations²⁶ was used for pathway analysis. By integrating 31,499 public RNA-seq samples from a wide range of tissues and cell types, GeneNetwork generated multiple pathways for pathway analysis²⁶. As genes are known to cause a particular disease or disease symptom tend to have similar molecular functions or are involved in the same pathway or biological processes^27,28, it’s possible for GeneNetwork to accurately predict gene functions and to prioritize candidate disease genes with high accuracy. Agnostic analyses of pathways in databases (including Gene Ontology (GO) and Reactome) were done to identify special pathways relevant to depression. More detailed information about the principle of GeneNetwork such as principal component analysis (PCA), co-regulation scores calculation, P-value calculation and statistical analyses can be found in the original study²⁶.

Expression analysis of TWAS significant genes in depression cases and controls

TWAS identified a total of 53 genes that showed significant association with depression (after correcting for Bonferroni test). To explore whether the expression levels of depression-associated risk genes identified by TWAS were dysregulated in depression cases compared with controls, we obtained expression data (based on RNA-seq) of brain tissues (only the expression data from the DLPFC were used for further analysis) from three datasets, including GSE102556 (26 depression cases and 22 controls)²⁹, GSE101521 (30 depression cases and 29 controls)³⁰ and GSE80655 (23 depression cases and 24 controls)³¹. The same processing procedure was conducted for quality control, alignment and gene-expression quantification in three RNA-seq datasets. Briefly, Trimmomatic³² was utilized to examine the sequencing quality and trim reads. The clean paired-end reads were then aligned to the human reference genome (GRCh38) by using Hisat2³³ and gene-level reads counts were quantified as transcripts per million (TPM) with featureCounts³⁴. Protein-coding genes with average TPM ≥ 1.0 were extracted and differentially expressed genes (DEGs) were identified (based on read counts using likelihood ratio test (LRT)) for each of three datasets using DESeq2³⁵ R package. More detailed information about sample collection, sample diagnose and RNA sequencing can be found in the original studies³⁶.

Spatio-temporal expression pattern analysis

To explore the spatio-temporal expression pattern of the depression candidate genes identified by TWAS analysis in human brain, we downloaded expression data (based on RNA-seq) from the Allen Institute for Brain Science (http://www.brainspan.org/)³⁷. For a specific brain developmental stage, the mean expression level of all the genes in a geneset was represented as the expression level of the geneset at this stage. The gene-expression level was measured by RPKM (read per kilobase per million mapped reads) and two genesets implicated by TWAS analysis were used in this study. Background genes were extracted from a previous study³⁸.

Cell-type-specific expression analysis of TWS depression genes

The cell-type-specific expression pattern data were obtained from a previous study³⁹. Briefly, Skene et al. carried out a comprehensive single-cell analysis of mouse brain tissues (including the hippocampus, neocortex, striatum and etc). A total of 9970 cells were sequenced and 24 cell types were identified by Skene et al. Skene et al. then calculated the specificity score of each gene in each cell type (a higher specificity score indicates a higher specificity of gene expression in a specific cell type). We firstly converted candidate human genes into mouse orthologs. We then extracted the specificity score for each gene in each cell type. We set a cutoff value of the specificity score to 0.1, and we calculated the number of genes with a specificity score greater than 0.1 in each cell type. All the analyses were performed with R software and more details about the single-cell data could be found in the original paper³⁹.

Spatio-temporal expression pattern analysis of TWS genes in the human brain

We used the expression data from the Allen Institute for Brain Science (http://www.brainspan.org/)³⁷ for spatio-temporal expression analysis. Gene-expression values of the TWS risk genes in the prefrontal cortex (PFC) (N = 42) were downloaded and transformed as previously described⁴⁰.

Tissue-type-specific expression analysis of TWS depression genes

To explore the expression pattern of TWS genes in human tissues, we investigated their expression level in diverse human tissues using the Genotype-Tissue Expression (GTEx) project (http://gtexportal.org/)⁴¹, which includes expression data in 54 human tissues. Detailed information about the GTEx (e.g. sample source or size, gene-expression normalization) can be found in original publication⁴¹ and the GTEx website.

Results

TWAS identifies 53 susceptibility genes for depression

To identify genes associated with depression, we performed a TWAS by integrating two gene-expression reference panels (i.e. CMC and BrainSeq2) and summary-level association data from the largest depression GWAS meta-analysis so far (a total of 807,553 individuals). We performed TWAS using the FUSION software (Methods)¹⁷. Briefly, this approach uses GWAS summary statistics and gene-expression panels with reference LD to estimate the association between the cis-genetic components of gene expression and depression risk. In total, we identified 53 TWS depression-associated genes (summed across two expression reference panels) after Bonferroni correction, including 33 significant genes detected using the CMC dataset (Supplementary Table 1 and Fig. 1a) and 27 genes detected using the BrainSeq2-DLPFC dataset (Supplementary Table 2 and Fig. 1b). Among the TWS associations (53 genes), 30 of the genes were implicated in the original depression GWAS and the remaining 23 genes did not fall within previous GWAS loci (definition of genes implicated in depression GWAS can be found in methods). Notably, we found that 7 out of 53 genes reached transcriptome-wide significance level in both two expression panels (CMC and BrainSeq2), including B3GALTL, FADS1, TCTEX1D1, XPNPEP3, ZMAT2, ZNF501 and ZNF502 (Table 1 and Fig. 1). Among the seven TWS genes, upregulation of XPNPEP3 may be associated with depression risk as it has a positive Z-score (Z > 0 suggests that the gene is predicted to be upregulated in cases compared with controls). However, downregulation of other genes may increase risk of depression (Z < 0). These data suggested that the TWAS significant genes may have a role in depression risk. In addition, the overlapping genes identified in both expression panels represent plausible candidate genes for depression as these genes showed TWS association with depression in two independent eQTL datasets.

**Fig. 1: Manhattan plot of the TWAS results for depression (246,363 cases and 561,190 controls).**

Table 1 Significant TWAS genes both in CMC and LIBD datasets for depression.

Full size table

Expression signals driven the depression TWAS hits

TWAS usually identifies several TWS genes for each of the risk locus. To detect if the identified transcriptome-wide association signals were conditionally independent and to explore whether the GWAS signals remain significant after removing the expression weights from TWAS, we conducted conditional and joint analyses. Our conditional analyses identified several independent TWS genes from both of the brain eQTL datasets (Fig. 2). We found that TCTEX1D1 explains most of the signal at its locus in both CMC dataset (rs10789214 lead SNP P_GWAS = 7.53 × 10⁻⁸, conditioned on TCTEX1D1 lead SNP P_GWAS = 7.83 × 10⁻³) (Fig. 2a) and BrainSeq2 dataset (rs10789214 lead SNP P_GWAS = 7.53 × 10⁻⁸, conditioned on TCTEX1D1 lead SNP P_GWAS = 2.12 × 10⁻²) (Fig. 2b). Similarly, conditional analyses showed that ZNF445 explained most of the signal at its locus in CMC dataset (Fig. 2c) and ZNF502 explained most of the signal at its locus in BrainSeq2 dataset (Fig. 2d). In addition, we also found that FADS1 (Fig. 2e, f), B3GALTL (Fig. 2g, h) and XPNPEP3 (Fig. 2i, j) genes explained most of the variance at their loci in both CMC and BrainSeq2 datasets. Collectively, our conditional analyses identified independent genes that driven the transcriptome-wide association signals.

**Fig. 2: Regional association of transcriptome-wide significant genes.**

Pathway enrichment analysis of the identified TWS genes revealed related biological processes

To detect whether the TWS genes identified by TWAS were enriched in specific pathways, we carried out pathway and gene ontology analysis. All of genes that reached a relaxed Bonferroni-corrected significance were grouped into three different clusters based on co-expression of 31,499 public samples (expression level was quantified with RNA-seq) (Fig. 3). Several biological pathways were significantly enriched among the TWS depression genes, including female pregnancy (Mann–Whitney U-Test, P = 8.53 × 10⁻⁶), formation of Fibrin Clot (Mann–Whitney U-Test, P = 7.92 × 10⁻⁵), cAMP binding (Mann–Whitney U-Test, P = 7.45 × 10⁻⁴) and ephrin signaling (Mann–Whitney U-Test, P = 7.37 × 10⁻⁴) (Table 2). In addition, Human Phenotype Ontology (HPO) analyses suggested that the identified TWAS significant genes were enriched in peripheral axonal degeneration (Mann–Whitney U-Test, P = 5.49 × 10⁻⁴) and neurodevelopmental delay (Mann–Whitney U-Test, P = 7.08 × 10⁻⁴) (Table 2). Interestingly, we noticed that several marginally significant enriched pathways were consistent with previous findings⁴², including G-protein coupled receptor activity (Mann–Whitney U-Test, P = 8.54 × 10⁻²).

**Fig. 3: Gene clustering for the identified TWAS genes based on co-expression.**

Table 2 Significant pathways of TWAS genes identified through gene-network analysis.

Full size table

TWS genes showed higher expression level than background genes in the human brain

In addition to pathway enrichment analysis, another important approach to explore the biological function of geneset is the spatio-temporal gene-expression profiling. Based on the expression data from the BrainSpan (http://www.brainspan.org/), we carried out spatio-temporal expression pattern analysis for two TWS genesets (geneset 1: the TWS depression genes identified both in CMC and BrainSeq2 expression panels, N = 7; geneset 2: all TWS depression genes across two expression panels, N = 53). More detailed information about spatio-temporal expression pattern analysis can be found in the study of Gilman et al.⁴³. The average expression level of all TWS depression genes (Supplementary Table 1 and 2) was significantly higher than the expression level of the background genes across all developmental stages (Wilcoxon rank-sum test, P < 8.66 × 10⁻⁵) (Supplementary Fig. 1). Moreover, we found that the expression level of the TWS depression genes was higher in prenatal stage than the postnatal stages in geneset 2 (P = 1.30 × 10⁻², Wilcoxon rank-sum test) but not in geneset 1 (P = 6.60 × 10⁻², Wilcoxon rank-sum test). These data suggest that the identified TWS depression genes may have pivotal roles in brain development and function.

Cell-type-specific expression analysis of target genes

To explore the expression pattern of the TWS depression genes in different brain cell types, we conducted cell-type-specific expression analyses³⁹. TWS depression genes identified in CMC and BrainSeq2 datasets (a total of 53 genes) (Supplementary Table 1 and 2) were analyzed to investigate if TWS depression genes were specifically expressed in specific cell populations. Notably, we found that the TWS depression genes were primarily expressed in two pyramidal cell types (including hippocampal CA1 pyramidal cells and somatosensory pyramidal cells) (i.e. these two pyramidal cell types contained the most numbers of the TWS depression genes that above the specificity score (i.e. specificity score > 0.1)) (Supplementary Fig. 2). This result is consistent with previous findings and provides further support for the involvement of pyramidal cells in depression^44,45. In addition, this result also suggests that the TWS genes may confer depression risk by affecting the functions of pyramidal cells. To explore the expression pattern of the seven overlapping TWS genes in different human tissues, we performed tissue-type-specific expression analysis using expression data from GTEx. Our results indicated that expression level of FADS1 and TCTEX1D1 is low in most human tissues. However, other five genes (B3GALTL, XPNPEP3, ZMAT2, ZNF501 and ZNF502) are all expressed in human brains (Supplementary Figs. 3–9).

Expression analysis of TWS genes in brains of depression cases and controls

TWAS identified depression-associated genes (Supplementary Tables 1 and 2) under the assumption that genetic variants influence depression risk by modulating gene expression. To further explore if the significant genes identified by TWAS analysis were dysregulated in depression cases compared with controls, we compared the expression level of the TWS genes in brains of depression cases and healthy controls using the expression data from three RNA-seq studies^29,30,31. As previous studies have indicated that the DLPFC may play a pivotal role in depression^46,47, we only selected the expression data from the DLPFC to perform differential expression analysis. These three datasets were GSE102556 (26 depression cases and 22 controls)²⁹, GSE101521 (30 depression cases and 29 controls)³⁰ and GSE80655 (23 depression cases and 24 controls)³¹. The P value was corrected by the Bonferroni correction (for multiple testing) approach, which resulted in a significance threshold of P = 9.43 × 10⁻⁴ (=0.05/53 TWS genes were retained for differential expression analysis analysis). We found that PCDHA8 (P = 1.31 × 10⁻³) was significantly downregulated in depression cases compared with controls in GSE101521 dataset. By contrast, FANCL (P = 4.88 × 10⁻²) showed a significant upregulation in depression cases compared with controls in GSE101521 dataset (Supplementary Table 3). In GSE80655 dataset, PCDHA7 showed a significant upregulation (P = 4.34 × 10⁻³) (Supplementary Table 3). Interestingly, five TWAS significant genes (TMEM161B-AS1, GMPPB, STAU1, NDUFA2 and GPX1) were significantly upregulated in brains of depression cases compared with controls in GSE102556 dataset (Supplementary Table 3). Taken together, these results further support that the identified TWAS significant genes may have a role in depression. In addition, these results also suggest that the identified risk variants may confer depression risk through regulating gene expression.

Discussion

Depression is a common psychiatric disorder that is caused by a combination of multiple risk factors, including genetic, psychological, biological and social factors. Although recent GWASs have successfully identified multiple depression risk loci, the biological interpretations and functional understanding of these associations remain poorly understood (partly due to the inability to fine-map depression-relevant genes). TWAS¹⁷ is a powerful approach to identify associated genes by combining the GWAS results and expression data. In this study, we performed a depression TWAS and identified candidate risk genes for depression. Considering that sample size and tissue matching have pivotal roles in TWAS, we used expression data from the DLPFC (i.e. CMC²⁰ and BrainSeq2²¹) to conduct TWAS analysis. We integrated the summary statistics from the largest depression GWAS¹⁴ so far (N = 807,553 individuals) and gene-expression measurements from two large-scale expression studies to identify TWS genes for depression. Overall, we identified 53 significant genes whose expression were associated with depression risk, of which 23 genes did not overlap with a genome-wide significant locus in the depression GWAS.

Our study also highlighted seven overlapping TWS genes (including B3GALTL, FADS1, TCTEX1D1, XPNPEP3, ZMAT2, ZNF501 and ZNF502) in both CMC and BrainSeq2 datasets. These seven TWS genes may represent high-confidence risk genes for depression as they reached transcriptome-wide significance level in two independent expression datasets. B3GLCT encodes Beta 3-Glucosyltransferase, which is involved in metabolism of proteins and O-glycosylation of TSR domain-containing proteins⁴⁸. No previous study showed association between this gene and depression. TCTEX1D1 (Tctex1 Domain Containing 1) has a role in organelle biogenesis and maintenance, and intraflagellar transport⁴⁹. The protein encoded by XPNPEP3 belongs to the family of X-pro-aminopeptidases, which remove the N-terminal amino acid from peptides with a proline residue in the penultimate position⁵⁰. ZMAT2 (Zinc Finger Matrin-Type 2) is related to nucleic acid binding. ZNF501 (Zinc Finger Protein 501) encodes a DNA-binding transcription factor. An important paralog of ZNF501 is ZNF502. There were no previous studies reported associations between ZNF501/ZNF502 and depression. It is possible that ZNF501/ZNF502 confer risk of depression by regulating gene expression. FADS1 encodes fatty acid desaturase 1, a protein that is mainly involved in metabolism of alpha-linolenic (omega3), linoleic (omega6) acid and metabolism⁵¹. Interestingly, a previous study has showed that mRNA expression of FADS1 was significantly lower in depression patients compared with controls, suggested that FADS1 may have pivotal roles in depression⁵². More work is needed to characterize the potential role of these genes in depression.

We further performed a depression TWAS analysis using the expression weights data from PsychENCODE²⁴. Among the seven overlapping TWS genes, three genes (i.e. B3GALTL, FADS1 and ZMAT2) showed significant associations in PsychENCODE dataset (Supplementary Table 4). Spatio-temporal expression pattern analysis showed distinct expression patterns of these seven overlapping TWS genes in the developing and adult human brain (Supplementary Fig. 10). Five genes (FADS1, XPNPEP3, ZMAT2, ZNF501 and ZNF502) showed higher expression in the prenatal developing human brain than adult brain. However, other two genes (B3GALTL and TCTEX1D1) showed reverse spatio-temporal expression patterns.

Of note, we noticed that previous studies^11,53 also used CMC dataset as SNP-expression weights for depression TWAS. To identify genes whose genetically regulated expression are associated with complex diseases and traits, Mancuso et al. performed a comprehensive TWAS by integrating multiple expression references and GWAS summary statistics⁵³. They identified over 1000 genes whose expression are associated with diseases or traits. Based on this comprehensive TWAS, they developed TWAS hub (http://twas-hub.org/), which includes multiple TWAS results and provides a convenient online resource to explore if a specific gene is associated with disease or trait. However, the sample size (i.e. depression cases and controls) included in this study is relatively small (N = 135,458 depression cases and 344,901 controls). We used the summary statistics from a larger depression GWAS (N = 246,363 depression cases and 561,190 controls) in this study. In addition, we also utilized another independent expression weights dataset (BrainSeq2) in this study.

We also explored the ethnic difference for the TWAS or GWAS results for major depressive disorder across populations (i.e. Caucasian v.s. Asian). We compared the depression GWAS performed in Asian (CONVERGE⁹) and European populations¹⁴. The CONVERGE consortium showed that rs12415800 (P = 1.92 × 10⁻⁸) and rs35936514 (P = 1.27 × 10⁻⁸) were significantly associated with depression in Chinese population⁹. However, recent large-scale GWAS did not found significant associations between these risk variants and depression in European populations¹⁴. The frequencies of the risk alleles of the identified risk variants (rs12415800 and rs35936514) show dramatic differences in Chinese and European populations, suggesting the potential ethnic difference (i.e. population heterogeneity) of depression risk variants. We further performed a TWAS using expression weights data²⁵ and GWAS⁹ of the Asians. We did not identify any TWS genes after Bonferroni correction (P < 7.32 × 10⁻⁵) (Supplementary Table 5). However, it should be noted that the sample size included in eQTL and GWAS were relatively small. In addition, as there is no public available brain eQTL of Asians, we used eQTL data from lymphoblastoid cell lines. More work is needed to explore the ethnic difference for the TWAS or GWAS results for major depressive disorder across populations (i.e. Caucasian v.s. Asian).

To explore if the three TWS genes (i.e. ZNF501, ZNF502 and B3GALTL) for depression were also associated with schizophrenia and bipolar disorder, we first compared the genetic results of these three TWS genes in GWAS of depression¹⁴, bipolar disorder⁵⁴ and schizophrenia⁵⁵. These TWS genes did not show significant associations with bipolar disorder and schizophrenia (Supplementary Fig. 11). To explore if expression of ZNF501, ZNF502, B3GALTL were associated with schizophrenia and bipolar disorder, we further examined TWAS results of these three genes (i.e. ZNF501, ZNF502 and B3GALTL) in PsychENCODE²⁴ (Supplementary Table 6). ZNF501, ZNF502 and B3GALTL did not show TWS associations with schizophrenia and bipolar disorder, suggesting that the association is depression specific. More work is needed to investigate the role of these genes in depression.

In addition to the seven overlapping TWS genes, other genes may also have a role in depression. For example, although eight genes (PCDHA8, FANCL, TMEM161B-AS1, GMPPB, STAU1, NDUFA2, GPX1 and PCDHA7) were not included in the overlapping TWS gene list, we noticed that these genes were significantly dysregulated in the DLPFC of depression cases compared with controls (Supplementary Table 3), suggesting the potential role of these genes in depression. Further work is needed to investigate the potential role of these genes in depression.

The majority of TWS depression genes identified in our study are located around known GWAS loci. Interestingly, conditional and joint analyses (conditioning on the top TWAS gene) demonstrated that several of the genome-wide significant signals from the depression GWAS were driven by the TWAS expression signals. There was little residual association signal from the genetic variant in the GWAS locus after conditioning on the predicted expression signals. Notably, our TWAS analysis suggested that the FADS1 gene may represent a promising candidate for depression (as the length of FADS1 was small compared with nearby genes, these smaller genes are often overlooked in GWAS because there are many larger protein-coding genes nearby). Our TWAS results showed that the expression of FADS1 largely explains the GWAS signal of depression, suggesting the power of TWAS to detect promising target genes. Previous study has showed that the mRNA expression of FADS1 was significantly downregulated in the postmortem prefrontal cortex of major depression patients compared with controls⁵². Another microarray study also found that FADS1 expression was downregulated in the prefrontal cortex of suicidal male major depression patients⁵⁶. These results suggest that FADS1 implicated in TWAS may represent a novel risk gene for depression.

In addition to identifying the candidate genes for depression, pathway enrichment analysis was also performed to better understand the biological implications of these TWAS significant genes in the context of the biological processes. Our pathways analysis identified several interesting pathways for depression, including female pregnancy, formation of Fibrin Clot, cAMP binding and ephrin signaling. Besides, our pathway analysis also confirmed previously identified pathways for depression. Among these enrichment pathways, synaptic transmission (Mann–Whitney U-Test, P = 1.94 × 10⁻²), dopaminergic (Mann–Whitney U-Test, P = 1.94 × 10⁻²) and G-protein coupled receptor signaling pathway (Mann–Whitney U-Test, P = 5.01 × 10⁻²) have been reported to be implicated in the pathogenesis of depression^57,58,59. Finally, we further performed the differential expression analysis for the identified TWS genes across three depression datasets (GSE102556²⁹, GSE101521³⁰ and GSE80655³¹). Several TWAS significant genes were also dysregulated in brains of depression cases compared with controls (including PCDHA8, FANCL, TMEM161B-AS1, GMPPB, STAU1, NDUFA2, GPX1 and PCDHA7), implying that genetic variants may contribute to depression risk by regulating gene expression.

There were several limitations in this study. First, TWAS typically restricts to impute the cis genetic component of expression on traits, thus, variants influencing depression but are independent of cis expression will not be identified. Second, the number of identified TWAS genes is limited by the size of the training cohort (i.e. reference individuals with expression and genotype measured data) and the quality of the training data. Therefore, more work is needed to increase the sample size and quality of expression data for future TWAS analysis. Finally, the summary-based TWAS could not pinpoint the causal variants and genes for depression. More works are needed to pinpoint the causal variants and to elucidate their roles in depression pathogenesis.

In summary, we performed a depression TWAS based on the integration of depression GWAS summary statistics and gene-expression data from the DLPFC. We identified promising candidate susceptibility genes for depression. Further functional characterization of the identified TWS genes will provide pivotal information for understanding the etiology of depression, facilitating biological interpretations of depression GWAS results and prioritizing potential targets for drug development.

Data availability

All data relevant to the study are included in the article or uploaded as online supplementary information. The data generated in this study will be available from the corresponding author on reasonable request.

References

Fava, M. & Kendler, K. S. Major depressive disorder. Neuron 28, 335–341 (2000).
Article CAS PubMed Google Scholar
Otte, C. et al. Major depressive disorder. Nat. Rev. Dis. Prim. 2, 16065 (2016).
Article PubMed Google Scholar
Ferrari, A. J. et al. The epidemiological modelling of major depressive disorder: application for the Global Burden of Disease Study 2010. PLoS ONE 8, e69637 (2013).
Article CAS PubMed PubMed Central Google Scholar
Greenberg, P. E., Fournier, A. A., Sisitsky, T., Pike, C. T. & Kessler, R. C. The economic burden of adults with major depressive disorder in the United States (2005 and 2010). J. Clin. Psychiatry 76, 155–162 (2015).
Article PubMed Google Scholar
Bromet, E. et al. Cross-national epidemiology of DSM-IV major depressive episode. BMC Med. 9, 90 (2011).
Article PubMed PubMed Central Google Scholar
Romans, S. E., Tyas, J., Cohen, M. M. & Silverstone, T. Gender differences in the symptoms of major depressive disorder. J. Nerv. Ment. Dis. 195, 905–911 (2007).
Article PubMed Google Scholar
GBD 2015. Disease and Injury Incidence and Prevalence Collaborators Global, regional, and national incidence, prevalence, and years lived with disability for 310 diseases and injuries, 1990–2015: a systematic analysis for the Global Burden of Disease Study 2015. Lancet 388, 1545–1602 (2016).
Article Google Scholar
Sullivan, P. F., Neale, M. C. & Kendler, K. S. Genetic epidemiology of major depression: review and meta-analysis. Am. J. Psychiatry 157, 1552–1562 (2000).
Article CAS PubMed Google Scholar
consortium, C. Sparse whole-genome sequencing identifies two loci for major depressive disorder. Nature 523, 588–591 (2015).
Article CAS Google Scholar
Hyde, C. L. et al. Identification of 15 genetic loci associated with risk of major depression in individuals of European descent. Nat. Genet. 48, 1031–1036 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wray, N. R. et al. Genome-wide association analyses identify 44 risk variants and refine the genetic architecture of major depression. Nat. Genet. 50, 668–681 (2018).
Article CAS PubMed PubMed Central Google Scholar
Li, X. et al. Common variants on 6q16.2, 12q24.31 and 16p13.3 are associated with major depressive disorder. Neuropsychopharmacology 43, 2146–2153 (2018).
Article CAS PubMed PubMed Central Google Scholar
Huo, Y. X. et al. Identification of SLC25A37 as a major depressive disorder risk gene. J. Psychiatr. Res. 83, 168–175 (2016).
Article PubMed Google Scholar
Howard, D. M. et al. Genome-wide meta-analysis of depression identifies 102 independent variants and highlights the importance of the prefrontal brain regions. Nat. Neurosci. 22, 343–352 (2019).
Article CAS PubMed PubMed Central Google Scholar
Smemo, S. et al. Obesity-associated variants within FTO form long-range functional connections with IRX3. Nature 507, 371–375 (2014).
Article CAS PubMed PubMed Central Google Scholar
Zhong, J. et al. Integration of GWAS and brain eQTL identifies FLOT1 as a risk gene for major depressive disorder. Neuropsychopharmacology 44, 1542–1551 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gusev, A. et al. Integrative approaches for large-scale transcriptome-wide association studies. Nat. Genet. 48, 245–252 (2016).
Article CAS PubMed PubMed Central Google Scholar
Wainberg, M. et al. Opportunities and challenges for transcriptome-wide association studies. Nat. Genet. 51, 592–599 (2019).
Article CAS PubMed PubMed Central Google Scholar
Gusev, A. et al. Transcriptome-wide association study of schizophrenia and chromatin activity yields mechanistic disease insights. Nat. Genet. 50, 538–548 (2018).
Article CAS PubMed PubMed Central Google Scholar
Fromer, M. et al. Gene expression elucidates functional impact of polygenic risk for schizophrenia. Nat. Neurosci. 19, 1442–1453 (2016).
Article CAS PubMed PubMed Central Google Scholar
Collado-Torres, L. et al. Regional heterogeneity in gene expression, regulation, and coherence in the frontal cortex and hippocampus across development and schizophrenia. Neuron 103, 203–216 e208 (2019).
Article CAS PubMed PubMed Central Google Scholar
Howard, D. M. et al. Genome-wide association study of depression phenotypes in UK Biobank identifies variants in excitatory synaptic pathways. Nat. Commun. 9, 1470 (2018).
Article PubMed PubMed Central CAS Google Scholar
Purcell, S. et al. PLINK: a tool set for whole-genome association and population-based linkage analyses. Am. J. Hum. Genet. 81, 559–575 (2007).
Article CAS PubMed PubMed Central Google Scholar
Gandal, M. J. et al. Transcriptome-wide isoform-level dysregulation in ASD, schizophrenia, and bipolar disorder. Science https://doi.org/10.1126/science.aat8127 (2018).
Stranger, B. E. et al. Patterns of cis regulatory variation in diverse human populations. PLoS Genet. 8, e1002639 (2012).
Article CAS PubMed PubMed Central Google Scholar
Deelen, P. et al. Improving the diagnostic yield of exome- sequencing by predicting gene-phenotype associations using large-scale gene expression analysis. Nat. Commun. 10, 2837 (2019).
Article PubMed PubMed Central CAS Google Scholar
Luo, X. et al. Protein-protein interaction and pathway analyses of top schizophrenia genes reveal schizophrenia susceptibility genes converge on common molecular networks and enrichment of nucleosome (chromatin) assembly genes in schizophrenia susceptibility loci. Schizophr. Bull. 40, 39–49 (2014).
Article PubMed Google Scholar
Liu, J., Li, M., Luo, X. J. & Su, B. Systems-level analysis of risk genes reveals the modular nature of schizophrenia. Schizophr. Res. 201, 261–269 (2018).
Article PubMed Google Scholar
Labonte, B. et al. Sex-specific transcriptional signatures in human depression. Nat. Med. 23, 1102–1111 (2017).
Article CAS PubMed PubMed Central Google Scholar
Pantazatos, S. P. et al. Whole-transcriptome brain expression and exon-usage profiling in major depression and suicide: evidence for altered glial, endothelial and ATPase activity. Mol. Psychiatry 22, 760–773 (2017).
Article CAS PubMed Google Scholar
Ramaker, R. C. et al. Post-mortem molecular profiling of three psychiatric disorders. Genome Med. 9, 72 (2017).
Article PubMed PubMed Central CAS Google Scholar
Bolger, A. M., Lohse, M. & Usadel, B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics 30, 2114–2120 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kim, D., Langmead, B. & Salzberg, S. L. HISAT: a fast spliced aligner with low memory requirements. Nat. Methods 12, 357–360 (2015).
Article CAS PubMed PubMed Central Google Scholar
Liao, Y., Smyth, G. K. & Shi, W. featureCounts: an efficient general purpose program for assigning sequence reads to genomic features. Bioinformatics 30, 923–930 (2014).
Article CAS PubMed Google Scholar
Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 550 (2014).
Article PubMed PubMed Central CAS Google Scholar
Li, H. J. et al. Transcriptomic analyses of humans and mice provide insights into depression. Zool. Res. 41, 632–643 (2020).
Article PubMed PubMed Central Google Scholar
Kang, H. J. et al. Spatio-temporal transcriptome of the human brain. Nature 478, 483–489 (2011).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Q. et al. Systems-level analysis of human aging genes shed new light on mechanisms of aging. Hum. Mol. Genet. 25, 2934–2947 (2016).
CAS PubMed PubMed Central Google Scholar
Skene, N. G. et al. Genetic identification of brain cell types underlying schizophrenia. Nat. Genet. 50, 825–833 (2018).
Article CAS PubMed PubMed Central Google Scholar
Yang, C. P. et al. Comprehensive integrative analyses identify GLT8D1 and CSNK2B as schizophrenia risk genes. Nat. Commun. 9, 838 (2018).
Article PubMed PubMed Central CAS Google Scholar
GTEx Consortium. The Genotype-Tissue Expression (GTEx) project. Nat. Genet. 45, 580–585 (2013).
Article CAS Google Scholar
Seney, M. L. et al. Opposite molecular signatures of depression in men and women. Biol. Psychiatry 84, 18–27 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gilman, S. R. et al. Diverse types of genetic variation converge on functional gene networks involved in schizophrenia. Nat. Neurosci. 15, 1723–1728 (2012).
Article CAS PubMed PubMed Central Google Scholar
Hercher, C., Canetti, L., Turecki, G. & Mechawar, N. Anterior cingulate pyramidal neurons display altered dendritic branching in depressed suicides. J. Psychiatr. Res. 44, 286–293 (2010).
Article PubMed Google Scholar
Brager, D. H. & Johnston, D. Plasticity of intrinsic excitability during long-term depression is mediated through mGluR-dependent changes in I(h) in hippocampal CA1 pyramidal neurons. J. Neurosci. 27, 13926–13937 (2007).
Article CAS PubMed PubMed Central Google Scholar
Koenigs, M. & Grafman, J. The functional neuroanatomy of depression: distinct roles for ventromedial and dorsolateral prefrontal cortex. Behav. Brain Res. 201, 239–243 (2009).
Article PubMed PubMed Central Google Scholar
Cooney, R. E., Joormann, J., Eugene, F., Dennis, E. L. & Gotlib, I. H. Neural correlates of rumination in depression. Cogn. Affect Behav. Neurosci. 10, 470–478 (2010).
Article PubMed PubMed Central Google Scholar
Weh, E., Takeuchi, H., Muheisen, S., Haltiwanger, R. S. & Semina, E. V. Functional characterization of zebrafish orthologs of the human Beta 3-Glucosyltransferase B3GLCT gene mutated in Peters Plus Syndrome. PLoS ONE 12, e0184903 (2017).
Article PubMed PubMed Central CAS Google Scholar
Spitali, P. et al. TCTEX1D1 is a genetic modifier of disease progression in Duchenne muscular dystrophy. Eur. J. Hum. Genet. 28, 815–825 (2020).
Article CAS PubMed PubMed Central Google Scholar
Singh, R. et al. Structure of the human aminopeptidase XPNPEP3 and comparison of its in vitro activity with Icp55 orthologs: Insights into diverse cellular processes. J. Biol. Chem. 292, 10035–10047 (2017).
Article CAS PubMed PubMed Central Google Scholar
Koletzko, B. et al. FADS1 and FADS2 polymorphisms modulate fatty acid metabolism and dietary impact on health. Annu. Rev. Nutr. 39, 21–44 (2019).
Article CAS PubMed Google Scholar
McNamara, R. K. & Liu, Y. Reduced expression of fatty acid biosynthesis genes in the prefrontal cortex of patients with major depressive disorder. J. Affect Disord. 129, 359–363 (2011).
Article CAS PubMed Google Scholar
Mancuso, N. et al. Integrating gene expression with summary association statistics to identify genes associated with 30 complex traits. Am. J. Hum. Genet. 100, 473–487 (2017).
Article CAS PubMed PubMed Central Google Scholar
Stahl, E. A. et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat. Genet. 51, 793–803 (2019).
Article CAS PubMed PubMed Central Google Scholar
Pardinas, A. F. et al. Common schizophrenia alleles are enriched in mutation-intolerant genes and in regions under strong background selection. Nat. Genet. 50, 381–389 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lalovic, A., Klempan, T., Sequeira, A., Luheshi, G. & Turecki, G. Altered expression of lipid metabolism and immune response genes in the frontal cortex of suicide completers. J. Affect Disord. 120, 24–31 (2010).
Article CAS PubMed Google Scholar
Lee, P. H. et al. Multi-locus genome-wide association analysis supports the role of glutamatergic synaptic transmission in the etiology of major depressive disorder. Transl. Psychiatry 2, e184 (2012).
Article CAS PubMed PubMed Central Google Scholar
Willner, P., Hale, A. S. & Argyropoulos, S. Dopaminergic mechanism of antidepressant action in depressed patients. J. Affect Disord. 86, 37–45 (2005).
Article CAS PubMed Google Scholar
Tomita, H. et al. G protein-linked signaling pathways in bipolar and major depressive disorders. Front. Genet. 4, 297 (2013).
Article PubMed PubMed Central CAS Google Scholar

Download references

Acknowledgements

We would like to thank the research participants and employees of 23andMe for making this work possible. We thank the following members of the 23andMe Research Team: Michelle Agee, Babak Alipanahi, Adam Auton, Robert K. Bell, Katarzyna Bryc, Sarah L. Elson, Pierre Fontanillas, Nicholas A. Furlotte, David A. Hinds, Karen E. Huber, Aaron Kleinman, Nadia K. Litterman, Jennifer C. McCreight, Matthew H. McIntyre, Joanna L. Mountain, Elizabeth S. Noblin, Carrie A.M. Northover, Steven J. Pitts, J. Fah Sathirapongsasuti, Olga V. Sazonova, Janie F. Shelton, Suyash Shringarpure, Chao Tian, Joyce Y. Tung, Vladimir Vacic and Catherine H. Wilson, who generated and made the summary statistics available for us, which made this work possible. One of the brain eQTL datasets used in this study was generated as part of the CommonMind Consortium supported by funding from Takeda Pharmaceuticals Company Limited, F. Hoffman-La Roche Ltd. and NIH grants R01MH085542, R01MH093725, P50MH066392, P50MH080405, R01MH097276, RO1-MH-075916, P50M096891, P50MH084053S1, R37MH057881 and R37MH057881S1, HHSN271201300031C, AG02219, AG05138 and MH06692. Brain tissue for the study was obtained from the following brain bank collections: the Mount Sinai NIH Brain and Tissue Repository, the University of Pennsylvania Alzheimer’s Disease Core Center, the University of Pittsburgh NeuroBioBank and Brain and Tissue Repositories and the NIMH Human Brain Collection Core. CMC Leadership: Pamela Sklar, Joseph Buxbaum (Icahn School of Medicine at Mount Sinai), Bernie Devlin, David Lewis (University of Pittsburgh), Raquel Gur, Chang-Gyu Hahn (University of Pennsylvania), Keisuke Hirai, Hiroyoshi Toyoshiba (Takeda Pharmaceuticals Company Limited), Enrico Domenici, Laurent Essioux (F. Hoffman-La Roche Ltd), Lara Mangravite, Mette Peters (Sage Bionetworks), Thomas Lehner, and Barbara Lipska (NIMH). The Genotype-Tissue Expression (GTEx) Project was supported by the Common Fund of the Office of the Director of the National Institutes of Health and by NCI, NHGRI, NHLBI, NIDA, NIMH and NINDS. This work was equally supported by the Innovative Research Team of Science and Technology Department of Yunnan Province (2019HC004) and the Distinguished Young Scientists grant of the Yunnan Province (202001AV070006) to X.-J.L. Also was supported by grants from the National Natural Science Foundation of China (U1904130 to W.L.) and the National Key Research and Development Program of China (Stem Cell and Translational Research) (2016YFA0100900 to X.-J.L.).

Author information

These authors contributed equally: Xiaoyan Li, Xi Su

Authors and Affiliations

Key Laboratory of Animal Models and Human Disease Mechanisms of the Chinese Academy of Sciences & Yunnan Province, Kunming Institute of Zoology, Chinese Academy of Sciences, 650204, Kunming, Yunnan, China
Xiaoyan Li, Jiewei Liu, Huijuan Li, Ming Li & Xiong-Jian Luo
Key Laboratory of Intelligent Computing and Signal Processing of Ministry of Education, Institutes of Physical Science and Information Technology, Anhui University, 230601, Hefei, Anhui, China
Xiaoyan Li
Henan Mental Hospital, The Second Affiliated Hospital of Xinxiang Medical University, Xinxiang, Henan, China
Xi Su & Wenqiang Li
Henan Key Lab of Biological Psychiatry, International Joint Research Laboratory for Psychiatry and Neuroscience of Henan, Xinxiang Medical University, Xinxiang, Henan, China
Xi Su & Wenqiang Li
KIZ-CUHK Joint Laboratory of Bioresources and Molecular Research in Common Diseases, Kunming Institute of Zoology, Chinese Academy of Sciences, 650204, Kunming, Yunnan, China
Ming Li & Xiong-Jian Luo
Center for Excellence in Animal Evolution and Genetics, Chinese Academy of Sciences, 650204, Kunming, China
Xiong-Jian Luo

Authors

Xiaoyan Li
View author publications
You can also search for this author in PubMed Google Scholar
Xi Su
View author publications
You can also search for this author in PubMed Google Scholar
Jiewei Liu
View author publications
You can also search for this author in PubMed Google Scholar
Huijuan Li
View author publications
You can also search for this author in PubMed Google Scholar
Ming Li
View author publications
You can also search for this author in PubMed Google Scholar
Wenqiang Li
View author publications
You can also search for this author in PubMed Google Scholar
Xiong-Jian Luo
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

the 23andMe Research Team

Contributions

X.-J.L. conceived, designed and supervised the whole study. X.Y.L. performed most of the bioinformatic analyses, including the GWAS meta-analysis, the TWAS analysis, conditional and joint analyses and pathway analyses of TWAS significant genes. J.W.L. conducted spatio-temporal expression pattern analysis and cell-type-specific expression analysis of TWAS significant genes. H.J.L. carried out differential expression analysis in depression cases and controls. All 23andMe Research Team members contributed to generating and making the summary statistics. X.Y.L., X.S., J.W.L., H.J.L., M.L., 23andMe, W.Q.L and X.-J.L. contributed to this work in data generation, analysis and the interpretation of the results. X.Y.L. drafted the first version in writing the manuscript. X.-J.L. supervised the project and was in charge of overall direction. All authors provided critical comments and approved the final manuscript.

Corresponding authors

Correspondence to Wenqiang Li or Xiong-Jian Luo.

Ethics declarations

Conflict of interest

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Li, X., Su, X., Liu, J. et al. Transcriptome-wide association study identifies new susceptibility genes and pathways for depression. Transl Psychiatry 11, 306 (2021). https://doi.org/10.1038/s41398-021-01411-w

Download citation

Received: 02 January 2021
Revised: 22 April 2021
Accepted: 30 April 2021
Published: 21 May 2021
DOI: https://doi.org/10.1038/s41398-021-01411-w
Springer Nature Limited

This article is cited by

Investigating neonatal health risk variables through cell-type specific methylome-wide association studies
- Thomas L. Campbell
- Lin Y. Xie
- Karolina A. Aberg
Clinical Epigenetics (2024)
Genome-wide meta-analysis, functional genomics and integrative analyses implicate new risk genes and therapeutic targets for anxiety disorders
- Wenqiang Li
- Rui Chen
- Xiong-Jian Luo
Nature Human Behaviour (2023)
Transcriptome-wide association analyses identify an association between ARL14EP and polycystic ovary syndrome
- Sarah M. Lyle
- Samah Ahmed
- Britt I. Drögemöller
Journal of Human Genetics (2023)
TWAS revealed significant causal loci for milk production and its composition in Murrah buffaloes
- Supriya Chhotaray
- Vikas Vohra
- Gopal Gowane
Scientific Reports (2023)
Variable number tandem repeats (VNTRs) as modifiers of breast cancer risk in carriers of BRCA1 185delAG
- Yuan Chun Ding
- Aaron W. Adamson
- Susan L. Neuhausen
European Journal of Human Genetics (2023)

Transcriptome-wide association study identifies new susceptibility genes and pathways for depression

Abstract

Similar content being viewed by others

Introduction

Methods and materials

GWAS meta-analysis

Transcriptome-wide association analysis

Transcriptome-wide association analysis using SNP-expression weights from PsychENCODE

Transcriptome-wide association study in the Asian dataset

Defining of genes implicated in depression GWAS

Conditional and joint analysis

Pathway analysis

Expression analysis of TWAS significant genes in depression cases and controls

Spatio-temporal expression pattern analysis

Cell-type-specific expression analysis of TWS depression genes

Spatio-temporal expression pattern analysis of TWS genes in the human brain

Tissue-type-specific expression analysis of TWS depression genes

Results

TWAS identifies 53 susceptibility genes for depression

Expression signals driven the depression TWAS hits

Pathway enrichment analysis of the identified TWS genes revealed related biological processes

TWS genes showed higher expression level than background genes in the human brain

Cell-type-specific expression analysis of target genes

Expression analysis of TWS genes in brains of depression cases and controls

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

the 23andMe Research Team

Contributions

Corresponding authors

Ethics declarations

Conflict of interest

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation