RNA-Seq analysis implicates dysregulation of the immune system in schizophrenia

Xu, Junzhe; Sun, Jingchun; Chen, Jingchun; Wang, Lily; Li, Anna; Helm, Matthew; Dubovsky, Steven L; Bacanu, Silviu-Alin; Zhao, Zhongming; Chen, Xiangning

doi:10.1186/1471-2164-13-S8-S2

RNA-Seq analysis implicates dysregulation of the immune system in schizophrenia

Research
Open access
Published: 17 December 2012

Volume 13, article number S2, (2012)
Cite this article

Download PDF

You have full access to this open access article

BMC Genomics Aims and scope Submit manuscript

RNA-Seq analysis implicates dysregulation of the immune system in schizophrenia

Download PDF

Junzhe Xu^1,2,3,
Jingchun Sun^4,5,
Jingchun Chen⁶,
Lily Wang⁷,
Anna Li²,
Matthew Helm²,
Steven L Dubovsky¹,
Silviu-Alin Bacanu⁶,
Zhongming Zhao^4,5,8 &
…
Xiangning Chen^6,9

8148 Accesses
53 Citations
1 Altmetric
Explore all metrics

Abstract

Background

While genome-wide association studies identified some promising candidates for schizophrenia, the majority of risk genes remained unknown. We were interested in testing whether integration gene expression and other functional information could facilitate the identification of susceptibility genes and related biological pathways.

Results

We conducted high throughput sequencing analyses to evaluate mRNA expression in blood samples isolated from 3 schizophrenia patients and 3 healthy controls. We also conducted pooled sequencing of 10 schizophrenic patients and matched controls. Differentially expressed genes were identified by t-test. In the individually sequenced dataset, we identified 198 genes differentially expressed between cases and controls, of them 19 had been verified by the pooled sequencing dataset and 21 reached nominal significance in gene-based association analyses of a genome wide association dataset. Pathway analysis of these differentially expressed genes revealed that they were highly enriched in the immune related pathways. Two genes, S100A8 and TYROBP, had consistent changes in expression in both individual and pooled sequencing datasets and were nominally significant in gene-based association analysis.

Conclusions

Integration of gene expression and pathway analyses with genome-wide association may be an efficient approach to identify risk genes for schizophrenia.

Transcriptome sequencing study implicates immune-related genes differentially expressed in schizophrenia: new data and a meta-analysis

Article Open access 18 April 2017

Altered expression of mRNA profiles in blood of early-onset schizophrenia

Article Open access 06 January 2016

What genes are differentially expressed in individuals with schizophrenia? A systematic review

Article Open access 28 January 2022

Background

Schizophrenia is characterized by delusions, hallucinations, and deficits in cognitive function. Over the years, epidemiologic studies have accumulated significant evidence that many genetic factors play important roles in both symptomatology and etiology. Recent genetic studies, including genome-wide association (GWA) studies, have identified several promising candidate genes and loci. One of the most consistent findings in GWA studies is the major histocompatibility (MHC) region in 6p [1–3]. This finding strongly implicates the immune system as being involved in the development of schizophrenia. Other findings include candidate genes functioning in cell adhesion [4–6], migration [4, 7] and apoptosis [8, 9]. Except the findings from rare copy number variations [10–13], the effects of individual genes overall are modest or weak. These results suggest that many genes with moderate or small effects may be involved in schizophrenia [14]. It is difficult to identify these genes without increased sample sizes and better analysis of phenotype. Integration of other functional studies or utilization of high throughput technologies may increase the power to detect other schizophrenia candidate genes.

Whole genome mRNA sequencing (RNA-Seq, or transcriptome sequencing) allows for the comprehensive survey of all the mRNAs in a sample. This platform is the fruit of recently developed high-throughput DNA sequencing technology [15, 16], and has produced exciting results in the study of various diseases [17–19]. In this experiment, we applied the RNA-Seq technology to study schizophrenia. Specifically, we aimed to understand the dysregulation in schizophrenia at higher levels of biological structure and to integrate gene expression data to facilitate the identification of promising candidate genes. To accomplish these goals, we sequenced the blood mRNAs isolated from 3 schizophrenic patients and 3 matched healthy controls. We verified the differentially expressed genes (DEGs) in 2 independent pooled samples. We further examined the association of the discovered DEGs using GWA data from the molecular genetics of schizophrenia (MGS) study. Two differentially expressed genes, S100A8 and TYROBP, reached nominal significance in gene-based association analysis.

Materials and methods

Subjects, sample preparation and sequencing

Schizophrenia patients were recruited from the pool of diagnosed schizophrenia patients of the inpatient unit of the VA Medical Center of Western New York (WNY) and from the Buffalo Psychiatric Center inpatient unit. All enrolled patients met the DSM-IV criteria for schizophrenia based on the examination of psychiatric case records and clinical interviews by at least two experienced psychiatrists. The healthy individuals were recruited though VA employees and advertisements in the local media. Individuals with a family history of schizophrenia or other major psychiatric disorders were excluded. A total of 26 subjects were included in this study. All subjects gave informed consent to participate in the study. The protocol and consent form were approved by the institutional review boards at the VA WNY Health Care System and New York State Office Mental Health.

Blood samples were collected with a 10 ml Vacutainer, Acid Citrate Dextrose (ACD) tube. Lymphocytes were prepared using isolymph (a diagnostic reagent used to separate lymphocytes from blood). Each 100 ml of isolymph contains 5.7 grams of Ficoll 400 and 9.0 grams of Diatrizoate Sodium) in a density gradient [20]. Ten ml of whole blood was diluted with equal volume RPMI and mixed by pipetting. Three milliliter of isolymph was transferred to a 15 ml Falcon conical bottom tube and 4 ml of diluted blood was carefully layered on top of the isolymph. The samples were centrifuged at 1640 rpm for 30 minutes. Lymphocytes were c ollected at the interface between the two layers by a sterile 1 ml pipette. The lymphocytes were mixed with 3 volumes of RPMI and centrifuged at 1400 rpm for 10 minutes. The supernatant was then removed. The lymphocytes were re-suspended in Cold Freeze Medium (60% RPMI 1640, 10% DMSO, 30% heat inactivated FCS) at a lymphocytes density of 6-12 × 10⁶ cells/ml. The lymphocytes were distributed to NUNC plastic cryopreservation vials for long term storage. The vials were first placed in the cryo freezing container and stored at -80°C for at least 4 hours and then transferred to liquid nitrogen for final storage.

RNA was prepared from the frozen lymphocytes. Total RNA was extracted using the Mico-to-Midi Total RNA Purification System (Invitrogen, Carlsbad, CA) according to the manufacturer's instructions. The total RNA concentration and purity were determined spectrophotometrically at 260 nm and 280 nm in the Functional Genomics Shared Resource (FGSR) in Vanderbilt University.

MRNA capture, cDNA conversion, sizing, and library construction were performed using kits from the Illumina Company and by following the manufacture's recommended procedures. For RNA-Seq application, individual libraries were constructed for 3 schizophrenia patients, 3 controls, and 2 pools (10 subjects/pool) of schizophrenia patients and controls. Each library was loaded into a single lane of the Illumina Genome Analyzer II flow cell. For 3 cases and 3 controls, we performed paired-end sequencing while for pooled samples, we performed single-end sequencing. Image analysis and base-calling were performed by the Genome Analyzer Pipeline version 2.0 with default parameters [21]. Library construction and RNA sequencing was performed in the Genome Technology Core (GTC) in Vanderbilt University.

Data process and analysis

After obtaining the short reads, we performed a series of quality checks, including quality score evaluation using program HTSeq [22] and marking duplicate reads by using software SAMTools [23]. All reads were independently aligned to a single reference file consisting of all human transcripts and the human genome in the UCSC genome assembly hg18 (NCBI build 36.1) by using TopHat (Version 1.0.10) [24]. The aligned sequences were evaluated with SAMTools for capture efficiency in order to ensure no artificial fragment representation (as assessed by fragment position distribution). We ran TopHat in the 'paired-end mode' with the minimum distance between paired-end reads of 120 bp, a maximum distance of 500,000 bp, and default settings of other parameters (e.g., no more than two mismatches between read and reference were allowed in the first 28 bp (5' end) of the read).

To obtain an accurate measure of transcript abundance, we only used the reads that were uniquely mapped to the human genome. Since the sequence reads were paired-end, we quantified expression levels of all transcripts in each subject according to the fragments per kilobase of exon per million fragments mapped (FPKM), which was calculated by the software Cufflinks [25]. FPKM is a similar measurement to RPKM, which measures gene expression in Reads Per exon Kilobase per Million mapped reads (RPKM). RPKM has been used to normalize measurement of exon read density and allows transcript levels to be compared both within and between samples [25, 26].

Considering that RNA-Seq mainly estimates exon expression and that most genes have multiple transcripts, it is necessary to determine how to estimate gene expression level based on transcript expression data. In our study, we employed a simple strategy: we first identified the differentially expressed transcripts (DETs) and then considered the unique genes of these transcripts as differentially expressed genes (DEGs) for further functional analysis.

To improve the reliability and comparability of differential expression analysis, we only examined the expression difference of those transcripts with FPKM value > 5 in all individually sequenced patients and controls [25]. Using these transcripts, we performed Fisher's exact test to identify transcripts with significantly differential expression between patients and controls [27–29]. For each transcript, we constructed a 2 × 2 contingency table, which included four FPKM values: n, N-n, r, R-r where n is the sum of FPKM values of a given transcript in 3 cases, N is the sum of FPKM values of all transcripts in cases, r is the sum of FPKM values of the given transcript in 3 controls, and R is the sum of the FPKM values of all transcripts in controls. To determine the expression change direction, we used "greater" or "less" parameters in the one-tailed Fisher's exact test to find the up-regulated transcripts or down-regulated transcripts respectively. Next, we controlled the type 1 errors by Bonferroni correction for the number of tests performed. A transcript was considered differentially expressed if the Bonferroni adjusted P-value was less than 0.05.

For data generated from pooled samples, we performed the same data processing and analysis as the individually sequenced samples except for using single-end mode to perform mapping of the reference sequence.

Functional analysis

To assess the function of the DEGs that we identified, we conducted pathway enrichment tests for the DEGs using the online tool WebGestalt (version 2) [30]. We used all pathways in the Kyoto Encyclopedia of Genes and Genomes (KEGG) database. We selected those pathways having adjusted P-values of less than 0.01 calculated by the hypergeometric test followed by the Benjamini-Hochberg method [31], which was implemented in WebGestalt. To make the analysis biologically meaningful, we considered only those KEGG pathways containing 5 or more DEG genes.

To further systematically determine canonical signaling pathways and molecular networks that the DEGs might involve, we performed the pathway/network enrichment analysis using the Ingenuity Pathway Analysis (IPA) tool from the Ingenuity Systems [32]. For canonical signaling pathway analysis, given a list of genes, a right-tailed Fisher's exact test was performed for the enrichment of these genes in its hand-curated canonical pathway database. Here, the P-value calculated for a pathway measures the probability of being randomly selected from all of the curated pathways. To control the error rate in the analysis, IPA also provided a corrected P-values to identify the most significant results in IPA's canonical pathways based on the Benjamini-Hochberg method [31]. This tool allowed us to identify the signaling pathways in which the DEGs were enriched. In our study, we used a cut-off of the corrected P-value less than 0.05 (or score > 1.30, here score = -log P) to define the significant pathways. For network enrichment, the DEGs were overlaid onto a global molecular network (GMN) developed based on the Ingenuity Pathways Knowledge Base, in which functional relationships such as activation, chemical-protein interaction, expression, inhibition and regulation of binding were manually curated. Subnetworks of genes were then extracted from the GMN based on their connectivity using the algorithm developed by IPA [33]. For each subnetwork, a likelihood score, which measures the probability of the DEGs being found in the same subnetwork by chance, was transformed from the P- values calculated by Fisher's exact test. Additionally, the IPA assigned the top 3 biological functions for each network it identified.

Gene-based genome-wide association analysis

The RNA-Seq application produced a list of differentially expressed genes in schizophrenia. We examined whether genetic variants in these DEGs harbored association signals. We conducted a gene-based GWA analysis using the MGS dataset for schizophrenia. We obtained this dataset from dbGaP under the protocol of "Genetic study of schizophrenia, nicotine dependence and other comorbid psychiatric disorders" by X.C. For each gene, its association P-value with schizophrenia was estimated using the VEGAS (Versatile Gene-based Association Study) software package [34].

Results

An overview of RNA-Seq data

In this study, we conducted genome-wide RNA sequencing for 6 individual samples and 2 pooled samples. The data from individually sequenced subjects was used for the initial identification of DEGs, while the pooled data was used to validate these DEGs.

For the 6 individually sequenced samples, after filtering by quality score, we generated an average of 8.7 million pairs of 43-bp paired-end reads per sample. The quality scores of the reads were satisfactory (Figure S1), of which 90.3% of the called bases had a Phred score ≥ 30. Table 1 shows the mapping statistics of the fragments. For each subject, an average of approximately 85.1% of the reads could be mapped to the human reference genome. Among the mapped sequences, ~48.6% of the read pairs were uniquely mapped to the human genome as properly aligned fragments. This is similar to the output of other RNA-Seq sequencing studies [35, 36].

Table 1 Statistics of the number of fragments sequenced, aligned and mapped using TopHat

Full size table

These reads were used to estimate transcript expression of all 6 samples. Table 2 shows the transcripts detected by RNA-Seq in subjects and mapped genes with FPKM values and coverage. Of the 33,599 transcripts and 32,797 genes annotated in the UCSC hg18, we detected 18,226 (54.2%) transcripts (FPKM > 0), which mapped to 14,929 (45.5%) unique genes. Among these transcripts, on average, 7223 (41.7%) had their FPKM values higher than 5.

Table 2 Statistics of the number of transcripts and genes detected

Full size table

For the pooled samples, we generated a total of 26.2 million reads for the cases, of which 92.05% were mapped to the human reference genome, and 28.4 million reads for the controls, of which 91.29% were mapped. The resulting expression on transcription level and gene level were summarized in Table 2.

Identification of differentially expressed genes

To identify the DEGs between the cases and controls, we used only those transcripts with FPKM values > 5.0 in all the 6 subjects. With this criterion, 4715 transcripts were included for differential expression analysis. Using Fisher's exact test, a total of 206 transcripts reached significance after Bonferroni correction (Table S1). Among these transcripts, 123 (mapped to 118 unique genes) were significantly down-regulated and 83 (mapped to 80 unique genes) were significantly up-regulated. In addition to transcripts expressed in both cases and controls, there were transcripts detected only in the cases or controls. Based on the FPKM distribution (Figure S2), transcripts exclusively expressed in either cases or controls with FPKM > 2.0 were included in the pathway and functional analyses. There were 12 transcripts exclusively expressed in cases (mapped to 12 unique genes) and 8 transcripts exclusively expressed in controls (mapped to 8 unique genes) (Table S2). Thus, we obtained a total of 218 genes differentially expressed among the 6 sequenced subjects.

Validation and functional enrichment analysis of the DEGs

To validate the DEGs discovered from the individually sequenced dataset, we conducted similar differential expression analyses for the pooled dataset (see Materials and methods). There were 155 transcripts reaching nominal significance, of which 84 were up-regulated (mapped to 78 genes) and 72 were down-regulated (mapped to 68 unique genes). Of the 218 DEGs identified from the individually sequenced dataset, 9 were up-regulated (GNAS, GNLY, HBA1, HBB, NCRNA00188, NEAT1, NFKB2, S100A8, and SNHG5) and 10 were down-regulated genes (CD74, CXCR4, LGALS2, LYZ, PF4, PIK3IP1, RBM38, RPL30, SCO2, and TYROBP) that were found differentially expressed between cases and controls in the pooled dataset (with the same direction of gene expression change).

We conducted KEGG pathway profiling of these 218 DEGs. The results are shown in Table 3. Of the pathways enriched in these 218 genes, the most noticeable ones are involved in the immune and inflammation systems (antigen processing, cell adhesion molecules, hematopoietic cell lineage, systemic lupus erythematosus, chemokine signaling pathway, intestinal immune network for IgA production, toll-like receptor signaling pathway, T cell receptor signaling pathway, B cell receptor signaling pathway, Cytokine-cytokine receptor interaction, etc). Interestingly, the cell adhesion molecules pathway (CAMs, KEGG pathway ID hsa04514, adjusted P value = 5.78 × 10^-9, Table 3) was the only pathway that was found to be significantly associated with both schizophrenia and bipolar disorder in a recent pathway analysis of schizophrenia and bipolar disorder GWAS datasets [37]. It was also highlighted in our recent pathway analysis using a generalized additive model for correction of gene length biases and other two methods (ALIGATOR and hypergeometric test) (unpublished data). Upon our further examination, we found 10 DEG genes in the CAMs pathway. Among the 10 genes, 8 were down-regulated (CD4, HLA-DPA1, HLA-DRA, HLA-DRB1, ITGB2, PECAM1, SELL and VCAN) and two were up-regulated (CD8A and ITGB7). Among the 8 down-regulated genes, three (HLA-DPA1, HLA-DRA and HLA-DRB1) were in the MHC region (chr6:20,000,000-40,000,000) (Table S3).

Table 3 KEGG pathways significantly enriched in the 218 differentially expressed genes

Full size table

For the 19 genes differentially expressed in both datasets, four are involved in immune systems (CXCR4, NFKB2, PF4, and TYROBP). In our KEGG pathway enrichment analysis, we found several pathways overrepresented in these genes including "Chemokine signaling pathway" and "Cytokine-cytokine receptor interaction," both of which were found significantly overrepresented in the 218 genes (Table 3). We further examined the pathways that were significantly overrepresented in these genes using IPA tools. The most significant pathways were "MIF-mediated glucocorticoid regulation," "MIF regulation of innate immunity," "TREM1 signaling," and "Induction of apoptosis by HIV1". MIF (macrophage migration inhibitory factor) is a unique counter-regulator of immunosuppressive and anti-inflammatory activities of glucocorticoids. Consistent with the results of 218 genes, all of these pathways are related to immune and inflammation systems. Furthermore, we conducted network analysis using IPA. Figure 1 shows the top network overrepresented in these genes, in which the top three functions are "Molecular transport," "Cellular movement," and "Hematological system development and function." Note that several genes in Figure 1 are potentially interesting like CD74, S100A8, Akt, IL12, TYROBP, HBB and HBA1. Among them, CD74 encodes a Type II transmembrane protein, which is a binding protein for MIF and an essential protein for MIF-induced activation of extracellular signal-regulated kinase-1/2MAP kinase cascade, cell proliferation and apoptosis [38].

Gene-based association analyses of DEGs

One of the objectives of our RNA-Seq experiments was to test whether differentially expressed genes were enriched for association signals in GWA studies. Towards this goal, we conducted gene-based GWA analysis using the MGS dataset and the VEGAS method [34].This analysis produced a P value for each gene by considering gene's linkage disequilibrium information from the HapMap populations. We matched the 218 DEGs with those genes from the association analysis. Of the 218 genes, 21 had their P values less than 0.05 (Table 4). This was 2-fold enrichment than the expected (P = 0.0025). Five genes (SELL[39], HLA-DRB1[40], CEBPD[41], HSPA5[42], and NRGN[3]) from the matched list had been previously studied for schizophrenia with positive association signals, the rest of the genes were involved in immune responses or other neuronal diseases. Two genes, S100A8 and TYROBP, were differentially expressed in both the individual and pooled sequencing datasets.

Table 4 Association of differentially expressed genes with schizophrenia

Full size table

Discussion

Recent studies have shown that most genetic factors predisposing to schizophrenia have only a modest effect. GWA studies alone seem insufficient to identify the majority of these genetic factors. Expression level is an index of function of genes and may be useful for identifying risk genes for schizophrenia at the transciptomic level. In this study, we took advantage of recently available next generation sequencing technologies (i.e., RNA-Seq) to sequence poly-A tailed mRNAs from blood samples of 6 individuals and 2 pools of schizophrenia patients and controls. In the 6 individually sequenced samples, we found 218 genes showing differentially expression between cases and controls. Among these genes, 19 were nominally significant at the expression level in the 2 pooled samples. In our IPA analysis, we found that MIF regulation of innate immunity and TREM1 signaling were highly enriched in these 19 genes. Furthermore, of the 218 DEGs, 21 reached nominal significance in gene-based association analysis of the MGSGWAS dataset. Nineteen of these 21 genes are directly involved in immune response/diseases, or have been studied for candidates for schizophrenia and other neuronal diseases. Two genes, S100A8 and TYROBP, showed the same direction of expression changes in the individual and pooled sequencing datasets, and they also reached nominal significance in gene-based association analysis.

S100A8, also called MRP-8, encodes a calcium binding protein involved in inflammatory responses. It has been implicated in rheumatoid arthritis [43], systemic lupus erythematosus [44] and cancers [45, 46]. Intriguingly, rheumatoid arthritis may be correlated with schizophrenia [47, 48]. TYROBP, also known as DAP12, encodes an immunoreceptor adaptor protein that plays a key role in osteoclast differentiation and maturation [49, 50]. Mutations in this gene lead to the Nasu-Hakola disease [51–54], a rare autosomal recessive disorder characterized by bone cyst and presenile dementia. In addition to their functions in the immune system, both genes are expressed in human brain. S100A8 shows elevated expression in cerebral ischemia [55] and posttraumatic brain injuries [56]. In a mouse model study, S100A8 expression increases significantly after chronic treatment with the antipsychotic drug olanzapine, which is used primarily to treat schizophrenia and bipolar disorder patients. TYROBP is implicated in the developmental neuronal death in hippocampus [57], impaired glutamatergic synaptic functions [58] and brain myelination [59]. All of these factors have been suspected to be involved in schizophrenia. TYROBP knockout mouse studies reveal deficits in cognitive functions and prepulse inhibition [49], symptoms that have been manifested in many schizophrenia patients. However, neither gene has been studied directly for schizophrenia. They may be novel candidates for the disease.

Glatt et al [60] applied microarray techniques to compare gene expression of peripheral blood cells (PBCs) and the dorsolateral prefrontal cortex (DLPFC) of the brain to identify risk factors for schizophrenia. They detected 123 differentially expressed genes in the blood samples. Among our 218 DEGs, 13 genes had the same direction of expression changes as reported by Glatt et al. Specifically, eight genes were down-regulated (CD74, FCN1, FGR, HLA-DPA1, HLA-DRB1, IL10RA, PSAP, and ZFP36L2) and five were up-regulated (GOS2, HBA1, HBA2, HBB, and IL8). The overlap of 13 genes with same direction of expression change is unlikely by chance considering they were selected from a genome-wide gene pool (P-value = 5.36 × 10^-6). Interestingly, among the eight down-regulated genes, CD74 was consistently found down- regulated in three gene expression data sets (our individual sample, our pooled sample, and PBC sample in the Glatt et al study). Gene CD74 encodes a protein in MHC and is located in a region implicated by genome-wide linkage meta-analyses [61, 62]. Additionally, the MHC locus on chromosome 6p was the most consistent finding from GWA studies [1–3]. Another gene, HLA-DRB1, also located in the MHC locus, was found to be differentially expressed in three data sets (the individual RNA-Seq dataset in this study, the PBC and DLPFC datasets in the Glatt et al study). HLA-DRB1 has been reported for positive association with schizophrenia [63, 64]. We also found that SNPs influencing the expression of HLA-DRB1 (expression quantitative trait loci - eQTLs) were significantly associated with schizophrenia in the CATIE and MGS datasets (unpublished data). This result provides empirical evidence that a combination of GWA data and eQTL analysis may be effective to identify risk genes.

Conclusion

This exploratory study aims at evaluating how RNA-Seq can be used to facilitate the identification of risk genes for complex diseases such as schizophrenia. Limitations include 1) the small number of subjects sequenced in this study and 2) only one pair of pooled samples was available to confirm the DEGs discovered in the individually sequenced dataset. Note that many RNA-Seq studies published in the past three years were based on a small number of samples (n ≤ 3) [65]. Due to these limitations, many genes potentially involved in schizophrenia could not be detected in the individually sequenced dataset and none of the genes in the pooled sample dataset reached significance after Bonferroni or false discovery rate correction. For these reasons, we selected to use those genes that reached nominal significance (one tailed test P < 0.1 for genes showing the same direction of expression changes) to verify the DEGs from the individually sequenced dataset. This may lead to the inclusion of some false positives in the 19 genes. At this time, we are unable to distinguish the true positives from the false ones. Since we observed 19 overlapping genes for the 218 DEGs, exceeding the expected number by chance, collectively, a majority of these 19 genes are unlikely to be false positives. The pathways and processes identified based on these 19 genes are likely reliable, and should provide important insights on the genes whose expression might be involved in the development of schizophrenia. Based on the same rationale, the list of genes identified by gene-based analysis may have false positives, but most of the genes could be considered promising candidates for schizophrenia. These promising candidates warrant further validation.

In summary, by combining high throughput RNA sequencing and GWA data, we have identified a list of candidate genes for schizophrenia despite our small sample size. These genes are enriched in the pathways and processes of the immune system. Our study demonstrates that integration of GWAS and gene expression can provide valuable information to prioritize candidates for future studies.

References

Purcell SM, Wray NR, Stone JL, Visscher PM, O'Donovan MC, Sullivan PF, et al: Common polygenic variation contributes to risk of schizophrenia and bipolar disorder. Nature. 2009, 460: 748-752.
CAS PubMed Google Scholar
Shi J, Levinson DF, Duan J, Sanders AR, Zheng Y, Pe'er I, et al: Common variants on chromosome 6p22.1 are associated with schizophrenia. Nature. 2009, 460: 753-757.
PubMed Central CAS PubMed Google Scholar
Stefansson H, Ophoff RA, Steinberg S, Andreassen OA, Cichon S, Rujescu D, et al: Common variants conferring risk of schizophrenia. Nature. 2009, 460: 744-747.
PubMed Central CAS PubMed Google Scholar
Kahler AK, Djurovic S, Kulle B, Jonsson EG, Agartz I, Hall H, et al: Association analysis of schizophrenia on 18 genes involved in neuronal migration: MDGA1 as a new susceptibility gene. Am J Med Genet B Neuropsychiatr Genet. 2008, 147B: 1089-1100. 10.1002/ajmg.b.30726.
Article PubMed Google Scholar
Sullivan PF, Keefe RS, Lange LA, Lange EM, Stroup TS, Lieberman J, et al: NCAM1 and neurocognition in schizophrenia. Biol Psychiatry. 2007, 61: 902-910. 10.1016/j.biopsych.2006.07.036.
Article CAS PubMed Google Scholar
Chen X, Wang X, Hossain S, O'Neill FA, Walsh D, Pless L, et al: Haplotypes spanning SPEC2, PDZ-G EF2 and ACSL6 genes are associated with schizophrenia. Hum Mol Genet. 2006, 15: 3329-3342. 10.1093/hmg/ddl409.
Article CAS PubMed Google Scholar
Drerup CM, Wiora HM, Topczewski J, Morris JA: Disc1 regulates foxd3 and sox10 expression, affecting neural crest migration and differentiation. Development. 2009, 136: 2623-2632. 10.1242/dev.030577.
Article PubMed Central CAS PubMed Google Scholar
Chen X, Sun C, Chen Q, O'Neill FA, Walsh D, Fanous AH, et al: Apoptotic engulfment pathway and schizophrenia. PLoS ONE. 2009, 4: e6875-10.1371/journal.pone.0006875.
Article PubMed Central PubMed Google Scholar
Jia P, Wang L, Meltzer HY, Zhao Z: Common variants conferring risk of schizophrenia: a pathway analysis of GWAS data. Schizophr Res. 2010, 122: 38-42. 10.1016/j.schres.2010.07.001.
Article PubMed Central PubMed Google Scholar
Mulle JG, Dodd AF, McGrath JA, Wolyniec PS, Mitchell AA, Shetty AC, et al: Microdeletions of 3q29 confer high risk for schizophrenia. Am J Hum Genet. 2010, 87: 229-236. 10.1016/j.ajhg.2010.07.013.
Article PubMed Central CAS PubMed Google Scholar
Need AC, Attix DK, McEvoy JM, Cirulli ET, Linney KL, Hunt P, et al: A genome-wide study of common SNPs and CNVs in cognitive performance in the CANTAB. Hum Mol Genet. 2009, 18: 4650-4661. 10.1093/hmg/ddp413.
Article PubMed Central CAS PubMed Google Scholar
Tam GW, Redon R, Carter NP, Grant SG: The role of DNA copy number variation in schizophrenia. Biol Psychiatry. 2009, 66: 1005-1012. 10.1016/j.biopsych.2009.07.027.
Article CAS PubMed Google Scholar
Grozeva D, Kirov G, Ivanov D, Jones IR, Jones L, Green EK, et al: Rare copy number variants: a point of rarity in genetic risk for bipolar disorder and schizophrenia. Arch Gen Psychiatry. 2010, 67: 318-327. 10.1001/archgenpsychiatry.2010.25.
Article PubMed Central PubMed Google Scholar
Sun J, Jia P, Fanous AH, van den OE, Chen X, Riley BP, et al: Schizophrenia gene networks and pathways and their applications for novel candidate gene selection. PLoS ONE. 2010, 5: e11351-10.1371/journal.pone.0011351.
Article PubMed Central PubMed Google Scholar
Bentley DR: Whole-genome re-sequencing. Curr Opin Genet Dev. 2006, 16: 545-552. 10.1016/j.gde.2006.10.009.
Article CAS PubMed Google Scholar
Heap GA, Yang JH, Downes K, Healy BC, Hunt KA, Bockett N, et al: Genome-wide analysis of allelic expression imbalance in human primary cells by high-throughput transcriptome resequencing. Hum Mol Genet. 2010, 19: 122-134. 10.1093/hmg/ddp473.
Article PubMed Central CAS PubMed Google Scholar
Ceol CJ, Houvras Y, Jane-Valbuena J, Bilodeau S, Orlando DA, Battisti V, et al: The histone methyltransferase SETDB1 is recurrently amplified in melanoma and accelerates its onset. Nature. 2011, 471: 513-517. 10.1038/nature09806.
Article PubMed Central CAS PubMed Google Scholar
Steidl C, Shah SP, Woolcock BW, Rui L, Kawahara M, Farinha P, et al: MHC class II transactivator CIITA is a recurrent gene fusion partner in lymphoid cancers. Nature. 2011, 471: 377-381. 10.1038/nature09754.
Article PubMed Central CAS PubMed Google Scholar
Briggs TA, Rice GI, Daly S, Urquhart J, Gornall H, Bader-Meunier B, et al: Tartrate-resistant acid phosphatase deficiency causes a bone dysplasia with autoimmunity and a type I interferon expression signature. Nat Genet. 2011, 43: 127-131. 10.1038/ng.748.
Article CAS PubMed Google Scholar
Nixon-Fulton JL, Bergstresser PR, Tigelaar RE: Thy-1+ epidermal cells proliferate in response to concanavalin A and interleukin 2. J Immunol. 1986, 136: 2776-2786.
CAS PubMed Google Scholar
Bentley DR, Balasubramanian S, Swerdlow HP, Smith GP, Milton J, Brown CG, et al: Accurate whole human genome sequencing using reversible terminator chemistry. Nature. 2008, 456: 53-59. 10.1038/nature07517.
Article PubMed Central CAS PubMed Google Scholar
Simon Anders. HTseq Program. 2012, [http://www-huber.embl.de/users/anders/HTSeq/]
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al: The sequence alignment/Map format and SAMtools. Bioinformatics. 2009, 25: 2078-2079. 10.1093/bioinformatics/btp352.
Article PubMed Central PubMed Google Scholar
Trapnell C, Pachter L, Salzberg SL: TopHat: discovering splice junctions with RNA-Seq. Bioinformatics. 2009, 25: 1105-1111. 10.1093/bioinformatics/btp120.
Article PubMed Central CAS PubMed Google Scholar
Trapnell C, Williams BA, Pertea G, Mortazavi A, Kwan G, van Baren MJ, et al: Transcript assembly and quantification by RNA-Seq reveals unannotated transcripts and isoform switching during cell differentiation. Nat Biotechnol. 2010, 28: 511-515. 10.1038/nbt.1621.
Article PubMed Central CAS PubMed Google Scholar
Mortazavi A, Williams BA, McCue K, Schaeffer L, Wold B: Mapping and quantifying mammalian transcriptomes by RNA-Seq. Nat Methods. 2008, 5: 621-628. 10.1038/nmeth.1226.
Article CAS PubMed Google Scholar
Bullard JH, Purdom E, Hansen KD, Dudoit S: Evaluation of statistical methods for normalization and differential expression in mRNA-Seq experiments. BMC Bioinformatics. 2010, 11: 94-10.1186/1471-2105-11-94.
Article PubMed Central PubMed Google Scholar
Auer PL, Doerge RW: Statistical design and analysis of RNA sequencing data. Genetics. 2010, 185: 405-416. 10.1534/genetics.110.114983.
Article PubMed Central CAS PubMed Google Scholar
Brooks AN, Yang L, Duff MO, Hansen KD, Park JW, Dudoit S, et al: Conservation of an RNA regulatory map between Drosophila and mammals. Genome Res. 2011, 21: 193-202. 10.1101/gr.108662.110.
Article PubMed Central CAS PubMed Google Scholar
Zhang B, Kirov S, Snoddy J: WebGestalt: an integrated system for exploring gene sets in various biological contexts. Nucleic Acids Res. 2005, 33: W741-W748. 10.1093/nar/gki475.
Article PubMed Central CAS PubMed Google Scholar
Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B. Edited by: Benjamini Y, Hochberg Y. 1995, 57: 289-300.
Ingenuity systems. Ingenuity pathway analysis. 2012, [http://www.ingenuity.com/]
Calvano SE, Xiao W, Richards DR, Felciano RM, Baker HV, Cho RJ, et al: A network-based analysis of systemic inflammation in humans. Nature. 2005, 437: 1032-1037. 10.1038/nature03985.
Article CAS PubMed Google Scholar
Liu JZ, McRae AF, Nyholt DR, Medland SE, Wray NR, Brown KM, et al: A versatile gene-based test for genome-wide association studies. Am J Hum Genet. 2010, 87: 139-145. 10.1016/j.ajhg.2010.06.009.
Article PubMed Central CAS PubMed Google Scholar
Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y: RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008, 18: 1509-1517. 10.1101/gr.079558.108.
Article PubMed Central CAS PubMed Google Scholar
Wang ET, Sandberg R, Luo S, Khrebtukova I, Zhang L, Mayr C, et al: Alternative isoform regulation in human tissue transcriptomes. Nature. 2008, 456: 470-476. 10.1038/nature07509.
Article PubMed Central CAS PubMed Google Scholar
O'Dushlaine C, Kenny E, Heron E, Donohoe G, Gill M, Morris D, et al: Molecular pathways involved in neuronal cell adhesion and membrane scaffolding contribute to schizophrenia and bipolar disorder susceptibility. Mol Psychiatry. 2010
Google Scholar
Leng L, Metz CN, Fang Y, Xu J, Donnelly S, Baugh J, et al: MIF signal transduction initiated by binding to CD74. J Exp Med. 2003, 197: 1467-1476. 10.1084/jem.20030286.
Article PubMed Central CAS PubMed Google Scholar
Iwata Y, Suzuki K, Nakamura K, Matsuzaki H, Sekine Y, Tsuchiya KJ, et al: Increased levels of serum soluble L-selectin in unmedicated patients with schizophrenia. Schizophr Res. 2007, 89: 154-160. 10.1016/j.schres.2006.08.026.
Article PubMed Google Scholar
Palmer CG, Hsieh HJ, Reed EF, Lonnqvist J, Peltonen L, Woodward JA, et al: HLA-B maternal-fetal genotype matching increases risk of schizophrenia. Am J Hum Genet. 2006, 79: 710-715. 10.1086/507829.
Article PubMed Central CAS PubMed Google Scholar
Lee CH, Liu CM, Wen CC, Chang SM, Hwu HG: Genetic copy number variants in sib pairs both affected with schizophrenia. J Biomed Sci. 2010, 17: 2-10.1186/1423-0127-17-2.
Article PubMed Central PubMed Google Scholar
Sun J, Wan C, Jia P, Fanous AH, Kendler KS, Riley BP, et al: Application of systems biology approach identifies and validates GRB2 as a risk gene for schizophrenia in the Irish Case Control Study of Schizophrenia (ICCSS) sample. Schizophr Res. 2011, 125: 201-208. 10.1016/j.schres.2010.12.002.
Article PubMed Central PubMed Google Scholar
Baillet A, Trocme C, Berthier S, Arlotto M, Grange L, Chenau J, et al: Synovial fluid proteomic fingerprint: S100A8, S100A9 and S100A12 proteins discriminate rheumatoid arthritis from other inflammatory joint diseases. Rheumatology (Oxford). 2010, 49: 671-682. 10.1093/rheumatology/kep452.
Article CAS Google Scholar
Soyfoo MS, Roth J, Vogl T, Pochet R, Decaux G: Phagocyte-specific S100A8/A9 protein levels during disease exacerbations and infections in systemic lupus erythematosus. J Rheumatol. 2009, 36: 2190-2194. 10.3899/jrheum.081302.
Article CAS PubMed Google Scholar
Ehrchen JM, Sunderkotter C, Foell D, Vogl T, Roth J: The endogenous Toll-like receptor 4 agonist S100A8/S100A9 (calprotectin) as innate amplifier of infection, autoimmunity, and cancer. J Leukoc Biol. 2009, 86: 557-566. 10.1189/jlb.1008647.
Article CAS PubMed Google Scholar
Gebhardt C, Nemeth J, Angel P, Hess J: S100A8 and S100A9 in inflammation and cancer. Biochem Pharmacol. 2006, 72: 1622-1631. 10.1016/j.bcp.2006.05.017.
Article CAS PubMed Google Scholar
Gorwood P, Pouchot J, Vinceneux P, Puechal X, Flipo RM, De Bandt M, et al: Rheumatoid arthritis and schizophrenia: a negative association at a dimensional level. Schizophr Res. 2004, 66: 21-29. 10.1016/S0920-9964(03)00017-3.
Article CAS PubMed Google Scholar
Torrey EF, Yolken RH: The schizophrenia-rheumatoid arthritis connection: infectious, immune, or both?. Brain Behav Immun. 2001, 15: 401-410. 10.1006/brbi.2001.0649.
Article CAS PubMed Google Scholar
Kaifu T, Nakahara J, Inui M, Mishima K, Momiyama T, Kaji M, et al: Osteopetrosis and thalamic hypomyelinosis with synaptic degeneration in DAP12-deficient mice. J Clin Invest. 2003, 111: 323-332.
Article PubMed Central CAS PubMed Google Scholar
Nataf S, Anginot A, Vuaillat C, Malaval L, Fodil N, Chereul E, et al: Brain and bone damage in KARAP/DAP12 loss-of-function mice correlate with alterations in microglia and osteoclast lineages. Am J Pathol. 2005, 166: 275-286. 10.1016/S0002-9440(10)62251-1.
Article PubMed Central CAS PubMed Google Scholar
Kaneko M, Sano K, Nakayama J, Amano N: Nasu-Hakola disease: The first case reported by Nasu and review. Neuropathology. 2010
Google Scholar
Satoh JI, Tabunoki H, Ishida T, Yagishita S, Jinnai K, Futamura N, et al: Immunohistochemical characterization of microglia in Nasu-Hakola disease brains. Neuropathology. 2011, 31: 363-375. 10.1111/j.1440-1789.2010.01174.x.
Article PubMed Google Scholar
Klunemann HH, Ridha BH, Magy L, Wherrett JR, Hemelsoet DM, Keen RW, et al: The genetic causes of basal ganglia calcification, dementia, and bone cysts: DAP12 and TREM2. Neurology. 2005, 64: 1502-1507. 10.1212/01.WNL.0000160304.00003.CA.
Article CAS PubMed Google Scholar
Paloneva J, Kestila M, Wu J, Salminen A, Bohling T, Ruotsalainen V, et al: Loss-of-function mutations in TYROBP (DAP12) result in a presenile dementia with bone cysts. Nat Genet. 2000, 25: 357-361. 10.1038/77153.
Article CAS PubMed Google Scholar
Ziegler G, Prinz V, Albrecht MW, Harhausen D, Khojasteh U, Nacken W, et al: Mrp-8 and -14 mediate CNS injury in focal cerebral ischemia. Biochim Biophys Acta. 2009, 1792: 1198-1204. 10.1016/j.bbadis.2009.10.003.
Article CAS PubMed Google Scholar
Engel S, Schluesener H, Mittelbronn M, Seid K, Adjodah D, Wehner HD, et al: Dynamics of microglial activation after human traumatic brain injury are revealed by delayed expression of macrophage-related proteins MRP8 and MRP14. Acta Neuropathol. 2000, 100: 313-322. 10.1007/s004019900172.
Article CAS PubMed Google Scholar
Wakselman S, Bechade C, Roumier A, Bernard D, Triller A, Bessis A: Developmental neuronal death in hippocampus requires the microglial CD11b integrin and DAP12 immunoreceptor. J Neurosci. 2008, 28: 8138-8143. 10.1523/JNEUROSCI.1006-08.2008.
Article CAS PubMed Google Scholar
Roumier A, Bechade C, Poncer JC, Smalla KH, Tomasello E, Vivier E, et al: Impaired synaptic function in the microglial KARAP/DAP12-deficient mouse. J Neurosci. 2004, 24: 11421-11428. 10.1523/JNEUROSCI.2251-04.2004.
Article CAS PubMed Google Scholar
Colonna M: DAP12 signaling: from immune cells to bone modeling and brain myelination. J Clin Invest. 2003, 111: 313-314.
Article PubMed Central CAS PubMed Google Scholar
Glatt SJ, Everall IP, Kremen WS, Corbeil J, Sasik R, Khanlou N, et al: Comparative gene expression analysis of blood and brain provides concurrent validation of SELENBP1 up-regulation in schizophrenia. Proc Natl Acad Sci USA. 2005, 102: 15533-15538. 10.1073/pnas.0507666102.
Article PubMed Central CAS PubMed Google Scholar
Lewis CM, Levinson DF, Wise LH, DeLisi LE, Straub RE, Hovatta I, et al: Genome scan meta-analysis of schizophrenia and bipolar disorder, part II: Schizophrenia. Am J Hum Genet. 2003, 73: 34-48. 10.1086/376549.
Article PubMed Central CAS PubMed Google Scholar
Ng MY, Levinson DF, Faraone SV, Suarez BK, DeLisi LE, Arinami T, et al: Meta-analysis of 32 genome-wide linkage studies of schizophrenia. Mol Psychiatry. 2009, 14: 774-785. 10.1038/mp.2008.135.
Article PubMed Central CAS PubMed Google Scholar
Schwab SG, Hallmayer J, Freimann J, Lerer B, Albus M, Borrmann-Hassenbach M, et al: Investigation of linkage and association/linkage disequilibrium of HLA A-, DQA1-, DQB1-, and DRB1-alleles in 69 sib-pair- and 89 trio-families with schizophrenia. Am J Med Genet. 2002, 114: 315-320. 10.1002/ajmg.10307.
Article PubMed Google Scholar
Wright P, Donaldson PT, Underhill JA, Choudhuri K, Doherty DG, Murray RM: Genetic association of the HLA DRB1 gene locus on chromosome 6p21.3 with schizophrenia. Am J Psychiatry. 1996, 153: 1530-1533.
Article CAS PubMed Google Scholar
Hansen KD, Wu Z, Irizarry RA, Leek JT: Sequencing technology does not eliminate biological variability. Nat Biotechnol. 2011, 29: 572-573. 10.1038/nbt.1910.
Article PubMed Central CAS PubMed Google Scholar

Download references

Acknowledgements

The RNA quality check, library construction, and RNA sequencing were performed in the Functional Genomics Shared Resource and Genome Technology Core (now both were merged to Genome Sciences Resource) in Vanderbilt University. We thank the patients and volunteers involved in this study. The study is supported in part by the National Institutes of Health grant R01LM011177, a research grant from the Stanley Foundation, an Independent Investigator Award to XC, the 2009 NARSAD Maltz Investigator Award to ZZ, and the 2010 NARSAD Young Investigator Award to JS. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript. The principal investigators for the MGS were Pablo Gejman and Douglas Levinson. MGS study was supported by funding from the National Institute of Mental Health and the National Alliance for Research on Schizophrenia and Depression. Genotyping of part of the sample was supported by GAIN and the Paul Michael Donovan Charitable Foundation. Genotyping was carried out by the Center for Genotyping and Analysis at the Broad Institute of Harvard and MIT with support from the National Center for Research Resources.

This article has been published as part of BMC Genomics Volume 13 Supplement 8, 2012: Proceedings of The International Conference on Intelligent Biology and Medicine (ICIBM): Genomics. The full contents of the supplement are available online at http://www.biomedcentral.com/bmcgenomics/supplements/13/S8.

Author information

Authors and Affiliations

Department of psychiatry, School of Medicine, University at Buffalo, SUNY, Buffalo, NY, 14260, USA
Junzhe Xu & Steven L Dubovsky
VA Western New York HealthCare System, Buffalo, NY, 14215, USA
Junzhe Xu, Anna Li & Matthew Helm
Buffalo Psychiatric Center, Buffalo, NY, 14213, USA
Junzhe Xu
Department of Biomedical Informatics, Vanderbilt University School of Medicine, Nashville, TN, 37232, USA
Jingchun Sun & Zhongming Zhao
Department of Psychiatry, Vanderbilt University School of Medicine, Nashville, TN, 37212, USA
Jingchun Sun & Zhongming Zhao
Virginia Institute for Psychiatric and Behavioral Genetics, Virginia Commonwealth University, Richmond, VA, 23298, USA
Jingchun Chen, Silviu-Alin Bacanu & Xiangning Chen
Department of Biostatistics, Vanderbilt University School of Medicine, Nashville, TN, 37232, USA
Lily Wang
Department of Cancer Biology, Vanderbilt University Medical Center, Nashville, TN, 37232, USA
Zhongming Zhao
Department of Human and Molecular Genetics, Virginia Commonwealth University, Richmond, VA, 23298, USA
Xiangning Chen

Authors

Junzhe Xu
View author publications
You can also search for this author in PubMed Google Scholar
Jingchun Sun
View author publications
You can also search for this author in PubMed Google Scholar
Jingchun Chen
View author publications
You can also search for this author in PubMed Google Scholar
Lily Wang
View author publications
You can also search for this author in PubMed Google Scholar
Anna Li
View author publications
You can also search for this author in PubMed Google Scholar
Matthew Helm
View author publications
You can also search for this author in PubMed Google Scholar
Steven L Dubovsky
View author publications
You can also search for this author in PubMed Google Scholar
Silviu-Alin Bacanu
View author publications
You can also search for this author in PubMed Google Scholar
Zhongming Zhao
View author publications
You can also search for this author in PubMed Google Scholar
Xiangning Chen
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Zhongming Zhao or Xiangning Chen.

Additional information

Competing interests

The authors declare that they have no competing interests.

Authors' contributions

JX, AL, MH SLD collected samples for the study. JS, JC, LW and SAB conducted data analysis. XC and ZZ conceived and designed the study and managed RNA-Seq experiments. JX, JS, ZZ and XC wrote the manuscript.

Junzhe Xu, Jingchun Sun contributed equally to this work.

Electronic supplementary material

12864_2012_4443_MOESM1_ESM.pdf

Additional file 1: This file includes the following figures and tables. Figure S1 - Median Phred score vs. base position (cycle). Figure S2 - Distribution of the average FPKM values in the controls and cases. FPKM represents for fragments per kilobase of exon per million fragments mapped. Table S1 - Differentially expressed genes between schizophrenia patients and controls. Table S2 - Genes exclusively expressed in cases or controls. Table S3 - CAMs genes located in the MHC regions (chr6:20,000,000-40,000,000). (PDF 238 KB)

Rights and permissions

Open Access This article is published under license to BioMed Central Ltd. This is an Open Access article is distributed under the terms of the Creative Commons Attribution License ( https://creativecommons.org/licenses/by/2.0 ), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Xu, J., Sun, J., Chen, J. et al. RNA-Seq analysis implicates dysregulation of the immune system in schizophrenia. BMC Genomics 13 (Suppl 8), S2 (2012). https://doi.org/10.1186/1471-2164-13-S8-S2

Download citation

Published: 17 December 2012
DOI: https://doi.org/10.1186/1471-2164-13-S8-S2

RNA-Seq analysis implicates dysregulation of the immune system in schizophrenia