Background

The genetic contribution to the etiology of schizophrenia is significant, but the full molecular basis of the genetic factors remains incompletely defined. Since at least 1992 [1], it has been known that deletion of chromosome 22q11.2 is associated with schizophrenia with a very substantial relative risk. One study of 78 adults with deletion 22q11.2 found that 22.6% had schizophrenia [2]. Recurrent seizures were found in 39.7%, which may be explained in part by the hypocalcemia associated with this phenotype. In 2008, two studies [3, 4] found that chromosomal deletions of 1q21.1, 15q13.3, and 22q11.2 were associated with markedly increased risk of schizophrenia; one report but not the other also suggested a role for deletion 15q11.2. These three most frequent deletions were found in 0.21%, 0.20%, and 0.26% respectively of the schizophrenic populations studied, and they are also associated with intellectual disability and autism [5, 6]. It is quite remarkable that each of these CNVs can be associated with a wide range of phenotypes including intellectual disability, autism, schizophrenia, and epilepsy [7]. Deletion 15q13.3 is also reported to be present in as many as 1% of individuals with idiopathic generalized epilepsy [8]. For the 15q13.3 deletion, the CHRNA7 gene encoding the α7 subunit of the neuronal nicotinic receptor is a strong candidate gene mediating the phenotypic effects, as further supported by smaller deletions causing similar phenotypes [9]. The other two deletions (1q21.1 and 22q11.2) encompass many genes with no one gene yet strongly implicated as mediating the phenotypic effects. Other chromosomal mutations have also been found in schizophrenia such as deletions of 3q29 and NRXN1 and duplications of 15q11-q13 (maternal), 16p11.2, and 16p13.3 [1014]. Deletions of chromosome 16p11.2 are particularly common in autism [15, 16], and there is evidence that reciprocal duplications are associated with schizophrenia [11]. There is also evidence that loss-of-function mutations in individual genes can be associated with autism and/or schizophrenia. These genes include or may include DISC1, NRXN1, PDE4B, NPAS3, CNTNAP2, and APBA2; see Sebat et al. [7] for bibliography.

We reasoned that the frequency of detectable CNVs associated with schizophrenia might be higher in subjects with both schizophrenia and epilepsy compared to either phenotype alone. There is a report that patients with schizophrenia have an 11-fold increase in the prevalence of comorbid epilepsy [17]. We analyzed DNA from subjects with both schizophrenia and idiopathic epilepsy or with bipolar disorder and idiopathic epilepsy and from controls using array comparative genomic hybridization (CGH).

Methods

Subjects

DNA samples from 235 subjects with both schizophrenia and idiopathic epilepsy, 80 subjects with both bipolar disorder and idiopathic epilepsy, and 191 controls derived from cultured lymphoblasts, were obtained from the National Institute of Mental Health Human Genetics Initiative (NIMH-HGI). The NIMH website for this collection https://www.nimhgenetics.org/nimh_human_genetics_initiative/ reports that the establishment of a diagnosis of schizophrenia or bipolar disorder was based on DSM-III-R and DSM-IV criteria following a systematic and comprehensive examination of multiple sources of available information obtained from relatives, medical records, and direct assessment using the Diagnostic Interview for Genetic Studies for each patient. A self-reported questionnaire completed by the subjects was used to identify individuals who also had epilepsy in addition to a diagnosis of schizophrenia or bipolar disorder. Subjects reporting seizures secondary to causes such as trauma, medication, and/or polydypsia were excluded. Some patients with self-reported seizures but no further detail about etiology of their seizures were included. Ethnicity for the schizophrenia plus epilepsy samples was 135 Caucasian, 62 black, 22 Asian, 8 Hispanic, and 6 other. Ethnicity for the bipolar plus epilepsy was 77 Caucasian, 1 other, and 2 unknown. Ethnicity for the controls was 100% Caucasian. We screened Caucasian control individuals for psychiatric symptoms based on their self reported questionnaire. We chose to exclude controls with even modest or questionable behavioral findings in an attempt to exclude individuals who might have mild manifestations of a deleterious CNV. The rationale was to obtain a control group with under-representation of any behavioral abnormalities. Control individuals were excluded if they saw a doctor and took medication or used drugs and alcohol to deal with their depression or anxiety, if they reported any behaviors suggestive of obsessive compulsive disorder, or if they had a history of substance and/or alcohol abuse/dependence. We excluded 1284 samples out of 1920 leaving 636 control samples for study. Then 191 of the 636 were selected at random for study. This high exclusion rate was surprising, but it was already known that lifetime prevalences for depressive, anxiety, and substance use diagnoses were higher than in some other sample collections [18]. Additional information regarding screening criteria can be found in Additional file 1. Another study using these controls excluded far fewer samples based on rescreening [19].

Array CGH

Blood collected from schizophrenia, bipolar disorder, and control samples was used to establish lymphoblastoid cell lines and DNA was extracted by Rutgers University Cell and DNA Repository, Piscataway, NJ. Blood DNA was not available for most samples, so it was not possible to test whether results were only in lymphoblast DNA and absent in blood DNA. All cases and controls were hybridized with the same Caucasian male reference DNA isolated from fresh blood. Although it is well known that using DNA from lymphoblast cell-lines can give rise to artifactual copy number changes [20, 21], these usually do not produce CNVs identical to those known to cause neurobehavioral phenotypes and more often involve aneuploidy, very large genomic changes, or deletions at immunoglobulin genes.

Array CGH was performed on cases and controls with once-used, stripped clinical v8.0 arrays from the Medical Genetics Laboratory (MGL) at Baylor College of Medicine (BCM). Specifically, the v8.0 array is a 180 K Agilent oligonucleotide array with 30 kb backbone coverage and exon by exon coverage for 1714 genes reported to be associated with or cause disease or considered to be candidates for neurological disease association [22] http://www.bcm.edu/geneticlabs/test_detail.cfm?testcode=8655. The array was designed by Pawel Stankiewicz, March 2006 and manufactured by Agilent Technology (Santa Clara, CA). Previously used slides from both abnormal and normal cases were stripped by boiling in a 5 mM potassium phosphate buffer solution for 2 minutes. The reuse of clinical arrays provided a major cost reduction for these studies and allowed for comparison to a large body of clinical data using the same array design. We have experience that the used arrays detect known variants reliably if the Agilent DLR score is less than 0.30. These data indicate that false negative results would be rare. There is no significant concern regarding false positive results, because all CNVs were validated using new arrays with coverage suitable for the putative CNV.

The procedures for DNA digestion, labeling, and hybridization for the oligonucleotide arrays were performed according to the manufacturers' instructions, with minor modifications [23]. Slides were scanned into image files using the Agilent G2565 Microarray Scanner. Scanned images were quantified using Agilent Feature Extraction software (v10.7.3.), then analyzed for copy-number change using our in-house analysis package, as described previously [2426]. All calls made with the in-house software were visually inspected to assess their validity. Additional information on analysis can be found in Additional file 1.

Common CNVs, CNVs located in introns or in non-genic regions, and calls less than one kb were noted in the analysis but are not reported here. Common calls, as defined by being seen more than 20 times in 20,000 cases analyzed by the Medical Genetics Laboratories at Baylor College of Medicine, were also not included. Array data has been deposited in the GEO database under accession number GSE23703. All coordinates are based on the Feb. 2009 UCSC Human Genome Browser assembly (hg19).

All calls reported were validated on a variety of fresh, higher resolution arrays, depending on the coverage of the region in question and included: a custom Agilent 1 M array designed to cover the great majority of exons in the genome (Celestion-Soper submitted manuscript), Agilent SurePrint G3 Human CGH 4 × 180 catalog array (design ID: 022060), and a gene targeted custom arrays (all Agilent Technologies, Santa Clara, California, USA). All arrays used in this study were designed and analyzed based on UCSC hg18 (NCBI Build 36), March 2006. For CNVs that did not have adequate coverage on the 4 × 180 Agilent catalog, custom Agilent 4 × 180 k or 8 × 60 k arrays with focused coverage for the regions of interest were designed using Agilent's E-array database. The array designs can be viewed on Agilent's e-array database https://earray.chem.agilent.com/earray/(SSC Tiling v2.0: 027305, SSC Focused v3.0: 028249, Focused v4.0: 028812).

CNVs found in our study were compared to those in the DGV database http://projects.tcag.ca/variation/. Entries in the DGV database that were smaller than or overlapped for less than 10% of our calls were not counted. IDENTICAL indicates one or more CNVs were found in the database to be identical to our CNV +or- 10% on either side. The publication by Levinson et al. [19] analyzed most of the same NIMH samples reported here. A tabulation of all CNV findings in individual samples reported here is available from the corresponding author.

Results and Discussion

DNA from 235 subjects with both schizophrenia and idiopathic epilepsy, 80 subjects with bipolar disorder and idiopathic epilepsy, and 191 controls was analyzed using array CGH. Out of 235 subjects with both schizophrenia and idiopathic epilepsy, we identified 10 CNVs well established to be associated with schizophrenia (see references in Background above), These included three deletions of 1q21.1, one deletion of 2p16.3 (NRXN1), one deletion of 15q13.3, two maternal duplications of 15q11-q13, and three deletions of 22q11.2 (Figure 1 and Table 1). We also found two other CNVs that have a well established role in other neuropsychiatric phenotypes or epilepsy. These included a deletion of 16p13.1 which is associated with intellectual disability and more recently with schizophrenia [27] and a duplication of the Williams-Bueren syndrome region (7q11.23) which is well documented to be associated with neuropsychiatric phenotypes [28]. In total we found 12 out of 235 cases of schizophrenia and epilepsy (5.1%) harbored CNVs highly likely to have a significant association with the disease phenotype. This is the highest detection rate for pathological CNVs in any schizophrenia population studied to date.

Figure 1
figure 1

Most frequent pathogenic microdeletions observed in NIMH cases with both schizophrenia and epilepsy. Three cases of deletion 22q11.2 and three cases of deletion 1q21.1 are shown. Unique identifiers from the NIMH collection are provided. Low copy repeat (LCR) regions and gene symbols are provided. Coordinates are based on hg19.

Table 1 CNV abnormalities of likely significance in cases with both schizophrenia and epilepsy.

Among cases with schizophrenia plus epilepsy, we identified many deletions of uncertain significance but with some published link to neuropsychiatric disease. Single cases of deletions were observed for 20p13, CHRNB3, DLG2, GRIP1, and SLC1A1 (Figure 2 and Table 2). A genome wide linkage analysis for autosomal dominant schizophrenia within a single Israeli Arab pedigree demonstrated a significant linkage association with 20p13 [29] and a linkage analysis of 270 Irish high-density families with varying psychotic illnesses showed a potential linkage of the behavioral phenotypes to a region on 20p that includes 20p13 [30]. There are multiple candidate genes in the linkage interval. CHRNB3, a nicotinic receptor, is reported to be associated with smoking behavior by GWAS [31] and to show linkage to epilepsy in one study [32]. DLG2 encodes a family member of the membrane-associated guanylate kinase (MAGUK) proteins which are part of the postsynaptic density and interact with receptors, ion channels, and other signaling proteins. Other members of the postsynaptic density such as PSD95 and SAP97 are reported to have altered expression in schizophrenia and epilepsy [3335]. GRIP1 is a member of the glutamate receptor interacting proteins and has been reported to show altered expression in schizophrenia brain [36, 37]. A study of genotyped autistic patients and matched controls showed an association of a SNP within a genomic region of GRIP1 with autism [38]. Linkage to the region containing, SLC1A1, a glutamate transporter, has been reported for obsessive compulsive disorder [39, 40], and a SNP near SLC1A1 was reported to show association with autism spectrum disorder [41].

Figure 2
figure 2

Microdeletions of uncertain significance with some published link to neuropsychiatric disease observed in NIMH cases with both schizophrenia and epilepsy. Deletions on specific Agilent custom arrays for validation are shown involving an exon of DLG2, an exon of SLC1A1, an exon of GRIP1, and multiple genes at the 20p13 locus as labeled. The regions of deletion are demarcated by red rectangles. Each dot represents an oligonucleotide, green for deleted and black for normal copy number.

Table 2 CNVs abnormalities of possible significance in cases with both schizophrenia and epilepsy.

Similarly, duplications of uncertain significance but with some published link to neuropsychiatric disease were found in five schizophrenia plus epilepsy cases. These included duplications of 6q15, 16p13.2, 19p13.3, CHRNA7, and SLC6A4 (Table 3). The 6q15 duplication contains HTR1E which encodes a serotonin receptor and has been suggested to be of interest in suicide [42], and attention-deficit/hyperactivity disorder [43]. The16p13.2 region contains ABAT which encodes a γ-aminobutyrate transaminase that is responsible for the conversion of GABA to succinic semialdehyde (OMIM 137150). This locus has been reported to show altered expression in epilepsy [44] and genetic association in autism [45]. There is some evidence for linkage to autism at 19p13.3 in a Finnish pedigree [46]. CHRNA7 small duplications are common in the population and may have some pathogenic significance [47], and at least one was reported in schizophrenia [3]. We identified two CHRNA7 duplications in schizophrenia patients and one in a bipolar disorder patient; however, we also found one in our controls. This duplication is quite common, but its phenotypic significance is not clear [47]. SLC6A4 encodes a protein involved in the transport of serotonin and there is a report of genetic association with schizophrenia [48]. Additional findings of unknown significance in schizophrenia plus epilepsy samples are listed in Table 3.

Table 3 CNVs of unknown significance in schizophrenia and epilepsy samples

For the 80 subjects with both bipolar disorder and epilepsy, only 13 CNVs were identified but the number of cases studied was relatively small. Bipolar disorder was previously reported to have a lower CNV detection rate as compared to schizophrenia [49]. The findings in patients with both bipolar disorder and epilepsy are listed and discussed in Additional file 1 Table S1.

The findings in the cases were compared to findings in the NIMH controls screened to reduce the frequency of psychiatric symptoms as discussed above. One control was found to have a duplication of 22q11.2. Duplications of 22q11.2 show incomplete penetrance and cause a highly variable phenotype of neurocognitive dysfunction [50, 51], although there is at least one report suggesting an association with schizophrenia [10]. Although we screened out a large fraction of controls based on psychiatric symptoms, this individual passed that screen. Yet this male control with duplication 22q11.2 is noteworthy for self-reporting two episodes of depression lasting for four weeks accompanied by use of drugs or alcohol more than once for these problems. He also reported that drinking of alcohol interfered with school, job or home life once or twice in the past. He reported the use amphetamines, marijuana, and cocaine, and that this use interfered with school, job or home life 3-5 times. (We excluded controls reporting such drug use 6 or more times.) We also found a 15q11.2 deletion in one control sample, and though this CNV is reported to be associated with schizophrenia by some authors [3], the penetrance is quite low [12].

Other findings in controls involving loci of some neurobehavioral relevance included deletions of CNTN4 and GABRR1 and duplications of 15q11.2. Table 4 for complete list of CNVs found in controls.

Table 4 CNVs in psychiatric screened controls

We wished to assess whether the frequency of disease-related CNVs was significantly more frequent in patients with schizophrenia and epilepsy compared to controls from the same repository and tested on the same array platform. For this assessment, we accepted the following CNVs as being known to be associated with schizophrenia based on published data [12, 19]: deletions of 1q21.1, 3q29, 15q13.3, 22q11.2, and NRXN1 and duplications of 15q11-q13 (maternal), 16p11, and 16p13.3. We included maternal duplications of 15q11-q13 in this group because of the extensive literature linking it to autism and to epilepsy and based on recent reports of its occurrence in schizophrenia [14, 52]. We detected 10 schizophrenia plus epilepsy cases in 235 (4.3%) with the above mentioned CNVs compared to 0 in 191 controls (Fisher's exact test, p = 0.003). Other likely pathological findings in schizophrenia plus epilepsy cases included 1 deletion 16p13 and 1 duplication 7q11.23 for a total of 12/235 (5.1%) while a possibly pathogenic duplication of 22q11.2 was found in one control for a total of 1 in 191 (0.5%) controls (Fisher's exact test, p = 0.008). We also wished to assess whether our data supported the hypothesis that disease-associated CNVs would be more common in cases of schizophrenia and epilepsy compared to unselected schizophrenia. The rate of abnormality in the schizophrenia plus epilepsy of 10/235 for the more definite CNVs compares to a rate of 75/7336 based on supplemental data from Levinson et al. [19] for these same CNVs in a series of unselected schizophrenia cases (Fisher's exact test, p = 0.0004).

There are weaknesses in the current study including the use of cell line DNA rather than blood derived DNA, self-reporting of epilepsy findings, less than optimal phenotypic information, lack of availability of parental DNA, and lack of complete matching of ethnicity of cases and controls. Despite these difficulties, the observed differences are so substantial that they indicate that the frequency of disease-associated CNVs is significantly higher than any control group and significantly higher than in any series of unselected schizophrenia cases. Based on these results, clinicians can reasonably expect a pathological CNV detection rate of about 5% if they study patients with both schizophrenia and idiopathic epilepsy. In addition, all of these samples are readily available to diagnostic laboratories for validation of testing. These data, along with progress in whole exome and whole genome sequencing, suggest that the time is approaching when molecular genetic analysis of schizophrenia patients will be a clinically useful activity.

Conclusions

The data presented here strongly suggest that disease-associated CNVs were present at a significantly higher frequency in cases with both schizophrenia and epilepsy compared to control samples from the same repository. In addition, disease-associated CNVs were found at a significantly higher frequency in schizophrenia plus epilepsy cases than in larger series of unselected schizophrenia. Although many geneticists would argue that chromosomal microarray analysis (CMA) is clinically useful today in patients with either schizophrenia or idiopathic epilepsy alone, the case for CMA testing is more compelling for individuals with both schizophrenia and epilepsy. The detection rate of clinically significant abnormalities is increasing with exon by exon coverage for hundreds of genes [22]. Although the frequency of pathological finding may be regarded by some as relatively low, definitive abnormalities can clarify diagnosis, inform genetic counseling, and potentially lead to CNV-specific management [53].