Whole genome sequencing study of identical twins discordant for psychosis

Ormond, Cathal; Ryan, Niamh M.; Hedman, Anna M.; Cannon, Tyrone D.; Sullivan, Patrick F.; Gill, Michael; Hultman, Christina; Heron, Elizabeth A.; Johansson, Viktoria; Corvin, Aiden

doi:10.1038/s41398-024-02982-0

Whole genome sequencing study of identical twins discordant for psychosis

Article
Open access
Published: 30 July 2024

Volume 14, article number 313, (2024)
Cite this article

Download PDF

You have full access to this open access article

Translational Psychiatry

Whole genome sequencing study of identical twins discordant for psychosis

Download PDF

654 Accesses
1 Altmetric
Explore all metrics

Abstract

Monozygotic (MZ) twins are often thought to have identical genomes, but recent work has shown that early post-zygotic events can result in a spectrum of DNA variants that are different between MZ twins. Such variants may explain phenotypic discordance and contribute to disease etiology. Here we performed whole genome sequencing in 17 pairs of MZ twins discordant for a psychotic disorder (schizophrenia, schizoaffective disorder or bipolar disorder). We examined various classes of rare variants that are discordant within a twin pair. We identified four genes harboring rare, predicted deleterious missense variants that were private to an affected individual in the cohort. Variants in FOXN1 and FLOT2 would have been categorized as damaging from recent schizophrenia and bipolar exome sequencing studies. Additionally, we identified four rare genic copy number variants (CNVs) private to an affected sample, two of which overlapped genes that have shown evidence for association with schizophrenia or bipolar disorder. One such CNV was a 3q29 duplication previously implicated in autism and developmental delay. We have performed the largest MZ twin study for discordant psychotic phenotypes to date. These findings warrant further investigation using other analytical approaches.

Deep sequencing reveals variations in somatic cell mosaic mutations between monozygotic twins with discordant psychiatric disease

Article Open access 27 July 2017

Identification of somatic mutations in monozygotic twins discordant for psychiatric disorders

Article Open access 13 April 2018

Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia

Article Open access 28 November 2017

Introduction

Schizophrenia is a substantially heritable brain disorder, with heritability estimated between 60–80% [1]. The core symptoms include psychosis, where perception of reality is impaired, and people develop delusions or hallucinations [2]. The genetic etiology is complex, and known to overlap significantly with other psychiatric disorders, particularly those that present with psychosis, e.g., schizoaffective disorder, bipolar disorder [3, 4]. Despite significant progress, much of the genetic variation contributing to risk of schizophrenia and related psychotic disorders remains to be determined [5]. The search has moved from array-based studies, which focus on common variants in genome-wide association studies (GWAS), to whole exome (WES) or genome (WGS) sequencing approaches which provide more complete analysis of the full spectrum of genetic variation [6, 7]. This is important, as rare mutations that disrupt gene function, identifiable by sequencing, may be particularly informative in understanding the molecular etiology involved, and provide targets for novel therapies. Because deleterious alleles with large effects tend to be removed over time by natural selection, they tend to be rare in the population and hence very large sample sizes are required to identify them using case-control studies [8, 9].

While the rare variant case-control approach has yielded success in identifying robust risk genes for psychotic disorders [10], there are other potential analytical strategies available. Historically, twin research has played an important role in revealing the genetic epidemiology of psychiatric disorders. Heritability estimates from twin studies for schizophrenia [11] and bipolar disorder [12] are substantially higher than the heritability explained from GWAS [13, 14], so it is likely that other genetic factors (e.g. rare variants) are contributing to the heritability. Concordance rates between monozygotic (MZ) twins for schizophrenia is estimated to be 50% [15], which is notably higher than the approximately 1% incidence rate in the general population [16]. The case co-twin study design allows for the control of shared genetic and environmental effects and for the examination of non-shared genetic variation. Postnatal environmental effects such as childhood trauma or substance abuse are known to increase risk for a psychotic disorder [17]. While MZ twins are the same age and often have similar childhood experiences, it is possible that they may not share such environmental effects.

One hypothesis explaining the discordance in diagnosis between MZ twins is that both individuals share a common genetic and environmental risk which is insufficient alone to be causal for the phenotype, but rare, post-zygotic genetic variation present in the affected twin increases their disease-risk. A recent study estimated that in general, almost 10% of de novo variants occur post-fertilization and prior to progenitor germ cell specification and were thus likely to be present in both germ and blood cells [18]. Another study examined transmission of post-zygotic variants to offspring of monozygotic twins and estimated that 2.1% of de novo variants occurred after the twinning event, but prior to progenitor germ cell specification [19]. These discordant de novo variants would form part of what is estimated from twin or family studies as the environmental or even non-additive genetic effects. Given their rarity, examining variation private to one twin will drastically reduce the search space of candidate causal variants compared to unrelated case-control cohorts. WGS and WES analyses have identified de novo post-zygotic variation in MZ twins discordant for a range of disorders [20,21,22] and further investigation in psychotic disorders is warranted. Such knowledge is useful from a clinical perspective, as it highlights another important factor that may be responsible for the discordance in clinical diagnoses between MZ twins.

Here we performed WGS on peripheral blood samples from 17 pairs of MZ twins discordant for schizophrenia, schizoaffective disorder, or bipolar disorder, which is the largest sample size for such a study to date. Using a strict filtering and annotation approach, we have identified discordant single nucleotide variants (SNVs) and copy number variants (CNVs) that may provide novel insights into the genetic basis of these conditions.

Methods and materials

Sample procurement

Ethics

Written informed consent was obtained from all participants in this study. Ethical permissions were obtained from Stockholm County in Sweden (Dnr: 2004/448/4, 2007/779-31/3 and 2008/292-32).

Recruitment and diagnostic assessment procedures

The schizophrenia and bipolar twin study in Sweden (STAR) is a study on MZ and DZ twin pairs with schizophrenia or bipolar disorder and in total 462 twins have participated, see Johansson et al. for further description of the cohort [23]. The participants in this study were originally identified through the Swedish Twin register (STR) [24] and the National Patient register (NPR), which is administered by the Social board of health and welfare. Potential participants were invited to the STAR study if only one twin had a registered treatment episode of schizophrenia or bipolar disorder (Diagnoses according to International Statistical Classification of Diseases: ICD-8: 295 or 296, ICD-9: 295 or 296 or ICD-10: F20, F30 or F31). If the twin pair decided to participate in STAR, an extensive assessment procedure was initiated. Diagnostic status was confirmed by a clinical psychiatrist through the Structured Clinical Interview for DSM-IV (SCID-I) [25]. The final diagnosis was determined by an evaluation team, with access to register data from previous hospitalizations and hospital records. The diagnoses were categorized as schizophrenia (ICD-10: F20), schizoaffective disorder (ICD-10: F25), bipolar disorder (ICD-10: F31), major depressive disorder (ICD-10 F32-F33) or not affected by any of those diagnoses.

From the STAR cohort we selected all available disease discordant MZ twin pairs (n = 19). We included pairs in which one twin was affected from schizophrenia, schizoaffective disorder, or bipolar disorder, and the co-twin was not affected from any of those diagnoses. These diagnoses were selected based on the shared genomics from rare variants [26] and given that all three can exhibit psychotic symptoms. In addition, schizoaffective disorder is often included in the case definition for both schizophrenia and bipolar disorder analyses [10, 26]. Rare, protein altering variants have a more modest effect on major depressive disorder (MDD) without psychosis or bipolar disorder [27] compared to schizophrenia [10] or bipolar disorder [26]. In addition, the shared common variant heritability between MDD and schizophrenia (r_g = 0.37) or between MDD and bipolar disorder (r_g = 0.34) are weaker than the shared heritability between schizophrenia and bipolar disorder (r_g = 0.68) [28]. We therefore included pairs where the co-twin had a diagnosis of MDD without occurrence of psychotic symptoms and considered that individual as unaffected.

Whole genome sequencing and quality control

DNA collection and extraction

The same day as the clinical assessment, blood samples for DNA extraction were collected in the morning and were sent to Karolinska Institutet (KI) Biobank for processing and storage. DNA was extracted from EDTA blood based on a salting out method from Puregene extraction kit using a Gentra robot. DNA concentrations were quantified by Qubit and the quality of DNA was determined by agarose gel electrophoresis. Both samples from the pair T19 failed quality control metrics for sequencing and were excluded.

Sequencing and quality control

WGS was performed by Edinburgh Genomics (Clinical Genomics) on a HiSeqX to an average depth of coverage of 30x per sample. WGS allowed us to examine non-coding regions which would be largely inaccessible from whole exome sequencing data. Additionally, WGS can give us better breakpoint resolution for calling CNVs. All FASTQ files were examined using FastQC and samtools [29] to identify DNA contamination or degradation. Reads were aligned to the GRCh38 reference genome using BWA-MEM [30], following the GATK Best Practices [31]. Briefly, this involved marking PCR duplicates, base quality score recalibration, local realignment of reads around indels, and variant calling with HaplotypeCaller (GATK version 3.8-0-ge9d806836). Genotype calling was performed jointly across all samples, and variant quality score recalibration (VQSR) was performed on the SNVs and Indels separately (see Supplementary Methods).

The software peddy [32] was applied to all samples jointly to check for: (i) relatedness discordance; (ii) sex discordance; (iii) low median coverage; and (iv) ancestry clustering by a principal component analysis (PCA) based on 1000 Genomes Project data [33]. Sample T04_U was flagged as having a relatedness error with all other samples at this stage, and so twin pair T04 was excluded. SNP array data was available for 35 samples, and the genotype concordance rates between the sequence data and SNP array data was determined using the GenotypeConcordance module from GATK. To confirm zygosity within each twin pair, the genotype concordance rates for the sequence data and for the SNP array data were determined using picard and GATK respectively (see Supplementary Methods).

SNV and indel prioritization

To remove low-quality variants, any variant with QUAL < 100.0 was removed across all samples [22]. In addition, if any sample had GQ < 20.0 or DP < 10.0, the genotype for that sample was set to missing [34]. The variants of interest are putative de novo events present in a twin pair, (i.e., where the affected sample had exactly one more copy of the allele of interest than their co-twin). A reverse-pairwise analysis was performed within each twin pair to identify such variants, regardless of affectation status. The Variant Effects Predictor (VEP) [35] was used to annotate each entry for: functional impact (sequence ontology) [36]; predicted deleteriousness (SIFT [37], PolyPhen-2 [38] and CADD [39]), and allele frequency (1000 Genomes Project [40] and gnomAD [41]).

To identify rare, putatively pathogenic variants, the following filters were applied: (i) variants were present in the coding sequence of a protein coding gene as determined by RefSeq [42]; (ii) VEP impact was MODERATE or HIGH; (iii) SIFT was “deleterious” or PolyPhen was “damaging”; (iv) the allele frequency was <1% or absent in the appropriate population groups in the 1000 Genomes Project and gnomAD databases; and (v) variants were not observed in any other twin pair within the cohort. Only SNVs were considered at this step, as SIFT and PolyPhen do not provide scores for indels. Multi-allelic sites were split to bi-allelic sites to further identify the pathogenic allele.

CNV calling and prioritization

Germline CNVs were called using a family-based consensus using four separate calling tools (Supplementary Methods). All the calls within a twin pair were combined (Supplementary Figs. 1, 2) to create Regions of Interest (ROIs). Any ROI that was found in one sample of the pair and identified by only one calling algorithm was removed. Once a list of high-confidence ROIs has been generated, variants were removed if they had at least a 50% reciprocal overlap with any common CNVs (i.e. frequency at least 1% in the appropriate population group) in the following public databases: gnomAD [41], the Deciphering Developmental Disorders (DDD) study [43], and the Database of Genomic Variants (DGV) [44]. As the DDD and gnomAD databases were curated relative to the hg19 genome build, the ROI files were converted to this build using the UCSC liftOver tool [45].

Any ROI that had a 50% reciprocal overlap with a variant labelled as “Pathogenic” in the NIH Clinical Genomics (ClinGen) CNV database (UCSC “iscaPathogenic” table) [46] was retained, regardless of population frequency. Since ClinGen collates CNV calls from a wide collection of sources, each of which may use different reference material for CNV calling, it is not possible to know if the type of pathogenic CNV matches that of the CNV call in our data. Hence, CNV calls were not matched for type at this stage. Pathogenic CNVs were retained if the associated phenotype was psychiatric or neurodevelopmental in nature.

Results

WGS data

WGS was performed on 19 MZ twin pairs discordant for a psychotic disorder, to a minimum of 30x coverage (Methods). After quality control, 17 pairs of twins were carried forward for analysis (Table 1, Supplementary Table 1). Principal components analysis identified that all samples were of European ancestry with the exception of twin pair T05 who were from East Asia (Supplementary Figs. 3–5). The within-pair genotype concordance rates confirmed the expected zygosity (Supplementary Methods; Supplementary Table 2). A high concordance between the WGS data and previously generated genotype array data for each sample confirmed that there was no sample mix-up. After removing lower quality variants, each sample had an average of 44 306 (standard deviation: 1640.5; median: 44,264; range: 41,134–48,050) discordant SNVs and indels across the genome (Supplementary Table 1). Given that short read WGS detects approximately 4,000,000 variants per genome [47], this implies approximately 1% of the variants detected in each sample are discordant. As the estimated error rate is 0.1% for short-read WGS on Illumina HiSeq technologies [48], this number is unlikely to be attributed solely to sequencing errors.

Table 1 Phenotypic data for the 17 pairs of MZ twins.

Full size table

Discordant SNVs and indels

To identify rare, putatively pathogenic variants that may have a direct effect on disease burden, a rigorous filtering pipeline was applied to discordant SNVs present in protein-coding regions (Methods). After applying filters, six rare, predicted deleterious discordant SNVs were discovered across four unique genes, (Table 2). For FOXN1, three discordant variants present in close proximity ( < 15 bp) in the same individual appeared on the same re-constructed reads, likely due to re-alignment of reads around indels during variant calling (Supplementary Fig. 6). We used CADD scores [39] and the RegulomeDB database [49] to examine rare, discordant, putatively pathogenic variants with a predicted regulatory effect (Supplementary Methods). All variants were required to have a Phred-scaled CADD score > 20.0 (i.e., in the top 1% of predicted deleterious variants across the genome) and a RegulomeDB score of 1 or 2. One such variant was identified but was present in an unaffected individual (Supplementary Table 4).

Table 2 Discordant protein-coding variants with a predicted pathogenic effect.

Full size table

We considered eight regulatory annotation features likely to have an impact on phenotype, derived from the ENCODE project as well as brain specific annotation features (Supplementary Methods). We use a Wilcoxon signed-rank sum test to evaluate whether the median count of all discordant variants overlapping each feature was different between the affected samples and the unaffected samples within a twin pair (Supplementary Fig. 7). No significant difference in median counts were observed for any regulatory feature (Supplementary Table 5).

CNVs and repeat expansions

Germline CNVs were called using a consensus approach based on four separate calling algorithms, extended to take family structure into account (Methods, Supplementary Methods). We screened all individuals against a list of 23 rare CNVs (Supplementary Table 6) previously implicated in schizophrenia or related psychotic disorders [50, 51]. One of these, a duplication on chromosome 13q12.11, was identified in both samples of twin pair T09. This CNV had previously been reported as having a protective effect for schizophrenia.

We prioritized discordant CNVs in cases, that were either rare/absent in public databases or had a known pathogenic effect. After applying filters, four rare CNVs that overlapped gene regions were identified across the cohort (Table 3). Of note, a duplication on chromosome 3q29 was observed in the affected sample of twin pair T17. While 3q29 deletions have been associated with schizophrenia, 3q29 duplications have been implicated in autism spectrum disorders and developmental delay [52]. This prompted us to examine a more extensive list of CNVs annotated by the ClinGen CNV database as implicated in psychiatric or neurodevelopmental disorders (Supplementary Table 10). 14 CNVs with a clinical impact were identified across the samples, but only the 3q29 duplication was present solely in the affected individuals.

Table 3 A list of rare discordant CNVs, including: the positions (GRCh38); length; type (DEL deletion, DUP duplication); the carrier sample ID; their phenotype (SAD schizoaffective disorder, SCZ schizophrenia); if they are annotated as pathogenic; and the overlapping genes from the BDgene and SZDB databases (full list in Supplementary Tables 8, 9).

Full size table

Somatic CNVs were called using MoChA [53], which examines differences in depth of coverage from phased SNVs (Supplementary Methods). After removing putative false positives on visual inspection of the regions, there does not appear to be evidence for the presence of discordant somatic CNVs in these samples. Repeat expansions were called from the raw sequence data using ExpansionHunter [54] for a collection of 16 repeat expansion disorders (Supplementary Methods, Supplementary Tables 11, 12). Despite some discordances within twin pairs, none of the samples had repeat counts above the specified threshold for any disorder (Supplementary Table 13, Supplementary Fig. 8).

Discussion

Here we report a WGS study where we assessed de novo post-zygotic variation in blood sample in the affected member of 17 MZ twins discordant for a major psychotic disorder (schizophrenia, schizoaffective disorder, or bipolar disorder). A rigorous filtering strategy identified six rare, deleterious, discordant, protein coding SNVs across four genes, each present in one affected member of the cohort. None of the six missense SNVs appeared in the Schizophrenia Exome Sequencing Meta-Analysis (SCHEMA) database [10], but some variants close by in the amino acid sequence were observed (Supplementary Table 3). In the Bipolar Exome (BipEx) sequencing project [26], one of the S291R variants of FOXN1 was observed in a bipolar case only. This variant was annotated as “other missense”, a class of variants not found to be enriched in bipolar cases compared to controls. However, all FOXN1 variants in this study would have been prioritized as “damaging missense” in the SCHEMA analysis. While the FLOT2 variant was not observed in the BipEx database, it is categorized as “damaging missense”.

We implemented a consensus calling strategy for CNVs and screened for rare CNVs with a known association with psychosis, these being implicated in schizophrenia studies. We identified four rare, discordant CNVs present in affected samples only (Table 3). One such variant was a duplication in the 3q29 region in the affected sample of twin pair T17. While only deletions in this region have been shown to be associated with schizophrenia, this region has also been implicated in autism spectrum disorders and developmental delay. Two of the regions contained genes which had previously shown some relationship to schizophrenia or bipolar disorder (Table 3, Supplementary Tables 8, 9). We investigated an extended list of CNVs reported to have a pathogenic effect in psychiatric or neurodevelopmental disorders (Supplementary Table 10). The previously mentioned 3q29 duplication is the only pathogenic CNV to be found exclusively in affected samples.

A duplication on chromosome 13q12.11 was identified in both samples of twin pair T09. In a discovery association analysis, this CNV was noted to have a protective effect for schizophrenia but was only nominally significant [50]. However, it is worth noting that the affected individual in this twin pair also has the rare, deleterious, discordant missense variant in the FLOT2 gene (Table 2). FLOT2 (Flotillin-2) has been shown to be involved in neuronal differentiation [55] and flotillins are known to interact with the NR2A and NR2B subunits of N-methyl-D-aspartate receptors [56].

Given the age profile of some of the twin pairs, it is possible that some of them may have children old enough to have a reliable psychiatric diagnosis. A follow up study including offspring of the twin pairs from this study could allow for the examination of transmission and segregation of the variants we have identified in the next generation. Given the relatively low transmission rates of post-zygotic variants from an MZ twin to their offspring [19], if the variants identified here were found in affected offspring of the MZ twins, this would provide additional support for these variants as disease-causing within that pedigree.

This study has some limitations. First, while the sample size is large relative to other studies of MZ twins, the cohort is not sufficiently powered to statistically evaluate the burden of rare, discordant, protein-coding variants. Second, while we have used the case co-twin design to identify post-zygotic variation, we do not have access to the parental genomes to confirm that our variants are de novo. Parental information would also allow us to examine de novo variation shared between both twins. For example, a shared rare, de novo variant with reduced penetrance could conceivably explain the phenotypic discordance between the twins. Third, the average depth of coverage of the WGS data in this study [30x] may not be sufficient to call variants under a somatic model. Whole exome sequencing, which typically uses a depth of coverage of 90x or higher, has had some success in identifying somatic variants in MZ twins discordant for psychosis [57]. This is supported by the lack of evidence of discordant, somatic CNVs, with all candidate variants being rejected on manual review. Fourth, although we aimed to identify post-zygotic variation occurring early in development, it is possible that de novo variants specific to brain tissue may be present [58], and these may not be observable from blood tissue. Finally, while the samples in this study have undergone a comprehensive diagnostic procedure to ascertain the validity of the phenotypic discordance within each twin pair, the unaffected twin may go on to receive a diagnosis later in life. Future follow up with these individuals from register data may be possible, although many of the twin pairs were originally interviewed after typical age of onset of psychotic disorders.

We have performed the largest MZ twin study for discordant psychotic phenotypes to date. While post-zygotic genomic variations are known to contribute to the discordance in phenotype between MZ twins, other factors such as environmental effects and epigenomic variation can also be driving this discordance. Therefore, many studies of MZ twins also look at methylation differences between MZ twins in addition to genomic variation [59, 60]. In this study, we focused on a broad range of genomic variation, from SNVs (both protein-altering and regulatory), to CNVs and repeat expansions, making use of the full potential of WGS data. This study is important as it contributes novel findings to the current body of literature for variants implicated in psychotic disorders and provides a framework for future studies.

Data availability

The WGS data from the individuals in this study are available from the NIMH Data Archive, collection ID 4231.

References

Owen MJ, Sawa A, Mortensen PB. Schizophrenia. Lancet. 2016;388:86–97.
Article PubMed PubMed Central Google Scholar
Association AP. Diagnostic and Statistical Manual of Mental Disorders (DSM-5¯). American Psychiatric Publishing. 2013.
Cardno AG, Owen MJ. Genetic relationships between schizophrenia, bipolar disorder, and schizoaffective disorder. Schizophr Bull. 2014;40:504–15.
Article PubMed PubMed Central Google Scholar
Lichtenstein P, Yip BH, Bjork C, Pawitan Y, Cannon TD, Sullivan PF, et al. Common genetic determinants of schizophrenia and bipolar disorder in Swedish families: a population-based study. Lancet. 2009;373:234–9.
Article CAS PubMed Google Scholar
Owen MJ, Williams NM. Explaining the missing heritability of psychiatric disorders. World Psychiatry: Off J World Psychiatr Assoc (WPA). 2021;20:294–5.
Article Google Scholar
Schreiber M, Dorschner M, Tsuang D. Next-generation sequencing in schizophrenia and other neuropsychiatric disorders. Am J Med Genet Part B, Neuropsychiatr Genet: Off Publ Int Soc Psychiatr Genet. 2013;162b:671–8.
Article Google Scholar
Kato T. Whole genome/exome sequencing in mood and psychotic disorders. Psychiatry Clin Neurosci. 2015;69:65–76.
Article CAS PubMed Google Scholar
Zuk O, Schaffner SF, Samocha K, Do R, Hechter E, Kathiresan S, et al. Searching for missing heritability: designing rare variant association studies. Proc Natl Acad Sci USA. 2014;111:E455–464.
Article CAS PubMed PubMed Central Google Scholar
Sullivan PF, Geschwind DH. Defining the genetic, genomic, cellular, and diagnostic architectures of psychiatric disorders. Cell. 2019;177:162–83.
Article CAS PubMed PubMed Central Google Scholar
Singh T, Poterba T, Curtis D, Akil H, Al Eissa M, Barchas JD, et al. Rare coding variants in ten genes confer substantial risk for schizophrenia. Nature. 2022;604:509–16.
Article CAS PubMed PubMed Central Google Scholar
Hilker R, Helenius D, Fagerlund B, Skytthe A, Christensen K, Werge TM, et al. Heritability of schizophrenia and schizophrenia spectrum based on the nationwide Danish twin register. Biol Psychiatry. 2018;83:492–8.
Article PubMed Google Scholar
Smoller JW, Finn CT. Family, twin, and adoption studies of bipolar disorder. Am J Med Genet C Semin Med Genet. 2003;123c:48–58.
Article PubMed Google Scholar
Trubetskoy V, Pardiñas AF, Qi T, Panagiotaropoulou G, Awasthi S, Bigdeli TB, et al. Mapping genomic loci implicates genes and synaptic biology in schizophrenia. Nature. 2022;604:502–8.
Article CAS PubMed PubMed Central Google Scholar
Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet. 2021;53:817–29.
Article CAS PubMed PubMed Central Google Scholar
Cardno AG, Gottesman II. Twin studies of schizophrenia: from bow-and-arrow concordances to star wars Mx and functional genomics. Am J Med Genet. 2000;97:12–17.
Article CAS PubMed Google Scholar
McGrath J, Saha S, Chant D, Welham J. Schizophrenia: a concise overview of incidence, prevalence, and mortality. Epidemiol Rev. 2008;30:67–76.
Article PubMed Google Scholar
Robinson N, Bergen SE. Environmental risk factors for schizophrenia and bipolar disorder and their relationship to genetic risk: current knowledge and future directions. Front Genet. 2021;12:686666.
Article PubMed PubMed Central Google Scholar
Sasani TA, Pedersen BS, Gao Z, Baird L, Przeworski M, Jorde LB, et al. Large three-generation human families revea l post-zygotic mosaicism and variability in germline mutation accumulation. eLife. 2019;8:e46922. https://doi.org/10.7554/eLife.46922.
Jonsson H, Magnusdottir E, Eggertsson HP, Stefansson OA, Arnadottir GA, Eiriksson O, et al. Differences between germline genomes of monozygotic twins. Nat Genet. 2021;53:27–34.
Article CAS PubMed Google Scholar
Huang Y, Zhao Y, Ren Y, Yi Y, Li X, Gao Z, et al. Identifying genomic variations in monozygotic twins discordant for autism spectrum disorder using whole-genome sequencing. Mol Ther Nucleic Acids. 2019;14:204–11.
Article CAS PubMed Google Scholar
Tang J, Fan Y, Li H, Xiang Q, Zhang DF, Li Z, et al. Whole-genome sequencing of monozygotic twins discordant for schizophrenia indicates multiple genetic risk factors for schizophrenia. J Genet Genom. 2017;44:295–306.
Article CAS Google Scholar
Castellani CA, Melka MG, Gui JL, Gallo AJ, O’Reilly RL, Singh SM. Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia. Clin Transl Med. 2017;6:43.
Article CAS PubMed PubMed Central Google Scholar
Johansson V, Hultman CM, Kizling I, Martinsson L, Borg J, Hedman A, et al. The schizophrenia and bipolar twin study in Sweden (STAR). Schizophr Res. 2019;204:183–92.
Article PubMed Google Scholar
Lichtenstein P, Björk C, Hultman CM, Scolnick E, Sklar P, Sullivan PF. Recurrence risks for schizophrenia in a Swedish national cohort. Psychol Med. 2006;36:1417–25.
Article PubMed Google Scholar
Spitzer RL, Williams JB, Gibbon M, First MB. The structured clinical interview for DSM-III-R (SCID). I: History, rationale, and description. Arch Gen Psychiatry. 1992;49:624–9.
Article CAS PubMed Google Scholar
Palmer DS, Howrigan DP, Chapman SB, Adolfsson R, Bass N, Blackwood D, et al. Exome sequencing in bipolar disorder identifies AKAP11 as a risk gene shared with schizophrenia. Nat Genet. 2022;54:541–7.
Article CAS PubMed PubMed Central Google Scholar
Tian R, Ge T, Hyeokmoon, Kweon H, Rocha DB, Lam M, et al. (2024) Whole-exome sequencing in UK Biobank reveals rare genetic architecture for depression Abstract. Nat Commun. 2024;15. https://doi.org/10.1038/s41467-024-45774-2.
Grotzinger AD, Mallard TT, Akingbuwa WA, Ip HF, Adams MJ, Lewis CM, et al. Genetic architecture of 11 major psychiatric disorders at biobehavioral, functional genomic and molecular genetic levels of analysis. Nat Genet. 2022;54:548–59.
Article CAS PubMed PubMed Central Google Scholar
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The sequence alignment/map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Article PubMed PubMed Central Google Scholar
Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM. 2013.
Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the genome analysis toolkit best practices pipeline. Curr Protoc Bioinforma. 2013;43:1–33.
Google Scholar
Pedersen BS, Quinlan AR. Who’s who? Detecting and resolving sample anomalies in human DNA sequencing studies with peddy. Am J Hum Genet. 2017;100:406–13.
Article CAS PubMed PubMed Central Google Scholar
Auton A, Brooks LD, Durbin RM, Garrison EP, Kang HM, Korbel JO, et al. A global reference for human genetic variation. Nature. 2015;526:68–74.
Article PubMed Google Scholar
Vadgama N, Pittman A, Simpson M, Nirmalananthan N, Murray R, Yoshikawa T, et al. De novo single-nucleotide and copy number variation in discordant monozygotic twins reveals disease-related genes. Eur J Hum Genet. 2019;27:1121–1133. https://doi.org/10.1038/s41431-019-0376-7.
McLaren W, Gil L, Hunt SE, Riat HS, Ritchie GR, Thormann A, et al. The ensembl variant effect predictor. Genome Biol. 2016;17:122.
Article PubMed PubMed Central Google Scholar
Eilbeck K, Lewis SE, Mungall CJ, Yandell M, Stein L, Durbin R, et al. The sequence ontology: a tool for the unification of genome annotations. Genome Biol. 2005;6:R44.
Article PubMed PubMed Central Google Scholar
Ng PC, Henikoff S. SIFT: predicting amino acid changes that affect protein function. Nucleic Acids Res. 2003;31:3812–4.
Article CAS PubMed PubMed Central Google Scholar
Adzhubei I, Jordan DM, Sunyaev SR. Predicting Functional Effect of Human Missense Mutations Using PolyPhen‐2 Abstract. Curr Protoc Hum Genet. 2013;76. https://doi.org/10.1002/0471142905.hg0720s76.
Rentzsch P, Witten D, Cooper GM, Shendure J, Kircher M. CADD: predicting the deleteriousness of variants throughout the human genome. Nucleic Acids Res. 2019;47:D886–d894.
Article CAS PubMed Google Scholar
Sudmant PH, Rausch T, Gardner EJ, Handsaker RE, Abyzov A, Huddleston J, et al. An integrated map of structural variation in 2504 human genomes. Nature. 2015;526:75–81.
Article CAS PubMed PubMed Central Google Scholar
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alföldi J, Wang Q, et al. (2020) The mutational constraint spectrum quantified from variation in 141456 humans Abstract. Nature. 2020;581:434–443. https://doi.org/10.1038/s41586-020-2308-7.
O’Leary NA, Wright MW, Brister JR, Ciufo S, Haddad D, McVeigh R, et al. Reference sequence (RefSeq) database at NCBI: current status, taxonomic expansion, and functional annotation. Nucleic Acids Res. 2016;44:D733–745.
Article PubMed Google Scholar
Firth HV, Wright CF. The deciphering developmental disorders (DDD) study. Dev Med Child Neurol. 2011;53:702–3.
Article PubMed Google Scholar
MacDonald JR, Ziman R, Yuen RK, Feuk L, Scherer SW. The database of genomic variants: a curated collection of structural variation in the human genome. Nucleic Acids Res. 2014;42:D986–992.
Article CAS PubMed Google Scholar
Kuhn RM, Haussler D, Kent WJ. The UCSC genome browser and associated tools. Brief Bioinform. 2013;14:144–61.
Article CAS PubMed Google Scholar
Rehm HL, Berg JS, Brooks LD, Bustamante CD, Evans JP, Landrum MJ, et al. ClinGen–the clinical genome resource. N. Engl J Med. 2015;372:2235–42.
Article CAS PubMed PubMed Central Google Scholar
Lappalainen T, Scott AJ, Brandt M, Hall IM. Genomic analysis in the age of human genome sequencing. Cell. 2019;177:70–84.
Article CAS PubMed PubMed Central Google Scholar
Fox EJ, Reid-Bayliss KS, Emond MJ, Loeb LA. Accuracy of next generation sequencing platforms. Next Gener Seq Appl. 2014:1.
Boyle AP, Hong EL, Hariharan M, Cheng Y, Schaub MA, Kasowski M, et al. Annotation of functional variation in personal genomes using RegulomeDB. Genome Res. 2012;22:1790–7.
Article CAS PubMed PubMed Central Google Scholar
Marshall CR, Howrigan DP, Merico D, Thiruvahindrapuram B, Wu W, Greer DS, et al. Contribution of copy number variants to schizophrenia from a genome-wide study of 41,321 subjects. Nat Genet. 2017;49:27–35.
Article CAS PubMed Google Scholar
Rees E, Walters JT, Georgieva L, Isles AR, Chambert KD, Richards AL, et al. Analysis of copy number variations at 15 schizophrenia-associated loci. Br J Psychiatry: J Ment Sci. 2014;204:108–14.
Article Google Scholar
Pollak RM, Zinsmeister MC, Murphy MM, Zwick ME, Mulle JG. New phenotypes associated with 3q29 duplication syndrome: results from the 3q29 registry. Am J Med Genet A. 2020;182:1152–66.
Article CAS PubMed PubMed Central Google Scholar
Loh PR, Genovese G, Handsaker RE, Finucane HK, Reshef YA, Palamara PF, et al. Insights into clonal haematopoiesis from 8342 mosaic chromosomal alterations. Nature. 2018;559:350–5.
Article CAS PubMed PubMed Central Google Scholar
Dolzhenko E, van Vugt JJFA, Shaw RJ, Bekritsky MA, van Blitterswijk M, Narzisi G, et al. Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res. 2017;27:1895–903.
Article CAS PubMed PubMed Central Google Scholar
Hanafusa K, Hayashi N. The Flot2 component of the lipid raft changes localization during neural differentiation of P19C6 cells. BMC Mol Cell Biol. 2019;20:38.
Article PubMed PubMed Central Google Scholar
Swanwick CC, Shapiro ME, Yi Z, Chang K, Wenthold RJ. NMDA receptors interact with flotillin-1 and -2, lipid raft-associated proteins. FEBS Lett. 2009;583:1226–30.
Article CAS PubMed Google Scholar
Nishioka M, Bundo M, Ueda J, Yoshikawa A, Nishimura F, Sasaki T, et al. Identification of somatic mutations in monozygotic twins discordant for psychiatric disorders. NPJ Schizophr. 2018;4:7.
Article PubMed PubMed Central Google Scholar
Fullard JF, Charney AW, Voloudakis G, Uzilov AV, Haroutunian V, Roussos P. Assessment of somatic single-nucleotide variation in brain tissue of cases with schizophrenia. Transl Psychiatry. 2019;9:21.
Article PubMed PubMed Central Google Scholar
Castellani CA, Melka MG, Gui JL, O’Reilly RL, Singh SM. Integration of DNA sequence and DNA methylation changes in monozygotic twin pairs discordant for schizophrenia. Schizophr Res. 2015;169:433–40.
Article CAS PubMed Google Scholar
Li Q, Wang Z, Zong L, Ye L, Ye J, Ou H, et al. Allele-specific DNA methylation maps in monozygotic twins discordant for psychiatric disorders reveal that disease-associated switching at the EIPR1 regulatory loci modulates neural function. Mol Psychiatry. 2021;26:6630–6642. https://doi.org/10.1038/s41380-021-01126-w.

Download references

Acknowledgements

The authors would like to thank all the study participants, research nurse Karin Dellenwall and the clinicians who performed the psychiatric assessments. The authors acknowledge the support of the Trinity Centre for High Performance Computing (ResearchIT). This work as part of the Psychiatric Genomics Consortium was supported by the National Institutes of Health [5U01MH 109499-04 to A.C., and P.F.S; R01 MH052857 to T.D.C], Science Foundation Ireland [16/SPP/3324 to A.C], the Swedish Research Council (Vetenskapsrådet, award D0886501 to P.F.S.), and by ALF, (a regional agreement on medical training and clinical research between Stockholm County Council and the Karolinska Institutet) [20100305, 20090183 to C.H].

Author information

Authors and Affiliations

Neuropsychiatric Genetics Research Group, Department of Psychiatry, Trinity College Dublin, Dublin, Ireland
Cathal Ormond, Niamh M. Ryan, Michael Gill, Elizabeth A. Heron & Aiden Corvin
Department of Epidemiology and Biostatistics, Karolinska Institutet, Stockholm, Sweden
Anna M. Hedman, Patrick F. Sullivan, Christina Hultman & Viktoria Johansson
Departments of Psychology and Psychiatry, Yale University School of Medicine, New Haven, CT, USA
Tyrone D. Cannon
Departments of Genetics and Psychiatry, University of North Carolina, Chapel Hill, NC, USA
Patrick F. Sullivan
Department of Clinical Sciences, Psychiatry Unit, Umeå University, Umeå, Sweden
Viktoria Johansson

Authors

Cathal Ormond
View author publications
You can also search for this author in PubMed Google Scholar
Niamh M. Ryan
View author publications
You can also search for this author in PubMed Google Scholar
Anna M. Hedman
View author publications
You can also search for this author in PubMed Google Scholar
Tyrone D. Cannon
View author publications
You can also search for this author in PubMed Google Scholar
Patrick F. Sullivan
View author publications
You can also search for this author in PubMed Google Scholar
Michael Gill
View author publications
You can also search for this author in PubMed Google Scholar
Christina Hultman
View author publications
You can also search for this author in PubMed Google Scholar
Elizabeth A. Heron
View author publications
You can also search for this author in PubMed Google Scholar
Viktoria Johansson
View author publications
You can also search for this author in PubMed Google Scholar
Aiden Corvin
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design: PFS, CH, TDC, AC; Patient recruitment: TDC, CH; Clinical data collection and selection: PFS, VJ, AH; Bioinformatics processing and analysis: CO, NMR; Interpreted the findings: CO, NMR, EAH, AC, PFS; Manuscript writing: CO, NMR, EAH, AC; All authors reviewed and approved the final version of the manuscript. PFS reports the following potentially competing financial interests: Lundbeck (advisory committee, grant recipient), RBNC Therapeutics (advisory committee, shareholder). The other authors have nothing to disclose.

Corresponding author

Correspondence to Aiden Corvin.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Supplementary Tables

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Ormond, C., Ryan, N.M., Hedman, A.M. et al. Whole genome sequencing study of identical twins discordant for psychosis. Transl Psychiatry 14, 313 (2024). https://doi.org/10.1038/s41398-024-02982-0

Download citation

Received: 23 November 2023
Revised: 14 June 2024
Accepted: 18 June 2024
Published: 30 July 2024
DOI: https://doi.org/10.1038/s41398-024-02982-0
Springer Nature Limited

Whole genome sequencing study of identical twins discordant for psychosis

Abstract

Similar content being viewed by others

Deep sequencing reveals variations in somatic cell mosaic mutations between monozygotic twins with discordant psychiatric disease

Identification of somatic mutations in monozygotic twins discordant for psychiatric disorders

Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia

Introduction

Methods and materials

Sample procurement

Ethics

Recruitment and diagnostic assessment procedures

Whole genome sequencing and quality control

DNA collection and extraction

Sequencing and quality control

SNV and indel prioritization

CNV calling and prioritization

Results

WGS data

Discordant SNVs and indels

CNVs and repeat expansions

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

Supplementary Tables

Rights and permissions

About this article

Cite this article

Navigation

Whole genome sequencing study of identical twins discordant for psychosis

Abstract

Similar content being viewed by others

Deep sequencing reveals variations in somatic cell mosaic mutations between monozygotic twins with discordant psychiatric disease

Identification of somatic mutations in monozygotic twins discordant for psychiatric disorders

Post-zygotic genomic changes in glutamate and dopamine pathway genes may explain discordance of monozygotic twins for schizophrenia

Introduction

Methods and materials

Sample procurement

Ethics

Recruitment and diagnostic assessment procedures

Whole genome sequencing and quality control

DNA collection and extraction

Sequencing and quality control

SNV and indel prioritization

CNV calling and prioritization

Results

WGS data

Discordant SNVs and indels

CNVs and repeat expansions

Discussion

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

Supplementary Tables

Rights and permissions

About this article

Cite this article

Share this article

Search

Navigation