Background

The genetic causes of neurodevelopmental disorders include almost all types of genomic variations (mainly, chromosomal rearrangements (microscopic and submicroscopic), copy number variations (CNV) and single gene mutations) [17]. Additionally, epigenetic alterations due to genomic variations affecting genes involved in epigenomic regulation and uniparental disomy resulting from chromosomal or segmental homozygosity (HMZ) are shown to contribute to the etiology of neurodevelopmental diseases [811]. However, epigenomic variations and instability have been significantly less investigated in terms of the causative role for these diseases than deserved [10, 11]. Moreover, genome- and epigenome-based analysis of brain cells suggests that epigenetic changes are likely an underappreciated source of neuronal diversity and neurodevelopmental diseases [12]. Accordingly, one can hypothesize that overlooked epigenomic variations might be involved in the pathogenesis of neurological and psychiatric diseases.

Probably the commonest type of epigenomic variations in humans is long contiguous stretches of homozygosity (LCSH) (also known as regions/runs of homozygosity and losses of heterozygosity) defined as CNV neutral chromosomal segments featured by allelic HMZ [1315]. LCSH over 1 Mb are always observed during genome-wide analyses by single-nucleotide polymorphism (SNP) chromosomal microarrays [1619]. The presence of LCSH can be indicative for parental consanguinity, uniparental disomy, or HMZ for single gene recessive mutations [1921]. Furthermore, LCHS are helpful for uncovering the genetic basis for complex traits [22] and locus-specific deleterious genomic variation [23]. Taking into account the epigenetic contribution to brain development and plasticity as well as genetic-environmental interactions in neuropsychiatric diseases, epigenomic variability was suggested to be a mechanism for neurodevelopmental disorders [24, 25]. Recently, several studies have addressed LCSH in brain diseases. However, these yielded conflicting results [2628]. Actually, the only more-or-less confirmed association between LCSH occurrence and neurodevelopmental diseases (intellectual disability (ID) and autism) is related to excess of LCSH encompassing recessive disease genes [19, 29]. Surprisingly, imprinted gene loci were not considered as a target for studying LCSH in neurodevelopmental disorders. Since classical imprinting syndromes, i. e. Angelman syndrome (AS), Beckwith-Wiedemann syndrome (BWS), Prader-Willi syndrome (PWS) and Silver-Russell syndrome (SRS), are associated with ID, autistic behavior, developmental delay, and seizures [30, 31], we hypothesize that LCSH at these disease loci may result in a similar neurological or behavioral phenotype.

In this study, LCSH were evaluated in a cohort of children with idiopathic intellectual disability, autism, congenital malformations and/or epilepsy by SNP chromosomal microarray with a resolution of HMZ stretch detection reaching a minimum of 1 Mb in size. An original bioinformatics approach to the prioritization of genes and genome/epigenome variations was used to assess pathogenic value of CNV and LCSH.

Results

The presence of LCSH was observed in all cases studied. Causative chromosome abnormalities, CNV and intragenic (exonic) CNV detected by SNP microarray technique were excluded from further analysis. Apart from individuals, who were the descendants of close consanguinity marriages, the amount of LCSH per patient varied between 63 and 132. Eight individuals (3 %) were descendants of close consanguinity marriages. Parental consanguinity was determined according to a methodology of a previous study by Fan et al. [32] and genealogic analysis. In these patients, LCSH have not spanned the loci of imprinted genes strongly associated with recognizable syndromes. A patient, who is a descendant of consanguinity mating, exhibited LCSH at 7p12, containing an imprinted gene GRB10. Since the involvement of GRB10 in SRS is questionable [33], we have excluded this case from further analysis.

LCSH at imprinted loci previously described as those of imprinting syndromes [3436] were found in 14 cases (5.2 %). These were 2 cases of LCSH at 7q21.3 (SRS), 4 cases of LCSH at 7q31.2, 3 (SRS) (Fig. 1), LCSH at 11p15.5 (SRS/BWS) (Fig. 2), and LCSH at 15p11.2 (AS/PWS) (Fig. 3). Molecularly, LHCS spanned the loci containing imprinted genes checked by the GENEIMPRINT database (http://www.geneimprint.com/site/genes-by-species.Homo+sapiens). Apart from a case of LCSH at 7q31.33q32.3 (~4.3 Mb in size), the remaining LHCS spanned DNA sequences varied from 1 to 1.6 Mb. No correlation with the size of LCSH and disease’s phenotype was observed. The size of LCSH was indicative for excluding whole-chromosome uniparental disomy as the mechanism for phenotypic manifestations in these cases. Table 1 summarizes data on molecular, chromosomal and clinical features of LCSH at the imprinted chromosomal regions.

Fig. 1
figure 1

Schematic depiction of chromosomal and genomic regions affected by LCHS at 7q21.3 (2 patients) and 7q31.2 (4 patients) using Affymetrix Chromosome Analysis Suite software screenshots

Fig. 2
figure 2

Schematic depiction of chromosomal and genomic regions affected by LCHS at 11p15.5 (11p15.5 and 11p15.4) (3 patients) using Affymetrix Chromosome Analysis Suite software screenshots

Fig. 3
figure 3

Schematic depiction of chromosomal and genomic regions affected by LCHS at 15q11.2 (5 patients) using Affymetrix Chromosome Analysis Suite software screenshots

Table 1 Summary of LCSH, associated clinical findings, and imprinted genes

Discussion

Epigenomic variations and instability are known to be associated with human diseases [812, 21, 24, 30]. Here, a primary imprinting defect (according to previous classification of imprinting defects [37]) is described. Along with chromosomal abnormalities and CNV (germline and somatic) [17, 3840], these epigenetic mutations can be considered as a common cause of neurodevelopmental diseases. There are no known epigenetic/epigenomic alterations detectable as common as LCSH at imprinted loci that can be considered as a causative for ID, autism or epilepsy. It is noteworthy, that LCSH are detectable by SNP chromosomal microarrays only [18, 20, 21, 32, 41, 42]. Although imprinting defects similar to LCSH can be also detected by molecular genetic approaches (methyl-sensitive polymerase chain reaction, bisulfite sequencing etc.) [36, 41, 43], these techniques are poorly effective for the detection because of their targeted nature. SNP chromosomal microarrays may lead to a 5 % improvement in etiological yield by uncovering LCSH at imprinted loci.

Clinically, 14 cases of LCSH spanning shortly the imprinted loci weakly resembled SRS, BWS, AS or PWS [10, 3436, 43, 44]. Additional phenotypic features have been observed, as well (Table 1). In total, phenotypic manifestations in these cases have not allowed attributing them to a specific imprinting syndrome providing speculations about causative relationship between unexplained ID, autism and epilepsy and LCSH at imprinted loci.

Since LCSH were observed in all the individuals of the cohort, we have compared our results with previous studies of clinical and unaffected populations [1323]. Identical LCSH in unrelated individuals were found to be confined to specific regions (i.e., 3p21 and 16p11.2p11.1). These were detected in the majority of patients. Consequently, we concluded that these LCSH are the result of a technological drawback. However, specific organization of these genomic loci can manifest as LCSH during SNP chromosomal microarray analysis. In the available literature, 7q21.3, 7q31.2, 11p15.5, and 15p11.2 genomic loci were not described as consistently affected by LCSH [1320]. Although Wang et al. 2015 [21] have reported large LCSH to affect 11p and 15q, these occasional cases are likely to represent rare cases of uniparental disomy associated with corresponding imprinting disorders. Accordingly, these epigenomic mutations are unlikely to represent the same short LCSH reported in the present study.

Mechanisms and consequences of LCSH are poorly understood [1315, 45]. Accordingly, the interpretation of these epigenetic mutations can represent a challenge. To solve the problem in silico, one can apply a variety of bioinformatic approaches to gene prioritization, which are known to be effective for uncovering functional significance of genomic and epigenomic variations [40, 46, 47]. Molecular testing for AS and PWS performed previously (fluorescence in situ hybridization-based and methylation analyses) [48] was not able to uncover these epigenetic mutations. Therefore, it is important to note that alternative empirical methods giving a solution to this problem do not currently exist.

Conclusions

This molecular cytogenetic and bioinformatic study shows for the first time that LCSH of 1–1.6 Mb in size at imprinted chromosomal regions (7q21.3, 7q31.2; 11p15.5; and 15p11.2) are relatively frequent (~5 %) among the children with intellectual disability, autism, congenital malformations and/or epilepsy. Thus, these epigenetic mutations appear to be common in neurodevelopmental diseases. Hence, to increase the diagnostic yield of SNP chromosomal microarrays, an additional consideration of shorter LCSH is warranted in children with intellectual disability, autism, congenital malformations and/or epilepsy.

Methods

Patients

Cases (n = 267) included in this study are a part of the Russian cohort of children with intellectual disability, autism, epilepsy and congenital anomalies partially described previously [4951]. Written informed consent was obtained from at least one of the patients’ parents.

SNP chromosomal microarray

CNV and LCSH were analyzed by CytoScan HD Arrays (Affymetrix, Santa Clara, CA) consisting of about 2.7 million markers for CNV evaluation and about 750,000 SNPs for LCSH analysis. The laboratory procedures have been previously described in detail [17, 18, 21, 51, 52]. CNV and LCSH were visualized by the Affymetrix Chromosome Analysis Suite software (ChAS analysis files for CytoScan® HD Array version NA32.3). Genomic localization was defined using NCBI Build GRCh37/hg19 reference sequence. Imprinted genes were checked against the Geneimprint database (http://www.geneimprint.com).

Bioinformatics

Bioinformatic analyses were performed using an original approach to gene and CNV prioritization as described in our earlier papers [46, 51, 53]. This procedure was performed to exclude the phenotypic effect of CNV and to confirm clinical relevance of LCSH. Briefly, the prioritization was performed using ontology-based gene filtering/ranking and fusion of data acquired from clinical, genomic, epigenetic, proteomic, and metabolomic databases as well as interactomic software.