Background

We recently reported that single de novo copy number variants (CNVs) in patients with developmental disorders frequently affected multiple functionally similar genes [1]. Since functional clusters are present in many eukaryotic species, including humans [2], a single CNV can affect multiple genes, potentially incurring additional deleterious effects due to epistatic interactions. Furthermore, the position of genes within a biological pathway may be relevant to the phenotypic impact of their disruption [3]. In this study, we investigated the roles that epistatic interactions and the network position of disrupted genes play in developmental disorders.

Materials and methods

We obtained a set of 626 de novo CNVs identified in patients with developmental disorders from the DatabasE of Chromosomal Imbalances and Phenotype in Humans using Ensemble Resources (DECIPHER). Employing a novel integrated functional linkage network, we examined clustering and importance (centrality within the network) of the genes affected by these de novo CNVs. A fast neighbor joining clustering algorithm was used to identify gene groups within each CNV defined as those linked by the top 1 % of shortest paths within the functional linkage network. To ensure functional clusters were not simply the result of tandem duplications, we collapsed paralogous genes into a single copy within the genome. Significant enrichments in clustering and central network position were used to build a predictive model able to scan the genome and identify similar regions whose copy number change may also predispose to disease.

Results

DECIPHER de novo CNVs were significantly enriched for large functional clusters with low within-cluster similarity compared with gene-number matched randomizations. Functional clusters were even more significantly enriched when paralogous genes were collapsed. Clusters were present in 357 of 626 CNVs, with an average size of four genes. In addition all measures of network centrality were significantly high, with average maximum betweenness (bottlenecks) the most significant. Bottleneck genes tended to be haplo-insufficient and highly pleiotropic when knocked out in mice. Functional clusters frequently contained bottleneck genes and these regions were frequently affected by CNVs in more than one patient.

Conclusions

DECIPHER de novo CNVs identify putative epistatic hotspots, which are clusters of functionally related genes whose disruptions are associated with developmental disorders. In addition, these hotspots are enriched in bottleneck genes that may play a role in the diversity of phenotypes observed for these patients.