Evolution often deals in genomic trade-offs: changes in the genome that are beneficial overall persist even though they also produce disease in a subset of individuals. Here, we explore the possibility that such trade-offs have occurred as part of the evolution of the human brain. Specifically, we provide support for the possibility that the same key genes that have been major contributors to the rapid evolutionary expansion of the human brain and its exceptional cognitive capacity also, in different combinations, are significant contributors to autism and schizophrenia. Furthermore, the model proposes that one of the primary genes behind this trade-off may not technically be “a gene” or “genes” but rather are the highly duplicated sequences that encode the Olduvai protein domain family (formerly called DUF1220). This is not an entirely new idea. Others have proposed that the same genes involved in schizophrenia were also critical to the rapid expansion of the human brain, a view that has been expressed as “the same ‘genes’ that drive us mad have made us human”. What is new is that a “gene”, or more precisely a protein domain family, has been found that may satisfy these requirements.
Evolution is opportunistic but also indifferent. Changes that become incorporated in a species’ genome need not be without detriment so long as they provide an overall benefit. A consequence of this is that evolution often deals in genomic trade-offs, where harmful effects in some individuals are outweighed by a greater advantage to others. As a result, human disease is sometimes an unfortunate by-product of evolutionary adaptations that have remodeled the genome to incorporate advantageous genomic changes at the expense of introducing disease-causing changes. The classic example of such a genomic trade-off is sickle cell disease and malaria resistance (Allison 1954). A hemoglobin genetic variant, when heterozygous, increases resistance to malaria in populations where the disease is endemic, providing a clear survival advantage. However, this same variant, when homozygous, produces sickle cell disease. Due to its protective effect against malaria, the variant persists in the population, despite its clearly deleterious effects in a subset of individuals.
This concept of genomic trade-offs may similarly apply to the brain and cognitive processes. Such a trade-off might resemble the following: the evolutionary benefits that have resulted from the enhanced cognitive capacity conferred by the human brain have been produced by genomic variants that themselves also give rise to cognitive disorders. This is not a new idea. Over two decades ago, Crow proposed that schizophrenia was a by-product of the key evolutionary events that produced the human brain and language, and that a major effect gene was involved (Crow 1995a, b, 1997, 2000). More recently, Burns has suggested that schizophrenia is an unfortunate by-product of social brain evolution (Burns 2004, 2006, 2007). Both share the view that this serious mental disorder constitutes a costly price paid by our species for our cognitive uniqueness. Their rationale can be expressed as follows: schizophrenia is a highly heritable neuropsychiatric disorder that, while maladaptive, nevertheless persists at high frequency (~ 1%) across virtually all human populations. However, if the disease is genetic and reduces fecundity, why have the underlying genes not been weeded out? This question has been called the central paradox of schizophrenia (Huxley et al. 1964), and Crow and Burns suggest a possible explanation: the causal disease genes also are highly evolutionarily adaptive and, as a result, the disease-causing sequences have been retained in the genome. They further suggest that, because of the high worldwide incidence, it is likely that the key genomic changes occurred prior to the migration of human populations out of Africa and their dispersal across the world. Thus, the crucial genomic variation would be ancient and shared among essentially all human populations. From these observations, they conclude that the key evolutionary benefit of these sequences is that they were critical to the evolution of the human brain.
An excerpt from Burns that quotes an impassioned passage from a novel (Faulks 2005) set in the late 1800s, amid the early days of psychiatry, conveys this rationale:
“The argument for natural selection has long been won, and its basic tenet is that only that which confers an advantage on the species is continually selected by the environment and therefore…perpetuated in the ‘genes’. Nature never selects against the benefit of the species; it cannot. Furthermore, these poor schizophrenic people have…losses in reproduction…Yet they survive at a constant level in the population. How can this be? It breaks the first law of Darwin! It can only be that a variant of that inheritance—the same units, but differently combined, so that they do not express themselves as illness—confer huge advantages. So huge that they compensate both for the misery of the illness, against species interests, and the reproductive failure of the afflicted! And what are these advantages? They are superior brain power, language, creative ability. The same ‘genes’ that drive us mad have made us human: in different combinations, I admit, but precisely, and in my view unarguably, the same particles of inheritance.”
While the cognitive trade-off theory has been proposed previously, new genomic data have become available in recent years that are providing additional support for this possibility. Here, we incorporate some of these new insights to make two testable proposals: (1) the same genes that were responsible for the evolution of the human brain are also a significant cause of autism and schizophrenia, and (2) primary genomic sequences behind this cognitive trade-off are those encoding the Olduvai protein domain family (formerly called DUF1220/NBPF) (Sikela and van Roy 2017).
DNA dosage insights linking autism and schizophrenia
One of the most striking events in the fields of autism and schizophrenia research occurred in 2008 when two independent groups published findings which showed that, of copy number variations (CNVs) in the chromosome 1q21 region, duplications were preferentially associated with autism (and macrocephaly), while reciprocal deletions were preferentially associated with schizophrenia (and microcephaly) (Mefford et al. 2008; Brunetti-Pierri et al. 2008). While both types of CNVs are observed in both disorders, this preferential distribution bias has since been confirmed in several other studies (Crespi et al. 2010). Such findings lent independent support to a previously proposed suggestion that some of the phenotypes of autism and schizophrenia could be viewed as diametric opposites (Crespi and Badcock 2008). Together, these results have several important ramifications:
The most plausible explanation for this unusual distribution is that there is a sequence (or sequences) within the 1q21 CNVs, the dosage of which contributes to these two disorders in opposite ways: high-dosage producing autism while low-dosage producing schizophrenia. Such a shared genomic location for autism and schizophrenia has also been found for three other genomic regions where deletions are associated with one disorder, while duplications are associated with the other (Crespi et al. 2010). These results suggest that autism and schizophrenia are related disorders and may have a shared underlying genomic etiology that involves opposite changes in the dosage of the same specific genes.
That 1q21 deletions and reciprocal duplications are associated with microcephaly and macrocephaly, respectively, implies that there is a sequence (or sequences) within these 1q21 CNVs, the dosage of which influences brain size in a dose-dependent manner (i.e., more copies produce a larger brain, while fewer copies produce a smaller brain). Such a conclusion also fits well with the fact that autism has been associated with increases in brain size/growth and neuron number (Courchesne et al. 2001, 2003, 2011), while schizophrenia has been linked with reduced brain size (Haijma et al. 2013). Such a pattern also implies that the underlying disease genes have a function related to brain size/growth.
Furthermore, if the critical gene(s) contained a highly duplicated and highly copy number variable coding sequence: (a) they would have the capacity to confer a broad spectrum of functional allelic variability that, in theory, could account for the wide range of symptom severity in the disorders and (b) they would have been missed in genetic disease studies because of the difficulty in measuring such sequences. Thus, if one were looking for a primary causal genomic sequence in these disease-associated CNVs, among the most likely would be genes, or parts of genes, that are highly duplicated and copy number polymorphic that have been missed by conventional genomic disease gene searches.
In addition, if such sequences were highly adaptive, e.g., also found to be important to the evolutionary expansion of the human brain, they would help to explain the central genetic paradox of schizophrenia, discussed above. Such a paradox can similarly be applied to autism: a genetic disorder that, while maladaptive, nevertheless persists at a high (> 1%) worldwide incidence (Power et al. 2013).
Do such gene sequences exist? We believe they do. In the following sections, we attempt to show that the sequences encoding the Olduvai protein domain family fulfill these criteria and do so better than any other previously examined sequences in the genome.
The pond is not dry
It has been reported (Crow 2011) that in 1988, when the power of genetic linkage and association strategies was becoming increasingly appreciated, there was great optimism that the genes underlying neuropsychiatric diseases such as schizophrenia were going to be readily found. This enthusiasm was reflected in the remark that “the thing about these techniques is that they have to work…you drain the pond dry and there are the genes” (Crow 2011). Yet, almost 30 years later, major effect genes for schizophrenia, autism, and other complex cognitive disorders and traits (including cognitive aptitude) continue to remain elusive. Indeed, the existence of a major gene for schizophrenia is currently thought to be “a pretty ridiculous idea” (Barron 2016). In addition, while numerous minor effect loci have been found, they explain only a small fraction of the expected genetic contribution associated with these conditions (Flint 2016). This “missing heritability”, despite efforts to dispel such concerns, continues to be missing.
A key persistent assumption behind the approaches that have been used to identify genetic variants underlying neuropsychiatric and neurodevelopmental diseases [GWAS, exome sequencing, and arrayCGH studies of biallelic CNVs (duplications and deletions that gain or lose a single copy)] has been that they survey the entire genome, and therefore, they “drain the pond dry”. However, that view is incorrect. While these approaches are “genome-wide”, they do not survey the entire genome. Even whole-genome sequencing approaches are limited in their ability to accurately assemble and measure changes in complex, highly duplicated genomic regions, and there are sequences in the genome, among them dynamic, high copy number gene coding regions, that are missed by each of these methods. Contrary to the original expectations, the pond is not “dry”. Notably, Olduvai sequences, which are among the most highly duplicated and copy number polymorphic coding regions in the human genome, have been unexamined by previous genomic studies employing these methods. In only a few instances, as discussed later, have these sequences been directly measured in genetic studies of cognitive capacity and disease (Dumas et al. 2012; Davis et al. 2014a, b, 2015; Searles Quick et al. 2015).
The cognitive genomic trade-off model of Olduvai domains
In an attempt to reconcile how the same gene sequences could underlie cognitive benefit and cognitive disease, we here make the following testable proposal: which, how, where, and when Olduvai copies change determines whether the consequences are beneficial, innocuous, or harmful.
The Olduvai protein domain family is composed of approximately 300 copies in the haploid human genome, each about 1.4 kb in length (Popesco et al. 2006; O’Bleness et al. 2012), and these are divisible into six primary subtypes based on sequence similarity: CON1-3 and HLS1-3. The domains are found on 23 Neuroblastoma Breakpoint Family (NBPF) genes (Vandepoele et al. 2005) where they occur as tandemly arranged units that vary from 6 to 60 copies per gene. The majority of these genes, including the ones that encode the greatest number of Olduvai copies, are found in 1q21.1–1q21.2, a 7 Mb region of chr1. There are 13 NBPF genes in this region and they are found interspersed among approximately 40 non-NBPF genes.
Thus, this region contains ~ 250 Olduvai copies that are both tandemly arranged and interspersed among non-Olduvai sequences. Such a genomic architecture would not only be prone to non-allelic homologous recombination (NAHR) events, but there would be a myriad of ways in which the Olduvai/NBPF sequences could recombine. Olduvai domains show the largest human lineage-specific increase in copy number of any coding region (approximately 165 copies have been added to the human genome since the Homo/Pan split) (O’Bleness et al. 2012), and are also highly variable in the human population, exhibiting a broad normal distribution (Fig. 1) (Davis et al. 2014b). These features have multiple important implications with respect to both disease and evolutionary adaptation:
Tandemly arrayed sequences are known to be highly variable, with a rate of change that is estimated to be 100,000 faster than that of single-nucleotide substitutions (Ellegren 2000). Small, tandemly arranged coding regions are also known to show accelerated copy number change and have been proposed to contribute to the rapid evolution of morphological traits (Fondon and Garner 2004). Such coding region amplifications and reductions, in contrast to simple single copy deletions or duplications, would have the potential to produce a broad continuum of variation in associated phenotypes.
The presence of 13 members of the same gene family (a family composed almost entirely of highly similar tandemly arranged repeated sequences, e.g., Olduvai copies) in close genomic proximity to one another (within 7 Mb) will be prone to multiple aberrant recombination events.
Such an architecture can be expected to often produce recombination events that are deleterious, e.g., when two NBPF genes undergo NAHR the intervening genes (non-NBPF) will either lose or gain a copy as a result. Such gene dosage changes, often involving several genes, will likely produce deleterious phenotypes.
If there is a sequence in this region, the increased dosage of which is evolutionary advantageous, such a recombinogenic genomic architecture will increase the probability that a dosage increase of the sequence will occur.
If this genomic architecture does lead to an evolutionarily beneficial increase in this sequence (e.g., enhanced survival), the recombinogenic architecture that produced it will be retained in the genome of the individuals who benefitted from the increased copies. In this manner, the beneficial effects will result in retention of the disease-prone genomic architecture. As long as increases in dosage of this sequence continue to be beneficial, such a phenomenon becomes cyclical and lends itself to recurrence. The self-perpetuating nature of this process can be depicted in a model (Fig. 2) (Dumas and Sikela 2009). Indeed, as more tandemly arranged copies are added to the same location (such as occurred with Olduvai sequences in the human genome), the propensity for variation and recombination would increase even further.
The above factors form the basis of a genomic trade-off model that describes how the beneficial cognitive effects of adding more Olduvai copies can simultaneously produce recurrent genetic diseases. While this possibility was previously suggested (Dumas and Sikela 2009), additional studies have appeared that have lent further support to this model. Incorporating these new findings, the following provides a more detailed explanation for how and why such events may have occurred (Fig. 3). The current model is based on genomic and phenotypic evidence. A key component of future work will be to ascertain the molecular function of Olduvai. There is at present limited evidence pointing to specific cellular functions; what is known will be briefly discussed later in the manuscript.
Beneficial and harmful effects of Olduvai variation
The following sections outline both the lines of evidence that support Olduvai’s role in evolution and disease, as well as related arguments that should be considered when formulating a model pertaining to genomic trade-offs.
Beneficial Olduvai effects: brain evolution and cognitive aptitude
As mentioned previously, sequences encoding Olduvai protein domains show the largest human lineage-specific increase in copy number of any coding region in the genome: humans have ~ 300 copies, great apes 90–130, monkeys 30–60, and all other mammals 1–9 (O’Bleness et al. 2012). Olduvai dosage is strongly correlated with an increase in brain size, neuron number, and several other brain size-related phenotypes (but not body size) among primates, and shows a robust linear association with neocortex volume among primate species (O’Bleness et al. 2012; Dumas et al. 2012; Keeney et al. 2014a, b; Zimmer and Montgomery 2015). Such a trend is fully consistent with previous evidence that the evolutionary increase in human brain size represents the extension of a process that began in early primate evolution (Goodman 1999; McKinney 2000) and that the human brain, while remarkable, is essentially a linearly scaled-up primate brain (Herculano-Houzel 2009, 2012a). As Goodman (Goodman 1999) pointed out:
“Features that we associate with being human did not just arise de novo in the past 6 million years since the lineage to humans separated from that to chimpanzees. Rather, some of the most striking human features, such as greatly enlarged brains and prolonged childhoods in social nurturing societies, have deep roots in our evolutionary history. Forty to 30 million years ago (Ma) neocortical portions of the brain increased in the two emerging branches of anthropoid primates—the platyrrhines (or New World monkeys) and the catarrhines. Within the catarrhine branch, additional marked enlargements occurred by 18–6 Ma in the lineage to the ancestors of modern hominids, and the largest neocortical increases occurred in the past 3 million years in the lineage to modern humans.”
In addition, there is increasing evidence that primate brain evolution has taken a unique mechanistic path: as the brain expanded from monkeys to apes to humans, neuron number increased, while, unlike in non-primate mammals, neuron size was kept constant (Herculano-Houzel et al. 2007; Herculano-Houzel 2009, 2012b). This primate-specific mode of brain expansion fits well with the fact that there was a burst of Olduvai copy number exclusively in the primate order: anthropoid primates (monkeys, apes, and humans) have 30–300 copies, while non-primate mammals have only 1–9 (O’Bleness et al. 2012).
Given that brain size among non-human primate species is also thought to be a predictor of cognitive ability (Deaner et al. 2007), it follows that Olduvai copy number also parallels primate cognitive capacity. As stated by McKinney, “Many of our mental abilities are largely attributable to extension of brain development to produce a proportionately scaled-up version of the ancestral ape brain” (McKinney 2000).
Within healthy human populations, increased Olduvai copy number is positively associated with increases in several brain size-related phenotypes (Dumas et al. 2012).
Olduvai copy number (subtype CON2) shows a linear association with measures of cognitive aptitude in two independent populations (Davis et al. 2014a, b). While the sample sizes employed in this study were modest, the effect sizes were unusually large. A similar association was also found with a different Olduvai subtype (HLS1) in a separate study (Davis et al. 2015).
Other genome-wide studies of cognitive aptitude have had minimal success, finding only minor effect loci that together account for only a small fraction of the expected genetic contribution (Rietveld et al. 2013, 2014; Callaway 2014; Sniekers et al. 2017). None of these studies directly tested the involvement of Olduvai copy number.
From an evolutionary perspective, Olduvai protein coding sequences are under strong positive selection, especially among primates (Popesco et al. 2006).
Autism has been associated with high intellectual function (Crespi 2016), and it has been reported that the incidence of child prodigy is increased in families in which autism occurs (Ruthsatz and Urbach 2012). To follow up on this, a genome-wide linkage study was undertaken to identify loci that were associated with both autism and prodigy (Ruthsatz et al. 2015). The strongest signal that was obtained was to a region on chr1 where the great majority of Olduvai copies map. While highly repeated sequences are typically invisible to conventional SNP-based analyses (Brahmachary et al. 2014), it is possible that the particular families used in this study contained informative SNPs that allowed these association to be detected.
Detrimental Olduvai effects: autism and schizophrenia
Olduvai copy number (CON1 subtype) shows a significant positive linear association with increasing severity of a primary symptom of autism, i.e., social impairment, and a suggestive association with diminished communicative skill (Davis et al. 2014b). These findings were subsequently replicated in an independent population (Davis et al. 2015). Thus, as Olduvai copy number increases, the severity of two of the primary symptoms of autism becomes progressively worse. These studies point to the possibility that Olduvai sequences influence autism severity and are doing so in a dosage-related manner (Fig. 3).
There is suggestive evidence that supports the possibility that the Olduvai effect size may be large in both autism and schizophrenia. This is based in part on the following observations. The method that was used to link Olduvai copy number with autism and schizophrenia severity (i.e., ddPCR) was limited in two ways: (1) it measured only one Olduvai subtype at a time (e.g., CON1), and (2) it measured only the global (total genome) copy number of the subtype (Davis et al. 2014b, 2015; Searles Quick et al. 2015). CON1 copies are found in almost all NBPF genes and these are dispersed over approximately 20 different genomic locations (Fig. 4). As a result, one would be following 20 different loci in each CON1 ddPCR experiment. What is remarkable is that, even with such a broadly based assay, modest sample sizes, and a highly complex phenotype, the positive associations between Olduvai copy number and autism symptom severity were robust enough to be detectable (Davis et al. 2014b) and successfully replicated (Davis et al. 2015). A similar rationale could be applied to the ddPCR studies that linked Olduvai copy number with schizophrenia severity and risk (Searles Quick et al. 2015), and with cognitive aptitude (Davis et al. 2014a). Thus, the fact that such associations are found even using a low-resolution assay, that much of the predicted heritability for these conditions remains unexplained, and that Olduvai sequences have not been directly tested in conventional genomic studies, raises the possibility that they may be major contributors to all three conditions.
Studies of Olduvai dosage variation support the view that autism and schizophrenia may be related disorders that have some opposing phenotypic characteristics (Fig. 3) (Crespi et al. 2009). In 1980, Crow suggested that schizophrenia could effectively be separated into positive and negative symptoms (Crow 1980). While the former included hallucinations, delusions and disorganized speech, the latter included flattened affect, poverty of speech, and anhedonia. As mentioned previously, Olduvai studies have found that CON1 copy number is linearly associated with autism severity (Davis et al. 2014b, 2015). Interestingly, in males with schizophrenia, increase in CON1 is also linearly associated with increase in the severity of negative symptoms, which overlap with the symptoms of autism. Conversely, CON1 decreases were associated with increased severity of positive symptoms in individuals with schizophrenia (Searles Quick et al. 2015), as were decreases in another subtype of Olduvai, HLS1. These findings support the aforementioned model that autism and schizophrenia may involve phenotypic extremes that are affected in opposite ways by dosage variation in the underlying genes, and that Olduvai dosage variation may play a role in influencing the severity of both disorders.
As mentioned above, CNVs that contain Olduvai sequences have frequently been linked to autism and schizophrenia. However, since 1q21 CNVs have been found in only a small fraction of cases, why would one suggest that Olduvai domains may be a major effect locus? We believe part of the answer is that the methods that have been used to detect genetic variation, including CNVs, would have missed which, how, and where Olduvai copies have changed in the genome. While arrayCGH studies that have been reported uncovered associations with autism and schizophrenia, they could only detect changes that involved relatively large genomic segments and were simple in nature (deletions or duplications of one or two copies). They would not be able to follow smaller, multi-allelic variations such as those known to occur with the Olduvai family. As a result, these kinds of variations remain unexamined in conventional studies of autism, schizophrenia, and cognitive aptitude.
Genomic sequences can range from highly stable to highly variable. Interestingly, it has been shown that there may be a window of variability between these extremes in which sequences can be stable enough to exhibit an obvious heritability yet vary enough that they cannot be reliably followed by SNP-based approaches or conventional linkage and association methods (Brahmachary et al. 2014). Olduvai sequences, which are known to vary considerably in copy number in humans, are excellent candidates to exhibit such behavior. In addition, their instability and capacity to rapidly change would also be consistent with the non-Mendelian nature of these disorders and that the diseases are sometimes discordant in monozygotic twins (Rosenberg et al. 2009; Ahn et al. 2014; Sandin et al. 2014; Colvert et al. 2015). In this regard, it is not surprising that Olduvai sequences have been shown to demonstrate a degree of somatic mosaicism and de novo variation.
Finally, the fact that conventional genetic studies of all three conditions (autism, schizophrenia, and cognitive aptitude) have only uncovered minor effect genes/loci that fall well short of accounting for the predicted genetic contribution is consistent with the notion that there are additional causal genes that have not yet been identified. The fact that Olduvai copies have not been examined in virtually all studies of autism, schizophrenia, and cognitive aptitude, together with the other features discussed above, makes them compelling candidates to explain a portion of the missing heritability associated with each condition.
When considering the detrimental effects of Olduvai in discussions of autism and schizophrenia, an obvious question is why no clear difference has been observed in global copy number between individuals with autism and controls (Davis et al. 2014b). This may also be explained by the nature of the ddPCR assay. Because the method only provides global copy number for CON1, it does not address which specific CON1 copies are changing, nor where in the genome such changes are occurring (e.g., in which specific NBPF genes) or whether variations are due to germline or somatic copy number changes. If such factors are critical to whether the effects produce disease or not, they will not be detected using ddPCR or similar methods that measure only global Olduvai copy number. And as discussed above, conventional assay techniques used in prior studies of these disorders have not examined Olduvai copy number variation.
Detrimental Olduvai effects: microcephaly and macrocephaly
It was previously mentioned that multiple reports have linked 1q21-associated duplications and reciprocal deletions to macrocephaly and microcephaly, respectively (Mefford et al. 2008; Brunetti-Pierri et al. 2008). While these CNVs contained a substantial number of genes, it was later noted that they also contained many Olduvai copies (Dumas and Sikela 2009).
Subsequently, a comprehensive analysis determined that, of the 50 genes located in the 1q21 region, Olduvai copy number showed the strongest association with brain size in individuals with 1q21-associated microcephaly and macrocephaly (Dumas et al. 2012). This finding was the first to demonstrate a clear deleterious effect to Olduvai copy number variation.
In an apparent contradiction to this trend, an individual was recently identified with a large (8.3 Mb) 1q21.1–1q21.3 duplication and microcephaly (Milone et al. 2016). However, the individual also had a 50 kb microdeletion embedded within the duplicated region which contained only two genes, both of which encode Olduvai domains: NBPF25P and NBPF23. Thus, in this case of microcephaly, which involved a duplicated segment encompassing more than 120 genes, the only genes that were deleted encoded Olduvai copies.
In reports of 1q21 duplications and deletions, the detected CNVs encompass numerous non-NBPF genes in addition to several NBPF genes that encode Olduvai domains. Thus, while Olduvai gains and losses may underlie changes in brain size in 1q21-associated macro/microcephaly, it is likely that the dosage change of the flanking (i.e., non-NBPF) genes is responsible for the diverse pathologies that have been found among these individuals.
The recombinogenic architecture of the 1q21 region that results from the many Olduvai copies that map there would be expected to produce many deleterious recombination events. This expectation is borne out by the reports of at least 12 different disorders that have been associated with 1q21 CNVs (Dumas et al. 2012).
Given the above considerations, our primary proposal to explain how Olduvai domains could be a key factor in both beneficial (cognitive capacity) and harmful (autism and schizophrenia) effects is as follows: which, where, how, and when Olduvai copies are changing determines whether beneficial, neutral, or harmful effects will occur. Such a model is not without precedent. The red and green opsin genes are also found tandemly arranged in the human genome and the high frequency of recombination between them is responsible for many peculiarities of red–green color vision in humans including the extraordinarily high frequency of color vision defects (Neitz and Neitz 2011).
Are there other genes that fit the cognitive trade-off model?
The overwhelming trend emerging from genetic studies of autism, schizophrenia, and cognitive aptitude has been that each of these conditions involves hundreds-to-thousands of small effect genes and/or loci. However, even using very large sample sizes, the identified loci fall far short of fully accounting for the expected genetic contribution to these conditions. For example, in the largest study of cognitive aptitude, involving over 76,000 individuals, 52 loci were identified, but they account for less than 5% of the expected genetic contribution to this phenotype (Sniekers et al. 2017). Similar results have been found for autism (Yuen et al. 2017) and schizophrenia (Crow 2011; Flint 2016). However, as mentioned previously, these conventional genome-wide studies do not look at the entire genome and none have directly measured Olduvai sequences.
One of the few examples of another gene that potentially fits the cognitive genomic trade-off model is the ARHGAP11B gene (Florio et al. 2015). While human-specific sequence variation in the gene has been implicated in promoting neuron increases, the gene is also found in a genomic location, 15q13, that has long been thought to harbor sequences that contribute to schizophrenia (Stefansson et al. 2008). However, in this example, human brain enlargement is proposed to be due to a single base change (Florio et al. 2016). Such a small change does not offer the genomic variability that would be needed to explain the continued, gradual, prolonged nature of primate brain expansion and the spectrum of symptom severity associated with schizophrenia (not to mention the range of severity associated with autism and the variation associated with cognitive aptitude…data that, in contrast, fits well with the extreme copy number variability of Olduvai in humans).
As mentioned, the possibility that there may be a major effect gene involved in schizophrenia has been proposed previously (Crow 1995b), and is in part based on Karlsson’s analysis of the incidence of schizophrenia in multigenerational Icelandic populations (Karlsson 1974, 1982, 1988). He states, “The segregation observed in certain large kindreds into branches with high and low rates of psychosis is very suggestive of a modified dominant transmission. The gene frequencies that would have to be postulated for modified recessive or polygenic inheritance would not permit such a pattern. A system based on a dominant principal gene, perhaps associated with other modifying genes, thus seems to be the most likely mode of transmission.”
While there were many fruitless searches for such a major gene, the lack of success may have been not because a major gene did not exist, but rather because the early genomic surveys that were carried out used methods that were very limited in scope. What were missed, then, and continue to be unexamined in current conventional genomic searches, are highly duplicated and dynamic sequences such as those that encode the Olduvai family.
The proposal that Olduvai sequences may represent a major effect locus for these conditions is not meant to imply that other genetic loci are not also involved. It is possible that other major effect loci have been missed, due to the same limitations that previously hindered the discovery of Olduvai. In addition, just as with any complex phenotype, normal brain function requires the action of multiple molecular components, any one of which, when faulty, will result in a sub-optimal outcome. Thus, if Olduvai variation is a primary contributor to these conditions, it is quite plausible that the many minor effect genomic variations that have been found may involve loci that affect the same pathways, perhaps upstream or downstream of Olduvai action. However, such loci will be scattered across the genome and, therefore, sorted independently. As a result, their contributions may only account for a small fraction of the expected genetic contribution to these conditions. A similar argument can be made with respect to the genetics of autism and cognitive aptitude.
Finally, it is worth noting that: (1) none of the identified genes or loci implicated in schizophrenia, autism, and cognitive aptitude show as great a capacity for functionally important allelic variation as the Olduvai family; (2) none show as great a human lineage-specific increase in copy number; and (3) none show a stronger correlation with the progressive, continual increase in brain size that is found during the evolution of the primate order and that has reached its most extreme form in the human lineage.
What key issues remain?
There are two primary needs related to Olduvai domains that should be addressed:
Understand Olduvai function.
From a functional perspective, little is definitively known regarding the cellular role of Olduvai. While some functional clues for these sequences have emerged (Popesco et al. 2006; Dumas et al. 2012; Zhu et al. 2017), their precise role in human brain function remains unclear. An early human study found that Olduvai domains are expressed in several tissues including at high levels in brain, where they show neuron-specific expression (for details, see Popesco et al. 2006). Previous work examining the cellular function and expression timing of Olduvai has suggested a role in promoting neuronal stem cell proliferation (Keeney et al. 2014a), findings that are quite compatible with the strong correlative data linking Olduvai dosage with brain size. A recent study similarly found that NBPF7 upregulation promotes cellular proliferation (Zhu et al. 2017). Interestingly, the majority of identified microcephaly disease genes encode centrosomal proteins (Bond and Woods 2006), and several lines of evidence have linked Olduvai function to the centrosome (Dumas et al. 2012). Prior studies have also attempted to elucidate NBPF and Olduvai function using techniques such as co-immunoprecipitation, transgenic mouse models, and in silico sequence analyses (Vandepoele et al. 2010; Keeney et al. 2015; Zhou et al. 2013), but few results have been replicated in the current literature.
Given that the domain family in humans has 300 haploid copies that are distributed among 23 dispersed human genes, it is not surprising that the sequences have been challenging to study at the functional level. However, we strongly discourage the use of genetic strategies that involve “humanizing” non-human primates as a means of understanding Olduvai function (Coors et al. 2010). Much new knowledge can be obtained without using living animal models. Progress in answering such questions may be aided by the availability of human iPS cells and the ability to precisely modify their genome.
Develop better genomic technologies for studying and measuring Olduvai domains and other highly duplicated sequences.
There are two reasons why this would be beneficial. First, no human or primate genome sequence assembly is complete. Regions that are highly duplicated are often not accurately assembled. Second, what is being missed is important. Some of the sequences that are being missed are likely to be key contributors to human disease and human evolution. Our ability to predict whether Olduvai changes are beneficial or harmful may depend on our ability to determine the precise nature of Olduvai variation in the genome.
The costly price of the human brain
Here, we provide support for a model that the Olduvai protein domain family may underlie a cognitive genomic trade-off in our species. Specifically, we propose that Olduvai sequences can play both a beneficial role (in brain evolution and cognition) and a detrimental role (in autism and schizophrenia), and which outcome occurs depends on which, where, how, and when copies are changing. This duality of effect can potentially account for the central paradox of why autism and schizophrenia, two genetic but maladaptive disorders, persist at high frequency across human populations. In addition, increases and decreases in Olduvai (e.g., CON1 subtype) copy number mirror the partially opposing phenotypic profiles that are associated with autism and schizophrenia. Finally, the high copy number and dynamic, polymorphic nature of Olduvai coding sequences makes them a rich source of unexamined functional allelic variation. As such, they represent an excellent candidate to explain why genome-wide studies of autism, schizophrenia, and cognitive capacity have not accounted for the predicted genetic contribution of these conditions.
Should this model be proven true, and our understanding of autism and schizophrenia requires that we study genes also involved in human brain evolution and cognitive capacity, we owe it to those afflicted with these disorders to do so. They, through no choice of their own, must carry these burdens. It may well be the steep price evolution has placed on the human brain, and we should feel compelled to show a greater compassion and support for those who have paid, and continue to pay, for its existence.
Ahn K, Gotay N, Andersen TM et al (2014) High rate of disease-related copy number variations in childhood onset schizophrenia. Mol Psychiatry 19:568–572. https://doi.org/10.1038/mp.2013.59
Allison A (1954) The distribution of the sickle-cell trait in East Africa and elsewhere, and its apparent relationship to the incidence of subtertian malaria. Trans R Soc Trop Med Hyg 48:312–318. https://doi.org/10.1016/0035-9203(54)90101-7
Barron D (2016) Getting past the “Shotgun” approach to treating mental illness. In: Scientific American https://blogs.scientificamerican.com/mind-guest-blog/getting-past-the-shotgun-approach-to-treating-mental-illness/. Accessed 16 Mar 2016
Brahmachary M, Guilmatre A, Quilez J et al (2014) Digital genotyping of macrosatellites and multicopy genes reveals novel biological functions associated with copy number variation of large tandem repeats. PLoS Genet 10:e1004418. https://doi.org/10.1371/journal.pgen.1004418
Brunetti-Pierri N, Berg JS, Scaglia F et al (2008) Recurrent reciprocal 1q21.1 deletions and duplications associated with microcephaly or macrocephaly and developmental and behavioral abnormalities. Nat Genet 40:1466–1471. https://doi.org/10.1038/ng.279
Burns JK (2004) An evolutionary theory of schizophrenia: cortical connectivity, metarepresentation, and the social brain. Behav Brain Sci 27:831–855
Burns JK (2006) Psychosis: a costly by-product of social brain evolution in Homo sapiens. Prog Neuro-Psychopharmacol Biol Psychiatry 30:797–814. https://doi.org/10.1016/j.pnpbp.2006.01.006
Burns J (2007) The descent of madness: evolutionary origins of psychosis and the social brain. Routledge, New York
Callaway E (2014) “Smart genes” prove elusive. Nature. https://doi.org/10.1038/nature.2014.15858
Colvert E, Tick B, McEwen F et al (2015) Heritability of autism spectrum disorder in a UK population-based twin sample. JAMA Psychiatry 72:415–423. https://doi.org/10.1001/jamapsychiatry.2014.3028
Coors ME, Glover JJ, Juengst ET, Sikela JM (2010) The ethics of using transgenic non-human primates to study what makes us human. Nat Rev Genet 11:658–662. https://doi.org/10.1038/nrg2864
Courchesne E, Karns CM, Davis HR et al (2001) Unusual brain growth patterns in early life in patients with autistic disorder: an MRI study. Neurology 57:245–254. https://doi.org/10.1212/WNL.57.2.245
Courchesne E, Carper R, Akshoomoff N (2003) Evidence of brain overgrowth in the first year of life in autism. JAMA 290:337–344. https://doi.org/10.1001/jama.290.3.337
Courchesne E, Mouton PR, Calhoun ME et al (2011) Neuron number and size in prefrontal cortex of children with autism. JAMA 306:2001–2010. https://doi.org/10.1001/jama.2011.1638
Crespi BJ (2016) Autism as a disorder of high intelligence. Front Neurosci 10:1–17. https://doi.org/10.3389/fnins.2016.00300
Crespi B, Badcock C (2008) Psychosis and autism as diametrical disorders of the social brain. Behav Brain Sci 31:241–261. https://doi.org/10.1017/s0140525x08004214
Crespi B, Summers K, Dorus S (2009) Genomic sister-disorders of neurodevelopment: an evolutionary approach. Evol Appl. https://doi.org/10.1111/j.1752-4571.2008.00056.x
Crespi B, Stead P, Elliot M (2010) Comparative genomics of autism and schizophrenia. Proc Natl Acad Sci USA 107:1736–1741. https://doi.org/10.1073/pnas.0906080106
Crow TJ (1980) Molecular pathology of schizophrenia: more than one disease process? Br Med J 280:66–68
Crow TJ (1995a) A theory of the evolutionary origins of psychosis. Eur Neuropsychopharmacol 5(Suppl):59–63
Crow TJ (1995b) A continuum of psychosis, one human gene, and not much else—the case for homogeneity. Schizophr Res 17:135–145
Crow TJ (1997) Is schizophrenia the price that Homo sapiens pays for language? Schizophr Res 28:127
Crow TJ (2000) Schizophrenia as the price that Homo sapiens pays for language: a resolution of the central paradox in the origin of the species. Brain Res Rev 31:118–129
Crow T (2011) The missing genes: what happened to the heritability of psychiatric disorders. Mol Psychiatry 16:362–364. https://doi.org/10.1038/mp.2010.92
Davis JM, Searles VB, Anderson N et al (2014a) DUF1220 copy number is linearly associated with increased cognitive function as measured by total IQ and mathematical aptitude scores. Hum Genet 134:67–75. https://doi.org/10.1007/s00439-014-1489-2
Davis JM, Searles VB, Anderson N et al (2014b) DUF1220 dosage is linearly associated with increasing severity of the three primary symptoms of autism. PLoS Genet 10:e1004241. https://doi.org/10.1371/journal.pgen.1004241
Davis JM, Searles Quick VB, Sikela JM (2015) Replicated linear association between DUF1220 copy number and severity of social impairment in autism. Hum Genet 134:569–575. https://doi.org/10.1007/s00439-015-1537-6
Deaner RO, Isler K, Burkart J, van Schaik C (2007) Overall brain size, and not encephalization quotient, best predicts cognitive ability across non-human primates. Brain Behav Evol 70:115–124. https://doi.org/10.1159/000102973
Dumas L, Sikela JM (2009) DUF1220 domains, cognitive disease, and human brain evolution. Cold Spring Harb Symp Quant Biol 74:375–382. https://doi.org/10.1101/sqb.2009.74.025
Dumas LJ, O’Bleness MS, Davis JM et al (2012) DUF1220-domain copy number implicated in human brain-size pathology and evolution. Am J Hum Genet 91:444–454. https://doi.org/10.1016/j.ajhg.2012.07.016
Ellegren H (2000) Microsatellite mutations in the germline: implications for evolutionary inference. Trends Genet 16:551–558
Faulks S (2005) Human traces: a novel. Random House, New York
Flint J (2016) Rare genetic variants and schizophrenia. Nat Neurosci 19:525–527. https://doi.org/10.1038/nn.4271
Florio M, Albert M, Taverna E et al (2015) Human-specific gene ARHGAP11B promotes basal progenitor amplification and neocortex expansion. Science 347:1465–1470. https://doi.org/10.1126/science.aaa1975
Florio M, Namba T, Pääbo S et al (2016) A single splice site mutation in human-specific ARHGAP11B causes basal progenitor amplification. Sci Adv 2:1–7. https://doi.org/10.1126/sciadv.1601941
Fondon JW, Garner HR (2004) Molecular origins of rapid and continuous morphological evolution. Proc Natl Acad Sci USA 101:18058–18063. https://doi.org/10.1073/pnas.0408118101
Goodman M (1999) The genomic record of Humankind’s evolutionary roots. Am J Hum Genet 64:31–39. https://doi.org/10.1086/302218
Haijma SV, Van Haren N, Cahn W et al (2013) Brain volumes in schizophrenia: a meta-analysis in over 18000 subjects. Schizophr Bull 39:1129–1138. https://doi.org/10.1093/schbul/sbs118
Herculano-Houzel S (2009) The human brain in numbers: a linearly scaled-up primate brain. Front Hum Neurosci 3:31. https://doi.org/10.3389/neuro.09.031.2009
Herculano-Houzel S (2012a) Neuronal scaling rules for primate brains: the primate advantage. Prog Brain Res 195:325–340. https://doi.org/10.1016/B978-0-444-53860-4.00015-5
Herculano-Houzel S (2012b) The remarkable, yet not extraordinary, human brain as a scaled-up primate brain and its associated cost. Proc Natl Acad Sci USA 109(Suppl):10661–10668. https://doi.org/10.1073/pnas.1201895109
Herculano-Houzel S, Collins CE, Wong P, Kaas JH (2007) Cellular scaling rules for primate brains. Proc Natl Acad Sci USA 104:3562–3567. https://doi.org/10.1073/pnas.0611396104
Huxley J, Mayr E, Osmond H, Hoffer A (1964) Schizophrenia as a genetic morphism. Nature 204:220–221. https://doi.org/10.1038/204220a0
Karlsson JL (1974) Inheritance of schizophrenia. Acta Pyschiatr Scan Suppl 247:1–116
Karlsson JL (1982) Family transmission of schizophrenia: a review and synthesis. Br J Psychiatry 140:600–606
Karlsson JL (1988) Partly dominant transmission of schizophrenia in Iceland. Br J Psychiatry 152:324–329
Keeney JG, Davis JM, Siegenthaler J et al (2014a) DUF1220 protein domains drive proliferation in human neural stem cells and are associated with increased cortical volume in anthropoid primates. Brain Struct Funct. https://doi.org/10.1007/s00429-014-0814-9
Keeney JG, Dumas L, Sikela JM (2014b) The case for DUF1220 domain dosage as a primary contributor to anthropoid brain expansion. Front Hum Neurosci 8:1–11. https://doi.org/10.3389/fnhum.2014.00427
Keeney JG, O’Bleness MS, Anderson N et al (2015) 2015 Generation of mice lacking DUF1220 protein domains: effects on fecundity and hyperactivity. Mamm Genome 26(1–2):33–42. https://doi.org/10.1007/s00335-014-9545-8
McKinney M (2000) Biology, brains, and behavior: the evolution of human development. School of American Research Press, Santa Fe
Mefford HC, Sharp AJ, Baker C et al (2008) Recurrent rearrangements of chromosome 1q21.1 and variable pediatric phenotypes. N Engl J Med 359:1685–1699. https://doi.org/10.1056/NEJMoa0805384
Milone R, Valetto A, Battini R et al (2016) Focal cortical dysplasia, microcephaly and epilepsy in a boy with 1q21.1–q21.3 duplication. Eur J Med Genet 59:278–282. https://doi.org/10.1016/j.ejmg.2016.03.003
Neitz J, Neitz M (2011) The genetics of normal and defective color vision. Vis Res 51:633–651. https://doi.org/10.1016/j.visres.2010.12.002
O’Bleness MS, Dickens CM, Dumas LJ et al (2012) Evolutionary history and genome organization of DUF1220 protein domains. G3 (Bethesda) 2:977–986. https://doi.org/10.1534/g3.112.003061
Popesco MC, Maclaren EJ, Hopkins J et al (2006) Human lineage-specific amplification, selection, and neuronal expression of DUF1220 domains. Science 313:1304–1307. https://doi.org/10.1126/science.1127980
Power RA, Kyaga S, Uher R et al (2013) Fecundity of patients with schizophrenia, autism, bipolar disorder, depression, anorexia nervosa, or substance abuse vs their unaffected siblings. JAMA Psychiatry 70:22–30. https://doi.org/10.1001/jamapsychiatry.2013.268
Rietveld CA, Medland SE, Derringer J et al (2013) GWAS of 126,559 individuals identifies genetic variants associated with educational attainment. Science 340:1467–1471. https://doi.org/10.1126/science.1235488
Rietveld CA, Esko T, Davies G et al (2014) Common genetic variants associated with cognitive performance identified using the proxy-phenotype method. Proc Natl Acad Sci 111:13790–13794. https://doi.org/10.1073/pnas.1404623111
Rosenberg RE, Law JK, Yenokyan G et al (2009) Characteristics and concordance of autism spectrum disorders among 277 twin pairs. Arch Pediatr Adolesc Med 163:907. https://doi.org/10.1001/archpediatrics.2009.98
Ruthsatz J, Urbach JB (2012) Child prodigy: a novel cognitive profile places elevated general intelligence, exceptional working memory and attention to detail at the root of prodigiousness. Intelligence 40:419–426. https://doi.org/10.1016/j.intell.2012.06.002
Ruthsatz J, Petrill SA, Li N et al (2015) Molecular genetic evidence for shared etiology of autism and prodigy. Hum Hered 79:53–59. https://doi.org/10.1159/000373890
Sandin S, Lichtenstein P, Kuja-Halkola R et al (2014) THe familial risk of autism. JAMA 311:1770–1777. https://doi.org/10.1001/jama.2014.4144
Searles Quick V, Davis JM, Olincy A, Sikela J (2015) DUF1220 copy number is associated with schizophrenia risk and severity: implications for understanding autism and schizophrenia as related diseases. Transl Psychiatry 5:e697. https://doi.org/10.1038/tp.2015.192
Sikela JM, van Roy F (2017) A proposal to change the name of the NBPF/DUF1220 domain to the Olduvai domain. F1000Research 6:2185. https://doi.org/10.12688/f1000research.13586.1
Sniekers S, Stringer S, Watanabe K et al (2017) Genome-wide association meta-analysis of 78,308 individuals identifies new loci and genes influencing human intelligence. Nat Genet 49:1107–1112. https://doi.org/10.1038/ng.3869
Stefansson H, Rujescu D, Cichon S et al (2008) Large recurrent microdeletions associated with schizophrenia. Nature 455:232–236. https://doi.org/10.1038/nature07229
Vandepoele K, van Roy N, Staes K et al (2005) A novel gene family NBPF: intricate structure generated by gene duplications during primate evolution. Mol Biol Evol 22:2265–2274. https://doi.org/10.1093/molbev/msi222
Vandepoele K, Staes K, Andries V, van Roy N (2010) Chibby interacts with NBPF1 and clusterin, two candidate tumor suppressors linked to neuroblastoma. Exp Cell Res 316(7):1225–1233. https://doi.org/10.1016/j.yexcr.2010.01.019
Yuen RKC, Merico D, Bookman M et al (2017) Whole genome sequencing resource identifies 18 new candidate genes for autism spectrum disorder. Nat Neurosci 20:602–611. https://doi.org/10.1038/nn.4524
Zhou F, Xing Y, Xu X et al (2013) NBPF is a potential DNA-binding transcription factor that is directly regulated by NF-κB. Int J Biochem Cell Biol 45(11):2479–2490. https://doi.org/10.1016/j.biocel.2013.07.022
Zimmer F, Montgomery SH (2015) Phylogenetic analysis supports a link between DUF1220 domain number and primate brain expansion. Genome Biol Evol 7:2083–2088. https://doi.org/10.1093/gbe/evv122
This work was supported by NIH grant R01 MH108684 (JMS) and Pilot Award 309230 from the Simons Foundation Autism Research Initiative (JMS). We thank the following individuals for helpful discussions: Hilary Coon, Jack Davis, Ilea Heft, Mark Johnston, and Amanda Law. We also thank Nate Anderson, Jack Davis and Ilea Heft for help with figures. Finally, we dedicate this manuscript to the many individuals and their families who are affected by autism, schizophrenia and other brain-related disorders.
Conflict of interest
On behalf of all authors, the corresponding author states that there is no conflict of interest.
About this article
Cite this article
Sikela, J.M., Searles Quick, V.B. Genomic trade-offs: are autism and schizophrenia the steep price of the human brain?. Hum Genet 137, 1–13 (2018). https://doi.org/10.1007/s00439-017-1865-9