Progress and prospects toward our understanding of the evolution of dosage compensation
- 854 Downloads
In many eukaryotic organisms, gender is determined by a pair of heteromorphic sex chromosomes. Degeneration of the non-recombining Y chromosome is a general facet of sex chromosome evolution. Selective pressure to restore expression levels of X-linked genes relative to autosomes accompanies Y-chromosome degeneration, thus driving the evolution of dosage compensation mechanisms. This review focuses on evolutionary aspects of dosage compensation, in light of recent advances in comparative and functional genomics that have substantially increased our understanding of the molecular mechanisms of dosage compensation and how it evolved. We review processes involved in sex chromosome evolution, and discuss the dynamic interaction between Y degeneration and the acquisition of dosage compensation. We compare mechanisms of dosage compensation and the origin of dosage compensation genes between different taxa and comment on sex chromosomes that apparently lack compensation mechanisms. Finally, we discuss how dosage compensation systems can also influence the evolution of well-established sex chromosomes.
KeywordsDosage compensation Sex chromosomes Evolution Chromatin Epigenetics
Chromatin ImmunoPrecipitation and Sequencing
- C. elegans
- D. busckii
- D. melanogaster
- D. miranda
Dosage Compensation Complex
Dependent On X
Long Interspersed Elements
Ligand of Numb-protein X
Long Terminal Repeat
MItosis and X associated
Males absent On the First
Messenger DeoxyRibonucleic Acid
Painting Of Fourth
Polycomb Repressive Complex
Recruitment Elements on X
RNA on the X
Sex determination and Dosage Compensation defect
Structural Maintenance of Chromosomes
Xist antisense RNA (Xist spelt backwards)
X Inactivation Center
X inactive specific transcript
The evolution of heteromorphic sex chromosomes and the pressure to increase X-linked expression in males
Species with separate sexes are widespread among animals and plants. Sex determination can vary widely within clades, with mechanisms as different as environmental and genetic determination occurring in related species, which indicates frequent de novo evolution of sex-determining mechanisms (Bull 1983). Heteromorphic sex chromosomes—such as the XY pair of mammals—have arisen independently from ancestral pairs of autosomes several times and display very similar characteristics, suggesting that their evolution followed similar steps, outlined below.
Y-chromosomes can arise when a male-determining gene appears on an autosome (in the case of male heterogamety, such as the mammalian system; the same principles apply in the case of female heterogamety; Charlesworth 1996). From that moment on, this newly formed sex chromosome (the proto-Y chromosome) is always transmitted to males; mutations that favour males are therefore predicted to accumulate there, even if they have deleterious effects in females. When a recombination event occurs between the proto-Y and the proto-X chromosomes, however, some of these male-beneficial mutations are relocated to the X, where they may be selected against in females. Selection therefore favours reduced recombination on the proto-Y chromosome to protect linkage between the male-determining gene and male-beneficial genes.
This reduced or absent recombination on the Y incurs a long-term cost for males: by reshuffling loci during meiosis, recombination creates a wide range of allele combinations for natural selection to act upon, thereby increasing the efficacy of natural selection (Hill and Robertson 1966). In the absence of recombination, all the sites on the chromosome are completely linked, forcing selection to act on a limited range of allele combinations. This has two main consequences: first, whenever a beneficial mutation is fixed in the population, all deleterious mutations located on the Y chromosome that carries the advantageous allele will become fixed simultaneously (Rice 1987). Second, purifying selection against mildly or moderately deleterious mutations will become less efficient on a non-recombining chromosome, resulting in their gradual accumulation on the Y chromosome (Charlesworth 1996). These evolutionary theories predict that after recombination is repressed, deleterious mutations in protein-coding genes and regulatory regions, and repetitive DNA—such as transposable elements—will quickly accumulate on the proto-Y. In species with relatively ancient sex chromosomes, like mammals or D. melanogaster, the Y chromosome has lost most of its original genes and consists to a large extent of repetitive junk DNA (Lahn et al. 2001). Genomic data from species that have only recently acquired their sex chromosomes have shown that the Y chromosome shows higher rates of amino-acid substitution (probably reflecting an accumulation of deleterious mutations) and frame-shift mutations, an accumulation of transposable elements and disrupted patterns of gene expression (Guttman and Charlesworth 1998; Charlesworth 2004; Liu et al. 2004; Bachtrog 2006; Bachtrog et al. 2008).
Different strategies of solving the dosage problem: up- and down-regulation of X chromosomes in mammals, worms and flies
The classical view of dosage compensation in mammals and C. elegans is that females halve the expression of X-linked genes to equalize gene expression between the sexes. Evolutionary biologists, however, noted early on that selection acts on individuals, so that a pressure to equalize expression differences between the sexes cannot account for the evolution of these compensation mechanisms. Furthermore, simple down-regulation of the X in females would only exacerbate the X/autosomal expression dosage difference, since dosage imbalances would then affect females as well as males (Gupta et al. 2006). Thus, it was suggested that there is in fact up-regulation of expression of X-linked genes, but that unlike in Drosophila, this up-regulation of X-linked genes was not male-specific but occurred in both sexes. In order to restore correct gene dose between the X and autosomes in females, repression of X-linked expression in females would have to evolve secondarily (Fig. 1). Charlesworth (1978; 1996) presented a verbal sketch of this model, and it was recently modelled theoretically. Assuming that males with a single active allele have suboptimal levels of X-linked expression and females with two active copies have the optimal level of gene expression, Engelstädter and Haig (2008) explored under which conditions increased X-linked gene expression in both sexes followed by X inactivation in females can evolve. They showed that up-regulation of X-linked genes in both sexes can evolve, as long as the fitness reduction in females is small relative to the fitness increase in males. Experiments in Drosophila support the notion that the deleterious effects of gene dosage reductions are generally more pronounced than those of similarly sized gene dosage increases (Lindsley et al. 1972). The theoretical prediction that inactivation or down-regulation of the X in females has evolved in response to up-regulation of the X in both sexes has recently been confirmed through exhaustive comparisons of X-linked and autosomal expression levels in mouse, humans and C. elegans. In particular, expression from X-linked genes in all sampled tissues is, on average, approximately equal to that from autosomal genes both in females and in males (Gupta et al. 2006; Nguyen and Disteche 2006; Lin et al. 2007). The actual mechanism behind this up-regulation of X-linked genes in both sexes is unknown, and molecular studies so far have focused on understanding the pathways controlling female repression of X-linked over-expression. It is possible that X-linked genes are selected individually for promoters with increased activity; however, examination of expression profiles in mouse germ cells and early zygotes suggests that up-regulation is absent from haploid germ cells but is rapidly established upon fertilization, suggesting a more elaborate, chromosome-wide regulatory mechanism (Nguyen and Disteche 2006).
Molecular mechanisms of dosage compensation: the importance of chromatin modifying complexes
While the mechanisms of dosage compensation differ markedly between taxa, each well-studied case involves the recruitment of a chromatin regulatory complex to modulate the expression on the entire X chromosome (Straub and Becker 2007). In D. melanogaster, a male-specific RNA-protein complex—the dosage compensation complex (DCC)—binds to the male X chromosome and doubles its expression, thereby readjusting the X to autosomal expression ratio. The DCC requires at least five “male-specific lethal” (MSL) proteins (msl-1, msl-2, msl-3, mle, and mof), and two (functionally redundant) non-coding RNAs, roX1 and roX2 (Park and Kuroda 2001). The DCC (or MSL-complex) binds almost exclusively to specific sites along the X chromosome, where it interferes with chromatin folding by acetylating lysine 16 of H4, a histone required for the folding of nucleosomal arrays into 30 nm fibres, possibly making the genetic material more readily accessible to the transcription machinery (Park and Kuroda 2001).
As noted above, the mechanisms resulting in non-sex-specific up-regulation of X-linked genes in mammals and worms are unknown, and molecular studies have instead concentrated on understanding the inactivation or down-regulation of X-linked transcription in females (or hermaphrodites). Mammals achieve down-regulation of X-linked genes in females by abolishing transcription of one copy of the X chromosome in each cell (Lyon 1961), through a process known as X-inactivation (see Payer and Lee 2008 for a detailed review of the molecular mechanisms involved). X-inactivation is controlled by one X-linked locus only, the X-inactivation center (XIC), which contains the two non-coding RNAs directly involved in mediating X chromosome inactivation, Xist and Tsix. Xist and Tsix are antisense RNAs (they originate from the same locus, but are transcribed in opposite directions) and function as mutual repressors of transcription. The Tsix copy located on the inactive X chromosome is repressed, so that Xist is highly transcribed and coats the X chromosome, recruiting the proteins that cause its heterochromatic state; on the active X, Tsix is expressed, and Xist expression is repressed. Two Polycomb repressive complexes, PRC1 and PRC2, are recruited to the X chromosome that is coated with Xist and modify histones, leading to a stably inactivated X (Straub and Becker 2007).
C. elegans females (or more accurately, XX hermaphrodites) have adopted a similar strategy to mammals, but halve the transcription rate of each X-chromosome instead of completely inactivating one copy (Meyer and Casson 1986). This requires the action of a dosage compensation complex formed by at least nine proteins, several of which are also involved in sex-determination (the sex determination and dosage compensation genes, or SDC genes). All the genes that form the C. elegans DCC are supplied maternally to both XX and XO embryos, with one exception: SDC-2 is expressed exclusively in XX embryos, where it mediates both the development of a hermaphroditic phenotype and the recruitment of the DCC proteins to the DCC (Dawes et al. 1999). A subset of these proteins then form a condensin-like subdomain, which is thought to be responsible for the chromatin modifications that cause the decreased expression levels on the X (Hagstrom and Meyer 2003).
Recognition of the X chromosome: cis-elements and spreading
Recognition of the X in Drosophila also follows a two-step model (Kelley et al. 1999); however, unlike in mammals, many more chromosome entry sites that recruit the DCC to the X chromosome in a sequence-dependent manner have been identified, and spreading occurs over much smaller distances (Fig. 2). Initial suggestions that the X chromosome of Drosophila possesses only 35–40 high-affinity “entry sites” that the DCC recognizes and binds to, followed by spreading in cis along the entire chromosome, were experimentally supported by translocations of identified X chromosome entry points onto autosomes. This led to DCC binding not only to the translocated locus, but often also to the adjacent autosomal regions (Kageyama et al. 2001). However, many X chromosome segments were shown to associate normally with the DCC when translocated onto autosomes, independently of whether they contained a described DCC entry site (Fagegaltier and Baker 2004). Recent experimental work using high-resolution ChIP-seq mapping has identified many additional chromosome entry sites, and supports a two-step model of DCC recruitment to the X chromosome of Drosophila. The DCC complex is thought to first target >150 (and up to 300) chromatin entry sites containing specific DCC recognition elements on the X chromosome in males (Alekseyenko et al. 2008; Sural et al. 2008). These entry sites appear to consist of combinations of degenerate DNA motifs enriched for repetitive sequences. After this initial, sequence-specific recognition step, local spreading from entry sites in cis along the X chromosome is thought to lead to DCC binding to the majority of actively transcribed genes on the X chromosome (Alekseyenko et al. 2008). Although secondary binding signals on the X may also play a role, cis spreading to the majority of X-linked genes occurs primarily through the recognition of H3K36-methylated nucleosomes (a general features of transcribed genes) by MSL-3, and is therefore largely independent of further sequence signals (Larschan et al. 2007; Sural et al. 2008).
Targeting of the X by the C. elegans DCC follows a similar approach to Drosophila, where DCC-recruiting signals of different affinities occur along the X chromosome. However, there appears to be a lower density of these binding sites on the X of C. elegans compared to Drosophila, implying spreading of the DCC over larger physical distances along the X chromosome in C. elegans (Fig. 2). Some large sections of the C. elegans X chromosome do not recruit the DCC when translocated to autosomes despite being associated with the DCC and normally dosage-compensated when X-linked (Meyer 2005). This suggests that their association with the DCC in their native X location results from spreading from flanking DCC-binding sequences over significant physical distances. Thus, a plausible model of X recognition in C. elegans involves widely spaced cis-acting DCC entry sites and significant spreading from these sites to coat the entire chromosome. Consistent with this model, two types of DCC-binding sites have been characterized experimentally in C. elegans (Jans et al. 2009): rex sites (recruitment elements on X) and dox sites (dependent on X). rex sites are the primary binding sites of the DCC on the X and consist of clusters of at least two small consensus motifs (McDonel et al. 2006). They recruit the DCC independent of their genomic location. The more frequent dox sites only bind the DCC when they are located on the X, and are thought to be responsible for the spreading of the DCC. They do not share the consensus motifs of rex sites, but do show an enrichment for a different G-rich 18bp motif (Jans et al. 2009). Since dox sites do not have the sequence signals required to independently recruit the DCC, it is possible that, like in Drosophila, other properties of the chromatin play a role in the secondary binding of the DCC (Jans et al. 2009).
What can we learn from these three examples of mechanisms of dosage compensation? First, as predicted since the sixties but only recently documented experimentally, all dosage compensation mechanisms operate to increase the expression of X-linked genes in males (instead of equalizing expression of X-linked genes between the sexes). If this is achieved by up-regulating X-linked expression through the action of a male-specific DCC, as in Drosophila, no further modulation of expression is necessary. If the expression of X-linked genes is up-regulated in a non-gender-specific manner, a second pathway is required to secondarily down-regulate expression in females.
A second interesting point is that, although they are not homologous, use different genes, and ultimately work in opposite ways, the C. elegans and D. melanogaster dosage compensation pathways share several characteristics. In these organisms, dosage compensation is achieved by means of sex-specific DCCs that originated through co-opting of genes involved in sex-determination. This is particularly visible in C. elegans, where the two pathways overlap repeatedly. In Drosophila, the Sxl gene, an RNA splicing enzyme, is expressed specifically in XX embryos, where it initiates the cascade leading to the female phenotype. One of the targets of Sxl is the mRNA of msl-2, a protein required for DCC assembly. In the presence of Sxl, the msl-2 mRNA is spliced into an inactive form, which effectively inhibits the formation of the DCC in females (Bashaw and Baker 1995; Kelley et al. 1995; Zhou et al. 1995). The specific targeting of the X chromosome in these two species is also similar, with initial binding of the DCC to high affinity entry points on the X, followed by spreading to adjacent regions, possibly assisted by secondary, lower affinity binding sites. This contrasts sharply with mammalian dosage compensation, which has neither a sex-specific DCC nor chromosome-specific targeting; instead, all chromosomes carrying the XIC locus are targeted, and it is the number of X-chromosomes in the cell that initiates Xist expression, not the sex in which they are located. These differences between mammals and C.elegans/D.melanogaster may reflect differences in the evolutionary history of their DCC; we discuss this possibility below.
The role of non-coding RNAs should also be emphasized, as they play a key role in the targeting of the X in both mammals and Drosophila. This is in line with recent literature highlighting the importance of non-coding RNAs for the regulation of gene expression (Mattick et al. 2009), although most of the known examples concern gene silencing, whereas in Drosophila the roX genes are used for upregulation. Finally, once the mechanisms that control X chromosome up-regulation in mammals and nematodes are characterized, it is well possible that further characteristics common to all three organisms will be identified.
The evolutionary origin of dosage compensation genes: recruitment of pre-existing regulatory complexes
As discussed above, dosage compensation differs greatly between the various organisms under study, and involves different genes. With the availability of whole genome sequences of several fly, nematode and mammalian species, another important question can be addressed: how are protein complexes created and recruited to perform newly evolved functions, such as dosage compensation in response to Y degeneration? While the mechanics of dosage compensation differs markedly between animal lineages, in each well-studied case it involves the recruitment of a pre-existing chromatin regulatory system to modulate expression of an entire X chromosome. The proteins that participate in these chromatin-modifying complexes appear to have an ancestral association that predates their role in dosage compensation. Thus, the evolution of dosage compensation does not require the emergence of novel regulatory proteins but instead the evolution of a recruiting mechanism that targets pre-existing regulatory complexes to a specific chromosome.
Several proteins of the Drosophila DCC have conserved homologs in nematodes, mammals and even in yeast (Hilfiker et al. 1997; Eisen et al. 2000). A complex that contains all msl protein homologs but one (mle, which may have gone undetected since it appears to have a more peripheral association with the complex in flies) has been detected in mammals (Smith et al. 2005). The human MSL-complex homolog is also involved in acetylating histone H4 at lysine 16, but binds to all chromosomes, and in both sexes, suggesting that a housekeeping complex present ancestrally was co-opted for dosage compensation in Drosophila. MOF, the DCC protein directly responsible for the acetylation of histone H4, has recently been found to perform the same function on the promoter region of genes located on the X and autosomes of both sexes in Drosophila (Kind et al. 2008). It therefore appears possible for an epigenetic pathway to be recruited for dosage compensation while maintaining its original regulatory function. The recruitment of this epigenetic complex for male-specific up-regulation of X-linked genes in Drosophila is likely to have occurred relatively rapidly, as DCC-protein homologs are not involved in dosage compensation of another Dipteran insect, Sciara ocellaris (Ruiz et al. 2000), although the MSL complex has been shown to mediate dosage compensation in several other Drosophila species. An interesting recent discovery regarding chromosome-wide regulation in fruit flies involves an autosome-specific protein, POF (Painting of Fourth). POF is a putative RNA binding protein that specifically binds the tiny fourth chromosome of both sexes in D. melanogaster (Larsson and Meller 2006). Surprisingly, in D. busckii—a species in which the fourth chromosome has become fused to the ancestral X chromosomes—POF paints the entire X chromosome in males only, suggesting that POF is participating in dosage compensation, while no MSL binding to the male X was detected (Larsson et al. 2000). This suggests that an alternative system of dosage compensation may be used even within the genus Drosophila. Haddrill et al. (2007) found that genes located on the D. melanogaster fourth chromosome have, on average, higher levels of expression than genes on other chromosomes, which they interpret as a response to reduced gene activity caused by the accumulation of deleterious mutations on this non-recombining chromosome. However, increased expression of genes on chromosome four could also result from their association with POF, if POF-related mechanisms were ancestrally responsible for achieving dosage compensation in Drosophila.
Unlike the msl proteins, roX1 and roX2 are not present in the human MSL-complex. The non-coding RNAs have been shown to be essential for specific DCC binding to the X chromosome (Li et al. 2008), indicating that the recruitment of the roX genes to the MSL-complex may have been a crucial step in the acquisition of dosage compensation in Drosophila. The origin of roX1 and roX2, however, remains a mystery, as no homologs have been found in the genome of the Dipteran insect Anopheles gambiae (Inagaki et al. 2005). Even within the genus Drosophila, not all species appear to have both roX genes (Park et al. 2007). Since roX1 and roX2 are functionally redundant in D. melanogaster, this may not be surprising, and identifying non-coding genes based on sequence conservation poses a general difficulty. However, despite low homology between roX genes identified in various Drosophila species, male-specific expression and X chromosome-specific binding of roX genes are conserved (Park et al. 2007).
A similar recruitment of ancestral epigenetic mechanisms that regulate chromatin compaction to perform dosage compensation functions occurred in C. elegans. In this species, three proteins of the DCC (DPY-26, DPY-27 and DPY-28) show similarity to proteins of the 13S condensin (Chuang et al. 1994; Lieb et al. 1996; Hagstrom and Meyer 2003), a mitotic complex involved in chromatin compaction and nucleosome resolution that is conserved from prokaryotes to eukaryotes (Hirano 1999). Another protein of the DCC, MIX-1, is a member of both the DCC and the 13S condensin complex (Lieb et al. 1998; Hagstrom et al. 2002). In addition to being similar to a condensin protein, DPY-28 participates in crossing over regulation in the germline, further highlighting the dual function of some DCC proteins (Tsai et al. 2008). Additional DCC subunits confer sex specificity to the dosage compensation process and recruit the condensin-like DCC subunits to X chromosomes.
Understanding the origin of Xist and Tsix proved challenging, as these non-coding RNAs are found only in eutherian mammals (Duret et al. 2006), where they are conserved (Chureau et al. 2002). Recently, a region homologous to the mammalian XIC region has been identified in the chicken genome (Duret et al. 2006). Although the genes flanking the XIC were found to be syntenic in these species, no RNA gene was found to correspond to Xist; instead, a protein coding gene of unknown function, Lnx3, was identified in the expected Xist region. Further analysis showed that, although Xist is much larger than Lnx3 and contains no protein coding sequence, some of its exons present significant similarity to Lnx3, indicating that Xist originated from Lnx3 (Duret et al. 2006). This has occurred at least partly through the acquisition of additional transposable-element-derived exons (Elisaphenko et al. 2008). Similarly, other XIC genes appear to have evolved by pseudogenization of coding genes of the ancestral XIC region (Elisaphenko et al. 2008).
Several lines of evidence suggest that transposable elements (TEs) played an important role in the acquisition of dosage compensation. As mentioned above, several XIC genes evolved by accumulating TE repeats in their coding sequence. Lyon (1998) also postulated that LINE elements may play a role in the spreading and stabilizing of the Xist RNA on the X chromosome, a theory supported by the higher frequency of LINE elements on the human X than on the autosomes, particularly around the XIC (although this accumulation of LINE elements is not detected in the mouse XIC region). A direct demonstration that dosage compensation and transposable elements are closely related in Drosophila was provided by Matyunina et al. (2008), who showed that MOF, one of the DCC proteins of Drosophila, is also involved in the repression of the copia LTR retrotransposon. This close relation between transposable elements and dosage compensation may reflect the history of epigenetic mechanisms, which are thought to have originated primarily as a defence against transposable elements (Matyunina et al. 2008).
The evolution of dosage compensation: dynamic interactions between Y degeneration and dosage compensation
In addition, the temporal dynamics of Y degeneration is expected to change over time, due to—among other factors—changes in the number of functional genes on a degenerating Y chromosome. It has been shown that this reduction in functional genes can result in different processes of Y degeneration dominating different evolutionary stages in the transition of a gene-rich proto-Y chromosome into a degenerate Y (Bachtrog 2008a). In particular, purifying selection to remove mildly deleterious mutations from the population may be particularly ineffective on a young, gene-rich Y chromosome, while selective sweeps driving the fixation of more strongly deleterious mutations may become more important at later stages of Y degeneration (Bachtrog 2008a). This could favour a block-model of dosage-compensation in the early stages of sex chromosome evolution, and select for genes becoming compensated individually later on. Again, however, the exact temporal dynamics of Y degeneration depends on many poorly known evolutionary parameters, making quantitative predictions on how we would expect dosage compensation to evolve difficult.
Although both types of interference effects are likely to participate in the degeneration of the Y chromosome after recombination is repressed, Charlesworth (1996) pointed out that different populations might be affected to different extents by the two effects depending on their effective population size. Species with smaller effective population sizes, such as mammals, are likely to be strongly affected by the accumulation of moderately deleterious mutations that are not being efficiently removed from the population. Flies and nematodes, on the other hand, have much larger effective population sizes (the current effective population size of C. elegans is quite small, but evidence suggests that this is due to a recent reduction in effective population size from a large ancestral population; Cutter 2006) and may therefore have evolved to a greater extent through the hitchhiking of more strongly deleterious mutations together with beneficial alleles. As pointed out above, mammalian dosage compensation does indeed have characteristics that set it apart; interestingly, X-inactivation seems to be primarily a chromosome-wide mechanism, with a few genes escaping X-inactivation. Dosage compensation in C. elegans and D. melanogaster appears to be much more localized, with a multitude of binding sites located along the X. This is in agreement with this theory of many weakly deleterious mutations accumulating on the proto-Y chromosome in mammals at many genes, selecting for large blocks on the X to be compensated simultaneously, while more strongly deleterious mutations fixing by hitchhiking with beneficial alleles in C. elegans and Drosophila select for more localized mechanisms of dosage compensation.
It should be noted, however, that it is of course possible that the repression of X-linked expression in mammalian females initially evolved on a gene-by-gene basis, and that a chromosome-wide mechanism of X-inactivation was secondarily adopted. Furthermore, nothing is known about the molecular basis for the initial up-regulation of X expression in mammals affecting both sexes, which may also be regulated independently for each gene. However, it appears sensible for the two mechanisms (up-regulation of the X in both sexes, and inactivation of the X in females) to evolve in parallel, and chromosome-wide mechanisms for up-regulation of X genes in mammals are supported by experimental data (Nguyen and Disteche 2006).
In general, mechanisms of dosage compensation that have been well-established for millions of years only provide limited information on how they were created in the first place. An approach that has proved useful in understanding the early steps of sex chromosome evolution consists in studying sex chromosomes that have only recently evolved. These can fall into two categories: true newly evolved sex-chromosomes, and autosomes that have become fused to sex-chromosomes (neo-sex chromosomes). Several neo-X chromosomes in the genus Drosophila have evolved dosage compensation over some or all of their length, by co-opting the existing dosage compensation machinery. For example, D. miranda has a neo-sex chromosome system that was formed only about 1 million years ago, and roughly half of the genes originally present on the neo-Y are already degenerate (Bachtrog et al. 2008). This massive degeneration of gene function on the neo-Y (involving over 1000 genes) has triggered an evolutionary response in its former homolog, the neo-X, which is already partly recruiting the molecular machinery necessary for dosage compensation. Targeting of the DCC to the neo-X chromosome of D. miranda must have involved the recent (adaptive) fixation of many de novo binding sites for the dosage compensation machinery. Interestingly, rates of adaptive evolution on the neo-X chromosome of D. miranda were shown to be about 10-fold higher than background levels of adaptation in the genome (Bachtrog et al. 2009), consistent with the action of positive selection at many genomic regions to acquire binding sites for the DCC on the neo-X.
The mode of how dosage compensation evolved in response to mutation accumulation on the Y can potentially affect the process of Y degeneration itself (Fig. 3). In particular, if dosage compensation evolves in a block-by-block manner, large segments of a proto-Y chromosome may become functionally redundant. Both genes which have only very slightly impaired functions and genes that are fully functional on the Y chromosome could become dosage compensated on the X and may therefore decay neutrally on the Y. Thus, under a block-model of dosage compensation, the rate of Y degeneration will be greatly increased once a genomic region acquires dosage compensation. In fact, under this scenario it may actually become beneficial to down-regulate either mal-adapted or fully functional genes on a proto-Y chromosome which are dosage compensated on the X, to restore proper gene dose for these genes in males. Genes could become down-regulated individually by mutations in their regulatory sequences or by transposable element insertions, or large regions of evolving Y chromosomes could become transcriptional silenced simultaneously by epigenetic modifications, such as heterochromatinization. Thus, under a block-by-block model of dosage compensation, Y degeneration, at some time point, will become an adaptive process. Interestingly, the neo-Y chromosome of D. miranda has been shown to have undergone recent positive selection (Bachtrog 2004). While it is of course possible that some genes evolved male-beneficial functions, this could also reflect adaptive down-regulation of neo-Y genes that are dosage compensated on the neo-X. Further studies of sex chromosomes that are in the process of acquiring dosage compensation will help uncover the dynamic interactions between Y degeneration and the evolution of dosage compensation.
The glitch in the theory: lack of dosage compensation in ZW systems
We have focused so far on the most frequent sex-chromosome system, male heterogamety. Female heterogamety (males are ZZ, females are ZW), the system found in birds, moths and butterflies, provides an interesting complement to the previous studies, as it allows us to examine separately the consequences of always being transmitted through males (which often have, for instance, a higher mutation rate) from other predictions of sex chromosome evolution theory that are expected to affect both systems equally. In agreement with theories of sex chromosome evolution (see above), W chromosomes are very similar to Y chromosomes: they contain few functional genes and are mostly heterochromatic, have low sequence diversity and high rates of non-synonymous divergence compared with the autosomes (Berlin and Ellegren 2005; Berlin et al. 2007; Vitkova et al. 2007). Since most genes on Z-chromosomes lack a homolog on the W, these same theories therefore predict that Z-linked genes should also be dosage-compensated in females.
An early examination of chromatin condensation showed no difference between Z chromosomes and autosomes in birds (Cock 1964), which suggested absence of dosage compensation (in male heterogametic systems, dosage compensation is detectable by differences in chromatin compaction of dosage-compensated X chromosomes). Small-scale studies of gene expression at few loci in birds and butterflies reached similar conclusions (Johnson and Turner 1979; Baverstock et al. 1982), leading to the puzzling suggestion that ZW systems may not have evolved dosage compensation.
More recently, female and male expression levels were systematically compared in birds using microarray studies (chicken and zebrafinch) and a few Z-linked genes were in fact shown to be dosage compensated. However, the main conclusion of these genome-wide studies remained that in birds dosage compensation seems to be the exception rather than the rule, and most Z-linked genes are expressed at significantly higher levels in males than females (Ellegren and Parsch 2007; Arnold et al. 2008). A recent expression analysis of over 500 genes located on the Bombyx mori (silkworm) Z chromosome detected a similar pattern in this species, with a vast majority of Z-linked genes presenting male-biased expression levels (Zha et al. 2008).
Several explanations have been put forward to account for this puzzling observation. Since not all genes function in a dosage dependent manner, some organisms with heteromorphic sex chromosomes will simply not require the development of chromosome-wide mechanisms of dosage compensation, independently of which sex is heterogametic (Graves and Disteche 2007). In particular, the necessity of evolving dosage compensation may depend on both the types of genes initially present on the sex chromosomes, and the number of genes located on the degenerating Y. Experimental data suggest that hemizygosity at a few genes or small genomic regions is usually tolerated, while hemizygosity over extended segments or entire chromosomes is lethal. If Z chromosomes of birds and Lepidoptera are generally gene-poor, a lack of dosage compensation in these species could simply result from sufficient buffering of the genome against small regions of hemizygosity, accompanied by individual up-regulation of the few genes that are dosage-sensitive. X chromosomes of Drosophila, C. elegans and humans are gene-rich, harbouring roughly 2300, 3100, and 1500 genes, respectively. The Z chromosomes of both Bombyx and chicken contain slightly fewer genes, with roughly 840 and 600 protein-coding genes identified on the chicken and the silkworm Z chromosome, respectively. However, hemizygosity of several hundred genes is usually lethal in Drosophila, suggesting that the lack of dosage compensation in chicken and Bombyx does not simply result from a low gene number on the Z. Another possibility is that these ZW pairs are still in the early stages of dosage compensation evolution, after up-regulation of the Z in both sexes but before the increase in expression of X- or Z-linked genes becomes sex-specific (see above). Data obtained for birds, however, do not support this idea: first, the Z and W chromosomes of birds diverged approximately at the same time as the mammalian X and Y (over a hundred million years ago), and before the Drosophila sex chromosomes. Second, the Z/Autosomal ratio of expression in chicken and zebrafinch males is approximately 1 (Ellegren et al. 2007; Itoh et al. 2007), which is not consistent with an chromosome-wide increase of expression on the Z.
Other theories, including sexual antagonism, male-biased mutation rates and increased egg size in ZW species, view the lack of dosage compensation as a consequence of female heterogamety. (a) Sexual antagonism: It has been shown theoretically and empirically that sexual selection, as well as sex-specific selective forces, can lead to different accumulation of genes with sex-specific functions on the sex chromosomes (Rice 1984). Thus, it is possible that Z-linked genes are expressed at higher rates in males than in females not because dosage compensation is inefficient, but because these are genes with male functions. Sexual selection theory suggests that male secondary characters would more likely be established in species with male homogamety (i.e. ZZ males) than those with male heterogamety (i.e. XY males; Reeve and Pfennig 2003; Albert and Otto 2005), and comparative analysis among a list of taxa suggests that male secondary characters are more exaggerated in ZZ/ZW systems than in XX/XY systems (Reeve and Pfennig 2003). In addition, avian Z-linked genes show evidence of accelerated rates of functional evolution, while no such effect was found for X-linked genes in mammals (Ellegren 2009). Thus, the higher levels of sex-linked gene expression in males may favour Z-linkage of genes under sexual selection, and this effect may be more pronounced in ZZ/ZW genetic systems than XX/XY systems. (b) Male-biased mutation: In many organisms, the production of female and male gametes requires different numbers of cell divisions. Since most mutations occur during cell division, this can lead to differences in male and female mutation rates. A male-biased mutation rate has been observed in mammals and birds, and is associated with increased mutation rates on the Y, while female-limited W chromosomes have lower rates of mutation than other chromosomes (Miyata et al. 1987). This effect could slow down the degeneration of the W chromosome, allowing for sex-specific dosage regulation to evolve on a gene by gene basis. Although the role of mutation rates on the degeneration of Y chromosomes has been quantified (Engelstädter 2008), the extent to which this affects ZW systems remains to be determined. This argument is further complicated by the fact that in many species (including birds and mammals), recombination between the proto-sex chromosomes is not restricted simultaneously over their entire length, but instead different genomic regions along the evolving sex chromosomes—containing only a subset of the genes—stop recombining at different evolutionary time points. Reducing the number of genes within the non-recombining segments of the proto-sex chromosomes has the same effect as reducing the chromosome-wide mutation rate on an evolving Y (or W) chromosome, thereby also reducing the speed of Y degeneration (Bachtrog 2008a). Thus, the effect of reduced mutation rates in ZW systems is confounded by the number of genes present in each non-recombining strata of an evolving sex chromosome pair, of which we only have limited knowledge. (c) Megalecithal eggs: Female-heterogametic species tend to have large eggs (e.g. birds) that contain large amounts of maternal mRNAs. Chandra (1991) pointed out that this could counteract the loss of W-linked genes that are required for early development, because females can produce and store large quantities of these mRNAs in the egg and transmit them to embryos. In species with smaller eggs, such as Drosophila, maternal RNAs are also crucial for zygotic and immediate post-zygotic development but are soon replaced by locally produced mRNAs; the abundance of maternal mRNAs of ZW organisms may be sufficient for embryonic transcription to be put on hold until morphogenesis has been determined, thereby avoiding dosage problems at this crucial stage (Chandra 1991). This theory assumes that gene dose is more crucial at early stages of development, an assumption that has not been explicitly tested.
The extent to which each of these hypotheses affects the evolution of Z chromosomes remains to be determined. Furthermore, since male and female expression levels of Z-linked genes have only been analysed globally in two independently evolved ZW systems, it is unclear if the lack of dosage compensation in birds and butterflies is a coincidence or if it reflects true differences in the evolution and physiology of ZW organisms. Further studies of female heterogametic taxa—as found in many reptile species—will help clarify this issue. Additionally, little is known about dosage compensation in plants that have heteromorphic sex chromosomes. Plants may generally be more tolerant to gene dose imbalances (for example, they are often polyploid and many genes occur in gene families), suggesting that they could also lack dosage compensation mechanisms. Future studies on levels of gene expression in plants with heteromorphic sex chromosomes will be of great interest to determine the extent of dosage compensation in these systems.
Dosage compensation and the evolution of well-established sex chromosomes
Most of the work reviewed here concerns the early evolution of sex chromosomes and the acquisition of dosage compensation mechanisms to counter-balance gene dose deficiencies caused by the degeneration of the Y chromosome. It is, however, worth noting that dosage compensation can also affect the current evolution of established sex chromosomes, both at sequence divergence and gene movement levels, and this may provide an explanation for some of the peculiar patterns of evolution observed on X chromosomes (Vicoso and Charlesworth 2006).
The presence of multiple DCC-binding sites along the X chromosomes of Drosophila and Caenorhabditis is likely to constrain the evolution of non-coding sequence. If binding sites are conserved within these clades, the accumulation of mutations that disrupt them will be prevented, so that non-coding sites are expected to evolve more slowly on the X chromosome than on autosomes. The extent of this bias clearly depends on the total number and size of dosage compensation binding sites along the X, and how easily DCC-binding sites can be lost and gained in the genome (cis binding sites often show surprisingly fast rates of turn-over between species). An unexpected finding that emerged from D. melanogaster polymorphism studies is that there is widespread species-specific positive selection on msl genes in this taxa (Rodriguez et al. 2007). The DNA-binding region of the MSL complex, in particular, has accumulated several adaptive changes, and it has been suggested that this may be coupled with adaptive changes at the DCC-binding sites of the X chromosome (Rodriguez et al. 2007). Indeed, a population genetics study of three well-characterized binding sites on the X chromosome has detected positive selection at these regions in D. melanogaster, consistent with the idea of adaptive co-evolution between the DCC proteins and their binding sites (Bachtrog 2008c). The selective pressure underlying this co-evolution is unknown, but could involve some conflict scenarios such as male-killing bacteria that specifically detect components of the MSL complex or defence against transposable elements. It will be of great interest to investigate patterns of polymorphism at many more recently identified DCC binding sites on the X of D. melanogaster, to confirm the finding of adaptive co-evolution between DCC proteins and their binding sites. Furthermore, gene expression studies in hybrids may reveal whether X-linked genes are more likely to be miss-expressed relative to autosomal genes, due to incompatibilities between DCC proteins and their binding sites. Problems in dosage compensation in species hybrids could help to explain the empirical observation that in species hybrids the heterogametic sex is more likely to suffer from sterility or inviability (known as Haldane’s rule).
A further confounding effect derives from the fact that X-chromosomes in several species of Drosophila tend to be more diverged at all sites, synonymous, non-synonymous and all classes of non-coding (Begun et al. 2007). It is possible that all X-linked sites are under stronger positive selection and/or weaker negative selection than the autosomes. It seems however more likely that the mutation rate on the X chromosome is higher than on the autosomes. As discussed before, mutation rate differences on sex chromosomes versus autosomes could reflect differences in mutation rates of males versus females (i.e. a lower mutation rate in the male germline than in the female germline could result in a higher mutation rate of the X). However, estimates of the number of cell divisions are similar for the male and female germlines of D. melanogaster (Drost and Lee 1998; but note that this estimates strongly dependent on the mean mating age of males and females which are hard to estimate in the wild). In fact, mutation rates were estimated to be higher in males of a different Drosophila species, D. miranda (Bachtrog 2008b). Another explanation that has been put forward is that the hypertranscription of the X chromosome required for dosage compensation may itself be mutagenic (Begun et al. 2007). Although the relation between transcription and mutagenesis is not completely understood, several experimental studies have shown that increased transcription can lead to increased mutation rates in prokaryotes, yeast and, more recently, mammalian cells (Hendriks et al. 2008). Increased transcription rates associated with dosage compensation are therefore plausible to be influencing the mutation rate of the X chromosome. It is, however, unclear if this can account for the increased divergence detected at X-linked sites in Drosophila, since in this group expression levels are negatively correlated with divergence levels, possibly because highly expressed genes are under stronger purifying selection against the accumulation of non-synonymous and synonymous mutations (Lemos et al. 2005).
Also, X chromosomes in Drosophila show a deficiency of genes with male-biased expression, and the mechanism of dosage compensation (i.e. hyper-transcription of the X through epigenetic modifications) could contribute to this observed deficiency. In particular, the modified chromatin structure of the X may directly interfere with subsequent transcriptional modification of X-linked genes in males. Male-biased gene expression originates mainly by increasing transcription of non-biased genes in males (rather than down-regulation in females), and higher expression levels may be harder to achieve on an already hyper-transcribed X chromosome (Vicoso and Charlesworth 2006; Vicoso and Charlesworth 2009). The X chromosome in male Drosophila is encumbered by the DCC and its chromatin structure has been modified globally. This may limit subsequent transcription factor binding or chromatin remodelling, and thus inhibit further transcriptional activation, resulting in a deficiency of male-biased expression on the X chromosome. In fact, direct interference between chromatin remodelling complexes and the dosage compensation machinery has been reported in Drosophila. Limitations in rates of transcription have also been put forward as an explanation for the deficiency of male-biased genes on the Drosophila X (Vicoso and Charlesworth 2009). This is supported by the observation that the proportion of male-biased genes located on the X differs significantly between high and low expression genes, with high expression male-biased genes being located less often on the X than low expression male-biased genes. This is expected if limits in rates of transcription prevent the accumulation of male-biased genes on the X, since such limitations are less likely to affect genes that are transcribed at low levels (Vicoso and Charlesworth 2009). It will be of interest to experimentally test whether dosage compensation can indeed help to explain the deficiency of male-biased genes on the Drosophila X.
We are grateful to members of the Bachtrog lab and Brian Charlesworth for comments on the manuscript. This research is funded by NIH Grant GM076007 and a Sloan Research Fellowship and a David and Lucile Packard Fellowship to D.B.
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
- Bachtrog, D., J. Jensen, and Z. Zhang. (2009). Accelerated adaptive evolution on a newly formed X chromosome. PLoS BiolGoogle Scholar
- Bull JJ (1983) Evolution of Sex Determining Mechanisms. Benjamin Cummings, Menlo Park, CAGoogle Scholar
- Gupta, V., M. Parisi, D. Sturgill, R. Nuttall, M. Doctolero, O. K. Dudko, J. D. Malley, P. S. Eastman, and B. Oliver. (2006). Global analysis of X-chromosome dosage compensation. J. Biol. 5:3.1–3.10Google Scholar
- Kelley, R. L., V. H. Meller, P. R. Gordadze, G. Roman, D. R.L., and M. I. Kuroda (1999) Epigenetic spreading of the Drosophila dosage compensation complex from roX RNA genes into flanking chromatin. Cell 98 Google Scholar
- Kelley, R. L., I. Solovyeva, L. M. Lyman, R. Richman, V. Solovyev, and I. Kuroda (1995) Expression of Msl-2 causes assembly of dosage compensation regulators on the X chromosomes and female lethality in Drosophila Cell 81:867–877Google Scholar
- Larsson J, Chen JD, Rasheva V, Rasmuson-Lestander Å, Pirrotta V (2000) Painting of fourth, a chromosome-specific protein in Drosophila. Proc Natl Acad Sci USA 98:6273–6278Google Scholar
- Lindsley DL, Sandler L, Baker BS, Carpenter ATC, Denell RE, Hall JC, Jacobs PA, Miklos GLG, Davis BK, Gethmann RC, Hardy RW, Hessler A, Miller SM, Nozawa H, Parry DM, Gould-Somero M (1972) Segmental aneuploidy and the genetic gross structure of the Drosophila genome. Genetics 71:157–184PubMedGoogle Scholar
- Meyer, B. J. (2005) X-Chromosome dosage compensation. WormBook:1–14Google Scholar
- Ruiz MF, Esteban MR, Donoro C, Goday C, Sanchez L (2000) Evolution of dosage compensation in Diptera: the gene maleless implements dosage compensation in Drosophila (Brachycera suborder) but its homolog in Sciara (Nematocera suborder) appears to play no role in dosage compensation. Genetics 156:1853–1865PubMedGoogle Scholar
- Vicoso, B., and B. Charlesworth (2009) The Deficit of Male-Biased Genes on the D. melanogaster X Chromosome Is Expression-Dependent: A Consequence of Dosage Compensation? J Mol EvolGoogle Scholar
- Zha X, Xia Q, Duan J, Wang C, He N, Xiang Z (2008) Dosage analysis of Z chromosome genes using microarray in silkworm. Bombyx mori, Insect Biochem Mol BiolGoogle Scholar
- Zhou S, Yang Y, Scott M, Pannuti A, Fehr K, Eisen A, Koonin E, Fouts D, Wrightsman R, Manning J, Lucchesi J (1995) Male-specific lethal 2, a dosage compensation gene of Drosophila, undergoes sex-specific regulation and encodes a protein with a RING finger and a metallothionein-like cysteine cluster. EMBO J 14:2884–2895PubMedGoogle Scholar