Histone modifications rather than the novel regional centromeres of Zymoseptoria tritici distinguish core and accessory chromosomes
- First Online:
- Cite this article as:
- Schotanus, K., Soyer, J.L., Connolly, L.R. et al. Epigenetics & Chromatin (2015) 8: 41. doi:10.1186/s13072-015-0033-5
- 2.7k Downloads
Supernumerary chromosomes have been found in many organisms. In fungi, these “accessory” or “dispensable” chromosomes are present at different frequencies in populations and are usually characterized by higher repetitive DNA content and lower gene density when compared to the core chromosomes. In the reference strain of the wheat pathogen, Zymoseptoria tritici, eight discrete accessory chromosomes have been found. So far, no functional role has been assigned to these chromosomes; however, they have existed as separate entities in the karyotypes of Zymoseptoria species over evolutionary time. In this study, we addressed what—if anything—distinguishes the chromatin of accessory chromosomes from core chromosomes. We used chromatin immunoprecipitation combined with high-throughput sequencing (“ChIP-seq”) of DNA associated with the centromere-specific histone H3, CENP-A (CenH3), to identify centromeric DNA, and ChIP-seq with antibodies against dimethylated H3K4, trimethylated H3K9 and trimethylated H3K27 to determine the relative distribution and proportion of euchromatin, obligate and facultative heterochromatin, respectively.
Centromeres of the eight accessory chromosomes have the same sequence composition and structure as centromeres of the 13 core chromosomes and they are of similar length. Unlike those of most other fungi, Z. tritici centromeres are not composed entirely of repetitive DNA; some centromeres contain only unique DNA sequences, and bona fide expressed genes are located in regions enriched with CenH3. By fluorescence microscopy, we showed that centromeres of Z. tritici do not cluster into a single chromocenter during interphase. We found dramatically higher enrichment of H3K9me3 and H3K27me3 on the accessory chromosomes, consistent with the twofold higher proportion of repetitive DNA and poorly transcribed genes. In contrast, no single histone modification tested here correlated with the distribution of centromeric nucleosomes.
All centromeres are similar in length and composed of a mixture of unique and repeat DNA, and most contain actively transcribed genes. Centromeres, subtelomeric regions or telomere repeat length cannot account for the differences in transfer fidelity between core and accessory chromosomes, but accessory chromosomes are greatly enriched in nucleosomes with H3K27 trimethylation. Genes on accessory chromosomes appear to be silenced by trimethylation of H3K9 and H3K27.
KeywordsCentromere Histone methylation ChIP-seq Accessory chromosomes Zymoseptoria tritici (Mycosphaerella graminicola)
adenine and thymine rich
centromere-specific histone H3 (CENP-A of mammals)
chromatin immunoprecipitation followed by massively parallel DNA sequencing
green fluorescent protein
dimethylated histone H3 lysine 4
dimethylated histone H3 lysine 9
trimethylated histone H3 lysine 9
dimethylated histone H3 lysine 27
trimethylated histone H3 lysine 27
dimethylated histone H4 lysine 20
chromosome conformation capture followed by massively parallel sequencing
non-homologous end joining
unclassified DNA repeat
reads per kilobase of transcript per million mapped reads
yeast malt sucrose
“Accessory chromosomes” are considered not essential for the survival and reproduction of an organism . Such “B” chromosomes have been described in many different organisms representing all major groups of eukaryotes , and are called “conditionally dispensable”, “lineage-specific”, or “accessory” chromosomes in fungi [3, 4, 5, 6]. One unifying characteristic of accessory chromosomes in fungi is that they can be present or absent in different individuals in a given population and thus occur at variable frequencies . In most cases, the functional importance of accessory chromosomes under natural conditions is unknown; however, the fact that they are maintained in some populations over evolutionary times suggests that they convey functional relevance, at least occasionally . Accessory chromosomes that result in adaptive advantage have been characterized in several fungi and special attention has been drawn to the presence of pathogenicity determinants, for example in Fusarium solani MPVI (Nectria haematococca) [3, 5], Fusarium oxysporum f. sp. lycopersici  and Leptosphaeria maculans [8, 9]. In all species studied so far, accessory chromosomes are distinguished from core chromosomes by their high repeat content and low gene density as shown by chromosome staining and biochemical methods [2, 10]. Currently, no studies on the chromatin structure of accessory chromosomes in fungi are available.
In the reference strain of the wheat pathogen Zymoseptoria tritici (synonym Mycosphaerella graminicola), eight accessory chromosomes have been found, ranging from ~0.4 to 1 Mb in size . Although these chromosomes comprise as much as 12 % of the total genome and encode more than 700 genes, no functional relevance has been assigned to accessory chromosomes in this species [11, 12]. This is consistent with recent analyses of transcriptomes and proteomes that revealed a majority of these genes to be non-transcribed (“silent”) during in vitro growth on rich medium, as well as during colonization of the wheat host [13, 14]. Accessory chromosomes may provide a special type of “genome niche”, conducive for rapid adaptive evolution of virulence-related genes, perhaps due to fewer selective constraints on sequence evolution . Indeed, genes located on the Z. tritici accessory chromosomes appear to evolve considerably faster compared to genes on the core chromosomes . The accessory chromosomes of Z. tritici were hypothesized to consist of duplicated sequences from core chromosomes that degenerated over time . Our detailed analyses of chromosome content to infer the extent of paralogy on core and accessory chromosomes showed that the majority of genes on accessory chromosomes are unique [12, 14]. Thus, evolution and dynamics of these chromosomes in Z. tritici are apparently not driven by simple duplications or translocations from the core chromosomes, but rather caused by a complex interplay of frequent structural rearrangements, likely aided by the activity of repetitive elements [14, 17]. Closely related species of Zymoseptoria also contain accessory chromosomes that are at least partially homologous to those of Z. tritici . This supports ancestral origins of Zymoseptoria accessory chromosomes and suggests that they, as the core chromosomes, have been maintained in populations of Z. tritici during speciation of the pathogen on wheat.
Accessory chromosomes are less faithfully transmitted than core chromosomes [5, 6, 11, 17, 19]. Transmission, and thus stability of eukaryotic chromosomes, derives in large part from two specialized regions, centromeres and telomeres (reviewed in ). Telomeric DNA repeats of Z. tritici are identical to the most common human repeat, 5′-(TTAGGG)n-3′ and have been found at 41 termini of the 21 chromosomes in the reference isolate IPO323 . Few complete fungal centromeres have been identified , as centromeres are not defined by specific DNA sequence, even in the short, one nucleosome-long point centromeres of Saccharomyces and related species . Most centromeres, however, are functionally defined by presence of the centromere-specific histone H3, CENP-A [23, 24, 25], which is called CenH3 in fungi . Regional centromeres extend over kilo- to megabases and show large variation in sequence composition and chromatin organization [21, 22, 26]. The two known extremes in the filamentous fungi are Neurospora crassa centromeres, which are large (150–300 kb), AT-rich and enriched with trimethylated H3 lysine 9 (H3K9me3), a histone mark typically associated with gene silencing , and Candida albicans centromeres that are short (4–18 kb) and do not contain conserved motifs or AT-rich repeats .
To understand what causes the frequent meiotic or mitotic instability of Z. tritici accessory chromosomes, we analyzed centromeres, telomeres and subtelomeric regions. We used chromatin immunoprecipitation (ChIP) of CenH3 tagged with GFP in combination with high-throughput sequencing (ChIP-seq) to identify the centromeres of core and accessory chromosomes. We also asked if centromeres in Z. tritici were associated with transcriptionally active (euchromatic) or silent (heterochromatic) regions and tested for the presence of telltale histone modifications by ChIP-seq. Here, we show that centromeres and subtelomeric regions of accessory chromosomes and core chromosomes are similar, but that the overall distribution of euchromatic and heterochromatic histone modifications is significantly different between the two types of chromosomes.
CenH3 is localized at several chromocenters in interphase nuclei
Centromeres of Z. tritici core and accessory chromosomes
Centromeres of Z. tritici
Chr. length (bp)
Cen start (bp)
Cen stop (bp)
Cen length (bp)
Cen (% chr.)
Repeats in Cen (%)
Cen location (% chr.)
Centromeres are not located in the longest AT-rich regions
Centromeres of Z. tritici are extremely short for a filamentous fungus, on average only 10.3 kb (Fig. 2; Table 1), but ranging from 5.57 kb (Chr. 13) to 13.55 kb (Chr. 8). The length of centromeric DNA is independent of chromosome length, which varies from 6 to 0.41 Mb. Compared to the very AT-rich centromeres of most other eukaryotes that have been studied, the GC-content of centromeres is little lower than the genome average, 48.3 % compared to 52.3 % (Figs. 2, 3, Additional file 3: Figure S2, Additional file 4: Figure S3; Table 1). In contrast to the long centromeres of N. crassa  and several fusaria [7, 21, 30], the centromeres of Z. tritici are also not located in the longest AT-rich region on each chromosome (Figs. 2, 3, Additional file 3: Figure S2; Additional file 7: Table S3). In total, we identified 847 AT-rich regions with an AT percentage above 50 %, covering 7.38 Mb (or 18.6 %) of the whole genome (Additional file 7: Table S3). These regions contain predominantly repetitive elements, but few coding sequences. They range in length from 2 kb to 86.3 kb with an average length of 9.7 kb, and an average AT-content of 55 %. Compared to the 11.6 % of DNA contained in accessory chromosomes, they harbor 21 % of the total AT-rich DNA and 23 % (196/847) of AT-rich regions. The longest AT-rich region per chromosome varies from 14.2 kb for chromosome 17 to 86.3 kb for chromosome 4. Only centromeres 1 and 5 are almost completely located in an AT-enriched region, but 17 other centromeres have AT-rich regions inside the CenH3-binding region.
The centromeres of Z. tritici are not defined by a consensus motif
To determine if there are conserved motifs within centromeric DNA of Z. tritici, we used BLAST  and MEME  analyses to search for conserved sequences and motifs. We first masked repetitive sequences in the centromeres and included only non-repetitive DNA in the BLAST analyses. Both the blast analysis and the MEME motif search failed to identify any conserved centromere-specific motif in the Z. tritici genome. Repetitive DNA found in some centromeres belongs to various repeat families—no single category is present at all centromeres. None of the retrotransposons, DNA transposons or their relics are specific for centromeres, i.e., they occur also in chromosome arms.
Centromeres of Z. tritici are not enriched in transposable elements (TEs)
The long AT-rich regions found in many fungi are composed of retrotransposons or relics of retrotransposons, and centromeres described in other filamentous fungi are composed almost entirely of repeats [21, 27, 30, 33]. In contrast, centromeres of Z. tritici are not composed of repetitive DNA as the mean repeat content of centromeric DNA is only 17.4 %, and centromeres 3, 7, 9, 13 and 19 have a repeat content of <1 % (Additional file 8: Table S4). Cen11 is the most repeat-rich centromere with a repeat content of only 44.2 %. Centromeres 3, 7, 9, 13 and 19 completely lack transposable elements (TEs) or relics of TEs, while even centromeres 2, 6, 8, 11 and 16 have TE coverage of only ~30 % (Additional file 8: Table S4). Repeat regions are overall short and can be grouped into a total of 17 different TE-families (Additional file 8: Table S4; ). RYN1 and unclassified repetitive regions (called “NoCat”) cover the most centromeric sequence on core chromosomes (each ~6 %), while—mostly inactive—RLG8 and RLC9 retroelements and DNA transposons are found most often at centromeres of accessory chromosomes. Overall, however, our comparisons of sequence motifs, AT-content, repeat content, and length revealed no significant differences between centromeres of core and accessory chromosomes.
Centromeres of Z. tritici contain predicted and expressed genes
A total of 39 putative genes lie completely within or overlap centromeric DNA, as determined by the presence of previously predicted reading frames [11, 14]. We re-analyzed existing data  and mapped them to our new annotation . Based on this analysis, 26 centromeric genes were transcribed (at reads per kilobase per megabase [RPKM] ≥2) during axenic growth in rich medium suggesting some functional relevance of these genes for the fungus (Additional file 9: Table S5). However, the mean RPKM value of centromeric genes on the core (385.4 ± 18881.5), and especially the accessory (1.7 ± 2.8) chromosomes is much lower than values obtained for all genes on core (584.9 ± 1992.5) or accessory chromosomes (23.0 ± 90.1); the mean RPKM for all genes on all chromosomes is 550.4 ± 1935.1 (Additional file 9: Table S5). Two genes completely within the centromeric DNA and three genes overlapping centromeric DNA, all on core chromosomes, showed consistent expression comparable to weakly expressed genes on chromosome arms. Only two have similarities to known genes, one encoding an alcohol dehydrogenase, the other an aminobutyrate aminotransferase; the remaining three are predicted or hypothetical proteins (Additional file 9: Table S5).
Telomere repeat tracts of core and accessory chromosomes are similar
Because we found no significant differences between centromeres of core and accessory chromosomes, we next turned our attention to chromosome ends. Telomeres are considered refractory to standard cloning procedures, yet we found telomere repeats (TTAGGGn) in the published Z. tritici genome sequence to be 128 ± 20.1 bp long, thus slightly longer than the 120 bp previously found in M. oryzae  and N. crassa , or the 110 bp in A. nidulans  by more sophisticated methods. Telomeres on core and accessory chromosomes are comprised of similar numbers of repeats (22.3 ± 3.2 vs. 19.8 ± 3.3 TTAGGGs, respectively) and are thus of similar average lengths (134 ± 18.8 bp vs. 118 ± 19.7 bp, respectively; Additional file 10: Table S6). The total telomere tract length varied from 91 to 175 bp on core and 73 to 143 bp on accessory chromosomes. We noticed the presence of a near-standard repeat, TGAGGG, on 13 of 41 cloned ends, and in some cases there were tandem repeats of 10–14 copies present, which were slightly enriched on accessory chromosomes. We also found 51 interstitial TTAGGG repeats with a minimum length of three repeat units; 29 were on core, 22 on accessory chromosomes, while two core chromosomes (Chr. 3 and 8) had no such repeats (data not shown). Only five of these tracts had six or more repeats, and four are located within subtelomeric regions. In the absence of more detailed studies on telomere length from a larger collection of wild-type strains, we deem the small differences in average telomere tract length not significant, even though the difference in repeat length is similar to the decrease in repeat length observed between Aspergillus wild-type and ku70 mutants . We conclude that telomere repeat length is not significantly different between the two classes of chromosomes.
Subtelomeric regions of core and accessory chromosomes contain the same transposable element families
Subtelomeric but not centromeric chromatin is enriched with H3K9me3 and H3K27me3 on both core and accessory chromosomes
ChIP-seq reveals different distribution of histone marks on core and accessory chromosomes
As the instability of accessory chromosomes cannot be explained by differences in centromeric, telomeric or subtelomeric regions alone, we compared overall patterns of histone modifications between core and accessory chromosomes outside of centromeric and subtelomeric regions. On core chromosomes, H3K4me2 was present in ~3 kb long blocks. In contrast, H3K9me3 and H3K27me3 covered larger blocks, with an average size of more than 13 kb. Consistent with the expected positive correlation to transcriptional activity, H3K4me2 was found in genic regions near and within coding sequences (Kendall’s Ʈ = 0.44, p < 2.2 × 10−16), while H3K9me3 was found in gene-poor, repeat-rich regions (Ʈ = 0.84, p < 2.2 × 10−16; Additional file 3: Figure S2, Additional file 13: Table S8). We found a negative correlation between H3K9me3 and coding sequences (Ʈ = −0.57, p < 2.2 × 10−16). These findings were consistent between two replicates of IPO323ΔChr18. The genome-wide distribution of H3K4me2 and H3K9me3 showed that these two marks are mutually exclusive (Fig. 5). In contrast to H3K9me3, the H3K27me3 mark is located not just in repeat-rich regions but also at discrete loci in genic regions, covering promoters and coding sequences (Fig. 4, Additional file 3: Figure S2; Ʈ = −0.41, p < 2.2.10−16). We found a twofold enrichment of H3K9me3 on accessory chromosomes compared to core chromosomes (17 and 9 % respectively; Fig. 5, Additional file 3: Figure S2). Similarly, H3K4me2 and H3K27me3 showed very different patterns on accessory chromosomes. We found a tenfold reduction of H3K4me2 on accessory chromosomes when compared to core chromosomes (Fig. 5, Additional file 3: Figure S2; Additional file 13: Table S8), which is reflected by the much lower number of expressed genes from these chromosomes. The core chromosomes harbor 11,111 annotated genes (Additional file 14: Table S9) while all accessory chromosomes combined only have 728 genes, few of which are expressed in pure culture or even in planta [12, 14].
The distribution of H3K27me3 in Z. tritici is different from that found in other fungi [45, 46, 47]. Unlike H3K4me2 or H3K9me3, it was not clearly correlated with coding sequences or transposable elements (Additional file 13: Table S8), indicating that its distribution was broader, covering both types of sequences. H3K27me3 and H3K9me3 overlap in DNA repeats and in some gene-sized regions. In contrast, H3K27me3 and H3K4me2 are largely exclusive of each other. There is a pronounced difference in the distribution of H3K27me3 on core and accessory chromosomes: core chromosomes show blocks of H3K27me3, mostly in the subtelomeric regions, while accessory chromosomes are almost completely enriched with H3K27me3 (Fig. 5, Additional file 3: Figure S2). In summary, our genome-wide histone modification maps show that core chromosomes are largely euchromatic (enriched with H3K4me2), except in subtelomeric and other repeat-rich regions, while accessory chromosomes are largely heterochromatic (enriched with H3K9me3 and H3K27me3).
A core chromosome segment with similarities to an accessory chromosome
The distal 0.865 Mb segment of the long right arm of chromosome 7 shows significant enrichment of H3K27me3 and near absence of the H3K4me2 mark (Fig. 6), reminiscent of the pattern observed for all accessory chromosomes. The main segment of this chromosome (1.80 Mb) has similar histone modification patterns as other core chromosomes; the centromere is localized at 259–266 kb, relatively close to the left arm telomere (TEL7L). The GC-content and gene density of the long “core” segment are at 52.5 % and 3.13 genes per 10 kb, almost identical to that of the distal “accessory” segment at 52.7 % and 3.3 genes per 10 kb, respectively. There is, however, a significant difference in gene content and organization between the two segments (Additional file 14: Table S9). The “core” segment has 563 genes with a mean gene length of 1.6 kb and half of these genes have predicted functions. The right-most, “accessory” segment has 288 genes with a mean length of 1.2 kb but only 10 % have known predicted functions. Putative secreted proteins appear to be enriched in this segment when compared to the rest of chromosome 7 (20 vs. 56 predicted secreted proteins, respectively). Thus, this region shares some characteristics with accessory chromosomes, such as almost complete absence of transcription  and low recombination rates , and we propose that this segment was translocated from an accessory chromosome or represents a fusion of a complete accessory chromosome onto the original chromosome 7. This is not without precedence, as fusions seem to have occurred in F. oxysporum, where the right arms of chromosomes 1 and 2 share sequence characteristics with accessory chromosomes . The predicted fusion site of chromosome 7 may lie within a long AT-rich region with retrotransposon relics and near a degenerate (TTAGGG)3 repeat at ~1.83 Mb. Only a short distance away, at ~1.69 Mb, is the rDNA array, which in the current genome assembly only contains two repeats . Between the rDNA repeats and TEL7R (nt 1,835,770–1,835,787), we found an imperfect telomere repeat (n = 3; Fig. 6). Based on genome sequencing depth of various wild-type isolates and the number of reads observed in our ChIP-seq experiments at this location (Figs. 5, 6, and data not shown), we observed 15- to 30-fold higher enrichment of reads at the two rDNA repeats compared to background coverage. Thus, we expect the rDNA cluster of Z. tritici to be between 30 and 60 repeats long. It remains uncertain if the locus identified on chromosome 7 is the only locus for rDNA arrays, but karyotyping in combination with Southern blots suggested only one hybridizing band to an IPO323 chromosome estimated to be 3.05 Mb long . Considering that chromosome 7 is 2.67 Mb according to the current assembly  and that a single rDNA repeat with intergenic spacer comprises ~7 kb, ~50 rDNA repeats would yield the expected size for chromosome 7 determined by karyotyping.
We present the first genome-wide analysis of centromeres, telomeres, subtelomeric regions and three selected histone modifications of core and accessory chromosomes in a filamentous fungus that is also an important pathogen of wheat, and this is also the first analysis of accessory or B chromosomes by ChIP-seq. We set out to investigate potential causes for the mitotic and meiotic instability of accessory chromosomes, an enduring puzzle of general interest to chromosome biologists, by examining Z. tritici, a suitable model organism. Our working hypothesis stated that centromeric regions and/or different combinations of subtelomeric repeats on accessory chromosomes result in the previously observed instability [11, 17, 19]. DNA sequence, repeat content, and selected characteristics of chromatin structure revealed no significant differences between the two types of chromosomes at centromeres, subtelomeres and telomere repeat tracts. Thus, we rejected our initial hypothesis, and concluded that the instability of accessory chromosomes must be caused by other chromosome-specific traits or processes. We found one clear difference between core and accessory chromosomes in the organization of facultative heterochromatin, as assayed by ChIP-seq with antibodies to H3K27me3 nucleosomes.
The analysis of Z. tritici centromeres by use of GFP-tagged CenH3 revealed a novel way in which centromeres are organized. Visualization of centromere-specific fluorescence in interphase cells showed that organization of chromocenters in Z. tritici nuclei differs from that observed in other filamentous fungi (e.g., F. graminearum and N. crassa; ). In Saccharomyces cerevisiae [49, 50] and Schizosaccharomyces pombe [51, 52], as well as Drosophila melanogaster  and some plants , centromeres also congregate into a single chromocenter and chromosomes organize themselves into what is called the “Rabl orientation”. In Z. tritici, however, the GFP signal forms several discrete foci suggesting that several centromeres are located in distinct regions inside the nucleus. This is reminiscent of Cryptococcus neoformans, where discrete foci coalesce into one spot only upon entry into mitosis .
The “telomere-to-telomere” genome assembly of Z. tritici IPO323 allowed us to precisely identify and characterize the DNA of entire centromeric regions by ChIP-seq with GFP–CenH3. For many species assembling the centromeric DNA presents a challenge due to the high AT-content and accumulation of near-identical DNA repeats [21, 26]. Centromeric regions of Schizosaccharomyces species are enriched with repetitive sequences, even in the centromere cores . Repeats in the centromeres of S. pombe and S. octosporus are more similar, and centromeres of S. japonicus are enriched with transposons in the pericentric regions . In N. crassa centromeres are entirely composed of relics of transposable elements . We show that the regional centromeres of Z. tritici are very short (~10.3 kb), not located in the longest AT-rich regions of the genome, and overall poor in DNA repeats. The DNA sequence for each centromere is unique and no common motif is discernible. Thus, they most resemble the short regional centromeres of C. albicans that also have unique DNA sequence without conserved motifs [28, 56, 57].
What makes Z. tritici centromeres different from other fungi is the presence of bona fide expressed genes. In N. crassa, three predicted genes were placed into centromeric DNA contigs, but all are either pseudogenes or part of a novel DNA transposon . Here, we identified 39 genes that are completely within Z. tritici centromeric DNA or overlapping these regions. Neocentromere formation in C. albicans resulted in silencing of genes located within newly formed centromeres [59, 60]. Neocentromere formation in S. pombe occurs in regions with genes that are poorly expressed during normal growth, but induced during nitrogen starvation . After neocentromere formation gene expression remains low, even after nitrogen depletion, suggesting that CenH3 nucleosomes and perhaps silencing histone marks reduce gene expression. While H3K9me3 is not required for the maintenance of centromeres and CenH3 deposition, it is required for de novo assembly of centromeres on plasmids . In contrast to centromeres of most fungi, the much larger plant centromeres have been shown to contain genes; for example, rice centromere 8 has at least 14 predicted genes of which four are expressed . A small fragment from maize chromosome 3, generated by UV mutagenesis, resulted in formation of a relatively unstable B chromosome named “Duplication 3a” (Dp3a). It carries a functional neocentromere covering 22 genes within 350 kb of a region enriched with CenH3 detected by ChIP-seq . Subsequent studies further dissected the sequence requirements for this B chromosome’s neocentromere and its derivatives; composition of the DNA sequence was not a deciding factor in neocentromere formation . A cross between maize and oat led to neocentromere formation, and analyses of two hybrid progeny showed 12 active genes within the newly formed centromere . The functions of expressed genes in the Z. tritici centromeres are unknown. It is unclear if the putative alcohol dehydrogenase and aminobutyrate aminotransferase have the predicted activities.
There was no simple correlation of either euchromatic or heterochromatic histone marks with centromeric DNA. Presence of CenH3 nucleosomes did not appear to affect histone marks on canonical H3 that were tested. Like centromeres of N. crassa , but in contrast to those of S. pombe , Drosophila and mammals [68, 69, 70] centromeres of Z. tritici lack enrichment for H3K4me2. Absence of H3K4me2 has also been described for centromeric DNA of A and B chromosomes of maize . Most centromeric nucleosomes of Z. tritici were, however, bordered by genic regions that showed H3K4me2 enrichment. H3K4me2 also surrounds the centromeres of the accessory chromosomes 19 and 21 from which the H3K4me2 mark is largely absent. In contrast to the core centromeric regions of N. crassa [27, 72] and C. neoformans , the centromeres of Z. tritici are also not enriched with the heterochromatic mark H3K9me3. In S. pombe and other Schizosaccharomyces species, the pericentric regions and to a much lesser extent the central cores are enriched with H3K9me3 [52, 62, 67]. Thus, the role of heterochromatin in centromere function first found in S. pombe [73, 74] and N. crassa,  is not shared in Z. tritici, as there are no clear pericentric heterochromatic regions. It is, however, possible that Z. tritici centromeres are enriched with other heterochromatic histone modifications that were not studied here such as H4K20me2, H3K27me2 or H3K9me2. In summary, by all characteristics measured here centromeres of core and accessory chromosomes are not significantly different. This suggests that interactions between centromeric DNA and some interactions between nucleosomes with centromere foundation proteins, such as CenH3, are different in Z. tritici when compared to other eukaryotes.
Taking all available data together, both core and accessory centromeres share some hallmarks of neocentromeres that have been found in C. albicans, usually after some forms of selection had been applied [59, 60, 75, 76]. So far, we have compared two closely related strains of the same Zymoseptoria species, both derived from the reference strain (IPO323) [11, 48]. In one strain, the KU70 gene had been replaced (IPO323ΔKU70), the other isolate had lost chromosome 18 during culturing in the lab (IPO323Δ18). Deletion of the KU70 gene did not result in genome instability, as one may predict when NHEJ is disabled, at least in our assays; no differences in centromere placement or other centromere features were noted. It is possible that in this relatively young and strongly host-adapted species centromeres behave differently than in species studied so far. A sexually reproducing species should, however, conserve localization of and synteny around the centromere for successful meiosis; this has been observed when comparing different species of several genera, e.g., Aspergillus , Schizosaccharomyces , Neurospora and Fusarium (M. Freitag, unpublished data). Dothideomycete chromosomes are characterized by “mesosynteny”, conserved overall chromosome structure coupled to many instances of local rearrangements [11, 78]. It is possible that these frequent reshufflings include the rather short centromeric regions to yield chromatin structure mimicking that of neocentromeres in other species. Additional Zymoseptoria strains and species, and additional Dothideomycetes will need to be examined to learn more about centromere positioning in these species.
Our results suggest that the reduced transmission fidelity of accessory chromosomes is not caused by differences in the structure of centromeres or telomeres. What are other quantifiable differences between core and accessory chromosomes? Commonly accessory chromosomes of fungi harbor few genes, but many active or disabled TEs [6, 11]. In Z. tritici, almost twice as many repetitive elements are found on accessory chromosomes [12, 79] and accessory chromosomes have much lower gene densities than core chromosomes, 1.6 vs. 3.2 genes per 10 kb, respectively (Additional file 14: Table S9; [12, 14]). There is ample evidence that there is little expression from accessory chromosomes, and this is certainly true for Z. tritici where genes on accessory chromosomes have 13-fold lower overall expression levels in both pure culture and during early host infection [12, 14]. This chromatin state correlates with the high AT and repeat content of Z. tritici accessory chromosomes, a correlation that holds also true in S. pombe, N. crassa, F. fujikuroi and F. graminearum [30, 46, 67, 72]. Functions for H3K9me3-enriched heterochromatic regions in fungi are still unclear, though packaging TEs and their relics into heterochromatin may prevent their spreading across the genome [72, 80]. Regulated removal of H3K9me3 may also activate genes involved in pathogenicity in L. maculans  and production of secondary metabolites in Epichloë festucae . Outside of repeat regions, all core chromosomes have H3K27me3 enriched in subtelomeric and shorter interstitial genic regions. Accessory chromosomes, however, are almost completely covered by H3K27me3, the major difference in chromatin structure we have found that separates the two classes of chromosomes. The association of H3K27me3 with genic regions on all chromosomes suggests involvement in regulation of gene expression as observed in Neurospora , Fusarium  and Cryptococcus neoformans . Heterochromatization is thus a quality accessory chromosomes share with the often entirely heterochromatic B chromosomes of other kingdoms [2, 7].
The H3K27me3 signal was partially overlapping with the distribution of H3K9me3, something that has so far not been found in other fungi. In Caenorhabditis elegans, however, both H3K9me3 and H3K27me3 can be overlapping at certain stages of development and in certain regions of the genome , though this was not obvious in an earlier study . Overlapping H3K9me3 and H3K27me3 was generated in centromeric domains by mutating C. neoformans Ccc1 , a protein with H3K27me3-binding activity that is required to maintain H3K27me3 in specific subtelomeric regions. There is also precedence for shifts in H3K27me3 from studies in plants  and mammals , where H3K27me3 moved into regions covered by H3K9me3 when cytosine DNA methylation had been removed by mutation of DNA methyltransferase genes. The reference strain of Z. tritici is deficient in cytosine DNA methylation because the conserved homolog of N. crassadim-2, MgDnmt, underwent gene duplication to more than 20 copies followed by inactivation by Repeat Induced Point mutation (RIP) . Thus, we hypothesize that the overlapping H3K9me3 and H3K27me3 we observed is a consequence of the lack of DNA methylation, similar to the results from plants and mammals.
Many studies have shown that nuclear position of chromosome segments can affect gene expression [88, 89, 90]. Based on the analysis of centromere chromocenters (by GFP–CENH3 microscopy) and chromatin structure (distribution of H3K4, H3K9, and H3K27 methylation by ChIP-seq), our study allows us to propose the existence of at least two different regions of chromosome organization in Z. tritici nuclei. Accessory chromosomes are almost entirely heterochromatic, similar to B chromosomes in other eukaryotes, while core chromosomes are mostly euchromatic. Studies on stretched chromatin fibers of maize stained with antibodies against various histone modifications suggested depletion of H3K27me2 from both B chromosomes . While H3K27me3 was not examined in that study, H3K9me2, a well-studied indicator of heterochromatin, was enriched on B chromosomes. Core chromosomes of both Zymoseptoria and maize are mainly euchromatic, in Z. tritici with some relatively short interspersed heterochromatic regions and larger blocks of heterochromatin at the chromosome ends. In S. cerevisiae Rabl orientation is maintained in interphase as revealed by HiC-based modeling of the nucleus . Strong centromeric and pericentric interactions have also been demonstrated in S. pombe  and Arabidopsis  HiC chromatin maps. In several studies, domains enriched with H3K27me3 have been shown to form blocks of chromatin that may condense into “polycomb bodies” and appear to be located, if not anchored, near the nuclear membrane [83, 94, 95]. Based on our studies, we formulated a testable working model for chromatin architecture in Z. tritici, in which interactions between centromeres of the 13 core chromosomes generate one main chromocenter, while interactions of H3K27me3-rich chromatin of the eight accessory chromosomes form “silencing bodies” near the nuclear membrane. This implies that for accessory chromosomes interactions between H3K27-methylated chromosome arms are stronger than interactions that bring centromeric regions of different accessory chromosomes together. These H3K27me3 domains may thus “trap” and separate centromeres of accessory chromosomes in the nucleus, resulting in the additional, weakly fluorescing chromocenters. Work on maize also suggested that less CenH3 is incorporated at centromeres on plant B chromosomes, resulting in weaker CenH3 immunofluorescence . One prediction of our hypothesis was that comparisons between GFP–CenH3 strains with a full chromosome complement and those that lack chromosome 18 would reveal differences in the average number of foci. Preliminary experiments suggest, however, that the situation is not as simple as proposed above because the median number of foci observed is not different. Instead, we saw minor changes in the number of bright foci. Future experiments using cytological and chromosome conformation capture approaches will be applied to further test our hypothesis.
We hypothesized that core and accessory chromosomes have distinct centromeres and that the centromeric organization leads to meiotically unstable accessory chromosomes. Besides the relative location of centromeres (acrocentric on most core, metacentric on most accessory chromosomes), however, there are no obvious measurable differences. Centromeres of Z. tritici are not perfectly associated with either canonical heterochromatin or euchromatin and they contain genes that are overall poorly expressed. Overall they exhibit features that have been considered common for neocentromeres in other organisms. Moreover, there is no significant difference between telomeric repeat sequences, repeat length and composition of the adjacent subtelomeric TEs on core and accessory chromosomes. The right arm of core chromosome 7 is poorly transcribed and enriched with heterochromatin. Based on these criteria, otherwise only found for accessory chromosomes, we propose that an accessory chromosome was fused to a core chromosome, resulting in the extant chromosome 7. We show that accessory chromosomes of Z. tritici can be distinguished from core chromosomes based on enrichment with H3K27me3. Whether this enrichment with a mark for facultative heterochromatin is causally involved in the reduced transmission fidelity of accessory chromosomes will be the focus of future work.
Strains and growth conditions
All experiments were performed with derivatives of the Z. tritici strain used for the reference genome, IPO323 . In IPO323ΔChr18 (Zt9), chromosome 18 has been lost [14, 48], and in IPO323ΔKU70 (Zt84), the KU70 gene has been disrupted to increase the efficiency of homologous recombination . That our original IPO323 isolate had lost chromosome 18 was discovered in the course of studies described here. Streaks from glycerol stocks (kept at −80 °C) were used as initial inoculum on YMS (4 g yeast extract, 4 g malt extract, 4 g sucrose, 20 g agar per 1 L H2O) agar plates. Cultures were grown for four to six days at 18 °C. Cells were transferred to liquid YMS medium and grown for 3 days at 18 °C while shaking at 200 rpm.
Construction of GFP-tagged strains
The homolog of CenH3, previously described as centromere-specific protein in other organisms [21, 23], was identified in the Z. tritici genome (http://genome.jgi-psf.org/Mycgr3/Mycgr3.home.html; ) by BLAST analyses  with N. crassa CenH3 . To introduce the GFP epitope tag, we designed constructs for homologous gene replacement based on the binary vector D0893pNOVpGpda SDHB_H267YtTrpC, which carries hph, the gene encoding resistance to Hygromycin B . We amplified the predicted open reading frame of the CenH3 gene, as well as 1 kb of 5′ and 3′ flanking sequences of each gene from genomic DNA of strain Zt9, and the GFP tag from pZero-GFP-loxP-hph-loxP  but with a 6X glycine linker. All primers are listed in Additional file 15: Table S10. Purified PCR amplified fragments were fused by overlap-PCR  to create a construct consisting of all fragments. The new construct was inserted into the binary vector at the unique Bsp120I and AscI sites. Plasmid sequences were verified by restriction analyses and Sanger sequencing. Constructs were introduced into Zt9 and Zt84 by Agrobacterium tumefaciens-mediated transformation , but with minor modifications. Correct insertions of the replacement cassettes were determined by an initial PCR screen followed by Southern analysis on positive clones . Isogenic strains resulting from transformation with GFP–CenH3 in Zt9 are called Zt121 (isolates Zt121-57 and Zt121-85) and those in Zt84 are called Zt118 (Zt 118-5 and Zt118-7).
Western analyses were done to verify translation of the GFP-tagged CenH3 protein. Proteins were extracted from GFP–CenH3 strains Zt118 and Zt121 by the peqGOLD TriFast protocol (Peqlab, Erlangen, Germany). Prior to electrophoresis, protein levels were normalized by Bradford assay . Western blotting was done by standard methods . Detection of the GFP signal was using the ECL prime kit (GE Healthcare Europe GmbH, Freiburg, Germany) according to instructions of the manufacturer. Anti-GFP antibody (Roche; #11 814 460 001) and secondary anti-mouse IgG-HRP antibody (Cell Signaling; #7076 G) were used.
To assess localization of GFP-tagged centromere proteins, we used fluorescence microscopy on a Zeiss Axioplan 2 Microscope System equipped with a 100X oil immersion objective (1.4 N/A); standard excitation filters and emission filters for detection of GFP fluorescence were used. Fluorescence micrographs were taken with a Cool SNAP HQ camera (Photometrics) and manipulated in Gimp (Version 2.8.10). To assess the number of GFP foci, numerous still images were taken of strains Zt121 (IPO323ΔChr18) and Zt118 (IPO323ΔKU70), both producing GFP–CenH3. Bright and weak GFP foci in nuclei were counted to determine if Zt121 had on average fewer foci than Zt118.
Chromatin immunoprecipitation followed by high-throughput sequencing (ChIP-seq)
ChIP experiments were performed on the yeast-like stage after growth in liquid YMS medium for 4 days until an OD600 of 0.6–0.8 had been reached. ChIP experiments were performed as previously described [27, 101] with minor modifications . Histones with the modification of interest were immunoprecipitated by adding 2 μL of antibodies to ~300–500 μL of purified chromatin. ChIP was performed with the following antibodies: anti-GFP (#ab290; Abcam, Cambridge, MA, USA), anti-H3K4me2 (#07-030; Millipore, Billerica, MA, USA), anti-H3K9me3 (#39161; Active Motif, Carlsbad, CA, USA) and anti-H3K27me3 (#39155; Active Motif, Carlsbad, CA, USA). Libraries for ChIP-seq were prepared according to Illumina TruSeq protocols with some modifications and controls [102, 103]. Purification of DNA was performed with AMPure XP beads (Agencourt, Beckman-Coulter). To size select samples were gel-purified and PCR amplified by 13 PCR cycles with Phusion polymerase (Finnzymes) and Illumina PCR primers. Libraries were sequenced on an Illumina HiSeq 2000 genome analyzer at the OSU Center for Genome Research and Biocomputing. Sequencing data were submitted to SRA under accession number SRP059394. ChIP tracks are viewable on a dedicated gbrowse server at http://ascobase.cgrb.oregonstate.edu/cgi-bin/gb2/gbrowse/ncrassa_public/.
Short read mapping and peak calling analyses
Illumina reads were filtered and trimmed as previously described , but they were not trimmed from the 5′ end, resulting in 47-nt reads. Processed reads were mapped to the Z. tritici IPO323 reference genome  with Bowtie2 at standard settings . Mapping outputs were converted from “.sam” to “.bed” files using Samtools and Bedtools [104, 105]. For visualization of mapped reads, alignment files from Bowtie2 were either processed with the genomecov tool from the bedtools package  and the Integrative Genome Viewer (IGV; http://www.broadinstitute.org/software/igv);  or loaded on a dedicated public gbrowse server (http://ascobase.cgrb.oregonstate.edu/cgi-bin/gb2/gbrowse/ncrassa_public/). For each ChIP sample (Additional file 2: Table S1), we determined significantly enriched domains with RSEG . Two biological replicates were also evaluated separately to assess variability between ChIP-seq experiments. The coverage or relative enrichment of coding sequences, repetitive elements and histone modifications on each chromosome were calculated in non-overlapping sliding windows of 10 kb. For correlation analyses in R (http://www.R-project.org), we used Kendall’s Tau correlation coefficient .
Identification of AT-rich regions
AT-rich regions were identified in the genome assembly of Z. tritici IPO323 using a homemade python script that computed the local GC-content in 1 kb sliding windows with a shift of one bp. AT-rich regions were defined as a concatenation of consecutive windows having a GC-content value lower than 50 %.
KS designed and performed most experiments, analyzed data and wrote the manuscript; JLS performed data analyses on ChIP-seq and telomere length; LRC designed and performed ChIP-seq experiments; JG performed data analyses on TE distribution; PH designed molecular tools and performed gene cloning experiments; KMS performed ChIP-seq data analyses and submitted data to SRA; MF and EHS designed and supervised the whole study, analyzed data and wrote the manuscript. All authors read and approved the final manuscript.
We thank Mike Cusaki for providing the IPO323ΔKU70 strain and a binary cloning plasmid, and Michael Bölker for helpful discussions. This study was supported by intramural funding of the Max Planck Society to EHS and by NIH grant GM097637 to MF. KS acknowledges an EMBO short-term fellowship for studies in Oregon. JLS is funded by a Young Scientist Funding by INRA. KMS was partially supported by NIH Grant GM068087.
Compliance with ethical guidelines
Competing interests The authors declare that they have no competing interests.
Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.