Epistatic and allelic interactions control expression of ribosomal RNA gene clusters in Arabidopsis thaliana

Rabanal, Fernando A.; Mandáková, Terezie; Soto-Jiménez, Luz M.; Greenhalgh, Robert; Parrott, David L.; Lutzmayer, Stefan; Steffen, Joshua G.; Nizhynska, Viktoria; Mott, Richard; Lysak, Martin A.; Clark, Richard M.; Nordborg, Magnus

doi:10.1186/s13059-017-1209-z

Epistatic and allelic interactions control expression of ribosomal RNA gene clusters in Arabidopsis thaliana

Research
Open access
Published: 03 May 2017

Volume 18, article number 75, (2017)
Cite this article

Download PDF

You have full access to this open access article

Genome Biology Aims and scope Submit manuscript

Epistatic and allelic interactions control expression of ribosomal RNA gene clusters in Arabidopsis thaliana

Download PDF

Fernando A. Rabanal ORCID: orcid.org/0000-0003-1538-9752¹,
Terezie Mandáková²,
Luz M. Soto-Jiménez¹,
Robert Greenhalgh³,
David L. Parrott³,
Stefan Lutzmayer¹,
Joshua G. Steffen⁴,
Viktoria Nizhynska¹,
Richard Mott⁵,
Martin A. Lysak²,
Richard M. Clark^3,6 &
…
Magnus Nordborg ORCID: orcid.org/0000-0001-7178-9748¹

5474 Accesses
29 Citations
28 Altmetric
Explore all metrics

Abstract

Background

Ribosomal RNA (rRNA) accounts for the majority of the RNA in eukaryotic cells, and is encoded by hundreds to thousands of nearly identical gene copies, only a subset of which are active at any given time. In Arabidopsis thaliana, 45S rRNA genes are found in two large ribosomal DNA (rDNA) clusters and little is known about the contribution of each to the overall transcription pattern in the species.

Results

By taking advantage of genome sequencing data from the 1001 Genomes Consortium, we characterize rRNA gene sequence variation within and among accessions. Notably, variation is not restricted to the pre-rRNA sequences removed during processing, but it is also present within the highly conserved ribosomal subunits. Through linkage mapping we assign these variants to a particular rDNA cluster unambiguously and use them as reporters of rDNA cluster-specific expression. We demonstrate that rDNA cluster-usage varies greatly among accessions and that rDNA cluster-specific expression and silencing is controlled via genetic interactions between entire rDNA cluster haplotypes (alleles).

Conclusions

We show that rRNA gene cluster expression is controlled via complex epistatic and allelic interactions between rDNA haplotypes that apparently regulate the entire rRNA gene cluster. Furthermore, the sequence polymorphism we discovered implies that the pool of rRNA in a cell may be heterogeneous, which could have functional consequences.

Centromere sequence-independent but biased loading of subgenome-specific CENH3 variants in allopolyploid Arabidopsis suecica

Article Open access 14 June 2024

Genome-wide association studies meta-analysis uncovers NOJO and SGS3 novel genes involved in Arabidopsis thaliana primary root development and plasticity

Article Open access 14 June 2024

Deficiency of multiple RNA silencing-associated genes may contribute to the increased susceptibility of Nicotiana benthamiana to viruses

Article Open access 19 June 2024

Background

The central importance of ribosomal RNA (rRNA) genes for our understanding of biology cannot be overstated: they may well be evolutionarily the oldest genes [1–4]; they are the most highly expressed genes in any organism; and their expression is central to cellular growth [5]. In eukaryotes, the catalytic core of ribosomes consists of four RNA molecules: the 18S, 5.8S and 25S rRNAs, produced via a common 45S rRNA precursor; and the often separately encoded 5S rRNAs [6, 7]. Because of the requirement for large quantities of rRNA, hundreds to thousands of 45S rRNA genes are tandemly arrayed head-to-tail in large ribosomal DNA (rDNA) clusters that, when expressed, form nucleolus organizer regions (NORs) [8–11] (Fig. 1a).

Although rRNA accounts for the majority of the RNA in a eukaryotic cell, only a subset of the rRNA genes appear to be active at any given time: the others are silenced by repressive chromatin modifications [5, 12–14]. In interspecific hybrids, uniparental expression of rRNA genes due to an epigenetic phenomenon termed nucleolar dominance is often observed [13, 15–17]. The word “dominance” is used because one rDNA cluster apparently suppresses the activity of the other [18]. At the intraspecies level, many model organisms, such as humans, mice, zebrafish, wheat, and Arabidopsis thaliana have multiple rDNA clusters on different autosomes [19–23] and early cytogenetic studies have shown differential expression of such clusters in human cell types [24] and plants [25]. While attempts have been made to clone and sequence differentially expressed rRNA gene variants, it was not possible to assign them to a specific rDNA locus or cluster [26–28]. In humans, for instance, characterizing rDNA cluster-specific variation required rodent-human somatic cell hybrids, each of which contained a single human chromosome [29]. In A. thaliana, however, rDNA clusters can be effectively unlinked from each other in experimental populations.

The genome of A. thaliana contains two rDNA clusters located at the top of chromosomes 2 and 4, hereinafter referred to as rDNA-2 and rDNA-4, respectively (Fig. 1a) [30, 31]. In the reference accession Col-0, only rDNA-4-derived rRNA genes are actively transcribed, while rDNA-2 is silent [32]. However, 45S rRNA gene copy number varies massively among natural accessions of A. thaliana [33–35], with both rDNA clusters contributing [36]. Furthermore, considerable variation in the degree of DNA methylation at rDNA clusters has also been observed, suggesting variation in silencing [37–39]. Here we exploit sequence variation among natural lines [40] to investigate whether there is also variation in rDNA cluster expression among A. thaliana accessions.

Results

Sequence variation in rRNA genes within and between individuals

Monitoring expression of particular rRNA genes is difficult because all copies are extremely similar due to concerted evolution, an evolutionary process that promotes homogeneity among the many rRNA gene repeats [41–45]. Nonetheless, by taking advantage of next-generation DNA sequencing data from the 1001 Genomes Consortium [40], we identified 2264 polymorphic sites along a 7.7 kb transcribed portion of the 45S rRNA gene, spanning from the minimal promoter to the end of the 25S rRNA subunit (see “Methods”; Fig. 1b and Additional file 1). Sites can have multiple variants; thus, there are 2844 variants found in at least one individual (the alternative variant must be present in at least 5% of the individual’s total 45S rRNA genes). Note that these are rRNA gene variants within as well as between individuals: while 56% of the variants are specific to one accession, 249 (9%) of them are shared by at least ten accessions (Fig. 1b). Interestingly, sequence variation is not restricted to the external transcribed spacer (ETS) or the internal transcribed spacer (ITS) sequences as previously reported for Col-0 [32, 46], but is also present within the highly conserved ribosomal subunits (although the population frequency of such variants is clearly lower, suggesting the action of purifying selection; see Fig. 1b).

Our primary interest in these polymorphisms is to use them as markers to monitor rDNA cluster-specific expression. In order to assign them to rDNA clusters, we used the multi-parent advanced generation inter-cross (MAGIC) population: a set of recombinant inbred lines derived from intercrossing a genetically heterogeneous stock of 19 worldwide accessions [47]. In the MAGIC population, the two rDNA clusters have been effectively randomized with respect to each other. In addition, due to the lack of recombination between homologous rDNA clusters in A. thaliana [30, 31, 48], rDNA clusters and flanking regions are usually inherited as haplotype blocks, making it possible to infer rDNA cluster identity from single-nucleotide polymorphism (SNP) markers in the flanking regions [36]. To ensure that all rRNA gene variants could be mapped, we used only variants unique to or shared by less than eight of the 19 founder accessions [49]. Through standard linkage mapping and manual curation (see “Methods”) in this population, we identified rDNA cluster-specific markers in the founder accessions (Fig. 1c and d, Additional file 2: Table S1).

Accessions express either rDNA-2 or rDNA-4, or both

We monitored the expression of these rDNA cluster-specific variants in leaves of approximately two-week old seedlings from different accessions. In agreement with previous results [32], none of the four rDNA-2-specific variants were expressed in the reference accession Col-0 (6909), while all rDNA-4-specific variants were (Fig. 2a and Additional file 3). Since active rRNA genes in A. thaliana are present in nucleoli when active and excluded when silenced [11], we reasoned that rDNA clusters carrying active variants would localize in proximity to the nucleolus more frequently than silenced ones (Fig. 2b). Indeed, rDNA cluster localization by means of fluorescence in situ hybridization (FISH) revealed that Col-0 rDNA-4 preferentially associated with the nucleolus in 61% of cells, while in the other nuclei (39%) rDNA-4 and rDNA-2 were equally close to the nucleolus (Fig. 2c and Additional file 4).

However, what is true for Col-0 is not universal (Fig. 2). While five other accessions—Sf-2, Bur-0, Edi-0, Ws-0, and Wu-0—appear to behave like Col-0 in that rDNA-4 is expressed and rDNA-2 silenced, four accessions—No-0, Ct-1, Can-0, and Hi-0—show the opposite pattern, with rDNA-2 exclusively expressed, and two lines—Ler-0 and Zu-0—express both rDNA-2 and rDNA-4 (Fig. 2 and Additional file 5: Figure S1). Furthermore, analysis of the MAGIC lines allowed us to determine the expression of the parental rDNA clusters in different combinations. Remarkably, the results strongly suggest that the expression at one rDNA cluster depends on the genotype at both clusters (Fig. 3). For example, in accession No-0 only rDNA-2 is expressed while in Ler-0 both rDNA clusters are (Fig. 2 and Additional file 5: Figure S1); however, in MAGIC lines 261 and 485, rDNA-4 inherited from Ler-0 is silenced when combined with rDNA-2 from No-0 (Fig. 3a). Thus, No-0 rDNA-2 expression is dominant over that of Ler-0 rDNA-4. In contrast, when No-0 rDNA-2 and Edi-0 rDNA-4 are combined in MAGIC line 170, the former is silenced and the latter expressed (Fig. 3b), despite the fact that both are expressed in the parental lines (Fig. 2 and Additional file 5: Figure S1). Thus, Edi-0 rDNA-4 expression is dominant over No-0 rDNA-2. Moreover, based on our limited data, the pattern of silencing appears to be deterministic (rather than stochastic) as suggested by MAGIC lines that after undergoing independent pedigrees—five generations of intermating [50]—inherited the same genotypes at both rDNA loci and display similar expression status (Fig. 3a, Additional file 5: Figure S2).

These results lead to two conclusions. First, the expression pattern in Col-0—rDNA-4 expressed and rDNA-2 silenced—is clearly not a general feature. Indeed, in Cvi-0 X Ler-0 recombinant inbred lines (RILs) expression of both rDNA clusters was observed in lines in which Cvi-0 contributed rDNA-2 and Ler-0 rDNA-4 [51], suggesting that rDNA-4 is not always fully dominant over rDNA-2. Equally importantly, the varying pattern of expression suggests that dominance is not a property of the rDNA cluster per se, but rather of its “allelic” content (i.e. haplotypic—note that rDNA clusters and flanking regions are usually inherited as complete haplotypes since homologous recombination is suppressed [48]). Dominance should thus occur between different haplotypes in individuals heterozygous with respect to a particular rDNA cluster.

Dominance is a property of rDNA “alleles”

Since the MAGIC lines are inbred, they can only be used to study interactions between loci (i.e. epistasis). To investigate interactions between alleles on homologous chromosomes (i.e. classical dominance), we crossed accessions with dominant rDNA-4 and analyzed the hybrid F₁ plants. In these crosses, Sf-2 rDNA-4 and Col-0 rDNA-4 seem to be co-dominant (Fig. 4a), while Sf-2 and Col-0 rDNA-4 are both dominant over Bur-0 rDNA-4 (Fig. 4b and c)— we detected weak expression of Bur-0 rDNA-4 in some replicates when Bur-0 is used as a mother and Col-0 as a father (see Additional file 5: Figure S3). Thus, dominance can occur not only between rDNA clusters on different chromosomes, but also between rDNA “alleles” of the same rDNA cluster.

Genetic analysis of interactions between rDNA “alleles”

To gain further insight into the complex interactions among rDNA “alleles,” we carried out linkage mapping in an F₂ population derived from a cross between Algutsrum (8230), in which rDNA-4 is dominant, and TDr-9 (6195), in which rDNA-2 is dominant (Fig. 5a, Additional file 2: Table S2 and Additional file 6). Expression of each of the four rRNA gene clusters was mapped separately (Fig. 5b–e). Mapping identified the distal end regions of chromosomes 2 and 4, i.e. the location of the rDNA clusters, as the major source of variation for the expression of TDr-9 rDNA-2 and Algutsrum rDNA-4, respectively (Fig. 5b and c). These rDNA clusters, both dominant in their parental lines, are expressed as long as they are inherited: even when both rDNA clusters are present, their rRNAs co-exist (they are co-dominant) (Fig. 5f). However, expression of the rDNA clusters that were silenced in the parental lines, i.e. Algutsrum rDNA-2 and TDr-9 rDNA-4, is clearly influenced by the full two-locus genotype (Fig. 5d and e). Both rDNA clusters are recessive relative to their “allelic” rDNA counterparts: they are only expressed in the presence of each other in homozygous state (Fig. 5f).

In fact, these results are consistent with the study of Riddle and Richards [38], who detected an epistatic interaction between rDNA loci when mapping overall levels of DNA methylation at rDNAs in a cross between Can-0 and Col-0. However, in an F₂ population derived from a cross between Cvi-0 and Col-0 rDNA methylation levels segregated as a Mendelian additive trait. Since DNA methylation and repressive chromatin modifications are needed for rRNA gene silencing [14, 52–55], it is plausible that this study indirectly mapped overall rRNA expression, and that an interaction was detected in the former population because Can-0 is dominant for rDNA-2 (Additional file 5: Figure S1) and Col-0 is dominant for rDNA-4 (Fig. 1), while in the second population no interaction was detected because both parents (Col-0 and Cvi-0) are dominant for rDNA-4 (Additional file 2: Table S3, Additional file 5: Figure S4 and Additional file 7).

Discussion

We are discovering huge amounts of variation in 45S rRNA genes on every level. At the gross level of total copy number, variation in 45S rRNA gene copy number is largely responsible for an over 10% variation in genome size among A. thaliana accessions [35] and the relative size of the two rDNA clusters varies greatly among accessions [36]. At the sequence level, there is variation in the conserved catalytic subunits themselves both within and among accessions. Furthermore, these rRNA gene variants readily express and make for a heterogenous rRNA pool in the cell, the functional significance of which is completely unknown. Ribosome heterogeneity has been studied mainly in the context of the regulation of the ribosomal proteins, the diversity and activity of other ribosome-associated factors and, although not fully understood, the modifications that the rRNA subunits suffer after transcription [56, 57]. In eukaryotes, there have been few attempts to study heterogeneity at the sequence level of the rRNA subunits: in the parasite Plasmodium two structurally distinct 18S rRNAs are differentially expressed during its life cycle [58, 59]; in humans, the 28S rRNAs have been shown to be heterogeneous in both mono- and polysomal fractions [26, 60]; similarly, in both the sea urchin Paracentrotus lividus [61] and A. thaliana several transcribed 5S rRNA variants are readily incorporated in functional ribosomes [62]. Our study provides the most comprehensive catalogue of rRNA gene variants to date, and will hopefully be useful for investigating their possible adaptive role, either at the level of transcriptional regulation, rRNA stability or translational efficiency.

Irrespective of their functional significance, these variants, although rarely homogenized throughout an entire rDNA cluster, can be used as markers of the expression of a particular rDNA cluster. Our findings make it clear that the silencing phenomenon known as nucleolar dominance occurs both within [32] and among natural lines of A. thaliana [51]. Furthermore, we demonstrate that dominance is a property neither of the parental strain nor of the chromosomal position (i.e. chromosome 2 versus 4), but rather of the specific “allelic” content of each rRNA gene cluster—including, perhaps, flanking DNA. Indeed, a recent study implicates centromere-proximal sequences in this regulation [63].

However, the molecular basis of the epistatic and allelic interactions among rDNA clusters remains unknown. Interestingly, rDNAs derived from different species in the genus Brassica follow a hierarchical dominance relationship [13] similar to the one among the many alleles at the self-incompatibility locus (S-locus) in Brassicaceae [64]. In Arabidopsis halleri, dominance at the S-locus is largely controlled by a set of small non-coding RNAs produced by dominant S-alleles that target a repertoire of more recessive S-alleles resulting in their epigenetic silencing [65, 66]. Since uniparental rRNA gene silencing involves short interfering RNAs (siRNAs)-directed DNA methylation (RdDM) pathway proteins in the hybrid plant Arabidopsis suecica [67, 68] and there is evidence that non-coding RNAs can act in trans to silence other rRNAs gene repeats in mice [69–71], it is tempting to speculate that a similar mechanism to the one described for the S-locus might explain how the rDNA clusters “talk” to each other. Surprisingly, in A. thaliana Col-0 RdDM pathway mutants RNA polymerase IV (nrpd1), Dicer-like-3 (dcl2/3/4), and DNA methyltransferases DRM1 and 2 (drm1/2) have, if any, a negligible effect on disrupting silencing of rDNA-2 (Additional file 5: Figure S5; [11]). In contrast, DNA maintenance methyltransferase MET1 (the ortholog of mammalian DNMT1), which is responsible for cytosine methylation in the CG context independently of siRNAs, is needed to silence rDNA-2 [11]. However, these results do not necessarily exclude the involvement of the RdDM pathway in the silencing of rDNA clusters in A. thaliana. In A. suecica, for instance, re-establishment of nucleolar dominance was observed in T2 progeny of one of the three RNA-dependent RNA polymerase RDR2-RNAi lines [67]. As appears to be the case for transposable elements, the establishment of silencing may well be distinct from its maintenance and many semi-redundant mechanisms may be involved [72].

Conclusions

We show here that rRNA gene expression in A. thaliana varies greatly between accessions, with some expressing one rDNA cluster, some the other, and some both. We further show that this expression is regulated via complex epistatic and allelic interactions between rDNA cluster haplotypes that apparently control the entire array via an unknown mechanism. Our paper represents a major new finding in the regulation of rRNA genes and points to several exciting questions about the underlying molecular mechanisms and evolutionary rationale for their existence, especially in context of our recent discovery of massive copy-number variation between accessions.

Methods

Plant material and growth conditions

For expression analyses corresponding to the founders of the MAGIC population [49], MAGIC lines [73], and accession Cvi-0 [74, 75], we obtained publicly available messenger RNA (mRNA)-seq data (an overview of the library types and read lengths are provided in Additional file 8). Similarly, for identification of rRNA gene variants at the DNA level in natural accessions [40, 49, 76], the MAGIC population [73], and the Cvi-0 X Ler-0 RIL population [36], we used data released elsewhere (Additional file 8).

For the F₁ crosses (♀ Col-0 x ♂ Bur-0, ♀ Bur-0 x ♂ Col-0, ♀ Col-0 x ♂ Sf-2, ♀ Sf-2 x ♂ Col-0, and ♀ Bur-0 x ♂ Sf-2), we harvested leaves from one two-week-old and three three-week-old plants per cross for total RNA-sequencing (RNA-seq). Additionally, we DNA-sequenced two individuals per cross.

The F₂ population was derived from the selfed F₁ progeny of a cross between accessions ♀ Algutsrum (8230) x ♂ TDr-9 (6195). F₂ seeds were stratified for five days in 0.1% agarose at 4 °C. For RNA isolation, aerial portions of the plants were harvested at the nine true leaf stage 7.5–8.5 h into the light period. We sequenced individuals of the F₂ population with two different protocols: one designed for mRNA-seq (183 individuals) and another one for total RNA-seq (a subset of 162 individuals, see below). For DNA-sequencing, we harvested leaves from other 16 F₂ individuals.

Mutant lines dcl2/3/4 (alleles dcl2-1, dcl3-1, and dcl4-2) and nrpd1a-3 used in this study have been described previously [77–80]. We harvested leaves from three-week-old plants in three biological replicates. All plants were grown under long photoperiod conditions (16 h light and 8 h dark) at 16 °C.

RNA isolation and library preparation

For the F₁ crosses and the RdDM mutants, RNA was isolated with TRIzol® Reagent (#15596-018, Ambion®) according to manufacturer’s instructions. Contaminating genomic DNA was removed with the Turbo DNA-free™ kit (#AM1907, Ambion®), then TURBO DNase was inactivated with DNase Inactivation Reagent and centrifuged at 10,000 × g for 2 min. RNA concentration was quantified using a NanoDrop 1000 spectrophotometer (Thermo Scientific).

For the F₂ population, RNA was isolated using the methods described in Gan et al. (supplementary methods, 7.2) [49]. Briefly, individual frozen plants were ground in liquid nitrogen using a mortar and pestle and RNA was isolated using PureLink Plant RNA Reagent (#12322-012, Invitrogen, Carlsbad, CA, USA) following the manufacturer’s instructions. Precipitated RNA was resuspended in RNAsecure Resuspension Solution (#AM7010, Ambion®; Life Technologies, Carlsbad, CA, USA) following manufacturer’s guidelines to prevent RNA degradation. Contaminating genomic DNA was removed by incubating samples with Turbo DNase (#AM2238, Ambion®) for 15 min at 37 °C. Finally, RNA was precipitated using 2.5 volume of 100% ethanol, 0.1 volume 5 M Ammonium Acetate, and 1 μL glycogen (5 mg/mL) and resuspended in RNase-free water. RNA concentration was quantified using a NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA) and RNA integrity was determined using an RNA 6000 nano assay (Bioanalyzer 2100, Agilent Technologies, Palo Alto, CA, USA).

F₂ mRNA-seq library preparation and sequencing

RNA-seq libraries were constructed using the Illumina TruSeq RNA Sample Preparation Kit V2 (# 15026495 Rev. C). Two kits of each barcode set were used for a total of 183 libraries. Libraries were prepared following the manufacturer’s instructions. Briefly, poly-A mRNA was purified from 1 μg of isolated total RNA using poly-T oligo-attached magnetic beads. Purified RNA was fragmented into approximately 120–210 bp fragments (average size ~155 bp) using divalent cations at 94 °C for 8 min. After fragmentation, first strand complementary DNA (cDNA) synthesis was performed using random primers and reverse transcriptase, followed by second strand cDNA synthesis using DNA polymerase I and RNase H. The cDNA fragments then went through end-repair and overhang A addition reactions followed by the ligation of the RNA Adapters. Finally, the products were purified and enriched using 12 polymerase chain reaction (PCR) cycles.

Libraries were validated using the DNA 1000 assay (Bioanalyzer 2100, Agilent Technologies, Palo Alto, CA, USA) and library concentrations were determined for all cDNA fragments between 200 and 700 bp. Libraries were normalized to 10 nM. Sets of 24 barcoded libraries were pooled into a single lane of an Illumina HiSeq2000 instrument and run as 50 bp single-end (SE) reads. Samples were run at the Microarray Core Facility at the Huntsman Cancer Institute (University of Utah, USA).

F₁ and F₂ total RNA-seq library preparation and sequencing

We prepared libraries using the SENSE Total RNA-Seq Library Prep Kit (Lexogen) according to the manufacturer’s instructions with the following specifications: 200 ng of total RNA were used as input; the sample was incubated at 37 °C for 110 min. In the reverse transcription and ligation step, after the second strand synthesis step, 27 μL of Purification Solution were added in the first purification step to increase the fraction of inserts < 200 nt, and 16 cycles were used during the amplification step.

Libraries were validated with a Fragment Analyzer™ Automated CE System (Advanced Analytical), concentrations were determined for all fragments between 200 and 700 bp, and pooled in two sets at equimolar concentration for 96-multiplex sequencing. Libraries were sequenced on an Illumina HiSeq™ 2500 Analyzer using manufacturer’s standard cluster generation and sequencing protocols in 100 bp SE mode. Samples were run at the at the Vienna Biocenter Core Facilities (VBCF) NGS unit in Vienna, Austria (http://www.vbcf.ac.at).

Removal of adapter contamination was performed with BBDuk from the BBMap package (v35.10; B. Bushnell, http://sourceforge.net/projects/bbmap/). The first 11 and the last six nucleotides of every read were further removed with Seqtk (H. Li, https://github.com/lh3/seqtk). Finally, reads shorter than 50 bp were removed with Trimmomatic (v0.33) [81].

DNA isolation and library preparation

We extracted DNA with the NucleoMag® 96 Plant (Macherey-Nagel) kit according to the manufacturer’s instructions. We prepared libraries using the Illumina Nextera™ Kit. Standard Nextera library construction was modified to reduce volume in the protocol. Briefly, tagmentation reaction was set up to 2.5 μL final volume with 2.5 ng of input DNA. For PCR amplification and multiplexing, we used Nextera Index Kit (Illumina) indexes. Size selection and PCR clean-up were performed with Agencourt AMPure XP Beads (Beckman Coulter). After PCR enrichment, libraries were validated with Fragment Analyzer™ Automated CE System (Advanced Analytical) and pooled in equimolar concentration for 96X-multiplex. Libraries were sequenced on Illumina HiSeq™ 2500 using manufacturer’s standard cluster generation and sequencing protocols in 125 bp paired-end (PE) mode at the VBCF in Vienna, Austria.

Identification of 45S rRNA gene variants at the DNA level

1138 natural accessions

In addition to the 1135 accessions sequenced by the 1001 Genomes Consortium [40], we analyzed accessions Mt-0 (6939), Po-0 (7308), and Hi-0 (8304), which are among the 19 founders of the MAGIC population [49]. We also substituted the reference accession Col-0 (6909) from the 1001 Genomes Consortium for another Col-0 of better sequencing quality [76].

For each genome, we performed 3’ adapter removal (either TruSeq or Nextera), quality trimming (quality 15 and 10 for 5’ and 3’-ends, respectively) and N-end trimming with cutadapt (v1.9) [82]. Then, we mapped all PE reads separately to a single 45S rRNA gene reference (described in [36]) and to the A. thaliana TAIR10 reference genome with BWA-MEM (v0.7.8) [83, 84]. We used Samtools (v0.1.18) to convert file formats [85] and Sambamba (v0.6.3) to sort and index bam files [86], we removed duplicated reads with Markduplicates from Picard (v1.101) (http://broadinstitute.github.io/picard/), and we performed local realignment around indels with GATK/RealignerTargetCreator and GATK/IndelRealigner functions from GATK (v3.5) [87, 88]. Due to the heterogeneity of Illumina platforms used to sequence these genomes we conducted a base quality recalibration step. We produced base recalibration reports based on the alignment to TAIR10 reference by providing known indels and SNPs from the 1001 Genomes Consortium to the function GATK/BaseRecalibrator; recalibrated base qualities were updated in the reads aligned to the 45S rDNA reference with function GATK/PrintReads in BQSR mode.

To detect polymorphisms along the 45S rDNA we retrieved per-site information with both the function variation_strand from the python package pysamstats (v0.24.2; A. Miles, https://github.com/alimanfoo/pysamstats) and a patched version of it that filters out bases with base quality lower than 20. While we used the output of the former to count the proportion of alternative alleles at each reference position, we used the output of the latter to calculate the strand bias (SB) score at each variable site according to the formula: \( \left|\;\frac{b}{a+ b}-\frac{d}{c+ d}\right|/\left(\frac{b+ d}{a+ b+ c+ d}\right) \), where \( a \) and \( c \) represent the forward and reverse strands’ counts for the major allele, respectively, and \( b \) and \( d \) represent the forward and reverse strands’ counts for the minor allele, respectively [89]. For each accession, we excluded from further analyses (i.e. mapping in the MAGIC population and expression analysis) variable sites with a SB score higher than 0.8. Similarly, we excluded alternative variants supported by less than 5% of the reads spanning that position within a given genome. We only report variants lying along a 7.7 kb transcribed portion of the 45S rDNA, spanning from the minimal promoter to the end of the 25S rRNA subunit in our 45S reference gene (300–8009 bp) [36]. Finally, at sites for which an alternative variant is more frequent than the reference allele, we also analyzed the reference allele (Additional file 1).

MAGIC lines, F₁ individuals, F₂ population, and RILs

We mapped low-coverage DNA-seq PE-reads of 393 MAGIC individuals, 10 F₁s, 16 F₂s, and eight RILs (see below) as described above for data from the 1001 Genomes Consortium, with the exception of the base recalibration step. Similarly, to detect polymorphisms we employed the same pipeline described above and only report variants that have been validated in the original parental accessions.

Annotation of rDNA cluster-specific variants

Founders of the MAGIC population

We focused on variants shared by seven or fewer of the 19 founder accessions and used two complementary approaches. First, we simply performed linkage mapping and multiple imputation of each variant in the 393 individuals of the MAGIC population with R/happy [47, 90]. Second, we used the software “reconstruction” [73] to infer the genotypes at the top of chromosomes 2 and 4, and compared this result with the information recovered from the rRNA gene variants themselves in each MAGIC line (see above; Additional file 3). For variants shared by few accessions the results of the first analysis alone were sufficient to determine rDNA cluster-specificity (Fig. 1c and d); however, for variants shared among many accessions both analyses were required to produce unambiguous results and correct wrongly inferred genotypes. Accession Mt-0 (6939) presented several discrepancies between variants found in the founder line and those found in MAGIC lines, thus was excluded from the study. rDNA cluster-specific markers for the founder accessions are reported in Additional file 2: Table S1.

Parents of the F₂ population

We selected variants supported by at least 10% of the reads spanning that position in a parental line and absent in the other. We performed genotyping by low-coverage DNA-sequencing as described in Rabanal et al. [36] binning SNP markers in 100 kb windows. rDNA cluster-specificity was unambiguously inferred from the predicted genotypes at the top of chromosomes 2 and 4 (Additional file 6). rDNA cluster-specific markers for the parental accessions are reported in Additional file 2: Table S2.

Accession Cvi-0

We selected variants present in Cvi-0 and absent in Ler-0. rDNA cluster-specificity was unambiguously inferred from the predicted genotypes at the top of chromosomes 2 and 4 in 16 re-sequenced RILs (CVL5, CVL9, CVL10, CVL13, CVL17, CVL20, CVL34, and CVL38) derived from a cross between Cvi-0 and Ler-0 [36, 91] (Additional file 7). rDNA cluster-specific markers for accession Cvi-0 are reported in Additional file 2: Table S3.

Expression of 45S rRNA gene variants

To avoid any bias derived from aligning or calling variants, we mapped reads (whether single-end or PE) as described for low-coverage DNA-sequencing reads, with the exception of the removal of duplicated reads step. To detect polymorphisms, due to the heterogeneous nature of the multiple datasets (strand-specific and non-strand-specific), no filter for SB was applied. However, we only report variants at the RNA level that have been validated as variants at the DNA level. In addition, we applied a minimum cutoff of 25 reads covering a given variable site to report it at the RNA level.

Despite selection with poly-T oligo-attached beads during library preparation, the mRNA-seq datasets used in this study still contain at least 6.1% of reads mapping to the 45S rRNA gene, on average (see Additional file 8). This should be contrasted with libraries from total RNA, which contain more than 60% of reads mapping to the 45S rRNA gene, on average. Low expressed ETS and ITS regions in particular are differentially covered by each type of library. Nonetheless, results based on well-covered variants are consistent between the different types of libraries.

Statistical genetics in the F₂ population

Genotyping by RNA-seq

Reads from the individual F₂ samples were aligned to the TAIR10 reference genome and annotation using the Bowtie (v2.1.0.0)/TopHat (v2.0.8b) [92, 93] pipeline with the following command line parameters: ‘-a 5 -i 5 -I 32000 --b2-very-sensitive --segment-mismatches 2 -g 1’. Employing the Pysam package (v0.7.5; A. Heger, https://github.com/pysam-developers/pysam), we assessed genotypes in each of the 183 F₂ samples at 266,663 SNP sites polymorphic between the two parents in genes in the TAIR10 annotation, i.e. genotypic data were assessed as the base identities in RNA-seq reads crossing a polymorphic position; the input genotypic data for TDr-9 and Algutsrum were obtained from a previous study [35]. For assessing genotypes at SNP positions near splice sites (e.g. a read aligned across two exons, for which alignment uncertainties are higher), aligned RNA-seq segments exceeding 3 bp were enforced to make a genotypic call; further, SNPs that were called as homozygous for one parent at a rate exceeding tenfold that of the other parent across all 183 samples were also removed from downstream analysis (these may reflect the impact of polymorphisms on read alignments, or alternatively allele-specific expression resulting from structural or other genetic variation, a confounding factor for genotyping with RNA-seq as opposed to DNA-seq reads). Non-overlapping windows of 50 SNPs were used to classify the genomes of each F₂ individual into heterozygous and homozygous regions. Briefly, for an individual SNP position to be defined as homozygous, over 90% of the RNA-seq reads crossing it had to come from one parent and for the non-overlapping windows of 50 successive SNPs to be classified as homozygous, over 85% of the SNPs in the window had to be assigned to the same parent (else, a window was classified as heterozygous). In F₂ individuals, only several recombination events are expected per chromosome; thus, long tracks of homozygosity or heterozygosity are expected, and were observed as revealed in plots of the window data across each chromosome for each F₂ sample. This identified chromosome regions of the same genotype and the breakpoints between homozygous and heterozygous genomic intervals were then manually refined using the genotypic information from single SNPs and/or by inspection of read alignments for given F₂ samples in IGV [94]. The result of this analysis was that each region of the genome in each F₂ individual was classified as either homozygous for one or the other parent or alternatively heterozygous. Based on these ranges, the intervening genotypes at all variable positions between the two parents were imputed for use in genetic mapping (535,800 single nucleotide positions in total).

Linkage mapping of rDNA cluster expression in the F₂ population

We performed linkage mapping of the expression of each rDNA cluster-specific variant with the values from the RNA sequencing protocol that covered each variant for the most number of individuals. First, we reduced the original genetic map from 535,800 segregating SNPs to 1394 after dropping those with identical genotype data. Simple interval mapping (SIM) was performed with the R package R/qtl [95]. We further reduced the genetic map to 366 markers with the function pickMarkerSubset with minimum distance of 1. With the latter subset Multiple QTL mapping (MQM) was performed with a 2 centimorgan step size and 20 as window size [96]. One thousand permutations were applied to estimate genome wide significance.

Fluorescence in situ hybridization (FISH)

We germinated seeds on filter paper soaked in distilled water in a Petri dish at 21 °C. Leaves of two-week-old seedlings grown under long photoperiod conditions (16 h light at 21 °C and 8 h dark at 18 °C) were fixed in ethanol:acetic acid (3:1) fixative at 4 °C for 24 h. The preparation of nuclei spreads from the fixed leaves and FISH followed the protocols published by Mandáková and Lysak [97, 98] with some modifications. Briefly, fixed leaves were rinsed in distilled water and 1 citrate buffer (10 mM sodium citrate, pH 4.8), and digested by 0.3% pectolytic enzymes (cellulase, cytohelicase, and pectolyase) in 1× citrate buffer at 37 °C for 20 min. Digested leaves were placed on a microscopic slide by a Pasteur pipette, disintegrated by a needle in a small drop of 1 citrate buffer, and the material spread in 20 μL of 60% acetic acid on a hot plate (50 °C) for 2 min. After the material was fixed on the slide using 100 μL of the ethanol:acetic acid (3:1) fixative, the slide was tilted and dried using a hair dryer. Prior to FISH, the slide was pretreated by ribonuclease A (100 μg/mL in distilled water) at 37 °C for 1 h and by pepsin (0.1 mg/mL in 10 mM HCl) for at 37 °C for 1 min, and postfixed in 4% formaldehyde in 2× SSC (20× SSC: 3 M NaCl in 0.3 M sodium citrate, pH 7.0) at room temperature for 10 min. The slide was rinsed in 2× SSC between the steps and eventually dehydrated in an ethanol series (70%, 80%, and 96% ethanol, 3 min each). To identify the rDNA clusters, A. thaliana BAC clone T15P10 containing 45S rRNA genes was used. To identify A. thaliana chromosomes 2 and 4, 11 BAC clones from the upper arm of chromosome 2 (F2I9, T8O11, T23O15, F14H20, F5O4, T8K22, F3C11, F16J10, T3P4, T6P5, and T25N22), and 15 BACs from the upper arm of chromosome 4 (F6N15, F5I10, T18A10, F3D13, T15B16, T10M13, T14P8, T5J8, F4C21, F9H3, T27D20, T19B17, T26N6, T19J18, and T1J1) were used. The 45S rRNA gene probe was labeled with Cy3-dUTP, chromosome 2 BACs with biotin-Dutp, and chromosome 4 BAC clones with digoxigenin-dUTP by nick translation [98]. A total of 100 ng from each labeled BAC DNA were pooled together, ethanol precipitated, dissolved in 20 μL of 50% formamide in 10% dextran sulfate in 2× SSC and pipetted on the selected spreads of nuclei. The nuclei and DNA probe were denatured together at 80 °C for 2 min and the slide incubated at 37 °C overnight. Hybridized probes were visualized either as the direct fluorescence of Cy3-dUTP (yellow) or through fluorescently labeled antibodies against biotin-dUTP (red) and digoxigenin-dUTP (green). After FISH, the slide was counterstained with 4,6-diamidino-2-phenylindole (DAPI, 2 μg/mL) in Vectashield antifade. Fluorescence signals were analyzed and photographed using a Zeiss Axioimager epifluorescence microscope and a CoolCube camera, and pseudocolored using Adobe Photoshop CS5 software. Darker, less DAPI-stained areas within the photographed nuclei, corresponding to the nucleoli, were demarcated in Adobe Photoshop. rDNA clusters (visualized by the 45S rRNA gene probe) located in the immediate proximity to the nucleoli were counted and evaluated (Additional file 4).

Reverse transcription polymerase chain reaction (RT-PCR)

Semi-quantitative RT-PCR was performed using cDNA generated with SuperScript III Reverse Transcriptase (Invitrogen) using random hexamers according to the manufacturer’s instructions from 400 ng of total RNA. Two microliters of cDNA were used for PCR amplification of rRNA gene 3' variable region (VAR1-4; 30 cycles) and ACT2 (26 cycles). For VAR1-4 [28] we used primers 5'-GAG ACA GAC TTG TCC AAA ACG CCC AC-3' and 5'-CTG GTC GAG GAA TCC TGG ACG ATT-3', while for ACT2 primers 5'-AAG TCA TAA CCA TCG GAG CTG-3' and 5'-ACC AGA TAA GAC AAG ACA CAC-3' [11].

References

Woese CR, Fox GE. The concept of cellular evolution. J Mol Evol. 1977;10:1–6.
Article CAS PubMed Google Scholar
Gilbert W. Origin of life: The RNA world. Nature. 1986;319:618.
Article Google Scholar
Siefert JL, Martin KA, Abdi F, Widger WR, Fox GE. Conserved gene clusters in bacterial genomes provide further support for the primacy of RNA. J Mol Evol. 1997;45:467–72.
Article CAS PubMed Google Scholar
Forterre P. The two ages of the RNA world, and the transition to the DNA world: a story of viruses and cells. Biochimie. 2005;87:793–803.
Article CAS PubMed Google Scholar
Warner JR. The economics of ribosome biosynthesis in yeast. Trends Biochem Sci. 1999;24:437–40.
Article CAS PubMed Google Scholar
Moss T, Langlois F, Gagnon-Kugler T, Stefanovsky V. A housekeeper with power of attorney: the rRNA genes in ribosome biogenesis. Cell Mol Life Sci. 2007;64:29–49.
Article CAS PubMed Google Scholar
Layat E, Sáez-Vásquez J, Tourmente S. Regulation of Pol I-transcribed 45S rDNA and Pol III-transcribed 5S rDNA in Arabidopsis. Plant Cell Physiol. 2012;53:267–76.
Article CAS PubMed Google Scholar
Ritossa FM, Spiegelman S. Localization of DNA complementary to ribosomal RNA in the nucleolus organizer Region of Drosophila Melanogaster. Proc Natl Acad Sci U S A. 1965;53:737–45.
Article CAS PubMed PubMed Central Google Scholar
Wallace H, Birnstiel ML. Ribosomal cistrons and the nucleolar organizer. Biochim Biophys Acta. 1966;114:296–310.
Article CAS PubMed Google Scholar
Long EO, Dawid IB. Repeated genes in eukaryotes. Annu Rev Biochem. 1980;49:727–64.
Article CAS PubMed Google Scholar
Pontvianne F, Blevins T, Chandrasekhara C, Mozgová I, Hassel C, Pontes OMF, et al. Subnuclear partitioning of rRNA genes between the nucleolus and nucleoplasm reflects alternative epiallelic states. Genes Dev. 2013;27:1545–50.
Article CAS PubMed PubMed Central Google Scholar
Grummt I, Pikaard CS. Epigenetic silencing of RNA polymerase I transcription. Nat Rev Mol Cell Biol. 2003;4:641–9.
Article CAS PubMed Google Scholar
Chen ZJ, Pikaard CS. Transcriptional analysis of nucleolar dominance in polyploid plants: biased expression/silencing of progenitor rRNA genes is developmentally regulated in Brassica. Proc Natl Acad Sci U S A. 1997;94:3442–7.
Article CAS PubMed PubMed Central Google Scholar
Santoro R, Grummt I. Molecular mechanisms mediating methylation-dependent silencing of ribosomal gene transcription. Mol Cell. 2001;8:719–25.
Article CAS PubMed Google Scholar
Navashin M. Chromosome alterations caused by hybridization and their bearing upon certain general genetic problems. Cytologia. 1934;5:169–203.
Article Google Scholar
Honjo T, Reeder RH. Preferential transcription of Xenopus laevis ribosomal RNA in interspecies hybrids between Xenopus laevis and Xenopus mulleri. J Mol Biol. 1973;80:217–28.
Article CAS PubMed Google Scholar
Martini G, O’Dell M, Flavell RB. Partial inactivation of wheat nucleolus organisers by the nucleolus organiser chromosomes from Aegilops umbellulata. Chromosoma. 1982;84:687–700.
Article Google Scholar
Flavell RB, O’Dell M, Thompson WF. Regulation of cytosine methylation in ribosomal DNA and nucleolus organizer expression in wheat. J Mol Biol. 1988;204:523–34.
Article CAS PubMed Google Scholar
Crosby AR. Nucleolar activity of lagging chromosomes in wheat. Am J Bot. 1957;44:813–22.
Article Google Scholar
Sears LMS, Lee-Chen S. Cytogenetic studies in Arabidopsis thaliana. Can J Genet Cytol. 1970;12:217–23.
Article Google Scholar
Henderson AS, Warburton D, Atwood KC. Location of ribosomal DNA in the human chromosome complement. Proc Natl Acad Sci U S A. 1972;69:3394–8.
Article CAS PubMed PubMed Central Google Scholar
Henderson AS, Eicher EM, Yu MT, Atwood KC. The chromosomal location of ribosomal DNA in the mouse. Chromosoma. 1974;49:155–60.
Article CAS PubMed Google Scholar
Sola L, Gornung E. Classical and molecular cytogenetics of the zebrafish, Danio rerio (Cyprinidae, Cypriniformes): an overview. Genetica. 2001;111:397–412.
Article CAS PubMed Google Scholar
de Capoa A, Marlekaj P, Baldini A, Rocchi M, Archidiacono N. Cytologic demonstration of differential activity of rRNA gene clusters in different human tissues. Hum Genet. 1985;69:212–7.
Article PubMed Google Scholar
Flavell RB, O’Dell M, Thompson WF, Vincentz M, Sardana R, Barker RF. The differential expression of ribosomal RNA genes. Philos Trans R Soc Lond B Biol Sci. 1986;314:385–97.
Article CAS Google Scholar
Kuo BA, Gonzalez IL, Gillespie DA, Sylvester JE. Human ribosomal RNA variants from a single individual and their expression in different tissues. Nucleic Acids Res. 1996;24:4817–24.
Article CAS PubMed PubMed Central Google Scholar
Tseng H, Chou W, Wang J, Zhang X, Zhang S, Schultz RM. Mouse ribosomal RNA genes contain multiple differentially regulated variants. PLoS One. 2008;3, e1843.
Article PubMed PubMed Central Google Scholar
Pontvianne F, Abou-Ellail M, Douet J, Comella P, Matia I, Chandrasekhara C, et al. Nucleolin is required for DNA methylation state and the expression of rRNA gene variants in Arabidopsis thaliana. PLoS Genet. 2010;6, e1001225.
Article PubMed PubMed Central Google Scholar
Gonzalez IL, Sylvester JE. Human rDNA: evolutionary patterns within the genes and tandem arrays derived from multiple chromosomes. Genomics. 2001;73:255–63.
Article CAS PubMed Google Scholar
Copenhaver GP, Doelling JH, Gens S, Pikaard CS. Use of RFLPs larger than 100 kbp to map the position and internal organization of the nucleolus organizer region on chromosome 2 in Arabidopsis thaliana. Plant J. 1995;7:273–86.
Article CAS PubMed Google Scholar
Copenhaver GP, Pikaard CS. RFLP and physical mapping with an rDNA-specific endonuclease reveals that nucleolus organizer regions of Arabidopsis thaliana adjoin the telomeres on chromosomes 2 and 4. Plant J. 1996;9:259–72.
Article CAS PubMed Google Scholar
Chandrasekhara C, Mohannath G, Blevins T, Pontvianne F, Pikaard CS. Chromosome-specific NOR inactivation explains selective rRNA gene silencing and dosage control in Arabidopsis. Genes Dev. 2016;30:177–90.
CAS PubMed PubMed Central Google Scholar
Schmuths H, Meister A, Horres R, Bachmann K. Genome Size Variation among Accessions of Arabidopsis thaliana. Ann Bot. 2004;93:317–21.
Article CAS PubMed PubMed Central Google Scholar
Davison J, Tyagi A, Comai L. Large-scale polymorphism of heterochromatic repeats in the DNA of Arabidopsis thaliana. BMC Plant Biol. 2007;7:44.
Article PubMed PubMed Central Google Scholar
Long Q, Rabanal FA, Meng D, Huber CD, Farlow A, Platzer A, et al. Massive genomic variation and strong selection in Arabidopsis thaliana lines from Sweden. Nat Genet. 2013;45:884–90.
Article CAS PubMed PubMed Central Google Scholar
Rabanal FA, Nizhynska V, Mandáková T, Novikova PY, Lysak MA, Mott R, et al. Unstable inheritance of 45S rRNA genes in Arabidopsis thaliana. G3 (Bethesda). doi:10.1534/g3.117.040204.
Riddle NC, Richards EJ. The control of natural variation in cytosine methylation in Arabidopsis. Genetics. 2002;162:355–63.
CAS PubMed PubMed Central Google Scholar
Riddle NC, Richards EJ. Genetic variation in epigenetic inheritance of ribosomal RNA gene methylation in Arabidopsis. Plant J. 2005;41:524–32.
Article CAS PubMed Google Scholar
Woo HR, Richards EJ. Natural variation in DNA methylation in ribosomal RNA genes of Arabidopsis thaliana. BMC Plant Biol. 2008;8:92.
Article PubMed PubMed Central Google Scholar
1001 Genomes Consortium. 1,135 genomes reveal the global pattern of polymorphism in Arabidopsis thaliana. Cell. 2016;166:481–91.
Article Google Scholar
Brown DD, Wensink PC, Jordan E. A comparison of the ribosomal DNA’s of Xenopus laevis and Xenopus mulleri: the evolution of tandem genes. J Mol Biol. 1972;63:57–73.
Article CAS PubMed Google Scholar
Arnheim N, Krystal M, Schmickel R, Wilson G, Ryder O, Zimmer E. Molecular evidence for genetic exchanges among ribosomal genes on nonhomologous chromosomes in man and apes. Proc Natl Acad Sci U S A. 1980;77:7323–7.
Article CAS PubMed PubMed Central Google Scholar
Dover G. Molecular drive: a cohesive mode of species evolution. Nature. 1982;299:111–7.
Article CAS PubMed Google Scholar
Arnheim N. Concerted evolution of multigene families. In: Nei M, Koehn RK, editors. Evolution of Genes and Proteins. Sunderland, MA: Sinauer; 1983. p. 38–61.
Google Scholar
Nei M, Rooney AP. Concerted and birth-and-death evolution of multigene families. Annu Rev Genet. 2005;39:121–52.
Article CAS PubMed PubMed Central Google Scholar
Havlová K, Dvořáčková M, Peiro R, Abia D, Mozgová I, Vansáčová L, et al. Variation of 45S rDNA intergenic spacers in Arabidopsis thaliana. Plant Mol Biol [Internet]. 2016;92(4–5):457–71.
Article Google Scholar
Kover PX, Valdar W, Trakalo J, Scarcelli N, Ehrenreich IM, Purugganan MD, et al. A Multiparent advanced generation inter-cross to fine-map quantitative traits in Arabidopsis thaliana. PLoS Genet. 2009;5, e1000551.
Article PubMed PubMed Central Google Scholar
Copenhaver GP, Browne WE, Preuss D. Assaying genome-wide recombination and centromere functions with Arabidopsis tetrads. Proc Natl Acad Sci. 1998;95:247–52.
Article CAS PubMed PubMed Central Google Scholar
Gan X, Stegle O, Behr J, Steffen JG, Drewe P, Hildebrand KL, et al. Multiple reference genomes and transcriptomes for Arabidopsis thaliana. Nature. 2011;477:419–23.
Article CAS PubMed PubMed Central Google Scholar
Scarcelli N, Cheverud JM, Schaal BA, Kover PX. Antagonistic pleiotropic effects reduce the potential adaptive value of the FRIGIDA locus. Proc Natl Acad Sci U S A. 2007;104:16986–91.
Article CAS PubMed PubMed Central Google Scholar
Lewis MS, Cheverud JM, Pikaard CS. Evidence for nucleolus organizer regions as the units of regulation in nucleolar dominance in Arabidopsis thaliana interecotype hybrids. Genetics. 2004;167:931–9.
Article CAS PubMed PubMed Central Google Scholar
Neves N, Heslop-Harrison JS, Viegas W. rRNA gene activity and control of expression mediated by methylation and imprinting during embryo development in wheat x rye hybrids. Theor Appl Genet. 1995;91:529–33.
Article CAS PubMed Google Scholar
Chen ZJ, Pikaard CS. Epigenetic silencing of RNA polymerase I transcription: a role for DNA methylation and histone modification in nucleolar dominance. Genes Dev. 1997;11:2124–36.
Article CAS PubMed PubMed Central Google Scholar
Santoro R, Li J, Grummt I. The nucleolar remodeling complex NoRC mediates heterochromatin formation and silencing of ribosomal gene transcription. Nat Genet. 2002;32:393–6.
Article CAS PubMed Google Scholar
Lawrence RJ, Earley K, Pontes O, Silva M, Chen ZJ, Neves N, et al. A concerted DNA methylation/histone methylation switch regulates rRNA gene dosage control and nucleolar dominance. Mol Cell. 2004;13:599–609.
Article CAS PubMed Google Scholar
Xue S, Barna M. Specialized ribosomes: a new frontier in gene regulation and organismal biology. Nat Rev Mol Cell Biol. 2012;13:355–69.
Article CAS PubMed PubMed Central Google Scholar
Sauert M, Temmel H, Moll I. Heterogeneity of the translational machinery: Variations on a common theme. Biochimie. 2015;114:39–47.
Article CAS PubMed Google Scholar
Gunderson JH, Sogin ML, Wollett G, Hollingdale M, de la Cruz VF, Waters AP, et al. Structurally distinct, stage-specific ribosomes occur in Plasmodium. Science. 1987;238:933–7.
Article CAS PubMed Google Scholar
Waters AP, Syin C, McCutchan TF. Developmental regulation of stage-specific ribosome populations in Plasmodium. Nature. 1989;342:438–40.
Article CAS PubMed Google Scholar
Gonzalez IL, Sylvester JE, Schmickel RD. Human 28S ribosomal RNA sequence heterogeneity. Nucleic Acids Res. 1988;16:10213–24.
Article CAS PubMed PubMed Central Google Scholar
Dimarco E, Cascone E, Bellavia D, Caradonna F. Functional variants of 5S rRNA in the ribosomes of common sea urchin Paracentrotus lividus. Gene. 2012;508:21–5.
Article CAS PubMed Google Scholar
Cloix C, Tutois S, Yukawa Y, Mathieu O, Cuvillier C, Espagnol M-C, et al. Analysis of the 5S RNA pool in Arabidopsis thaliana: RNAs are heterogeneous and only two of the genomic 5S loci produce mature 5S RNA. Genome Res. 2002;12:132–44.
Article CAS PubMed PubMed Central Google Scholar
Mohannath G, Pontvianne F, Pikaard CS. Selective nucleolus organizer inactivation in Arabidopsis is a chromosome position-effect phenomenon. Proc Natl Acad Sci U S A. 2016;113(47):13426–31.
Article CAS PubMed PubMed Central Google Scholar
Bateman AJ. Self-incompatibility systems in angiosperms: III. Cruciferae. Heredity. 1955;9:53–68.
Article Google Scholar
Tarutani Y, Shiba H, Iwano M, Kakizaki T, Suzuki G, Watanabe M, et al. Trans-acting small RNA determines dominance relationships in Brassica self-incompatibility. Nature. 2010;466:983–6.
Article CAS PubMed Google Scholar
Durand E, Méheust R, Soucaze M, Goubet PM, Gallina S, Poux C, et al. Dominance hierarchy arising from the evolution of a complex small RNA regulatory network. Science. 2014;346:1200–5.
Article CAS PubMed Google Scholar
Preuss SB, Costa-Nunes P, Tucker S, Pontes O, Lawrence RJ, Mosher R, et al. Multimegabase silencing in nucleolar dominance involves siRNA-directed DNA methylation and specific methylcytosine-binding proteins. Mol Cell. 2008;32:673–84.
Article CAS PubMed PubMed Central Google Scholar
Tucker S, Vitins A, Pikaard CS. Nucleolar dominance and ribosomal RNA gene silencing. Curr Opin Cell Biol. 2010;22:351–6.
Article CAS PubMed PubMed Central Google Scholar
Mayer C, Schmitz K-M, Li J, Grummt I, Santoro R. Intergenic transcripts regulate the epigenetic state of rRNA genes. Mol Cell. 2006;22:351–61.
Article CAS PubMed Google Scholar
Mayer C, Neubert M, Grummt I. The structure of NoRC-associated RNA is crucial for targeting the chromatin remodelling complex NoRC to the nucleolus. EMBO Rep. 2008;9:774–80.
Article CAS PubMed PubMed Central Google Scholar
Santoro R, Schmitz K-M, Sandoval J, Grummt I. Intergenic transcripts originating from a subclass of ribosomal DNA repeats silence ribosomal RNA genes in trans. EMBO Rep. 2010;11:52–8.
Article CAS PubMed Google Scholar
Law JA, Jacobsen SE. Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet. 2010;11:204–20.
Article CAS PubMed PubMed Central Google Scholar
Imprialou M, Kahles A, Steffen JG, Osborne EJ, Gan X, Lempe J, et al. Genomic rearrangements in Arabidopsis considered as quantitative traits. Genetics. 2017;205:1425–41.
Article PubMed PubMed Central Google Scholar
Clauw P, Coppens F, De Beuf K, Dhondt S, Van Daele T, Maleux K, et al. Leaf responses to mild drought stress in natural variants of Arabidopsis. Plant Physiol. 2015;167:800–16.
Article CAS PubMed PubMed Central Google Scholar
Kawakatsu T, Huang S-SC, Jupe F, Sasaki E, Schmitz RJ, Urich MA, et al. Epigenomic diversity in a global collection of Arabidopsis thaliana accessions. Cell. 2016;166:492–505.
Article CAS PubMed Google Scholar
Watson JM, Platzer A, Kazda A, Akimcheva S, Valuchova S, Nizhynska V, et al. Germline replications and somatic mutation accumulation are independent of vegetative life span in Arabidopsis. Proc Natl Acad Sci U S A. 2016;113(43):12226–31.
Article CAS PubMed PubMed Central Google Scholar
Xie Z, Johansen LK, Gustafson AM, Kasschau KD, Lellis AD, Zilberman D, et al. Genetic and functional diversification of small RNA pathways in plants. PLoS Biol. 2004;2, E104.
Article PubMed PubMed Central Google Scholar
Xie Z, Allen E, Wilken A, Carrington JC. DICER-LIKE 4 functions in trans-acting small interfering RNA biogenesis and vegetative phase change in Arabidopsis thaliana. Proc Natl Acad Sci U S A. 2005;102:12984–9.
Article CAS PubMed PubMed Central Google Scholar
Henderson IR, Zhang X, Lu C, Johnson L, Meyers BC, Green PJ, et al. Dissecting Arabidopsis thaliana DICER function in small RNA processing, gene silencing and DNA methylation patterning. Nat Genet. 2006;38:721–5.
Article CAS PubMed Google Scholar
Onodera Y, Haag JR, Ream T, Costa Nunes P, Pontes O, Pikaard CS. Plant nuclear RNA polymerase IV mediates siRNA and DNA methylation-dependent heterochromatin formation. Cell. 2005;120:613–22.
Article CAS PubMed Google Scholar
Bolger AM, Lohse M, Usadel B. Trimmomatic: a flexible trimmer for Illumina sequence data. Bioinformatics. 2014;30:2114–20.
Article CAS PubMed PubMed Central Google Scholar
Martin M. Cutadapt removes adapter sequences from high-throughput sequencing reads. EMBnet J. 2011;17:10–2.
Article Google Scholar
Li H, Durbin R. Fast and accurate short read alignment with Burrows–Wheeler transform. Bioinformatics. 2009;25:1754–60.
Article CAS PubMed PubMed Central Google Scholar
Li H. Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM [Internet]. arXiv [q-bio.GN]. 2013. http://arxiv.org/abs/1303.3997v2.
Li H, Handsaker B, Wysoker A, Fennell T, Ruan J, Homer N, et al. The Sequence Alignment/Map format and SAMtools. Bioinformatics. 2009;25:2078–9.
Article PubMed PubMed Central Google Scholar
Tarasov A, Vilella AJ, Cuppen E, Nijman IJ, Prins P. Sambamba: fast processing of NGS alignment formats. Bioinformatics. 2015;31:2032–4.
Article CAS PubMed PubMed Central Google Scholar
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
Article CAS PubMed PubMed Central Google Scholar
Van der Auwera GA, Carneiro MO, Hartl C, Poplin R, Del Angel G, Levy-Moonshine A, et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr Protoc Bioinformatics. 2013;11:11.10.1–11.10.33.
Google Scholar
Guo Y, Li J, Li C-I, Long J, Samuels DC, Shyr Y. The effect of strand bias in Illumina short-read sequencing data. BMC Genomics. 2012;13:666.
Article CAS PubMed PubMed Central Google Scholar
Mott R, Talbot CJ, Turri MG, Collins AC, Flint J. A method for fine mapping quantitative trait loci in outbred animal stocks. Proc Natl Acad Sci U S A. 2000;97:12649–54.
Article PubMed PubMed Central Google Scholar
Alonso-Blanco C, Peeters AJ, Koornneef M, Lister C, Dean C, van den Bosch N, et al. Development of an AFLP based linkage map of Ler, Col and Cvi Arabidopsis thaliana ecotypes and construction of a Ler/Cvi recombinant inbred line population. Plant J. 1998;14:259–71.
Article CAS PubMed Google Scholar
Langmead B, Salzberg SL. Fast gapped-read alignment with Bowtie 2. Nat Methods. 2012;9:357–9.
Article CAS PubMed PubMed Central Google Scholar
Kim D, Pertea G, Trapnell C, Pimentel H, Kelley R, Salzberg SL. TopHat2: accurate alignment of transcriptomes in the presence of insertions, deletions and gene fusions. Genome Biol. 2013;14:R36.
Article PubMed PubMed Central Google Scholar
Robinson JT, Thorvaldsdóttir H, Winckler W, Guttman M, Lander ES, Getz G, et al. Integrative genomics viewer. Nat Biotechnol. 2011;29:24–6.
Article CAS PubMed PubMed Central Google Scholar
Broman KW, Wu H, Sen S, Churchill GA. R/qtl: QTL mapping in experimental crosses. Bioinformatics. 2003;19:889–90.
Article CAS PubMed Google Scholar
Arends D, Prins P, Jansen RC, Broman KW. R/qtl: high-throughput multiple QTL mapping. Bioinformatics. 2010;26:2990–2.
Article CAS PubMed PubMed Central Google Scholar
Mandáková T, Lysak MA. Chromosome preparation for cytogenetic analyses in Arabidopsis. Current protocols in plant biology. Chichester: John Wiley & Sons, Inc.; 2016.
Google Scholar
Mandáková T, Lysak MA. Painting of Arabidopsis chromosomes with chromosome-specific BAC clones. Current protocols in plant biology. Chichester: John Wiley & Sons, Inc.; 2016.
Google Scholar

Download references

Acknowledgements

We sincerely thank Ortrun Mittelsten Scheid for helpful discussions during the course of this project, Mario Arteaga-Vazquez and Ilka Reichardt-Gomez for their valuable input in the preparation of this manuscript, J. Matthew Watson for early access to unpublished Col-0 DNA-seq data, and Zsuzsanna Mérai, Ümit Seren and Edward J. Osborne for experimental/informatic assistance.

Funding

This study was funded, in part, by European Research Council (ERC) grant 288962 (MAXMAP; to MN), National Institutes of Health (NIH) award P50 HG002790 (to MN and RMC), and Czech Science Foundation grant P501/12/G090 (to MAL and TM). RG was supported by National Institutes of Health Genetics Training Grant T32GM007464.

Availability of data and materials

DNA-seq and RNA-seq data supporting the conclusions of this article are available at the NCBI (https://www.ncbi.nlm.nih.gov/bioproject) under BioProject PRJNA380541 and at the GEO (https://www.ncbi.nlm.nih.gov/geo/) under series number GSE92568. Accession numbers for previously published data used herein are listed in Additional file 8.

Authors’ contributions

FAR and MN conceived and designed the study. FAR, TM, LMS, DLP, and VN performed the experiments. FAR performed most of the analyses. RG contributed to the analyses. JGS, RMC, SL, RM, and MAL contributed unpublished essential data or reagents. FAR and MN wrote the manuscript. All authors read and approved the final manuscript.

Competing interests

The authors declare that they have no competing interests.

Ethical approval

Not applicable.

Publisher’s Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Author information

Authors and Affiliations

Gregor Mendel Institute (GMI), Austrian Academy of Sciences, Vienna Biocenter (VBC), Dr. Bohr-Gasse 3, 1030, Vienna, Austria
Fernando A. Rabanal, Luz M. Soto-Jiménez, Stefan Lutzmayer, Viktoria Nizhynska & Magnus Nordborg
Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czech Republic
Terezie Mandáková & Martin A. Lysak
Department of Biology, University of Utah, Salt Lake City, UT, USA
Robert Greenhalgh, David L. Parrott & Richard M. Clark
Department of Natural Sciences, Colby-Sawyer College, New London, NH, USA
Joshua G. Steffen
Genetics Institute, University College London (UCL), Gower Street, London, WC1E 6BT, UK
Richard Mott
Center for Cell and Genome Science, University of Utah, Salt Lake City, UT, USA
Richard M. Clark

Authors

Fernando A. Rabanal
View author publications
You can also search for this author in PubMed Google Scholar
Terezie Mandáková
View author publications
You can also search for this author in PubMed Google Scholar
Luz M. Soto-Jiménez
View author publications
You can also search for this author in PubMed Google Scholar
Robert Greenhalgh
View author publications
You can also search for this author in PubMed Google Scholar
David L. Parrott
View author publications
You can also search for this author in PubMed Google Scholar
Stefan Lutzmayer
View author publications
You can also search for this author in PubMed Google Scholar
Joshua G. Steffen
View author publications
You can also search for this author in PubMed Google Scholar
Viktoria Nizhynska
View author publications
You can also search for this author in PubMed Google Scholar
Richard Mott
View author publications
You can also search for this author in PubMed Google Scholar
Martin A. Lysak
View author publications
You can also search for this author in PubMed Google Scholar
Richard M. Clark
View author publications
You can also search for this author in PubMed Google Scholar
Magnus Nordborg
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding authors

Correspondence to Fernando A. Rabanal or Magnus Nordborg.

Additional files

Additional file 1:

45S rRNA gene variants analysis for the 1138 genomes. Variable sites along the 45S rRNA gene present in at least 5% of the copies within an individual for 1138 accessions. Nomenclature: X[position].[reference].[alternative]. (CSV 39979 kb)

Additional file 2:

Table S1. rDNA cluster-specific markers in the 19 founders of the MAGIC population. Table S2. rDNA cluster-specific markers in the parental accessions of the F₂ population: Algutsrum and TDr-9. Table S3. rDNA cluster-specific markers in accession Cvi-0. (PDF 105 kb)

Additional file 3:

45S rRNA gene variants analysis for the MAGIC population and founders. Variable sites along the 45S rRNA genes at the DNA and RNA level for the founders of the MAGIC population, the MAGIC lines, and the F₁ crosses used in this study. For the MAGIC lines, genotypes at the top of chromosomes 2 (rDNA-2) and 4 (rDNA-4) inferred by the software “reconstruction” [73] and corrected rDNA cluster genotypes according to the variants carried by the rDNA clusters themselves. Type of sequencing-library source is also indicated. Nomenclature: X[position].[reference].[alternative]. (CSV 492 kb)

Additional file 4:

Nucleolar association of rDNA-2 and rDNA-4 in five accessions. (PDF 55 kb)

Additional file 5:

Figure S1. Different accessions express either rDNA-2 or rDNA-4, or both. Figure S2. The pattern of rRNA expression is consistent across replicate lines. Figure S3. Expression of Bur-0 rDNA-4 when used as a mother in an F₁ cross. Figure S4. Cvi-0 expresses rDNA-4. Figure S5. Col-0 rDNA-2 is not reactivated in mutants of the RdDM pathway. (PDF 991 kb)

Additional file 6:

45S rRNA gene variants analysis for the the F₂ population and parental accessions. Variable sites along the 45S rRNA genes at the DNA and RNA level for the parental accessions of the F₂ population, and the F₂ lines used in this study. Predicted genotypes at rDNA-2 and rDNA-4 for each line and type of sequencing-library source are also indicated. Nomenclature: X[position].[reference].[alternative]. (CSV 109 kb)

Additional file 7:

45S rRNA gene variants analysis for accession Cvi-0. Variable sites along the 45S rRNA genes at the DNA and RNA level for the accession Cvi-0. The predicted genotypes at rDNA-2 and rDNA-4 for 8 recombinant inbred lines (RILs; Cvi-0 x Ler-0) and type of sequencing-library source are also indicated. Nomenclature: X[position].[reference].[alternative]. (CSV 4 kb)

Additional file 8:

Data availability. (PDF 69.2 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver (http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

Reprints and permissions

About this article

Cite this article

Rabanal, F.A., Mandáková, T., Soto-Jiménez, L.M. et al. Epistatic and allelic interactions control expression of ribosomal RNA gene clusters in Arabidopsis thaliana . Genome Biol 18, 75 (2017). https://doi.org/10.1186/s13059-017-1209-z

Download citation

Received: 01 December 2016
Accepted: 06 April 2017
Published: 03 May 2017
DOI: https://doi.org/10.1186/s13059-017-1209-z

Epistatic and allelic interactions control expression of ribosomal RNA gene clusters in Arabidopsis thaliana

Abstract

Background

Results

Conclusions

Similar content being viewed by others

Background

Results

Sequence variation in rRNA genes within and between individuals

Accessions express either rDNA-2 or rDNA-4, or both

Dominance is a property of rDNA “alleles”

Genetic analysis of interactions between rDNA “alleles”

Discussion

Conclusions

Methods

Plant material and growth conditions

RNA isolation and library preparation

F2 mRNA-seq library preparation and sequencing

F1 and F2 total RNA-seq library preparation and sequencing

DNA isolation and library preparation

Identification of 45S rRNA gene variants at the DNA level

1138 natural accessions

MAGIC lines, F1 individuals, F2 population, and RILs

Annotation of rDNA cluster-specific variants

Founders of the MAGIC population

Parents of the F2 population

Accession Cvi-0

Expression of 45S rRNA gene variants

Statistical genetics in the F2 population

Genotyping by RNA-seq

Linkage mapping of rDNA cluster expression in the F2 population

Fluorescence in situ hybridization (FISH)

Reverse transcription polymerase chain reaction (RT-PCR)

References

Acknowledgements

Funding

Availability of data and materials

Authors’ contributions

Competing interests

Ethical approval

Publisher’s Note

Author information

Authors and Affiliations

Corresponding authors

Additional files

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation

F₂ mRNA-seq library preparation and sequencing

F₁ and F₂ total RNA-seq library preparation and sequencing

MAGIC lines, F₁ individuals, F₂ population, and RILs

Parents of the F₂ population

Statistical genetics in the F₂ population

Linkage mapping of rDNA cluster expression in the F₂ population