Abstract
RNA Directed DNA Methylation (RdDM) is a pathway that mediates de novo DNA methylation, an evolutionary conserved chemical modification of cytosine bases, which exists in living organisms and utilizes small interfering RNA. Plants utilize DNA methylation for transposable element (TE) repression, regulation of gene expression and developmental regulation. TE activity strongly influences genome size and evolution, therefore making DNA methylation a key component in understanding divergence in genome evolution among seed plants. Multiple proteins that have extensively been studied in model plant Arabidopsis thaliana catalyze RNA dependent DNA Methylation pathway along with small interfering RNA. Several developmental functions have also been attributed to DNA methylation. This review will highlight aspects of RdDM pathway dynamics, evolution and functions in seed plants with focus on recent findings on conserved and non-conserved attributes between angiosperms and gymnosperms to potentially explain how methylation has impacted variations in evolutionary and developmental complexity among them and advance current understanding of this crucial epigenetic pathway.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
Introduction
DNA methylation refers to addition of a methyl group (CH3) to the 5′ position of a cytosine base resulting in changes to activity of a DNA segment without changing its sequence (Tammen et al. 2012). Methylation occurs predominantly in TE and repeat element regions and functions to repress their activity. DNA Methylation evolved early among land plants and differs among various lineages (Leitch 2012). It has been hypothesized to have evolved primarily for control of repetitive elements within genomes, with additional functions such as regulation of gene expression and developmental functions evolving later (Bräutigam and Cronk 2018). Genomes of nearly all organisms contain repetitive elements and transposons which are mobile elements that move via cut and paste or copy and paste mechanisms (Fedoroff 2012). Consequently, plant species whose genomes have high proportions of TEs such as Zea mays and conifer Picea abies, (84.2%), (70%) respectively are very densely methylated (Ausin et al. 2016) (Fig. 1).
Repetitive element recombination activities such as insertion and deletion strongly impact the structure and evolution of genomes, as TEs accumulate and cause mutations throughout genomes (Ito 2011; Baidouri and Panaud 2013) resulting in transposon mediated genomic changes, attributable to the C-value paradox (Lee and Kim 2014). Angiosperm and gymnosperm genomes differ in size and structure as well as in levels and contexts of methylation. For instance, angiosperm genomes vary 2400-fold between C-values of lean A. thaliana and giant fritillaria with most species tending towards smaller C-values. On the contrary, gymnosperm genomes lack wide variation with only 16-fold C-value variation but lean towards larger sizes (Leitch 2012; Pellicer et al. 2018).
Apart from epigenetic silencing via methylation, plants also utilize illegitimate and unequal homologous recombination to excise TEs and counteract their duplication and recombination activities, yielding truncated and solo LTRs (Ma et al. 2004). Unequal intrastrand recombination between homologous LTR elements results in net loss of DNA by removing a portion of individual LTR and leaving a solo LTR (Bennetzen 1997; Devos et al. 2002). On the other hand, illegitimate recombination involves intra-element recombination and gradually eliminates LTRs by accumulation of small deletions in the genome and net DNA loss. The ratio of solo to intact LTRs would reflect the rate of TE removal. Genome size is therefore a factor of efficacy of TE removal mechanisms and rate of TE accumulation which varies among species (Cossu et al. 2017). For example, fritillaria have the largest angiosperm genomes ranging 30.15–85.38 Gb, resulting from accumulation of TEs due to lack of proper removal processes (Kelly et al. 2015). Interestingly, fritillaria was also found to be lacking SAWADEE HOMEODOMAIN HOMOLOGUE 1 (SHH1) and KOW DOMAIN CONTAINING TRANSCRIPTION FACTOR 1 (KTF1) genes involved in Polymerase IV&V functioning respectively in the RdDM pathway (Ma et al. 2015) similar to gymnosperms possibly a characteristic of larger genomes. Likewise, Z. mays genome which is relatively large (2.3 Gb) compared to its relatives has undergone TE amplification over the past three million years (Schnable et al. 2014) and has a slow rate of TE excision with a solo to intact LTR ratio of 0.2:1. On the contrary, Oryza sativa genome has remained small (0.5 Gb) in spite of accumulation of TEs over a similar period to Z. mays potentially due to effective removal of TEs with a solo to intact LTR ratio of 1.5:1 (Ma et al. 2004). Notably, among gymnosperms it was previously thought that their large genomes are due to poor TE removal mechanisms similar to observations in conifer P. abies which displays a large genome (19.6 Gb) as a result of a slow accumulation of TEs over millions of years and poor removal mechanisms with a 0.16:1 solo to intact LTR ratio (Nystedt et al. 2013). This has been disputed in (Wan et al. 2018) where measurement of unequal and illegitimate recombination in Gnetum montanum (4.1 Gb) revealed a significantly high solo to intact LTR ratio; 2.35:1 indicating more efficient TE removal and no recent TE activity, a trend that is consistent in Amborella trichopoda another ancient genome. These findings dispute the generalization of similarities among large gymnosperm genomes and point to conifer potentially being unique. However, an anticorrelation is theorized to exist between recombination based elimination of TEs and heterochromatization driven by epigenetic mechanisms as observed in P. abies and fritillaria. (Cossu et al. 2017) All together, these examples point to inefficiency of TE removal mechanisms as an important contributing factor to expansion in genome size among seed plants.
Methylation occurs in different cytosine contexts symmetrical CG and CHG (H = A, T or C) and asymmetrical CHH (Law and Jacobsen 2010); established by DOMAINS REARRANGED METHYLTRANSFERASE 2 (RDM2). METHYLTRANSFERASE 1 (MET1) maintains CG methylation while CHG methylation is maintained by a plant specific CHROMOMETHYLASE 3 (CMT3) (Zhang et al. 2018). Non-symmetrical CHH methylation is maintained in part by CHROMOMETHYLASE 2 (CMT2) and via constant de-novo methylation by DRM2 (Shen et al. 2014) (Fig. 2). CG, CHG and CHH methylation vary among seed plants.
This review will compare and contrast recent findings and current understanding of aspects of evolution, occurrence, regulation, and functions of methylation in angiosperms and gymnosperms and show their contribution to their divergent genomes and structures. It must be noted that angiosperms have several well-characterized methylomes unlike gymnosperms; which due to their large genome sizes comprising largely of TEs and repeats are less studied. The only gymnosperm high resolution single base resolution methylome available is that of P. abies (Ausin et al. 2016). We propose that continued divergence of methylation patterns has resulted in current different developmental and genomic complexity existing among seed plant species. To illustrate this, the review will cover RdDM dynamics and currently known functions in varying angiosperm species in comparison with the available gymnosperm information which is mostly limited to conifer P. abies and predict how these aspects of methylation have influenced seed plant genomes.
Establishment of methylation
Canonical pathway
Using ChIP-seq, genes involved in the RdDM pathway have been identified in model plant A. thaliana (Zhang et al. 2006) as well as model crop plants such as O. sativa and Z. mays. RdDM occurs in a canonical and non-canonical format each distinguished by the proteins involved, with their occurrence similarly varying among species (Matzke et al. 2015). First siRNA are generated in a process directed by RNA Polymerase IV (Pol IV), which is recruited to its target site by interacting protein (SHH1) (Law and Jacobsen 2010) which binds to histone H3K9me and unmethylated H3K4 and transcribes single stranded RNA at the target (Matzke and Mosher 2014). RDR2 associates with Pol IV and copies the single stranded RNA into double stranded RNA. DICER 3 (DCL3) then cleaves the double stranded RNA into 24 nucleotide siRNA which are made stable by HUA ENHANCER 1 (HEN1) (Haag et al. 2012; Matzke and Mosher 2014). SUVH2 and SUVH9 are SET and RING-ASSOCIATED (SRA) domain proteins and members of the SU (VAR)3-9 histone methyltransferase family involved in recruiting Pol V via their SRA domains which bind methylated DNA (Johnson et al. 2014).
Upon recruitment, Pol V associates with chromatin facilitated by a chromatin remodeling (DDR complex) which comprises of DEFECTIVE IN RNA DIRECTED DNA METHYLATION 1(DRD1), DEFECTIVE IN MERISTEM SILENCING 3 (DMS3) and RNA DIRECTED DNA METHYLATION 1 (RDM1) (Matzke and Mosher 2014) Pol V recruits AGO4 via its largest subunit NRPE1 that contains an AGO-hook. The siRNA bound to AGO4 base pairs with Pol V transcript recruiting DOMAINS REARRANGED METHYLTRANSFERASE 2 (DRM2) to catalyze de-novo methylation (Matzke and Mosher 2014; Zhang et al. 2018). SWI2/SNF2 complex is an ATP-dependent chromatin remodeler complex which interacts with INVOLVED IN DE-NOVO METHTLATION (IDN2) and IDN2 paralogues (IDP1 & IDP2) IDN2-IDP complex and bind dsRNA to facilitate RdDM (Ausin et al. 2012). At this stage, removal of active histone marks i.e. histone de-acetylation, de-methylation and de-ubiquitination occurs followed by deposition of repressive histone modification H3K9me by SUVH4, SUVH5 and SUVH6 at some RdDM targets resulting in stable heterochromatin (Fig. 3) (Matzke and Mosher 2014; Gentry and Hennig 2014).
Previously, Pol V methylation was hypothesized to have evolved to facilitate the diploidization of polyploids and that it’s activity was stimulated by genome doubling shock so as to target duplicated genes and transposons (Craig et al. 2014). This hypothesis is likely to be supported by the occurrence of NRPE1 the largest subunit of Pol V in conifer P. abies and other gymnosperms but warrants further comment.
Non-canonical pathway
The canonical pathway has been extensively studied in plants. However, several pathways of RdDM have emerged different from the canonical RdDM pathway which was discovered in relation to silencing virus infections. These pathways utilize DCL2, RDR2, AGO4, Pol IV & Pol V and 21–22 nt siRNA (Marí-ordóñez et al. 2013). Gymnosperms P. abies, Ginkgo biloba and Welwitschia mirabilis predominantly display 21 nt siRNA (Ma et al. 2015) making this a key difference in RdDM between angiosperms and gymnosperms. Factors involved in non-canonical methylation are typical in Transcriptional and Post-transcriptional gene silencing (TGS & PTGS).
Previous studies hypothesized that the canonical RdDM pathway is unique to angiosperms (Ma et al. 2015) however, this hypothesis has been disproved (Ausin et al. 2016) due to findings that several key genes involved in both pathways are also conserved in gymnosperms. Non-canonical RdDM mechanisms have been implicated in initiation of TE silencing via virus induced gene silencing (Bond and Baulcombe 2014) since Pol IV and Pol V are recruited by preexisting heterochromatin marks. It is noteworthy that different mechanisms have varying entry points into the non-canonical pathway and play minor roles compared to the canonical RdDM. Future studies should focus on understanding how plants transition between the canonical and non-canonical pathways and whether the two have different functions (Fig. 3).
Maintenance of methylation
CG methylation is present both in heterochromatin regions as well as in genic regions, contrary to CHG and CHH methylation, which are predominantly present in heterochromatic regions. Different pathways regulate maintenance of each sequence context. CG methylation is maintained by (MET1), maintenance of CHG methylation involves crosstalk between CHROMOMETHYLASE 3 (CMT3) and H3K9 histone methyltransferase for demethylation (Law and Jacobsen 2010). RdDM has also been found to contribute to restoration of symmetrical methylation as observed in A. thaliana centromeric repeats, in a process mediated by siRNA (Teixeira et al. 2009).
On the other hand, asymmetrical CHH methylation is maintained by continuous de-novo methylation catalyzed by RDR2 and siRNA via continuous RdDM and by CMT2 to an extent (Law and Jacobsen 2010; Stroud et al. 2013). DRM2 target sites are evolutionarily young, short edges of TEs whereas CMT2 targets long TEs with high H3K9 methylation. DECREASED DNA METYLATION 1 (DDM1) is a nucleosome remodeler preferentially involved in methylation of heterochromatic sequences by counteracting the linker histone HI and CMT2 in an RdDM independent way (Zemach et al. 2013). However, no specific differences have been observed between the two in methylation context percentage.
Gene body methylation (GbM) and Non-CG methylation
Various species carry gene body methylation (GbM) in CG context and much less in non-CG context apart from gymnosperm P. abies (Ausin et al. 2016) which was found to be higher than other plant species such as A. thaliana and O. sativa in the CHG context in the gene body, potentially due to its large genome size with higher proportion of repetitive elements. GbM is located in long exons and commonly occurs in constitutively expressed genes i.e. house-keeping genes and less frequently in genes of variable expression in a conserved manner across land plants (Zilberman 2017). It is interesting to note that GbM has been conserved in eukaryotes for millions of years and varies significantly across land plant species (Takuno et al. 2016). The debate on gene body methylation functionality in shaping genomes is ongoing with contradicting studies; some hypothesizing that GbM occurs as an inconsequential result of TE methylation pathways and genes and plays no role in transcription or chromatin modification (Bewick et al. 2017). This hypothesis has however been disputed in O. sativa and A. thaliana where imprinted methylation occurs in genes of maternal sex cells involved in control of transcription factors for regulation of plant development (Rodrigues and Zilberman 2015) implicating GbM in developmental processes. Among angiosperms, there is variation in GbM among various lineages (Niederhuth et al. 2016; Takuno et al. 2016) e.g. poceae have enrichment of CHH in genic regions. Such differences may be further evidence for evolutionary variations and history observed among species.
Non-CG methylation is also complex and its functions not well characterized. It requires CHROMOMETHYLASES (CMT2&3) which are plant specific methyltransferases that evolved prior to the diversification of land plants. However, CMT2 is hypothesized to have evolved with the angiosperm specific duplication (≥ 236 MYA) (Bewick et al. 2017) consistent with studies that have found CMT2 absent in gymnosperm species (Ma et al. 2015; Ausin et al. 2016).
It is noteworthy that non-CG methylation (CHG & CHH) setting and maintenance overlap and incorporate histone modifications (Stroud et al. 2013). CMTs contain 3 domains BAH, CHROMO and C-5 domains; the CHROMO domain of CMT3 binds to H3K9me while the SRA domain of histone methyltransferase SUVH binds to methylated DNA creating a self-reinforcing loop that maintains non-CG methylation. Both CMT2 and CMT3 are involved in CHG and CHH methylation though CMT3 prefers CHG sites (Fig. 2).
DNA demethylation
DNA demethylation occurs passively after replication due to lack of a methyl donor resulting from inactivation or reduction of methylation enzymes (Li et al. 2018) or actively via demethylase enzymes that remove methylation marks (Zhang et al. 2018). Passive demethylation occurs during formation of male gametophytes and endosperm where RdDM factors were found to be reduced (Keith et al. 2009). In contrast, active demethylation utilizes DNA glycosylases DEMETER (DME) which is more specific to maternal central cell. REPRESSOR OF SILENCING 1 (ROS1) and DEMETER LIKE 2&3 (DMT2&3) act on both somatic and non-somatic tissues (Ooi and Bestor 2008) to remove methylated cytosines. Finally this is followed by a base excision repair (BER) pathway mechanism (Li et al. 2018).
Demethylases have specific target loci, ROS1 targets are established by a histone acetyltransferase INCREASED DNA DEMETHYLATION 1 (IDM1) which binds methylated DNA and acetylates histone 3 (H3) lacking H3Kme2 and H3Kme3 (Zhang et al. 2018) the mechanism is however not fully understood. ROS1 targets tend to be protein coding genes with close proximity to highly methylated TEs and is therefore thought to function in preventing spread of methylated cytosines to nearby genes (Tang et al. 2016).
Studies into regulation of demethylases have revealed a cross talk between active demethylation and RdDM. It has been hypothesized that some RdDM targets are differently regulated by ROS1 demethylase to maintain a methylation homeostasis (Lei et al. 2015). Demethylation counteracts with RdDM to prevent hyper methylation at the target loci (Gong et al. 2002; Lei et al. 2015). Further analysis in A. thaliana revealed that ROS1 expression was greatly reduced in mutants of RdDM factors such as NRPD1 (Tang et al. 2016) providing further proof that the two mechanisms are cross-linked. Further studies are needed to clarify the genome-wide counteraction of RdDM and active demethylation since DNA demethylation plays a role in development during embryogenesis in angiosperms and gymnosperm as seen in P. abies.
Role of RdDM in transposon silencing and regulation of gene expression
Transposon silencing and regulation is the main known function of DNA methylation to date. It is an ancient function that has been identified across most land plant lineages with different methylation genes presenting varying origin times and gene structures; with genes being gained and lost within seed plant lineages particularly in angiosperms. However, RdDM gene domains are highly conserved among species pointing to the importance of their functionality (Pei et al. 2019). Transposable elements exist in multiple families and are diverse among species. They are ubiquitous in nature and may spread rapidly within genomes resulting in null and deleterious mutations within genomes. Such mutations, present a threat to genome stability and integrity which may cause plant structural and developmental defects (Deniz et al. 2019). To prevent this, DNA methylation mechanisms are triggered via various mechanisms, one of which is repeat induced mutations (Freitag and Selker 2005).
TE silencing by methylation is evidenced by the enrichment of methylation along repeat regions coupled with histone modifications and heterochromatization. This hypermethylation maintains integrity of the genomes by repressing TEs (Slotkin and Martienssen 2007). Studies conducted in A. thaliana and Z. mays indicate that mutants in RdDM factors display TE de-repression and increased transposition of TE families such as Copia long terminal repeat retrotransposons (LTR-RT) compared to wild types (Gaut and Hollister 2009). In angiosperms, reactivated TEs were re-silenced via RdDM while no re-activated transposons were found in gymnosperms. It is likely that gymnosperms utilize alternative transposon silencing mechanisms (Ma et al. 2015).
We conducted a Pearson R correlation test between repeat element and methylation percentage in different contexts using seven previously studied sample seed plant species (Fig. 4), including three gymnosperms; G. biloba, P. taeda, G. montanum and four angiosperms; A. thaliana, O. sativa, Z. mays, A. trichopoda. This revealed a strong positive correlation in CG and CHG contexts (r = 0.914 and r = 0.759) respectively while CHH methylation displayed a correlation coefficient (r = 0.00003) consistent with similar findings that global and gene body mCG and mCHG positively correlate with genome size (Ausin et al. 2016; Takuno et al. 2016). These findings suggest that symmetrical methylation CG and CHG also positively correlate to repeat content. Overall, genomes with high levels of TEs such as Z. mays (84%) and P. abies (70%) are densely methylated as proportion of TEs correlates positively to levels of methylation.
Comparably, DNA methylation regulates gene expression in a genome size dependent manner. Angiosperms such as Z. mays with larger genomes than A. thaliana and higher TE content undergo spreading of silencing to genes adjacent to TEs, in particular promoter regions of genes (Craig et al. 2014). Methylation in these species therefore plays a role in regulating gene expression and mutations may result in lethal abnormalities.
Gene regulation occurs through repression of transcription by preventing the binding of transcription factors or promoting repressive histone modifications such as H3K9me (Domcke et al. 2015). Alternatively, methylation activates transcription i.e. methylation at ROS1 promoter activates transcription. This function was proposed to have likely evolved secondary to TE silencing among eukaryotes and was instrumental in shaping their functioning (Bräutigam and Cronk 2018). Transcription regulation results in differential methylation among tissues and impacts physiological processes for plant growth and development during their life cycle (Zhong et al. 2013) DNA methylation gene regulation has been hypothesized to play several important roles in different tissue and cell types during developmental processes. As a whole, DNA methylation functioning is highly dependent on TE and repeat content and observing such dynamics in more gymnosperms which have even larger genomes and repeat contents will further clarify this.
RdDM and developmental regulation
Embryogenesis and pattern formation
Flowering plants utilize double fertilization in the formation of endosperm and embryo (Raghavan 2003). During this process, endosperms in O. sativa and A. thaliana display global hypomethylation compared to embryos as a result of (DME) demethylation, occurring in the central cell of the female gamete and the vegetative cell of the male gamete (Park et al. 2016). The demethylated transposons result in production of siRNA which travel from the vegetative cell to the sex cell and reinforce RdDM to enhance transposon silencing after fertilization (Sroufe et al. 2012) (Fig. 5a) Interestingly, CHH methylation is upregulated during seed development and reduces later during germination via epigenetic reprogramming. A similar mechanism exists in conifer P. abies where methylation reprogramming was observed in culture cells (Ausin et al. 2016), suggesting that epigenetic reprogramming is a conserved phenomenon in both gymnosperms and angiosperms.
In addition, plants differ from animals as they do not define their germ cells during earlier developmental stages and therefore utilize already differentiated regions of plant growth containing stem cells referred to as meristems occurring at the roots and shoots; Root Apical Meristem and Shoot Apical Meristem. Meristems are progenitors for descendant cells and organs and are particularly vulnerable to deleterious somatic mutations that result from TE recombination activities. If inherited, such mutations would affect the integrity of progeny genomes (Gutzat et al. 2018). To protect genome stability and integrity, plants utilize RdDM to reinforce TE silencing in the stem cells.
In A. thaliana, both root and shoot meristems display similar patterns of upregulated RdDM factors and CHH hypermethylation (Fig. 5b). A comparison between root meristem and shoot apical meristem indicated that columella cells were the highest methylated cell type recorded in A. thaliana (Kawakatsu et al. 2016) while shoot apical meristem (SAM) during early vegetative growth, utilized RdDM to reinforce silencing of TEs via up-regulation of RdDM factors (Baubec et al. 2014). Besides, meristem hypermethylation has been suggested to likely operate in a similar way as companion cells i.e. vegetative cell of male and central cell of female for effective TE silencing to ensure proper tissue differentiation (Kawakatsu et al. 2016).
It is likely that meristem hypermethylation is conserved in gymnosperms as observed in Norway spruce conifer P. abies where culture cells displayed higher levels of CHH methylation compared to needle and flower buds (Ausin et al. 2016). Conservation of these mechanisms illustrates the importance of methylation in meristem functioning particularly for the crucial role of preventing activation of TEs to ensure proper inheritance of epigenetic marks to somatic tissue as well as progeny. Taken together these findings suggest that CHH methylation plays a key role in TE silencing during pattern formation in cells that undergo rapid differentiation and may also influence maintenance of stemness in stem cells. This however warrants further comment.
Development of reproductive structures
Some plants flower after extended cold periods to escape harsh winters in an epigenetically controlled phenomenon referred to as vernalization. This occurs via downregulation of protein FLC a MADS box transcriptional regulator which represses flowering (Bastow et al. 2004). In Arabidopsis H3 Histones within the FLC gene are methylated to repress its function and allow the transition from vegetative to reproductive state (Fig. 6a). This is an illustration of facultative epigenetic responses that allow plants to respond to environmental cues to determine developmental processes (Bräutigam and Cronk 2018). Furthermore, during fruit ripening in Solanum lycopersicum among other species, DNA demethylases (DML & DME) are upregulated leading to gradual demethylation at loci of ripening genes such as CNR (Zhang et al. 2018) which encodes a transcription factor CNR (colorless non-ripening). Mutants of cnr gene display hypermethylation at the promoter preventing binding of RIN (ripening inhibitor) (Gao et al. 2019) transcription factor and lowered gene expression (Fig. 6b). More work needs to be done to uncover additional RdDM functions in gymnosperms.
Response to environmental factors
Plants display variations in methylation patterns upon infection by pathogens as a result of mechanisms aimed at altering susceptibility to pathogens (Yu et al. 2013). The most common example of this occurs during defense against bacterial and viral infections where DNA demethylation occurs as a part of plant induced immune response. Several plants including Nicotiana tabacum, O. sativa A. thaliana and Cucumis sativus have displayed an increased level of ribosomal RNA (rRNA) transcription accompanied by hypomethylation in the host plant (Yu et al. 2013; Castellano et al. 2016). Demethylation then results in recruitment of trans-activator proteins onto promoters that contain pathogen responsive elements, which regulate expression of defense genes.
Similar to biotic stress, several crop plants as well as the model A. thaliana have been analyzed for potential methylation roles in conditions of high ultraviolet rays, nutrient deficiency, heat (Hashida et al. 2006), cold, drought, among other abiotic stresses and illustrated variation in global or specific locus methylation (Bucher et al. 2011). These variations occurred possibly for transcriptional regulation of abiotic stress genes (Narsai et al. 2017; Li et al. 2017). In some instances these changes in the epigenome are stably inherited to the next generation (Thiebaut et al. 2019; Martienssen and Colot 2001). DNA methylation epialleles mediated by repeat sequences that cause phenotypic alterations in crop plants relay heritable changes to consecutive generations in a process thought to result from failure in epigenetic reprogramming (Fig. 6c). Studying more gymnosperm methylomes will reveal how they utilize RdDM for biotic and abiotic stress response.
Conclusion and future outlook
DNA methylation has been shown to be a key player in maintaining genome stability and integrity among eukaryotic organisms and contributes to the diversity in genome and developmental characteristics observed among seed plant species. This review shows that methylation levels in the CG and CHG contexts positively correlate with genome size as well as repeat levels consistent with (Ausin et al. 2016). We also show that meristem functioning particularly relies on RdDM to reinforce TE silencing during differentiation in both gymnosperms and angiosperms.
Research so far has managed to illuminate majority of the key genes involved in the RdDM pathway and their roles in various stages in model plant A. thaliana as well as major crop plants such as Z. mays and O. sativa. Moreover, the mechanisms for maintenance and removal of methylation marks have been studied extensively. More recently, studies have revealed function of methylation in gene regulation and developmental processes. However, these revelations have also raised multiple unanswered questions such as the mechanisms by which polymerases (POL IV & POL V) and demethylases (ROS1) are recruited to their target loci. Likewise, mechanisms of non-CG methylation maintenance and establishment warrant further comment.
Further, the functions of gene body methylation and non-CG methylation remain largely unknown despite them displaying different levels in various tissue types and species (Hisataka et al. 2012; Kawakatsu et al. 2016). Notably, several growth and developmental, stress response (Hashida et al. 2006; Bucher et al. 2011), and molecular genome regulation roles that are regulated by DNA methylation have been discovered. Future research will no doubt uncover additional roles for the pathway.
More studies need to focus on expanding to sequencing methylomes of less studied seed plant species with larger, more complex genomes as well as plants from more clades of land plants for comparison with currently available information. Such research could reveal crucial details about variations in genome evolution dynamics and more DNA methylation functions particularly in relation to seed plants (Fig. 7).
Author contributions statement
WMR wrote the initial draft and collected the references. TW restructured the manuscript and increased the scope. QFW contributed to subsequent revisions. All authors read and approved the manuscript.
Abbreviations
- BAH domain:
-
Bromo Adjacent Homology domain found in proteins to facilitate protein–protein interactions and plays a role in transcriptional regulation and gene silencing
- CHROMO domain:
-
Chromatin Organization Modifier is a structural protein domain utilized in chromatin manipulation and remodeling. They are also involved in gene regulation
- C5 domain:
-
Cytosine methyltransferase catalyzes the transfer of a methyl group and is dependent on SAM S-adenosyl methionine as the methyl donor
- Central cell:
-
A binucleate cell that comprises the female gametophyte and is directly involved in double fertilization in flowering plants to give rise to the endosperm and embryo
- C-value paradox:
-
Phenomenon in which organisms of similar complexity differ in nuclear DNA content in a haploid genome (1c value)
- ChIP seq:
-
Chromatin Immunoprecipitation Sequencing used to study the interaction of proteins with DNA. It utilizes chromatin immunoprecipitation and DNA sequencing
- Epialleles:
-
Genes that differ in their extent of methylation
- Histone de-acetylation:
-
Removal of acetyl groups from a histone to allow histone proteins to wrap around DNA more tightly and prevent transcription and gene expression
- Histone de-ubiquitination:
-
Removal of ubiquitin from the core histone it modifies for transcriptional regulation and chromatin modification
- Illegitimate recombination:
-
A mechanism that gradually eliminates LTRs, resulting in a net loss of DNA
- KTF1:
-
KOW DOMAIN CONTAINING TRANSCRIPTION FACTOR 1. RdDM protein involved in Polymerase V transcription for association with ARGANAUTE
- LTR-RT:
-
Long Terminal Repeat Retrotransposon are identical DNA sequences that repeat multiple times at either end of retrotransposons and are formed by reverse transcription of RNA
- MADS Box:
-
A conserved sequence motif in genes that binds DNA sequences. It often is a transcription factor involved in plant development processes
- Meristem:
-
Plant tissues that contain undifferentiated cells that undergo rapid differentiation giving rise to plant organs. They are found at the growing tips of roots and shoots
- Pearson R correlation:
-
Statistical analysis to measure the linear correlation between two variables indicated by a coefficient (r) which ranges from − 1 to 1
- SHH1:
-
SAWADEE HOMEODOMAIN HOMOLOGUE 1. Involved in recruitment of Polymerase IV by binding to methylated H3K9
- Transcriptional and post transcriptional gene silencing:
-
Mechanisms to repress transcription of genes or mRNA degradation after transcription, resulting in their silencing through DNA methylation
- Trans-activators:
-
Protein molecules that trigger activation of a gene by another molecule. It commonly occurs following infection by a virus
- Unequal recombination:
-
A type or genetic recombination by which transposable elements are excised from the genome. It leads to the elimination of two long terminal repeats and the internal region of the element leaving a Solo long terminal repeat
- Vegetative cell:
-
A component of the pollen grain that initiates formation of the pollen tube which carries the sperm cells to the ovule for fertilization
- Vernalization:
-
A mechanism in which plants use prolonged cold periods to initialize flowering allowing them to adapt to seasonal changes during development
References
Ausin I et al (2012) INVOLVED IN DE NOVO 2-containing complex involved in RNA-directed DNA methylation in Arabidopsis. Proc Natl Acad Sci 109(22):2–9. https://doi.org/10.1073/pnas.1206638109
Ausin I, Feng S, Chaowei Yu, Liu W, Kuo HY, Jacobsen EL, Zhai J, Gallego-Bartolome J, Wang L, Egertsdotter U, Street NR, Jacobsen SE, Wang H (2016) DNA methylome of the 20-Gigabase Norway spruce genome. Proc Natl Acad Sci 113(50):E8106–E8113. https://doi.org/10.1073/pnas.1618019113
Bastow R, Mylne JS, Lister C, Lippman Z (2004) Vernalization requires epigenetic silencing of FLC by histone methylation. Nature 427(January):25–28. https://doi.org/10.1038/nature02269
Baubec T, Finke A, Scheid OM, Pecinka A (2014) Meristem-specific expression of epigenetic regulators safeguards transposon silencing in Arabidopsis. EMBO Rep 15(4):446–452. https://doi.org/10.1002/embr.201337915
Bennetzen JL (1997) Do plants have a one-way ticket to genomic obesity? Plant Cell 9(September):1509–1514. https://doi.org/10.1105/tpc.9.9.1509
Bewick AJ et al (2017) The evolution of CHROMOMETHYLASES and gene body DNA methylation in plants. Genome Biol 18(65):1–13. https://doi.org/10.1186/s13059-017-1195-1
Bond DM, Baulcombe DC (2014) Epigenetic transitions leading to heritable, RNA-mediated de novo silencing in Arabidopsisthaliana. Proc Natl Acad Sci 2014(13):18–23. https://doi.org/10.1073/pnas.1413053112
Bräutigam K, Quentin C (2018) DNA Methylation and the evolution of developmental complexity in plants. Front Plant Sci 9(October):1–14. https://doi.org/10.3389/fpls.2018.01447
Bucher E et al (2011) An SiRNA pathway prevents transgenerational retrotransposition in plants subjected to stress. Nature 472:5–10. https://doi.org/10.1038/nature09861
Castellano M, Martinez G, Marques MC, Moreno-romero J (2016) Changes in the DNA methylation pattern of the host male gametophyte of viroid-infected cucumber plants. J Exp Bot 67(19):5857–5868. https://doi.org/10.1093/jxb/erw353
Cossu RM et al (2017) LTR retrotransposons show low levels of unequal recombination and high rates of intraelement gene conversion in large plant genomes. Genome Biol Evol 9(12):3449–3462
Deniz Ö, Frost JM, Branco MR (2019) Regulation of transposable elements by DNA modifications. Nat Rev Genet 20:417–431. https://doi.org/10.1038/s41576-019-0106-6
Devos KM, Brown JKM, Bennetzen JL (2002) Genome size reduction through illegitimate recombination counteracts genome expansion in Arabidopsis. Genome Res 12(7):1075–1079. https://doi.org/10.1101/gr.132102
Domcke S et al (2015) Transcription factors determines binding of NRF1. Nature 528:575–579. https://doi.org/10.1038/nature16462
El Baidouri M, Panaud O (2013) Comparative genomic paleontology across plant kingdom reveals the dynamics of TE-driven genome evolution. Genome Biol Evol 5(5):954–965. https://doi.org/10.1093/gbe/evt025
Fedoroff NV (2012) Transposable elements, epigenetics, and genome evolution. Science 338(November):758–767
Freitag M, Selker EU (2005) Controlling DNA methylation: many roads to one modification. Curr Opin Genet Dev 15(2):191–199. https://doi.org/10.1016/j.gde.2005.02.003
Gao Y et al (2019) Diversity and redundancy of the ripening regulatory networks revealed by the FruitENCODE and the new CRISPR/Cas9 CNR and NOR mutants. Horticult Res 6(39):10–39. https://doi.org/10.1038/s41438-019-0122-x
Gaut BS, Hollister JD (2009) Epigenetic silencing of transposable elements: a trade-off between reduced transposition and deleterious effects on neighboring gene expression. Genome Res 19(8):1419–28. https://genome.cshlp.org/content/19/8/1419.abstract.
Gentry M, Hennig L (2014) Remodelling chromatin to shape development of plants. Exp Cell Res 321(1):40–46. https://doi.org/10.1016/j.yexcr.2013.11.010
Gong Z et al (2002) ROS1, a repressor of transcriptional gene silencing in Arabidopsis, encodes a DNA glycosylase/lyase. Cell 111(6):803–814. https://doi.org/10.1016/S0092-8674(02)01133-9
Gutzat R et al (2018) Stage-specific transcriptomes and DNA methylomes indicate an early and transient loss of transposon control in arabidopsis shoot stem cells. bioRxiv. https://doi.org/10.1101/430447
Haag JR et al (2012) Short article in vitro transcription activities of Pol IV, Pol V, and RDR2 reveal coupling of Pol IV and RDR2 for DsRNA synthesis in plant RNA silencing. Mol Cell 48(5):811–818. https://doi.org/10.1016/j.molcel.2012.09.027
Hashida S-N et al (2006) The temperature-dependent change in methylation of the antirrhinum transposon Tam3 is controlled by the activity of its transposase. Plant Cell 18(January):104–118. https://doi.org/10.1105/tpc.105.037655
Ito H (2011) Small RNAs and transposon silencing in plants. Dev Growth Differ 54(1):100–107. https://doi.org/10.1111/j.1440-169X.2011.01309.x
Johnson LM et al (2014) Polymerase V occupancy to DNA methylation. Nature 507(7490):124–128. https://doi.org/10.1038/nature12931
Kawakatsu T et al (2016) Unique cell-type-specific patterns of DNA methylation in the root meristem. Nat Plants. https://doi.org/10.1038/nplants.2016.58
Kelly LJ et al (2015) Analysis of the Giant Genomes of Fritillaria (Liliaceae) indicates that a lack of DNA removal characterizes extreme expansions in genome size. New Phytol 208(2):596–607. https://doi.org/10.1111/nph.13471
Law JA, Jacobsen SE (2010) Establishing, maintaining and modifying DNA methylation patterns in plants and animals. Nat Rev Genet 11(3):204–220. https://doi.org/10.1038/nrg2719
Lee S, Kim N (2014) Transposable elements and genome size variations in plants. Genom Informat 12(3):87–97. https://doi.org/10.5808/GI.2014.12.3.87
Lei M et al (2015) Regulatory link between dna methylation and active demethylation in Arabidopsis. Proc Natl Acad Sci 112(11):3553–3557. https://doi.org/10.1073/pnas.1502279112
Leitch AR, Leitch IJ (2012) Ecological and genetic factors linked to contrasting genome dynamics in seed plants. Int J Plant Sci 194(3):629–646. https://doi.org/10.1111/j.1469-8137.2012.04105.x
Li S et al (2017) Laser irradiation-induced DNA methylation changes are heritable and accompanied with transpositional activation of MPing in rice. Front Plant Sci 8(March):1–15. https://doi.org/10.3389/fpls.2017.00363
Li Y, Kumar S, Qian W (2018) Active DNA demethylation: mechanism and role in plant development. Plant Cell Rep 37(1):77–85. https://doi.org/10.1007/s00299-017-2215-z
Ma J, Devos KM, Bennetzen JL (2004) Analyses of LTR-retrotransposon structures reveal recent and rapid genomic DNA loss in rice. Genome Res 30(4):860–869. https://doi.org/10.1101/gr.1466204
Ma Lu et al (2015) Angiosperms are unique among land plant lineages in the occurrence of key genes in the RNA-directed DNA methylation (RdDM) pathway. Genome Biol Evol 7(9):2648–2662. https://doi.org/10.1093/gbe/evv171
Marí-ordóñez A et al (2013) Reconstructing de novo silencing of an active plant retrotransposon. Nat Genet 45(July):1029–1039. https://doi.org/10.1038/ng.2703
Martienssen RA, Colot V (2001) DNA methylation and epigenetic inheritance in plants and filamentous fungi. Science 293(August):1070–1075
Matzke MA, Mosher RA (2014) RNA-directed DNA methylation: an epigenetic pathway of increasing complexity. Nat Rev Genet 15(June):394–408. https://doi.org/10.1038/nrg3683
Matzke MA, Kanno T, Matzke AJM (2015) RNA-directed DNA methylation: the evolution of a complex epigenetic pathway in flowering plants. Annu Rev Plant Biol 66(1):243–267. https://doi.org/10.1146/annurev-arplant-043014-114633
Narsai R et al (2017) Dynamic and Rapid changes in the transcriptome and epigenome during germination and in developing rice (Oryzasativa) coleoptiles under anoxia and re-oxygenation. Plant J 89(4):805–824. https://doi.org/10.1111/tpj.13418
Niederhuth CE et al (2016) Widespread natural variation of DNA methylation within angiosperms. Genome Biol. https://doi.org/10.1186/s13059-016-1059-0
Numa H, Yamaguchi K, Shigenobu S, Habu Y (2012) Gene body CG and CHG methylation and suppression of centromeric CHH methylation are mediated by DECREASE IN DNA METHYLATION1 in rice. Cell Press 8(10):1560–1562. https://doi.org/10.1016/j.molp.2015.08.002
Nystedt B et al (2013) The Norway spruce genome sequence and conifer genome evolution. Nature 497(7451):579–584. https://doi.org/10.1038/nature12211
Ooi SKT, Bestor TH (2008) The colorful history of active DNA demethylation. Cell 133(7):1145–1148. https://doi.org/10.1016/j.cell.2008.06.009
Park K et al (2016) DNA demethylation is initiated in the central cells of Arabidopsis and rice. Proc Natl Acad Sci 113(52):15138–15143. https://doi.org/10.1073/pnas.1619047114
Pei L et al (2019) Tracing the origin and evolution history of methylation-related genes in plants. BMC Plant Biol 1(19):1–13. https://doi.org/10.1186/s12870-019-1923-7
Pellicer J et al (2018) Genome size diversity and its impact on the evolution of land plants. Genes 9(2):88. https://www.mdpi.com/2073-4425/9/2/88. Accessed 11 Sep 2018
Pikaard CS, Scheid O (2014) Epigenetic regulation in plants. Cold Spring harbor Perspect Biol. https://doi.org/10.1080/0021213X.1988.10677093
Raghavan V (2003) Some reflections on double fertilization, from its discovery to the present. New Phytol 159(3):565–583. https://doi.org/10.1046/j.1469-8137.2003.00846.x
Rodrigues JA, Zilberman D (2015) Evolution and function of genomic imprinting in plants. Genes Dev 29(24):2517–2531. https://doi.org/10.1101/gad.269902.115
Schnable PS, Ware D, Fulton RS, Stein JC, Wei F, Pasternak S, Liang C, Zhang J, Fulton L, Graves TA et al (2014) The B73 maize genome: complexity, diversity, and dynamics. Science 1112(2009):1112–1115
Shen X et al (2014) Natural CMT2 variation is associated with genome-wide methylation changes and temperature seasonality. PLoS Genet 10(12):e1004842. https://doi.org/10.1371/journal.pgen.1004842
Slotkin RK, Martienssen R (2007) Transposable elements and the epigenetic regulation of the genome. Nat Rev Genet 8(April):272–285. https://doi.org/10.1038/nrg2072
Slotkin K, Vaughn M, Borges F, Tanurdzic M, Becker JD, Feijo JA, Martien RA (2009) Epigenetic reprogramming and small RNA silencing of transposable elements in pollen. Cell 136(3):461–472. https://doi.org/10.1016/j.cell.2008.12.038
Sroufe LA et al (2012) Active DNA demethylation in plant companion cells reinforces transposon methylation in gametes. Science 337(6100):1360–1364
Stroud H et al (2013) Non-CG methylation patterns shape the epigenetic landscape in Arabidopsis. Nat Struct Mol Biol 21(December):64–72. https://doi.org/10.1038/nsmb.2735
Takuno S, Ran J-H, Gaut BS (2016) Evolutionary patterns of genic DNA methylation vary across land plants. Nat Plants 2(2):1–7. https://doi.org/10.1038/nplants.2015.222
Tammen SA, Friso S, Choi S-W (2012) Molecular aspects of medicine epigenetics: the link between nature and nurture. J Mol Aspects Med. https://doi.org/10.1016/j.mam.2012.07.018
Tang K, Lang Z, Zhang H, Zhu J-K (2016) The DNA demethylase ROS1 targets genomic regions with distinct chromatin modifications. Nat Plants 2(11):1–10. https://doi.org/10.1038/nplants.2016.169
Teixeira FK et al (2009) A role for RNAi in the selective correction of DNA methylation defects. Science 323(5921):1600 LP–1604. https://science.sciencemag.org/content/323/5921/1600.abstract
Thiebaut F, Hemerly AS, Cavalcanti P, Ferreira G (2019) A role for epigenetic regulation in the adaptation and stress responses of non-model plants. Front Plant Sci 10(March):1–7. https://doi.org/10.3389/fpls.2019.00246
Wan T et al (2018) A genome for gnetophytes and early evolution of seed plants. Nat Plants 4(2):82–89. https://doi.org/10.1038/s41477-017-0097-2
Yu A et al (2013) Dynamics and Biological Relevance of DNA demethylation in Arabidopsis antibacterial defense. Proc Natl Acad Sci 110(6):2–7. https://doi.org/10.1073/pnas.1211757110
Zemach A et al (2013) The Arabidopsis nucleosome remodeler DDM1 allows DNA methyltransferases to access H1-containing heterochromatin. Cell 153(1):193–205. https://doi.org/10.1016/j.cell.2013.02.033
Zhang X et al (2006) Resource mapping and functional analysis of DNA methylation in Arabidopsis. Cel 126:1189–1201. https://doi.org/10.1016/j.cell.2006.08.003
Zhang H, Lang Z, Zhu JK (2018) Dynamics and function of DNA methylation in plants. Nat Rev Mol Cell Biol 19(8):489–506
Zhong S et al (2013) Single-base resolution methylomes of tomato fruit development reveal epigenome modifications associated with ripening. Nat Biotechnol 31(2):154–159. https://doi.org/10.1038/nbt.2462
Zilberman D (2017) An evolutionary case for functional gene body methylation in plants and animals. Genome Biol 18(87):17–19. https://doi.org/10.1186/s13059-017-1230-2
Acknowledgements
We acknowledge Zhi-Ming Liu and Min Yang for assistance and comments on bioinformatics interpretation, data analysis and figure preparation. This work was supported by the National Science Foundation of China (31870206), the Scientific Research Project of Shenzhen Urban Administration (201519) and a Major Technical Research Project of the Innovation of Science and Technology Commission of Shenzhen (JSGG20140515164852417). Additional funding was provided in particular by the Scientific Research Program of Sino-Africa Joint Research Center (SAJL201607).
Author information
Authors and Affiliations
Corresponding authors
Ethics declarations
Conflict of interest
All authors declare no conflict of interest.
Additional information
Communicated by Neal Stewart.
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Wambui Mbichi, R., Wang, QF. & Wan, T. RNA directed DNA methylation and seed plant genome evolution. Plant Cell Rep 39, 983–996 (2020). https://doi.org/10.1007/s00299-020-02558-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00299-020-02558-4