INTRODUCTION

Currently, a bulk of data has been accumulated to demonstrate a close relationship between the key cellular processes and the regulation of the activity of transposable elements (TEs). This relationship enables survival of TEs in the genomes despite strong defense mechanisms limiting their activity. What mechanism underlies this global genomic trade-off? TEs serve as a rich source of evolutionary changes in the genome: they offer enhancers, promoters, exons, splicing sites, architectural elements and participate in the key mechanisms of development and immune response [1-4]. TEs contribute to the maintenance of essential chromosome structures such as centromeres and ribosomal RNA gene loci [5-8]. In this review, we focus on the origin and maintenance of telomeres, one of the most compelling cases of the role of retrotransposons in genome evolution.

Telomeres – the ends of linear chromosomes – have captivated attention of many scientists for more than 50 years. In 1971-1973, Aleksey M. Olovnikov published his brilliant prediction regarding the “Achilles’ heel of the double helix,” or chromosome end under-replication [910]. He also suggested existence of a specialized DNA polymerase, which compensates for the shortening of telomeric DNA. Much later this enzyme, now known as telomerase, was discovered. It turned out that this was a reverse transcriptase, which acts in a complex with the RNA template [11]. The ends of the primary linear chromosomes are thought to be protected by the attachments of retroelements. From this point of view, the telomerase ribonucleoprotein complex can be considered as a specialized retroelement that evolved to protect the chromosome ends [12]. In fact, phylogenetic analysis of the reverse transcriptase of retrotransposons and telomerase revealed that they originated from a common ancient retroelement enzyme [13-15]. Telomerase maintains telomeres in most organisms, but there are other ways to elongate chromosome ends. There are many modern species of animals whose telomeres are supported by the retrotransposon attachments. These are many species of insects and, in particular, the Drosophilidae family. It is believed that Drosophila has lost telomerase, and the specialized telomeric retrotransposons are used to maintain telomeres. The telomeres of the silkworm Bombyx mori are of mixed type, and they are maintained by the specific telomeric retroelements and low-activity telomerase. Under certain conditions, retroelements are also attached to the human telomeres and to telomeres of other species using telomerase [1617]. In these cases, the double strand break at the chromosome end is used as a convenient target for TE retrotranspositions. However, mosquitoes do not have telomerase or telomeric retroelements, and the telomeres lengthening appeared to be mediated by the recombination of short satellite-like repeats [1819].

Telomeres maintained by TE transpositions differ in the nature of telomeric repeats from telomeres maintained by telomerase, but they serve as a reminder of the retrotransposon origin of telomeres. The study of the regulation of Drosophila telomeres, which consist of retrotransposons, reveals a surprising similarity, if not identity, between the mechanisms that control telomere maintenance and TE activity. This similarity leads us to the conclusion that there is a close functional link between retrotransposons and telomeres in the genome, which, according to recent evidence, is found not only in Drosophila but also in mammals.

WHY IS TELOMERASE LOST IN MANY INSECT AND DROSOPHILA SPECIES?

Telomerase has been lost in many plants and animals in the process of evolution. Instead, telomeres are extended by other mechanisms in these cases. In Diptera the telomerase gene was lost about 270 million years ago [20]. Members of the Diptera group are one of the most numerous and prosperous species of animals, despite their lack of telomerase. The gene coding for telomerase has not been detected in the genome of Drosophila, and telomere elongation occurs due to transpositions of specialized TEs. The most thoroughly studied are the telomeric retroelements of Drosophila melanogaster. They are represented by three families of LINE retrotransposons (Long Interspersed Nuclear Elements) – HeT-A, TART, and TAHRE [2122]. At the same time, Bombyx mori has low-activity telomerase, with telomeric niches actively filled with specialized telomere retrotransposons SART and TRAS [23]. At present, the evolutionary pressures underlying the apparent reverse evolution and rejection of telomerase remain unknown.

Alexey M. Olovnikov was always interested in exceptions to the rules as he endeavored to explain various mysterious phenomena of nature. In this chapter, we would like to cite his interesting ideas concerning the loss of telomerase in Drosophila, which he expressed in personal correspondence: “It is known that the chromosomes of the salivary gland cells in D. melanogaster larva are able to undergo many of endoreplication rounds. In addition, there is a somatic synapse of homologous chromosomes. Therefore, the lateral conjugation of sister chromatids, tightly connected together along their entire length, should necessarily create a mechanical barrier for the formation of a telomerase telomere. Such telomere should have a three-dimensional telomere loop. In the formation of a three-dimensional telomeric complex, hundreds of conjugated chromatid ends, tightly united in a single polytenic bunch, would create compelling steric obstacles to each other in the formation of their 3D telomeric complex. Therefore, polytenization forced Drosophila to abandon telomerase. In contrast, the G-quadruplex formation, which protects the telomeric retrotransposon end on each chromatid, does not require the telomeric loop and is therefore easily compatible with lateral chromatid conjugation. Presumably, polytenization was the main reason for choosing an alternative method of protecting Drosophila telomeres. As it is well known, chromosome polytenization in salivary gland cells in Drosophila larva is necessary for production of large amounts of glue before pupation. It is likely that organisms that need increased gene copy number and have telomerase telomeres do not use dense chromatid packaging. For example, in ciliates that have a polyploid macronucleus and telomerase, chromosomes are fragmented. In theory, the following alternative is also acceptable: if telomeres, unlike the rest of the polytenized chromatids, are not side-conjugated (and, therefore, free from the previously mentioned steric hindrance), then this expands the possibilities of using the telomerase method of telomere protection. Therefore, there may be species in which polytene chromosomes and telomerase-like proteins are used at some developmental stages, but the ends of the chromosomes are not paired. Such termini have been cytogenetically observed, for example, in specialized polytene cells and at the ends of meiotic pachytene chromosomes of the legume plant Vigna unguiculata [2425]. A tendency to split some chromosomal ends into oligotene bundles, can be seen in the polytene chromosomes of certain species [26]” (from the letter of A. M. Olovnikov to A. I. Kalmykova, September 2017).

Indeed, recently it was reported that telomeric retrotransposons tend to form G-quadruplexes (secondary structures formed by guanine-rich DNA sequences) not only in the Drosophila species, but also in other species [27]. Such structures can protect the ends of linear chromosomes in the absence of a telomeric loop typical for the telomeres maintained by telomerase. Existence of alternative ways of telomere maintenance gives a unique opportunity to explore emergence of the functional analogues in nature. Researching Drosophila telomeres allowed us to take a fresh look at the protective mechanisms of TE control and their roles in telomere function. The most striking example is the involvement of small Piwi-interacting RNA (piRNA) and the piRNA pathway in the control of Drosophila telomere length in the germline.

piRNA PATHWAY: SOURCES AND TARGETS OF piRNAs

TEs were found in all studied species and are represented by several classes and numerous families [28]. TEs make up half of the human genome and 20% of the D. melanogaster genome [29] and are the primary source of mutations that can cause both cancer and severe developmental disorders [3031]. Various mechanisms are used to limit the activity of TEs in somatic cells and prevent inherited transpositions in the germline. Mechanisms which work at the level of transcription of TEs are key to prevent the initial stage of their reproduction. Transcriptional silencing is achieved by two main mechanisms leading to the formation of an inactive chromatin structure. The first and most conservative mechanism of chromatin compaction is associated with changes in histone modifications, in particular with methylation of lysine 9 of histone 3 (H3K9me), which next leads to the binding and spread of the heterochromatin protein 1 (HP1) [32]. Another powerful mechanism of repression of transcription is DNA methylation by cytosine-5′-methyltransferases [33]. In the genome, both DNA methylation and histone modifications target TEs to prevent their transcription. The main question is how are these general mechanisms recruited to the TEs? The KRAB-ZFP (KRAB-containing zinc finger proteins) family proteins play an important role in the recognition and methylation of DNA and histones at endogenous retroviruses in somatic cells of vertebrates [3435].

For most plants and animals, RNA interference is the most conservative and nearly universal mechanism for distinguishing between “self” and “non-self”. The pathways involving Argonaute family proteins and associated short RNAs are termed nucleic acid immunity because these mechanisms may recognize and eliminate foreign nucleic acids belonging to viruses, TEs, or transgenes based on nucleic acid sequence. siRNA (small interfering RNA) of 21 nucleotides in length and RNA interference protect somatic cells from viruses. The proteins of the Piwi subfamily of the Argonaute family and the associated piRNAs with a length of 24-30 nucleotides provide protection against TEs and viruses in the animal gonads [3637]. The distinctive feature of this system is that the Piwi–piRNA complex is able to induce TE chromatin modifications in a sequence-specific manner, i.e., to cause transcriptional silencing. Piwi nuclear proteins in complex with piRNAs induce the formation of heterochromatin, using the complementarity of piRNA and nascent mRNA of TEs. This is a multi-stage process that requires the interaction of several linker and accessory proteins, post-translational modifications that result in assembly, conformational changes, and eventually stabilization of the Piwi-piRNA-driven chromatin protein complex, and, finally, recruitment of universal heterochromatin factors to TEs [3638]. Such tight regulation is essential to turn off the TEs and prevent erroneous repression of cellular genes.

piRNA-mediated silencing is a multi-stage process. The main steps of this process are the formation of long single-strand RNA precursors in the nucleus, their processing into mature piRNAs in the cytoplasm, and piRNA-mediated silencing, which can occur both in the nucleus (transcriptional silencing) and in the cytoplasm (post-transcriptional silencing).

To perform their functions, mature piRNAs should complementarily interact with the transcripts of TE, i.e., they should be antisense to the mRNA of TEs. Indeed, a significant portion of piRNAs in the gonads corresponds to TEs, but are not generated from the mRNA of TEs. The origin of such antisense TE RNAs in the genome is not quite clear but may involve the following mechanisms. piRNAs are thought to be formed from long single-strand endogenous RNAs known as piRNAs precursors (pre-piRNAs) [3940]. Our knowledge on this issue is mainly obtained from the D. melanogaster oogenesis model. Studies of the genomic origin of piRNAs in Drosophila have led to the conclusion that the sources of piRNAs and their targets are located at different genomic loci (Fig. 1a). This assumption was based on the small RNA sequencing data that revealed the regions in pericentromeric heterochromatin with high density of the single-mapped (in other words, unique) piRNAs. Such loci were termed piRNA clusters [39]. These extended regions of the genome, up to 200 kb in size, enriched in damaged TE copies, encode unusually long read-through transcripts that are processed into mature piRNAs that target euchromatic active TEs. This is a remarkable illustration of how the genome regions that were previously thought of as “junk” DNA are really being used functionally. It should be mentioned that this scenario has not been proved to be universal.

Fig. 1.
figure 1

Where do piRNA precursors come from? a) Heterochromatic pericentromeric piRNA clusters are the sources of anti-transposon piRNAs that target the TE active copies. The mapping of small RNAs to the chromosome is schematically shown with the activity of uni- and dual-strand piRNA clusters depicted below. The Piwi–piRNA complex recognizes and silences TE active copies (black triangle). b) TE active copies form de novo piRNA clusters. piRNA signature of the TE-associated piRNA clusters is schematically shown. sRNAseq – small RNA sequencing. c) Transgenic model demonstrates how active TEs in euchromatin could become piRNA clusters. Piwi protein in complex with endogenous piRNA to I-element recognizes RNA in the complementary transgenic locus, promoting generation of transcripts from both genomic strands, which are then processed into mature piRNAs.

In Drosophila, there are two types of piRNA clusters, uni-strand and dual-strand, which are transcribed in one or two directions, respectively. Both types of piRNA clusters produce predominantly antisense piRNA relative to TEs. The reason why they are antisense is easier to understand considering the uni-strand piRNA clusters, in which all TE remnants are in an inverted position relative to the pre-piRNA transcription direction. The best-known uni-strand piRNA cluster is the Drosophila flamenco locus, which operates in ovarian follicular cells. It contains many inactive TE copies on the minus genomic strand relative to the direction of transcription, therefore the piRNAs processed from long precursor transcripts are complementary to the transposon mRNAs and guide their transcriptional silencing [3941]. flamenco-like loci generate piRNAs against endogenous retroviruses related to the gypsy family. This intricate scenario nevertheless is recurring in evolution. flamenco-like uni-strand piRNA clusters have been found in mosquitoes and other Drosophila species [42-44]. Moreover, during mouse spermatogenesis, two flamenco-like uni-strand piRNA clusters produce anti-transposon piRNAs [45]. The second type of heterochromatic piRNA clusters – dual-strand clusters – are considered as principal players in the anti-transposon defense in the Drosophila germline. These loci contain chaotically oriented TE copies disrupted by other TE insertions. Dual-strand piRNA clusters are bidirectionally transcribed resulting in the generation of long non-coding piRNA precursors from both genomic strands [46]. Long piRNA precursors are exported to the cytoplasm where they are processed into mature piRNAs. piRNA processing is a conserved well-established mechanism in different species. The outer mitochondrial membrane and perinuclear compartment serve as platform for compartmentalization of piRNA generation and maturation. piRNAs are cleaved as a result of activity of various specialized ribonucleases. Cytoplasmic Piwi subfamily proteins perform piRNA amplification using sense and antisense TE transcripts and piRNA precursors from clusters, which leads to the formation of both sense and antisense piRNAs. This mechanism, known as “ping-pong,” is found in the gonads of all the species studied, and leads not only to cleavage of TE mRNA (post-transcriptional silencing), but also to production of new piRNAs. Recent comprehensive reviews describe in details the processes of piRNA formation and amplification, as well as the piRNA processing machinery [36, 37, 47].

Dual-strand piRNA clusters are thought to represent a repository of information in the genome about prior TE invasions, as well as a kind of trap for TEs, because their insertions in the piRNA cluster would lead to the production of piRNAs and the suppression of the activity of cognate copies in the genome [48]. While dual-strand piRNA clusters were shown to be species-specific for Drosophila and a few arthropod species, these types of transposon-specific piRNA sources have not been identified in other investigated species, including mammals. The existence of a conservative mechanism for the synthesis of piRNA precursors that are antisense to TEs is still a matter of debate. For example, sense and antisense piRNAs are produced from individual copies of evolutionarily young active retrotransposons in mammals, despite the presence of a significant number of damaged TE copies in the genome [45].

In the piRNA pathway mechanism for transposon silencing, active transposons are expected to be the primary targets of piRNAs. A more thorough examination of small RNA libraries derived from D. melanogaster ovaries revealed that not only heterochromatic piRNA clusters, but also full-length euchromatic TEs generate short RNAs, both piRNAs and siRNAs. This shows that at the sites of recent TE insertions, local dual-strand piRNA clusters occur [49], which is similar to the scenario of piRNA generation from the active copies of LINE1 in mammals [45]. A distinctive feature of the TE-associated piRNA clusters is a “piRNA signature”. It is an asymmetric profile of small RNA distribution upstream and downstream of TEs resulting from the read-through transcription of piRNA precursors into the TE flanking genomic regions (Fig. 1b). This signature was successfully used for the prediction of non-annotated TE insertions. Moreover, the expansion of small RNA production outside TEs can suppress the expression of the neighboring genes [49]. Thus, active TEs serve not only as targets for the piRNA system, which promotes chromatin compaction, but also as a source of small RNAs, as they can produce de novo si- and piRNAs. The Drosophila telomeric retrotransposons, which are arrayed as tandem repeats in the telomere, are also the target and source of piRNAs, which allow feedback regulation of the telomeric retrotransposons expression, as addressed in more depth in the following section.

The mechanism underlying the formation of new TE-associated piRNA clusters, particularly the mechanism of activation of bidirectional transcription, is not totally understood. Antisense promoters are known only for a few TEs and are rather an exception. For example, the LINE1 human retrotransposon antisense promoter and the transcripts derived from it are involved in siRNA-mediated LINE1 transposition suppression [5051]. When the Piwi–piRNA complex recognizes complementary transcripts, an uncommon type of transcription, convergent transcription, is thought to be triggered at genomic loci harboring active copies of TEs. This process depends on the genomic environment of the TE insertion: endogenous convergent transcription at the TE insertion locus promotes the formation of a new piRNA cluster [52]. Transcripts generated from both genomic strands are then processed into mature pi/siRNAs. Such a scenario has an obvious biological significance, as it would result in amplification of protective small RNAs against most dangerous, transcriptionally active TEs. A transgenic model containing a DNA fragment targeted by endogenous piRNAs elucidated some details of the de novo piRNA cluster formation [53-55]. It was shown that transgenic constructs containing a fragment of I-element can become new piRNA clusters. The formation of piRNA clusters de novo is accompanied by the appearance of weak transcription from the minus strand and the generation of sense and antisense pi/siRNAs from the entire transgene and from genomic sequences at a distance of 1 to 10 kb from the transgene [5657] (Fig. 1c).

A scenario in which an active TE is a source of piRNAs should be evolutionarily beneficial and have selective advantages. It is unclear why extended heterochromatic piRNA clusters, which are maintained by a unique transcription mechanism, have survived throughout evolution in Drosophila and other arthropods. The removal of the most extended pericentromeric piRNA clusters in D. melanogaster were found to not lead to TE derepression [58]. This suggests that TE silencing in the stable laboratory line is not primarily mediated by heterochromatic piRNA clusters. This function is most likely carried out by TE-associated piRNA clusters, which are generated via the assistance of piRNAs acquired from the mother with the oocyte cytoplasm [59].

Differences in TE silencing strategies between species may be related to different habitats. Horizontal TE transfer has an impact on natural arthropod populations, including the re-invasion of previously lost TEs in the genome [6061]. The memory of earlier TE invasions, which is kept in the form of TE fragments in piRNA clusters, can protect the population from TE re-infection due to the existence of complementary piRNAs and the activation of piRNA-silencing [39, 52, 57, 59, 62].

The idea that active TE copies serve as the primary targets of the piRNA pathway is emphasized by the recently discovered mechanism of a co-transcriptional degradation of TE transcripts [63]. Despite the establishment of heterochromatin in loci with active copies of TEs, they can still be transcribed in the germline. The excess of the TE transcripts in the Drosophila germline is removed through the activity of the nuclear Ccr4–Not complex, which has deadenylase activity [64]. Ccr4–Not is recruited to the transposon transcripts co-transcriptionally by the Piwi-piRNA nuclear complex. Most likely, the polyA tail at the 3′-end of the TE mRNA is eliminated due to deadenylase activity of the Ccr4–Not complex. The nuclear RNA quality control system can identify these transcripts as aberrant and subject them to exonuclease cleavage. Noteworthy, the targets of nuclear Ccr4–Not are mainly active full-length TEs and telomeric retrotransposons [63].

To summarize, piRNAs can trigger a number of important processes aimed at suppressing TE activity (Fig. 2). In the nucleus, piRNAs induce transcriptional silencing leading to heterochromatin assembly at the TE loci [65-67]. The piRNA system also functions at the co-transcriptional level, causing nucleases to degrade TE transcripts at transcription sites [63]. In the cytoplasm, the piRNA system cleaves TE transcripts (post-transcriptional silencing), resulting in piRNA amplification and their subsequent inheritance through the oocyte cytoplasm [39, 59, 68-71]. Importantly, piRNAs can initiate de novo piRNA cluster establishment at full-length euchromatic TE insertions leading to the expression of antisense TE transcripts and a burst of piRNA production against most active TEs [49]. Furthermore, the fast evolution of proteins participating in the piRNA pathway allows this system to be highly adaptable to new targets [72-74]. Despite the high capacity of piRNAs and other anti-transposon defense systems, selfish TEs successfully bypass them, continuing their propagation, resulting in deleterious mutations, diseases, and developmental disorders. A paradoxical situation has emerged in which the suppression of TE activity requires its activity, and as a result, the piRNA system operates. Apparently, the far from perfect efficacy of these and other defense systems allows TEs to multiply within the tolerable limits, producing material for genome evolution and positive selection, while also allowing some critical systems, such as telomeres, to function. The latter phenomenon will be discussed below.

Fig. 2.
figure 2

The piRNA silencing system is a multi-level mechanism. In Drosophila, piRNAs can induce the following processes: a) transcriptional silencing and chromatin compaction; b) co-transcriptional degradation of nascent RNA in the nucleus via assistance of the deadenylase nuclear complex Ccr4–Not; c) post-transcriptional RNA degradation and amplification of piRNAs in the cytoplasm; d) de novo piRNA production at active TEs due to activation of antisense transcription; e) epigenetic transgenerational memory (transfer of maternal piRNAs in complex with Piwi proteins to offspring through the germ plasm of the oocyte).

ROLE OF piRNAs IN REGULATION OF Drosophila TELOMERES

In Drosophila, disruption of the piRNA system causes not only TE activation but also excessive telomere extension due to the increased frequency of retrotransposon attachments to chromosome ends [75]. This occurs because the germline pool of piRNAs complementary to telomeric retrotransposons is reduced. The piRNA pathway processes all TE transcripts in the germline, producing piRNA without differentiating between telomeric and parasitic retrotransposons. Disruption of the piRNA pathway leads to a decrease in piRNA abundance, which is accompanied by the accumulation of telomeric TE transcripts. These transcripts serve as intermediates for the extension of telomeres. Drosophila telomeres are maintained by the retrotranspositions of specialized telomeric TEs that are not found in other genomic loci. Telomeric retroelements are both the sources and the targets of piRNAs, making them a unique example of a self-regulated TE-associated piRNA cluster. Bidirectional promoters of Drosophila telomeric retrotransposons organized in head-to-tail arrays ensure the generation of both sense and antisense transcripts, which are then used to produce telomeric piRNAs [76-79]. Drosophila is not the only example of such regulation. Piwi proteins are also involved in the regulation of SART and TRAS telomeric retrotransposon transpositions in the silkworms [80].

The heterochromatin state is a characteristic of telomeres. Heterochromatin factors are recruited to telomeric retrotransposons via the assistance of the piRNA system in the Drosophila germline [81]. When the piRNA system is disrupted, the levels of the key heterochromatin protein HP1 and the histone modification H3K9me3 decrease at telomeres. The shift of telomeric clusters from the nuclear periphery, along with a global change in telomeric chromatin structure, is found in the germline of Drosophila piRNA mutants [81]. Furthermore, the most abundant D. melanogaster telomeric repeat, HeT-A, is co-transcriptionally regulated in the nucleus by recruiting the Ccr4–Not deadenylase complex [63]. Confocal microscopy revealed the presence of structures nearby telomeres that contained the Ccr4–Not protein complex, the Piwi protein, and RNA nuclear export factors [63]. Interestingly, non-canonical deadenylases Ccr4 and Caf1, which are localized in Cajal nuclear bodies, are involved in the biogenesis of human telomerase RNA component [82]. Thus, the piRNA system and Ccr4–Not deadenylase are implicated in both the reduction of activity of selfish TEs and in the Drosophila telomere regulation.

Telomere length in Drosophila is negatively regulated by the piRNA system; the more actively it functions, the less frequently telomere lengthening takes place. In order to maintain the frequency of attachment of telomeric retroelements to the ends of chromosomes required for normal development, a complicated feedback mechanism controls the level of telomeric transcripts, telomeric piRNAs, and the state of telomeric chromatin. The fact that the piRNA-mediated mechanism should effectively suppress TE activity complicates telomere length control even more. Indeed, piRNA system disruption causes not only an increase in the frequency of telomeric attachments, but also TE activation and sterility [75]. The opposite situation, i.e., activation of the piRNA system and excessive telomere shortening, was observed in one of the natural Drosophila strains. Differences in the extent of anti-transposon protection among natural Drosophila populations can be explained by the high variability of proteins involved in the piRNA pathway and variable efficacy of primary piRNA processing [83]. Flies, which had the most efficient piRNA processing, lacked full-length copies of the HeT-A retrotransposon, the main structural element of telomeres. This strain had reduced fertility and viability. Therefore, enhancement of piRNA-mediated protection leading to strong repression of transposons, can also cause telomere shortening. This result implies that limits exist for improvements in the efficiency of the piRNA defense system because it needs to achieve a balance between protecting the genome against transposon propagation and allowing adequate telomere maintenance (Fig. 3).

Fig. 3.
figure 3

The mechanisms regulating telomere maintenance and transposon control are tightly linked. The piRNA system controls both activity of selfish TEs and telomeric retrotransposon attachments to the chromosome ends in the Drosophila germline.

WHEN AND WHY ARE TRANSPOSABLE ELEMENTS ACTIVATED DURING DEVELOPMENT?

The stages of development during which TEs are activated could point to the mechanisms of TE control (Fig. 4, a-c). Surprisingly, short-term transposon activation is observed during early stages of gametogenesis in a variety of species [84]. There is a brief developmental window at the early stages of D. melanogaster oogenesis when Piwi levels decrease and TEs mobilize [85]. The TE transcripts produced at this stage are processed into piRNAs by the cytoplasmic piRNA amplification machinery. When Piwi expression is restored these piRNAs mediate transcriptional silencing of TEs at later stages of oogenesis [86]. When the Piwi protein is absent during the early stages of oogenesis, transcriptional activation of telomeric retroelements (HeT-A and TART) is detected, which leads to the formation of telomere elongation intermediates [66, 87, 88]. In D. melanogaster, such intermediates are spherical ribonucleoprotein (RNP) particles composed of the retrotransposon HeT-A-encoded Gag protein, packed with HeT-A RNA, and capable of being targeted to telomeres [8990]. For the first time, such HeT-A spheres were found in actively proliferating larval brain cells, and their appearance on telomeres coincided with telomere replication [89]. The main structural component of Drosophila telomeres is the non-autonomous retroelement HeT-A, lacking reverse transcriptase. Reverse transcriptase is provided by other telomeric retroelements – TART and/or TAHRE. TART reverse transcriptase was also discovered in the HeT-A spheres in neuroblasts [91]. Most likely, HeT-A spheres are necessary for the retrotransposition of telomeric elements to the chromosome ends to elongate Drosophila telomeres. Transcription of Drosophila telomeric retrotransposons HeT-A and TART is normally detected in the ovarian germ cysts, and when the piRNA system is disrupted, spherical particles containing RNA and the Gag protein encoded by HeT-A arise [87]. Telomere elongation occurs at the same stages at which TEs are activated. This is most likely due to a decrease in piRNA protection and the accumulation of telomeric element transcripts, as well as global heterochromatin decompaction and greater accessibility of chromosomal ends in mitotically active gamete progenitor cells. Indeed, disruption of various heterochromatin factors such as HP1, histone demethylase Lsd1 and its cofactor Ova, methyltransferases dSetdB1 and Su(var)3-9 lead to both TE activation and telomeric transcript accumulation [92-98]. Mutation of Su(var)2-5 gene encoding HP1, causes telomere elongation [98]. A similar scenario has been observed in other animals. In mammals, a wave of de novo DNA methylation is typical for prenatal male germline development. As a result, expression of young active subfamilies of retroelement LINE1 occurs in primordial germ cells [99]. At later stages of spermatogenesis, emergence of Piwi subfamily protein, MILI2 in mouse, triggers piRNA production, recruitment of DNA methylase and establishment of piRNA-mediated de novo DNA methylation at LINE1 sequences [100].

Fig. 4.
figure 4

Developmental stages of transposon activation. a) When Piwi protein levels fall during Drosophila oogenesis, TEs and telomeric retrotransposons are activated in the germ cysts. At this step, piRNAs are produced, which cause transcriptional silencing at the later stages of oogenesis. Telomere elongation is anticipated to occur at the same stage due to retrotranspositions of telomeric retroelements to the chromosome ends. b) TE activation is associated with global demethylation in mouse primordial germ cells during spermatogenesis. Then, piRNA-mediated de novo DNA methylation of LINE1 active copies is established. c) Transposon mobilization and telomere elongation are observed during first zygotic divisions in mammalian embryogenesis. d) The inhibition of LINE1 by piRNAs may have an indirect effect on LINE1 telomeric functions in mammalian germ cells (hypothesis).

In the mammalian development, two waves of demethylation and remethylation of DNA occur – in primordial germ cells and in the early embryogenesis [101]. Global demethylation of the genome in the primordial germ cells during prenatal development is believed to be required for epigenome reprogramming and the establishment de novo DNA methylation patterns [102, 103]. DNA demethylation is also required for telomere extension in embryonic stem cells which is associated with increased levels of transcription of many retrotransposons [104-106]. The transcription factor Zscan4 (Zinc finger and SCAN domain containing 4) plays a critical role in maintaining the DNA demethylation state and heterochromatin derepression in embryonic stem cells and 2-cell embryos [107]. Zscan4 increases the expression of genes involved in homologous recombination, which stimulates the recombination mechanism of telomere lengthening in embryonic stem cells, 2-cell embryos, and ALT (alternative lengthening of telomeres) tumor cells that use recombination for telomere lengthening [105, 106, 108]. In preimplantation embryos, the activation of LINE1 retrotransposons and endogenous retroviruses is also observed [109-112]. Interestingly, LINE1 inhibition leads to impaired expression of pluripotency factors, including Zscan4, and blocks telomere elongation in embryonic stem cells [113]. In turn, LINE1 RNA acts as a transcriptional repressor of the Dux gene, which is required for embryonic cells to exit the 2C state and continue development [114]. It was also shown that early differentiation gene promoters contain regulatory regions of endogenous retroviruses and are regulated by retroviral proteins [115]. Thus, transposon activation occurs during brief periods of development associated with genome reprogramming and telomere elongation. Furthermore, these processes are linked by a sophisticated regulatory network, the proper balance of which is required for normal development.

Natural aging and premature aging syndromes both exhibit chromatin decompaction and activation of expression of TEs and telomeric repeats, resulting in DNA damage and cell death [116117], which emphasizes the commonality of epigenetic regulation of telomeres and TEs at different developmental stages.

INVOLVEMENT OF LINE1 IN THE MAINTENANCE OF MAMMALIAN TELOMERES

Similarities between TE control and telomere regulation mechanisms are evident in Drosophila where telomeres are elongated by the retrotranspositions of retroelements. However, in the vast majority of species, telomeres are maintained by the activity of a specialized reverse transcriptase – telomerase – the components of which are encoded by cellular genes. New evidence demonstrates that the retrotransposons are involved in the functioning of telomeres in mammalian cells. In the absence of the components of the protective telomere complex shelterin, the LINE1 retrotransposon can attach to the telomere in human cells via reverse transcription and become a structural part of telomeric DNA [17]. In addition, LINE1 is also able to play a role in telomere function. LINE1 knockdown in cancer cells resulted in a reduced expression of shelterin proteins, decreased telomerase activity and telomere shortening [118]. In line with this observation, inhibition of LINE1 activity in the mouse 2-cell embryos blocked telomere elongation and genome reprogramming [113]. Furthermore, LINE1 RNPs were found directly at telomeric ends in human cancer cells and mouse 2-cell embryos, where they interacted with telomeric repeat-containing RNA (TERRA) [113119]. LINE1 appears to be involved in telomere biogenesis, however it is unknown how LINE1 is targeted there or which telomere components it recognizes. Given the role of LINE1 in telomere function, it is tempting to speculate about a potential relationship between the piRNA system that regulates LINE1 expression and telomeres in telomerase-using mammals (Fig. 4d).

All together, these findings indicate that the systems regulating telomere maintenance and transposon control in the mammalian genome are connected with respect to temporal regulation and function. Research on this relationship will contribute to a better understanding of the mechanisms of telomere regulation in development, aging, and cancer cells.

CONCLUSION

The interaction between the host genome and TEs is commonly referred to as a genomic conflict, and the processes behind this conflict are complicated and debatable [120]. When the molecular mechanisms of TE control were not yet known, analysis of TE population dynamics revealed a balance between TE spread rate and the mechanisms that limit this process. As a result, the genome has a rather stable amount of TE copies [121]. Excessive reduction of TE activity and their copy number does not appear to provide a selective benefit to the host, as recent methods of genomic data analysis of natural Drosophila populations indicate. For example, the amount of piRNA does not correlate with TE transposition activity, therefore, the piRNA system is not optimally adapted to protect the genome from TEs [122]. The continuous improvement of the adaptable piRNA system is probably limited by its participation in key regulatory functions, which creates a conflict between TEs and the genome. Incomplete suppression of TEs allows important cellular functions to be performed by the domesticated TEs and consequently ensures the survival of selfish TEs in the genome. What is the mechanism of this global genomic compromise? Part of the solution, we believe, can be found in the retrotransposon origin of telomeres and telomerase. Transposon activity is associated with genome reprogramming, erasing of epigenetic marks and telomere elongation. A dual role of the piRNA system in telomere protection and transposon silencing in Drosophila resulted from the nature of Drosophila telomeres which are made up of retrotransposons per se. However, telomerase can also be considered as a specialized retroelement that retained its functional relationship with genomic retrotransposons. Thus, we conclude that protective systems can only partially suppress the activity of selfish TEs since they must be balanced to carry out essential tasks, like those related to telomere maintenance. However, this is a blessing in disguise: an increasing body of evidence indicates that domesticated TEs make a significant contribution to the diversity of regulatory mechanisms that ultimately confer evolutionary benefits to the host genome.