Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome

García, Emilio; Cross, Ismael; Portela-Bens, Silvia; Rodríguez, María E.; García-Angulo, Aglaya; Molina, Belén; Cuadrado, Angeles; Liehr, Thomas; Rebordinos, Laureana

doi:10.1038/s41598-019-53673-6

Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome

Article
Open access
Published: 20 November 2019

Volume 9, article number 17146, (2019)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome

Download PDF

Emilio García¹^na1,
Ismael Cross¹^na1,
Silvia Portela-Bens¹,
María E. Rodríguez¹,
Aglaya García-Angulo¹,
Belén Molina¹,
Angeles Cuadrado ORCID: orcid.org/0000-0003-0089-5249²,
Thomas Liehr³ &
…
Laureana Rebordinos ORCID: orcid.org/0000-0001-6255-9449¹

1659 Accesses
12 Citations
3 Altmetric
Explore all metrics

Abstract

Repetitive sequences play an essential role in the structural and functional evolution of the genome, particularly in the sexual chromosomes. The Senegalese sole (Solea senegalensis) is a valuable flatfish in aquaculture albeit few studies have addressed the mapping and characterization of repetitive DNA families. Here we analyzed the Simple Sequence Repeats (SSRs) and Transposable elements (TEs) content from fifty-seven BAC clones (spanning 7.9 Mb) of this species, located in chromosomes by multiple fluorescence in situ hybridization (m-BAC-FISH) technique. The SSR analysis revealed an average density of 675.1 loci per Mb and a high abundance (59.69%) of dinucleotide coverage was observed, being ‘AC’ the most abundant. An SSR-FISH analysis using eleven probes was also carried out and seven of the 11 probes yielded positive signals. ‘AC’ probes were present as large clusters in almost all chromosomes, supporting the bioinformatic analysis. Regarding TEs, DNA transposons (Class II) were the most abundant. In Class I, LINE elements were the most abundant and the hAT family was the most represented in Class II. Rex/Babar subfamily, observed in two BAC clones mapping to chromosome pair 1, showed the longest match. This chromosome pair has been recently reported as a putative sexual proto-chromosome in this species, highlighting the possible role of the Rex element in the evolution of this chromosome. In the Rex1 phylogenetic tree, the Senegalese sole Rex1 retrotransposon could be associated with one of the four major ancient lineages in fish genomes, in which it is included O. latipes.

Repetitive DNAs and chromosome evolution in Megaleporinus obtusidens and M. reinhardti (Characiformes: Anostomidae)

Article 08 April 2024

Identification and chromosome mapping of repetitive elements in the Astyanax scabripinnis (Teleostei: Characidae) species complex

Article 31 December 2014

DNA transposon invasion and microsatellite accumulation guide W chromosome differentiation in a Neotropical fish genome

Article 27 August 2019

Introduction

A large part of the eukaryotic genome is composed of the so-called repetitive DNA, comprising multiple copies of DNA sequences. Repetitive DNA can be divided into two groups: tandem repeats, which includes satellite DNA, and transposable elements (TEs) that are scattered interspersed repetitions¹. The majority of these repetitive non-coding sequences are usually located in heterochromatic regions. Greater knowledge of these regions and, in general, of the genomes of numerous organisms is now available thanks to high-performance sequencing; this, in recent years, has facilitated great advances in the fields of functional² and comparative genomics³. For some time, repetitive sequences were considered to be mainly “junk” DNA⁴, however, with the advances in genomic research, these repetitive sequences are now known to play a more important role in the functional and structural evolution of the genome. Repetitive sequences are implicated in chromosomal rearrangements and are responsible for a substantial proportions of the karyotype variability observed in several groups⁵.

Microsatellites, or simple sequence repeats (SSR), constitute a unique type of genomic sequence repeated in tandem and are repetitive non-coding DNA regions consisting of small motifs of 1 to 6 tandem-repeated nucleotides; they are abundantly distributed throughout all eukaryotic and prokaryotic⁶ genomes. SSRs contribute to the structure of DNA, the organization of chromatin, the regulation of transcription and translation, as well as DNA recombination, and cell cycle dynamics. SSR are present both in the coding and non-coding DNA⁶, although it has been demonstrated that, in eukaryotic organisms, SSRs are mostly in non-coding regions.

TEs are discrete DNA fragments that have the ability to move within a host genome, often creating new copies of themselves during the process. This unique ability of TEs to make copies of themselves seems to be an effective strategy for self-preservation, and this is evident by their presence in all genomes throughout the tree of life⁷. TEs play important roles and can alter or disrupt the expression of genes, promoting population-level variation and rapid adaptation through the expansion of new TE families by generating structural genomic diversity between populations^8,9. In this sense it has been proposed that the diversity and speciation of teleosts is a reflection of the diversity in the size and structure of their genomes^10,11.

Depending on the transposition mode (with or without an intermediate RNA) the TEs are divided into Class I and II. Each class comprises different subclasses, superfamilies and families³. Among vertebrates, teleosts have the highest number of TE superfamilies¹² and their abundance seems to be determinant in the size of the genomes of this group^3,13. Among actinopterygians, teleost fish have the greatest diversity, with more than twenty-five TE superfamilies described in some species¹⁴. It should be noted that in some species of fish with a small genome size, as pufferfishes and fugu, their genomic reduction is not accompanied by a loss of superfamilies of transposable elements, as is the case in other vertebrates with a reduced genome¹³.

Class I TEs, also called retrotransposons, are characterized by moving through the genomes by means of a copy-paste mechanism, through a reverse transcription of an intermediate RNA. This class is further subdivided into LTR (Long Terminal Repeat) and non-LTR retrotransposons¹⁵. Based on both structure and phylogenetic data, some differences in reverse transcriptase LTR have been found, and they appear to indicate a close relationship between LTR retrotransposons and some types of retroviruses with more distant non-LTR retrotransposons³. The Rex transposable elements are mobile genetic elements belonging to class I^16,17. Rex elements have been found, in addition to several other TEs, in the genome of some fish species, such as the Nile Tilapia Oreochromis niloticus¹⁸. These elements are generally dispersed and may be related to sexual differentiation within the species. In this sense, an interesting case of a Rex TE type in association with Tc-1 has been detected in the differentiation of the sex chromosomes of Chionodraco hamatus, where it was suggestted that transposons in that species may have exerted some influence on the differentiation and structure of sex chromosomes¹⁹. Class II transposons, or DNA transposons, are divided into three subclasses according to their transposition mechanism: first, “cut-and-paste” transposons; second, inverted terminal repeat sequence (ITRs) transposons (e.g. hATs and Helitrons); and third, self-synthesizing DNA transposons (e.g. Mavericks)¹.

The large superfamily of hAT transposons, so called from three of its members: the hobo element of Drosophila, the Activator element of Zea mays i.e. maize (reported by McClintock) and the Tam3 element of snapdragon, is very widespread in plants and animals^20,21. The hAT transposons are also found in the genomes of mammals, and in humans they are the most abundant DNA transposons accounting for 1.55% (195 Mb) of the total genome²². The hAT elements have many common features, some of which are subterminal repeats (STRs) at both ends of the TE, inverted terminal repeats (ITRs) and a gene that encodes a 600–800 amino acid transposase which catalyzes the DNA division and integration of the target, with 8 bp in the destination site of duplications (TSD) at each end of the integration site during the transposition^21,23.

Senegalese sole (Solea senegalensis) is considered to be one of the most economically valued fish species in southern Europe. From the beginning of its culture in aquaculture, several problems have appeared, among which the control of reproduction stands out, in particular the infertility present in the individuals grown in captivity²⁴. S. senegalensis has 21 pairs of chromosomes, and lacks morphologically identificable heteromorphic sex chromosomes²⁵ that can be identified by their morphology. The species is male heterogametic having an XX/XY²⁶ system and the largest chromosome, originated by the fusion of 2 acrocentric chromosomes^27,28, has been proposed as a proto-sex chromosome²⁹.

Sexual chromosome evolution, from autosomes, is accompained by loss of genes, accumulation of sex-specific alleles and a gain of repetitive DNA sequences³⁰. Lack of recombination between heterogametic chromosomes is the mechanism that promotes differentiation between both chromosomes. Co-evolution betweeen TE and recombination has been proposed³¹.

Recently, important advances in genomic tools and resources have been documented in some of the main cultured flatfish species³². These resources include whole genome sequencing, genetics maps, QTL studies and mapped SNPs. However, with the exception of one genetic map²⁶ and some cytogenomics maps based in BAC-FISH^27,28,29,33 few studies exist dealing with structural genomics and sequencing in the Senegalese sole and only the abundance and type of microsatellites present in its transcriptome have been described³⁴.

The objective of this work is to provide new information about the distribution of microsatellite motifs (dinucleotides, trinucleotides, tetranucleotides, pentanucleotides and hexanucleotides) and main motifs of TE in the genome of S. senegalensis paying special attention to the largest metacentric chromosomes proposed as proto-sex chromosome. With this information we discuss the aspects of the organization of TE and microsatellites in the sole genome. In particular, we discuss the role and abundance of Rex transposons in the major chromosomal pair of the S. senegalensis genome as a possible proto-sex chromosome.

Methods

PCR screening from S. senegalensis genomic library and BAC clone sequencing

Thirty-two BAC clones from a library of the S. senegalensis genome previously constructed and described were screened^29,33. The BAC library is comprised of 29,184 clones distributed in 384-well plates (76 plates in total). BAC clones were identified and isolated using a 4D-PCR method. Briefly, plates were organized in 4 pools and used as template DNA. Primers for several gene and sequences were used to carry out the screening of the pools in a hierarchical way until location of the BAC in an specific coordinates of a specific plate of the library. Each BAC clone was named using the library plate number, and columns and rows coordinates. Clones were used in m-FISH experiments and in repetitive sequence analysis. BAC clones were sequenced as described in Garcia-Cegarra et al.³³. Briefly, DNA from the S. senegalensis BAC genome library was isolated and purified using the Large-Construct Kit (Qiagen, Hilden, Germany), and then digested with Hae II and Rsa I enzymes (20 U). A total of 454 sequencings were performed according to supplier’s recommendations.

BAC sequences from another twenty-five BAC clones previously described^28,29,33 were used for the repetitive sequences study and integrated mapping analysis. BACs with several chromosome locations were counted as many times as they were localized. Overall, sixty-four BAC clone sequences have been analyzed in this work, integrating information about their chromosome localization, number and distribution of SSRs and TEs (Accession Numbers AC278047-AC278120).

The experimental procedures were in accordance with the recommendation of the University of Cádiz (Spain) for the use of laboratory animals (https://bit.ly/2tPVbhY) and the Guidelines of the European Union Council (86/609/EU). The experiment was authorised by the Ethics Committee of University of Cadiz (Spain).

FISH analysis

Chromosome Preparations

Chromosome preparations were made according to Cross et al.³⁵. Briefly, 2–3 day-old S. senegalensis larvae were pretreated with 0.02% colchicine for 3 h. Then they were subjected to hypotonic shock with KCl (0.4%) and finally fixed in a freshly-prepared solution of absolute ethanol:acetic acid (3: 1). Larvae were homogenized in Carnoy, and the preparations were then dropped onto wet slides and placed on a hot plate with damp paper to create the necessary moisture for a good spread of the chromosomes²⁹.

mBAC-FISH

BAC clones labeling was carried out with a first amplification by DOP-PCR, followed by a conventional PCR for labeling, as described previously in Garcia Angulo et al.²⁸. Three different fluorochromes were used: Texas red (Thermo Fisher Scientific, USA), fluorescein-isothiocyanate (FITC) (Enzo, USA), and diethyl-aminocoumarin (DEAC) (Vysis, USA). The chromosomes were pretreated with pepsin and fixed in formaldehyde. Finally, the chromosome preparation was dehydrated with ethanol series and air-dried before hybridization. Hybridization was done according to Portela-Bens et al.²⁹.

SSRs-FISH

Chromosome preparations were made according to Cuadrado et al.³⁶. A total of 10 different mono-, di-, tri-, and tetranucleotide microsatellite motifs were physically mapped using synthetic oligonucleotides labeled with biotin at both ends (Roche Applied Science) as probes: (A)₂₀, (C)₂₀, (AC)₁₀, (AG)₁₀, (ACG)₅, (AAT)₅, (GCA)5, (AAC)₅, (GACA)₄, (GATA)₄. Chromosomes and probe denaturation and in situ hybridization were performed as described by Cuadrado et al.³⁶. In brief, the hybridization mixture was prepared by adding 50% de-ionized formamide, 10% dextran sulfate, 2× SSC, 0.1% sodium dodecyl sulfate (SDS), and 2 ppm of the microsatellite probes. For post-hybridization washing, slides were immersed in 4× SSC/0.2% Tween-20 for 10 min at room temperature (RT). Biotin was detected by incubating the slides in streptavidin-Cy3 (Sigma) in 5% (w/v) BSA for 1 h at 37 °C. Slides were rinsed for 10 min in 4× SSC/0.2% Tween- 20 at RT and then stained with DAPI (4′,6-diamidino-2-phenylindole). They were mounted in Vectashield antifading medium (Serva) and examined with a Zeiss Axiophot microscope.

Bioinformatic analysis

SSR and TE analysis. After determining the chromosome location of BAC clones, using the FISH technique, the genomic sequences obtained from those clones (taking into account several multi-loci situations) were loaded into a local pool. A configuration file was used together with the perl script MISA (Microsatellite identification tool)³⁷. DNA sequences were then searched for both perfect and compound microsatellites, with a basic motif of 2–8 bp. Only 1 to 6 motifs were considered, and the minimum repeat unit was defined as 10 for mononucleotide, 6 for dinucleotide repeats, and 5 for tri-, tetra-, penta- and hexa-nucleotides. The maximum number of bases interposed between two SSRs in a compound microsatellite was set at 100. A homology-based approach using the Repbase (release 23.07) database; RepeatMasker³⁸ was also applied. Analysis of TEs distribution in the Senegalese sole genome was made possible by using both the information of BAC clone position obtained from the FISH technique, and the coordinates of the TE elements from the RepeatMasker software. Statistical analysis to determine frequency and distribution by chromosome of both TE and SSR elements was done using SPSS software (v17.0).

Phylogenetic analysis

In order to generate the phylogenetic tree for the Rex retrotransposon, fish Rex1 sequences from Repbase (Giri repbase - https://www.girinst.org/) were downloaded. In addition, the BLASTn algorithm³⁹ was used in the Ensembl database (https://www.ensembl.org) to find homologies with sequences matching the S. senegalensis Rex1 element, and the matched sequences were also used. One hundred and twenty five fish sequences were included in the phylogenetic tree. All sequences were then aligned in MAFFT software⁴⁰ using an iterative method. To eliminate poorly-aligned positions and divergent regions of DNA, the Gblocks server was used, and different options for a less stringent selection (allowing smaller final blocks, allowing gap positions within the final blocks, and allowing less-strict flanking positions) were applied to the analysis. Then the SMS program (Smart Model Selection) was applied to determine the best-fit phylogenetic model⁴¹ and, finally, the PhyML 3.0 software⁴² was used to run the model. The resulting best-fit model predicted was GTR + G + I. The proportion of invariable sites was 0.012, the number of substitution rate categories was 4, and the Gamma shape parameter estimated was 1.389. The statistic used for model selection was the Akaike information criterion (AIC), the value of which was 235939.11 and the -LnL was -117712.55986. Branch support was tested by the fast likelihood-based method using aLRT SH-like⁴³ Tree edition was carried out using MEGA version 7⁴⁴.

Compliance with ethical standards

The experimental procedures are according to the recommendation of the University of Cádiz (Spain) for the use of laboratory animals and the Guidelines of the European Union Council (86/609/EU). The experiment was authorised by the Ethics Committee of University of Cadiz (Spain).

Results

BAC-FISH mapping in S. senegalensis

A multi BAC-FISH mapping was performed in S. senegalensis chromosomes using 32 BAC clones (Fig. 1) and integrating previously published data^27,28,29,32. Results are summarized in Fig. 2 and Supplementary Table S1.

Multiple hybridization analysis showed four clones producing a single signal and not co-localizing with other clones. Specifically, BAC9-J4 presented a signal in the small metacentric chromosome pair 3. The other three BACs appear in acrocentric chromosomal pairs: BAC15-I19 in pair 10; BAC36-M2 in chromosomal pair 18; and BAC4-M14 gives a signal in the chromosomal pair 20.

The hybridization signals of 5 new BACs were observed in the median metacentric chromosome pair number 2. In one arm appear the five BACs 4D-15, 52-G10 and 4C-5 that co-localize with those previously detected by Portela Bens et al.²⁹. (2017), 6-P22 and 19-J21, the latter also signaling in the acrocentric pair 15. In the other arm, the BACs 36-I3 and 36-K1 signal; these co-localize with the BAC 21-O23 that also signals in the acrocentric pair 14 as a single signal, as described in Portela-Bens et al.²⁹.

In the submetacentric pair 4, the hybridizations give signals for 7 new BACs. The BAC46-B2, in the short arm in telomeric position, co-locates in the same region as the previously described BACs 12-N15²⁹ and 30-J4³². Six new BACs were located in the long arm: BAC36-J2, BAC36-H3, BAC36-H2, BAC 8-A23, BAC3-C15 and BAC12-D24. In the subtelocentric chromosome pair 8, two signals co-located: BACs 31-A1 and 31-A2.

In six acrocentric chromosomal pairs, signals from new BACs clones were detected co-localizing with others already described above: in pair 12 BACs 35-D17 and 13-F2 co-localize with the BAC30-J4 located in an almost centromeric position, also detected on chromosome 4³². In chromosome pair 13 we detected the BAC29-D4 signal, also located in a more centromeric situation, which co-located with BAC8–07 that is co-localized in the long arm of the subtelocentric chromosome 7²⁹. In pair 15 we find 2 signals that correspond to the BACs 4-F-12 and 36-E3 and co-localize with the BACs 19-J21²⁹ and 16-E36; the latter co-localizes in the large metacentric chromosome 1 and in the long arm of the subtelocentric chromosome 6²⁸. In the chromosomal pair 16 we detect the signals corresponding to the BACs 52-E7 and 30-P17 that co-locate with BAC9-N8²⁷. In chromosomal pair 19 we detected the signals of BACs 31-C1 and 13-F4 that co-localize with BAC12-K6;²⁷ and finally in pair 21 we find the signal of BAC 63-A3 co-locating with the signal of BAC30-H22²⁹.

FISH mapping of SSRs in S. senegalensis

To study the distribution of SSR sequences in the S. senegalensis genome, 2 mono-, 2 di-, 4 tri- and 2 tetra-nucleotide probes were used (Table 1). Seven out of the 11 probes yielded positive signals: (A)₂₀, (C)₂₀, (AC)₁₀, (AG)₁₀, (GCA)₅, (GACA)₄, (GATA)₄ (Fig. 3). (AC)₁₀ probe presented the largest and most intense signals. This SSR was found in subtelomeric position both in larger (metacentric and submetacentric) and acrocentric chromosomes. This distribution was similar to the location of the (GACA)₄ probe, and in a smaller quantity, the GATA repeats. The (AG)₁₀ probe displayed a dispersed pattern of FISH signals on chromosomes, showing a greater concentration in subtelomeric positions. Mononucleotide probes (A)₂₀ and (C)₂₀ were found scattered among several chromosomes: the A probe showed weak signals although with a somewhat greater concentration in subtelomeric positions; the C probe presented clusters in centromeric positions of almost all chromosomes.

Table 1 Number of FISH signals and localization of BAC clones onto S. senegalensis chromosomes.

Full size table

NGS analysis of SSRs and TEs in the S. senegalensis BAC clones

To study the number, distribution and abundance of microsatellites in S. senegalensis, fifty-seven BAC clones from a genome library were analyzed with MISA software. Twenty-three out of them had been sequenced previously^27,28,29 (Table S1). The 57 clones comprise 6.9 Mb. As described in BAC-FISH results, some BACs were localized in two or more chromosomes, so these were included as many times as they appear in the S. senegalensis chromosomes. Taking this into account, the total number of BACs used in the SSR analysis was 64, and the total sequence length analyzed was 7.9 Mb. The number of SSR loci observed was 5330, comprising 1.27% of the genome analyzed, and presenting a total of 53505 repeat units. The average number of loci per Mb was calculated as the total number of identified loci (5330) in relation to the BAC sequences length analyzed (7.9 Mb) and normalized by Mb. In average, 675.1 SSR loci per Mb were found in the S. senegalensis genome. The coverage calculated as the quantity of sequences of SSR (bp) in relation to the BAC sequences length analyzed (7.9 Mb), and again normalized by Mb, was 12716.63 bp. Attending to the motif length of microsatellite DNA, the di-nucleotide motif showed the largest number of identified SSRs, with almost thirty thousand repeats (29968) in the Senegalese sole BAC clones, followed by mononucleotide repeats (16246). The mean number of repeats loci was higher for the mononucleotide motif than for the dinucleotide motif (11. 63 and 10.33 respectively). The analysis also showed a high level of dinucleotide coverage (measured as nucleotides per Mb sequenced), with an abundance of 59.69%. The mononucleotide and trinucleotide abundance presented lower values (16.18 and 16.32%) than dinucleotides (59.69%) (Table 2). When microsatellite abundance per motif length class is studied, it can be seen that the mononucleotide “A” is rather more abundant than “C” (81.3% vs 18.7%). In the dinucleotide class, “AC” was the most abundant in the genome analyzed (67.7%). The most abundant trinucleotide motifs were “AAT” and “AGC” (28.4 and 21.8% respectively) (Table 3).

Table 2 Number and length of microsatellite loci per motif size, microsatellite coverage and relative abundance per motif in S. senegalensis.

Full size table

Table 3 Microsatellite abundance (%) per motif length and sequence in the S. senegalensis genome.

Full size table

After positioning BACs on chromosomes by means of FISH, the location and genome abundance of SSRs (measured as bp of SSR per Mb) could be studied (Fig. 4). Chromosome 17 showed the highest SSR coverage, with more than 41000 bp of SSR per Mb. Chromosomes 1 and 20 showed the lowest SSR coverage (9644 and 5457 bp per Mb). When number of loci was measured, similar results were found: chromosomes 1 and 20 show the lowest values and chromosome 17 the highest (Suppl. File 2).

All BAC sequences, with information about their chromosome position, were also analyzed using Repeat Masker software. After removing simple repeats and artifacts, 4685 BAC clone positions matching with known Repbase TE elements were obtained. Results were organized by: Class I (retrotransposons); Class II (DNA transposons); and Other repeat elements (Table 4). As it can be observed in Table 4, Class I transposons showed 1549 elements in the genome sampled (BACs sequenced) which represents 33.04% of the TEs in the genome analyzed. From this Class I, 717 elements were found as LINES. Within LINES elements, we found another 14 families, the most abundant being, in numbers of elements (660 out of 717 = 92%) the following: L2 (364), RTE-BovB (112), Rex (88), L1 (63) and Penelope (33). When LINES elements were filtered by length (higher than 1 kb), only Rex and L2 families (5 loci) showed matching repeats (L2:1199–2113 bp and Rex: 2551 bp). When filtering for matches longer than 500 bp, again only these two families were found. The DNA transposons (Class II) were the most abundant with 54.9% of the TEs found, with the hAT family being the element with the greatest presence in the S. senegalensis genome: 900 elements and an abundance of 19%. Other repeated elements such as rRNA, tRNA and scRNA show an abundance of 12.06%. Taking into account the genome distribution of TE elements by chromosomes, results showed a heterogeneous distribution (Fig. 5). Chromosomes 8 and 17 showed the greatest abundance with more than 450 loci per Mb. The Class I: Class II ratio was 1.86 on average for all chromosomes, with an extreme value (ratio: 5) in chromosome 14 because of the very low Class I TE value.

Table 4 Number and relative abundance of TE elements in the BAC sequences of S. senegalensis.

Full size table

Within TE elements found in the S. senegalensis genome analyzed (BAC sequences), hAT elements were the most abundant (900 elements, Table 4). Within hAT, some elements as Charlie, Ac or TIP100 were the most frequent (818 out of 900 elements: > 90%). The Fig. 6 represents the percentage of these elements out of total hAT elements. As it can be observed, the hAT elements from Class II found in the genome showed that more than half of those elements analyzed (51.22%; 461 elements) were hAT-Ac, followed by hAT-Charlie (32.11%; 289 elements), hAT- Tip100 (7.56%; 68 elements) and other repeats of minority elements (9.11% cumulative).

After BAC clone sequence analysis, the distribution and abundance of TE elements in the S. senegalensis genome was assessed. Within LINES elements, five of them matched regions longer than 1 kb, with the Rex/Babar subfamily showing the longest one (match length 2551 bp). This Rex family was observed in two BACs localized in chromosome 1 (10-L10 and 5-K5). According to recent literature this chromosome could be a proto-sex chromosome. In this sense, we measured the coverage of Rex elements per chromosome, finding the highest value (7427) in chromosome 1, followed by chromosome 4 (3277) and chromosome 19 (1277). In addition, short sequences from BACs of different chromosomes showed similarities with Rex transposon, having a wide distribution across the genome (Fig. 7).

The Rex/Babar sequence, from BACs localized in chromosome 1, was then used against other teleost genomes, as a query in a BLAST search of the Ensemble database. The matches obtained were then extracted as FASTA files and a phylogenetic tree was made (Fig. 8). The tree showed a group with two robust branches: one containing S. senegalensis, Oryzias latipes and Tetraodon nigroviridis, and other containing several sequences from Danio rerio, Salmo salar, Gasterosteus aculeatus, Takifugu rubripes and Lepisosteus oculatus. In the former, Senegalese sole is present in an internal branch comprising different sequences from O. latipes. T. nigroviridis shares a branch (low support) with a sequence of O. latipes. In a third group, several species such as O. niloticus, Gadus morua and Poecilia formosa can be observed.

Discussion

In the present work, we studied repeated DNA in S. senegalensis using cytogenetic techniques and BAC sequencing. Although the fraction of the Senegalese sole genome studied in the present work is approximately 1.1% of the estimated total size, it can be considered a representative sample to detect the repetitive elements present in the genome of this species⁴⁵. In another species, the pea (Pisum sativum), it has been found that a low-pass sequencing of its genome is sufficient to capture the repetitive sequences present in its genome with at least 1000 copies;⁴⁶ and the potential of bioinformatic analysis of low-depth sequencing data for investigation of repeats has been further demonstrated in several other studies^47,48. Consequently, the analysis of BAC sequences, together with the knowledge of their location by BAC-FISH and supported by the results of SSR-FISH, has enabled us to quantify for the first time the number and distribution of the repetitive elements of the genome of S. senegalensis.

In previous studies an integrated genetic map was constructed in the Senegalese sole;^27,29 this map comprises the sequence and localization of more than 50 BACs. In this study, using the mFISH technique, we determined the chromosome location of 32 new BAC clones and their genome sequences. Using this approach, the main advantage is that it allows us to study the repetitive distribution elements, after BAC hybridization, in various Senegalese sole chromosomes. The results obtained in the present work indicate a frequency of SSRs similar to the data of the transcriptome published³⁴. In the genome studied here, dinucleotides were the most abundant motifs (at around 60%), followed by trinucleotides and mononucleotides (both at 16%); in the transcriptome the most abundant microsatellite were dinucleotides, followed by trinucleotides and tetranucleotides in decreasing abundance. The most common SSR motifs in the S. senegalensis transcriptome were AC and GT for dinucleotides (at 74.6%) and in the present study they are slightly lower (at 67%). The most abundant trinucleotides in the transcriptome are AGG and CCT (at 21.5%): this finding differs from that detected by BACs (AAT and CCG, at 28.3%)³⁴.

In this study, the source of SSRs are BAC clone sequences that have been located throughout the genome of the sole. A total of 5330 microsatellites were identified based on BAC sequences and comprising 1.27% of the genome analyzed, a value similar to that found in the genome of the fugu puffer fish T. rubripes (1.29%)⁴⁹ and slightly less than half that in the green puffer fish T. nigroviridis, where SSRs account for 3.21% of the genome⁵⁰. These data are consistent with the long-standing assumption that microsatellites are present in all the vertebrate and invertebrate species so far studied. The abundance of microsatellites in sole is similar to that found in humans (>1.5%)⁴⁹ and a little lower than that found in mouse (2%)⁵¹ and snake (2.8%)⁵².

Although it is widely assumed that the abundance of microsatellites rises with the genome size, many exemptions have been recorded in animals and plants^53,54. The microsatellite frequency described in this work (on average, 675 loci per Mb) is similar to that obtained in Drosophila, with a genome three times smaller (180 GB), in human with a genome five times larger (3000 GB), and in mouse⁵¹. The relative abundance of length classes of microsatellite motif exhibits a remarkable inter-species variation but dinucleotides and mononucleotides are the predominant in the majority of cases^55,56. In S. senegalensis the dinucleotide motifs are the most abundant, in a proportion of 59.69%. Next, with lower but similar values, we found the mononucleotides and the trinucleotides (16.18 and 16.32% respectively).

The most abundant dinucleotide motifs are AC and GT, a finding similar to that described for the swamp eel genome⁵⁷ and that of human⁵⁸. The AC and GT motifs have been reported as the most frequent SSRs in the intergenic and intron regions of vertebrates⁶ and are 2.3 times more frequent than (AT) n, the second most general type of dinucleotide⁶. The more notable repeats in trinucleotides are AAT, AGC and AGG, and the relative abundance of the AAT motif is the most notable; this also occurs in the swamp eel genome⁵⁷. It shows a predominance of A-rich repeats during the evolution of the genome in teleosts. The extent of the repeats is probably affected by their secondary structures and the influence on DNA replication;⁵⁸ or it could reflect a genetic adaptation to the aquatic environment during speciation of fish. Trinucleotide, tetranucleotide, penta- and hexanucleotide microsatellites are much less frequent than dinucleotides and are usually present 1 to 5 times less frequently than dinucleotides in the genomic DNA of vertebrates^6,50. The mononucleotides detected have an abundance similar to that of the trinucleotides (around 16%) and the most abundant motifs are A and T, which account for about 81%. In primates, mononucleotides are represented by A and T motifs and are the most frequent among SSRs⁶. In relation to the distribution of SSRs throughout the genome, using mapped BAC sequencing as sampling, there is some variation among the particular chromosomes, and it is noteworthy that chromosome 1 is one of those with the lowest SSR abundance values.

Using SSR probes in FISH experiments, we observed that some di-and tetra-nucleotide microsatellites produce the strongest FISH signals. The bioinformatic analysis of the BAC clones indicated that the AC motif has the highest relative abundance value. Our SSR-FISH results support this datum: AC shows up in clusters with the brightest and most intense signal in almost all chromosomes. Furthermore, the GACA, GATA and AG elements showed similar patterns after applying the FISH technique to localize them. Hence, these four microsatellites are probably present as an established combination of repetitive elements in the heterochromatin of sole. Mononucleotide probes (A) and (C) were found scattered throughout the chromosomes. Using FISH there was no clear correspondence between the frequency of the microsatellite motif and the intensity of the signal, since in our study the AAT motif with frequency of 28% gave no signal while the C motifs with 18% and AG with 14% gave more intense signals. In fish species such as D. rerio, Rineloricaria latirostris and Steindachneridion scripta, these repetitive sequences tend to be grouped in the telomeric and centromeric regions⁵⁹.

In relation to repeated elements, we have identified a total of 4686, from which, 4121 were TE elements. These TE elements showed a total length of 467144 bp (5.94% of the genome analyzed). When compared with other fish, this proportion is similar to that found in the two smallest reported genomes of teleost fish, the green spotted pufferfish, and the fugu (T. rubripes), with genome sizes of approximately 342 and 393 Mb, respectively, that contain only ~ 6% of their DNA derived from TE⁶⁰. The proportion of TEs in stickleback, cod and European eel, with values of 12–15% of their genome, is twice that observed in the S. senegalensis genome¹³. In the group of tilapia, platyfish, medaka and spotted gar the proportion of TEs is even higher, with values between 20 and 30%; and the proportion observed in the coelacanth is 25% of the genome¹³. The number and proportions of TEs differ widely among genomes of actinopterygian (ray-finned) fishes, especially teleosts. In fact, a large part of the zebrafish genome (~1.4 Gb) consists of TEs (55%)¹⁴. The abundance of TEs seems to be the main determining factor of genome size in this group^13,61. However, TEs proportion in the small genome of tetraodon (representing just 7.13% of its genome) and in other vertebrates as birds (TE content values ranging 8–10%) are also close to those found in sole^13,61.

Considering the genome distribution of TEs by chromosome, our results show a heterogeneous distribution. Class II TEs (DNA transposon) cover almost 55% of total Repetitive DNA found in the S. senegalensis BAC sequences analized and it is similar to the 60% of Class II TEs detected in cichlids and somewhat greater than the 39% of the same type of TEs in the zebrafish genome. In S. senegalensis the retrotransposons (Class I) account for 33%, with a coverage of LINES of 15%, the most abundant with 717 matches, coverage of SINES (short interspersed elements) of 10%, and of LTRs of almost 8%, whereas in cichlids and zebrafish retrotransposons represent less than 12% of each type^14,62. The DNA transposons (Class II) were the most abundant with 55% of the TEs found, these being the class with the highest presence (900 elements and abundance of 19%) in the S. senegalensis genome analyzed. In particular, two main TIR (Translocated Intimin Receptor) families (hAT and Tc-Mariner), with many subfamilies, constitute the largest fraction of DNA transposons in the sole genome. To a lesser extent, Harbinger has also been detected.

The TIR family of hAT transposons is worth mentioning, given its coverage of 1% of the genome studied. This value represents a coverage ten times higher than that detected in the coelacanth (0.11%) and Lung fish (0.1%)¹³. The hAT transposons are also found in the genomes of mammals, including humans, where they are the most abundant DNA transposons and comprise 1.55% (195 Mb) of the total genome²². In chicken, values similar to those of fish have been found (0.1%) and the value detected in salamander (0.63%) is also lower than that of sole¹³. The highest value of genome coverage in the hAT superfamily: 6.10% was detected in frog¹. Few data are available on the role played by the hAT superfamily in fish; however, it is known that none of the hAT elements in the human genome have been active during the last 50 million years²². In vertebrates, most hAT transposons are inactive, since host cells have developed the mechanism of vertical inactivation to silence and prevent the deleterious effects of active transposons on genome stability⁶³.

We have found a sequence that presents homology with the Rex retrotransposon of many species. The abundance of TEs of the Rex type detected mostly in the chromosome pair 1 of the S. senegalensis genome, raises the hypothesis that this chromosome could be a proto-sex chromosome²⁹. It is known that Rex-type transposons are very important in the evolution of the eukaryotic genome, and participate in processes of chromosomal rearrangement⁶⁴ and chromosomal sex differentiation^65,66,67, which are involved in sexual differentiation. Several authors have also associated these transposable and retro-transposable elements with chromosomal sex differentiation in groups of fish such as Cyprinodontiformes⁶⁸, Characiformes⁶⁹, and Beloniformes⁷⁰. Indeed, in the Cyprinodontiforme Semaprochilodus taeniurus, Terencio et al.⁶⁹ observed a significant increase in the size of the W chromosome due to repetitive DNA accumulation, and among these DNA sequences was Rex1.

In O. nitolicus, Rex elements are concentrated in the first pair of chromosomes¹⁸. In this species, the first pair of chromosomes seems to correspond to the sex chromosomes⁷¹, possibly originated from fusion processes⁷². The location of the Rex1 elements in the chromosome pair 1 could have had some role in chromosomal rearrangements of the S. senegalensis genome, as occurs in O. nitolicus¹⁸.

In S. senegalensis, our SSR-FISH results showed a higher concentration in subtelomeric positions of several probes that are probably present as a combination of repetitive elements in the heterochromatin of sole. This heterochromatin is present in metacentric chromosomes, such as chromosome pair 1. In addition, one of the BACs where Rex1 presented the highest length and abundance values (BAC10-L10), was found in a subtelomeric position in chromosome 1. Hence, this subtelomeric region could be comprised of heterochromatin in which (or adjacent to which) the Rex1 retrotransposon could occur.

It has been described that the preferential position of Rex1, Rex2 and Rex6 genes in heterochromatic regions of the genomes of some fish^73,74 could indicate some mechanism of regulation of these elements that impedes or prevents excessive dispersion and propagation in the genome, since the presence of heterochromatin could be regulating, through epigenetic mechanisms, the dispersion of these sequences without modifying their sequence⁷⁵. Several studies have shown a relationship between the preferential presence of repetitive sequences in sexual chromosomes and heterochromatin regions. Thus, in Harttia carvalhoi (Loricariidae) it has been discussed how the location of the retroelements Rex1, Rex3 and Rex6 in the pericentromeric region of an X chromosome could have influenced its fission, which led to the formation of chromosomes Y1 and Y2⁶⁹.

The first reference to the existence of Rex1 was published by Volff et al.⁶⁸, after finding an insert in a cosmid from the Y sex chromosome of X. maculatus that revealed a sequence encoding a product with similarities to the RT of non-LTR retrotransposons. That sequence was called Rex1-XimJ⁶⁸. After a wide analysis, the phylogeny of Rex1 sequences was explained by the presence of four major ancient lineages in fish genomes. The lineage 4 contained sequences from O. latipes and O. niloticus among others. Lineage 4 is observed in all Acanthopterygii, but not in C. carpio, D. rerio or O. mykiss, among others⁶⁸. In the Rex1 phylogenetic tree constructed, the Senegalese sole Rex1 retrotransposon could be associated with one of the four major ancient lineages in fish genomes, in which it is included O. latipes.

One of the hypotheses to explain the wide distribution of the lineage 4 of Rex1in fishes is the possibility of horizontal transfer⁶⁸. Horizontal transfer has been well documented for some DNA transposons and for LTR retrotransposons⁷⁶. The possibility of a horizontal transfer (HT) event between phylogenetically distant species (Perciformes and Batrachoidiformes orders) has been recently reported in fishes⁷⁷. It has also been demonstrated that 5 S rRNA genes and retro-transposons can interact with one another⁷⁸, and this interaction might be the cause of the pattern of evolution and the dispersed arrangement of some organisms. Therefore a putative role of the Rex1 retrotransposon, and its presence in a heterochromatic region of S. senegalensis, in the evolution of this putative sex proto-chromosome 1 should be not rejected. On the other hand, the chromosome pair 15 has also shown high abundance of Rex1 sequences in the BACs localized in this pair. In a previous work, the BAC 19-J21 also localized in this chromosome, and it carried the SOX9 gene²⁹. In the Prochilodontidae fish family, the W chromosome of Semaprochilodus taeniurus species, has significantly increased in size due to the accumulation of repetitive DNAs, like the Rex1 retro-element, with the consequent differentiation of the ZZ/ZW system of sex chromosomes⁶⁹. In that study, one of the W-specific fragments showed high similarity with the transcription factor of the SOX9 gene in T. rubripes. The SOX9 is a gene related to sex determination in many organisms and is present in the BAC 19-J21 in S. senegalensis. Hence the presence of the Rex1 gene in regions where it occurs, and the role it has played in certain events related to sex determination, must be taken into account in studies of the evolution of the Senegalese sole genome.

Conclusions

Our work represents a first approach to the study of the repetitive elements of the genome of the Senegalese sole (S. senegalensis). The analysis of the location of SSR allowed the description of large clusters of microsatellites in centromeric and subtelomeric positions, as well as the study of their composition by bioinformatic analysis. These results reflect a prevalence of A-rich repetitions during the evolution of this species as occurs in the genomes of other teleostats. The study of TEs revealed that the most abundant family in the genome of this flatfish is the hAT, as well as the discovery of a transposable Class I element, Rex, in the largest metacentric chromosome pair, recently described as a possible proto-sex chromosome. The presence of this element on this chromosome and its position in a heterochromatin region might have been relevant during the evolution of the chromosome. Our results present an important advance on the evolution of the S. senegalensis genome through the analysis of the distribution and quantification of repetitive elements and the role that Rex 1 may have played in certain events related to sex determination.

References

Jurka, J. et al. Repbase Update, a database of eukaryotic repetitive elements. Cytogenet. Genome Res. 110, 462–467 (2005).
Article CAS PubMed Google Scholar
Farre, M., Robinson, T. J. & Ruiz-Herrera, A. An Integrative Breakage Model of genome architecture, reshuffling and evolution: The Integrative Breakage Model of genome evolution, a novel multidisciplinary hypothesis for the study of genome plasticity. Bioessays 37, 479–488 (2015).
Article PubMed Google Scholar
Sotero-Caio, C. G., Platt, R. N. II, Suh, A. & Ray, D. A. Evolution and Diversity of Transposable Elements in Vertebrate Genomes. Genome Biol. Evol. 9, 161–177 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nowak, R. Mining treasures from ‘junk DNA’. Science (New York, N.Y.) 263, 608–610 (1994).
Article ADS CAS Google Scholar
Kidwell, M. G. Transposable elements and the evolution of genome size in eukaryotes. Genetica 115, 49–63 (2002).
Article CAS PubMed Google Scholar
Toth, G., Gaspari, Z. & Jurka, J. Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res. 10, 967–981 (2000).
Article CAS PubMed PubMed Central Google Scholar
Kazazian, H. H. J. Mobile elements: drivers of genome evolution. Science 303, 1626–1632 (2004).
Article ADS CAS PubMed Google Scholar
Chenais, B., Caruso, A., Hiard, S. & Casse, N. The impact of transposable elements on eukaryotic genomes: from genome size increase to genetic adaptation to stressful environments. Gene 509, 7–15 (2012).
Article CAS PubMed Google Scholar
Rebollo, R., Romanish, M. T. & Mager, D. L. Transposable elements: an abundant and natural source of regulatory sequences for host genes. Annu. Rev. Genet. 46, 21–42 (2012).
Article CAS PubMed Google Scholar
Sun, C. et al. LTR retrotransposons contribute to genomic gigantism in plethodontid salamanders. Genome Biol. Evol. 4, 168–183 (2012).
Article PubMed Google Scholar
Elliott, T. A. & Gregory, T. R. What’s in a genome? The C-value enigma and the evolution of eukaryotic genome content. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 370, 20140331 (2015).
Article PubMed PubMed Central CAS Google Scholar
Volff, J.-N. Genome evolution and biodiversity in teleost fish. Heredity (Edinb). 94, 280–294 (2005).
Article CAS PubMed Google Scholar
Chalopin, D., Naville, M., Plard, F., Galiana, D. & Volff, J.-N. Comparative analysis of transposable elements highlights mobilome diversity and evolution in vertebrates. Genome Biol. Evol. 7, 567–580 (2015).
Article CAS PubMed PubMed Central Google Scholar
Howe, K. et al. The zebrafish reference genome sequence and its relationship to the human genome. Nature 496, 498–503 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Eickbush, T. H. & Jamburuthugoda, V. K. The diversity of retrotransposons and the properties of their reverse transcriptases. Virus Res. 134, 221–234 (2008).
Article CAS PubMed PubMed Central Google Scholar
Van Sluys, M.-A., Setta, N. de, Scortecci, K. C. & Costa, A. P. P. O genoma instável, sequências genéticas móveis. In Biologia molecular e evolução PP - Ribeirão Preto (Holos, 2012).
Levin, H. L. & Moran, J. V. Dynamic interactions between transposable elements and their hosts. Nat. Rev. Genet. 12, 615–627 (2011).
Article CAS PubMed PubMed Central Google Scholar
Valente, G. T. et al. Cytogenetic mapping of the retroelements Rex1, Rex3 and Rex6 among cichlid fish: new insights on the chromosomal distribution of transposable elements. Cytogenet. Genome Res. 133, 34–42 (2011).
Article CAS PubMed Google Scholar
Ferreira, D. C., Porto-Foresti, F., Oliveira, C. & Foresti, F. Transposable elements as a potential source for understanding the fish genome. Mob. Genet. Elements 1, 112–117 (2011).
Article PubMed PubMed Central Google Scholar
Benjak, A., Forneck, A. & Casacuberta, J. M. Genome-wide analysis of the ‘cut-and-paste’ transposons of grapevine. PLoS One 3, e3107 (2008).
Article ADS PubMed PubMed Central CAS Google Scholar
Jiang, X.-Y. et al. The N-terminal zinc finger domain of Tgf2 transposase contributes to DNA binding and to transposition activity. Sci. Rep. 6, 27101 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Lander, E. S. et al. Initial sequencing and analysis of the human genome. Nature 409, 860–921 (2001).
Article ADS CAS PubMed Google Scholar
Yuan, Y.-W. & Wessler, S. R. The catalytic domain of all eukaryotic cut-and-paste transposase superfamilies. Proc. Natl. Acad. Sci. USA 108, 7884–7889 (2011).
Article ADS CAS PubMed PubMed Central Google Scholar
Duncan, E. J., Wilson, M. J., Smith, J. M. & Dearden, P. K. Evolutionary origin and genomic organisation of runt-domain containing genes in arthropods. BMC Genomics 9, 558 (2008).
Article PubMed PubMed Central CAS Google Scholar
Vega, L., Díaz, E., Cross, I. & Rebordinos, L. Cytogenetic and isozymatic characterization of the sole Solea senegalensis Kaup, 1858. Bol. - Inst. Esp. Oceanogr. 18, 245–250 (2002).
Google Scholar
Molina-Luzon, M. J. et al. Chromosomal manipulation in Senegalese sole (Solea senegalensis Kaup, 1858): induction of triploidy and gynogenesis. J. Appl. Genet. 56, 77–84 (2015).
Article PubMed Google Scholar
Merlo, M. A. et al. Analysis of the histone cluster in Senegalese sole (Solea senegalensis): evidence for a divergent evolution of two canonical histone clusters. Genome 60, 441–453 (2017).
Article CAS PubMed Google Scholar
Garcia-Angulo, A. et al. Evidence for a Robertsonian fusion in Solea senegalensis (Kaup, 1858) revealed by zoo-FISH and comparative genome analysis. BMC Genomics 19, 818 (2018).
Article CAS PubMed PubMed Central Google Scholar
Portela-Bens, S. et al. Integrated gene mapping and synteny studies give insights into the evolution of a sex proto-chromosome in Solea senegalensis. Chromosoma 126, 261–277 (2017).
Article CAS PubMed Google Scholar
Gamble, T., Geneva, A. J., Glor, R. E. & Zarkower, D. D. anolis sex chromosomes are derived from a single ancestral pair. Evolution 68, 1027–1041 (2014).
Article CAS PubMed Google Scholar
Kent, T. V, Uzunovic, J. & Wright, S. I. Coevolution between transposable elements and recombination. Philos. Trans. R. Soc. Lond. B. Biol. Sci. 372, (2017).
Robledo, D. et al. Integrating genomic resources of flatfish (Pleuronectiformes) to boost aquaculture production. Comp. Biochem. Physiol. Part D. Genomics Proteomics 21, 41–55 (2017).
Article CAS PubMed Google Scholar
García-Cegarra, A. et al. A preliminary genetic map in Solea senegalensis (Pleuronectiformes, Soleidae) using BAC-FISH and next-generation sequencing. Cytogenet. Genome Res. 141, 227–240 (2013).
Article PubMed CAS Google Scholar
Benzekri, H. et al. De novo assembly, characterization and functional annotation of Senegalese sole (Solea senegalensis) and common sole (Solea solea) transcriptomes: integration in a database and design of a microarray. BMC Genomics 15, 952 (2014).
Article PubMed PubMed Central CAS Google Scholar
Cross, I. et al. Cytogenetic characterization of the sole Solea senegalensis: Ag-NOR, (GATA)n, (TTAGGG)n and ribosomal genes by one-color and two-color FISH. Genetica 128, 253–259 (2006).
Article PubMed Google Scholar
Cuadrado, A., Schwarzacher, T. & Jouve, N. Identification of different chromatin classes in wheat using in situ hybridization with simple sequence repeat oligonucleotides. Theor Appl Genet 101, 711–717 (2000).
Article CAS Google Scholar
Thiel, T., Michalek, W., Varshney, R. K. & Graner, A. Exploiting EST databases for the development and characterization of gene-derived SSR-markers in barley (Hordeum vulgare L.). Theor. Appl. Genet. 106, 411–422 (2003).
Article CAS PubMed Google Scholar
Smit, A. F. A., Hubley, R. & Green, P. RepeatMasker Open-4.0. Available from http//www.repeatmasker.org.
Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. J. Mol. Biol. 215, 403–410 (1990).
Article CAS PubMed Google Scholar
Katoh, K. & Toh, H. Recent developments in the MAFFT multiple sequence alignment program. Brief. Bioinform. 9, 286–298 (2008).
Article CAS PubMed Google Scholar
Lefort, V., Longueville, J.-E. & Gascuel, O. SMS: Smart Model Selection in PhyML. Mol. Biol. Evol. 34, 2422–2424 (2017).
Article CAS PubMed PubMed Central Google Scholar
Guindon, S. et al. New algorithms and methods to estimate maximum-likelihood phylogenies: assessing the performance of PhyML 3.0. Syst. Biol. 59, 307–321 (2010).
Article CAS PubMed Google Scholar
Anisimova, M., Gil, M., Dufayard, J.-F., Dessimoz, C. & Gascuel, O. Survey of branch support methods demonstrates accuracy, power, and robustness of fast likelihood-based approximation schemes. Syst. Biol. 60, 685–699 (2011).
Article PubMed PubMed Central Google Scholar
Kumar, S., Stecher, G. & Tamura, K. MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets. Mol. Biol. Evol. 33, 1870–1874 (2016).
Article CAS PubMed PubMed Central Google Scholar
Novak, P., Neumann, P. & Macas, J. Graph-based clustering and characterization of repetitive sequences in next-generation sequencing data. BMC Bioinformatics 11, 378 (2010).
Article PubMed PubMed Central CAS Google Scholar
Macas, J., Neumann, P. & Navratilova, A. Repetitive DNA in the pea (Pisum sativum L.) genome: comprehensive characterization using 454 sequencing and comparison to soybean and Medicago truncatula. BMC Genomics 8, 427 (2007).
Article PubMed PubMed Central Google Scholar
Swaminathan, K., Varala, K. & Hudson, M. E. Global repeat discovery and estimation of genomic copy number in a large, complex genome using a high-throughput 454 sequence survey. BMC Genomics 8, 132 (2007).
Article PubMed PubMed Central CAS Google Scholar
Wicker, T. et al. Low-pass shotgun sequencing of the barley genome facilitates rapid identification of genes, conserved non-coding sequences and novel repeats. BMC Genomics 9, 518 (2008).
Article PubMed PubMed Central CAS Google Scholar
Edwards, Y. J., Elgar, G., Clark, M. S. & Bishop, M. J. The identification and characterization of microsatellites in the compact genome of the Japanese pufferfish, Fugu rubripes: perspectives in functional and comparative genomic analyses. J. Mol. Biol. 278, 843–854 (1998).
Article CAS PubMed Google Scholar
Chistiakov D.A. A4 - Hellemans, B. A4 - Volckaert, F.A.M., D. A. A.-C. Microsatellites and their genomic distribution, evolution, function and applications: A review with special reference to fish genetics. Aquaculture v. 255, 1-29–2006 v.255 nos. 1–4 (2006).
Article CAS Google Scholar
Sharma, P. C., Grover, A. & Kahl, G. Mining microsatellites in eukaryotic genomes. Trends Biotechnol. 25, 490–498 (2007).
Article CAS PubMed Google Scholar
Castoe, T. A. et al. Thousands of microsatellite loci from the venomous coralsnake Micrurus fulvius and variability of select loci across populations and related species. Mol. Ecol. Resour. 12, 1105–1113 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ustinova, J., Achmann, R., Cremer, S. & Mayer, F. Long repeats in a huge genome: microsatellite loci in the grasshopper Chorthippus biguttulus. J. Mol. Evol. 62, 158–167 (2006).
Article ADS CAS PubMed Google Scholar
Pannebakker, B. A., Niehuis, O., Hedley, A., Gadau, J. & Shuker, D. M. The distribution of microsatellites in the Nasonia parasitoid wasp genome. Insect Mol. Biol. 19(Suppl 1), 91–98 (2010).
Article CAS PubMed Google Scholar
Kejnovsky, E. et al. Expansion of microsatellites on evolutionary young Y chromosome. PLoS One 8, e45519 (2013).
Article ADS CAS PubMed PubMed Central Google Scholar
Schoebel, C. N. et al. Lessons learned from microsatellite development for nonmodel organisms using 454 pyrosequencing. J. Evol. Biol. 26, 600–611 (2013).
Article CAS PubMed Google Scholar
Li, Z. et al. Genome-wide mapping and characterization of microsatellites in the swamp eel genome. Sci. Rep. 7, 3157 (2017).
Article ADS PubMed PubMed Central CAS Google Scholar
Subramanian, S., Mishra, R. K. & Singh, L. Genome-wide analysis of microsatellite repeats in humans: their abundance and density in specific genomic regions. Genome Biol. 4, R13 (2003).
Article PubMed PubMed Central Google Scholar
Vanzela, A. L. L. et al. Differential Distribution of (GA)9+C Microsatellite on Chromosomes of Some Animal and Plant Species. Cytologia (Tokyo). 67, 9–13 (2002).
Article CAS Google Scholar
Volff, J.-N., Bouneau, L., Ozouf-Costaz, C. & Fischer, C. Diversity of retrotransposable elements in compact pufferfish genomes. Trends Genet. 19, 674–678 (2003).
Article CAS PubMed Google Scholar
Gao, B. et al. The contribution of transposable elements to size variations between four teleost genomes. Mob. DNA 7, 4 (2016).
Article PubMed PubMed Central Google Scholar
Brawand, D. et al. The genomic substrate for adaptive radiation in African cichlid fish. Nature 513, 375–381 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Lohe, A. R., Moriyama, E. N., Lidholm, D. A. & Hartl, D. L. Horizontal transmission, vertical inactivation, and stochastic loss of mariner-like transposable elements. Mol. Biol. Evol. 12, 62–72 (1995).
Article CAS PubMed Google Scholar
Raskina, O., Barber, J. C., Nevo, E. & Belyayev, A. Repetitive DNA and chromosomal rearrangements: speciation-related events in plant genomes. Cytogenet. Genome Res. 120, 351–357 (2008).
Article CAS PubMed Google Scholar
Harvey, S. C. et al. Karyotype evolution in Tilapia: mitotic and meiotic chromosome analysis of Oreochromis karongae and O. niloticus x O. karongae hybrids. Genetica 115, 169–177 (2002).
Article CAS PubMed Google Scholar
Steinemann, S. & Steinemann, M. Retroelements: tools for sex chromosome evolution. Cytogenet. Genome Res. 110, 134–143 (2005).
Article CAS PubMed Google Scholar
Pokorna, M., Kratochvil, L. & Kejnovsky, E. Microsatellite distribution on sex chromosomes at different stages of heteromorphism and heterochromatinization in two lizard species (Squamata: Eublepharidae: Coleonyx elegans and lacertidae: Eremias velox). BMC Genet. 12, 90 (2011).
Article CAS PubMed PubMed Central Google Scholar
Volff, J. N., Korting, C. & Schartl, M. Multiple lineages of the non-LTR retrotransposon Rex1 with varying success in invading fish genomes. Mol. Biol. Evol. 17, 1673–1684 (2000).
Article CAS PubMed Google Scholar
Terencio, M. L. et al. Repetitive sequences associated with differentiation of W chromosome in Semaprochilodus taeniurus. Genetica 140, 505–512 (2012).
Article CAS PubMed Google Scholar
Takehana, Y. et al. Molecular cloning and characterization of the repetitive DNA sequences that comprise the constitutive heterochromatin of the W chromosomes of medaka fishes. Chromosome Res. 20, 71–81 (2012).
Article CAS PubMed Google Scholar
Carrasco, L. A. P., Penman, D. J. & Bromage, N. Evidence for the presence of sex chromosomes in the Nile tilapia (Oreochromis niloticus) from synaptonemal complex analysis of XX, XY and YY genotypes. Aquaculture 173, 207–218 (1999).
Article Google Scholar
Chew, J. S. K., Oliveira, C., Wright, J. & Dobson, M. J. Molecular and cytogenetic analysis of the telomeric (TTAGGG)n repetitive sequences in the Nile tilapia, Oreochromis niloticus (teleostei: Cichlidae). Chromosoma 111, 45–52 (2002).
Article CAS PubMed Google Scholar
Fischer, C. et al. Global heterochromatic colocalization of transposable elements with minisatellites in the compact genome of the pufferfish Tetraodon nigroviridis. Gene 336, 175–183 (2004).
Article CAS PubMed Google Scholar
Favarato, R. M., Ribeiro, L. B., Feldberg, E. & Matoso, D. A. Chromosomal Mapping of Transposable Elements of the Rex Family in the Bristlenose Catfish, Ancistrus (Siluriformes, Loricariidae), from the Amazonian Region. J. Hered. 108, 254–261 (2017).
CAS PubMed Google Scholar
Richards, C. L., Bossdorf, O. & Pigliucci, M. What role does heritable epigenetic variation play in phenotypic evolution? Bioscience 60, 232–237 (2010).
Article Google Scholar
Gonzalez, P. & Lessios, H. A. Evolution of sea urchin retroviral-like (SURL) elements: evidence from 40 echinoid species. Mol. Biol. Evol. 16, 938–952 (1999).
Article CAS PubMed Google Scholar
Merlo, M. A. et al. Evidence for 5S rDNA Horizontal Transfer in the toadfish Halobatrachus didactylus (Schneider, 1801) based on the analysis of three multigene families. BMC Evol. Biol. 12 (2012).
Article CAS PubMed PubMed Central Google Scholar
Kapitonov, V. V. & Jurka, J. A novel class of SINE elements derived from 5S rRNA. Mol. Biol. Evol. 20, 694–702 (2003).
Article CAS PubMed Google Scholar

Download references

Acknowledgements

Authors thank to Dr Manuel Manchado from the IFAPA El Toruño for keeping and sharing the BAC library and Dr. Alberto Arias and Mr.Royston Snart for a critical reading of the manuscript. This study has been supported by the Spanish Ministerio de Ciencia e Innovación MICINN-FEDER (AGL2014-51860 -C2-1-P and RTI2018-096847-B-C21). AGA received a fellowship from the UCA.

Author information

These authors contributed equally: Emilio García and Ismael Cross.

Authors and Affiliations

Área de Genética, Facultad de Ciencias del Mar y Ambientales, INMAR, Universidad de Cádiz, 11510, Cádiz, Spain
Emilio García, Ismael Cross, Silvia Portela-Bens, María E. Rodríguez, Aglaya García-Angulo, Belén Molina & Laureana Rebordinos
Department of Biomedicine and Biotechnology, University of Alcala, 28871, Alcalá de Henares (Madrid), Spain
Angeles Cuadrado
Institut für Humangenetik, Universitätsklinikum Jena, 07747, Jena, Germany
Thomas Liehr

Authors

Emilio García
View author publications
You can also search for this author in PubMed Google Scholar
Ismael Cross
View author publications
You can also search for this author in PubMed Google Scholar
Silvia Portela-Bens
View author publications
You can also search for this author in PubMed Google Scholar
María E. Rodríguez
View author publications
You can also search for this author in PubMed Google Scholar
Aglaya García-Angulo
View author publications
You can also search for this author in PubMed Google Scholar
Belén Molina
View author publications
You can also search for this author in PubMed Google Scholar
Angeles Cuadrado
View author publications
You can also search for this author in PubMed Google Scholar
Thomas Liehr
View author publications
You can also search for this author in PubMed Google Scholar
Laureana Rebordinos
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.G. and I.C. contributed equally to the work and carried out the bioinformatic analysis. E.G. made the chromosome preparations and I.C. made the phylogenetic tree and drafted the paper. S.P., M.E.R. and T.L. carried out the mFISH. A.G. made the screening and isolation of the BACs from the library. B.M. isolated the DNA of the BACs for sequencing and mFISH. A.C. carried out the SSRs FISH. L.R. conceived and coordinated the study, participated in its design, discussed the results and corrected the manuscript. All authors read and approved the final manuscript

Corresponding author

Correspondence to Laureana Rebordinos.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Dataset 1

Supplementary Dataset 2

Supplementary Dataset 3

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

García, E., Cross, I., Portela-Bens, S. et al. Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome. Sci Rep 9, 17146 (2019). https://doi.org/10.1038/s41598-019-53673-6

Download citation

Received: 11 March 2019
Accepted: 05 November 2019
Published: 20 November 2019
DOI: https://doi.org/10.1038/s41598-019-53673-6
Springer Nature Limited

This article is cited by

Chromosome anchoring in Senegalese sole (Solea senegalensis) reveals sex-associated markers and genome rearrangements in flatfish
- Israel Guerrero-Cózar
- Jessica Gomez-Garrido
- Manuel Manchado
Scientific Reports (2021)
Development of whole-genome multiplex assays and construction of an integrated genetic map using SSR markers in Senegalese sole
- Israel Guerrero-Cózar
- Cathaysa Perez-Garcia
- Manuel Manchado
Scientific Reports (2020)

Integrative genetic map of repetitive DNA in the sole Solea senegalensis genome shows a Rex transposon located in a proto-sex chromosome

Abstract

Similar content being viewed by others

Introduction

Methods

PCR screening from S. senegalensis genomic library and BAC clone sequencing

FISH analysis

Chromosome Preparations

mBAC-FISH

SSRs-FISH

Bioinformatic analysis

Phylogenetic analysis

Compliance with ethical standards

Results

BAC-FISH mapping in S. senegalensis

FISH mapping of SSRs in S. senegalensis

NGS analysis of SSRs and TEs in the S. senegalensis BAC clones

Discussion

Conclusions

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation