The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs

Garavís, Miguel; Méndez-Lago, María; Gabelica, Valérie; Whitehead, Siobhan L.; González, Carlos; Villasante, Alfredo

doi:10.1038/srep13307

The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs

Article
Open access
Published: 20 August 2015

Volume 5, article number 13307, (2015)
Cite this article

Download PDF

You have full access to this open access article

Scientific Reports

The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs

Download PDF

Miguel Garavís^1,2,
María Méndez-Lago¹^nAff6,
Valérie Gabelica^3,4,
Siobhan L. Whitehead⁵,
Carlos González² &
…
Alfredo Villasante¹

4059 Accesses
45 Citations
3 Altmetric
Explore all metrics

Abstract

Centromeres are the chromosomal loci at which spindle microtubules attach to mediate chromosome segregation during mitosis and meiosis. In most eukaryotes, centromeres are made up of highly repetitive DNA sequences (satellite DNA) interspersed with middle repetitive DNA sequences (transposable elements). Despite the efforts to establish complete genomic sequences of eukaryotic organisms, the so-called ‘finished’ genomes are not actually complete because the centromeres have not been assembled due to the intrinsic difficulties in constructing both physical maps and complete sequence assemblies of long stretches of tandemly repetitive DNA. Here we show the first molecular structure of an endogenous Drosophila centromere and the ability of the C-rich dodeca satellite strand to form dimeric i-motifs. The finding of i-motif structures in simple and complex centromeric satellite DNAs leads us to suggest that these centromeric sequences may have been selected not by their primary sequence but by their ability to form noncanonical secondary structures.

Interspecies conservation of organisation and function between nonhomologous regional centromeres

Article Open access 28 May 2019

Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches

Article Open access 21 September 2015

DNA Sequences in Centromere Formation and Function

Introduction

Centromere sequences evolve rapidly due to inevitable recombination processes undergone by tandemly repeated sequences but many centromere proteins are conserved¹. This paradox could be explained by the presence of a conserved sequence-independent structural motif, rather than a particular sequence motif². The maintenance of a centromere-specific motif through evolution could be driven by molecular co-evolution of centromeric DNA and centromeric proteins. Importantly, in support of this evolutionary hypothesis, it has been shown that the centromere-specific histone H3 variant CENP-A (CID in Drosophila) has evolved in concert with centromeric satellite DNAs^3,4.

Although a centromeric structural motif might be sufficient to direct the formation of centromeric chromatin on its own, the episodic occurrence of centromere activity associated with noncentromeric sequences, neocentromeres^5,6 and the frequent inactivation/reactivation of centromeres^{7,8,9,10,11,12} indicates that centromere specification involves both genomic competency and epigenetic mechanisms¹³. It is now recognized that CENP-A-containing nucleosomes provide the epigenetic mark to establish the centromere-specific chromatin^14,15,16 and that the centromeric chromatin contains blocks of CENP-A nucleosomes interspersed with blocks of canonical histone H3 nucleosomes¹⁷. However, the folding of this centromeric chromatin is still not clear and is of much controversy^18,19.

Although there is evidence to suggest that each Drosophila melanogaster endogenous centromere is made up of different simple and complex satellite DNAs^{20,21,22,23,24,25}, their molecular structure has yet to be determined. The centromeric region of chromosome 3 of D. melanogaster, as well as the centromeric region of chromosomes 2 and 3 of the sibling species Drosophila simulans and Drosophila mauritiana, contains dodeca satellite 11/12 bp tandem repeats (CCCGTACTGGT/CCCGTACTCGGT) showing asymmetric distribution of guanine and cytosine residues such that one strand is predominantly G-rich and the other C-rich^25,26,27.

In order to fully understand the structural and functional aspects of centromeres, it is important to elucidate the types of secondary DNA structures that can be formed by their constituent repeat units. Hence, we determined that the G-rich dodeca satellite strand is able to fold into very stable intramolecular hairpin structures that are stabilized by the formation of noncanonical G:A pairs²⁸ and recently we have shown that not only the type B monomer of the human centromeric alpha-satellite²⁸ but also the type A are able to form dimeric i-motif structures²⁹. The i-motif is a four-stranded intercalated structure formed by the association of two parallel duplexes combined in an antiparallel fashion by forming intercalated hemi-protonated C:C⁺ base pairs^30,31,32,33. As i-motif formation requires protonation of cytosines these structures are more stable at acidic pH, although, depending on particular C-rich sequences, they can fold at neutral pH^34,35. I-motifs can also exist at neutral pH under molecular crowded conditions³⁶ and under transcriptionally induced negative superhelicity³⁷.

Since centromere specification may rely on centromeric structural motifs under the control of epigenetic mechanisms, knowledge of the fine structure of the endogenous D. melanogaster centromeres is required to elucidate the potential formation of noncanonical centromeric DNA structures. However, due to the repetitive nature of centric heterochromatin, the endogenous centromeres remain poorly represented in the new D. melanogaster Release 6 reference genome sequence³⁸.

The heterochromatin of D. melanogaster has been subdivided into 61 distinct cytological regions³⁹ and the primary constriction of the third chromosome localizes asymmetrically within the h53 region³⁹. Previous work has shown that the dodeca satellite DNA hybridized very close to the primary constriction of the third chromosome, but extending to the right arm²⁶ and that, occasionally, two hybridization signals can be seen in prometaphase chromosomes²⁷. Moreover, the cytological analysis of the free chromosome arm F(3R)1 has shown that the amount of dodeca satellite on the right arm can be reduced without compromising chromosome segregation²⁶. Initial studies of the long-range structure of the dodeca satellite DNA were reported by²⁷. Common-cutting restriction enzymes that do not cut within dodeca satellite DNA revealed that most of the dodeca satellite was organized in two major blocks. In addition, the distribution of restriction sites in the long-range map suggested that the region between the blocks consists of complex DNA sequences, while the flanking region of one of the blocks is likely to contain another putative block of repetitive DNA yet undescribed.

Results and Discussion

Since a complete physical map across the centromere should extend from chromosome arm 3L to chromosome arm 3R, we set out to isolate bacterial artificial chromosome (BAC) clones that contain dodeca satellite and to construct a comprehensive map around the dodeca satellite blocks using 20 restriction enzymes that do not occur within the dodeca satellite. Single and double genomic digests were size-fractionated by pulsed-field gel electrophoresis (PFGE) using a “Waltzer” apparatus⁴⁰, which gives sharp resolution up to 2 Mb (representative digests are shown in Supplementary Fig. S1). In order to obtain dodeca satellite clones, three D. melanogaster BAC libraries were screened: the RPCI-98 library generated by cloning EcoRI-digested genomic DNA and the CHORI-221 and CHORI-223 libraries generated from sheared genomic DNA. BAC end sequencing and fingerprinting data of the stronger clones were used to construct contigs and five clones were chosen for complete sequencing: CH221-29J09 (containing rDNA intergenic spacer (IGS) related sequences, cIGS), CH211-27P10 (containing Akap200 related sequences) and BACR19P07, BACR16A01 and BACR12I02 that were also positive for the retrotransposon Circe. The presence of Circe sequences in the centromeric region h53 had previously been reported⁴¹. By combining data from the sequence of these BACs and the sequence of whole genome shotgun scaffolds containing dodeca satellite with the results of an accurate restriction site mapping of genomic DNA, it has been possible to determine the position and orientation of the first eight scaffolds in the Release 6 assembly of the chromosome arm 3R (Fig. 1a).

The dodeca satellite sequences at this endogenous centromere are organized as two adjacent major blocks, block I and block II, plus several minor blocks (Fig. 1a). Interestingly, a detailed analysis of the sequence of the blocks has shown that block I has more undeca than dodeca repeats (Supplementary Fig. S2) and that the sequence of both repeat units are highly conserved (Supplementary Fig. S3). Moreover, this centromeric region contains transposable elements and two segmental duplications: one results from a duplication of a fragment of Akap200 (chromosome arm 2L at 29C) and subsequent amplification and the other, located at one edge of block I, results from a duplication of IGS sequences at the nucleolus organizer region (NOR) (Fig. 1a). Nevertheless, FISH to mitotic chromosomes under low-stringency conditions with a cIGS-specific probe has not detected IGS-related sequences in the centromeric region of chromosome 2, although clear cross-hybridization signals occur at the NORs (Supplementary Fig. S4).

To elucidate whether CID interacts with dodeca satellite sequences, immunofluorescence-FISH experiments were performed. Thus, by using SuUR Su(var)3–9 double mutants to suppress the normal under-replication of Drosophila heterochromatin during the process of polytenization, we showed that CID co-localizes on polytene chromosomes with dodeca satellite sequences⁴². To corroborate this interaction, we increased the resolution performing immunofluorescence-FISH on extended chromatin fibers from cultured S2 cells and found that anti-CID antibody and dodeca satellite signals co-localize in approximately one fourth of the CID-positive fibers (n = 46), consistent with dodeca satellite being present only in the centromeric heterochromatin of chromosome 3 and not in the other four Drosophila melanogaster chromosomes (Fig. 1b). Some of the fibers that were positive for dodeca did not show co-staining for CID. This could indicate that dodeca satellite, most likely block II and minor blocks, extends beyond the centromeric chromatin, which contains CID nucleosomes. A similar scenario has been observed in the centromeres of Arabidopsis thaliana, in which part of the 178 bp satellite repeats extend beyond the centromeric chromatin into pericentromeric regions. This pericentromeric 178-bp satellite associates with H3.1-containing chromatin, while the centromeric 178-bp repeats associate with CENH3-containing chromatine^43,44. Although the assembly of Droshophila melanogaster centromere 3 presented here is not completed at a base pair level, our results strongly suggest that dodeca-satellite block I is a good candidate for the centromere of chromosome 3.

During our effort to identify the putative block of simple sequence DNA in the flanking region of the dodeca satellite block 1, we repeated the cytological mapping of the 10 bp satellite (AATAACATAG)n using a fluorescent probe, which improves sensitivity and resolution with respect to results obtained with tritiated probes²². The 10 bp satellite had been mapped by²² to region h37 on the second chromosome (contiguous to the centromeric region h38) and to region h48 on the third chromosome (far away from the centromeric region h53). Unexpectedly, FISH experiments with dodeca and 10 bp satellite probes revealed no additional sites for the 10 bp satellite, but showed a change in its location on the third chromosome from h48 to h52p, a position which is very close to dodeca satellite (Fig. 2a–e). To investigate further the possibility that the flanking satellite DNA corresponds to the 10 bp satellite, we asked whether the 1.2 Mb BssHII fragment containing both dodeca satellite and flanking satellite sequences would hybridize with the 10 bp satellite probe. To this end, genomic DNA was digested with BssHII, size-fractionated by PFGE, transferred to a nylon membrane, hybridized with the dodeca satellite probe and then stripped and re-hybridized with the 10 bp satellite probe. As can be seen in Fig. 2f, there is a 1.2 Mb fragment (labeled with an asterisk) that hybridizes with both probes. Finally, the junction between 10 bp satellite and dodeca satellite sequences was found by searching the Trace Archive database (Fig. 2g). This result indicates that the 10 bp satellite DNA is physically linked to the dodeca satellite DNA. Here, it is important to remember that PROD, a protein required for centromere condensation⁴⁵ and that specifically recognizes the 10 bp satellite⁴⁵, is located near but not in the CID-containing chromatin⁴⁶. Therefore, the physical map constructed comprises two adjacent chromatin domains with distinct functions. Although we have not completed the assembly at a base pair level, the sequence obtained from the five newly sequenced BAC clones, together with pre-existing contigs, with the identification of large blocks of several DNA satellites and with the correct re-location of the 10 bp satellite from h48 to h52p, represent the most comprehensive physical map and assembly across the centromere of chromosome 3 of Drosophila melanogaster.

To determine the structural behavior of the dodeca satellite DNA, several oligonucleotides containing the dodeca repeat and, its main variant, the undeca repeat were studied by NMR, circular dichroism (CD) and mass spectrometry (MS). The G-rich and the C-rich strands were analyzed under different experimental conditions (Supplementary Fig. S5 and S6). In all cases, the NMR spectra of the G-rich oligonucleotides indicate the formation of G:C base pairs and no formation of G-tetrads is observed even at high K⁺ concentrations (Supplementary Fig. S5). This is in agreement with the formation of intramolecular hairpins previously reported⁴⁷. However, under acidic conditions the NMR spectra of the oligonucleotides corresponding to the C-rich strand of the dodeca and undeca repeats exhibit sharp imino signals around 15–16 ppm, characteristic of i-motif formation (Fig. 3a,b and Supplementary Fig. S6,8). I-motif formation is confirmed by CD spectra, which show the characteristic strong positive band at 285 nm⁴⁸ (Fig. 3c,d). CD melting experiments show that these structures are quite stable at pH 4.0, with melting temperatures around 42 °C for dodeca and 45 °C for undeca (Supplementary Fig. S7e,f). Mass spectrometry data clearly indicate the formation of dimeric structures at acidic pH (Fig. 3e,f). The peaks corresponding to the dimeric species are not present at neutral pH and become very intense at pH 4 (Supplementary Fig. S7a–d). This pH dependence is consistent with i-motif structures. Interestingly, no tetrameric species are observed in mass spectrometry experiments, indicating that the structures formed by these oligonucleotides are the result of the self-association of two hairpins; similar dimeric structures as those observed in the A and B box of the human alpha satellite^28,29. To further explore this similarity, we carried out two-dimensional NMR experiments of the dodeca oligonucleotide. Although a full structural determination is beyond the scope of this study, some interesting information can be readily spotted from the NOESY spectra. Each of the six cytosine imino signals (14.5–16.0 ppm) exhibit NOE cross-peaks with two amino protons (Fig. 4a). The presence of only two cross-peaks with cytosine amino protons instead of four (see Fig. 4c) denotes the formation of C:C⁺ base pairs between equivalent residues in each subunit. Other NOEs characteristic of i-motifs are also observed, such as imino-imino cross-peaks between adjacent C:C⁺ base pairs (Fig. 4a) or H1’-H1’ contacts between deoxyribose protons (Fig. 4b). According to these experimental data, we suggest a plausible structural model in which two hairpins self-associate in a head-to-head orientation through the formation of six intermolecular C:C⁺ base pairs (Fig. 4d).

In order to explore the existence of these structures under conditions that better represent a physiological context, we performed CD experiments at pH 6 and pH 7 and we explored the influence of the crowding agent PEG₄₀₀₀ on the stability of the structure. The CD spectra of both sequences at pH 7 show a maximum of ellipticity around 275 nm and a minimum around 240 nm (Supplementary Fig. S9). These CD spectra are different than those observed in single stranded DNA and are considered as indicative of “i-motif-like” structures⁴⁹. At pH 6, the maximum ellipticities of both sequences appear at around 285 nm, which is characteristic of i-motif structures. Importantly, the addition of 20% w/w PEG₄₀₀₀ produces a notable increase of the intensity at 285 nm (Supplementary Fig. S9), which indicates that the crowding conditions favour the formation of the i-motif at this pH. These results are totally consistent with previous observations on the stabilizing effect of crowding agents in other i-motif structures³⁶ and suggest that the crowding environment present in the centromeric nucleosome might favour i-motif formation.

In conclusion, we have shown that the C-rich strand of the dodeca satellite (from both its 11 bp and 12 bp repeat units) is able to form dimeric i-motif structures in vitro. This experimental evidence, together with recent findings of similar i-motif structures in the human centromeric alpha-satellite²⁹ lead us to suggest that we may be observing a structural motif common to centromeric sequences. Interestingly, preliminary results on the 359 bp satellite DNA (centromeric DNA from the Drosophila X chromosome)²² point in this direction, since the 359 bp C-rich region (found at internucleosomal linkers⁵⁰) can also fold into i-motif structures (Supplementary Fig. S10).

These in vitro results stimulate the study of non-canonical DNA structures in vivo. The recent observation of G-quadruplex structures in vivo -20 years after their discovery in vitro- in telomeres and in gene promoters demonstrates that non-canonical DNA structures, different than the double helix, play significant roles in biological processes⁵¹. Much effort is still ahead to probe whether i-motifs exist in vivo or not, although recent findings on gene inhibition by small molecules that bind i-motif structures in promoter regions, suggest they might form in vivo^52,53. If the occurrence of these structures were demonstrated in a centromeric nucleosomal context, they could be a structural signature of centromeric regions. Moreover, this would provide further support for our hypothesis that centromeric sequences are selected not by their primary sequence but by their ability to form noncanonical secondary structures⁵⁴.

Materials and Methods

Drosophila strains and cell lines

Oregon R was used as wild-type strain. The isogenic red e strain was used for the construction of the physical map. Standard culture conditions and media were used. Drosophila S2 cells were grown and maintained as described⁴⁶.

DNA analysis, sequencing and probes

High molecular weight DNA from 0–12 h Drosophila embryos was prepared in agarose plugs as previously described by Ref. 25,55,56. Restriction enzyme digestions were performed following the suppliers’ recommendations. DNA was analyzed by pulsed-field gel electrophoresis using a “Waltzer” apparatus⁴⁰ and transferred to Hybond N⁺ nylon filters (Amersham) in 0.4 M NaOH.

The dodeca satellite probe was pBK6E218²⁵. The 10 bp satellite oligo probe was 5′-AATAACATAGAATAACATAGAATAACATAGAATAACATAGAATAACATAG-3′. The centromeric IGS (cIGS) probe (5.6 kb fragment) was obtained from BACR31J03 using the primers: cIGS-Fw: 5′-TGGCAGCGTTTTAAGGGATG-3′ and cIGS-Rv: 5′-TAAGACGCCTGCAGAGAACG-3′. The PCR was carried out as described by⁵⁷. The PCR product was cloned in vector pGEM-T (Promega). Plasmid probes were ³²P-labeled by random-priming and oligonucleotide probes were ³²P-labeled with T4 polynucleotide kinase. The BAC clones were sequenced at The Wellcome Trust Sanger Institute by the standard shotgun sequencing and directed finishing approach. The GenBank accession number for the sequence of BAC19P07, BAC16A01, BAC12I02, CH221-29J09 and CH221-27P10 are CU311183, CR942806, CR942807, CU463787 and CU313318, respectively.

Fluorescence in situ hybridization to mitotic chromosomes

Larval neuroblast chromosomes from Oregon R were prepared as described previously⁵⁸. Chromosomes were counterstained with 4′,6-diamino-2-phenylindole (DAPI). The dodeca satellite oligo probe 5′-CCCGTACTGGTCCCGTACTGGTCCCGTACTCGGTCCCGTACTCGGT-3′ and the 10 bp satellite oligo probe 5′-AATAACATAGAATAACATAGAATAACATAGAATAACATAGAATAACATAG-3′ were chemically synthesized and labeled at the 5′ end with Cy3 or at the 3′ end with fluorescein (New England Biolabs). DNA probes derived from clones or PCR products were labeled by nick translation with digoxygenin-11-dUTP (Roche) using the DIG-Nick Translation Mix (Roche). Digoxygenin labeled probes were detected with Anti-Digoxigenin-Rhodamine, Fab fragments (Roche) in a 1:200 dilution, following supplier recommendations. Digital images were obtained using a Zeiss Axiover 200 microscope equipped with a cooled Charge-Coupled Device camera. The fluorescent signals were recorded separately as grey-scale digital images and then pseudo-colored and merged using Adobe Photoshop software.

Immunofluorescence-FISH on extended chromatin fibers

Extended chromatin fibers were prepared from S2 cells by centrifuging 5 × 10⁴ cells onto slides at 800 rpm for 4 min in a Cytospin 4 (Thermo Shandon, Pittsburgh, PA) and then slides were dipped into salt detergent lysis buffer (25 mM Tris, pH 7.5, 500 mM NaCl and 1% Triton X-100) for 25 minutes, slowly and steadily removed using an in-house made device consisting of a modified EasyDip™ Slide Staining System connected to a peristaltic pump and subsequently fixed in 4% paraformaldehyde (PFA) for 5 minutes. Slides were incubated in 1×PBST (1×PBS ± 0.05% Tween-20) for 15 minutes. Slides were dipped again in the former lysis dilution for 15 minutes, after which they were slowly and steadily removed. Slides were blocked in 1×PBS, 0.1% Triton X-100, 4% formaldehyde for 10 minutes at room temperature and washed for 5 minutes in 1x PBS before proceeding to immunostaining. Slides were blocked in goat serum (Zymed Laboratories) for 30 minutes and incubated overnight at 4 °C with a chicken anti-CID antibody⁴⁶, diluted to 1:100 in blocking buffer. Slides were washed 3 times for 5 minutes in 1×PBST and incubated for 1 hour at 37 °C in Alexa 488 anti-chicken secondary antibody (Molecular probes). Slides were then washed 3 times in 1×PBST and 3 times in 1×PBS. After immunofluorescence with CID antibodies, slides were re-fixed in 4% formaldehyde for 15 minutes and then hybridized to the dodeca probe. For each slide, around 250 ng of dodeca probe were precipitated with 3 M Sodium Acetate and absolute ethanol, re-suspended in hybridization solution (50% formamide, 10% dextran sulfate, 2×SSC) and denatured for 10 min at 80 °C. Slides were incubated at 37 °C for 24 h.

DNA sample preparation for NMR and MS experiments

Oligonucleotides were purchased from Integrated DNA Technologies, IDT, Coralville, IA, USA. Samples for NMR experiments were dissolved in 9:1 H₂O/D₂O. Buffer conditions: 25 mM sodium phosphate, 100 mM NaCl pH 4.0 for C rich sequences and 25 mM potassium phosphate, 100 mM KCl pH 7.0 for G rich sequences. The latter were previously annealed by heating at 90 °C for 5 minutes and cooling down to room temperature overnight.

Samples for MS experiments were dissolved at 100 μM in 100 mM NH₄OAc buffer at pH 7 and pH 4. pH was adjusted by adding acetic acid and NH₃ aliquots.

NMR experiments

All NMR spectra were acquired in Bruker spectrometers operating at 600 and 800 MHz, equipped with cryoprobes and processed with the TOPSPIN software. A jump-and-return pulse sequence⁵⁹ was employed to observe the rapidly exchanging protons in 1D H₂O experiments. In most of the experiments in H₂O, water suppression was achieved by including a WATERGATE module in the pulse sequence prior to acquisition. NOESY experiments were recorded at 5 °C, in 25 mM NaPi, pH 4.0 (9:1 H₂O/D₂O) buffer and with mixing time of 100 ms.

Circular Dichroism spectroscopy

Circular dichroism spectra at different temperatures were recorded on a Jasco J-810 spectropolarimeter fitted with a thermostated cell holder. CD spectra were recorded in 25 mM sodium phosphate buffer, pH 4, with 100 mM NaCl (100 μM oligo concentration). CD melting curves were recorded at the wavelength of the larger positive band, 285 nm with a heating rate of 0.5 °C.min⁻¹.

Experiments with PEG₄₀₀₀ were performed by preparing the sample in buffer 25 mM NaPi, 100 mM NaCl containing 20% w/w PEG₄₀₀₀. A volume of the sample is weighted and an amount of 20% of the measured weight is added as PEG₄₀₀₀ (Sigma-Aldrich) to the sample. Then pH is fixed by adding HCl and NaCl aliquots.

Mass spectrometry

All ESI-MS experiments were carried out in the negative ion mode on an Exactive ESI-Orbitrap mass spectrometer (Thermo Scientific, Bremen, Germany). The ESI spray voltage and capillary voltage used were −2.75 kV and −20 V, respectively. The capillary temperature was set to 150 °C. Tube lens and skimmer voltage were fixed to 180 V and −10 V, respectively. Samples were injected at a flow rate of 4 μL min⁻¹.

Additional Information

How to cite this article: Garavís, M. et al. The structure of an endogenous ^Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs. Sci. Rep. 5, 13307; doi: 10.1038/srep13307 (2015).

References

Henikoff, S., Ahmad, K. & Malik, H. S. The centromere paradox: stable inheritance with rapidly evolving DNA. Science 293, 1098–1102 (2001).
Article CAS PubMed Google Scholar
Abad, J. P. & Villasante, A. Searching for a common centromeric structural motif: Drosophila centromeric satellite DNAs show propensity to form telomeric-like unusual DNA structures. Genetica 109, 71–75 (2000).
Article CAS PubMed Google Scholar
Malik, H. S. & Henikoff, S. Adaptive evolution of Cid, a centromere-specific histone in Drosophila. Genetics 157, 1293–1298 (2001).
Article CAS PubMed PubMed Central Google Scholar
Schueler, M. G., Swanson, W., Thomas, P. J. & Green, E. D. Adaptive evolution of foundation kinetochore proteins in primates. Mol. Biol. Evol. 27, 1585–1597 (2010).
Article CAS PubMed PubMed Central Google Scholar
Du Sart, D. et al. A functional neo-centromere formed through activation of a latent human centromere and consisting of non-alpha-satellite DNA. Nat. Genet. 16, 144–153 (1997).
Article CAS PubMed Google Scholar
Williams, B. C., Murphy, T. D., Goldberg, M. L. & Karpen, G. H. Neocentromere activity of structurally acentric mini-chromosomes in Drosophila. Nat. Genet. 18, 30–38 (1998).
Article CAS PubMed Google Scholar
Rocchi, M., Archidiacono, N., Schempp, W., Capozzi, O. & Stanyon, R. Centromere repositioning in mammals. Heredity 108, 59–67 (2011).
Article PubMed PubMed Central Google Scholar
Han, F., Gao, Z. & Birchler, J. A. Reactivation of an inactive centromere reveals epigenetic and structural components for centromere specification in maize. Plant Cell 21, 1929–1939 (2009).
Article CAS PubMed PubMed Central Google Scholar
Agudo, M. et al. A dicentric chromosome of Drosophila melanogaster showing alternate centromere inactivation. Chromosoma 109, 190–196 (2000).
Article CAS PubMed Google Scholar
Sullivan, B. A. & Willard, H. F. Stable dicentric X chromosomes with two functional centromeres. Nat. Genet. 20, 227–228 (1998).
Article CAS PubMed Google Scholar
Fisher, A. M. et al. Centromeric inactivation in a dicentric human Y; 21 translocation chromosome. Chromosoma 106, 199–206 (1997).
Article CAS PubMed Google Scholar
Steiner, N. C. & Clarke, L. A novel epigenetic effect can alter centromere function in fission yeast. Cell 79, 865–874 (1994).
Article CAS PubMed Google Scholar
Hayden, K. E. et al. Sequences associated with centromere competency in the human genome. Mol. Cell. Biol. 33, 763–772 (2013).
Article CAS PubMed PubMed Central Google Scholar
Allshire, R. C. & Karpen, G. H. Epigenetic regulation of centromeric chromatin: old dogs, new tricks? Nat. Rev. Genet. 9, 923–937 (2008).
Article CAS PubMed PubMed Central Google Scholar
Black, B. E. & Cleveland, D. W. Epigenetic centromere propagation and the nature of CENP-a nucleosomes. Cell 144, 471–479 (2011).
Article CAS PubMed PubMed Central Google Scholar
Fachinetti, D. et al. A two-step mechanism for epigenetic specification of centromere identity and function. Nat. Cell Biol. 15, 1056–1066 (2013).
Article CAS PubMed PubMed Central Google Scholar
Blower, M. D., Sullivan, B. A. & Karpen, G. H. Conserved organization of centromeric chromatin in flies and humans. Dev. Cell 2, 319–330 (2002).
Article CAS PubMed PubMed Central Google Scholar
Ribeiro, S. A. et al. A super-resolution map of the vertebrate kinetochore. Proc. Natl. Acad. Sci. USA. 107, 10484–10489 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Sullivan, B. A. & Karpen, G. H. Centromeric chromatin exhibits a histone modification pattern that is distinct from both euchromatin and heterochromatin. Nat. Struct. Mol. Biol. 11, 1076–1083 (2004).
Article CAS PubMed PubMed Central Google Scholar
Abad, J. et al. Pericentromeric regions containing 1.688 satellite DNA sequences show anti-kinetochore antibody staining in prometaphase chromosomes of Drosophila melanogaster. Mol. Gen. Genet. 264, 371–377 (2000).
Article CAS PubMed Google Scholar
Agudo, M. et al. Centromeres from telomeres? The centromeric region of the Y chromosome of Drosophila melanogaster contains a tandem array of telomeric HeT-A-and TART-related sequences. Nucleic Acids Res. 27, 3318–3324 (1999).
Article CAS PubMed PubMed Central Google Scholar
Lohe, A. R., Hilliker, A. & Roberts, P. Mapping simple repeated DNA sequences in heterochromatin of Drosophila melanogaster. Genetics 134, 1149 (1993).
Article CAS PubMed PubMed Central Google Scholar
Mendez-Lago, M. et al. Novel sequencing strategy for repetitive DNA in a Drosophila BAC clone reveals that the centromeric region of the Y chromosome evolved from a telomere. Nucleic Acids Res. 37, 2264–2273 (2009).
Article CAS PubMed PubMed Central Google Scholar
Sun, D. et al. Inhibition of human telomerase by a G-quadruplex-interactive compound. J. Med. Chem. 40, 2113–2116 (1997).
Article CAS PubMed Google Scholar
Abad, J. P. et al. Dodeca satellite: a conserved G+C-rich satellite from the centromeric heterochromatin of Drosophila melanogaster. Proc. Natl. Acad. Sci. USA. 89, 4663–4667 (1992).
Article ADS CAS PubMed PubMed Central Google Scholar
Carmena, M., Abad, J. P., Villasante, A. & Gonzalez, C. The Drosophila melanogaster dodecasatellite sequence is closely linked to the centromere and can form connections between sister chromatids during mitosis. J. Cell Sci. 105, 41–50 (1993).
Article CAS PubMed Google Scholar
Losada, A., Abad, J., Agudo, M. & Villasante, A. Long-range analysis of the centromeric region of Drosophila melanogaster chromosome 3. Chromosome Res. 8, 651–653 (2000).
Article CAS PubMed Google Scholar
Gallego, J., Chou, S. H. & Reid, B. R. Centromeric pyrimidine strands fold into an intercalated motif by forming a double hairpin with a novel T:G:G:T tetrad: solution structure of the d(TCCCGTTTCCA) dimer. J. Mol. Biol. 273, 840–856 (1997).
Article CAS PubMed Google Scholar
Garavís, M., Escaja, N., Gabelica, V., Villasante, A. & González, C. Centromeric alpha-satellite DNA adopts dimeric i-motif structures capped by AT Hoogsteen base pairs. Chem. Eur. J. 21, 9816–9824 (2015).
Article PubMed CAS Google Scholar
Benabou, S., Avino, A., Eritja, R., Gonzalez, C. & Gargallo, R. Fundamental aspects of the nucleic acid i-motif structures. RSC Adv. 4, 26956–26980 (2014).
Article CAS ADS Google Scholar
Day, H. A., Pavlou, P. & Waller, Z. A. E. i-Motif DNA: Structure, stability and targeting with ligands. Bioorg. Med. Chem. 22, 4407–4418 (2014).
Article CAS PubMed Google Scholar
Gehring, K., Leroy, J.-L. & Gueron, M. A tetrameric DNA structure with protonated cytosine-cytosine base pairs. Nature 363, 561–565 (1993).
Article ADS CAS PubMed Google Scholar
Leroy, J. L., Gehring, K., Kettani, A. & Gueron, M. Acid multimers of oligodeoxycytidine strands: stoichiometry, base-pair characterization and proton exchange properties. Biochemistry 32, 6019–6031 (1993).
Article CAS PubMed Google Scholar
Brazier, J. A., Shah, A. & Brown, G. D. I-Motif formation in gene promoters: unusually stable formation in sequences complementary to known G-quadruplexes. Chem. Commun. 48, 10739–10741 (2012).
Article CAS Google Scholar
Leroy, J. L., Guéron, M., Mergny, J. L. & Hélène, C. Intramolecular folding of a fragment of the cytosine-rich strand of telomeric DNA into an i-motif. Nucleic Acids Res. 22, 1600–1606 (1994).
Article CAS PubMed PubMed Central Google Scholar
Cui, J., Waltman, P., Le, V. H. & Lewis, E. A. The effect of molecular crowding on the stability of human c-MYC promoter sequence i-motif at neutral pH. Molecules 18, 12751–12767 (2013).
Article CAS PubMed PubMed Central Google Scholar
Sun, D. & Hurley, L. H. The importance of negative superhelicity in inducing the formation of G-quadruplex and i-motif structures in the c-Myc promoter: implications for drug targeting and control of gene expression. J. Med. Chem. 52, 2863–2874 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hoskins, R. A. et al. The Release 6 reference sequence of the Drosophila melanogaster genome. Genome Res. 25, 445–458 (2015).
Article PubMed PubMed Central Google Scholar
Gatti, M. & Pimpinelli, S. Functional elements in Drosophila melanogaster heterochromatin. Annu. Rev. Genet. 26, 239–276 (1992).
Article CAS PubMed Google Scholar
Southern, E., Anand, R., Brown, W. & Fletcher, D. A model for the separation of large DNA molecules by crossed field gel electrophoresis. Nucleic Acids Res. 15, 5925–5943 (1987).
Article CAS PubMed PubMed Central Google Scholar
Losada, A., Abad, J. P., Agudo, M. & Villasante, A. The analysis of Circe, an LTR retrotransposon of Drosophila melanogaster, suggests that an insertion of non-LTR retrotransposons into LTR elements can create chimeric retroelements. Mol. Biol. Evol. 16, 1341–1346 (1999).
Article CAS PubMed Google Scholar
Andreyeva, E. N. et al. High-resolution analysis of Drosophila heterochromatin organization using SuUR Su(var)3-9 double mutants. Proc. Natl. Acad. Sci. USA. 104, 12819–12824 (2007).
Article ADS CAS PubMed PubMed Central Google Scholar
Vaquero-Sedas, M. I. & Vega-Palas, M. A. Differential association of Arabidopsis telomeres and centromeres with Histone H3 variants. Sci. Rep. 3, (2013).
Zhang, W., Lee, H. R., Koo, D. H. & Jiang, J. Epigenetic modification of centromeric chromatin: hypomethylation of DNA sequences in the CENH3-associated chromatin in Arabidopsis thaliana and maize. Plant Cell 20, 25–34 (2008).
Article PubMed PubMed Central CAS Google Scholar
Török, T., Gorjánácz, M., Bryant, P. J. & Kiss, I. Prod is a novel DNA-binding protein that binds to the 1.686 g/cm3 10 bp satellite repeat of Drosophila melanogaster. Nucleic Acids Res. 28, 3551–3557 (2000).
Article PubMed PubMed Central Google Scholar
Blower, M. D. & Karpen, G. H. The role of Drosophila CID in kinetochore formation, cell-cycle progression and heterochromatin interactions. Nat. Cell Biol. 3, 730–739 (2001).
Article CAS PubMed PubMed Central Google Scholar
Ferrer, N., Azorin, F., Villasante, A., Gutierrez, C. & Abad, J. Centromeric dodeca-satellite DNA sequences form fold-back structures. J. Mol. Biol. 245, 8–21 (1995).
Article CAS PubMed Google Scholar
Kypr, J., Kejnovská, I., Renčiuk, D. & Vorlíčková, M. Circular dichroism and conformational polymorphism of DNA. Nucleic Acids Res. 37, 1713–1725 (2009).
Article CAS PubMed PubMed Central Google Scholar
Dettler, J. M. et al. Biophysical characterization of an ensemble of intramolecular i-motifs formed by the human c-MYC NHE III1 P1 promoter mutant sequence. Biophys. J. 99, 561–567 (2010).
Article ADS CAS PubMed PubMed Central Google Scholar
Käs, E. & Laemmli, U. K. In vivo topoisomerase II cleavage of the Drosophila histone and satellite III repeats: DNA sequence and structural characteristics. EMBO J. 11, 705 (1992).
Article PubMed PubMed Central Google Scholar
Biffi, G., Tannahill, D., McCafferty, J. & Balasubramanian, S. Quantitative visualization of DNA G-quadruplex structures in human cells. Nat. Chem. 5, 182–186 (2013).
Article CAS PubMed PubMed Central Google Scholar
Kendrick, S. et al. The dynamic character of the BCL2 promoter i-motif provides a mechanism for modulation of gene expression by compounds that bind selectively to the alternative DNA hairpin structure. J. Am. Chem. Soc. 136, 4161–4171 (2014).
Article CAS PubMed PubMed Central Google Scholar
Kang, H. J., Kendrick, S., Hecht, S. M. & Hurley, L. H. The transcriptional complex between the BCL2 i-motif and hnRNP LL is a molecular switch for control of gene expression that can be modulated by small molecules. J. Am. Chem. Soc. 136, 4172–4185 (2014).
Article CAS PubMed PubMed Central Google Scholar
Garavís, M., González, C. & Villasante, A. On the origin of the eukaryotic chromosome: the role of noncanonical DNA structures in telomere evolution. Genome Biol. Evol. 5, 1142–1150 (2013).
Article PubMed PubMed Central CAS Google Scholar
Karpen, G. H. & Spradling, A. C. Analysis of subtelomeric heterochromatin in the Drosophila minichromosome Dp1187 by single P element insertional mutagenesis. Genetics 132, 737–753 (1992).
Article CAS PubMed PubMed Central Google Scholar
Karpen, G. H. & Spradling, A. C. Reduced DNA Polytenization of a Minichromosome Region Undergoing Position-Effect Variegation in Drosophila. Cell 63, 97–107 (1990).
Article CAS PubMed PubMed Central Google Scholar
Losada, A., Abad, J. P. & Villasante, A. Organization of DNA sequences near the centromere of the Drosophila melanogaster Y chromosome. Chromosoma 106, 503–512 (1997).
Article CAS PubMed Google Scholar
Gatti, M., Bonaccorsi, S. & Pimpinelli, S. Looking at Drosophila mitotic chromosomes. Methods Cell Biol. 44, 371–391 (1994).
Article CAS PubMed Google Scholar
Plateau, P. & Gueron, M. Exchangeable proton NMR without base-line distorsion, using new strong-pulse sequences. J. Am. Chem. Soc. 104, 7310–7311 (1982).
Article CAS Google Scholar

Download references

Acknowledgements

We dedicate this work in memory of our dear colleague and friend, Alfredo Villasante, who died during the revision of this manuscript. We gratefully acknowledge D. V. Laurents for revision of the manuscript. We thank G. H. Karpen for instructing M. Mendez-Lago in the preparation of extended chromatin fibers and for critical comments on the manuscript. We thank the Structural Biophysical Chemistry platform of the IECB (CNRS UMS3033/Inserm US001) for the access to the mass spectrometry facility and Dr. F. Rosu for his kind assistance. We also thank A. Losada for the initial long-range restriction analysis. We acknowledge financial support from MICINN (BFU2014-52864-R to C.G.; BFU2011-30295-C02-01 to A.V.), the Inserm (ATIP-Avenir Grant no. R12086GS to V.G.), the Conseil Régional Aquitaine (Grant no. 20121304005 to V.G.), the EU (FP7-PEOPLE-2012-CIG-333611 to V.G.), the Wellcome Trus and the institutional grant from the Fundación Ramón Areces to the Centro de Biología Molecular “Severo Ochoa”. M.G. was supported by the FPI-fellowship BES-2009-027909.

Author information

María Méndez-Lago
Present address: Centro Nacional de Análisis Genómico, Baldiri Reixac 4, 08028, Barcelona, Spain

Authors and Affiliations

Centro de Biología Molecular “Severo Ochoa” (CSIC-UAM), Universidad Autónoma de Madrid, Nicolás Cabrera 1, Madrid, 28049, Spain
Miguel Garavís, María Méndez-Lago & Alfredo Villasante
Instituto de Química Física Rocasolano, CSIC, Serrano 119, Madrid, 28006, Spain
Miguel Garavís & Carlos González
Univ. Bordeaux, ARNA Laboratory, IECB, 2 rue Robert Escarpit, Pessac, F-33600, France
Valérie Gabelica
Inserm ARNA Laboratory, 146 rue Leo Saignat, Bordeaux, F-33000, France
Valérie Gabelica
The Wellcome Trust Sanger Institute, Cambridgeshire, Hinxton, United Kingdom
Siobhan L. Whitehead

Authors

Miguel Garavís
View author publications
You can also search for this author in PubMed Google Scholar
María Méndez-Lago
View author publications
You can also search for this author in PubMed Google Scholar
Valérie Gabelica
View author publications
You can also search for this author in PubMed Google Scholar
Siobhan L. Whitehead
View author publications
You can also search for this author in PubMed Google Scholar
Carlos González
View author publications
You can also search for this author in PubMed Google Scholar
Alfredo Villasante
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.V., C.G., S.L.W. and V.G. designed and M.G. and M.M.L. performed the experiments. M.G. and C.G. performed and analysed the NMR and CD experiments; M.G. and V.G., the mass spectrometry; and M.M.L. and A.V., the fluorescence and FISH experiments. S.L.W. performed the BAC clone sequencing. A.V. supervised the project and wrote the manuscript with contributions from all authors. C.G. revised the final version of the manuscript.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Information

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Garavís, M., Méndez-Lago, M., Gabelica, V. et al. The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs. Sci Rep 5, 13307 (2015). https://doi.org/10.1038/srep13307

Download citation

Received: 23 January 2015
Accepted: 21 July 2015
Published: 20 August 2015
DOI: https://doi.org/10.1038/srep13307
Springer Nature Limited

This article is cited by

The control of transcriptional memory by stable mitotic bookmarking
- Maëlle Bellec
- Jérémy Dufourt
- Mounia Lagha
Nature Communications (2022)
The dark side of centromeres: types, causes and consequences of structural abnormalities implicating centromeric DNA
- V. Barra
- D. Fachinetti
Nature Communications (2018)
The effect of the neutral cytidine protonated analogue pseudoisocytidine on the stability of i-motif structures
- B. Mir
- X. Solés
- N. Escaja
Scientific Reports (2017)
Inheritance of the CENP-A chromatin domain is spatially and temporally constrained at human centromeres
- Justyne E. Ross
- Kaitlin Stimpson Woodlief
- Beth A. Sullivan
Epigenetics & Chromatin (2016)

The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs

Abstract

Similar content being viewed by others

Interspecies conservation of organisation and function between nonhomologous regional centromeres

Organization and evolution of Gorilla centromeric DNA from old strategies to new approaches

DNA Sequences in Centromere Formation and Function

Introduction

Results and Discussion

Materials and Methods

Drosophila strains and cell lines

DNA analysis, sequencing and probes

Fluorescence in situ hybridization to mitotic chromosomes

Immunofluorescence-FISH on extended chromatin fibers

DNA sample preparation for NMR and MS experiments

NMR experiments

Circular Dichroism spectroscopy

Mass spectrometry

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Supplementary Information

Rights and permissions

About this article

Cite this article

This article is cited by

The control of transcriptional memory by stable mitotic bookmarking

The dark side of centromeres: types, causes and consequences of structural abnormalities implicating centromeric DNA

The effect of the neutral cytidine protonated analogue pseudoisocytidine on the stability of i-motif structures

Inheritance of the CENP-A chromatin domain is spatially and temporally constrained at human centromeres

Navigation

The structure of an endogenous Drosophila centromere reveals the prevalence of tandemly repeated sequences able to form i-motifs

Abstract

Similar content being viewed by others

Introduction

Results and Discussion

Materials and Methods

Drosophila strains and cell lines

DNA analysis, sequencing and probes

Fluorescence in situ hybridization to mitotic chromosomes

Immunofluorescence-FISH on extended chromatin fibers

DNA sample preparation for NMR and MS experiments

NMR experiments

Circular Dichroism spectroscopy

Mass spectrometry

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Navigation