Abstract
Approximately 3% of the human genome is composed of short tandem repeat (STR) DNA sequence known as microsatellites, which can be found in both coding and non-coding regions. When associated with genic regions, expansion of microsatellite repeats beyond a critical threshold causes dozens of neurological repeat expansion disorders. To better understand the molecular pathology of repeat expansion disorders, precise cloning of microsatellite repeat sequence and expansion size is highly valuable. Unfortunately, cloning repeat expansions is often challenging and presents a significant bottleneck to practical investigation. Here, we describe a clear method for seamless and systematic cloning of practically any microsatellite repeat expansion. We use cloning and expansion of GGGGCC repeats, which are the leading genetic cause of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD), as an example. We employ a recursive directional ligation (RDL) technique to build multiple GGGGCC repeat-containing vectors. We describe methods to validate repeat expansion cloning, including diagnostic restriction digestion, PCR across the repeat, and next-generation long-read MinION nanopore sequencing. Validated cloning of microsatellite repeats beyond the critical expansion threshold can facilitate step-by-step characterization of disease mechanisms at the cellular and molecular level.
Similar content being viewed by others
Availability of data and material
The datasets generated and analyzed during this study will be made available on a public database to be determined or by request to the corresponding author.
References
Al-Allaf FA, Tolmachov OE, Zambetti LP, Tchetchelnitski V, Mehmet H (2013) Remarkable stability of an instability-prone lentiviral vector plasmid in Escherichia coli Stbl3. 3 Biotech 3:61–70. https://doi.org/10.1007/s13205-012-0070-8
Ameur A, Kloosterman WP, Hestand MS (2019) Single-molecule sequencing: towards clinical applications. Trends Biotechnol 37:72–85. https://doi.org/10.1016/j.tibtech.2018.07.013
Bahlo M, Bennett MF, Degorski P, Tankard RM, Delatycki MB, Lockhart PJ (2018) Recent advances in the detection of repeat expansions with short-read next-generation sequencing. F1000Res. https://doi.org/10.12688/f1000research.13980.1
Baron U, Bujard H (2000) Tet repressor-based system for regulated gene expression in eukaryotic cells: principles and advances. Methods Enzymol 327:401–421. https://doi.org/10.1016/s0076-6879(00)27292-3
Batra R, Lee CW (2017) Mouse models of C9orf72 hexanucleotide repeat expansion in amyotrophic lateral sclerosis/frontotemporal dementia. Front Cell Neurosci 11:196. https://doi.org/10.3389/fncel.2017.00196
Berens C, Hillen W (2003) Gene regulation by tetracyclines. Constraints of resistance regulation in bacteria shape TetR for application in eukaryotes. Eur J Biochem 270:3109–3121. https://doi.org/10.1046/j.1432-1033.2003.03694.x
Brouwer JR, Willemsen R, Oostra BA (2009) Microsatellite repeat instability and neurological disease. BioEssays 31:71–83. https://doi.org/10.1002/bies.080122
Chen IC et al (2009) Spinocerebellar ataxia type 8 larger triplet expansion alters histone modification and induces RNA foci. BMC Mol Biol 10:9. https://doi.org/10.1186/1471-2199-10-9
Cheng W et al (2018) C9ORF72 GGGGCC repeat-associated non-AUG translation is upregulated by stress through eIF2alpha phosphorylation. Nat Commun 9:51. https://doi.org/10.1038/s41467-017-02495-z
Cleary JD, Ranum LP (2017) New developments in RAN translation: insights from multiple diseases. Curr Opin Genet Dev 44:125–134. https://doi.org/10.1016/j.gde.2017.03.006
Cleary JD, Pattamatta A, Ranum LPW (2018) Repeat associated non-ATG (RAN) translation. J Biol Chem. https://doi.org/10.1074/jbc.R118.003237
Corchero JL, Villaverde A (1998) Plasmid maintenance in Escherichia coli recombinant cultures is dramatically, steadily, and specifically influenced by features of the encoded proteins. Biotechnol Bioeng 58:625–632
de Haro M et al (2006) MBNL1 and CUGBP1 modify expanded CUG-induced toxicity in a Drosophila model of myotonic dystrophy type 1. Hum Mol Genet 15:2138–2145. https://doi.org/10.1093/hmg/ddl137
DeJesus-Hernandez M et al (2011) Expanded GGGGCC hexanucleotide repeat in noncoding region of C9ORF72 causes chromosome 9p-linked FTD and ALS. Neuron 72:245–256. https://doi.org/10.1016/j.neuron.2011.09.011
Ebbert MTW et al (2018) Long-read sequencing across the C9orf72 'GGGGCC' repeat expansion: implications for clinical use and genetic discovery efforts in human disease. Mol Neurodegener 13:46. https://doi.org/10.1186/s13024-018-0274-4
Ellegren H (2004) Microsatellites: simple sequences with complex evolution. Nat Rev Genet 5:435–445. https://doi.org/10.1038/nrg1348
Engler C, Kandzia R, Marillonnet S (2008) A one pot, one step, precision cloning method with high throughput capability. PLoS ONE 3:e3647. https://doi.org/10.1371/journal.pone.0003647
Engler C, Gruetzner R, Kandzia R, Marillonnet S (2009) Golden gate shuffling: a one-pot DNA shuffling method based on type IIs restriction enzymes. PLoS ONE 4:e5553. https://doi.org/10.1371/journal.pone.0005553
Evans-Galea MV, Hannan AJ, Carrodus N, Delatycki MB, Saffery R (2013) Epigenetic modifications in trinucleotide repeat diseases. Trends Mol Med 19:655–663. https://doi.org/10.1016/j.molmed.2013.07.007
George S, Pankhurst L, Hubbard A, Votintseva A, Stoesser N, Sheppard AE, Mathers A, Norris R, Navickaite I, Eaton C, Iqbal Z, Crook DW, Phan HTT (2017) Resolving plasmid structures in Enterobacteriaceae using the MinION nanopore sequencer: assessment of MinION and MinION/Illumina hybrid data assembly approaches. Microb Genom 3:e000118. https://doi.org/10.1099/mgen.0.000118
Godiska R et al (2010) Linear plasmid vector for cloning of repetitive or unstable sequences in Escherichia coli. Nucleic Acids Res 38:e88. https://doi.org/10.1093/nar/gkp1181
Gossen M, Bujard H (1992) Tight control of gene expression in mammalian cells by tetracycline-responsive promoters. Proc Natl Acad Sci USA 89:5547–5551. https://doi.org/10.1073/pnas.89.12.5547
Gossen M, Freundlieb S, Bender G, Muller G, Hillen W, Bujard H (1995) Transcriptional activation by tetracyclines in mammalian cells. Science 268:1766–1769. https://doi.org/10.1126/science.7792603
Grabczyk E, Usdin K (1999) Generation of microgram quantities of trinucleotide repeat tracts of defined length, interspersion pattern, and orientation. Anal Biochem 267:241–243. https://doi.org/10.1006/abio.1998.2962
Green KM et al (2017) RAN translation at C9orf72-associated repeat expansions is selectively enhanced by the integrated stress response. Nat Commun 8:2005. https://doi.org/10.1038/s41467-017-02200-0
Green KM, Linsalata AE, Todd PK (2016) RAN translation-what makes it run? Brain Res 1647:30–42. https://doi.org/10.1016/j.brainres.2016.04.003
Hardiman O, van den Berg LH, Kiernan MC (2011) Clinical diagnosis and management of amyotrophic lateral sclerosis. Nat Rev Neurol 7:639–649. https://doi.org/10.1038/nrneurol.2011.153
He F, Todd PK (2011) Epigenetics in nucleotide repeat expansion disorders. Semin Neurol 31:470–483. https://doi.org/10.1055/s-0031-1299786
Hodges JR, Piguet O (2018) Progress and challenges in frontotemporal dementia research: a 20-year review. J Alzheimers Dis 62:1467–1480. https://doi.org/10.3233/JAD-171087
Iacoangeli A et al (2019) C9orf72 intermediate expansions of 24–30 repeats are associated with ALS. Acta Neuropathol Commun 7:115. https://doi.org/10.1186/s40478-019-0724-4
Jeffreys AJ, Wilson V, Thein SL (1985) Individual-specific 'fingerprints' of human DNA. Nature 316:76–79. https://doi.org/10.1038/316076a0
Jiang SW, Trujillo MA, Eberhardt NL (1996) An efficient method for generation and subcloning of tandemly repeated DNA sequences with defined length, orientation and spacing. Nucleic Acids Res 24:3278–3279. https://doi.org/10.1093/nar/24.16.3278
Labbadia J, Morimoto RI (2013) Huntington's disease: underlying molecular mechanisms and emerging concepts. Trends Biochem Sci 38:378–385. https://doi.org/10.1016/j.tibs.2013.05.003
Laccone F, Maiwald R, Bingemann S (1999) A fast polymerase chain reaction-mediated strategy for introducing repeat expansions into CAG-repeat containing genes. Hum Mutat 13:497–502. https://doi.org/10.1002/(SICI)1098-1004(1999)13:6%3c497:AID-HUMU10%3e3.0.CO;2-6
Liao HH (1991) Effect of temperature on the expression of wild-type and thermostable mutants of kanamycin nucleotidyltransferase in Escherichia coli. Protein Expr Purif 2:43–50
Mamedov TG et al (2008) A fundamental study of the PCR amplification of GC-rich DNA templates. Comput Biol Chem 32:452–457. https://doi.org/10.1016/j.compbiolchem.2008.07.021
Mankodi A et al (2000) Myotonic dystrophy in transgenic mice expressing an expanded CUG repeat. Science 289:1769–1773
Matsuura T, Ashizawa T (2002) Polymerase chain reaction amplification of expanded ATTCT repeat in spinocerebellar ataxia type 10. Ann Neurol 51:271–272. https://doi.org/10.1002/ana.10049
Meyer DE, Chilkoti A (2002) Genetically encoded synthesis of protein-based polymers with precisely specified molecular weight and sequence by recursive directional ligation: examples from the elastin-like polypeptide system. Biomacromol 3:357–367
Miga KH, Eisenhart C, Kent WJ (2015) Utilizing mapping targets of sequences underrepresented in the reference assembly to reduce false positive alignments. Nucleic Acids Res 43:e133. https://doi.org/10.1093/nar/gkv671
Mirkin SM (2007) Expandable DNA repeats and human disease. Nature 447:932–940. https://doi.org/10.1038/nature05977
Mitchell JD, Borasio GD (2007) Amyotrophic lateral sclerosis. Lancet 369:2031–2041. https://doi.org/10.1016/S0140-6736(07)60944-1
Mizielinska S et al (2014) C9orf72 repeat expansions cause neurodegeneration in Drosophila through arginine-rich proteins. Science 345:1192–1194. https://doi.org/10.1126/science.1256800
Musso M, Bocciardi R, Parodi S, Ravazzolo R, Ceccherini I (2006) Betaine, dimethyl sulfoxide, and 7-deaza-dGTP, a powerful mixture for amplification of GC-rich DNA sequences. J Mol Diagn 8:544–550. https://doi.org/10.2353/jmoldx.2006.060058
Nguyen L, Cleary JD, Ranum LPW (2019) Repeat-associated non-ATG translation: molecular mechanisms and contribution to neurological disease. Annu Rev Neurosci 42:227–247. https://doi.org/10.1146/annurev-neuro-070918-050405
Ohshima K, Kang S, Larson JE, Wells RD (1996) Cloning, characterization, and properties of seven triplet repeat DNA sequences. J Biol Chem 271:16773–16783. https://doi.org/10.1074/jbc.271.28.16773
Ordway JM, Detloff PJ (1996) vitro synthesis and cloning of long CAG repeats. Biotechniques 21(609–610):612. https://doi.org/10.2144/96214bm08
Paulson H (2018) Repeat expansion diseases. Handb Clin Neurol 147:105–123. https://doi.org/10.1016/B978-0-444-63233-3.00009-9
Pingoud A, Jeltsch A (2001) Structure and function of type II restriction endonucleases. Nucleic Acids Res 29:3705–3727. https://doi.org/10.1093/nar/29.18.3705
Pingoud A, Wilson GG, Wende W (2014) Type II restriction endonucleases—a historical perspective and more. Nucleic Acids Res 42:7489–7527. https://doi.org/10.1093/nar/gku447
Renton AE et al (2011) A hexanucleotide repeat expansion in C9ORF72 is the cause of chromosome 9p21-linked ALS-FTD. Neuron 72:257–268. https://doi.org/10.1016/j.neuron.2011.09.010
Roewer L (2013) DNA fingerprinting in forensics: past, present, future. Investig Genet 4:22. https://doi.org/10.1186/2041-2223-4-22
Rohilla KJ, Gagnon KT (2017) RNA biology of disease-associated microsatellite repeat expansions. Acta Neuropathol Commun 5:63. https://doi.org/10.1186/s40478-017-0468-y
Rowland LP, Shneider NA (2001) Amyotrophic lateral sclerosis. N Engl J Med 344:1688–1700. https://doi.org/10.1056/NEJM200105313442207
Seznec H et al (2001) Mice transgenic for the human myotonic dystrophy region with expanded CTG repeats display muscular and brain abnormalities. Hum Mol Genet 10:2717–2726. https://doi.org/10.1093/hmg/10.23.2717
Shoubridge C, Gecz J (2012) Polyalanine tract disorders and neurocognitive phenotypes. Adv Exp Med Biol 769:185–203
Stueber D, Bujard H (1982) Transcription from efficient promoters can interfere with plasmid replication and diminish expression of plasmid specified genes. EMBO J 1:1399–1404
Szybalski W, Kim SC, Hasan N, Podhajska AJ (1991) Class-IIS restriction enzymes—a review. Gene 100:13–26. https://doi.org/10.1016/0378-1119(91)90345-c
Thys RG, Wang YH (2015) DNA replication dynamics of the GGGGCC repeat of the C9orf72. Gene J Biol Chem 290:28953–28962. https://doi.org/10.1074/jbc.M115.660324
Verkerk AJ et al (1991) Identification of a gene (FMR-1) containing a CGG repeat coincident with a breakpoint cluster region exhibiting length variation in fragile X syndrome. Cell 65:905–914. https://doi.org/10.1016/0092-8674(91)90397-h
Weischenfeldt J, Symmons O, Spitz F, Korbel JO (2013) Phenotypic impact of genomic structural variation: insights from and for human disease. Nat Rev Genet 14:125–138. https://doi.org/10.1038/nrg3373
Wen X et al (2014) Antisense proline-arginine RAN dipeptides linked to C9ORF72-ALS/FTD form toxic nuclear aggregates that initiate in vitro and in vivo neuronal death. Neuron 84:1213–1225. https://doi.org/10.1016/j.neuron.2014.12.010
Wick RR, Judd LM, Holt KE (2019) Performance of neural network basecalling tools for Oxford Nanopore sequencing. Genome Biol 20:129. https://doi.org/10.1186/s13059-019-1727-y
Zarei S et al (2015) A comprehensive review of amyotrophic lateral sclerosis. Surg Neurol Int 6:171. https://doi.org/10.4103/2152-7806.169561
Zhao XN, Usdin K (2015) The repeat expansion diseases: The dark side of DNA repair. DNA Repair (Amst) 32:96–105. https://doi.org/10.1016/j.dnarep.2015.04.019
Zhao T, Hong Y, Li XJ, Li SH (2016) Subcellular clearance and accumulation of huntington disease protein: a mini-review. Front Mol Neurosci 9:27. https://doi.org/10.3389/fnmol.2016.00027
Zu T et al (2011) Non-ATG-initiated translation directed by microsatellite expansions. Proc Natl Acad Sci USA 108:260–265. https://doi.org/10.1073/pnas.1013343108
Acknowledgements
We thank T.R. Murphy of the Hamilton-Brehm laboratory (SIU) for assistance in MinION data analysis.
Funding
This work was supported by an ALS Association grant to K.T.G., a Judith and Jean Pape Adams Charitable Foundation grant to K.T.G., and a Department of Defense ALSRP grant to K.T.G.
Author information
Authors and Affiliations
Contributions
KJR designed and executed RDL cloning experiments, performed data analyses, and wrote the manuscript. KNO designed and executed RDL cloning experiments, performed data analyses, and assisted in manuscript preparation. AAP performed and analyzed MinION sequencing, developed the bioinformatics workflow, and assisted in manuscript preparation. MB executed RDL cloning experiments. AJH assisted with RDL cloning and PCR experiments. KTG designed, planned and supervised all experiments, interpreted results, and wrote the manuscript.
Corresponding author
Ethics declarations
Conflict of interest
The authors declare that they have no competing interests.
Ethics approval and consent to participate
Not applicable.
Consent for publication
Not applicable.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Rohilla, K.J., Ovington, K.N., Pater, A.A. et al. Systematic microsatellite repeat expansion cloning and validation. Hum Genet 139, 1233–1246 (2020). https://doi.org/10.1007/s00439-020-02165-z
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00439-020-02165-z