Introduction

Repeat expansion disorders are a class of genetic diseases that are caused by expansions in DNA repeats. The DNA repeats come in various sizes from single nucleotides to dodecamers or longer. The threshold at which the repeat expansions become symptomatic varies with the specific disease. There are over 40 distinct diseases now known to be caused by these expansions in DNA sequence. Remarkable progress during the last three decades has defined causative mutations that drive pathogenesis in a large number of these diseases. Expansion of CAG, GCG, CTG, CGG, and CAAA repeats both in coding and non-coding sequences in distinct genes results in a diverse group of diseases with mechanisms linked to protein levels or toxicity, RNA, and/or both (Fig. 1).

Fig. 1
figure 1

The repeat expansion disorders, a class of genetic diseases that are caused by expansions DNA repeats. The location of the expansion occurs in exonic and intronic regions of the gene

As a polyglutamine (polyQ) expansion disease researcher, I was drawn to the field two decades ago when Drs. Hayden, Bredesen, and Wellington and I discovered that the majority of disease-causing polyQ proteins were substrates for caspases and that this pathway has a critical role in the mechanisms of neuronal cell death observed in patients [1,2,3,4,5,6,7,8,9]. Most importantly the community of patients who have diseases caused by expansions of DNA repeats and their families continue to inspire us to move forward with new discoveries and therapeutics treatments.

In this issue of Neurotherapeutics, an international group of experts review the molecular mechanisms and therapeutics for many of these diseases. Highlights from the reviews are summarized below and presented in order of the discovery of the causative mutation underlying these diseases (see Fig. 1).

Spinal and Bulbar Muscular Atrophy/Kennedy’s Disease

In 1991, the cause of Kennedy’s disease was shown to be a CAG expansion in the androgen receptor (AR) gene [10]. Spinal and bulbar muscular atrophy (SBMA) is a slow progressive neuromuscular disorder in which the lower motor neurons and muscles degenerate. SBMA is X-linked and therefore mainly affecting males, with some exceptions discussed in the Arnold and Merry review [11]. The symptoms include gynecomastia, testicular atrophy, and reduced fertility, all which correlate with the known function of AR as a transcription factor binding androgen hormones. AR is involved in the male reproductive and skeletal system and female fertility. Given the known function of AR, the normal lifespan of SBMA patients, and the rapid progress in the field, it is surprising that after almost thirty years of investigation, we do not have a therapeutic treatment for this disease. Merry reviews the complexity of the underlying pathogenesis of this disorder and highlights steps that may lead to therapeutics for SBMA/Kennedy’s disease [11].

Huntington’s Disease

In 1993, the cause of Huntington’s disease was found to be a CAG expansion in exon 1 of the huntingtin gene (HTT) [12]. The disease protein contains a polyglutamine expansion in the N-terminal region of the Huntingtin protein (HTT). Among the polyQ diseases, HD is the best studied. Several key questions in the field that continue to provide clues for therapeutic approaches in the field are [1] what are the cell-autonomous and non-cell autonomous mechanisms that are triggered by the mutant protein?; [2] what are the key molecular mechanisms that drive the disease and how can we appropriately model the human condition?; [3] what is the basis of preferentially vulnerability of distinct neuronal populations?; and finally, how does DNA repair mechanism impact HD and other DNA repeat expansions diseases? Maiuri et al. describe the emerging importance of DNA repair mechanisms in HD and other repeat expansion disorders [13]. Creus-Muncunill and Ehrlich review the detailed and elegant studies directed at understanding the cell-autonomous and non-autonomous pathogenic mechanisms in HD [14]. Naphade et al. describe the rapid progress using human patient-induced pluripotent stem cells to model HD and other polyQ diseases [15].

SCA1

The genetic cause of Spinocerebellar ataxia type 1 (SCA1) was reported in 1993 [16]. SCA1 is an autosomal-dominant disorder characterized by neurodegeneration of the cerebellum, spinal cord, and brainstem and has a polyQ expansion in the ataxin-1 protein. Srinivasan and Shakkottai review the advances in understanding SCA1 disease biology and the challenges in the field to developing tailored therapeutics for the distinct SCAs [17].

SCA3

Spinocerebellar ataxia type 3 (SCA3), also known as Machado-Joseph disease (MJD), is caused by a polyQ expansion in the ataxin-3 protein [18]. Ataxin-3 is a ubiquitin ligase and Da Silva et al. present a unifying molecular mechanism for disease pathogenesis focusing on the disruption of protein homeostasis [19]. This molecular mechanism is common to all polyQ diseases and key networks are likely shared between the diseases.

Friedreich Ataxia

Friedreich ataxia (FRDA) serves as an exciting example of the rapid progress made in therapeutics. In FRDA, the GAA.TTC triplet repeat sequence results in transcriptional silencing of the frataxin gene [20]. Frataxin is a mitochondrial protein and is a component of a multiprotein complex that assembles iron-sulfur clusters. Gottesfeld reviews the molecular mechanisms of FRDA and the numerous interesting approaches to treat the disease [21]. These include excision or correction of the frataxin gene by genome engineering, gene activation with small molecules, protein replacement, and small molecules identified in cellular screens. The therapeutic approaches for FRDA set the stage for many of the repeat expansion disorders.

SCA2

In 1996, the CAG expansion in ataxin-2 gene was discovered to cause Spinocerebellar ataxia type 2 (SCA2) [22,23,24]. Egorova and Bezprozvanny describe the epidemiology, the function of ataxin-2 in RNA metabolism and stress granules, the molecular changes underlying Purkinje cell loss, the cerebellar-thalamic cortical circuit dysregulation and ataxia, and molecular mechanisms that may be therapeutically targeted [25].

SCA7

Spinocerebellar ataxia type 7 (SCA7) is caused by a CAG expansion in the ataxin-7 gene and is characterized by neuronal loss in the cerebellum, brainstem, and retina. The major symptoms are cerebellar ataxia and blindness. The ataxin-7 protein is a subunit of the multiprotein SAGA complex which is involved in chromatin remodeling and there is now a detailed molecular understanding of how the polyQ expansion in ataxin-7 disrupts neuronal function. Niewiadomska-Cimicka and Trottier describe the molecular functions of ataxin-7 and detail current approaches to treat the disease [26].

SCA17

SCA17 is caused by a CAG/CAA repeat expansion in the gene encoding the TATA box-binding protein (TBP) [27]. TBP is a well-characterized transcription factor. Liu et al. discuss the normal function TBP, the transcriptional alterations caused by the polyQ expansion in TBP and thoughtfully review the current candidate drugs and reducing the expression of the disease genes by antisense oligonucleotides or CRISPR-Cas9 for the treatment of SCA17 [28].

SCA31

SCA31 is an adult onset neurological disorder with progression cerebellar ataxia caused by degenerating Purkinje cell in the Japanese population. The genetics of SCA31 was elucidated in 2009 [29]. There is 2.5–3.8 kb insertion of the penta-nucleotide repeat (TGGAA)n stretch in the introns of thymidine kinase 2 (TK) and BEAN (brain expressed, associated with Nedd4). As described by Drs. Ishikawa and Nagai, an emerging mechanism of RNA toxicity is apparent for this disease [30].

C9orf72 ALS/FTD

In 2011, a hexanucleotide repeat expansion in the first intron of the C9orf72 gene was identified as a case of amyotrophic lateral sclerosis (ALS) and frontotemporal dementia (FTD). Given its relatively recent discovery, there has been rapid progress in identifying key pathogenic events such as the gain of toxicity from bidirectional transcribed repeat-containing RNAs, nucleocytoplasmic transport defects common to a number of the expansion diseases, and how the expansion causes two different diseases. These advances have led to phase I clinical trials using antisense oligonucleotide therapies. Drs. Jiang and Ravits review the exciting developments in our understanding of the pathogenic mechanisms for C9orf72 ALS/FTD [31].

Summary and Outlook

It has been over three decades since the discovery of the first repeat expansion disease and the scientific community has provided a rich body of research that has elucidated the mechanisms that drive disease progression. All the reviews highlight how our understanding of the normal function of the disease-causing protein has evolved and the disruption of the function or gain of function provides valuable information for the development of therapies for these diseases. The discovery of new forms of repeat expansion diseases continues to fuel the field (Fig. 1). X-linked dystonia-parkinsonism (XDP) is associated with a SINE-VNTR-Alu (SVA)-type retrotransposon insertion in the TAF1 gene (TATA-box binding protein associated factor 1) which has a hexameric DNA repeat expansion, (CCCTCT)n, the length of which varies among patients and is inversely correlated to age of disease onset [32]. For many of the repeat expansion diseases, the use of antisense oligonucleotides or CRISPR-Cas9 for the treatment of these diseases offers a promising area of neurotherapeutics.