Comparative Genetics of Functional Trinucleotide Tandem Repeats in Humans and Apes
Received: 28 November 2003 Accepted: 21 January 2004 DOI:
Cite this article as: Andrés, A.M., Soldevila, M., Lao, O. et al. J Mol Evol (2004) 59: 329. doi:10.1007/s00239-004-2628-5 Abstract
Several human neurodegenerative disorders are caused by the expansion of polymorphic trinucleotide repeat regions. Many of these loci are functional short tandem repeats (STRs) located in brain-expressed genes, and their study is thus relevant from both a medical and an evolutionary point of view. The aims of our study are to infer the comparative pattern of variation and evolution of this set of loci in order to show species-specific features in this group of STRs and on their potential for expansion (therefore, an insight into evolutionary medicine) and to unravel whether any human-specific feature may be identified in brain-expressed genes involved in human disease. We analyzed the variability of the normal range of seven expanding STR CAG/CTG loci (SCA1, SCA2, SCA3-MJD, SCA6, SCA8, SCA12, and DRPLA) and two nonexpanding polymorphic CAG loci (KCNN3 and NCOA3) in humans, chimpanzees, gorillas, and orangutans. The study showed a general conservation of the repetitive tract and of the polymorphism in the four species and high heterogeneity among loci distributions. Humans present slightly larger alleles than the rest of species but a more relevant difference appears in variability levels: Humans are the species with the largest variance, although only for the expanding loci, suggesting a relationship between variability levels and expansion potential. The sequence analysis shows high levels of sequence conservation among species, a lack of correspondence between interruption patterns and variability levels, and signs of conservative selective pressure for some of the STR loci. Only two loci (SCA1 and SCA8) show a human specific distribution, with larger alleles than the rest of species. This could account, at the same time, for a human-specific trait and a predisposition to disease through expansion.
Keywords Spinocerebellar ataxia Trinucleotide repeat expansion Short tandem repeat evolution Primates
This article contains online supplementary material.
References Albà, MM, Santibanez-Koref, MF, Hancock, JM 1999 Conservation of polyglutamine tract size between mice and humans depends on codon interruption Mol Biol Evol 16 1641 1644 PubMed Google Scholar Andres, AM, Lao, O, Soldevila, M, Calafell, F, Bertranpetit, J 2003 Dynamics of CAG repeat loci revealed by the analysis of their variability Hum Mutat 21 61 70 CrossRef PubMed Google Scholar Bamshad, MJ, Mummidi, S, González, E, Ahuja, SS, Dunn, DM, Watkins, WS, Wooding, S, Stone, AC, Jorde, LB, Weiss, RB, Ahuja, SK 2002 A strong signature of balancing selection in the 5′ cis-regulatory region of CCR5 Proc Natl Acad Sci USA 99 10539 10544 CrossRef PubMed Google Scholar Chakraborty, R, Kimmel, M, Stivers, DN, Davison, LJ, Deka, R 1997 Relative mutation rates at di-, tri-, and tetranucleotide microsatellite loci Proc Natl Acad Sci USA 94 1041 1046 CrossRef PubMed Google Scholar Chen, FC, Li, WH 2001 Genomic divergences between humans and other hominoids and the effective population size of the common ancestor of humans and chimpanzees Am J Hum Genet 68 444 456 CrossRef PubMed Google Scholar Choong, CS, Kemppainen, JA, Wilson, EM 1998 Evolution of the primate androgen receptor: a structural basis for disease J Mol Evol 47 334 342 PubMed Google Scholar Cooper, G, Rubinsztein, DC, Amos, W 1998 Ascertainment bias cannot entirely account for human microsatellites being longer than their chimpanzee homologues Hum Mol Genet 7 1425 1429 CrossRef PubMed Google Scholar Crouau-Roy, B, Service, S, Slatkin, M, Freimer, N 1996b A fine-scale comparison of the human and chimpanzee genomes: linkage, linkage disequilibrium and sequence analysis Hum Mol Genet 5 1131 1137 CrossRef Google Scholar Deka, R, Guangyun, S, Smelser, D, Zhong, Y, Kimmel, M, Chakraborty, R 1999 Rate and directionality of mutations and effects of allele size constraints at anonymous, gene-associated, and disease-causing trinucleotide loci Mol Biol Evol 16 1166 1177 PubMed Google Scholar Di Rienzo, A, Donnelly, P, Toomajian, C, Sisk, B, Hill, A, Petzl-Erler, ML, Haines, GK, Barch, DH 1998 Heterogeneity of microsatellite mutations within and between loci, and implications for human demographic histories Genetics 148 1269 1284 PubMed Google Scholar Djian, P, Hancock, JM, Chana, HS 1996 Codon repeats in genes associated with human diseases: fewer repeats in the genes of nonhuman primates and nucleotide substitutions concentrated at the sites of reiteration Proc Natl Acad Sci USA 93 417 421 CrossRef PubMed Google Scholar Dror, V, Shamir, E, Ghanshani, S, Kimhi, R, Swartz, M, Barak, Y, Weizman, R, Avivi, L, Litmanovitch, T, Fantino, E, Kalman, K, Jones, EG, Chandy, KG, Gargus, JJ, Gutman, GA, Navon, R 1999 hKCa3/KCNN3 potassium channel gene: Association of longer CAG repeats with schizophrenia in Israeli Ashkenazi Jews, expression in human tissues and localization to chromosome 1q21 Mol Psychiatry 4 254 260 CrossRef PubMed Google Scholar Ellegren, H, Moore, S, Robinson, N, Byrne, K, Ward, W, Sheldon, BC 1997 Microsatellite evolution—A reciprocal study of repeat lengths at homologous loci in cattle and sheep Mol Biol Evol 14 854 860 PubMed Google Scholar Ellegren, H, Primmer, CR, Sheldon, BC 1995 Microsatellite ‘evolution’: directionality or bias? Nat Genet 11 360 362 CrossRef PubMed Google Scholar Enard, W, Przeworski, M, Fisher, SE, Lai, CS, Wiebe, V, Kitano, T, Monaco, AP, Pääbo, S 2002 Molecular evolution of FOXP2, a gene involved in speech and language Nature 418 869 872 CrossRef PubMed Google Scholar Figueroa, KP, Chan, P, Schols, L, Tanner, C, Riess, O, Perlman, SL, Geschwind, DH, Pulst, SM 2001 Association of moderate polyglutamine tract expansions in the slow calcium-activated potassium channel type 3 with ataxia Arch Neurol 58 1649 1653 CrossRef PubMed Google Scholar Fu, YH, Kuhl, DP, Pizzuti, A, Pieretti, M, Sutcliffe, JS, Richards, S, Verkerk, AJ, Holden, JJ, Fenwick, RG, Jr., Warren, ST, Oostra, BA, Nelson, DL, Caskey, CT 1991 Variation of the CGG repeat at the fragile X site results in genetic instability: resolution of the Sherman paradox Cell 67 1047 1058 CrossRef PubMed Google Scholar Gagneux, P, Varki, A 2001 Genetic differences between humans and great apes Mol Phylogenet Evol 18 2 13 CrossRef PubMed Google Scholar Garza, JC, Slatkin, M, Freimer, NB 1995 Microsatellite allele frequencies in humans and chimpanzees, with implications for constraints on allele size Mol Biol Evol 12 594 603 PubMed Google Scholar González-Cabo, P, Sanchez, MI, Canizares, J, Blanca, JM, Martínez-Arias, R, Castro, M, Bertranpetit, J, Palau, F, Molto, MD, Frutos, R 1999 Incipient GAA repeats in the primate Friedreich ataxia homologous genes Mol Biol Evol 16 880 883 PubMed Google Scholar Hacia, JG 2001 Genome of the apes Trends Genet 17 637 645 CrossRef PubMed Google Scholar Hedrick, PW, Thomson, G 1983 Evidence for balancing selection at HLA Genetics 104 449 456 PubMed Google Scholar Hughes, AL, Nei, M 1988 Pattern of nucleotide substitution at major histocompatibility complex class I loci reveals overdominant selection Nature 335 167 170 CrossRef PubMed Google Scholar Ingman, M, Kaessmann, H, Pääbo, S, Gyllensten, U 2000 Mitochondrial genome variation and the origin of modern humans Nature 408 708 713 CrossRef PubMed Google Scholar Jodice, C, Giovannone, B, Calabresi, V, Bellocchi, M, Terrenato, L, Novelletto, A 1997a Population variation analysis at nine loci containing expressed trinucleotide repeats Ann Hum Genet 61 425 438 CrossRef Google Scholar Jorde, LB, Watkins, WS, Bamshad, MJ, Dixon, ME, Ricker, CE, Seielstad, MT, Batzer, MA 2000 The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data Am J Hum Genet 66 979 988 CrossRef PubMed Google Scholar Justice, CM, Den, Z, Nguyen, SV, Stoneking, M, Deininger, PL, Batzer, MA, Keats, BJ 2001 Phylogenetic analysis of the Friedreich ataxia GAA trinucleotide repeat J Mol Evol 52 232 238 PubMed Google Scholar Kaessmann, H, Wiebe, V, Pääbo, S 1999 Extensive nuclear DNA sequence diversity among chimpanzees Science 286 1159 1162 CrossRef PubMed Google Scholar Kaessmann, H, Wiebe, V, Weiss, G, Pääbo, S 2001 Great ape DNA sequences reveal a reduced diversity and an expansion in humans Nat Genet 27 155 156 CrossRef PubMed Google Scholar Kawaguchi, Y, Okamoto, T, Taniwaki, M, Aizawa, M, Inoue, M, Katayama, S, Kawakami, H, Nakamura, S, Nishimura, M, Akiguchi, I 1994 CAG expansions in a novel gene for Machado-Joseph disease at chromosome 14q32.1 Nat Genet 8 221 228 CrossRef PubMed Google Scholar Koob, MD, Moseley, ML, Schut, LJ, Benzow, KA, Bird, TD, Day, JW, Ranum, LP 1999 An untranslated CTG expansion causes a novel form of spinocerebellar ataxia (SCA8) Nat Genet 21 379 384 CrossRef PubMed Google Scholar Lai, CS, Fisher, SE, Hurst, JA, Vargha-Khadem, F, Monaco, AP 2001 A forkhead-domain gene is mutated in a severe speech and language disorder Nature 413 519 523 CrossRef PubMed Google Scholar Limprasert, P, Nouri, N, Heyman, RA, Nopparatana, C, Kamonsilp, M, Deininger, PL, Keats, BJ 1996 Analysis of CAG repeat of the Machado-Joseph gene in human, chimpanzee and monkey populations: A variant nucleotide is associated with the number of CAG repeats Hum Mol Genet 5 207 213 CrossRef PubMed Google Scholar Limprasert, P, Nouri, N, Nopparatana, C, Deininger, PL, Keats, BJ 1997 Comparative studies of the CAG repeats in the spinocerebellar ataxia type 1 (SCA1) gene Am J Med Genet 74 488 493 CrossRef PubMed Google Scholar Messier, W, Stewart, CB 1997 Episodic adaptive evolution of primate lysozymes Nature 385 151 154 CrossRef PubMed Google Scholar Nagafuchi, S, Yanagisawa, H, Sato, K, Shirayama, T, Ohsaki, E, Bundo, M, Takeda, T, Tadokoro, K, Kondo, I, Murayama, N 1994 Dentatorubral and pallidoluysian atrophy expansion of an unstable CAG trinucleotide on chromosome 12p Nat Genet 6 14 18 CrossRef PubMed Google Scholar Noda, R, Kim, CG, Takenaka, O, Ferrell, RE, Tanoue, T, Hayasaka, I, Ueda, S, Ishida, T, Saitou, N 2001 Mitochondrial 16S rRNA sequence diversity of hominoids J Hered 92 490 496 CrossRef PubMed Google Scholar Nolin, SL, Brown, WT, Glicksman, A, Houck, GE Jr., Gargano, AD, Sullivan, A, Biancalana, V, Brondum-Nielsen, K, Hjalgrim, H, Holinski-Feder, E, Kooy, F, Longshore, J, Macpherson, J, Mandel, JL, Matthijs, G, Rousseau, F, Steinbach, P, Vaisanen, ML, Koskull, H, Sherman, SL 2003 Expansion of the fragile X CGG repeat in females with premutation or intermediate alleles Am J Hum Genet 72 454 464 CrossRef PubMed Google Scholar Orr, HT, Chung, MY, Banfi, S, Kwiatkowski, TJ, Jr., Servadio, A, Beaudet, AL, McCall, AE, Duvick, LA, Ranum, LP, Zoghbi, HY 1993 Expansion of an unstable trinucleotide CAG repeat in spinocerebellar ataxia type 1 Nat Genet 4 221 226 CrossRef PubMed Google Scholar Pulst, SM, Nechiporuk, A, Nechiporuk, T, Gispert, S, Chen, XN, Lopes-Cendes, I, Pearlman, S, Starkman, S, Orozco-Diaz, G, Lunkes, A, DeJong, P, Rouleau, GA, Auburger, G, Korenberg, JR, Figueroa, C, Sahba, S 1996 Moderate expansion of a normally biallelic trinucleotide repeat in spinocerebellar ataxia type 2 Nat Genet 14 269 276 CrossRef PubMed Google Scholar Rubinsztein, DC, Amos, W, Leggo, J, Goodburn, S, Jain, S, Li, SH, Margolis, RL, Ross, CA, Ferguson-Smith, MA 1995a Microsatellite evolution—Evidence for directionality and variation in rate between species Nat Genet 10 337 343 CrossRef Google Scholar Rubinsztein, DC, Leggo, J, Coetzee, GA, Irvine, RA, Buckley, M, Ferguson-Smith, MA 1995b Sequence variation and size ranges of CAG repeats in the Machado-Joseph disease, spinocerebellar ataxia type 1 and androgen receptor genes Hum Mol Genet 4 1585 1590 Google Scholar Ruvolo, M 1997 Molecular phylogeny of the hominoids: Inferences from multiple independent DNA sequence data sets Mol Biol Evol 14 248 265 PubMed Google Scholar Saleem, Q, Anand, A, Jain, S, Brahmachari, SK 2001 The polyglutamine motif is highly conserved at the Clock locus in various organisms and is not polymorphic in humans Hum Genet 109 136 142 CrossRef PubMed Google Scholar Schneider, S, Roessli, D, Excoffier, L 2000A software for population genetics data analysis Genetics and Biometry Laboratory, University of Geneva Switzerland Google Scholar Sokal, RR, Rohlf, FJ 1995Biometry WH Freeman New York Google Scholar Watkins, WS, Bamshad, M, Jorde, LB 1995 Population genetics of trinucleotide repeat polymorphisms Hum Mol Genet 4 1485 1491 PubMed Google Scholar Webster, MT, Smith, NG, Ellegren, H 2002 Microsatellite evolution inferred from human-chimpanzee genomic sequence alignments Proc Natl Acad Sci USA 99 8748 8753 CrossRef PubMed Google Scholar Wise, CA, Sraml, M, Rubinsztein, DC, Easteal, S 1997 Comparative nuclear and mitochondrial genome diversity in humans and chimpanzees Mol Biol Evol 14 707 716 PubMed Google Scholar Yue, S, Serra, HG, Zoghbi, HY, Orr, HT 2001 The spinocerebellar ataxia type 1 protein, ataxin-1, has RNA-binding activity that is inversely affected by the length of its polyglutamine tract Hum Mol Genet 10 25 30 CrossRef PubMed Google Scholar Zhuchenko, O, Bailey, J, Bonnen, P, Ashizawa, T, Stockton, DW, Amos, C, Dobyns, WB, Subramony, SH, Zoghbi, HY, Lee, CC 1997 Autosomal dominant cerebellar ataxia (SCA6) associated with small polyglutamine expansions in the alpha 1A-voltage-dependent calcium channel Nat Genet 15 62 69 CrossRef PubMed Google Scholar