Summary
Reports of single base-pair mutations within gene coding regions causing human genetic disease were collated. Thirty-five per cent of mutations were found to have occurred within CpG dinucleotides. Over 90% of these mutations were C → T or G → A transitions, which thus occur within coding regions at a frequency 42-fold higher than that predicted from random mutation. These findings are consistent with methylation-induced deamination of 5-methyl cytosine and suggest that methylation of DNA within coding regions may contribute significantly to the incidence of human genetic disease.
Similar content being viewed by others
References
Antonarakis SE, Kazazian H, Orkin SH (1985a) DNA polymorphism and molecular pathology of the human globin gene clusters. Hum Genet 69:1–14
Antonarakis SE, Waber PG, Kittur SD, Patel AS, Kazazian HH, Mellis MA, Counts RB, Stamatoyannopoulos G, Bowie W, Fass DN, Pittman DD, Wozney JM, Toole JJ (1985b) Hemophila A; detection of molecular defects and of carriers by DNA analysis. N Engl J Med 313:842–848
Antonarakis SE, Youssoufian H, Kazazian HH (1987) Molecular genetics of hemophilia A in man (factor VIII deficiency). Mol Biol Med 4:81–84
Barker D, Schäfer M, White R (1984) Restriction sites containing CpG show a higher frequency of polymorphism in human DNA. Cell 36:131–138
Bentley AK, Rees DJG, Rizza C, Brownlee GG (1986) Defective propeptide processing of blood clotting factor IX caused by mutation of arginine to glutamine at position-4. Cell 45:343–348
Bird AP (1980) DNA methylation and the frequency of CpG in animal DNA. Nucleic Acids Res 8:1499–1504
Bird AP (1986) CpG-rich islands and the function of DNA methylation. Nature 321:209–213
Bird A, Taggart M, Frommer M, Miller OJ, Macleod D (1985) A fraction of the mouse genome that is derived from islands of nonmethylated, CpG-rich DNA. Cell 40:91–99
Bird AP, Taggart MH, Nichols RD, Higgs DR (1987) Non-methylated CpG-rich islands at the human α-globin locus: implications for evolution of the α-globin pseudogene. EMBO J 6:999–1004
Bonthron DT, Markham AF, Ginsburg D, Orkin SH (1985) Identification of a point mutation in the adenosine deaminase gene responsible for immunodeficiency. J Clin Invest 76:894–897
Brown WRA, Bird AP (1986) Long-range restriction site mapping of mammalian genomic DNA. Nature 322:477–481
Bullock E, Elton RA (1972) Dipeptide frequencies in proteins and the CpG deficiency in vertebrate DNA. J Mol Evol 1:315–325
Chan SJ, Seino S, Gruppuso PA, Schwartz R, Steiner DF (1987) A mutation in the β chain coding region is associated with impaired proinsulin conversion in a family with hyperproinsulinaemia. Proc Natl Acad Sci USA 84:2194–2197
Cladaras C, Hadzopoulou-Cladaras M, Felber BK, Pavlakis G, Zannis VI (1987) The molecular basis of a familial Apo E deficiency. J Biol Chem 262:2310–2315
Cohn DH, Byers PH, Steinman B, Gelinas RE (1986) Lethal osteogenesis imperfecta resulting from a single nucleotide change in one human pro α1 (I) collagen allele. Proc Natl Acad Sci USA 83:6045–6047
Cooper DN (1983) Eukaryotic DNA methylation. Hum Genet 64:315–333
Cooper DN, Gerber-Huber S (1985) DNA methylation and CpG suppression. Cell Differ 17:199–205
Cooper DN, Schmidtke J (1984) DNA restriction fragment length polymorphisms and heterozygosity in the human genome. Hum Genet 66:1–16
Cooper DN, Schmidtke J (1986) Diagnosis of genetic disease using recombinant DNA. Hum Genet 73:1–11
Cooper DN, Schmidtke J (1987) Diagnosis of genetic disease using recombinant DNA. Supplement. Hum Genet 77:66–75
Cooper DN, Taggart MH, Bird AP (1983) Unmethylated domains in vertebrate DNA. Nucleic Acids Res 11:647–658
Cooper DN, Smith BA, Cooke HJ, Niemann S, Schmidtke J (1985) An estimate of unique DNA sequence heterozygosity in the human genome. Hum Genet 69:201–205
Cooper DN, Gerber-Huber S, Nardelli D, Schubiger J-L, Wahli W (1987) The distribution of the dinucleotide CpG and cytosine methylation in the vitellogenin gene family. J Mol Evol 25:107–115
Coulondre C, Miller JH, Farabaugh PJ, Gilbert W (1978) Molecular basis of base substitution hotspots in Escherichia coli. Nature 274:775–780
Daar IO, Artymiuk PJ, Phillips DC, Maquat LE (1986) Human triose-phosphate isomerase deficiency: a single amino acid substitution results in a thermolabile enzyme. Proc Natl Acad Sci USA 83:7903–7907
Davis LM, McGraw RA, Ware JL, Roberts HR, Stafford DW (1987) Factor IX alabama: a point mutation in a clotting protein results in hemophilia B. Blood 69:140–143
De Verneuil H, Grandchamp B, Beaumont C, Picat C, Nordmann Y (1986) Uroporphyrinogen decarboxylase structural mutant (Gly281 → Glu) in a case of porphyria. Science 234:732–734
DiLella AG, Marvit J, Lidsky AS, Guttler F, Woo SLC (1986) Tight linkage between a splicing mutation and a specific DNA haplotype in phenylketonuria. Nature 322:799–803
DiLella AG, Marvit J, Brayton K, Woo SLC (1987) An amino-acid substitution involved in phenylketonuria is in linkage disequilibrium with DNA haplotype 2. Nature 327:333–336
Duchange N, Chassé J-F, Cohen GN, Zakin NM (1986) Antithrombin III Tours gene: identification of a point mutation leading to an arginine → cysteine replacement in a silent deficiency. Nucleic Acids Res 14:2408
Gardiner-Garden M, Frommer M (1987) CpG islands in vertebrate genomes. J Mol Biol 196:261–282
Gitschier J, Wood WL, Tuddenham EGD, Shuman MA, Goralka TM, Chen EY, Lawn RM (1985) Detection and sequence of mutations in the factor VIII gene of haemophiliacs. Nature 315:427–430
Gitschier J, Wood WI, Shuman MA, Lawn RM (1986) Identification of a missense mutation in the factor VIII gene of a mild hemophiliac. Science 232:1415–1416
Grippo P, Iaccarino M, Parisi E, Scarano E (1968) Methylation of DNA in developing sea urchin embryos. J Mol Biol 36:195–208
Haneda M, Chan SJ, Kwok SCM, Rubenstein AH, Steiner DF (1983) Studies on mutant human insulin genes: identification and sequence analysis of a gene encoding [Ser B24] insulin. Proc Natl Acad Sci USA 80:6366–6370
Josse J, Kaiser AD, Kornberg A (1961) Enzymatic synthesis of deoxyribonucleic acid. VIII. Frequencies of nearest neightbor base sequences in deoxyribonucleic acid. J Biol Chem 236:864–875
Keshet I, Lieman-Hurwitz J, Cedar H (1986) DNA methylation affects the formation of active chromatin. Cell 44:535–543
Kidd VJ, Wallace RB, Itakura K, Woo SLC (1983) 155-1 deficiency detection by direct analysis of the mutation in the gene. Nature 304:230–234
Law SW, Brewer HB (1985) Tangier disease: the complete mRNA sequence encoding for preproapo-A1. J Biol Chem 260:12810–12814
Lehrman MA, Goldstein JL, Brown MS, Russell DW, Schneider WJ (1985) Internalization-defective LDL receptors produced by genes with nonsense and frameshift mutations that truncate the cytoplasmic domain. Cell 41:735–743
Li Q, Powers PA, Smithies O (1985) Nucleotide sequence of 16 kilobase pairs of DNA 5′ to the human ɛ-globin gene. J Biol Chem 260:14901–14910
Lindsay S, Bird AP (1987) Use of restriction enzymes to detect potential gene sequences in mammalian DNA. Nature 327:336–338
Long I (1987) Structure and evolution of the human genes encoding protein C and coagulation factor IX. J Cell Biochem 33:185–190
Maeda S, Mita S, Araki S, Shimada K (1986) Structure and expression of the mutant prealbumin gene associated with familial amyloidotic polyneuropathy. Mol Biol Med 3:329–338
Nukiwa T, Satoh K, Brantly ML, Ogushi F, Fells GA, Courtney M, Crystal RG (1986) Identification of a second mutation in the protein-coding sequence of the Z type alpha 1-anti-trypsin gene. J Biol Chem 261:15989–15994
Nussinov R (1981) Eukaryotic dinucleotide preference rules and their implications for degenerate codon usage. J Mol Biol 149:125–131
Razin A, Szyf M (1984) DNA methylation patterns: formation and function. Biochim Biophys Acta 782:331–342
Rees DJG, Rizza CR, Brownlee GG (1985) Haemophilia B caused by a point mutation in a donor splice junction of the human factor IX gene. Nature 316:643–645
Romeo G, Hassan HJ, Staempfli S, Roncuzzi L, Cianetti L, Leonardi A, Vincente V, Mannucci PM, Bertina R, Peschle C, Cortese R (1987) Hereditary thrombophilia: identification of nonsense and missense mutations in the protein C gene. Proc Natl Acad Sci USA 84:2829–2832
Savatier P, Trabuchet G, Fauré C, Chebloure Y, Gouy M, Verdier G, Nigon VM (1985) Evolution of the primate beta-globin gene region. High rate of variation in CpG dinucleotides and in short repeated sequences between man and chimpanzee. J Mol Biol 182:21–29
Selker EV, Stevens JN (1985) DNA methylation at asymetric sites is associated with numerous transition mutations. Proc Natl Acad Sci USA 82:8114–8118
Shibasaki Y, Kawakami T, Kanazawa Y, Akanuma Y, Takaku F (1985) Post-translational cleavage of proinsulin is blocked by a point mutation in familial hyperproinsulinemia. J Clin Invest 76:378–380
Shoelson S, Haneda M, Blix P, Nanjo A, Sanke T, Inouye K, Steiner D, Rubenstein A, Tager H (1983) Three mutant insulins in man. Nature 302:540–543
Stavnezer-Nordgren J, Kekish O, Zegers BJM (1985) Molecular defects in a human immunoglobulin K chain deficiency. Science 230:458–461
Tsuji S, Choudary PV, Martin BM, Stubblefield BK, Mayor JA, Barranger JA, Ginns EI (1987) A mutation in the human glucocerebrosidase gene in neuronopathic Gaucher's disease. N Engl J Med 316:570–575
Valerio D, Dekker BMM, Duyvesteyn MGC, Voorn L van der, Berkvens TM, Ormondt H van, Eb AJ van der (1986) One adenosine deaminase allele in a patient with severe combined immunodeficiency contains a point mutation abolishing enzyme activity. EMBO J 5:113–119
Vogel F, Kopun M (1977) Higher frequencies of transitions among point mutations. J Mol Evol 9:159–180
Wallace MR, Dwulet FE, Conneally PM, Benson MD (1986) Biochemical and molecular genetic characterization of a new variant prealbumin associated with hereditary amyloidosis. J Clin Invest 78:6–12
Wang RY-H, Kuo KC, Gehrke CW, Huang L-H, Ehrich M (1982) Heat and alkali-induced deamination of 5-methylcytosine and cytosine residues in DNA. Biochim Biophys Acta 697:371–377
Williams SR, Gekeler V, Mclvor RS, Martin DW (1987) A human purine nucleoside phosphorylase deficiency caused by a single base change. J Biol Chem 262:2332–2338
Youssoufian H, Kazazian HH, Phillips DG, Aronis S, Tsiftis G, Brown VA, Antonarakis SE (1986) Recurrent mutations in haemophilia A give evidence for CpG mutation hotspots. Nature 324:380–382
Author information
Authors and Affiliations
Rights and permissions
About this article
Cite this article
Cooper, D.N., Youssoufian, H. The CpG dinucleotide and human genetic disease. Hum Genet 78, 151–155 (1988). https://doi.org/10.1007/BF00278187
Received:
Revised:
Issue Date:
DOI: https://doi.org/10.1007/BF00278187