Abstract
The pattern of spontaneous mutation can be inferred from the pattern of substitution in pseudogenes, which are known to be under very weak or no selective constraint. We modified an existing method (Gojobori T, et al., J Mol Evol 18:360, 1982) to infer the pattern of mutation in bacteria by using 569 pseudogenes from Mycobacterium leprae. In Gojobori et al.’s method, the pattern is inferred by using comparisons involving a pseudogene, a conspecific functional paralog, and an outgroup functional ortholog. Because pseudogenes in M. leprae are unitary, we replaced the missing paralogs by functional orthologs from M. tuberculosis. Functional orthologs from Streptomyces coelicolor served as outgroups. We compiled a database consisting of 69,378 inferred mutations. Transitional mutations were found to constitute more than 56% of all mutations. The transitional bias was mainly due to C→T and G→A, which were also the most frequent mutations on the leading strand and the only ones that were significantly more frequent than the random expectation. The least frequent mutations on the leading strand were A→T and T→A, each with a relative frequency of less than 3%. The mutation pattern was found to differ between the leading and the lagging strands. This asymmetry is thought to be the cause for the typical chirochoric structure of bacterial genomes. The physical distance of the pseudogene from the origin of replication (ori) was found to have almost no effect on the pattern of mutation. A surprising similarity was found between the mutation pattern in M. leprae and previously inferred patterns for such distant taxa as human and Drosophila. The mutation pattern on the leading strand of M. leprae was also found to share some common features with the pattern inferred for the heavy strand of the human mitochondrial genome. These findings indicate that taxon-specific factors may only play secondary roles in determining patterns of mutation.
Similar content being viewed by others
References
Altschul SF, Gish W, Miller W, Myers EW, Lipman DJ (1990) Basic local alignment search tool. J Mol Biol 215:403–410
Andersson JO, Andersson SGE, (1999) Genome degradation is an ongoing process in Rickettsia. Mol Biol Evol 16:1178–1191
Brown JR, Douady CJ, Italia MJ, Marshall WE, Stanhope MJ (2001) Universal trees based on large combined protein sequence data sets. Nature Genet 28:281–285
Bulmer M. (1991) Strand symmetry of mutation rates in the b-globin region. J Mol Evol 33:305–310
Clayton DA (1982) Replication of animal mitochondrial DNA. Cell 28:693–705
Cole ST, Brosch R, Parkhill J, et al. (1998) Deciphering the biology of Mycobacterium tuberculosis from the complete genome sequence. Nature 393:537–544
de Miranda AB, Alvarex-Valin F, Jabbari K, Degrave WM, Bernardi G (2000) Gene expression, amino acid conservation, and hydrophobicity are the main factors shaping codon preferences in Mycobacterium tuberculosis and Mycobacterium leprae. J Mol Evol 50:45–55
EyreWalker A (1997) Problems with parsimony in sequences of biased base composition. J Mol Evol 47:686–690
Forsdyke DR (1995) Relative roles of primary sequence and (G+C)% in determining the hierarchy of frequencies of complementary trinucleotide pairs in DNAs of different species. J Mol Evol 41:573–581
Francino MP, Chao L, Riley MA, Ochman H (1996) Asymmetries generated by transcription-coupled repair in enterobacterial genes. Science 272:107–109
Cole ST, Eiglmeier K, Parkhill J, et al. (2001) Massive gene decay in the leprosy bacillus. Nature 409:1007–1011
Frank AC, Lobry JR (2000) ORILOC: Prediction of replication boundaries in unannotated bacterial chromosomes. Bioinformatics 16:560–561
Galtier N, Lobry JR (1997) Relationships between genomic G+C content, RNA secondary structures, and optimal growth temperature in prokaryotes. J Mol Evol 44:632–636
Galtier N, Piganeau G, Mouchiroud D, Duret L, (2001) GC-content evolution in mammalian genomes: The biased gene conversion hypothesis. Genetics 159:907–911
Gojobori T, Li WH, Graur D (1982) Patterns of nucleotide substitution in pseudogenes and functional genes. J Mol Evol 18:360–369
Graur D, Li WH (2000) Fundamentals of molecular evolution. Sinauer Associates, Sunderland, MA
Hoppe FM (ed) (1993) Multiple comparisons, selections and applications in biometry. Marcel Dekker, New York
Kreutzer DA, Essigmann JM (1998) Oxidized, deaminated cytosines are a source of C transitions in vivo. Proc Natl Acad Sci USA 95:3578–3582
Lawrence JG, Hendrix RW, Casjens S (2001) Where are the pseudogenes in bacterial genomes? Trends Microbiol 9:535–540
Li WH, Wu CI, Luo CC (1984) Nonrandomness of point mutation as reflected in nucleotide substitutions in pseudogenes and its evolutionary implications. J Mol Evol 21:58–71
Lindahl T (1993) Instability and decay of the primary structure of DNA. Nature 362:709–715
Liu Y, Harrison PM, Kunin V, GersteinM (2004) Comprehensive analysis of pseudogenes in prokaryotes: Widespread gene decay and failure of putative horizontally transferred genes. Genome Biol 5:R64
Lobry JR (1996) Asymmetric substitution patterns in the two DNA strands of bacteria. Mol Biol Evol 13:660–665
Mira A, Ochman H (2002) Gene location and bacterial sequence divergence. Mol Evol 19:1350–1358
Morton BR, Clegg MT (1993) A chloroplast DNA mutational hotspot and gene conversion in a noncoding region near rbcL in the grass family (Poaceae). Curr Genet 24:357–365
Ochman H (2003) Neutral substitutions in bacterial genomes. Mol Biol Evol 20:2091–2096
Parkhill JM, Sebaihia A, Preston LD, et al. (2003) Comparative analysis of the genome sequences of Bordetella pertussis, Bordetella parapertussis and Bordetella bronchiseptica. Nature Genet 35:32–40
Petrov DA, Hartl DL (1999) Patterns of nucleotide substitution in Drosophila and mammalian genomes. Proc Natl Acad Sci USA 96:1475–1479
Razin A, Riggs AD (1980) DNA methylation and gene-function. Science 210:604–610
Rocha EPC, Danchin A (2001) Ongoing evolution of strand composition in bacterial genomes. Mol Biol Evol 18:1789–1799
Rocha EPC, Danchin A Viari A (1999) Universal replication biases in bacteria. Mol Microbiol 32:11–16
Salazar L, Fsihi H, deRossi E, Riccardi G, Rios C, Cole ST, Takiff HE (1996) Organization of the origins of replication of the chromosomes of Mycobacterium smegmatis, Mycobactenum leprae and Mycobacterium tuberculosis and isolation of a functional origin from M. smeginatis. Mol Microbiol 20:283–293
Sharp PM, Shields DC, Wolfe KH, Li WH (1989) chromosomal location and evolutionary rate variation in enterobacterial Genes. Science 246:808–810
Sueoka N (2003) Intrastrand parity rules of DNA base composition and usage biases of synonymous codons. J Mol Evol 40:318–325
Tajima F, Nei M (1982) Biases of the estimation of DNA divergence obtained by the restriction enzyme technique. Mol Biol Evol 18:115–120
Tamura F, Nei M (1993) Estimation of the number of nucleotide substitutions in the control region of mitochondrial-DNA in humans and chimpanzees. Mol Biol Evol 10:512–526
Tanaka M, Ozawa T (1994) Strand asymmetry in human mitochondrial-DNA mutations. Genomics 22:327–335
Thompson JD, Higgins DG, Gibson TJ (1994) Clustal W: Improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. Nucleic Acids Res 22:4673–4680
Zar JH (1999) Biostatisticial analysis. Prentice–Hall, Upper Saddle River, NJ
Acknowledgments
This study was supported in part by a grant from the Israel Science Foundation.
Author information
Authors and Affiliations
Corresponding author
Additional information
[Reviewing Editor:Dr. Dmitri Petrov]
Electronic Supplementary Material
Rights and permissions
About this article
Cite this article
Mitchell, A., Graur, D. Inferring the Pattern of Spontaneous Mutation from the Pattern of Substitution in Unitary Pseudogenes of Mycobacterium leprae and a Comparison of Mutation Patterns Among Distantly Related Organisms. J Mol Evol 61, 795–803 (2005). https://doi.org/10.1007/s00239-004-0235-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00239-004-0235-0