Deleterious amino acid polymorphisms in Arabidopsis thaliana and rice
- 444 Downloads
Plant genetic diversity has been mainly investigated with neutral markers, but large-scale DNA sequencing projects now enable the identification and analysis of different classes of genetic polymorphisms, such as non-synonymous single nucleotide polymorphisms (nsSNPs) in protein coding sequences. Using the SIFT and MAPP programs to predict whether nsSNPs are tolerated (i.e., effectively neutral) or deleterious for protein function, genome-wide nsSNP data from Arabidopsis thaliana and rice were analyzed. In both species, about 20% of polymorphic sites with nsSNPs were classified as deleterious; they segregate at lower allele frequencies than tolerated nsSNPs due to purifying selection. Furthermore, A. thaliana accessions from marginal populations show a higher relative proportion of deleterious nsSNPs, which likely reflects differential selection or demographic effects in subpopulations. To evaluate the sensitivity of predictions, genes from model and crop plants with known functional effects of nsSNPs were inferred with the algorithms. The programs predicted about 70% of nsSNPs correctly as tolerated or deleterious, i.e., as having a functional effect. Forward-in-time simulations of bottleneck and domestication models indicated a high power to detect demographic effects on nsSNP frequencies in sufficiently large datasets. The results indicate that nsSNPs are useful markers for analyzing genetic diversity in plant genetic resources and breeding populations to infer natural/artificial selection and genetic drift.
KeywordsMAPP Prediction Artificial Selection Demographic History Selection Coefficient Lower Allele Frequency
We are grateful to the IPK bioinformatics group for assistance with the computer cluster and to two anonymous reviewers for their comments. This work was supported by core funding from the Institute of Plant Genetics and Crop Plant Research (IPK) Gatersleben, and the Swedish University of Agricultural Sciences (SLU) Uppsala.
- Clark RM, Schweikert G, Toomajian C, Ossowski S, Zeller G, Shinn P, Warthmann N, Hu TT, Fu G, Hinds DA, Chen H, Frazer KA, Huson DH, Scholkopf B, Nordborg M, Ratsch G, Ecker JR, Weigel D (2007) Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana. Science 317(5836):338–342CrossRefPubMedGoogle Scholar
- Gepts P, Papa R (2002) Evolution during domestication. In: Encyclopedia of life sciences. Wiley, Chichester. http://www.els.net/
- Li WH (1997) Molecular evolution. Sinauer Associates, SunderlandGoogle Scholar
- Nordborg M, Hu TT, Ishino Y, Jhaveri J, Toomajian C, Zheng H, Bakker E, Calabrese P, Gladstone J, Goyal R, Jakobsson M, Kim S, Morozov Y, Padhukasahasram B, Plagnol V, Rosenberg NA, Shah C, Wall JD, Wang J, Zhao K, Kalbfleisch T, Schulz V, Kreitman M, Bergelson J (2005) The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol 3(7):e196CrossRefPubMedGoogle Scholar
- Oka H (1988) Origin of cultivated rice. Japan Scientific Societies Press, Tokyo, Elsevier, AmsterdamGoogle Scholar
- Sekine KT, Ishihara T, Hase S, Kusano T, Shah J, Takahashi H (2006) Single amino acid alterations in Arabidopsis thaliana RCY1 compromise resistance to Cucumber mosaic virus, but differentially suppress hypersensitive response-like cell death. Plant Mol Biol 62(4):669–682CrossRefPubMedGoogle Scholar
- TIGR (2007) Rice genome annotation, vol 5. http://www.tigr.org/tdb/rice