Abstract
Little is known about variation of nucleotide insertion/deletions (indels) within species. In Arabidopsis thaliana, we investigated indel polymorphism patterns between two genome sequences and among 96 accessions at 1215 loci. Our study identified patterns in the variation of indel density, size, GC content and distribution, and a correlation between indels and substitutions. We found that the GC content in indel sequences was lower than that in non-indel sequences and that indels typically occur in regions with lower GC content. Patterns of indel frequency distribution among populations were more consistent with neutral expectation than substitution patterns. We also found that the local level of substitutions is positively correlated with indel density and negatively correlated with their distance to the closed indel, suggesting that indels play an important role in nucleotide variation.
Similar content being viewed by others
References
Barker MD (1989) High-frequency homologous recombination between duplicate chromosomal immunoglobulin mu heavy-chain constant regions. Mol Cell Biol 9:5500–5507
Batley J, Barker G, O’Sullivan H, Edwards KJ, Edwards D (2003) Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence. Plant Physiol 132:84–91
Berger J, Suzuki T, Senti K, Stubbs J, Schaffner G, Dickson BJ (2001) Genetic mapping with SNP markers in Drosophila. Nat Genet 29:475–481
Bhangale TR, Rieder MJ, Livingston RJ, Nickerson DA (2005) Comprehensive identification and characterization of diallelic insertion–deletion polymorphisms in 330 human candidate genes. Hum Mol Genet 14:59–69
Britten RJ, Rowen L, Williams J, Cameron RA (2003) Majority of divergence between closely related DNA samples is due to InDels. Proc Natl Acad Sci USA 100:4661–4665
Chan SK, Hsing M, Hormozdiari F, Cherkasov A (2007) Relationship between insertion/deletion (indel) frequency of proteins and essentiality. BMC Bioinformatics 8:227
Crombach A, Hogeweg P (2007) Chromosome rearrangements and the evolution of genome structuring and adaptability. Mol Biol Evol 24(5):1130–1139
DeBerardinis RJ, Goodier JL, Ostertag EM, Kazazian HH Jr (1998) Rapid amplification of a retrotransposon subfamily is evolving the mouse genome. Nat Genet 20(3):288–290
Ding J, Araki H, Wang Q, Zhang P, Tian D et al (2007) Highly asymmetric rice genomes. BMC Genomics 8:154
Garcia-Diaz M, Kunkel TA (2006) Mechanism of a genetic glissando*: structural biology of InDel mutations. Trends Biochem Sci 31:206–214
Gregory TR (2004) Insertion–deletion biases and the evolution of genome size. Gene 324:15–34
Hammarlund M, Davis MW, Nguyen H, Dayton D, Jorgensen EM (2005) Heterozygous insertions alter crossover distribution but allow crossover interference in Caenorhabditis elegans. Genetics 171:1047–1056
Hardison RC, Roskin KM, Yang S, Diekhans M, Kolbe D et al (2003) Covariation in frequencies of substitution, deletion, transposition, and recombination during eutherian evolution. Genome Res 131:13–26
Hinds DA, Kloek AP, Jen M, Chen X, Frazer KA (2006) Common deletions and SNPs are in linkage disequilibrium in the human genome. Nat Genet 38:82–85
Jander G, Norris SR, Rounsley SD, Bush DF, Last RL et al (2002) Arabidopsis map-based cloning in the post-genome era. Plant Physiol 129:440–450
Kondrashov AS (2003) Direct estimates of human per nucleotide mutation rates at 20 loci causing Mendelian diseases. Hum Mutat 21:12–27
Kondrashov FA, Rogozin IB, Wolf YI, Koonin EV (2002) Selection in the evolution of gene duplication. Genome Biol 3:research0008.1-0008.9
Levin I (1999) Relating statistics and experimental design: an introduction. Sage Publications, Thousand Oaks
Levy S, Sutton G, Ng PC, Feuk L, Venter JC et al (2007) The diploid genome sequence of an individual human. PLoS Biol 5:e254
Mills RE, Luttig CT, Larkins CE, Beauchamp A, Devine SE et al (2006) An initial map of insertion and deletion (INDEL) variation in the human genome. Genome Res 16(9):1182–1190
Mitchell-Olds T, Schmitt J (2006) Genetic mechanisms and evolutionary significance of natural variation in Arabidopsis. Nature 441:947–952
Nordborg M, Hu TT, Ishino Y, Jhaveri J, Zheng H et al (2005) The pattern of polymorphism in Arabidopsis thaliana. PLoS Biol 3:e196
Pearson CE, Edamura KN, Cleary JD (2005) Repeat instability: mechanisms of dynamic mutations. Nat Rev Genet 6:729–742
Petrov DA, Sangster TA, Johnston JS, Hartl DL, Shaw KL (2000) Evidence for DNA loss as a determinant of genome size. Science 287:1060–1062
Rogers SO, Bendich AJ (1985) Extraction of DNA from milligram amounts of fresh, herbarium and mummified plant tissues. Plant Mol Biol 5:69–76
Schwartz S, Kent WJ, Smit A, Zhang Z, Miller W et al (2003) Human-mouse alignments with BLASTZ. Genome Res 13:103–107
Sun T, Gao Y, Tan W, Ma S, Lin D et al (2007) A six-nucleotide insertion–deletion polymorphism in the CASP8 promoter is associated with susceptibility to multiple cancers. Nat Genet 39:605–613
Tajima F (1989) Statistical method for testing the neutral mutation hypothesis by DNA polymorphism. Genetics 123:585–595
Tenaillon MI, Sawkins MC, Long AD, Gaut RL, Gaut BS et al (2001) Patterns of DNA sequence polymorphism along chromosome 1 of maize (Zea mays ssp. mays L.). Proc Natl Acad Sci USA 98:9161–9166
The Chimpanzee Sequencing and Analysis Consortium (2005) Initial sequence of the chimpanzee genome and comparison with the human genome. Nature 437:69–87
Wicks SR, Yeh RT, Gish WR, Waterston RH, Plasterk RHA (2001) Rapid gene mapping in Caenorhabditis elegans using a high density polymorphism map. Nat Genet 28:160–164
Yang S, Jiang K, Araki H, Ding J, Tian D et al (2007) A molecular isolation mechanism associated with high intra-specific diversity in rice. Gene 394:87–95
Acknowledgments
This research was supported by NSFC (30570987) to D.T.
Author information
Authors and Affiliations
Corresponding author
Additional information
Communicated by Y. Van de Peer.
W. Zhang and X. Sun have contributed equally to this work.
Electronic supplementary material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zhang, W., Sun, X., Yuan, H. et al. The pattern of insertion/deletion polymorphism in Arabidopsis thaliana . Mol Genet Genomics 280, 351–361 (2008). https://doi.org/10.1007/s00438-008-0370-1
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s00438-008-0370-1