Sources and predictors of resolvable indel polymorphism assessed using rice as a model
The principal sources of genetic variation that can be assayed with restriction enzymes are base substitutions and insertions/deletions (indels). The likelihood of detecting indels as restriction fragment length polymorphisms (RFLPs) is determined by the size and frequency of the indels, and the ability to resolve small indels as RFLPs is limited by the distribution of restriction fragment sizes. In this study, we use aligned sequences from the indica and japonica subspecies of rice ( Oryza sativa L.) to quantify and compare the ability of restriction enzymes to detect indels. We look specifically at two abundant transposable element-derived indel sources: miniature inverted repeat transposable elements (MITEs) and long terminal repeat (LTR) retroelements. From this analysis we conclude that indels rather than base substitutions are the prevailing source of the polymorphism detected in rice. We show that, although MITE derived indels are more abundant than LTR-retroelement derived indels, LTR-retroelements have a greater capacity to generate visible restriction fragment length polymorphism because of their larger size. We find that the variation in the detectability of indels among restriction enzymes can be explained by differences in the frequency and dispersion of their restriction sites in the genome. The parameters that describe the fragment size distributions obtained with the restriction enzymes are highly correlated across the sequenced genomes of rice, Arabidopsis and human, with the exception of some extreme deviations in frequency for particular recognition sequences corresponding to variations in the levels and modes of DNA methylation in the three disparate organisms. Thus, we can predict the relative ability of a restriction enzyme to detect indels derived from a specific source based on the distribution of restriction fragment sizes, even when this is estimated for a distantly related genome.
KeywordsTransposable elements Oryza sativa L. Rice Insertions/Deletions (Indels) Restriction Fragment Length Polymorphisms (RFLPs)
Supplementary Table 1 For each commercially available enzyme, the percent of the total aligned sequence in restriction fragments detecting polymorphism due to: all indels, LTR-retroelement derived indels, and MITE derived indels
Supplementary Table 2 Parameters describing the distributions of restriction fragments
- Bernaola-Galvan P, Roman-Roldan R, Oliver JL (1996) Compositional segmentation and long-range fractal correlations in DNA sequences. Physical Rev E. Statistical Physics, Plasmas, Fluids, and Related Interdisciplinary Topics 53:5181–5189Google Scholar
- McCouch SR, Kochert G, Yu ZH, Wang ZY, Khush GS (1988) Molecular mapping of rice chromosomes. Theor Appl Genet 76:815–829Google Scholar
- Second G (1982) Origin of the gene diversity of cultivated rice ( Oryza sativa L.): study of the polymorphism scored at 40 isoenzyme loci. Jpn J Genet 57:25–57Google Scholar