Abstract
With the development of high-throughput sequencing techniques, transcriptome sequencing projects which provide valuable resources for designing simple sequence repeat (SSR) primers have been carried out for many plants. However, the utility of SSRs for molecular breeding depends on genome-wide distribution and coverage, as well as moderately high allelic variability, in the available SSR library. In this study, we characterized the exonic SSRs developed from the publicly available Populus genome as a case study to determine their value for molecular breeding. As expected, our results confirmed that microsatellites occurred approximately three times less often in coding regions than in non-coding regions. Mutability test also showed that exonic SSRs contained less allelic variability than intronic SSRs. More importantly, exonic SSRs were unevenly distributed both among and within chromosomes. Large exonic SSRs deserts were observed on several chromosomes. Differential selection between paralogous chromosomes, at the gene level, appears to be responsible for these SSR deserts, though the mechanisms that cause chromosome-specific SSR deserts are not known. This work provides ample evidence that the candidate gene approach based on unigenes identified from transcribed sequences may not be the best strategy to identify highly polymorphic SSRs.
Similar content being viewed by others
References
Adamsa KL, Wendel JF (2005) Novel patterns of gene expression in polyploid plants. Trends Genet 21:539–543
Adamsa KL, Percifield R, Wendel JF (2004) Organ-specific silencing of duplicated genes in a newly synthesized cotton allotetraploid. Genetics 168:2217–2226
Blanc G, Wolfe KH (2004) Functional divergence of duplicated genes formed by polyploidy during Arabidopsis evolution. Plant Cell 16:1679–1691
Castillo A, Budak H, Varshney RK, Dorado G, Graner A, Hernandez P (2008) Transferability and polymorphism of barley EST-SSR markers used for phylogenetic analysis in Hordeum chilense. BMC Plant Biol 8:97–105
Jewell E, Robinson A, Savage D et al (2006) SSR primer and SSR taxonomy tree: biome SSR discovery. Nucl Acids Res 34:656–659
Kashi Y, King D, Soller M (1997) Simple sequence repeats as a source of quantitative genetic variation. Trends Genet 13:74–78
Keim P, Beavis W, Schupp J, Freestone R (1992) Evaluation of soybean RFLP marker diversity in adapted germplasm. Theor Appl Genet 85:205–212
Khanal S, Tang S, Beilinson V, Miguel PS et al (2008) ESTs are a rich source of polymorphic SSRs for genomics and molecular breeding applications in peanut [abstract]. In: Proceedings of the plant and animal genome XVI conference. San Diego, California, p 135
Li YC, Korol AB, Beiles A et al (2002) Microsatellites: genomic distribution, putative functions and mutational mechanisms: a review. Mol Ecol 11:2453–2465
Lijoi A, Mena RH, Prünster I (2007) A Bayesian nonparametric method for prediction in EST analysis. BMC Bioinformatics 8:339–359
Lothe RA (1997) Microsatellite instability in human solid tumors. Mol Med Today 3:61–68
Lyons E, Freeling M (2008) How to usefully compare homologous plant genes and chromosomes as DNA sequences. Plant J 53:661–673
Reddy PS, Housman DE (1997) The complex pathology of trinucleotide repeats. Curr Opin Cell Bio 9:364–372
Ren Y, Zhang Z, Liu J, Staub JE, Han Y et al (2009) An integrated genetic and cytogenetic map of the cucumber genome. PLoS ONE 4(6):e5795
Smith EC (1943) A study of cytology and speciation in the genus Populus L. J Arnold Arbor 24:275–305
Susko E, Roger RJ (2004) Estimating and comparing the rates of gene discovery and expressed sequence tag (EST) frequencies in EST surveys. Bioinformatics 20:2279–2287
Tautz D, Trick M, Dover GA (1986) Cryptic simplicity in DNA is a major source of genetic variation. Nature 322:652–656
Toth G, Gaspari Z, Jurka J (2000) Microsatellites in different eukaryotic genomes: survey and analysis. Genome Res 10:967–981
Tuskan GA, DiFazio SP, Hellsten U et al (2006) The genome of black cottonwood, Populus trichocarpa (Torr. & Gray ex Brayshaw). Science 313:1596–1604
Voorrips RE (2002) MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered 93:77–78
Wang JPZ, Lindsay BG, Cui LY et al (2005) Gene capture prediction and overlap estimation in EST sequencing from one or multiple libraries. BMC Bioinformatics 6:300–310
Yang XH, Kalluri UC, Jawdy S et al (2008) Comparative analysis of F-Box proteins in arabidopsis, poplar and rice suggests differential proteolytic pathway mediated through F-Box protein between woody perennial and herbaceous annual plants. Plant Physiol 148:1189–1200
Yin TM, DiFazio SP, Gunter LE, Riemenschneider D, Tuskan GA (2004) Large-scale heterospecific segregation distortion in Populus revealed by a dense genetic map. Theo Appl Genet 109:451–463
Yin TM, Zhang XY, Gunter L et al (2009) Microsatellite primers resource developed from the mapped sequence scaffolds of Nisqually-1 genome. New Phytol 181:498–503
Acknowledgments
We thank Han Jingling, Zhenghong Liu and Faying Yuan at Nanjing Forestry University for their help in sample collection. Special thanks go to the editor and anonymous reviewers for their help in formulating the revision. Funds for this research were provided by Natural Science Foundation of China (30971609) and Forestry Nonprofit Project (200904002). This work is also supported by the US DOE BioEnergy Science Center. The BioEnergy Science Center is a US Department of Energy Bioenergy Research Center supported by the Office of Biological and Environmental Research in the DOE Office of Science.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Below is the link to the electronic supplementary material.
11032_2010_9413_MOESM1_ESM.xls
Table 1. Distribution test of exonic SSRs in the 2-Mb sliding windows along the Populus chromosomes. The valid reading length is the length of A, T, G, C designation in each window. Distribution test is performed by the Poisson calculation at P ≤ 0.05 and P ≤ 0.01 significance levels. “**+” Significantly overabundant at P ≤ 0.01; “*+” significantly overabundant at P ≤ 0.05; “**−” significantly sparse at P ≤ 0.01; “*−” significantly sparse at P ≤ 0.05 (XLS 78 kb)
11032_2010_9413_MOESM2_ESM.xls
Table 2. Exploring the loss of exonic SSR repeat motifs on the paralogous segments between chromosome I and chromosome III. Note: The two chromosomes share two duplicated segments; lines without shadow correspond to exonic SSRs in the upper duplicated region on chromosome I, whereas lines in gray shadow correspond to the exonic SSRs in the lower duplicated region on chromosome I (XLS 201 kb)
11032_2010_9413_MOESM3_ESM.xls
Table 3. Summary of primer information and amplification results in SSR variability test. In this table, the primer name is designated as “P” followed by “number of chromosome ID”, followed by “_”, and ends with “the start base of left primer”. Columns I to AF contain the amplification results of the tested primer pairs. In these columns, “M/M” stand for the missing data; “No amp” stand for no amplification products. The corresponding gene model and their definition are listed in the last two columns (XLS 165 kb)
Rights and permissions
About this article
Cite this article
Li, S., Yin, T., Wang, M. et al. Characterization of microsatellites in the coding regions of the Populus genome. Mol Breeding 27, 59–66 (2011). https://doi.org/10.1007/s11032-010-9413-5
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11032-010-9413-5