An efficient method for developing polymorphic microsatellite markers from high-throughput transcriptome sequencing: a case study of hexaploid oil-tea camellia (Camellia oleifera)
- 181 Downloads
The bottleneck of microsatellite marker development is to determine polymorphisms of microsatellite markers. A large amount of microsatellites can be detected via high-throughput sequencing. However, most previous studies didn’t fully use the high-throughput sequencing data to predict number of alleles at microsatellite loci. Instead, laborious experiments were performed for manually screening microsatellite loci, finding out number of alleles at each microsatellite loci and selecting those with polymorphisms for marker development. In this study, we improved the method for efficient development of polymorphic microsatellite markers from high-throughput transcriptome sequencing, using hexaploid oil-tea camellia as a case study. Leaf transcriptomes were sequenced of eight wild oil-tea camellia samples at different altitudes in Jinggang and Lu Mountains, China. Microsatellites were directly identified in the sequencing reads and primers were designed. Strategies were designed to filtering duplicate and multi-locus markers. For each marker, number of alleles cross samples was predicted and length of the potentially amplifiable sequence was estimated. 153 predicted polymorphic markers were selected and empirically validated in the eight samples. Sixty five markers (42%) were polymorphic (2–12 alleles) and 31 (20%) were highly polymorphic (6–12 alleles). The empirical number of alleles was generally higher than the predicted number of alleles but they were significantly correlated. The predicted allele length was among the empirical allele length range. Compared with most previous studies, the method shows a higher efficiency for developing polymorphic markers and filtering duplicate and multi-locus markers. The polymorphic microsatellite markers developed can be used for analyzing the genetic diversity of oil-tea camellia.
KeywordsCamellia oleifera High throughput sequencing Microsatellite marker development Polymorphism Simple sequence repeat Short tandem repeat
We thank Dr. Marinus J. M. Smulders for comments on the data analysis. This work was supported by the National Natural Science Foundation of China (NSFC Grant No. 31460072) and the “Gan-Po Talent 555” Project of Jiangxi Province, China.
Compliance with ethical standards
Conflict of interest
The authors declare no conflict of interest.
- Hall TA (1999) BioEdit: a user-friendly biological sequence alignment editor and analysis program for Windows 95/98/NT. Nucleic Acids Symp Ser 41:95–98Google Scholar
- Ming TL (2000) Monograph of the genus Camellia. Yunnan Science and Technology Press, KunmingGoogle Scholar
- Rong J, Xu S, Meirmans PG, Vrieling K (2013) Dissimilarity of contemporary and historical gene flow in a wild carrot (Daucus carota) metapopulation under contrasting levels of human disturbance: implications for risk assessment and management of transgene introgression. Ann Bot 112:1361–1370CrossRefPubMedPubMedCentralGoogle Scholar
- Rozen S, Skaletsky HJ (2000) Primer3 on the WWW for general users and for biologist programmers. In: Krawetz S, Misener S (eds) Bioinformatics methods and protocols: methods in molecular biology. Human Press, Totowa, pp 365–386Google Scholar
- Sundaram RM, Naveenkumar B, Biradar SK, Balachandran SM, Mishra B, IlyasAhmed M, Viraktamath BC, Ramesha MS, Sarma NP (2008) Identification of informative SSR markers capable of distinguishing hybrid rice parental lines and their utilization in seed purity assessment. Euphytica 163:215–224CrossRefGoogle Scholar
- Zhuang RL (2008) Oil-tea camellia in China, 2nd edn. China Forestry Publishing House, BeijingGoogle Scholar