Abstract
The fingerprinting of 742 potato landraces with 51 simple sequence repeat (SSR, or microsatellite) markers resulted in improving a previously constructed potato genetic identity kit. All SSR marker loci were assayed with a collection of highly diverse landraces of all species of cultivated potato with ploidies ranging from diploid to pentaploid. Loci number, amplification reproducibility, and polymorphic information content were recorded. Out of 148 SSR markers of which 30 are new, we identified 58 new SSR marker locations on at least one of three potato genetic linkage maps. These results permitted the selection of a new potato genetic identity kit based on 24 SSR markers with two per chromosome separated by at least 10 cM, single locus, high polymorphic information content, and high quality of amplicons as determined by clarity and reproducibility. The comparison of a similarity matrix of 742 landraces obtained with the 24 SSR markers of the new kit and with the entire dataset of 51 SSR markers showed a high correlation (r = 0.94) by a Mantel test and even higher correlations (r = 0.99) regarding topological comparisons of major branches of a neighbor joining tree. This new potato genetic identity kit is able to discriminate 93.5% of the 742 landraces compared to 98.8% with 51 SSR markers. In addition, we made a marker-specific set of allele size standards that conveniently and unambiguously provide accurate sizing of all alleles of the 24 SSR markers across laboratories and platforms. The new potato genetic identity kit will be of particular utility to standardize the choice and allele sizing of microsatellites in potato and aid in collaborative projects by allowing cumulative analysis of independently generated data.
Similar content being viewed by others
Introduction
Simple sequence repeats (SSRs, or microsatellites) have been used to great advantage in potato for studies of diversity, genetic structure, and classification (Spooner et al. 2007); tracing germplasm migrations (Spooner et al. 2005a; Rios et al. 2007); fingerprinting (Moisan-Thiery et al. 2005; Provan et al. 1996; Schneider and Douches 1997); genetic linkage mapping (Ghislain et al. 2001; Feingold et al. 2005); establishment of core collections (Ghislain et al. 2006); and investigations of duplicate collections across genebanks (Del Rio et al. 2006). Although not yet used in potato they have potential applications in studies of linkage disequilibrium (Remington et al. 2001; Stich et al. 2005) and gene flow (Devaux et al. 2005; Fenart et al. 2007). They require considerable developmental costs and often have maximum utility within a narrow range of germplasm from which they were developed. Once developed, however, they have tremendous advantages over many other marker classes to include low operational costs, codominance, hypervariablity, high quality bands, highly reproducible bands, amenability to automation, ease of multiplexing, and use with low quality DNA (Spooner et al. 2005b).
Over 200 potato SSRs have been identified through enriched genomic libraries and database searches of expressed sequence tags (ESTs) (Milbourne et al. 1998; Ashkenazi et al. 2001; Ghislain et al. 2004; Feingold et al. 2005). However, many more are becoming available as ESTs are being identified. The latest SSR summary statistics from the former The Institute for Genomic Research (TIGR) document more than 5,800 sequences with potentially useful SSR (repeats of 2–6 nucleotides) markers for potato. These SSRs differ greatly, however, in quality (clarity and repeatability of bands), map location, and polymorphism. Some of them have been tested on potato landraces and advanced varieties mapped on various potato genetic maps. However, an extensive analysis on a large collection of potato SSRs was lacking. Ghislain et al. (2004) provided the first such analysis of 156 SSRs for quality and polymorphism, chose 22 of them by a combination of the above criteria in cultivated potato, showed how some of these could be multiplexed, and mapped them.
The purpose of the present study is to screen additional potato SSRs from all taxonomic groups of potato to refine a selection of microsatellites for maximum utility in a cultivated potato background. Such large data set is available from a previous study aiming at classifying cultivated potato (Spooner et al. 2007). Its wide genetic diversity makes this data set particularly valuable for our purpose.
Materials and methods
Plant materials and DNA extraction
Seven hundred and forty-two native (landrace) potatoes belonging to a composite genotyping set of potato at the International Potato Center (CIP) were used for this study. These landraces were selected to represent all four species and taxonomic groups of potato as described above, and represent the same used in the taxonomic study of Spooner et al. (2007) and are described in the supporting dataset 1. Genomic DNA was obtained using standard protocol derived from Doyle and Doyle (1990). DNA concentration was estimated by using a TBS-380 Fluorometer (Turner BioSystems, USA) with PicoGreen® reagent and 500 ng/ml salmon DNA as reference.
Microsatellite markers and PCR conditions
Eighty-eight SSR markers were obtained from four sources: (1) 22 belong to the previously identified potato genetic identity (PGI) kit (Ghislain et al. 2004), (2) 13 from ESTs developed at the Scottish Crop Research Institute (Milbourne et al. 1998), (3) 30 identified at CIP using the potato EST database of the former The Institute for Genomic Research (http://www.tigr.org/), and (4) 23 from the University of Idaho (Feingold et al. 2005).
PCR reactions were performed in a 10 μl volume containing 100 mM Tris–HCl (Sigma), 20 mM (NH4)2SO4 (Merck), 2.5 mM MgCl2 (Merck), 0.2 mM of each dNTP (Amersham), 25 pM of 700 or 800 IRDye-labeled M13 forward primer (LI-COR), 22 pM M13-tailed forward SSR primer (Invitrogen), 15 pM reverse SSR primer (Invitrogen), 1 unit of Taq polymerase, and 15 ng of genomic DNA. PCR was carried out in a PTC-100 or PTC-200 thermocycler (MJ Research Inc.) using the following cycling profiles: 4 min at 94°C; 33 cycles of 1 min at 94°C, 1 min at annealing temperature (T a) determined experimentally for each SSR primer combination and 1 min at 72°C; with a final extension step of 4 min at 72°C. Blue Stop solution (#830-05630, LI-COR, USA) in a ratio of 1:1 was added to the PCR reaction before loading. PCR products were separated by electrophoresis on a 4300 LI-COR DNA Analyzer system. We sized alleles with the IRDye 50–350 bp fragment size ladder (LI-COR, USA). SSR alleles were detected and scored using the SAGA Generation 2 software (LI-COR, USA).
Mapping new SSR markers
Previously mapped and new SSR markers were mapped on at least one of three segregating diploid populations from which genetic maps were developed: the PD population (Ghislain et al. 2001), BCT population (Bonierbale et al. 1988) or the PCC1 population (Villamón et al. 2005). A total of 148 SSR markers were used in the present map effort and are provided in the supporting dataset 2. The segregation data of 27 SSR marker alleles located on the BCT genetic map was provided to us by the research group (Feingold et al. 2005). Marker alleles segregated as 1:1 ratio whereas skewed markers were rejected using the threshold value established for each genetic map (goodness of fit χ2 test). Null alleles were not considered. Linkage analysis of marker alleles segregating from the respective source parent was performed using JoinMap 3.0 (Stam 1993) with a LOD score of 3.
Polymorphic information content and matrix comparison
SSR marker alleles were scored for presence or absence of the band for all 742 genotypes and treated as dominant marker. The polymorphic information content (PIC) was calculated as PIC = 1−∑(p 2 i ), where p i is the frequency of the ith allele detected in all individuals of the population (Nei 1973). In addition, the ability of a refined set of SSRs chosen here to discriminate a large dataset were compared to a neighbor-joining analysis of Spooner et al. (2007), who analyzed 742 accessions with 50 SSRs by neighbor joining in DARwin software 4.0 (http://darwin.cirad.fr/darwin/Home.php), to which we added one SSR (STM0019). For this analysis, similarity matrixes were calculated using Jaccard’s coefficient and the comparison of the similarity matrixes were performed using the Mantel matrix-correspondence test in the MXCOMP option of the NTSYS 2.02h software (Sokal and Rohlf 1995). Correlations were conducted of the three main branches of this tree (the “bitter potato” (S. ajanhuiri, S. curtilobum, and S. juzepczukii) cluster, the diploid cluster, and the polyploid cluster), using correlation statistics in Microsoft Excel: ρ X,Y = cov (X,Y)/σXσY, where ρ X,Y is the correlation coefficient (r), cov (X,Y) is the co-variance of X and Y, σ X is the standard deviation of X, and σ y is the standard deviation of Y.
Construction of new potato SSR fragment size ladders
We initially used a pUC18 sequencing reaction or a IRDye-50–350 size standard in our LI-COR DNA Analyzer System as a fragment size ladder. To make our new kit easily applicable to the cultivated potato germplasm base across all platforms, we constructed new size ladders for each of the 24 primer pairs. We examined allele sizes from our database and selected genotypes displaying a range of sizes based on the following three criteria: (1) good separation among the alleles (>3 bp), (2) choice of allele/genotype combinations highlighting the high-frequency alleles encountered in our screening studies, and (3) the presence of the minimum and maximum size of the range of alleles, when possible. Genomic DNA was obtained from the DNA bank of CIP. Amplification products were obtained using standard protocol for SSR markers from CIP (Ghislain et al. 2004). Optimization of PCR conditions were conducted through temperature gradient PCR experiment for optimal annealing temperature, and appropriate number of amplification cycles to obtain good gel resolution of the bands. Electrophoreses to separate amplified products were performed using denaturant 6% poly-acrylamide gels and a silver stain protocol to reveal the bands (Ghislain et al. 2004).
Results
Genetic mapping
The 30 new candidate SSR markers from the TIGR database were surveyed for polymorphism in the PD, BCT and PPC1 mapping populations (details are included in the supporting dataset 2). Two markers were monomorphic in all populations and three markers displayed a skewed segregation from the expected 1:1 ratio. The remaining 25 SSR markers could be mapped in one or two of the three populations using a LOD score of 3. The use of three segregating populations allowed us to identify 33 new map locations of 29 SSR markers not previously mapped. An integrated map was built with the three maps using a map integration function based on mean recombination frequencies and combined LOD scores of the selected sets of loci from each chromosome (Fig. 1). Four markers (STI0012, STM0019, STM0037, STM1053) of the PGI kit were monomorphic in all three segregating population tested and hence were included graphically based on published maps. Only six out of 157 map locations (STG0023, STG0027a, STM0038, STM2022, STM3009, STM51145) produced conflicting map locations on the integrated map and hence were not included. This map represents the most complete SSR potato map developed to date with 138 mapped potato SSR markers at 147 map locations (excluding the four placed graphically and the six conflicting).
Polymorphic information content and matrix comparisons
We analyzed 742 potato landraces of all four cultivated potato species: S. tuberosum Group Andigenum and Group Chilotanum, S. ajanhuiri, S. curtilobum, and S. juzepczukii with 56 SSR markers; 22 from the prior PGI kit (Ghislain et al. 2004) and the 34 most useful of the remaining 66 SSR markers based on marker quality as observed visually on gels. Out of these, five SSR markers appeared to be multi loci based on exceeding allele number considering the ploidy of the plant sample and these were not considered further.
Data obtained with the remaining 51 SSR markers on the 742 potato landraces were analyzed for polymorphic information content (PIC). Considering the cultivated potato a single gene pool (Spooner et al. 2007), markers were scored across all cultivar groups with different ploidies. PIC values per SSR marker ranged from 0.250 to 0.884 while the number of alleles per locus ranged from 2 to 21 (Table 1).
The discriminatory capacity of the SSR markers was analyzed by comparing a similarity matrix generated on 742 genotypes analyzed with 8, 16, 24, 32, 42 and 51 SSR markers ranked by their PIC value (Fig. 2). The results indicated that the 24 SSR markers with the highest PIC values provided nearly identical similarity matrices with one generated with the 32 highest PIC values (r = 0.97). In total, 93.5% of the genotypes can be discriminated using the selected 24 SSR markers (Table 1). However, comparison of placement of accessions within the three main branches of the 742 accession neighbor-joining tree of Spooner et al. (2007) were correlated at r = 0.99, suggesting that the 6.5% of the accessions not absolutely discriminated using the 24 markers are all so similar as to have little effect on major groups discovered in phenetic or phylogenetic analyses. These results lead us to propose 24 (Table 1) as an appropriate number of SSR markers for a new PGI kit.
Selection of the new PGI kit
The most informative 24 SSR markers for genotyping potato landraces were selected based on quality criteria, genome coverage, and locus-specific information content. We selected two SSR markers per chromosome with a linkage distance at least 10 cM except for chromosome VII, where markers STM0031 and STI0033 were separated by only 3 cM due to lack of alternative markers with a high PIC value (Fig. 1).
New potato SSR fragment size ladders
To construct a fragment size ladder for each SSR marker, we chose the alleles that were: (1) high-frequency, (2) covered the range of allele sizes, (3) well-separated alleles while avoiding those giving overlapping bands due to stuttering, and (4) displayed by a minimum number of landraces. These provide effective size ladders for easy extrapolation of alleles not part of the size kit. Annealing temperatures had to be verified and for few cases adjusted. Final DNA concentrations of the selected genotype range between 3 and 12 ng/μl to be mixed to produce the ladder (Table 2). We succeed in identifying a maximum of four (e.g., STM0019) genotypes to have good coverage of allele sizes. The number of alleles for each of the 24 SSR fragment size standard ranged from three (STM5121) to nine (STI0012) with an average number of 5.5 (Table 2). Overall, the 24 SSR fragment size standards produce 137 reference alleles, representing 44.7% of the total of 306 alleles found in 742 accessions by the 24 SSR markers of the new PGI kit. Allele sizes included in the size standards ranged from 83 to 322 bp providing an easy and convenient tool for identification of allele sizes (Fig. 3).
Discussion
The new PGI kit is composed of 24 SSR markers from over 200 we screened. It provides high-quality, high polymorphism alleles with two markers from each of the 12 linkage groups of potato separated by at least 10 cM, except for chromosome VII with two SSRs separated by only 3 cM. It discriminates representative germplasm samples from all potato cultivar groups with high accuracy. Nine SSRs are from the previous PGI kit (Ghislain et al. 2004), three from ESTs developed at SCRI, four from TIGR, and eight from the University of Idaho. A composite reference DNA sample can conveniently be used to provide accurate sizing of all alleles for these SSR markers across laboratories and platforms.
The PGI kit can be used for potato germplasm characterization for a variety of purposes from identity verification (fingerprinting), to studies of genetic diversity, anchoring genetic linkage maps, establishment of core collections, and gene flow. The 24 composite reference samples of DNA for allelic size determinations will stimulate and foster collaborations worldwide on the use of SSRs for these applications. For example, we used the new PGI kit to identify potential duplicate landraces between the CIP and PROINPA Bolivian potato germplasm collections (data not shown). Some landraces belong to the same morphologically selected cluster were not grouped into the same molecular cluster especially for landraces of very diverse germplasm sets such as the S. tuberosum Andigenum Group. There was a total correspondence with the less diverse S. tuberosum Chilotanum Group. In another application, the new PGI kit has been using to genotype breeding lines and advanced cultivars of potato, and most of the breeding material groups into a well-defined cluster with landraces of the Chilotanum group (data not shown). Such grouping is expected because the germplasm of the Chilotanum group has been used extensively in potato breeding worldwide.
In summary, these highly characterized new SSR markers have tremendous utility for a variety of applications and can stimulate standardization and international collaborations within the cultivated potato gene pool. The PGI kit including primers and fragment size standard are available upon request. A SSR database of the cultivated potato is available on line from the bioinformatics portal of the Generation Challenge Program web site (www.generationcp.org) and of CIP (http://research.cip.cgiar.org/confluence/display/IPD/SSR+Marker). The latter provides a full description of each SSR markers, amplification and detection conditions, and the genotyping data of all potato landraces available to date. It is expected that with increased uses the SSR database will be integrated with the germplasm database of the CGIAR centers.
Abbreviations
- CIP:
-
Centro International de la Papa (International Potato Center)
- cM:
-
Centimorgans
- EST:
-
Expressed sequence tags
- PGI kit:
-
Potato genetic identity kit
- PIC:
-
Polymorphic information content
- SSR:
-
Simple sequence repeat
- Ta :
-
Annealing temperature
- TIGR:
-
The Institute of Genomic Research
References
Ashkenazi V, Chani E, Lavi U, Levy D, Hillel J, Veilleux RE (2001) Development of microsatellites markers in potato and their use in phylogenetic and fingerprinting analysis. Genome 44:50–62. doi:10.1139/gen-44-1-50
Bonierbale MW, Plaisted RL, Tanksley SD (1988) RFLP maps based on a common set of clones reveal modes of chromosomal evolution in potato and tomato. Genetics 120:1095–1103
Del Rio A, Bamberg J, Huamán Z (2006) Genetic equivalence of putative duplicate germplasm collections held at CIP and US potato genebanks. Am J Potato Res 83:279–285
Devaux C, Lavigne C, Falentin-Guyomarc’h H, Vautrin S, Lecomte JE, Klein K (2005) High diversity of oilseed rape pollen clouds over an agro-ecosystem indicates long-distance dispersal. Mol Ecol 14:2269–2280. doi:10.1111/j.1365-294X.2005.02554.x
Doyle JJ, Doyle JL (1990) Isolation of plant DNA from fresh tissue. BRL Focus 12:13–15
Feingold S, Lloyd J, Norero N, Bonierbale M, Lorenzen J (2005) Mapping and characterization of new EST-derived microsatellites for potato (Solanum tuberosum L.). Theor Appl Genet 111:456–466. doi:10.1007/s00122-005-2028-2
Fenart S, Austerlitz F, Cuguen J, Arnaud J-F (2007) Long distance pollen-mediated gene flow at a landscape level: the weed beet as a case study. Mol Ecol 16:3801–3813. doi:10.1111/j.1365-294X.2007.03448.x
Ghislain M, Trognitz B, del Rosario Herrera M, Solis J, Casallo G, Vásquez C et al (2001) Genetic loci associated with field resistance to late blight in offspring of Solanum phureja and S. tuberosum grown under short-day conditions. Theor Appl Genet 103:433–442. doi:10.1007/s00122-001-0545-1
Ghislain M, Spooner DM, Rodríguez F, Villamón F, Núñez J, Vásquez C et al (2004) Selection of highly informative and user-friendly microsatellites (SSRs) for genotyping of cultivated potato. Theor Appl Genet 108:881–890. doi:10.1007/s00122-003-1494-7
Ghislain M, Andrade D, Rodríguez F, Hijmans RJ, Spooner DM (2006) Genetic analysis of the cultivated potato Solanum tuberosum L. Phureja Group using RAPDs and nuclear SSRs. Theor Appl Genet 113:1515–1527. doi:10.1007/s00122-006-0399-7
Kawchuk LM, Lynch DR, Thomas J, Penner B, Sillito D, Kulcsar F (1996) Characterization of Solanum tuberosum simple sequence repeats and application to potato cultivar identification. Am Potato J 73:325–335. doi:10.1007/BF02849164
Milbourne D, Meyer R, Collins A, Ramsay L, Gebhardt C, Waugh R (1998) Isolation, characterization and mapping of simple sequence repeat loci in potato. Mol Gen Genet 259:233–246. doi:10.1007/s004380050809
Moisan-Thiery M, Marhadour S, Kerlan MC, Dessenne N, Perramant M, Gokelaere T et al (2005) Potato cultivar identification using simple sequence repeats markers (SSR). Potato Res 48:191–200. doi:10.1007/BF02742376
Nei M (1973) Analysis of gene diversity in subdivided populations. Proc Natl Acad Sci USA 70:3321–3323. doi:10.1073/pnas.70.12.3321
Provan J, Powell W, Waugh R (1996) Microsatellite analysis of relationships within cultivated potato (Solanum tuberosum). Theor Appl Genet 92:1078–1084. doi:10.1007/BF00224052
Remington DL, Thornsberry JM, Matsuoka Y, Wilson LM, Whitt SR, Doebley J et al (2001) Structure of linkage disequilibrium and phenotypic associations in the maize genome. Proc Natl Acad Sci USA 98:11479–11484. doi:10.1073/pnas.201394398
Rios D, Ghislain M, Rodriguez F, Spooner DM (2007) What is the origin of the European potato? Evidence from Canary Island landraces. Crop Sci 47:1271–1280
Schneider K, Douches DS (1997) Assessment of PCR-based simple sequence repeats to fingerprint North American potato cultivars. Am Potato J 74:149–160. doi:10.1007/BF02851594
Sokal RR, Rohlf FJ (1995) Biometry, 3rd edn. Freeman, New York, 0 7167 2411 1
Spooner DM, Núñez J, Rodríguez F, Naik PS, Ghislain M (2005a) Nuclear and chloroplast DNA reassessment of the origin of Indian potato varieties and its implications for the origin of the early European potato. Theor Appl Genet 110:1020–1026. doi:10.1007/s00122-004-1917-0
Spooner DM, van Treuren RR, de Vicente MC (2005b) Molecular markers for germplasm and genebank management. Tech Bull 10. International Plant Genetic Resources Institute, Rome
Spooner DM, Núñez J, Trujillo G, del Rosario Herrera M, Guzmán F, Ghislain M (2007) Extensive simple sequence repeat genotyping of potato landraces supports a major reevaluation of their gene pool structure and classification. Proc Natl Acad Sci USA 104:19398–19403. doi:10.1073/pnas.0709796104
Stam P (1993) Construction of integrated genetic linkage maps by means of a new computer package: JoinMap. Plant J 3:739–744
Stich B, Melchinger AE, Frisch M, Maurer HP, Heckenberger M, Reif JC (2005) Linkage disequilibrium in European elite maize germplasm investigated with SSRs. Theor Appl Genet 111:723–730. doi:10.1007/s00122-005-2057-x
Veilleux RE, Shen LY, Paz MM (1995) Analysis of the genetic composition of anther-derived potato by randomly amplified polymorphic DNA and simple sequence repeats. Genome 38:1153–1162
Villamón FG, Spooner DM, Orillo E, Mihovilovich E, Pérez W, Bonierbale MW (2005) Late blight resistance linkages in a novel cross of the wild potato species Solanum paucissectum (series Piurana). Theor Appl Genet 111:1201–1214. doi:10.1007/s00122-005-0053-9
Acknowledgments
We thank Sergio Feingold and Jim Lorenzen for providing their data set of the map location of SSR markers. This research was supported by the Grant SP1C2-2004-5 from the Generation Challenge Program, the International Potato Center, and by U.S. National Science Foundation Division of Environmental Biology (NSF DEB) 0316614 entitled: A world-wide treatment of Solanum (http://www.nhm.ac.uk/research-curation/projects/solanaceaesource//). The authors are grateful to the conscious dedication of Luciano Fernandez for his technical support.
Open Access
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
Author information
Authors and Affiliations
Corresponding author
Electronic supplementary material
Rights and permissions
Open Access This is an open access article distributed under the terms of the Creative Commons Attribution Noncommercial License (https://creativecommons.org/licenses/by-nc/2.0), which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.
About this article
Cite this article
Ghislain, M., Núñez, J., del Rosario Herrera, M. et al. Robust and highly informative microsatellite-based genetic identity kit for potato. Mol Breeding 23, 377–388 (2009). https://doi.org/10.1007/s11032-008-9240-0
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11032-008-9240-0