Introduction

The origin of the “European” potato (cultivated potato first appearing in Europe and then spreading worldwide) has been controversial for nearly a century since the Russian potato scientists Juzepczuk and Bukasov (1929) supposed it as derived from landraces of Chilean origin, growing at Chiloé Island and the Chonos Archipelago to the south. The morphological resemblance and the capacity to tuberize under long-day conditions were the two main arguments for a Chilean origin. The tetraploid cultivated potato Solanum tuberosum has been long recognized as divided into two entities either referred to as species (Bukasov 1971; Lechnovich 1971), sub-species (Hawkes 1990; Ochoa 1990) or Cultivar Groups (Dodds 1962; Huamán and Spooner 2002; Spooner et al. 2007). S. tuberosum Andigenum Group is distributed from western Venezuela south to northern Argentina, and S. tuberosum Chilotanum Group, formerly referred to as the Tuberosum Group, from lowland Chile in Chiloé Island and the islands of the Chonos Archipelago to the south (Huamán and Spooner 2002). For simplicity of discussion in this paper we refer to S. tuberosum Andigenum Group as “Andigenum” and S. tuberosum Chilotanum Group as “Chilotanum” (referred to as Tuberosum in most potato breeding literature). Andigenum germplasm tuberizes poorly in the long days of the temperate latitudes, and the ability to rapidly select Andigenum for long-day length tuberization has been a big part of the long-standing controversy on the origin of the European potato (Ames and Spooner 2008).

In contrast, a number of potato scientists proposed or supported an Andigenum origin of the European potato (Salaman 1937; Salaman and Hawkes 1949; Glendinning 1975b; Hosaka and Hanneman 1988). This long-standing view was based partly on the production of Neo-Tuberosum potatoes, and on a range of other historical arguments (Ames and Spooner 2008). In 1959, Simmonds initiated a long-term experiment at the John Innes Institute to select modern potato characteristics from germplasm of exclusive Andigenum origin (Simmonds 1966). He started with 3,300 seedlings of about 300 Andigenum accessions, from geographically wide sources in Colombia, Peru, and Bolivia. He bred these through two sexual generations in a field with open pollination, and mass selected them for 5 years, primarily for day-length adaptation as expressed by greater tuber production in the long days of the UK. At the end of the experiment he produced a population he designated to be “half way” from the original Andigenum to Chilotanum relative to a range of characters that he referred to as the “day length syndrome.” These included earlier tuberization, fewer and shorter stems, less flowering, shorter stolons, greater yield, larger leaves, less pigmented, smoother tubers, fewer but larger tubers, fewer seeds per fruit, and late blight and virus X and Y resistance. Simmonds (1969) named this new advanced potato germplasm “Neo-Tuberosum”. The material was later transferred to the Scottish Crop Research Institute where it was further intercrossed (Glendinning 1975a) and characterized with respect to Andigenum and Tuberosum traits (Glendinning 1975b). Similar to previous observations by Simmonds (1966), Neo-Tuberosum appeared to have intermediate characteristics between Andigenum and Chilotanum. It was shown to be a highly diverse set of germplasm with a wide range of desirable traits that was thought to be lacking in existing advanced cultivars such as viruses X and Y and wart resistance provided by the Andigenum germplasm (Glendinning 1975c).

The ability to rapidly adapt Andigenum germplasm to a Chilotanum-like form and day length response was used as one part of a multi-component argument supporting Andigenum germplasm, rather than Chilotanum germplasm, as the progenitor of the European potato (Hawkes 1956; Simmonds 1978; Hosaka and Hanneman 1988). The Chilean origin hypothesis (Juzepczuk and Bukasov 1929) was relegated to history and the Andigenum origin was universally accepted for the past 40 years.

The late blight epidemic beginning in the UK in 1845 was seen as a turning point in shifting from the initial Andigenum introductions to Chilotanum introductions, which, through interbreeding, gave rise to modern cultivars being hybrids of the two Cultivar groups. There were many problems with this theory, including general lack of late blight resistance in Chilean germplasm (Bukasov 1933; Glendinning 1975d; Jansky 2000) and cytoplasmic male sterility of the Chilotanum serving mainly as female (Grun 1979). The incompatibility is especially significant because well over 99% of modern cultivars contain Chilotanum cytoplasm (Corriveau and Coleman 1988). However, this incompatibility is not absolute and many modern varieties produce pollen and can serve as a male breeding source.

Neo-Tuberosum development was quickly envisioned as a valuable pre-breeding program useful to gain access to the wide Andigenum diversity of traits not easily found in Chilotanum landraces or modern cultivars, by eliminating the unfavorable characteristics which had impaired its use in breeding in the past (Simmonds 1966, 1993). It inspired a new generation of potato breeders to practice the concept of germplasm enhancement by base broadening breeding using Neo-Tuberosum and the Andigenum landraces germplasm (Glendinning 1975c). In 1965, the potato breeding program at Cornell University took over the material from John Innes Institute and bred it with Andigenum landraces from South America (Plaisted 1972; Rasco et al. 1980; Muñoz and Plaisted 1981). After several years of breeding and selection, this program provided Neo-Tuberosum material to many breeding programs worldwide including the International Potato Center (CIP, in Lima Peru). The latter institution developed advanced breeding lines with resistance to early blight, late blight, viruses X and Y, and important progenitors of true potato seeds (Fernandez-Northcote et al. 1986). However, not all breeding programs, in particular private ones, adopted the concept of base broadening using Neo-Tuberosum and Andigenum germplasm shading already doubts about the real impact on varietal improvement.

The original purpose of our study was to quantify the degree of genetic base broadening of modern varieties and breeders’ lines with respect to Andigenum and Chilotanum landraces. We assessed this base broadening with a representative sample from the CIP potato collection that is currently grown world wide including Neo-Tuberosum original clones, and their derivatives. We assessed overall diversity with simple sequence repeats (SSRs or microsatellites) that represent a refinement (paper in review; Herrera et al. 2006) of a set of SSR markers chosen to represent all 12 potato linkage groups, highly polymorphic and with no selected function (Ghislain et al. 2004). To assess maternal contributions of Andean and Chilean germplasm we also screened these materials with a 241-bp plastid deletion marker in the trn V-UAC/ndhC intergenic region of the plastid DNA molecule, which is absent in 94% (or 95%) of the Andean tetraploid landraces and present in 86% (or 81%) of the tetraploid Chilean landraces, depending on the studies of Hosaka (2004) or Spooner et al. (2007).

Materials and methods

Plant material

A total of 688 tetraploid potato accessions were examined. These include: (1) six “pure” clones of Neo-Tuberosum that derive from the Neo-Tuberosum materials generated at Cornell University from the original populations developed by Simmonds, (2) 33 varieties or breeders’ lines developed with Neo-Tuberosum in their pedigree; of these, 25 had Neo-Tuberosum as a female parent and the other 8 had Neo-Tuberosum as a male parent, (3) 154 varieties or breeders’ lines without Neo-Tuberosum in their pedigree, chosen from the CIP genebank selected to cover a wide range of modern germplasm of cultivated potato from around the world; included here are three breeding clones [B1C4 clone Andigenum × Andigenum and ICA NEVADA Phureja × Andigenum, San Antonio Abad (Tuberosum × Andigenum) × Andigenum] serving as controls of Andigenum-based breeding and 15 unclassified putative tetraploid landraces that likely represent Andigenum × Chilotanum hybrids, (4) 305 of Andigenum landraces, (5) 190 Chilotanum landraces. The latter two groups of germplasm are those used in a previous study (Spooner et al. 2007) compared with 55 Andigenum and 164 Chilotanum accessions. The list of the 1–3 germplasm materials is deposited as a supplementary data file including the assignation of their taxonomic unit based on pedigree information available at CIP and in the literature.

DNA extraction, SSR primers, PCR conditions and electrophoresis

Genomic DNA was obtained by using standard protocols at CIP (Herrera and Ghislain 2000). DNA concentration was calculated by using PicoGreen dsDNA quantification reagent and a TBS-380 Fluorometer (Turner BioSystems). DNA dilutions were performed to achieve a final concentration of 3 ng/μl using 96-well plates. We used 24 SSR markers from a new Potato Genetic Identity kit (Table 1; paper under review; Herrera et al. 2006) that updates one by Ghislain et al. (2004). PCR reactions were performed and carried out as described in Spooner et al. (2007). PCR products were separated by electrophoresis on a LI-COR 4300 DNA Analyzer System.

Table 1 SSR markers used to genotype the 193 varieties and breeders’ lines (VBL), 305 accessions of Andigenum (ADG) and 190 accessions of Chilotanum (CHL), as well as number of alleles identified and polymorphic index content (PIC) per SSR locus

SSR allele scoring

Simple sequence repeat alleles were detected and scored using the SAGA Generation 2 software (LI-COR, USA). The SSR alleles were identified by their size in base pairs of the upper band of the allele and scored as present (1), absent (0), or missing (9). Missing data were less than 5% (3.8%) which is acceptable for cluster analysis.

Cluster analysis

Genetic analysis was performed by using the program DARwin 5.0 (Perrier and Jacquemoud-Collet 2006). We used Jaccard’s similarity coefficient and 50% of minimal proportion of valid data required for each unit pair for cluster analysis. The dendrogram was built by using the Neighbor-joining method.

Plastid DNA marker analysis

The 241 bp plastid deletion marker was analyzed for all 688 landrace accessions by using the PCR primers and protocols from Hosaka (2002). PCR products were separated by electrophoresis on a 1% agarose gel, and Lambda phage digested by PstI was used as a molecular weight marker. The 241 bp deletion polymorphism was determined for size in base pairs and scored as “T” (with the deletion) or “X” (lacking it) based on clearly separating bands.

Data repository

Simple sequence repeat marker data were deposited in two SSR databases available on line from the bioinformatics portal of the Generation Challenge Program web site (http://www.generationcp.org) and of CIP (http://research.cip.cgiar.org/confluence/display/IPD/SSR+Marker). These list full descriptions of each SSR marker, amplification and detection conditions, and the genotyping data of all potato landraces available to date from this and previous studies.

Results

The plastid DNA marker typical of Chilotanum germplasm was present in one of the six pure Neo-Tuberosum clones. Seven of the 12 varieties and breeders’ lines with Neo-Tuberosum as direct female parent present the plastid DNA marker typical of Chilotanum germplasm which is rare among Andigenum germplasm. SSR markers grouped the six pure Neo-Tuberosum clones and all Neo-Tuberosum derived varieties and breeding lines entirely with the Chilotanum cluster instead of the Andigenum cluster (Fig. 1). This is in contrast with the three breeding controls resulting of Andigenum-based breeding and selection which grouped with the Andigenum cluster as expected.

Fig. 1
figure 1

Dissimilarity tree using Neighbor-joining cluster analysis of (1) Six “pure” accessions of Neo-Tuberosum (Neo-tbr) that represent some of the original populations developed by Plaisted from Simmonds materials (green), (2) 33 varieties or breeders’ lines developed with Neo-Tuberosum in their pedigree (purple), (3) 154 varieties or breeders’ lines without Neo-Tuberosum in their pedigree chosen from the CIP genebank selected to cover a wide range of modern germplasm of cultivated potato around the world (orange), (4) 305 of Andigenum landraces (blue), (5) 190 Chilotanum landraces (gray). Also are indicated the three breeding clones serving as controls of Andigenum-based breeding

For the entire data set of 688 accessions examined, a total of 319 alleles were identified from 24 SSR markers with an average of 13.3 alleles per marker. Two hundred and twenty-nine alleles (71.8%) were identified within the variety and breeders’ lines with an average of 9.5 alleles per marker and the polymorphic index content (PIC) of all 688 accessions ranged from 0.550 to 0.854 with an average of 0.745 (Table 1).

Cluster analyses revealed a well-defined cluster of the varieties and breeders’ lines grouping together with the Chilotanum landraces, completely separate from the Andigenum landraces (Fig. 1).

Fourteen (6.1%) of the 229 SSR alleles were found only in the varieties and breeders’ lines. One had a frequency of 2.3% while the other was below 1%. The average PIC (0.743) is statistically indistinguishable from those of groups of native potatoes (PIC = 0.752 for Andigenum, and PIC = 0.732 for Chilotanum) which illustrates the high genotypic diversity of the breeding material from CIP.

The plastid DNA deletion marker characteristic of Chilotanum landraces was found in 46% of the varieties and breeders’ lines in comparison to 73.2% of the Chilotanum landraces and 4.6% in the Andigenum landraces.

Discussion

The finding that all six “pure” Neo-Tuberosum and the 33 Neo-Tuberosum derivatives group, without exception, with the Chilotanum rather than with the Andigenum cluster leads us to conclude that the Neo-Tuberosum materials are clearly not the product of strict inter-Andigenum breeding. The three Andigenum-based breeding controls falling in the Andigenum cluster and the fact that the SSR alleles are widely dispersed neutral markers not chosen for linkage to Chilotanum traits (Ghislain et al. 2004) validate this conclusion based on the analysis of the SSRs and the plastid marker.

This completely unexpected result has possibly two explanations which may or may not have occurred together. One explanation is supported by tracing the events which led to the original Neo-Tuberosum stocks. The initiation of the first Neo-Tuberosum populations was done “by some hundreds yards” away from modern cultivars (Simmonds 1966) which led Simmonds (1966) to state that out-pollination “was at worst infrequent and probably non-existent”. Potatoes require vibratile buzzing to release pollen, and bumblebees are their main pollinators (Johns and Keen 1986; Scurrah et al. 2008). Bumblebees typically forage over 70–631 m (Osborne et al. 1999), but pollen from one flower is usually deposited only across a limited number that are subsequently visited and gene flow is generally restricted to very near neighbors (McPartlan and Dale 1994; Scurrah et al. 2008). Only about 20% of seeds were cross-pollinated in the original Neo-Tuberosum experiment (Glendinning 1976). These and factors such as residence time in one crop favors highly localized cross-pollination of plants near the pollen source (Cresswell et al. 2002). Therefore, it seems unlikely that after only two recombination cycles in the field, the population would have become “half way” towards modern varieties by only pollen contamination from nearby modern variety plots. In addition to possible gene flow from Chilotanum adjacent fields, the Neo-Tuberosum generation plots could have gained genes from modern varieties within the field from volunteers of modern potato varieties which were found later to be present and recognized to be a possible source of contamination in the early cycles of the re-creation experiment (Glendinning 1976). Hence, this first possible explanation, gene flow from adjacent fields, and from volunteers within the Neo-Tuberosum fields cannot be excluded from the first years of the Neo-Tuberosum experiment. As this original and derived material was distributed to other partners later on who confirmed the Neo-Tuberosum theory, an early contamination with germplasm from modern varieties might have completely skewed their conclusions.

A second explanation may be the misidentification of the original genetic stocks. The plant material used in the original John Innes Institute studies in UK was from the Andigenum stocks from the Commomwealth Potato Collection. Described by Glendinning (1975a), this material is of 43% Bolivian, 35% southern Peruvian, and about 10% northern Peruvian and Colombian origin. This collection may have had misidentified accessions of Chilotanum as Andigenum. In the early days of potato collection, all tetraploid landraces from the highland Andes were automatically included into Andigenum. However, a closer look at some of Andean landrace accessions reveals mixed characteristics of Andigenum and Chilotanum (unpublished data at CIP) revealing the difficulties in identifying some accessions. These are referred sometimes as Andigenum × Chilotanum or vise versa and assumed to be natural hybrids although no evidences are provided except for intermediate morphological characteristics (unpublished data at CIP). These accessions could also be re-introductions from Chiloé or hybrids with modern potato varieties. The latter seems, however, the least probable due to frequent pollen sterility of modern potato varieties and limited, if any, introduction of modern varieties in southern Peru and Bolivia.

The genetic analysis of the variety and breeders’ lines reveal that these group with Chilotanum landraces rather than with Andigenum landraces, and that there is no statistically significant difference in genotypic diversity as assessed by SSRs (Table 1). Both modern varieties and breeders’ lines present a high level of diversity and interestingly few unique SSR alleles (14 out of 229) possibly reflecting the introgression of DNA from potato wild species. Indeed, SSR markers were shown to amplify also in several Solanum species (Merino et al. 2006). The high frequency of the plastid marker in variety and breeders’ lines reflects also the importance of the Chilotanum germplasm into their pedigrees. Hence, breeders who have incorporated Andigenum and wild species germplasm following the base-broadening approach have apparently applied a strong negative selection against the Andigenum type when identifying future successful clones. A plausible explanation could be that the desirable features of a cultivated potato, such as adaptation under long-day conditions, superficial eyes, oblong tubers, and possibly others, are present mainly in the Chilotanum germplasm and rarely in the Andigenum germplasm. Such observations reveal that the base-broadening approach using Andigenum germplasm for breeding modern potatoes does not result in major incorporation of genetic factors from the Andigenum germplasm.

Our data are parallel to those of Douches et al. (1991) who documented, with isozyme data, mistaken pedigrees of many nineteenth century potato cultivars. Like us, they ascribed some of these errors to unappreciated hybridization in open-pollinated fields. Apparently, gene flow is common enough that establishing pedigrees by breeding programs based on open pollination is not always reliable and one may question the pedigrees of potato varieties not produced by controlled pollination.