Introduction

Winter oilseed rape (WOSR, rapeseed, canola) (Brassica napus L. var. oleifera) is the result of spontaneous, interspecific hybridizations between genotypes of turnip rape (Brassica rapa L. syn. campestris; genome AA, 2n = 20) and cabbage (Brassica oleracea L.; genome CC, 2n = 18). Genetically, the rapeseed is an allopolyploid (genome AACC, 2n = 38) and contains full genomes of turnip rape and cabbage. From being a marginal species grown on a limited area in the mid-twentieth century, rapeseed has become the most important oil crop in countries with a moderate climate, including Poland. Next to soybean, it is the main global oil crop. The significant increase in interest in this plant is the result of intensive breeding, which has resulted in the elimination of erucic acid (C22:1), which is undesirable in human nutrition, from oil composition, and the nearly tenfold reduction in the content of sulfur compounds—glucosinolates in the seeds, which limit the use of post-extraction rapeseed meal as feed for livestock. Double low ‘00’ varieties, i.e., low-erucic, low-glucosinolates ones, have been commonly found in cultivation since the late 1980s. The most important achievements in rapeseed breeding include an improvement of yielding ability and, above all, the development and introduction into cultivation of hybrid varieties (Wittkop et al. 2009). The increase in rapeseed production need not only be a consequence of increased cultivation area, which is limited, but might primarily be the result of the implementation of new technologies of cultivation and biological progress in breeding techniques. The use of molecular/genetic markers enables the selection of economically important features at an early stage of plant development, giving the breeder an additional tool to increase the efficiency of the breeding process (Mikołajczyk 2013; Rakoczy-Trojanowska et al. 2017a, b).

Rapeseed is a relatively young species, which, as a cultivated plant, appeared in Europe only 500 years ago. Currently, it is primarily the natural variability observed within the species that is used. This is relatively narrow due to the limited range of cultivation, absence of wild progenitors acting as a natural source of variation to be used to increase genetic diversity, and the intensive breeding aimed at obtaining double low forms (low-erucic, low-glucosinolates) (Delourme et al. 2013). For this reason, the selection of parental components for combination is a particularly important issue. Such components should be as genetically diverse as possible. Numerous research centers have created the concept of the elaboration, for the purposes of cultivation, of separate genetic pools on the basis of the DNA polymorphism of breeding material. The process of breeding hybrid varieties requires a large number of crosspollination combinations of parental lines and their evaluation in multiple field experiments, preferably in multiple environments in terms of general and specific combining ability, with simultaneous very careful selection of the components based on a number of qualitative characteristics (Bocianowski et al. 2016). For this purpose, methods are sought that would allow the early selection of parental components of F1 hybrids, and thus the shortening of the breeding cycle (Liersch et al. 2016). Such studies performed in different species of plants have shown that one of the elements allowing a high heterosis effect to be obtained in the yield of seeds is the genetic distance (GD) of the parental forms used to make the F1 hybrid. According to the theory of heterosis, the effect of heterosis is greater in hybrids when parental lines are more distant genetically (Fu et al. 2014). A degree of GD between parents is necessary for significant heterosis but it is not sufficient to guarantee it. The relationship between GD and the heterosis effect has been demonstrated, among others, in cereals, corn and rapeseed (Reif et al. 2003; Krystkowiak et al. 2009; Girke et al. 2012a). These results indicate the possibility of the selection of parental lines of hybrids based on their polymorphism identified through observation of phenotypic traits and genotype analysis (Lefort-Buson et al. 1988; Yu et al. 2005). These studies initially involved phenotypic characterization (Ali et al. 1995), isoenzymatic markers (Chèvre et al. 1991), and the development of molecular genetics. Then, methods of genotyping such as AFLP, SSR, SNP and DArTseq led to the possibility of the quick evaluation of genetic variability independent of modifying environmental effects (Stępień et al. 2007; Irzykowska and Bocianowski 2008; Krzymińska et al. 2008; Li et al. 2011; Gao et al. 2014; Wu et al. 2014; Seidler-Łożykowska et al. 2014; Varghese et al. 2014; Rahman et al. 2015; Luo et al. 2016).

The objective of this study was to use AFLP markers to determine the extent of DNA polymorphism to enable estimation of the GD between parental lines of CMS ogura F1 hybrids, double lines and cultivars, as well as resynthesized (RS) B. napus lines and double low-quality semi-RS lines with the restorer gene (Rfo).

Materials and methods

Plant materials

The plant materials used in this study consisted of two collections of WOSR genotypes from the collection of Plant Breeding Company Strzelce Ltd., Poland, and resynthesized (RS) lines from the Plant Breeding and Acclimatization Institute—NRI. The former included 78 genotypes: 40 CMS ogura male sterile inbred lines, 26 restorer lines for ogura system (Rfo) and 12 cultivars and genotypes of Polish, Ukrainian and Asian origin. The second collection comprised nine CMS ogura male sterile lines, 33 restorer lines (Rfo), five RS WOSR lines, and a double low semi-RS line with the Rfo gene.

DNA isolation and molecular marker analysis

Genomic DNA was extracted from young leaves of ten plants from each line using a modified cetyltrimethylammonium bromide (CTAB) extraction protocol (Doyle and Doyle 1990). DNA quantity and purity were checked spectrophotometrically at 260 nm and 280 nm wave lengths (NanoDrop 2000, Thermo Scientific) and by 0.8% agarose gel electrophoresis for 1 h at 100 V in 1 × TBE, visualized under UV light after staining with Roti®-Safe GelStain (Carl Roth, Karlsruhe, DE). AFLP analysis followed standard methods described by Vos et al. (1995) and in accordance with the manufacturer’s instructions: Gibco BRL, AFLP Analysis Reagent Kit, AFLP Analysis System I for collection 1, while collection 2—AFLP® Plant Mapping Kit, Applied Biosystems, Foster City, CA. The primer combinations were selected from the linkage map for rapeseed described by Lombard and Delourme (2001) and the results in the analyses of collections of different WOSR genotypes found in Liersch et al. (2013).

Genomic DNA of both collections of oilseed rape were digested with the restriction enzymes EcoRI and MseI (AFLP Core Reagent Kit, Invitrogen). Eleven primer combinations (EcoRI, MseI) (Gibco BRL, AFLP Starter Primer Kit) were used in selective amplifications of WOSR collection 1. PCR products were resolved by 13.35% denaturing polyacrylamide gels, and the gels were stained with silver. Seven primer combinations were used in selective amplifications of WOSR collection 2. EcoRI primers were fluorescently labeled with blue (FAM), green (JOE) and yellow (NED) 5′ reporter dyes (AFLP Selective Amplification Primers, Applied Biosystems). Amplification products were separated using capillary electrophoresis on an ABI Prism 3130xl Genetic Analyzer (Applied Biosystems) and analyzed by PeakScanner 1.0 (Applied Biosystems).

Data scoring and statistical analysis

A peak was considered polymorphic if it was present in at least one accession and absent in others. Mostly, clear peaks (markers) with strong intensity were counted, and only clear-cut polymorphic peaks (markers) were scored, and the data were exported to a 1–0 matrix for statistical evaluation. Fragments smaller than 50 bp were excluded from the data matrix. Both collections were analyzed independently. The polymorphic information content (PIC) for each primer was calculated using the formula described by Wolko et al. (2015):

$$PIC = 1 - \sum\limits_{i = 1}^{n} {p_{i}^{2} } ,$$

where pi is the frequency of the i-th allele and n is the number of markers per primer. PIC values ranged from 0 (in the case of fixation of one allele) to 0.5 (when the frequencies of both alleles were equal). The data matrix for both collections was used to calculate pairwise GD based on Nei and Li (1979) coefficients. Genetic distance for each pair of genotypes was estimated based on the coefficient proposed by Nei and Li (1979), defined as GS = 2NXY/(NX + NY), where NXY is the number of bands shared by genotypes X and Y, NX is the number of bands in genotype X, and NY is the number of bands in genotype Y. The genotypes were grouped using the unweighted pair group method with arithmetic mean (UPGMA). Dissimilarities between genotypes were visualized on a dendrogram. On the basis of all markers, principal component analysis (PCA) was performed with the use of the GenStat v. 18 statistical package. This multivariate technique allowed us to find and plot the major pattern within the multiple loci data set. The resulting distance matrix was also subjected to an analysis of molecular variance (AMOVA) using GenAlEx 6.5 (Peakall and Smouse 2012). AMOVA estimated and partitioned the total molecular variance within and between groups of genotypes and then tested the significance of partitioned variance components using nonparametric permutational testing procedures with 999 permutations (Excoffier et al. 1992). The FST coefficient was used as a measure of population genetic structure and was calculated from the formula: FST = (HT − HS)/HT, where HT is the probability that two alleles drawn at random (with replacement) from the entire population differ in state (i.e., the probability of no population structure), and HS is the probability that two alleles drawn at random from a subpopulation differ in state (which, for a two-allele system, will always be 2piqi, where pi is the observed allelic frequency in subpopulation i), averaged over subpopulations. Groups for AMOVA were identified on the basis of the pedigree of the genotypes (Table 1).

Table 1 List of plant material (B. napus genotypes and RS lines) with their codes used in the study, group number for analysis of molecular variance, type and country of origin

Results

The first collection of 78 genotypes of WOSR was tested using 11 combinations of AFLP primers. 539 DNA amplification products were obtained, of which 262 were polymorphic (Table 2). The number of polymorphic markers for a single primer combination ranged from 14 to 36. In the reaction with the EAGG: MCTA primers the highest number of polymorphic markers was obtained—36, while the highest level of polymorphism was found with the primer combinations EAGG: MCTA—59.00% and EAGG: MCTC—71.43%. One primer combination generated an average of 23.8 polymorphic DNA fragments. The set of AFLP primers used in this study generated highly informative loci with PIC values ranging from 0.474 to 0.500, with a mean of 0.491 (Table 2). The GD data matrix based on the Nei and Li (1979) coefficients ranged from 0.08 to 0.74 and the lowest was for genotype pairs DG 25 and DG 26 (0.08) and DG 7 and DG 8 and DG 73 and DG 78 (0.09) and the highest for genotypes DG 9 and DG 52 (0.74) as well as DG 2 and DG 55—0.72 (Table 3). The dendrogram obtained by cluster analysis of the Nei and Li (1979) distance values identified 78 genotypes in two basic groups—A and B—each of which was divided into smaller clusters (Fig. 1). The first group (A) is partitioned into three smaller subgroups. In the first (A1) are three cultivars from Ukraine: ‘Szmaragd’, ‘Królowa Śniegu’ and ‘Kleopatra’ (DG 1–DG 3), 24 CMS ogura lines (DG 4–DG 27), and GMS line DG 28. The second small group (A2) contained two GMS lines (DG 29, DG 30), and nine CMS ogura lines (DG 31–DG 39). The third (A3) small cluster comprised only three CMS ogura lines (DG 40–DG 42). The second group (B) is also divided into three subgroups. The first subgroup (B1) is the most diverse, because it includes both “00” lines of WOSR (DG 44–DG 46), four lines with the restorer gene Rfo (DG 53–DG 56), the Chinese line resistant to Sclerotinia sclerotiorum (DG 52) and three CMS ogura lines (DG49–DG 51). The second small cluster (B2) contained only yellow seeded restorer lines Rfo– numbers from DG 69 to DG 78. The last (B3) small cluster included only 12 lines with the restorer gene Rfo– numbers from DG 69 to DG 78 (Fig. 1).

Table 2 Genetic diversity analysis of investigated WOSR genotypes collection with the use of AFLP primer combinations
Table 3 Minimum and maximum of genetic distance values calculated according to using Nei and Li (1979) measure
Fig. 1
figure 1

Genetic dissimilarity dendrogram of 78 genotypes (collection 1) of WOSR, based on 262 AFLP markers

AMOVA of WOSR of different genotypes (cultivars, double low, CMS and restorer lines for Ogura system, Table 1) as a grouping criterion (four groups) revealed that within-group variance accounted for 79% of the total variance, while variation between groups contributed 21% (Table 4). The significant differentiation (FST= 0.215; p = 0.001) between four groups was supported by the results of AMOVA (Tables 4 and 5). PCA was performed to confirm the structure and the diversity of genotypes of WOSR based on all markers (Fig. 2). The first two principal components explained a total of 23.18% of marker variation (14.71, and 8.47%, respectively). The first axis, which accounted for 14.71% of the variance, separated the genotypes into two clusters. The second axis accounted for 8.47% of the variance, and showed the diversity of two groups of genotypes.

Table 4 Results of analysis of molecular variance (AMOVA) for both genotype collections
Table 5 Molecular variance for particular groups of both genotype collections
Fig. 2
figure 2

Principal component analysis (PCA) of 78 genotypes winter oilseed rape (collection 1) based on 262 AFLP markers

AFLP fingerprinting of the 48 genotypes from collection 2 with seven EcoRI/MseI primer combinations detected a high level of polymorphism—423 polymorphic AFLP markers (Table 2). The number of polymorphic markers per primer combinations ranged from 24 (EACT FAM: MCTC) to 86 (EACC NED: MCTC), with an average of 60.4 markers. Primer combinations EAGG JOE: MCTA—93.44%, EACC NED: MCTC—93.48% and EACC NED: MCAG—96.0% were the most effective in detecting polymorphism among genotypes. The set of AFLP primers used for collection 2 generated highly informative loci with PIC values ranging from 0.423 to 0.500, with a mean of 0.484 (Table 2). The GD coefficient showed the lowest dissimilarity value: 0.14 for CMS ogura lines (DG 13/14 and DG 14/14), and the highest value was 0.74 between restorer line (DG 41/14) and resynthesized oilseed rape line RS48/14 (Table 3). Cluster analysis enabled differentiation of the investigated genotypes into three main groups: C, D, E (Fig. 3). The first group (C) included one CMS ogura line (DG 8/14) and 25 lines with the restorer gene (Rfo). The cluster D comprised 16 genotypes and was subdivided into two smaller clusters. Six CMS ogura lines (DG 11/14–DG 16/14) formed cluster D1, and eight restorer lines (Rfo) and two CMS ogura lines (DG 9/14, DG 10/14) cluster D2. The last group (E) included only resynthesized lines: RS 55/14, RS 60/14, RS 42/14, RS 50/14, RS 48/14, RS 67/14. Analyzing the results of molecular tests of genotypes from collection 2, specific markers appeared only in the resynthesized lines of WOSR. These markers did not occur in the analyzed double low WOSR genotypes, whereas they were present in a few of the resynthesized genotypes. The AFLP markers characteristic for RS genotypes were found in five of the seven AFLP primer combinations used in the study. AFLP markers AAC/CAC275, 276, ACC/CAC175, ACT/CTC93 and AGG/CTA499.8 were found in three resynthesized lines, and the AAC/CAC306, ACC/CAC201 and ACC/CAG277 markers in four RS lines. Seven AFLP markers were found in all analyzed resynthesized lines of WOSR—AAC/CAC344,416, ACC/CAC96,132, ACC/CAG278 and AGG/CTA83.3, 276.2, and six AFLP markers in five RS lines—AAC/CAC586, ACC/CAC102,127,170,202, and ACC/CAG333, respectively. AMOVA for the WOSR genotypes, using types (CMS or restorer lines for Ogura system, RS lines) as a grouping criterion (three groups), revealed that within-group variance accounted for 81% of the total variance, while variation between groups contributed 19% (Table 4). The significant differentiation (FST= 0.185; p = 0.001) between the three groups was supported by the results of AMOVA (Tables 4 and 5). Variations in individual groups were similar: 49.4 for CMS ogura lines, 54.2 for resynthesized lines and 55.5 for restorer lines (Table 5). PCA was performed to confirm the structure and to gain insight into the diversity of 48 genotypes from collection 2 based on the 423 AFLP markers (Fig. 4). The first two PCs explained a total of 18.79% of AFLP marker variation (11.79 and 7.00%, respectively). The first axis, which accounted for 11.79% of the variance, separated the genotypes which were assigned by UPGMA to the first and second clusters. It is evident that the results provided by these two methods applied independently are complementary.

Fig. 3
figure 3

Genetic dissimilarity dendrogram of 48 genotypes (collection 2) of WOSR, based on 423 AFLP markers

Fig. 4
figure 4

PCA of 48 genotypes (collection 2) of WOSR, based on 423 AFLP markers

Discussion

Research carried out in various species of crops has shown that one of the elements that allows a high effect of heterosis in the seed yield to be obtained is the GD of parental forms used to create F1 hybrids. Therefore, many breeding companies have formed gene pools based on the DNA polymorphism of breeding materials (Snowdon and Friedt 2004). Molecular analysis of two WOSR genotype collections originating mainly from the Plant Breeding Company Strzelce Ltd. and Plant Breeding and Acclimatization Institute—NRI performed with the use of AFLP molecular markers was to form the basis of the creation of gene pools for the development of the WOSR F1 hybrids. Previous studies on Brassica crops confirmed the usefulness of AFLP markers for analyses of genetic variability (Ecke et al. 2010; Faltusová et al. 2011; Liersch et al. 2013). The choice of the AFLP primer combinations based on restriction enzymes (EcoRI and MseI) was made on the basis of a genetic map of WOSR (Lombard and Delourme 2001) and studies of parental lines of CMS ogura hybrids (Liersch et al. 2013). The results confirmed the usefulness of selected AFLP primer combinations due to the large number of markers obtained for one primer combination—53.9 for the first collection, and 65.9 for the second one, as well as the high efficiency and repeatability of the AFLP method. The analysis of the first collection used the method of separating PCR products on polyacrylamide gel, and detection by silver staining, while in the second study a different method was used for analysis of PCR products using capillary electrophoresis. The use of such a modification made it possible to simplify and significantly accelerate research and reduce the labor intensiveness of the method. Jesske et al. (2013) showed the usefulness of AFLP analysis using fluorescently labeled markers and separation of amplification products on capillaries in studies of genetic diversity of 126 rapeseed genotypes. Nurhasanah and Ecke (2016) indicated that the AFLP method is one of the most efficient methods, generating a large number of DNA markers in the study of genetic variation and mapping. The authors also demonstrated the usefulness of AFLP and SSR markers for the creation of genetic maps, genomic analyses and quantitative traits loci mapping. 262 AFLP polymorphic markers for the first collection and 423 markers obtained in the studies of the second collection allowed the determination of the genetic diversity within each collection, providing a sufficient assessment of the variability of the examined winter rape genomes. The dendrograms created on the basis of the GD value divided the examined genotypes of both collections into separate groups according to the type of plant material and its origin/pedigree. CMS ogura lines and restorer lines (Rfo) form separate groups in both collections, as do Ukrainian cultivars in the first of the studied collections. Resynthetic lines of winter oilseed rape newly obtained in the Tissue Culture Laboratory of Plant Breeding and Acclimatization Institute—NRI of WOSR form a distinctly different group on the dendrogram (Fig. 3) from other double low WOSR. The genetic diversity of resynthesized (RS) and semi-resynthesized (semi-RS) lines has also been demonstrated in studies of 101 breeding lines and WOSR cultivars using 10 AFLP primer combinations (Liersch et al. 2016). Similar results were obtained by Sosnowska et al. (2017) in investigations of collections of spring and WOSR cultivars, B. rapa and Brassica oleracea, as well as nearly 50 RS lines and semi-RS lines. Confirmation of the distinctiveness of resynthesized lines by AFLP markers shows that these lines can be valuable material for creating separate gene pools, especially for hybrid breeding of WOSR (Szała et al. 2016; Sosnowska et al. 2017). Jesske et al. (2013) analyzed 126 genotypes including 55 natural B. napus genotypes (winter, spring, Asian), and vegetable/cabbage forms and 71 resynthetic lines using 20 AFLP primer combinations. The authors showed a high value of GD between these groups, and thus created a dendrogram of rapeseed groups to clusters according to their origin and type/form in contrast to the resynthetic lines (RS) that created completely separate clusters. Seyis et al. (2006), Girke et al. (2012a, b), Jesske et al. (2013), and Chatterjee et al. (2016) in their research confirmed the usefulness of new RS rapeseed genotypes created as a result of hybridizing B. rapa with B. oleracea diploid parents. Hybridization helped to broaden the variability of breeding material, and to create a gene pool for use by breeders (Tonosaki et al. 2016). These authors point to the high yield potential of hybrids created with rapeseed RS lines, as confirmed by results from field experiments conducted in several environments.

The presented results confirmed the usefulness of molecular markers from the AFLP technique for the study of the GD of various WOSR collections. This resulted from the large number of generated markers and the repeatability of the obtained results. In the case of the first collection, 11 combinations of AFLP primers were used for the study, and in the second collection seven combinations allowed the characterization of the rapeseed genotypes tested and the creation of separate heterotic groups. Determination of genetic variation based on molecular markers is particularly useful in breeding F1 hybrid cultivars, an example of which is a winter rapeseed restored hybrid—Poznaniak—developed on the basis of the CMS ogura system, whose parental components (MS 83 × BR 26) were selected from different clusters (data not published). Understanding the genetic variability of breeding materials is a valuable complement to classical breeding methods. It allows the selection of the best components for creating test hybrids, often eliminating, at the early stages of breeding, genotypes that are unsuitable from the point of view of breeding. This approach enables the comprehensive selection of CMS ogura lines and restorer lines which will be used for further crosses and then evaluated for general and specific combination ability in multi-experimental field trials (Nowosad et al. 2016, 2017; Bocianowski et al. 2018, 2019). In addition, demonstrating the significant genetic separateness of the resynthesized lines (RS) of rapeseed from commonly cultivated double low WOSR cultivars will enable a broadening of the variability within Brassica and the use of RS lines in breeding new winter rapeseed cultivars (Szała et al. 2016; Sosnowska et al. 2017).