Plant material and mating design
In this article, the successive maritime pine breeding populations were named as follows:
-
G0 trees, the “plus” trees mass selected from the Landes provenance; they constitute the base population of the French maritime pine breeding program (Illy 1966)
-
G1 trees, the selected progeny from G0 trees; they constitute the second generation of the breeding program
-
G2 trees, the progeny from G1 progeny trials.
Six polycross progeny trials for the maritime pine breeding program were established from 1994 to 2002 in southwestern France for the prediction of second-generation (G1) parental breeding values. In total, 960 G1 trees were evaluated (as seed donors) within these six polycross progeny trials, each of which took place on three sites. This study focuses on one of these trial sites, established in 1996 (at 44° 42′ 32″ N/0° 46′ 8″ W) for the evaluation of 166 G1 trees as seed parents. Two different pollen mixes were used: 98 seed parents pollinated with one polymix (47 G1 pollen donors, Ns = 19, four pollen donors were also used as females) and 76 seed parents pollinated with the other polymix (43 G1 pollen donors, Ns = 43, ten pollen donors were also used as females). There were no pollen donors common to both polymixes. Eight of the G1 seed parents were pollinated with both polymixes, but the families resulting from identical seed parents and different PMX were considered to be different. The progeny trial thus consisted of 174 half-sib families plus five checklots, planted in a randomized block design, corresponding to a total of 6440 trees (35 complete blocks with one tree plot per family and two trees per checklot in each block).
Progeny (G2 trees) of both polycrosses was phenotyped for selection criteria. Tree girth at breast height (GBH) and tree height (HT) were measured (in cm) at the age of 12 years, and stem sweep (SWE; stem deviation from verticality at 1.5 m from the ground, expressed in cm) was measured at the age of 8 years.
Breeding value prediction and genetic index
Breeding values for growth (height and girth) and stem sweep were estimated with the TREEPLAN genetic evaluation system (McRae et al. 2004), which includes a database of all available data from the genetic trials of the French maritime pine breeding program. The phenotypic data were first spatially adjusted within each trial. A joint multivariate analysis of all trials based on the best linear unbiased prediction (BLUP) method was then carried out, taking into account both the pedigree relationships between the trees and the correlations between traits. Estimated breeding values (EBVs) and their accuracy were calculated for GBH, HT, and SWE at both measurement ages (i.e., 8 and 12 years). EBVs were also estimated at harvest age (i.e., 35 years), on the basis of age-age correlations for SWE and volume (VOL). Age-age correlations were estimated with the Lambeth correlation model (Lambeth 1980) from multiage data available on other maritime pine trials (unpublished data). The Lambeth coefficient was set at 0.10 for SWE and 0.15 for VOL.
EBVs were obtained with two different pedigree models (before and after pedigree recovery): EBV_PP were calculated with the partial-pedigree model, in which only the theoretical seed donors were known, and EBV_FP were calculated with the full-pedigree model, which included the complete pedigree of the genotyped G2 trees. EBVs are expressed in units of additive standard deviation with the G0 population as the reference population.
Selection decisions were based on multiple-trait selection indexes, combining EBVs calculated at the harvest age. Index_PP and Index_FP were successively considered depending on pedigree information used to calculate the EBVs.
$$ Index\_ PP=\mathrm{EBV}\_\mathrm{PP}\_\mathrm{VOL}-\mathrm{EBV}\_\mathrm{PP}\_\mathrm{SWE} $$
where EBV_PP_VOL and EBV_PP_SWE are the EBVs estimated with the partial-pedigree model for volume and stem sweep at 35 years of age
$$ Index\_ FP=\mathrm{EBV}\_\mathrm{FP}\_\mathrm{VOL}-\mathrm{EBV}\_\mathrm{FP}\_\mathrm{SWE} $$
where EBV_FP_VOL and EBV_FP_SWE are the EBVs estimated with the full-pedigree model for volume and stem sweep at 35 years of age.
Sampling in the polycross trial, with two different preselection strategies
The G2 trees of the progeny polycross trial were ranked according to Index_PP. As this index includes no evaluation of major defects, G2 trees were also scored visually (binary score: 0 for trees with major defects, such as bad branching, forks, disease, or pest damage; 1 for trees without major defects). Trees with a score of 0 were excluded from the preselection process described below.
Two different options were used in the polycross trial to preselect candidates with high growth and low sweep for pedigree recovery. The two options differed in terms of the contribution of the maternal family:
-
In preselection 1 (PS1), no restriction was placed on relatedness. PS1 involved the preselection of trees with no major defects ranked among the 200 best individuals (based on Index_PP). In total, 153 G2 trees were sampled.
-
Preselection 2 (PS2) included a restriction on relatedness. PS2 involved preselection of the two top-ranking trees with no major defects from each of the 75 best families in the progeny trial. The families and the trees within each family were ranked according to Index_PP. Thus, 150 G2 trees (2 individuals × 75 families) were sampled.
Overall, 57 preselected individuals were common in PS1 and PS2 which means that 246 G2 trees were sampled in total. Young needles were collected from the preselected trees and their potential parents (seed donors and pollen donors of both polymixes) and stored at −80 °C for DNA extraction.
DNA extraction and fingerprinting
Frozen needle tissues were ground to a fine powder and used for DNA extraction with an Invisorb® DNA Plant HTS 96 Kit (Stratec Molecular, Berlin, Germany), according to the manufacturer’s instructions. The DNA was quantified with a NanoDrop microvolume spectrophotometer (Thermo Fisher Scientific Inc., Waltham, CA, USA). The sampled individuals were genotyped with SNP markers, in the Sequenom MassARRAY iPLEX Gold assay (Sequenom, San Diego, CA, USA), performed at the genotyping and sequencing facility of Bordeaux, France (http://www.pgtb.u-bordeaux2.fr/). The 80 SNPs used here were originally developed for paternity recovery in a maritime pine breeding population (Vidal et al. 2015). These SNPs were selected from a 12-k Infinium SNP-array (Illumina, San Diego, USA) developed by Chancerel et al. (2013), and each had a minor allele frequency greater than 0.45 and low levels of linkage disequilibrium (r
v
2 < 0.3).
Assignment of parentage for the preselected trees
Likelihood inference was carried out with Cervus 3.0 (Kalinowski et al. 2007; Marshall et al. 1998), both to check the identity of the maternal parent and to recover the identity of the paternal parent for each of the preselected G2 trees. Cervus was run assuming a 0.1% genotyping error rate. The female parent was confirmed if the LOD score (likelihood ratio estimated over all loci, Marshall et al. 1998) was positive, and only one mismatch allele was allowed for each progeny and its supposed female parent. For paternity recovery, 90% of the pollen donors were considered to have been sampled (Vidal et al. 2015). The delta score (i.e., the difference in LOD scores of the two most likely candidate parents) was used as a criterion for paternity assignment at the 99% confidence level. The critical values of delta scores were based on simulations of 100,000 progeny. One mismatch allele was allowed between a given progeny and its male parent.
Final selection for clonal seed orchard establishment
OPSEL 1.0 software (Mullin 2014) was used for the optimal selection of a production population (virtual CSO), maximizing genetic gains while imposing various constraints on coancestry within the selected population. Constraints on coancestry were based on the minimum status number Ns. The status number of a population describes the effective number of individuals, i.e., the corresponding number of unrelated and non-inbred individuals (Lindgren et al. 1997). Three levels of coancestry constraints were tested: either no restriction on Ns, Ns = 10, or Ns = 20.
The “optimum selection of seed orchard method” was used, allowing unequal numbers of ramets per genotype in the CSO.
The final selection strategies studied were as follows:
-
Forward (FOR) selection based on preselection PS1 or PS2: The candidate genotypes were G2 trees for which a complete pedigree had been recovered. Genetic evaluation was carried out with Index_FP (i.e., with EBVs estimated from the full pedigree model)
-
Backward (BACK) selection: all the 166 G1 seed donors evaluated in the polycross trial were candidates. Genetic evaluation was carried out with Index_PP (i.e., with EBVs estimated from the partial-pedigree model)
-
-Mixed (MIX) forward-backward selection: G2 trees for which a complete pedigree had been recovered, and all 166 seed donors were candidates. The genetic evaluation was carried out with Index_FP for G1 and G2 individuals.
The target number of selected ramets constituting the CSO was set at 600 (named “census size” in OPSEL). For logistical reasons, the number of ramets per genotype was set at a maximum of 50 for G2 trees and a maximum of 200 for G1 trees (several ramets of G1 trees were available from clonal archives, but this was not the case for G2 trees, with only one tree per genotype, limiting the number of available scions for grafting).
Estimation of genetic gain for seed orchards
The expected genetic gain (ΔG) was calculated as \( \Delta G={\mathrm{CV}}_{\mathrm{a}}{\sum}_{i=1}^n\mathrm{EBV} i\ \mathrm{p} i \), where CVa is the additive coefficient of variation of the base population (G0 trees), EBVi and pi are the estimated breeding value and the proportion of ramets in the CSO of genotype i, respectively, and n is the number of different genotypes in the CSO.
CVa values for height, girth, and stem sweep were extracted from the article by Bouffier et al. (2008) and were calculated as CVa = σ
a/μ, where σ
a is the square root of the additive genetic variance and μ is the mean value for the trait. Expected genetic gains are expressed as a percentage relative to G0 trees (plus trees) performances.