Introduction

The use of inorganic nitrogen (N) was a key driver of the Green Revolution in the mid-twentieth century and helped to dramatically improve the yields of major crops. Worldwide, the use of N fertilizers increased by 430 % from 1965 to 1998 (Mosier 2002) and new varieties were selected for their ability to respond to high N inputs and in particular for their improved resistance to lodging. However, massive fertilization has major environmental drawbacks, with nitrate leaching and greenhouse gas emissions, resulting in water and air pollution (Hirel et al. 2011). In addition, the high cost of N fertilizer also has a significant impact on farmer incomes. Therefore, reducing N inputs is a major issue for achieving sustainable agriculture at the agronomic, environmental and economic level in the future.

Oilseed rape (OSR) is a crop of high economic value, with Canada (15.4 MT in 2012; 1,840 kg/ha), China (14 MT; 1,920 kg/ha), France (5.4 MT; 3,400 kg/ha) and Germany (4.8 MT; 3,700 kg/ha) as the main producers (FAOSTAT 2012). It is grown mainly for its oil-rich seeds (~40–45 % of the seed dry matter) used for human consumption and in industrial applications. The seed cake contains ~30–35 % protein and is used as animal feed. Thus, grain yield as well as increased seed oil and protein are major targets for breeding programs.

N use efficiency (NUE), which is the seed yield achieved per N unit available to the crop, is the product of two components: the N uptake efficiency (NUpE), the proportion of available N that is taken up by the crop, and the N utilization efficiency (NUtE), the grain yield achieved by N unit absorbed by the crop (Moll et al. 1982). Compared to other crops, OSR has a low apparent NUE [15.3 kg seed/kg available N vs. 35–40 kg/kg for cereals (CETIOM 2011)]. This is partly due to the higher energy content of OSR seeds compared to cereals grains. Potential improvements of NUE in OSR would include optimisation of traits related to NUpE (e.g., rooting traits, duration of N uptake, total N accumulated during vegetative growth) as well as NUtE (e.g., N remobilization during leaf senescence, increased harvest index).

However, although NUE improvement has been considered as a major goal (Yau and Thurling 1987; Rathke et al. 2006; Schulte auf’m Erley et al. 2007; Berry et al. 2010; Miro 2010; Kessel et al. 2012), no varieties have been specifically generated to address this issue as yet, with the exception of a genetically modified OSR line overexpressing the barley alanine aminotransferase gene in the roots (Good et al. 2007). Therefore, better knowledge of the genetic adaptation of OSR to N constraints is needed. Unravelling the genetic control of the traits contributing to the yield of OSR grown under low N input is a way to improve NUE and to propose new N-efficient OSR cultivars (Brancourt-Hulmel et al. 2005; Berry et al. 2010). This will lead to (1) improved knowledge of the allelic diversity and the genetic and molecular determinism of yield and yield components under low N constraints and (2) optimal allele mining for pre-breeding.

Grain yield is a very complex trait in OSR compared to other crops. The complexity is mainly related to the potential of OSR for growth and branching after flowering which enable the crop to use one yield component to compensate for limitations in another one. As a consequence, a given final yield can result from different combinations of yield components (number of plants per m2, number of pods per plant, seed weight, seed quality…) (Diepenbrock 2000). All these components are impacted by developmental traits (flowering time, seed filling duration), and environmental conditions (climatic conditions, water and fertilizer availability). These traits are all under polygenic control and previous analyses identified several quantitative trait loci (QTLs) for seed oil content (Ecke et al. 1995; Gül 2002; Burns et al. 2003; Delourme et al. 2006; Qiu et al. 2006; Zhao et al. 2006; Mei et al. 2009; Zou et al. 2010), seed yield (Udall et al. 2006; Radoev et al. 2008; Basunanda et al. 2010; Chen et al. 2010), plant height (Basunanda et al. 2010), thousand seed weight (Basunanda et al. 2010; Ding et al. 2012), number of seeds per area (Ding et al. 2012), number of pods per area (Radoev et al. 2008) and flowering time (Chen et al. 2010; Honsdorf et al. 2010). However, only a small number of studies were carried out under abiotic stress, such as cold stress (Kole et al. 2002; Asghari et al. 2007), boron stress (Xu et al. 2001) or phosphorus stress (Yang et al. 2010; Ding et al. 2012). In the context of N stress, QTL analyses have been performed on major cereal crops, including barley (Kjaer and Jensen 1995; Gorny and Sodkiewicz 2001; Mickelson et al. 2003), maize (Hirel et al. 2001; Coque et al. 2008), rice (Lian et al. 2005; Cho et al. 2007) and wheat (Habash et al. 2007; Laperche et al. 2007) whereas there have been a couple of studies carried out on OSR (Gül 2002, Miro 2010).

The aim of the present study was to extend our knowledge of the genetic control of yield and its components under N constraints. For this, a genetic analysis of yield components in a winter OSR segregating population grown under two contrasting N conditions was undertaken. Our objectives were to (1) determine the main genomic regions(s) involved in yield and its components, (2) understand the organization of the intricate network of QTLs in the Brassica napus genome, and (3) assess their stability over years and contrasting N nutrition regimes in order to identify the regions with potential for use in breeding programs.

Materials and methods

Plant material and genetic map

A population of 112 doubled haploid (DH) lines was derived from an Aviso × Montego (AM) cross. The parental lines were chosen for their contrasted yield response to a change in the N nutrition regime: Aviso uses N more effectively for yield and shows a smaller difference in seed yield between the two N regimes than Montego (unpublished results). The parental lines were used as controls in the trials. The AM genetic map was described by Delourme et al. (2013) and comprises 2301 SNPs representing 831 unique loci, covering a total length of 1,947.3 cM, at a density of one marker every 2.34 cM. Chi square tests for goodness of fit (1:1; p value = 0.01) on the 2301 SNPs revealed that 20.6 % of the markers were in segregation distortion at the whole genome level. The linkage groups (LGs) with the highest proportion of loci in distortion were A2 (99.5 %), A7 (35.3 %), C5 (71.4 %) and C9 (86 %).

Field trials and trait measurements

The AM population was evaluated in Le Rheu (LR) located in Brittany (France) during the 2010–2011 (48°8′21.63″N–1°48′9.26″O), 2011–2012 (48°8′31.77″N–1°46′59.76″O) and 2012–2013 (48°8′21.76″N–1°46′56.23″O) cropping seasons. The LR station has a deep loamy soil (58 % silt, 24 % sand, 18 % clay, and depth >80 cm). Plants were grown under two N regimes (N1: low; N2: optimal) as described in detail below. In order to limit the amount of mineral N in soil in the experimental plots, no organic matter was spread on the fields for 3 years before the trials and the previous crops were grown under a low input management system (see Supplementary Data S1a).

Experimental design

All trials were designed as split-plots with N as the main plots and genotypes as subplots. The 2010–2011 trial (hereafter referred to as LR11) involved three replicates. The 2011–2012 and 2012–2013 trials (hereafter referred to as LR12 and LR13 respectively) involved four replicates. Plants were sown in 10.5 m2 plots at a density of 45 seeds/m2. Plants were sown in early-to-mid September and the entire plots were harvested at the beginning-to-mid July. Details of the crop management strategy used in LR11, LR12 and LR13 are shown in Supplementary Data S1b. In order to estimate the N mineralization and leaching, extra plots that were kept empty of plants were added on the borders of the experimental plots.

Characterization of the crop sites

Several soil and climatic variables were recorded throughout the crop cycle. The mineral N soil content was measured for each control under both N conditions and on the extra plots empty of plants at three dates: just before sowing, at the end of winter and after the seed harvest. Homogeneous samples of soil (50 g) were collected for three horizons (0–30, 30–60, 60–90 cm). Mineral N was estimated using the Kjeldahl method (Kjeldahl 1883) (NO3− and NH4+ ions were scored separately). The N soil values were used to calculate the required N fertilization (see Eq. 1).

Daily air temperature (in °C), rainfall (in mm), global radiation (in J/cm2) and Penmann evapotranspiration (in mm) were recorded throughout the whole crop cycle by the INRA meteorological station located at Le Rheu and endorsed by Météo France (station no 35240001). These data identified four climatic periods: autumn, winter, spring and grain filling period (GFP). The autumn period started with the sowing and ended on the first day that the air temperature was below 0 °C. The winter period lasted as long as the daily mean air temperature was below 0 °C. The spring period extended from the end of winter to the beginning of flowering, which was calculated as the mean of the flowering date for all the genotypes. Flowering was defined as 50 % of the plants showing 10 % open flowers on the primary inflorescence, defined as the 61 stage according to the BBCH scale for OSR (Lancashire et al. 1991). Finally, the GFP lasted from the beginning of flowering until seed harvest. For each period, the values of the cumulated temperatures (growing degree day, GDD), cumulated rainfalls (in mm), cumulated global radiation (in MJ/m2) and cumulated Penmann evapotranspiration (in mm) were calculated.

Management of N fertilization

N fertilization was calculated using the balance sheet method that is commonly used in France for the main arable crops (Rémy and Hébert 1977; Parnaudeau et al. 2009). The N doses were calculated as follows:

$$X = [(6.5 \times SY) \, - Ri + Rf - Pi - Mn)]$$
(1)

where SY is the yield objective (defined in our study at 3.5 t/ha for N2 and 2.0 t/ha for N1), Ri is the mineral N soil amount measured at the end of winter (as described above), Rf is the mineral N soil amount at seed harvest (estimated at 30 kg N/ha for deep loamy soils in Brittany according to the CETIOM reference values), Pi is the amount of N absorbed by the plants at the end of winter (determined as described below) and Mn is the estimated amount of N mineralized during spring from the soil organic matter (estimated at 50 kg/ha according to the CETIOM). The N1 regime corresponded to 40 kg N/ha for LR11 and 0 kg N/ha for LR12 and LR13. The N2 regime corresponded to 130 kg N/ha for LR11 and to 80 kg N/ha for LR12 and LR13. All applications were made using liquid fertilizer with a 39 % N solution (50 % urea, 25 % nitrate and 25 % ammonium) at two dates (the beginning of stem elongation and during spring elongation) as recommended for OSR crop management in Brittany. In this region, autumnal mineralization is sufficient to cover rapeseed N needs until the start of stem elongation, avoiding fertilizing during that period. Indeed, at the end of winter, the amount of N soil in the 0–90 cm profile ranged from 54 to 143.4 kg/ha under empty plots and from 11 to 52.5 kg/ha under the plots with plants (Supplementary Data S1c), reflecting huge amounts of N available in the soil at this period and a high N absorption by the plants.

The N nutrition index (NNI) was measured on the controls at the end of autumn (date 1), the end of winter (date 2) and during spring elongation (date 3). At dates 1 and 2, no N fertilizer had been applied yet, so all the plants were at the same N nutrition level. A surface of 1 m2 was harvested and above ground plant tissue was directly weighed to determine the fresh matter weights then dried (70 °C o/n) for dry matter measurements and finally ground and used to estimate the N content using the Dumas combustion method. The NNI values were then calculated according to Colnenne et al. 1998. The plants were considered stressed if the NNI values were below 0.8 (Colnenne et al. 1998). A t test was performed to compare the NNI values at date 3 between the N conditions.

Trait measurements

The traits measured were as follows. The days to flowering (DTF in days) was the number of days from the 1st January until the day when 50 % of the plants showed 10 % of open flowers on the primary inflorescence. The seed yield (SY in t/ha) was determined for each plot from a sample of 200 g of seeds, adjusted to 0 % water content and 0 % impurities. The thousand seed weight (TSW in g) was determined by weighing and counting an aliquot of fully dried seeds. The seed oil content (O in % of the seed dry matter) and the seed protein content (Pr in % of the seed dry matter) were estimated using near-infrared reflectance spectroscopy (Foss 6500 NIRS equipment) using commercial calibrations developed for OSR (P. Dardenne, Univ. Gembloux, Belgium; equation #5colz38.eqa). In addition, oil yield (OY = O × SY, in t/ha), protein yield (PrY = Pr × SY, in t/ha) and the seed number/m 2 (SN = (SY × 100000)/TSW) were calculated. In order to evaluate the response of the genotypes to a change in N regime, a number of ratios with values obtained under N1 and N2 nutrition levels were calculated for the seed yield, the seed number/m2, and the oil yield and protein yield traits. The ratio (N1/N2) estimated the deviation from the linear relationship between N1 and N2 (Laperche et al. 2007). The term (ΔN/N2, where ΔN = N2 − N1) expressed the QTL × N interaction adjusted to the value of the trait under optimal fertilization conditions (N2).

Phenotype data analysis

All statistical analyses, including QTL analyses, were carried out with R software (RCoreTeam 2013). The different mixed models were analyzed using the lme4 package (Bates et al. 2013).

A combined mixed linear model was fitted on the 112 DH lines and the parents for each trait (P) using the REML method, with all 3 years combined.

$$P_{ijkl} = \mu + G_{i} + N_{j} + Y_{l} + G_{i} \times N_{j} + Y_{l} (R_{k} ) + G_{i} \times Y_{l} + e_{ijklm}$$
(2)

where P ijkl is the phenotypic value, µ is the population mean, G i stands for the genotype i, N j for the N nutrition condition j, R k for the replicate k and Y l for the year l and e ijklm is the residual.

To test for genotype (G), N nutrition condition (N), year (Y), genotype × year (G × Y) and genotype × N (G × N) effects, these terms were first considered as fixed. The replicate (R) was considered random. In a second model, we considered only N j as the fixed term, in order to estimate G, Y, G × N and G × Y variances.

Based on the model 2, broad sense heritability was then calculated as:

$$h^{2} = \frac{{\sigma^{2}_{G} }}{{\sigma^{2}_{G} + \frac{{\sigma^{2}_{G \times N} }}{n} + \frac{{\sigma^{2}_{G \times Y} }}{y} + \frac{{\sigma^{2}_{e} }}{y \times n \times r}}}$$
(3)

where \(\sigma^{2}_{G}\) is the genetic variance, \(\sigma^{2}_{G \times N}\) the G × N variance, \(\sigma^{2}_{e}\) the residual variance, \(\sigma^{2}_{G \times Y}\) the G × Y variance, n the number of N conditions, y the number of years, and r the number of replicates per genotype per N and per year.

A second mixed linear model was applied to each year.

$$P_{ijkl} = \mu + G_{i} + N_{j} + R_{k} + G_{i} \times N_{l} + e_{ijkl}$$
(4)

where P ijkl is the phenotypic value, µ is the population mean, G i stands for the genotype i, N j for the N nutrition condition j, R k for the replicate k and e ijkl is the residual. R was considered random and G, N and G × N were tested as for model 2.

The corresponding heritabilities were assessed as follows:

$$h^{2} = \frac{{\sigma^{2}_{G} }}{{\sigma^{2}_{G} + \frac{{\sigma^{2}_{G \times N} }}{n} + \frac{{\sigma^{2}_{e} }}{n \times r}}}$$
(5)

A last mixed linear model was applied to each year × N combination:

$$P_{ijkl} = \mu + G_{i} + R_{k} + e_{ijk}$$
(6)

where P ijkl is the phenotypic value, µ is the population mean, G i stands for the genotype i, R k for the replicate k and e ijk for the residual. All terms were declared as random. The heritabilities were estimated for each N condition and each year with the following formula:

$$h^{2} = \frac{{\sigma^{2}_{G} }}{{\sigma^{2}_{G} + \frac{{\sigma^{2}_{e} }}{r}}}$$
(7)

The Pearson’s correlation coefficients between the different traits were also calculated.

Linkage analysis: multiple QTL mapping (MQM)

A multiple QTL mapping model was tested using the R/qtl package (Broman et al. 2003) for each trait in each year × N combination and for each N-responsive trait on the 2301 SNPs. For each trait and each genotype, the adjusted means were considered according to model 6. We performed a stepwise selection of QTL (forward and backward), allowing for QTL-pairwise interactions, using the Haley-Knott regression method (stepwiseqtl function). The Maximum QTL number (max.qtl) was set to five. Thresholds for incorporating new additive QTLs and epistatic interactions into the model were calculated using 100 permutations with α = 0.05 (calc.penalties function). The chosen multiple QTL model corresponded to the model with the largest penalized LOD score (pLOD). An ANOVA was fitted to the chosen multiple QTL model (fitqtl function). We retained QTLs in the final model when their effects were significant (α = 0.05). Based on the same ANOVA, the percentage of variation explained by the global model and the R2 of each QTL were assessed. The fitqtl function also provided the LOD value for each QTL. We finally assessed the confidence intervals of the QTL with a LOD drop of 1 (scaneone and lodint functions). QTL positions and marker names at these positions, confidence intervals, percentages of variation explained by each QTL (R2), and favorable alleles were scored.

Results

Characterization of the crop sites

Overview of the climatic data

The growing cycles lasted for 287, 307 and 315 days for LR11, LR12 and LR13 respectively (see Supplementary Data S2). The number of days in the four periods (autumn, winter, spring and GFP) were 73/19/100/95 days in LR11, 87/39/66/115 days in LR12 and 97/45/55/118 days in LR13. Due to the cool and wet oceanic climate in Brittany, there were only 8, 10 and 5 days with a mean air temperature below 0 °C for LR11, LR12 and LR13 respectively. Therefore, the winter period was too short to discriminate the 3 years. However, the difference of cumulative GDD in autumn between the years (819.5 GDD/1,096.2 GDD/1,058.3 GDD in LR11, LR12 and LR13 respectively) could explain these disparities. In addition, the start of spring growth differed between the years (approximately one month difference between LR11 and the two other years) and resulted in differences in DTF, in cumulated GDD in spring and in GFP phases (Supplementary Data S2), which could also explain the variations observed.

The cumulated temperature values for the 3 years were above the minimum requirement of 2330 GDD as defined by the CETIOM for rapeseed (Merrien and Landé 2009) with the lowest value in LR11 (2,721.4 GDD) and the highest value in LR12 (3,346 GDD). In addition, the lowest cumulated radiation in the overall cycle was in LR11 (3,007.31 MJ/m2 compared to 3,174.42 and 3,352.91 MJ/m2 for LR12 and LR13 respectively). The cumulated rainfall value was the highest in LR13 during the growing cycle (786.5 mm compared to 455.5 and 566 mm for LR11 and LR12 respectively). When the cumulated rainfall values were compared between the corresponding periods for the three years, LR12 had the lowest value during the spring period (85.5 mm vs. 148 and 174.5 mm in LR11 and LR13 respectively) and the highest value during the GFP (297 mm vs. 104 and 188 mm in LR11 and LR13 respectively). The cumulated Penman evapotranspiration values over the whole cycle were relatively constant between the three years (457, 440.3, and 489.9 mm for LR11, LR12 and LR13 respectively). Overall from rainfall and ETP values we can conclude that water was not limiting at any time during the 3 years. In summary, LR11 had a short winter period with overall lower average daily temperatures and global radiation over the whole growing cycle. The climatic periods for LR12 and LR13 were of similar duration, with shorter spring periods and longer GFP than LR11. In addition, LR12 and LR13 were characterized by high rainfall values (especially in LR13 with a rainfall value around 1.5 fold higher than the other two locations) and higher cumulated radiation values than LR11.

Characterization of N constraints

Assessment of the values of the NNI, the biomass dry matter (DM) and N accumulated in the aerial parts of the controls, as well as mineral N soil during the crop cycle gave an estimate of the N constraints (Fig. 1; Supplementary Data S1c, S3). At the end of autumn (date 1), no N deficiency was recorded, since the NNI mean values ranged between 0.91 and 1.18 and the biomass DM values ranged from 1.26 to 2.81 t/ha (Supplementary Data S3). At the end of winter (date 2), two scenarios were observed. On the one hand, plants were moderately stressed in LR11 (NNI values at date 2–0.8 for both genotypes), probably due to N leaching during autumn and early vegetative regrowth. On the other hand, there was an excess of N in LR12 and LR13 (NNI mean values were up to 1.15 and 1.26), which was confirmed by the high amounts of N soil recorded on empty plots (143.4 and 73.5 kg N/ha in LR12 and LR13 respectively, Supplementary Data S1c) and the high amount of N in the aerial parts of the plants (114.53 and 101.44 kg N/ha in LR12 and LR13 compared to 50.50 kg N/ha in LR11, Supplementary Data S3). This is probably due to a high rate of N mineralisation caused by the mild and humid conditions during the fall and winter seasons in LR12 and LR13.

Fig. 1
figure 1

NNI values measured on the Aviso and Montego genotypes at the end of autumn (date 1), the end of winter (date 2) and during the spring elongation (date 3), in LR11, LR12 and LR13. The average NNI values resulting from three (LR11) or four (LR12 and LR13) replicates are indicated on the plot along with the standard errors bars. The plants were considered stressed if the NNI values were below 0.8. Significant differences in the NNI values between N conditions are indicated as follows: ***p value < 0.001, **0.01 < p value < 0.001, *0.05 < p value, NS non-significant

At the flowering stage (date 3), NNI values between the two N conditions were significantly different for Montego in all years and only in LR13 for Aviso (p value < 0.05) (Fig. 1). When grown under the N1 condition, no N stress was recorded for Aviso, which always showed higher NNI values than Montego and N stress was observed for Montego in LR11 (NNI value at date 3 = 0.75) and LR13 (NNI value at date 3 = 0.77).

Analyses of phenotypic data

The phenotypic data are presented for each trait and each trial (year × N combination) in Table 1. Seed yield was higher in N2 than in N1, except for LR12 where no significant difference was observed (Fig. 2). Positive correlations were found between seed yield, seed number/m2, oil content and oil yield, with the highest correlation values observed between seed yield and oil yield (≥0.98) and between seed number/m2 and seed yield (≥0.86) (Supplementary Data S4). In contrast, no significant correlation was found between seed yield and the TSW (Supplementary Data S4), suggesting that seed yield depends primarily on the seed number/m2 and not on the weight of a grain, as illustrated in Fig. 2. A strong negative correlation was found between the oil and protein seed contents (correlation values ranged between −0.63 and −0.77).

Table 1 Phenotypic values for each trait in each combination year × N (mean ± standard error (minimum–maximum))
Fig. 2
figure 2

Distribution of the main yield components in the AM population evaluated 3 years under two N conditions. Boxplots represent the distribution of the seed yield (SY), the thousand seed weight (TSW) and the seed number/m2 (SN) values acquired in LR11, LR12 and LR13 trials for conditions N1 and N2. SY is expressed in t/ha, TSW in g and SN in number of seeds per m2

The results of the mixed model (Eqs. 2 and 4) along with the heritability values (h2, Eqs. 3, 5 and 7) are shown in Tables 2, 3 and 4. A significant genotype effect was found for all the traits in every year. Similarly, a significant effect of N nutrition was detected for all the traits except for the seed number/m2 in LR12. No genotype × N interaction was found except for the TSW trait in LR12. Heritability values (h2) for all the years (model 3) were high and ranged from 0.74 (protein yield) to 0.94 (seed oil content). The h2 values were not very different between the two N conditions (model 7, Table 4). When each year was compared separately with both N conditions combined (model 5), the h2 values were higher for the LR12 station for every trait studied except for TSW and seed protein content. This suggests a less stressful environment in LR12 compared to the other years, leading to a lower residual variance and thus to higher h2 values. The lowest h2 values were recorded for the protein yield in LR13, regardless of the model used.

Table 2 Results of the mixed linear model 2: \(P_{ijkl} = \mu + G_{i} + N_{j} + Y_{l} + G_{i} \times N_{j} + Y_{l} (R_{k} ) + G_{i} \times Y_{l} + e_{ijklm}\); mean square (MS) values along with significance of the genotype (G), the year (Y), the nitrogen level (N), the genotype × nitrogen interaction (G × N) and the genotype × year (G × Y) effects on every trait studied is indicated
Table 3 Results of the mixed linear model 4: \(P_{ijkl} = \mu + G_{i} + N_{j} + R_{k} + G_{i} \times N_{l} + e_{ijkl}\); significance of the genotype (G), nitrogen level (N) and the genotype × nitrogen interaction (G × N) effect is assessed for each year (LR11, LR12 and LR13) separately
Table 4 Heritability (h2) values are shown for each trait according to the models 3, 5, and 7

QTL analysis

One hundred and four QTLs were detected all over the Brassica napus genome

A total of 104 QTLs were detected when all traits and all environments (year × N) were considered. These QTLs were distributed all over the rapeseed genome with the exception of the A8, A9 and C7 LGs, to which no QTLs localized. The LGs A5 (15 QTLs), C3 (14 QTLs), A1 (13 QTLs) and C8 (10 QTLs) carried the highest number of QTLs. The number, position, LOD score and R2 of the QTLs detected for each trait and environment (year × N) are summarized in Table 5. The QTLs were mapped onto the AM genetic map (Fig. 3). Two QTLs were considered similar if their confidence intervals overlapped.

Table 5 Results of the MQM analysis for each combination site × N
Fig. 3
figure 3figure 3

Schematic representation of the 104 QTLs identified in this study on the “Aviso × Montego” genetic map. Each QTL is labeled with the name of the corresponding trait (DTF, SY, SN, TSW, O, Pr, OY, PrY), the year (LR11, LR12, LR13), the N condition (N1, N2) and the R2 value given in %. The N1-QTLs are written in blue, the N2-QTLs in red and the N-responsive QTLs in green. The QTLs found in LR11 are represented with plain lines, the QTLs found in LR12 by dashed lines and the QTLs found in LR13 by dotted lines. The DTF regions are indicated by pink bars on the left side of the LGs. The eleven critical regions involved in yield and yield components are indicated by: (1) green bars for regions containing QTLs for multiple traits and (2) orange bars for regions containing QTLs for one trait. Refer to Table 5 for exact positions of the QTLs and the names of the markers at these positions, and to Supplementary Data S5 for the description of the eleven critical regions. (Color figure online)

Most of the QTLs were stable across N conditions but not throughout the years

Among the 104 QTLs, 28 were related to flowering time (DTF) of which 24 were highly stable among the trials and the N nutrition conditions, and were located on four main genomic regions on LGs A1, A2, C2 and C6 (Table 5; Fig. 3). The four remaining DTF QTLs were found on LGs A6 (identified in LR12 under N2 condition), A7 (LR11, N1 and N2 conditions), and A10 (LR11, N1 condition). The DTF QTLs on the A1, A10 and C6 LGs were putatively co-localized with other QTLs for yield components: seed number/m2 and TSW on A1, seed number/m2 on A10 and seed protein content on C6. In contrast, those located on the A2, C2, A6 and A7 LGs were DTF specific loci.

Concerning the QTLs related to yield components, a total of 76 QTLs were detected of which 36 were revealed under the N1 nutrition condition, 37 under the N2 condition and three were N-responsive QTLs (Table 5; Fig. 3). Some QTLs were common to the two N conditions and others were specific to one N condition only (Fig. 4a). For instance, in LR11, 29 QTLs were found in total with 12 N1 specific QTLs, nine N2 specific QTLs and four QTLs found in both N nutrition conditions; in LR12, 22 QTLs were found in total with three N1 specific QTLs, three N2 specific QTLs and eight found in both N nutrition conditions; in LR13, 22 QTLs were found in total, with two N1 specific QTLs, six N2 specific QTLs and seven QTLs found in both N conditions. Very few QTLs were common to at least two out of the 3 years (Fig. 4b). Thus, when considering the N1 condition alone, no QTLs were common to the 3 years, four QTLs were identified in 2 years and 28 QTLs were specific to 1 year. Considering the N2 condition only, none of the QTLs were common to the 3 years, five were found in 2 years and 27 were specific to 1 year.

Fig. 4
figure 4

Number of QTLs detected for all traits. a Number of QTLs found in each N condition for each year. The numbers at the intersections of the circles correspond to the QTLs common to the two N conditions. b Number of QTLs found in each year for each N condition. The numbers at the intersections of the circles correspond to the QTLs common to 2 or 3 years of trial

Eleven main genomic regions are critical for the elaboration of oil yield

To obtain an overview of the key genomic regions involved in yield within the AM population, we tried to group the 76 QTLs controlling yield components using the following criterion: a critical region must carry overlapping QTLs for one or several traits that were stable in at least two environments (N and/or site). The confidence intervals of the critical genomic regions were defined as the overlapping of constitutive QTLs confidence intervals. As a result, we identified 11 critical genomic regions that were located on seven LGs as shown in Fig. 3 and Supplementary Data S5.

Seven of these 11 regions carried QTLs for only one trait of which four were for TSW (Region (R)-A4, R-A10-a, R-C1-b and R-C3-c), two for oil content (R-C1-a and R-C3-b) and the last one for seed number/m2 (R-A10-b). In addition, four of the seven mono-trait regions were specific to 1 year (R-A10-a, R-A10-b, R-C1-b and R-C3-b specific to LR13, LR11, LR13 and LR13 respectively). The four remaining regions (namely R-A1, R-A5, R-C3-a and R-C8) carried QTLs for multiple traits and were designated as yield-related regions. Three of these four regions were specific to one site (R-A1, R-C3-a and R-C8 specific to LR13, LR11 and LR12 respectively) whereas the last one (R-A5) carried QTLs found in 2 years (LR11 and LR12). The most promising region appears to be R-A5 (10–64 cM) to which 12 stable QTLs for seed yield, oil yield, protein yield and seed number/m2 were localized (QTLs found in N1, N2, LR11 and LR12).

Epistatic interactions

Only one epistatic interaction was detected in the N1 condition of the LR11 trial with an interaction between QTLs of TSW found on the C3 and C9 LGs. The percentage of variation explained by this interaction was 6.48 % where the main effects were 20.9 and 18.66 % for the C3 and C9 TSW QTLs, respectively.

Discussion

The aim of the present study was to determine the genetic regions involved in grain yield and yield components in OSR grown under contrasting N regimes over 3 years of trial. The high number of QTL detected in our study highlights the complexity of the genetic control of yield in OSR. Eleven critical genomic regions carrying stable QTLs across years and/or N conditions were detected and are promising potential candidates for OSR breeding programs. A particularly dense region on the A5 LG (R-A5) grouped 12 QTLs related to yield components. The potential for using these regions in breeding strategies is discussed.

Power of QTL detection in the AM population

Flowering time is a major developmental trait known to interact with yield components (Diepenbrock 2000). We investigated DTF QTLs and found 28 loci involved in the genetic control of flowering time. Most of the DTF QTLs defined four genomic regions located on A1, A2, C2 and C6 that were stable across the environments (year × N). The DTF region on C2 corresponded to the FLC4 gene (data not shown) previously cloned in B. napus (Tadege et al. 2001). In addition, DTF QTLs located on A1, A2 and C6 were also reported previously in different genetic backgrounds (Delourme et al. 2006). The DTF region on A1 showed a possible co-localization with the yield-related genomic region R-A1.

Seventy-six yield-related QTLs were identified which explained from 3.4 (QTL for protein content on C6) to 34.6 % (seed yield QTL on A5) of the total variation of the traits studied. Their confidence intervals ranged from 5.7 (oil yield QTL on A5) to 76.4 cM (QTL for protein content on A5). This can be explained by the relatively small size of the AM population (112 individuals) which results in a decreased power of QTL detection (low number of detected QTLs, overestimation of the QTL effects as well as the size of the confidence intervals) (Bernardo 2008). However, due to the experimental design which included two N conditions and several (three to four) replicates, a higher number of individuals would have been very difficult to manage and one hundred individuals should adequately balance the precision of QTL detection and the experimental constraints (Vales et al. 2005).

More than 20 % of the SNPs were in segregation distortion at the whole genome scale, which is commonly observed in DH populations compared to other kinds of segregating populations (Zhang et al. 2010). The impact of segregation distortions on QTL mapping have been reported (Liu et al. 2010) but appeared to be minimized with large populations and when the distance between the distorted markers and the QTLs is over 40 cM (Zhang et al. 2010). In our case, a total of 16 yield-related QTLs should be considered with care due to a possible effect of genetic distortion within their vicinities. These QTLs were located on A2, A3, A7 and C4 and in the two regions R-C1-a and R-C3-a. However, by deleting distorted markers from the analysis we would have run the risk of missing QTLs (Liu et al. 2010). The 60 other yield-related QTLs were far from distorted markers (>40 cM) and can therefore be considered reliable.

Yield-related QTLs were gathered on eleven critical genomic regions with a particular dense seed yield area on the A5 linkage group

We identified eleven critical genomic regions encompassing seven LGs where yield-related QTLs were stable across N conditions and/or years of trial. Seven regions corresponded to mono-traits and the four others were multi-traits. Except for the R-A1, the multi-trait regions carried QTLs for correlated traits. Indeed, QTLs of seed yield and seed number/m2 were often co-localized. This was particularly obvious for the R-A5 region where 12 QTLs for seed yield, seed number/m2, protein yield and oil yield co-localized. Previous studies also reported the co-localization of QTLs controlling yield and yield components in B. napus (Ding et al. 2012) or in other species such as in rice (Wei et al. 2012). This raises the question of whether these regions result from the genetic linkage of several independent QTLs or whether they carry master regulators with pleiotropic effects as suggested by Shi et al. (2009). Considering the complexity of seed yield, it appears more than likely that many genes contribute directly or indirectly to this trait (Slafer 2003) and that both the suggested mechanisms play a significant role. In addition, in our study several QTLs for traits that were negatively correlated also co-localized. For instance, QTLs controlling seed protein content or seed oil content co-localized on A2 although the two corresponding traits were strongly negatively correlated (Supplementary Data S4), thus confirming the literature (Jeuffroy et al. 2006). Here, the two QTLs displayed opposite allelic effects. This demonstrates the complexity of combining favorable alleles for two competitive traits. Focusing on QTLs that may be inherited independently may be a necessary strategy for obtaining high seed oil and protein content. Thus, QTLs of protein content on the A1, A3, A7 and C3 LGs and QTLs of seed oil content on the C1, C3 and C8 LGs could be considered for improving both traits simultaneously, after validation in other environments.

QTLs for yield components on A4 (QTL for TSW, Ding et al. 2012), A8 (QTLs for seed yield, TSW and seed number/area; Shi et al. 2009) and C3 (Zhao et al. 2012) were previously reported in rapeseed, and appear to correspond to the regions R-A4, R-A8 and R-C3-b described in the present study. In addition, several studies already reported the presence of QTLs for seed yield related traits on the A5 LG (Ding et al. 2012; Fan et al. 2010; Shi et al. 2009), which supports the hypothesis that this is a critical region for plant breeding purposes. Our study showed the importance of R-A5 effects on yield related traits (8.82 < R2 < 34.64 %) and its stability with time and environment, which, to our knowledge, had not yet been demonstrated. However, this region needs to be further characterized using additional genetic backgrounds before being considered as a serious candidate for plant breeding. Association mapping would be a relevant tool to both confirms R-A5 in a wide set of genotypes and to determine more precise data on its position and confidence interval.

QTLs were stable across N conditions but differed between trial years

Several genetic analyses conducted under abiotic constraints, including N stress, have been published for crops such as wheat (Campbell et al. 2003; Laperche et al. 2007) or rapeseed (Miro 2010) and reported interactions between QTLs and abiotic stress. However, (Gül 2002; Gül et al. 2003) showed that only few QTLs had a significant interaction with N nutrition regime in rapeseed, which was confirmed by our results. Indeed, in the present study, the QTLs were relatively stable between the N conditions and very few G × N and QTL × N interactions were significant. Only 17–18 QTLs were specific to one or the other N condition. In LR12 and LR13, many QTLs were found to be consistent between N conditions with only six and eight QTLs specific to one N condition in LR12 and LR13 respectively and only three QTLs which reacted to a change in N condition ((N1/N2) and (∆N/N2)) were found in LR12 site on the A5 LG. This could be due to the limited N stress induced in the N1 condition, especially in LR12 and LR13. Another explanation could lie in the ability of B. napus to compensate for a developmental phase that was not optimal (for example, poor N remobilization in the seeds during the GFP) by improving another step of development (for instance, an increase in the number of ramifications leading to an increased number of seeds per area). Furthermore, under low N, rapeseed seed yield is positively correlated with the NUpE, whereas it is more correlated with the NUtE under a high N (Berry et al. 2010; Schulte auf‘m Erley et al. 2011). This suggests that there are physiological differences between genotypes with contrasted NUE [root system, amount of N accumulated in the stems, for instance (Berry et al. 2010)] that were not studied here. Thus, refining the analysis of seed yield by studying physiological traits throughout the cycle would allow pinpointing the exact steps when OSR yield components development strategy is modified under N limitation. Although N stress was not pronounced in our experiments (NNI values close to 0.8 and small differences in seed yield between the two N conditions), the NNI values at date three in N1 were significantly lower than in N2 for Montego in all years and for Aviso in LR13, suggesting that the plants showed a difference in N nutrition between the two N conditions. To explore this diversity of N nutrition, the analysis could focus on the response of rapeseed to a gradient of NNI by examining the trait values as the difference in NNI values between the two N conditions instead of using the values in N1 and N2 per se. Nevertheless, 17 QTLs were found to be specific to the N1 nutrition condition (on LGs A2, A3, A7, C3, C5, C8 and C9) and could be interesting for the adaptation of cultivars to low N fertilization. These QTLs need to be validated in other experiments because they were only detected in 1 year of the trial.

When considering all the N conditions together, 55 QTLs were specific to 1 year, two were common to 2 years and only DTF-QTLs were common to all years. Despite the fact that the 3 years of trials were conducted within the same pedo-climatic area (oceanic climate, deep loamy soil), there may have been interactions between the QTLs and the years. Indeed, the climatic periods (autumn, winter, spring, and GFP) were substantially different from 1 year to another, with for example a mild winter in LR11 and cold spring periods in LR12 and LR13, which could have had an impact on yield and yield components and the determining QTLs. Many studies reported QTLs which could be detected in some environments but not in others in several species including rapeseed (Bernardo 2008; Shi et al. 2009), however, this does not mean that a significant QTL × environment interaction exists. Indeed, a higher value of error variance in an environment may lead to more trouble at detecting a QTL in that environment. Thus the genomic regions may not be reliable through a wide range of environments (Bernardo 2008).

Possible applications in plant breeding programs

Although our results still require further validation, they could already provide some clues for breeding strategies to improve OSR adaptation to low N inputs. The question of the efficiency of indirect versus direct selection under stressed environments has led to contradictory results in other studies depending on the species and the experimental conditions (Brancourt-Hulmel et al. 2005). Bänziger et al. (1997) showed that the genetic correlation between high N and low N condition for seed yield in maize increased with decreasing N stress intensity. In our experimental conditions, the low G × N interaction effects, the low number of N-responsive QTLs as well as the high heritability values of the traits suggested a high genetic correlation between the two N conditions, leading to a similar efficiency of direct versus indirect selection. In addition, our results demonstrated that many yield-related loci were not controlled by plant developmental loci such as flowering time, which opens the way to marker assisted selection (MAS).

MAS programs have increased dramatically in plant breeding since the late 1980s. Those programs concerned essentially traits controlled by a few genes with major effects and which were directly introgressed in the new varieties by marker assisted backcross for example. For complex quantitative traits like yield, the strategy used is to enrich the population with the desired alleles of the targeted QTLs through marker assisted recurrent selection (Bernardo 2008). However, to date, only a few programs were successful compared to the number of linkage analyses published (Bernardo 2008). To be successful, a MAS program should involve traits with high heritability values and include QTLs accounting for a large proportion of the variance.

In the case of a complex trait controlled by many QTLs such as yield, the ideal goal is to pyramid several QTLs of interest into a single cultivar. Depending on the strategy, the breeder might prefer to generate varieties adapted to a large set of growing environments or on the contrary adapted to specific climatic/stress conditions. On the one hand, to ensure yield stability over the years in Le Rheu site, R-A5 associated with R-C1-a would be good candidate regions. Indeed, R-A5 was a strong yield-related region (average R2 value of 23.1 % with Montego as the favorable allele), stable across N and years, and RC1-a, comprised QTLs of oil found in the three years of trial in N1 or N2 (average R2 value of 12.1 % with Montego as the favorable allele). On the other hand, the genomic regions controlling yield traits that were specific to one year of trial could also be exploited to confer adaptability to a wider range of climatic variations as occurred during our sets of trials. Hence, as R-C1-b, R-C3-a, R-C3-b and R-C8 were specific to LR13 (Montego as the favorable allele), LR11 (Aviso as the favorable allele), LR13 (Montego as the favorable allele) and LR12 (Montego as the favorable allele) respectively, they could be used in this strategy. However, before using these QTLs for MAS, we need to reduce their confidence intervals and validate them in different genetic backgrounds, for instance using association mapping methods and by testing them in elite lines.

Conclusion

In a context of reducing inputs in agriculture, there is a huge need for breeding new N efficient rapeseed varieties. The objective would be to introduce genomic regions involved in NUE under low N fertilization conditions in the new varieties. Our study did not highlight QTLs specific to low N conditions; however, 11 regions were found to be stable across N conditions and/or years of trial and could be used for further studies. R-A5 is of particular interest as a dense QTL area which was consistent in both N conditions and two out of the three sites. This region could be a good candidate for introducing stable seed yield and oil yield traits into rapeseed varieties in a breeding program.