Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection

Soto-Cerda, Braulio J.; Duguid, Scott; Booker, Helen; Rowland, Gordon; Diederichsen, Axel; Cloutier, Sylvie

doi:10.1007/s00122-014-2264-4

Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection

Original Paper
Open access
Published: 26 January 2014

Volume 127, pages 881–896, (2014)
Cite this article

Download PDF

You have full access to this open access article

Theoretical and Applied Genetics Aims and scope Submit manuscript

Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection

Download PDF

Braulio J. Soto-Cerda^1,2,6,
Scott Duguid³,
Helen Booker⁴,
Gordon Rowland⁴,
Axel Diederichsen⁵ &
…
Sylvie Cloutier^1,2

4457 Accesses
59 Citations
Explore all metrics

Abstract

Key message

The identification of stable QTL for seed quality traits by association mapping of a diverse panel of linseed accessions establishes the foundation for assisted breeding and future fine mapping in linseed.

Abstract

Linseed oil is valued for its food and non-food applications. Modifying its oil content and fatty acid (FA) profiles to meet market needs in a timely manner requires clear understanding of their quantitative trait loci (QTL) architectures, which have received little attention to date. Association mapping is an efficient approach to identify QTL in germplasm collections. In this study, we explored the quantitative nature of seed quality traits including oil content (OIL), palmitic acid, stearic acid, oleic acid, linoleic acid (LIO) linolenic acid (LIN) and iodine value in a flax core collection of 390 accessions assayed with 460 microsatellite markers. The core collection was grown in a modified augmented design at two locations over 3 years and phenotypic data for all seven traits were obtained from all six environments. Significant phenotypic diversity and moderate to high heritability for each trait (0.73–0.99) were observed. Most of the candidate QTL were stable as revealed by multivariate analyses. Nine candidate QTL were identified, varying from one for OIL to three for LIO and LIN. Candidate QTL for LIO and LIN co-localized with QTL previously identified in bi-parental populations and some mapped nearby genes known to be involved in the FA biosynthesis pathway. Fifty-eight percent of the QTL alleles were absent (private) in the Canadian cultivars suggesting that the core collection possesses QTL alleles potentially useful to improve seed quality traits. The candidate QTL identified herein will establish the foundation for future marker-assisted breeding in linseed.

Quantitative genomics-enabled selection for simultaneous improvement of lint yield and seed traits in cotton (Gossypium hirsutum L.)

Article Open access 26 May 2024

Association mapping of seed quality traits in Brassica napus L. using GWAS and candidate QTL approaches

Article 11 June 2015

Analysis of QTL for seed oil content in Brassica napus by association mapping and QTL mapping

Article 20 December 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Introduction

Oil crops have gained in importance worldwide over the past 20 years as indicated by the increase in total harvested area from 189.3 million hectares in 1992 to 272.7 million hectares in 2011 (FAOSTAT 2013). This increase hinges partly on the versatility of their fatty acid profiles which play a significant role in the nutritional properties and the end-use functionality of oil crops. In this regard, linseed (Linum usitatissimum L.), with its high content of alpha linolenic acid, is unique. With ~23 % of world production, Canada is the world’s largest linseed producer and exporter followed by China and the Russian Federation (FAOSTAT 2013).

Linseed is an annual, self-pollinated species with a genome size of ~370 Mb (Ragupathy et al. 2011). Domesticated in the Near East 9,000 years ago (Harris 1997), linseed is considered the oldest oilseed in the world. Its seed oil (~35–50 %) is composed of five main fatty acids (FAs): palmitic (PAL; C16:0, ~6 %), stearic (STE; C18:0, ~2.5 %), oleic (OLE; C18:1, ~19 %), linoleic (LIO; C18:2, ~13 %) and linolenic (LIN; C18:3, ~55 %) (Westcott and Muir 2003; Diederichsen et al. 2013). The high percentage of LIN distinguishes it from other oilseeds in the industrial, human food and animal feed markets. Its oxidative instability, ensuing in a soft and flexible film, and the absence of volatile organic compounds (formaldehyde, aldehydes and benzene), resulting in reduced environmental hazards (Green et al. 2008), makes linseed oil valuable in industry for paints, linoleum flooring, inks and varnishes (Cullis 2007). In addition, LIN is the precursor of the long chain polyunsaturated fatty acids eicosapentaenoic acid (EPA), docosapentaenoic acid (DPA) and docosahexaenoic acid (DHA) which are synthesized in the human body and recognized for their health benefits (Simopoulos 2000).

Linseed breeders have focused mainly on maintaining the high LIN content, while PAL, STE, OLE and LIO which correlate negatively with LIN tend to be selected against (Cullis 2007). High LIN (>65 %) germplasm is available (Friedt et al. 1995; Kenaschuk 2005), but agronomic improvement of many of these sources is needed to achieve adaptability. The first high LIN linseed cultivar NuLin™ 50 was registered in Canada by Viterra (http://www.viterra.ca). Altered FA profiles in linseed, for example low LIN (2–4 %) and high LIO (>50 %) obtained by mutation breeding, has proven effective in improving the oxidative stability and suitability of linseed oil for a variety of food uses (Green et al. 2008). Green and Marshall (1984) developed linseed lines with reduced LIN content (<29 %) using ethyl methane sulfonate (EMS)-mediated mutagenesis. Further reduction in LIN content to ~2 % was later achieved (Green 1986; Rowland 1991). Fatty acid desaturase 3 genes lufad3a and lufad3b had point mutations causing premature stop codons in one of the characterized EMS mutant lines resulting in non-functional FAD3 enzymatic activity (Vrinten et al. 2005). Additional variations in FA composition, including lines with elevated OLE and PAL content, have also been developed (Green, unpublished data; Rowland and Bhatty 1990).

Various aspects of the genetic control of storage oil biosynthesis in linseed have been studied (Green 1986; Fofana et al. 2004; Sørensen et al. 2005; Vrinten et al. 2005; Khadake et al. 2009; Banik et al. 2011) and new genes such as LuFAD2-2 (Khadake et al. 2009) and fad3c (Banik et al. 2011) encoding FA desaturases have been cloned, broadening the options for modifying linseed FA profiles for new end uses. Generally, oilseed breeding is a more complex undertaking than the breeding of cereals or legumes, as many oilseeds such as soybean, rapeseed, sunflower and linseed have the potential to be dual- or multi-purpose crops, which require the simultaneous manipulation of quality and agronomic traits (Vollmann and Rajcan 2009). Conventional breeding has been conducted in linseed for over a century and has been particularly successful in adapting crop phenology to regional growing seasons as well as providing yield stability across environments (Green et al. 2008). However, the phenotypic selection of quantitative traits, such as oil content and FA composition, is complicated by environmental effects (Cloutier et al. 2011) that significantly reduce breeding gain. In Canada, oil content can vary up to 15 % (range 35–50 %) in individual farm samples (Duguid 2009) and the percentage of LIN can be as much as 5 % higher in cool environments (Fofana et al. 2006).

Consumer awareness of oil quality is becoming an increasingly important variable that conditions shifts in the food ingredient selection process, thereby creating new market opportunities (Wilson 2012). Acceleration of breeding cycles could translate into the edge necessary to respond to these new market demands in a timely fashion.

The use of marker-assisted selection (MAS) for oil content and FA composition can improve the efficiency of traditional linseed breeding. However, MAS requires the development of genomic tools such as molecular markers and linkage maps (Cloutier et al. 2009, 2011, 2012a, b). These tools have been recently developed in linseed, establishing the foundation for the application of MAS (Roose-Amsaleg et al. 2006; Cloutier et al. 2009, 2011, 2012a, b; Deng et al. 2010, 2011; Ragupathy et al. 2011; Soto-Cerda et al. 2011a, b; Kumar et al. 2012; Wang et al. 2012).

Quantitative trait loci (QTL) mapping based on bi-parental crosses has been the most applied approach to map QTL associated with oil content and FA in crops such as rapeseed (Zhao et al. 2005; Hu et al. 2006; Qiu et al. 2006; Smooker et al. 2011), maize (Goldman et al. 1994; Wassom et al. 2008; Yang et al. 2010) and soybean (Chung et al. 2003; Bachlava et al. 2009; Qi et al. 2011; Xie et al. 2012). In linseed, however, only one QTL study related to oil content and FA composition has been carried out, positioning QTL for iodine value (IOD), PAL, LIO and LIN (Cloutier et al. 2011). While QTL mapping has been very successful in detecting QTL, the bi-parental nature of the populations often resulted in large confidence intervals for the QTL positions which, combined with a limited number of alleles at each locus, hindered their applications in MAS (Gupta et al. 2005; Ersoz et al. 2009; Myles et al. 2009).

Association mapping (AM) or linkage disequilibrium (LD) mapping has emerged as a complementary approach to QTL mapping (Myles et al. 2009). Its power relies on the utilization of a large population of individuals with a higher level of allelic diversity that improves the probability of QTL detection and the mapping resolution (Ersoz et al. 2009). AM has been useful in dissecting the complex genetic architecture of oil content and FA composition in oil crops such as rapeseed (Honsdorf et al. 2010; Zou et al. 2010), peanut (Wang et al. 2011), soybean (Li et al. 2011) and maize (Cook et al. 2012; Li et al. 2013). These AM studies not only validated previous results from QTL mapping showing the FA biosynthesis pathway similarity among oil crops, but also identified new QTL and candidate genes useful for improving oil content and quality.

In our previous study, we characterized the flax core collection of 407 accessions assembled from the Canadian flax world collection preserved by Plant Gene Resources of Canada (Diederichsen et al. 2013), and showed its abundant genetic diversity, weak population structure and familial relatedness, and a relatively fast LD decay, all positive attributes for AM studies (Soto-Cerda et al. 2013). In the present study, we conducted AM for oil content and FA composition traits on 390 accessions aiming to identify QTL underlying these seed quality traits, which could be used for accelerating linseed breeding through MAS and for identifying germplasm with desirable characteristics.

Materials and methods

Plant material, genotyping and field experiments

A core collection of 407 L. usitatissimum accessions assembled from the Canadian World collection of flax (~3,500 accessions) (Diederichsen et al. 2013) was genotyped with 460 microsatellite (SSR) markers (Roose-Amsaleg et al. 2006; Cloutier et al. 2009, 2012a; Deng et al. 2010, 2011) distributed across the 15 linkage groups of flax (Cloutier et al. 2012b). The amplification products were resolved on an ABI 3130xl Genetic analyzer (Applied Biosystems, Foster City, CA, USA). Output files were analyzed by GeneScan (Applied Biosystems) and subsequently imported into Genographer. Fragment sizes were estimated using GeneScan ROX-500 (Applied Biosystems) and MapMarker^® 1000 (BioVentures Inc., Murfreesboro, TN) internal size standards. The genotype of each locus was encoded based on its allele size in bp or as a null allele for dominant markers.

The flax core collection was assessed with 259 mapped neutral SSR loci which indicated that all accessions were organized into two major groups (G1 and G3) and one admixed group (G2) with a weak population structure (F _ST = 0.09) (Soto-Cerda et al. 2013). G1 included 90 % of the fiber flax accessions mostly from Western Europe and linseed accessions from South Asia and South America, while G3 included accessions from North America and Eastern Europe and was mostly oil type. A relatively fast genome-wide LD decay of ~1 cM (r ² = 0.1) was estimated (Soto-Cerda et al. 2013).

Phenotypic data were collected from 390 accessions including 381 accessions selected by Diederichsen et al. (2013) and nine accessions of relevance to recent Canadian flax breeding programs. The 390 accessions were evaluated during 3 years (2009, 2010 and 2011) in Morden, Manitoba (MB) and at the Kernen Farm located near Saskatoon, Saskatchewan (SK), Canada, which represent two mega-environments where most of the linseed is produced in Western Canada (http://www.canadagrainscouncil.ca/). A type 2 modified augmented design (MAD) (Lin and Poushinsky 1985) was used to phenotype oil content and FA composition traits. Main plots (2 m long, 2 m wide with 20 cm row spacing) were arranged in grids of ten rows and ten columns. Each main plot was divided into five parallel subplots of two rows each with a plot control (CDC Bethune replicated 100 times) located at the center. Additional subplot controls (Hanley and Macbeth) were assigned to five randomly selected main plots. The 4-m² plots were harvested, threshed and cleaned. Seeds of each plot were subsampled for oil content and FA composition analyses.

Oil content and FA composition analyses

OIL was measured by nuclear magnetic resonance calibrated against the FOSFA (Federation of Oils, Seeds and Fats Associations Limited) extraction method. Methyl esters of FA were prepared according to the American Oil Chemists’ Society (AOCS) (http://www.aocs.org/Methods/index.) Official Method Ce 2-66 (09) and FA composition was determined by capillary gas chromatography (GC), following the AOCS Official Method Ce 1e-91. IOD, a measure of the saturation level of lipids, was calculated from the GC-derived FA composition, following the AOCS Method Cd 1c-85.

Statistical analysis

Adjusted data were obtained for each trait as previously described based on the MAD (You et al. 2013). Normality of the adjusted data was tested using the Shapiro–Wilk test (Shapiro and Wilk 1965) and normal probability plots. The adjusted phenotypic values were used to estimate the variance components to determine the effect of year, location, genotype and their interactions on oil content and FA composition using the GLM procedure in SAS 9.1 (SAS Institute 2004) as described in You et al. (2013). As a measurement of the repeatability of the field trials across years within locations, broad sense heritability (H) on an entry mean basis for each seed quality trait was estimated as follows: \({H} = {\sigma}_{G}^{2} / \left[ {\sigma}_{G}^{2} + \left( {{\sigma}_{GE}^{2}} / {e} \right) + \left( {{\sigma}_{E}^{2}} / {e} \; {r} \right) \right] \) where \( {\sigma}_{G}^{2} \), \( {\sigma}_{GE}^{2} \), \( {\sigma}_{E}^{2} \), e and r correspond to the genetic variance, the genetic by environment interaction variance, the residual variance, the number of environments and the replications per environment, respectively. Pearson’s correlation coefficients (P < 0.001) were used to express the relationships between seed quality traits.

Linkage disequilibrium

An LD heat map was constructed using six linkage groups (LGs) and 158 SSR loci (mean = 1 locus/3.5 cM). The six LGs were selected based on their marker density and differences in size from the consensus linkage map of flax (Cloutier et al. 2012b). The heat map was produced with GGT 2.0 (van Berloo 2008) based on pairwise r ² estimates for all marker pairs with minor allele frequency (MAF) > 0.05 (Breseghello and Sorrells 2006). Allelic frequencies were calculated in GENALEX v.6.41 (Peakall and Smouse 2006) and MAF < 0.05 were set to “U” (missing data) and excluded from the LD analysis. This heat map verified the relationships between genomic regions harboring significant markers and large blocks of LD. The 95th percentile of the distribution of unlinked markers r ² = 0.09 (Soto-Cerda et al. 2013) was used to set the statistical r ² value to determine LD that resulted from physical linkage (Breseghello and Sorrells 2006). Markers on different linkage groups were considered unlinked.

Association mapping

The adjusted phenotypic values of the seed quality traits were used for AM. Five AM models were tested in TASSEL 2.1 (Bradbury et al. 2007) including two general linear models and three mixed linear models (MLMs). The first GLM incorporated the Q matrix as the fixed covariate, while the second used PCA (Price et al. 2006). The first MLM incorporated the kinship matrix (K) (Yu et al. 2006) as a random effect only, while the second and third used in addition the Q matrix and PCA as fixed covariates, respectively. The Q matrix was estimated using 259 mapped neutral SSRs (Soto-Cerda et al. 2013). The PCA matrix calculated in TASSEL 2.1 retained the first three components explaining 27 % of the variation. The K matrix was constructed on the basis of 448 SSRs using SPAGeDi (Hardy and Vekemans 2002). All negative values between individuals were set to zero (Yu et al. 2006). The most suitable AM model was selected using cumulative probability–probability (P–P) plots which indicate the extent to which the analysis produced more significant results than expected by chance. For the AM analysis, only MAF > 0.05 were retained (Breseghello and Sorrells 2006).

AM analyses for the seed quality traits were carried out for each year and location independently. Correction for multiple testing was performed using the qFDR value, which is an extension of the false discovery rate (FDR) method (Benjamini and Hochberg 1995). The q values were calculated with the QVALUE R package using the smoother method (Storey and Tibshirani 2003). Markers with qFDR < 0.01 in at least 2 years were considered significant within location. Further, markers with qFDR < 0.01 in at least four of the six environments were considered consistent associations. For markers significantly associated with a trait, a GLM with all fixed-effect terms was used to estimate the amount of phenotypic variation explained by each marker (R ²). Allelic effects of the significant marker loci were calculated as the difference between the average phenotypic values of the homozygous alleles with MAF > 0.05. The significant differences between the allele means were estimated by the Kruskal–Wallis non-parametric test (Kruskal and Wallis 1952) and visualized as box plots.

Candidate QTL were delineated using the estimated background LD (95th percentile) for unlinked markers r ² = 0.09 (Soto-Cerda et al. 2013) as suggested by Breseghello and Sorrells (2006). Thus, associated markers were considered linked and part of the same candidate QTL if they showed r ² > 0.09. Since markers in the same QTL were closely linked and in significant LD, the amount of phenotypic effect explained by the candidate QTL was estimated using the marker within the QTL with the highest P value as described above for the significant markers.

QTL/marker effects and stability

The QTL/marker effects were calculated as described above for the allelic effects. The stability of a candidate QTL and associated markers was estimated using the additive main effect and multiplicative interaction (AMMI) model (Zobel et al. 1988; Gauch 1992) in GenStat 14 (VSN International 2011). Candidate QTL/markers with a first interaction principal component (IPCA1) near zero are more stable, while those QTL/markers with IPCA1 either positive or negative are more unstable. The AMMI’s stability values (ASV) (Purchase 1997) were also calculated using the following formula:

\({\text{ASV}} = \sqrt {\frac{{{\text{SSIPCA}}1}}{{{\text{SSIPCA}}2}}({\text{IPCA}}1)^{2} + ({\text{IPCA}}2)^{2} }\), where SSIPCA1 is the sum of squares interaction of the first principal component (PC) analysis and SSIPCA2 is the sum of squares interaction of the second PC analysis. The smaller the ASV value, the more stable the candidate QTL/markers are across environments. The stability of QTL/markers based on their IPCA1 was defined as follows: 0 to ±0.5 highly stable; ±0.51 to ±1 stable; ±1.01 to ±1.5 moderately stable; and higher than ±1.51 unstable. The stability of QTL/markers based on their ASV values was defined as follows: 0–0.5 highly stable; 0.51–1 stable; 1.01–1.5 moderately stable; and higher than 1.51 unstable. The QTL/marker effects estimated were decomposed into PCs via singular value decomposition and the first two PCs were plotted for both QTL/markers and environments to form a QTL main effect and QTL by environment interaction (QQE) biplot (Yan and Tinker 2005) using GenStat 14 (VSN International 2011).

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

QTL/marker alleles were defined as alleles of the marker with the largest P value from a QTL or alleles of a significantly associated marker not part of a candidate QTL. With the aim of identifying new potentially favorable QTL/marker alleles absent in linseed Canadian cultivars, the observed number of alleles, the number of private alleles and the allelic richness were contrasted for the 30 linseed Canadian cultivars (Online Resource 1) present in the flax core collection with the remaining 377 of diverse origins (Diederichsen et al. 2013; Soto-Cerda et al. 2013). In addition to the QTL, stable associated markers not part of a QTL but that explained at least 1 % of the phenotypic variation were also included. The number of private QTL/marker alleles and QTL/marker allelic richness were corrected for sample size differences and estimated using the rarefaction method implemented in HP-RARE v.1.2 (Kalinowski 2005). This analysis included all alleles, even the rare ones (MAF < 0.05). The frequencies of the most favorable QTL/marker alleles were estimated in GENALEX v.6.41 (Peakall and Smouse 2006) and compared between the flax core collection and the 30 Canadian cultivars across all identified stable QTL/markers. Significant differences between the allele frequencies were ascertained by the Kruskal–Wallis non-parametric test (Kruskal and Wallis 1952).

Results

Phenotypic data

All seed quality traits showed significant genotype (G), location (L) and year (Y) effects (P < 0.001; Online Resource 2), although G explained a much larger percentage of the phenotypic variation (33.3–90.6 %) than L (1.2–26.5 %) and Y (0.5–7.3 %). Most of the genotype by environment (GE) interactions (G × L, G × Y, L × Y and G × L × Y) were significant and accounted for up to 10 % of the seed quality traits variation. The location means, standard deviations, ranges, H and the correlations exhibited by the seed quality traits are summarized in Table 1. In MB, H ranged from 0.87 to 0.99, while in SK, it ranged from 0.73 to 0.98, with a lower mean (0.89) than MB (0.95), indicating that the repeatability between years was more consistent in MB than in SK. LIN and IOD were highly correlated at both locations (MB = 0.87, SK = 0.76; P < 0.001). Highly significant negative correlations were observed between the other FAs and IOD. Most of the correlations between FAs were significant and negative. OIL was positively correlated with PAL at both locations and with STE and OLE in SK, but negatively correlated with LIO and IOD in SK.

Table 1 Mean ± standard deviation, range, broad sense heritability (H) and correlation of seven seed quality traits in the flax core collection evaluated in six environments

Full size table

Linkage disequilibrium

As shown in Online Resource 3, syntenic r ² (estimated LD for the loci on the same LG) was predominant on LGs 3, 8, 12 and 14, while LGs 1 and 10 showed r ² close to background level. Blocks of LD among unlinked loci, which can produce false-positive associations, were also identified suggesting that the kinship matrix used in the MLM could be used to control false-positive LDs (Yu et al. 2006).

AM analysis

The average relative kinship between any two genotypes was 0.023, and 80 % of the pairwise kinship comparisons ranged from 0 to 0.05 (Online Resource 4). As depicted by the cumulative P–P plots (Online Resource 5), numerous spurious associations for all traits were observed with the GLM (Q). This model was characterized by an excess of small P values indicating spurious associations. On the other hand, the GLM (PCA) overcorrected the majority of the small P values with few higher P values departing at the very end of the expected distribution. The MLMs (K) and (Q + K) performed similarly for the seven seed quality traits with their observed P values deviating the most from the expected ones for OIL, PAL, STE, OLE, LIO and IOD, indicating that inclusion of the Q matrix brought little or no improvement to the AM model. Nevertheless, they displayed a better distribution of P values for LIN (Online Resource 5). The MLM (PCA + K) had the smallest deviation from the expected distribution for all seed quality traits. The three first PCAs in combination with the K matrix were sufficient to control the majority of the potential false-positive associations created by population and family structure. As a result, the P values generated by the MLM PCA + K were retained for posterior analyses.

QTL contributing to seed quality traits

AM was conducted on OIL, PAL, STE, OLE, LIO, LIN and IOD across six environments of the Canadian Prairies. The genomic distribution and number of significant markers, candidate QTL and their phenotypic contribution to seed quality traits are summarized in Fig. 1, Tables 2 and 3 and Online Resource 6. Nine QTL were detected for five seed quality traits. The QTL with the largest effects were QIod-LG8.1, QLin-LG5.2 and QOil-LG9.1 for IOD, LIN and OIL, respectively (Table 3). No QTL were detected for PAL and OLE, but marker Lu2046 on LG2 and marker Lu2555 on LG6 explained 8.4 and 3.9 % of the variation, respectively, with one allele contributing significantly to PAL and OLE as described in the next section (Fig. 2b, d). Several QTL and markers co-located within the same chromosomal regions such as those for LIO and LIN on LGs 3, 5 and 12 and LIO, LIN and IOD on LG8 (Fig. 1).

Table 2 Summary of significant markers and candidate QTL associated with seven seed quality traits in linseed identified using the MLM (PCA + K)

Full size table

Table 3 Stable candidate QTL associated with seed quality traits identified at both Manitoba (MB) and Saskatchewan (SK) locations

Full size table

Allelic effects of stable associations

Some alleles were significantly associated with positive improvements of the traits. For example, the 270 bp allele of Lu181 significantly increased OIL by an average of 1.3 % (P < 0.001) across the six environments tested (Fig. 2a). For Lu2534, the 312 bp allele had the largest effect on PAL increasing the value by ~1 % over the average of the other alleles (P < 0.001; Fig. 2b). For STE, the 356, 358 and 360 bp alleles of Lu146 had significantly larger effect than the other two alleles (Fig. 2c). An increase of 2.3 % (P < 0.001) in OLE was associated with the 217 bp allele of Lu2555 (Fig. 2d). Lu3262 explained ~8 % of the variation for LIO with the 195 bp allele increasing the trait by 0.9 % (Fig 2e). The same allele was also associated with an increase in LIN of 1.3 % (Fig. 2f). A significant positive effect of the 241 bp allele of Lu2102 increased IOD by 9.5 units (Fig. 2g) (P < 0.001).

QTL stability and QTL main effect

The AMMI analysis revealed that four of the nine candidate QTL identified for five seed quality traits were highly stable with IPCA1 values lower than ±0.5 (Table 3). Also, all but three of the candidate QTL were stable or moderately stable with ASV in the range of 0.4–1.16.

The QQE biplot displays the average environment defined by the average PC1 and PC2 scores across environments (indicated by an open blue circle) (Fig. 3). The arrow passing through the biplot origin is called the AEC abscissa and points toward increasing QTL main effect. The AEC ordinate line, perpendicular to the abscissa and also passing through the biplot origin, indicates stability/instability. Highly unstable QTL have longer projections on the AEC ordinate irrespective of their direction. The LIN-related candidate QTL/markers were highly stable because most of them landed on or very close to the AEC abscissa (Fig. 3). The intersection of the two axes defines the average QTL/marker main effect, and, as such, Lu203b, Lu2102, Lu206b, Lu566, QLinLG12.3 and Lu585B had effects below average, while Lu2746, Lu2561a, QLin-LG3.1, QLin-LG5.2, Lu373 and Lu164 had the largest main effects on LIN across environments. In general, the QTL main effects showed by the QQE biplot were in agreement with the estimated phenotypic effect (Table 3, Online Resource 6).

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

Nine QTL/markers and 16 associated markers not part of a QTL but that explained at least 1 % of the phenotypic variation were included in the analyses, totaling 25 QTL/markers, where some of them were associated with more than one trait (Table 3, Online Resource 6). 43 QTL/marker alleles were present in the 30 lines representing the Canadian cultivars (Online Resource 1) and 102 were present in the remaining 377 lines of the core collection, while the observed number of private QTL/marker alleles, which are alleles exclusively present in a group and absent in the other, was 1 and 77, respectively. After adjusting for sample size differences, the QTL/marker allelic richness was estimated at 43 and 71 in the Canadian cultivars and the core collection respectively, while the number of private QTL/marker alleles was 4 and 32, respectively. In the core collection, 65 of the observed QTL/marker alleles were rare (MAF < 0.05), whereas in the Canadian cultivars only 2 fell in this category (data not shown).

The frequencies of the favorable QTL/marker alleles, i.e., alleles associated with increased OIL and LIN, were not statistically different between the core collection and the Canadian cultivars for the seven seed quality traits (Kruskal–Wallis P = 0.437; Online Resource 7). Nevertheless, for most favorable QTL alleles, the Canadian cultivars had higher frequencies, indicating that Canadian flax breeders have been successful in pyramiding the best QTL alleles for seed quality traits. Five favorable alleles were absent in the Canadian cultivars, but were also low in frequency in the core collection (Online Resource 7).

Discussion

Linseed oil and its FA profile define to a large extent its market end use and value. Genetic progress can be accelerated once genetic diversity for the traits of interest and QTL architecture knowledge are available to breeders. In the present study, we described the application of AM using a core collection of 390 L. usitatissimum accessions for the identification of QTL underlying seed quality traits. This study establishes a framework to understand the quantitative nature of OIL and FA composition in linseed.

Phenotypic analysis

Significant GE interaction was observed for all seven seed quality traits, suggesting genotypic sensitivities to differences in environmental conditions (Online Resource 2). In linseed, OIL and FA composition are affected by temperature during plant development (Casa et al. 1999; Fofana et al. 2006). Differences in planting dates and soil moisture can also affect OIL and FA composition in oil crops (van der Merwe et al. 2013). Fofana et al. (2006) showed that warmer and drier environmental conditions resulted in approximately 5 % lower LIN compared to OLE and suggested that the fatty acid desaturase FAD2, which converts OLE into LIO, was more sensitive to environmental variations and therefore rate limiting. QTL for FA composition had already been linked to the FAD2 enzymes in flax (Cloutier et al. 2011). Our results are in line with this report where OLE was 5.7 % higher in MB than in SK but LIN was higher in SK by the same percentage. Historical meteorological data (30 year period) indicate that the MB location is warmer than the SK location, particularly during the growing season in 2010 and 2011 (Agriculture and Agri-Food Canada; http://climate.weather.gc.ca/advanceSearch/searchHistoricData_e.html).

Broad sense heritability (H) estimates were moderate to high with the phenotypic means and ranges reflecting the broad variation of the core collection and also indicating that a large proportion of the phenotypic variation was genetic. Genetic gain could be achieved through phenotypic selection; however, the correlations among seed quality traits exhibited complex relationships. The development of linseed cultivars with specific FA profiles could be better achieved through MAS for which a clear understanding of the genetic architecture of seed quality traits is needed.

AM analysis

The advantages of AM in identifying QTL for multiple traits in a single diverse population have been outlined (Gupta et al. 2005; Myles et al. 2009; Rafalski 2010). However, this approach sometimes suffers from an inflation of false positives due to population structure (Pritchard et al. 2000) and familial relatedness (Yu et al. 2006). Several linear and mixed models have been proposed to correct for the effect of both confounding factors (Pritchard et al. 2000; Price et al. 2006; Yu et al. 2006). In general, when population and family structures are present, the MLM is superior to the GLM (Myles et al. 2009), but in many cases, the best fitting model will depend on the dynamics of the association panel chosen. The K matrix can account for subtle population structure caused by familial relatedness, while the Q and PCA matrices control factors such as growth habit, market classes, geography, etc. PCA axes of variation have been shown to better adjust for allele frequency differences between subpopulations (Price et al. 2006; Ma and Amos 2012). In our previous study, one of the two major STRUCTURE sub-groups clustered more than 90 % of the fiber flax accessions, indicating that the inferred Q matrix mostly accounted for plant morphotype differences (Soto-Cerda et al. 2013) and, hence, geographic differences present in the flax core collection might not be properly interpreted by the Q matrix fitted (Price et al. 2006). For all seven seed quality traits studied herein, the PCA + K model provided the best adherence to the expected cumulative distribution of P values (Online Resource 5), being superior to the K and Q + K models. This suggests that, in the case of linseed, the PCA matrix can better correct for population stratification, which turns out to also be computationally advantageous even with thousands of markers (Price et al. 2006).

Fatty acid QTL

Seed oils are composed primarily of triacylglycerols (TAGs), which are glycerol esters of FAs (Rao et al. 2008). The primary FAs in the TAGs of oilseed crops are 16–18 carbons in length and contain 0–3 double bonds where PAL, STE, OLE, LIO and LIN predominate (Rao et al. 2008). Only three FA-related QTL have been identified to date in flax: two co-located QTL, each associated with LIO, LIN and IOD, and one affecting PAL (Cloutier et al. 2011). In the present study, we validated one of them, i.e., the co-located QLio-LG12.3 and QLin-LG12.3 (Fig. 1; Table 3) located in the block of LD on LG12 (Online Resource 3). Several markers and candidate QTL mapped close to genes involved in the FA biosynthesis pathway. Marker Lu3150 (LG3), associated with STE, mapped 5.3 cM from the acyl-CoA:diacylglycerol acyltransferase A (dgatA) gene (Fig. 1). Cloutier et al. (2011) mapped the gene using the microsatellite markers present in the upstream region of the dgat1 gene which was characterized from a bacterial artificial chromosome (BAC) clone. Highly significant associations between DGTA1-2 and OLE and OIL have been reported in maize (Chai et al. 2012). A direct role for DGAT in STE is not obvious because DGAT-A and -B exert their main control in the final steps of oil assembly and are hypothesized to be a determining factor of OIL in higher plants (Weselake 2005). The associations with STE may be caused by the LD between the dgatA gene and the putative causative gene, a causal effect which could be resolved with a higher marker density. On the other hand, some of the oil assembly enzymes have been shown to have a preference for certain FAs (Sørensen et al. 2005). Such a selective mechanism could explain their indirect influence on the FA composition because most of the FAs will be assembled in TAGs.

Marker Lu566 (LG7) associated with LIO and LIN co-localized to the same region as the fad3A gene, overlapping with the previously published QTL QLio.crc-LG7 and QLin.crc-LG7 (Cloutier et al. 2011), thus being a major candidate gene for the control of LIN. Three fad3 genes have been identified in the flax genome: fad3a and fad3b from cultivar Normandy (Vrinten et al. 2005) and more recently fad3c (Banik et al. 2011). FAD3A and FAD3B are major enzymes controlling LIN content in linseed (Vrinten et al. 2005); they were mapped in a bi-parental population (Cloutier et al. 2011) and recently integrated into the consensus map of flax (Cloutier et al. 2012b). In linseed, DGATA has an enhanced specificity for α-18:3-CoA (Sørensen et al. 2005; Rao et al. 2008); hence, higher LIN could translate to higher OIL in favorable environments such as SK where LIN was 5.7 % higher and OIL was 1.7 % higher than at the MB location (Table 1).

The genetic architecture of the traits provides some insights into the detection of more QTL for FA composition as compared to OIL. Variations in FA composition are mainly determined by a small number of major genes including fatty acid elongases and desaturases, while the number of genes potentially involved in OIL is expected to be greater and also more sensitive to environmental variations (Honsdorf et al. 2010). The marker density also likely played a role. The 460 SSRs represent less than one-third of the 1,500 estimated minimum markers necessary to tag all QTL, indicating that potentially many QTL remained undetected. Likewise, the flax morphotype, i.e., oilseed and fiber flax, could negatively impact on the number of significant associations. When alleles segregate across multiple subpopulations, MLMs are more powerful, but when they segregate in only one or a subset of the subpopulations or, when different alleles are present in the subpopulations, MLMs will fail to detect the associations entirely (Zhao et al. 2011). We cannot discard the potential effect of the fiber morphotype on seed quality traits associations because it is likely that the favorable alleles associated with these traits do not segregate homogeneously across sub-groups, or they could even be totally absent in the fiber accessions which have not been selected for these traits, consequently underpowering the AM results. AM analysis conducted separately for the fiber and oilseed accessions could provide further insights in this regard.

The phenotypic correlations between traits were consistently reflected in the identification of common markers and candidate QTL (Fig. 1) as reported in other QTL studies (Bachlava et al. 2009; Honsdorf et al. 2010; Cloutier et al. 2011; Hamdan et al. 2012; Li et al. 2012). For example, the stable QTL defined by markers Lu2102 and Lu928 on LG8 (Fig. 1) was associated not only with IOD, but also with LIN which were positively correlated. Another candidate QTL between markers Lu206b and Lu765Bb on LG12 (Fig. 1), associated with both LIO and LIN, overlapped with the previously reported QTL QLio.crc-LG16 and QLin.crc-LG16 having significant negative correlations (Cloutier et al. 2011). Negative correlation between LIO and LIN has been observed in Brassica napus (Honsdorf et al. 2010) and common QTL affecting several FAs have also been reported in soybean (Bachlava et al. 2009; Xie et al. 2012) and safflower (Hamdan et al. 2012).

Marker/QTL effects and QTL stability

To maximize the initial impact of MAS in crops with a lack of molecular tools, such as linseed, the associated markers should be closely linked to the QTL and the mapped QTL should ideally have large effect and high stability. For example, the two QTL associated with LIO and LIN reported by Cloutier et al. (2011) were located in a confidence interval of 11.6 cM. In our study, we narrowed down those QTL to 3.2 cM and showed their high stability and high LD (Table 3). Improvement in linkage tightness translates into recombination probability reduction, thus creating better markers for MAS. Nevertheless, because large effect and highly stable QTL will be first fixed in breeding programs, large effect and environment-specific QTL should also be targeted by breeders. For example, QOil-LG9.1increased OIL by 1.3 % but exhibited higher instability than the other QTL (Table 3). Although our statistical threshold for linked LD was 0.1 which could be considered weak for effective MAS, seven of the identified candidate QTL showed moderate to high LD in the range of 0.22–0.93. However, the phenotypic variation explained by the same QTL differed between studies. In Cloutier et al. (2011), the QTL associated with LIO and LIN explained 20 % each of the variation, higher than the 13.6 and 6.1 % reported in the present study. Many AM studies in humans have reported low R ² values, labeling the remaining unexplained variation as the missing heritability (Myles et al. 2009). In Brassica napus, 57 significant markers explained up to 18 % of the phenotypic variation for OIL (Zou et al. 2010), while in maize 26 loci explained up to 83 % (Li et al. 2013). There are several reasons for this. First, insufficient marker coverage where the causal polymorphism is not in perfect LD with the genotyped markers affects the detection power of AM leaving unexplained a higher percentage of the variation (Myles et al. 2009). Second, rare alleles with large effects remained undetected because they were excluded for statistical reasons (Breseghello and Sorrells 2006; Rafalski 2010). Third, traits controlled by a large number of genes/QTL, each with small individual effects, may escape statistical detection (Manolio et al. 2009). Fourth, variation resulting from epistatic interactions between genes might also go undiscovered because epistasis can only be investigated practically in a sequential scan of major common loci (Storey et al. 2005). Finally, epigenetic variations are emerging as a major cause of the missing heritability (Rakyan et al. 2011). Epigenome-wide association studies are likely going to shed some light on the specific epigenetic mechanisms at play in phenotypic variation (Rakyan et al. 2011), and most interestingly their environmental and trans-generational stabilities. Bi-parental mapping has the power to detect the effects of rare alleles (Gupta et al. 2005). As such, high R ² values reported by Cloutier et al. (2011) using a bi-parental cross of high LIN with low LIN, providing an extreme range of FA profiles, likely correspond to the mutant parental line major fatty acid desaturase rare alleles of large effect, while in AM the smaller R ² values could correspond to common variants of small effects from the same locus. Allele frequency differences for the same underlying locus between bi-parental populations and AM panels affect the explained phenotypic variation (Stich et al. 2008). The maximum proportion of the variance explained by a marker is observed for allele frequencies of 0.5, as expected in bi-parental populations such as recombinant inbred lines or F₁-derived doubled haploids. For an AM panel, the allele frequencies are expected to be considerably different from 0.5, especially when multi-allelic markers such as SSRs are used (Stich et al. 2008). As a consequence, the proportion of the variance explained by a marker is notably lower despite the same underlying allelic effect (Stich et al. 2008). In our study, the majority of the associated markers and candidate QTL explained <5 % of the phenotypic variation. Nevertheless, some candidate QTL explained up to 19 % of the phenotypic variation, and major QTL for OIL (8 %), STE (19.6 %), LIO (6.6 %) and LIN (9.3 %) were stable, making them suitable for MAS (Table 3; Fig. 3).

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

Several reports indicate that Canadian linseed cultivars have been developed from a narrow genetic base (Fu et al. 2002, 2003; Cloutier et al. 2009) which is an impediment to further breeding progress. In the present study, the flax core collection showed abundant QTL allelic diversity with approximately eight times more unique (private) alleles than the Canadian cultivar subgroup. However, the majority of these novel QTL alleles were rare, limiting their exploitation in AM, hence requiring different strategies for their efficient utilization. Among these potential strategies, optimal bi-parental mapping populations could be designed using the comprehensive phenotypic and genetic characterization of the flax core collection. In addition, the joint use of linkage mapping and association models through the design of multiparent advanced generation intercross (MAGIC) or nested association mapping (NAM) populations can overcome the population structure issue (Rafalski 2010). These populations are advantageous from the point of view of increasing the frequency of rare alleles and balancing the overall allele frequencies, although the strong kinship relationships could be an impediment. However, the high kinship relationships among genotypes could be mitigated by MLM and exploited through genomic selection, a strategy complementary to AM which uses genome-wide marker information to model phenotypic traits and obtain estimated breeding values (Meuwissen et al. 2001).

Final remarks

The current study represents the first AM analysis in linseed. We identified nine consistent QTL across six environments for seed quality traits and several stable markers providing a basis for further AM and fine mapping efforts aiming to understand the genetic architecture of seed quality traits in linseed. Although this study was somewhat limited with respect to marker density, novel QTL were mapped and several previously reported were validated. To realize the full potential of AM and of the flax core collection, whole genome re-sequencing of the entire core collection is under way to saturate the genetic map with hundreds of thousands of single nucleotide polymorphism markers. Validation of candidate QTL in bi-parental populations will guide the development of marketable linseed cultivars using MAS.

References

Bachlava E, Dewey RE, Burton JW, Cardinal AJ (2009) Mapping candidate genes for oleate biosynthesis and their association with unsaturated fatty acid seed content in soybean. Mol Breed 23:337–347
Article CAS Google Scholar
Banik M, Duguid S, Cloutier S (2011) Transcript profiling and gene characterization of three fatty acid desaturase genes in high, moderate, and low linolenic acid genotypes of flax (Linum usitatissimum L.) and their role in linolenic acid accumulation. Genome 54:471–483
Article PubMed CAS Google Scholar
Benjamini Y, Hochberg Y (1995) Controlling the false discovery rate: a practical and powerful approach to multiple testing. J R Statist Soc B 57:289–300
Google Scholar
Bradbury PJ, Zhang Z, Kroon DE, Casstevens TM, Ramdoss Y, Buckler ES (2007) TASSEL: software for association mapping of complex traits in diverse samples. Bioinformatics 23:2633–2635
Article PubMed CAS Google Scholar
Breseghello F, Sorrells M (2006) Association mapping of kernel size and milling quality in wheat (Triticum aestivum L.) cultivars. Genetics 172:1165–1177
Article PubMed Central PubMed Google Scholar
Casa R, Russell G, Lo Cascio B, Rossini F (1999) Environmental effects on linseed (Linum usitatissimum L.) yield and growth of flax at different stand densities. Eur J Agron 11:267–278
Article Google Scholar
Chai Y, Hao X, Yang X, Allen WB, Li J, Yan J, Shen B, Li J (2012) Validation of DGAT1-2 polymorphisms associated with oil content and development of functional markers for molecular breeding of high-oil maize. Mol Breed 29:939–949
Article CAS Google Scholar
Chung J, Babka HL, Graef GL, Staswick PE, Lee DJ, Cregan PB, Shoemaker RC, Specht JE (2003) The seed protein, oil, and yield QTL on soybean linkage group I. Crop Sci 43:1053–1067
Article CAS Google Scholar
Cloutier S, Niu Z, Datla R, Duguid S (2009) Development and analysis of EST-SSRs for flax (Linum usitatissimum L.). Theor Appl Genet 119:53–63
Article PubMed CAS Google Scholar
Cloutier S, Ragupathy R, Niu Z, Duguid S (2011) SSR-based linkage map of flax (Linum usitatissimum L.) and mapping of QTLs underlying fatty acid composition traits. Mol Breed 28:437–451
Article CAS Google Scholar
Cloutier S, Miranda E, Ward K, Radovanovic N, Reimer E, Walichnowski A, Datla R, Rowland G, Duguid S, Ragupathy R (2012a) Simple sequence repeat marker development from bacterial artificial chromosome end sequences and expressed sequence tags of flax (Linum usitatissimum L.). Theor Appl Genet 125:685–694
Article PubMed Central PubMed CAS Google Scholar
Cloutier S, Ragupathy R, Miranda E, Radovanovic N, Reimer E, Walichnowski A, Ward K, Rowland G, Duguid S, Banik M (2012b) Integrated consensus genetic and physical maps of flax (Linum usitatissimum L.). Theor Appl Genet 125:1783–1795
Article PubMed Central PubMed Google Scholar
Cook JP, McMullen MD, Holland JB, Tian F, Bradbury P, Ross-Ibarra J, Buckler ES, Flint-Garcia SA (2012) Genetic architecture of maize kernel composition in the nested association mapping and inbred association panels. Plant Physiol 158:824–834
Article PubMed Central PubMed CAS Google Scholar
Cullis CA (2007) Flax. In: Kole C (ed) Genome mapping and molecular breeding in plants, vol 2. Springer, Berlin, pp 275–295
Google Scholar
Deng X, Long S, He D, Li X, Wang Y, Liu J, Chen H (2010) Development and characterization of polymorphic microsatellite markers in Linum usitatissimum. J Plant Res 123:119–123
Article PubMed CAS Google Scholar
Deng X, Long S, He D, Li X, Wang Y, Hao D, Qiu C, Chen X (2011) Isolation and characterization of polymorphic microsatellite markers from flax (Linum usitatissimum L.). Afr J Biotechnol 10:734–739
CAS Google Scholar
Diederichsen A, Kusters PM, Kessler D, Bainas Z, Gugel RK (2013) Assembling a core collection from the flax world collection maintained by Plant Gene Resources of Canada. Genet Resour Crop Evol 60:1479–1485
Article Google Scholar
Duguid SD (2009) Flax. In: Vollmann J, Rajcan I (eds) Oil crops, handbook of plant breeding 4. Springer, New York, pp 233–255
Google Scholar
Ersoz ES, Yu J, Buckler ES (2009) Applications of linkage disequilibrium and association mapping in maize. In: Kriz A, Larkins B (eds) Molecular genetic approaches to maize improvement. Springer, Berlin, pp 173–195
Chapter Google Scholar
FAOSTAT (2013) Production of crops: linseed: area harvested and production (tonnes). Available at http://faostat3.fao.org/home/index.html. Accessed March 2013
Fofana B, Duguid S, Cloutier S (2004) Cloning of fatty acid biosynthetic genes β-ketoacyl CoA synthase, fatty acid elongase, stearoyl-ACP desaturase and fatty acid desaturase and analysis of expression in the early developmental stages of flax (Linum usitatissimum L.) seeds. Plant Sci 166:1487–1496
Article CAS Google Scholar
Fofana B, Cloutier S, Duguid S, Ching J, Rampitsch C (2006) Gene expression of stearoyl-ACP desaturase and Δ12 fatty acid desaturase 2 is modulated during seed development of flax (Linum usitatissimum). Lipids 41:705–720
Article PubMed CAS Google Scholar
Friedt W, Bickert C, Schaub H (1995) In vitro breeding of highlinolenic, doubled haploid lines of linseed (Linum usitatissimum L.) via androgenesis. Plant Breed 114:322–326
Article Google Scholar
Fu YB, Diederichsen A, Richards KW, Peterson G (2002) Genetic diversity within a range of cultivars and landraces of flax (Linum usitatissimum L) as revealed by RAPDs. Genet Resour Crop Evol 49:167–174
Article Google Scholar
Fu YB, Rowland GG, Duguid SD, Richards K (2003) RAPD analysis of 54 North American flax cultivars. Crop Sci 43:1510–1515
Article Google Scholar
Gauch HG (1992) AMMI analysis of yield trials. In: Kang MS, Gauch HG (eds) Genotype-by-environment interaction. CRC Press, Boca Raton, pp 1–40
Google Scholar
Goldman IL, Rocheford TR, Dudley JW (1994) Molecular markers associated with maize kernel oil concentration in an Illinois high protein × Illinois low protein cross. Crop Sci 34:908–915
Article Google Scholar
Green AG (1986) A mutant genotype of flax (Linum usitatissimum L.) containing very low levels of linolenic acid in its seed oil. Can J Plant Sci 66:499–503
Article CAS Google Scholar
Green AG, Marshall AR (1984) Isolation of induced mutants in linseed (Linum usitatissimum L.) having reduced linolenic acid content. Euphytica 33:321–328
Article CAS Google Scholar
Green AG, Chen Y, Singh SP, Dribnenki JCP (2008) Flax. In: Kole C, Hall TC (eds) Compendium of transgenic crop plants: transgenic oilseed crops. Blackwell Publishing Ltd, Oxford, pp 199–226
Google Scholar
Gupta PK, Rustgi S, Kulwal PL (2005) Linkage disequilibrium and association studies in higher plants: present status and future prospects. Plant Mol Biol 57:461–485
Article PubMed CAS Google Scholar
Hamdan YAS, Garcia-Moreno MJ, Fernandez-Martinez JM, Velasco L, Perez-Vich B (2012) Mapping of major and modifying genes for high oleic acid content in safflower. Mol Breed 30:1279–1293
Article CAS Google Scholar
Hardy OJ, Vekemans X (2002) SPAGeDi: a versatile computer program to analyse spatial genetic structure at the individual or population levels. Mol Ecol Notes 2:618–620
Article CAS Google Scholar
Harris DR (1997) The spread of neolithic agriculture from the Levant to western-central Asia. In: Damania AB, Valkoun J, Willcox G, Qualset CO (eds) The origin of agriculture and crop domestication. Proceedings of Harlan Symposium, ICARDA, Aleppo, Syria, 10–14 May, pp 65–82
Honsdorf N, Becker HC, Ecke W (2010) Association mapping for phenological, morphological, and quality traits in canola quality winter rapeseed (Brassica napus L.). Genome 53:899–907
Article PubMed CAS Google Scholar
Hu X, Sullivan-Gilbert M, Gupta M, Thompson SA (2006) Mapping of the loci controlling oleic and linolenic acid contents and development of fad2 and fad3 allele-specific markers in canola (Brassica napus L.). Theor Appl Genet 113:497–507
Article PubMed CAS Google Scholar
Kalinowski ST (2005) HP-RARE 1.0: a computer program for performing rarefaction on measures of allelic richness. Mol Ecol Notes 2005(5):187–189
Article CAS Google Scholar
Kenaschuk EO (2005) High linolenic acid flax. US patent 6870077 issued on March 22, 2005
Khadake RM, Ranjekar PK, Harsulkar AM (2009) Cloning of a novel omega-6 desaturase from flax (Linum usitatissimum) and its functional analysis in Saccharomyces cerevisiae. Mol Biotechnol 42:168–174
Article PubMed CAS Google Scholar
Kruskal WH, Wallis WA (1952) Use of ranks in one-criterion variance analysis. J Am Stat Assoc 47:583–621
Article Google Scholar
Kumar S, You FM, Cloutier S (2012) Genome wide SNP discovery in flax through next generation sequencing of reduced representation libraries. BMC Genom 13:684
Article CAS Google Scholar
Li Y, Smulders MJM, Chang R, Qiu L (2011) Genetic diversity and association mapping in a collection of selected Chinese soybean accessions based on SSR marker analysis. Conserv Genet 12:1145–1157
Article Google Scholar
Li X, Yan W, Agrama H, Jia L, Jackson A, Moldenhauer K, Yeater K, McClung A, Wu D (2012) Unraveling the complex trait of harvest index with association mapping in rice (Oryza sativa L.). PLoS One 7:e29350
Article PubMed Central PubMed CAS Google Scholar
Li H, Peng Z, Yang X, Wang W, Fu J, Wang J, Han Y, Chai Y, Guo T, Yang N, Liu J, Warburton ML, Cheng Y, Hao X, Zhang P, Zhao J, Liu Y, Wang G, Li J, Yan J (2013) Genome-wide association study dissects the genetic architecture of oil biosynthesis in maize kernels. Nat Genet 45:43–50
Article PubMed CAS Google Scholar
Lin CS, Poushinsky G (1985) A modified augmented design (type 2) for rectangular plots. Can J Plant Sci 65:743–749
Article Google Scholar
Ma J, Amos C (2012) Principal components analysis of population admixture. PLoS One 7:e40115
Article PubMed Central PubMed CAS Google Scholar
Manolio TA, Collins FS, Cox NJ, Goldstein DB, Hindorff LA, Hunter DJ, McCarthy MI, Ramos EM, Cardon LR, Chakravarti A, Cho JH, Guttmacher AE, Kong A, Kruglyak L, Mardis E, Rotimi CN, Slatkin M, Valle D, Whittemore AS, Boehnke M, Clark AG, Eichler EE, Gibson G, Haines JL, Mackay TF, McCarroll SA, Visscher PM (2009) Finding the missing heritability of complex diseases. Nature 461:747–753
Article PubMed Central PubMed CAS Google Scholar
Meuwissen TH, Hayes BJ, Goddard ME (2001) Prediction of total genetic value using genome-wide dense marker maps. Genetics 157:1819–1829
PubMed Central PubMed CAS Google Scholar
Myles S, Peiffer J, Brown PJ, Ersoz ES, Zhang Z, Costich DE, Buckler ES (2009) Association mapping: critical considerations shift from genotyping to experimental design. Plant Cell 21:2194–2202
Google Scholar
Peakall R, Smouse PE (2006) GENALEX 6: genetic analysis in excel. Population genetic software for teaching and research. Mol Ecol Notes 6:288–295
Article Google Scholar
Price AL, Patterson NJ, Plenge RM, Weinblatt ME, Shadick NA, Reich D (2006) Principal components analysis corrects for stratification in genome-wide association studies. Nat Genet 38:904–909
Article PubMed CAS Google Scholar
Pritchard JK, Stephens M, Rosenberg NA, Donnelly P (2000) Association mapping in structured populations. Am J Hum Genet 67:170–181
Article PubMed Central PubMed CAS Google Scholar
Purchase JL (1997) Parametric analysis to describe G × E interaction and yield stability in winter wheat. PhD thesis. Department of Agronomy, Faculty of Agriculture, University of the Orange Free State, Bloemfontein, South Africa
Qi Z, Wu Q, Han X, Sun Y, Du X, Liu C, Jiang H, Hu G, Chen Q (2011) Soybean oil content QTL mapping and integrating with meta-analysis method for mining genes. Euphytica 179:499–514
Article Google Scholar
Qiu D, Morgan C, Shi J, Long Y, Liu J, Li R, Zhuang X, Wang Y, Tan X, Dietrich E, Weihmann T, Everett C, Vanstraelen S, Beckett P, Fraser F, Trick M, Barnes S, Wilmer J, Schmidt R, Li J, Li D, Meng J, Bancroft I (2006) A comparative linkage map of oilseed rape and its use for QTL analysis of seed oil and erucic acid content. Theor Appl Genet 114:67–80
Article PubMed CAS Google Scholar
Rafalski JA (2010) Association genetics in crop improvement. Curr Opin Plant Biol 13:174–180
Article PubMed CAS Google Scholar
Ragupathy R, Rathinavelu R, Cloutier S (2011) Physical mapping and BAC-end sequence analysis provide initial insights into the flax (Linum usitatissimum L.) genome. BMC Genom 12:217
Article CAS Google Scholar
Rakyan VK, Down TA, Balding DJ, Beck S (2011) Epigenome-wide association studies for common human diseases. Nat Rev Genet 12:529–541
Article PubMed Central PubMed CAS Google Scholar
Rao S, Abdel-Reheem M, Bhella R, McCraken C, Hildebrand D (2008) Characteristics of high α-linolenic acid accumulation in seed oils. Lipids 43:749–755
Article PubMed CAS Google Scholar
Roose-Amsaleg C, Cariou-Pham E, Vautrin D, Tavernier R, Solignac M (2006) Polymorphic microsatellite loci in Linum usitatissimum. Mol Ecol Notes 6:796–799
Article CAS Google Scholar
Rowland GG (1991) An EMS-induced low linolenic acid mutant in McGregor flax (Linum usitatissimum L.). Can J Plant Sci 71:393–396
Article CAS Google Scholar
Rowland GG, Bhatty RS (1990) Ethyl methanesulfonate induced fatty acid mutations in flax. J Am Oil Chem Soc 67:213–214
Article CAS Google Scholar
SAS Institute (2004) SAS Version 9.1. SAS Institute, Cary
Shapiro SS, Wilk MB (1965) An analysis of variance test for normality (complete samples). Biometrika 52:591–611
Article Google Scholar
Simopoulos AP (2000) Human requirement for N-3 polyunsaturated fatty acids. Poult Sci 79:961–970
Article PubMed CAS Google Scholar
Smooker AM, Wells R, Morgan C, Beaudoin F, Cho K, Fraser F, Bancroft I (2011) The identification and mapping of candidate genes and QTL involved in the fatty acid desaturation pathway in Brassica napus. Theor Appl Genet 122:1075–1090
Article PubMed CAS Google Scholar
Sørensen BM, Furukawa-Stoffer TL, Marshall KS, Page EK, Mir Z, Forster RJ, Weselake RJ (2005) Storage lipid accumulation and acyltransferase action in developing flaxseed. Lipids 40:1043–1049
Article PubMed Google Scholar
Soto-Cerda BJ, Carrasco RJ, Aravena GA, Urbina HA, Navarro CS (2011a) Identifying novel polymorphic microsatellites from cultivated flax (Linum usitatissimum L.) following data mining. Plant Mol Biol Rep 29:753–759
Article Google Scholar
Soto-Cerda BJ, Urbina Saavedra H, Navarro Navarro C, Mora Ortega P (2011b) Characterization of novel genic SSR markers in Linum usitatissimum (L.) and their transferability across eleven Linum species. Electron J Biotechnol. doi:10.2225/vol14-issue2-fulltext-6
Google Scholar
Soto-Cerda BJ, Diederichsen A, Ragupathy R, Cloutier S (2013) Genetic characterization of a core collection of flax (Linum usitatissimum L.) suitable for association mapping studies and evidence of divergent selection between fiber and linseed types. BMC Plant Biol 13:78
Article PubMed Central PubMed CAS Google Scholar
Stich B, Piepho HP, Schulz B, Melchinger AE (2008) Multi-traits association mapping in sugar beet (Beta vulgaris L.). Theor Appl Genet 117:947–954
Article PubMed Google Scholar
Storey JD, Tibshirani R (2003) Statistical significance for genomewide studies. Proc Natl Acad Sci USA 100:9440–9445
Article PubMed Central PubMed CAS Google Scholar
Storey JD, Akey JM, Kruglyak L (2005) Multiple locus linkage analysis of genomewide expression in yeast. PLoS Biol 3:e267
Article PubMed Central PubMed CAS Google Scholar
van Berloo R (2008) GGT 2.0: Versatile software for visualization and analysis of genetic data. J Hered 99:232–236
Article PubMed CAS Google Scholar
van der Merwe R, Labuschagne MT, Herselman L, Hugo A (2013) Stability of seed oil quality traits in high and mid-oleic acid sunflower hybrids. Euphytica 193:157–168
Article CAS Google Scholar
Vollmann J, Rajcan I (2009) Oil crops breeding and genetics. In: Vollmann J, Rajcan I (eds) Oil crops, handbook of plant breeding 4. Springer, Berlin, pp 1–30
Google Scholar
Vrinten P, Hu Z, Munchinsky MA, Rowland G, Qiu X (2005) Two FAD3 desaturase genes control the level of linolenic acid in flax seed. Plant Physiol 139:79–87
Article PubMed Central PubMed CAS Google Scholar
VSN International (2011). GenStat for Windows 14th Edition. VSN International, Hemel Hempstead, UK. http://www.GenStat.co.uk
Wang ML, Sukumaran S, Barkley NA, Chen Z, Chen CY, Guo B, Pittman RN, Stalker HT, Holbrook CC, Pederson GA, Yu J (2011) Population structure and marker-trait association analysis of the US peanut (Arachis hypogaea L.) mini-core collection. Theor Appl Genet 123:1307–1317
Article PubMed Google Scholar
Wang Z, Hobson N, Galindo L, Zhu S, Shi D, McDill J, Yang L, Hawkins S, Neutelings G, Datla R, Lambert G, Galbraith DW, Grassa CJ, Geraldes A, Cronk QC, Cullis C, Dash PK, Kumar PA, Cloutier S, Sharpe AG, Wong GK, Wang J, Deyholos MK (2012) The genome of flax (Linum usitatissimum) assembled de novo from short shotgun sequence reads. Plant J 72:461–473
Google Scholar
Wassom JJ, Wong JC, Martinez E, King JJ, DeBaene J, Hotchkiss JR, Mikkilineni V, Bohn MO, Rocheford TR (2008) QTL associated with maize kernel oil, protein, and starch concentrations; kernel mass; and grain yield in Illinois high oil × B73 backcross-derived lines. Crop Sci 48:243–252
Article Google Scholar
Weselake RJ (2005) Storage lipids. In: Murphy DJ (ed) Plant lipids: biology, utilization and manipulation. Blackwell Publishing, Oxford, pp 162–225
Google Scholar
Westcott ND, Muir ND (2003) Chemical studies on the constituents of Linum sp. In: Muir AD, Westcott ND (eds) Flax, the genus Linum. Taylor and Francis, New York, pp 55–73
Google Scholar
Wilson RF (2012) The role of genomics and biotechnology in achieving global food security for high-oleic vegetable oil. J Oleo Sci 61:357–367
Article PubMed CAS Google Scholar
Xie D, Han Y, Zeng Y, Chang W, Teng W, Li W (2012) SSR- and SNP-related QTL underlying linolenic acid and other fatty acid contents in soybean seeds across multiple environments. Mol Breed 30:169–179
Article CAS Google Scholar
Yan W, Tinker NA (2005) A biplot approach for investigating QTL-by-environment patterns. Mol Breed 15:31–43
Article Google Scholar
Yang X, Yan J, Shah T, Warburton ML, Li Q, Li L, Gao Y, Chai Y, Fu Z, Zhou Y, Xu S, Bai G, Meng Y, Zheng Y, Li J (2010) Genetic analysis and characterization of a new maize association mapping panel for quantitative trait loci dissection. Theor Appl Genet 121:417–431
Article PubMed Google Scholar
You FM, Duguid SD, Thambugala D, Cloutier S (2013) Statistical analysis and field evaluation of the type 2 modified augmented design in phenotyping of flax germplasms in multiple environments. Australia J Crop Sci 7:1789–1800
Google Scholar
Yu J, Pressoir G, Briggs W, Vroh Bi I, Yamasaki M, Doebley J, McMullen M, Gaut B, Nielsen D, Holland J, Kresovich S, Buckler E (2006) A unified mixed-model method for association mapping that accounts for multiple levels of relatedness. Nat Genet 38:203–208
Article PubMed CAS Google Scholar
Zhao J, Becker HC, Zhang D, Zhang X, Ecke W (2005) Oil content in a European × Chinese rapeseed population: QTL with additive and epistatic effects and their genotype–environment interactions. Crop Sci 45:51–59
Article CAS Google Scholar
Zhao K, Tung CW, Eizenga GC, Wright MH, Ali ML, Price AH, Norton GJ, Islam MR, Reynolds A, Mezey J, McClung AM, Bustamante CD, McCouch SR (2011) Genome-wide association mapping reveals a rich genetic architecture of complex traits in Oriza sativa. Nat Commun 2:467
Article PubMed Central PubMed CAS Google Scholar
Zobel RW, Wright MG, Gauch HG (1988) Statistical analysis of yield trial. Agron J 80:388–393
Article Google Scholar
Zou J, Jiang C, Cao Z, Li R, Long Y, Chen S, Meng J (2010) Association mapping of seed oil content in Brassica napus and comparison with quantitative trait loci identified from linkage mapping. Genome 53:908–916
Article PubMed CAS Google Scholar

Download references

Acknowledgments

The authors are grateful to Evelyn Loewen, Andrzej Walichnowski, Elsa Reimer, Natasa Radovanovic, Evelyn Miranda and the breeding teams at the Morden Research Station and the Crop Development Centre for technical assistance. This work was conducted as part of the Total Utilization Flax Genomics (TUFGEN) project funded by Genome Canada and co-funded by the Government of Manitoba, the Flax Council of Canada, the Saskatchewan Flax Development Commission, Agricultural Development Fund and the Manitoba Flax Growers Association. Project management and support by Genome Prairie are also gratefully acknowledged. Braulio J. Soto-Cerda was supported by Becas Chile—Comisión Nacional de Investigación Científica y Tecnológica (CONICYT).

Conflict of interest

The authors declare that they have no conflict of interest.

Ethical standards

The authors declare that the work in this manuscript was carried out in accordance with the current laws and regulations in Canada. The work is original except where indicated by special references in the text. This manuscript has not been submitted for publication elsewhere. Any views expressed in the manuscript are those of the authors.

Author information

Authors and Affiliations

Department of Plant Science, University of Manitoba, 66 Dafoe Road, Winnipeg, MB, R3T 2N2, Canada
Braulio J. Soto-Cerda & Sylvie Cloutier
Cereal Research Centre, Agriculture and Agri-Food Canada, 195 Dafoe Rd, Winnipeg, MB, R3T 2M9, Canada
Braulio J. Soto-Cerda & Sylvie Cloutier
Morden Research Station, Agriculture and Agri-Food Canada, Route 100, Morden, MB, R6M 1Y5, Canada
Scott Duguid
Crop Development Centre, College of Agriculture and Bioresources, University of Saskatchewan, 51 Campus Drive, Saskatoon, SK, S7N 5A8, Canada
Helen Booker & Gordon Rowland
Plant Gene Resources of Canada, Agriculture and Agri-Food Canada, 107 Science Place, Saskatoon, SK, S7N 0X2, Canada
Axel Diederichsen
Genomics and Bioinformatics Unit, Agriaquaculture Nutritional Genomic Center (CGNA), Km 10 Camino Cajón-Vilcún, Temuco, La Araucania, Chile
Braulio J. Soto-Cerda

Authors

Braulio J. Soto-Cerda
View author publications
You can also search for this author in PubMed Google Scholar
Scott Duguid
View author publications
You can also search for this author in PubMed Google Scholar
Helen Booker
View author publications
You can also search for this author in PubMed Google Scholar
Gordon Rowland
View author publications
You can also search for this author in PubMed Google Scholar
Axel Diederichsen
View author publications
You can also search for this author in PubMed Google Scholar
Sylvie Cloutier
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Sylvie Cloutier.

Additional information

Communicated by B. Hulke.

Electronic supplementary material

Below is the link to the electronic supplementary material.

Supplementary material 1 (DOCX 1394 kb)

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Reprints and permissions

About this article

Cite this article

Soto-Cerda, B.J., Duguid, S., Booker, H. et al. Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection. Theor Appl Genet 127, 881–896 (2014). https://doi.org/10.1007/s00122-014-2264-4

Download citation

Received: 25 June 2013
Accepted: 03 January 2014
Published: 26 January 2014
Issue Date: April 2014
DOI: https://doi.org/10.1007/s00122-014-2264-4

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Association mapping of seed quality traits using the Canadian flax (Linum usitatissimum L.) core collection

Abstract

Key message

Abstract

Similar content being viewed by others

Quantitative genomics-enabled selection for simultaneous improvement of lint yield and seed traits in cotton (Gossypium hirsutum L.)

Association mapping of seed quality traits in Brassica napus L. using GWAS and candidate QTL approaches

Analysis of QTL for seed oil content in Brassica napus by association mapping and QTL mapping

Introduction

Materials and methods

Plant material, genotyping and field experiments

Oil content and FA composition analyses

Statistical analysis

Linkage disequilibrium

Association mapping

QTL/marker effects and stability

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

Results

Phenotypic data

Linkage disequilibrium

AM analysis

QTL contributing to seed quality traits

Allelic effects of stable associations

QTL stability and QTL main effect

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

Discussion

Phenotypic analysis

AM analysis

Fatty acid QTL

Marker/QTL effects and QTL stability

Frequency of QTL/marker allele in the flax core collection and Canadian cultivars

Final remarks

References

Acknowledgments

Conflict of interest

Ethical standards

Author information

Authors and Affiliations

Corresponding author

Additional information

Electronic supplementary material

Supplementary material 1 (DOCX 1394 kb)

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Search

Navigation