Introduction

Wheat was domesticated in the Fertile Crescent of the Middle East and today is a major source of sustenance for humans providing about 20 % of the caloric intake. The two primary cultivated forms of modern wheat are common (bread) wheat (Triticum aestivum L., 2n = 6x = 42, AABBDD genomes), and durum (macaroni) wheat (T. turgidum ssp. durum L., 2n = 4x = 28, AABB genomes). Common wheat, which is hexaploid, accounts for about 95 % of the global wheat crop and is used primarily to make breads, cookies, cakes, crackers, noodles, etc., and durum wheat, which accounts for about 5 % of the global wheat crop, is used to make pasta and other semolina-based products.

Durum and common wheat arose from the same evolutionary lineage. About a half million years ago (Huang et al. 2002; Chalupska et al. 2008), a hybridization event between the wild diploid wheat T. urartu Tumanian ex Gandylian (2n = 2x = 14; AA genome) as the donor of the A genome (Dvorak et al. 1993) and a close relative of the diploid goatgrass Aegilops speltoides Tausch (2n = 2x = 14, SS genome) as the donor of the B genome (Dvorak and Zhang 1990; Blake et al. 1999; Huang et al. 2002; Chalupska et al. 2008; Salse et al. 2008) resulted in the formation of tetraploid wild emmer wheat T. turgidum ssp. dicoccoides (Körn.) Thell (2n = 4x = 28, AABB genomes).

Wild emmer had a very brittle rachis, which provided a natural mechanism of seed dispersal because the spikelets would disarticulate and fall to the ground upon maturity. However, about 10,000 years ago, forms of emmer wheat with non-brittle rachises emerged in the Fertile Crescent (Nesbitt and Samuel 1996) due to genetic mutations in the brittle rachis (Br) genes on chromosomes 2A (Peng et al. 2003; Peleg et al. 2011; Thanh et al. 2013), 3A and 3B (Watanabe and Ikebata 2000; Nalam et al. 2006; Li and Gill 2006; Thanh et al. 2013). This non-shattering form is referred to as cultivated emmer wheat (T. turgidum ssp. dicoccum L., 2n = 4x = 28, AABB genomes), and was instrumental in spawning the Agricultural Revolution of the Neolithic Times due to the fact that the non-brittle rachis trait allowed early farmers to more efficiently harvest the grain (Tzarfati et al. 2013).

Although cultivated emmer had a non-brittle rachis, the rachis was still somewhat fragile and the seed hulled (non-free-threshing). Mutations in genes governing seed threshability (see below) led to the formation of free-threshing T. turgidum subspecies, such as durum wheat, with fully tough rachises. About 8,000 years ago (Huang et al. 2002), T. turgidum (most likely a free-threshing form) hybridized with the diploid A. tauschii Coss. (2n = 2x = 14, DD genome) leading to the formation of hexaploid common wheat (Kihara 1944; McFadden and Sears 1946).

Two major genes prohibit the free-threshing trait in wheat: the q locus on chromosome arm 5AL and the homoeologous tenacious glume (Tg) gene set on the short arms of homoeologous group 2 chromosomes. A mutation in the q allele on 5AL about 10,000 years ago resulted in the formation of the hyperfunctionalized Q allele (Simons et al. 2006; Zhang et al. 2011), which confers the free-threshing character and also influences a repertoire of other domestication-related traits including rachis fragility, glume tenacity, plant height, and heading time (Watkins 1940; Sears 1956; Muramatsu 1963, 1979, 1985, 1986; Kato et al. 1999, 2003; Faris and Gill 2002; Faris et al. 2003, 2005; Simons et al. 2006; Zhang et al. 2011). The molecular cloning of the Q gene revealed that it is a member of the AP2 class of transcription factors (Faris et al. 2003; Simons et al. 2006). For reasons argued in other articles (Matsuoka 2011; Dvorak et al. 2012; Faris 2014), the mutation from q to Q most likely happened in a tetraploid T. turgidum subspecies, which was then involved in the amphiploidization event with A. tauschii that resulted in the formation of hexaploid wheat thereby transferring the Q allele from tetraploid to hexaploid wheat (Simons et al. 2006).

Although the first hexaploid wheat probably had Q, it would have been non-free-threshing due to acquisition of the Tg-D1 gene from A. tauschii, which confers very tenacious glumes and is epistatic to Q (Kerber and Rowland 1974). This trait was first observed by Kerber and Dyck (1969) and later mapped by Kerber and Rowland (1974) to the short arm of chromosome 2D using cytogenetic stocks. More recent mapping experiments have confirmed the location of Tg-D1 on 2DS relative to molecular markers (Jantasuriyarat et al. 2004; Nalam et al. 2007; Sood et al. 2009; Dvorak et al. 2012).

Simonetti et al. (1999) evaluated a wild emmer (T. turgidum ssp. dicoccoides) × durum (T. turgidum ssp. durum) population for QTL associated with threshability and identified a major QTL on 5AL corresponding to the Q locus as well as a QTL on 2BS, which was suggested to be a homoeoallele of Tg-D1 on 2DS. Faris et al. (2014a) recently demonstrated that the gene conferring tenacious glumes on 2BS was indeed homoeologous to Tg-D1, and therefore designated Tg-B1. Faris et al. (2014a) also showed that chromosome 2A of at least some wild emmer accessions harbored factors causing reduced threshability, suggesting the possible existence of a Tg-A1 allele on 2AS as well. Dvorak et al. (2012) also provided evidence for the existence of Tg-A1 alleles. Therefore, in regards to seed threshability, wild emmer would have the genotype Tg-A1Tg-A1/Tg-B1Tg-B1/qq.

On the contrary, fully domesticated durum wheat would necessarily have mutations at all three loci and have the genotype tg-A1tg-A1/tg-B1tg-B1/QQ because it is completely free-threshing. Cultivated emmer wheat (T. turgidum ssp. dicoccum) is intermediate to wild emmer and domesticated durum wheat, both in terms of position on the evolutionary lineage and in degree of threshability, i.e., cultivated emmer is non-free-threshing, but not as difficult to thresh as wild emmer. Cultivated emmer is known to have the q allele, which itself confers the non-free-threshing trait and a somewhat fragile rachis, but it is not known if cultivated emmer might possess Tg-A1, Tg-B1, or both alleles. In other words, while it is known that the mutation of q to Q occurred during the transition of cultivated emmer to durum wheat, it is not known if the mutations from Tg-A1 to tg-A1 and Tg-B1 to tg-B1 occurred during the transition of wild to cultivated emmer, or from cultivated emmer to fully domesticated durum.

Other reports of the evaluation of domestication traits in tetraploid wheat mostly involved populations derived from crosses between wild emmer and durum wheat (Simonetti et al. 1999; Peng et al. 2003) or wild emmer and cultivated emmer (Thanh et al. 2013), but to our knowledge, studies investigating transitions in domestication traits between cultivated emmer and durum wheat have not been conducted. Here, we report the identification of QTL associated with domestication, spike morphology, and yield-related traits in a cultivated emmer × durum wheat recombinant inbred line (RIL) population. The results provide further insights into the genetic factors and events that shaped the domestication of both durum and bread wheat.

Materials and methods

Plant materials

A population consisting of 200 recombinant inbred lines (RILs) was developed by crossing the durum wheat variety Ben (PI 596557) with the cultivated emmer wheat accession PI 41025 and advancing by the single seed-decent method to the F7:8 generation. Ben is a North Dakota hard amber durum variety (Elias and Miller 1998) and PI 41025 was collected near Samara, Russia. Seed of PI 41025 was obtained from the USDA-ARS National Small Grains Collection in Aberdeen, ID. The population of 200 RILs will hereafter be referred to as the BP025 population.

Phenotypic and statistical analysis

The parents and BP025 population were evaluated for the number of days to heading (DTH), spike length (SL), number of spikelets per spike (SPS), spike compactness (CMP), kernels per spike (KPS), grain weight per spike (GWS), thousand kernel weight (TKW), rachis fragility (FRA), and threshability (THR). All plants were grown in the greenhouse in a completely randomized design with one plant per pot, and each pot was considered as an experimental unit. Plants were grown in six-inch clay pots with Metro Mix 902 soil (Hummert International, Earth City, MO) and supplemental fertilizer at a temperature of 21 °C and a 16/8 h light/dark photoperiod. Traits DTH, SL, SPS, CMP, FRA, and THR were evaluated in two greenhouse seasons with three replications, whereas data for KPS, GWS, and TKW were evaluated with two replications.

DTH was measured as the number of days from planting to emergence of the first spike beyond the flag leaf. Measurements for SL, SPS, and CMP were conducted as described in Faris et al. (2014). Briefly, four main spikes were harvested at maturity and placed in a dryer at 32 °C for at least 24 h. SL was determined by measuring the spikes from the base of the first spikelet to the tip of the most terminal spikelet excluding the awns. SPS was calculated by counting the number of spikelets, and CMP was determined by dividing the spike length by the number of spikelets. KPS was calculated by counting the number of seeds produced by the four main spikes and dividing the number by 4. Similarly, GWS was determined by weighing the total seed sample from the four main spikes and dividing that number by 4. TKW for each line was determined by weighing 100 seeds and multiplying the number by 10.

The same four spikes from each plant used to calculate SL, SPS, CMP, KPS, and GWS were used to measure FRA and THR. FRA was measured by manually breaking the spike near the middle of the spike and scoring the ease of breakage on a scale from 1 to 4, where 1 = most difficult to break (as difficult as the fully tough rachis parent Ben); 2 = somewhat difficult to break but not as difficult as Ben; 3 = somewhat easy to break but not as easy as PI 41025; and 4 = very easy to break (as easy as PI 41025). THR was scored precisely as described in Faris et al. (2014a) using the 1–4 scale for ease of threshing.

For traits SL, SPS, FRA, and THR, the scores for the four spikes from each plant were averaged to derive the mean score for each plant within the replicate. For all traits, the mean scores for each plant were subjected to Bartlett’s Chi squared test for homogeneity of error variances among replications using PROC GLM in the SAS program version 9.3 (SAS institute 2011). Error variances were homogeneous among replications for all traits with the exception of DTH (data not shown). Therefore, the scores of each replicate for traits SL, SPS, CMP, KSP, GWS, TKW, FRA, and THR were used to calculate the overall mean for each RIL, and the mean values were used in subsequent analyses. Because the DTH data were not statistically homogeneous, means were not calculated and each replicate was analyzed separately for associated QTL. Fisher’s Least Significant Difference (LSD) test was used to determine significant differences among means of the RILs at the 0.05 level of probability.

Genotyping and linkage mapping

The BP025 population along with the parental lines was genotyped using the iSelect array containing 9000 wheat single nucleotide polymorphism (SNP) markers (Cavanagh et al. 2013). The SNP markers were designated with “IWA” followed by their index number. The genotyping assay was carried out using Illumina’s Infinium assay following the manufacturer’s protocols. SNP clustering and genotype calling were performed using Illumina’s GenomeStudio software v.2011.1. The genotype callings were manually inspected to correct cluster shifts due to copy number differences and to ensure call accuracy for every SNP.

In addition, several simple sequence repeat (SSR) markers with known chromosome locations were selected from each chromosome to help anchor genetic linkage groups to specific chromosomes. These SSR primers were selected from BARC (Song et al. 2005), CFA (Sourdille et al. 2003), CFD (Sourdille et al. 2004), WMS (marker designations = ‘gwm’) (Röder et al. 1998), GDM (Pestsova et al. 2000), and WMC (Somers et al. 2004) sets and tested for polymorphism between Ben and PI 41025 using the PCR conditions described by Röder et al. (1998).

The SSR primer set FCP650, which detects an SSR within the last intron of the Q gene (Simons et al. 2006) and has the marker designation Xfcp650(Q), was used to map the Q gene on chromosome 5A. The primer sequences for the FCP650 set are 5′ GCACTAGCTAATTCAGTGGTTAGATTTGCTCA 3′ and 5′ ATTCAGTGGTAGCAACAGTTTCAGTAAGCTGG 3′ and an annealing temperature of 65 °C was used.

Four sequence-tagged site (STS) markers previously developed by Abeysekara et al. (2010) for mapping the Tsc2 gene on chromosome 2B were also mapped in the BP025 population. One of the four markers was derived from the expressed sequence-tag (EST) BF484320, and the other three were developed from the tentative consensus (TC) sequences TC454839, TC432379, and TC422669.

Linkage maps were assembled using the computer program MapDisto 1.7 (Lorieux 2012). Markers were first organized into groups using the ‘find groups’ command with a minimum LOD = 3.0 and a maximum theta of 0.30. The ‘order’ sequence command was used to establish the initial order of markers within a linkage group. Subsequent interrogation of the sequence using the ‘check inversions,’ ‘ripple order,’ and ‘drop locus’ commands was conducted to determine the best map. Map distances were calculated using the Kosambi mapping function (Kosambi 1944).

QTL analysis

The DTH data for individual replicates and the overall means for all other traits were used in QTL analysis. Composite interval mapping (CIM) was performed using the computer program QGene 4.3 (Joehanes and Nelson 2008). A 1,000-iteration permutation test revealed a critical LOD threshold of 3.2 for the 0.05 level of probability.

Results

Map construction

Genotyping of the BP025 population with the 9 K iSelect SNP chip resulted in 2,461 segregating markers that were used for genetic linkage mapping (Table 1, Online Resource 1). In addition to the SNP markers, 128 SSR markers and four EST-STS markers were mapped in the BP025 population for a total of 2,593 markers, which detected 967 unique loci and accounted for a total genetic map length of 2,444.4 cM. This resulted in an average whole-genome marker density of one marker every 0.94 cM.

Table 1 Chromosome assignment and distribution of markers, length of chromosome linkage groups, and marker density of maps generated in the Ben × PI 41025 (BP025) recombinant inbred population

The average number of markers per chromosome was 185.2 and the number ranged from 89 for chromosome 4B to 335 for chromosome 2B (Table 1). Similarly, the average genetic length over all chromosome maps was 174.6 cM, and the map lengths for individual chromosomes ranged from 118.5 cM for 4B to 237.2 cM for 2B. Chromosome 1A was the most densely covered with one marker every 0.53 cM, and chromosome 5A was the least dense with one marker per 1.43 cM.

The number of markers, genetic map lengths, and marker densities for the A and B genomes were nearly equal. The A genome had 1,288 markers spanning 1,218.4 cM for a density of one marker per 0.95 cM, and the B genome had 1,305 markers spanning 1,226.0 cM with a marker density of one per 0.94 cM (Table 1).

Of the 2,593 markers, a total of 721 (28 %) had segregation ratios that significantly (P < 0.05) deviated from the expected 1:1 ratio (Table 1, Online Resource 1). Nearly all the distorted markers occurred in large clusters (Online Resource 1). With the exception of chromosome 4B, all chromosomes harbored some distorted markers with chromosome 2B having the most (214, 64 %). Chromosome 5B possessed a large cluster of the most severely distorted markers (P < 0.00001), which coincides with a genomic region of segregation distortion in tetraploid wheat studied in detail by Kumar et al. (2007).

Trait evaluations

The DTH data were not homogeneous and therefore each replicate was evaluated separately in the QTL analysis. However, we used overall means from the three replicates to determine the mean and range of the BP025 population, and to evaluate correlations with other traits. The average DTH for the parents of the RIL population differed by only a single day with Ben and PI 41025 having values of 59.67 and 58.67, respectively, and were therefore not significantly different (Table 2). However, the population ranged from 42.67 to 87.0 DTH indicating that the two parents possess different genes governing heading time.

Table 2 Parental and population means, ranges, and least significant differences (LSD) at the 0.05 level of probability (P < 0.05) for the traits days to heading (DTH), spike length (SL), spikelets per spike (SPS), spike compactness (CMP), kernels per spike (KPS), grain weight per spike (GWS), thousand kernel weight (TKW), rachis fragility (FRA), and threshability (THR)

Seed of Ben and PI 41025 along with mature spikes of Ben, PI 41025, and representative spikes from several BP025 RILs are presented in Fig. 1. Ben had an average SL that was 0.66 cm shorter than the average SL of PI 41025, but they did not differ significantly (Figs. 1, 2; Table 2). The average SL among all the RILs in the BP025 population was 6.0 cm, which was the midpoint value between the average parental SL values. However, RIL population had SL values that ranged from 3.36 to 8.81 cm, again suggesting that the parents harbor different genes governing SL.

Fig. 1
figure 1

Seed and spike morphology of the parents and several recombinant inbred lines of the Ben × PI 41025 (BP025) population. (a) Seed of the durum variety Ben (left) and the cultivated emmer wheat accession PI 41025 (right). (b) Mature spikes of Ben, PI 41025, and several recombinant inbred lines of the BP025 population

Fig. 2
figure 2

Histograms of the Ben × PI 41025 (BP025) recombinant inbred population for traits days to heading (DTH), spike length (SL), spikelets per spike (SPS), spike compactness (CMP), kernels per spike (KPS), grain weight per spike (GWS), thousand kernel weight (TKW), rachis fragility (FRA), and threshability (THR). For DTH, histograms for each individual replication were generated because the replications were not statistically homogeneous and therefore the data was not combined

Ben and PI 41025 had average SPS values of 16.53 and 26.64, respectively (Fig. 2; Table 2). Therefore, on average, PI 41025 spikes had 10.11 more spikelets per spike than did Ben. The RILs of the BP025 population had an average of 20.68 SPS, and the range across the population was 13.03–26.03 suggesting that PI 41025 was the sole contributor of genetic factors conferring an increased number of SPS.

Spike compaction (CMP) was calculated by dividing the SL by the SPS. Therefore, CMP values are a function of SL and SPS. Spikes of PI 41025 were significantly more compact than those of Ben with average CMP values of 0.25 and 0.36, respectively (Fig. 2; Table 2). Therefore, although Ben had slightly (albeit insignificantly) shorter spikes than PI 41025, it had less compact spikes than PI 41025 due to the fact that PI 41025 had significantly more SPS than did Ben. The BP025 population had an average overall CMP value of 0.29 and a range of 0.18–0.42 suggesting that the increased SPS in PI 41025 was not the only factor contributing to CMP and that both parents likely harbor gene(s) that influence this trait.

Ben and PI 41025 averaged 29.13 and 36.0 kernels per spike (KPS), respectively (Fig. 2; Table 2). Therefore, although PI 41025 averaged 10.11 more SPS than Ben, it averaged only seven more kernels. The BP025 population averaged 33.64 KPS but had a range of 8.13–54.63 KPS suggesting that both PI 41025 and Ben harbored genes that confer increased KPS, but the genes were different.

Although PI 41025 produced more seed than Ben, the GWS for Ben (1.60 g) was more than that for PI 41025 (1.04 g) (Fig. 2; Table 2). The RIL population averaged 1.40 g and ranged from 0.16 to 2.35 g, again indicating that Ben and PI 41025 both contribute to increased GWS, but though different genes.

For TKW, Ben had much heavier seed at 55.12 g compared to only 28.84 g for PI 41025 (Fig. 2; Table 2). The RIL population averaged 40.84 g and ranged from 10.28 to 54.33 g. It is likely that PI 41025 does not harbor alleles that contribute to increased seed weight because no RIL had a heavier TKW than Ben.

Ben had a fully tough rachis and was completely free-threshing resulting in average FRA and THR values of 1.33 and 1.00, respectively (Fig. 2; Table 2). Conversely, PI 41025 had a relatively fragile rachis that broke easily and seed that was not free-threshing resulting in average FRA and THR values of 3.92 and 3.75, respectively. The BP025 population had average FRA and THR values of 2.36 and 2.26, respectively, and a range of 1.00–4.00 for both traits suggesting that Ben was responsible for contributing the genetic factors controlling a tough rachis and free-threshing seed.

The correlation coefficients among the six traits evaluated are presented in Table 3. DTH was negatively correlated with SL, CMP, GWS, and TKW. SL had a positive correlation with all traits. In addition to a positive correlation with SL, KPS, and GWS; SPS had a weak positive correlation with THR and a strong negative correlation with CMP. CMP had strong negative correlations with DTH and SPS, and strong positive correlations with SL, TKW, FRA, and THR as well as a weaker positive correlation with GWS. KPS was strongly correlated with SL, SPS, GWS, and TKW, and it had a weak negative correlation with FRA. GWS had weak negative correlations with DTH and FRA, and stronger positive correlations with SL, SPS, CMP, KPS, and TKW. FRA had positive correlations with SL, CMP, and THR, and weak negative correlations with KPS and GWS. THR was positively correlated with SL, SPS, CMP, and FRA.

Table 3 Correlation coefficients between the mean values of the traits days to heading (DTH), spike length (SL), spikelets per spike (SPS), spike compactness (CMP), kernels per spike (KPS), grain weight per spike (GWS), thousand kernel weight (TKW), rachis fragility (FRA), and threshability (THR)

QTL analysis

A total of four QTL associated with DTH were identified (Fig. 3; Table 4). The DTH QTL was identified on chromosomes 1B, 2B, 5A, and 5B, and designated QEet.fcu-1B, QEet.fcu-2B, QEet.fcu-5A, and QEet.fcu-5B, respectively. PI 41025 contributed the earliness effects of the 1B and 2B QTL, and they explained as much as 16.5 and 9.7 % of the variation in DTH, respectively. Ben contributed the earliness effects of the 5A and 5B QTL, which explained up to 19.2 and 17.6 % of the DTH variation for the different replicates, respectively.

Fig. 3
figure 3

Illustration of the chromosomal locations of the 37 quantitative trait loci (QTL) associated with the nine traits evaluated. A centiMorgan (cM) scale is shown on the left. The known positions of the QTg-B1, Vrn-A1, and Vrn-B1 loci are indicated in black. Chromosomes 6A and 6B are not shown because no QTL was detected on them in this research

Table 4 Quantitative trait loci associated with the traits evaluated in the Ben × PI 41025 (BP025) recombinant inbred population

Three QTL associated with SL were identified on chromosomes 4A, 4B, and 5A (Fig. 3), and they were designated QEl.fcu-4A, QEl.fcu-4B, and QEl.fcu-5A (Table 4). Ben contributed effects for decreased spike length at all three loci. The 4A, 4B, and 5A QTL had LOD values of 6.5, 7.0, and 26.5 and explained 9.1, 7.7, and 27.3 % of the variation in SL, respectively. QEl.fcu-5A peaked at marker Xfcp650(Q) on 5A indicating that the effects for decreased spike length for this QTL were conferred by the Q allele of Ben.

Four significant QTL were associated with SPS, and PI 41025 contributed the effects for increased SPS at all four loci (Fig. 3; Table 4). The four QTL, designated QSpn.fcu-1B, QSpn.fcu-3B, QSpn.fcu-7A, and QSpn.fcu-7B, were located on chromosomes 1B, 3B, 7A, and 7B, respectively. The LOD values for these QTL ranged from 4.7 to 7.8, and they explained from 6.2 to 11.8 % of the variation in SPS.

Nine QTL were found to be significantly associated with CMP (Fig. 3; Table 4). These consisted of single QTL on chromosomes 1B (QCmp.fcu-1B), 2B (QCmp.fcu-2B), 3B (QCmp.fcu-3B), 5B (QCmp.fcu-5B), and 7A (QCmp.fcu-7A), and two QTL each on chromosomes 2A (QCmp.fcu-2A.1 and QCmp.fcu-2A.2) and 5A (QCmp.fcu-5A.1 and QCmp.fcu-5A.2). With the exception of QCmp.fcu-5A.2, the effects for increased CMP at all the other QTL were contributed by PI 41025. These eight QTL had LOD values ranging from 4.2 to 6.8 and explained from 2.0 to 7.0 % of the variation in CMP. Ben contributed the effects for increased CMP at the QCmp.fcu-5A.2 locus. This QTL had an LOD value of 36.2 and explained 31.6 % of the variation in CMP. The marker Xfcp650(Q) defined the peak of this QTL indicating that the Q allele from Ben was the genetic factor underlying the effects of the QTL.

Three QTL associated with KPS were identified (Fig. 3; Table 4). The QTL QKps.fcu-2B was located on chromosome 2B with an LOD of 17.7 and explained 15.2 % of the variation. Effects for increased KPS at this locus were contributed by Ben. PI 41025 contributed increased KPS for the other two QTL, which were located on chromosomes 5B and 7A and designated QKps.fcu-5B and QKps.fcu-7A, respectively. These two QTL each explained 5.6 % of the variation.

The four QTL associated with GWS were located on chromosomes 1A, 2B, 5B, and 7A (Fig. 3; Table 4). The 2B QTL, QGws.fcu-2B, had the largest effects with an LOD of 14.0 and explaining 16.6 % of the variation. QGws.fcu-1A, QGws.fcu-5B, and QGws.fcu-7A had LOD values of 9.0, 4.6, and 5.0, and explained 6.4, 7.0, and 2.9 % of the variation, respectively. Ben contributed the effects for increased GWS at the 2B and 1A loci, and PI 41025 contributed increased GWS at the 5B and 7A loci. A survey of the mean GWS for all RILs indicated that five lines (BP025-41, BP025-50, BP025-114, BP025-115, and BP025-118) had significantly more GWS than either parent. Investigation of the genotypes harbored by these five RILs at each of the four QTL revealed that BP025-114 and BP025-115, the two RILs with the highest GWS among all RILs, harbored alleles for increased GWS at all four loci (Fig. 4). BP025-41 had the beneficial alleles at all loci except for QGws.fcu-7A and BP025-118 had harbored alleles for increased GWS at all loci except for QGws.fcu-1A. BP025-50 harbored only one of the four QTL alleles for increased GWS, which was QGws.fcu-7A.

Fig. 4
figure 4

Graphical genotypes of QTL regions of chromosomes 1A, 2B, 5B, and 7A, which harbor QTL for increased GWS. The graphical genotypes of Ben, PI 41025, and the RILs with significantly greater GWS than Ben including BP025-41, BP025-50, BP025-114, BP025-115, and BP025-118 are shown where open bars represent Ben chromatin and black represent PI 41025 chromatin. The QTL regions are indicated by the hatched boxes. A cM scale is indicated

Chromosomes 1A, 2B, and 3B carried QTL for TKW, and they were designated QTkw.fcu-1A, QTkw.fcu-2B, and QTkw.fcu-3B, respectively (Fig. 3; Table 4). The effects for increased TKW for all three loci were contributed by Ben, and they had LOD and R 2 values that ranged from 4.3 to 11.9 and 0.047 to 0.083, respectively.

QTL associated with FRA were identified on chromosomes 3B and 5A, and designated QFra.fcu-3B and QFra.fcu-5A, respectively (Fig. 3; Table 4). QFra.fcu-3B had an LOD score of 7.1 and explained 6.8 % of the variation, whereas QFra.fcu-5A, for which the Q gene defined the peak, had an LOD of 18.6 and explained 27.7 % of the variation in FRA. The effects of both QTL for rachis toughness were contributed by Ben.

The Q locus represented by marker Xfcu650(Q) was also the most significant marker associated with THR and defined the peak of the QTL QFt.fcu-5A on chromosome 5A (Fig. 3; Table 4). In addition to this QTL, two QTL associated with THR were identified on chromosome 2A (QFt.fcu-2A.1 and QFt.fcu-2A.2) and one QTL each on chromosomes 2B (QFt.fcu-2B) and 3A (QFt.fcu-3A). The 5A QTL had the largest effects on THR with an LOD of 28.9 and explained 30.9 % of the variation in THR. The 2B QTL was the second largest QTL associated with THR with an LOD of 16.9 and explained 17.2 % of the variation. The 2A and 3A QTL had LOD values ranging from 4.6 to 9.7 and explained from 2.2 to 5.7 % of the phenotypic variation. The parent Ben contributed effects for ease of threshability for all five of these QTL.

Overlapping QTL

Evaluation of the nine traits described in this research led to the identification of a total of 37 QTL. However, the 37 QTL were represented by 24 unique loci on 12 chromosomes due to some loci likely having pleiotropic effects on multiple traits or being tightly linked (Fig. 3). QTL for GWS (QGws.fcu-1A) and TKW (QTkw.fcu-1A) co-localized on chromosome 1A. On homoeologous group 2 chromosomes, QTL for CMP and THR on 2A (QCmp.fcu-2A.1 and QFt.fcu-2A.2) and 2B (QCmp.fcu-2B and QFt.fcu-2B) co-localized. Also, QTL for KPS (QKps.fcu-2B), GWS (QGws.fcu-2B) and TKW (QTkw.fcu-2B) coincided on chromosome 2B. The chromosome 3B QTL QSpn.fcu-3B, QCmp.fcu-3B, and QFra.fcu-3B all co-localized to the 58–62 cM position (Fig. 3; Table 4). QTL for CMP and SPN (QCmp.fcu-7A and QSpn.fcu-7A) co-localized on chromosome 7A, and QTL for KPS and GWS coincided at the same positions on chromosomes 5B (QKps.fcu-5B and QGws.fcu-5B) and 7A (QKps.fcu-7A and QGws.fcu-7A). Most remarkably, QTL for FRA, THR, CMP, and SL (QFra.fcu-5A, QFt.fcu.5A, QCmp.fcu-5A, and QEl.fcu-5A) all co-localized at the Xfcp650(Q) locus on chromosome 5A demonstrating the pleiotropic effects of the Q gene.

Discussion

Map construction

Most previous genetic linkage maps developed for durum or other tetraploid wheat subspecies have employed molecular marker platforms such as restriction fragment length polymorphisms (RFLPs), amplified fragment length polymorphisms (AFLPs), SSRs, diversity array technology (DArT), SNPs, or combinations thereof (Blanco et al. 1998; Elouafi and Nachit 2004; Mantovani et al. 2008; Chu et al. 2010; Buerstmayr et al. 2012; Marone et al. 2012; Colasuonno et al. 2013; Thanh et al. 2013). vanPoecke et al. (2013) recently conducted genotype-by-sequencing (GBS) demonstrating the utility of next generation sequencing for de novo SNP marker discovery and genotyping in durum wheat. The wheat iSelect 9 K array was developed for genotyping in hexapoid wheat (Cavanagh et al. 2013) and its use in tetraploid wheat has not been reported other than for marker discovery and linkage mapping of the short arm of chromosome 2B in a population of RILs derived from a cross between durum wheat and a durum-T. turgidum ssp. dicoccoides disomic chromosome 2B substitution line (Faris et al. 2014a). Here, we showed that the iSelect 9 K array originally developed for hexaploid wheat works very well for the development of high-density whole-genome linkage maps in tetraploid wheat as well.

Days to heading

The primary objective of this work was to evaluate domestication-related traits, i.e., rachis fragility and threshability, to gain insights regarding the transition events that occurred during the evolution of cultivated emmer to durum wheat, but also to evaluate spike morphology and yield-related traits as well as heading time. There are three genetic systems in wheat that affect heading/flowering time in wheat: vernalization, photoperiod sensitivity, and earliness per se (see Kamran et al. 2014 for review). Of the four QTL identified for DTH, two were on the long arms of chromosomes 5A and 5B in the vicinity of the vernalization genes Vrn-A1 and Vrn-B1, respectively. Therefore, it is probable that the effects of QEet.fcu-5A and QEet.fcu-5B are due to allelic variation between Ben and PI 41025 within the Vrn-A1 and Vrn-B1 loci. The QTL for DTH (QEet.fcu-2B) mapped to the short arm of 2B, where the photoperiod sensitivity gene Ppd-B1 is known to be located in common wheat (Mohler et al. 2004) making it possible that the earliness effects contributed by PI 41025 at QEet.fcu-2B are due to allelic variation in that gene. Finally, the DTH QTL on 1B (QEet.fcu-1B) mapped near the distal end of the long arm where Lin et al. (2008) detected a QTL for flowering time in common wheat thus suggesting the possibility that the 1BL QTL detected in our research is the same.

Spike morphology

QTL affecting SL were identified on 4A, 4B, and 5A. The QTL on 5A was due to the effects of the Q gene, which was expected because it is well known that the Q allele confers a shorter spike relative to the q allele (see Faris et al. 2005 for review). QTL affecting spike length have been previously reported in the vicinity of QEl.fcu-4A on chromosome 4A (Borner et al. 2002; Jantasuriyarat et al. 2004) and of QEl.fcu-4B on chromosome 4B (Wang et al. 2010) making it possible that the ones identified in this research are the same as those previously reported. It is interesting to note that although the effects for a shortened spike were contributed by Ben for all three QTL, Ben and PI 41025 did not differ significantly in SL and the RIL population exhibited strong transgressive segregation for this trait. This would suggest that PI 41025 harbors genes that contribute to a shortened spike that differ from those of Ben, and that they went undetected. Reasons for this are unclear, but it is possible that PI 41025 contains numerous genes with minor effects on SL. In this case it is worthy to note that QTL with effects for reduced SL contributed by PI 41025 that approached the significance threshold (LOD > 2.5) were detected on chromosomes 2A, 2B, 5A, and 6A, but were not considered because they did not reach the critical LOD threshold of 3.2. It is possible that these represent real QTL with effects contributed by PI 41025. It is also possible that epistatic interactions exist that contributed to the transgressive segregation.

Although SPS relates directly to yield, it also influences spike morphology. The number of SPS and other yield components have been evaluated in many previous studies leading to reports of QTL associated with SPS on most of the wheat chromosomes. Therefore, it is likely that the four QTL associated with SPS in the BP025 population have been reported before. For example, Peng et al. (2003) reported a QTL on 1BS associated with SPS that may be the same as QSpn.fcu-1B, and Jantasuriyarat et al. (2004) and Ma et al. (2007) have reported SPS QTL in the same vicinity of QSpn.fcu-7A. Many yield and yield-related QTL have been reported on 3B (Borner et al. 2002; Groos et al. 2003; Elouafi and Nachit 2004; Cuthbert et al. 2008; Graziani et al. 2014). However, to our knowledge, QTL associated with SPS have not previously been reported on 7B.

QTL for spike compactness have been reported on nearly all chromosomes as well, and they coincide many times with QTL for SL, but also coincide sometimes with QTL for SPS or both SL and SPS (Wang et al. 2010; Faris et al. 2014b). In this work, we found that the CMP QTL on chromosomes 3B and 7A coincided with SPS QTL, whereas the 5A CMP QTL coincided with a SL QTL due to the underlying effects of Q. The remaining six CMP QTL did not coincide with either SPS or SL QTL, however it is interesting to note that a CMP QTL on 2B was very near the 2B QTL for THR—which was most likely due to the Tg-B1 gene (see below)—and a CMP QTL on 2A nearly coincided with a 2A THR QTL, which could perhaps represent Tg-A1. It may be possible that the Tg genes on group 2 chromosomes have slight, albeit significant, pleiotropic effects on spike compactness as well as threshability.

Yield-related traits

QTL for yield and yield components have been identified on most, if not all, wheat chromosomes (Marza et al. 2005; Edae et al. 2014), and some potentially coincide with the QTL identified in this research for KPS, GWS, and TKW. For example, Borner et al. (2002) reported QTL for GWS on 1AS, 2BS, and 7AS, and a QTL on 3B for TKW that may correspond to QGws.fcu-1A, QGws.fcu-2B, QGws.fcu-7A, and QTkw.fcu-3B, respectively. Chromosome 5B has been reported to harbor QTL for various yield components as well (Gonzalez-Hernandez et al. 2004; Edae et al. 2014), which may indicate the previous identification of QTL associated with KPS and GWS as identified in this research.

The QTL on chromosomes 7A and 5B was associated with both KPS and GWS indicating that the underlying genes contributed by PI 41025 contribute primarily to increase the number of kernels per spike, and this increase leads to an increase in the weight of the spike. On the contrary, the 1A QTL is associated with GWS and TKW, but not KSP, indicating that the gene(s) underlying the 1A QTL and contributed by Ben enhance seed weight but not the number of kernels. The chromosome 2B QTL was associated with all three traits (KPS, GWS, and TKW) and had the largest effects. Therefore, the gene(s) underlying the 2B QTL contributed by Ben has pleiotropic effects on the number of seeds produced per spike and the weight of the seed.

Based on the results of this research it is evident that PI 41025 harbors alleles that confer an increased number of SPS, Ben contains alleles that contribute an increase in seed weight (TKW), and both Ben and PI 41025 harbor alleles that contribute to the number of KPS. It should therefore be possible to combine alleles for increased SPS and KPS from PI 41025 with alleles for increased KPS and TKW from Ben to achieve genotypes with significant yield increases compared to either parent. Indeed, the measure of GWS in the population revealed five RILs with GWS values significantly better than Ben. The best RIL, BP025-114, had a GWS value of 2.35 g/spike, which was 0.75 g/spike more than Ben (a BP025-114 spike is shown in Fig. 1). It is also interesting to note that, because BP025-50 had significantly higher GWS than Ben but carried alleles for increased GWS at only one of the four detected QTL (QGws.fcu-7A), it is likely that other QTL affecting GWS went undetected. It would be interesting to conduct large field-based experiments to evaluate more accurately yield and yield-related traits in this population, because some RILs, perhaps such as BP025-114, may be useful for incorporation into breeding programs to achieve advances in durum wheat yields.

Rachis fragility and seed threshability

The pleiotropic effects of the Q gene on morphological and domestication traits, including rachis fragility, are well documented (Watkins 1940; Sears 1956; Muramatsu 1963, 1979, 1985, 1986; Kato et al. 1999, 2003; Faris and Gill 2002; Faris et al. 2003, 2005; Simons et al. 2006; Zhang et al. 2011). Here, the profound effects of Q on FRA and THR were fully expected, because cultivated emmer wheat is known to possess the q allele and durum wheat the Q allele. The other QTL associated with FRA was on chromosome 3B. Whereas the effects of QFra.fcu-3B were relatively minor in comparison to those of the QTL governed by the Q locus (QFra.fcu-5A), it is interesting to note that the position of QFra.fcu-3B corresponds well with the map position of the brittle rachis gene Br-B1 (Nalam et al. 2006), which is considered to have undergone mutation along with other brittle rachis genes during the transition of wild emmer wheat (T. turgidum ssp. dicoccoides) to cultivated emmer wheat (T. turgidum ssp. dicoccum). It is possible that some residual effects of Br-B1 remain and yet contribute slightly to rachis fragility in cultivated emmer. But, it is also interesting to note that QFra.fcu-3B coincided with QTL for SPS and CMP, suggesting that, whatever the gene responsible for QFra.fcu-3B, it likely has pleiotropic effects on all three traits.

As expected, the Q gene had the largest impact on THR explaining over 30 % of the variation. The QTL with the second largest impact was QFt.fcu-2B, which corresponds well with the position of Tg-B1 determined by Faris et al. (2014a), who used a segregating mapping population derived from a Langdon durum–T. turgidum ssp. dicoccoides chromosome 2B disomic chromosome substitution line crossed with Langdon to map Tg-B1. In both this work and that of Faris et al. (2014a), the locus governing threshability mapped very near the SSR marker Xwmc154, indicating they are likely the same gene. The observance of Tg-B1 contributed by PI 41025 in this research thus indicates that the mutation in Tg-B1 giving rise to the free-threshing tg-B1 allele did not occur during the transition of wild to cultivated emmer, but instead must have happened during the transition of cultivated emmer to durum or another fully domesticated tetraploid subspecies.

Dvorak et al. (2012) and Faris et al. (2014a) both presented evidence of the existence of factors inhibiting threshability on chromosome 2A in polyploid wheat and speculated that it was most likely the homoeologue of Tg-B1 and Tg-D1, which would be referred to as Tg-A1. It is likely that one of the two THR QTL on chromosome 2A in the BP025 population represents Tg-A1. The genetic position of QFt.fcu-2A.1 along chromosome 2A is similar to that of the Tg-B1 locus (QFt.fcu-2B) on chromosome 2B, which might suggest that QFt.fcu-2A.1 is associated with Tg-A1. However, it is interesting to note that the second QTL on chromosome 2A (QFt.fcu-2A.2) is associated with CMP (QCmp.fcu-2A) as well as THR, and the Tg-B1 locus on 2B is also associated with both of these traits. Although the Tg loci have not previously been reported to be associated with spike compactness, it is possible that they might influence the trait in certain genetic backgrounds. If this was the case, then QFt.fcu-2A.2 might represent Tg-A1. In either case, more work is needed to determine which 2A QTL (QFt.fcu-2A.1 or QFt.fcu-2A.2) is homoeologous to QFt.fcu-2B and therefore representative of Tg-A1.

Insights into wheat domestication events

It has long been known that cultivated emmer harbors the q allele because the seed is hulled and the rachis somewhat fragile, however, it had not been determined if cultivated emmer possessed Tg-B1, Tg-A1, neither gene, or perhaps both genes. The results of the current study indicate that the cultivated emmer accession PI 41025 clearly harbors Tg-B1, and suggest that it contains Tg-A1 as well. Whereas mutations in genes conferring a brittle rachis led to the transition of wild emmer to cultivated emmer wheat, seeds of the latter remained hulled and difficult to thresh due to the likelihood of having the genotype Tg-A1Tg-A1/Tg-B1Tg-B1/qq. The evolution of the first free-threshing tetraploid wheat then required mutations at all three loci, i.e., Tg-A1, Tg-B1, and q to give rise to the genotype tg-A1tg-A1/tg-B1tg-B1/QQ. The mutations in these three loci likely occurred during the transition from cultivated emmer to the first free-threshing tetraploid. However, other cultivated emmer accessions should be evaluated for threshability QTL to further validate this hypothesis.

With regards to the formation of hexaploid common wheat (T. aestivum), it is most probable that a free-threshing tetraploid subspecies with genotype tg-A1tg-A1/tg-B1tg-B1/QQ was involved in the amphiploidization event (Matsuoka and Nasuda 2004; Dvorak et al. 2012; Faris 2014; Faris et al. 2014a) with A. tauschii in the Middle East. The first hexaploid amphiploid would have been non-free-threshing due to the acquisition of Tg-D1 from A. tauschii (McFadden and Sears 1946; Kerber and Rowland 1974). The amphiploid must have acquired a mutation in Tg-D1 very rapidly to give rise to the tg-D1 allele and fully free-threshing wheat, because true non-free-threshing hexaploid ancestors have not been observed in the archeological record (Nesbitt and Samuel 1996). This fact lends further support to the argument that the tetraploid parent of T. aestivum was free-threshing with genotype tg-A1tg-A1/tg-B1tg-B1/QQ, because if it was not, then it would have been necessary for the original hexaploid to undergo mutations at multiple loci to become free-threshing, which would be expected to take a significant amount of time. Therefore, it is likely that the original hexaploid wheat plant had the genotype tg-A1tg-A1/tg-B1tg-B1/Tg-D1Tg-D1/QQ and underwent a mutation in Tg-D1 to give rise to the fully domesticated T. aestivum ssp. aestivum with genotype tg-A1tg-A1/tg-B1tg-B1/tg-D1tg-D1/QQ.

Conclusions

Although cultivated emmer wheat was instrumental to the Agricultural Revolution of the Neolithic times, which led to the rise of human civilization, today it is the fully domesticated durum wheat that has replaced cultivated emmer as the primary form of tetraploid wheat that is grown worldwide and provides a significant source of calories for humans. Pronounced differences in agronomic, morphological, productivity-associated, and domestication-related traits exist between cultivated emmer and modern domesticated tetraploid wheat, and the current research provides insights regarding the transitional events that led not only to a fully tough rachis and free-threshing seed, but also to alterations in spike architecture and increased seed weight and yield. Although modern durum wheat is superior to cultivated emmer in agronomic performance and productivity, cultivated emmer provides a rich pool of genetic diversity from which to obtain genes for potentially desirable traits such as enhanced yield, quality, disease and pest resistance, and agronomic performance for the improvement of durum wheat varieties.

Author contributions

JF, QZ, SC, ZZ, and SX performed experiments and collected data. JF and SX analyzed and interpreted the data. JF, SC, and SX wrote the paper.