Introduction

Cultivated peanut (Arachis hypogaea L.) is an allotetraploid (AABB, 2n = 4× = 40) species that may be the product of a single hybridization event between A. duranensis (AA, 2n = 2× = 20) and A. ipaënsis (BB, 2n = 2× = 20) followed by chromosome duplication (Kochert et al. 1996). Peanut is not only used as human food and an edible oil but also as livestock fodder and green manure. In addition, peanut is cultivated in tropical and sub-tropical regions in more than 100 countries, with a global annual production of 38.6 Mt over an area of 21.8 Mha (http://faostat.fao.org/faostat/collections?subset=agriculture 2011). However, with the rapid increase of human populations worldwide, production has been unable to satisfy global demand. In China, the field area of 4.7 Mha was harvested with a total peanut production of 16.1 Mt, which is the largest area of production worldwide but not the highest yield per hectare (3.4 t ha−1), suggesting that there is great potential to further enhance peanut production through the genetic improvement of high-yield varieties.

Yield is one of the most important and complex traits in crops. Peanut yield is directly and indirectly influenced by agronomic traits, such as the height of the main stem and total branching number. Because the pod is one of the most important organs, pods play a paramount role in growth and development processes and serve to protect the developing seeds from biotic and abiotic stresses. Pod- and seed-related traits, such as pod length, pod width, seed length and seed width directly influence peanut yield. In addition, more than 60 % of the total peanut production in China is crushed for edible oil. Fatty acid composition influences the quality and storage stability of the total oil. Thus, oil content and fatty acid composition have also become increasingly important trait objectives in the breeding of high-yield peanut varieties.

Agronomic and quality-related traits are almost always quantitative traits in plant species. Linkage mapping or QTL mapping based on segregating populations derived from bi-parental crossing is the most popular and successful method of identifying QTLs that control complex traits in model plants (Miura et al. 2011; Xing and Zhang 2010). In non-model species, however, QTL mapping is hindered because the number of markers is not large enough to construct saturated genetic maps. SSR markers that feature codominant heritability, genome-wide dispersal and simple amplification have become the most valuable markers for diverse studies including genetic mapping, comparative genomics, molecular fingerprinting and marker-assisted selection (Gupta and Varshney 2000). The first SSR-based genetic linkage map for cultivated peanut was constructed by Varshney et al. (2009) with a recombinant inbred line (RIL) population derived from TAG24 × ICGV86031. Because of low genetic polymorphism between any two varieties of cultivated peanut, only 135 SSR loci were included on the genetic map and classified into 22 linkage groups (LGs). With abundant SSRs developed from various resources, including cDNA libraries (Koilkonda et al. 2012; Proite et al. 2007), SSR-enriched genomic DNA libraries (Naito et al. 2008), bacterial artificial chromosome (BAC)-end sequences (Wang et al. 2012) and transcript sequences (Zhang et al. 2012), a number of genetic linkage maps have been constructed based on F 2 and RIL populations (Gautami et al. 2012a; Qin et al. 2012; Shirasawa et al. 2012; Wang et al. 2012). A high-density genetic linkage map has recently been reported. The map is composed of 16 single-linkage maps and contains 3693 marker loci covering 2651 cM across 20 consensus LGs that include A and B genomes (Shirasawa et al. 2013). Furthermore, a number of linkage studies have been undertaken to identify QTLs for a wide range of various traits in peanut, including resistance to biotic stresses (Khedikar et al. 2010; Leal-Bertioli et al. 2009; Qin et al. 2012; Sujay et al. 2012) and abiotic stress (drought tolerance) (Gautami et al. 2012b; Ravi et al. 2011; Varshney et al. 2009), nutritional quality (protein content, oil content, and oleic acid) (Sarvamangala et al. 2011) and several agronomic traits (Shirasawa et al. 2012). Although great progress has been made for these traits, the QTL pattern of agronomic and quality-related traits remains unclear.

In this study, we developed a molecular genetic map of peanut based on an F 2 mapping population using published SSR markers. By linking with phenotypes across multiple environments, QTL mapping was performed to unravel the genetic basis of 18 agronomic and quality-related traits in peanut.

Materials and methods

Plant materials

An F 2 mapping population derived from Zhonghua 10 × ICG12625 (n = 232), was used to construct a linkage map. The female parent, Zhonghua 10, belongs to A. hypogaea var. vulgaris and is large-seeded with pink testa and two seeds per pod. The male parent, ICG12625, belongs to A. hypogaea var. aequatoriana and is small-seeded with dark purple testa and three seeds in each pod.

Trait measurement

Because of the low fertility of self-pollinated offspring, only 144 lines in the F 3 generation were obtained. The 144 F 3 lines and parental lines were planted in experimental fields of Wuhan, Nanchong and Zhumadian in China in 2011. Each accession was planted in a single-row plot with the one-replication randomized block design, and there were 8–10 plants in each row with 10 cm between plants within each row and 30 cm between the rows. Eight plants were selected randomly from each accession to investigate the phenotypes. Agronomic traits were investigated according to previously described standard procedures (Jiang et al. 2006), including height of main stem, total branching number, pod length, pod width, seed length, seed width, hundred-pod weight, hundred-seed weight and shelling percentage. The oil content and composition of fatty acids were tested with nuclear magnetic resonance and gas chromatography, respectively, using fresh-dried mature seeds with intact testa. To reduce the influence of environmental factors on phenotypic characterization, the mean value of the phenotypes across three environments for each trait was used in following analyses. A principal component analysis (PCA) of the RIL for the traits and the correlation analysis between the traits and inferred PCs (principal components) were performed using the R package (R Development Core Team 2012).

DNA extraction and SSR genotyping

Genomic DNA was extracted from young leaves collected from each line using a modified cetyltrimethyl ammonium bromide (CTAB) method. The integrity and quality of the DNA was evaluated on a 1 % agarose gel by comparison with uncut lambda DNA.

A total of 3371 SSR markers from different resources was used to screen the polymorphism between two parental genotypes (Table S1) and polymorphic SSRs were used to genotype the F 2 population. SSR markers with the prefixes pPGPseq, pPGSseq, TC, IPAHM, Ah, RI, EE, EM, GA, GM, GNB, AC, Ad, ARS, gi, AHBGS, PM, AHS, AHGS and HAS were obtained from the literatures (Bravo et al. 2006; Cuc et al. 2008; Ferguson et al. 2004; Gimenes et al. 2007; He et al. 2003; Hopkins et al. 1999; Liang et al. 2009; Macedo et al. 2012; Moretzsohn et al. 2005, 2009; Qin et al. 2012; Shirasawa et al. 2012; Wang et al. 2012; Zhang et al. 2012). SSR markers with prefixes XY and POCR were developed and published by our laboratory (Tang et al. 2012), and those with prefixes AGGS were developed and unpublished by our laboratory. Based on the origin of the SSRs, we classified the total published SSRs (3371) into four groups: genomic SSRs (1467), EST-SSRs (1589), BAC-end SSRs (155), and transcript-SSRs (160). The PCR reactions followed the protocol described by Chen et al. (2008), and the PCR products were visualized on a 6 % polyacrylamide gel followed by silver staining. The fragment sizes of the PCR products were estimated by comparison with a 50 bp DNA ladder.

Construction of the genetic linkage map

Pearson’s Chi square test was used to assess the goodness of fit to the expected 1:2:1 segregation ratio for each codominant marker or expected 3:1 segregation ratio for each dominant marker (P < 0.05). A genetic linkage map was constructed using JoinMap 3.0 (Van and Voorrips 2001) with the minimum logarithm of odds (LOD) of 4.0. The recombination ratio was converted to genetic distance by the Kosambi mapping function (Kosambi 1944). Linkage groups were aligned to a published linkage map based on common markers (Shirasawa et al. 2013), and the linkage groups belonging to the A and B genome were designated A1–A10 and B1–B10, respectively.

QTL analysis

QTL analyses for agronomic traits, oil content and fatty acids were performed separately. PC significantly related to agronomic traits and quality-related traits were also subjected to QTL analysis. The composite interal mapping method (Zeng 1994) implemented in the software windows QTL cartographer 2.5 (http://statgen.ncsu.edu/qtlcart/WQTLCart.htm) was used to conduct the QTL analysis. The forward regression method model 6 (default model) was selected to obtain covariates. The number of control markers, window size and walk space were set to 5, 10 and 2 cM, respectively. The threshold of LOD for declaring the presence of a QTL was determined by 1000 permutation tests. The nomenclature of QTL was similar to that described by Udall et al. (2006). QTLs are designated with initial letter q followed by the trait name and linkage group. An alphabetical letter was added if more than one QTL was detected for the same linkage group for one trait. For example, if two QTLs for seed width were detected on A2, they were named qSWA2.1 and qSWA2.2. Dominance degree (d) is defined as the absolute value of dominance divided by additive effect, which was used to classify the detected QTLs for all traits into the following three groups: (1) additive QTL, where 0 < d < 0.5; (2) dominance QTL, where 0.5 < d < 1; and (3) over-dominance QTL, where d > 1 (Zhou et al. 2012).

Results

Phenotypic variation of agronomic and quality-related traits

Eighteen agronomic and quality-related traits were investigated in the 144 F 2:3 lines in three locations in 2011. An analysis of variance (ANOVA) indicated that the genotypic effect (G) and environmental effect (E) had significant influences on agronomic and quality-related traits (Table 1). Therefore, the mean value of each trait across three environments was used in the subsequent analysis to reduce the instability of phenotypic characterization. Large variation was observed in the F 3 population for these traits (Table 1). Eicosenoic acid showed the largest change (6.0-fold), which varied from 0.3 to 1.8 % and averaged 0.8 %, whereas the oil content showed the lowest change (1.3-fold), which varied from 45.4 to 57.1 % and averaged 51.2 %. A similar phenomenon was observed for the coefficient of variation (CV), which ranged from 3.7 % for the oil content to 30.3 % for the total branching number. In addition, the phenotypic distribution for all of the traits revealed near normality (Fig. 1). The broad sense heritability of all traits was relatively high and ranged from 0.711 for the total branching number to 0.863 for eicosenoic acid (Table 1); however, the broad sense heritability for palmitic acid (0.642) was lower, which indicated that genetic factors were predominant in determining the phenotypes of these traits. A correlation analysis was conducted to determine the association between these traits (Table 2); oleic acid had the highest significant negative association with linoleic acid (r = −0.885, P < 0.001), whereas, the hundred-pod weight had the highest significant positive association with hundred-seed weight (r = 0.856, P < 0.001). For pod length, pod width, seed length, seed width, hundred-pod weight and hundred-seed weight, positive correlations were revealed between each pair of the six traits. Similarly, positive correlations were also revealed between each pair of the following traits: height of main stem, total branching number and oil content. Furthermore, negative correlations were revealed between each of the former six traits and each of the latter three traits.

Table 1 Phenotypic variation for agronomic and quality-related traits in the peanut panel
Fig. 1
figure 1

Phenotypic distribution of 18 agronomic and quality traits in the F 2:3 population

Table 2 Correlation analysis for the agronomic and quality-related traits

SSR polymorphisms and genetic map construction

From a total of 3371 SSR markers, a set of 341 Genomic SSRs (23.24 %), 82 EST-SSRs (5.16 %), 32 BAC-end SSRs (20.65 %) and 15 transcript-SSRs (9.38 %) were polymorphic between the parents ‘Zhonghua 10’ and ‘ICG12625’, indicating that random genomic SSRs had higher polymorphism than SSRs within coding regions. Four hundred-seventy polymorphic SSR primers were used to genotype the F 2 population of 232 individuals (Table 3). According to the SSR band segregation patterns in the F 2 population, 26 of 470 SSRs amplified two loci for each SSR, and the remaining 444 SSRs amplified a single locus for each SSR. Among these 496 SSRs, 461 SSRs were co-dominant loci and 35 were dominant.

Table 3 Polymorphic ratio of the investigated markers

A genetic linkage map containing 470 SSR loci was constructed, and it covered a total length of 1877.3 cM of the cultivated peanut genome, with an average distance of 4.0 cM between flanking markers (Fig. 2; Table 4). All of the loci were assigned to 20 linkage groups (LG) that were designated as A1–A10 for the A subgenome and B1–B10 for the B subgenome by aligning common markers to previously published maps. Out of 470 mapped SSR loci, the Chi square (χ 2) analysis identified 133 loci (28.3 %) that significantly deviated from expected ratios of 1:2:1 or 3:1 (P < 0.05). The number of segregation-distorted loci varied for the different linkage groups, ranging from 1 locus (5.0 %) in B1 to 16 loci (61.5 %) in B2 (Fig. 2; Table S2).

Table 4 Description of the peanut genetic linkage map in this study
Fig. 2
figure 2

Genetic linkage map and QTL distribution of the agronomic and quality-related traits in an F 2:3 population derived from Zhonghua 10 × ICG12625. The suffix “asterisk” indicates loci with segregation distortion in F 2 generation, and the red loci are common loci for assigning linkage groups to defined chromosomes of cultivated peanut based on a comparison with the published map (color figure online)

Comparative analyses of the inter-genome and intra-genome

The syntenic comparative analysis of the present linkage map with the published tetraploid linkage map (Shirasawa et al. 2013) revealed distinct similarities. Of the 470 mapped SSRs in the present linkage map, 333 SSRs (70.9 %) were common loci with the published linkage map, ranging from 6 common loci for A2 to 28 common loci for A7 (Table 4). A small fraction of inverted segments, which were defined as consecutively inverted common loci, were observed in the comparison analysis, indicating that our linkage map is essentially co-linear to the published linkage map (Fig. 3; Table 4).

Analyzing the duplicated loci amplified from the same SSR primer pair is a direct method of determining the homology in allotetraploid genomes. We detected 26 homologous SSRs that amplified 52 duplicated loci. Among the 26 homologous SSRs, 4 singleton SSRs were detected where only one homologous locus was mapped on the linkage map, indicating that the detection of homologous loci was highly likely, whereas the detection of allelic polymorphisms was not as likely. Moreover, 17 homoeologus SSRs (65.4 %) amplified two duplicated loci for each SSR and were mapped onto the A subgenome and B subgenome (Fig. 3). For example, SSR primer Ah11 amplified two loci, Ah11-1 and Ah11-2, which were mapped on chromosomes B1 and A1, respectively. The SSR primer pair AHGS2130 amplified two loci, AHGS2130-1 and AHGS2130-2, which were mapped on chromosome B8 and A9, respectively. Additionally, only one SSR showed a potentially homologous relationship within subgenomes. The SSR primer ARS729 amplified two loci ARS729-1 and ARS729-2, which were both mapped on the chromosome A1, suggesting the two loci may be originated from a local duplication event during genome evolution of A. duranensis.

Fig. 3
figure 3

Comparison of the linkage map with the published linkage map for cultivated peanut (Shirasawa et al. 2013). The suffix “asterisk” indicates the linkage group on the published linkage map. Co-linear SSR loci between two linkage maps and homoeologous loci between the A and B genomes are indicated by black lines. Red dots highlights the homoeologous relationships between subgenomes (color figure online)

Detection of QTLs for agronomic and quality-related traits

Genome-wide QTL mapping was performed using the mean value of each trait across three environments (Fig. 2; Table 5). For the agronomic traits, a total of 24 QTLs were identified that explained 1.69–18.70 % of the phenotypic variance. Three QTLs for the height of main stem, qHMSA3, qHMSB4 and qHMSB7, were detected on A3, B4 and B7, and they explained 8.31, 6.12 and 8.90 % of the phenotypic variance, respectively. Two QTLs for total branching number, qTBNA1 and qTBNA8, were detected on A1 and A8, and they explained 7.46 and 6.11 % of the phenotypic variance, respectively. One QTL for pod length, qPLB9, was detected on B9, and it explained 11.23 % of the phenotypic variance. Two QTLs for pod width, qPWA5 and qPWB3, were detected on A5 and B3, and they explained 18.70 % and 2.11 % of the phenotypic variance, respectively. Three QTLs for seed length, qSLA3, qSLB2 and qSLB3, were detected, and explained 9.86–10.48 % of the phenotypic variance. Four QTLs for seed width, qSWA2.1, qSWA2.2, qSWA3 and qSWA7, were detected and explained 6.39–12.20 % of the phenotypic variance. Three QTLs for hundred-pod weight, qHPWA7.1, qHPWA7.2 and qHPWB3, were detected and explained 8.02–15.39 % of the phenotypic variance. Three QTLs for hundred-seed weight, qHSWA8, qHSWB2 and qHSWB3, were detected and explained 1.69–17.88 % of the phenotypic variance. Three QTLs for shelling percentage, qSPA5, qSPA7 and qSPA9, were detected and explained 2.00–11.78 % of the phenotypic variance.

Table 5 QTL for the agronomic and quality-related traits

For the quality-related traits, 12 QTLs were identified and explained 1.72–20.20 % of the phenotypic variance. One QTL for oil content, qOCB3, was detected on B3, and it explained 14.36 % of the phenotypic variance. One QTL for palmitic acid, qPAA8, was detected on A8, and it explained 17.02 % of the phenotypic variance. Two QTLs for stearic acid, qSAA4 and qSAA8, were detected on A4 and A8, and they explained 18.31 and 2.52 % of the phenotypic variance, respectively. One QTL for oleic acid, qOAA3, was detected on A3, and it explained 1.72 % of the phenotypic variance. Two QTLs for arachidic acid, qAAA4 and qAAB9, were detected on A4 and B9, and they explained 8.10 % and 20.20 % of the phenotypic variance, respectively. Three QTLs for eicosenoic acid, qEAA1.1, qEAA1.2 and qEAA5, were detected and explained 3.80–7.51 % of the phenotypic variance. Two QTLs for behenic acid, qBAB1 and qBAB9, were detected on B1 and B9, and they explained 15.76 and 18.85 % of the phenotype variance, respectively.

A PCA was performed to summarize the phenotypic variation of 18 agronomic and quality traits. Two principal components, PC1 and PC9, were significantly correlated with agronomic and quality traits, respectively (Table S3), and designated as the synthetic phenotypes for QTL mapping. Three QTLs for PC1, qPC1A7.1, qPC1A7.2 and qPC1A10, were identified, and they explained 1.20–13.86 % of the phenotypic variance. Two QTLs for PC9, qPC9A1.1 and qPC9A1.2, were identified, and they explained 5.89 % and 5.09 % of the phenotypic variance, respectively. Among these five QTLs, qPC1A7.1 was co-localized with qHPWA7.1 for hundred-pod weight and qPC1A7.2 was co-localized with qSWA7 for seed width and qHPWA7.2 for hundred-pod weight.

Genetic effect of QTLs for agronomic and quality traits

The F 2:3 population is an ideal population that can be used to designate QTL genetic effects as either additive or dominant. The degree of dominance may reveal the degree of heterosis for the QTL (Table 5; Fig. 4, S1). Out of 36 QTLs for agronomic and quality traits, four QTLs (11.1 %) exhibited no apparent dominance effect over the additive effect. Seven QTLs (19.4 %) had the mid-parent dominance effect. Twenty-five QTLs (69.4 %) had dominance degree (d) that was more than one, exhibiting over-parent heterosis for these QTLs. The results suggested that most of the QTLs for agronomic and quality traits are determined by the dominance effect, which indicates the potential of heterosis applications in peanut breeding.

Fig. 4
figure 4

Additive and dominance patterns of three QTLs exhibited differential contributions to phenotypic variance. a qHMSA3 additive QTL with a weak dominance effect relative to the additive effect, b qPWA5 dominance QTL with a dominance effect that is equivalent to the additive effect, c qPWB3 over-dominance QTL with a strong dominance effect relative to the additive effect

Discussion

A genetic linkage map is a prerequisite to efficiently identify molecular markers associated with agronomically important traits. Because of a lack of polymorphism at the DNA level, the first SSR-based genetic linkage map for cultivated peanut, which was constructed by Varshney et al. (2009) only had 135 SSR loci mapped onto 22 linkage groups. As more SSRs have been developed for peanut, more SSR-based maps have been constructed (Qin et al. 2012; Shirasawa et al. 2013, 2012; Sujay et al. 2012; Wang et al. 2012). In the present study, an F 2 mapping population was used to construct a linkage map containing 470 loci, that covered a total length of 1877.3 cM with an average distance of 4.0 cM between adjacent loci. The density of our map was relatively higher than that of other studies, except for the synthetic genetic map which integrated the marker information of 16 linkage maps (Shirasawa et al. 2013). The proportion of SSR loci (29 %) observed to be segregating with distortion was significantly less than previously reported (Moretzsohn et al. 2005; Shirasawa et al. 2013; Wang et al. 2012). Generally, segregation distortion is a result of chromosomal structural differences or the presence of transmission ratio distorter factors on a chromosome (Rieseberg et al. 1995). Therefore, the lower degree of segregation distortion in the present study likely implies a lower degree of diversity in the parental genotypes. Additionally, we observed a lower segregation distortion proportion (19 %) in the 144 RIL lines with the same set of markers (Table S2), indicating the following QTL analysis will be reliable although population size decreased with the F 2:3 population. Interestingly, we found 26 SSR primers for which two duplicated loci in the linkage map exhibited features of homoeologous and paralogues in tetraploid species (Ravi et al. 2011; Varshney et al. 2009). A comparative analysis revealed a small degree of inverted segments between the present linkage map and published integrated map and indicated a high degree of consistency between our linkage map and published maps. Therefore, the linkage map in our study is a beneficial tool that will aid in the assembly of whole genome sequences for tetraploid peanut.

The genetic bases of yield-related traits have been determined in many crop species using QTL mapping (Maccaferri et al. 2008; Xing and Zhang 2010), and several major QTLs have been isolated by map-based cloning (Bommert et al. 2013; Li et al. 2014). In this study, a total of 36 QTLs were detected for 18 agronomic and quality traits. Furthermore, two PCs correlated with agronomic and quality traits were treated as synthetic traits for QTL mapping (Table 5). As a result, five QTLs were detected for the PCs, two of which co-localized with qHPWA7.1 and qSWA7 for hundred-pod weight and seed width, respectively. This result indicates that the QTLs of hundred-pod weight and seed width are critically important for determining the phenotypes of agronomic traits in cultivated peanut. Several QTLs for multiple traits co-localized with each other (Table 5), indicating that pleiotropic QTLs or several linked QTLs may be involved in regulating these traits. For example, the QTL qHPWB3 was detected for hundred-pod weight at the genetic location 120.6–126.3 cM on linkage group B3. Another two QTLs, qHSWB3 and qOCB3, were detected for hundred-seed weight and oil content at the genetic locations of 120.5–126 and 125.3–126.7 cM on linkage group B3, respectively. The co-localization of QTLs for three traits also reflects the phenotypic correlation between these traits (Table 2). For these overlapping QTLs, the allele derived from Zhonghua 10 increased the value of hundred-pod weight and hundred-seed weight but decreased the value of oil content (Table 5). Thus, it is necessary to fine-map these overlapping QTLs to determine causative genes for the overlapped QTLs caused by several linkage QTLs/genes. By utilizing PCR-based molecular markers within or closely linked to the target gene, the potential for effectively pyramiding favorable alleles for multiple traits into single varieties of peanut in selective breeding can be achieved. Furthermore, the majority of QTLs exhibited dominance or over-dominance effects on agronomic and quality traits; thus, determining the heterosis of key agricultural traits should be prioritized in efforts to enhance the elite peanut varieties.

Author contribution statement

LH and HFJ designed the experiment. LH and XPR established the mapping population. HYH and WGC performed SSR genotyping. LH, HYH, WGC, YLC, XJZ, YLX and XLW performed the agronomic traits measurements of the plant materials. XPR performed the composition of fatty acid measurements of the plant materials. XGJ performed the oil content measurements of the plant materials. LH constructed the genetic linkage mapping, performed the QTL analysis and wrote the manuscript. LH, BSL and HFJ revised the manuscript. All of the authors read and approved the final manuscript.