An intra-specific consensus genetic map of pigeonpea [Cajanus cajan (L.) Millspaugh] derived from six mapping populations

Abstract

Pigeonpea (Cajanus cajan L.) is an important food legume crop of rainfed agriculture. Owing to exposure of the crop to a number of biotic and abiotic stresses, the crop productivity has remained stagnant for almost last five decades at ca. 750 kg/ha. The availability of a cytoplasmic male sterility (CMS) system has facilitated the development and release of hybrids which are expected to enhance the productivity of pigeonpea. Recent advances in genomics and molecular breeding such as marker-assisted selection (MAS) offer the possibility to accelerate hybrid breeding. Molecular markers and genetic maps are pre-requisites for deploying MAS in breeding. However, in the case of pigeonpea, only one inter- and two intra-specific genetic maps are available so far. Here, four new intra-specific genetic maps comprising 59–140 simple sequence repeat (SSR) loci with map lengths ranging from 586.9 to 881.6 cM have been constructed. Using these four genetic maps together with two recently published intra-specific genetic maps, a consensus map was constructed, comprising of 339 SSR loci spanning a distance of 1,059 cM. Furthermore, quantitative trait loci (QTL) analysis for fertility restoration (Rf) conducted in three mapping populations identified four major QTLs explaining phenotypic variances up to 24 %. To the best of our knowledge, this is the first report on construction of a consensus genetic map in pigeonpea and on the identification of QTLs for fertility restoration. The developed consensus genetic map should serve as a reference for developing new genetic maps as well as correlating with the physical map in pigeonpea to be developed in near future. The availability of more informative markers in the bins harbouring QTLs for sterility mosaic disease (SMD) and Rf will facilitate the selection of the most suitable markers for genetic analysis and molecular breeding applications in pigeonpea.

Introduction

Pigeonpea [Cajanus cajan (L.) Millspaugh] is the fifth most important pulse crop in the world and represents an important component of semi-arid and sub-tropical farming systems (Shanower et al. 1999). Pigeonpea is a diploid species (2n = 2x = 22) and its genome comprises of 833.1 Mbp arranged into 11 pairs of chromosomes (see Varshney et al. 2012). Globally, it is cultivated in 4.6 Mha with a production of 3.49 Mt. Nearly 70 % of the pigeonpea production and 74 % of the pigeonpea area is in India. Pigeonpea is a hardy and drought tolerant crop assuring sustainable returns from marginal lands with minimal inputs, hence it is considered as a very suitable crop for subsistence agriculture. Pigeonpea seeds contain about 20–24 % protein and reasonable amounts of essential amino acids making it an important source of dietary protein, mainly in vegetarian-based diets.

Pigeonpea production has shown an increasing trend in worldwide harvested area from 2.7 Mha (1961) to 4.6 Mha (2009) (FAO 2009, http://faostat.fao.org/). However, no increase has been observed in its productivity, which in the past five decades remained stagnated at around 750 kg/ha. To overcome the existing yield barriers, cytoplasmic male-sterility (CMS)-based hybrid technology has been developed in pigeonpea (Saxena et al. 2010a). For instance, recently the ICPH 2671 hybrid developed using A4 cytoplasm has been released for commercial cultivation in India. The availability of a CMS system circumvents the need for manual emasculation and crossing, which is more suitable for commercial hybrid seed production. However, identification of a good restorer is cumbersome and time consuming as it requires extensive field evaluation.

Molecular breeding seems to be the next step for genetic improvement in pigeonpea. Molecular tools, such as DNA markers and genetic maps are essential prerequisites for undertaking any molecular breeding programme. Using these tools, QTLs or genes for traits of interest are identified and the markers linked with the QTLs/genes can be used to select the superior progenies in breeding programme. Among various kinds of markers systems available, simple sequence repeat (SSR) is preferred as the marker of choice for the plant breeding and genetics community (Gupta and Varshney 2000) and have been used successfully for genetic mapping and tagging of many agronomically important traits in several crop species. Advances in genomics, next generation sequencing (NGS) technologies and high-throughput (HTP) genotyping facilities, have provided automation-driven marker systems, such as single nucleotide polymorphism (SNP) markers. However, in the case of orphan legumes, such as pigeonpea, efforts are still underway to exploit the full potential of these technologies (Varshney et al. 2010a), while SSR markers have already proven of widespread value in molecular studies.

The low level of genetic diversity and less availability of DNA markers have hindered progress of development of saturated genetic maps in pigeonpea. Despite this, an SSR based genetic map derived from an inter-specific cross (Cajanus cajan × C. scarabaeoides) with moderate marker density has been reported in pigeonpea (Bohra et al. 2011). However, the genetic maps developed for cultivated pigeonpea so far (Gnanesh et al. 2011), are still suffering from the problem of poor map resolution due to the low polymorphism available between parental lines. For instance, the recently developed individual intra-specific genetic maps derived from the F2 populations viz. ICP 8863 × ICPL 20097 and TTB 7 × ICP 7035 have 120 and 78 SSR loci, respectively.

Considering the above, the construction of an integrated genetic map for cultivated pigeonpea offers a viable alternative to address the problem of low polymorphism through providing better genome coverage in comparison to population specific genetic maps. Apart from this, an integrated genetic map provides an excellent platform to target several important traits since individual mapping populations, may not segregate for many traits.

In this study, we report development of four genetic maps based on intra-specific F2 populations, of which three populations were segregating for fertility restoration. Subsequently, the first consensus genetic map after merging six SSR-based genetic maps has been developed. In addition, an attempt has been made to identify the genomic regions or QTLs associated with fertility restoration from three different genetic backgrounds.

Materials and methods

Mapping populations and DNA extraction

Four F2 mapping populations: ICPB 2049 × ICPL 99050, ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671 comprising of 188 individuals each, were used for construction of genetic maps. Phenotyping and QTL analysis for fertility restoration was done for the last three populations. The A-lines viz. ICPA 2039 and ICPA 2043, used in the three crosses were alloplasmic CMS lines based on A4 cytoplasm derived from wild progenitor C. cajanifolius (Saxena et al. 2010a). Genomic DNA from mapping parents and populations was isolated from leaf tissue and purified following Cuc et al. (2008).

Phenotyping of mapping populations for pollen fertility

For assessing pollen fertility, 10 fully grown but un-opened floral buds were collected from different parts of the plants between 9 and 11 a.m. to prepare microscope glass slides for examination. Anthers from the sampled flowers were removed and squashed in 1 % aceto-carmine solution. In each glass slide, three different microscopic fields were studied under light microscope. The pollen grains were considered fertile if they were stained with dye and sterile if they were not stained (Gulyas et al. 2006). Within each population, discrimination among the plants for male-fertility restorers and non-restorers was done on the basis of their pollen fertility data. Plants with ≥80 % stained pollen grains were classified as male-fertile; while those with ≤10 % pollen fertility were identified as male-sterile.

PCR and SSR analysis

Markers polymorphic between the parental lines as identified in Bohra et al. (2011) were used for genotyping the respective mapping population. Polymerase chain reactions (PCRs) for amplification of SSR loci were performed in a 384-well micro titre plate (ABgene, Rockford, IL, USA) using thermal cycler GeneAmp PCR System 9700 (Applied Biosystems, Foster City, CA, USA). The reaction volume consisted of 5 μl containing 0.5 μl of 10 × PCR buffer (SibEnzyme, Novosibirsk, Russia), 1.0 μl of 15 mM MgCl2, 0.25 μl of 2 mM dNTPs, 0.50 μl of 2 pmol/μl primer anchored with M13-tail (MWG-Biotech AG, Bangalore, India), 0.1 U of Taq polymerase (SibEnzyme, Novosibirsk, Russia) and 1.0 μl (5 ng/μl) of template DNA. A touch down PCR programme was used to amplify the DNA fragments: initial denaturation was for 5 min at 95 °C followed by five cycles of denaturation for 20 s at 94 °C, annealing for 20 s at 60 °C (the annealing temperature for each cycle being reduced by 1 °C per cycle) and extension for 30 s at 72 °C. Subsequently, 35 cycles of denaturation at 94 °C for 20 s followed by annealing for 20 s at 56 °C and extension for 30 s at 72 °C and 20 min of final extension at 72 °C. The PCR products were checked for amplification on 1.2 % agarose gel. Amplified products were separated on capillary electrophoresis using ABI 3730 (Applied Biosystems, Foster City, CA, USA) and allele calling was performed using GeneMapper software version 4.0 (Applied Biosystems, Foster City, CA, USA).

Construction of component genetic maps

Genotype data were assembled for all segregating makers on all 188 F2 individuals from four mapping populations and linkage analysis was performed using JoinMap version 3.0 using “Regression mapping algorithm” (Van Ooijen and Voorrips 2001). Before linkage analysis, marker segregations in all populations were subjected to goodness of fit test to assess deviations from the expected Mendelian segregation ratio of 1:2:1 at 5 % level of significance. “Locus genotype frequency” function was used to calculate the χ2 values for all the markers. Map calculations were performed with parameters like LOD value ≥3.0, recombination frequency ≤0.40 and a χ2 jump threshold for removal of loci = 5. Addition of a new locus may influence the optimum map order; hence, a “Ripple” was performed after adding each marker into the map. Map distances were calculated using Kosambi mapping function (Kosambi 1944) and a third round was set to allow mapping of optimum number of loci in the genetic map. Placement of markers into different linkage groups (LGs) was done with “LOD groupings” and “Create group using the mapping tree” commands. Mean χ2 contributions or average contributions to the goodness of fit of each locus were also checked to determine the best fitting position for markers in genetic maps. The markers showing negative map distances or a large jump in mean χ2 values were discarded. Final maps were drawn with the help of MapChart version 2.2 (Voorrips 2002).

Construction of consensus genetic map

Genotype data for six F2 mapping populations including four mapping populations in this study and two mapping populations reported earlier (Gnanesh et al. 2011) were used for developing a consensus genetic map using software JoinMap version 3.0. In this approach, segregation data from all mapping populations on all or some individuals are used to achieve a consensus order of loci to be used to develop the synthetic or integrated map (Wenzl et al. 2006). Map integration was accomplished by following three steps (Truco et al. 2007):

  1. 1.

    A priori identification of common loci among different mapping populations was carried out and their relative positions in different genetic maps were used to derive a consensus or framework order.

  2. 2.

    Finally “Combine groups for map integration” function from the “Join” menu was applied to synthesize an integrated LG.

  3. 3.

    The framework order of common markers obtained from step (1) was kept as fixed for map calculations of integrated LG using “Fixed order” command.

Problematic anchor loci in framework order, identified on the basis of mean χ2 statistics, were taken out from fixed order. To assess the amount of co-linearity in marker orders between consensus and component genetic maps, correlation coefficients (r) were calculated from marker positions in consensus and individual genetic map and their significance was tested. All the developed genetic maps were aligned together using a comparative mapping programme CMap version 1.01 to visually assess the congruency of marker orders.

QTL analysis for fertility restoration

QTL analysis of fertility restoration from three mapping populations (ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671) were undertaken employing composite interval mapping (CIM) in the WinQTL Cartographer version 2.5 (Wang et al. 2007). CIM analysis was performed applying the Standard Model 6, with a genome scan interval (walk speed) of 1 cM. The “forward–backward stepwise regression” was used to set number of marker cofactors as background control. A window size of 10 cM was used to block out signals within 10 cM on either side of the flanking markers or QTL test site. Thresholds were determined by permutation tests using 1,000 permutations and a significance level of 0.05.

Results

Marker genotyping and segregation

Screening of 3,072 SSR markers on 22 parental genotypes of 13 mapping populations provided a set of 842 polymorphic markers which consisted of markers exhibiting polymorphism at least within one parental combination (Bohra et al. 2011). Based on the marker polymorphism data, a genetic map based on the inter-specific mapping population (C. cajan ICP 28 × C. scarabaeoides ICPW 94) with 239 SSR loci (Bohra et al. 2011) and two genetic maps based on the intra-specific mapping populations that segregate for sterility mosaic disease (SMD) viz. ICP 8863 × ICPL 20097 and TTB 7 × ICP 7035, with 120 and 78 SSR loci, respectively, were developed (Gnanesh et al. 2011). Genotyping of four new intra-specific mapping populations: ICPB 2049 × ICPL 99050, ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671 was done in this study using polymorphic SSR markers identified by Bohra et al. (ESM Table 1). These mapping populations segregate for different traits, such as Fusarium wilt (FW) (ICPB 2049 × ICPL 99050) and fertility restoration (Rf) (ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671) (Varshney et al. 2010b).

In summary, segregation data were assembled for 104, 83, 166 and 145 polymorphic markers on populations ICPB 2049 × ICPL 99050, ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671, respectively. Marker segregation data from each population was subjected to goodness of fit tests to assess the deviation from expected Mendelian ratio of 1:2:1 at the threshold of p = 0.05 (ESM Table 1, ESM Fig. 1).

Component or individual genetic maps

Genotype data generated for all four intra-specific mapping populations were used to develop the components genetic maps for individual mapping populations. Two intra-specific genetic maps, reported earlier (Gnanesh et al. 2011), were also included for further analysis in this study. The percentage of markers, showing significant deviation from expected 1:2:1 ratio varied from 4.2 % (ICP 8863 × ICPL 20097) to 29.3 % (ICPA 2043 × ICPR 3467) (Table 1) in different populations. These distorted loci were scattered on all LGs, but LG02, LG03 and LG04 exhibited a higher proportion of distorted loci as compared to other LGs (ESM Fig. 1).

Table 1 Features of component genetic maps

In summary, the number of mapped loci across all the six intra-specific genetic maps ranged from 59 (ICPB 2049 × ICPL 99050) to 140 (ICPA 2043 × ICPR 3467) (Table 1). In all the genetic maps, 11 linkage groups (LGs) were obtained except for population ICPB 2049 × ICPL 99050 with 12 LGs. The maximum map length was shown by ICPA 2043 × ICPR 3467 (881.6 cM) genetic map while minimum of 467 cM was observed for TTB 7 × ICP 7035. Average inter-marker distance varied from 4.5 cM (ICP 8863 × ICPL 20097) to 9.9 cM (ICPB 2049 × ICPL 99050) (http://www.cmap.icrisat.ac.in/cmap/sm/pp/bohra/).

The consensus genetic map

The availability of a sufficient number of common markers on six intra-specific genetic maps facilitated the merging of six maps into one consensus map. While integrating different genetic maps, the nomenclature of common markers present on component genetic maps is crucial (Varshney et al. 2007). In the present study, however, there was no discrepancy in names of common markers, since 98.8 % of the markers used for linkage analysis came from the same source, i.e. BAC-end derived SSRs and designated as Cajanus cajan microsatellite (CcM) markers. Segregation data for 348 markers obtained on 6 different mapping populations was used for merging multiple genetic maps. Although 203 markers were unique to individual genetic maps, 145 markers were common among two (80 markers), three (43 markers), four (16 markers) and five (6 markers) mapping populations that served as anchor points for map integration (Table 2). Most of the LGs of component populations were successfully integrated into the consensus map. Details of the consensus map and markers contributed from different component genetic maps have been given in Table 3.

Table 2 Number of common markers among different component mapping populations
Table 3 Summary of consensus genetic map

All the common markers collectively led to the synthesis of a consensus map comprising 339 loci on 11 LGs and covering a map distance of 1,059 cM (Fig. 1; Table 3). In the consensus map, a total of 147 (43.4 %) markers were anchor markers and the percentage of these markers varied from 31.0 % (LG03) to 54 % (LG11) across different LGs. The remaining 192 (56.6 %) markers in the consensus map were unique to individual mapping populations. It is important to note that four markers namely CcM0492 (mapped on LG02 and LG09), CcM1110 (mapped on LG02 and LG05), CcM2379 (mapped on LG03 and LG08) and CcM2505 (mapped on LG01 and LG11) were mapped on different LGs in different crosses. Two of the anchor markers couldn’t integrate into consensus map and another four were mapped at two different loci hence the total number becomes 147 instead of 145.

Fig. 1
figure1

A consensus genetic map comprising 339 loci. Markers are shown on right side of the LG while map distances are indicated on left side. Each LG is divided into several bins based on 10-cM interval. The markers unique to mapping populations, common between two, three, four and five mapping populations have been shown by green, red, brown, blue and black colour, respectively. QTLs are indicated by bars with different colours. Blue, green, pink, white and yellow coloured bars were used to show the QTLs derived from populations TTB 7 × ICP 7035, ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 2671, ICPA 2043 × ICPR 3467 and ICP 8863 × ICPL 20097, respectively

The number of markers per linkage group on the consensus map varied form 11 (LG11) to 50 (LG06). The LG02 exhibited maximum map length of 135.2 cM while minimum map length (57.5 cM) was observed for the LG08. The average inter-marker distance ranged from 1.6 cM (LG06) to 11.2 cM (LG11) with an average of 3.1 cM. Non-uniform distribution of markers was evident in all LGs. Visual inspection of the consensus map resulted in identification of only 15 major gaps (> 10 cM) across all the LGs except for LG04 which did not show any major gap. The largest gap between two loci was found to be 35.8 cM between markers CcM0112 (at 0 cM) and CcM1045 (at 37.8 cM) on LG10 followed by 33.5 cM between CcM0834 (at 89.6 cM) and CcM2505 (at 123.1 cM) on LG11 (Fig. 1, ESM Fig. 1, http://www.cmap.icrisat.ac.in/cmap/sm/pp/bohra/).

In terms of SSR motifs, the majority (55.45 %) of the markers integrated into the consensus map, belonged to the di-nucleotide repeat category followed by compound type SSRs (28.90 %) (ESM Table 2). The lowest representation was from tetra and hexa-nucleotide repeat classes. More than 58 % of the markers in the consensus map exhibited polymorphism information content (PIC) values greater than 0.5, with 28 % having PIC values greater than 0.75. Average PIC value of individual LGs varied from 0.64 (LG04) to 0.72 (LG11) while average number of alleles ranged form 6.5 (LG08) to 7.2 (LG10). The consensus map was divided into several bins of 10 cM each to aid future genetic mapping and diversity analysis (Fig. 1, ESM Fig. 1). As expected, the SSR markers present in each bin have varied PIC values (ESM Table 2). Now the community can select the highly informative SSR markers from each bin that will best represent the genome in the germplasm to be analyzed.

With the objective to make the consensus map more informative, QTLs for fertility restoration identified in this study and for SMD resistance based on two mapping populations (TTB 7 × ICP 7035 and ICP 8863 × ICPL 20097) identified by Gnanesh and colleagues, were placed on the consensus map (Fig. 1). Placement of all these QTLs into a single genetic map will facilitate the adoption of the identified QTLs for SMD resistance and fertility restoration in pigeonpea breeding. For instance, a QTL associated with SMD resistance namely qSMD3, bracketed by markers CcM2149 (PIC value: 0.73) and CcM0468 (PIC value: 0.67), was identified on LG02 from one of the component population TTB 7 × ICP 7035 (Gnanesh et al. 2011). In the consensus map, five additional markers namely CcM0494, CcM0183, CcM1110, CcM0477 and CcM1238 were integrated into this QTL region. Among these new markers, CcM0494 and CcM0183 with PIC values of 0.86 and 0.78 respectively as compared to CcM2149 and CcM0468 identified originally, will be more valuable while screening the germplasm for resistance to SMD. Similarly, localization of all the three RF-QTL regions, identified on LG06, into a single genetic map provided a common region i.e. marker interval CcM2842–CcM1506, that may be associated with fertility restoration in all three genetic backgrounds.

Comparison of consensus map and component maps

Nomenclature of LGs in the consensus as well as in component genetic maps were given according to the reference genetic map of pigeonpea derived from an inter-specific F2 (ICP 28 × ICPW 94) population. Detailed comparison of the consensus map and population-specific genetic maps has revealed a very high degree of conservation in marker orders and marker groupings. For instance, a high degree of correlation (correlation coefficients varying from 0.64 to 0.99) was observed for all the LGs between consensus and population specific LGs. The highest amount of co-linearity with the consensus map was exhibited by the ICP 8863 × ICPL 20097 genetic map, which consistently showed correlation coefficients of 0.99 for the nine linkage groups merged into consensus map. Highly significant values of correlation coefficients showed a good agreement of both marker orders and markers positions or inter-marker distances between consensus and component genetic maps (Fig. 2). As an example, comparison of LG06 for all the maps using CMap version 1.01 has been shown in Fig. 3. A detailed comparison of all linkage groups across all the maps has been shown in ESM Fig. 2. CMap helps in assessing the congruency of marker positions and orders by making a pairwise comparison between different genetic maps. Considering only the common loci existing among various genetic maps, highly conserved marker orders were manifested.

Fig. 2
figure2

Scatter plots showing the extent of correlations among consensus genetic map and population-specific genetic maps. The marker integrated from different populations viz. ICP 8863 × ICPL 20097, ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 2671, ICPA 2043 × ICPR 3467, TTB 7 × ICP 7035 and ICPB 2049 × ICPL 99050 are shown by red triangles, pink triangles, purple squares, blue diamonds, light-green diamonds and yellow circles, respectively

Fig. 3
figure3

This depicts the marker-based correspondences for LG06, among consensus and individual genetic maps. Only common markers i.e. landmarks are included to visually asses the co-linearity of marker orders and marker positions. LGs are aligned together using comparative mapping programme CMap version 1.01. Figure can also be found at http://www.cmap.icrisat.ac.in/cmap/sm/pp/bohra/

Comparison of consensus map with the inter-specific genetic map

With the objective of assessing the consistency of marker orders and possible rearrangements between the intra-specific and inter-specific genetic maps, the consensus map (339 SSR loci) developed in this study was compared with a reference genetic map (ICP 28 × ICPW 94) developed by Bohra and colleagues (Fig. 4). Between these maps, a total of 38 markers were common and scattered on all 11 linkage groups. Out of these 38 common markers, six markers; namely, CcM2911, CcM0417, CcM0392, CcM1781, CcM0603 and CcM0752 had different positions. Three of these makers had significant segregation distortion (CcM2911: χ2 = 17.3, CcM0417: χ2 = 17.4. and CcM0392: χ2 = 78.5) in the inter-specific cross. Each of the six markers was mapped only in one of the six intra-specific mapping populations and therefore these markers were not used as anchor markers. Nevertheless, these markers were included in the consensus genetic map. However, some inconsistency was observed in the genetic mapping positions for these markers between the consensus map and the inter-specific genetic map that may be the result of mapping of two different loci/fragments in the inter-specific and intra-specific mapping populations.

Fig. 4
figure4

Comparison of marker order between the consensus and inter-specific genetic map based on ICP 28 × ICPW 94 mapping population. Consensus LGs are on left side while inter-specific LGs are on right side. Common loci are indicated by red colour, while unique loci are shown by blue colour

The remaining 32 markers were mapped to the same position on the LGs in both consensus map and inter-specific genetic maps. Marker positions were found to be fairly concurrent between these two genetic maps. Although five markers (CcM1232, CcM1647, CcM2855, CcM2639 and CcM0257) showed slight difference in their position along LG, most of these were consecutive pairs, so still found on the same genomic regions.

Phenotyping and QTLs for fertility restoration

Three of the mapping populations used in this study segregate for fertility restoration (ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPA 2043 × ICPR 2671) and were phenotyped for fertility restoration. The cross ICPA 2039 × ICPR 2447 belonged to the early maturing category while the latter two crosses were from the late maturing category. In all the crosses, fully fertile F1s with good pollen load were recovered indicating dominant nature of loci involved in fertility restoration. In the F2, the phenotypic segregation for fertility restoration was observed and data were recorded on 188 individuals of each the three crosses (Table 4).

Table 4 Descriptive statistics of phenotyping data on fertility restoration

QTL mapping for fertility restoration was done based on arc sine transformed values of mean phenotypic data of  percentage pollen fertility and genetic mapping data using CIM approach. CIM analysis revealed occurrence of a total of four major QTLs for fertility restoration across three different pedigrees (Table 5). These QTLs were designated as QTL-RF-1 to QTL-RF-4. Of the total QTLs identified, two QTLs namely QTL-RF-1 (flanked by CcM1821 and CcM1522) and QTL-RF-2 (flanked by CcM0047 and CcM2332) explaining 14.85 %, and 15.84 % of the PV respectively, were identified in ICPA 2039 × ICPR 2447 population. Similarly one major QTL viz. QTL-RF-3 (bracketed in CcM1277-CcM2542) explaining 20.89 % of PV was recovered from population ICPA 2043 × ICPR 2671. QTL analysis conducted on population I CPA 2043 × ICPR 3467 identified a single major QTL named as QTL-RF-4 (bracketed in CcM0374–CcM1506 region). This QTL contributing up to 24.17 % of PV was identified at a LOD value of 8.9. In terms of localization of RF-QTLs in linkage groups, the LG06 contained three QTLs (QTL-RF-1, QTL-RF-3 and QTL-RF-4) while the remaining single QTL viz. QTL-RF-2 was located on the LG11.

Table 5 Identification of QTLs for fertility restoration using CIM analysis

Discussion

Molecular markers and genetic maps are prerequisites for undertaking trait mapping and molecular breeding in any crop species. While significant progress has been made in cereals (Varshney et al. 2005) and a few legume species (Varshney et al. 2010c), in the case of pigeonpea, because of its narrow genetic base, together with the paucity of molecular markers and mapping populations, the crop did not have a genetic map until 2010 (Varshney et al. 2010b). Only recently, a set of 3,200 SSR markers and an inter-specific reference genetic map have become available (Bohra et al. 2011). However, as for breeding applications, intra-specific genetic maps are more useful, only two intra-specific genetic maps with few QTLs for SMD have been reported so far (Gnanesh et al. 2011). The present study focuses on construction of four genetic maps based on intra-specific mapping populations of which three populations segregate for fertility restoration. These maps contain only 78 (ICPA 2039 × ICPR 2447) to 140 (ICPA 2043 × ICPR 3467) SSR loci even after scanning 3,200 SSR markers on the parental genotypes of the mapping populations. This low level of polymorphism and the low-density genetic maps have been reported earlier and the intra-specific genetic maps contained 78 (TTB 7 × ICP 7035) and 120 (ICP 8863 × ICPL 20097) SSR loci respectively (Gnanesh et al. 2011).

Segregation distortion was observed in all the six intra-specific crosses with varying degree of deviation. Segregation distortion is a common phenomenon observed in intra as well as in inter-specific crosses, however the extent is more in case of inter-specific crosses. For instance, percentage of distorted markers ranged from 3.49 % (ICP 8863 × ICPL 20097) to 37.50 % (ICPB 2049 × ICPL 99050) in intra-specific crosses, about 63.5 % SSR showed segregation distortion in inter-specific cross (Bohra et al. 2011). Similar instances of segregation distortion were also reported for Medicago (Jenczewski et al. 1997), chickpea (Gaur et al. 2011) and mungbean (Lambrides et al. 2000). Some of the regions on LG02, LG03 and LG04 (in the crosses ICPA 2039 × ICPR 2447, ICPA 2043 × ICPR 3467 and ICPB 2049 × ICPL 99050) can be considered as “segments associated with skewed segregation” because these regions harboured four or more closely linked markers showing significant and consistent deviation from expected F2 ratio of 1:2:1 (Xu et al. 1997; Marcel et al. 2007). Segregation distortion may result from various factors such as residual heterozygosity, gametic or zygotic selections and genotyping errors (Liang et al. 2006).

The prime objective of this study was to construct a high density integrated genetic map from different pedigrees with highly conserved marker orders that can be used as reference genetic map for cultivated crosses. As a result, we present the first integrated genetic map for cultivated pigeonpea that may be regarded as a “consensus map” as suggested by Isobe et al. (2009). The good agreement of marker orders as well as inter-marker distances observed among different component genetic maps may be due to (1) fairly similar population size (~188), (2) type of mapping populations (all F2s) and (3) type of marker system (co-dominant), taken into consideration for linkage analysis. Such consensus maps were developed earlier in many plant species like wheat (Somers et al. 2004), barley (Varshney et al. 2007; Marcel et al. 2007), red clover (Isobe et al. 2009), sorghum (Mace et al. 2009), soybean (Hyten et al. 2010), groundnut (Hong et al. 2010) and chickpea (Radhika et al. 2007; Millan et al. 2010). Consensus genetic maps, consolidating genetic information contained in different genetic backgrounds, offer a valuable resource for genetic analysis and breeding.

The average marker density (3.1 cM) in the consensus map is higher than recorded for inter-specific genetic map (3.8 cM) (t = 2.1 and p = 0.03) (Bohra et al. 2011). However, the slight difference in marker order relative to inter-specific genetic map may be accounted to genotyping errors. Secondly, all of these markers are located on the same genomic regions and flipping is a common phenomenon for closely spaced markers (Feltus et al. 2006; Wu and Huang 2006) which may be accounted to genotyping imprecision rather than real rearrangements (Lombard and Delourme 2001). Similar findings were also observed by Winter et al. (1999) and Millan et al. (2010) while comparing intra- and inter-specific genetic maps in chickpea. Poor correlation observed between length of LGs and number of markers/LG in consensus genetic map suggested non-uniform distribution of markers along LGs. This non-uniform distribution is mainly because of the gaps existing in distal ends of LGs which may be due to deficiency of markers in these regions (Sewell et al. 1999).

Most of the markers integrated into the consensus map were highly informative since more than 50 % of the markers exhibited PIC values greater than 0.50. Similarly, the average number of alleles (6.27) and average PIC value (0.67) of all mapped markers were higher than reported earlier (Burns et al. 2001; Odeny et al. 2007; Saxena et al. 2010b). The bin-wise information on PIC values provided for all integrated markers will help geneticists and breeders to select a good set of markers that will represent the genome as well as display high degree of polymorphism and such a set of markers will be very useful for developing new genetic maps, trait mapping and diversity analysis.

Marker-trait association analysis in three mapping populations provided the candidate molecular markers and QTLs for fertility restoration in hybrid breeding of pigeonpea. All four QTLs detected for fertility restoration contributed more than 10 % of phenotypic variation and these QTLs, therefore, can be considered as QTLs playing major roles in restoring fertility in A4 cytoplasm in pigeonpea. The fertility restoration has been subjected to QTL analyses in F2 population of several other crop species where CMS systems are well established such as wheat (Zhou et al. 2005), rice (Tan et al. 1998), pepper (Wang et al. 2004) etc. These studies reported existence of large effect QTLs governing major proportions of the phenotypic variation. However, presence of minor QTLs/genes was also observed which can act as modifiers in restoring the fertility and hence increasing complexity in fertility restoration phenomenon.

Moreover, the QTL region flanked by the markers CcM1506 and CcM2542 were found in two different genetic backgrounds. This indicates the utility of these common markers and consistent QTLs for hybrid breeding in pigeonpea. It is interesting to note that majority of the QTLs identified were located on the LG06 in all the three mapping populations indicating the underlying importance of the LG06. This is the first study on the identification of QTLs for fertility restoration in pigeonpea. Identification of SSR markers tightly linked with fertility restoration will assist pigeonpea breeders in quick discrimination between maintainer (B-lines) and restorer lines (R-lines). Since the absence of fertility restorer in B- line is an essential prerequisite for maintenance of sterile lines (A-lines). Furthermore, recovery of a potential restorer for CMS based hybrid development is very labour intensive and cumbersome procedure as it requires extensive test crossing and field screening to assess the level of fertility restoration through various A × R combinations (Yue et al. 2010). Furthermore, identification of good R-lines cannot be done before onset of flowering in A × R progenies. Hence, SSR marker would facilitate not only rapid selection of restorer lines but also ensure precise introgression of fertility restorer loci into elite pigeonpea breeding lines. Apart from QTLs governing fertility restoration, QTLs imparting SMD resistance were also placed in the consensus genetic map which allowed integration of more informative markers into QTL harbouring regions. Inclusion of additional markers in the QTL regions of the consensus genetic map provides an opportunity for selecting reliable markers from the region together with allowing comparison of the region of interest in different pedigrees.

In summary, four new intra-specific genetic maps have been constructed based on BAC end sequence (BES) derived SSR markers. All these genetic maps together with the two intra-specific genetic maps reported in earlier study, allowed development of a consensus genetic map comprising 339 loci with an average marker density of 3.1 cM. This is the first instance of integrating multiple component genetic maps in pigeonpea. Furthermore, grouping of markers into bins and associating them with PIC values on the integrated genetic map will facilitate the selection of evenly distributed markers for various genetics and breeding studies including genetic mapping (for new populations), association or linkage disequilibrium (LD) studies, diversity analysis, or for practicing background selection in molecular breeding studies aimed at crop improvement in pigeonpea. In parallel, QTL analysis performed on fertility restoration data, detected a total of four major QTLs, representing this study as a pioneering step towards molecular dissection of fertility restoration in pigeonpea. The identification of major RF-QTLs would open new avenues for genomics-assisted hybrid breeding in pigeonpea.

References

  1. Bohra A, Dubey A, Saxena RK, Penmetsa RV, Poornima KN, Kumar N, Farmer AD, Srivani G, Upadhyaya HD, Gothalwal R, Ramesh R, Singh D, Saxena KB, Kavi Kishor PB, Singh NK, Town CD, May GD, Cook DR, Varshney RK (2011) Analysis of BAC-end sequences (BESs) and development of BES-SSR markers for genetic mapping and hybrid purity assessment in pigeonpea (Cajanus spp.). BMC Plant Biol 11:56

    Google Scholar 

  2. Burns MJ, Edwards KJ, Newbury HJ, Ford-Lloyd BR, Baggot CD (2001) Development of simple sequence repeat (SSR) markers for the assessment of gene flow and genetic diversity in pigeonpea (Cajanus cajan). Mol Ecol Notes 1:283–285

    Article  CAS  Google Scholar 

  3. Cuc LM, Mace ES, Crouch JH, Quang VD, Long TD, Varshney RK (2008) Isolation and characterization of novel microsatellite markers and their application for diversity assessment in cultivated groundnut (Arachis hypogaea). BMC Plant Biol 8:55

    PubMed  Article  Google Scholar 

  4. Feltus FA, Hart GE, Schertz KF, Casa AM, Kresovich S, Abraham S, Klein PE, Brown PJ, Paterson AH (2006) Alignment of genetic maps and QTLs between inter- and intraspecific sorghum populations. Theor Appl Genet 112:1295–1305

    PubMed  Article  CAS  Google Scholar 

  5. Gaur R, Sethy NK, Choudhary S, Shokeen B, Gupta V, Bhatia S (2011) Advancing the STMS genomic resources for defining new locations on the intraspecific genetic linkage map of chickpea (Cicer arietinum L.). BMC Genomics 12:117

    PubMed  Article  CAS  Google Scholar 

  6. Gnanesh BN, Bohra A, Sharma M, Byregowda M, Pande S, Wesley V, Saxena RK, Saxena KB, Kavi Kishor PB, Varshney RK (2011) Genetic mapping and quantitative trait locus analysis of resistance to sterility mosaic disease in pigeonpea [Cajanus cajan (L.) Millsp.]. Field Crops Res 123:56–61

    Article  Google Scholar 

  7. Gulyas G, Pakozdi K, Lee JS, Hirata Y (2006) Analysis of fertility restoration by using cytoplasmic male-sterile red pepper (Capsicum annuum L.) lines. Breed Sci 56:331–334

    Article  Google Scholar 

  8. Gupta PK, Varshney RK (2000) The development and use of microsatellite markers for genetic analysis and plant breeding with emphasis on bread wheat. Euphytica 113:163–185

    Article  CAS  Google Scholar 

  9. Hong Y, Chen X, Liang X, Liu H, Zhou G, Li S, Wen S, Holbrook CC, Guo B (2010) A SSR-based composite genetic linkage map for the cultivated peanut (Arachis hypogaea L.) genome. BMC Plant Biol 10:17

    PubMed  Article  Google Scholar 

  10. Hyten DL, Choi IY, Song Q, Specht JE, Carter TE, Shoemaker RC Jr, Hwang EY, Matukumalli LK, Cregan PB (2010) A high density integrated genetic linkage map of soybean and the development of a 1536 universal soy linkage panel for quantitative trait locus mapping. Crop Sci 50:960–968

    Article  CAS  Google Scholar 

  11. Isobe S, Kölliker R, Hisano H, Sasamoto S, Wada T, Klimenko I, Okumura K, Tabata S (2009) Construction of a consensus linkage map for red clover (Trifolium pratense L.). BMC Plant Biol 9:57

    PubMed  Article  Google Scholar 

  12. Jenczewski E, Ghérardi M, Bonnin I, Prosperi JM, Olivieri I, Huguet T (1997) Insight on segregation distortions in two intraspecific crosses between annual species of Medicago (Leguminosae). Theor Appl Genet 94:682–691

    Article  Google Scholar 

  13. Kosambi DD (1944) The estimation of map distances from recombination values. Ann Eugen 12:172–175

    Google Scholar 

  14. Lambrides CJ, Lawn RJ, Godwin ID, Manners J, Imrie BC (2000) Two genetic linkage maps of mungbean using RFLP and RAPD markers. Aust J Agric Res 51:415–425

    Article  CAS  Google Scholar 

  15. Liang SX, Zhen SX, Zhen ZT (2006) Segregation distortion and its effect on genetic mapping in plants. Chin J Agric Biotechnol 3:163–169

    Article  Google Scholar 

  16. Lombard V, Delourme R (2001) A consensus linkage map for rapeseed (Brassica napus L.): construction and integration of three individual maps from DH populations. Theor Appl Genet 103:491–507

    Article  CAS  Google Scholar 

  17. Mace ES, Rami JF, Bouchet S, Klein PE, Klein RR, Kilian A, Wenzl P, Xia L, Halloran K, Jordan DR (2009) A consensus genetic map of sorghum that integrates multiple component maps and high-throughput Diversity Array Technology (DArT) markers. BMC Plant Biol 9:13

    PubMed  Article  Google Scholar 

  18. Marcel TC, Varshney RK, Barbieri M, Jafary H, de Kock MJD, Graner A, Niks RE (2007) A high-density consensus map of barley to compare the distribution of QTLs for partial resistance to Puccinia hordei and of defence gene homologues. Theor Appl Genet 114:487–500

    PubMed  Article  CAS  Google Scholar 

  19. Millan T, Winter P, Ju¨ngling R, Gil J, Rubio J, Cho S, Cobos MJ, Iruela M, Rajesh PN, Tekeoglu M, Kahl G, Muehlbauer FJ (2010) A consensus genetic map of chickpea (Cicer arietinum L.) based on 10 mapping populations. Euphytica 175:175–189

    Article  CAS  Google Scholar 

  20. Odeny DA, Jayashree B, Ferguson M, Hoisington D, Cry LJ, Gebhardt C (2007) Development, characterization and utilization of microsatellite markers in pigeonpea. Plant Breed 126:130–136

    Article  CAS  Google Scholar 

  21. Radhika P, Gowda SJM, Kadoo NY, Mhase LB, Jamadagni BM, Sainani MN, Chandra S, Gupta VS (2007) Development of an integrated intraspecific map of chickpea (Cicer arietinum L.) using two recombinant inbred line populations. Theor Appl Genet 115:209–216

    PubMed  Article  CAS  Google Scholar 

  22. Saxena KB, Sultana R, Mallikarjuna N, Saxena RK, Kumar RV, Sawargaonkar SL, Varshney RK (2010a) Male-sterility systems in pigeonpea and their role in enhancing yield. Plant Breed 129:125–134

    Article  Google Scholar 

  23. Saxena RK, Prathima C, Saxena KB, Hoisington DA, Singh NK, Varshney RK (2010b) Novel SSR markers for polymorphism detection in pigeonpea (Cajanus spp.). Plant Breed 129:142–148

    Article  CAS  Google Scholar 

  24. Sewell MM, Sherman BK, Neale DB (1999) A consensus map for loblolly pine (Pinus taeda L.). 1. Construction and integration of individual linkage maps from two outbred three-generation pedigrees. Genetics 151:321–330

    PubMed  CAS  Google Scholar 

  25. Shanower TG, Romeis J, Minja EM (1999) Insect pests of pigeonpea and their management. Annu Rev Entomol 44:77–96

    PubMed  Article  CAS  Google Scholar 

  26. Somers DJ, Isaac P, Edwards K (2004) A high-density microsatellite consensus map for bread wheat (Triticum aestivum L.). Theor Appl Genet 109:1105–1114

    PubMed  Article  CAS  Google Scholar 

  27. Tan XL, Vanavichit A, Amornsilpa S, Trangoonrung S (1998) Genetic analysis of rice CMS-WA fertility restoration based on QTL mapping. Theor Appl Genet 5:994–999

    Article  Google Scholar 

  28. Truco MJ, Antonise R, Lavelle D, Ochoa O, Kozik A, Witsenboer H, Fort SB, Jeuken MJ, Kesseli RV, Lindhout P, Michelmore RW, Peleman J (2007) A high-density integrated genetic linkage map of lettuce (Lactuca spp.). Theor Appl Genet 115:735–746

    PubMed  Article  CAS  Google Scholar 

  29. Van Ooijen JW, Voorrips RE (2001) JoinMap 3.0, software for the calculation of genetic linkage maps. Plant Research International, Wageningen, The Netherlands

  30. Varshney RK, Graner A, Sorrells ME (2005) Genic microsatellite markers in plants: features and applications. Trends Biotechnol 23:48–55

    PubMed  Article  CAS  Google Scholar 

  31. Varshney RK, Marcel TC, Ramsay L, Russell J, Röder M, Stein N, Waugh R, Langridge P, Niks RE, Graner A (2007) A high density barley microsatellite consensus map with 775 SSR loci. Theor Appl Genet 114:1091–1103

    PubMed  Article  CAS  Google Scholar 

  32. Varshney RK, Glaszmann JC, Leung H, Ribaut JM (2010a) More genomic resources for less-studied crops. Trends Biotechnol 28:452–460

    PubMed  Article  CAS  Google Scholar 

  33. Varshney RK, Thudi M, May GD, Jackson SA (2010b) Legume genomics and breeding. Plant Breed Rev 33:257–304

    Article  Google Scholar 

  34. Varshney RK, Penmetsa RV, Dutta S, Kulwal PL, Saxena RK, Datta S, Sharma TR, Rosen B, Carrasquilla-Garcia N, Farmer AD, Dubey A, Saxena KB, Gao J, Fakrudin B, Singh MN, Singh BP, Wanjari KB, Yuan M, Srivastava RK, Kilian A, Upadhyaya HD, Mallikarjuna N, Town CD, Bruening GE, He G, May GD, McCombie R, Jackson SA, Singh NK, Cook DR (2010c) Pigeonpea genomics initiative (PGI): an international effort to improve crop productivity of pigeonpea (Cajanus cajan L.). Mol Breed 26:393–408

    PubMed  Article  Google Scholar 

  35. Varshney RK, Chen W, Li Y, Bharti AK, Saxena RK, Schlueter JA, Donoghue MTA, Azam S, Fan G, Whaley AM, Farmer AD, Sheridan J, Iwata A, Tuteja R, Penmetsa RV, Wu W, Upadhyaya HD, Yang SP, Shah T, Saxena KB, Michael T, McCombie WR, Yang B, Zhang G, Yang H, Wang J, Spillane C, Cook DR, May GD, Xu X, Jackson SA (2012) Draft genome sequence of pigeonpea (Cajanus cajan), an orphan legume crop of resource-poor farmers. Nat Biotechnol 30:83–89

    Article  CAS  Google Scholar 

  36. Voorrips RE (2002) MapChart: software for the graphical presentation of linkage maps and QTLs. J Hered 93:77–78

    PubMed  Article  CAS  Google Scholar 

  37. Wang LH, Zhang BX, Lefebvre V, Huang SW, Daubeze AM, Palloix A (2004) QTL analysis of fertility restoration in cytoplasmic male sterile pepper. Theor Appl Genet 109:1058–1063

    PubMed  Article  CAS  Google Scholar 

  38. Wang S, Basten CJ, Zeng ZB (2007) Windows QTL cartographer 2.5. Department of Statistics, 2007, North Carolina State University, Raleigh (http://statgen.ncsu.edu/qtlcart/)

  39. Wenzl P, Li H, Carling J, Zhou M, Raman H, Paul E, Hearnden P, Maier C, Xia L, Caig V, Ovesná J, Cakir M, Poulsen D, Wang J, Raman R, Smith KP, Muehlbauer GJ, Chalmers KJ, Kleinhofs A, Huttner E, Kilian A (2006) A high-density consensus map of barley linking DArT markers to SSR, RFLP and STS loci and agricultural traits. BMC Genomics 7:206

    PubMed  Article  Google Scholar 

  40. Winter P, Pfaff T, Udupa SM, Hüttel B, Sharma PC, Sahi S, Arreguin-Espinoza R, Weigand F, Muehlbauer FJ, Kahl G (1999) Characterization and mapping of sequence-tagged microsatellite sites in the chickpea (Cicer arietinum L.) genome. Mol Gen Genet 262:90–101

    PubMed  Article  CAS  Google Scholar 

  41. Wu YQ, Huang Y (2006) An SSR genetic map of Sorghum bicolor (L.) Moench and its comparison to a published genetic map. Genome 50:84–89

    Google Scholar 

  42. Xu Y, Zhu L, Xiao J, Huang N, McCouch S (1997) Chromosomal regions associated with segregation distortion of molecular markers in F2, backcross, doubled– haploid and recombinant inbred populations in rice (Oryza sativa L.). Mol Gen Genet 253:535–545

    PubMed  Article  CAS  Google Scholar 

  43. Yue B, Vick BA, Cai X, Hu J (2010) Genetic mapping for the Rf1 (fertility restoration) gene in sunflower (Helianthus annuus L.) by SSR and TRAP markers. Plant Breed 129:24–28

    Article  CAS  Google Scholar 

  44. Zhou W, Kolb FL, Domier LL, Wang S (2005) SSR markers associated with fertility restoration genes against Triticum timopheevii cytoplasm in Triticum aestivum. Euphytica 141:33–40

    Article  CAS  Google Scholar 

Download references

Acknowledgments

The authors are thankful to Indian Council of Agricultural Research (ICAR) and Generation Challenge Programme (GCP) of CGIAR for supporting this research. Thanks are also due to Trushar Shah for his suggestions on CMap analysis, Naresh Kumar for generating some genotyping data for ICPB 2049 × ICPL 99050 population published in an earlier article and to Abdul Gafoor, S Ramesh and Ms. G Srivani for their excellent technical support.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Affiliations

Authors

Corresponding author

Correspondence to Rajeev K. Varshney.

Additional information

Communicated by A. Schulman.

Electronic supplementary material

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and Permissions

About this article

Cite this article

Bohra, A., Saxena, R.K., Gnanesh, B.N. et al. An intra-specific consensus genetic map of pigeonpea [Cajanus cajan (L.) Millspaugh] derived from six mapping populations. Theor Appl Genet 125, 1325–1338 (2012). https://doi.org/10.1007/s00122-012-1916-5

Download citation

Keywords

  • Quantitative Trait Locus
  • Mapping Population
  • Simple Sequence Repeat Marker
  • Cytoplasmic Male Sterility
  • Quantitative Trait Locus Analysis