Pigeonpea genomics initiative (PGI): an international effort to improve crop productivity of pigeonpea (Cajanus cajan L.)
- First Online:
- Cite this article as:
- Varshney, R.K., Penmetsa, R.V., Dutta, S. et al. Mol Breeding (2010) 26: 393. doi:10.1007/s11032-009-9327-2
- 2k Downloads
Pigeonpea (Cajanus cajan), an important food legume crop in the semi-arid regions of the world and the second most important pulse crop in India, has an average crop productivity of 780 kg/ha. The relatively low crop yields may be attributed to non-availability of improved cultivars, poor crop husbandry and exposure to a number of biotic and abiotic stresses in pigeonpea growing regions. Narrow genetic diversity in cultivated germplasm has further hampered the effective utilization of conventional breeding as well as development and utilization of genomic tools, resulting in pigeonpea being often referred to as an ‘orphan crop legume’. To enable genomics-assisted breeding in this crop, the pigeonpea genomics initiative (PGI) was initiated in late 2006 with funding from Indian Council of Agricultural Research under the umbrella of Indo-US agricultural knowledge initiative, which was further expanded with financial support from the US National Science Foundation’s Plant Genome Research Program and the Generation Challenge Program. As a result of the PGI, the last 3 years have witnessed significant progress in development of both genetic as well as genomic resources in this crop through effective collaborations and coordination of genomics activities across several institutes and countries. For instance, 25 mapping populations segregating for a number of biotic and abiotic stresses have been developed or are under development. An 11X-genome coverage bacterial artificial chromosome (BAC) library comprising of 69,120 clones have been developed of which 50,000 clones were end sequenced to generate 87,590 BAC-end sequences (BESs). About 10,000 expressed sequence tags (ESTs) from Sanger sequencing and ca. 2 million short ESTs by 454/FLX sequencing have been generated. A variety of molecular markers have been developed from BESs, microsatellite or simple sequence repeat (SSR)-enriched libraries and mining of ESTs and genomic amplicon sequencing. Of about 21,000 SSRs identified, 6,698 SSRs are under analysis along with 670 orthologous genes using a GoldenGate SNP (single nucleotide polymorphism) genotyping platform, with large scale SNP discovery using Solexa, a next generation sequencing technology, is in progress. Similarly a diversity array technology array comprising of ca. 15,000 features has been developed. In addition, >600 unique nucleotide binding site (NBS) domain containing members of the NBS-leucine rich repeat disease resistance homologs were cloned in pigeonpea; 960 BACs containing these sequences were identified by filter hybridization, BES physical maps developed using high information content fingerprinting. To enrich the genomic resources further, sequenced soybean genome is being analyzed to establish the anchor points between pigeonpea and soybean genomes. In addition, Solexa sequencing is being used to explore the feasibility of generating whole genome sequence. In summary, the collaborative efforts of several research groups under the umbrella of PGI are making significant progress in improving molecular tools in pigeonpea and should significantly benefit pigeonpea genetics and breeding. As these efforts come to fruition, and expanded (depending on funding), pigeonpea would move from an ‘orphan legume crop’ to one where genomics-assisted breeding approaches for a sustainable crop improvement are routine.
KeywordsMolecular markers Genetic mapping Trait mapping Genomics Next generation sequencing Gene discovery Crop improvement
Pigeonpea (Cajanus cajan [L.] Millspaugh) is an important food legume (or pulse) crop that is predominantly cultivated in tropical and subtropical regions of the world. It is a diploid (2n = 22) crop with a genome size of 808 Mbp. Pigeonpea is a drought tolerant crop with large variation for days to maturity, ranging from extra short (90 days) duration to long duration (300 days). It is generally cultivated as a sole crop or as a mixed crop with short maturing cereals or legumes as well as with long duration crops like cotton and groundnut. Globally pigeonpea is cultivated on 4.64 M ha, with an annual production of 3.43 million tons and a mean productivity of 780 kg/ha. India is the primary pigeonpea growing country in the world, accounting for 3.53 M ha area and 2.51 million tons of production. Pigeonpea seeds have 20–22% protein and are consumed as green peas, whole grain or split peas. The seed and pod husks make a quality feed, whereas dry branches and stems serve as domestic fuel. Fallen leaves from the plant provide vital nutrients to the soil and the plant also enriches soil through symbiotic nitrogen fixation.
Pigeonpea belongs to subtribe Cajaninae of tribe Phaseoleae under sub-family Papilionoideae of family Leguminosae. C. cajan is the only domesticated species under sub-tribe Cajaninae. Within Phaseoleae, Cajaninae is well distinguished by the presence of vesicular glands on leaves, calyx, and pods. Currently, 11 genera are grouped under Cajaninae. The members of the earlier genus Atylosia closely resembled the genus Cajanus in major vegetative and reproductive characters but they were relegated to two separate genera, mainly on the basis of the presence or absence of seed strophiole.
In 1980s, van der Maesen revised the taxonomy of both the genera and merged the genus Atylosia in to Cajanus (van der Maesen 1980). The revised genus Cajanus currently comprises of 18 species from Asia, 15 species from Australia, and one species from West Africa. Of these, 13 are found only in Australia, 8 in the Indian subcontinent, and 1 in West Africa, with the remaining 14 species occurring in more than one country. Based on growth habit, leaf shape, hairiness, structure of corolla, pod size, and presence of strophiole, van der Maesen (1980) grouped the genus Cajan into six sections. The 18 erect species were placed under three sections: seven species in Atylosia, nine species in section Fruticosa, and two species in section Cajanus that consists of the cultivated species along with its progenitor, C. cajanifolius. Eleven climbing and creeping species were arranged in two sections, Cantharospermum (5) and Volubilis (6) and the remaining three trailing species were classified under Rhynchosoides. Three Cajanus species have been further subdivided into botanical varieties; C. scarabaeoides into var. pedunculatus and var. scarabaeoides; C. reticulatus into var. grandifolius, var. reticulatus, and var. maritimus; and C. volubilis into var. burmanicus and var. volubilis.
Breeding and production constraints in pigeonpea
In pigeonpea, plant growth as well as flowering is highly influenced by the environment. Hence, breeding for wider adaptation, a complex phenomenon is a major issue to be tackled. Although related wild species are a rich reservoir of not only resistance genes against various biotic and abiotic stresses but also of genes responsible for yield components such as pods per plant, length of fruiting branches, and number of primary branches per plant, use of inter-specifics in pigeonpea improvement have been limited. This is due to the poor crossability of cultivated Cajanus cajan to species other than the closest species, Cajanus cajanifolia and C. scaraboides. Biotechnology approaches, such as in vitro rescue and propagation of wide cross hybrids, in conjunction with the use of bridge crosses, may enable the transfer of novel genes from a wider range of germplasm within and outside the genus Cajanus. Ongoing efforts using molecular tools to examine taxonomic relationships within subtribe Cajaninae should clarify phylogenetic relationships within the subtribe, and may suggest parsimonious routes for trait introgression.
Despite the importance of pigeonpea in semi-arid regions of the world, little concerted research effort has been directed towards pigeonpea crop improvement. A number of factors are responsible for the poor productivity, including lack of improved cultivars, poor crop husbandry, pests, and diseases. Major diseases include Fusarium wilt (Fusarium udum Butler), sterility mosaic disease (Sterility mosaic virus) and phytophthora blight (Phytophthora drechsleri), and pests such as gram pod borer (Helicoverpa armigera), Maruca (Maruca vitrata), pod fly (Melanagromyza obtusa), plume moth (Exelastis atomosa) cause substantial reduction to pigeonpea production every year. Furthermore, sensitivity to abiotic stresses like water-logging, common in this rain fed crop during early stages, and stress from low water conditions in the later stages, and salinity also reduce pigeonpea production. Conventional breeding approaches for pigeonpea improvement have been in use for several decades but have had limited success in overcoming these biotic and abiotic challenges to stable crop production (Varshney et al. 2007; Saxena 2008).
Knowledge of genetic inheritance of yield and related traits plays an important role in deciding breeding strategies and methodologies for crop improvement. In comparison to other economically important crops, relatively less effort has been made to understand the genetics of important traits in pigeonpea. Both additive effects and dominant non-additive effects have been reported as being important in determining yield, plant height, and protein content (Saxena and Sharma 1990). Pleiotropic effects of genes, physiological changes, and highly sensitive nature of pigeonpea towards the environmental changes makes it difficult to interpret the inheritance of yield and associated characters (Byth et al. 1981). Like yield, restoration of male fertility in cytoplasmic-genetic male-sterility (CGMS) based hybrids is also critical and important trait in pigeonpea as it governs the viability of hybrid system.
Current status of pigeonpea breeding research
Breeding in pigeonpea has been more challenging compared to other food legumes due to various crop specific traits. Pigeonpea is an often cross pollinated crop, with an insect-aided natural out crossing range from 20 to 70% (Saxena et al. 1990) that has limited the use of efficient selection and mating designs possible in self-pollinating species. Pure line breeding, population breeding, mutation breeding, and wide hybridization have been used for development of new varieties in pigeonpea and have led to incremental improvements in the yield potential of this crop. To overcome this bottleneck, two genetic male-sterility (GMS) systems were discovered in pigeonpea (Reddy et al. 1978; Saxena et al. 1983). Despite a 30% yield advantage over the non-hybrids, the GMS based hybrids could not be commercialized due to high cost of hybrid seed production. The yield-jump observed in the GMS hybrids encouraged the development of the alternative and more efficient cytoplasmic-genetic male-sterility (CGMS) system (Tikka et al. 1997; Saxena and Kumar 2003; Wanjari and Patel 2003). As a result of intensive hybrid development programme at ICRISAT in collaboration with its partners, the first CMS- based hybrid GTH-1 was released in India in 2004. Another CMS-based pigeonpea hybrid, ICPH 2671 was developed using C. cajanifolius (A4 cytoplasm) at ICRISAT in 2005 (Saxena 2008), that has been released as “Pushkal” by Pravardhan Seeds for cultivation in several states of India such as Andhra Pradesh, Karnataka, Madhya Pradesh, and Maharashtra. Continued hybrid-technology based improvements in pigeonpea yield potential, together with on going efforts to breed for resistance to biotic and abiotic stresses (Fusarium wilt, sterility mosaic, pod borer, etc.) are likely to lead to increased area under pigeonpea hybrids, contribute to increased crop returns for farmers and sustainable pigeonpea production.
The pigeonpea genomics initiative
Although pigeonpea improvement through conventional breeding and hybrid technology (Saxena and Kumar 2003) is ongoing, molecular breeding should accelerate utilization of the substantial variability among the pigeonpea landraces and germplasm lines for various morphological, physiological, and agronomic traits. The genetic basis of most important traits in pigeonpea is not known and to date, no linkage map has been reported. This may be attributed to: (1) low levels of DNA polymorphism within the primary (cultivated) gene pool, and (2) very small number of molecular markers available (Burns et al. 2001; Yang et al. 2006; Odeny et al. 2007, 2009; Saxena et al. 2009a). To address the need for genomic tools in pigeonpea, the pigeonpea genomics initiative (PGI) has focused on the development of a robust set of polymorphic markers including microsatellite or simple sequence repeats (SSRs; Gupta and Varshney 2000), single nucleotide polymorphisms (SNP), and diversity arrays technology (DArT) markers. Use of these molecular markers in diverse mapping populations in pigeonpea will facilitate the construction of a genetic map, mapping, and map based cloning of disease resistance genes, quantitative trait loci (QTL) mapping, and the integration of phenotypic data across the different mapping populations. Simultaneously, there was a need to develop mutant lines and a large DNA-insert library e.g., bacterial artificial chromosome (BAC) library to enable map-based cloning and functional analysis of traits in pigeonpea.
To address these needs, the Indian Council of Agricultural Research (ICAR) and the Government of India, under the umbrella of Indo-US Agricultural Knowledge Initiative (AKI), floated the Pigeonpea Genomics Initiative in November 2006. Initial partners in the initiative were National Research Centre for Plant Biotechnology (NRCPB), New Delhi; Indian Institute of Pulses Research (IIPR), Kanpur; Dr Panjabrao Deshmukh Agricultural University (PDAU), Akola; University of Agricultural Sciences, Dharwad (UAS-D); Banaras Hindu University (BHU), Varanasi; and International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Patancheru from India, and the University of California, Davis (UC-Davis) from USA. Subsequently, as a result of funding from the Generation Challenge Program (GCP) of the Consultative Group on International Agricultural Research (CGIAR) or through informal collaborations, additional partners joined the PGI, including National Centre for Genome Resources (NCGR), Santa Fe, New Mexico; Tuskegee University, Tuskegee, Alabama; Purdue University, West Lafayette, Indiana; The J. Craig Venter Institute (JCVI), Maryland; Cold Spring Harbour Laboratory (CSHL), New York, from the USA, and Diversity Array Technology Pty Ltd., Yaramulla from Australia.
Achievements in pigeonpea genetics and genomics
The availability of appropriate genetic resources is a pre-requisite for the effective use of genomics derived tools in any crop species (Varshney et al. 2005a). Therefore the PGI consortium planned from the beginning to develop a suitable set of genetic resources. Significant progress has been made during the last <3 years in developing a large number of populations and for genetic mapping and reverse genetic analysis.
Current status on development of pigeonpea mapping populations at different collaborating centers
Size of population
Important segregating traits
International Crops Research Institute for the Semi Arid Tropics (ICRISAT), Hyderabad
ICPB 2049 × ICPL 99050
ICPL 20096 × ICP 332
Fusarium wilt and sterility mosaic
ICPL 20097 × ICP 8863
ICPL 87119 × ICPL 87091
Fusarium wilt and sterility mosaic
ICP 7035 × ICPL 332
ICPL 88034 × ICPL 84023
ICP 28 × ICPW 94
ICPA 2043 × ICPR 3467
ICPA 2043 × ICPR 2671
ICPA 2039 × ICPR 2447
ICPA 2039 × ICPR 2438
Dr. Panjabrao Deshmukh Agricultural University (PDAU), Akola
TAT10 × BSMR736
Fusarium wilt and sterility mosaic, morphological traits
Asha × TV1
Fusarium wilt and sterility mosaic
AKT 8811 × BSMR 736
Fusarium wilt and sterility mosaic
GT 288 × C 11
Fusarium wilt and morphological traits
University of Agricultural Sciences (UAS), Dharwad
Gullyal white × Maruti
Fusarium wilt, morphological traits, seed colour
Gullyal white × BSMR 736
Sterility mosaic, seed colour
Asha × Andola black
Asha × Gulyal red
Indian Institute of Pulses Research (IIPR), Kanpur
Asha × UPAS 120
For Reference map and Fusarium wilt
Bahar × 67B
IPA6-1 × UPAS 120
Banaras Hindu University (BHU), Varanasi
MAL 13 × MA Deo 74
NDA 1 × MA 6
MAL 13 × ICPL 9150
Features of the parental genotypes used for developing mapping populations
Susceptible to Fusarium wilt
Resistant to Fusarium wilt
Resistant to Fusarium wilt and sterility mosaic
Susceptible to Fusarium wilt and sterility mosaic
Resistant to sterility mosaic
ICP 8863 (Maruti)
Erect, mid late, highly resistant to Fusarium wilt and susceptible to SMD, an extensively grown variety in Northern Karnataka and Maharashtra region of India, red seeded genotype
ICPL 87119 (Asha)
A high yielding popular variety, matures late, red seeded, susceptible to terminal drought stress in the field; resistant to Fusarium wilt and sterility mosaic
Susceptible to Fusarium wilt and sterility mosaic
Resistant to sterility mosaic
Susceptible to water logging
Tolerant to water logging
Erect, extra early, susceptible to Fusarium wilt and sterility mosaic
Spreading, mid-late, green stem, red seeded with yellow flowers; resistant to Fusarium wilt and highly resistant to SMD
Semi spreading, early, susceptible to Fusarium wilt and sterility mosaic
Semi spreading, early, tolerant to Fusarium wilt and susceptible to sterility mosaic
Erect, early, susceptible to Fusarium wilt and sterility mosaic, white seeded
Spreading, mid late, resistant to Fusarium wilt and susceptible to sterility mosaic, red seeded
A local genotype highly susceptible for Fusarium wilt and SMD, flowers early, medium duration, light brown stem with light red flowers, white seeded; good dhal quality
A local genotype highly susceptible for Fusarium wilt and SMD, flowers early, medium duration, light brown stem with light red flowers, red seeded, good milling quality, known for field drought tolerance
A local genotype, flowers early, field tolerance to drought stress
Indeterminate; Early, susceptible to wilt
Compact; Late, yellow flower; flat and deep purple pods; brown seeds; susceptible to wilt
Determinate; dwarf, early (~100 cm), susceptible to wilt
Indeterminate; late, tall (>250 cm), resistant to wilt
Spreading; green stem; light yellow flowers; green pods; brown seeds; resistant to sterility mosaic
MA Deo 74
Compact; yellow flower with purplish streaks; green pods with brown seeds; susceptible to sterility mosaic
Compact; yellow flower with purplish streaks; green pods with brown oval seeds; susceptible to sterility mosaic
Spreading; yellow flower; dark purple pods; brown-slightly rectangular seeds, resistant to sterility mosaic
Compact, purple stem; yellow flower; green pods; creamy white seeds; moderately resistant to sterility mosaic
As mentioned earlier, ICRISAT in collaboration with various partners has been successful in developing hybrids in pigeonpea; ICRISAT is developing populations for mapping of the fertility restorer (Rf) gene for A4 cytoplasm. Identification of fertility restorer lines for a particular cytoplasm is an important requirement for sustainable pigeonpea hybrid production. In this context, additional eight mapping populations (BC1F1 and F2) have been developed at ICRISAT for the mapping of Rf gene (Table 1). Molecular markers tightly linked with Rf gene will help breeders for marker assisted introgression (MSI) of fertility restorer loci into other elite cultivars using marker assisted selection.
Rapid acquisition of genomic sequence data has elevated a new discipline, functional genomics, which focuses on determination of gene function. To facilitate functional studies in pigeonpea that would follow from genome sequencing, PGI has initiated the development of a TILLING (Targeted Induced Local Lesions in Genomes; McCallum et al. 2000)-based reverse genetic resource for pigeonpea. TILLING is a reverse genetic approach where a library of saturation mutagenized individuals, each with several hundred-low 1,000 s of point mutations, are screened by high-throughput genotyping to identify individuals harboring a range of single nucleotide induced variants in genes of interest. The reference genotype Asha (ICPL 87119) was mutagenized with ethyl methane sulfonate (EMS) mutagen to develop TILLING population at BHU (Banaras Hindu University). In the pilot experiments, BHU treated 3,000 seeds in each of four different concentrations of EMS (0.01, 0.02, 0.03, and 0.04 M) during the year 2007–2008. As expected, the germination (70%) and pollen fertility (87.9%) were higher with 0.01 M treatment of EMS and reduced down with the subsequent doses. To date, a total of 505 single plant M1 lines yielded fertile M2 seed. In the M2 generation, a number of chlorophyll and plant form (very dwarf, dwarf, fasciated and tall) mutants have been identified. Efforts to significantly expand such mutant populations to several 1,000–10,000 plant lines are underway.
A significant amount of genomic resources have become available for pigeonpea during last 3 years (see Varshney et al. 2009a), some of them are discussed below.
Large-insert genomic DNA library
cDNA libraries and expressed sequence tags
Transcriptome sequencing has been a popular approach to efficiently identify the transcribed portion of the genome (Sreenivasulu et al. 2002). With an objective to identify genes associated with Fusarium wilt and sterility mosaic diseases, a total of 16 cDNA libraries were generated from Fusarium udum and Sterility mosaic virus challenged root tissues of four genotypes (ICPL 20102 and ICP 2376 for FW; ICP 7035 and TTB 7 for SMD). Several thousand cDNA clones from these cDNA libraries were sequenced at ICRISAT (Raju et al., unpublished) to obtain a total of 5,680 expressed sequence tags (ESTs) from FW challenged and 3,788 ESTs from SMD challenged tissues and submitted to NCBI.
In addition to traditional Sanger sequencing to obtain these FW and SMD associated ESTs, next generation sequencing (NGS) technology (Varshney et al. 2009b), was employed to identify whole plant ESTs. cDNA prepared at ICRISAT from 15 tissues representing different developmental stages of the Pusa Ageti genotype were pooled, and normalized to minimize redundancy and enhance efficiency of capture of rare transcripts. ICRISAT in collaboration with JCVI sequenced the normalized cDNA pool using 454/FLX sequencing, a next generation sequencing technology that offers higher throughput relative to Sanger sequencing. Of 4,96,705 sequence reads 2, 87,766(>50%) were longer than 200 bp. Cluster analysis of these sequences, done at NCGR in collaboration with ICRISAT, provided 48,519 contigs. Similarly, NRCPB has employed 454/FLX sequencing on the cDNA pools from two cultivars, Asha and UPAS120. As a result of this a total of 1,696,724 sequence reads (566 Mb) with an average read length of 333 bp were generated from these two genotypes at NRCPB. Sequence analysis of these genotypes provided 42,000 unique sequences of which 25,000 were common between these two genotypes.
Together these transcript sequences represent a significant fraction of the pigeonpea transcriptome, and should be a useful resource for marker development as well as gene discovery and functional studies.
Microsatellite/simple sequence repeat markers
Simple sequence repeat markers are the markers of choice for plant genetics and breeding applications (Gupta and Varshney 2000). In case of pigeonpea, however, only 140 SSR markers were available in public domain before the establishment of PGI (Burns et al. 2001; Odeny et al. 2007, 2009). As <10% SSR markers show polymorphism in the intra-specific germplasm, development of an intra-specific map with moderate marker density (with about 300 markers in each intra-specific population) would require availability of at least 3,000 SSR markers. To develop the larger number of SSR markers, three approaches are being used at ICRISAT in collaboration with UC-Davis and other collaborating centers.
Advances in development of SSR markers in pigeonpea under PGI
SSR enriched library
Number of clones
Amount of sequence data (kb)
SSRs frequency (in kb)
Primer pairs designed
Primer pairs synthesized
BAC-end sequences derived SSRs
In species where BAC-end sequences are available, development of SSR markers from the BAC-end sequences (BESs) is very cost effective (Shultz et al. 2008; Temnykh et al. 2001). SSR development from BAC ends obviates the need for a priori assumptions regarding the nature of the repeat motifs within a species, and offers genome-wide coverage as all repeat types are systematically sampled in the randomly selected BACs. Therefore, all 87,590 pigeonpea BESs were screened with MISA (MIcroSAtellite) search module for identification of SSRs. In total, 18,149 SSRs were identified in 14,001 BESs representing 6,590 BAC clones. 3,124 BESs contained more than one SSR and 2,111 SSRs were present in compound form. Mono- and di-nucleotide were the most abundant classes of SSRs, followed by tri- and tetra-nucleotides SSRs; penta- and hexa-nucleotide SSRs occurred at lower proportions. From a total of 6,590 primer pairs designed 3,072 primer pairs were synthesized and tested. Amplified products were obtained for 2,565 primer pairs (Table 3) and are currently being used at ICRISAT to identify polymorphism in a set of 24 pigeonpea genotypes that are parents of different mapping populations.
Expressed sequence tags derived SSRs
With the availability of pigeonpea ESTs from transcriptome sequencing described above, it has been possible to identify SSRs from EST sequences. Although expressed sequence tags derived SSR (EST-SSR) markers have been useful for assaying functional genetic diversity in the germplasm, these markers display lower levels of polymorphism relative to SSRs derived from genomic sequences. In case of pigeonpea, the unigene set of 5,085 genes assembled from cluster analysis of the available Sanger ESTs, was searched for the presence of SSRs at ICRISAT to identify 3,583 EST-SSRs that occurred at a frequency of 1/800 bp. 698 ESTs contained more than one SSR and 1,729 SSRs were found as compound SSRs. The majority (3,498, 97.6%) of EST-SSRs were mono-nucleotide repeats, with only a limited number of SSRs of other repeat classes [di- (40), tri- (33), tetra- (9), penta- (2), and hexa (1)-nucleotide SSRs]. Primer pairs were designed for 383 SSRs including some mononucleotide SSRs as Saxena et al. (2009a) reported a moderate level of polymorphism for mononucleotide SSRs. Of 84 randomly selected EST-SSR targeting primer pairs, 52 (61.9%) primer pairs showed scorable amplification of which 15 (28.8%) markers showed polymorphism in a set of 40 genotypes. These 15 EST-SSRs identified 2–7 alleles, with the PIC value ranging from 0.20 to 0.70.
As large amount of transcript data have been generated from three genotypes by using 454 sequencing approach, SSR mining has been undertaken in these datasets. For instance, 87,314 SSRs have been identified in 188,741 sequences at ICRISAT. While 12,168 454 sequences contained more than one SSR, 12,679 SSRs were found as compound SSRs (Table 3). In addition to this, by using 454 sequences of two genotypes (Asha and UPAS120), a set of 400 potential polymorphic SSR markers has been identified at NRCPB. Thus in principle, a larger number of primer pairs can be synthesized for SSRs identified in short read sequences generated by 454/FLX machine sequences for enhancing the repertoire of SSR markers for pigeonpea. Although a large number of SSRs could be developed from transcript data, ongoing efforts are focused on the use of the >3,000 set (Table 3) that are predominantly BES-SSRs. Such genomic sequence derived SSRs are more polymorphic relative to EST-SSRs (Varshney et al. 2005b), and offer the additional advantage of allowing for the anchoring of the source BACs to physical and genetic maps.
In summary, the pigeonpea community has an access to >3,000 SSR markers (Table 3). Availability of these markers together with other classes of markers should be a good resource for developing genetic maps and assessment of genetic diversity (Varshney et al. 2009a).
Single nucleotide polymorphism markers
In recent years, the development and use of SNP markers in plant genetics and breeding has gained popularity compared to SSRs, as SNPs are more abundant and amenable for high-throughput genotyping (Varshney 2009).
Conserved orthologous sequence based SNPs
Next generation sequencing based SNPs
Recent developments in NGS technologies are catalyzing the development of SNP markers even in those crops with little or no sequence information (Varshney et al. 2009b). ICRISAT and NCGR have been working to use the Solexa NGS technology to sequence transcriptomes of ten pigeonpea genotypes that are parents of six mapping populations. The availability of reference genome sequence data vastly improves the effectiveness of NGS approaches. In pigeonpea, transcript assembly (~48,000 transcript contigs) developed from 454/FLX NGS and Sanger ESTs will facilitate alignment of Solexa sequence data. Multiple sequence alignment (MSA) of Solexa datasets from the ten genotypes, together with reference sequence, should provide a large number of high confidence SNPs for high frequency alleles. SNPs identified from such NGS approaches, together with re-sequencing of COS loci in additional genotypes should allow for the development of additional SNP sets (for example, a 1,536 GoldenGate SNP assay or even larger) for mapping of several hundred SNPs in different intra-specific mapping populations.
Diversity array technology markers
Diversity array technology provides a sequence independent and high throughput approach to develop dominant-type markers, and provides a cost-effective whole-genome profiling (Jaccoud et al. 2001). Further, as the same platform is used for discovery and scoring of polymorphic markers it is a quite cost effective and user friendly approach for genotyping of polymorphic markers. DArT has been developed as a technology for whole-genome profiling in over 40 crop species. In pigeonpea, a pilot DArT array, comprising of 5,376 features was developed by Yang et al. (2006). When this array was used to analyze 96 genotypes representing 20 species of Cajanus, cultivated genotypes did not show much polymorphism. Under the framework of a recent project sponsored by Generation Challenge Programme, DArT Pty Ltd in collaboration with ICRISAT, has upgraded the DArT array for pigeonpea that has >15,000 features. These DArT markers are being used to genotype several mapping populations to develop integrated and high-density genetic maps of pigeonpea.
Isolation and characterization of nucleotide binding site domains
Harnessing the potential of comparative genomics
Comparative genomics offers the promise of leveraging genomic information from related species to more rapidly advance genetics in species with fewer genetic/genomic resources (O’Brien et al. 1999). Development of COS markers, as mentioned above, is one of those examples of application of comparative genomics approaches. The nearest reference genome (i.e., one with extensive genome sequence) for pigeonpea is soybean (Glycine max), another phaseoloid legume. Despite this phylogenetic relationship, leveraging soybean to advance pigeonpea genomics may be hampered by polyploidy (Shoemaker et al. 1996; Doyle et al. 2004; Walling et al. 2006) and extensive whole or partial genome duplication and gene fractionation (Schlueter et al. 2006, 2007; Innes et al. 2008). Thus comparisons to any one segment may not be as informative as comparisons to both duplicated segments (McClean et al., unpublished data). Two other reference legume species Medicago truncatula and Lotus japonicus have extensive genome sequence data, but are estimated to have diverged from the phaseoloid clade ~45–55 my, compared to 15–20 my for the pigeonpea-soybean split (Lavin et al. 2005), which suggests that soybean genome sequencing which is nearing completion (http://www.phytozome.org/soybean), may more readily benefit genetics and crop improvement in pigeonpea.
Towards genome sequencing of pigeonpea
At the inception of the PGI, a clone-by-clone approach was proposed to sequence the pigeonpea genome in Phase 3. However, recent developments in sequencing technologies (Mardis 2008) suggest possible revisions to this approach. NGS technologies can “democratize” genome sequencing for crops with little political and/or research support (Varshney et al. 2009b) such as pigeonpea. Although complete genome sequence would require more extensive resources, we anticipate that existing genomics resources (BACs, BES, transcript sequences, high density molecular maps) together with NGS technology could allow the assembly of a significant fraction of the low copy genic portions of the genome (euchromatin) in the near term, which in itself should revolutionize molecular breeding in pigeonpea. Ongoing rapid advances in sequencing technologies are likely to make complete genome sequencing in pigeonpea achievable in the not too distant future.
Summary and outlook
In many crop species, genomic tools and approaches have enhanced the precision and efficiency of prediction of phenotype from genotype (Varshney et al. 2005a) and have been instrumental in developing superior genotypes and varieties (Varshney et al. 2006, 2009c). However, until recently appropriate genomic tools were not available in pigeonpea. The PGI consortium, during the last 3 years, has been very successful in developing a significant amount of genetic and genomic resources in pigeonpea, with the majority of the data already in the public domain, or nearly so. Generation of a variety of genetic and genomic tools such as mapping populations, mutant population, different kinds of molecular markers (e.g., SSRs, DArTs, SNPs, COSs) at moderately large scale, BAC library, Sanger, and 454/FLX ESTs, unigenes, NBS-LRR genes, etc. in pigeonpea, from the PGI should have tremendous impact on pigeonpea breeding. For instance, molecular markers together with mapping populations would provide the markers associated with the trait through linkage mapping approach. High-throughput marker genotyping platforms such as DArT markers, GoldenGate assays for SNPs will enable the community to undertake association mapping to identify the markers/gene(s) associated with complex traits by harnessing the genetic variation present in the natural populations and germplasm collections. BAC library and BESs will help develop physical maps to anchor genome sequence data, and to clone genes in concert with transcriptomics resources. Molecular markers identified from these approaches that are associated with traits of importance to breeders should accelerate pigeonpea improvement via marker assisted selection (MAS) or transgenic approaches. Largely through the efforts of the PGI, pigeonpea should move from its current status of an ‘orphan’ legume to a well-resourced crop species.
In summary, modern molecular breeding methods together with the power of genomics and genetic resources developed through the PGI should revolutionize pigeonpea crop improvement, and consequently benefit farmers and consumers of this important pulse crop of India and the semi-arid regions of the world.
Authors are thankful to several research scholars and post-doctoral fellows working in the laboratories of collaborating research institutes of PGI as well as a number of collaborators engaged directly or indirectly in research activities of PGI. Authors would like to extend their sincere thanks to Dr. Mangla Rai, Director General, Indian Council of Agricultural Research (ICAR) and Secretary, Department of Agricultural Research and Extension (DARE) for his support in initiating the PGI under the Indo-US Agricultural Knowledge Initiative (AKI). Thanks are also due to Dr. Satya Prakash Tiwari, Deputy Director General (Education), ICAR; Dr. Jean-Marcel Ribaut, Director, Generation Challenge Programme (GCP, www.generationcp.org); Dr. William Dar, Director General, ICRISAT and Dr. Dave Hoisington, Deputy Director General-Research, ICRISAT for their strong support for pigeonpea genomics activities. Research in the laboratories of authors presented in this article is supported by ICAR under the umbrella of Indo-US AKI, Generation Challenge Program (GCP), and the National Science Foundation (USA).
This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.