A physical map of the Chinese chestnut (Castanea mollissima) genome and its integration with the genetic map
- First Online:
- Cite this article as:
- Fang, GC., Blackmon, B.P., Staton, M.E. et al. Tree Genetics & Genomes (2013) 9: 525. doi:10.1007/s11295-012-0576-6
- 2.3k Downloads
Three Chinese chestnut bacterial artificial chromosome (BAC) libraries were developed and used for physical map construction. Specifically, high information content fingerprinting was used to assemble 126,445 BAC clones into 1,377 contigs and 12,919 singletons. Integration of the dense Chinese chestnut genetic map with the physical map was achieved via high-throughput hybridization using overgo probes derived from sequence-based genetic markers. A total of 1,026 probes were anchored to the physical map including 831 probes corresponding to 878 expressed sequence tag-based markers. Within the physical map, three BAC contigs were anchored to the three major fungal blight-resistant quantitative trait loci on chestnut linkage groups B, F, and G. A subset of probes corresponding to orthologous genes in poplar showed only a limited amount of conserved gene order between the poplar and chestnut genomes. The integrated genetic and physical map of Chinese chestnut is available at www.fagaceae.org/physical_maps.
KeywordsCastanea spp. Chestnut BAC library HICF fingerprinting Physical map Comparative genomics Chestnut blight resistance
Chestnuts (Castanea spp.) and their relatives, including oaks (Quercus spp.) and beeches (Fagus spp.), comprise the Fagaceae, a plant family that dominates much of the climax hardwood forests of the Northern Hemisphere (Manos et al. 2008). On the North American continent, the American chestnut (Castanea dentata) was once an abundant and important element of the eastern hardwood forests, serving significant roles, both ecologic and economic (Hill 1994). However, from 1905 to the 1950s, a devastating invasion of the chestnut blight fungus (Cryphonectria parasitica) eliminated the American chestnut as a dominant forest tree species (Anagnostakis 1987).
Several strategies to restore the American chestnut forests, including biological control and hybrid breeding programs (MacDonald and Fulbright 1991; Hebard 2006), have been explored. A biological control approach that involved the introduction of exotic hypovirulent strains of Cryphonectria parasitica into local fungal communities successfully reduced the virulence of Cryphonectria strains in Europe (Heiniger and Rigling 1994). In North America, this approach was unsuccessful primarily due to the complexity of the vegetative incompatibility genetic system that prohibited fungal anastomosis (mycelial fusion) required for efficient transmission of the hypovirulent virus (Anagnostakis et al. 1986). For this reason, groups in the USA have pursued an interspecies backcross breeding strategy to introgress blight-resistant genes from Chinese chestnut (Castanea mollissima) into the American chestnut (Hebard 2006). Multiple American chestnut germplasm populations have been sampled to preserve as much of the genetic diversity of the species as possible (Hebard 2006). The long generation time and space requirements for chestnut breeding are significant barriers to the rapid development of resistant trees; however, the introduction of molecular markers and marker-assisted breeding and genetic engineering could accelerate the resistance improvement process (Merkle et al. 2007; Wheeler and Sederoff 2009).
An integrated structural genomic resource comprised of substantial expressed sequence tags (ESTs) and integrated (i.e., cross-referenced) genetic map and physical maps is ideal for high-resolution mapping of important traits and map-based cloning of candidate genes. In rice, Arabidopsis, and poplar, the model systems for monocots, dicots, and woody plants, respectively, such integrated resources serve as a platform for comparative genomics studies and provide scaffolds for gene mining from related but less studied species (Flavell 2009; Koornneef and Meinke 2010).
The Fagaceae Genome Project (discussed in Wheeler and Sederoff 2009) was launched in 2006 to develop genomic research tools to identify genes for chestnut blight resistance, establish chestnut (Castanea spp.) as a model for the Fagaceae, and to compare it to northern red oak (Quercus rubra), white oak (Quercus alba), and American beech (Fagus grandifolia). Specifically, the four main objectives of the project were to (1) develop DNA sequence-based resources in the form of ESTs and SSR markers and EST-based SNP and SSR markers; (2) develop a high-density, high-resolution resolution genetic map of the Chinese chestnut genome, positioning quantitative trait loci (QTLs) for chestnut blight resistance; (3) develop a whole genome physical map anchored to the genetic map; and (4) discover and analyze candidate genes for blight resistance located with the QTL intervals.
Prior to this project, our understanding of the chestnut genome, like most forest trees, was limited. In comparison to annual field crops and other model plants such as Arabidopsis, availability of highly informative genetic maps in forest trees having defined intervals for important phenotypic traits is limited. Most forest trees take years to flower, and progeny trials require a significant commitment of time, space, and labor for genetic analysis. For chestnut, a small number of genetic maps are currently available. These include (1) a genetic map based on a three-generation pedigree of an interspecific cross of American chestnut and Chinese chestnut comprising 196 RAPD and RFLP markers covering 530.1 cM (Kubisiak et al. 1997); (2) an updated version of the interspecific map, adding 275 AFLP markers, 24 SSR markers, and the 5S rDNA locus (Sisco et al. 2004); (3) a new genetic map of Chinese chestnut created from 1,401 new SSR and SNP markers (Kubisiak et al., companion manuscript) that served as a reference map for anchoring the physical map described here; and (4) a genetic map of European chestnut (Castanea sativa Mill.) using a two-way pseudo-testcross on a population of 96 F1 full-sib individuals (Casasoli et al. 2001).
In parallel to these genetic mapping efforts, a whole genome physical map (i.e., bacterial artificial chromosome (BAC) contig assembly) of the Chinese chestnut was constructed and anchored to a genetic map (Kubisiak et al., companion manuscript) via marker hybridization to BAC clones arrayed on filters. Besides integrating the genetic and physical maps, the marker hybridizations verified and improved the physical map through manual editing. Here we report on the physical mapping efforts including BAC library construction and analysis, high information content fingerprinting (HICF) and BAC contig assembly, integration of the physical map with the genetic map through marker hybridization, and comparative analyses with the poplar genome.
Materials and methods
BAC library construction
Three Chinese chestnut BAC libraries were prepared. Two BAC libraries, CMCMBb and CMCMBd, were created at the Clemson University Genomics Institute. The BAC library CM_MBc was constructed by LGG and differences in protocol for that library are noted in Additional File 1. Also, CUGI generated the BAC library CDC_Ba from the American chestnut (Castanea dentata) to be used for Castanea mollissima versus Castanea dentata comparative analysis. The BAC library construction protocol at CUGI is described below. Briefly, nuclei were isolated from young leaves and embedded in agarose plugs by the method of Peterson et al. (2000). A twofold increase in antioxidants was amended to the extraction buffer due to the high phenolic content of chestnut leaf tissue. The agarose plugs were treated with proteinase K supplemented with 0.2 % beta-mercaptoethanol to reduce oxidation of phenolic compounds. To fractionate genomic DNA, plugs were macerated with a razor blade followed by partial digestion with HindIII (libraries CMCMBb and CDC_Ba) or EcoRI (library CMCMBd) and size-selected in a clamped homogeneous electrical field apparatus (CHEF-DR III, Bio-Rad, Berkeley, CA, USA) (Peterson et al. 2000). After two rounds of size selection, DNA was recovered from the agarose by electroelution and cloned into the vector pIndigoBAC536. The ligation reactions were heated at 65 °C for 20 min to inactivate enzymes and desalted on a Millipore membrane (type VSWP, 0.025 μm) on 5 % PEG (polyethylene glycol) 8,000 at room temperature for 2 h before transformation into DH10B competent cells (Invitrogen, Life Technologies, Carlsbad, CA, USA) by electroporation. Transformed cells were plated on LB agar supplemented with chloramphenicol (12.5 μg/ml), 0.5 mM IPTG, and 50 μg/ml X-gal at 37 °C for 24 h. Library clones were picked (Genetix QBot, Molecular Devices, New Milton, UK), grown in LB amended with 7 % glycerol and chloramphenicol (12.5 mg/l) in 384-well plates, and stored at −80 °C. To evaluate the average insert size, 96 clones were randomly selected from each library and grown in Terrific Broth (TB) supplemented with 12.5 μg/ml chloramphenicol before DNA extraction. BAC DNA was then digested with NotI and resolved by electrophoresis in a 1.0 % agarose gel in 0.5× TBE on the CHEF system (Peterson et al. 2000) using a 5–15-s ramping pulse and 6 V/cm running voltage for 15 h at 14 °C. The gels were stained with ethidium bromide. Both Lambda ladder and MidRange I PFG Markers (New England BioLabs, Ipswich, MA, USA) were used as size references. Whole library copies, individual clones, or macrofilters for the three libraries may be obtained through the Clemson University Genomics Institute (http://www.genome.clemson.edu/online_orders).
All fingerprinting was performed using the four-color HICF SNaPshot method of Luo et al. (2003). BAC DNA was isolated from 1.2 ml cultures in TB supplemented with 12.5 μg/ml chloramphenicol by alkaline lysis in a 96-well format. DNA was digested with five enzymes. Four 6-bp cutters, BamHI, HindIII, XbaI, and XhoI, were chosen to digest DNA over a wide range of GC content, allowing for an unbiased fragmentation of the BAC DNA. One 4-bp cutter, HaeIII, was chosen to reduce the DNA fragment size for HICF analysis. Restriction digestions were incubated at 37 °C for 2 h, and the DNA was labeled using a ABI PRISM SNaPshot multiplex kit and was subsequently loaded on an ABI 3730 DNA analyzer (Applied Biosystems, Foster City, CA, USA). Two rice BAC clones with a known fragment pattern were included as internal controls.
The fingerprints were processed using GeneMapper v4.0 (Applied Biosystems), FPPipeliner 4.0, and FPMiner 2.0 (Bioinforsoft, Beaverton, OR, USA) to call fragment sizes and remove spurious peaks, vector bands, and the internal control clones. Clones yielding less than 10 bands would be statistically unlikely to be accurately placed on the physical map and were excluded from further analysis due to lack of information. Clones with more than 250 bands were excluded because they are likely to have multiple clone contamination.
Physical map construction
The fingerprinting data were analyzed using FPC v9.3 (“FingerPrinted Contigs”; Soderlund et al. 2000; Nelson et al. 2007) for contig assembly. The initial contig assembly was created with high stringency parameters (tolerance 3 and Sulston score of 1e−50). All contigs with more than 10 % questionable (Q) clones were split by the “DQer” function of FPC to reduce misassembly. An incremental re-build strategy was used to sequentially lower the stringency of the build parameters. In each round of calculation, the Sulston score cutoff was reduced by 1e−5 and an “Ends to Ends” merge with parameters “match” 2 and “from end” 34 was performed. This was followed by “Singles to Ends” merge at the same stringency until the final assembly Sulston score was at 1e−30. The average restriction band size was calculated by the average BAC insert size divided by the average number of bands per fingerprint.
Sanger and 454 DNA sequencing were used to generate 450 million bases of EST sequence from Chinese chestnut, American chestnut, northern red oak, white oak, and American beech (http://www.fagaceae.org/sequences), including 847,952 reads from Chinese chestnut and 688,198 reads from American chestnut. Computational analysis of the transcriptomes generated from chestnut blight cankers and healthy tissues revealed ESTs preferentially expressed during canker development in both Chinese chestnut and American chestnut and provided a list of potential candidate genes for blight resistance (Barakat et al. 2009, 2012). The collection of ESTs was analyzed for both SSR and SNP markers to develop a highly informative genetic map. In addition, this rich unigene resource was mined for gene sequences to use for overgo probe design.
The protocol for the pooled overgo hybridizations discussed in the “Results” section is detailed below. A separate set of 23 overgo markers was hybridized against the CM_Mbc library by LGG using a slightly different protocol. The process of selection and the methods used for these markers are detailed in Additional File 1. Low complexity sequences and known repetitive elements were masked from the EST sequences by RepeatMasker (Smit et al. 1996–2010) and Tandem Repeat Finder (Benson 1999). Repeat libraries from RepBase (Jurka et al. 2005) were used by RepeatMasker, with the species set to “viridiplantae” to utilize all repeats from green plants in the database. Overgo probe (Ross et al. 1999) design was performed on the resulting masked sequences using OligoSpawn (Zheng et al. 2006). All oligomers were synthesized by IDT (Integrated DNA Technologies, Coralville, IA, USA).
High-density BAC filters were created on Hybond N(+) membranes (GE Healthcare Life Sciences, Pittsburgh, PA, USA) by an automated QBot (Genetix, New Milton, UK). Each filter contained a total of 18,432 clones double-spotted in a 4 × 4 pattern in a six-field grid. The experimental design consisted of a previously developed 3D pooling strategy that enables resolution of 125 probes through 15 hybridizations (Fang et al. 2010). A pool of probes was used for each of the 15 hybridizations, named X1, X2, X3, X4, and X5 in the X dimension; Y1, Y2, Y3, Y4, and Y5 in the Y dimension; and Z1, Z2, Z3, Z4, and Z5 in the Z dimension. Each probe was assigned to one X pool, one Y pool, and one Z pool, with each probe having a unique address such as (X1, Y1, Z1) or (X3, Y4, Z2), etc., indicative of the pools to which it was included. An automated setup script was used to create the pool addresses for each probe and is available online at http://www.genome.clemson.edu/software/hybdecon/exp_setup. Overgo probes were individually labeled with 32-P at 37 °C for 1 h (McPherson et al. 2001), and the labeled probes were mixed according to the pooling formula, denatured at 95 °C for 10 min and hybridized at 60 °C overnight. Filters were washed with 0.1 % SDS, 1× SSC at 60 °C and exposed to phosphor screens for 24 to 48 h.
The images were recorded by a Typhoon 9400 Imager (GE Healthcare Life Sciences, Uppsala, Sweden). All images were examined using the Hybdecon software package to score the addresses of positively hybridized clones, and the deconvolution function from the software was used to resolve the hits for each probe. Hybdecon is an in-house expanded version of the software HybSweeper (Lazo et al. 2005) and can be downloaded from http://www.genome.clemson.edu/software/hybdecon. A BAC with a successful hybridization in one X pool, one Y pool, and one Z pool was scored as a successful hybridization of the probe assigned to those same three original pools. A BAC with more than one hybridization result in a single dimension (X, Y, or Z) or no hybridization result in a dimension was discarded. This is a very stringent method of hybridization because the probe and BAC must have a successful hybridization three times, one for each dimension of the experiment. Loss of data occurs if the hybridization fails in one of the dimensions or if a BAC is hybridized by more than one probe.
Manual editing of physical map
Two types of evidence were used to merge physical map contigs. BAC clones on the ends of each contig that strongly matched BAC clones at the end of another BAC contig (“overlapping” BACs) were identified using the FPC software. Contigs with clones that hybridized to the same overgo probe were also considered candidates for merging because the majority of overgo sequences are expected to be a single copy. The initial round of editing used parameters of tolerance 3 and Sulston score cutoff at 1e−20. Contigs were merged if they fulfilled the following criteria: (a) overlapped by three or more BACs from both contigs or (b) overlapped by two BACs from both contigs with at least one pair matching at a cutoff value equal to or more significant than 1e−30. For a contig pair with only one shared BAC from one contig matching to multiple BACs in the other, a common overgo marker hybridized to both contigs must further support the merge. The second round of manual contig editing was done at a lower stringency of tolerance 3 and cutoff 1e−15 using the requirements of (a) multiple overlapped clones from both contigs, with the first contig sharing three or more clones with two or more clones from the second contig, and (b) consensus band (CB) map computation recalculated at tolerance 3, cutoff at 1e−20 or lower if the number of overlapping clones did not fulfill the minimal requirement of the first rule.
Comparative genome analyses
To identify poplar orthologs for the comparative genomics study, all chestnut unigenes (i.e., EST contigs, CCall unigene version 2) were compared using BLAST against the poplar (Populus trichocarpa) gene sequences (http://genome.jgi-psf.org/Poptr1_1/Poptr1_1.home.html). Putative orthologs were defined as reciprocal best matches between a chestnut unigene and a poplar gene sequence. The putatively orthologous chestnut unigene sequences were used to design overgos and hybridize as described above. The poplar paralog, expected from the recent whole genome duplication, was found using the VISTA-Point pairwise alignment browser with Poplar v2.0 aligned to Poplar v2.0(dupl.) (Frazer et al. 2004). If a physically mapped contig shared hybridizations to both an orthologous poplar marker and to one or more genetically mapped markers, the location of the poplar ortholog was inferred to be at the same location on the genetic map.
Genetic mapping of chestnut blight resistance identified three QTLs (referred to as Cbr1, Cbr2, and Cbr3 on linkage groups B, F, and G, respectively) (Kubisiak et al., companion manuscript). In order to identify chestnut BAC contigs originating from the QTL regions of the genome, three genetic map markers with high LOD scores (4.2, 4.3, and 3.5, for Cbr1, Cbr2, and Cbr3, respectively) were used for overgo design. The overgos were hybridized against the BAC libraries to identify BAC clones and physical BAC contigs corresponding to the QTLs.
BAC library construction
Construction of BAC libraries from two chestnut species
Average insert size (kb)
# of clones
The HICF method was chosen for fingerprinting because it provides superior sensitivity over the agarose method, makes use of automated sequencing technology to increase throughput, and is able to create a more contiguous physical map (Nelson et al. 2005). A total of 166,161 BACs were fingerprinted: 73,728 from CMCMBb, 92,160 from CMCMBd, and 273 from CM_MBc. Quality control filtering yielded a final set of 126,176 high quality fingerprints, representing an estimated 18× genome coverage.
Physical map construction
Summary of the Chinese chestnut physical map (BAC contig assembly)
Number of clones in physical map
Number of clones in contigs
Size of contigs
Number of contigs
Average length of contigs
Number of bands in contigs
Estimated total physical length in contigs (Mb)
1,311 (=1.7× genome size)a
Number of singletons
Number of overgo probes anchored
Genetic marker hybridization
A total of 1,125 probes were designed and tested against the Chinese chestnut BAC libraries CMCMBb (HindIII) and CMCMBd (EcoRI) libraries. In order to assess chestnut–poplar macrosynteny, 260 overgo probes were designed from Chinese chestnut unigene sequences orthologous to poplar genes. The rest of the probes were designed from Chinese chestnut unigene sequences that contain SSR and SNP markers that were being screened for placement on the genetic map (Kubisiak et al., companion manuscript). This included a set of 43 RFLP marker sequences kindly provided by Dr. Paul Sisco and a set of 23 sequences previously screened against the CM_MBc library (LLG unpublished data, see Additional File 1). The overgo probes were hybridized against the BAC libraries in nine pools of 125 probes each. For each pool, an average of 90 % of the overgo probes successfully hybridized to one or more BAC clones, providing an efficient transfer of genetic information to the physical map. The 10 % failure rate was primarily due to the loss of information inherent in the 3D pooling strategy. Additional factors include the following: the overgo sequence spanning an intron splice site, no or very low BAC coverage in a particular region of the genome, and uneven growth of BAC clones on the filters.
Summary of 831 overgo probes corresponding to genetic map markers assigned to the physical map in Chinese chestnut
Total number of overgo probes
# of markers hybridized to BAC singletons
# of markers hybridized to single BAC contig
# of markers hybridized to 2 BAC contigs
# of markers hybridized to 3 or more BAC contigs
Summary of mapped genetic markers by linkage group anchored to the physical map of Chinese chestnut
# of genetically mapped and physically anchored markersa
# of genetically mapped BAC contigsa
Mb spanned by mapped BAC contigs
Comparative genome organization at low resolution
Macrosynteny between poplar and chestnut was assessed utilizing a set of 260 overgos chosen from chestnut unigenes with reciprocal best hits to poplar genes. For each chestnut overgo, two poplar loci were examined to account for the recent whole genome duplication discovered in the poplar genome (Tuskan et al. 2006). The chestnut unigenes selected for this analysis corresponded to putative poplar orthologs spanning the 19 poplar genome linkage groups and 11 unmapped poplar scaffolds. Contigs that could be localized to a chestnut linkage group via a genetically mapped marker and also to two poplar locations via an ortholog probe were used to assess how the chestnut and poplar genomes compare in organization. Results from 229 successful probes provided comparative data for a total of 260 physical map contigs (Additional File 3). Among these was a set of 131 contigs that could be anchored to chestnut linkage groups by genetically mapped markers.
Chinese chestnut physical map contigs anchored to two or more closely spaced poplar orthologous locations
Phys map contig
Populus location 1
Populus location 2
Putative conserved region
Scaffold 8 from 10.2 to 10.2 Mb
Scaffold 7 from 9.5 to 14.5 Mb
Scaffold 1 from 9.8 to 9.9 Mb
Scaffold 10 from 7.8 to 8.0 Mb
Scaffold 8 from 11.9 to 12.1 Mb
Scaffold 12 from 4.0 to 5.6 Mb
Scaffold 7 from 1.2 to 4.8 Mb
Scaffold 9 from 4.8 to 7.5 Mb
Scaffold 1 from 24.5 to 27.7 Mb
Scaffold 4 from 22.5 to 22.6 Mb
Confirmation of these syntenic blocks and a more in-depth understanding of the evolutionary rearrangements between chestnut and poplar will require finer mapping and more sequence data. This analysis is unable to establish the relative size of conserved syntenic blocks between the two genomes. The size of the physical map contigs limits the effective block size that could be detected, and the small sample size of 260 overgos prevents the discovery of small conserved blocks because of the large distance between each gene. Further research utilizing the entire set of overgo markers and additional sequence resources is ongoing and may yield more comparative information.
Physical mapping of blight-resistant QTLs
The main focus of the genomic characterization of chestnut is to identify candidate genes for chestnut blight resistance and to introgress the resistance into American chestnut. Kubisiak et al. (companion manuscript) mapped three major blight-resistant QTLs (Cbr1, Cbr2, and Cbr3) with EST-based genetic markers. Probes designed for these markers identified BAC clones and their contigs resulting in an integrated genetic physical map for each QTL. For example, the probe for marker CmSNP00927 (Ccall_contig40076_v2) on linkage group B (Cbr1) hybridized to 15 BAC clones, all members of physical map ctg2730. Similarly, probes designed for markers CmSNP01333 (linkage group F, Cbr2) and CmSNP01477 (linkage group G, Cbr3) hybridized to eight BAC clones in ctg11964 and six BAC clones in ctg2356 (Fig. 2). Based on the estimation of 1.74 kb for 1 CB unit, the lengths of these three BAC contigs are 4.4 Mb for ctg2730, 4.4 Mb for ctg11964, and 1.2 Mb for ctg2356. Further comparative mapping of the QTL contigs using genetic map marker sequences against the poplar and peach genomes (http://www.phytozome.net/) identified microsynteny at all three of these regions. These microsyntenies are further described in the companion genetic map manuscript.
The new BAC-based genetically anchored physical map for Chinese chestnut is expected to facilitate map-based cloning and identification of genes involved in blight resistance. These genes can then be introduced through introgressive interspecies breeding strategies (Hebard 2006) or transgene technologies into American chestnuts (Merkle et al. 2007) with the eventual goal of restoring chestnut forests in North America. The physical map will also guide the whole genome sequence assembly and facilitate comparative genomics studies in other tree species. Because of the relatively conserved genome structure in the Fagaceae, detailed information from the chestnut genome is expected to serve as a model for the study of genome organization and the evolution of the Fagaceae as well as provide avenues for study of the genes and gene activity underlying the conservation and evolution of important traits such as disease resistance, phenological adaptation, and wood formation.
The collective span of the physical map contigs is 1,311 Mb, 1.6× of the estimated 794-Mb genome (Kremer et al. 2007). The inflated length from the physical map might be due to the obligate outcrossing nature of the species that resulted in a highly heterozygous genome, thus increasing the complexity of the fingerprints derived from alleles of any region. Consequently, BACs from variable regions of separate parental haplotypes, while often still overlapping enough to assemble into the same contig, would generate extra restriction fragments that appear to not be nonoverlapping and therefore contribute to the expanded size estimate from the physical contig assembly. Similar expansion in physical size estimated from physical maps was also reported from other heterozygous plant species such as grapevine (Moroldo et al. 2008) and poplar (Kelleher et al. 2007). Results from this physical map appear to indicate a relatively complex heterozygous genome. However, heterozygosity does not appear to interfere with marker hybridization in which more than 80 % of the genetic markers mapped preferentially to single contigs. The hybridization pattern confirms that heterozygous but overlapping BACs are usually correctly assembled into the same physical map contig. However, as the markers were derived from ESTs that in general are more conserved than nongenic regions in plant genomes, the effects of heterozygosity might be minimized in the genic regions where hybridizations were localized. Further investigation through the comparison with the whole genome sequence, which is under assembly (http://www.foresthealthinitiative.org/genomics.html), should provide more insights into the genome heterozygosity and provide references to improve the physical map.
The comparative mapping of the chestnut and poplar genomes provides a framework for further study of genome evolution across these lineages. It has been reported that many angiosperms have been through at least two rounds of whole genome duplication (WGD); an ancient WGD occurred early in the evolution of angiosperms followed by a second wave of large-scale genome duplication during further divergence into different lineages (Bowers et al. 2003; Paterson et al. 2004; De Bodt et al. 2005; Cui et al. 2006). In addition to the WGD, segmental duplications are commonly observed in plant genomes (Van de Peer et al. 2009). Collectively, the WGD and segmental duplications followed by translocation, substitution, and deletion play a critical role in the diversity of plant genomes and provide morphological and functional advantages in environmental adaptation (Wendel 2000; Adams and Wendel 2005). Results from the overgo hybridizations do not support a recent large-scale genome duplication in the chestnut lineage. Of the 831 overgo probes (representing 878 genetic markers), 83 % strongly preferentially hybridized to single physical map contigs, suggesting that these markers hybridized to unique loci. Results from more comparative genomics will provide informative clues to better understand the genome variation in this plant lineage.
A better understanding of the specific mechanism and genetic location of blight resistance is expected in the near future due to the robust resources now available for chestnut. Blight resistance is controlled by at least three major QTLs on linkage groups B, F, and G (Cbr1, Cbr2, and Cbr3, respectively), in blight-resistant Chinese chestnut (Kubisiak et al. 1997; Wheeler and Sederoff 2009). Markers flanking the QTL regions were used to design overgo probes to identify the physical map contigs containing potential genes for resistance. While the QTL regions and the corresponding physical map contigs are large, prior research into the biological response of the Chinese chestnut to canker blight will provide guidance for candidate gene selection. The physiologically complex disease resistance reactions have been studied in canker tissue and are known to involve signal sensing for programmed cell death. Expression of sets of “effector genes” carry out the synchronized cell death and activate other defense reactions such as lignin deposition from surrounding cells to restrain further enlargement of canker tissue and block growth of the fungal mycelial fans (Hebard et al. 1984; Anagnostakis 1987; Barakat et al. 2009). Genes found in the BAC sequences will be functionally profiled and those with activities in these particular cellular processes will be prioritized for further research.
Minimum tiling paths of the BAC contigs that span the three blight-resistant QTLs (Cbr1, Cbr2, and Cbr3) are being sequenced by the FHI initiative (J.E.C., M.E.S. unpublished data, http://www.foresthealthinitiative.org/). Strategies for utilizing the sequenced region to identify the actual source of resistance include (a) comparing the sequences to the corresponding homologous regions in peach, poplar, and grape genomes with particular attention to previously identified fungal disease resistance genes; (b) focusing on genes that are preferentially expressed in canker tissue; and (c) selecting, sequencing, and comparing the homologous regions of the American chestnut genome for allelic variation in these two chestnut species. This latter strategy will be enabled by the availability of the American chestnut BAC library. Further association mapping analysis with multiple Chinese chestnut genotypes exhibiting different levels of resistance may reveal the strength of each QTL for conferring resistance as well as reveal any regions controlling resistance that have been overlooked. Confirmation of the candidates for blight resistance may include expression profiling of the genes in different tissues, expression interference or gene disruption in Chinese chestnut followed by evaluation for disease responses, and transformation of American chestnut germplasm for improved disease resistance.
The physical map will continue to be a critical asset for clone-based mapping and sequencing of ecologically important phenotypes. Resistance to Cryphonectria parasitica is not the only trait required for restoration of the American chestnut. Phytophthora cinnamomi, an exotic fungal root pathogen also introduced from China, is a major threat to trees living in lower elevation regions (Jeffers et al. 2009). The chestnut gall wasp (Dryocosmus kuriphilus Yasumatsu) is another devastating pest (Stehli 2006). Natural sources for resistance to either the pest or pathogen can be found in existing germplasm and introgressed using the approach and resources pioneered for the canker blight. Given the discovery that Fagaceae species appear to be conserved in genome structure (Kremer et al. 2007), information from this genome resource can be transferred to other less studied Fagaceae species to improve the quality of other major forest trees as well.
This study was supported by the Plant Genome Research Program (PGRP) from the US National Science Foundation (NSF-PGRP 0605135) awarded to RRS. The Forest Health Initiative (FHI) awarded to AGA supported part of the map integration and comparative analyses. Plants used in this research were obtained from Greg Miller at the Empire Chestnut Company. The authors thank Dr. Paul Sisco for early support through The American Chestnut Foundation (TACF) and for markers in the earliest stages of this project and Dr. Bill Powell and Kathleen Baier for full-length cDNA sequence and in-depth discussion.
This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.