Introduction

Populations of homozygous recombinant inbred lines (RILs) have been widely used for quantitative trait locus (QTL) mapping (Addison et al. 2021; Arif et al. 2021). However, RIL construction requires at least six generations to approach homozygosity using conventional inbreeding or selfing procedures. In contrast, doubled haploidy achieves homozygosity in one generation. Because of the ability to generate homozygosity in such a short time, doubled haploid (DH) populations have been used extensively in plant breeding, especially in maize. The DH lines of maize exhibit high genetic variance or display notable transgressive segregation for specific traits (Bordes et al. 2006; Mayor and Bernardo 2009). Given these unique attributes, DH populations are used for mapping QTLs, backcross breeding (Pillen et al. 2003), bulked segregant analysis (BSA) (Ardiel et al. 2002), construction of genetic maps, elite crossing, and genomic research (Seymour et al. 2012; Lee et al. 2016).

Poplars (the genus Populus) have strong adaptability, wide distributions, and the advantages of rapid growth and easy reproduction. Poplars are also the main afforestation tree species in the world today (Douglas 2017; Jha 2018). Black cottonwood (Populus trichocarpa Torr. & A. Gray ex Hook.) was the first forest species to have a complete genome sequence (Tuskan et al. 2006), and consequently, poplars have become the model woody plant to investigate the molecular biology of trees. Most forest trees have high levels of genetic diversity, presumably due to outcrossing and their large population sizes (Hamrick et al. 1981; Porth and El-Kassaby 2014). Consequently, most have a high genetic load of deleterious or lethal mutations, so inbreeding is greatly limited (Namkoong and Bishir 1987).

Unlike crops that are monoecious and could produce homozygotes by several generations of selfing, most poplars are dioecious, and induction of haploid material from parthenogenetic reproduction or microspore culture may be the most effective method for producing homozygous genomes. Various methods, such as cross-pollination with stressed pollen, heat-treated pollen (Winton and Einspahr 1968) or pollen treated with toluidine Winton blue (Illies 1974), have been used to induce haploid plants. However, anther culture with phytohormone treatment in tissue culture is the most often used method for generating haploids in poplar species (Deutsch et al. 2004; Li et al. 2013; Yang et al. 2017).

Although numerous studies have contributed to the induction methods for generating haploid plants, woody plants are usually recalcitrant to standard procedures. In the last twenty years, there have been no reports of the production of DH populations of woody plants. Therefore, unlike in a wide variety of other plants, little effort has been made to study the genetics and breeding of DH woody plants. In the present study, we constructed a DH population from an F1 hybrid of P. simonii × P. nigra and provide some novel findings. Our results should encourage studies of haploids in poplar and other forest tree species.

Materials and methods

Plant materials

The anther donor tree of P. simonii × P. nigra was located on the Northeast Forestry University campus in Harbin, China. Twigs with flower buds were cut from the donor tree, and bud scales were removed. Catkins were surface sterilized for 30 s in 70% alcohol and then 5 min in 3% sodium hypochlorite (12% available chlorine). The anthers were cultured in Petri dishes on full-strength Murashige and Skoog (MS) medium (Murashige and Skoog 1962) and kept in a light-proof box in a plant growth room until callus derived from the anthers could be tested. The medium was supplemented with 2 mg L−1 2,4-dichlorophenoxyacetic acid (2,4-D), 1 mg L−1 kinetin, and 3 g L−1 Gelrite. Calluses of haploid origin were transferred to differentiation medium to induce adventitious shoots. When the shoots were approximately 2.0 cm tall, they were detached from the callus and transferred to a rooting medium containing 0.2 mg L−1 1-naphthaleneacetic acid (NAA). The growth room was maintained at 25 °C under a 16-h light/8-h dark cycle. After 20 − 25 d, the rooted plantlets were transplanted into soil.

Flow cytometry test

The nuclei for flow cytometry (FCM) were obtained from anther callus, and the young leaves of the donor tree were dissociated by mechanical damage in a nuclear extraction buffer containing 0.004% (w/v) MgCl2·6H2O, 0.025% (w/v) NaCl, 0.0035% (w/v) EDTANa2·2H2O, 0.0095% (w/v) Na2S2O5, 0.0315 M Tris–HCl (pH = 7.0), 0.05% (w/v) PVP-10 and 0.5% (v/v) Triton X-100. Approximately 1 g of anther callus or young leaves of the donor tree was added to 1 mL of nuclear isolation buffer and rapidly chopped using double-sided blades. The suspension was then filtered through a 70 μm nylon filter to remove fragments and large tissue debris. Nuclei were collected by centrifugation (1000 rpm, 8 min), resuspended in 500 μL of propidium iodide (PI) and chilled at 4 °C for 20 min. Samples were put into the flow cytometer, and data were collected for 20,000 cells with the same flow rate, voltage and dye and analyzed for ploidy.

Next-generation sequencing and k-mer analysis

Total genomic DNA was extracted from callus or plantlet leaves using the cetyl trimethyl ammonium bromide (CTAB) method (Porebski et al. 1997). DNA fragment size and homogeneity were visualized on 1% agarose gels, and quantification and absorbance were monitored using a NanoDrop 2000 spectrophotometer (Thermo Scientific, Waltham, MA, USA). Library construction and whole-genome sequencing were performed by the Beijing Genomic Institute (BGI, Wuhan, China) on the MGISEQ-2000 platform. To obtain high-quality reads, fastp (version 0.19.7) was used to filter raw reads, and FastQC (version 0.11.8) used to check the quality of the clean reads (Chen et al. 2018). The k-mer analysis and genome assessment were performed using Genomic Character Estimator (GCE, version 1.0.2) (Liu et al. 2013). The k-mer distribution was obtained using Excel 2019. The primers (1534-F: 5′-CGTGTAGCACTAGATGCCCAATTCAATC-3′, 1534-R: 5′-GCCTCCTCGTCACCTCTAAAGAAACC-3′) were used for the amplification of the flanking sequence of the SNP site, and the primers (1534-Alignment-F: 5′-GCCTTTGCACTTCTTTTGCC-3′, 1534-Alignment-R: 5′-TTGGAATTAGAAGTGGGC-3′) were used to perform Sanger sequencing alignment of the SNP site.

Genetic variation in tree height and basal diameter of DH clones and the F2 generation

Tree height (TH) and basal diameter (BD) of 49 annual F2 families and 7 DH lines were investigated. DH16, DH109, DH127, DH158, DH171, DH212 and DH316 were represented by 40, 13, 51, 53, 18, 16 and 15 individuals, respectively. The TH and BD were measured directly using a tape measure and a Vernier caliper. All data were analyzed using SPSS version 25.0. Family heritability was estimated as follows:

$$R = 1 - \frac{1}{F}$$
(1)

where, F was derived from analysis of variance.

The phenotypic coefficient of variation (PCV) was calculated using the formula:

$${\text{PCV}} = {\text{SD}}/{\text{X}} \times {1}00$$
(2)

where, X and SD are the phenotypic mean and standard deviation of the trait, respectively.

Phenotypic correlation analysis of trait x and trait y was calculated according to the following formula:

$$r_{p12} = \frac{{{\text{Cov}}_{{{\text{p}}12}} }}{{\sqrt {{\upsigma }^{2}_{{{\text{p}}1}} {\upsigma }^{2}_{{{\text{p}}2}} } }}$$
(3)

where, Covp12 was the phenotypic covariance between traits, and \({{\upsigma }^{2}}_{\mathrm{p}1}\) and \({{\upsigma }^{2}}_{\mathrm{p}2}\) were the phenotypic variances for trait x and trait y, respectively.

Selected single guide (sg) RNA and analysis of mutant cell lines

A gene editing system was established using CRISPR‒Cas9 (Ueta et al. 2017). Agrobacterium tumefaciens-mediated transformation of DH callus line L23 was performed as described for tobacco (Nicotiana tabacum L.) BY-2 cells (An 1985). A genomic DNA fragment of PsnPDS was PCR amplified with gene-specific primers (PsnPDS-F: 5’-ATGAGTGCATTGAACTTGAGCTGG-3’, PsnPDS-R: 5’-TTAAGTAATGGTTGCCTCAGTCAAC-3’) designed based on its homologous target gene (Potri.014g148700.8) sequence in Populus trichocarpa. According to the obtained genome sequence and coding mRNA sequence, the 20 bp sequence containing a protospacer adjacent motif (PAM) structure was selected as the sgRNA. The sequence was close to the 5′ end, and the GC content was greater than 40%. For detection of the mutation in the PsnPDS gene in the transgenic L23 cell line, genomic DNA was extracted from a stable transgenic and wild-type (WT) cell line following a typical CTAB extraction protocol. The genomic DNA extracted from the L23 cell line was then used as a template to amplify the endogenous PDS fragment by PCR. PCR was performed using specific primers for the target (PsnPDS-detect-F: 5’-TTAGAGGCAGTGAATCAATGGGC-3’, PsnPDS-detect-R: 5’-TATCGCACAATCTCTCAAGAGTACATG-3’). The PCR product was separated on a 1% agarose gel, and bands were recovered and cloned into the pEASY-T1 vector (TransGen Biotech, Beijing, China). The mutations were identified by Sanger sequencing of individual clones. All sequence results were compared with the reference sequence of the PsnPDS gene by alignment in BioEdit (Hall 1999).

Results

Haploid callus induction

The results show that when the anther bears microspores mostly in the mid-uninucleate and late-uninucleate stages (Figs. 1A, B), the haploid callus induction rate is the highest, and it is almost impossible to obtain haploid callus when microspores are after the late-uninucleate stage. Numerous potential haploid calluses were obtained from poplar anthers after five weeks of culture on induction medium (Fig. 1C). Finally, 330 callus lines of haploid origin were obtained, and the positive rate of haploid origin of all the calluses was approximately 10% (330/3340). The callus lines showed significant differences in growth rate, dispersibility (friability) and morphology, implying that there is significant genetic variance in the callus population (Figs. 1D − F).

Fig. 1
figure 1

Induction of haploid callus. A State of the floral stage used for anther culture. Bar = 0.5 cm. B DAPI staining of mid-to-late-uninucleate microspores. Bar = 20 µm. C The callus was obtained after anther culture for 5 weeks. Red arrows show callus regenerated from anthers. D Different haploid callus lines showed significant variance in morphology. 1–9: Haploid callus lines L2, L19, L34, L35, L65, L69, L91, L96, and L219. E Haploid callus line L158 showed a quick growth rate on solid hormone medium with a tenfold increase in biomass after culture for two weeks. Bar = 1 cm. F Haploid callus line L44 showed obvious rapid mortality. Bar = 1 cm

DH whole-plant population displayed a variety of phenotypes

To obtain whole plants, 116 callus lines of haploid origin were transferred to adventitious shoot-inducing medium under a 16-h light/8-h dark cycle at 25 °C for plant regeneration through organogenesis. When the adventitious shoots were two centimeters in length, they were transferred to rooting medium to generate complete plantlets (Fig. 2A). The rooted plantlets were transferred into soil for continued growth (Fig. 2B). Flow cytometry (FCM) tests showed that all the plantlet lines were DHs, suggesting that there was spontaneous chromosome doubling during the development of adventitious shoots and that haploid poplar plantlets survived at extremely low rates. Finally, 96 callus lines successfully regenerated to whole plants, at which point 20 haploid callus lines had failed to induce shoots.

Fig. 2
figure 2

DH plantlet regeneration and a variety of phenotypes. A Adventitious shoot induced from haploid callus. B Two-year-old DH plants cultured in soil. C DH21 showed a different bark morphology compared to that of the donor control after grafting for 3 years. D The stems of DH4 and DH5 plants showed distinct variation compared to that of the donor control. The transverse section indicates toluidine blue staining. Bar = 200 µm. E and F Two-year-old DH92 and DH171 showed a shrub-like morphology

After being planted in soil, most lines exhibited slow and weak growth compared with that of the control donor tree (Fig. 2B). The DH plants grown under field conditions showed exceptionally high genetic variance. Almost every plant line growing in soil exhibited at least one distinct morphological trait compared to that of other DH lines or the donor tree, such as leaf and stem shape (Figs. 2C − F) or other characters. Some lines such as DH92 and DH171 showed a shrub-like morphology (Figs. 2E, F). The genetic variance in plant height of an F2 population from the half-sib progeny of the donor tree was compared to that of the DHs, and the latter showed significantly higher genetic variance (Table 1). Under the same growth conditions, most DHs possessed weak resistance to biotic or abiotic stress such as drought and infection by spider mites or pathogenic bacteria and viruses. Some plant lines were successfully cultured under field conditions for more than two years.

Table 1 Genetic variation parameters for tree height (TH) and basal diameter (BD) of 7 DH clones and the F2 family

DH plant potential for multiple applications in genetic and breeding research

The DH plants showed homozygous genomes which are useful for genetic or breeding studies. To date, no Populus DH plant line has been widely used as a popular tool in research. Efforts were made to identify ideal DH lines over the past several years. Finally, we found that some DH lines that showed excellent potential as model plants. The cells of several callus lines are highly susceptible to Agrobacterium tumefaciens, and we performed knockout of the phytoene desaturase gene (PDS) with the CRISPR-Cas9 system using L23 line. The transgenic lines showed an albinism phenotype of callus under light culture conditions, suggesting that there was successful knockout of the PDS gene (Figs. 3A–C). We suggest that L23 may be an ideal tool for molecular biological studies. Although most DH lines showed low vigor and a high mortality rate under field conditions due to viral infection or other stresses, some individual lines showed a high survival rate. Some DH21 plants survived for three years under field conditions (Fig. 3D), and this line was previously used in a molecular genetic study (Liu et al. 2021). DH158 depicted ~ 100% survival after two years of cultivation and showed a high growth rate of 2 m in one year of culture (Fig. 3E). Thus, it has the potential to be an excellent candidate for cross breeding in the future.

Fig. 3
figure 3

Haploid potential for multiple applications. A − C CRISPR/Cas9-mediated gene editing in the transgenic L23 cell line. (A) Acquisition of kanamycin-resistant callus after culture for 4 weeks. B − C Phenotypic observation of the transgenic strain when callus was cultured under light conditions. psnpds: mutant strain; WT: Wild type. D DH21 plant (marked with a red arrow) that survived for three years under field conditions. E DH158 showed a high growth rate of 2 m in one year of culture

Protocol for confirmation of haploid origin

Although anther culture is a very effective way to produce haploid poplar plants, an efficient protocol for identifying callus with a haploid origin is not available. In previous studies, simple sequence repeat (SSR) analysis was used for genome homozygosity testing (Deutsch et al. 2004; Yang et al. 2017). However, the design of SSR marker primers and the SSR analysis procedure are time-consuming. To effectively identify calluses with a haploid origin, a simple improved-throughput protocol was established by combining FCM with the SNP identification method. Haploid callus lines were first obtained by determining the ploidy level using FCM (Fig. 4A). For the diploid and other polyploid lines, we checked for genome homozygosity by direct DNA sequencing of PCR products of SNP sites in the donor tree (Fig. 4B). In the present study, the results of de novo genome sequencing were screened, and two SNP sites selected. Forty-five callus lines of haploid origin were surveyed as determined by direct DNA sequencing of PCR products and performed k-mer analysis. Finally, we proved that our method showed 100% accuracy for each SNP site (Fig. 4C). This protocol was effective, as we were able to check more than 500 callus lines per day. The process for DH plant induction described in the present study is shown in Fig. 5.

Fig. 4
figure 4

Confirmation of haploid origin. A DNA ploidy level test using FCM. 1, Anther donor tree. 2, Haploid callus line L21. 3, Anther donor tree and haploid callus line merge. B Detection of heterozygosity of anther callus by SNP site. 1, Anther donor tree. 2, Doubled haploid whole plant of line DH158. 3, Haploid callus line L29. 4, Haploid callus line L171. C Distribution of 17-mer frequency. 1, Anther donor tree. 2, Doubled haploid whole plant of line DH158. 3, Haploid callus line L29. 4, Haploid callus line L171

Fig. 5
figure 5

Process for DH plant induction

Discussion

Heterozygosity of a plant genome can create difficulties in genomic research because, to a variable extent, a heterozygote represents two genome sequences that need to be analyzed simultaneously. For example, both RNA interference (RNAi) and CRISPR technology require that the guiding RNA be highly specific to the target gene sequence, and heterozygosity could reduce specific base pairing and seriously affect the efficiency of gene suppression (Gitlin et al. 2002; Zhou et al. 2015). Although poplars have been used as model woody plants for studying the molecular biology of trees, most have high genome heterozygosity (Ingvarsson and Bernhardsson 2020; Huang et al. 2021). Haploid induction would be the most effective method to achieve homozygosity. However, it is still difficult to obtain enough haploid individuals for poplar genetic research. In the present study, an efficient system for obtaining DH lines through poplar anther culture was developed, and 96 DH whole-plant lines were obtained, representing the first report of the construction of a DH population of woody plants. Haploid and DH poplars will be ideal plant materials for RNAi and CRISPR technology because there should be essentially no SNPs in the genomes of poplars of haploid origin. Homozygosity of haploid and DH poplars will provide high-quality genome assemblies, similar to those of other haploid plants such as apple (Daccord et al. 2017) and rose (Raymond et al. 2018). Therefore, some haploid and DH lines were identified that have the potential to be ideal tools for molecular biology and breeding studies in the future. Our research will contribute to the protocol of haploid and DH plant induction in woody plants.

During the first meiotic division, homologous chromosomes pair and undergo reciprocal crossing over, which recombines linked sequence variation (Lawrence et al. 2017) and generates genetic diversity in haploid gametes so that each haploid product has a unique genetic composition. The DH lines of maize exhibited higher genetic variance than selfing lines (Bordes et al. 2006; Mayor and Bernardo 2009). In theory, the heterozygosity of poplar will result in higher genetic variance among haploids. However, few studies have focused on the genetic variation of haploid poplars because it is difficult to construct DH poplar populations. In this study, both the haploid callus and the DH whole-plant population from P. simonii × P. nigra exhibited very high genetic variance. Because haploid or DH plants carry only half of the chromosomes from their diploid parent, the recessive genes expressed in the donor tree lead to unusual phenotypes in DH plants. In this way, it becomes possible to observe the phenotypes of rare recessive alleles that have never before been observed. DHs can also lead to the identification of recessive lethal or semi lethal genes. Our results also suggested that DH populations would be useful for genome-wide selection for traits such as timber yield and horticultural traits in many important woody plants.