Introduction

Many aspects of tree growth and development change with age, such as wood formation (Xu et al. 2016; Li et al. 2013a, 2014, 2017; Zeng et al. 2017), growth rate (Lai et al. 2014), rooting ability (Peer and Greenwood 2001), and the phase change (Brunner et al. 2017), and these traits are important in forestry. So, revealing the underlying mechanisms is not only helpful to understand the processes of tree growth and development, but also helpful for wood, cutting seedlings, and seed production. Seeds are used to propagate many forest trees, which is still the main means of producing seedlings, at least for larch in China. In addition, the seeds of some trees, such as Pinus koraiensis, are edible and sold commercially. So, shortening the time required to enter into the reproductive phase is of great relevance and economic value, but still poses a challenge. Maintaining the juvenile phase or delaying the onset of the reproductive phase could maintain a high rate of growth and wood production, as well as other traits important for clonal forestry (Wendling et al. 2014b).

Usually, many years are required for forest trees to be able to flower and produce seeds, and so far, the genetic mechanisms underlying this process of sexual reproductive development are still not clear compared with those in herbaceous plants (Bergonzi and Albani 2011; Khan et al. 2014). Mainly due to the prolonged juvenile phase, the efficiency of forest tree breeding is low and needs to be improved. So, investigating the effects of aging on the processes of reproductive development in forest trees is of great theoretical and practical importance.

The phase transition in trees occurs gradually with age and is controlled by genetic and environmental cues as well as silviculture (García-López et al. 2014; Khan et al. 2014; Brunner and Nilsson 2004). During the phase transition, many aspects of growth and development change (Wendling et al. 2014a): the capacity to produce seeds occurs after the transition from the juvenile vegetative to the adult vegetative phase, and reproductive organs appear after the transition from the adult vegetative to the adult reproductive phase. Depicting the molecular features of phase transition with age will help to understand these processes and provide more information about the other physiological changes.

Genes functioning to delay reproductive growth or maintain the state of vegetative growth have been identified from herbaceous plants, such as Arabidopsis, barley, and rice, including microRNA miR156 (Wang et al. 2009; Wu et al. 2009), miR171 (Curaba et al. 2013; Fan et al. 2015), DHD1 (DELAYED HEADING DATE1) (Zhang et al. 2019), EMF1 (Embryonic Flower 1) and EMF2 (Sung et al. 2003), TEM (TEMPRANILLO) (Sgamma et al. 2014), and TFL1 (Terminal Flower 1) (Mohamed et al. 2010; Kotoda and Wada 2005). In gymnosperm trees, genes functioning to change the timing of reproductive growth have also been identified; among them, DAL1, a MADS-box transcription factor from Picea abies, is expressed increasingly from age 5 years, and its over-expression in Arabidopsis results in early flowering (Carlsbecker et al. 2004); in contrast, the over-expression of LaAP2L1 (a heterosis-associated AP2/EREBP transcription factor from Larix) results in late flowering in Arabidopsis (Li et al. 2013b) and over-expression of the PEBP genes identified in conifers also represses flowering in Arabidopsis (Karlgren et al. 2011; Klintenäs et al. 2012). Furthermore, 38 members of the MADS family in Larix, including the putative homologue of DAL1, are also expressed increasingly from age 2 or 5 years (see Table S3 in Li et al. 2017), indicating their involvement in the transition from the juvenile to the adult phase. While the capacity to produce seeds occurs at about 20 years in Picea and 10 years in Larix, the molecular regulatory mechanism of the vegetative-to-reproductive phase transition in trees mediated by these genes is still unclear.

In our previous study of the Larix transcriptome, 10,604 transcripts were assigned to the GO term “vegetative to reproductive phase transition,” 2619 to “vegetative phase change,” 1072 to “regulation of timing of transition from vegetative to reproductive phase,” and 94 to “regulation of vegetative phase change” (Li et al. 2017), all showing genetic features of the control of the phase transition in Larix aging. In addition, based on GO enrichment analysis, the GO term “reproduction” was enriched in genes with upregulated expression in comparison (I) 2 vs 1 year (Fig. 1), indicating that the molecular aspects of reproductive processes occur very early in the life span of Larix, consistent with the findings in Olea europaea L. (García-López et al. 2014). Detailed analysis of these genes is needed if they are to be used as targets for the improvement of reproductive traits. Studying the regulation of their expression with age is helpful to understand the relationships between their expression and the timing of the phase change, which are still not clear for most trees.

Fig. 1
figure 1

Distribution of Gene Ontology (GO) terms associated with the vegetative-to-reproductive phase change after enrichment analysis of differentially expressed genes (DEGs) during aging of Larix kaempferi (Li et al. 2017). The enriched GO terms are highlighted in yellow (upregulated) and purple (downregulated). Five pairwise comparisons were performed in chronological order to identify DEGs: (I) 2 vs 1 year, (II) 5 vs 2 years, (III) 10 vs 5 years, (IV) 25 vs 10 years, and (V) 50 vs 25 years

As for larch, about 10 years is required to achieve the capacity to produce seeds. We hypothesized that the levels of regulators of the phase change increase or decrease gradually with age, and genes, including transcription factors, controlling the phase change have different levels in the juvenile (before 10 years) and mature stages (after 10 years). By comparing the transcriptomes of juvenile vegetative (1- and 2-year-old) and adult reproductive (25- and 50-year-old) phases, which were generated in our previous study (Li et al. 2017), we set out to identify these regulatory genes.

Materials and methods

Sample preparation

The transcriptome data for analysis were generated in our previous study (Li et al. 2017). The uppermost main stems produced in the current year were collected from 1-, 2-, 5-, 10-, 25-, and 50-year-old L. kaempferi trees in July 2011 for RNA extraction and transcriptome library construction and sequencing (Li et al. 2014, 2017). The trees were located in Dagujia seed orchard (42° 22′ N, 124° 51′ E), Liaoning province, in northeast China, from young (1-, 2-, 5-, and 10-year-old), middle-aged (25-year-old), and mature (50-year-old) stands. They were grown from seed and include the vegetative and reproductive phases of L. kaempferi. After removal of branches and needles, the stems from at least three trees from each age category were pooled, frozen in liquid nitrogen, and stored at − 80 °C until RNA extraction.

Identification of differentially expressed genes

Transcript quantification from RNA-Seq data was performed in our previous study (Li et al. 2017). Six sets of sequencing reads were mapped to the assembled reference transcripts using Bowtie (Langmead and Salzberg 2012). RNA-Seq by Expectation Maximization was used to estimate the transcript abundance (Li and Dewey 2011). The expression of genes was normalized with edgeR (empirical analysis of digital gene expression data in R) (Nikolayeva and Robinson 2014). Fragments per kilobase of transcript per million fragments was used to measure the normalized expression value. Based on the result of the quantification of transcripts (Li et al. 2017), here we made four pairwise comparisons to identify the differentially expressed genes (DEGs) in the juvenile vegetative (1- and 2-year-old) and adult reproductive (25- and 50-year-old) phases: (I) 25 vs 1 year old; (II) 50 vs 1 year old; (III) 25 vs 2 years old; and (IV) 50 vs 2 years old (Fig. 2). After pairwise comparison, the DEGs were obtained with stringent cutoffs: a false discovery rate (FDR)-corrected P value cutoff of 0.001 and a minimum expression fold change of 2.

Fig. 2
figure 2

Analytical flowchart of RNA-Seq analysis. 299,637 assembled transcripts generated from the uppermost main stems of 1-, 2-, 5-, 10-, 25-, and 50-year-old larch trees in a previous study (Li et al. 2017) were used to predict the transcription factors, and their expression levels in the juvenile vegetative (1- and 2-year-old) and adult reproductive (25- and 50-year-old) phases were compared. In total, 27 transcription factors shared in four comparisons: (I) 25 vs 1 year, (II) 50 vs 1 year, (III) 25 vs 2 years, and (IV) 50 vs 2 years, were identified

Transcription factor prediction and their expression patterns

The nucleic acid sequences of 299,637 assembled transcripts (Li et al. 2017) were input into a server to predict the transcription factors (http://planttfdb.cbi.pku.edu.cn/prediction.php) (Jin et al. 2017). When the nucleic acid sequences were input, ESTScan 3.0 (Iseli et al. 1999) was employed to identify the coding regions of the input nucleic acid sequences and translate them to protein sequences, which were used to predict transcription factors based on the family assignment rules (http://planttfdb.cbi.pku.edu.cn/help_famschema.php).

The expression patterns of the predicted transcription factors in the juvenile vegetative and adult reproductive phases were analyzed based on the identification of DEGs (Fig. 2). The Venny 2.1 tool was used to find the transcription factors shared among the four comparisons (http://bioinfogp.cnb.csic.es/tools/venny/index.html) (Fig. 2). Then the expression patterns of the shared transcription factors in six age categories were further analyzed to characterize the changes in their transcript levels during aging throughout an entire rotation period. Quantitative reverse transcription polymerase chain reaction analysis had been performed in our previous study to confirm the results of transcript quantification from RNA-Seq data, so this was not repeated here.

Interaction prediction

The translated protein sequences of the transcription factors shared among the four comparisons were input into the STRING server to analyze their interactions in the context of Arabidopsis thaliana (https://string-db.org/) (Szklarczyk et al. 2017).

Result and discussion

Prediction of Larix transcription factors

After input of 299,637 assembled transcripts, 142,475 protein sequences were obtained with the length longer than 50 amino acids; among them, 4377 (3.07%) were predicted to be transcription factors and classified into 55 families (Fig. 3). At least three members were identified in each family and the largest number of transcription factors (411) was in the basic helix-loop-helix family (Fig. 3).

Fig. 3
figure 3

Transcription factors predicted from the transcriptome generated from the uppermost main stems produced in the current year collected from 1-, 2-, 5-, 10-, 25-, and 50-year-old L. kaempferi trees (Li et al. 2014, 2017). In total, 4377 transcripts were predicted to be transcription factors and belonged to 55 families

Up to 8 July 2019, 320,370 transcription factors classified into 58 families were identified from 165 species in PlantTFDB 4.0 (http://planttfdb.cbi.pku.edu.cn/), which covers the main lineages of green plants (Jin et al. 2017), providing a powerful platform for predicting the transcription factors across green plants and studying the transcriptional control of plant growth and development. Genome-wide identification of transcription factors from P. abies 28,354 genes (Nystedt et al. 2013) and Pseudotsuga menziesii 47,874 genes (Neale et al. 2017) shows that there are 1107 (3.9%, classified into 54 families) and 1915 (4%, classified into 54 families) transcription factors, respectively (http://planttfdb.cbi.pku.edu.cn/index.php).

Here transcriptome-wide identification of transcription factors from L. kaempferi assembled transcripts showed that the ratio of transcription factors to genes from L. kaempferi was similar to that from P. abies and P. menziesii, and that almost the same number of transcription factor families existed in these three conifers. These results suggested the reliability and accuracy of the prediction of transcription factor with PlantTFDB 4.0 and conserved features of transcriptional control of conifers’ growth and development. The prediction of transcription factors from trees’ genomes and transcriptomes not only helps to study the transcriptional control of many aspects of tree growth and development, such as wood formation, phase change, and seasonal activity of the meristem, but also helps to identify the targets of larch microRNA to construct regulatory networks at the post-transcriptional level.

Transcriptional regulation of gene expression is a relatively stable mechanism in a time window of 50 years

To identify the transcription factors involved in the control of the phase change, we first made four pairwise comparisons: (I) 25 vs 1 year old; (II) 50 vs 1 year old; (III) 25 vs 2 years old; and (IV) 50 vs 2 years old (Fig. 2, Table 1). Based on this comparison, 12,789 transcripts were identified as DEGs (Table S1), among which 573 were transcription factors based on the prediction results (Table S2). Second, Venn analysis was used to identify the transcription factors shared in the four comparisons (Fig. 2). The results showed that 27 transcription factors from eight families were common, including 18 members of the MADS family (Fig. 4, Table 2). Third, we detected the expression patterns of 27 transcription factors in six age categories, finding that members of the AP2 and Dof families were expressed highly in 1- and 2-year-old trees, but weakly in 25- and 50-year-old trees, while members of another five families (C3H, G2-like, GRAS, MYB-related, and MADS) had the opposite patterns, and one member of the MADS family was only detected in 25- and 50-year-old trees (Table 2). Notably, the number of transcription factors highly expressed in 25- and 50-year-old trees was larger than that of highly expressed 1- and 2-year-old trees (Table 2) and the ratio of differentially expressed transcription factors to DEGs was higher in the upregulated genes (Table 1), suggesting more transcription factors take part in the positive regulation of the phase change.

Table 1 Statistics of differentially expressed transcription factors and transcripts in juvenile vegetative (1- and 2-year-old) and adult reproductive (25- and 50-year-old) phases in Larix kaempferi
Fig. 4
figure 4

Venn diagram of differentially expressed transcription factors in four comparisons: (I) 25 vs 1 year, (II) 50 vs 1 year, (III) 25 vs 2 years, and (IV) 50 vs 2 years in Larix kaempferi

Table 2 Expression patterns of 27 transcription factors in six age categories of Larix kaempferi shared in four comparisons: (I) 25 vs 1 year, (II) 50 vs 1 year, (III) 25 vs 2 years, and (IV) 50 vs 2 years

In our previous study, 12,927 transcripts were identified as DEGs during larch tree aging (Li et al. 2017). To obtain more information about the transcriptional regulation of tree aging, here we used the 4377 predicted transcription factors to annotate the 12,927 DEGs again, and found that 562 (4.35%) were transcription factors (Table S3). In our present study, 573 transcription factors were differentially expressed, having 421 in common with the 562 transcription factors identified previously, and the ratio of differentially expressed transcription factors to DEGs was 4.48% (573/12,789), almost the same as in our previous study. These findings indicated that changes in the expression of transcription factors might be part of, or responsible for, the extensive and complex transcriptome reprogramming during tree aging. They also suggested that the transcriptional regulation of gene expression is a relatively stable mechanism underlying tree growth and development at different stages, and is involved in the phase change at an early stage in the life span.

Transcriptional regulation of the phase change at an early stage in the life span

Modulation of gene expression with genetic transformation methods has been used to shorten the juvenile phase in woody plants (Brunner et al. 2017); for example, downregulation of TFL1 homologues in Populus (Mohamed et al. 2010), Malus (Kotoda et al. 2006; Flachowsky et al. 2012), and Pyrus (Yamagishi et al. 2016; Freiman et al. 2012) results in precocious flowering. Some homologues of members of the MADS family have been overexpressed to shorten the juvenile phase in Betula (Elo et al. 2007; Huang et al. 2014) and Malus (Flachowsky et al. 2007). These reports inspired us to further explore the genetic regulation of the timing of the transition from the vegetative to the reproductive phase in forest trees, and here identification of the transcription factors involved in the phase transition in larch offers new potential targets for genetic manipulation to change the length of the juvenile phase (Table 2).

Entry into the reproductive phase occurs earlier in larch than in spruce; however, an increase in the expression of some members of the MADS family, such as the putative homologue of DAL1 in both larch (Table 2) and spruce (Carlsbecker et al. 2004), is found at the same age, indicating that at that age (5 years), some conserved and unknown biological processes occur in both species. Among the 562 transcription factors identified as DEGs in our previous study, 38 were MADS family members (see Table S3 in Li et al. 2017), also showing the importance of the MADS family in the phase change. So, identification of the targets of MADS family members and their partners will help to reveal these processes, and meanwhile, studying the regulation of the expression of MADS family members with age will also help to understand the age control of tree growth and development.

Based on the expression patterns of the transcription factors identified here (Table 2), some could be used as molecular markers to distinguish the special stages of tree growth and development and determine the state of plant materials after rejuvenation. For example, members of the AP2 (comp128327_c0_seq16) and ERF (comp129386_c0_seq9) families were only detected in 1- and 2-year-old trees, while several members of the MADS family were undetectable in these trees (Table 2). Notably, one member of the MADS family (comp126977_c0_seq2) was only detected in 25- and 50-year-old trees (Table 2). Nevertheless, to dissect changes in their expression with age, examples from more ages or developmental stages are still needed.

Regulatory network of the phase change

Environmental control of tree growth and development occurs throughout the lifetime of a tree, and its effects might be evident in the tree at any time—once a certain level of a tree’s responses is reached, changes in anatomy and/or physiology appear. Several genes have been identified as both regulators of the reproductive phase change and integrators of environmental signaling in annual plants (Blümel et al. 2015; Song et al. 2013), such as SUPPRESSOR OF OVEREXPRESSION OF CONSTANS 1 (SOC1) and MIR172. Here, the expression of several putative homologues of SOC1 in larch were found to increase with age (Table 2); the expression of two putative homologues of AP2 family members, of which some are targets of miR172 (Zhu and Helliwell 2011), decreased with age (Table 2). These results indicated that photoperiod, ambient temperature, and age pathways take part in the phase change of larch, and conserved mechanisms governing growth and development in response to environmental cues occur in gymnosperms and angiosperms (Blümel et al. 2015; Song et al. 2013). Low levels of key regulators of flowering time at an early stage in the lifetime of larch might be the reason for no “flowering.”

In addition, we concluded that other transcription factors identified here also play roles in the phase change, because the functions of their putative homologues in other plants have been studied. For example, plants with elevated levels of a Dof transcription factor, CYCLING DOF FACTOR 1, flower late (Imaizumi et al. 2005); a member of the MYB family, ASYMMETRIC LEAVES 1, accelerates flowering by increasing the amount of gibberellin-4 and facilitating CONSTANS to induce the expression of FLOWERING LOCUS T (Song et al. 2012); in barley and rice, miR171 regulation of some members of the GRAS (GAI-RGA-SCR) family mediate the phase change from vegetative to reproductive development (Curaba et al. 2013; Fan et al. 2015), and in rice, another member of the GRAS family, DHD1, also controls the flowering time (Zhang et al. 2019).

Based on the expression patterns, annotations and predicted interactions of transcription factors identified here (Table 2), and the flowering time gene network in Arabidopsis thaliana (Blümel et al. 2015; Song et al. 2013), we propose a regulatory network model for the phase change from vegetative to reproductive development in larch (Fig. 5), which is helpful to understand the relationships of these transcription factors in the context of the current flowering pathways. However more work is still required to verify their functions and interactions with other molecular biology techniques.

Fig. 5
figure 5

Regulatory network model proposed for the Larix kaempferi phase change from vegetative to reproductive development based on the expression patterns, annotations, and predicted interactions of 27 transcription factors shared in four comparisons: (I) 25 vs 1 year, (II) 50 vs 1 year, (III) 25 vs 2 years, and (IV) 50 vs 2 years, and the flowering time gene network in Arabidopsis thaliana (Blümel et al. 2015; Song et al. 2013)

Taken together, we conclude that the phase change from vegetative to reproductive development during aging of larch is orchestrated by putative homologues of major developmental regulators in angiosperms, and they constitute a complex regulatory network (Fig. 5). Although the positions of some of these homologues in the network cannot be confirmed, their expression levels increase or decrease gradually year by year over the course of aging, characteristic of regulators of the phase change.