An Analysis of Poplar Growth and Quality Traits to Facilitate Identification of Climate-Adapted Plant Material for Sweden

Poplar plantations harbor large potential as a renewable source of biomass for bioenergy and other industrial applications. The overall aim of this study is to analyze growth, phenology, stem form, and branching characteristics of 32 poplar clones grown in a trial in southern Sweden for their suitability to be grown as industrial feedstock. In a linear mixed model, performed for diameter at breast height and stem volume, the precision was improved by the use of two competition indices. The significance of phenology and quality characteristics for growth performance and ranking of poplar clones was evaluated through genotypic correlations, and multivariate hierarchical cluster analysis used to group the material. All traits showed moderate to high broad sense heritability. In general, higher stem volume was positively correlated with later leaf senescence, and uncorrelated with spring phenology. Selection efficiency for stem diameter and height was greatly improved between age 3 and 6 years allowing a better precision in selecting a subset of clones to be further tested in production plots and pilot plantations. Two commercial Populus maximowiczii Henry × trichocarpa Torr. & Gray cultivars performed best, while some intraspecific hybrids of P. trichocarpa are considered useful to genetically diversify commercial plantations in Southern Sweden (Belgian clones) or establish plantations in north-central parts of Sweden (Swedish clones). The cluster analysis emphasized growth traits and the grouping of the clones corresponded to their origin (or parentage). The results will facilitate decisions on the use of studied material in breeding, further testing and commercial deployment of poplar plantations in Sweden.


Introduction
Transformation towards sustainable land and material use benefits from an increased availability of fossil-free and regionally produced raw materials such as woody biomass. Fast-growing Populus species represent one potentially important biomass source [1,2] with high biomass productivity [3,4] and multiple industrial and environmental uses [5,6]. Currently, Swedish poplar feedstock is mixed in the pulping process (particularly for textile pulp) with other tree species, while tops and branches are used as fuel in combined heat and power plants (Anders Ekstrand, Södra-personal communication). Nevertheless, the anticipated use of poplar wood is much broader and includes sawn wood, plywood, and refined fuel products for vehicles and aviation. In any case, the economic and technological viability of poplars rests heavily on the access to reliable feedstock supply and its quality characteristics. This precondition greatly depends on availability of highly productive and climate-adapted poplar varieties. Currently, in Sweden, commercially deployable poplar varieties are reduced to clone "OP-42" (Populus maximowiczii Henry × P. trichocarpa Torr. & Gray) crossed in the1920s by Stout and Schreiner [7][8][9]. This clone has been planted since the 1990s on some thousand hectares at latitudes 56°-59°N reaching an annual biomass production of up to 10 Mg ha −1 year −1 [4,10] within rotations of 18-25 years. However, operating with only one commercial variety represents an obstacle to further expansion of poplar plantations on the potentially available land area (0.5-2.5 million ha [11]), particularly in more harsh environments at higher latitudes.
Breeding and selection of Populus in Sweden was first focused on hybrid aspen, whereas poplars have been only occasionally tested [12]. Development of bioenergy market in the 1970s and 1980s attracted renewed interest to poplars as an economically interesting alternative for farmers on the land set-aside from an overproducing agricultural sector. Poplar germplasm, originally introduced mainly from collections in the Netherlands and Germany, was later expanded with Belgian material consisting of P. trichocarpa and its hybrids with P. deltoides Bartr. Ex Marsh. and with IUFRO [13] material imported mainly from Finland [14]. After initial evaluations, Belgian clones were considered unsuitable for central Sweden, but partly usable in southern Sweden [15,16]. In 1990s, a short poplar breeding program was started as a result of a growing insight into the importance of adaptation of material for Swedish climate conditions. The focus was entirely on P. trichocarpa using parentage from maritimecontinental and south-north transects at 45-60°N [17,18]. A relatively small portion of the resulting progeny population was saved and is presently under field evaluation across the whole Baltic Region. Six clones from this program were included in the trial presented here.
So far, poplar breeding programs have been mainly focused on producing material suitable for temperate climate regions at latitudes between 30°N and 50°N [19][20][21][22]. Such material lacks necessary hardiness when planted at higher latitudes (55°N-65°N), which has been confirmed in a number of north European field trials [15,16,18,[23][24][25]. This maladaptation has its cause in phenotypic mismatch, usually related to delayed autumnal transition from growth to dormancy [26,27]. In P. trichocarpa, this transition is mainly governed by photoperiod signals for growth cessation triggered at longer days in northern compared to southern ecotypes [26,28]. Thus, while highly productive varieties of southern origin tend to set buds too late or inadequately when grown at higher latitudes, there is also a clear trend of prolonged periods of shoot elongation to be correlated with higher biomass production [15,16,18,[23][24][25][29][30][31][32][33]. Such a trade-off between competitive advantages of a longer seasonal growth and survival due to timely termination of growth is under strong selection pressure in natural populations of Populus spp. [31,34]. Consequently, even the adaptation of poplars outside their natural range will rest on our ability to develop criteria for adequate balance of biomass productivity and phenology timing under the specific photoperiod and temperature conditions in different parts of Sweden.
In many studies on poplar clones, multiple traits have been used to evaluate poplar material using multivariate analytic tools [30,[35][36][37][38][39]. In the present study, we use some quality traits along with growth and phenology variables, adding thus the potential end use value as the evaluation criterion. The overall aim of this study was to analyze growth, phenology, stem form, and branching characteristics of 32 poplar clones grown in southern Sweden for their suitability as industrial feedstock. The specific aims were to (i) estimate the genotypic components of clonal variation for various traits, (ii) evaluate correlations between phenology and clonal growth performance, (iii) analyze age-dependent correlations of growth traits and their impact on selection efficiency, and (iv) group the material using multiple traits for growth performance, phenology, and quality characteristics. This study will improve our understanding of the importance of phenological and stem quality traits in the selection of poplar clones for Swedish climate conditions.

Climate and Soil
The trial was located at the Swedish west coast, outside the city of Halmstad (56°41′ N; 12°56′ E), a region with temperate climate, naturally dominated by deciduous broadleaved forests. During the study period, between 2007 and 2018, all important climate parameters corresponded to the average values for the standard reference period 1961-90. The mean annual temperature was 8.2°C, equaling an average value of 1783 growing degree days, whereas the total annual precipitation was 864 mm, with 371 mm falling during the period of intensive growth in May-August. The temperature-based average length of growing season (SMHI 2020-Open data, Swedish Meteorological and Hydrological Institute) is 210 days beginning in early April and ending in the first week of November. The mean temperature during the growing season was 13.1°C and the first autumn frosts occurred usually first between October 31 and November 15 or later.
The soil has developed on relatively deep postglacial sand (SGU-Geological Survey of Sweden) with sandy loam/loamy sand texture for 0-20 and 20-50-cm layers, respectively. Blocks I and II were located on the northern slope, whilst blocks III and IV were placed at the bottom of the slope where ground water table was occasionally high, decreasing the tree survival in a part of block III. The soil pH was 5.5; there was 2% of organic C within 20 cm topsoil, while the NO 3 and NH 4 nitrogen amounted to approximately 30 kg ha −1 .

Trial Design and Plant Material
After appropriate soil preparation and weed treatment the trial was established in June 2007 by planting 1-year-old, containerized seedlings of 32 poplar clones. A randomized 4-block design was applied with a 3 × 3 m planting spacing, 5-trees line plots, totaling 20 trees per clone. The poplar clones originated from four major sources; (i) P. trichocarpa and P. trichocarpa × deltoides bred at Geraardsbergen, Belgium (BELt and BELtd), preliminary evaluated by Christersson [16], (ii) six clones of P. trichocarpa bred for central Sweden (58°-62°N, SLUc-for central Sweden and SLUn-for the southern part of Swedish northern provinces) [14], (iii) 17 progenies of P. trichocarpa from Prince George (PG, 54°N) including two additional progenies of P. balsamifera from Forth Nelson (FN, 58°N), British Columbia, Canada, and finally (iv) two reference clones (REF) of P. maximowiczii × trichocarpa, "OP-42" and "OP-41" ( Table 1). The Kinship coefficient between the two reference clones is 0.464 (Pär Ingvarsson, SLU, Uppsala, Sweden-personal communication). Considering that the expected value for monozygotic twins is 0.5 and 0.25 for full-sibs, it appears that the clone identified as "OP-41" most probably is "OP-42." However, we decided to treat them as two different clones in our statistical analysis.

Measurements and Sampling
Diameter at breast height (D) and tree height (H) were measured repeatedly in 3-year intervals. In 2015, we visually assessed stem form and branching, and measured the occurrence and height of top dominance loss, crooks and forks (Table S1). Phenology of bud burst (BB) was inventoried in 4-6-day intervals between end of March and the beginning of May 2010 by observing six different developmental stages: BB0, closed bud; BB1, initial shoot emergence; BB2 and BB3, the two phases defined by the length of primordial expansion outside the bud scales; BB4, leaves half shed (still rolled), bud scales wide open or shed; and BB5, leaves completely unrolled. Julian day (doy) for bud burst phase BB3 was selected as the starting point of canopy growth. To compensate for inventor's bias and due to periodic oscillation of temperature between screening occasions, we used all the observations of bud burst to slightly correct timing of BB3 for each clone (Supplementary Information, S2). Changes in autumn crown color and defoliation were registered between the end of September and mid-November 2018. The color change was screened separately for 1-year-old shoots (TC), and for older parts of crown (CRC) using the 0-2 scale, i.e., 0, green; 1, yellow-green; and 2, yellow. The same scale was used for leaf shed (LS), i.e., 0, less than 33%; 1-33% to 66%; and 2, more than 66% leaves shed (Table 2). Bud burst was inventoried in 2010 and leaf senescence in 2018. Consequently, clone canopy durations (CADU), calculated between BB3 and different autumn variables were hypothetical. Here, we made assumption that relative comparison of CADU among different clones will be meaningful, and also usable to compare the correlations of CADU with growth traits. In some cases, doy of phenological stages was not recorded, so that linear estimates were made for each single tree between two consecutive observations, for example, between BB2 and BB4 or for CRC2 when it was not registered.

Calculations and Statistical Analysis
In January 2019, 39 trees of 13 clones were thinned out from the trial and used to model volume growth based on diameter measurements along the stem in 2-m segments. Obtained parameters were used to estimate stem volume of all clones in the trial at all ages according to following function: where V (dm 3 , i.e., m 3 /1000) is stem volume on bark, D (cm, i.e. mm/10) is the breast height diameter, H (m) is total tree Variance analysis of phenology variables was performed using mixed model procedure with the restricted maximum likelihood method to estimate the effects of fixed and random components. Z and F tests were used to determine significance of random and fixed effects respectively. The following linear model was used for analyses of all phenology variables: where Y jkl represents an observation on the l th line plot (average value of the five individuals in a line plot) on a j th clone within k th block, μ is the overall mean, and e jkl is random error. The random clonal component was assumed to have normal distribution with the expectation of zero and corresponding σ 2 c and σ 2 e . The analyzed variables are presented in Table 2.
In the analyses of D, H, and V, and the corresponding periodic increments (zD, zH, zV) additional random spatial components were introduced in the mixed model representing row-column within-block matrices. Two competition indices were also calculated and used as covariates, i.e., sum of line lengths (SLL) [40] and neighbor plots (NP). The competition indices compensated for differences in growing space caused by early mortality and the edge effects. SLL was calculated for every single tree, then averaged for each line plot entry: where SLL i is the competition index value of a target tree, D i is the breast height diameter of a target tree, D j is the breast height diameter of a neighbor tree, and RDist is the relative distance to the neighbor trees in a quadratic design, equaling 1 for perpendicular and 1.41 for diagonal neighbors. The NP index was emphasizing the competition effects of the two nearest neighboring line plots using the mean values per plot and was calculated as where NP tp is the index value for the target plot, D tp is the mean breast height diameter of the target plot, and D np1 and D np2 are the mean breast height diameters of the nearest line plots on both sides of the target line plot. Analysis of variance for growth variables at different ages was performed using a mixed model procedure with the following basic linear model: Table 2 List of variables and acronyms used to characterize growth, phenology, and quality of trees grown in a 12-year-old poplar trial in southern Sweden

Variables Definition
Growth traits D, H, V Cumulative growth of D, diameter at breast height (1.3 m); H, total tree height; V, stem volume at age 3, 6, 9, and 12, i.e., D 3 , D 6 zD, zH, zV Periodic growth increments of D, H and V for the period 3-6, 6-9 and 9-12 years, or for the period 6-12 years, i.e., zD 3-6 , zD 6-9 etc. where Y ijkl represents the estimated value of clone i, block j in row k(j), and column l(j), and e ijkl is random error, while C i , r k(j) , and c l(j) were entered as random variables in the model. The competition indices, SLL and NP, were entered as covariances always representing the relationships among neighbor trees and plots from the end of the previous 3-year period, i.e., in the model for D 6 (D at age 6) we used competition indices calculated for trees and plots at the age of 3 years. The model runs for the first 3-year period did not include competition indices as they were negligible at this stage with the 3 × 3 m spacing applied here. The competition indices were omitted from the analysis for H and zH, as the effect of competition on height increment could not be clearly anticipated. Block effects were omitted in the analysis of variance for stem form and branching as well as competition indices except for applying SLL for branchiness and branch thickness. Variance components of random effects were derived from the mixed models runs for each trait. Genetic relatedness ( Table 1) was not taken into account and clones were treated as independent genotypes. The genotypic clonal variance component, here corresponding to broad sense heritability, was calculated as where σ 2 c is the clonal variance component and σ 2 e is the residual variance. The genotypic coefficient of variation was calculated as follows: where b μ is the predicted grand mean in the mixed model. The repeatability of clonal means was estimated as follows: where k represents the harmonic mean number of replications per clone, i.e., harmonic mean number of line plots. The standard errors of repeatability estimates were calculated according to following formula [41]: where N is the number of clones and n the total number of entries. The Pearson correlation coefficient was calculated for a large number of traits, while the relative selection efficiency of a trait Y (RS y ) as related to the direct selection of the target trait X (S x ), in this case V 12 : where H y and H x equal square root of broad sense heritability for the indirect trait Y (σ GY ) and the directly selected trait X (σ GX ), whereas r G is the genotypic correlation between traits X and Y [42]. Hierarchical cluster analysis was used to visualize the structuring of the tested material according to selected traits. The analysis included four variables for growth performance (D 12 , H 12 , zD 6-12 , zH 6-12 ), two for phenology (BB3 and CRC2), one for stem form (STR) and one for branchiness (BRNS). Prior to analysis all eight variables were standardized at the range − 1 to 1. Euclidian distance was used as a measure of similarity that was fixed at 70 and complete linkage was applied as a clustering algorithm. The cluster analysis and all other statistical runs were performed in Minitab 18.

Superior Performance of Reference Clones
The two reference clones (REF) established their dominance during the period between age 3 and 6 years reaching at age 12 a mean diameter of 260 mm and a height above 20 m (Fig. 1). Belgian (BELt) and Swedish (SLUc) clones achieved on average 70% of REF stem volume by the age of 12 years. Across the 12-year period, BELt material was steadily closing the gap to REF, while SLUc decreased their relative performance from the initial 94% of REF at age 3 (Fig. 1d). The superior performance of REF clones is reflected by a sustained better diameter growth (Fig. 1a), along with good height growth performance except during the last 3-year period (age 9-12 years, Fig. 1c). The Belgian T × D (Table 1) clones (BELtd), Swedish SLUn, and four best PG clones (PG4best) reached only 40% of REF stem volume at age 12 years. Apart from REF, the best performing new clone was "21.9" (SLUc) with 18% lower mean stem volume compared to "OP-42." The broad sense heritabilities (σ 2 G ) for the growth variables were high (cumulative D, H, and V) or moderate to high (periodic growth increments zD, zH, and zV), ranging from 0.43 for zD 9-12 to 0.87 for H 12 . The (σ 2 G ) values increased slightly with age for cumulative growth and decreased for periodic increments. The exception from this trend was σ 2 G for zV that stayed unchanged during the period 3-12 years (Table 3).

Strong Age-Age Correlations Enabled Early Selection of Clones
The correlations between values of traits at different ages, known as age × age correlations (r G ), were 0.81 and 0.90 for D 3 × D 12 , and H 3 × H 12 respectively. Initial growth variables, i.e., D 3 and H 3 , were moderately to strongly correlated with subsequent periodic 3-year increments ranging from 0.57 to 0.90 (Table S2A). The merit of early selection in relation to the target trait (i.e., V 12 , stem volume at age 12) was calculated as selection efficiency (eq. 10). This value was high already after 3 years (0.80-0.83) and increased towards the target age, particularly for D and V that already at age 6 reached a value of Table 3 Results of the linear mixed model analysis of growth, phenology, stem form, and branching properties of poplar clones grown in southern Sweden. Variance components for random effects were divided between the clone effects (σ G 2 , corresponding to broad sense heritability) and error (σ e 2 ), which includes both random error and within-block variance component that was relatively small in most cases. R c 2 is repeatability of clonal means, CV G and CV e are coefficients of variation of σ G 2 and σ e 2 ; SLL and NP are the competition indices; F and p values are provided for the effects of block and two competition indices used as covariance in some models. The variables were described in Table 2 Trait   0.94 and 0.95 (Fig. 2). The selection efficiency calculated for the subset of 17 best performing clones had lower values initially (i.e., 0.59 for D 3 ), increasing to 0.88 for D 6 (Fig. 2). Thus, the precision of an early estimate increased substantially between age 3 and 6 years and was generally higher for D and V compared to H. However, despite a high selection efficiency at age 6 there were substantial changes in clonal rank within the subset of the 17 best clones between age 6 and 12 (Fig. 3). These changes were caused by differential growth development of different groups of clones (see SLUc and BELt in Fig. 1d) also reflected in lower correlation coefficients between cumulative and periodic growth traits (Table S3).

Advantageous Stem Form and Branching Can Be Combined with High Growth Performance
The stem straightness (STR) was > 2 for most of the PG and the two REF clones, and < 2 for the SLU and BELt clones. The score for branch angle (BRA) in the best producing clones was low (i.e., obtuse angle) except for "69.037/2" and "70.038/67" with higher scores of 2.4 and 2.3 respectively. Both branch thickness (BRTH) and branchiness (BRNS) were lowest for the SLU material ( Table 4). The variance components of clonal effect (σ 2 G ) were moderate to strong, being largest (0.80) for BRNS (Tables 3  and 4).
Crooks occurred frequently (26%), but only 8% and 10% of the trees had discontinuous stems (loss of apical dominance in the tree crown) and forks (Table 5). Crooks and forks occurred usually at 50-60% of tree height at age 9 (Fig. S1, supplement) and were more frequently found on Swedish and Belgian material compared to the reference clones. The Fig. 1 The periodic diameter, height and stem volume (a, b, and c) increment of poplar clones grown in southern Sweden. The standard error bars were omitted to increase the readability of the graph. The cumulative growths were presented as relative to REF clones (d). The clones were grouped according to Table 1 except for PG4best representing the values of the four best performing PG clones with respect to stem volume at age 12 (V 12 )  Table 3) by values of D, H, and V at age 3, 6, and 9 years crooks were particularly often recorded on clones "23.4" (87%), "70.038/67" (44%), and "70.038/20" (58%). Forks were most frequently found on Belgian T × D hybrid "70.045/1" (27%), on two reference clones (20% and 23%) and on some PG clones.
The assessed stem form and branching characteristics were in general positively correlated to each other indicating that trees with thicker and denser branching had less straight stems with branches attached at a sharper angle. The same correlation pattern was also found within the two subsets of clones. Out of all the branching characteristics, particularly BRNS was negatively (low score for BRNS = less branchy appearance) correlated to volume and height. Also, BRTH and BRA were negatively correlated with growth, but significantly only for some of the growth variables. The correlations between STR and growth were not significant. Weak to moderate negative correlations of stem form and branching to growth indicated that desirable quality characteristics were generally found in better growing clones (Table S3 A and B).

Canopy Duration and Leaf Senescence, but Not Bud Burst, are Strongly Correlated to Growth
Correlations among all analyzed variables were performed for (i) the whole set of 32 clones (Table S2 A and B), (ii) a subset of 17 best performing clones, and (iii) a subset containing only PG clones (Table S3 A and B). The same general pattern of positive, mostly significant, correlations between growth, leaf phenology, and canopy duration was present for the whole set and within the two subsets, where correlation coefficients were weaker and in many cases non-significant, particularly in the PG subset of clones. In contrast, growth and bud burst (BB3) were uncorrelated (Fig. 4). Coefficients for correlation of growth to autumn leaf phenology and canopy duration increased with age (Fig. 5). This trend was particularly consistent for both cumulative and periodic volume increment. The correlations between growth and autumn leaf phenology variables were stronger for the subset of 17 best clones compared to the PG subset (Fig. 5a, c). The strong correlations obtained between growth and phenology traits are presented by the slope of regression lines in Fig. 4a-e. For example, the values of diameter (D 12 ) and height (H 12 ) at age 12 increased by 3.5 mm and 2.8 dm (i.e., mm/100) respectively for each day of prolonged growth until the leaf coloring stage (CRC2, i.e., doy for leaves turning yellow). The slope of regression lines of growth traits in relation to canopy duration (CADU) were almost identical (see equations presented in Fig. 4a-e).

Hierarchical Cluster Analysis Visualizes the Links Between Genetic Background, Growth, and Quality Characteristics
The resulting dendrogram from hierarchical cluster analysis was presented in Fig. 6. It divides the studied poplar material into two main branches with high performing clones in one branch (clusters 1, 2, 5, and 7) and all PG material including the SLUn and BELtd clones contained within the other branch. The four clusters of the first branch were separated mainly on the basis of cumulative and periodic growth for D Fig. 3 The relationship between clone ranking for stem volume at age 12 and 6 years presented for the subset of 17 best growing poplar clones in the trial in southern Sweden. The dashed line represents the correlation coefficient of 1 between the ranks at different ages. The corresponding Pearson correlation coefficient between the two traits was 0.89 and the selection coefficient (RS y /S x ) for V 6 against V 12 was 0.88 (see Fig. 2). Clonal rank for clones above the 1/1 line decreases, while the opposite applies to the clones below the 1/1 line and H ( Table 6). The division of clones into clusters is based on their proximity of cluster centroids with a strong tendency to reflect the common genetic origin of the clones. However, clone "23.4" (one-clone cluster 2) was separated from its halfsibs in cluster 1 based on growth characteristics, but also by the differential influence of its phenology (CRC2, BB3) and stem form (STR). Clustering of PG material provided more specific insights into this group of clones. Cluster 10, for example, contains the clones "PG2-22" and "PG5-5" with relatively weak diameter growth, but a high values for H 12 and zH [6][7][8][9][10][11][12] . A low score for STR and BRNS in cluster 10 stands in a sharp contrast to the four highly producing clones (in terms of V 12 ) of clones in cluster 9 that had poor scores for the two quality variables.
Cluster 3 contains the full-sibs "44.7" and "44.11" (SLUn) characterized by a considerable height growth and the most favorable STR and BRNS. The two BELtd hybrids were separated within cluster 4 by late bud burst, a relatively early autumn leaf coloring, and a poor height compared to diameter growth, i.e., low H/D value.

Discussion
In this study, we compared two commercial P. maximowiczii × trichocarpa clones with new, potentially promising, material of P. trichocarpa and P. trichocarpa × deltoides bred (SLU) or selected (BEL) specifically for Swedish photoperiod and climate conditions. This clone evaluation presents results after 12 years of growth, whereas a majority of comparable studies on poplars are from younger trials. Growth data from 3-year periods allowed us to estimate agedependent correlation coefficients for growth performance, and to establish correlations between single-year phenological observations and growth at different ages. Growth performance was additionally emphasized by using 6-year periodic increments (zD 6-12 and zH 6-12 ) in a hierarchical cluster analysis. The correlation analysis performed for slower growing progenies from British Columbia revealed that some general correlation patterns present in the material of broader geographic origin can also be observed within a single geographic origin  (Table S3). Fast growth and a substantial phenological variation provided good prerequisites to reveal correlation patterns among growth, phenology, and branching traits.
In addition, the analysis of variations in stem form and branching represents an important, but rarely described aspect of the selection of poplar clones. Table 5 Occurrence of stem deviations in 9-year-old poplar clones tested in Arlösa, southern Sweden. For definition of clone groups see Table 1 Characteristics of stem deviations  values for CRC2, CADU, and BB3. The slope of increasing Y over X is provided in equations for CRC2 and CADU. Clones were grouped according to clustering from the multivariate analysis presented in Fig. 6 and Table 6 High Heritability Values and Strong Age-Age

Correlations Increase the Precision of Early Clonal Selection
First, it needs to be stressed that the high broad sense heritability values (σ 2 G , hereafter "heritability") obtained in this study for the growth traits were likely biased upward (i.e., overestimated). For instance, (i) heritability obtained from single-site trials lacks separation between genetic and genetic × environment effects present in the studies where the trial design includes replications on several locations. Thus, in this study the potential genetic × environment variance was The change with time of correlation coefficients between growth and phenology. Cumulative growth ("Cum") and periodic growth increment ("Per") were shown in separate panels in relations to average autumn leaf phenology ("COL&SH") and canopy duration ("CADU").
Other potential sources of upward bias were (ii) relatively homogeneous and favorable growing conditions allowing the full expression of genetic potential reducing the spurious environmental effects; (iii) large heterogeneity of tested material; (iv) possible competition effects, which we tried to accommodate for by using competition indices as covariates; (v) exclusion of genetic relatedness of clones as a term in the model; and (vi) the use of line plot values as entries in the analysis, thus disregarding the variation present among individual trees within line plots. The heritabilities calculated on the basis of line plots are difficult to compare with individual tree heritabilities but are useful for the purpose of clonal selection as it was applied in the present study [44]. We did not use family terms in the model due to a complexity already imposed by the inclusion of raw-column matrices within blocks as well as the competition indices. In this study, the heritability values (σ 2 G =0.74-0.87) for cumulative growth (D, H, V) were similar or higher than values reported from other single-site Populus trials. However, some authors reported repeatability of clonal means as a measure of heritability (R c , eq. 8, Table 3). The R c values calculated for growth variables presented in Table 3 were higher (0.81-0.97) than the corresponding values (0.65-0.91) reported from two 3year-old Lithuanian poplar trials planted with more than a 100 clones of different origin [18]. Heritabilities similar or lower (0.38-0.89) to those obtained in our study were reported from two Swedish poplar trials, 10 and 14 years old [23,24], as well as from several, 8-9 years old, hybrid aspen trials (0.23-0.63) [42]. For D and H, heritability increased between age 6 and 9 years, while staying constant for V after the initial increase between age 3 and 6 years (Table 3). Other authors compared heritabilities for cumulative growth variables in conifer species reporting either changing or approximately constant values with increasing tree age [45][46][47][48]. Stener and Karlsson [42] accounted the increase of heritability values with age in hybrid aspen trials exclusively to competition effects. On the other hand, heritabilities of periodic increments (zD and zH) in our study decreased with increasing age, while zV remained constant after an initial increase between age 3 and 6 years. If competition was the main source of increasing heritability for a cumulative growth trait, we would have expected this to be reflected also in increasing heritabilities of periodic volume increments. This was not the case in our analysis, and one possibility is that the competition effects for volume increments were compensated for by competition indices, SLL (sum of line lengths) and NP (neighbor plots). Brodie and DeBell [40] found competition indices particularly useful as a tool for reranking the clonal performance from mixed clonal plots to better fit the expected results in monoclonal plots. The linear character of zD, for example, implies that similar zD corresponds to different zV in the trees of different sizes. Consequently, a better estimate of heritability for periodic clonal growth should be achieved by analyzing zD 2 or, as in our cases, the stem volume increment (zV), for which heritability values did not decrease in our study, but remained constant for the period 3-12 years (Table 3). Eleven clusters were cut-off at similarity 70 (horizontal line) are indicated on dendrogram branches, as well as the six groups of clones compared for growth and stem and branching characteristics. The analysis was performed based on two cumulative growth variables (D 12 , H 12 ), two periodic growth variables for age 6-12 (zD 6-12 , zH [6][7][8][9][10][11][12], two phenology variables (BB3, CRC2), and two variables describing stem form (STR) and branchiness (BRNS) A higher heritability of a trait implies a better precision of clonal estimates and indicates good possibilities for an effective clonal selection [33]. A successful early selection depends also on traits at selection age being strongly correlated with the selection traits at the target age [43,44]. In our case, with V 12 as a target trait, both these components increased with age (i.e., D 9 and V 12 were better correlated than D 3 and V 12 ), but a relatively high selection efficiency could be reached already at age 6 ( Fig. 2). Diameter appeared to be a better early selection criterion than height, which can also be explained by its stronger allometric influence on volume compared to height [49]. The most important implication of a high selection efficiency should be contained in a relatively high accuracy at an early clonal selection. However, the correlations of clonal ranks for volume between age 6 and 12 show that substantial changes occurred during this period despite high age-age correlations and correlation efficiency. Clone "70038/24," for example, improved its rank from 13 to 9 between age 6 and 12, but was a top-three clone with regard to zD 9-12 and zH [9][10][11][12] . Also, two other BELt clones gained on SLUc clones during this period. Thus, it could be argued that an early clonal selection in this type of trials needs to target a larger proportion of the tested material and is most effective as a mean to select sets of clones for production trials and pilot plantations, rather than single out very few clones to be deployed commercially on a large scale.
The clonal variance component (σ 2 G Þ, i.e., heritability, of spring and autumn phenology traits was high in the present study, being highest for BB3, CRC2, and LS2. CRC2 and LS2 were similarly defined in the recent study on P. trichocarpa in central Sweden [50] showing also the highest heritability among other traits of autumn leaf phenology. Medium to high heritability values obtained for stem form and branching characteristics suggest that these traits are strongly genetically determined and can be selected for in order to improve the value of produced poplar biomass. Similar characteristics of stem form and branching were reported in 7-year-old poplar trials in southern Sweden [24], with heritability values between 0.35 for branch thickness and 0.59 for the number of branches. The corresponding values in this study were 0.60 and 0.80 (branchiness). Somewhat lower heritabilities (0.21-0.29) for these characteristics were reported in a study of 10-year-old hybrid aspen [42], while Karlsson et al. [23] obtained values of 0.50 and 0.30 for straightness and branching type in 14year-old poplar trials with a set of clones grown in central Europe.

Correlations and Hierarchical Cluster Analysis
Our study confirmed the positive correlations between autumn phenology and growth reported in other studies [18,29,33,[51][52][53][54][55]. High heritability values for both growth and autumn phenology traits indicate that there is enough variability to adapt poplars to photoperiods at higher latitudes in Sweden. If selection is entirely based on growth, there will be a tendency to deploy clones that run increased risk of frost damage if planted at harsher localities or higher latitudes. The selection for an earlier autumn phenology, on the other hand, will tend to decrease biomass production. As poplar plantations are based on the qualities of selected individual genotypes, the selection can be directed to the outliers from these general trends. Thus, to select suitable clones for higher latitudes, we would need to look for genotypes with high performance under shorter growing seasons or clones that partly can compensate an earlier autumn phenology with an earlier bud burst in spring. Based on our results, one such clone could be SLU:s "23.4," which had 15 days earlier leaf coloration and shedding compared to "OP-42" (Fig. 4). This clone still produced 69% of the stem volume of "OP-42" by the age 12 years and performed equally well as other SLU and BEL clones with autumn phenology similar or delayed compared to "OP-42." Contrary to autumn phenology the variation in spring phenology was lower and was not correlated with growth (Fig. 4). A tendency of weak negative correlations were found within "PG" subset indicating that PG clones with an early bud burst tended to perform better. Pliura et al. [18] reported an early bud burst to have significantly positive effect on poplar growth in a large material covering several poplar species and hybrids. Also, Yu et al. [56] found earlier-flushing hybrid aspen to achieve a better growth compared to a local and laterflushing aspen material. Other studies mostly report weak or non-existing direct correlations between bud burst and growth performance [32,54].
In the present study, the length of clonal canopy duration (i.e., effective growth season length) can only be regarded as hypothetical as it was calculated on bud burst and autumn leaf phenology from different years. In addition, the year of leaf phenology screening (2018) was unusual due to an extremely warm summer, which might have affected leaf phenology, exaggerating the differences between vigorous and more suppressed clones. Thus, the strong relationships between autumn phenology, canopy duration, and growth (Fig. 4) need to be considered in the light of the possibility that the timing of autumn leaf phenology did not only reflect the phenological trait, but might also have been affected by the tree (clonal) vigor in 2018. More frequent inventories of phenological traits are necessary to reveal the true strength of these correlations. However, the relative timing of phenology events in Populus seems to be reasonably stable across different years and sites. This is particularly true for bud burst [34], while reports on relative stability of autumn phenology vary in this respect [33,56,57]. We assumed that the projected length of canopy duration for a clone may hold for correlations with cumulative and periodic growth, i.e., general trends, and not for modeling of biomass growth in any particular year. Positive correlations with growth were the overall trend for canopy duration also present in the two subsets of clones (Fig. 5). Thus, our results confirm previous reports of moderate to strong correlations between growth and canopy duration (length of seasonal growth) in Populus [18,32,[58][59][60][61]. Correlation coefficients between cumulative growth, canopy duration, and autumn leaf phenology increased with time in our data (Fig. 5), which might be related to the cumulative nature of this growth trait. The increase of correlation coefficients of periodic increments can have several different causes such as increased competition, an improved ability with age to express genetic potential or the cumulative effect of previous biomass growth that was not accounted for in the mixed models. The increase of correlation coefficients between autumn phenology respective canopy duration and zV was much steeper for the full set of 32 clones compared to the subset for 17 best clones, which suggests that the composition of the material plays a substantial role for modeling of trends in this type of correlations. For the subset of PG clones, i.e., within the material of a single geographic origin, these correlations confirm that delayed autumn phenology provide a competitive advantage with regard to growth. It is thus expected that there will be a selection pressure at lower latitudes or a in a changed climate to favor genotypes adapted to warmer autumns allowing for prolonged period of growth. The lack of significant correlations between bud burst and autumn phenology indicates that different sets of genes regulate these traits, and that they may be bred for separately [43]. Different stages of leaf senescence were all well correlated in our data set and also correlated with canopy duration. It has been shown earlier that clones with late start in spring, initiated on high temperature sums [18], may lose a substantial part of growing season, which must be regarded disadvantageous at higher latitudes.
The correlations between growth, stem form, and branching characteristics were presented in Table S2 and S3. Initially, diameter (D 3 ) and the branching scores were negatively and significantly correlated suggesting that trees with more favorable branching characteristics grew better. By the age 12 years, however, these correlations weakened, while the correlations between height and branching stayed negative and significant throughout the 12-year period. It can be expected that an extensive branchiness or thick branches may have positive effect on diameter development. The quality aspects have been studied more often in other species where discordances between diameter growth and quality characteristics are often handled by putting additional emphasize on height growth [43]. This, presumably unfavorable correlation between growth and quality imposes additional difficulties on defining poplar ideotypes, with imperative to satisfy a broad spectrum of possible end uses. However, our data clearly confirmed that height growth and stem form (STR) were negatively affected by branch number (BRNS) and thickness (BRTH), suggesting that the selection for biomass performance needs to be balanced by quality characteristics.
As selection of suitable clones is dependent on growth, climate adaptation, and quality traits, we used multivariate analysis for grouping and evaluating clones. The hierarchical clustering is useful for selecting clones with similar characteristics, provides an overview of the material and represents a valuable aid for the decisions about further testing of clones or their deployment in pilot plantations. The clusters can also be useful for the composition of clonal mixtures to be deployed commercially, particularly when site-specific studies on clonal performance are not yet available. Deployment of clones in mixtures is regarded as advantageous in terms of risk avoidance and ecological services and has been applied in Sweden for both hybrid aspen and poplars [62]. Hierarchical clustering is more efficient when underlying traits are characterized by high heritability and clear correlation patterns. In our case, four out of eight variables characterized growth, a deliberate choice to emphasize growth where cumulative growth represented a feature of overall clonal performance, and the periodic increment at age 6-12 years characterizing clonal development after the anticipated time for the first selection at the age of 6 years. The use of bud burst and leaf coloring as phenological variables is governed by their high heritabilities (i.e., large variation in the studied material) and importance in the delimitating the length of canopy duration. Stem straightness and branchiness, on the other hand, have shown high heritability values in this study and are important for the characterization of clones and their suitability for use in different types of growing systems.
The results of present study implicate a superior performance of the two commercial clones "OP-41" and "OP-42" (Cluster 7). P. maximowiczii × trichocarpa hybrids have been frequently reported as high performing in other studies from central and northern Europe and in comparison with the material of varying origin [17,18,33,37,[63][64][65]. These hybrids are undoubtedly of interest for northern Europe and should be included in the future breeding programs targeting geographical regions at high latitudes. The BELt and SLUc clones performed less, but can be of interest for further testing in southern Sweden (particularly BELt clones, cluster 5) and as an alternative at latitudes 57-59°N (cluster 1). Clone "23.4" was separated into a single-clone cluster (cluster 2) due to its distinct characteristics of high diameter increment, lower height increment in the period between age 6 and 12 years, earlier autumn phenology, earlier bud burst, and less advantageous stem form (Fig. 6, Table 6). Thus, clone "23.4" belongs to the branch of high producing clones (SLUc, BELt, and REF clones; Fig. 6) while phenologically having potential to be transferred to the north at much lower risk compared to REF clones. In general, SLUc and BELt clones had more favorable stem form and branching traits compared to REF clones at the expense of productivity. However, high-producing clones with smaller crowns, such as "21.9," may potentially be interesting for growing systems established with denser initial planting distances and thinning schemes. The multivariate analysis was also useful for structuring PG clones, which potentially can be of interest in breeding for higher latitudes. For example, clones belonging to cluster 9 were best in terms of volume growth, but are also characterized unfavorable values for STR and BRNS. Thus, extensive branching may have contributed to overestimate the measured stem diameter and volume while signaling decreased value related to harvest and industrial processing of biomass. Contrary, the clones "PG2-22" and "PG5-5" separated within cluster 10 seem to be good candidates for inclusion in germplasm of Swedish poplar breeding population, as they have straight stems, relatively thin branches and the best height increment among PG material.

Conclusions
& The commercial clones "OP-42" and "OP-41" have shown superior performance, but unfavorable stem form and branching compared to the Swedish and Belgian P. trichocarpa clones selected for this study. & Some Swedish P. trichocarpa clones demonstrate potential to be deployed at higher latitudes, where growth of "OP-42" implies an increased economic risk. & Late autumn phenology and longer canopy duration correlate positively with productivity. Thus, the selection of clones for deployment at higher latitudes needs to be based on their relative phenological characteristics. & Stem straightness and height growth are negatively influenced by extensive branching. & Hierarchical cluster analysis provides a valuable tool for clonal evaluation based on growth, phenology, stem form, and branching characteristics. The material is separated in clusters that reflect the origin and parentage of the material, while providing a visual overview of differences and similarities among the material. The method is useful for decision making with regard to clonal deployment and building up a poplar breeding population for northern latitudes.