Introduction

The accurate prediction of breeding values and estimation of genetic parameters in tree breeding program rely on well-designed progeny trials. Progeny trials for forest tree species are usually characterized by large field size, heterogeneous global and microenvironmental variation and long-term observations. Various experimental designs were proposed to separate within-site micro-environmental variation from the treatment variation as much as possible (Cochran and Cox 1957). Randomized complete block (RCB), set-in-block, and noncontiguous plots were initially used in forestry field trials. Later, incomplete block designs including row-column design were introduced to reduce the size of replicated block (Libby and Cockerham 1980; Williams et al. 2002; Patterson and Thompson 1971). To further separate microenvironmental variation from treatment variation, single-tree plots were recommended over the usual multiple-tree row plots (Loo-Dinkins 1992).

To deal with spatial correlation between adjacent plots, nearest neighbor methods were first established to take such variation into account (Papadakis 1937; Bartlett 1938). In crops and forest trees, spatial or neighbor adjustment were proposed more than a quarter of century ago (Gleeson and Cullis 1987; Martin 1990; Magnussen 1990, 1993; Wilkinson et al. 1983). But a spatial model was only systematically applied in the last decades thanks to the development of software such as ASReml to deal with complex linear mixed models (Gilmour et al. 2009; Smith et al. 2005; Gilmour and Cullis 1995). Spatial analyses could improve the estimation of genetic effects by modeling the spatial distribution of error effects and separating spatially correlated global and microsite variation from independent random environmental variation in forestry progeny trials. Previous studies have demonstrated that site variation in many forest genetic trials is generally spatially continuous, and residuals from close locations are more related than distant locations. By introducing spatial models into the analysis of forest genetic tests for tree height, Magnussen (1990) reported a reduction of 5–20% in the standard error of the difference between full-sib families and an increase in the number of significant differences among treatments, relative to the RCB design (Magnussen 1990). Kusnadar and Galwey (2000) showed that a spatial model gave a significantly better fit than an incomplete block model for tree growth traits in maritime pine (Pinus pinaster Ait.). On the basis of 12 forest tree progeny trials, Costa e Silva et al. (2001) concluded that a spatial model increased REML (restricted maximum likelihood) log-likelihood and the prediction accuracy of genetic values. Dutkowski et al. (2002, 2006) presented one of most convincing evidence using 55 trials in 6 species that spatial analysis methods improved predicted genetic responses by more than 10% for 20 variables and recommended that spatial analysis should be routinely applied in forest genetic trial analyses where the spatial arrangement of trees can be determined. Ye and Jayawickrama (2008) reported that more than 97% of the data sets in 275 trials had significantly improved by spatial analysis, and 14–34% of the residual variances could be removed by fitting a spatial correlation model. This improved the accuracy of breeding value prediction up to 20%.

Chinese fir (Cunninghamia lanceolata (Lamb.) Hook) is the predominant commercial plantation species in China, accounting for about 25% timber production and 6.7% of the total area of forest resources. Breeding for growth traits in the first two generations has increased average growth rate by almost 30% (Shi 1994). In a quantitative genetics survey, several unusual and interesting unique genetic properties of Chinese fir’s breeding population such as low heritability of wood density and low negative genetic correlation between growth and wood density, a large difference between narrow-sense and broad-sense heritability, and regional-related G by E interactions were found (Bian et al. 2014). Several relatively large progeny trials with a family of sufficient size for genetic analyses revealed low heritability for DBH (e.g., h 2 = 0.09 in a trial of 112 families (Fang and Shi 1998); h 2 = 0.05 in another trial of 228 families (Yu et al. 2001)). The typical progeny test site for Chinese fir in southern China is planted on a hilly slope with a usual global environment gradient from the hill top to valley. Planting blocks are usually arranged on the similar contour along the hill, creating a potential spatial correlation among neighboring trees on the similar contour of the same block. Therefore, proper experimental design and an adequate analytical model to reduce correlated environmental variation are probably more important for progeny testing than planting on a flatter site. In this study, we will explore the utility of spatial analyses in reducing the environmental variation and increase of heritability estimates (e.g., increasing the efficiency of progeny analyses) in Chinese fir progeny trials in an approach similar to that used by Costa e Silva et al. (2001), Dutkowski et al. (2002, 2006), and Ye and Jayawickrama (2008). We also used post-blocking and fitted global trend to compare with spatial analyses and whether post-blocking and fitted global trend are similarly efficient in reducing environmental variance as the spatial model. The DBH from a single site in northern Fujian of China was used to evaluate the efficiency of a spatial two-dimensional auto-correlated model, post-blocking, and fitting global trend over incomplete block (IB) analysis by assessing the reduction in residual variance, changes in heritability, and genetic gain.

Materials and methods

Materials

The study was based on a progeny trial at Weimin Forest Farm (latitude 27°05′N; longitude 117°40′E), Fujian Province planted in February 2003 with 80 open-pollinated families and one control, using an incomplete block experimental design with 10 replications. There were nine incomplete blocks per replication with nine plots per block, and four trees in a row-plot along the contour (e.g., the row is perpendicular to the slope). Tree spacing was 2 m × 2 m with a buffer zone established around the trial. The half-sib families were derived from Nanjing Forestry University–Fujian Province Chinese Fir Cooperation second-generation breeding population at Guanzhuang Forest Farm (latitude 26°32′N; longitude 117°45′E). The planting site was on a relatively steep hill of 23° slope, and the previous crop was a Chinese fir plantation. The site was relative fertile with a layer of 20 cm humus above the base red soil about 90 cm deep before planting. Tree height and diameter at 1.3 m above ground (DBH) were measured during the growing season of 2005–2011, equivalent to a stand age of 3–9 years after planting. For facilitating spatial analysis, every tree was uniquely identified by a row and column position in a total of 56 rows and 171 columns (Fig. 1a).

Fig. 1
figure 1

a DBH distribution at age 9 along the row and columns (Numbers in the figure indicate replicates and the rows are perpendicular to the slope). b Independent residual distribution of DBH at age 9 after fitting spatial model and eight post-blocks are shown on the figure. Note the color of grey means missing values

Statistical model and analysis

The following linear mixed model was used in the genetic analysis of the trial:

$$ y = Xb + Zu + e, $$
(1)

where, y is the vector of individual tree observations; b is the vector of fixed effects with the overall mean as the first element; u is the vector of random effects of individual trees including replications, incomplete blocks, plots and additive genetic merits; e is the vector of random residual deviations of individual trees; X and Z are known design matrices for fixed and random effects, respectively. Separate model terms in u were assumed to be uncorrelated. Fixed and random effect solutions are obtained by solving the mixed model equations:

$$ \left[ {\begin{array}{*{20}l} {{\text{X}}^{ '}\, {\text{R}}^{ - 1} {\text{X}}} & {{\text{X}}^{ '}\, {\text{R}}^{ - 1} {\text{Z}}} \\ {{\text{Z}}^{ '}\, {\text{R}}^{ - 1} {\text{X}}} & {{\text{Z}}^{ '}\, {\text{R}}^{ - 1} {\text{Z}} + {\text{G}}^{ - 1} } \\ \end{array} } \right]\left[ {\begin{array}{*{20}c} {{\hat{\text{b}}}} \\ {{\hat{\text{u}}}} \\ \end{array} } \right] = \left[ {\begin{array}{*{20}c} {{\text{X}}^{ '} {\text{R}}^{ - 1} {\text{y}}} \\ {{\text{Z}}^{ '}\, {\text{R}}^{ - 1} {\text{y}}} \\ \end{array} } \right] $$
(2)

where, R is the variance–covariance matrix of residuals and G is the direct sum of the variance–covariance matrices of each of the random effects. Residuals are assumed to be independent, and R is denoted as \( {{\sigma }}_{\text{e}}^{2} {\mathbf{I}} \), but spatial analysis allows R to have a different structure based on a decomposition of e into spatially dependent \( ({\varvec{\upxi}}) \) and spatial independent \( ({\varvec{\upeta}}) \) residuals (nugget effect) to model a correlation structure in correlated residual variance. In our analyses, \( {\varvec{\upxi}} \) was modelled as a first-order separable autoregressive process (\( {\mathbf{AR}}1 \otimes {\mathbf{AR}}1 \)) with the variance–covariance matrix structured as:

$$ {{Var}}\left( {\varvec{\upxi}} \right) = \sigma_{{{\xi }}}^{2} \left[ {{\mathbf{AR}}1\left( {\rho_{\text{col}} } \right) \otimes {\mathbf{AR}}1\left( {\rho_{\text{row}} } \right)} \right] $$
(3)
$$ {\mathbf{AR}}1\left( {\rho_{\text{col}} } \right) = \left[ {\begin{array}{*{20}c} 1 & {\rho_{c} } & {\rho_{c}^{2} } & \cdots & {\rho_{c}^{{n_{c}^{ - 1} }} } \\ {\rho_{c} } & 1 & {\rho_{c} } & \cdots & {\rho_{c}^{{n_{c}^{ - 2} }} } \\ {\rho_{c}^{2} } & {\rho_{c} } & 1 & \cdots & {\rho_{c}^{{n_{c}^{ - 3} }} } \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ {\rho_{c}^{{n_{c}^{ - 1} }} } & {\rho_{c}^{{n_{c}^{ - 2} }} } & {\rho_{c}^{{n_{c}^{ - 3} }} } & \cdots & 1 \\ \end{array} } \right] $$
(4)
$$ \begin{aligned} {\mathbf{AR}}1\left( {\rho_{\text{row}} } \right) = \left[ {\begin{array}{*{20}c} 1 & {\rho_{r} } & {\rho_{r}^{2} } & \cdots & {\rho_{r}^{{n_{r}^{ - 1} }} } \\ {\rho_{r} } & 1 & {\rho_{r} } & \cdots & {\rho_{r}^{{n_{r}^{ - 2} }} } \\ {\rho_{r}^{2} } & {\rho_{r} } & 1 & \cdots & {\rho_{r}^{{n_{r}^{ - 3} }} } \\ \vdots & \vdots & \vdots & \ddots & \vdots \\ {\rho_{r}^{{n_{r}^{ - 1} }} } & {\rho_{r}^{{n_{r}^{ - 2} }} } & {\rho_{r}^{{n_{r}^{ - 3} }} } & \cdots & 1 \\ \end{array} } \right] \hfill \\ \hfill \\ \end{aligned} $$
(5)

where, \( \sigma_{{{\xi }}}^{2} \) is the spatial variance of the trend process and Eqs. 4 and 5 are the first order autoregressive correlation matrices with autocorrelation parameters \( \rho_{\text{col}} \) and \( \rho_{\text{row}} \) for n c columns and n r rows, respectively, and ⊗ is the Kronecker product. The elements of vector \( {\varvec{\upeta}} \) are assumed to be pairwise independent, so the residual variance–covariance matrix of the spatial model is defined as:

$$ {{Var}}\left( {\mathbf{e}} \right) = {\mathbf{R}} = \sigma_{\xi }^{2} \left[ {{\mathbf{AR}}1\left( {\rho_{\text{col}} } \right) \otimes {\mathbf{AR}}1\left( {\rho_{\text{row}} } \right)} \right] + \sigma_{\eta }^{2} {\mathbf{I}}_{{n_{c} *n_{r} }} , $$
(6)

where, \( \sigma_{{{\eta }}}^{2} \) is the variance of the independent residuals and I is an identity matrix. The residuals in \( {\varvec{\upeta}} \) reflect random environmental effects after accounting for correlated spatial variation.

Global model and post-blocking model were also fitted along with the design model to examine the efficiency at removing the environmental variation from the residual variance. We first examined whether the designed factor, such as replication, had significant effect to reduce environmental variation in the trial. Then we used post-blocking to examine its effectiveness in reducing environmental variation. Finally, we fitted the spatial model to further study the effectiveness of the spatial model in reducing environmental variation. Therefore, five models with various combinations of random effects and definitions of R and G were fitted for the trial: (1) a design base model (D) which fitted replication, incomplete block, and plot as random effects; (2) design base model plus post-blocking (DP) where a new random incomplete block term was fitted in addition to the base model (Eight post-blocks were selected based on the DBH pattern and the location of each natural block as illustrated in Fig. 1b); (3) spatial model with first-order autoregressive for both row and column (S); (4) design base model plus global trend (DG) in which a second degree of polynomial was fitted along the row and column; (5) spatial model plus global trend (GS).

For testing the significance of the random term, the ratio of the variance component to their corresponding standard error was used. If the ratio was larger than 2, then we regarded the random term as significant. If the ratio was less than 1, then we regarded the random term not significant. For ratios between 1 and 2, a likelihood ratio test (LRT) was applied assuming that log-likelihood ratio LLRT [= −2 × (difference of log-likelihood excluding and including the random term)] follows a \( \chi_{0.5}^{2} \) distribution with sufficient large size of sample (Gilmour et al. 2009). Spatial models were fitted from age 3–9 (after planting) for both height and DBH to examine a possible time trend of autocorrelation of spatial variation.

Assuming spatial variance could be separated as part of a correlated environmental variance, independent residual variances were used to estimate individual tree narrow sense heritability (h 2) for tree height and DBH:

$$ \hat{h}^{2} = \frac{{\hat{\sigma }_{A}^{2} }}{{\hat{\sigma }_{A}^{2} + \hat{\sigma }_{\text{e}}^{2} }} $$
(7)

where, \( \sigma_{A}^{2} \) is additive genetic variance. Genetic gain was estimated using Eq. 8:

$$ \Delta G = \frac{{i \times h^{2} \times \sigma_{p} }}{M} \times 100 $$
(8)

where, i is the selection intensity, h 2 is the narrow-sense heritability, σ p is the phenotypic standard deviation, and M is the mean value of trait.

Variance parameters were estimated by the restricted maximum likelihood (REML) method and an approximate standard error of statistics was obtained by Taylor expansion, implemented in the ASReml software package (Gilmour et al. 2009).

The Akaike information criterion (AIC) was calculated for model selection as:

$$ {\text{AIC}} = - 2\ln L + 2k $$
(9)

where, \( \ln L \) is the maximized value of the likelihood function for the mode, k is the number of estimated variance parameters in the model. And the preferred model is the one with the minimum AIC value.

Results

The mortality in the trial at the final measurement of age 9 after planting was around 25.2%. The average height and DBH reached 8.4 m and 13.5 cm with corresponding standard deviation of 1.4 m and 3.4 cm, respectively (Table 1). The histogram suggested that the data are approximately normally distributed.

Table 1 The mortality, overall mean value of height and DBH at different stand age with approximate standard deviation in parentheses

Five models were fitted for both DBH and tree height and all nonsignificant effects removed from the models except for post-block variances. Table 2 presents results for DBH for the designed factors (e.g., replicates, incomplete blocks, and plots) in five models. All design factors were significant (p < 0.05) for DBH from age 3–9 in the model using experiment design (Design). However, post-blocks were not significant at all ages for DBH. When an autoregressive spatial correlation and independent error were fitted together with the design factors, the other design factors were insignificant at most ages except at age 4, 5, and 7 for plots and at age 3, 4, and 5 for incomplete blocks. These results may indicate that the replicate, incomplete block and plot effect were absorbed by the spatial variances at most ages, particularly at late ages.

Table 2 Estimated variances of design factors using five models (D Design, DP Design plus Post-blocking, S Spatial, DG Design plus Global, GS Spatial plus Global) for DBH of Chinese fir trial from age 3–9

Distribution of individual tree growth at age 9 is shown in Fig. 1a. The spatial pattern for DBH had an apparent global trend of increasing tree size from the top to the bottom of the hill. The residual was more random after fitting the spatial model for the same data at age 9 (Fig. 1b). Figure 2 presents variograms for DBH and tree height at age 9 and shows typical highly correlated spatial variation for both row and column, with trees that are closer together being more similar than those that are farther apart.

Fig. 2
figure 2

Variogram of residual for DBH and tree height (Height) at 9 year-old after planting

Both spatial model and spatial plus global model increased additive genetic variances relative to the experimental design model, while post-blocking and adding global trend under the design model did not increase the additive genetic variance (Fig. 3a). Also the increase in the additive genetic variance was not obvious at age 3, but it increased as trees grew from age 4–9. Similarly, independent residual variances were reduced after fitting the spatial and the spatial plus global model while fitting of post-block and global model did not reduce the independent residual variances (Fig. 3b). As a consequence, heritability for DBH increased between 23 and 31% for ages 3–5, and between 31 and 64% for ages 6–9 using the spatial model relative to experimental design model (Fig. 3c). Models using post-blocking and global trend were found to have no effect on heritability relative to the design models.

Fig. 3
figure 3

Age trends of a Additive genetic variance, b Residual variance, c Heritability, d Log-likelihood, e Akaike information criterion (AIC) based on five models (D Design, DP Design plus Post-blocking, S Spatial, DG Design plus Global, GS Spatial plus Global) for DBH of Chinese fir trial

The log-likelihoods increased after fitting the spatial model between 50 and 80 and were highly significant (p < 0.001) relative to the base design model (Fig. 3d). Similarly, the AIC decreased between 98 and 157 relative to the base design model with significant effects for all ages (Fig. 3e).

The age trends for autocorrelation are shown in Fig. 4a. Columns had higher spatial correlation than rows. Fitting of the spatial plus global model slightly decreased autocorrelation with slightly more decrease in rows than in columns. In addition, the independent residual variance and spatial and additive variances all increased with age, with the residual and the spatial variance rising the quickest. The spatial variance in the global plus spatial model increased slower than the corresponding one in the spatial model as trees aged (Fig. 4b).

Fig. 4
figure 4

a Age trend of row (ρrow) and column (ρcol) auto-correlation under two models (S Spatial, GS Spatial plus Global) and b Spatial, additive and residual variance under Spatial model (S) and spatial variance under Spatial plus Global model (GS) for DBH of Chinese fir trial

Overall, age trends for the five models of tree height were similar to DBH. On average, heritability increased from 0.15 to 0.20 for DBH, and from 0.19 to 0.28 for tree height with a 35 and 52% increase, respectively.

The relative genetic gains from parental and mass selection also increased using the spatial model relative to the design model. For example, the gain for DBH increased about 3.2 and 5.2% from using the spatial analysis relative to the experimental design model when selection intensity was 10% for parents and 0.5% for mass selection.

Discussion and conclusion

The main objective of a progeny trial in a tree breeding program is to estimate genetic parameters and breeding values for selection. For reducing environmental variance due to environmental heterogeneity in the progeny trial, various experimental designs have been suggested. The simplest experimental design is a completely randomized design, which was adopted by northern Swedish tree breeders. However, these tree breeders successfully used post-blocking to reduce the environmental variation associated with large blocks (Dutkowski et al. 2002; Ericsson 1997). The most common experimental design is a randomized complete block design (Loo-Dinkins 1992; White et al. 2007). For further reducing environmental heterogeneity, an incomplete block design was introduced, particularly with a row and column design (Loo-Dinkins 1992). Incomplete block designs were adopted for controlling within-site variation in most Chinese-fir progeny trials planted in the 1990s and 2000s. Usually 50–120 families of 2–4 trees per row-plot were deployed in Fujian Chinese fir breeding testing. The incomplete block design was generally effective in Fujian’s hilly Chinese fir sites. These sites typically have environmental gradients from hill top to bottom (Bian et al. 2014). To reduce this global trend of environmental variation, blocks were usually employed along the contour of the site to keep the microenvironment within the same block as homogeneous as possible. Such sites may be well suited for spatial analyses to reduce correlated environmental variation along rows and columns.

Our analysis revealed that both adding a global trend and imposing post-blocking was not effective under the experimental design model using an incomplete block on this hilly Chinese fir site. However, spatial analyses increased efficiency and not only accounted for the effects of design factors (replicates and incomplete blocks), but also reduced residual variance by separate spatial variation from the independent residual variation. Independent residual variation was markedly reduced by the spatial autocorrelation analysis, which may indicate that part of the environmental variation was autocorrelated and that spatial analyses can separate it from the independent environmental variation. Furthermore, spatial analyses also accounted for the large global field variation from the hill top to the bottom. Replication is traditionally used to account for this global trend in the experimental design in hilly sites.

Spatial autocorrelation was significant at an early age, 3 years, in the Chinese-fir experiment, but all parameters (e.g., additive and residual variances, log-likelihood and AIC) indicated that spatial analyses were more effective for DBH from age 4 onward. There were higher autocorrelations among columns than rows, which is not surprising since adjacent trees among the columns were planted on the same contour, while adjacent trees among rows were planted along an altitudinal gradient. Therefore, the autocorrelation among rows may account for the global trend from hill top to bottom, while autocorrelation among columns may account for local spatial variation, e.g., the similarity in the environments on the same contour of the terraces.

All 14 spatial models for DBH and tree height improved heritability in this study, which was similar to observations in Douglas-fir (Pseudotsuga menziesii) in which 97% of the data sets showed significant model improvement using spatial analysis. An improvement in model fit using spatial analysis was also observed in 90% of the cases for other tree species such as Picea abies, Picea mariana, Picea sitchensis, Pinus pinaster, Pinus radiata, and Eucalyptus globulus (Dutkowski et al. 2006; Ye and Jayawickrama 2008).

In our experiment using an incomplete block design, residual variance was the dominant term. In the spatial model, the independent residual was reduced by 13% for DBH and 19% for tree height on average. These were similar to the Doulas-fir study for which, on average, spatial analyses removed 14–34% of the correlated variance due to spatial correlation (Ye and Jayawickrama 2008). Narrow-sense heritabilities for individual tree increased 35 and 51% on average by removing spatial variance from the residual variance, which was higher than the results for DBH summarized by Dutkowski et al. (2006). This result is not surprising considering that both high autocorrelations in rows and columns were observed in this hilly site, perhaps indicating that the spatial analyses are more useful or powerful for hilly sites such as the Chinese-fir sites. However, we only analyzed one site. If more sites are found with such high two-dimensional autocorrelations, then spatial analyses should routinely be applied to similar forest progeny testing sites in countries where most forest trials are on hilly sites, such as in southern China and other southeastern Asian countries.

A moderate genetic gain of about 3–5% for selection was determined in this study using spatial analyses, similar to the finding that extra genetic gain usually did not exceed 6% (Dutkowski et al. 2002). However higher gain was observed by Costa e Silva et al. (2001) in several progeny trials including radiate and maritime pines and Sitka spruce. In conclusion, fitting with global trend and post-blocking was not effective with the current incomplete block design used in Chinese-fir progeny trial, while spatial analyses reduced residual variance, increased heritability and genetic gain.