Study area and site index data
Study sites in three provinces in southern China (Fig. 1) included Fujian (eastern region in mid-subtropics), Sichuan (central region in mid-subtropics) and Guangxi (southern subtropical climatic zone). The soil type is mainly red soil developed on granite in Fujian and Guangxi and red soil developed on shale in Sichuan.
In each of these three regions, long-term positioning observation test plantations of Chinese fir established in 1982, with even-aged trees, were selected for this study. In each study area, 15 plots were installed in a random block arrangement with five initial planting densities: A (1667 trees ha−1, 2 m × 3 m), B (3333 trees ha−1, 2 m × 1.5 m), C (5000 trees ha−1, 2 m × 1 m), D (6667 trees ha−1, 1 m × 1.5 m), E (10,000 trees ha−1, 1 m × 1 m). Each treatment level was replicated three times for a total of 15 plots. Each plot was 20 m × 30 m and surrounded by a 2-row buffer zone, comprising similarly treated trees. Three experiments were established using bare-root seedlings, and all trees were tagged. After afforestation, each plot was surveyed in the winter at 1–3-year intervals. Height data for 18–20 years were obtained from each plot. The dominant height was computed as the average height of six tallest trees in each plot. The site index was calculated for each plot at the reference age of 20 years using the dominant height data (Fujian and Guangxi) or using the Richards model with three parameters (Sichuan).
Climatic data
ClimateAP is an online platform to generate annual, seasonal and monthly climatic data for historical and future periods in the Asia Pacific region (http://climateap.net/; Wang et al. 2012, 2017a, b). To explore the effects of climate factors on the site index of Chinese fir plantations, climatic data of each site was obtained using ClimateAP through spatially interpolated estimations based on site longitude, latitude, and elevation. In this study, eight climatic variables including mean annual temperature (MAT, °C), mean annual precipitation (MAP; mm), degree-days above 5 °C (DD5), degree-days below 0 °C (DD_0), July maximum mean temperature (Tmax07; °C), summer mean maximum temperature (Tmax_JJA; °C), spring precipitation (PPT_MAM; mm), annual heat–moisture index (AHM) were chosen as candidate variables for model fitting of the site index. What’s more, AHM integrates MAT and MAP data into a single parameter, as shown below:
$${\mathrm{AHM}}=\frac{\left({\text{M}}{\text{AT}}+10\right)}{{\text{M}}{\text{AP}}}\times 1000$$
(1)
where AHM is the annual heat–moisture index. MAT is the mean annual temperature. MAP is the mean annual precipitation. Lower \(AHM\) values indicate relatively wetter conditions. The climatic data for each study site are shown in Table 1.
Table 1 Summary of elevation, latitude, longitude and climatic variables at the three study areas Soil data
Soil samples were collected by digging a soil profile in mature Chinese fir plantations in Fujian, Guangxi and Sichuan. Three soil profiles were selected and diagonally distributed in each plot. A total of 135 soil profiles were manually dug at the three study sites. Each soil profile was 1 m deep and divided into five soil depths: 0–20, > 20–40, > 40 − 60, > 60–80 and > 80–100 cm. Bulk density was determined by inserting three cutting rings (5 cm height, 5 cm inner diameter and known volume) at each depth of the soil profile. At the same time, soil water content was determined from soil samples collected in three aluminum boxes placed at each depth. Approximately 1 kg of soil was sampled at each depth, stored in bags and transported to the laboratory. The soil samples were then air-dried, ground, sieved, then analyzed for pH, organic matter content (g kg−1), total N (g kg−1), alkali-hydrolyzable N (mg kg−1), total P (g kg−1), available P (mg kg−1), total K (g kg−1), available K (mg kg−1), bulk density (g cm−3), water content (%), C/N ratio, C/P ratio and N/P ratio. The soil pH was determined using the potentiometer method using a suspension of 1-part soil to 2.5 parts 1 M KCl. Soil organic matter content was measured using the K2Cr2O7-H2SO4 oxidation method. Total N was determined using the Kjeldahl method and alkali-hydrolyzable N using alkaline hydrolysis method. Total P was measured using the NaOH alkali solution–molybdenum antimony colorimetric method and available P using the NaHCO3 alkali solution–molybdenum antimony colorimetric method. Total K and available K were determined using flame photometry (Bao 2000; Venanzi et al. 2016). The soil data for the three provinces are summarized in Table 2.
Table 2 Soil characteristics for the three study areas in Chinese fir plantations in Fujian, Guangxi and Sichuan in China Data analyses
Analysis of variance (ANOVA) and multiple comparisons test were performed to compare the differences in site index among different regions. The relationship between site index and soil factors at the different soil depths in the three provinces and that between site index and climatic variables were examined by Pearson correlation analysis. To reveal the importance of site factors across regions with different climates, while taking into account the interdependency among independent variables, stepwise regression analysis was used to limit the number of explanatory variables for the three climatic regions, and the most important site quality variables affecting the site index of Chinese fir in each climatic region were determined.
Linear mixed effects model (LMM)
The linear mixed effects model was used to predict the site index as a function of the most important soil factors and climatic factors. Three regional models were established based on the determined soil factors for each climatic region, a climatic model also developed using the most dominant climatic variable. The following basic model was used for modeling site index related to regional models and climatic model:
$${S}_{\mathrm{I}}={\alpha }_{0}+\alpha X+\varepsilon$$
(2)
where SI is the site index, \({\alpha }_{0}\) is the intercept, α is a vector of coefficients, \({\text{X}}\) is a vector of independent variables, including various soil and climatic variables, and ε is the error term, ε ~ N(0, σ2).
Considering the differences among different climatic regions, we treated climatic region as a dummy variable. A global site index model for the whole study area was established, which used the dummy variable and the most important soil factors of the three climatic regions as predictors. The basic site index model could be given by:
$${S}_{\mathrm{I}}={\beta }_{0}+{S}_{1}\left({\beta }_{1}{X}_{1}\right)+{S}_{2}\left({\beta }_{2}{X}_{2}\right)+{S}_{3}\left(\beta {X}_{3}\right)+\varepsilon$$
(3)
where SI is the site index, β0 is the intercept, β1, β2, β3 are vectors of coefficients for Fujian, Guangxi and Sichuan, respectively; X1, X2, X3 are vectors of independent variables for Fujian, Guangxi and Sichuan, respectively; S1, S2, S3 represent dummy variables for Fujian, Guangxi and Sichuan, respectively; S1 = 1 indicates the Fujian and 0 indicates the other climatic regions; S2 = 1 indicates the Guangxi region and 0 indicates the other climatic regions; S3 = 1 indicates the Sichuan and 0 indicates the other climatic regions.
In addition, based on the important soil factors selected in the three climatic regions, the relationship between these soil factors and site index was explored in the whole study area. A polynomial relation between the soil factors and site index was obtained, from which the soil factors most related to site index in the whole study area were determined. Therefore, another global site index model was established by using the most important soil and climatic variables in the whole study area.
We selected 30 plots from the 45 plots as modeling data, and the remaining 15 plots were used as vertification data. Since data collected from three regions and five planting densities in each region, the random effects of region and planting density were added to the intercept of the site index model. In addition, we used the independent equal variance structure for describing the variance–covariance structure of random effects. Parameters in the LMM models were estimated through restricted maximum likelihood approach implemented in ForStat2.2 software (Tang et al. 2009).
Model evaluation
The model evaluation and testing were based on the mean absolute error (MAE), mean relative error (MRE), root mean square error (RMSE) and coefficient of determination (R2).
$${M}_{\mathrm{AE}}=\frac{1}{n}{\sum }_{i=1}^{n}\left|{y}_{i}-{\widehat{y}}_{i}\right|$$
(4)
$${M}_{\mathrm{RE}}=\frac{1}{n}{\sum }_{i=1}^{n}\left|\frac{{y}_{i}-{\widehat{y}}_{i}}{{y}_{i}}\right|$$
(5)
$${R}_{\mathrm{MSE}}=\sqrt{\frac{1}{n-1}{\sum }_{i=1}^{n}{\left|{y}_{i}-{\widehat{y}}_{i}\right|}^{2}}$$
(6)
$${R}^{2}=1\left[\frac{{\sum }_{i=1}^{n}{\left({y}_{i}-{\widehat{y}}_{i}\right)}^{2}}{{\sum }_{i=1}^{n}{\left({y}_{i}-\overline{y }\right)}^{2}}\right]$$
(7)
where yi is the observed value, \({\widehat{\text{y}}}_{\text{i}}\) is the predicted value, \(\stackrel{\mathrm{-}}{\text{y}}\) is the mean value of the observed value, and n is the number of the sample plots.
The independent sample data not used in the modeling were used to test the model, and the prediction performance of the global site index models was evaluated. Equation 8 (Jiang and Li 2014) was used to calculate the random parameter value in the LMM models:
$${\widehat{b}}_{k}=\widehat{D}{\widehat{Z}}_{k}^{T}{\left({\widehat{Z}}_{k}\widehat{D}{\widehat{Z}}_{k}^{T}+{\widehat{R}}_{k}\right)}^{-1}{\widehat{e}}_{k}$$
(8)
where \({\widehat{b}}_{k}\) is the random parameter, \(\widehat{D}\) is the variance–covariance matrix of the random effect parameters, \({\widehat{R}}_{k}\) is the variance–covariance structure in the model, and \({\widehat{Z}}_{k}\) is the design matrix of random effect. The actual value minus the predicted value calculated using the fixed effect parameters is \({\widehat{e}}_{k}\).