Introduction

Stand biomass is one of the most important measures of space and resource utilization in forest ecosystems. As carbon content in plant tissues is relatively constant, in comparison with the variability of stand biomass (e.g., Lehtonen et al. 2004; Martin and Thomas 2011; Jagodziński et al. 2012), assessment of biomass allows calculation of how much carbon has been sequestrated in an ecosystem. This utility of biomass calculation confers its high importance in carbon inventories, allowing estimation of carbon sequestration in forests. In the age of changing climate (Thuiller et al. 2011; Sohngen and Tian 2016) and increasing concentration of CO2 in the atmosphere (IPCC 2013), increasing accuracy of biomass estimation is an urgent need of science. This is especially important, as forests are one of the most important terrestrial pools of carbon (Pan et al. 2011) and may help to mitigate negative effects of climatic change (Chmura et al. 2010; Lindner et al. 2014; Dyderski et al. 2018).

Usually stand biomass may be assessed using the two main approaches: allometric equations at both tree and stand levels (e.g., Baskerville 1972; Zianis et al. 2005; Zasada et al. 2008; Xie et al. 2016; Forrester et al. 2017) and stand-level assessments based on biomass conversion and expansion factors (e.g., Lehtonen et al. 2004; Teobaldelli et al. 2009; Wojtan et al. 2011; Jagodziński et al. 2017). The former is a regression model between two variables expressed in units of different dimensions. The most common are tree-level models, allowing for biomass estimation based on dimensions of single trees. This tool uses general allometric rules of biomass scaling according to the organism’s dimensions (Weiner 2004; McCarthy and Enquist 2007; Poorter et al. 2015). Although these models usually have high accuracy (Zianis et al. 2005), their applicability is limited only to datasets containing single tree observations. For that reason usage of tree-level methods of biomass estimation is time and money demanding. BCEFs are coefficients which allow calculation of stand biomass using information on volume (Eggleston et al. 2006; Somogyi et al. 2007). As BCEFs are volume dependent, their application is only possible when this information is available. However, most forest inventories provide this data, as stand volume is the most important parameter from the economical point of view. For that reason, BCEFs are more useful for cases of large-scale analyses (Neumann et al. 2016). Nevertheless, BCEFs are developed using biomass calculated from tree-level inventories; thus, their accuracy is biased at the level of tree-level biomass estimation and development of stand-level models.

Stand biomass, as a function of resource availability, also depends on the other stand features. Allometric trajectories of biomass models are modified by stand age (Wirth et al. 2004; Lehtonen et al. 2004; Jagodziński et al. 2017). Biomass production and allocation also depend on stand density (Jagodziński and Oleksyn 2009a, b). Also, measures of stand density and dimensions, such as volume and basal area, are often used in modeling of stand biomass or BCEFs (Teobaldelli et al. 2009; Castedo-Dorado et al. 2012; Lehtonen et al. 2016; Jagodziński et al. 2017). Biomass allocation patterns (i.e., ratio of stem, branches, root and foliage masses), influencing stand biomass, also differ among habitat types. For example, site-specific allometric models were provided for peatlands (e.g., Laiho and Finér 1996; Hytönen and Aro 2012; Lehtonen et al. 2016), post-agricultural sites (e.g., Uri et al. 2007; Bijak et al. 2013; Jagodziński et al. 2014) or post-industrial sites (e.g., Pietrzykowski and Socha 2011; Kuznetsova et al. 2011; Jagodziński et al. 2014). The differences in growth of trees are also connected with provenances and genotypes, reflecting adaptations to soil and climate (e.g., Oleksyn et al. 1999; Bussotti et al. 2015; Chakraborty et al. 2016). For that reason, for large-scale inventories there is a need to provide generalized models (e.g., Wirth et al. 2004; Muukkonen 2007; Forrester et al. 2017), taking into account stand parameters which are usually available in forest inventory datasets.

Scots pine (Pinus sylvestris L.) is the most extensively distributed tree species in Eurasia (Houston Durrant et al. 2016). Its range covers an area from Spain to Eastern Siberia, from boreal to Mediterranean zones. This species typically dominates in poor habitats, connected with sandy soils and harsh climates; however, it has a broad ecological amplitude. Scots pine is able to colonize both extremely dry and extremely wet sites (Ellenberg 1988; Houston Durrant et al. 2016). For that reason, this species has high economic importance in Europe, with a 20% share of timber production (Houston Durrant et al. 2016). Nevertheless, our knowledge about its biomass production is disproportionally low relative to its geographical range and economic usage (Cienciala et al. 2006; Forrester et al. 2017). For Scots pine, Zianis et al. (2005) provided 205 allometric equations for the entirety of Europe. Forrester et al. (2017) found 107 allometric models useful for large-scale analyses, rejecting those not based on diameter at breast height. BCEFs and allometric equations for Scots pine biomass estimation were also reviewed by Neumann et al. (2016) and by Lakida et al. (1996). Numerous raw data of tree- and stand-level biomass are provided in Schepaschenko et al. (2017). Although accounting for large extents within Scots pine geographical range, as well as its variability in ecophysiology and patterns of biomass allocation (e.g., Oleksyn et al. 1999; Lehtonen 2005; Finér et al. 2007; Jagodziński and Oleksyn 2009a; Repola and Ahnlund Ulvcrona 2014; Bronisz and Zasada 2016), this number of biomass estimation models might still be insufficient for proper reporting of carbon sequestration in forest ecosystems. This insufficiency results from local specificities and biogeographic trends. Moreover, despite studies on stand biomass of Scots pine in Poland, at this time there are no published studies covering a whole chronosequence of this species biomass in Poland using both tree- and stand-level approaches. Thus, we aimed: (1) to develop biomass models for different aboveground biomass components at tree and stand levels, as well as biomass conversion and expansion factors (BCEFs) for Scots pine, (2) to assess the relationships between stand parameters and aboveground biomass and biomass conversion and expansion factors (BCEFs) and (3) to compare stand biomass obtained using BCEFs with models developed based on stand parameters (age, basal area, stand volume and mean height).

Methods

Study sites and material

We established 120 plots in pure even-aged Scots pine stands ranging from 3 to 117 years old and on all site types typical for this species, including long-term forest (forest growing on forest sites, in contrast to first-generation forests), post-agricultural and post-industrial sites (Table 1, S1). The plots were established during different studies; thus, their selection was usually connected with visual estimation as being representative of larger areas of neighborhood forests. Plot sizes varied from 0.008 to 0.6125 ha, to maintain sufficient numbers of trees to analyze stand features (at least 100 trees per plot). Sample stands grew in a wide range of habitats, mostly poor and mesic soils, which are potential sites of pine forests, and poor sites of oak forests, constituting optimal sites for Scots pine (Ellenberg 1988). Within our dataset, we also included stands established in habitats of fertile deciduous forests, where Scots pine was planted to increase its wood production. All plots were located in lowlands of Western and Central Poland, in a zone of transition between maritime and continental temperate climate (51.21–53. 92°N; 14.34–18.59°E).

Table 1 Overview of the study plots and stand characteristics (n=120)

Field and laboratory methods

Diameters of all trees and heights of at least 20% of trees were measured within each study plot. Within study plots, we selected and harvested four to twelve sample trees to obtain weight of the biomass components. We cut off branches of all trees and weighed branches with needles. After that, in the field we weighed subsamples of branches with needles (at least 5% for large trees and usually > 20% for smaller trees). Each subsample was divided into needles and branches in the laboratory and then dried and weighed. Sample trees were selected to be representative of the diameter distribution—one tree from each quantile. Samples of needles (FL), branches (BR) and stems (ST, including wood and bark) were oven-dried to a constant mass (65 °C). Then, plant material was weighed with an accuracy of 1 g. Using the proportion of dry and fresh masses of samples and total fresh masses of biomass components obtained in the field, we calculated total dry mass of each biomass component for a particular model tree. As biomass of dead branches and cones strongly varies among trees, study sites and growing seasons, we decided to exclude these tree components from analyses, focusing on biomass of FL, BR, ST and their sum—aboveground biomass (AB). For each plot, we measured stand density (N), mean height weighted by basal area (Hg), growing stock volume (V) and age (A).

Data analysis

We used site-specific Naslund’s models from the lmfor::ImputeHeights() function (Mehtatalo 2008) to impute heights for each tree (R Core Team 2017). We also calculated volume of the whole stem. We decided to take into account volume of whole stems instead of merchantable volume, as the youngest trees have no merchantable volume. We also did not account for merchantable branch biomass, as it was present only in 19 of 791 sample trees and constituted a maximum of 3.4% of total stem volume. Volume of each tree stem was calculated using diameter measurements of sections 1 m in length. We assumed the shape of each section as a cylinder and the shape of the last section as a cone. Due to different origins of datasets used, we did not measure volume for 219 young sample trees (< 20 years old) and 24 older sample trees (up to 47 years old). To estimate volume of these trees, we used a machine learning technique—the random forest model (Breiman 2001) implemented in the caret::train() function (Kuhn 2008) in R software (R Core Team 2017) for obtaining tree volume, based on the diameter, height and age of the trees. We are conscious that joining datasets with measured and predicted volume undermines assumptions of methods identity. However, in our study we did not focus on tree volume estimation, but we calculated it for the purpose of calculating stand volume. Thus, we decided not to remove 54 study plots to decrease uncertainty of our models. This model had RMSE = 0.002 and R2 = 0.98, and RMSE = 0.03 and R2 = 0.98 for younger and older trees, respectively. At the stand level, we calculated Hg, but for the 35 youngest stands we calculated mean height, as diameters at breast height were not available.

In our study, we provided three methods of Scots pine biomass estimation: individual tree biomass models, stand level biomass models and stand level BCEFs. For each biomass component, we calculated nonlinear regression models, using the allometric model, as in previous biomass studies (Zianis et al. 2005; Zasada et al. 2008; Bronisz and Zasada 2016; Bronisz et al. 2016; Jagodziński et al. 2018):

$$W = a \times D^{b} \left( {{\text{allometric}}\,{\text{model}}\,{\text{on}}\,D} \right),$$
(1)
$$W = a \times D^{b} \times H^{c} \left( {{\text{allometric}}\,{\text{model}}\,{\text{on}}\,D\,{\text{and}}\,H} \right),$$
(2)

where W—dry mass of the considered biomass component (kg), D—diameter (cm), H—height (m). For the youngest trees (up to 10 years old), we provided model accounting only for tree height, as the youngest trees may not reach breast height (130 cm):

$$W = a \times H^{b} \left( {{\text{allometric}}\,{\text{model}}\,{\text{on}}\,H} \right).$$
(3)

Similar models were used to calculate volume of each tree within stands.

We used the provided tree-level biomass models to estimate biomass of each stand studied. In stands with A < 11 years old, we used Eq. 3 and for older stands we used Eq. 2. For stand-level biomass analyses, we calculated biomass conversion and expansion factors (BCEFs) as BCEF = W/V, where W—dry mass of the considered biomass component (Mg ha−1), obtained as a sum of tree biomass components and V—growing stock volume (m3 ha−1). For assessment of relationships between BCEFs and stand characteristics, we also used power models:

$${\text{BCEF}} = a \times z^{b} \left( {{\text{e}} . {\text{g}} .\,{\text{Peichl}}\,{\text{and}}\,{\text{Arain}}\,2007} \right),$$
(4)

where z—stand characteristic (N, Hg, V or A), a and b—model coefficients. We used Eq. 4, with W instead of BCEF, in stand-level biomass models. To prevent lack of normality of residuals and problems with heteroscedasticity, we used iterative reweighted least squared regression implemented in the robustbase::nlrob() function (Myers 1986; Maechler et al. 2018). In this method, the regression curve is refitted in each iteration by weighting observations by the residuals. To check normality of standardized residuals and homoscedasticity, we examined diagnostic plots (standardized residuals versus fitted values). Furthermore, we compared them with volume-based functions and biomass estimations based on BCEFs. Because of variance inflation resulting from intercorrelations between predictors and problems with nonlinear regression convergence, we provided stand-level biomass models for only one independent variable. Although using multiple sample trees from multiple plots would justify application of mixed models (e.g., Repola 2009; Repola and Ahnlund Ulvcrona 2014; Bates et al. 2015), for purposes of model simplicity and applicability by practitioners we chose to use simple linear and nonlinear regression techniques. In all cases, we also calculated other measures of model quality—Akaike’s information criterion (AIC), mean error (ME), root mean squared error (RMSE) and modeling efficiency (MEf) using the following formulas:

$${\text{AIC}} = \frac{{2\log \left( {\hat{\pi }} \right)}}{n} + \frac{2k}{n}$$
(5)
$${\text{ME}} = \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \hat{y}_{i} } \right)}}{n}$$
(6)
$${\text{RMSE}} = \sqrt {\frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \hat{y}_{i} } \right)^{2} }}{n - k}}$$
(7)
$${\text{MEf}} = 1 - \frac{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \hat{y}_{i} } \right)^{2} }}{{\mathop \sum \nolimits_{i = 1}^{n} \left( {y_{i} - \bar{y}} \right)^{2} }}$$
(8)

where log—natural logarithm, \(\hat{\pi}\)—likelihood estimator of model fitness, k—number of model parameters, n—sample size, yiith observed dependent variable value, \(\hat{y}_{i}\)ith predicted dependent variable value, \(\bar{y}\)—mean value of the dependent variable. Although R2 for nonlinear models is biased and does not provide the proper amount of variance explained by the models, we decided to provide this measure as rough estimations of R2 of linear models (MEf).

For model development, we used nonlinear, least-square regression using the stats::nls() function (R Core Team 2017). For comparison of model quality, we presented AIC of BCEFs and stand biomass models and AIC0—AIC of the null model (intercept only), according to Mac Nally et al. (2018). All analyses were conducted using R software (R Core Team 2017).

Results

Biomass of stands

Tree-level biomass models had coefficients of determination (MEf) ranging from 0.836 to 0.990, with an average of 0.922 ± 0.016 (Table 2). The highest values were for total aboveground biomass and stem biomass, and the lowest for branches and foliage biomass. Models based on height only, obtained for young trees (up to 10 years old) had lower accuracy than those for older trees. Total aboveground biomass of the stands studied ranged from 0.11 to 291.25 Mg ha−1, with an average of 65.62 ± 7.12. Biomass of branches ranged from 0.01 to 32.89 Mg ha−1, with an average of 9.64 ± 0.68. Biomass of needles ranged from 0.05 to 17.06 Mg ha−1, with an average of 5.43 ± 0.32. Biomass of stems ranged from 0.08 to 256.10 Mg ha−1, with an average of 53.39 ± 6.37.

Table 2 Final allometric equations for biomass components developed for each age class

Total aboveground biomass of Scots pine stands increased with increasing stand height, growing stock volume and age, and decreased with increasing stand density (Fig. 1; Table 3). Similar trends were found for particular biomass components; however, increases in foliage and branch biomass quickly reached a plateau, with little to no increase in older stands. For total aboveground biomass and all compartments, the best model was based on stand volume (MEf from 0.322 in FL to 0.985 in ST). Other biomass components were also strongly correlated with stand height. The weakest predictor of biomass, regardless of the component considered, was stand density (MEf from 0.014 in FL to 0.633 in ST).

Fig. 1
figure 1

Relationships between stand characteristics and stand biomass components: total aboveground (AB), branches (BR), foliage (FL) and stem (ST). Parameters of nonlinear regression models are presented in Table 3

Table 3 Relationships between stand characteristics (predictors) and stand biomass components (Mg ha−1), modeled using Eq. 4

Biomass conversion and expansion factors

BCEFs of Scots pine for total aboveground biomass ranged from 0.3173 to 4.5022, with an average of 0.6207 ± 0.0423, for branch biomass from 0.0287 to 1.3933, with an average of 0.1427 ± 0.0138, for foliage biomass from 0.0087 to 1.1576, with an average of 0.1538 ± 0.0155, and for stem biomass from 0.1667 to 1.9146, with an average of 0.3984 ± 0.0201. BCEF values were constant for older stands, but in the youngest stands its values were decreasing with increasing stand age, height and growing stock volume, and increasing with increasing density (Fig. 2; Table 4). However, the rate of these dynamics was distinct only at low values of these parameters and reached a plateau after c.a. 30 years old. The best predictor of BCEFs was stand volume (MEf from − 0.044 to 0.459), except for branches where the best was stand density (MEf = 0.099). These relationships were weak; however, in most cases RMSE was lower than 0.001 Mg m−3, which was connected with low variability of BCEFs in the higher part of the range of the parameters studied. For that reason, in the older stands (above 10 years old), we may assume mean BCEFs for AB—0.4767 ± 0.00698, for BR—0.1224 ± 0.0116, for FL—0.0844 ± 0.0120 and for ST—0.3648 ± 0.0067.

Fig. 2
figure 2

Relationships between stand characteristics and BCEFs for biomass components: total aboveground (AB), branches (BR), foliage (FL) and stem (ST). Parameters of nonlinear regression models are presented in Table 4

Table 4 Relationships between stand characteristics (predictors) and BCEFs for particular biomass components (Mg m−3), modeled using Eq. 4

Discussion

Effect of stand age

Our results indicated the important role of age in differing relationships between stand features and biomass, as well as BCEFs. The most distinct effect of age was found in the youngest stands, where differentiation of BCEFs and biomass was the highest. This is similar to the results of Jagodziński et al. (2017) for young stands of Betula pendula, where the estimated age breakpoint was c.a. 5 years old. In our study, there was a strong decrease in BCEFs after c.a. 10 years of stand development and after that time BCEFs were more or less constant, similar to a study focused only on young trees (Jagodziński et al. 2018). Also Schepaschenko et al. (2018) revealed this trend using a large dataset from Eurasia. This trend was confirmed for age-class-specific biomass models based on stand parameters (Fig. 3), where strong relationships occurred in the younger stands.

Fig. 3
figure 3

Distributions of deviations between biomass calculated using stand parameter biomass models (Table 3; black dots) and using BCEF models using different stand parameters (columns) multiplied by growing stock volume (Table 4; open dots) for total aboveground biomass (AB), branches (BR), foliage (FL) and stem (ST) biomass. Lines indicate 1:1 proportions

Higher variability of younger stands is connected with different conditions of juvenile growth. This may result from management treatment or site conditions. Moreover, in smaller plants, there are higher proportions of foliage and branches (Mikšys et al. 2007; Uri et al. 2012; Poorter et al. 2015), the biomass of which is less uniform than stem biomass, which constitutes most of the biomass in older trees. Foliage biomass strongly depends on different site-specific factors (Poorter and De Jong 1999; Jagodziński and Kałucka 2008; Rademacher et al. 2009); thus, its variation affects model accuracy. A similar pattern was found for BCEFs by Lehtonen et al. (2004). However, our previous paper (Jagodziński et al. 2018), where for young stands we used site-specific allometric tree-level models, showed similar patterns. Therefore, this effect is consistent regardless of the accuracy of biomass estimation method. Similarly low coefficients of determination for BCEF models were provided by Lehtonen et al. (2004). As older trees are composed mainly of stem biomass, which is strictly related to growing stock volume, this constancy may suggest using constant BCEF values for these trees. Effects of stem mass ratio may be confirmed by higher accuracy of models for BCEFs without leaves than with leaves, provided by Teobaldelli et al. (2009). Constant lines for modeled BCEFs for older Scots pines were also provided by other authors (Lehtonen et al. 2004; Jalkanen et al. 2005; Teobaldelli et al. 2009; Wojtan et al. 2011).

Effect of other stand features

In our study, the most important features influencing biomass and BCEFs were stand height and growing stock volume. These two factors are mostly related to stem biomass, which constitutes the majority of aboveground biomass (Poorter et al. 2015). As these parameters within the same age are site-dependent, we may assume that these models allow predictions to overcome site-specific conditions at the stand level and may also be better than preparing models for different site indices, which are different in particular countries. Other studies revealed high importance of site index in shaping stand biomass (e.g., Shepashenko et al. 1998; Teobaldelli et al. 2009; Schepaschenko et al. 2018). However, when we compare heteroscedasticity of models based on height and growing stock volume, those based on height seem to be more homoscedastic (Fig. 3). Stand density was the weakest predictor of stand biomass, similar to Castedo-Dorado et al. (2012). Despite its importance in shaping biomass allocation (Jagodziński and Oleksyn 2009b), its primary impact is connected with shaping growth conditions, especially single tree dimensions—at higher density individual trees have lower diameters and higher heights (Jagodziński and Oleksyn 2009a); therefore, this parameter is also included in stand height and growing stock volume.

Accuracy of models and applicability

Our study confirmed higher accuracy of tree-level than stand-level methods. Our study also showed that biomass estimation based on stand-level biomass models and BCEFs gives similar accuracy, when the best models are taken into account. However, for branch and foliage biomass, tree-stand-based models give overestimated results, and BCEFs give underestimated results, compared to biomass calculated using tree-level approaches (Fig. 3). A similar trend, but with lower magnitude of bias, was found for volume-based models. However, due to high randomness, foliage and branch biomass are the most difficult to estimate, both at the levels of trees and stands, similar to other studies (e.g., Zianis et al. 2005; Teobaldelli et al. 2009; Wojtan et al. 2011; Castedo-Dorado et al. 2012).

Regional stand-level biomass models obtained in our study may be applied in large-scale inventories, including those using airborne laser scanning and other remote sensing techniques. These inventories can easily and quickly provide large amounts of data. However, they provide the highest accuracy for height measurements (Niemi et al. 2015; Kauranne et al. 2017). For that reason, high accuracy of models based on stand height may allow for increasing applicability of these methods. Moreover, it may be used for young stands or when data on stand volume is not available, due to lack of ownership interests or remote localities (Jagodziński et al. 2017, 2018).

Conclusions

Our study provided a comprehensive set of tree- and stand-level biomass models. We also described how stand biomass increases with increasing height, growing stock volume and age, and with decreasing stand density during stand development. Our study indicated highly dynamic increases of biomass and decreases of BCEFs in the youngest phase of stand growth and relative stabilization in later phases. Our models showed that accurate biomass assessment may be conducted using airborne methods, providing data on stand height (Jagodziński et al. 2018).