1 Introduction

Biomass is the mass of biological origin, living or dead, animal or plant, whereas the forest biomass is applicable to all existing plant mass in the forest or the arboreal fraction (Sanquetta and Balbinot 2002). The laborious and costly process of measuring tree biomass in the forest is often the motivation for using empirical models that allow biomass to be estimated from dendrometric variables. The study of the biomass modeling is associated with the subject of allometry. West et al. (1997) and Broad (1998) present a general theory for the allometric scale laws.

Scientists, in turn, are interested in knowing the partitioned biomass of the tree, namely the proportion distributed among its components: root, bole, bole bark, branches, and leaves. For this purpose, biomass measurements are performed on a per component basis and the total biomass is considered the sum of all masses. With the data in hand, the modeling study can be carried out to find the best performing equation for estimating biomass components and total biomass. Thus, the biomass of a given area can be estimated using inventory measurements of diameter at breast height and total height.

The problem proposed by the authors in this paper is to choose the best technique for building the allometric equations. When modeling the biomass components and total biomass, in most cases, the adjustment of models is performed independently. That is, isolated fitting is performed for each component. Using these equations, the sum of the biomass components does not produce the same result as that obtained using the total biomass equation. This generates inconsistent results; however, this approach was found in 80 % of 50 studies published in scientific journals in recent years.

The sum of the biomass component estimates should result in the same value as that estimated from the total biomass equation. This implies that the equations for biomass components and total biomass must be estimated together. Simultaneous estimation, considering the additivity principle of (Kozak, 1970; Reed and Green, 1985), is a technique that can be used to solve the inconsistency in these estimates.

Zellner (1962) proposed the seemingly unrelated regressions (SUR) approach, which utilizes simultaneous estimation. This technique provides a statistically correlated system of equations with restrictions (Parresol, 1999) and, in the present case, the fitting of the equations for the biomass components and for the total biomass imposes restrictions on the parameters, ensuring additivity. Specific methods to ensure additivity of regression functions are presented by Cunia (1979) and Jacobs and Cunia (1980). The SUR procedure is discussed at length by Reed (1986), Gallant (1987), and Srivastava and Giles (1987).

The plantations of Pinus elliottii Engelm in Brazil were established for timber production, aiming at various industrial purposes. The use of allometric equations has allowed the emergence of scientific papers related to estimates of tree biomass components (Witschoreck, 2008; Valerio, 2009; Giongo et al., 2011). However, it is not yet frequent to use equations with simultaneous adjustment for the genus Pinus. Recently, Dong et al. (2014) used the process nonlinear seemingly unrelated regressions (NSUR) in estimates of components of biomass for Pinus koraiensis Sieb. et Zucc, demonstrating success in applying this methodology.

The independent fitting accounts for the correlation between the biomass components, which ignores the principle of additivity and may result in inconsistencies in the overall estimate of the tree biomass. Thus, in this study, we tested the following hypothesis: Simultaneous fitting rather than independent fitting results in a more effective estimator for the equations of biomass components and total biomass.

The objective of this study is to evaluate the performance of simultaneous estimation (which considers the principle of additivity) for the independent modeling of individual components and the total biomass of Pinus elliottii Engelm trees.

2 Materials and methods

2.1 Data source

In this study, 30 individuals of the species Pinus elliottii Engelm from forest stands in the south central region of the state of Paraná, Brazil, were used. The plantations were established in the municipality of General Carneiro, coordinates 26° 25′ 44″ South, 51° 19′ 2″ West, south central state of Paraná, Brazil. The climate of the region, according to the classification of Köppen, is Humid Subtropical Mesothermal (Cfb), characterized by cool summers and winters with frosts, no dry season. Organic soils are characteristically hydromorphic, predominantly Entisols, Inceptisols, and Ultisols. Individual trees were selected in the diameter range of 12.73 to 30.88 cm, between the ages of 6 and 20 years.

The individual biomass was measured using the destructive method. Initially the diameter at breast height (1.30 meters from ground level - cm) was measured with a dendrometric tape (Forestry) and then individuals were felled, and the total height (m) of each tree was measured using a measuring tape (Richter).

The fresh biomass was obtained from weighing the five components of each tree separately: bole without bark, bole bark, branches, leaves, and roots. Each component was weighed using a dynamometer (Instruntemp) with a 100-kg capacity and accurate to within 100 g. To determine the root biomass, a trench was dug to the extent of the crown coverage, projected on the ground, and to a depth of 50 cm. All roots in the trench were collected greater than a diameter limit of 2 mm. The stump biomass was considered as root biomass.

A 500 g sample of each component and of each individual was collected to determine moisture content, which was obtained by measuring the fresh weight and dry weight on an analytical balance (Shimadzu). The dry weight was obtained after the samples were dried in a convection oven at 100 °C to a constant weight at 15 days. Thus, taking into consideration the moisture content of the samples, it was possible to establish a relationship to the fresh biomass, which resulted in the dry biomass of each tree’s components. The total biomass was the sum of each component’s masses.

A 4 cm thick disk was collected at the bole base, which was sanded to visually count growth rings and determine tree age. This information was also compared to the historical records of the plantations.

2.2 Estimation of parameters using generalized nonlinear regression model (EGNLS)

The model used to estimate biomass components were:

$$ {\widehat{y}}_{\mathrm{leaves}}={b}_0*{d}^{b_1}*{\left(\frac{1}{h}\right)}^{b_2}+{e}_i $$
(1)
$$ {\widehat{y}}_{\mathrm{branches}} = {b}_0*{\left(\frac{1}{d}\right)}^{b_1}*{\left(\frac{1}{h}\right)}^{\kern0.5em {b}_2}+{e}_i $$
(2)
$$ {\widehat{y}}_{\mathrm{bole}} = {b}_0*{\left({d}^2h\right)}^{b_1}+{e}_i $$
(3)
$$ {\widehat{y}}_{\mathrm{bark}} = {b}_0*{d}^{b_1}+{e}_i $$
(4)
$$ {\widehat{y}}_{\mathrm{root}} = {b}_0*{\left({d}^2h\right)}^{b_1}+{e}_i $$
(5)
$$ {\widehat{y}}_{\mathrm{total}} = {b}_0*{\left({d}^2h\right)}^{b_1}+{e}_i $$
(6)

where ŷ j  = estimates of a given component or total biomass (in kg); d = diameter at breast height (cm); h = total height (m); b i  = coefficients of the model.

These models were selected in a previous study for the preparation of this work. The selection of the best model for the biomass of each component and total was based on the quality of fit indicators: adjusted coefficient of determination (R 2adj ), standard error of estimate (S yx  (%)) (presented by Steel et al. 1996), graphical analysis of the residuals. The White test was applied to check the homogeneity of the residuals.

The first step is to calculate the partial derivative matrix (Z(β)) defined by:

$$ Z\left(\beta \right) = \frac{\partial f\left(X,\beta \right)}{\partial {\beta}^{\hbox{'}}} = \left[\begin{array}{ccc}\hfill \frac{\partial f\left({x}_1,\beta \right)}{\partial {\beta}_1}\hfill & \hfill \cdots \hfill & \hfill \frac{\partial f\left({x}_1,\beta \right)}{\partial {\beta}_k}\hfill \\ {}\hfill \vdots \hfill & \hfill \ddots \hfill & \hfill \vdots \hfill \\ {}\hfill \frac{\partial f\left({x}_t,\beta \right)}{\partial {\beta}_1}\hfill & \hfill \cdots \hfill & \hfill \frac{\partial f\left({x}_t,\beta \right)}{\partial {\beta}_k}\hfill \end{array}\right] $$
(7)

where X are the independent variables and β the coefficients of the models.

The β estimate is obtained, minimizing the residual sum of squares (S(β)):

$$ S\left(\beta \right) = \varepsilon^{\prime}\varepsilon ={\left[y-f\left(X,\beta \right)\right]}^{\prime}\left[y-f\left(X,\beta \right)\right] $$
(8)

where y is the observed dependent variable and f(X, β) is the estimated dependent variable.

The solution of this system is achieved by employing an algorithm; see Greene (1999) and Judge et al. (1988). This procedure was applied using proc model in Statistical Analysis System (SAS) software.

The confidence interval (CI) of the average value, at 95 % probability, on two side test, is given by:

$$ CI\left[\widehat{y}\pm {t}_{\alpha /2}\sqrt{\ \widehat{v}\left(\widehat{y}\right)}\right] = 1-\alpha $$
(9)

The variance (\( \widehat{v} \)) to the mean value (ŷ t ) is estimated by:

$$ \widehat{v}\left({\widehat{y}}_t\right)=z{(b)}_t^{\prime }\ {S}^2(b)\ z{(b)}_t $$
(10)

The variance-covariance matrix (S 2(b)) is given by:

$$ {S}^2(b)={\widehat{\sigma}}^2{\left[Z{(b)}^{\hbox{'}}\ Z(b)\right]}^{-1} $$
(11)

The variance is given by:

$$ {\widehat{\sigma}}^2 = \frac{S(b)}{T-K} = \frac{{\left[y-f\left(X,b\right)\right]}^{\hbox{'}}\left[y-f\left(\mathrm{X},\mathrm{b}\right)\right]}{T-K}, $$
(12)

where T is the number of observations and K is the number of parameters in the estimated model.

The prediction interval (PI) (new, i.e., n), at 95 % probability, on two side test, is calculated by:

$$ PI\left[{\widehat{y}}_{i(n)}\pm {t}_{\alpha /2}\sqrt{\ \widehat{v}\left({\widehat{y}}_n\right)}\right]=1-\alpha $$
(13)

The variance is given by:

$$ \widehat{v}\left({\widehat{y}}_{t(n)}\right)={\widehat{\sigma}}^2 + z{(b)}_t^{\hbox{'}}\ {S}^2(b)\ z{(b)}_t $$
(14)

2.3 Nonlinear seemingly unrelated regressions

To achieve simultaneous modeling, at first, it is necessary to select the best performing model for each component of the biomass. In the second step, the selected models for each component are adjusted simultaneously, so total biomass of the equation is a function of the independent variables in equations for each component i, thus placing restrictions on the parameters. This can be described as:

$$ \begin{array}{l}{y}_i={f}_i\left({X}_i,\ {\beta}_i\right) + {\varepsilon}_i,\ i = 1, \dots,\ k\hfill \\ {}{y}_{\mathrm{total}} = {f}_{\mathrm{total}}\left({X}_1,{X}_2,\dots, {X}_{k,\kern0.5em }\ {\beta}_1,\ {\beta}_2, \dots, \kern0.5em {\beta}_n\right) + {\varepsilon}_k\hfill \end{array} $$
(15)

Or alternatively:

$$ {y}_{\mathrm{leaves}} = {f}_{\mathrm{leaves}}\left(X,\ \beta \right) + {\varepsilon}_{\mathrm{leaves}}{y}_{\mathrm{branches}} = {f}_{\mathrm{branches}}\left(X,\ \beta \right) + {\varepsilon}_{\mathrm{branches}}{y}_{\mathrm{bole}} = {f}_{\mathrm{bole}}\left(X,\ \beta \right) + {\varepsilon}_{\mathrm{bole}}{y}_{\mathrm{bark}} = {f}_{\mathrm{bark}}\left(X,\ \beta \right) + {\varepsilon}_{\mathrm{bark}}{y}_{\mathrm{total}} = {f}_{\mathrm{total}}\left(X,\ \beta \right) + {\varepsilon}_{\mathrm{total}} $$
(16)

The estimate of the vector β employing NSUR that corresponds to the values of β, which minimize the residual sum of squares, is given by:

$$ R\left(\beta \right) = {\varepsilon}^{\hbox{'}}\left({{\displaystyle \sum}}^{-1}\otimes I\right)\varepsilon = {\left[y-f\left(X,\ \beta \right)\right]}^{\hbox{'}}\left({{\displaystyle \sum}}^{-1}\otimes I\right)\ \left[y-f\left(X,\ \beta \right)\right] $$
(17)

where R(β) = r to impose restrictions on the parameters.

The calculation is performed using iterative convergence to minimize the residual sum of squares and can be, for example, the minimization method using the Gauss-Newton, Marquardt (SAS 1999a, b, 2002). This procedure was performed in SAS software, and the technique consisted of applying the use of generalized least squares method called nonlinear seemingly unrelated regressions (NSUR) iteratively using the option ITSUR of the procedure MODEL (Institute Inc 1999a, b, 2002). The application in this case is presented in Appendix.

F(β) means the matrix of partial derivatives of the residuals on the same parameters such that:

$$ F{\left(\beta \right)}^{\hbox{'}} = \frac{\partial {\varepsilon}^{\hbox{'}}}{\partial \beta } = \left[\frac{\partial {f}_1^{\hbox{'}}}{\partial \beta },\ \frac{\partial {f}_2^{\hbox{'}}}{\partial \beta }, \dots,\ \frac{\partial {f}_M^{\hbox{'}}}{\partial \beta}\right] $$
(18)

It is assumed that the vector of random variables ε follows a distribution with zero mean (E(ε i ) = 0), constant variance and covariance, then:

$$ \left(E\left(\varepsilon {\varepsilon}^{\hbox{'}}\right)\right) = {\displaystyle \sum}\otimes IeE\left({\varepsilon}_i{\varepsilon}_j^{\hbox{'}}\right) = {\sigma}_{ij}I, $$
(19)

where ⊗ is the Kronecker product.

The unknown covariances from the model established by matrix system σ ij are estimated by the expression:

$$ {\widehat{\sigma}}_{ij} = \frac{1}{\sqrt{\left(T-{K}_i\right)}\ \sqrt{\left(T-{K}_j\right)}\kern0.5em }{e}_i^{\hbox{'}}{e}_j $$
(20)

where T is the number of observations. K i and K j are the number of parameters of each equation i and j.

Such expression provides constant variance-covariances estimates, in which e i  = y i  − f i (X i β i ) are the residuals obtained by estimated generalized nonlinear least squares (EGNLS)

The estimated variance-covariance matrix of the parameters is given by:

$$ {\widehat{{\displaystyle \sum}}}_b = {\left[F{(b)}^{\hbox{'}}\left({\widehat{{\displaystyle \sum}}}^{-1}\otimes I\right)\ F(b)\right]}^{-1} $$
(21)

The variance of the NSUR is given by:

$$ {\widehat{\sigma}}_{\mathrm{NSUR}}^2=\frac{R(b)}{MT-K} = \frac{e^{\hbox{'}}\left({\widehat{{\displaystyle \sum}}}^{-1}\otimes I\right)e}{MT-K} $$
(22)

where M is the number of equations used.

The estimated variance from the equation i of the system is given by:

$$ {S}_{{\hat{y}}_i}^2 = {f}_i\ {(b)}^{\hbox{'}}\widehat{{\displaystyle \sum }}{\sigma}_b{f}_i(b) $$
(23)

In which f i (b) is a column vector containing the partial derivatives in order to the parameters of the equation i.

The confidence interval (CI) of a mean value, at 95 % probability, on two side test, is given by:

$$ \mathrm{C}I\left[\widehat{y}\pm {t}_{\alpha /2}\sqrt{\ {S}_{{\hat{y}}_i}^2}\right] = 1-\alpha $$
(24)

The prediction interval (new, i.e., n) (PI), at 95 % probability, on two side test, is calculated by:

$$ PI\left[{\widehat{y}}_{i(n)}\pm {t}_{\alpha /2}\sqrt{\ {S}_{{\widehat{y}}_i}^2+{\widehat{\sigma}}_{\mathrm{NSUR}}^2{\widehat{\sigma}}_{ii}}\right] = 1-\alpha $$
(25)

2.4 Effect of independent and simultaneous adjustment in modeling the tree biomass

Comparison of the equations for the independent estimation (performed by EGNLS) and simultaneous estimation (performed by NSUR) was performed. Regarding independent estimation, the effect of lack of additivity on estimates of the overall biomass at individual and population level was evaluated. The estimate of the tree total biomass must be equal to the sum of the biomass components. This analysis was not performed for simultaneous estimation because the additivity property is already ensured.

The difference (absolute and relative) of the results generated by the total biomass equation and the sum of the results of the equations of the biomass components were verified at the individual tree level. This analysis was also performed at the stand level for ages 6, 10, 15, and 20 years. For that purpose, we have simulated the diameter at breast height and total height of the forest stands using the SisPinus software (Oliveira et al. 1989). The input data in the simulation were dominant height of 21 m, initial survival percentage of 95 %, and plantation with homogeneity of five. From the diameter at breast height, height, and population survival data, one can calculate the total and biomass components for an area of 1 ha.

In addition, the simulated data were used to evaluate the effect of the methods ENGLS and SUR on the estimates of the biomass components, confidence interval, and prediction interval, extrapolated to hectare.

3 Results

3.1 Estimation of parameters using nonlinear regression—estimated generalized nonlinear least squares (EGNLS)

White’s test revealed that the residuals have presented homogeneous distribution for all components (significant at 99 % probability). The equations and statistics resulting from these adjustments were:

$$ {\widehat{y}}_{\mathrm{leaves}}=0.070729*{d}^{2,042984}*{\left(\frac{1}{h}\right)}^{0,4153}{R}_{\mathrm{adj}}^2 = 0.74,\ {\mathrm{S}}_{\mathrm{yx}}=26.49\%,\ \mathrm{White}=13.{97}^{\mathrm{ns}} $$
(26)
$$ {\widehat{y}}_{\mathrm{branches}} = 0.008359*{\left(\frac{1}{d}\right)}^{-2.09814}*{\left(\frac{1}{h}\right)}^{-0,43851}{R}_{\mathrm{adj}}^2 = 0.71,\ {S}_{\mathrm{yx}}=45.00\%,\ \mathrm{White}=17.{17}^{\mathrm{ns}} $$
(27)
$$ {\widehat{y}}_{\mathrm{bole}} = 0.005143*{\left({d}^2h\right)}^{1.097758}{R}_{\mathrm{adj}}^2 = 0.96,\ {S}_{\mathrm{yx}}=19.30\%,\ \mathrm{White}=4.{26}^{\mathrm{ns}} $$
(28)
$$ {\widehat{y}}_{\mathrm{bark}}=\kern0.5em 0.006748*{d}^{2.591151}{R}_{\mathrm{adj}}^2 = 0,92,\ {S}_{\mathrm{yx}}=22.96\%,\ \mathrm{White}=6.{19}^{\mathrm{ns}} $$
(29)
$$ {\widehat{y}}_{\mathrm{root}} = 0.013898*{\left({d}^2h\right)}^{0.815147}{R}_{\mathrm{adj}}^2 = 0.90,\ {S}_{\mathrm{yx}}=23.15\%,\ \mathrm{White}=6.{96}^{\mathrm{ns}} $$
(30)
$$ {\widehat{y}}_{\mathrm{total}} = 0.0289*{\left({d}^2h\right)}^{0.9671}{R}_{\mathrm{adj}}^2 = 0.96,\ {S}_{\mathrm{yx}} = 16.77\%,\ \mathrm{White}=4.{68}^{\mathrm{ns}} $$
(31)

Incompatibility was noted between the estimations of the components and the total biomass when the equations for independent fitting were used, and consequently, one of the properties that ensure the appropriate biomass modeling was not assured. The absence of additivity represents the existence of inconsistencies in the overall estimate of the tree biomass, since the estimated values from the biomass equations for the components are not equal to that estimated from the equation for total tree biomass.

These differences inside a tree varied from −3,02 to 4.37 kg and occurred for the set of diameters at breast height and total height used when applying the equation. When the diameter at breast height and total height were simulated per hectare in pine forest stands of age 6 to 20 years, and after applying these data in the equations for total and biomass components (estimated from total biomass equations − estimated from components), the differences reached up to 7.63 % (Table 1).

Table 1 The difference (absolute and relative) of the result generated by the equation of total biomass for Pinus elliottii Engelm. and sum of the results of the equations for biomass components in an area of 1 ha

A solution to force additivity of the equations solved by EGNLS, is consider the total biomass as the sum of all biomass components, namely: ŷ total  = ŷ leaves + ŷ branches + ŷ bole + ŷ bark + ŷ root.

Take a tree with dbh = 21 cm and ht = 16 m, and by applying these values in the equations of 26–30, we have:

$$ {\widehat{y}}_{\mathrm{leaves}}=11.24\ \mathrm{kg} $$
(32)
$$ {\widehat{y}}_{\mathrm{branches}}=16.76\ \mathrm{kg} $$
(33)
$$ {\widehat{y}}_{\mathrm{bole}}=86.30\ \mathrm{kg} $$
(34)
$$ {\widehat{y}}_{\mathrm{bark}}=18.00\ \mathrm{kg} $$
(35)
$$ {\widehat{y}}_{\mathrm{root}}=19.06\ \mathrm{kg} $$
(36)
$$ {\widehat{y}}_{\mathrm{total}}=11.24+16.76+86.30+18.00+19.06=151.36\ \mathrm{kg} $$
(37)

The vector of partial derivatives resulted in:

$$ z{(b)^{\hbox{'}}}_{\mathrm{leaves}}=\left[\begin{array}{ccc}\hfill {d}^{b_1}{\left(\frac{1}{h}\right)}^{b_2}\hfill & {b}_0{d}^{b_1} \ln (d){\left(\frac{1}{h}\right)}^{b_2}& {b}_0{d}^{b_1} \ln \left(\frac{1}{h}\right){\left(\frac{1}{h}\right)}^{b_2}\end{array}\right]=\left[\begin{array}{ccc}\hfill 158.93\hfill & \hfill 34.22\hfill & \hfill -31.17\hfill \end{array}\right] $$
(38)
$$ z{(b)^{\hbox{'}}}_{\mathrm{branches}}=\left[\begin{array}{ccc}\hfill {\left(\frac{1}{d}\right)}^{b_1}{\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{\left(\frac{1}{d}\right)}^{b_1} \ln \left(\frac{1}{d}\right){\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{\left(\frac{1}{d}\right)}^{b_1} \ln \left(\frac{1}{h}\right){\left(\frac{1}{h}\right)}^{b_2}\hfill \end{array}\right]=\left[\begin{array}{ccc}\hfill 2005.49\hfill & \hfill -51.04\hfill & \hfill -46.48\hfill \end{array}\right] $$
(39)
$$ z{(b)^{\hbox{'}}}_{\mathrm{bole}}=\left[\begin{array}{cc}\hfill {\left({d}^2h\right)}^{b_1}\hfill & \hfill {b}_0{\left({d}^2h\right)}^{b_1} \ln \left({d}^2h\right)\hfill \end{array}\right]=\left[\begin{array}{cc}\hfill 16,779.77\hfill & \hfill 764.74\hfill \end{array}\right] $$
(40)
$$ z{(b)^{\hbox{'}}}_{\mathrm{bark}}=\left[\begin{array}{cc}\hfill {d}^{b_1}\hfill & \hfill {b}_0{d}^{b_1} \ln (d)\hfill \end{array}\right] = \left[\begin{array}{cc}\hfill 2667.29\hfill & \hfill 54.80\hfill \end{array}\right] $$
(41)
$$ z{(b)^{\hbox{'}}}_{\mathrm{root}}=\left[\begin{array}{cc}\hfill {\left({d}^2h\right)}^{b_1}\hfill & \hfill {b}_0{\left({d}^2h\right)}^{b_1}\hfill \end{array} \ln \left({d}^2h\right)\right]=\left[\begin{array}{cc}\hfill 1371.33\hfill & \hfill 168.89\hfill \end{array}\right] $$
(42)

The variance-covariance matrix b, obtained by EGNLS regression for biomass components, resulted in:

$$ {S}^2{(b)}_{\mathrm{leaves}} = \left[\begin{array}{ccc}\hfill 0.0020404\hfill & \hfill -0.0138537\hfill & \hfill -0.0053848\hfill \\ {}\hfill -0.0138537\hfill & \hfill\ 0.1596963\hfill & \hfill\ 0.1083941\hfill \\ {}\hfill -0.0053848\hfill & \hfill\ 0.1083941\hfill & \hfill\ 0.0930543\hfill \end{array}\right] $$
(43)
$$ {S}^2{(b)}_{\mathrm{branches}} = \left[\begin{array}{ccc}\hfill 0.0001112\hfill & \hfill 0.0039757\hfill & \hfill 0.0000622\hfill \\ {}\hfill 0.0039757\hfill & \hfill\ 0.4036055\hfill & \hfill - 0.280671\hfill \\ {}\hfill 0.0000622\hfill & \hfill - 0.280671\hfill & \hfill\ 0.3066558\hfill \end{array}\right] $$
(44)
$$ {S}^2{(b)}_{\mathrm{bole}}=\left[\begin{array}{cc}\hfill 0.0000111\hfill & \hfill -0.000222\hfill \\ {}\hfill -0.000222\hfill & \hfill 0.0044433\hfill \end{array}\right] $$
(45)
$$ {S}^2{(b)}_{\mathrm{bark}}=\left[\begin{array}{cc}\hfill 0.0000183\hfill & \hfill -0.0008231\hfill \\ {}\hfill -0.0008231\hfill & \hfill 0.0370452\hfill \end{array}\right] $$
(46)
$$ {S}^2{(b)}_{\mathrm{root}} = \left[\begin{array}{cc}\hfill 0.0000794\hfill & \hfill - 0.0005937\hfill \\ {}\hfill -0.0005937\hfill & \hfill\ 0.0044509\hfill \end{array}\right] $$
(47)

The variance for the estimation of biomass components resulted in:

$$ \widehat{v}\left({\widehat{y}}_{\mathrm{leaves}}\right)=0.3798\ {\mathrm{kg}}^2 $$
(48)
$$ \widehat{v}\left({\widehat{y}}_{\mathrm{branches}}\right)=3.9702\ {\mathrm{kg}}^2 $$
(49)
$$ \widehat{v}\left({\widehat{y}}_{\mathrm{bole}}\right)=26.4069\ {\mathrm{kg}}^2 $$
(50)
$$ \widehat{v}\left({\widehat{y}}_{\mathrm{bark}}\right)=0.8215\ {\mathrm{kg}}^2 $$
(51)
$$ \widehat{v}\left({\widehat{y}}_{\mathrm{root}}\right)=1.2656\ {\mathrm{kg}}^2 $$
(52)

The total biomass is given by the sum of the biomass components (ŷ total = ŷ leaves + ŷ branches + ŷ bole + ŷ bark + ŷ root), consequently the variance is given by:

$$ \widehat{v}\left({\widehat{y}}_{\mathrm{total}}\right)={\displaystyle \sum }v\left({\widehat{y}}_i\right)+2{\displaystyle \sum }{\displaystyle \sum }cov\left({\widehat{y}}_i,{\widehat{y}}_j\right) $$
(53)

In which cov(ŷ i , ŷ j ) is calculated by:

$$ cov\left({\widehat{y}}_i,{\widehat{y}}_j\right)={\widehat{\rho}}_{y_i{y}_j}\sqrt{v\left({\widehat{y}}_i\right)v\left({\widehat{y}}_j\right)}. $$
(54)

where \( {\widehat{\rho}}_{y_i{y}_j} \) means expected correlation between y i and y j , i.e.,:

Component

Leaves

Branches

Bole

Bark

Root

Total

Leaves

1

     

Branches

0.8019

1

    

Bole

0.7823

0.8928

1

   

Bark

0.8352

0.8584

0.9666

1

  

Root

0.7907

0.8162

0.9357

0.9191

1

 

Total

0.8155

0.9166

0.9968

0.9725

0.9438

1

The application of the Eq. 54 resulted in:

$$ \begin{array}{l}\widehat{v}\left({\widehat{y}}_{\mathrm{total}}\right)=0.3798+3.9702 + 26.4069+0.8215+1.2656 + 2\\ {}*0.8019\ \sqrt{0.3798+3.9702}+2\\ {}*0.7823\ \sqrt{0.3798+26.4069}+2\\ {}*0.8352\sqrt{0.3798+0.8215}+2\\ {}*0.7907\ \sqrt{0.3798+1.2656}+2\\ {}*0.8928\ \sqrt{3.9702+26.4069}+2\\ {}*0.8584\sqrt{3.9702+0.8215}+2\\ {}*0.8162\ \sqrt{3.9702+1.2656} + 2\\ {}*0.9666\sqrt{26.4069+0.8215}+2\\ {}*0.9357\ \sqrt{26.4069+1.2656}+2\\ {}*0.9191\sqrt{0.8215+1.2656} = 88.0684\ {\mathrm{kg}}^2\end{array} $$
(55)

Estimates of confidence intervals at 95 % probability level resulted in:

$$ {CI}_{\mathrm{leaves}}=11.24\pm 2.04\sqrt{\ 0.3798}=11.24\pm 1.26\ \mathrm{kg} $$
(56)
$$ {CI}_{\mathrm{branches}}=16.76\pm 2.04\sqrt{\ 3.9702} = 16.76\pm 4.06\ \mathrm{kg} $$
(57)
$$ {CI}_{\mathrm{bole}}=86.30\pm 2.04\sqrt{26.4069}=86.30\pm 10.48\ \mathrm{kg} $$
(58)
$$ {CI}_{\mathrm{bark}}=18.00\pm 2.04\sqrt{\ 0.8215}=18.00\pm 1.85\ \mathrm{kg} $$
(59)
$$ {CI}_{\mathrm{root}}=19.06\pm 2.04\sqrt{\ 1.2656}=19.06\pm 2.29\ \mathrm{kg} $$
(60)
$$ {CI}_{\mathrm{total}}=151,36\pm 2.04\sqrt{88.0684} = 151,36\pm 19.14\ \mathrm{kg} $$
(61)

Estimates of prediction intervals at 95 % probability level resulted in:

$$ {PI}_{\mathrm{leaves}}=11.24\pm 2.04\sqrt{\ 9.4117}=11.24\pm 6.26\ \mathrm{kg} $$
(62)
$$ {PI}_{\mathrm{branches}}=16.76\pm 2.04\sqrt{\ 72.0528} = 16.76\pm 17.32\ \mathrm{kg} $$
(63)
$$ {PI}_{\mathrm{bole}}=86.30\pm 2.04\sqrt{449.3069}=86.30\pm 43.24\ \mathrm{kg} $$
(64)
$$ {PI}_{\mathrm{bark}}=18.00\pm 2.04\sqrt{\ 20.9645}=18.00\pm 9.34\ \mathrm{kg} $$
(65)
$$ {PI}_{\mathrm{root}}=19.06\pm 2.04\sqrt{\ 24.7059}=19.06\pm 10.14\ \mathrm{kg} $$
(66)
$$ {PI}_{\mathrm{total}}=151.36\pm 2.04\sqrt{811.3462} = 151.36\pm 58.11\ \mathrm{kg} $$
(67)

3.2 Nonlinear seemingly unrelated regressions

White’s test revealed that the residuals have presented homogeneous distribution for all components (significant at 99 % probability). The equations and statistics resulting from these adjustments were

$$ {\widehat{y}}_{\mathrm{leaves}}=0.101855*{d}^{1.523562}*{\left(\frac{1}{h}\right)}^{- 0.02921}{R}_{\mathrm{adj}}^2 = 0.73,\ {S}_{\mathrm{yx}}=26.91\%,\ \mathrm{White}=15.{69}^{\mathrm{ns}}. $$
(68)
$$ {\widehat{y}}_{\mathrm{branches}} = 0.003479*{\left(\frac{1}{d}\right)}^{-2.77207}*{\left(\frac{1}{h}\right)}^{0.009156}{R}_{\mathrm{adj}}^2 = 0.72,\ {S}_{\mathrm{yx}}=44.45\%,\ \mathrm{White}=13.{59}^{\mathrm{ns}}. $$
(69)
$$ {\widehat{y}}_{\mathrm{bole}} = 0.003247*{\left({d}^2h\right)}^{1.144185}{R}_{\mathrm{adj}}^2 = 0.96,\ {S}_{\mathrm{yx}}=19.16\%,\ \mathrm{White}=4.0{6}^{\mathrm{ns}} $$
(70)
$$ {\widehat{y}}_{\mathrm{bark}}=\kern0.5em 0.005036*{d}^{2.679999}{R}_{\mathrm{adj}}^2 = 0.92,\ {S}_{\mathrm{yx}}=22.65\%,\ \mathrm{White}=5.{65}^{\mathrm{ns}} $$
(71)
$$ {\widehat{y}}_{\mathrm{raiz}} = 0.012858*{\left({d}^2h\right)}^{0.82377}{R}_{\mathrm{adj}}^2 = 0.90,\ {S}_{\mathrm{yx}}=22.77\%,\ \mathrm{White}=6.{88}^{\mathrm{ns}} $$
(72)
$$ {\widehat{y}}_{\mathrm{total}}={\widehat{y}}_{\mathrm{leaves}}+{\widehat{y}}_{\mathrm{branches}}+{\widehat{y}}_{\mathrm{bole}} + {\widehat{y}}_{\mathrm{bark}}+{\widehat{y}}_{\mathrm{root}}{R}_{\mathrm{adj}}^2 = 0.90,\ {S}_{\mathrm{yx}}=19.24\%,\ \mathrm{White}=28.{53}^{\mathrm{ns}} $$
(73)

Take the same tree with dbh = 21 cm and ht = 16 m, and by applying these values in the equations of 68–73, we have:

$$ {\widehat{y}}_{\mathrm{leaves}}=11.42\ \mathrm{kg} $$
(74)
$$ {\widehat{y}}_{\mathrm{branches}}=15.69\ \mathrm{kg} $$
(75)
$$ {\widehat{y}}_{\mathrm{bole}}=82.21\ \mathrm{kg} $$
(76)
$$ {\widehat{y}}_{\mathrm{bark}}=17.70\ \mathrm{kg} $$
(77)
$$ {\widehat{y}}_{\mathrm{root}}=19.03\ \mathrm{kg} $$
(78)
$$ {\widehat{y}}_{\mathrm{total}}=11.42+15.69 + 82.21+17.70 + 19.03=145.95\ \mathrm{kg} $$
(79)

The vector of the partial derivatives is given by:

$$ \begin{array}{c}\hfill f{(b)^{\hbox{'}}}_{\mathrm{leaves}}=\left[\begin{array}{cccccccccccc}\hfill {d}^{b_1}{\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{d}^{b_1} \ln (d){\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{d}^{b_1} \ln \left(\frac{1}{h}\right){\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \\ {}\hfill =\left[\begin{array}{cccccccccccc}\hfill 112.11\hfill & \hfill 34.77\hfill & \hfill -31.66\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \end{array} $$
(80)
$$ \begin{array}{c}\hfill f{(b)^{\hbox{'}}}_{\mathrm{branches}}=\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {\left(\frac{1}{d}\right)}^{b_1}{\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{\left(\frac{1}{d}\right)}^{b_1} \ln \left(\frac{1}{d}\right){\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill {b}_0{\left(\frac{1}{d}\right)}^{b_1} \ln \left(\frac{1}{h}\right){\left(\frac{1}{h}\right)}^{b_2}\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \\ {}\hfill =\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 4510.86\hfill & \hfill -47.78\hfill & \hfill -43.51\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \end{array} $$
(81)
$$ \begin{array}{c}\hfill f{(b)^{\hbox{'}}}_{\mathrm{bole}}=\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {\left({d}^2h\right)}^{b_1}\hfill & \hfill {b}_0{\left({d}^2h\right)}^{b_1} \ln \left({d}^2h\right)\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \\ {}\hfill =\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 25,319.96\hfill & \hfill 728.55\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \end{array} $$
(82)
$$ \begin{array}{c}\hfill f{(b)^{\hbox{'}}}_{\mathrm{bark}}=\left[\begin{array}{ccccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {d}^{b_1}\hfill & \hfill {b}_0{d}^{b_1} \ln (d)\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \\ {}\hfill =\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 3495.80\hfill & \hfill 53.60\hfill & \hfill 0\hfill & \hfill 0\hfill \end{array}\right]\hfill \end{array} $$
(83)
$$ \begin{array}{c}\hfill f{(b)^{\hbox{'}}}_{\mathrm{root}}=\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill {\left({d}^2h\right)}^{b_1}\hfill & \hfill {b}_0{\left({d}^2h\right)}^{b_1} \ln \left({d}^2h\right)\hfill \end{array}\right]\hfill \\ {}\hfill =\left[\begin{array}{cccccccccccc}\hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 0\hfill & \hfill 1480.22\hfill & \hfill 168.66\hfill \end{array}\right]\hfill \end{array} $$
(84)

The variance-covariance matrix of b ij (where i refers to coefficients in each equation and j refers to coefficients among equations) for the biomass components in the regressions, obtained by EGNLS resulted in:

$$ \begin{array}{cc}\hfill \begin{array}{c}\hfill {\widehat{{\displaystyle \sum}}}_b=\hfill \\ {}\hfill {b}_{11}\hfill \\ {}\hfill {b}_{12}\hfill \\ {}\hfill {b}_{13}\hfill \\ {}\hfill {b}_{21}\hfill \\ {}\hfill {b}_{22}\hfill \\ {}\hfill {b}_{23}\hfill \\ {}\hfill {b}_{31}\hfill \\ {}\hfill {b}_{32}\hfill \\ {}\hfill {b}_{41}\hfill \\ {}\hfill {b}_{42}\hfill \\ {}\hfill {b}_{51}\hfill \\ {}\hfill {b}_{52}\hfill \\ {}\hfill \hfill \\ {}\hfill {b}_{11}\hfill \\ {}\hfill {b}_{12}\hfill \\ {}\hfill {b}_{13}\hfill \\ {}\hfill {b}_{21}\hfill \\ {}\hfill {b}_{22}\hfill \\ {}\hfill {b}_{23}\hfill \\ {}\hfill {b}_{31}\hfill \\ {}\hfill {b}_{32}\hfill \\ {}\hfill {b}_{41}\hfill \\ {}\hfill {b}_{42}\hfill \\ {}\hfill \begin{array}{c}\hfill {b}_{51}\hfill \\ {}\hfill {b}_{52}\hfill \end{array}\hfill \end{array}\hfill & \hfill \left|\begin{array}{cccccc}\hfill {b}_{11}\hfill & \hfill {b}_{12}\hfill & \hfill {b}_{13}\hfill & \hfill {b}_{21}\hfill & \hfill {b}_{22}\hfill & \hfill {b}_{23}\hfill \\ {}\hfill 0.003751\hfill & \hfill -0.012720\hfill & \hfill -0.001400\hfill & \hfill 0.000090\hfill & \hfill 0.006978\hfill & \hfill 0.000972\hfill \\ {}\hfill -0.012720\hfill & \hfill 0.061885\hfill & \hfill 0.024932\hfill & \hfill -0.000356\hfill & \hfill -0.029000\hfill & \hfill -0.001197\hfill \\ {}\hfill -0.001400\hfill & \hfill 0.024932\hfill & \hfill 0.022481\hfill & \hfill -0.000092\hfill & \hfill -0.008714\hfill & \hfill 0.001891\hfill \\ {}\hfill 0.000090\hfill & \hfill -0.000356\hfill & \hfill -0.000092\hfill & \hfill 0.000009\hfill & \hfill 0.000797\hfill & \hfill -0.000023\hfill \\ {}\hfill 0.006978\hfill & \hfill -0.029000\hfill & \hfill -0.008714\hfill & \hfill 0.000797\hfill & \hfill 0.143586\hfill & \hfill -0.081261\hfill \\ {}\hfill 0.000972\hfill & \hfill -0.001197\hfill & \hfill 0.001891\hfill & \hfill -0.000023\hfill & \hfill -0.081261\hfill & \hfill 0.086361\hfill \\ {}\hfill -0.000011\hfill & \hfill -0.000011\hfill & \hfill -0.000046\hfill & \hfill 0.000000\hfill & \hfill 0.000027\hfill & \hfill -0.000004\hfill \\ {}\hfill 0.000356\hfill & \hfill 0.000162\hfill & \hfill 0.001322\hfill & \hfill -0.000008\hfill & \hfill -0.000780\hfill & \hfill 0.000120\hfill \\ {}\hfill 0.000011\hfill & \hfill -0.000060\hfill & \hfill -0.000031\hfill & \hfill -0.000001\hfill & \hfill -0.000088\hfill & \hfill 0.000024\hfill \\ {}\hfill -0.000622\hfill & \hfill 0.003327\hfill & \hfill 0.001580\hfill & \hfill 0.000046\hfill & \hfill 0.005442\hfill & \hfill -0.001518\hfill \\ {}\hfill -0.000049\hfill & \hfill 0.000168\hfill & \hfill 0.000023\hfill & \hfill -0.000004\hfill & \hfill -0.000196\hfill & \hfill -0.000132\hfill \\ {}\hfill 0.000399\hfill & \hfill -0.001362\hfill & \hfill -0.000172\hfill & \hfill 0.000029\hfill & \hfill 0.001542\hfill & \hfill 0.001105\hfill \\ {}\hfill {b}_{31}\hfill & \hfill {b}_{32}\hfill & \hfill {b}_{41}\hfill & \hfill {b}_{42}\hfill & \hfill {b}_{51}\hfill & \hfill {b}_{52}\hfill \\ {}\hfill -0.000011\hfill & \hfill 0.000356\hfill & \hfill 0.000011\hfill & \hfill -0.000622\hfill & \hfill -0.000049\hfill & \hfill 0.000399\hfill \\ {}\hfill -0.000011\hfill & \hfill 0.000162\hfill & \hfill -0.000060\hfill & \hfill 0.003327\hfill & \hfill 0.000168\hfill & \hfill -0.001362\hfill \\ {}\hfill -0.000046\hfill & \hfill 0.001322\hfill & \hfill -0.000031\hfill & \hfill 0.001580\hfill & \hfill 0.000023\hfill & \hfill -0.000172\hfill \\ {}\hfill 0.000000\hfill & \hfill -0.000008\hfill & \hfill -0.000001\hfill & \hfill 0.000046\hfill & \hfill -0.000004\hfill & \hfill 0.000029\hfill \\ {}\hfill 0.000027\hfill & \hfill -0.000780\hfill & \hfill -0.000088\hfill & \hfill 0.005442\hfill & \hfill -0.000196\hfill & \hfill 0.001542\hfill \\ {}\hfill -0.000004\hfill & \hfill 0.000120\hfill & \hfill 0.000024\hfill & \hfill -0.001518\hfill & \hfill -0.000132\hfill & \hfill 0.001105\hfill \\ {}\hfill 0.000001\hfill & \hfill -0.000038\hfill & \hfill 0.000000\hfill & \hfill -0.000019\hfill & \hfill 0.000001\hfill & \hfill -0.000011\hfill \\ {}\hfill -0.000038\hfill & \hfill 0.001183\hfill & \hfill -0.000010\hfill & \hfill 0.000595\hfill & \hfill -0.000042\hfill & \hfill 0.000339\hfill \\ {}\hfill 0.000000\hfill & \hfill -0.000010\hfill & \hfill 0.000009\hfill & \hfill -0.000509\hfill & \hfill -0.000012\hfill & \hfill 0.000098\hfill \\ {}\hfill -0.000019\hfill & \hfill 0.000595\hfill & \hfill -0.000509\hfill & \hfill 0.030594\hfill & \hfill 0.000730\hfill & \hfill -0.005895\hfill \\ {}\hfill \begin{array}{c}\hfill 0.000001\hfill \\ {}\hfill -0.000011\hfill \end{array}\hfill & \hfill \begin{array}{c}\hfill -0.000042\hfill \\ {}\hfill 0.000339\hfill \end{array}\hfill & \hfill \begin{array}{c}\hfill -0.000012\hfill \\ {}\hfill 0.000098\hfill \end{array}\hfill & \hfill \begin{array}{c}\hfill 0.000730\hfill \\ {}\hfill -0.005895\hfill \end{array}\hfill & \hfill \begin{array}{c}\hfill 0.000065\hfill \\ {}\hfill -0.000524\hfill \end{array}\hfill & \hfill \begin{array}{c}\hfill -0.000524\hfill \\ {}\hfill 0.004239\hfill \end{array}\hfill \end{array}\right|\hfill \end{array} $$
(85)

The variance for the estimate of biomass components resulted in:

$$ {S}_{{\hat{y}}_{\mathrm{leaves}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b) = 0.3713\ {\mathrm{kg}}^2 $$
(86)
$$ {S}_{{\hat{y}}_{\mathrm{branches}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b) = 2.1319\ {\mathrm{kg}}^2 $$
(87)
$$ {S}_{{\hat{y}}_{\mathrm{bole}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b)\kern0.5em =10.0356\ {\mathrm{kg}}^2 $$
(88)
$$ {S}_{{\hat{y}}_{\mathrm{bark}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b) = 1.1723\ {\mathrm{kg}}^2 $$
(89)
$$ {S}_{{\hat{y}}_{\mathrm{root}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b)\kern0.75em =1.4035\ {\mathrm{kg}}^2 $$
(90)
$$ {S}_{{\hat{y}}_{\mathrm{total}}}^2 = {f}_i\ {(b)}^{\hbox{'}}{\widehat{{\displaystyle \sum}}}_b{f}_i(b) = 15.1666\ {\mathrm{kg}}^2 $$
(91)

Estimates of confidence intervals at 95 % probability level resulted in:

$$ {CI}_{\mathrm{leaves}}=11.42\pm 2.04\sqrt{\ 0.3713}\kern0.5em =11.42\pm 1.24\ \mathrm{kg} $$
(92)
$$ {CI}_{branches}=15.69\pm 2.04\sqrt{\ 2.1319} = 15.69\pm 2.98\ \mathrm{kg} $$
(93)
$$ {CI}_{\mathrm{bole}}=82.21\pm 2.04\sqrt{\ 10.0356}=82.21 \pm 6.46\ \mathrm{kg} $$
(94)
$$ {CI}_{\mathrm{bark}}=17.70\pm 2.04\sqrt{\ 1.1723} = 17.70\pm 2.21\ \mathrm{kg} $$
(95)
$$ {CI}_{\mathrm{root}}=19.03\pm 2.04\sqrt{1.4035} = 19.03\pm 2.42\ \mathrm{kg} $$
(96)
$$ {CI}_{\mathrm{total}}=145.95\pm 2.04\sqrt{15.1666} = 145.95\pm 7.94\ \mathrm{kg} $$
(97)

Estimates of prediction intervals at 95 % probability level resulted in:

$$ {PI}_{\mathrm{leaves}}=11.42\pm 2.04\sqrt{1.7371} = 11.42 \pm 2.69\ \mathrm{kg} $$
(98)
$$ {PI}_{\mathrm{branches}}=15.69\pm 2.04\sqrt{31.2246} = 15.69 \pm 11.40\ \mathrm{kg} $$
(99)
$$ {PI}_{\mathrm{bole}}=82.21\pm 2.04\sqrt{345.1596}=82.21\pm 37.90\ \mathrm{kg} $$
(100)
$$ {PI}_{\mathrm{bark}}=17.70\pm 2.04\sqrt{5.8521} = 17.70 \pm 4.93\ \mathrm{kg} $$
(101)
$$ {PI}_{\mathrm{root}}=19.03\pm 2.04\sqrt{6.0311} = 19.03\pm 5.01\ \mathrm{kg} $$
(102)
$$ {PI}_{\mathrm{total}}=145.95\pm 2.04\sqrt{639.4817} = 145.95\pm 51.59\ \mathrm{kg} $$
(103)

3.3 Effect of independent and simultaneous adjustment in the statistics of the equations

The equations presented for estimating the biomass of individual components such as leaves, branches, bole, bark, root, and total biomass of the tree showed satisfactory levels of accuracy, regardless of the fitting method. The implementation of the simultaneous estimation method did not lead to appreciable changes on the fit’s statistics, residual distribution or regression constraints compared with those produced by independent estimation (Table 2). The values of the coefficient of determination have not presented major differences, seen only in biomass of leaf, branches, and total. The estimated standard error resulted in variations of the order of −14.73 to 1.64 %, keeping the same trends when comparing the two methods. In absolute terms, these differences were smaller than 2.47. The distribution of residuals showed the same trend when two methods for estimation of coefficients were compared.

Table 2 Effect of independent (ENGLS) and simultaneous (NSUR) fitting on statistical estimates using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total

The largest differences were observed in the confidence intervals and prediction interval when the adjustment SUR was conducted to estimate the coefficients in a more consistent manner, since the intervals were smaller (Table 2). Similar results were observed for the estimates extrapolated to hectare values (Tables 3, 4, 5, 6, and 7). The results show that more accurate estimates are obtained by adjusting SUR, especially for the total biomass, in which the differences in confidence intervals ranged up to 71.71 % and the prediction intervals to 10.95 %. For the other biomass components, the SUR method also provided better results, especially in the prediction intervals.

Table 3 Simulation the effect of independent (ENGLS) and simultaneous (NSUR) fitting on confidence intervals and prediction intervals using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total
Table 4 Simulation the effect of independent (ENGLS) and simultaneous (NSUR) fitting on confidence intervals and prediction intervals using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total
Table 5 Simulation the effect of independent (ENGLS) and simultaneous (NSUR) fitting on confidence intervals and prediction intervals using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total
Table 6 Simulation the effect of independent (ENGLS) and simultaneous (NSUR) fitting on confidence intervals and prediction intervals using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total
Table 7 Comparison the effect of independent (ENGLS) and simultaneous (NSUR) fitting on confidence intervals and prediction intervals using models for biomass components and total biomass of individuals of Pinus elliottii Engelm.: leaves, branches, bole without bark, bole bark, root and total

4 Discussion

The worst statistical evaluation of goodness of fit (adjusted coefficient of determination and standard error of the estimate) was observed for the biomass of leaves and branches, while the best was observed for the bole, bark, and roots. The biomass of leaves and branches is naturally more variable compared to other biomass components, since they suffer greater influences of internal and external factors, as highlighted by Satoo and Madgwick (1982) and António et al. (2007). Considering these statistics, the equations presented here for estimating the biomass of individual components such as leaves, branches, bole, bark of the bole and roots, and also for the total biomass, yielded satisfactory accuracy level. Simultaneous fitting was found not to lead to appreciable changes in the fit statistics of the equations when compared to independent fitting. It was found that the simultaneous adjustment (NSUR) did not lead to appreciable variations in the statistics of adjustments by equations, compared with independent fitting (EGNLS), with exception of the confidence intervals and prediction intervals, where the NSUR adjustment was more efficient.

Modeling biomass components and total biomass with an isolated equation for each one, as happens when using the EGNLS procedure, should not be the main concern in studies of modeling of biomass. The equation for estimation of each biomass component refers to a single mathematical relationship and that proved to be inappropriate, since the estimates obtained from equations for biomass components are not equal to the estimated ones from the equation for the total tree biomass, that is, they are not addictive. In the present work, the incompatibility of the equation results for the components and for the total biomass was observed when using independent adjustment and also the spread of the error, when expanded to the area of one hectare (item 3.1).

One solution to force additivity in these equations would be to consider the total biomass as sum of the biomass of all components. However, this is not appropriate because the estimation in conditional models is the average value of the dependent variable as a function of the independent variables, while the value of any dependent variable should consider all the variables involved.

The NSUR procedure was effective in achieving an additive system of equations. This procedure considers the contemporaneous correlations between the biomass components, considering the interdependence of the components so as to admit a set of allometric relations and also impose restrictions to the coefficients, which ensures the additivity of the equations. Thus, the calculation of the total biomass is provided by three factors: use of the same independent variables for the estimate of each biomass component, the addition of the best regression functions of each component and finally force the regressions for individual components estimate the total biomass (Cunia and Briggs 1985; Parresol, 1999).

This method ensures consistency between estimation of total biomass and the estimates of biomass components, thus is the appropriate way to model the total biomass and of the biomass components of the tree. This approach has been discussed for about half a century (Kozak, 1970; Chiyenda and Kozak, 1984; Cunia and Briggs, 1984, 1985, Reed and Green, 1985), although often it has been ignored. Few researchers who adopt the technique, such as: Parresol (1999, 2001), Bi et al. (2001), Návar et al. (2002), Carvalho and Parresol (2003), Bi et al. (2004), Cháidez et al. (2004), Návar et al. (2004), Lambert et al. (2005), Návar (2009) Bi et al. (2010), Goicoa et al. (2011), Ruiz-Peinaldo et al. (2012) and Menéndez-Miguélez et al. (2013).

Despite the mathematical complications of NSUR estimates, it can be easily done using the SAS software and brings the following advantages: more consistent biomass estimates for the total and components, more consistent coefficients, and with smaller confidence and prediction limits (Cunia and Briggs, 1984; Parresol, 1999; Carvalho and Parresol, 2003). The positive effects of decreases on confidence intervals and of prediction are associated with smaller variances obtained by applying the method of simultaneous estimation, considering the contemporaneous correlation between the components. Thus, the smaller the confidence intervals for the equations, the better are the intervals of the predictions, thus generating more efficient estimators. This results in important implications in the forest, especially in terms of biomass inventory and preparation of management plans.

In this work, it was found that the simultaneous estimation (NSUR) presents the best properties for building allometric equations for estimation of total and biomass of components, when compared with the independent estimation (EGNLS). These properties are related to the compatibility of estimated values for the biomass components and for the total (the additivity property) and generate more efficient estimators, as was discussed earlier. Consequently, the hypothesis tested in the present study was confirmed.

This study is useful for understanding the basic principles of this method and also on the used analytical procedures. Meanwhile, for a more comprehensive explanation of the statistical theory involved, one should consult Zellner (1962), Kozak (1970), Gallant (1975), Cunia (1979), Jacobs and Cunia (1980), Chiyenda (1983), and Chiyenda and Kozak (1984), Cunia and Briggs (1984, 1985), Amemiya (1985), Reed and Green (1985), Srivastava and Giles (1987), Judge et al. (1988), Greene (1999), Parresol (1999, 2001).

5 Conclusions

Modeling total biomass and components should not be treated separately, as is the case of independent estimation. Such approach has been shown to be inappropriate.

Rather, the appropriate approach to model biomass components and total biomass of the trees should take into account the interdependence of the components by including a series of allometric relationships. This is the case when using simultaneous fitting.

In modeling biomass components and total biomass, the additivity of equations must be considered, so that the equation for total biomass is a function of the sum of the regression components.

Simultaneous estimation in modeling the total biomass and of the biomass components brings several advantages, notably the compatibility between the estimates of total biomass and components. Additionally, the coefficients can gain greater consistency.

Simultaneous fitting was found not to lead to appreciable changes in the fit statistics of the equations when compared to independent fitting, yet another reason to use this method.

Simultaneous estimation presents properties best suited for building allometric equations to estimate the biomass of the components and total biomass over independent estimation. It is therefore the method that should be used.