Background

Androstachys johnsonii Prain (A. johnsonii) stands, known as mecrusse, are very important woodlands. Almost entirely restricted to Mozambique [1], it has an important socioeconomic value to local communities, that sell and use stakes and poles of A. johnsonii in the construction of homes, shelters, and furniture; and it is the main source of income in the Funhalouro and Mabote districts [2, 3]. On the global scale, mecrusse forests form part of the woodland belt that stretches over large portions of southern Africa and are reported to be a tipping point in regional ecological and socioeconomic development [4], hence, their importance in the mitigation of greenhouse gas emissions.

Forest biomass is a key variable employed when making estimates of carbon pools in forests, and for studying other biochemical cycles [5]. In the past, only the aboveground portion of trees was the desired products from forests [6]. However, with the increased significance of biomass estimation since the Kyoto Protocol was adopted in 1997 [7], and thus, the enhanced awareness of the sequestration functions of trees, climate change issues, have made belowground biomass (BGB) more relevant.

Despite the recent advances in examining root distribution and biomass with ground-penetrating radar [812], the belowground component of trees is still poorly known because, traditionally, it requires labour- and time-intensive in situ measurements [13]. Yet, BGB constitutes a major share of total forest biomass. Cairns et al. [14] and Litton et al. [15] have maintained that BGB may represent up to 40% of the total biomass. In Mozambique, Magalhães and Seifert [16, 17] found that approximately 20% of the forest biomass of mecrusse woodlands was allocated to the root system, and so highlighting the need to study this carbon pool.

Besides the share of BGB in whole tree forest biomass, BGB happens to be a unique carbon pool because after exploitation, the root system, along with the stump, are left in the forest and, in some tree species, are then allowed to sprout, continuing the carbon sequestration process or decompose, releasing CO2 and nutrients. Therefore, BGB can be used to estimate the carbon that will be transferred to the soil and the nutrients that will be reclaimed by the site.

BGB is often estimated indirectly, using root-to-shoot ratios (R/S) [1721], root system biomass expansion factors (BEFs) [17], and by using regression equations of BGB versus aboveground biomass (AGB) [18, 22, 23] or versus easily measured variables (diameter at breast height (DBH) and tree height (TH)) [23, 24]. However, whatever the method utilised to estimate BGB (R/S, BEFs, equations), it is necessary that the root system is directly measured to develop those methods.

Based on the fact that measuring BGB is difficult and time-consuming, the root system is often partially removed from the soil [2530], depths of excavation are predefined [2931], and fine roots are excluded [19, 32, 33]. However, the depths of excavation and the definition of fine roots are not standardised [18, 34], but the depth selected in a given study is assumed to capture a large proportion of the roots [18]. Yet, according to Mokany et al. [20], sampling to what may be deemed an insufficient soil depth to capture the majority of the roots, while not sampling the root collar or fine roots, as well as sampling with inadequate replication, are a few of the methodological pitfalls associated with sampling root biomass, and can lead to underestimation.

In other cases, a root sampling procedure is applied where only a fraction of roots from each root system are fully excavated, and then the information from the excavated roots is employed to estimate biomass for the roots not excavated [6, 23, 35]. The disadvantage of relying on sampling procedures is that the observed biomass value for each individual root system is less accurately determined compared to excavating in full [6].

Very few allometric biomass models exist for Mozambican forests; exceptions include Magalhães and Seifert [16, 17], Ryan et al. [33], Mate et al. [36], and Sitoe et al. [37]. As is best present known, the only studies that have included BGB are those by Magalhães and Seifert [16, 17], Ryan et al. [33], and Magalhães and Seifert [38]. However, the study by Ryan et al. [33] was based on only several sample trees (23) within a limited geographical range (27 ha) and the root system was not completely excavated (fine roots were not included). Although Magalhães and Seifert [16, 17, 38] considered relatively large sample trees and an expanded geographical area (93 trees harvested in 5 districts), besides including the entire root system, their allometric models were limited by not considering the different root components (e.g. taproot, root crown, lateral roots) and, therefore, the BGB allocation patterns were not analysed. Root component biomass models and BGB allocation patterns analyses are scarce worldwide, Litton et al. [15] being the only reference available in the literature.

Studying BGB allocation patterns is very important for understanding root anchorage as both root anchorage and BGB allocation patterns depend on root architecture, branching patterns, and size and depth of the roots [3942]. In turn, those factors affecting tree anchorage and BGB allocation patterns depend on tree species and soil types [43] and resources [4448]. The anchoring capacity of a tree is a critical factor for survival regarding external abiotic stresses [49].

It has been suggested by Herrel et al. [49] that for a fixed amount of biomass, a network of several small roots is more resistant to tension than a few large structural roots. This means that a root system with a larger taproot and several smaller lateral roots, as in the case of A. johnsonii trees [50], is more resistant to tension and therefore may have a better anchoring capacity than otherwise.

Parametric studies have shown that the number of lateral ramifications and their diameter were both major components affecting the resistance to pull-out for a given soil pressure [49]. On the other hand, the biomass allocated to a certain root component is also a function of the ramifications and/or their diameter, suggesting that studying biomass assigned to the different root components can help to identify which component affects the resistance to pull-out and anchorage more. Ennos and Fitter [51] showed that various anchorage strategies (plate-like and tap-like morphology) have an impact on biomass allocation patterns, therefore stressing the necessity of studying BGB allocation patterns to understand anchorage strategies.

Bila et al. [52] have demonstrated that silvicultural treatments positively influenced the health and growth of A. johnsonii and suggested that this species can be grown for commercial and urban forestry purposes. Knowledge of the extent and distribution of tree root systems is essential for managing trees in the constructed environment [53]. Conversely, the performance of urban trees depends upon the ability of their root systems to acquire resources and provide anchorage [53]. This emphasises the requirement to study BGB allocation patterns, though the anchorage of open grown trees and of those grown in the woods may be different.

Hence, as direct estimation of BGB is labour- and time-intensive, developing root component biomass models based on easily measurable variables is crucial for studying BGB allocation patterns and therefore anchoring functions of the different root components.

The present study was aimed at analysing the BGB allocation patterns, fitting and validating root component and root system models for A. johnsonii. A general model of the root system was also fitted using the best root component models using weighted non-linear seemingly unrelated regression (WNSUR) and critically compared against independent root system models. The excavation depth range that can capture more than 75% of the BGB was also estimated.

Results

Description of the data

Diameter distributions of the phase-1 and phase-2 trees are given in the Figure 1 and Table 1, respectively, and show that the phase-2 sample trees (outside the brackets in the Table 1) are representative of those from phase-1. On the other hand, it is also noted that the testing sample trees (inside the brackets in the Table 1) are also representative of those from phase-2.

Figure 1
figure 1

Diameter distribution histogram of phase-1 sampled A. johnsonii trees.

Table 1 Diameter distribution of the training sample trees (outside the brackets) and of the testing sample trees felled inside the study area (inside the brackets)

The testing sample collected outside the study area (in Chicualacuala district) was distributed according to diameter classes in the Table 1, as follows: 4, 3, 3, 3, 2 and 1, respectively. Note that, although the study area comprised 5 districts (Chibuto, Madlakaze, Panda, Funhalouro and Mabote), during the randomization (Figure 2), none of the plots fell in Chibuto and Panda districts, and those districts have almost a negligible share of mecrusse woodlands of the study area.

Figure 2
figure 2

Area of occurrence of A. johnsonii in the districts of Gaza and Inhambane Provinces (a) and its soil types (b).

BGB allocation patterns

On average, the percentage of the root system biomass attributed to taproot, root collar, and lateral roots biomasses was 48.36, 30.79 and 51.64%, respectively; and the percentage of the taproot attributed to root collar was 64.89% (Table 2). The percentage of the root system biomass found at 20% of the taproot depth, which is equivalent to 9.6–61.2 cm in depth from the ground level, was 81.20%.

Table 2 Belowground biomass allocation patterns

Table 3 shows that BGB allocation patterns vary with tree size (DBH, RCD, and TH), except the proportion of root system biomass allocated to the root collar (RC/RS), which is found to be independent on tree size by either Pearson´s correlation test or dcov test of independence.

Table 3 Dependence of BGB allocation patterns on tree size

Modelling

The laterals roots and root system biomass models showed that more than 99% of the BGB variation was explained by the predictor variables (Tables 4, 5). The root collar and taproot biomass models showed that more than 96 and 98% of the BGB variation, respectively, were explained by the predictor variables. The CVr varied from, approximately, 23 to 46%; the smallest and highest CVr values were verified for the root system and the root collar biomass models, respectively.

Table 4 Coefficients of regression (± standard error) for independently fitted models
Table 5 Goodness of fit statistics, and heteroskedasticity and normality tests of residuals for independently fitted models

All the root component models presented statistically insignificant bias (MR) as tested by Student’s t-test and their residuals showed homoscedasticity (p-value >> 0.05) and normal distribution (p-value > 0.05) (except for the model form 3 of the lateral roots) (Table 5). The plots of the residuals (not shown) presented no particular trend; the cluster of points was contained in a horizontal band, with the residuals evenly distributed under and over the axis of abscissas, meaning that there were not model defects. All the models performed almost equally.

Forcing additivity of the taproot and lateral roots biomasses into root system biomass

Because the different model forms performed almost equally (Table 5), the WNSUR with parameter restriction was applied to models using DBH and TH as predictors; and to those using either DBH or RCD only. This ensured that the additivity could be achieved either using two variables (DBH and TH) or using only one variable (DBH or RCD). The latter case was included because TH is difficult and time-consuming to measure in natural forests.

The relatively better taproot and lateral roots model forms were the model forms (5) and (6) (refer to “Methods”), respectively, as judged by Adj.R2 and CVr, as other statistics were equally insignificant. Therefore, the root system model form is a function (sum) of the predictors of the models (5) and (6). The structural system of equations (including the root system biomass model) obtained by combining the best taproot and lateral roots model forms under parameter restriction is given in Eq. (1). Using the same principle for the model forms with either DBH or RCD only as predictors, the structural systems of equations for WNSUR are given in Eqs. (2) and (3).

$$\begin{aligned} &\hat{Y}_{Taproot} = b_{10} DBH^{{b_{11} }} TH^{{b_{12} }} \hfill \\ &\hat{Y}_{Lateral - roots} = b_{20} \left( {DBH^{2} TH} \right)^{{b_{21} }} \hfill \\ &\hat{Y}_{Root - system} = b_{10} DBH^{{b_{11} }} TH^{{b_{12} }} + b_{20} \left( {DBH^{2} TH} \right)^{{b_{21} }} \hfill \\ \end{aligned}$$
(1)
$$\begin{aligned} &\hat{Y}_{Taproot} = b_{10} DBH^{{b_{11} }} \hfill \\ &\hat{Y}_{Lateral - roots} = b_{20} DBH^{{b_{21} }} \hfill \\ &\hat{Y}_{Root - system} = b_{10} DBH^{{b_{11} }} + b_{20} DBH^{{b_{21} }} \hfill \\ \end{aligned}$$
(2)
$$\begin{aligned} &\hat{Y}_{Taproot} = b_{10} RCD^{{b_{11} }} \hfill \\ &\hat{Y}_{Lateral - roots} = b_{20} RCD^{{b_{21} }} \hfill \\ &\hat{Y}_{Root - system} = b_{10} RCD^{{b_{11} }} + b_{20} RCD^{{b_{21} }} \hfill \\ \end{aligned}$$
(3)

Note that, in Eqs. (13), the coefficients of regression of each regressor in each root component model (taproot or lateral roots) are forced (constrained, restricted) to be equal to coefficients of the equivalent regressor in the root system model, allowing additivity.

For the WNSUR in Eqs. (13) the range of Adj.R2 is 83.30–93.46, 82.59–93.32, and 82.92–92.90%, respectively (Tables 6, 7). It was observed that by forcing additivity the Adj.R2 decreased, and the normality of the residuals was lost (p-value < 0.05). However, the bias (MR) kept insignificant and the residuals showed homoscedasticity (p-value >> 0.05). The models in Eqs. (13), fitted under WNSUR, performed almost equally.

Table 6 Coefficients of regression (± standard error) for simultaneously fitted models
Table 7 Goodness of fit statistics, and heteroskedasticity and normality tests of residuals for simultaneously fitted models

The t-test results for the restrictions imposed on WNSUR (Table 8) were insignificant (p-value ≈1), indicating that the data were consistent with the restriction and that the models fit as well with the restriction imposed.

Table 8 t test for the restrictions imposed for WNSUR

Validation

The aggregate difference (AD) for independent models (Table 9) varied from −6.06 to 0.21% in the study area and from 0.98 to 6.0 outside the study area. For the whole testing sample (including both the inside and outside samples) the AD varied from −2.9 to 5.1%. For simultaneous models the range of AD was from −5.4 to 0.68% and from 3.42 to 5.90%, inside and outside the study area, respectively (Table 10). For the whole testing sample the AD ranged from −2.87 to 5.1 %. The Wilcoxon signed rank test revealed that for both independent and simultaneous models (Tables 9, 10) the observed BGB did not differ statistically from the predicted BGB values (p-value >0.25); hence, the models can be used reliably inside and outside the study area.

Table 9 Validation of independently fitted models
Table 10 Validation of simultaneously fitted models

Discussion

BGB allocation patterns

Considering the fact that 90% of the lateral roots of A. johnsonii trees are located in the first node, which is located close to the ground level [16, 17, 38, 50] it can be inferred that the 81.20% of the root system (found up to 61.2 cm in depth) is composed by root collar and lateral roots and therefore, the remaining portion of the taproot constitutes less than 20% of the root system biomass. This can be verified by summing the average taproot and lateral roots biomasses, which is equal to 82.43%, very close to the percentage of the root system biomass found at 20% of the taproot depth (81.20%). The difference of those percentages (1.13%) represents the lateral roots found at depths above 20% of the taproot depth (above the depth range of 9.6–61.2 cm from ground level).

Depths of excavation of the roots are not standardized [18, 34], but the depth selected in a given study is assumed to capture a large portion of the roots [18]. For example, Green et al. [19], Kuyah et al. [23], Sanquetta et al. [30], and Paul et al. [31], used excavation depths of 50–120, 40–200, 50, and <200 cm, respectively. Schenk and Jackson [54] found that globally, 95% of all roots are within the upper 200 cm of the soil profile. In this study, the excavation range (9.6–61.2 cm) that captured 81.20% of root biomass fall in the excavation range by those authors.

However, BGB estimates by Green et al. [19], Kuyah et al. [23], Sanquetta et al. [30], and Paul et al. [31] might have been underestimated, as according to Mokany et al. [20], sampling to insufficient soil depth to capture the majority of the roots, not sampling the root collar, not sampling fine roots, and sampling with inadequate replication are some methodological pitfalls associated with sampling root biomass, and can lead to underestimation. In this study, the root system was excavated to total depth and removed, including the root components generally ignored as stated by Mokany et al. [20].

The proportion of BGB allocated to the root collar (average = 30.79%, CV = 36.44%) is lower than that reported by Mokany et al. [20] (average = 41%, CV = 3.10%). Inclusion of different terrestrial biomes by Mokany et al. [20] may justify this discrepancy.

It is noted from Table 3 that as the tree increased in DBH, RCD, and TH, the proportion of BGB allocated to taproots (TR/RS) decreased and that allocated to lateral roots (LR/RS) increased. This is presumably because as the trees grow the larger is the need for its anchorage in the soil; and that the anchorage function in larger A. johnsonii trees is much attributed to the lateral roots as they hold a larger amount of soil than the taproots. In fact, there are trees without taproots but hardly a tree can sustain itself in the soil without lateral roots, especially the larger ones. As maintained by Crook and Ennos [55] only relatively small tree species can rely solely on the taproot for anchorage and that this is the reason that most large trees develop thick lateral roots. Moreover, Bailey et al. [56] have verified that lateral roots play an important role in anchorage.

It has also been noted by Dupuy et al. [39] that heart like root systems (those root systems that possess large lateral roots originating from the centre of the bole) generally had the most efficient anchorage. The heart like root system is determined by lateral roots, implying that the lateral roots influence greatly the anchorage efficiency. A. johnsonii trees do not have larger lateral roots than the taproot; however, they have many lateral roots (up to 11 lateral roots per node) which make them to contain a larger proportion of biomass than the taproot (Table 2). In fact, Dupuy et al. [40] maintained that the number of lateral roots is determinant for anchorage, as they determine the root’s ability to bear a large amount of soil and the area of soil mobilized during pull-out.

Trees are also known to respond to wind stress by increasing the number of lateral roots, which provide the most resistance to overturning [57]. This emphasizes the importance of lateral roots for tree anchorage, hence larger allocation of BGB biomass to them (Table 2), especially as trees grow (Table 3).

The decreasing BGB allocation to taproot with tree size might be related to decreasing anchorage function of the taproot with tree size. Khuder et al. [58] argued that “taproots may play an important role in anchoring young trees, but in adult trees, their growth is often impeded by the presence of a hard pan layer in the soil and the taproot becomes a minor component of tree anchorage”.

Modelling, additivity, and validation

The homoscedasticity observed for the independent and simultaneous models implies that the derived weight functions were efficient in addressing the heteroskedasticity.

Because all the fitted independent models performed almost equally, any model can be used accurately to estimate BGB and carbon stocks in A. johnsonii stands (mecrusse woodlands), which, along with mopane and miombo are the most important woodlands in Gaza and Inhambane Provinces. However, as including tree height improved the fit statistics of the models negligibly, and sometimes worsened them, the models using either DBH or RCD only should be preferred, as tree height is difficult to measure in natural forests. The model using the RCD only should be preferred when the tree is found cut down, as besides RCD being relatively difficult to measure compared to DBH, it is sometimes influenced by root buttress. The same holds true for simultaneous models, as they also performed equally.

The mean biomass per tree (MB) obtained using the WNSUR root system model based on DBH and TH [Eq. (1)], based on DBH [Eq. (2)], and based on RCD [Eq. (3)] was only 0.40, 0.31 and 0.23% larger than the MB obtained using the independent root system model based on DBH and TH (model form 6, found to be the best), based on DBH, and based on RCD, respectively. The differences between the MB of the root components (taproot and lateral roots) obtained using simultaneous and independent models were also negligible (>0.50%). These results are in line with those found by Repola [7].

As the differences between MB obtained using simultaneous and independent models were negligible, the independent models should be preferred to avoid unnecessary complexity of the models, and also because the residuals for simultaneously fitted models were not normally distributed.

This study is distinguished from other studies that include BGB (e.g. [6, 15, 19, 23, 29, 31, 33]) for five reasons: (1) in this study a very large sample size and geographical range was covered (93 trees of a single species harvested in 5 districts); (2) the root system was excavated to total depth and removed (including fine roots); (3) the root system was divided into three root components (taproot, root collar, and lateral roots) allowing, therefore, fitting the models for each root component and analysing BGB allocation patterns; (4) the error variance was modelled to derive the weight functions and address heteroskedasticity, which led to increases in Adj.R2 and general improvement of the models´ performance and; (5) the models were validated inside and outside the study area, therefore, the predictive capacity of the models was checked.

Mugasha et al. [6] used a large number of sample trees (80) distributed over 60 tree species, making up an average of 1.34 trees per species, which might not have been representative. Moreover, when modelling tree biomass, the use of species-specific equations are preferred because trees of different species may differ greatly in architecture and wood density [59], and architecture can influence biomass allocation and allometry [41, 42].

Mugasha et al. [6] and Kuyah et al. [23] did not fully excavated the root system, relied on sampling procedures, which might have led to less accurate estimates when compared to cases where all the root system is removed. Green et al. [19], Kuyah et al. [23], Ruiz-Peinado et al. [29], Paul et al. [31], and Ryan et al. [33] used predefined excavation depths and/or did not include fines roots; which leads to underestimation of the BGB [20].

Mugasha et al. [6], Litton et al. [15], Green et al. [19], Ruiz-Peinado et al. [29], and Ryan et al. [33] did not validate their models, therefore, the predictive capacity of the models was not checked. Paul et al. [31] checked the predictive capacity of the models only in the study area. Kuyah et al. [23] used a very small testing sample (6 trees) to validate the models which obviously did not cover all variation ranges.

The fit statistics for BGB model by Magalhães and Seifert [38] using the same model form as the one in Eq. (3) (Adj.R2 = 94.94%; CVr = 21.79%) were different from those obtained here (Adj.R2 = 99.80%; CVr = 23.16%). These differences might be because Magalhães and Seifert [38] considered the stump as part of the root system and because the weight functions were derived interactively; the weight functions were, therefore, just approximations.

Independent BGB (root system) models of this study (Adj.R2 range 99.73–99.80%; bias range 0.02–1.38%) performed better than those by Kuyah et al. [23] (R2 range 91.90–96.10%; bias range −49.60 to 35.40%) and Mugasha et al. [6] (R2 range 89.00–94.00%; bias range 0.12–5.98%). Models of this study also showed superiority to the BGB models by Paul et al. [31] (R2 range 64.00–95.00%) and Ryan et al. [33] (R2 = 94.00%).

Husch et al. [5] suggested that the aggregate difference should not exceed \({{2 \times CV_{r} } \mathord{\left/ {\vphantom {{2 \times CV_{r} } {\sqrt n }}} \right. \kern-0pt} {\sqrt n }}\), where n is the number of trees used in the test. In this study the lowest value of \({{2 \times CV_{r} } \mathord{\left/ {\vphantom {{2 \times CV_{r} } {\sqrt n }}} \right. \kern-0pt} {\sqrt n }}\) was 11.96%, which was almost twice as large as the largest aggregate difference; therefore, the models can be applied inside and outside the study area without requiring any corrections, according to this criterion.

The dataset of this study comprised a training and testing sample of 93 and 37 trees, respectively, with DBHs varying from 5 to 32 cm and from 5.5 to 32 cm, respectively. A. johnsonii trees hardly can exceed 35 cm in DBH. Magalhães and Soto [60] (unpublished data) found only 13 A. johnsonii trees per ha with DBH ≥ 30 cm, corresponding to only 5% of the total trees per ha. Here, the number of trees with DBH ≥ 32.5 cm were only 19 per ha, equivalent to 1.54% of the total trees per ha. This implies that no serious bias will be added when extrapolating the models outside the DBH range used to fit the models since very few trees are found outside that DBH range.

A. johnsonii stands occur mainly in Ferralic Arenosols and Stagnic soils (Figure 2b) [61], which cover 410,144 ha (74%) and 108,960 ha (20%), respectively, of the total area of occurrence of A. johnsonii stands (Figure 2a, b) in Gaza and Inhambane Provinces; the remaining 6% are covered by other soil types [61]. Of the 23 sampled plots, 20 were located in Ferralic Arenosols and 3 (plots 8, 9 and 22 in Figure 2b) were located in Stagnic soils. The 20 plots from Ferralic Arenosols accounted with 81 felled trees of the training sample, the remaining 12 trees were from Stagnic soils. The testing sample trees from outside the study area (16 trees from Chicuacala district) were all from Ferralic Arenosols (Figure 2b). The climate in the districts where A. johnsonii occurs is dry and humid tropical [2, 3, 6171], however, A. johnsonii occurs only in dry tropical climates [61]. Humid tropical climate occurs only along the coast, where A. johnsonii stands do not occur [2, 3, 6171]. This implies that the fitted models can also be safely applicable and valid over a vast range of soils and regions where A. johnsonii occurs and outside the study area.

Therefore, besides the fact that the models were validated inside and outside the study area, the study area covered almost the entire range of soil and climate variations where A. johnsonii occurs (despite the apparent lack of large variations), and a wide range of DBHs and THs, therefore, the models can be applied in all A. johnsonii stands in Mozambique.

Conclusions

In this study, it was found that more than half and almost one-third of BGB is allocated to the lateral roots and to the root collar, respectively. More than 80% of the BGB was found at a depth range of 9.6–61.2 cm from the ground level. As the tree size increased, the proportion of BGB allocated to taproots decreased and that allocated to lateral roots increased. Consequently, it was hypothesised that BGB allocation patterns is a response of the anchorage functions of the tap and lateral roots and therefore, root component biomass models can be used as a methodology to predict anchoring functions of the different root components.

Because all fitted independent models performed almost equally, the models using either DBH or RCD exclusively are preferred as tree height is difficult to measure in natural forests. The model using RCD only as a predictor variable should be further preferred when the tree is found cut down, as sometimes the RCD is affected by root buttress. As a result of the differences between the mean biomasses obtained using independent and simultaneous models being negligible, the independent models should be preferred to avoid unnecessary complexity in the models.

The fitted independent models were based on a very large sample size (93 trees) and a wide geographical range (5 districts) and exhibited that, on average, 98% of the variation in BGB is explained by the predictor variables and were validated inside and outside the study area. Therefore, the models presented here could be applied to all A. johnsonii stands in Mozambique.

Methods

Study area

The study was conducted in Mozambique. The study area comprised 5 districts (Mabote, Funhalouro, Panda, Mandlakaze, and Chibuto) of 2 provinces (Inhambane and Gaza) with an extension of 4,502,828 ha [61], of which 226,013 ha (5%) were A. johnsonii stands (Mecrusse). Mecrusse is a forest type where the main tree species, many times the only one, in the upper canopy is A. johnsonii. Detailed description of the species, forest type and study area can be found in Magalhães and Seifert [16, 17, 38, 50].

Data acquisition

The data were collected in 2012 and 2014. In 2012, a two-phase sampling design was used to determine BGB. In the first phase, diameter at breast height (DBH), root collar diameter (RCD) and total tree height (TH) of 3574 trees were measured in 23 randomly located circular plots (20-m radius). Only trees with DBH ≥ 5 cm were considered. In the second phase, 93 trees (DBH range 5–32 cm; TH range 5.69–16 m), 2 to 6 per plot, were randomly selected from those analysed during the first phase for destructive measurement of BGB along with the variables from the first phase. In 2014, additional 37 trees (DBH range 5.5–32 cm; TH range 7.3–15.74 m) were felled outside sampling plots, 21 (DBH range 6.0–31 cm; TH range 9.37–15.74 m) inside and 16 (DBH range 5.5–32 cm; TH range 7.3–15.05 m) outside the study area (in Chicualacula district, Figure 2). The 93 trees collected in 2012 were used to fit BGB models (training sample) and those collected in 2014 (37 trees) were used to validate the models (testing sample).

Trees (both from 2012 and 2014) were cut down at 20 cm from the ground level. Thereafter, the root system was excavated and sampled as follows. First, the root system was partially excavated to the first node, using hoes, shovels, and picks; to expose the primary lateral roots. The primary lateral roots were numbered and separated from the taproot with a chainsaw and removed from the soil, one by one. This procedure was repeated in the subsequent nodes until all primary roots were removed from the taproot and the soil. Finally, the taproot was excavated and removed.

The removal of the root system to the total depth was relatively easy because 90% of the lateral roots of A. johnsonii are located in the first node, which is located close to ground level; the lateral roots grow horizontally to the ground level, do not grow downwards; and because the taproots had, at most, only 4 nodes and at least 1 node (at ground level). The root system was removed completely, so the depth of excavation depended on the depth of the taproot. For images illustrating the excavation process, refer to Magalhães and Seifert [16, 17, 38].

The root system was divided into following root components: lateral roots (fine and coarse), root collar, and taproot. These root components are not additive, as the taproot includes also the root collar (Figure 3); therefore, the root system is the sum of lateral roots and taproot. The remaining portion of the taproot, obtained after removing the root collar, was not considered as an independent component because it would be an artificial component, as the taproot includes the root collar as well; moreover, there is no name for such a portion (Figure 3). Lateral roots with diameters at the insertion point on the taproot <5 cm were considered as fine roots and those with diameters ≥5 cm were considered as coarse roots.

Figure 3
figure 3

Scheme of the root system and its components.

Most of the studies on BGB (e.g. [15, 19, 23, 31, 7275]) considered fine roots those with a diameter at insertion point on the taproot <2 mm and did not include them in the root samples. In this study, fine roots were defined as those with a diameter at insertion point on the taproot <5 cm and were included in the root samples. Therefore, although the definition of fine roots in this study is distinguished from that of most studies, the definition of fine roots by these authors is included in this study, and all dimensions of roots were considered here. Therefore, the definition of fine and coarse roots did not affect the estimates, as both definition categories were considered.

Fresh weight was obtained for the taproot, root collar, each coarse lateral root and for all fine lateral roots. A sample was taken from each root component, fresh weighed, marked, packed in a bag, and taken to the laboratory for oven drying. For the taproot, the samples were two discs, one taken on the top of the root collar and another from the middle of the taproot. For the coarse lateral roots, two discs were also taken, one from the insertion point on the taproot and another from the middle of it. For fine roots the sample was 5–10% of the fresh weight of all fine lateral roots. Oven drying of all samples was done at 105°C to constant weight, hereafter, referred to as dry weight.

Dry weights of the taproot, root collar, and lateral roots were determined by multiplying the ratio of oven-dry- to fresh-weight of each sample by the total fresh weight of the relevant component. The dry weight of the root system was obtained by summing the dry weights of taproot and lateral roots (fine and coarse ones). The disc taken on the top of the root collar was used to estimate its dry weight and the one taken in the middle of the taproot was used to estimate the dry weight of the remaining portion of the taproot.

Data analysis

Possible variations of BGB allocation patterns with tree size (DBH, RCD, and TH) were studied by investigating the dependence of the ratios of lateral roots to root system (LR/RS), taproot to root system (TR/RS), root collar to root system (RC/RS), and root collar to taproot (RC/TR) biomasses on tree size, using Pearson’s correlation coefficient test of significance and distance covariance (dcov) test of independence. The first test was performed using the cor.test function of R software [76] and the second using the dcov.test function of energy package [77] in R software [76].

The Pearson’s correlation coefficient measures only linear dependencies. Although, the dcov test of independence measures all types of dependences (linear, non-linear and non-monotone) between random vectors X and Y in arbitrary dimension [78], the Pearson’s correlation coefficient was also considered, because unlike dcov test, it shows the direction of variation between two variables.

Biomass models were fitted using weighted non-linear regression. Non-linear models were preferred over linear ones because biomass is a non-linear function of stem diameter and height [32, 7982]. Weighted least squares (WLS) were preferred over ordinary least squares (OLS) to address the heteroskedasticity as, quite often, the error variance is functionally related to the independent variables in regression [83], that is, the variability of the biomass increases with tree independent variables [84]. The weight functions were obtained by modelling the error structure as described by Parresol [83, 85]. For that, the squares of the OLS residuals were fitted against the different combination of the independent variables. Thus, it was assumed that the squares of the OLS residuals are representative of the error variance.

The tested model forms for all root components are given below.

$$\hat{Y} = b_{0} \times DBH^{{b_{1} }}$$
(4)
$$\hat{Y} = b_{0} \times DBH^{{b_{1} }} \times TH^{{b_{2} }}$$
(5)
$$\hat{Y} = b_{0} \times \left( {DBH^{2} \times TH} \right)^{{b_{1} }}$$
(6)
$$\hat{Y} = b_{0} \times RCD^{{b_{1} }}$$
(7)

where Ŷ is expressed in Kg, DBH and TH are expressed in cm and m, respectively.

A model form using the RCD only as a predictor variable was also considered to allow the estimate of BGB when trees are found already cut down and only stump dimensions are available. Estimation of BGB of exploited trees is very important, as in that case, BGB can be used to estimate the carbon that will be transferred to the soil and/or the nutrients that will be reclaimed by the site.

However, the sum of the biomass predictions for the root components will not equal the biomass prediction for the root system, and a desired and logical feature of the root component regression equations is that the sum of biomass predictions of the root components equals the prediction for the root system. To cope with that, a new root system biomass model form was obtained as a function of the predictor variables of the best taproot and lateral roots biomass model forms. Then, the new root system model form and the best taproot and lateral roots biomass model forms were fitted again, simultaneously, using weighted non-linear seemingly unrelated regression (WNSUR) with parameter restriction, to achieve additivity.

Root component models were fitted with the statistical software R [76] and the function nls using the Gauss–Newton algorithm. The simultaneous models (WNSUR models) were fitted using PROC MODEL statement of SAS software [86], using the ITSUR option. Restrictions on the regression coefficients were imposed by using RESTRICT statement. The start values of the parameters in nls function of the R software and in PROC MODEL statement of the SAS were obtained by fitting the logarithmized models of each component in Microsoft Excel.

The following criteria were used to evaluate the models (independent and simultaneous ones): adjusted coefficient of determination (Adj.R2), mean residual (MR), standard deviation of residuals (Sy.x), test for heteroskedasticity of residuals, test for normality of residuals, and graphical analysis of residuals. The mean residual and the standard deviation of residuals were expressed as relative values, hereafter referred to as percent mean residual [MR (%)] and coefficient of variation of residuals [CVr (%)], respectively, which are more revealing. MR measures the average model bias, describing the directional magnitude, the size of expected under and overestimates. The ideal value is zero. CVr measures the dispersion between the observed and the estimated values of the model. It indicates the error that the model is subject to when is used for predicting the dependent variable. The ideal value is zero.

The heteroskedasticity of residuals was evaluated using the White’s test for heteroskedasticity, with the aid of the package het.test [87] of the R statistical software [76], under the null hypothesis of homoscedasticity. For that, the residuals were used as dependent variables and the predicted root component biomass as independent variable. The normality of residuals was evaluated using the Lilliefors normality test under the null hypothesis of normality, using the lillie.test function of R statistical software [76].

The models were then validated inside and outside the study area using aggregate difference in percentage (AD) [5, 88] and by comparing the observed and predicted BGB using Wilcoxon signed rank test in R [76] as recommended by Philip [89], under the null hypothesis of no difference between the observed and predicted BGB values. Aggregate difference is the prediction error of the models using an independent sample of trees (e.g. testing sample; trees not included in the sample used to fit the models).

All the statistical analyses were performed at α = 0.05.