Introduction

The bamboo species Guadua angustifolia Kunth has its natural habitat in Colombia, Ecuador, and Venezuela and was introduced to other countries in Central and South America, Europe and Asia (Londoño 2001). It is one of the 3 largest growing bamboo species (with diameters up to 18.5 cm) and one of the economically most important in the world (Londoño et al. 2002). It has a wide distribution all over Colombia and covers an estimated area of 51,500 ha, of which about 46,250 ha are natural stands and 5.250 ha are plantations (Londoño 1998). Within the coffee region, where this study was done, a recent inventory yielded an estimated of about 40,000 ha of Guadua stands (Kleinn and Morales 2006), which are highly fragmented and follow mainly the courses of rivers and creeks as “gallery bamboo forest”. For numerous rural communities, this bamboo resource plays an important role for their livelihoods as raw material for manifold construction purposes and also for handicraft. For many families, Guadua products are the main source of income (Riaño et al. 2002).

While research into the use of Guadua culms and in particular into the technological properties of the Guadua wood is relatively well established (Londoño et al. 2002; Camargo 2006), this is less so for the management of the Guadua resource. This management has been following largely traditional approaches, where the experiences in the communities were passed to the following generations, and this refers practically exclusively to natural Guadua stands. It is only for a relatively short period of time that systematic scientific approaches are being pursued as to learn, for example, about the growth and regeneration pattern of Guadua bamboo, about the potential impact of harvesting schemes and/or site characteristics on wood quality, about the establishment of artificial Guadua plantations, and, summarizing the above, about optimizing the production systems.

One of the basics of the management of a renewable natural resource is “to get the numbers right”, that is to develop approaches for the procurement of the information that is required for informed management decisions; these information requirements embrace both biophysical features such as area, density, volume etc. and use-related features such as the harvesting schemes and utilization purposes. For Guadua bamboo in Colombia, actually, there is quite some scattered information, but still the basic mensuration relationships have not been comprehensively worked out and transparently presented. This input is not only required for immediate practical applications (when, for example, marketing and selling the resource “Guadua culms”) but also to foster research towards optimal management systems for Guadua stands.

Diameter at breast height (dbh) and total height are variables that are commonly observed in forest inventories. They are excellent indicators for growing stock, site quality and other stand characteristics important for management and planning. Although the biological growth patterns of bamboo and trees are very different, the basic issues that we need to address in a bamboo inventory are not so different: we are in the first place interested in the number of culms per hectare, in the diameter distribution and also in culm length and volume information. For commercially utilized trees height and length are usually very close to identical. For bamboo, however, which is highly bended in the upper sections, height and length are considerably different (Camargo 2006). For bamboo, the variable of interest is culm length, because it is directly related to the eventual commercial product.

The height of the diameter measurement must also be clearly defined for Guadua bamboo. In forestry, it is common to use a fixed height for the default diameter measurement (breast height = 1.3 m as an international standard applied in many regions). This concept, however, is not immediately suitable for bamboo mensuration because diameter varies considerably with proximity to the internodes. Therefore, it is sensible to define here the reference diameter in the middle between those internodes where the conventional breast height is 1.3 m . It means that the diameter is taken at slightly varying heights but all close to 1.3 m.

For bamboo species, regression models to predict culms length from diameter are rarely used (for example (FU 2001). A very simple model predicts culm length (L) in metres from the perimeter (C) in centimetres at breast height times a constant K, which is to be determined empirically for the specific areas of interest. Among the published constants for G. angustifolia are K = 0.56 (Arbeláez 1996) and K = 0.5837 (Cruz 1994).

Culm volume is one of the central attributes when analysing and comparing stand productivity (García 2004; Camargo et al. 2007). There are not many studies on volume functions for bamboo (Camargo 2006, Reid et al. 2004). In this context, it is also important to stress the differences between the assessment of the development of tree stands and bamboo stands. For bamboo, the development of the diameter of the individual culm is of lesser relevance as bamboo has no vascular cambium and consequently no secondary diameter growth: each culm emerges with a fixed diameter (Judziewicz et al. 1999), which it maintains over its entire life period. Culm height and length are an important factor of culm development. After having reached the maximum elongation, however, which may be as soon as after 6 months, it takes much more time for the wood to reach maturity (Liese 1995). It is one of the most important and difficult aspects of the management of Guadua bamboo to determine the earliest point in time of full maturity of the culms. Culms that are harvested early have much worse physical–mechanical properties and are much less durable.

In addition, many bamboos species (Guadua included) have hollow internodes with a lumen and a wall thickness that varies considerably from species to species (Judziewicz et al. 1999). The variable wood volume as a productivity measure for bamboo stands requires, therefore, a very clear definition (FU 2001).

For volume prediction in forest inventories, diameter is usually the most relevant predictor variable (Sharma et al. 2002). However, most of the bamboo stands are relatively homogenous with respect to diameter, culm length and culm volume. Therefore, integration of length as a second predictor should be straightforward. We define here two concepts of volume: the culm volume is the gross volume of the entire culm, including the hollow sections; the woody volume (=net volume) may then be estimated by a conversion factor or a more elaborated statistical model.

The objective of this study is to analyse and evaluate different regression models for length and volume prediction of G. angustifolia culms in the coffee region of Colombia, which is one information component that is required for the sustainable management of this resource.

Materials and methods

Study area and sampling

The study sites are located in the coffee region of Colombia, which has a total area of 1.029.524 ha and elevations between 900 and 2000 m.a.s.l. This vertical range corresponds to the elevation range of G. angustifolia there. Mean annual temperature is between 20° and 27°C and precipitation between 950 and 2,500 mm per year with a bimodal distribution. Most of the soils are Andisols from volcanic ashes, are slightly acid and have good physical properties such as structural stability, high effective depth, low apparent density and high porosity (García 2004).

The sample culms came from an inventory that had been carried out in a research project on Guadua bamboo (Kleinn and Morales 2006). Systematic two-stage sampling was applied. In the first stage, the entire study area was subdivided into primary units of 10 km × 10 km. Out of these primary units, a number of 103 were randomly selected. In the centre of each selected primary unit, a square plot of 3 km × 3 km was established on aerial photographs. In these aerial photo sample plots, all G. angustifolia stands were mapped. A number of 16 aerial photo sample plots were selected for field sampling, and a total of 101 field plots of 10 m × 10 m were randomly selected within these mapped G. angustifolia stands.

In each 10 m × 10 m field plot, diameter at breast height (at middle of the internode at breast height) of all culms was tape measured. Culm length was measured at 3 culms that were randomly selected on the plot and cut. These culms were then also used to calculate apparent and net volume by the Smalian equation (Dieguez et al. 2003), where “apparent volume” includes all hollow parts of the culm, and “net volume” refers to the volume of the woody parts of the stem only. Culm diameter and wall thickness were measured for that purpose at 3 sections of the culm: at the height of the lowest internode, in the middle of culm and at the height of the last internode. Per-hectare stand variables were extrapolated from the sample plot data such as basal area (based on the diameter measurement), number of culms and the quadratic mean diameter.

Regression models

A large number of height curves and volume functions have been reported in the forest mensuration literature, mostly species- and/or region-specific. For this particular study, for the length models, we consider those most commonly used in forest mensuration and those that were used for other bamboo species as Phyllostachys pubescens for predicting height (Fu 2001).

The following notation is used throughout this paper: h = total length of the culm [m]; h int = height at lowest internode [cm]; d = diameter at internode at breast height [cm]; d g = quadratic mean diameter of the stand [cm]; d int = diameter at the height of the lowest-internode [cm]; N = number of culms per hectare; t = mean culm wall thickness, e = base of natural logarithm and b i = estimated regression coefficients.

A total of 15 models for culm length and 7 for culm volume were fitted and validated (Tables 1 and 2). It is important to remark that most of the models analysed for both variables have been applied by different authors exclusively for predicting tree height (Pearl and Reed 1920; Winsor 1932; Richards 1959; Curtis 1967; Yang et al. 1978; Schreuder et al. 1979; Bates and Watts 1980; Schnute 1981; Sibbesen 1981; Wykoff et al. 1982; Buford 1986; Larson 1986; Ratkowsky and Reedy 1986; Larsen and Hann 1987; Zeide 1989; Ratkowsky 1990; Huang et al. 1992; Prodan et al. 1997) and tree volume Sharma et al. (2002) and Prodan et al. 1997.

Table 1 Functions used to model culm length from culm diameter
Table 2 Functions used to model culm volume from culm diameter and culm length

Statistical analysis

The majority of the models described in Tables 1 and 2 are non-linear regression models. Analysis was done with PROC NLIN (SAS 1999) of SAS/STAT® using the Gauss–Newton algorithm (Hartley 1961). The initial values for the iterative procedure were obtained by previously linearizing the model equations and fitting it to the data by ordinary least squares.

The estimated models were compared by graphical and numerical analysis of the residuals and by four statistics that describe model fit and quality: the bias B, the mean square error MSE, the coefficient of determination and the adjusted coefficient of variation, R² and \( R_{\text{adj}}^{2} \), respectively, and Akaike’s information criterion differences (Δ j ).

A cross-validation was carried out for each model, estimating the regression coefficients from a random subset of the data and taking the remaining data to test the performance of the resulting model. This performance was evaluated from the prediction residuals by the same statistics that were used to assess model fit (see preceding paragraph). In addition, \( {\text{MEF}}_{\text{adj}}^{2} \) was calculated, a measure of model precision (López et al. 2003).

To check for bias, the linear model (Hui and Gadow 1999) was fitted between the observed (and assumed true) value and the predicted value; a bias is likely to be absent with a given error probability if the estimated slope coefficient does not differ significantly from 1 and the estimated intercept not from 0. Values of culm apparent and net volume estimated with the form factor were compared with the regression models by bias and mean square error.

Results

Models for culm length

Regression models 13 and 15 performed best for culm length prediction. The adjusted coefficient of determination (\( R_{\text{adj}}^{2} \)) is around 0.5, and the bias is small. Best precision (MSE) is achieved with the modified model 13 (López et al. 2003) and also Akaike’s criterion and \( {\text{MEF}}_{\text{adj}}^{2} \) indicate its best performance. This model also resulted slightly better than model 15. Cox I (López et al. 2003) that showed similar values (Tables 3 and 4). The modified model 13 Cox I Mod 1 (López et al. 2003) includes the stand variable d g, and the good results obtained with this model are consistent with those obtained for Pinus radiata in Chile (Cox 1994) and in Spain (López et al. 2003). However, no references were found for applying these models to Guadua or other bamboo species.

Table 3 Regression statistics for the culm length models
Table 4 Statistics of cross-validation for culm length models

Graphical analysis of bias and mean square error against diameter classes were done for all models. The plot of residuals versus predicted culm length showed that there was no reason to reject the hypotheses of normality, homogeneity of variance and independence of residuals (Fig. 1) for model 13. Cox I Mod 1 (López et al. 2003). The comparison of observed and predicted culm length values exhibits relatively wide variation, but a major systematic deviation is not observed (see Fig. 2).

Fig. 1
figure 1

Residuals versus predicted culm length for model 13. Cox I Mod. 1 (López et al. 2003)

Fig. 2
figure 2

Observed versus predicted culm length for model 13. Cox I Mod. 1 (López et al. 2003)

Bias and mean square error were calculated and plotted against diameter. Additionally, other models are also depicted so as to demonstrate the contrast to the best model 13 Cox I Mod. 1 (López et al. 2003). In general, the models show a trend towards underestimation in particular for the higher diameter classes; however, model 14 Cox II Mod. 2 (López et al. 2003) is always upwards biased (Figs. 3 and 4).

Fig. 3
figure 3

Deviations between true and predicted value (bias) per diameter class for models 13 Cox I Mod. 1 (López et al. 2003), 1 (Linear), 6. (Larson 1986) and 14 Cox II Mod. (López et al. 2003)

Fig. 4
figure 4

Mean square error (MSE) per diameter class for models 13 Cox I Mod. 1 (López et al. 2003), 1 (Linear), 6. (Larson 1986) and 14 Cox II Mod. (López et al. 2003)

Models for culm volume

The culm volume models showed an overall good performance where, in general, apparent culm is more precisely predicted than net volume; inclusion of culm length as predictor variable improves the models considerably (Tables 5 and 6).

Table 5 Regression statistics for the volume models
Table 6 Statistics for cross-validation of volume models

Model 4 Schumacher-Hall (Dieguez et al. 2003) for apparent volume and model 5 (allometric III) for net volume perform best here. The plot of residuals versus predicted culm volume gives no reason to reject the hypotheses of normality, homogeneity of variance and independence of residuals (Figs. 5 and 6).

Fig. 5
figure 5

Residuals versus predicted apparent volume for model 4 Schumacher–Hall (Dieguez et al. 2003)

Fig. 6
figure 6

Residuals versus predicted net volume for model 5 allometric III

The regression and the plot of observed versus predicted culm volumes (Figs. 7 and 8) show a relatively strong relationship with moderate variability only and no indications for systematic deviations.

Fig. 7
figure 7

Observed culm apparent volume values versus predicted culm apparent volume model 4 Schumacher-Hall (Dieguez et al. 2003)

Fig. 8
figure 8

Observed versus predicted culm net volume for model 5 allometric III

To analyse the performance of the best models, bias and mean square error were plotted against diameter classes. For comparison, model 2 (allometric I) was included in this analysis. The precision of prediction in terms of absolute values tends to decrease for higher diameter classes for all statistics depicted in Figs. 9, 10, 11, 12.

Fig. 9
figure 9

Deviations between true and predicted value (bias) of culm apparent volume per diameter class for the model that showed best performance, model 4 Schumacher-Hall (Dieguez et al. 2003) contrasted with model 2 (allometric I)

Fig. 10
figure 10

Mean square error (MSE) of predicted culm apparent volume per diameter class for the best model 4 Schumacher-Hall (Dieguez et al. 2003) and the much poorer performing model 2 (allometric I)

Fig. 11
figure 11

Differences between true and predicted value (bias) of culm net volume per diameter class for the best model 5 (allometric III) and the much poorer performing model 2 (allometric I)

Fig. 12
figure 12

Mean square error (MSE) of predicted culm net volume per diameter class for the best model 5 (allometric III) and the much poorer performing model 2 (allometric I)

The relationship between culm apparent volume and culm net volume is based on culm features such as diameter and culm length, which are used to calculate both variables from culms cut. Hence, this relationship may be used to estimate culm net volume from culm apparent volume. A simple model results as

$$ V_{\text{net}} = 0.001059 + 0.2840.V_{\text{app}} $$

(\( {R}_{\text{adj}}^{2} \) = 0.9398 MSE = 1.5824E−05).

Discussion

Models for culm length

Model 13 Cox I Mod. 1 (López et al. 2003), which includes two independent variables, gives an overall better performance than the others. This may be due in part to the inclusion of the stand-characterizing variable d g that may reflect stand conditions. It is likely that inclusion of further independent variables would help increasing model fit and precision of prediction.

This is also seen with models 15 Cox I (López et al. 2003) and 14 Cox II Mod. 2 (López et al. 2003), despite their larger bias. However, including more variables means increasing the measurement effort at the same time. Model 13 Cox I Mod. 1 (López et al. 2003) offers a balance between model accuracy and measurement effort, because the value of d g is readily obtained from d and N.

The basic regression models used are those that are successfully also used for trees in forest inventories. However, growth pattern of Guadua and of trees is fundamentally different: One single individual of G. angustifolia has usually numerous culms. One encounters even stands which practically consist of one single individual. Consequently, culm length might depend not only on diameter but also on stand and site variables (including those which are influenced and codetermined by the management of the stand). The inclusion of stand variables is also used for trees (Dieguez et al. 2003; Prodan et al. 1997), and the inclusion of stand variables into the models helped improving the predictions (López et al. 2003).

For bamboo species, a—though relatively weak—relationship between total number of culms in a stand and culm length has been described (Camargo 2006; Fu 2001). This relationship may be among the reasons for the better performance of models that include stand variables.

Models for culm volume

For obvious geometric reasons, both culm diameter and culm length are important factors determining culm volume, and the precision of models that included both variables resulted higher. However, also model 2 (Allometric I) with the predictor variables diameter at internode at breast height (d) and diameter at lowest-internode height (d int) shows an acceptable level of precision and may be used for volume estimation. Model 2 is recommendable in situations where culm length and/or wall thickness were not observed.

Volume models fitted with only the diameter as predictor variable yield good predictions if the relationship between culm diameter and culm height is strong (Prodan et al. 1997), and this holds mainly for application in smaller area inventories. Wall thickness is an important variable for culm net volume, and it is included in the best model 5 (allometric III). Absolute wall thickness decreases as does the diameter from the base to the top, and this tendency may contribute to explain why models to estimate apparent volume can also be used for estimating net volume.

Conclusions

Regression models commonly used in forest inventory proved flexible enough also for Guadua bamboo culm length and culm volume prediction. Our data are from the coffee region of Colombia, and the model performance found here holds for that region in the first place. Transfer and application to other regions would require careful validation. All data were from natural Guadua stands, which is the by far most common stand type in the study region. Guadua plantations established recently (about eight years ago) where the bamboo clumps have not yet reached their maximum growth did not appear suitable for models that are predominantly used to estimate the harvestable resource.

The models resulting from this study will serve as a useful tool for those who manage Guadua stands in the coffee region of Colombia. The methods applied in this study will be helpful also for bamboo inventories in other regions and do also support the implementation of large area studies such as the special study on bamboo resources envisaged in the Global Forest Resource Assessment FRA2010 of FAO (Lobovikov et al. 2007). An important immediate application of these models will be to bring more light into the overall biomass and carbon production and dynamics of Guadua stands.