Introduction

Site-index models relate height, age, and site quality (potential productivity) in even-aged single-species stands. They are used for predicting stand height development and for assessing site quality. The principles can be traced back to the 18th Century (Batho and García 2006), and various approaches are described in books such as Belyea (1931); Spurr (1952); Assmann (1970); Clutter et al. (1983); von Gadow and Hui (1999); Pretzsch (2009). General reviews have been published by Jones (1969); Carmean (1975); Hägglund (1981); Ortega and Montero (1988); Grey (1989). Some of the details are subtle. Assumptions and interpretations are not always clear and explicit, leading to misunderstandings and controversy. We focus here on some implications of natural variability of stand development. Mathematical and modelling aspects are stressed, but detailed statistical procedures are beyond the scope of this article.

Three sources of variability may be identified: (a) between sites, (b) within sites, and (c) observation error. These are illustrated in Fig. 1. The between-sites variation gives rise to a family of site curves, each representing a height–age trajectory for a certain site quality. It is discussed briefly in the next Section. For a given site, individual stands will deviate from the nominal site curve due to weather fluctuation and other factors. The implications of this within-site variability are the main topic here. In addition, height observations (and sometimes age) are subject to sampling and measurement error. These errors can be important in devising appropriate estimation and statistical inference procedures, but will not be discussed in detail.

Fig. 1
figure 1

Nominal site-index curve (dashed), actual trajectory (solid), and observations (isolated dots). A and B indicate different site-index definitions (see text)

Over time, site-index modelling has developed its own methods and terminology. The article makes an effort to link these to standard mathematical concepts. It is hoped that tapping into a wider pool of knowledge may facilitate future progress.

Variation between sites

Differences in height growth across sites are the basis of the site-index concept. Ignoring some of the natural variation, the main ideas are not difficult to understand. It is assumed that stands follow height–age trajectories characteristic of each site quality, the site-index curves (or site curves) drawn with dots in Fig. 1. The curves do not intersect, except possibly at the origin, and greater heights at any particular age indicate higher site quality.

Equation-based models usually start with a growth curve function H = g(A), where A is stand age and H is some measure of stand height. The growth curve is made to vary with site quality by including a site-dependent parameter, say q, so that H = g q (A) = g(Aq). For instance, with the Schumacher (1939) function \(H = a\exp(-b/A)\), typically one of the parameters a or b is taken as site-dependent (or local), while the other is assumed to be common to all sites and stands (global). Thus, one may have a model where the site curves differ by an H-scale factor, \(H = q\exp(-b/A)\) (called anamorphic), or by an A-scale factor, \(H= a \exp(-q/A)\). More generally, both original parameters might be assumed to be site-dependent, as in \(H = \alpha \beta^q \exp(-q/A)\), where q is local, and α and β are new global parameters.

The local parameter q serves only as a label for the individual site curves and can be chosen in different ways. Curves may be labeled simply by discrete quality classes, often with roman numerals. The most common continuous labeling scheme uses a site index, S, defined as the curve height at some reference base age A b . The site index is related to any other site-dependent parameter q through \(S = g_q(A_b)\).

Variation within sites

Clearly, real stands will inevitably deviate from the curve specified by any deterministic model. The model does not necessarily ignore this and the curve may be interpreted as a point estimate, a predicted, expected, or most likely height–age trajectory. We shall not be specific about the differences among these (mean, mode, median, etc.) and will say “predicted” or “nominal”.

Which site index?

A first consequence of this within-site variability is the existence of different definitions of site index. Some authors have explicitly or implicitly defined site index as the actual height reached by a particular stand at the base age. This is a property of the stand and is different from the definition based on predicted height given in the previous Section, which is a property of the site. “Stand site index” corresponds to point A in Fig. 1 and “site site index” to point B.

Definitions cannot be right or wrong, but the proper statistical treatment differs, and lack of clarity on this point can (and has) lead to misunderstanding and controversy. Under the stand site index view, models appropriate for predicting height may differ from those for assessing site quality, and statistical analysis typically involves error-in-variables situations (Curtis et al. 1974; Goelz and Burk 1992). The site site index approach is more abstract, although it may be closer to the original idea.

Focusing instead on local and global parameters sidesteps these issues.

Dynamics

The nominal model height–age trajectories correspond to a prediction “at birth”. By analogy to a person’s life expectancy, and for similar reasons, a future height predicted at birth can be expected to be different from that predicted later in life. Generally, for a stand growing in site quality q, one knows (or has an estimate of) its height H 1 at age A 1 and wants to predict the height H 2 at some other age A 2. Writing \(t = A_2 - A_1\) for the prediction interval, the predicted height is some function

$$ H_2 = F_q(A_1, H_1, t). $$
(1)

It is assumed that this function is continuous and smooth (differentiable). These assumptions are implicit in the use of growth functions like those in Sect. 2, even though the model does not usually pretend to reproduce the seasonal growth fluctuations within a year (but see García 1979, 1999, for one way of doing this).

The function (1) is special in that it must satisfy two consistency conditions: prediction over a zero-length interval must return the starting value, i.e. \(F_q(A_1, H_1, 0) = H_1\), and two predictions over consecutive intervals must give the same result as a single prediction over the whole: \(F_q[A_1 + t, F_q(A_1, H_1, t), u] =F_q(A_1, H_1, t + u)\) (Sullivan and Clutter 1972; Clutter et al. 1983, p. 123; García 1979, 1994)Footnote 1. Such a function, or more precisely (1) together with the obvious age “prediction” function \(A_2 = A_1 + t\), is known as a (global) transition function (Padulo and Arbib 1974; García 1994, and references therein) or a flow (Arnold 1973, Chap. 1) (a semi-flow if predictions back in time, \(t\,<\,0\), are not allowed). In forestry, these have been called also difference equations (Clutter et al. 1983), algebraic difference equations (ADE, Cieszewski and Bailey 2000), or self-referencing functions (Northway 1985), at least when independent of q (see below).

Any flow (1) can be obtained as the solution of an ordinary differential equation (ODE)

$$\frac{{\hbox{d}}H}{\hbox{d}t} = f_q(A, H) $$
(2)

(and \(\hbox{d}A/\hbox{d}t = 1\)), which is sometimes more convenient (Arnold 1973). In fact, modelling often starts with an ODE formulation, and the transition function arises through integration. The reversal of this classical view by the Russian school of Anosov, Arnold, and others, starting from flows as the more primitive concept, is attractive because it is often argued that ODE’s are not appropriate for forest modelling. It is also closely related to the ADE ideas introduced by Clutter (1963) and Bailey and Clutter (1974).

Alternatively, flows may be described by an invariant, or first integral, an expression that remains constant over trajectories of the flow. A trajectory is the curve generated by varying t in (1), for fixed A 1 and H 1. This relates also to ADE’s, as shown below. For multivariate generalizations, see García (2010).

Site curves and ADE’s

The algebraic difference approach (ADA/GADA)Footnote 2 produces a flow or ADE compatible with a given site equation \(H\,=\,g(A, q)\) (Bailey and Clutter 1974; Clutter et al. 1983; Cieszewski and Bailey 2000). The usual procedure consists of solving for the local parameter, \(q = \varphi(A, H)\), and equating the value at two height–age points:

$$ \varphi(A_1, H_1) = \varphi(A_2, H_2). $$

Solving for H 2, one obtains the flow equation (ADE). For example, with the anamorphic Schumacher model \(H = q \exp(-b / A); q =H\exp(b / A), H_1 \exp(b / A_1) = H_2\exp(b / A_2) , \text{and}\,H_2 = H_1 \exp(b / A_1) / \exp(b /A_2) = H_1 \exp[b (1/A_1 -1/A_2)] \;\).

Note that \(\varphi(A, H)\) is an invariant, constant for all the points on a trajectory. Alternatively, one may differentiate the invariant and obtain the ODE for the flow:

$$ \frac{{\hbox{d}}H \exp(b / A)}{{\hbox{d}}A} = 0\\\frac{{\hbox{d}}H}{\hbox{d}A} \exp(b / A) + H \exp(b / A) (-b /A^2) = 0, $$

from where,

$$ \frac{{\hbox{d}}{H}}{\hbox{d}A} = b \frac{H}{A^2}. $$

This ODE was actually the starting point of Schumacher (1939).

The ADE is a relationship between any two points lying on the same site curve. The corresponding ODE is called the ODE of a one-parameter family of curves (e.g. Agnew 1960, Chap. 4); in this case, the family of site curves parameterized by q.

The ADE, or its ODE, predicts that any stand of site quality q that currently lies on the site-q curve will continue to follow that curve, as one might expect. However, that is not the only flow and ODE with this property. One could use any other invariant in the derivation, not just the one that corresponds to the local parameter. A few possibilities for the anamorphic Schumacher are shown in Table 1. The ADE, first row, is the (only) flow that does not depend on site quality. The one on the second row is obtained through solving for b; the others cannot be obtained by the method of equating parameters. The ODE in row 3 is a function of A, and that in 4 is a function of H. There is an infinity of ODE’s depending on both A and H (and q) that produce the same growth curve, starting from a point on the curve. In other words, a site equation H = g(Aq) is not sufficient by itself to predict future dynamics when the stand deviates from the nominal curve; hypotheses about the growth rate are needed.

Table 1 Some flows and ODES that generate \(H = q \exp(-b/A)\)

Prediction

Consider a stand for which we know, or have estimates of, the current age A 1, height H 1, and the site quality q. In the absence of within-site variability, \((A_1, H_1)\) would be on the site-q curve and, as just explained, any of the flows or ODE’s discussed in the previous section then give the same predictions, following the nominal site curve. In reality, the stand would almost certainly have deviated from the nominal curve, and then the different flow and ODE equations produce different predictions.

This is illustrated in Fig. 2, for an anamorphic Schumacher site model with \(b\,=\,18\). Representative site curves are drawn with dashes and labeled with the site index for base age 25. The Clutter-Bailey ADE always follows the site curve passing through the current point, regardless of site quality. On the other hand, the solid curves show the trajectories predicted by the flow on the forth row of Table 1 for stands of site index 20 that have deviated above or below the nominal curve.

Fig. 2
figure 2

Dashed: Site-index curves and ADE trajectories, anamorphic Schumacher model with \(b\,=\,18\). Continuous: predicted trajectories for site index 20 according to the fourth row of Table 1. Site indices for base age 25

ADE’s describe site curves well and, by eliminating local parameters, make it possible to fit site models using standard nonlinear regression packages. But their representation of growth dynamics in the presence of within-site variation is questionable. A site-index 20 stand that happens to be on the site-index 18 curve can be expected to grow faster than a site-index 18 stand that so far has stayed the course.

Among the alternatives, there are good biological and other reasons to prefer the one where the growth rate (2) depends on size (H), but not on age (e.g. Peñuelas 2005). For the Schumacher, it is the one on row 4 of Table 1 and in Fig. 2.

Figure 3 displays similar predicted trajectories from the model of Hu and García (2010). It is based on an age-invariant Bertalanffy-Richards ODE. For another example, see the EasySDE User Guide at http://forestgrowth.unbc.ca/sde.

Fig. 3
figure 3

From model by Hu and García (2010). Dashed: site-index curves for sites 19, 20, 21. Solid: projected heights for stands growing in site quality 20, currently at the dot locations. ADE predictions follow the dashed curves

One situation where the various predictions coincide is where the only information available about a stand is the current age and height, without any knowledge of site quality or previous measurements. Then q must be inferred from the starting point, and it can be seen that the resulting trajectories will be the same. This might be a common occurrence when applying these systems in practice. During model development, however, there are usually multiple measurements in a sample plot, and ADA essentially derives site quality separately for each pair of measurements, ignoring the restriction of a common value for the plot. The question of if discarding some information is compensated by avoiding the complications of having to deal with local parameters would depend on the data and other considerations, and seems difficult to answer in general.

Conclusions

The site modelling methods pioneered by Clutter and others deviated from the classical ODE-centered approaches to dynamical systems common in other fields. Although they might have seemed ad hoc, it is remarkable how they actually paralleled to some extent more recent developments in Mathematics (Arnold 1973; Anosov et al. 1997).

Given current stand conditions, the growth rate in ADE/ADA/GADA models is independent of site quality. This might be seen as a conceptual flaw. Deviations from the nominal curves, however, are unlikely to be large, so the practical implications are not entirely clear. Certainly, ADA made feasible the development of good site models at a time when computing resources were limited. The computational and flexibility advantages of ADA/GADA techniques might well carry into the future.

Only essentially deterministic aspects of growth forecasting have been discussed, dealing with predicted or nominal trajectories. A superimposed stochastic structure is important for hypothesis testing and in the search for good estimators. One natural extension is to include environmental perturbations in the ODE’s, possibly adding also observation errors (Hotelling 1927; Seber and Wild 2003, Sect. 7.5). It is tempting to use hierarchical modelling for the sources of variation, treating the local parameter as “random” (Snijders 2003; Hall and Bailey 2001); but it should be remembered that height–age data is rarely a random sample from the target population, and the effects of violating this assumption are unclear.