# Dynamical implications of the variability representation in site-index modelling

- 556 Downloads
- 10 Citations

## Abstract

Issues in the development and formulation of forest site-index models are examined, linking the forestry terminology and methods to standard mathematical concepts. Variability complicates interpretation. Three sources of variation are distinguished: between sites, within sites, and observation error, with the article focusing mainly on the second one. Two site-index definitions arising from different views about the variability are contrasted. Modelling based on algebraic difference equations (ADE’s) is analyzed in detail, relating it to concepts of state space flows used in modern dynamical systems theory. It is shown that, given a stand current state, ADE’s predict growth rates that are independent of site quality.

## Keywords

Forest growth and yield Site productivity Algebraic difference equations ADA GADA Differential equations Dynamical systems## Introduction

Site-index models relate height, age, and site quality (potential productivity) in even-aged single-species stands. They are used for predicting stand height development and for assessing site quality. The principles can be traced back to the 18th Century (Batho and García 2006), and various approaches are described in books such as Belyea (1931); Spurr (1952); Assmann (1970); Clutter et al. (1983); von Gadow and Hui (1999); Pretzsch (2009). General reviews have been published by Jones (1969); Carmean (1975); Hägglund (1981); Ortega and Montero (1988); Grey (1989). Some of the details are subtle. Assumptions and interpretations are not always clear and explicit, leading to misunderstandings and controversy. We focus here on some implications of natural variability of stand development. Mathematical and modelling aspects are stressed, but detailed statistical procedures are beyond the scope of this article.

Over time, site-index modelling has developed its own methods and terminology. The article makes an effort to link these to standard mathematical concepts. It is hoped that tapping into a wider pool of knowledge may facilitate future progress.

## Variation between sites

Differences in height growth across sites are the basis of the site-index concept. Ignoring some of the natural variation, the main ideas are not difficult to understand. It is assumed that stands follow height–age trajectories characteristic of each site quality, the site-index curves (or *site curves*) drawn with dots in Fig. 1. The curves do not intersect, except possibly at the origin, and greater heights at any particular age indicate higher site quality.

Equation-based models usually start with a growth curve function *H* = *g*(*A*), where *A* is stand age and *H* is some measure of stand height. The growth curve is made to vary with site quality by including a site-dependent parameter, say *q*, so that *H* = *g* _{ q }(*A*) = *g*(*A*, *q*). For instance, with the Schumacher (1939) function \(H = a\exp(-b/A)\), typically one of the parameters *a* or *b* is taken as site-dependent (or *local*), while the other is assumed to be common to all sites and stands (*global*). Thus, one may have a model where the site curves differ by an *H*-scale factor, \(H = q\exp(-b/A)\) (called *anamorphic*), or by an *A*-scale factor, \(H= a \exp(-q/A)\). More generally, both original parameters might be assumed to be site-dependent, as in \(H = \alpha \beta^q \exp(-q/A)\), where *q* is local, and α and β are new global parameters.

The local parameter *q* serves only as a label for the individual site curves and can be chosen in different ways. Curves may be labeled simply by discrete quality classes, often with roman numerals. The most common continuous labeling scheme uses a *site index*, *S*, defined as the curve height at some reference *base age* *A* _{ b }. The site index is related to any other site-dependent parameter *q* through \(S = g_q(A_b)\).

## Variation within sites

Clearly, real stands will inevitably deviate from the curve specified by any deterministic model. The model does not necessarily ignore this and the curve may be interpreted as a point estimate, a predicted, expected, or most likely height–age trajectory. We shall not be specific about the differences among these (mean, mode, median, etc.) and will say “predicted” or “nominal”.

### Which site index?

A first consequence of this within-site variability is the existence of different definitions of site index. Some authors have explicitly or implicitly defined site index as the actual height reached by a particular stand at the base age. This is a property of the stand and is different from the definition based on predicted height given in the previous Section, which is a property of the site. “Stand site index” corresponds to point A in Fig. 1 and “site site index” to point B.

Definitions cannot be right or wrong, but the proper statistical treatment differs, and lack of clarity on this point can (and has) lead to misunderstanding and controversy. Under the *stand site index* view, models appropriate for predicting height may differ from those for assessing site quality, and statistical analysis typically involves error-in-variables situations (Curtis et al. 1974; Goelz and Burk 1992). The *site site index* approach is more abstract, although it may be closer to the original idea.

Focusing instead on local and global parameters sidesteps these issues.

### Dynamics

*q*, one knows (or has an estimate of) its height

*H*

_{1}at age

*A*

_{1}and wants to predict the height

*H*

_{2}at some other age

*A*

_{2}. Writing \(t = A_2 - A_1\) for the prediction interval, the predicted height is some function

The function (1) is special in that it must satisfy two consistency conditions: prediction over a zero-length interval must return the starting value, i.e. \(F_q(A_1, H_1, 0) = H_1\), and two predictions over consecutive intervals must give the same result as a single prediction over the whole: \(F_q[A_1 + t, F_q(A_1, H_1, t), u] =F_q(A_1, H_1, t + u)\) (Sullivan and Clutter 1972; Clutter et al. 1983, p. 123; García 1979, 1994)^{1}. Such a function, or more precisely (1) together with the obvious age “prediction” function \(A_2 = A_1 + t\), is known as a (global) *transition function* (Padulo and Arbib 1974; García 1994, and references therein) or a *flow* (Arnold 1973, Chap. 1) (a semi-flow if predictions back in time, \(t\,<\,0\), are not allowed). In forestry, these have been called also difference equations (Clutter et al. 1983), algebraic difference equations (ADE, Cieszewski and Bailey 2000), or self-referencing functions (Northway 1985), at least when independent of *q* (see below).

Alternatively, flows may be described by an *invariant*, or first integral, an expression that remains constant over trajectories of the flow. A trajectory is the curve generated by varying *t* in (1), for fixed *A* _{1} and *H* _{1}. This relates also to ADE’s, as shown below. For multivariate generalizations, see García (2010).

### Site curves and ADE’s

^{2}produces a flow or ADE compatible with a given site equation \(H\,=\,g(A, q)\) (Bailey and Clutter 1974; Clutter et al. 1983; Cieszewski and Bailey 2000). The usual procedure consists of solving for the local parameter, \(q = \varphi(A, H)\), and equating the value at two height–age points:

*H*

_{2}, one obtains the flow equation (ADE). For example, with the anamorphic Schumacher model \(H = q \exp(-b / A); q =H\exp(b / A), H_1 \exp(b / A_1) = H_2\exp(b / A_2) , \text{and}\,H_2 = H_1 \exp(b / A_1) / \exp(b /A_2) = H_1 \exp[b (1/A_1 -1/A_2)] \;\).

The ADE is a relationship between any two points lying on the same site curve. The corresponding ODE is called the ODE of a one-parameter family of curves (e.g. Agnew 1960, Chap. 4); in this case, the family of site curves parameterized by *q*.

*q*that currently lies on the site-

*q*curve will continue to follow that curve, as one might expect. However, that is not the only flow and ODE with this property. One could use any other invariant in the derivation, not just the one that corresponds to the local parameter. A few possibilities for the anamorphic Schumacher are shown in Table 1. The ADE, first row, is the (only) flow that does not depend on site quality. The one on the second row is obtained through solving for

*b*; the others cannot be obtained by the method of equating parameters. The ODE in row 3 is a function of

*A*, and that in 4 is a function of

*H*. There is an infinity of ODE’s depending on both

*A*and

*H*(and

*q*) that produce the same growth curve, starting from a point on the curve. In other words, a site equation

*H*=

*g*(

*A*,

*q*) is not sufficient by itself to predict future dynamics when the stand deviates from the nominal curve; hypotheses about the growth rate are needed.

Some flows and ODES that generate \(H = q \exp(-b/A)\)

| Invariant | Flow equation | ODE |
---|---|---|---|

1 | \(H \exp(b / A)\) | \(H_2 = H_1 \exp(b/A_1 - b/A_2)\) | \(\hbox{d}H/\hbox{d}{A} = b H/A^2\) |

2 | \(A \ln(H / q)\) | \(H_2 = q (H_1/q)^{A_1/A_2}\) | \(\hbox{d}{H}/\hbox{d}{A} = H \ln(q/H) / A\) |

3 | \(H - q \exp(-b / A)\) | \(H_2 = H_1 + q[\exp(-b/A_2) - \exp(-b/A_1)]\) | \(\hbox{d}{H}/\hbox{d}{A} = bq \exp(-b/A) / A^2\) |

4 | \(A + b / \ln(H/q)\) | \(H_2 = q \exp[b / (A_1 - A_2 + b / \ln(H_1/q))]\) | \(\hbox{d}{H}/\hbox{d}{A} = H \ln(H/q)^2 / b\) |

5 | \(q/H - \exp(b/A)\) | \(H_2 = q / [\exp(b/A_2) - \exp(b/A_1) + q/H_1]\) | \(\hbox{d}{H}/\hbox{d}{A} = (b/q) H^2 \exp(b/A) / A^2\) |

### Prediction

Consider a stand for which we know, or have estimates of, the current age *A* _{1}, height *H* _{1}, and the site quality *q*. In the absence of within-site variability, \((A_1, H_1)\) would be on the site-*q* curve and, as just explained, any of the flows or ODE’s discussed in the previous section then give the same predictions, following the nominal site curve. In reality, the stand would almost certainly have deviated from the nominal curve, and then the different flow and ODE equations produce different predictions.

ADE’s describe site curves well and, by eliminating local parameters, make it possible to fit site models using standard nonlinear regression packages. But their representation of growth dynamics in the presence of within-site variation is questionable. A site-index 20 stand that happens to be on the site-index 18 curve can be expected to grow faster than a site-index 18 stand that so far has stayed the course.

Among the alternatives, there are good biological and other reasons to prefer the one where the growth rate (2) depends on size (*H*), but not on age (e.g. Peñuelas 2005). For the Schumacher, it is the one on row 4 of Table 1 and in Fig. 2.

*EasySDE User Guide*at http://forestgrowth.unbc.ca/sde.

One situation where the various predictions coincide is where the only information available about a stand is the current age and height, without any knowledge of site quality or previous measurements. Then *q* must be inferred from the starting point, and it can be seen that the resulting trajectories will be the same. This might be a common occurrence when applying these systems in practice. During model development, however, there are usually multiple measurements in a sample plot, and ADA essentially derives site quality separately for each pair of measurements, ignoring the restriction of a common value for the plot. The question of if discarding some information is compensated by avoiding the complications of having to deal with local parameters would depend on the data and other considerations, and seems difficult to answer in general.

## Conclusions

The site modelling methods pioneered by Clutter and others deviated from the classical ODE-centered approaches to dynamical systems common in other fields. Although they might have seemed ad hoc, it is remarkable how they actually paralleled to some extent more recent developments in Mathematics (Arnold 1973; Anosov et al. 1997).

Given current stand conditions, the growth rate in ADE/ADA/GADA models is independent of site quality. This might be seen as a conceptual flaw. Deviations from the nominal curves, however, are unlikely to be large, so the practical implications are not entirely clear. Certainly, ADA made feasible the development of good site models at a time when computing resources were limited. The computational and flexibility advantages of ADA/GADA techniques might well carry into the future.

Only essentially deterministic aspects of growth forecasting have been discussed, dealing with predicted or nominal trajectories. A superimposed stochastic structure is important for hypothesis testing and in the search for good estimators. One natural extension is to include environmental perturbations in the ODE’s, possibly adding also observation errors (Hotelling 1927; Seber and Wild 2003, Sect. 7.5). It is tempting to use hierarchical modelling for the sources of variation, treating the local parameter as “random” (Snijders 2003; Hall and Bailey 2001); but it should be remembered that height–age data is rarely a random sample from the target population, and the effects of violating this assumption are unclear.

## Footnotes

- 1.
In forestry, the property has been called

*path invariance*. Mathematically, this is a one-parameter (*t*) continuous group of transformations. An example of Lie group, named after the Norwegian mathematician Sophus Lie (pronounced “Lee”). - 2.
This terminology should not be confused with the standard mathematical meanings. In mathematics, difference equations deal with sequences of uniformly spaced values. An algebraic equation may contain elementary operations and rational exponents, excluding exponentials and other transcendental functions (James, 1992).

## Notes

### Acknowledgments

This work owes much to extensive discussions over the years with Keith Rennolls, and with Chris Cieszewski. Encouragement and suggestions from Mike Strub are gratefully acknowledged.

### Open Access

This article is distributed under the terms of the Creative Commons Attribution Noncommercial License which permits any noncommercial use, distribution, and reproduction in any medium, provided the original author(s) and source are credited.

## References

- Agnew RP (1960) Differential equations, 2nd edn. McGraw-Hill, New YorkGoogle Scholar
- Anosov DV, Aranson SK, Arnold VI, Bronshstein IU, Grines VZ, Il’yashenko YS (1997) Ordinary differential equations and smooth dynamical systems. Springer, New YorkGoogle Scholar
- Arnold VI (1973) Ordinary differential equations. The MIT Press, Cambridge, MAGoogle Scholar
- Assmann E (1970) The principles of forest yield study. Pergamon Press, Oxford, EnglandGoogle Scholar
- Bailey RL, Clutter JL (1974) Base-age invariant polymorphic site curves. For Sci 20:155–159Google Scholar
- Batho A, García O (2006) De Perthuis and the origins of site index: a historical note. For Biom Model Inform Sci 1:1–10Google Scholar
- Belyea HC (1931) Forest measurement. Wiley, New YorkGoogle Scholar
- Carmean WH (1975) Forest site quality evaluation in the United States. In: Brady NC (eds) Advances in agronomy, vol 27. Academic Press, New York, pp 209–269CrossRefGoogle Scholar
- Cieszewski CJ, Bailey RL (2000) Generalized algebraic difference approach: theory based derivation of dynamic site equations with polymorphism and variable asymptotes. For Sci 46(1):116–126Google Scholar
- Clutter JL (1963) Compatible growth and yield models for loblolly pine. For Sci 9:354–371Google Scholar
- Clutter JL, Fortson JC, Pienaar LV, RLBailey GHB (1983) Timber management: a quantitative approach. Wiley, New YorkGoogle Scholar
- Curtis RO, DeMars DJ, Herman FR (1974) Which dependent variable in site index-height-age regressions? For Sci 20:74–80Google Scholar
- García O (1979) Modelling stand development with stochastic differential equations. In: Elliott DE (ed) Mensuration systems for forest management planning. New Zealand Forest Service, Forest Research Institute Symposium No. 20, pp 315–334. http://web.unbc.ca/~garcia/publ/sym20.pdf
- García O (1994) The state-space approach in growth modelling. Can J For Res 24:1894–1903CrossRefGoogle Scholar
- García O (1999) Height growth of
*Pinus radiata*in New Zealand. N Z J For Sci 29(1):131–145Google Scholar - García O (2010) A parsimonious dynamic stand model for interior spruce in British Columbia. For Sci (to appear)Google Scholar
- Goelz JCG, Burk TE (1992) Development of a well-behaved site-index equation: Jack pine in north central Ontario. Can J For Res 22:776–784CrossRefGoogle Scholar
- Grey DC (1989) Site index—a review. S Afr For J 148):28–32Google Scholar
- Hägglund B (1981) Evaluation of forest site productivity. For Abstr 42:515–527Google Scholar
- Hall DB, Bailey RL (2001) Modelling and prediction of forest growth variables based on multilevel nonlinear mixed models. For Sci 47:311–321Google Scholar
- Hotelling H (1927) Differential equations subject to error, and population estimates. J Am Stat Assoc 22:283–314CrossRefGoogle Scholar
- Hu Z, García O (2010) A height-growth and site-index model for interior spruce in the sub-boreal spruce biogeoclimatic zone of British Columbia. Can J For Res 40(6):1175–1183CrossRefGoogle Scholar
- James RC (1992) Mathematics dictionary. 5th edn. Chapman & Hall, New YorkGoogle Scholar
- Jones JR (1969) Review and comparison of site evaluation methods. Research Paper RM-51, USDA Forest Service, 27 pGoogle Scholar
- Northway SM (1985) Notes: fitting site index equations and other self-referencing functions. For Sci 31:233–235Google Scholar
- Ortega A, Montero G (1988) Evaluación de la calidad de las estaciones forestales. Revisión bibliográfica. Ecología 2:151–184Google Scholar
- Padulo L, Arbib MA (1974) System theory. Hemisphere Pub. Co, Washington, DCGoogle Scholar
- Peñuelas J (2005) Plant physiology: a big issue for trees. Nature 437(7061):965–966. doi: 10.1038/437965a PubMedCrossRefGoogle Scholar
- Pretzsch H (2009) Forest dynamics, growth and yield: from measurement to model. Springer, BerlinCrossRefGoogle Scholar
- Schumacher FX (1939) A new growth curve and its application to timber-yield studies. J For 37:819–820Google Scholar
- Seber GAF, Wild CJ (2003) Nonlinear regression. Wiley-Interscience, New YorkGoogle Scholar
- Snijders TAB (2003) Multilevel analysis. In: Lewis-Beck M, Bryman A, Liao T (eds) The SAGE encyclopedia of social science research methods, vol II. Sage, Thousand Oaks, CA, pp 673–677Google Scholar
- Spurr SH (1952) Forest inventory. Ronald Press, New YorkGoogle Scholar
- Sullivan AD, Clutter JL (1972) A simultaneous growth and yield model for loblolly pine. For Sci 18(1):76–86Google Scholar
- von Gadow K, Hui G (1999) Modelling forest development. Forestry sciences, vol 57. Kluwer Academic Publishers, Dordrecht, The NetherlandsGoogle Scholar