Background

Understanding and quantifying how environmental factors influence tree growth and site quality is of considerable importance to forest managers. Knowledge of site quality serves as an important tool for operational planning and is a major contributing factor for the formulation of management regimes, the prediction of harvest yields and the determination of the economic value of a forest. Site quality can be estimated directly from environmental factors, but it is generally more accurate to infer site quality from some characteristic of the current tree crop on a given site.

Site Index is the most common measure of forest site productivity used worldwide. This variable, which expresses the height of dominant and/or co-dominant trees at a given age (Skovsgaard and Vanclay 2008), is a useful measurement as it is relatively unaffected by management intervention. Despite the prevalent use of Site Index by forest managers and researchers, a site quality metric based on volume, rather than height, may more accurately reflect site quality as volume directly links to stand productivity and integrates both height and diameter growth. This contention is reinforced by previous analysis of stand density trials covering a wide range in Site Index. These analyses showed that, at a common age and stand density, Site Index is only weakly related to basal area, demonstrating that Site Index provides only a partial measure of site productivity for Pinus radiata D.Don (P. radiata) (Kimberley et al. 2005).

The 300 Index is a volume productivity index for P. radiata that provides a more accurate means to ascertain site productivity than Site Index. The 300 Index is calculated by adjusting field plot measurements for age, stand density and silvicultural history to give the mean annual increment (expressed as m3 ha−1 year−1) at age 30 for a reference regime where all crop trees are pruned to 6 m in a timely fashion, thinned to the final stocking density (300 stems ha−1) at the completion of the final pruning. Research within New Zealand P. radiata plantations has shown that accurate and unbiased values for the 300 Index can be obtained using measurements taken from stands differing in age or stand density from those of the 300 Index standard regime (30 years and 300 stems ha−1, Kimberley et al. 2005). The mean 300 Index for New Zealand is 27.4 m3 ha−1 year−1.

Stand-level estimates of the 300 Index are commonly made by averaging plot values, and these stand estimates are used to guide forest management. The resulting estimate aggregations may not be adequate to detect local growing variations within stands resulting from interactions between climate, soil, topography and genetic factors (Véga and St-Onge 2009; Saremi et al. 2014). Consequently, predictive modelling methods should be developed to produce spatial layers that provide a finer spatial resolution for estimates of the 300 Index.

National layers of the 300 Index with moderate accuracy have been developed from site descriptors that can easily be viewed and analysed in a geographic information system (GIS). This approach has considerable merit as forest managers generally have access to GIS and can use these layers to understand the future yields of their resource. The last two decades have seen rapid increases in the number of spatial layers covering a diverse range of climatic (Leathwick and Stephens 1998; Leathwick et al. 2003; Tait et al. 2006) and edaphic variables (Watt and Palmer 2012). These environmental layers have been successfully used to develop spatial layers describing the 300 Index for P. radiata at a resolution of 100 m2 (Palmer et al. 2009a).

Three-dimensional point cloud data acquired by airborne laser scanning (ALS, also referred to as light detection and ranging (LiDAR)) provides a valuable means of characterising forest canopy structure. Since the early applications of LiDAR to forestry management (Nilsson 1996), the technology has been widely used to spatially quantify variation in stand structure at a range of resolutions (e.g. Holmgren et al. 2003; Hyyppä et al. 2001; Lim and Treitz 2004; Popescu et al. 2004; Watt et al. 2013a). Several studies have shown that data acquired through ALS surveys can also be used to predict Site Index (Rombouts et al. 2010; Watt et al. 2015; Packalén et al. 2011). This is to be expected as ALS data provides highly accurate information on canopy height. However, no research has investigated the utility of ALS for predicting the 300 Index. As the 300 Index is a composite of both stand height and diameter increment, the utility of ALS data is less certain.

Data from satellite imagery typically has a coarser resolution and contains less structural detail of the forest canopy than ALS. However, the use of satellite-derived metrics provides a cost-effective means of predicting vegetation attributes in many cases (Shamsoddini et al. 2013). Numerous studies have shown that multispectral satellite data can be used to map important forest attributes, such as tree height and volume when the data include various combinations of spectral information, vegetation ratios and texture metrics. Results are improved when imagery of a greater resolution is applied (McRoberts and Tomppo 2007; Donoghue and Watt 2006; Shamsoddini et al. 2013; Ozdemir 2008; Kayitakire et al. 2006; Watt et al. 2013b). Previous research has shown that moderate-resolution satellite imagery (5 m) can be useful for monitoring Site Index (Watt et al. 2015) but we are unaware of any research that has investigated the utility of such data for predicting the 300 Index.

Managers of planted forests are likely to have access to a wide variety of spatial information. Although stand age is an important determinant of the 300 Index, this variable is often not easily available at a national scale. Consequently, there is considerable interest in determining the utility of a variety of different sources of information, in the development of productivity layers such as the 300 Index. A review of the literature indicates that no previous research has compared the precision of models of the 300 Index created from satellite imagery or ALS data, or various combinations of these data sources, supplemented by auxiliary environmental data. It is considered that a comprehensive comparison using these three different sources of information should be undertaken using common parametric and non-parametric modelling approaches. Models that include or exclude age as a determinant of the 300 Index should also be developed to test whether the 300 Index can be predicted in the absence of stand age.

This study used data obtained from a P. radiata plantation located in the central North Island of New Zealand. The objective of the research was to compare the accuracy of parametric and non-parametric models of the 300 Index models that included explanatory variables extracted from combinations of ALS data and satellite imagery supplemented with auxiliary environmental data. It was hypothesised that the supplementary environmental data may contain useful information on factors that influence site productivity that could not be detected using remote sensing. This study was designed to identify the most precise model for forest managers to predict the 300 Index based on data from remote sensing and auxiliary data.

Methods

Study site

Data was acquired from Kaingaroa forest which is located in the central North Island of New Zealand (Fig. 1). Kaingaroa is New Zealand’s largest contiguous planted forest covering around 180,000 ha. The majority of the forest occupies the pumice plateau within the central North Island and has generally flat topography. The northern part of the forest is characterised by rolling hills and areas of steeper terrain. The terrain gradually slopes upwards towards the forest’s southern extent leading to a notable gradient in productivity. The forest soils are classified as Orthic Pumice belonging to the Kaingaroa series (Hewitt 1993) with those in the north of the forest derived from Tarawera ash. The dataset was restricted to stands of P. radiata, which cover 92 % of the total forested area.

Fig. 1
figure 1

Map showing the location of the plots used for fitting and validation of the tested models

Field measurement

Grid sampling was used to locate 493 field plots throughout the study forest. Plots were located at the intersections of a grid that had a randomised start point and orientation. The sampling unit was a 0.06-ha bounded, circular field plot. A survey-grade global positioning system (GPS) was used to fix the centre of each plot. This fix was differentially corrected using local base stations following acquisition. Diameter at breast height (dbh) was measured for all trees within each field plot. Tree height was measured for a subset of plot trees that were free from excessive lean or malformation and were selected from across the dbh range. The silvicultural history and stand age for each plot was extracted from the stand records maintained by the forest manager.

Derivation of the 300 Index

The 300 Index can be estimated from plot measurements of basal area, mean top height and stand density at a known age when the stand silvicultural history has been recorded (Watt et al. 2010). The productivity index is estimated using numerous models including a stand-level basal area growth model, a height/age function, a mortality function, a stand-volume function and a thinning function. These models have been embedded within a software package (West et al. 2013) that was used to estimate the 300 Index value for each field plot. These values served as the response variable for the modelling work in this study.

Candidate predictor variables

The methods used to derive candidate predictor variables for this study were the same as those described in Watt et al. (2015).

ALS data

An ALS survey was completed in early 2014 using an Optech Pegasus scanner to collect a discrete, small footprint dataset. The data were collected with a pulse rate frequency of 100 kHz, a maximum scan angle of 12° off nadir and a swath overlap of 25 %. This resulted in a dataset with a footprint size of 0.25 m and an average pulse density of 11.46 points m−2. Returns were classified according to the ASPRS standard LiDAR point classes including 2—ground, 3—low vegetation, 4—medium vegetation, 5—high vegetation, 6—building and 7—low point (noise). Classification was automated using the TerraScan module of the TerraSolid software product (TerraSolid Ltd, Helsinki, Finland). Subsequent manual inspection and reclassification, where required, was used to improve the classification accuracy. The automated classifications were only adjusted when, following inspection by a technician, they were deemed to have been clearly classified erroneously.

Within the field plot boundaries, LiDAR metrics were extracted including height percentiles (P5ht, P10ht, P20ht,…, P95ht, m); the mean (H mean, m) and maximum height (H max, m); several metrics describing the LiDAR height distribution through the canopy (skewness, coefficient of variation, standard deviation (SD) and kurtosis); and measures of canopy density such as the percentage of returns reaching within 0.5 m of the ground (P zero, %) and the percentage returns above 0.5 m (P cover, %).

RapidEye data

The RapidEye satellite system is a constellation of five satellites carrying identical sensors, all of which were launched at the end of 2008 (RapidEye AG 2011). Each sensor collects electromagnetic radiation data in five wavelength bands that include band 1—blue (440–510 nm), band 2—green (520–590 nm), band 3—red (630–685 nm), band 4—red edge (690–730 nm) and band 5—near infrared (NIR) (760–850 nm) at a spatial resolution of 6.5 m, which is resampled to 5 m during pre-processing prior to delivery.

A level 3A ortho product provided by RapidEye was used in this study. The level 3A ortho product underwent a range of pre-processing stages including the application of radiometric, sensor and geometric corrections. It was also aligned to a cartographic map projection (the Universal Transverse Mercator (UTM)) with the default geometric correction based on Ground Control Points derived from DigitalGlobe 2-m satellite imagery and the New Zealand 25-m digital elevation model (DEM). The intention of the ortho-correction process was to remove impacts of topographic distortions in the imagery. The process ensured that the satellite image conformed to a map projection and included corrections for terrain displacement.

RapidEye images that covered the study area were acquired from January 16 and 28, 2014. The images were delivered as 16-bit digital numbers, and reflectance values were calculated from each band of the RapidEye imagery using the ENVI 4.7 image processing software.

Once converted, the various vegetation indices (Table 1) were calculated from the reflectance and applied to the extent of the images. Texture metrics were calculated using four different window sizes—3 × 3, 5 × 5, 15 × 15 and 25 × 25 pixels, a consistent displacement of 1 pixel and direction of 135°. The following summary provides an overview of the indices and metrics calculated.

Table 1 Vegetation indices used within the modelling process

The following vegetation indices were computed from the RapidEye data: Normalised Difference Vegetation Index (NDVI); Enhanced Vegetation Index (EVI); Simple Ratio (SR); Green Ratio (GR); Red Edge Ratio (RE); Vegetation Index (VI); and Brightness. Equations for these ratios are given in Table 1.

Textural attributes, developed by Haralick et al. (1973), are commonly utilised in remote sensing studies. We used the most relevant grey-level co-occurrence matrix (GLCM) textural attributes for remote sensing applications as described in the literature (Baraldi and Parmiggiani 1995; Solberg 1999; Lu 2005; Tuominen and Pekkarinen 2005; Kayitakire et al. 2006). These attributes included the mean (ME); variance (VAR); standard deviation (SD); contrast (CON); angular second moment (ASM); entropy (ENT); homogeneity (HOM); energy (EN); correlation (COR); and dissimilarity (DISS). All textural metrics were calculated using the glcm package in the R statistical environment (Zvoleff 2014).

Mean values for spectral (individual bands and indices) and textural metrics from the pixels that were spatially coincident with the plots were extracted for the analyses.

Auxiliary environmental data

Auxiliary environmental data were extracted from GIS layers that included terrain attributes (Palmer 2008); environmental layers (Leathwick et al. 2003); and monthly and annual climate data (Mitchell 1991; Leathwick et al. 2002). Environmental variables that served as candidate predictor variables for inclusion in predictive models included mean annual and monthly air temperature, relative humidity, solar radiation, vapour pressure deficit and rainfall. These surfaces had a spatial resolution of 100 m2 or less. A spatial soil water balance model developed for P. radiata (Palmer et al. 2009c) was used to determine mean annual and seasonal root-zone water storage (W) for all plot locations. Fractional available root-zone water storage was determined from these data and the maximum available root-zone water storage, W max, as W/W max. Soil fertility was represented by soil C:N ratio, which provides a useful index of nitrogen mineralisation (Watt and Palmer 2012).

Analyses

Overview

Some of the analytical methodology used here was similar to that in a previous study comparing parametric and non-parametric methods for predicting Site Index for P. radiata using combinations of data derived from environmental surfaces, satellite imagery and airborne laser scanning (Watt et al. 2015). From the full dataset of 493 plots, 60 plots were randomly selected and used for model validation. The remaining 433 plots were used for the model-fitting process. Separate models were created from the following categories of data that all included age as a predictor: (i) RapidEye band spectral values; (ii) RapidEye vegetation indices; (iii) RapidEye textural metrics; (iv) all RapidEye metrics (a combination of categories (i) to (iii)); (v) data from environmental layers; (vi) all RapidEye metrics and data from environmental layers; (vii) Site Index predicted from LiDAR (as Site Index is likely to be related to 300 Index); and (viii) Site Index predicted from LiDAR, along with all other LiDAR metrics in combination with environmental layers and all RapidEye metrics.

Models without stand age were also created using predictors from categories (i) to (vi), but not categories (vii) and (viii) as it is not practical to predict Site Index using LiDAR information without age. These 14 different models were created using both parametric and non-parametric methods.

Predictions of Site Index using LiDAR for models (vii) and (viii) were made for each field plot. This was achieved by first predicting plot mean top height using the 99th LiDAR height percentile, P 99, from the following equation:

$$ \mathrm{Mean}\ \mathrm{top}\ \mathrm{height} = 2.138 + 0.9927\ {P}_{99} $$
(1)

An appropriate transformation of a height age function was then used to convert this estimate of plot mean top height, along with plot age from the stand record system, into an estimate of Site Index.

Model precision was assessed for parametric and non-parametric models from the validation dataset using the squared Pearson correlation coefficient (R 2) and root mean square error (RMSE). Model bias was assessed through the mean error, with error for each observation in the validation dataset defined as the measured value minus the predicted value. Bias was also assessed through examination of plots between measured and predicted values.

Parametric modelling

All parametric analyses were undertaken using the general linear model procedure within the SAS software package (SAS-Institute-Inc. 2008). Multiple regression models were used to develop the 14 models of the 300 Index, described above. For each model, variables were introduced sequentially starting with the variable that exhibited the strongest correlation, until further additions were not significant, or did not substantially improve the model precision. Variable selection was undertaken manually, one variable at a time, and plots of residuals were examined prior to variable addition to ensure that the variable was included in the model using the least biased functional form. The number of independent variables in each model ranged from two to five for the models without age and three to five for the models with age. The most precise model had four independent variables including age.

Model bias was examined through plotting predicted against measured values of the 300 Index and residual values of the 300 Index (measured 300 Index − predicted 300 Index) against predicted values and all independent variables in the model.

The most important variables were identified through comparison of F-values across all of the 14 models. Where a variable was included in more than one model, these F-values were averaged for that variable.

Non-parametric modelling

Non-parametric k-nearest neighbour (k-NN) and k-most similar neighbour (k-MSN) modelling techniques were used to impute the 300 Index for the validation plots based on the measurements in the reference dataset. Nearest neighbour approaches are widely used statistical techniques that have gained considerable popularity for use in forest assessment (e.g. McRoberts 2012) since their first application to forestry (Tomppo and Katila 1991). Under nearest neighbour estimation, variables of interest (Y) are imputed for target elements, commonly pixels covering an area of interest, where Y has not been measured. Imputation of Y is based on auxiliary variables (X) that are known for all elements in the population (N) and are correlated with Y. A subset of the elements in N have paired observations of both X and Y and are referred to as the reference dataset. Imputation for a target element is estimated as a function of k Y values in the reference dataset that have X values that are closest, using a measure of statistical proximity, to the X values of the target element (Magnussen and Tomppo 2014).

All k-NN and k-MSN modelling was completed in the yaImpute package (Crookston and Finley 2008) of the R statistical software environment (R Development Core Team 2014). The k-NN models were developed using the random forest distance metric. Under the random forest approach, observations are considered similar if they tend to converge in the same terminal node in a suitably constructed collection of classification and regression trees (Breiman 2001; Liaw and Wiener 2012). The metric used to define statistical distance is calculated as one minus the proportion of trees where a target observation is in the same terminal node as a reference observation (Crookston and Finley 2008). An alternative approach to assigning donor observations is through using the k-MSN distance metric, which is derived using canonical correlation analysis to produce a weighting matrix for the selection of donors from the fitting dataset (Moeur and Stage 1995). The value of k was varied for each imputation, and the value that minimised the RMSE within the validation dataset was selected for prediction. The imputed value (Y) for a given target was calculated using the distance-weighted average of the k value nearest to that from reference observations. Models developed using a k value of 1 were also produced for comparison in this analysis.

A measure of variable importance was estimated by taking a scaled composite of the importance scores outputted by random forests. Importance was estimated by permuting the values for each predictor variable and assessing the impact of the permutation on the model error. Importance scores were estimated for all predictors that were included in the final non-parametric models of the 300 Index. To account for the random element, 200 iterations were completed and the importance scores for each variable were averaged.

Results

Variation in data

The mean and range were relatively similar between the fitting and validation datasets for key explanatory variables used in the models. Stand age averaged 16.6 years, ranging from 2.8 to 37.8 years (Table 2). Of the five RapidEye spectral bands, the green band exhibited the greatest variation, ranging 40-fold within the fitting dataset. Variation in the ratios was greatest for the GR with values ranging 31-fold.

Table 2 Variation in stand age, RapidEye reflectance and ratios, key environmental variables and key LiDAR metrics for the fitting (n = 433) and validation datasets (n = 60)

Environmental variation was relatively high across the study site. Mean annual air temperature averaged 11.2 °C ranging from 9.27 to 14.1 °C, while total annual rainfall averaged 1420 mm ranging from 1119 to 2216 mm. The majority of the terrain over which measurements were made was relatively flat with slope averaging 5.24°. The slope ranged from 0° to 31.9° (Table 2) indicating that some measurements were taken on much steeper sites.

LiDAR metrics ranged widely, with P 95 averaging 21.1 m and ranging 16-fold from 0.79 to 43.4 m (Table 2). Mean values for P 05 and P 50 were, respectively, 0.35 m (range 0.015–6.81 m) and 14.0 m (range 0.16–33.8 m).

Model comparison

For all 14 non-parametric models, the use of k-NN with an optimised value of k produced a RMSE that was considerably lower (Fig. 2) than for models where k = 1 (mean gains in RMSE = 1.21 m3 ha−1 year−1, range = 0.75–2.01 m3 ha−1 year−1), or the use of the k-MSN with k = 1 (mean gains = 1.83 m3 ha−1 year−1, range = 1.10–3.08 m3 ha−1 year−1) or an optimised value of k was used (mean gains = 0.37 m3 ha−1 year−1, range −0.056–0.711 m3 ha−1 year−1) (Fig. 2). The mean value of optimised k for models using random forest was 18 (range 8–48) while for k-MSN models, the mean value of k was 35 (range 9–58). Given the substantial increases in precision obtained using k-NN with an optimised value for k, further results were compared using this non-parametric method to parametric models for prediction of the 300 Index. There is a small computational cost associated with larger values of k, but this was easily offset by the increased model precision.

Fig. 2
figure 2

Variation in root mean square error (RMSE) for parametric models (black open triangles) and non-parametric models using the most similar neighbour with k = 1 (red open circles) and optimised k (pink crosses) and k-nearest neighbour (k-NN) using k = 1 (black filled squares) and k-NN using an optimised value of k (filled cyan circles) for models that a do not include age and b include age. Note that model precision is not given for use of LiDAR without age (panel a) as these models are not feasible

Of the three sources of information used, models generated from environmental data were the least precise in general, with RMSE values averaging 3.71 m3 ha−1 year−1 (range 3.55–3.85 m3 ha−1 year−1). The most important environmental predictor was mean air temperature during spring (Fig. 3).

Fig. 3
figure 3

Predictors of most importance for a parametric and b non-parametric models listed in the order of greatest to least importance. The F-values and importance scores shown were extracted from all 14 models and averaged where a variable occurred in more than one model. Following the texture measures, the value of 5 and 25 refers to a window size of, respectively, 5 × 5 and 25 × 25 pixels

Models created from satellite imagery generally had a slightly lower RMSE than those created from environmental data (Fig. 2). Of the satellite imagery-derived models, the most precise were generated using individual band values (mean RMSE 3.64 m3 ha−1 year−1) followed by vegetation indices (mean RMSE 3.67 m3 ha−1 year−1). Textural metrics produced the least precise models (mean RMSE 3.79 m3 ha−1 year−1) (Fig. 2). The use of all available metrics from satellite imagery resulted in models with equivalent precision to those created using individual band values (mean RMSE = 3.64 m). For parametric models, the most important bands were NIR, red reflectance and green reflectance (Fig. 3). Addition of age to these models markedly improved the RMSE of parametric models (mean reductions in RMSE of 0.28 m3 ha−1 year−1) with lesser effect on non-parametric models (mean reductions in RMSE of 0.03 m3 ha−1 year−1).

Further modest gains occurred when metrics derived from satellite imagery were added to environmental variables (Fig. 2). These models had mean RMSE of 3.62 m3 ha−1 year−1. The addition of age as a variable markedly improved the RMSE of parametric (mean reductions in RMSE of 0.64 m3 ha−1 year−1) and to a lesser extent non-parametric models (mean reductions in RMSE of 0.23 m3 ha−1 year−1).

Of the data sources investigated, models developed with LiDAR data that included age had the highest precision (Fig. 2). The mean RMSE for the two model types was 2.52 m3 ha−1 year−1 (range 2.48 − 2.55 m3 ha−1 year−1). Both F-values and model importance scores show that Site Index derived from LiDAR to be the most useful predictor of the 300 Index of the LiDAR metrics (Fig. 3). The addition of environmental variables and metrics derived from satellite imagery to these models provided modest precision gains, with RMSE declining by on average 0.04 m3 ha−1 year−1. The model with the highest precision was a parametric model using stand age that included variables derived from all three data sources and had an R 2 of 0.790 and RMSE of 2.45 m3 ha−1 year−1.

For all six models that did not include age, non-parametric models were more precise than parametric models (Fig. 2). Expressed as a percentage of the parametric RMSE, the RMSE for non-parametric models was on average 94.6 % (range 92.9–97.0 %). For models that did include age, non-parametric models were more precise for five of the eight models, but overall, the mean RMSE between the two model types was very similar (Fig. 2).

Non-parametric models were more biased than parametric models for all but two of the constructed models (Fig. 4). For models without age, mean error averaged 0.530 and 0.384 m3 ha−1 year−1, respectively, for non-parametric and parametric models while for models with age, mean error averaged 0.613 and 0.451 m3 ha−1 year−1, respectively, for non-parametric and parametric models.

Fig. 4
figure 4

Variation in mean error for parametric models (filled black circles) and non-parametric models using k-NN with an optimised value of k (open circles) for models that a do not include age and b include age. Note that model precision is not given for use of LiDAR without age (panel a) as these models are not feasible

Inclusion of stand age as an explanatory variable was far more effective when included within parametric models than non-parametric models. Expressed as a percentage of the RMSE for models without age, the RMSE for models with age averaged 94.5 % (range 88.2–98.8 %) for parametric models and averaged 101.0 % (range 93.2–107.0 %) for non-parametric models. Results show that age was the eighth most important predictor for parametric models and fourth most important for non-parametric models (Fig. 3).

The R 2 and RMSE (in brackets) for various parametric or non-parametric models that included stand age were, respectively, (i) 0.582 (3.46 m3 ha−1 year−1) or 0.506 (3.81 m3 ha−1 year−1) for all metrics derived from satellite imagery; (ii) 0.495 (3.80 m3 ha−1 year−1) or 0.569 (3.55 m3 ha−1 year−1) for environmental layer variables; (iii) 0.614 (3.32 m3 ha−1 year−1) or 0.586 (3.48 m3 ha−1 year−1) for variables derived from a combination of satellite imagery and environmental layers; (iv) 0.772 (2.55 m3 ha−1 year−1) or 0.789 (2.48 m3 ha−1 year−1) for LiDAR metrics; and (v) 0.790 (2.45 m3 ha−1 year−1) or 0.786 (2.50 m3 ha−1 year−1) for all available variables. Plots of measured against predicted values for these ten models generally show parametric models to be relatively unbiased although there was slight underprediction of the 300 Index at higher values for the five models (Fig. 5ae). This underprediction was also evident for the non-parametric models, and bias for these models was greater than that for parametric models (Fig. 5).

Fig. 5
figure 5

Relationship between measured and predicted 300 Index for parametric models (ae) and non-parametric k-NN (with optimised values of k) models (fj) with stand age that include a, f variables derived from satellite imagery; b, g variables derived from environmental layers; c, h variables derived from satellite imagery and environmental layers; d, i Site Index predicted from LiDAR; and e, j variables selected from all data sources. The 1:1 line is shown on all panels as a dashed line

Discussion

Of the three data sources considered in this study, results clearly show that LiDAR provides the most precise predictions of the 300 Index. The use of variables extracted from environmental layers was found to provide the least precise estimates of the 300 Index while the use of variables derived from satellite imagery was of intermediate precision. Combining different data sources only resulted in modest gains in predictive precision of the 300 Index. Results clearly show the importance of age as a predictive variable and demonstrate non-parametric models to be more precise than parametric models if age is not available as an independent variable.

The use of LiDAR as a technology for predicting stand attributes is widely accepted within forestry. Since the first application of LiDAR in forestry almost three decades ago, LiDAR data have been used to accurately predict stand height and volume (Coops et al. 2007; Means et al. 2000; Næsset 2002; Means et al. 1999; Watt et al. 2014; Dash et al. 2016; Melville et al. 2015). Correlations of moderate to high strength have been found between LiDAR metrics and basal area (Næsset 2002, 2004, 2005; Nord-Larsen and Schumacher 2012; Means et al. 1999; Means et al. 2000), diameter (Næsset 2002) and green crown height (Næsset and Økland 2002). Given that canopy height is the dimension predicted with the most precision by LiDAR, it is not surprising that recent research has shown that Site Index can be predicted with high precision by LiDAR in plantation species such as Eucalyptus urograndis (Packalén et al. 2011) if stand age is available.

Our results extend this research by showing that LiDAR can be used to precisely (R 2 = 0.79) predict the 300 Index when stand age is available. The use of predicted Site Index, which was based on stand age and a LiDAR height-based metric, was most useful for predictions of the 300 Index. This result is consistent with previous research as most of the variation in tree volume in LiDAR-based models is typically attributable to metrics describing tree-height percentiles using LiDAR (Watt and Watt 2013).

Variables derived from satellite imagery provided moderately precise estimates of the 300 Index. Although we are unaware of any research that has used satellite imagery to predict the 300 Index, previous research does show a moderate correlation between P. radiata stand volume and textural attributes (R 2 = 0.64) derived from high-resolution satellite imagery (Shamsoddini et al. 2013). The utility of models that use solely spectral information, or vegetation ratios, is often limited because as the canopy reaches closure, the spectral response flattens and is unresponsive to continued forest growth. This reduces the predictive ability of satellite-based models to estimate changes in stand volume (Donoghue and Watt 2006) and is particularly problematic in highly productive plantations.

Models generated using environmental layers were the least precise of those developed. However, the precision range for these models (RMSE range of 3.55–3.85 m3 ha−1 year−1) was similar to that of previous national New Zealand models of P. radiata 300 Index created from environmental layers (Palmer et al. 2009b) where the most precise model had RMSE of 3.65 m3 ha−1 year−1. This consistency does highlight the utility of satellite imagery and LiDAR as an alternative more precise means of predicting the 300 Index.

Air temperature was the environmental variable with the greatest influence on the 300 Index. This is consistent with previous research that showed air temperature to be the most important determinant of P. radiata growth in New Zealand (Jackson and Gifford 1974; Hunter and Gibson 1984; Watt et al. 2010), with air temperature in most locations in New Zealand, including the central North Island, sub-optimal for growth under current climatic conditions (Kirschbaum and Watt 2011). Models of Site Index for plantation species growing outside of New Zealand have also frequently found air temperature to be an important determinant of Site Index (Sharma et al. 2012), but rainfall is at least as important as air temperature in many drier regions (Mohamed et al. 2014; Sabatia and Burkhart 2014).

Although non-parametric models were slightly more biased, our results show these models to be more precise than parametric models when stand age is not available. This is consistent with previous models developed for predicting Site Index. Such models demonstrate the utility of non-parametric methods particularly when there are a large number of predictors with complex model forms (Aertsen et al. 2010). Prediction of other stand metrics, such as height, biomass, volume and diameter distribution from remotely sensed data, also often show non-parametric methods such as k-NN to outperform parametric regression (Packalen and Maltamo 2006; Zhou et al. 2011; Maltamo et al. 2009; Bollandsås et al. 2013) but not always (Mora et al. 2013; Penner et al. 2013). Pierce et al. (2009) found that a variant of k-NN, the gradient nearest neighbour, generally performed better than linear regression models or classification trees in temperate marine forests but that regression performed better in the more complex forests of adjacent temperate steppe and Mediterranean forests. This result suggests that the best technique may be reliant on the characteristics of the particular forest or landscape. In a recent review, Brosofske et al. (2014) outline a number of important considerations when selecting the most appropriate model type for prediction of forest inventory attributes from remotely sensed data.

These results confirm that the selection of an appropriate value of k is an important decision made by the analyst, which is dependent on the end uses of the model outputs. Small values of k are more susceptible to noise in the training data. The error rate decreases as the value of k increases, but both the likelihood of distant observations being included and the over-smoothing of model outputs increase. Finding an appropriate balance between these two is a recurring issue and the focus of substantial interest (Dash et al. 2015; Ghosh 2006; McRoberts 2012; McRoberts et al. 2015). Numerous algorithms for selecting suitable values of k are available, but selecting k to optimise a model criterion such as RMSE is a logical approach.

Stand age was a useful determinant of the 300 Index. Without stand age, LiDAR data is of little use in predictions of the 300 Index as height percentiles cannot be adjusted to the age at which the 300 Index is determined. Results show that if age is not available, then the combination of data derived from satellite imagery and environmental variables provides a more precise means of estimating the 300 Index (R 2 = 0.65 and RMSE = 3.21 m3 ha−1 year−1 for the non-parametric model).

Acquiring LiDAR data is very expensive from an operational perspective. Results suggest that a combination of satellite imagery and available auxiliary environmental data may provide a cost-effective alternative for assessing the spatial variability of the 300 Index across planted forests. This methodology is likely to be particularly useful for regional- or national-scale predictions of the 300 Index or under circumstances when stand age is not readily available.

Conclusions

In conclusion, our results demonstrate that LiDAR is the most useful data source for predicting the 300 Index when stand age is available. Using these data, the most accurate model had an R 2 of 0.789 and RMSE of 2.48 m3 ha−1 year−1. In the absence of stand age, non-parametric models that used variables derived from both satellite imagery and auxiliary environmental data were found to be the most precise.