Background

Leaf Area Index (LAI) is one of the most important parameters of tree foliage, mediating many physical and biological processes in forest ecosystems (Sidabras and Augustaitis 2015). For example, processes such as evapotranspiration, transpiration, and photosynthesis are a function of LAI (Wulder et al. 1998). It is a critical parameter needed in developing environmental and ecological models such as growth and yield models, soil-water balance models, energy balance models, and climate change models (Peng et al. 2002; Wang et al. 2011; Mason et al. 2012). It is used in a large variety of studies including global change, nutrient cycling, and carbon and hydrology (Vilà et al. 2003; Liang et al. 2005; Chianucci et al. 2015).

The LAI, is a dynamic parameter, which depends on management practices, seasonality, site conditions, developmental stage, and species composition. Number of methods have been developed to determine LAI (Jonckheere et al. 2004). LAI can be predicted by direct and indirect methods (Chen et al. 1991). Direct methods are precise but tremendously hard and time-consuming and more expensive than the indirect methods; they require tree falling in some cases (Weiss et al. 2004), and they are impractical and dangerous in hardly accessed localities (Macfarlane et al. 2007). Therefore, these disadvantages of direct methods resulted in indirect measurements of LAI by ground based optical instruments to gain importance in recent decades (Chianucci et al. 2015). Indirect methods have been utilized since 1960s in forest ecology and management (Chianucci and Cutini 2012). Hemispherical photography is a valuable indirect method for estimating LAI (Jonckheere et al. 2004; Sidabras and Augustaitis 2015) beside it’s used for evaluating canopy features (Hale and Edwards 2002).

Hemispherical photographs have been widely used for determination of LAI in forest ecology and management studies (e.g. Chen et al. 1991; Walter and Torquebiau 2000; Frazer et al. 2001; Hale and Edwards 2002; Leblanc et al. 2005; Macfarlane et al. 2007; Hu and Zhu 2009; Mason et al. 2012). Many studies showed that there are significant associations between LAI and forest productivity (Vose and Allen 1988), net primary productivity (Gholz 1982), evapotranspiration (Swank et al. 1988), water balance (Grier and Running 1977), and spatiotemporal variations in forest ecosystems (Chianucci and Cutini 2012). There are also some other studies analyzing the relationships between LAI and forest stand parameters such as biomass (Nowak 1996; Madugundu et al. 2008; Jagodzinski and Kalucka 2008), density (Sidabras and Augustaitis 2015), canopy closure (Jeleska 2004), diameter at breast height (Turner et al. 2000; Arias et al. 2007; Khosravi et al. 2012; Sidabras and Augustaitis 2015), basal area (Khosravi et al. 2012), mean stand height (Khosravi et al. 2012; Sidabras and Augustaitis 2015), stand volume (Sidabras and Augustaitis 2015), and stand age (Jagodzinski and Kalucka 2008; Pokorny and Stojnic 2012).

The aim of this study was to evaluate the capability of usability of the artificial neural network models for predicting the LAI by comparing the regression analysis models as the classical method in these pure and even-aged Crimean pine forest stands. The LAI was determined by operating an indirect method depend on the hemispherical photography. Forest stand parameters related to each sample plot were estimated by examining the inventory data obtained from forest inventory. The partial correlation analysis was firstly used to examine the associations between the stand LAI values and stand parameters. To model the relationships between LAI and forest stand parameters such as the number of trees of stand, stand age, site index, quadratic mean diameter, density index, stand basal area, the multivariate linear regression as the classical prediction technique and the artificial neural network models being as artificial intelligence model were used in this study.

Methods

Study site

In this study, Yenice Forest Management Unit, located in the middle-north region of Turkey (Fig. 1), was selected as case study area. The study area is mostly mountainous. The climate is mainly Black Sea type. Diurnal temperature differences are high. The annual average precipitation is 474 mm. The forests in the study area are mainly coniferous, comprising approximately 1700 ha of pure even-aged Crimean pine, Pinus nigra subsp. pallasiana (Lamb.) Holmboe, forest stands.

Fig. 1
figure 1

Case study area

Measurements

A total of 108 temporary sample plots were taken, considering different site classes, crown closures, and stand development stages in pure Crimean pine forest stands. Ground measurement of the sample plots was carried out in August 2014. The position of each sample plot was first determined by a hand type of a satellite-based navigation system and a global positioning system (GPS). The sizes of the sample plots were 400, 600 and 800 m2 depending on the stand crown closures. Within each sample plot, all the trees with diameters ≥8 cm were identified and measured. Tree ages in 4–5 trees near the medium diameter in each sample plots were measured by an increment borer, and the mean was calculated to determine stand age (ages). Stand age and height (in meters) were calculated from number of trees determined according to 100 trees method in hectare. Stand density, stand basal area (m2·ha− 1), stand age (yr), site index (m) and height (m), number of trees and quadratic mean diameter (cm) of each sample plots were used for determination of LAI. The summary statistics, such as the range, mean, standard deviation, variance, minimum, and maximum for the LAI and stand variables are given in Table 1.

Table 1 Descriptive statistics on LAI and some stand attributes

LAI was predicted by the Hemiview system at 108 temporary sample plots of Crimean pine forest stands. Hemispherical photographs were taken by a fisheye lens mounted on a color film-mounted Cannon (EOS 50D) camera with color film. The center was set to the peak of canopy, and photographs were taken at the correct automatic pose. The camera was mounted on a tripod at 1.2–1.5 m height to avoid interference with forest soil and understory vegetation. The LAI values were quantified using the photographs, taken at the middle of each sample plot, with the Hemiview software program.

Partial correlation analysis

The partial correlation analysis (PCA) was used to evaluate the relationships between the stand LAI and stand parameters. The PCA provides the correlations between two variables by controlling other variables that have important effect on the relation among these two variables. The PCA yields the partial correlation coefficients, which depict spurious and hidden correlations, including the effects of other variables. The partial correlation coefficients were obtained using PROC CORR procedures of the SAS/ETS v9 software (SAS Institute Inc. 2012).

Multivariate linear regression

While the partial correlation analysis explains individual relationships between the LAI and some stand parameters, the multivariate linear regression predicts LAI exploiting the relationships between LAI and some stand parameters such as the stand number of trees, stand age, site index, quadratic mean diameter, density index, basal area. The Ordinary Least Squares (OLS) technique was used to obtain the parameters and other statistics of the multivariate linear regression model. The parameters were predicted using PROC REG and RSQUARE procedure, available in SAS/ETS® 9 software (SAS Institute Inc. 2012). The stepwise variable selection regression technique (SVSRT) was used to select the stand variables, which contribute LAI predictions significantly (p < 0.05). The SVSRT, a modification of the forward-selection technique, is based on the following procedure: a variable is introduced into the model and significance of its contribution in predicting the dependent variable is controlled by the partial F-test. If the variable has a significant contribution, it’s included in the model, otherwise it’s released. The SVSRT was conducted across complete fitting stages. In this study, the following linear relationship was assumed:

$$ \mathrm{LAI}={\beta}_0+{\beta}_1{X}_1+{\beta}_2{X}_2+\dots +{\beta}_n{X}_n+\varepsilon $$
(1)

where LAI is the leaf area index, X1,…, Xn are parameter vectors (independent variables) corresponding to stand parameter values, β1,…, βn represent model coefficients and ε is the additive error term.

Artificial neural network modeling

The artificial neural network (ANN) technique was used to predict the stand LAI. The ANN technique includes training, verifying, and testing stages using data-subsets, which are constituted by randomly selected data values from the dataset. We partitioned total sample plots into training (75% of all data), verification (15% of all data) and testing (the remaining 10% of all data) subsets (Fausett 1994). In these network models, input variables were best predictive stand parameters, in which these independent variables were selected by the stepwise variable selection technique in regression analysis, and target variable was the stand LAI value obtained from the terrestrial measurements. Two artificial neural network model categories including the Multilayer Perceptron (MLP) and the Radial Basis Function (RBF) architectures were trained by using STATISTICA® software (Statsoft Inc, 2007). A total of 800 in MLP and 50 trained networks in RBF were trained and used to obtain the LAI predictions. In ANNs training process with MLP, the number of neurons of the input layer ranged from 1 to 50 alternatives, four activation functions including the identity, logistic, tanh and exponential functions in hidden layer, correspondingly these four activation functions were used in output layer. In RBF, the number of neurons of the input layer ranged from 1 to 50 neuron alternatives and the hidden layer had always the activation function as being on isotropic Gaussian basis and identity function was used in output layer, in which the STATISTICA software does not present any activation function alternatives for RBF. Thus, the differentiation in number of training alternatives for MLP and RBF occurred for training the ANN models, because the MLP included sixteen alternatives (four activation functions in hidden layer × four activation functions in output layer) and the RBF comprised one activation function alternative (the isotropic Gaussian basis in hidden layer and the identity function in output layer). Because of the 50 neurons of the input layer for MLP and RBF, a total of 800 network alternatives in MLP (16 × 50) and a total of 50 network alternatives in RBF (50 × 1) were trained to have the predictions of the ANN models.

In MLP, feed-forward neural network architecture include input, hidden and output layers with a bias term; the training algorithm is Broyden-Fletcher-Goldfarb-Shanno (BFGS), which is a robust training algorithm with very fast convergence with the Hessian matrix. After training these network models, five best predictive networks for MLP and RBF for the stand LAI values presented, including some evaluations of the magnitudes and distributions of models’ residual in the STATISTICA software. From these network models, six goodness-of-fit statistics, including sum of squared errors (SSE), Akaike’s information criterion (AIC), Bayesian information criterion (BIC), Root Mean Square Error (RMSE), Mean Squared Error (MSE) and Adjusted Coefficient of Determination (R2adj), were calculated to evaluate and compare the performance of the predictive network models. The formulae for these statistical values are provided below (2 to 7).

$$ \mathrm{Mean}\ \mathrm{squared}\ \mathrm{errors}\ \left(\mathrm{MSE}\right)={\sum}_{i=1}^n\frac{{\left({\mathrm{LAI}}_i-{\widehat{\mathrm{LAI}}}_i\right)}^2}{n-p} $$
(2)
$$ \mathrm{Root}\ \mathrm{mean}\ \mathrm{squared}\ \mathrm{error}\ \left(\mathrm{RMSE}\right)=\sqrt{\sum_{i=1}^n\frac{{\left({\mathrm{LAI}}_i-{\widehat{\mathrm{LAI}}}_i\right)}^2}{n-p}} $$
(3)
$$ \mathrm{Sum}\ \mathrm{of}\ \mathrm{Squared}\ \mathrm{error}\kern0.5em \left(\mathrm{SSE}\right)={\sum \limits}_{i=1}^n{\left({\mathrm{LAI}}_i-\widehat{{\mathrm{LAI}}_i}\right)}^2 $$
(4)
$$ \mathrm{Adjusted}\ \mathrm{cofficient}\ \mathrm{of}\ \mathrm{determination}\ \left({R}_{\mathrm{adj}.}^2\right)=1-\frac{\sum_{i=1}^n{\left({\mathrm{LAI}}_i-\widehat{{\mathrm{LAI}}_i}\right)}^2\left(n-1\right)}{\sum_{i=1}^n{\left({\mathrm{LAI}}_i-\overline{\mathrm{LAI}}\right)}^2\left(n-p\right)} $$
(5)
$$ \mathrm{AIC}=-2\log L+2p $$
(6)
$$ \mathrm{BIC}=-2\log (L)+p\log (N) $$
(7)

where, LAIi represents the LAI values; \( \widehat{{\mathrm{LAI}}_i} \) represents the predicted stand LAI values, \( {\overline{\mathrm{LAI}}}_i \) represents average LAI value, n represents the number of data, L represents maximum value of the log likelihood function and p represents the number of parameters within the model.

The graphs including the residuals against predicted LAI values and the predicted LAI values against the observed values were presented to compare the predictions obtained by the predictive regression model, MLP model, and RBF model. Independent stand parameters, contributing the prediction of LAI significantly at the level of 0.05, were determined by SVSRT. The LAI predictions for regression model were acquired by using the best predictive regression model based on the included independent stand parameters that were selected by the stepwise variable selection regression technique in SAS software, in which are significant (p < 0.05) in predictions. The LAI predictions for ANN models based on the MLP and RBF prediction technique that included the best predictive stand parameters selected by the regression technique as input variables were achieved by using STATISTICA neural network simulation module.

Results

The correlation coefficients for the relationships between parameters of number of trees in stands, the stand density, the basal area, the stand age, and the quadratic mean diameter with the LAI values were significant at the 0.01 level (Table 2). The number of trees of stands (r = 0.564) showed the highest correlation with the LAI values. Similarly, stand density (r = 0.512) and stand basal area (r = 0.371) were positively correlated with the LAI. However, the quadratic mean diameter (r = − 0.294) and stand age (r = − 0.294) were negatively correlated with the LAI. The correlation coefficients for relationships between the quadratic mean height and site index with the LAI values were not significant at the level of 0.05. Figure 2 shows linear relationships between LAI and the stand parameters.

Table 2 Partial correlation coefficients between the LAI and some stand attributes
Fig. 2
figure 2

The relationships among whole variables such as the LAI and the stand variables with ordinary linear equations

After verifying linear relationships between LAI and stand parameters by linear correlation and regression techniques, the multivariate linear regression models were developed to predict the LAI as a function of the selected stand parameters (stand basal area, site index, number of trees and stand age), which are significant at the 0.05 level (Table 3). In this regression model for LAI values, the F statistics and its coefficients were significant at a probability level of 95% (p < 0.05). In Table 3, the standardized coefficients for independent variables were presented to display the contribution of stand parameters to the explained variance by the regression model. The number of trees has the highest contribution values (42.09%) of stand parameters to the explained variance, and other stand parameters such as site index (23.01%), age (19.39%) and basal area (15.51%) were ranked by the contribution values for the prediction of the LAI values.

Table 3 Parameters and other statistics of the ‘Best fit’ Regression models predicting LAI as functions selected stand parameters including the stand basal area, site index, number of trees and stand age

Values of goodness-of-fit statistics of SSE, AIC, BIC, RMSE, MSE and \( {R}_{\mathrm{adj}}^2 \) for the regression equation and five best predictive networks of MLP and RBF are presented in Table 4. SSE was between 12.1040 and 15.3269, MSE between 0.1223 and 0.1548, RMSE was between 0.3497 and 0.3935, AIC was between 10.44 and 0.2690, and BIC was between 77.7310 and 64.5107, and Radj2 between 0.4864 and 0.6392 in all fitting techniques. These goodness-of-fit statistics show that the neural network architecture based on the RBF 4–19-1 with Gaussian activation function in hidden layer and the identity activation function in output layer showed better fitting ability with SSE (12.1040), MSE (0.1223), RMSE (0.3497), AIC (0.1040), BIC (− 77.7310) and R2 (0.6392) than the other studied ANN prediction techniques. Also, this best predictive artificial neural network models provided better results based on the SSE, AIC, BIC, RMSE, MSE and \( {R}_{\mathrm{adj}}^2 \) than multivariate linear regression model.

Table 4 The goodness-of-fit statistics of number of trees predictions for the ANNs types and multiple regression model

Figures 3, 4, and 5 show the residuals for predicted LAI-values (a) and the predicted and observed LAI-values (b). The LAI predictions were obtained from three prediction methods: (1) the multivariate regression analysis (Fig. 3), (2) the ANN technique based on the MLP 4–13-1 (Fig. 4), and (3) the ANN technique based on the RBF 4–19-1 (Fig. 5). These LAI prediction techniques presented casual trend of residual around zero and no important relations, suggesting that there is no serious failure of homoscedasticity, violations of the assumption of constant variance, in the LAI prediction of ANN models based on the RBF and MLP and the best predictive regression model.

Fig. 3
figure 3

The residuals against predicted LAI values (a) and predicted LAI values against observed LAI values (b) for the best predictive regression model

Fig. 4
figure 4

The residuals against predicted LAI values (a) and predicted LAI values against observed LAI values (b) for MLP 4–13-1

Fig. 5
figure 5

The residuals against predicted LAI values (a) and predicted LAI values against observed LAI values (b) for RBF 4–19-1

Discussion

We studied the performance of the artificial neural network (ANN) models and compared them with classical regression models to predict leaf area index (LAI) in pure-even aged Crimean pine stands. The results showed that the ANN models outperformed the classical regression models in predicting LAI, which may be attributed to ability of the ANN models to describe complex and nonlinear relationships between various variables. The ANN models have been successfully used to predict several stand attributes, for example, Hasenauer et al. (2001); Diamantopoulou (2005); Özçelik et al. (2008); Diamantopolou and Milios (2010); Özçelik et al. (2010); Leite et al. (2011); Diamantopolou and Özçelik (2012); Özçelik et al. (2013); Ashraf et al. (2013) and Özçelik et al. (2017). This study presents better predictive leaf area index results with the ANN models than the ones with the regression models. This finding may be explained by the ability of the ANN model for predicting complex and nonlinear relationships between various variables, which it is provided that this capability of the ANN in many studies. This ability of the ANN models depends on modeling such complexity and nonlinearity that overlooked by the traditional statistical regression models by the architecture of an ANN by allowing highly correlated inputs to be used to enhance its modelling capability. Thus, ANN models are becoming a widespread prediction method, because of the unnecessary of some statistical assumptions that are important for accurately predicting regression models that may fit the observed data in nowadays.

In studies about modeling LAI from stand parameters, Kashian et al. (2005) reported a significant but weak relationship between LAI or BAI and stand density (r2 < 0.20) at stand ages characterized by canopy closure. DeRose et al. (2010) developed some nonlinear regression equations, which indicated that LAI was significantly related to relative density, site quality, and stand top height, and concluded that LAI increased nonlinearly with increasing relative density holding stand top height constant. Sidabras and Augustaitis (2015) found indirect relationships between LAI and stand mean diameter and height in a multi-aged mixture of coniferous forest, while they found no significant relationships between LAI and stand sum square basal area. In the same study, the authors found a significant relationship between LAI and stand density and volume of the pure pine forest stands. Dantec et al. (2000) found that there were significant relationships between LAI and stand basal area and number of trees of stands. Özbayram et al. (2015) reported positive correlations between LAI and stand age, mean diameter, top height, green tree height, basal area, and litter in Turkish red pine stands, but negative correlation between LAI and stand age and mean diameter in black pine stands. The ANN models have been used for predicting LAI from remotely sensed data, for example, Bacour et al. (2006), Shoemaker and Cropper (2008), Verger et al. (2008), Omer et al. (2016), and Wang et al. (2017). Beyond these studies about the LAI predictions from the remote sensing data by the ANN models, the present study has underlined that the ANN models may be used alternatively to traditional regression techniques for predicting LAI predictions from terrestrial measurements such as some stand parameters.

The mechanisms underlying the relationships between LAI and the included stand parameters depend on the positive or negative effect of these stand parameters on the LAI. Particularly, the stand basal area, site index, and stand number of trees have positive effect, while the stand age has negative effect on the LAI. This positive effect of the stand basal area and number of trees on the LAI can be explained by the increase or crowding of trees in any forest stands, because increasing basal area and number of trees result in further trees in a forest stand and so increasing tree crown cover with further LAI values in these stands. In addition to stand parameters including the stand basal area and number of trees, the site index has a significantly positive effect on the LAI, since forest areas of high site quality with higher site index values are more favorable areas for growth of forest species, especially tree crown growth, and thus more LAI in these stands. However, the stand age showed a negative relationship with the LAI, since the number of trees decreases with increasing age due to age-related tree mortality and thus these decreases of stand number of tress result in less LAI values in forest stands. Khosravi et al. (2012) tried to determine the relations between LAI values and some stand parameters and found that there were moderate relationships (R2 = 0.36 for stand mean diameter, R2 = 0.36 for stand basal area, R2 = 0.45 for stand mean height) when the results obtained were evaluated. Some studies showed that LAI values decreased as the stand age increased (Sonohat et al. 2004; Homolova et al. 2007; Pokorny and Stojnic 2012). Bequet et al. (2012) also showed that the stand age was an important parameter in estimating stand LAI.

Among different ANN structures, the neural network architecture, based on the Radial Basis Function (RBF) architectures, showed greater performance in predicting LAI from stand parameters. These results may depend on the tree crown characteristics of a species growing in different regional and local forest conditions. Certain ANN models with different modeling architectures will provide better information for calibration response than other alternatives at various forest sites and for various tree species. Our results revealed that the use of hemispheric photography for LAI estimation in pure pine forest stands is rather appropriate. Because of time-consuming and labor-intensive nature of direct LAI determination methods, analysis of hemispherical photographs, which are fast, cheap and accurate can be successfully used to characterize forest canopy structure and to find relationships between LAI and forest stand parameters.

Conclusion

We evaluated the performance of ANN models for predicting LAI from stand parameters in pure-even-aged Crimean pine forests in Turkey. The ANN modeling proved useful in predicting LAI in study forests, suggesting that the ANN may be used in lieu of classical regression equations in cases the regression equations fails to perform adequately. However, weights of ANN models are specific to the study species and site conditions. Further studies should be conducted with different plant and site conditions to generalize the results across forest ecosystems. Especially, the ANN model provided better information for decision-making in management of forest resources, likely fulfilling the needs of field foresters to assess LAI as a measure of site potential when suitable site trees are not available in forest areas. The LAI predictions are reliable and accurate within the ecological ranges of the original data. However; these results may not be extrapolated outside the study area, as the relationships found in this study between LAI and stand parameters and these weights of ANN model are specific for this species and this study area.