Impact of data model and point density on aboveground forest biomass estimation from airborne LiDAR

Garcia, Mariano; Saatchi, Sassan; Ferraz, Antonio; Silva, Carlos Alberto; Ustin, Susan; Koltunov, Alexander; Balzter, Heiko

doi:10.1186/s13021-017-0073-1

Impact of data model and point density on aboveground forest biomass estimation from airborne LiDAR

Research
Open access
Published: 15 February 2017

Volume 12, article number 4, (2017)
Cite this article

Download PDF

You have full access to this open access article

Carbon Balance and Management Aims and scope Submit manuscript

Impact of data model and point density on aboveground forest biomass estimation from airborne LiDAR

Download PDF

Mariano Garcia ORCID: orcid.org/0000-0001-6260-5791^1,2,
Sassan Saatchi¹,
Antonio Ferraz¹,
Carlos Alberto Silva^1,3,4,
Susan Ustin⁵,
Alexander Koltunov⁵ &
…
Heiko Balzter^2,6

5159 Accesses
33 Citations
2 Altmetric
Explore all metrics

Abstract

Background

Accurate estimation of aboveground forest biomass (AGB) and its dynamics is of paramount importance in understanding the role of forest in the carbon cycle and the effective implementation of climate change mitigation policies. LiDAR is currently the most accurate technology for AGB estimation. LiDAR metrics can be derived from the 3D point cloud (echo-based) or from the canopy height model (CHM). Different sensors and survey configurations can affect the metrics derived from the LiDAR data. We evaluate the ability of the metrics derived from the echo-based and CHM data models to estimate AGB in three different biomes, as well as the impact of point density on the metrics derived from them.

Results

Our results show that differences among metrics derived at different point densities were significantly different from zero, with a larger impact on CHM-based than echo-based metrics, particularly when the point density was reduced to 1 point m⁻². Both data models-echo-based and CHM-performed similarly well in estimating AGB at the three study sites. For the temperate forest in the Sierra Nevada Mountains, California, USA, R² ranged from 0.79 to 0.8 and RMSE (relRMSE) from 69.69 (35.59%) to 70.71 (36.12%) Mg ha⁻¹ for the echo-based model and from 0.76 to 0.78 and 73.84 (37.72%) to 128.20 (65.49%) Mg ha⁻¹ for the CHM-based model. For the moist tropical forest on Barro Colorado Island, Panama, the models gave R² ranging between 0.70 and 0.71 and RMSE between 30.08 (12.36%) and 30.32 (12.46) Mg ha⁻¹ [between 0.69–0.70 and 30.42 (12.50%) and 61.30 (25.19%) Mg ha⁻¹] for the echo-based [CHM-based] models. Finally, for the Atlantic forest in the Sierra do Mar, Brazil, R² was between 0.58–0.69 and RMSE between 37.73 (8.67%) and 39.77 (9.14%) Mg ha⁻¹ for the echo-based model, whereas for the CHM R² was between 0.37–0.45 and RMSE between 45.43 (10.44%) and 67.23 (15.45%) Mg ha⁻¹.

Conclusions

Metrics derived from the CHM show a higher dependence on point density than metrics derived from the echo-based data model. Despite the median of the differences between metrics derived at different point densities differing significantly from zero, the mean change was close to zero and smaller than the standard deviation except for very low point densities (1 point m⁻²). The application of calibrated models to estimate AGB on metrics derived from thinned datasets resulted in less than 5% error when metrics were derived from the echo-based model. For CHM-based metrics, the same level of error was obtained for point densities higher than 5 points m⁻². The fact that reducing point density does not introduce significant errors in AGB estimates is important for biomass monitoring and for an effective implementation of climate change mitigation policies such as REDD + due to its implications for the costs of data acquisition. Both data models showed similar capability to estimate AGB when point density was greater than or equal to 5 point m⁻².

LiDAR Data Fusion to Improve Forest Attribute Estimates: A Review

Article Open access 21 June 2024

Mapping and monitoring peatland conditions from global to field scale

Article Open access 10 October 2023

Fire activity and fire weather in a Lower Mekong subregion: association, regional calibration, weather–adjusted trends, and policy implications

Article 21 June 2024

Background

Forests provide essential ecosystem services at a range of scales and represent a major sink of atmospheric carbon, yet can turn into a significant carbon source due to deforestation and forest degradation. Therefore, identifying the role of forests as carbon sinks or sources is key to understanding the carbon cycle [1]. Likewise, development of precise forest monitoring systems is essential for the effective implementation of climate change mitigation policies such as REDD + (reducing emissions from deforestation and degradation), which require accurate mapping of aboveground biomass (AGB) and its changes.

Numerous studies have proved the ability of LiDAR data to provide accurate estimations of field-measured AGB across different ecosystems [2–5] given its capability of providing detailed 3D measurements of forest structure. Nevertheless, the accuracy of the LiDAR estimation is subject to the accuracy of the field measurements and allometric equations used to derive AGB, which are subsequently used to calibrate LiDAR-based models [6].

Sensor characteristics and flight planning parameters affect LiDAR measurements of the spatial distribution of canopy components and therefore of the vegetation structure metrics derived from them. Similarly, digital elevation models (DEM) and digital surface models (DSM) derived from the LiDAR data are also affected by acquisition parameters. These effects will be propagated to the canopy height model (CHM) obtained by subtracting the DEM from the DSM. The effect of LiDAR survey parameters on the derivation of biophysical properties from airborne LiDAR data has been investigated in different studies. For example [7, 8], concluded that the use of different sensors or variation of flying altitude and pulse repetition frequency (PRF) between different acquisitions result in significant differences of LiDAR metrics sensitive to the vertical distribution of vegetation and canopy density. However, Hopkinson [9] found that laser pulse peak power concentration was the most important factor in the variation of intensity and frequency distribution of returns, although with different effects over short and tall vegetation. Scan angle has also been shown to affect fractional cover (FC) estimates, yet for small scan angles the effect is less evident [7].

In most of these studies, the effect of survey configuration on the LiDAR information was assessed by collecting new data with varying survey parameters. However, variation of acquisition settings like flying height or PRF results in a simultaneous variation of more than one LiDAR parameter like footprint size, point density or pulse power. This makes it difficult to generalize the effect of changing a single parameter on the resulting point cloud. In order to isolate the effect of each survey characteristic on the resulting point clouds and the height estimated from them, Disney et al. [10] simulated different point clouds for different scenarios defined by modifying a single parameter at a time, using a Monte Carlo Ray trace (MCRT) model of canopy scattering. Some of these studies have shown a general increase in the retrieved vegetation height with an increase flying height or reduced PRF [8, 10] whereas the opposite effect has also been reported [7, 9] as a result of a reduction of the pulse energy per unit area.

LiDAR vegetation measures can be represented using two different data models, the echo-based and the CHM raster model. The former represents forest structure by means of 3D point cloud whereas the latter summarizes this information into a raster where each pixel represents the maximum height of the points contained within it. The CHM approach significantly decreases the volume of the data at the expense of loss of information provided. Some studies have evaluated the effect of echo- and CHM-based models on the retrieval of canopy gaps [11] or more recently, on the estimation of AGB [12]. Nevertheless, these studies did not evaluate the impact of varying acquisition parameters on the metrics derived from each data model.

In the context of carbon monitoring, which requires repeated acquisitions at a certain interval, it is likely that each survey will be carried out using different sensors or flight configurations. In addition, in order to maintain cost-efficiency of LiDAR data for REDD + MRV (measuring, reporting and verification), optimum survey configurations should be planned. Point density, along with the footprint size, determines the spatial resolution of LiDAR datasets. It is probably the most important parameter when planning a LiDAR acquisition, with a significant impact in acquisition costs, as it is common to target a minimum point density for the study area in order to maximize spatial coverage. Therefore, the evaluation of the effect of both point density and the data model used on the estimation of AGB becomes an important issue in the MRV process.

This study aims at evaluating the potential of the echo-based and the CHM data models for AGB estimation over three forests across different biomes, and how they are affected by the point density. The specific objectives were to: (1) evaluate the effect of point density on the metrics derived from each data model; (2) evaluate the impact of plot size on the metrics; (3) evaluate the potential of these data models to estimate AGB in different forests with very different vegetation types; and (4) evaluate the impact of point density on the derived empirical models.

Results

Effect of point density on the metrics

Tables 1, 2 and 3 show the results of the two-sided Wilcoxon signed rank test for the null hypothesis that there were no statistically significant differences in medians between the metrics derived from the original and the thinned data. In all three sites, the reduction of point density resulted, for most of the metrics, in significant differences between the metrics derived from the original and the thinned datasets. These results were also supported by a two-sided one-sample t test of the differences in means between the original and the thinned datasets (results not shown). Point density reduction had larger impact on the metrics derived from the CHM than on those derived from the echo-based model. The effect of point density on the metrics also generally showed similar behavior for the different plot sizes tested, from 0.09 to 1 ha. Although similar patterns were observed at the three study sites, some differences exist among them, reflecting differences in their vegetation structure. For instance, the area under the canopy waveform (AUCW) obtained from the echo-based model showed significant differences in the Sierra Nevada Mountains in California (SNM herein after) and on Barro Colorado Island, Panama (BCI herein after), whereas in the Sierra do Mar in Brazil (SdM herein after) differences were not statistically significant. Similarly, while differences in fractional cover (FC) or the standard deviation of the height (StdH) were significantly different from zero for any point density or data model in SNM, in BCI and SdM the differences were only significant for the lowest point density (1 point m⁻²).

Table 1 Two-sided Wilcoxon signed rank test results of the point density effect on LiDAR metrics for the Sierra Nevada Mountains study site

Full size table

Table 2 Two-sided Wilcoxon signed rank test results of the point density effect on LiDAR metrics for the Barro Colorado Island study site

Full size table

Table 3 Two-sided Wilcoxon signed rank test results of the point density effect on LiDAR metrics for the Serra do Mar study site

Full size table

Although the statistical test resulted in significant differences for the metrics derived at different point densities, the magnitude of these differences was generally very low. In all three sites, canopy height values decreased as the point density was reduced. The mean difference between the maximum height estimated from the highest point density and the thinned data was negligible, except for the lowest point density (1 point m⁻²), for which it could be larger than 1 m. In addition, in most cases the standard deviation of the differences was larger than the mean. Thus, the mean differences (±standard deviation) in maximum canopy height ranged between −0.02 m (±0.20 m) and 1.16 m (±0.87 m) in SNM, −0.03 m (±0.66 m) and 0.84 m (±1.73 m) in BCI and 0 (±0.94 m) and 0.97 m (±1.37 m) in SdM. The same trends were observed regardless of the data model used, although differences from the CHM-derived metrics were larger. The same pattern was observed for other metrics related to the vertical distribution of vegetation (mean and percentiles of the height) derived from the echo-based model. Differences ranged between −0.09 m (±0.10 m) and 0.06 m (±0.20 m) in SNM, between −0.08 m (±0.37 m) and 0.11 m (±0.48 m) in BCI and between −0.52 m (±0.56 m) and 0.96 m (±0.66 m) in SdM. Height metrics derived from the CHM were more affected by point density with differences ranging between 0.29 m (±0.23 m) and 5.09 m (±3.49 m) in SNM, 0.25 (±0.50 m) and 5.19 m (±1.93 m) in BCI and 0.14 m (±0.12 m) and 4.82 m (±1.20 m) in SdM. These differences were statistically significant and unlike for the metrics derived from the echo-based model, the standard deviation was smaller than the mean. In all cases, the largest differences were attained at the lowest point density (1 point m⁻²). In the case of the coefficient of variation of the height, differences were generally not significant for all three sites when it was derived from the echo-based model but became significant when derived from the CHM-based model. In the case of the standard deviation of the height, different behavior was observed for each study site, with significant differences observed in SNM but not in SdM. When the metric was derived from the CHM, the differences were significant at all three sites. Moreover, while the differences in the standard deviation of the height were less than 15 cm when derived from the echo-based model, they were larger than 1 m when derived from the CHM in SNM and BCI. Regarding the AUCW, smaller values were obtained as the point density was reduced, particularly when derived from the CHM. Finally, in the case of FC, differences were less than 2% in all three sites when derived from the echo-based model, despite being statistically significant for the SNM study site. Slightly larger differences were obtained when FC was derived from the CHM, with values reaching 5% in SNM, 14% in BCI and 2% for the SdM.

The boxplots in Fig. 1 show a summary of the variation of mean canopy height and FC subsequently used to model AGB, derived from each data model at each study site as a function of point density. These variables were used to model AGB from the LiDAR data.

Effect of plot size on the metrics

The effect of structural variability associated with the plot size varied among the study sites and the data models used. Variables like AUCW, coefficient of variation and StdH, showed different behavior in each study site. The same pattern of the effect of plot size on the metrics was observed for the different point densities evaluated (Tables 4, 5 and 6). FC and height related metrics, except maximum height and P25H, did not show statistically significant differences (p value >0.05). Maximum canopy height showed significant differences at all sites as it could be expected with absolute differences ranging between 3.15 m (±3.55 m) and 8.13 (±4.54 m); 1.67 m (±3.26 m) and 6.64 m (±6.19 m); and 2.57 m (±2.08 m) and 7.24 m (±0.44 m) for SNM, BCI and SdM, respectively. P25H showed different behavior for SNM than for BCI and SdM, which could be a result of a more open canopy. Whereas for SNM the differences ranged between 0.45 m (±1.78 m) and 0.90 m (±0.28 m), for BCI the range was from −0.05 m (±1.42 m) to −0.46 (±3.60 m) and for SdM they spanned from −0.01 m (±1.60 m) to −0.48 m (±1.49 m). A monotonic effect of plot size on the metrics was observed at all point densities, i.e. an increase or decrease as the plot size varied, regardless of the data model used or the study site.

Table 4 Two-sided Wilcoxon signed rank test results of the plot size effect on LiDAR metrics for the Sierra Nevada Mountains study site

Full size table

Table 5 Two-sided Wilcoxon signed rank test results of the plot size effect on LiDAR metrics for the Barro Colorado Island study site

Full size table

Table 6 Two-sided Wilcoxon signed rank test results of the plot size effect on LiDAR metrics for the Serra do Mar study site

Full size table

The CHM-derived metrics generally showed a greater dependence on the plot size than the echo—based metrics, especially for the SNM. Although for BCI and SdM the mean differences of the metrics were similar, with the exception of AUCW and P25H, the standard deviation for the metrics derived from the CHM was higher for all metrics and study sites.

The boxplots in Fig. 2 show a summary of the variation of mean canopy height and FC derived from each data model at each study site as a function of plot size.

Aboveground biomass modeling

Table 7 presents the results of the power models adjusted to estimate AGB for each study site and data model. It also presents the effect of point density on the model derived at the original point density. Both data models performed similarly in all study sites and no effect of point density was observed for the echo-based model. This was expected due to the small changes observed in mean height and FC when the metrics were derived at different point densities. Although differences between the AGB estimates from the thinned datasets were statistically significant (p value <0.05), except for SdM, the largest error for SNM was only 4% of the mean AGB derived from the model trained with the highest point density. For BCI and SdM the largest errors represented less than 1 and 5%, respectively. In all three sites, the CHM-based model showed a remarkable decrease in performance when applied to the lowest point density (1 point m⁻²). This effect was not reflected in terms of R² but in the RMSE. Moreover, the largest error represented up to 48, 23 and 15% of the mean AGB derived from the model trained with the highest point density for SNM, BCI and SdM, respectively. The inclusion of FC in the model slightly improved results in SNM and SdM but had no effect in BCI.

Table 7 Model (echo-based and CHM) evaluation for each study site and power model fitted

Full size table

Figure 3 shows the scatter plot of the estimated AGB from the different models and resolutions compared to the field measurements. Points almost overlap when the model is calibrated using echo-derived metrics whereas higher discrepancies are observed in the CHM-based models. This trend is observed in the three study sites.