Background

Tropical deforestation has been estimated to occur at a rate of 13 million ha per year [1], with an associated net loss of forest biomass of 1.3 ± 0.7 Pg C year−1 [2]. Remote sensing has been successful at mapping global deforestation [3, 4]. However, deforestation presents a simplified view of forest disturbance that ignores the many graduations of lower intensity, but often widespread, forest disturbance and degradation [5]. Forest disturbance refers to the mechanisms which limit biomass by causing its destruction [6]. The impact of forest disturbance is highly variable, leading to the total or partial loss of biomass through a diverse range of natural (e.g. disease, droughts, fires, herbivory and windstorms) and/or anthropogenic processes (e.g. urbanisation, agriculture, selective logging, and fires). Remotely sensed information on the spatial extent and intensity of forest disturbance would be extremely useful for managers and ecologists in attempts to develop a mechanistic understanding of forest degradation and the processes of forest disturbance [7, 8].

Unfortunately, whilst satellites have the coverage to provide global information on forest disturbance, very few studies have attempted to do so [9]. The focus of remote sensing has remained on mapping deforestation (e.g. [3]) and arises from a number of technological limitations which include, amongst other factors, observation precision and the relative “directness” of the observation [1012]. Ideally remote sensing measurements would have a direct dependence on the quantity being estimated; however current optical satellite derived estimates of forest disturbance are not able to achieve this ideal. Instead, optical satellite measurements of forest disturbance rely on observing physical properties that are expected to correlate with disturbance (e.g. changes in leaf area that can be detected by satellites due to a change in the absorption in the chlorophyll spectral bands). When these correlations change, indirect measures cannot be expected to provide a robust estimate of disturbance.

As Synthetic aperture radar (SAR) is sensitive to the structural properties of forests it can be thought of as a more direct measure of forest biomass than the passive optical alternatives [8, 13, 14]. SAR provides one of the only viable data sources currently available for global monitoring of forest disturbance at moderate spatial resolutions. However, the low signal to noise ratio of SAR leads to poor precision and large uncertainty in the estimated biomass of individual pixels [13] and this equates to high levels of uncertainty when detecting biomass change at the pixel level. Filtering can be used to improve the signal to noise ratio of SAR, but this improvement comes at the expense of a reduction in the effective spatial resolution of the biomass change estimates. Therefore remote sensing can provide regional estimates of biomass change or fine scale maps of deforestation, but it cannot yet be said to truly determine the intensity of forest degradation across scales [13].

In this study we set out a novel framework for quantifying the intensity of forest disturbance from successive remotely sensed biomass maps. This approach can describe both focused high intensity forest loss and low intensity, widespread degradation. We provide an example methodology to exploit this framework using SAR data with realistic observation errors. We explore the strengths and limitations of this approach using an observation system simulation experiment.

Results and discussion

A framework for quantifying the intensity of forest disturbance

The framework that we propose describes forest disturbance for an area in which all satellite observation pixels are assumed to experience the same disturbance regime. That is, whilst each pixel might or might not be disturbed, each pixel within each area has the same probability of being disturbed and the same relative loss of biomass when disturbed. We consider an ensemble of n remote sensing pixels, which are observed on two successive dates (t = 1 and t = 2). This approach builds on an earlier framework set out in Williams [15] that used the biomass distribution at a single point in time. The first advantage of this new approach is that it allows ensemble statistics to be calculated, negating the limitations of poor signal to noise of individual biomass pixels by estimating ensemble’s mean fractional loss of biomass per year (E M). The second advantage of the framework is that it permits E M to be split into two factors for the ensemble: the probability of disturbance per year for each ensemble pixel member (E P) and the fractional loss of biomass per disturbance for each disturbed ensemble pixel member (E I), Eq. 1.

$$E_{\text{M}} \approx E_{\text{P}} E_{\text{I}}$$
(1)

Both E M and its factors E P and E I can take values ranging from 0–1. Where, for example, an E P = 0.05 would imply a 1 in 20 chance of each pixel being disturbed in a year. An E I = 0.2 implies that, if disturbed, a pixel will lose 20 % of its biomass. Combining these example factors would lead to the expectation of E M = 0.01, or a 1 % reduction in mean biomass for the ensemble. The ensemble size can be picked to balance the competing requirements of high precision on estimates of E M, E P and E I and meeting the assumption of a continuous disturbance regime.

The inclusion of the factors E P and E I allows a flexible description of biomass loss, without the arbitrary distinction between deforestation and lower intensity forest disturbance. Forest disturbance (other than deforestation) can be represented by the parameter space 0 < E M < 1, 0 < E I < 1 and 0 < E P ≤ 1 (Fig. 1). There are also several special cases: the total deforestation of the ensemble area (E M = 1, E I = 1 and E P = 1); partial deforestation within the area (0 < E M < 1, E I = 1 and 0 < E P < 1); and no disturbance (E M = 0).

Fig. 1
figure 1

The relationships between the magnitude of disturbance (i.e. the mean fractional disturbance of a region per year, E M), the disturbance probability (i.e. the probability of a pixel being disturbed each year, E P), and the disturbance intensity (i.e. the fraction of biomass lost if disturbed, E I) are shown. Indicated within the figure are regions corresponding to widespread forest degradation, localised deforestation and widespread deforestation

Describing forest disturbance using this framework has a number of advantages over traditional descriptors as it avoids the need for an arbitrary threshold for a forest cover loss used in other studies, e.g. [16]. In turn, this allows for a more nuanced description of forest disturbance than is possible with categorical land cover classes, or measures of forest cover.

Using the framework to estimate biomass loss and disturbance intensity

Using the disturbance framework, our observation system simulation experiments (OSSE) show it is possible to robustly estimate the mean biomass loss for the ensemble (E M) and also the disturbance regime, as described by E P and E I (Fig. 2; Table 1). E M is better constrained than either E I or E P, reflecting the direct impact of E M on the mean of the biomass distribution versus the more variable impacts of the E I or E P on the second and third order moments: standard deviation and skew. With larger mean disturbances the constraints on E I or E P improve, presumably due to the larger number of pixels effected and/or greater impact on the observed biomass.

Fig. 2
figure 2

Estimates from the biomass difference approach for nine different combinations of disturbance intensity and disturbance probability. Crosses indicate the actual E P and E I used for each synthetic analysis. Coloured areas indicate the most likely combinations of E I and E P as estimated by the difference approach. The filled areas encompass the first 95 % of the cumulative likelihood, L. Low intensity cases are shown in blue, high intensity cases are shown in red and the special case of no disturbance is shown in grey

Table 1 The maximum (97.5 % CI), minimum (2.5 % CI) and 95 % CI range (maximum–minimum) for estimates of E M , E I and E P are shown for each of the test

How the framework is affected by observation bias

It is highly likely that biomass maps used with the framework will not only have random noise, but also systematic observation errors [13], therefore we test the framework’s sensitivity to bias. The inclusion of a realistic observation bias of ±160 gC m−2 (±1.6 tC ha−1) [13] had the largest impacts on the estimates of E M, which showed a bias of 0.004 (0.4 %). However E I and E P were less affected (Fig. 3; Table 1) and it was still possible to distinguish high and low intensity disturbance regimes. The bias in the estimate for E M is consistent with change in the mean biomass that is implied by a bias of 160 gC m−2, and would therefore apply to any other approach to estimating biomass loss.

Fig. 3
figure 3

As for Fig. 2, but showing the impact of observation bias on estimated E P and E I. Biases of 160, 0 and −160 gC m−2 are applied to the first of the biomass images. For each bias two example cases with the same E M are shown: (1) High intensity disturbance, E I = 0.9 and E P = 0.05. (2) Low intensity disturbance, E I = 0.05 and E P = 0.9

What is the optimal ensemble size?

The accuracy of predictions using the framework are best when considering an ensemble size of n ≥ 1600, whilst for n ≤ 400, the precision of our estimates drops rapidly (Fig. 4; Table 1). Therefore, for the 25 m pixels typical of current SAR biomass estimates, the recommended ensemble size covers an area of at least 1 km2 or 100 ha. The implication is that it must be reasonable to assume that the disturbance regime is constant across areas of at least 100 ha. We expect this minimum area to be robust, at least for ALOS PALSAR data, but the sensitivity to ensemble size should checked in each new study. It is worth noting that there is no requirement for these areas to be rectangular, or even contiguous. The grainy texture evident in the estimates from the analyses is due to the simulation of stochastic disturbances (Figs. 2, 3, 4). This graining is reduced with increasing ensemble size, but can be further reduced, at significant computational expense, by averaging repeat runs of the maximum likelihood estimation.

Fig. 4
figure 4

The effect of ensemble size (the number of pixels) on the analysis. The cross shows the synthetic truth and the coloured areas indicate the most likely E I and E P

Alternative formulations of the framework

We formulated the framework in terms of a fractional change in biomass for both E M and E I. This is not the only possible formulation, nor is it necessarily the optimal for all situations; however, it is a mathematically simple approach to maintaining a positive (i.e. plausible) biomass for each pixel. The implicit assumptions of the formulation are: 1) that all pixels are equally likely to be disturbed, and 2) that when disturbed a fixed fraction is lost, irrespective of the starting biomass. It is possible to imagine scenarios that would not be well described by our scheme where (say) selective logging only targets the largest trees (i.e. E P is high for pixels above a threshold biomass and zero for all others). These scenarios do not contradict our first assumption of common disturbance regime for the ensemble and it should be possible to restate the parameters used in the framework to accommodate a particular set of assumptions about the disturbance regime. However the inclusion of more complicated mathematical representations and, specifically, more parameters, can be expected to increase the challenge of estimating the parameters of any new framework formulation and decrease its general applicability.

Limitations to the framework

Finally there are two notable caveats: Firstly, the success of the approach is strongly tied to the ability to characterise the random error of biomass estimates. The design of the analysis mitigates some of the errors as biomass errors that are consistent between the two biomass maps will be removed by this differencing. Whilst Ryan et al. (2012) report normal errors in the biomass domain, other studies assume errors will actually be normal in the dB (i.e. log10) domain [17] which would result in log-normal, asymmetric errors on biomass estimates. We therefore reran the synthetic experiments with 0.5 dB error and achieved similar results. The second caveat is that the biomass differencing approach assumes the carbon model (A-DALEC) is unbiased. This assumption is, by definition, valid in a observation system simulation experiment (OSSE). However in practice the productivity of the ecosystem is not known perfectly, and so estimates of E M can be expected to be biased. However the assumption is not as crude as it at first might seem: ignoring the production term (as is implicitly done by most deforestation algorithms) makes the assumption that the forest otherwise in steady state. This issue is likely to be more severe in field sites where less information is available on which to provide independent estimate of the rates of aboveground biomass accumulation, e.g. [18].

Conclusions

From a management and policy perspective it is important to be able to distinguish between the different intensities of biomass loss as they may be associated with different disturbances mechanisms (e.g. low intensity disturbances are likely driven by a need for timber or fuel, and high intensity disturbances driven by a need for agricultural land) [19]. However, current estimates of deforestation and biomass loss are not adequate for estimating lower intensity forest disturbance. Theoretically, high resolution biomass loss estimates could provide fine-scale estimates of forest degradation, but the current precision is not adequate and there is no immediate prospect of this changing, partly due to the speckle and other noise in SAR imagery [20]. The framework that we have described is a pragmatic representation of forest degradation that uses ensemble statistics to mitigate the poor precision of current SAR biomass estimates [13, 17, 20]. SAR is expected to remain a key technique for global mapping of forest biomass, and the framework we propose is compatible with the upcoming BIOMASS and new L-band satellites [14]. Using an OSSE we have shown that is possible to robustly estimate the parameters of the framework to describe forest disturbance, provided that two successive biomass maps, separated by at least one year, are available. We suggest that using a similar framework will allow remote sensing studies to provide more relevant constraints on estimates of land-use change.

Methods

Estimating the framework parameters

A number of approaches could be taken to estimating E M, E I and E P, we chose to use maximum likelihood estimation. We identify the combinations of E I and E P that allow a simulations to most closely match the observed changes in the biomass distribution from time t = 1 to time t = 2, for an ensemble of n pixels.

The observed biomass (O t i ) of the ith pixel at time t is sum of the actual biomass (B t i ) and the measurement noise (N t i ), Eq. 2. The observed biomass at time t + 1, follows the same logic (Eq. 3). The actual biomasses are related via the growth G t i and disturbance D t i of each pixel in the ensemble (Eq. 4).

$$O_{i}^{t} = B_{i}^{t} + N_{i}^{t} \;\;\,{\text{for }}\;i = \, 1, \, 2, \, 3, \ldots ,n$$
(2)
$$O_{i}^{t + 1} = B_{i}^{t + 1} + N_{i}^{t + 1} \;\;\,{\text{for}}\;\;i = { 1},{ 2},{ 3}, \ldots ,n$$
(3)
$$B_{i}^{t + 1} = B_{i}^{t} + G_{i}^{t} - D_{i}^{t} \;\;\,{\text{for}}\;\;i = { 1},{ 2},{ 3}, \ldots ,n$$
(4)

Similarly we are able to simulate the observed biomass at time t (Os t i ) based on the simulation biomass (Bs t i ) added to the simulated noise (Ns t i ), Eq. 5. The observed biomass at time t (Os t+1 i ) is calculated from the simulation biomass (Bs t i ) through the sum of the simulated growth (Gs t i ), the simulated disturbance (Ds t i ), and the simulated noise (Ns t+1 i ), Eq. 6.

$$Os_{i}^{t} = Bs_{i}^{t} + Ns_{i}^{t} \;\;\,{\text{for}}\;\;i = { 1},{ 2},{ 3}, \ldots ,n$$
(5)
$$Os_{i}^{t + 1} = Bs_{i}^{t} + Gs_{i}^{t} - Ds_{i}^{t} + Ns_{i}^{t + 1} \;\;\,{\text{for}}\;\;i = { 1},{ 2},{ 3}, \ldots ,n$$
(6)

where Ds t i is related to the disturbance parameters in Eq. 7.

$$Ds_{i}^{t} = \left\{ {\begin{array}{*{20}c} 0 & {{\text{if }}U([0,1]) \ge E_{p} } \\ {E_{I} Bs_{i}^{t} } & {otherwise} \\ \end{array} } \right.\;\;{\text{for}}\;\;i = { 1},{ 2},{ 3}, \ldots ,n$$
(7)

We model Gs t i using the A-DALEC model, a biogeochemical model of carbon cycling in forests, which runs at an annual time step (see Additional file 1 A for full details) [15, 21]. Ns t i and Ns t+1 i are based on observed pixel uncertainty in SAR biomass estimates [13]. We cannot base Bs t i directly on the observed biomass O t i , as the observation noise N t i cannot be determined (and thus removed) for each pixel, and so the A-DALEC model is used to spin-up an estimate of Bs t i , see Additional file 1 section B for full details. The equations above assume that t = 1 and t = 2 are separated by 1 year. When the separation is more than 1 year, as in the case of our OSSE, the base Bs t i should have the annual growth and disturbance applied once for each year of separation.

95 % confidence intervals for the factors E P and E I are found using a maximum likelihood estimation routine, see Additional file 1 C for full details. This routine finds the most likely combinations of E P and E I based on minimising the differences between the observed and simulated ensemble distributions; ΔO = O t+1 − O t and ΔOs = Os t+1 − Os t.

Testing the framework parameter estimates

To allow the technique to be assessed against a known ‘truth’ we use a set of observation system simulation experiment (OSSE). In the OSSE we simulate two successive (noisy) biomass observations which have observation noise and a definable disturbance regime applied. The OSSE observations are derived by spinning up the A-DALEC model to match ALOS observations within the Gorongosa and Nhamatanda districts of Sofala province in central Mozambique. The area is dominated by dry miombo woodland, the dominant woodland type in Southern Africa, see Additional file 1 B. The first OSSE observation, t = 1, is taken at the end of the 1st year, the second observation, t = 2, is taken at the end of the 5th year, i.e. separated by 4 years. Separate spin-ups are performed for the ΔO and ΔS simulations. Three tests are performed; the first to assess the ability of the approach to identify E M, E I and E P, the second to assess the impact of observation bias and the third to determine impact of ensemble size.

Determining the accuracy of the framework parameter estimates: test 1

A set of nine analyses were used to test the ability of our approach to identify E M, E I and E P. Each analysis was performed for the same size ensemble of n = 1600 pixels, which equates to an area of 1000 by 1000 m given the 25 by 25 m pixel size of the ALOS biomass maps [13]. We perform OSSEs with four levels of mean disturbance E M = 0, 0.018, 0.045, 0.090 and 0.180. These disturbance fractions equate to annual biomass loss rates of 0, 1.8, 4.5, 9 and 18 %. For each nonzero E M we simulate a high intensity (E IE P) and low intensity (E IE P) disturbance scenario. In the high intensity cases only a few pixels are disturbed, but when disturbed 90 % of the biomass is lost (i.e. E I = 0.9). In the low intensity cases, 90 % of the pixels are disturbed (i.e. E P = 0.9), but each disturbance results in a small loss of biomass at the pixel level. The various combinations of factors are shown in Table 1.

Determining the impact of observation bias on the framework parameter estimates: test 2

To simulate the impact of observation bias we perturb the synthetic experiments using the bias observed in the actual biomass maps [13]. Only the first biomass observation (i.e. at time t = 1) is biased by −160, 0, and 160 gC m−2. The impact of the bias is assessed using both a low intensity case (i.e. E I = 0.05 and E P = 0.9), and a high intensity case (i.e. E I = 0.9 and E P = 0.05). In both cases the OSSE has an E M = 0.045 and an ensemble size of n = 1600 pixels.

Determining the impact of ensemble size on the framework parameter estimates: test 3

The impact of varying the ensemble size is tested for n = 6400, 1600, 400, and 100. These ensembles equate to areas of 2000 by 2000 m (6400 pixels), 1000 by 1000 m (1600 pixels), 500 by 500 m (400 pixels), and 250 by 250 m (100 pixels). The areas are centred on the same location. We analyses these areas for a single low intensity disturbance regime E M = 0.045, E I = 0.05 and E P = 0.9.