1 Introduction

Shrubland ecosystems are globally distributed and highly heterogeneous in structure and plant species composition. Under certain conditions, fires in shrubland ecosystems can spread fast with high intensity [1, 2]. These fires have the potential to burn extensive areas under extreme fire conditions, threatening human life, property and a result in broad range of environmental impacts [3]. Recent advances in computational models have shown the potential to predict fire behaviour effectively [1, 4, 5]. These models are used to characterise fire behaviour under specific fuel and weather conditions and predict fire spread during a fire event [1].

In semi-arid environments, fuels are often organised in stratified layers and discrete clumps separated by bare ground [1, 6]. Discontinuities in the horizontal distribution of canopy fuels associated with large-scale heterogeneous fuel complexes cause discontinuous fire behaviour. For example Oak chaparral in California [7], sage brush in the interior Western US and Mallee-heath [8] and hummock grasslands [1, 9] in Australia. A small change in the drivers of the fire can lead to a large change in resulting fire behaviour, including transitions from surface to a crown fire, with a corresponding increase in the rate of spread [2].

Given the influence of fuel characteristics on the behaviour of fire, it is essential to explicitly account for the effects of vegetation structure when exploring the interaction between forest structure and fire behaviour [4, 10,11,12,13,14,15]. Parsons et al. [13] suggest that fine-scale canopy fuel spatial patterns can affect both mean and variability in fire behaviour outcomes and canopy cover is one of the strongest drivers of changes in fire metrics.

It is a time-consuming challenge for managers to collect accurate field information throughout the lifecycle of wildfires. However, recent development in remote sensing technology provides researchers with an opportunity to quantify and map patterns of variation in forest cover and health assessment across space and time [16, 17]. Satellites and aircraft are the main platforms used to collect such data. Satellites are suitable for covering a large area, and for conducting multi-temporal analysis. However, due to limited penetration capability of optical imagery in vegetated areas, satellite imagery is currently unable to provide detailed measurements of forest structure often required for use in fire simulation models [18], although space based laser ranging technologies, for example NASA’s Global Ecosystem Dynamics Investigation, ‘GEDI’ are progressively bridging this gap [19, 20].

Airborne Laser Scanning (ALS) is an active remote sensing technique which utilises reflections from laser pulses emitted at known locations to determine the 3D properties of the environment [21]. Ongoing research into the processing and analysis of ALS data has allowed for the development of an extensive range of ALS data products from which a wide range of forest metrics can be derived [22,23,24]. Studies have shown that ALS can provide reliable estimates of 3D forest canopy structure [21, 25,26,27]. Due to its ability to penetrate through forest canopy, ALS provides superior canopy cover estimates to airborne and satellite optical imagery [28, 29].

Traditionally, canopy cover was determined by field measurements such as Cajanus tube and line intersect sampling method [30, 31]. However, these measurements are labour-intensive, highly time-consuming and expensive. Thus, it becomes difficult to collect sufficient field measurements for estimating canopy cover over large areas [32]. Recently various algorithms have been developed to estimate canopy cover from ALS data based on both the geometry of the captured scene and the properties of the returns within that scene [21, 32,33,34]. Furthermore, previous studies have indicated that ALS based estimations of canopy cover can be affected by many factors, such as LiDAR scan angle [35], ALS point density [21], and forest composition [36]. However, we currently lack a thorough understanding of how the properties of these algorithms, data capture and forest structure across a range of different forest types influence canopy cover estimations. Any differences in canopy cover, due to the choice of algorithm, are likely to result in unknown variations of predicted fire behaviour from the empirical models which use this metric. Given the potential of ALS data and the ability to derive fuel maps at high spatial resolutions, a greater understanding of how different algorithms to derive canopy cover interact with existing fuel models is required.

For improving predictions of fire behaviour, it is critical to understand the sensitivity of fire behaviour models to the variation in canopy cover descriptions. The increasing availability and diversity of new methods for deriving canopy cover for consumption into fire behaviour models require an understanding of how different methods for estimating the same metric can affect model outputs as a first step towards improving predictions.

In this study, we investigate the effect of ALS-derived canopy cover estimates from different ALS processing algorithms on the outcomes (burned area, unburned area ratio and fire extent) of a Mallee-heath fire spread model. To date, a generalised understanding of how algorithms to derive fuel related input data affect fire behaviour in this environment is lacking. This study therefore fills an essential knowledge gap by providing information on (1) how five different canopy cover estimate algorithms in Mallee-heath vegetation types produce cover estimates and (2) the impact of these different canopy cover estimates on fire behaviour model outputs such as burned area, unburned area ratio and fire extent.

2 Materials and Methods

2.1 Study Area

The study site for this experiment is located 25 km north of Renmark in South Australia on Calperum station (33\(^{\circ }\)44\(^\prime\)49\(^{\prime \prime }\)S, 140\(^{\circ }\)52\(^\prime\)22\(^{\prime \prime }\)E) (Fig. 1). It is one of the supersites of the Australian Terrestrial Ecosystem Research Network (TERN). TERN is Australia’s land ecosystem observatory, where key terrestrial ecosystem attributes over time are measured from continental scale to field sites at hundreds of representative locations [37]. This site was chosen due to the availability of a published empirical fire spread model for the fuel type at the site (Section 2.6) and high-resolution remote sensing data (Section 2.2). The Calperum area is important because of presence of several threatened bird species, wetlands adjacent to the Murray river and intact Mallee vegetation [38].

Fig. 1
figure 1

Study area. a Location of the study area in southeast of the state of South Australia. b Vegetation present in the study area (Source: https://supersites.tern.org.au/supersites/clpm). c Plot design used for the study area showing the location of 10 ignition points in red superimposed on a canopy height map. The wind direction at each ignition point is shown as black arrows. The points were chosen to be 1.5 km from the centre of the area and at 36\(^\circ\) intervals from each other

The vegetation in the area is dominated by eucalypt trees of four species (Eucalyptus dumosa, Eucalyptus incrassata, Eucalyptus oleosa and Eucalyptus socialis). Their distribution is influenced by a number of factors, including rainfall, soil type and fire history, at different spatial and temporal scales. These multi-stem trees are typically sparsely distributed across the landscape and grow between 3.5 and 7 m [38]. Eremophila, Hakea, Olearia, Senna and Melaleuca genera are sparsely distributed in the mid-storey. Across the site hummock and tussock grasses are also present as an understorey.

The climate is semi-arid and characterised by extremely hot, dry summers and cool winters. The mean annual rainfall ranges between 200 and 340 mm (annual extremes are 72–694 mm). Daily maximum temperatures can reach up to 40 \(^\circ\)C in the summer and winters are generally dry and mild with an average maximum of 16 \(^\circ\)C. The landscape is an extensive plain with the River Murray, its anabranches and floodplain wetlands forming the major topographic feature [38].

The central and southeast areas of the site are characterised by sand dune and swale systems that run approximately east-west, with undulation from swale to crest up to 8 m in elevation [38]. Dunes support a canopy of Mallee-heath with a Triodia understorey [39]. These areas present the greatest fire risk in Mallee-heath systems due to the extreme flammability of Triodia. This risk increases with seasonal conditions such as above-normal rainfall [1]. A strong response of ephemeral herbs and grasses to wet years creates continuous fuel conditions which can lead to extensive fires in this vegetation [6, 40]. Essentially, fire behaviour in Mallee-heath is a function of complex interactions between vegetation types and the seasonal conditions.

2.2 ALS Data and Pre-processing

The ALS data used in this study was obtained from TERN AusCover [37]. A Riegl LMS-Q560 full-waveform laser scanner was flown at a height of 300 m above ground level to capture small footprint data over 5 km \(\times\) 5 km study site in February 2012. Data was captured using regular north-south oriented flight lines with a spacing of approximately 125 m. The swath overlap between flight lines was 50%. The scanner has a beam divergence of 0.5 mrad resulting in a laser footprint of 30 cm on the ground. The scan angle was limited to -22.5\(^\circ\) to +22.5\(^\circ\) in the across-track direction resulting in 45\(^\circ\) field-of-view.

The full waveform data was processed by the data supplier to produce a discrete return point cloud. This was achieved using Riegel RiAnalyze (version 4.1.2) to identify discrete peaks in the back-scattered signal using a Gaussian Pulse Estimation technique [41], resulting in a maximum of 7 returns per output pulse.

Discrete returns were classified into either ground or non-ground using the Cloth Simulation Filter (CSF). A complete description of the CSF algorithm and the various parameters can be found in [42]. In brief, this filter identifies points that are most likely belonging to the ground through the simulation of a rigid cloth draped over the point set. The filter is parameterised using class threshold, resolution, time step and rigidness parameters. In this case values of 0.02, 5, 2 and 3 were applied for class threshold, resolution, time step and rigidness respectively. Once the ground points were identified, linear interpolation was used to generate a 1 m resolution Digital Elevation Model (DEM). Subsequently, the Above Ground Height (AGH) of all non-ground points was calculated by subtracting the value of the DEM elevation at each horizontal location.

2.3 Canopy Metrics Derivation

The fire model described in Section 2.6 requires both canopy height and canopy cover as input. Both canopy cover and height metrics were derived at 10 m resolution grid. A scale of 10 m was used as Taneja et al. [43] showed that both canopy height and cover are spatially correlated up this distance in Mallee vegetation. Canopy height was calculated as the 95th percentile of the AGH of the non-ground points falling within each cell. Mahoney et al. [44] suggest that 95th percentile is the logical height percentile for validating Australia-wide predictions of canopy height. Furthermore, the typical accuracies associated with ALS height information with respect to ground estimates, 95th percentile is assumed to be an adequate estimate of true canopy height across Australia [45]. Canopy cover was calculated using the approaches described in the following section.

2.4 Canopy Cover Calculation

Five different methods were used to calculate canopy cover from the ALS data. These were:

  1. 1.

    First Cover Index (FCI)

  2. 2.

    Last Cover Index (LCI)

  3. 3.

    Solberg Cover Index (SCI)

  4. 4.

    Alpha Cover Index (ACI)

  5. 5.

    Grid Cover Index (GCI)

The details of these methods are described in the following section. These can be broadly grouped into two categories: (1) return-based metrics (FCI, SCI and LCI), where canopy cover is based on the height and type of return (i.e. first, only or last return from an outgoing pulse) and (2) geometric-based metrics (ACI and GCI) where canopy cover is based on the geometry of the returns above a certain height. In all cases we consider canopy returns to be any returns that occur greater than 1.35 m above the ground (Fig. 2), as this height has been shown to efficiently separate the tree crowns from the understorey and ground vegetation [46].

Fig. 2
figure 2

Diagram showing how returns used in the return-based cover estimates are generated from ALS pulses. The blue line shows a pulse that has been partially intercepted by the canopy. Generating a \(First_{canopy}\) return (used in the First Canopy index (FCI), this is also a \(First_{all}\) return used in the Solberg Canopy Index (SCI)), a \(Last_{canopy}\) return (used in the Last Canopy Index (LCI)), several intermediate returns and a \(Last_{all}\) return (used in LCI and SCI). The red line shows a pulse that was not intercepted by the canopy, which generates \(First_{all}\) (used in FCI) and \(Last_{all}\) (used in LCI and SCI) returns

2.4.1 Return-Based Metrics of Canopy Cover

Three return-based indices utilise the properties of laser pulses to determine a measure of canopy cover. These laser pulses are emitted from the ALS system and reflect from objects both on and above the ground surface including trees, other vegetation (grasses and shrubs), buildings and the ground. Pulses passing through the environment may be partially or fully intercepted by any objects it encounters. When only one return is triggered per pulse this is termed as single return, whereas multiple returns occur when several objects reflect enough light back to the sensor from a single pulse. The returns from a multiple-return pulse can be categorised as first returns, intermediate returns and last returns as shown in Fig. 2. For example a pulse may reflect from upper canopy first (first return), then the lower tree canopy, stems, understorey vegetation (intermediate returns) as finally the ground (last return), resulting in multiple measurable returns from that pulse.

The first return laser pulse is the most significant return and will be associated with the highest feature in the landscape like a treetop as shown in Fig. 2. The first return can also represent the ground in an area which has no vegetation or any other feature. In this case only single return will be detected by the ALS system. It is important to note that the last return will not always be from a ground return. For example, when a pulse hits a thick branch on its way to the ground and the pulse does not actually reach the ground, the last return is not from the ground but from the branch that reflected the entire laser pulse. The intermediate returns, in general, provide an indication of vegetation structure.

The first method, the First Cover Index (FCI) described in Morsdorf et al. [47], uses the proportion of first (\(First_{canopy}\)) and single (\(Single_{canopy}\)) returns from the canopy to the total number of first (\(First_{all}\)) and single (\(Single_{all}\)) returns within an area to provide an estimate of canopy cover as follows:

$$\begin{aligned} FCI = \frac{\sum {Single_{canopy}} + \sum {First_{canopy}}}{\sum {Single_{all}} + \sum {First_{all}}} . \end{aligned}$$
(1)

The second method, the Last Cover Index (LCI) given in Korhonen et al. [21] is based on the proportion of last return (\(Last_{Canopy}\)) and single (\(Single_{canop y}\)) returns from the canopy to the total number of last (\(Last_{all}\)) and single (\(Single_{all}\)) returns within an area, to provide an estimate of canopy cover as follows:

$$\begin{aligned} LCI = \frac{\sum {Single_{canopy}} + \sum {Last_{canopy}}}{\sum {Single_{all}} + \sum {Last_{all}}} . \end{aligned}$$
(2)

The third method, the Solberg’s Cover Index (SCI) given in Solberg et al. [48], which takes into account both single first and last returns from the canopy and the ground as follows:

$$\begin{aligned} SCI = 1 - \frac{\sum {Single_{ground}} + 0.5(\sum {First_{ground}} + \sum {Last_{ground}})}{\sum {Single_{all}} + 0.5 (\sum {First_{all}} + \sum {Last_{all}})}. \end{aligned}$$
(3)

The coefficient 0.5 is a weight for pulses that produce both first and last echoes, and it can be adjusted if necessary [48]. For instance, a pulse that produces both canopy-first and ground-last echoes is assigned penetration of 0.5. For a single ground echo the penetration is 1, and for a single canopy echo it is 0.

2.4.2 Geometric-Based Metrics of Canopy Cover

The grid cover index (GCI) introduced by Korhonen et al. [21] uses the vertical projection of cover points defined across a 2D binary grid. Following Korhonen et al. [21] a binary 0.1 m resolution grid was created within cells containing a canopy return given a value of 1, whilst others were labeled with 0. Morphological closing is then applied to the canopy map to remove small gaps in the data. This is followed by morphological opening which allows any unnecessary detail and outliers to be removed. Cover is then calculated as the percentage of 1 m cells within each 10 m grid cell area containing labeled with a 1.

The alpha cover index (ACI) applied to ALS data [34] uses alpha shapes to map the crown area based on the 2D projection of crown strikes. The parameter \(\alpha\) is used to tune the "tightness" of the shape around the points. For a very large value of \(\alpha\), the shape is equivalent to the convex hull. For a very small value of \(\alpha\), the \(\alpha\)-shape forms holes and pockets with the shape clustering around the original points. Cover is then calculated as the percentage of area of the 10 m grid cell covered by an alpha shape. To accurately capture the variations within the crown and eliminate any unnecessary noise, \(\alpha\) was chosen as a function of the point density. As per the recommendation from [34], value of \(\alpha\) was chosen to be 1 m and 0.5 m for this study. ACI1 represents alpha cover index with \(\alpha\) as 1 m and ACI05 represents alpha cover index with \(\alpha\) as 0.5 m.

2.5 Ground Estimates of Canopy Cover

The ground estimates of canopy cover were downloaded from the TERN data portal [49] and compared with cover estimates derived from ALS data. These estimates were collected following the TERN supersite monitoring protocols in May 2012.

The ground estimates of canopy cover were obtained using Digital Canopy Photography method (DCP) [50, 51] for a 100 by 100 m area immediately to the North-West of the flux tower present in the study area. Photographs were taken at the intercepts along ten 100 m east-west transects with transects at 10 m North-South intervals [49]. This approach was used to characterise canopy cover for a 1 ha area. At each point a single upward looking digital image was captured. The image was then classified into canopy and sky pixels. Sky pixels were further classified as those occurring between crown elements and those occurring outside of the crown area. Canopy cover (fc) is defined as the fraction of canopy pixels including sky pixels that lie in between small crown gaps [50].

Ground observation data did not contain accurate location coordinates therefore preventing direct comparison with ALS data. In lieu of direct comparison, a comparison between the properties of the population of canopy cover estimates from the field with ALS based estimates within the 140 \(\times\) 140 m area directly North-West of the flux tower. A two-sample t-test was then made between each set of ALS metrics and the ground estimates to determine if mean of ground estimates is same or significantly different from mean of each ALS metrics.

2.6 Fire Spread Model

Fire behaviour in semi-arid Mallee-heath is modelled by an empirical Mallee-heath fire spread model developed by Cruz et al. [1]. This model predicts the likelihood of fire propagation, type of fire (surface or crown fire), and forward rate of spread. The probability of successful fire spread (\(P_s\)) is given by

$$\begin{aligned} P_s = \frac{1}{1+exp[-(14.62+0.207 U_{10} - 1.872 MC - 0.304 Cov_o)]}, \end{aligned}$$
(4)

where \(U_{10}\) is 10-m wind speed (kmh\(^{-1}\)), \(Cov_o\) is the canopy cover percentage and MC is the moisture content (%) of the dead litter fuels given by [1]:

$$\begin{aligned} MC = 4.79 + 0.173 RH - 0.1(T - 25) - \Delta (0.027 RH) , \end{aligned}$$
(5)

where RH is the relative humidity (%), T is the air temperature (\(^\circ\)C) and \(\Delta\) is solar radiation variable. Note Eq. (5) only holds during daytime hours (\(\Delta = 1\)) and 0 otherwise.

If \(P_s < 0.5\), then fire will be self-extinguishing. If \(P_s \ge 0.5\), then fire will result in sustained fire spread and for spreading fires, the probability of crown fire propagation (\(P_c\)) is given by

$$\begin{aligned} P_c = \frac{1}{1+exp[-(11.138+1.4054 U_{10} - 3.4217MC)]} . \end{aligned}$$
(6)

Rather than assuming the fire is a surface or crown, the model uses a probability of crowning, \(P_c\), to determine overall rate of spread, R, (mmin\(^{-1}\)) is given by

$$\begin{aligned} R = (1-P_c) \dot{R}_s + P_c \dot{R}_c, \end{aligned}$$
(7)

where \(\dot{R}_s\) and \(\dot{R}_c\) are rates of spread of surface fire and crown fire respectively. \(\dot{R}_s\) (mmin\(^{-1}\)) is given by

$$\begin{aligned} \dot{R}_s = 3.337 \; U_{10}\;exp (-0.1284\;MC) \; H^{-0.7073}, \end{aligned}$$
(8)

where H is canopy height (m) and \(R_c\) (mmin\(^{-1}\)) is given by

$$\begin{aligned} \dot{R}_c = 9.5751 \; U_{10}\;exp(-0.1795\;MC) \; (Cov_o/100)^{0.3589}. \end{aligned}$$
(9)

Noting that from Eq. (8) the rate of spread of surface fire is independent of canopy cover.

2.7 Fire Spread Modelling Environment and Parameterisation

The ‘Spark’ wildfire simulation framework was used for modelling hypothetical wildfires over our study area using the Mallee-heath rate-of-spread model [52]. Multiple rate-of-spread models can be implemented in Spark using user-defined scripts based on the different fuel types and conditions required for each model [53]. The framework supports standard geospatial data types for fuel layers and meteorological conditions. The system requires the ignition conditions of the fire in the form of points, lines or polygons. Typically, topography information is also required as the speed of a fire is dependent on the slope of the terrain. However, topography was held constant for the simulations as the focus of this study is to determine the impact of canopy cover calculations on the fire spread model.

The area for which the fire could be modelled was limited by the extent of the airborne ALS capture (5 km \(\times\) 5 km). As such, and in order to remove any spatial bias in the fuel distribution pattern (e.g. horizontal striations), 10 ignition points were distributed evenly (at 36\(^{\circ }\) intervals) around the edges of a 1.5 km radius circle centred within the data capture as shown in Fig. 1. At each ignition point, the wind direction was set such that the fire would pass through the centre of the study area. This allowed for a model duration of 1 h and ensured each simulated fire extent remained within the footprint of the ALS data. Furthermore, this also served to remove any directional bias in the results as the fires originated from different start points spread equidistant around the study area for each of the 10 simulations.

To determine the effect of cover calculations on fire spread model, other parameters such as weather and topography affecting fire propagation, were held constant for each simulation. Empirical models developed from experimental data with sound functional forms have been shown to work well for wildfire situations where the environmental drivers are well over the range of the experimental datasets [54]. In this study we simulate fire under following fire weather conditions: wind speed was set to 30 kmh\(^{-1}\), temperature was fixed at 25 \(^{\circ }\)C and relative humidity was set to 10%. Under these conditions fire can transition from surface to crown in this fuel type [1].

The Spark simulations were run for each canopy cover grid as derived from each of the different canopy cover calculation methods, at each ignition point. The raster resolution of the model simulations was set to 1 m \(\times\) 1 m, with all input and output raster layers resampled to this resolution. Once the simulated fire reached the cell’s centroid, the cell was considered ignited (and burned by the end of the simulation) and the current time was recorded as the arrival time of fire at that particular location. After the simulation was complete, isochrones denoting fire spread over time were generated.

2.8 Fire Behaviour Metrics

For this study, all simulations were summarised by the total area burned by fire and the unburned area ratio. Area burned is a commonly used metric when fire behaviour and effects are being examined. It has been used in addressing many ecological and Earth science challenges, including characterising wildfires and evaluating their impacts [55,56,57]. However, area alone often ignores the existence of unburned area within a fire perimeter (depending on the level of detail in the observational data); discriminating between burned/unburned area is an important component of the burn mosaic [58,59,60]. The unburned area ratio is defined as the ratio between the unburned area within the fire perimeter and the total area of the fire perimeter. The fire perimeter was defined by an alpha shape (alpha=1.5) created around the centre point of all burned cells and includes unburned areas inside this polygon. The area burned was estimated as the summed area of the grid cells burned at the end of the simulation.

For operationally predictive fire behaviour model, it is assumed that fuels are uniform and continuous for the period of application of the model [61]. A simulation was conducted for a homogeneous fuel distribution, in which canopy cover percentage and height were assumed to be constant. The canopy cover varies between coastal shrublands to sparse heathlands (10–30%) in semi-arid environments Cruz et al. [61]. Furthermore, whilst evaluating the impact of wind speed and moisture content on the likelihood of fire prediction in semi-arid Mallee-heath model, Cruz et al. [61] assumed a canopy cover of 33%. With this in mind, canopy cover for homogeneous fuel distribution case was fixed at 30%. Canopy height was calculated by taking the average of canopy heights calculated as the 95th percentile of the AGH of the non-ground points falling within 10-m cell. These values have been selected to be representative of the fuel conditions only. The outcome of this simulation (referred to as ‘hom’ throughout this study) serves as a point of inter-comparison between the outcomes of simulations using the different cover methods.

The Jaccard Similarity Index (J) was used to compare the area burned for each simulation with area burned for homogeneous landscape simulation. Several studies have used J to compare actual fire events with simulated fires [62]. This index simply expresses the proportion of burned cells common between two fire simulations and is given by

$$\begin{aligned} Jaccard\;Similarity\;Index\;(J) = \frac{\vert A \cap B \vert }{ \vert A \cup B \vert }, \end{aligned}$$
(10)

where A = number of grid cells burned in simulations running with different cover methods and B = number of grid cells burned in homogeneous landscape.

3 Results

3.1 Canopy Cover Estimates

In comparison to the ground based field estimates only the ACI1 method was found to produce similar estimates within the 1 ha area (at a 5% significance level) (Fig. 3b). Within this area, the mean of the ground estimates of canopy cover, fc, was 54% ± 24%. In comparison, ACI1 estimated a mean cover of 51% ± 16%. The null hypothesis of the two-sample t-test at the 5% significance level was rejected for all ALS based cover calculation approaches. The grid-based approach overestimated the mean canopy cover in 1 ha area, whilst ACI05 and the return-based approaches underestimated canopy cover (Fig. 3b).

The 5 methods for deriving canopy cover from ALS data produced different estimates across the 5 km \(\times\) 5 km study area (Fig. 3a). Geometric-based canopy cover estimates were found to be greater than the return-based canopy cover estimates. With the GCI metric showing the highest mean cover estimate of 60.26% and the LCI metric shows the smallest mean cover estimate of 2.77% (Fig. 3). Geometric-based methods also show greater variation in cover estimates when compared to the return-based methods (reported in Table 1). The GCI is the only metric which reported maximum canopy cover percent (100%) (i.e. cells covered completely with vegetation). The canopy cover estimates from the LCI metric were found to be least correlated with other cover metrics. The correlation between different cover metrics with each other is reported in Table 1.

Fig. 3
figure 3

Boxplot showing canopy cover estimates obtained from different cover methods for (a) 5 km \(\times\) 5 km study area and (b) 1 ha area. Canopy cover estimates from ground estimates are shown as fc

Table 1 Summary statistics from the 10 simulations of canopy cover percentage and coefficient of correlation for the different methods to derive canopy cover from ALS data for 5 km \(\times\) 5 km study area

Despite the differences in magnitude of estimated cover the spatial distribution of canopy cover calculated using different methods is similar, as shown in Fig. 4. Fire barriers such as roads are distinguishable in geometric-based methods (Fig. 4a–c) as compared to return-based methods (Fig. 4h–j).

In general, the trends shown by different cover estimates for 1ha area and 5 km \(\times\) 5 km study area are same (Fig. 3a, b). Thus, we can hypothesise that these results can be representative of vegetation present in this ecosystem.

Fig. 4
figure 4

Canopy cover maps produced from ALS data at 10 m grid resolution over 5 km \(\times\) 5 km study area. Parts (d, e, f, k, l and m) show the zoomed-in portion (white rectangle) of a GCI, b ACI1, c ACI05, h FCI, i SCI and j LCI cover estimates obtained from different algorithms

3.2 Effect of Canopy Cover Methods on Fire Simulations

The predicted fires from the model were affected by the methods used to calculate canopy cover estimates. Simulations running with return-based cover estimates show slow progression of fire as compared to the simulations running with geometric-based cover estimates. In general, simulations running with single canopy cover method and for all ignition points reported similar predictions irrespective of the wind direction and ignition point combination as shown in Fig. 1. This was observed for each of the canopy cover estimate method, and ignition point and wind direction combinations.

Figure 5 shows the progression of the fire over time for the canopy cover grids calculated using different methods at ignition point 1. Result (total area burned and unburned area ratio within the fire perimeter) from this ignition point follows the similar trend to all other ignition points figures provided in the Appendix. The patterns of burned area indicated in Fig. 5 are similar across all methods except LCI. A similar pattern for ratio of unburned area to total area in fire perimeter was observed. Notably each method showed the roads within the study area (seen as a linear feature at approximately half way through the burn) acting to slow the fire and resulting in areas being unburned. In return-based methods, the road appears to have a greater effect on the FCI method, despite a similar burned area estimate to that of SCI for all ignition points (#1 and ignition points in the Appendix).

Fig. 5
figure 5

Isochrones and arrival time plot for simulations from fire spread model at ignition point 1 using different cover methods over a duration of 1 h. Black curve in each plot shows the fire simulated for a constant landscape. Whole area within black ellipse was burned. Grey contours shown in each plot represent the isochrones at 6-min intervals (10 increments in total for each plot). North arrow shows the actual north direction

The progression of a sustained surface fire was observed for simulations running with geometric-based cover estimates and the return-based methods FCI and SCI at all ignition points (Fig. 5 and figures in the Appendix). This is also evident in the probability maps shown in Fig. 6. These maps were generated using Eq. (4) and show the probability of surface fire for each cover method. Figure 6 shows that more than 80% of the area is predicted to carry (probability > 50%) a surface fire in simulations running using geometric-based cover estimates and the return-based methods FCI and SCI. Such conditions could result in a transition of surface fire to crown fires in this fuel type [1].

Simulations running with return-based cover estimates also show progression of sustained surface fire in some areas except the one running with LCI cover estimates (Fig. 5). Model simulations running with LCI cover estimates result in slowing down or extinguishing of the fire spread. Failure to spread surface fire using LCI cover estimates is evident in probability maps (Fig. 6j) where only 8% of the area is predicted for surface fire.

From Table 2, it can be seen that J (= 0.84) is highest for simulations running with ACI1 metric, indicating that fires simulated with ACI1 metric become increasingly similar to the fire simulated for homogeneous landscape. Fire simulations with the GCI approach result in a greater burn area and a lower J (Table 2). Whilst all return-based methods result in a lower burn area also resulting in a lower value of J. In general, once the fire had reached the same extent across the methods the shape of the burned area was similar, meaning that the interpretation of the Jaccard Index mostly relates to area burned in the simulated time.

Table 2 Summary results showing Mean (\(\mu\)) and standard deviation (\(\sigma\)) of burned area, unburned area ratio and Jaccard similarity index (J) for different cover methods aggregated for all ignition points
Fig. 6
figure 6

Maps showing probability of surface fire over 5 km \(\times\) 5 km study area. Zoomed-in portion (white rectangle) in parts (d, e, f, k, l and m) show probability of surface fire predicted from Mallee-heath fire spread model by using cover estimates obtained from a GCI, b ACI1, c ACI05, h FCI, i SCI and j LCI methods. Blue represent ‘self extinguishing/no-go’, white represent transition from self-extinguishing to sustained fire spread and red represent sustained fire spread conditions

3.3 Modelled Burned Area

The total area simulated to be burned by fire is strongly affected by the method used to calculate canopy cover estimates. This is seen in Fig. 7 which shows the total area burned by fire over a period of 60 min for each canopy cover grid obtained from different methods (also reported in Table 2) aggregated for all 10 ignition points, where the shading shows the 95% confidence interval. Simulations running with return-based cover estimates show less area covered by progression of fire as compared to the simulations run with geometric-based cover estimates. The cover estimates based on grid-based method show maximum mean area (419.60 ha) burned by fire and cover estimates based on LCI method show minimum mean area (29.89 ha) burned by fire. The confidence band is narrow for return-based cover estimates as compared to geometric-based cover estimates, although width of the band, representing the variation in the burned area, increases with time for all cover calculation algorithms. The area burned by fire simulated with ACI1 cover estimates is close to the area burned by a fire for homogeneous landscape (Fig. 7).

Fig. 7
figure 7

Area burned by fire simulated with canopy cover percentage grids obtained from different canopy cover methods over an hour. ‘hom’ shows area burned by fire for homogeneous landscape. Shaded area shows 95% confidence interval for total burned area

3.4 Unburned Area Ratio

The ratio of unburned area to total area in fire perimeter, aggregated for all of the ignition points (also reported in Table 2) is strongly affected by different cover calculation algorithms (Fig. 8). Simulations running with LCI method show maximum mean unburned area ratio and it could be attributed to the fact that roads were not encountered due to the low modelled fire extent. This ratio shows minimum variation in simulations running with grid-based methods. These minimum variations could be linked to the fact that irrespective of wind direction, simulations running with GCI method show maximum burned area for all ignition points.

Fig. 8
figure 8

Boxplot showing the ratio between the unburned area within the fire perimeter and the total area within the fire perimeter for all cover methods

4 Discussion

Recent studies have shown the importance of incorporating high spatial resolution fuel datasets [43] and high-resolution fuel fidelity and heterogeneity information [4] to capture local features such as fire and fuel break that improve fire behaviour forecasts. However, to further improve prediction of fire behaviour it is important to understand how the precision and accuracy of different methods to estimate fuel input such as canopy cover impact the output from the fire behaviour models. In this study, we used five different published methods for deriving canopy cover estimates from ALS data. Our results indicate that different methods produce different canopy cover estimates in the Mallee-heath environment and in turn affect the outcomes of fire simulations. Primarily, estimates of canopy cover from return-based algorithms were found to be lower than the estimates from geometric-based algorithms. However, geometric-based estimates, particularly using the ACI1 approach, were found to be more similar to field based estimates of canopy cover in comparison to the return-based estimates. These results are consistent with results found in literature in other forest types [21, 34].

The geometric-based methods produce canopy cover estimates based on the 3D vegetation geometry represented within the point cloud and take into account all returns above 1.35 m rather than the complex interactions between the laser and the canopy. These methods do, however, make use of parameters which can result in different canopy cover estimates. These parameters are selected to account for variations in the pulse density of the ALS data. For example, a lower value of \(\alpha\) in the ACI method will result in more canopy gaps and therefore a lower estimate of canopy cover. This will be exacerbated when the pulse density of the ALS data is lower than the chosen value of \(\alpha\). In this case using an alpha value of 1 provided the similar estimates of cover to an infield digital canopy photography approach. Similarly, the GCI method relies on the resolution of the initial grid (in this case 1 \(\times\) 1 m) and the size and shape of the kernel used in the opening and closing process. In this case, gaps which could be true canopy gaps or a result of low pulse density, are often neglected if they approach the grid size.

Of all the return-based canopy cover methods, the LCI method produces the lowest canopy cover estimates. As for all canopy cover metrics derived from ALS, it is a function of the complex interactions between canopy properties (i.e. canopy vegetation density, canopy length) and how returns are triggered in the scanner (i.e. the time between returns and amount of energy required to trigger a return). Within the Mallee-heath, trees are multi-stemmed and sparsely distributed across the landscape. The multi-stemmed structure of the trees contributes towards a number of last returns, with other last returns coming from the hummock and tussock grasses present in this understorey layer, or the ground (Fig. 2). In such environments, most of the LiDAR pulses are expected to be able to penetrate through the canopy and reach the understorey components and/or the ground directly without interacting in intermediate or upper canopy components which contribute almost negligible last returns in canopy as shown in Fig. 2 within this study explaining the low canopy cover estimates produced using LCI.

In landscapes such as the Mallee-heath, which consist largely of discontinuous fuels in surface and canopy layers, with an absence of elevated and mid-storey fuel layers, the method used to compute canopy cover can result in significantly different fire behaviours observed, and this is shown in results of this study. For example: simulations running with LCI based cover estimates show slow progression or extinguishing behaviour of fire. This is due to minimum canopy cover estimates obtained using the LCI method. In simulations running with LCI derived cover layers, local features of landscape where fuels are organised in discrete clumps separated by bare ground acted as fire breaks, effectively stopping the fire, and resulting in small burned area and large unburned area ratio as reported in Table 2. Simulations running with LCI cover estimates do not reach the road irrespective of location of ignition points shown in Figs. 9, 10, 11, 12, 13, 14, 15, 16, and 17 in the Appendix. However, simulations running with geometric-based methods show progression of sustained surface fire (Fig. 6) with maximum area burned due to large estimation of canopy cover as shown in Fig. 7. GCI cover estimates neglect canopy gaps and provide connectivity between fuel elements used in the model allowing the fire to propagate. Surface fire spread can lead to crown fire propagation and resulting high rates of spread. For some ignition points the area covered by the fire becomes patchy due to the presence of roads and discontinuities in the canopy. These act as breaks in the canopy, resulting in unburned areas.

These results have two implications for simulating fires in a forest with heterogeneous fuels using empirical modelling: (1) there can be significant differences in representing the canopy fuel as a homogeneous layer for ecosystems that include gaps; and (2) The method used to derive fuel metrics for models also influences modelled fire behaviour. In this context there is clear indication that cover metrics derived from LCI method cause the fire to effectively stop. The sensitivity of fire behaviour models to the variation in canopy cover descriptions highlighted in this study suggests the need to choose an appropriate method to derive input fuel data for fire behaviour models. Substantial gains in modelling fire behaviour could be made by incorporating a suitable method to derive inputs for fire behaviour models. Such developments could increase the application and accuracy of data-driven wildfire models [63, 64].

In addition to canopy cover estimates, the Mallee-heath empirical fire model requires other inputs that could also be derived from ALS data, such as canopy height and terrain. Careful consideration of how these metrics are calculated may also be required. For example, methods to calculate canopy height may include the percentile height, Lorey’s height, maximum height and mean height of points within an area. These will result in difference in height values both in absolute (across the modelled area) and relative (between nearby cells) that may also result in modelled differences in fire spread. Furthermore, the resolution at which inputs for fire models are calculated [43] and terrain properties that may affect fire behaviour and predicting models derived from geospatial data.

Canopy cover metrics based on different methods have been shown in this study to result in different results when modelling fire behaviour, and it is conceivable that different vegetation types may also produce different results depending on the canopy cover method used. Consistent documentation of methods is recommended to prepare correct inputs for fire simulation models. Doing this would enable fire behaviour phenomena to be documented and analysed, and ensures that results are reproducible. Ideally, the method used to derive the cover metrics in a particular environment should be tested with field measurements to ensure the accuracy of observations and their description of structure, before using them in a fire behaviour model. The choice of method used should be guided by assessing the relative accuracy of derived data against reference data collected (using similar the infield approaches which were employed to measure the variable when developing the empirical model). For example, if a model was developed based on field measures of LAI or angular canopy cover, using vertical canopy cover derived from the ALS data as an input to fire model will not result in consistent results. As such it is important that the methods used to collect and describe fuel when developing a model are fully described and remotely sensed metrics are related to that physical attribute before using them as input into the model.

5 Conclusions

Empirical fire models are routinely used in fire management operations to predict fire spread and associated risks, and these models require fuel data. Whilst remote sensing approaches such as ALS offer opportunities to provide input into these models, there are currently a wide variety of methods used to derive estimates of fuel attributes from such data. This study assessed the impact of different methods to calculate canopy cover estimates on the performance of an empirical fire behaviour model in a Mallee-heath. Our results show that different algorithms to derive cover from ALS data produce different canopy cover estimates and, in turn, different fire behaviour predictions (including extent and pattern). Simulations running with geometric-based cover estimates resulted in higher estimates of burned area and maximum number of unburned patches as compared to simulations running with return-based cover estimates. Grid-based cover estimates neglect canopy gaps and provide connectivity between fuel elements used in the model allowing the fire to propagate. When the canopy cover estimate was lower (for example in using the return-based approach) features such as gaps and patchy data in the fuel led to the predicted fire simulations being slowed or stopped. These results demonstrate that methods for determining fuel inputs into fire behaviour models should be chosen based on the properties of the data, the vegetation structure present in an area and how the input was determined in developing the model. Without this level of rigour, the use of remote sensing data as inputs into a fire behaviour model will likely not provide the expected results or performance from that model.