Introduction

Knowledge about the above-ground volume of single trees and complete forest stands is of high importance as basis for multiple decisions for management as well as for varied scientific questions. For example, volume estimates of stands can be used to calculate thinning intensities and sustainable harvesting approaches. In recent years, especially in the context of climate change mitigation, information about biomass-derived sequestered carbon has gained more and more attention. Both volume and biomass are traditionally determined by destructive sampling of whole trees or parts of trees (Seifert and Seifert 2014). These methods, however, are very laborious, time-consuming, costly and inherent of their destructive nature, render repeated measurements over time impossible. A non-destructive method for estimating tree biomass is the use of species-specific allometric functions that are based on simple geometric measurements such as tree diameter at breast height (DBH) and tree height (Picard et al. 2012; Roxburgh et al. 2015; Paul et al. 2016). These functions, however, are still usually parameterised based on measurements of destructively harvested trees. As such, they are only able to provide reliable estimates within clear boundaries: (1) comparability of site and stand conditions (such as climate, soil, stand density, and species composition) between the sample and the subject trees, (2) the estimated trees need to be within the calibration range according to the dimensions of the predictor variable and must be of the same species and (3) the estimated trees shall not be of an atypical growth form or with defects such as with a low fork or with a hollow trunk. A recent study by Calders et al. (2022) has shown how neglecting the underlying assumptions of allometric functions can lead to substantial bias in the estimation of above-ground biomass in forests. Furthermore, the choice of analysis methods, model selection and ignoring model assumptions can lead to inaccurate and biologically implausible allometric functions (Sileshi 2014; Picard et al. 2015). For this reason, allometric functions cannot be considered a universal tool for estimating tree volume or biomass for any tree or stand alone.

An alternative to destructive sampling and allometric modelling are the estimation of volume or biomass utilising remote sensing. Satellite data or laser scans from aerial platforms or unmanned aerial vehicles (UAVs) can be used to estimate stand volume (e.g. Maack et al. 2016; Puliti et al. 2020). However, corresponding individual tree estimates are subject to high uncertainty. One of the most promising methods for assessing the above-ground volume and several more single tree parameters is terrestrial laser scanning (TLS) (Vonderach et al. 2012; Astrup et al. 2014; Calders et al. 2015; Hackenberg et al. 2015; Raumonen et al. 2015; Sheppard et al. 2017; Disney et al. 2018; Bohn Reckziegel et al. 2022; Schindler et al. 2023a, 2023b). TLS is a LiDAR (Light Detection And Ranging) based technique that produces high-resolution 3D point clouds of trees with up to several million points per tree. Since the early 2000s, this technology has been used within forestry applications for various purposes, such as extracting forest inventory parameters (Liang et al. 2016), assessing carbon stocks (Schindler et al. 2023b), quantifying forest structure (Ehbrecht et al. 2016), estimating leaf area index and light variability (Antonarakis et al. 2010; van der Zande et al. 2010; Rosskopf et al. 2017; Bohn Reckziegel et al. 2021), measuring crown structure (Pretzsch et al. 2011), quantifying tree growth (Sheppard et al. 2017; Yrttimaa et al. 2023), classifying tree species (Haala et al. 2004; Othmani et al. 2013), assessing wood quality (Pyörälä et al. 2019), estimating leaf angles (Stovall et al. 2021) and mapping microhabitats (Frey et al. 2020) and bark surface structure (Kretschmer et al. 2013). Additionally, TLS has been utilised for ecological applications such as assessments of biodiversity (Knuff et al. 2020; Helbach et al. 2022; Rappa et al. 2022) and microclimate (Ehbrecht et al. 2019; Kolbe et al. 2022). Often, the collected 3D data are used as basis for the calculation of geometric primitives such as spheres, cones or cylinders for better handling and eased calculation. From these quantitative structural models (QSMs), geometric and volumetric information, as well as topological properties, can be derived (Raumonen et al. 2013; Hackenberg et al. 2014; Åkerblom et al. 2015). Combining the volumetric data with density and carbon content information enables the estimation of above-ground biomass and carbon stocks (Calders et al. 2015; Disney et al. 2018).

Many previous studies show the good agreement of QSM-derived volume or biomass estimates with manual measurements of stem diameters and tree height (Liang et al. 2016; Pyörälä et al. 2019), dependent on scanning technology, number of scans, species and stand structure (Brede et al. 2017). While some studies indicated an underestimation of volume or biomass (Gonzalez de Tanago et al. 2018), other studies found QSMs to overestimate these (Calders et al. 2015; Raumonen et al. 2015). Previous studies often evaluated the accuracy of their TLS derived volume or biomass estimates based on single tree or even stand level without regarding different branches and stem diameter classes separately. Recently, Demol et al. (2022) investigated the accuracy of QSM volume estimates on two trees at the level of branch detail. They found the QSMs to provide a total volume overestimation of 38% up to 52%. Branches with less than 5 cm in diameter contributed to about 80% of the respective overestimation in the study. They argue the overestimation to be mainly caused by misaligned point clouds and scattering errors due to partial returns. Nevertheless, they emphasise the need for further research to uncover the mechanisms leading to inaccuracies. In a study by Abegg et al. (2023), the effect of object diameters on tree volume estimates was investigated using simulated point clouds. They found the volume of small branches with a diameter below 7 cm to be severely overestimated and suggest that this effect might be caused by noise resulting from the scanning distance. While these small tree parts were overestimated by 110% for small trees, their volume was underestimated by 50% for larger trees, probably due to occlusion effects. Such a compensation of volume overestimation by occlusion was as well observed by Kükenbrink et al. (2021) in a TLS dataset of 55 destructively sampled urban trees. It is very likely, though, that this effect does not generalise well across different tree species and growing conditions.

Since the fitting process of QSMs is mostly data-driven and currently depends on 3D point coordinates only, the geometric primitives are fitted to all points, regardless of their plausibility (Raumonen et al. 2013; Wilkes et al. 2021; Raumonen and Åkerblom 2022). Consequently, that means that the QSMs are only as good as the point clouds they are derived from. To obtain perfect QSMs, perfect point clouds without erroneous points are a prerequisite. The quality of point clouds obtained using LiDAR devices is influenced by many factors, e.g. laser scanning technique and features of the scanned object (Boehler et al. 2003). In addition to these factors, other sources of error can occur when scanning in forest conditions, such as unfavourable weather (wind and precipitation), occlusion by plants and large scanning distances due to the size of the objects. For most of these factors, there is currently only insufficient knowledge about their impact on volume estimates and the underlying mechanisms. Most previous studies have been conducted on whole trees and under uncontrolled conditions, making it difficult to isolate the effects of the individual factors.

In conclusion, despite frequent application in various research fields and despite technological progress, there are still unexplored uncertainties and unresolved inaccuracies when using TLS and QSMs. For example, when laser beams only partially hit objects, recorded points may be shifted away from the object towards the centre of the emitted light pulse and not adequately represent the object surface (Wilkes et al. 2021). As the distance between the scanner and the object increases, the size of the laser footprint and thus the proportion of partial hits increase, while the total number of laser pulses per area decreases at the same time (Abegg et al. 2021). Partial hits and large footprints are known to affect point cloud quality, but the impact of scanning distance on volume estimates derived from TLS and QSMs is yet to be quantified in real data. In a previous study by Abegg et al. (2023), the effect of distance noise on volume estimates was quantified. However, they did not evaluate the effects as a function of scanning distance and used simulated point clouds only.

With increasing tree height, branch diameter generally decreases along the vertical axis. Branches are positioned progressively higher in the crown and more distant to the ground, and thus also from the TLS device. This leads not only to more self-occlusion by the trees themselves (Abegg et al. 2017), but also to increasing distance-dependent errors, especially for small-diameter branches (Abegg et al. 2021).

In this paper, we aim to investigate the effect of the distance between TLS scan positions and scanned branches on the QSM-based estimates of branch volume and length. We chose to use European beech (Fagus sylvatica L.) branches for our study, as it is not only one of the most important European tree species, but also has exceptionally thin branches with minimum diameters of only a few millimetres. Moreover, since European beech can grow up to around 40 m in height (Packham et al. 2012), this, in combination with the fine branching structure, is especially challenging for the TLS technique. For our experiment, ten beech branches without leaves were scanned from different distances simulating vertical height differences between scanner and branches as part of the canopy. We used leafless branches, as deciduous trees are usually scanned in winter to reduce occlusion effects arising from leaves. To exclude any disturbing weather conditions such as wind or rain, which can lead to measurement errors (Krok et al. 2020), the branches were scanned under controlled, near-optimal conditions within a sports hall. Using different sampling distances, a comparison of QSM estimates with the estimates of the best possible point clouds obtained by multiple close-range scans is given to make a solid assessment of possible errors. To exclude influencing factors such as wind and precipitation, we scanned indoors.

Our research hypotheses, formulated as alternative hypotheses, are:

  1. (1)

    With increasing distance between TLS scan position and scanned branches, cumulative branch length estimates decrease due to a lower point cloud density.

  2. (2)

    With increasing distance between TLS scan position and scanned branches, branch volume estimates increase.

Methods

TLS data collection

The ten sample branches were collected in a nearby beech-dominated forest under leafless conditions. We concentrated on the last metres of branches containing the smallest twigs. On average, the sample branches had a width of 2.1 m, a length of 2.6 m and a maximum branch diameter of 2.2 cm. The branches were secured in a vertical position using trees stands to simulate the vertical scanning direction that will usually occur when scanning tree crowns from the ground (Appendix, Fig. 7). The branches were lined up in two sets of five branches with sufficient spacing not to touch or occlude any other.

To obtain the best practicable point clouds as a benchmark dataset for later comparison, for each branch set, 12 close-range scans with a scanning distance of less than 2 m were conducted around the branches (Fig. 1, turquoise crosses). In the following step, the branches were scanned from a distance of 5–45 m at 5-m intervals. The total distances between scanner and scanned branches in the horizontal axis can be assumed to represent vertical distances (tree height) in the field. For each distance, three scans with 7 m between the scan positions were performed from a line parallel to the branch line (Fig. 1, black crosses). Laser scanning was also used to procure the benchmark data, this method was carried out instead of manual measurements, as this enabled us to process all data using the same methods to isolate the distance effect. If we were to use manually measured data, the difference between benchmark and experimental data would potentially also include differences introduced by inaccuracies stemming from the TLS/QSM approach.

Fig. 1
figure 1

Scanning setup for one set of five branches. The scanning setup shows the centres of the branches (grey circles), the scan positions from different distances (black crosses) and the 12 close-range scans around the branches that were used to obtain the benchmark data for comparison (turquoise crosses)

The scans were performed using a RIEGL VZ-400i laser scanner (RIEGL Laser Measurement Systems GmbH, Horn, Austria). This time-of-flight scanner is capable of recording multiple returns for each emitted laser pulse. Besides recording the coordinates and the reflectance, which is the range-corrected intensity, the pulse shape deviation of the received pulses is recorded. The laser beams exit the device with a diameter of 7 mm (Demol et al. 2022) and have a beam divergence of 0.35 mrad (RIEGL Laser Measurement Systems GmbH 2022). The range accuracy of this scanner is 5 mm, the range precision is 3 mm and the 3D position accuracy is 5 mm, all at a distance of 100 m (RIEGL Laser Measurement Systems GmbH 2022). We set the pulse repetition rate to 1200 kHz and the angular resolution to 0.04°. This angular resolution was chosen to represent a typical setup commonly used to scan forest environment (Abegg et al. 2021, 2023; Calders et al. 2022) sometimes even a coarser resolution is used (Lau et al. 2019).

Due to the technical requirements of the scanner, some important features of the system with increasing scanning distance can be stated. In Fig. 2a, the beam radius that is increasing from 7 mm at the device to 23 mm at a distance of 45 m can be seen as well as the key features of the device for an target area of 9 cm × 9 cm including the beam diameter, the beam distance and the number of full hits (Fig. 2b).

Fig. 2
figure 2

a Laser beam radius and diameter (mm) of the RIEGL VZ-400i depending on the distance to the scanner. b Laser beam diameter (mm) and distance between the centres of the laser beams (mm) and number of full hits on a target area of 9 cm × 9 cm with increasing scanning distance (left: 5 m, centre: 20 m, and right: 45 m), assuming a horizontal and vertical resolution of 0.04°. The red circles show the size of single laser footprints at the respective scanning distance on a 9 cm × 9 cm flat surface

Point cloud pre-processing

The scans were pre-processed with the software RiSCAN PRO v2.14.1 (RIEGL Laser Measurement Systems GmbH, Horn, Austria). The scans were registered automatically, and their registration was visually verified. RiSCAN PRO uses the internal and external position of the scanner as well as the point clouds themselves to register the scans without any targets. The point clouds were filtered for isolated points (< 5 neighbours within a 10 cm radius), as well as for points with low brightness (reflectance ≤  − 15 dB) or high pulse shape deviation (deviation ≥ 15). As suggested by Demol et al. 2022), the multi-station adjustment (MSA2) in RiSCAN PRO was performed to reduce misalignment errors. Subsequently, to preserve as much information as possible, the point clouds of the different scanning distances were combined and exported without any down-sampling. Afterwards, the point clouds were manually segmented using CloudCompare v2.11.3 (2022). From the close-range scan data serving as a benchmark, noise was removed both manually and using the “Statistical Outlier Removal” tool. The tool removed points that were located at a greater distance \(({\text{dist}}>{{\text{mean}}}_{{\text{dist}}}+2\bullet {{\text{SD}}}_{{\text{dist}}})\) from their nearest neighbours (k = 10) than specified. On average, the tool removed 16,172 points per point cloud, which corresponds to about 5%.

Reconstructing QSMs

For each branch scanned from each of the distances, several QSMs were derived using TreeQSM v2.4.1 (Raumonen et al. 2013; Raumonen and Åkerblom 2022). TreeQSM is a common tool for creating QSMs and has been validated multiple times (Calders et al. 2015; Gonzalez de Tanago et al. 2018; Kükenbrink et al. 2021). It segments point clouds into non-bifurcated segments and fits cylinders to these. With TreeQSM, it is necessary to pre-define a set of hyperparameters prior to computation. Here, the hyperparameters were optimised for each branch scanned from each distance by computing five QSMs each for 24 different hyperparameter combinations (2, 4 and 3 values for PatchDiam1, PatchDiam2Min, and PatchDiam2Max, respectively; Appendix, Table 3) defined by an in-built function recommended in the software's manual (Raumonen and Åkerblom 2022). The function selects parameter ranges based on the stem base radius and point cloud density. Since some of the point clouds did not converge, a minimum stem diameter of 1 cm was implemented in that function to prevent the selection of unrealistic hyperparameters. TreeQSM does not produce deterministic results; therefore, multiple models per point cloud must be computed to obtain robust estimates (Raumonen and Åkerblom 2022). For each branch and experimental scanning distance, 50 QSMs were computed using the respective optimised hyperparameter set that previously yielded the smallest average distance between points and cylinders (in the following “optimised”). The same procedure was conducted on the data from the close-range scans to produce the benchmarking dataset (in the following “benchmark”). A further 50 QSMs were computed for each branch and experimental scanning distance, using a set of pre-defined parameters selected based on prior experiences with TreeQSM (in the following “fixed”, PatchDiam1 = 0.05, PatchDiam2Min = 0.005, PatchDiam2Max = 0.02, BallRad1 = 0.07, BallRad2 = 0.03). For all QSMs, the minimum cylinder radius was manually set to 1 mm to account for the small minimum diameters of the sampled branches (MinCylRad = 0.001).

Statistical analysis

All statistical analyses were conducted in R v4.2.1 (R Core Team 2022). For each branch, scanning distance and hyperparameter set, the variables derived from the QSMs were averaged using the arithmetic mean. In the following, we refer to the averaged total cumulative length and volume of all cylinders per branch as “total length” and “total volume”. To analyse the effect of scanning distance on the lengths and volumes of the branches, GAMs (generalised additive models) were derived using the “mgcv” package (Wood 2011). The total length and volume per branch were modelled as functions of scanning distance using a thin plate spline and branch ID as a random effect. The benchmark data were not included in these models since the pre-processing was not consistent with the experimental data. For each hyperparameter set, a separate set of models for length and volume was modelled. To handle the heteroscedasticity of the data, which showed higher variance with increasing scanning distance, the response variables were log-transformed. The residuals of all four GAMs were successfully checked for normality both visually and using Kolmogorov–Smirnov tests. To account for the bias introduced by log-transformation, correction factors were derived and applied to the back-transformed predictions (Baskerville 1972; Sprugel 1983). Confidence intervals were obtained by bootstrapping each model 1000 times and calculating the 2.5% and the 97.5% quantiles of the respective back-transformed and bias-corrected predictions. To facilitate the interpretation of the results, the cumulative cylinder length and the volume of the branches were divided into 1 cm-diameter classes.

Results

Selected branch characteristics for the benchmark data and for the experimental scans at distances of 20 and 45 m are given in Table 1. The presented values show the arithmetic mean ± standard deviation for the ten branches. Values used for calculation are the averaged length and volume of the respective 50 QSMs per scanning distance and branch (see Sect. 2.4). Therefore, the standard deviation reflects the variance between the branches caused by different branch architectures and sizes. In general, the dataset based on fixed QSM hyperparameters leads to results closer to the benchmark results. While for the length estimates, the two parameterisations performed similarly well, the volume estimates differed considerably.

Table 1 Branch characteristics for the benchmark data and for the scans conducted at a distance of 20 m and 45 m

The point cloud resolution of the point clouds varied depending on the different scanning distances (Fig. 3). Displaying a near linear relationship between the scanning distance and the logarithm of the number of points, the number of points per branch decreased logarithmically with increasing scanning distance. At the first scanning distance of 5 m, the point clouds of the branches consisted of 27,681 points on average. At a scanning distance of 45 m, the average number of points decreased to 607. The benchmark data contained an average of 343,448 points per sample branch. As the visual inspection of Fig. 3 shows, the high quality of the benchmark data makes it possible to derive QSMs that model even fine structures such as twigs of a few millimetres in diameter or buds. However, as the scanning distance increased, fine structures could no longer be detected. At greater distances, even coarse structures were no longer fully detected, and the pattern of the laser footprint (see also Fig. 2b) became visible due to the low point cloud density (Fig. 3).

Fig. 3
figure 3

Point clouds and QSMs of one sample branch scanned from varying distances. The top row shows the point clouds, and the bottom row shows the QSMs of the fixed parameter dataset for the respective distances. The plots on the left show the benchmark data derived from 12 close-range scans, the plots in the centre show the branch scanned from a distance of 20 m, while the plots on the right show the branch scanned from 45 m distance. The points are transparent to show overlapping points. The QSMs are coloured according to branch order

The cumulative length of the branches decreased with increasing scanning distance (Fig. 4). While most of the length in the benchmark data was originally in the smallest diameter class (≤ 1 cm), the thickest diameter class shifted to higher diameters with increasing distance. Although the results are slightly different for the two different parameterisations (fixed and optimised), the overall trend and the changing distribution of the length in the diameter classes along the distances are generally similar.

Fig. 4
figure 4

Total cumulative length of all branches combined (Y-axis) depending on the scanning distance (X-axis) and the cylinder diameter classes (colour). The “BM” bar shows the benchmark data from the 12 close-range scans

The largest mean underestimation of cumulative branch length occurred at 45 m scanning distance, where the length was underestimated by 75%, regardless of the parametrisation (Table 2).

Table 2 Average length and volume under/overestimation for the different scanning distances and parametrisations

Both GAMs on cumulative branch length as a function of distance showed highly significant (p < 0.001) effects of scanning distance and branch ID. The model based on the fixed parameter dataset achieved an adjusted R2 (R2adj) of 0.98, explaining 99% of the deviance. The second model achieved a marginally lower R2adj of 0.97 and explained 98% of the deviance. In both models, the predicted length decreased almost linearly with increasing scanning distance (Fig. 5). For both the raw data and the GAMs, no major differences between the two datasets are apparent.

Fig. 5
figure 5

Regression models predicting the over/underestimation of length (left) and volume (right) of single sample branches as a function of the scanning distance. Separate models were fitted for data derived from QSMs using fixed hyperparameters (pink, solid line) and for data derived from QSMs using optimised hyperparameters (turquoise, dashed line). The shaded areas indicate the bootstrapped confidence intervals. The effect of the individual branch ID was excluded for the plots. The grey line indicates the mean of the benchmark data

The total volume of the branches generally increased with scanning distance (Fig. 6). However, the dataset based on the fixed QSM hyperparameters shows a decrease of volume at 45-m scanning distance. While the smallest diameter class (≤ 1 cm) originally accounted for most of the total volume, there is almost no volume within this diameter class at the largest scanning distance. With increasing distances, the dominant diameter class changes. In contrast to the length estimates, there are large differences between the estimates of the two parametrisations. The dataset using fixed parameters outperformed the dataset using the optimised parameters. The largest average overestimation of branch volume occurred with the optimised parameter dataset and a scanning distance of 45 m, resulting in an overestimation of 539% (see also Table 2) (Fig. 7).

Fig. 6
figure 6

Total cumulative volume of all branches (Y-axis) depending on the scanning distance (X-axis) and the cylinder diameter classes (colour). The “BM” bar shows the benchmark data from the 12 close-range scans

Similar to the previous models, the GAMs regarding branch volume show highly significant (p < 0.001) effects of scanning distance and branch ID. The GAM for the fixed parameter dataset reached an R2adj of 0.83 and explained 85% of the deviance. Here, the model for the optimised parameter dataset performed slightly better with an R2adj of 0.86, explaining 88% of the deviance. While both models show narrow confidence intervals and an almost linear relationship between estimated volume and scanning distance up to a scanning distance of 35 m, they afterwards increasingly diverge (Fig. 5). Above a distance of 35 m, the model based on the fixed parameter dataset shows decreasing volume with increasing distance, while the other model exhibits a monotonic trend of increasing volume. Furthermore, the confidence intervals expand with the scanning distance.

Discussion

In this study, we investigated the effect of scanning distance on the estimated cumulative branch length and total branch volume of ten European beech branches using TLS data and QSMs. Excluding disturbing external factors such as wind and occlusion by the study design, we could extract the influence of distance on point cloud quality. Additionally, the artificial environment with many horizontal and vertical planes facilitated the coregistration of the individual scans, reducing errors resulting from inaccurate coregistration. Given the results of our experiment, we can confirm the initial research hypotheses: As the distance between scanner and branches increases, the cumulative branch length decreases, and the total branch volume increases. While for one of the QSM parameterisations, the volume increased monotonically, the other parameterisation showed an overestimation of the volume at all scan distances, but at 35 m scanning distance, the overestimation culminated and decreased at higher distances. The GAMs for the volume estimates show that uncertainty increases with scanning distance, as shown by the broadening confidence intervals.

This study is an experiment under partially controlled conditions and therefore deviates from reality in some respects. While we depicted the branches in similar distances as in a natural environment, the change from vertical to horizontal distances that should simulate the scanning process in forest surrounding is artificial. Nevertheless, this should not alter the way; the algorithm fits cylinders into the point clouds. Probably, the biggest difference between our study and scanning in forests is the disturbance regime. In natural environments, one must expect wind, humidity and dust particles which all might cause different artefacts (Boehler et al. 2003; Krok et al. 2020). Even slight wind, which is not even detectable from below the canopy, might move branches in the upper crowns (Oliver 1971). Dust particles, insects and pollen can cause returns floating between the branches. Water drops on the TLS device can lead to erroneous data due to reflections and refractions of the outgoing and incoming laser beams (Culvenor et al. 2014; Portillo-Quintero et al. 2014). Moisture on branches might lead to missing returns, as water reduces the amount of reflected light considerably (Kaasalainen et al. 2010), if near-infrared wavelengths are used. Additionally, in our experiment, there was no occlusion by other plant material, which is one of the major problems when trying to depict trees without obstruction (Heinzel and Huber 2017; Schneider et al. 2019; Wang et al. 2019). Depending on stand diameter distribution and density, occlusion can vary and deteriorate point cloud quality accordingly (Liang et al. 2018), which can be alleviated through a well-chosen scanning setup (Abegg et al. 2017). Occlusion in complex stands is known to affect various derived tree attributes such as height and volume (Liang et al. 2018). All these environmental factors have been excluded in our experimental setup, which was chosen to isolate the distance effect from other environmental influences in the best possible way.

We chose to scan the branches from one side only, as this is the typical perspective of horizontal tree branches scanned with TLS in a real-world setting from an orthogonal perspective. During TLS campaigns, one rarely has the opportunity to scan from a tower or similar structures to capture trees from above. Conversely when using Airborne Laser Scanning (ALS) or UAV Laser Scanning (ULS), trees are usually only scanned from above, with the ground view lacking (Brede et al. 2017; Wang et al. 2019).

Another source of error is the scanning system used. According to our results, large laser footprints resulting from large distances between scanner and object lead to poor point cloud quality and consequently to poor estimates of the length and volume of fine branch structures. The system used in this study already has a relatively narrow opening angle compared to other TLS devices (Abegg et al. 2023). Furthermore, as the distance between the scanner and the object increases, gaps between individual laser footprints arise, making it more difficult to capture objects from one scanning position entirely (Fig. 2b). Although the RIEGL VZ-400i can operate at higher angular resolutions of up to 0.0007° vertically and 0.0005° horizontally (RIEGL Laser Measurement Systems GmbH 2022), these are generally not used in forestry applications for reasons of time efficiency and practicality. Scanning with higher resolution currently increases the scanning time considerably, which makes data acquisition much more time-consuming and increases the likelihood that the data are negatively affected by environmental factors such as wind. Currently, the availability and quality of LiDAR scanners, regardless of platform, are developing rapidly, and prices are falling. Compared to TLS, mobile laser scanners offer more homogeneous coverage, but also introduce another source of error in the reconstruction of the movement trajectory. ALS and ULS are known to produce lower density point clouds (Calders et al. 2020), which are therefore currently less suitable for QSM reconstruction for fine structures. For this reason, it can be assumed that TLS currently remains the most precise measurement method and that the underestimation of length and overestimation of volume could be even more serious with less precise LiDAR devices as well as with LiDAR measurements from other platforms.

During the pre-processing, we were able to avoid down-sampling, which is commonly applied but may cause additional artefacts. This technique is usually used to reduce the computational effort, which presents no issue for the small number of branches as in our experiment, but increases rapidly for whole forest stands with mature trees. Down-sampling is sometimes used to homogenise the point cloud density, which is usually heterogeneous in TLS data due to occlusion effects and the distance between laser footprints that vary with scanning distance. Since we did not encounter any occlusion and since each of the point clouds contained only points from similar distances, we assume that our point clouds nevertheless exhibit an approximately uniform point cloud density.

Our results are consistent with those of Abegg et al. (2023), who observed overestimation of small tree volume of more than 100% in a simulation study in a similar scanning environment and using the same QSM reconstruction algorithm. The study by Abegg et al. (2023) concentrates on providing more insights into different scanning systems, without focusing on the cause of the misestimations. Our study sheds light on the observation that sparse data coverage of distant objects leads to missing branch segments and that laser beam diameters exceeding branch diameters lead to a cylinder inflation effect that causes an overestimation of the volume (Demol et al. 2022; Abegg et al. 2023). This cylinder inflation effect is probably caused by partial hits of laser beams. When emitted laser pulses partially hit objects and enough light is reflected to trigger a return, the resulting points may be shifted to the centre of the laser footprint, away from the object (Wilkes et al. 2021). As the scanning distance increases, the size of the footprint increases, potentially increasing the number of partial hits and the introduction of ghost points between objects (Abegg et al. 2021). This is probably the reason why volume estimates deteriorate with increasing scanning distance.

Another factor deteriorating volume estimates is the combination of scatter noise with the lack of data for the branches on the side facing away from the scanner. Since the scanner cannot estimate the angles to the branches in the same accuracy as the distances and since the rear facing surfaces are not depicted, the points might not resemble the curved surface of the branch cylinder, but rather a flat plane. This could lead to an overestimation of cylinder radii caused by minimising the distance between the points and the cylinder surface, as the surface of a large cylinder also locally increasingly resembles a flat plane.

Nevertheless, previous studies found field-based measurements to agree well with TLS-based volume and biomass estimates (Calders et al. 2015; Gonzalez de Tanago et al. 2018). This can probably be explained by the fact that these studies investigated the volume or biomass of mature trees at the tree or stand level. We assume that the overestimation effects were cancelled out by occlusion effects which lead to an underestimation of volume which has already been observed by Kükenbrink et al. (2021). This is a plausible explanation, as occlusion is typically an important issue especially in the upper crown (Wang et al. 2019), where much of the fine branch structure is located. However, one cannot rely on the assumption that different sources of error will always compensate for each other, as this might not always be the case. Depending on the complexity of a stand, the occlusion and the resulting underestimation vary and might not coincide with the overestimation due to cylinder inflation effects.

As can be seen well in Fig. 3, the errors seen are mainly caused by insufficient data and not by the TreeQSM algorithm itself. The observation that the detection of objects with small diameters is limited and deteriorates with scanning distance is in line with the results of Abegg et al. (2021), who investigated the influence of the laser beam diameter on point cloud quality in a virtual environment. However, as can be seen in Figs. 5 and 6, the errors in the volume estimates vary considerably for different hyperparameters, with the automatically optimised parameters, leading to greater overestimation for distances of about 10 m and more. Apparently, the choice of hyperparameters is a source of uncertainty that needs to be resolved in order to obtain more robust estimates. However, since the errors are mainly attributable to the point cloud quality, there is no point in optimising the algorithm as long as the scanning procedure is not optimised and standardised. Narrower scanning patterns and smaller scanning distances could increase the accuracy of the point cloud at the expense of more labour-intensive fieldwork. However, this approach is limited as it is currently not possible to deploy a TLS close to the canopy or an ULS below the canopy. Therefore, larger scan distances, which are inevitable due to tree height, are unavoidable. The only other option seems to be the use of multi-platform approaches combining top-down and bottom-up scans to provide full coverage through close-range scans.

It should be emphasised that this study focuses on particularly small branches. Volume estimates from TLS data of trunks or lager branches might be much closer to reality (Abegg et al. 2023). Still, identifying and exploring the limitations of the technology might provide a better understanding for which questions could be useful and which applications should be avoided. Currently, we are not aware of any other solution to estimate the volume of trees without destruction, apart from allometric equations, which have their own limitations (Disney et al. 2018), or destructive sampling, which is extremely labour-intensive, as well as species- and site-specific. Therefore, it is crucial to further our understanding of this technology as it is likely to be used in the future owing to its advantages over other methods.

Outlook

It is important to note that European beech as a species is especially challenging to scan due to its very fine branch structure and its large tree heights, which can reach more than 40 m (Packham et al. 2012). However, the branches of beech trees have a relatively straight growth compared to other tree species (e.g. Quercus spp.), which is probably an advantage for reconstruction using straight cylinders. We assume that the reconstruction quality might vary considerably between different tree species due to growth differences such as minimum branch diameter, branch curvature, tree height and branch density within the crown. These aspects should be explored in future research to improve our understanding of the mechanisms that influence the accuracy of the estimates between tree species. Taking this into account, it is quite a complex task to provide a calibration to compensate for the severe biases we observe. Such a calibration would have to consider for each cylinder its distance from the scanner, its diameter and length, as well as information about the tree species and the scanning setup, including the scanning device itself as well as its placement and the surrounding environment. In summary, it would be necessary to estimate the errors separately due to the scanner limitations, the tree characteristics and the errors caused by occlusion or prevailing circumstances such as the weather. This is very challenging with a point cloud that is assembled from multiple scans. Even if distance-dependent bias was successfully quantified taking these aspects into account, efforts to derive calibration curves would be complicated by the fact that many cylinders are likely to be fitted to points acquired from different scan distances. Furthermore, correcting the diameter for the individual cylinders while greatly underestimating the branch length would still lead to incorrect estimates of the volume and vice versa.

Conclusions

Our experiment revealed severe misestimations for branch length and volume, which were biased by up to − 75% and + 539%, respectively, and deteriorated with increasing distance between the scanner and the branches. It should be mentioned that we scanned the branches under near-optimal conditions to minimise the effects of external environmental factors, as the branches were scanned under leaf-off conditions in an enclosed building. Therefore, the branch length and volume estimates are not necessarily applicable under real forest conditions where the environmental conditions will further decrease the quality of the data. As our results show, the combination of TLS and QSMs is not suitable for reconstructing fine branch structures below 3 cm in diameter even at short distances. This can mainly be attributed to insufficient data quantities due to the scanner characteristics and is most severe for thin branches. This problem could be partially solved by improved software and hardware. Using current technology, higher angular resolution can already be employed to improve data quality, but at the cost of higher operation time and arising disadvantages. Until scanners are available that offer at least an order of magnitude higher accuracy, precision and angular resolution at similar operation times, all studies and applications using this technology should be aware of these limitations and uncertainties. In future research, we need to gain a better understanding of the influences of different tree structures and environmental conditions on the reconstruction to enable a better assessment of the uncertainties associated with the use of QSMs. Better estimates could be obtainable by combining different sensors and platforms (e.g. TLS & ULS), better pre-processing algorithms that take laser properties into account (more confidence in closer and less divergent laser pulses) and partial calibrations. However, all these solutions have their own individual difficulties that must be overcome.