How to account for temporal correlations with a diagonal correlation model in a nonlinear functional model: a plane fitting with simulated and real TLS measurements

To avoid computational burden, diagonal variance covariance matrices (VCM) are preferred to describe the stochasticity of terrestrial laser scanner (TLS) measurements. This simplification neglects correlations and affects least-squares (LS) estimates that are trustworthy with minimal variance, if the correct stochastic model is used. When a linearization of the LS functional model is performed, a bias of the parameters to be estimated and their dispersions occur, which can be investigated using a second-order Taylor expansion. Both the computation of the second-order solution and the account for correlations are linked to computational burden. In this contribution, we study the impact of an enhanced stochastic model on that bias to weight the corresponding benefits against the improvements. To that aim, we model the temporal correlations of TLS measurements using the Matérn covariance function, combined with an intensity model for the variance. We study further how the scanning configuration influences the solution. Because neglecting correlations may be tempting to avoid VCM inversions and multiplications, we quantify the impact of such a reduction and propose an innovative yet simple way to account for correlations with a “diagonal VCM.” Originally developed for GPS measurements and linear LS, this model is extended and validated for TLS range and called the diagonal correlation model (DCM).


Introduction
Terrestrial laser scanners (TLS) capture a large number of three-dimensional (3D) points rapidly, with high precision and spatial resolution. The point clouds obtained can be visualized in specific commercial software or approximated with parametric models, among which are the increasingly popular and flexible B-spline curves or surfaces (e.g., Bureick et al. 2016;Koch 2010;Neitzel et al. 2019). The main advantage of B Gaël Kermarrec kermarrec@gih.uni-hannover.de Michael Lösler michael.loesler@fb1.fra-uas.de 1 point cloud modelization strategies over visualization comes from the possibility of performing rigorous test statistics, for example, for deformation analysis purposes Kermarrec et al. 2020a). The mathematical model can be simplified for basic objects, such as ellipsoids, cylinders or planes (e.g., Ahn 2004; Fang et al. 2015;Lehmann 2019). A linearized Gauss-Helmert (Lenzmann and Lenzmann 2004;Neitzel 2010) or Gauß-Markov model (GHM and GMM, respectively, see (Koch 1999) is generally used to estimate the parameters of the corresponding geometric primitives. Among them, planes have a simple geometry and are easy to manufacture and handle. For that reason, they are often used for calibration purpose, i.e., to estimate the uncertainty of the sensors or optimize measurements settings. Exemplarily, Wujanz et al. (2017Wujanz et al. ( , 2018 or Schmitz et al. (2019) derive a stochastic model for TLS range measurements using approximations of planar surfaces with different colors and material properties. Heinz et al. (2019) investigated the TLS parameter settings for optimization purposes by scanning a plane. TLS point clouds were further approximated with planes in order to improve the registration in Theiler and Schindler (2012). Plane fitting and segmentation of target surfaces are an important step in applications such as the monitoring of structures (Bolkas and Martinez 2018).
Unfortunately, a bias is introduced when the estimates of a nonlinear problem-such as a popular plane fitting-are derived by a linearized or first-order substitute problem. The second-order effect was obtained analytically by expanding the nonlinear functional model by its second-order Taylor series in Lösler et al. (2020). The authors analyzed the bias between the first-and second-order solution rigorously for a plane fitting and also studied the impact of a large number of poor measurements versus small and precise samples. The empirical results of Heinz et al. (2019) were confirmed mathematically: Both contributions highlighted the advantages of a high discretization of the point clouds regarding an increase in the signal-to-noise ratio, balancing the computation effort and the need for accurate parameter estimation. The simulated measurements in Lösler et al. (2020) were assumed to be spatially correlated by means of a rotational-invariant Taylor-Karman structured matrix (Grafarend and Schaffrin 1979). Modeling correlations as being spatial focuses on the resolution capability of TLS or their effective number of measurements (Kauker and Schwieger 2017;Schmitz et al. 2020). In this contribution, we will adopt a different approach to correlations and come back to the source of the measurements: Range from TLS-phase or time of flight-is a measure of time (Rüeger 1996); their correlations should logically be modeled as being time dependent since they are mainly linked with physical temporal effects such as laser or atmospheric noise, see Lösler et al. (2016) for investigations with total station measurements. For the sake of simplification, we will assume-and shortly justify-that angles are non-correlated and further consider that the measurements are heteroscedastic, i. e., with different variances.
Neglecting correlations in LS adjustment leads to a simplification of VCM and is known to give unrealistic and mainly underestimated a priori or a posteriori dispersion (Jäger et al. 2005, p. 214ff) see Klos et al. (2018) for GPS velocity uncertainty and Kermarrec and Schön (2014) for coordinate estimation using double differences from GPS phase observations. This discrepancy can be linked with the particular structure of the inverse of the VCM (Kermarrec and Schön 2016). Unfortunately, the VCM is involved in the computation of the aforementioned second-order solution (Lösler et al. 2020); we are entitled to think that temporal correlations will affect the corresponding bias and raise the following questions: 1. What is the impact of the correlations of TLS range measurements on the second-order solution regarding the first-order one?
2. Does the scanning configuration affect the parameter and dispersion biases? 3. Will the diagonal VCM that neglects correlations lead to results comparable to the one obtained with fully populated VCM? If not, is it possible to account for temporal correlations by an equivalent diagonal model?
By answering these questions, we aim to give the practitioner some indications about the impact of TLS range correlations on the second-order solution. Our goal is to weigh the benefits of this improved solution against the additional computational burden, in the presence of correlated noise. We will place ourselves, without loss of generality in the controlled framework of simulated TLS point clouds from a plane, which is a popular scanned object.
We intent to draw the attention of the reader to the risk of neglecting correlations in the first-or second-order solution, and propose a powerful yet simple alternative to the use of fully populated VCM in LS adjustment. To that aim, we will investigate the extent to which the DCM proposed for GPS correlations together with a strictly linear functional model can be applied to the nonlinear problem of a plane fitting from TLS measurements. This new model was said, ironically, to account for correlations in an "hidden" way (Kermarrec and Schön 2017a). Up to now it has never been introduced, tested nor validated in engineering geodesy for nonlinear functional models. The DCM avoids computational demanding matrix inversion, which is a non-negligible advantage when the second-order solution is worth computed, for example, regarding small objects and/or a high scanning rate. In this contribution, we propose to validate the use of the DCM in the light of simulations by analyzing real data from a scanned plane. This necessitates to choose a model for temporal correlations of range measurements from TLS: We will adopt to the Matérn model (Matérn 1960). Our proposal is inspired by the work of Kermarrec and Schön (2014) for GPS and based on a first investigations for TLS correlations . The Matérn covariance function is widely used in geostatistics (Gelfand et al. 2010) for modeling spatiotemporal patterns. For some applications of the model, the interested reader is refereed to Lilly et al. (2017). Its three parameters, i. e., the smoothness, the correlation parameter, and the variance, make the covariance function extremely flexible and able to model all kinds of noise, from Gaussian to flicker or random walk noise. It has been recently implemented for the analysis of GPS coordinate time series in the dedicated software Hector (Bos et al. 2012), highlighting the high interest of this specific community for this process.
The paper is organized as follows: We summarize in Sect. 2 the first-and second-order solution when a linearization of the functional model is performed. We further describe in Sect. 3 the stochastic model chosen for TLS measurements and its simplification with the DCM. In Sect. 4, we present the results derived from both simulated and real TLS measurements from a plane. Sect. 5 concludes our investigations with some recommendations on how and when to account for correlations in LS plane fitting.
which leads to the necessary condition whereÂ = A + x T H i i is the Jacobian evaluated atx (Box 1971). Substituting Eqs. 2, 3 into Eq. 9 and neglecting the quadratic term yields the Jacobian For the quadratic term, we finally obtain According to Eq. 7, the bias of the estimated parameters and the related dispersion yields where x = F e F T = A T −1 e A −1 is the well-known first-order dispersion of the parameters (Box 1971). It should be emphasized that the Hessian G is not required in explicit representation to derive the bias of the estimates.
As shown by Lösler et al. (2020), the Jacobian in Eq. 10 of an implicit functional model f (x, e) = f (u) = 0, which is solved by minimizing the Lagrangian function, is given by where B is the Jacobian of f evaluated atẽ. The first-order dispersion of the estimated parameters x and the Lagrangian k are given by (cf. Höpcke 1980, p. 161f) In some application, nullity A T N −1 A = r > 0 occurs and, therefore, the inverse in Eq. 14a does not exist. To solve the resulting ill-posed problem, r further parameter constraints are usually introduced and Eq. 14a becomes (e. g. Pázman and Denis 1999) Here, R is the Jacobian of the further parameter constraint equations and The extended Jacobian of an implicit functional model f containing r parameter constraints to eliminate the rank deficiency reads (Lösler et al. 2020) To derive the bias of the estimates having an implicit functional model, the Jacobian F u and the Hessian H u of the function f evaluated atũ T = x TẽT are substituted into Eq. 12, i. e., where u = F u e F u T . A detailed derivation can be found in Lösler et al. (2020).

Special case of a plane fitting
From now on, we will place ourselves in the specific case of estimating a plane from a TLS point cloud recorded in polar coordinates. The nonlinear implicit functional relation of a plane reads where n T = n 1 n 2 n 3 is the normalized normal vector, d is the perpendicular distance from the estimated plane to the origin of the scanner's local coordinate system. P T i,cart = X i Y i Z i is an arbitrary point of the plane. Eq. 18 can be written as n 1 X i + n 2 Y i + n 3 Z i − d = 0 by considering the constraint n T n = 1 or n 2 1 + n 2 2 + n 2 3 = 1 for the length of the normal vector.
The raw measurements of a TLS are not Cartesian but polar coordinates: The range r expressed in meter [m], a vertical and a horizontal angle called θ and φ, respectively, expressed in degree [ • ] or gradian [gon]. They are depicted for one point P i in Fig. 1. The origin of the coordinate system is the center of the TLS. The Cartesian coordinates in Eq. 18 are expressed by means of polar measurements as with P T i,pol = θ i φ i r i . The first-and second-order dispersion corresponding to a plane fitting are derived from Sect. 2.1 and described in P r φ θ TLS Fig. 1 Polar and Cartesian coordinates. The center of the TLS is the origin of the coordinate system detail by Lösler et al. (2020). The estimates and the a priori dispersions-whether first or second order-depend on the VCM e from Eq. 17. We base our investigations on the approximation that e ≈ y , where y is the VCM of the raw TLS measurements. In this contribution, we chose to model the covariance function of the measurements to set y . Other approaches than modeling are, for example, based on variance component estimation (Teunissen and Amiri-Simkooei 2008). Such iterative implementations are computational demanding and may lead to numerical inaccuracies-up to the non-symmetry of the estimated VCM. In this contribution, we avoid this drawback by following the work of Kermarrec and Schön (2014) and suppose that a physical plausible modeling of the correlation structure is available. Our results can be, thus, read from two different perspectives when the parameters of the chosen model are varied in a plausible range: How (i) an inaccurate stochastic model and/or (ii) correlations affect the biases. We will use the notation y = pol for the sake of readability.

Heteroscedasticity
The raw TLS measurements are expressed in polar coordinates, they are heteroscedastic and have different variances σ 2 r , σ 2 θ , and σ 2 φ .
1. The angle measurements are coming from the mechanical rotation of internal mirrors: It is justified to consider their correlation structure as a combination of flicker and white noise (Hooge 1994). Our first investigations with maximum likelihood estimation have confirmed this assumption and shown that the white noise component makes more than 70% of the total noise variance for the scanning scenario used in this contribution. Based on these results, we allowed ourselves to consider them as uncorrelated in a first approximation. Their variance is taken from the manufacturer's datasheet (Böhler and Marbs 2002). In this contribution, we will use the specification of the Z+F 2016 (Zoller & Fröhlich GmbH, Wangen im Allgäu, Germany) for both, the simulations and the real data analysis. 2. The range variance σ 2 r will depend on different factors such as the properties of the scanned object (e. g. color, roughness), eventually atmospheric transmission, and the range r or the scanner itself (Soudarissanane et al. 2011). This value can be constant based on the manufacturer's indications or follow empirical models. Exemplarily, Wujanz et al. (2017) modeled the point-wise range variance depending on the intensity of the reflected signal. Using the 1D mode of a TLS, a power function could be fitted to the range variance versus intensity: σ r = c int + β int I nt α int , where the parameters α int , β int and c int were determined empirically by regression analysis for different terrestrial laser scanners and I nt is the raw intensity expressed in [Inc]. Unfortunately, the intensity value may not be accessible for all TLS. In order to cope with this challenge, further investigations were carried out (e.g., Schmitz et al. 2019). In this contribution, we place ourselves in a simulated framework and vary σ 2 r independently of simulated intensity values.

Temporal correlations
As aforementioned, we consider the temporal range correlations only and neglect the correlations for angle measurements. This approximation seems justified in a first approximation and will be the topic of further investigations.

Why the Matérn model: a justification
Range measurements are a measure of time independent of the TLS under consideration (phase or time of flight) (Rüeger 1996): We will, therefore, consider the TLS range measurements as temporally correlated. Many effects act on correlating the measurements, such as atmospheric turbulence due to the propagation of the laser in a random medium (Wheelon 2001) or to the sensor itself (Llopis et al. 2011;van der Ziel 1970). Phase noise related to fluctuations of the optical phase of the output, quantum noise due to the random phase of photons added by spontaneous emission and optical fiber with scattering phenomena inside the fiber should be considered as factors acting on correlating the measurements, to name but a few.
Such noise, also called stochastic processes, is well described by power-law slopes in the frequency domain, socalled colored noises. By means of differential equations, the different sources of noise can be modeled, leading to different power laws; the resulting "global" noise is a combination of all of them. Exemplarily, a widely assumed model for GPS coordinate time series analysis is a combination of flicker or power-law noise with white noise (Bos et al. 2020). However, many noises exhibit a high-frequency slope with a low-frequency plateau. Such stationary processes are better modeled by a Matérn model (Matérn 1960;Guttorp and Gneiting 2005), which accounts for the short-range dependence-as the fractional Brownian motion (fBm, Mandelbrot and Ness (1968))-but is damped at low frequencies.
The property of the process is intuitively related to a common feature of many physical systems: a pressure to grow together with a resistance on that growth (Lilly et al. 2017); this phenomenon leads to a balance or equilibrium between the two forces. The Matérn process accounts for that effect and, as the length of the time series increases, the stochastic model is more adequately modeled with a Matérn model than a purely fractional noise (see, e.g., He et al. 2019). Such processes are widely used in spatial geostatistic (Gneiting et al. 2010;Handcock and Wallis 1994;Stein 1999). The Matérn model is less popular for modeling noise of time series, although its flexibility permits a wide range of applications.
In this contribution, we will model the temporal correlations of TLS range measurements with this process, following the work of the first author on GPS phase correlation and the strong parallel between TLS and GPS measurements and processing . First investigations based on residual analysis and the empirical mode decomposition have empirically confirmed the theoretical expectation about the noise structure of TLS range measurements (Kermarrec et al. 2020b) (please note that in Tab. 1 of this contribution, the smoothness is given by the third and not the second value; the correlation parameter is the fourth value and should be 1.18, see erratum). The authors could successfully justified the use of the Matérn model. In the next section, we propose introducing the Matérn model in detail, which is a mandatory step to understand the concepts of smoothness and correlation length and their impact on the biases defined in Sect. 2.1.

Statistical description
A Matérn process z is stationary, and its spectrum is of the form The parameter A sets the spectral level, and ν is called the smoothness of the process and is related to the slope of the power spectral density (psd) as the frequency ω → ∞. For ν = 1 2 the Matérn model is identical to the exponential model; the limit case as ν → ∞, is the Gaussian model. ν = 1 was used exemplarily in Rodríguez-Iturbe and Mejía (1974) to model the volume of rainfall. A smoothness larger than 1 is linked to the mean-square differentiability of the process (Stein 1999), i. e., with the smooth decrease in the covariance function at the origin. The parameter α is a positive range parameter which has units of angular frequency. It acts as a damping of the psd as ω → 0 and, thus, controls the transition between two regimes: To illustrate this property, Fig. 2 shows the psd of different processes corresponding to different set of values [α, ν]. Without lack of generality, we assumed a sampling of 1 s. s] by varying the smoothness parameter and yielding the correlation parameter constant to 1.5. The smoothness affects the mean-squared differentiability of the function at the origin strongly. The corresponding legend is depicted in b, which represents the psd: The slopes change with the smoothness. c The covariance function by varying the correlation parameter and keeping the smoothness parameter to 1. d The psd. The slopes are constant, α affects the cutoff frequency of the psd from which it becomes constant. e The psd by adding white noise to the correlated noise (CN). The CN gives the part of correlated noise regarding white noise; CN=1 corresponds to no white noise. f psd of processes where the variance of the total noise is not scaled Changing the sampling does not affect the figure due to the scaling effect. Clearly, the Matérn psd is constant for low frequencies and decreases as a power law at higher frequencies, similarly to a damped fBm. A high smoothness (exemplary ν = 3, magenta curve) is linked to a strong decay of the psd as |ω| α and a constant regime as |ω| α. This latter case corresponds to a white noise having the same power level for all frequencies.
For a high correlation parameter (exemplary α = 1.5), Fig. 2b shows a longer plateau regarding a noise generated with a low α = 0.5. This latter has a psd similar to the one of a fBm.
The covariance C zz of a second-order stationary process is related to the spectrum through the inverse Fourier rela- We introduce the Matérn function as where denotes the gamma function and K ν− 1 2 the decaying modified Bessel function of the second kind of order ν − 1 2 (Abramowitz and Stegun 1972). The variance of the process is denoted by σ 2 . Figure 2a, c, e shows the covariance function for the same parameter values as Fig. 2b, d, f. Clearly, for ν = 3, the function decreases slowly at the origin compared to the exponential model ν = 1 2 . The Matérn covariance has asymptotic behavior for large and small timescales (Lilly et al. 2017).

Our proposal to model TLS range temporal correlations
Following the proposal of Kermarrec et al. (2019), we model the temporal correlations of TLS range measurements with (e) (f) a separable covariance function (Gelfand et al. 2010). This function is built as follows: 1. We assume that the variance of the range measurements will account for or "catch" all spatial effects, i. e., the impact of surface properties and atmospheric transmission on the signal-to-noise ratio by means of the intensity model proposed by Wujanz et al. (2017). 2. The temporal correlations are modeled independently with a Matérn covariance function.
The resulting function is a product of the modeled variance and the chosen covariance function. In this contribution, we assume that the variance of additional white noise is small compared to the correlated one. Increasing the impact of white noise decreases the one of correlations in the LS adjustment: Such cases are less relevant to study, as the results obtained will be similar to the one corresponding to a diagonal VCM. We illustrate this behavior in Fig. 2e, where we show exemplarily the psd of a simulated noise defined as a combination of white noise and a Matérn process. In this example, the total variance is the same for all simulated time series (the green line corresponds to a pure white noise and the blue one to a pure correlated noise). In Fig. 2f, we increased the power of white noise gradually as well as the total variance. From these two figures, we see that only the high-frequency region of the psd is impacted by the additional source of noise, provided that the latter has approximately the same variance as the correlated one (yellow line in Fig. 2f). If the white noise has a smaller variance (red line), its impact on the shape of the psd is less pronounced than in the previous case. Thus, additional white noise will have a similar effect as decreasing the smoothness.
The Matérn parameters can be estimated from time series such as LS residuals with the Whittle maximum likelihood estimator, which we briefly explain in "Appendix A" for the sake of completeness.

Building the VCM
In building the VCM of the raw measurements, we propose to account for the scanning time and model the correlation "linewise": For a given object extracted from the whole point cloud, we consider that the last point of a scanned line (Fig. 3 for t = 10), is not directly recorded before the first point of the second line (Fig. 3, t = 11) after the horizontal angle has been incremented.
We resume the stochasticity of the TLS measurements in a matrix form,ˆ pol being the VCM of the polar measurements defined as: where the diagonal block matricesˆ θ ,ˆ φ are sorted timewise. Their elements are given by the corresponding variances σ 2 θ , σ 2 φ .ˆ r is the fully populated VCM of the range, and is filled using the proposed Matérn covariance model of Eqs. 22 and 23 with σ 2 = σ 2 r as the range variance. The time increment between two measurements is given from the scanning setting chosen, e. g. high, superhigh or extremely high. The corresponding matrixˆ r is visualized in Fig. 4, for which [α, ν] = [0.05, 1.5]. Clearly, the number of elements of the covariance function that are non-null depends on the chosen set of Matérn parameters.
We denote furthermoreˆ pol,diag , the diagonal Matrix corresponding toˆ pol , where diag states for diagonal, i. e., without accounting for temporal correlations.

Impact of the measurement configuration
Each coordinate based functional model can be expressed in polar and Cartesian representation. According to Suchomski (1999), the transformation of the polar coordinates into their Cartesian representations yields biased results, if a linearized substitute problem is used. In this contribution and following Sect. 2, we avoid this drawback: No transformation of the VCM of the raw measurements by means of the propagation law is needed. However, the biases defined in Sect. 2.1 will still depend on the scanning configuration through the Jacobian and Hessian matrices (Eq. 17). The orientation of the scanned object in space will affect the biases of second order described in Sect. 2.1.

The equivalent diagonal model or DCM
Using fully populated VCM in LS adjustment may be computational demanding. Based on the work of Luati and Proietti (2011), Kermarrec and Schön (2016) developed an alternative based on diagonal VCM: the so-called DCM. This model was successfully used in LS adjustment with GPS phase observations for the specific use of ambiguity resolution or coordinates estimation to avoid introducing fully populated matrices (Kermarrec and Schön 2017b). In this section, we propose to shortly summarize the main idea behind this simplified yet powerful stochastic model.
We callˆ equi the equivalent diagonal matrix toˆ pol .The conditions under which the LS estimate withˆ equi is identical to the one given byˆ pol are given in Luati and Proietti (2011) and are based on a decomposition of the design matrix of the linear LS adjustment as a product of the eigenvectors of polˆ −1 equi . In the case of a mean estimator, a necessary and sufficient condition for the equivalence between the generalized LS and the DCM to hold is that each element of the diagonal matrixˆ −1 equi is the sum of the row elements of the inverse of the fully populated covariance matrixˆ −1 pol . Kermarrec and Schön (2016) tested empirically the generalization of this equivalence for more sophisticated linear functional models: They showed that matrix multiplication involving non-sparse fully populated VCM could be avoided. Figure 5 explains graphically how to compute the equivalent matrix. We point out that the semi-equivalence holds true for the estimates and their a priori cofactor matrices in a lin- When the inverse of the VCM exists explicitly, the VCM can be replaced by a simple factor, which we call the Variance Inflation Factor (VIF), see Kermarrec et al. (2020a) and "Appendix B." An explicit formulation of the inverse exists when the correlation structure can be modeled with a Matérn model corresponding to ν = 1 2 , which corresponds in one dimension to a so-called autoregressive model of first order AR(1), see (Rao and Toutenburg 1995) or (Rasmussen and Williams 2006). Thus, the use of a VIF will be justified if the simulations show that misspecifying the smoothness do not affect strongly the bias of second order.

Simulated observation
We simulate TLS raw measurements from a plane to study the impact of both scanning configuration and temporal correlations on the first-order and second-order bias in a controlled framework, see Eq. 17 and cf. Sect. 2.2. In the following sections, we propose describing the setup that was adopted to make the simulations as close as possible to a real scenario. Fig. 6 Example of two simulated planes corresponding to two scanning configurations. The scanning rate for the representation of the TLS plane generated is constant and taken here to dt = 1 × 10 −5 s: This corresponds to a value for which the covariance function reaches the 0 value inside one line and not to the reference value in both cases. Please note that we consider that the main axis of the TLS crosses the plane in its center in all simulated cases

General configuration
The optimal scanning configuration for which the regular points lie on a regular grid corresponding to a null horizontal and vertical tilt regarding the TLS main axis is depicted in Fig. 6. For this reference configuration, we have [θ, φ] = [0, 0] • . All planes are generated having the size 1 m × 1 m and scanned at a distance of 10 m. We chose a temporal spacing between the points of dt = 5 × 10 −5 s so that the reference number of points per scanning line reaches 25. This value corresponds to the setting "premium, high" or "preview, normal" for the Z+F 2016 used in this contribution for the real data analysis.

Positive definite VCM
In order to study the impact of the scanning configuration on the biases, we generated different point clouds by adapting the horizontal and vertical tilts of the planes. Because we account for the slight increase in distance coming from the tilt, not all the scanning lines have 25 measurements, similarly to a real case scenario. For the sake of comparison-and in order to also ensure that all VCM are positive definitewe need to adapt the scanning rate to reach the reference number of 25 points per scanning line for all configurations. An example of this problematic is presented in Fig. 6: The reference scanning configuration shown in Fig. 6a is compared with the one corresponding to a horizontal tilt of and a vertical tilt of 40% (see Fig. 6b). A temporal spacing of dt = 1 × 10 −5 s was chosen for both cases and not the reference value of dt = 5 × 10 −5 s. Without decreasing dt accordingly, the number of points per scanning line is lower for the case corresponding to Fig. 6b than for the reference one. Unfortunately, when the number of measurements per scanning lines is low and the level of the chosen correlations high, the covariance function will not reach the 0 value inside one line: The corresponding VCM are badly conditioned and cannot be used in the adjustment, i. e., they are not positive definite. This effect is highlighted in Fig. 7a, for which α = 0.2. Clearly, the covariance function does not reach the 0 value inside one line of 25 values. Since this effect occurs when the number of points per scanning line is small regarding the correlation length, we avoid it by fixing the minimal number of points per scanning line equal to the number of points chosen for the reference configuration. Concretely dt may be smaller, for some scanning configurations, than the reference value. This allows to compare the results when α is varied, see Sect. 4.2. We note that this effect is not likely to happen with real data as the number of points per line is much higher. We intentionally chose in this contribution to follow the configuration used in Lösler et al. (2020)

Configurations
To answer the first and second scientific questions raised in the introduction, we simulated different configurations by changing the horizontal and vertical tilts of the plane. θ and φ are, thus, varied in the range [0, 20, 40, 60] • independently. An example of the planes obtained is depicted in Fig. 6, for which a distance of 10 m was chosen. The distance between the center of the TLS and the plane was held fixed.
We are aware that the scanning configuration also includes the edge length and the distance to the instrument. These effects on the first-and second-order solutions have already Based on their study, we highlight the following points: 1. Changing the distance is similar to acting on the a priori variance factor of the measurements. Exemplarily, increasing the distance corresponds to an increase in σ 2 r and thus of the bias (Wujanz et al. 2017). 2. Acting on the distance is, moreover, linked to varying the scanning rate but for a fixed distance: For a given edge length, increasing dt is related to decreasing the number of grid points, which increases the bias. 3. A high bias was found for small edge lengths.
These effects have already been investigated in Lösler et al. (2020). We do not aim to repeat their investigations and, thus, concentrate on the impact of the correlation structure on these biases only and not on their absolute values.

Stochastic model
We account for the heteroscedasticity and correlations of the TLS measurements as proposed in Sect. 3.1. The setup of the VCM of the measurements is as follows:

How to compare the results and what is expected?
We define the following ratios in order to provide an answer to the three main questions raised in the introduction: • To investigate the difference between the second-and first-order solution for the estimated ranged and the standard deviation σ d of the range of the plane: • To study how a simplification of the fully populated VCM with its diagonal counterpartˆ pol,diag will impact the second-order solution: • To test whether the DCM usingˆ equi can replace the fully populated VCM in the LS adjustment: We calld 2 nd ,d 2 nd ,diag ,d 2 nd ,equi the second-order solution computed withˆ pol ,ˆ pol,diag andˆ equi , respectively, for the range r of the TLS.d 1 st is the corresponding first-order solution. The dispersions σ d are defined similarly. All ratios will be given in %, with as many digits as necessary for the sake of comparison. Please note that we always consider the second-order solution as the reference one.
We propose to initially discuss the expected results intuitively to simplify the understanding of the simulations: 1. Since we consider having a fixed number of points per line, increasing the correlation parameter α will act as decreasing the impact of correlations: The solution will tend to the one obtained with a diagonal matrixˆ pol,diag . 2. Increasing the smoothness will increase the impact on the bias regarding the diagonal solution or for a given α. The diagonal solution is close to the one obtained for ν = 1 2 as α increases: Exemplarily Fig. 7a, b shows the decay of the covariance function in that specific case, as the time increases. 3. We expect that a larger σ r will increase the ratios. 4. A suboptimal scanning configuration is expected to decrease the impact of the temporal correlations on the ratios.
In the following, we choose intentionally to present in details the results for two configurations only. Further results will be discussed in text form in order not to overload our contribution.

Difference between the second-and first-order solution
Rd ,2/1 and R σ,2/1 . The corresponding results are presented in Fig. 8a, b (top). Under the reference scanning configuration, [θ, φ] = [0, 0] • and for σ r ,0 = 1 mm, Rd ,2/1 is nearly constant (around 0.003%) independently of the correlation structure. R σ,2/1 is between 1% and 2% for the range of values chosen. Both ratios tend to a constant value, which is reached for α > 2. We see clearly that for σ r ,0 = 5 mm an increase in correlations is linked to a slight decrease in the ratio, i. e., from 0.0045 to 0.004% for the parameter (magenta line versus blue line) and 0.25% for R σ,2/1 , for α = 0.5, ν = [0.5 − 1.5]: Accounting for correlations damped the difference between the first-and second-order solution for both parameter and dispersion. This effect can be interesting to avoid computing the second-order solution when small objects are scanned, for which the absolute bias is not negligible. However, we note that the ratios are under 1% and can be considered as small. To summarize, when temporal correlations are accounted for, the ratios decrease so that it becomes less interesting to use the second-order solution. This effect is at the same time a risk when correlations are underestimated and the first-order solution is computed. For that reason, it is still always preferable not to compute the first-order solution for small objects. Please note that we were not considering the absolute value of the differences-which depends on the edge length-and only analyzed how correlations affected the difference. 2. Simplification of the fully populated VCM with its diagonal counterpart Rd ,2/2diag and R σ,2/2diag The results are presented in Fig. 8a, b (middle). The impact of the fully populated VCM at the parameter level regarding its diagonal counterpart is nearly negligible. For R σ,2/2diag , the expected high difference between fully populated VCM and diagonal VCM can be confirmed: for σ r ,0 = 1 mm and [α, ν] = [0.5, 1.25], the ratio R σ,2/2diag reaches nearly 40% and 60% for σ r ,0 = 5 mm. For a lower smoothness and high correlation parameter, we have less significant values: For the exponential covariance function [α, ν] = [0.5, 0.5], the ratio reaches 20% and 50% for σ r ,0 = 1 mm and σ r ,0 = 5 mm, respectively. We note further that the correlation parameter α should be high to reach R σ,2/2diag = 0, particularly for σ r ,0 = 5 mm: Even low correlations lead to a high ratio. A strong smoothness ν increases the ratio for a given α, which is a logical effect as the VCM is less sparse. This reference case highlights that correlations should not be disregarded for computing the dispersion. This conclusion would be the same if the first-order solution had been used, due to the aforementioned small difference between the first-and second-order solution (see previous point). Not accounting for correlations and using a diagonal VCM is linked to a risk of making a high error in terms of dispersion. 3. Simplification of the fully populated VCM with its diagonal counterpart Rd ,2/2equi and R σ,2/2equi derived by the DCM approach Figure 8a, b (bottom) shows the good adequation between the results given with the DCM and the fully populated VCM. At the parameter level, the ratio Rd ,2/2equi is not higher than 0.002% for the case corresponding to [α, ν] = [0.5, 1.25] and σ r ,0 = 5 mm. R σ,2/2equi highlights the strength of the approximation with the DCM regarding using the diagonal VCM: R σ,2/2equi reaches a maximum value of approximately 4% for [α, ν] = [0.5, 1.25] and σ r ,0 = 1 mm, about one order of magnitude better than R σ,2/2diag for the same configuration. We note that R σ,2/2equi is getting smaller as σ 0,r increases ( Fig. 8b right bottom), which we interpret as coming from the closeness of the range variance to the angle one. Additional tests by increasing the angle variance to 0.01 • instead of 0.007 • led to an increase in R σ,2/2equi comparable with the values given for σ r ,0 = 1 mm and the reference angle variance chosen. This study confirms empirically that the equivalence holds true for a nonlinear LS plane fitting: The DCM can be used with high confidence to replace the fully populated VCM. We also mention an important point for computational efficiency: The difference by varying the smoothness for a fixed α is below 0.1%. This result is promising. Indeed, even if the estimated smoothness is higher than 0.5, we can replace it by ν = 0.5 and slightly decrease the estimated α from 0.5 without impacting the results for the parameters and dispersion. Alternatively, one could directly assume that the temporal correlation model follows an AR(1) process and estimate or model the correlation length only: The AR(1) model for onedimensional time series is corresponding to a Matérn model with ν = 1 2 (Rasmussen and Williams 2006). As aforementioned in Sect. 3.3.3, the inverse of the VCM from an AR(1) model can be replaced by a scaled identity in the LS adjustment, see Kermarrec et al. (2020a) for a detailed explanation. The authors recommend using the AR(1) model due to the potential numerical instability of the DCM for high smoothness and low α (see, e. g., Fig. 8, bottom, yellow line). These computational inaccuracies are linked to the high number of relevant values, i. e., not close to zero, on one line of the fully populated VCM. Additionally combined with suboptimal scanning configurations, they can lead to a ratio of up to 1% for [θ, φ] = [60, 20] • and [α, ν] = [0.5, 1.25]. This effect-although of a small magnitude-gives weight to the use of a more stable AR(1) model in the DCM, besides the fact that for that model a close formula for the inverse of the VCM exists.

Impact of the scanning configuration
We further analyzed different scanning configurations following Table 1 for the sake of completeness. Without loss of generality, we chose to present the case [θ, φ] = [40, 0] • graphically in Fig. 9. Further results are given in text form.

Difference between the second-and first-order solution
Rd ,2/1 and R σ,2/1 From Fig. 9a (top), the impact of the scanning configuration chosen is shown to affect Rd ,2/1 negatively, which increases by a factor of 10 regarding the reference case ( Fig. 8): The difference between the second-and firstorder solution still decreases with increasing correlations and tends to 0.03%. On the contrary, R σ,2/1 reaches approximately 0.075% in mean over all simulated correlation structure. It is lower than with [θ, φ] = [0, 0] • ; the scanning configuration lead to a decrease in the impact of the temporal correlations on the second-order solution compared to the first-order one. These results were confirmed for other scanning configuration. Exemplarily for [θ, φ] = [60, 20] • , we got a mean of Rd ,2/1 = 0.06% and R σ,2/1 = 0.08%, for [θ, φ] = [40, 40] • , Rd ,2/1 = 0.07% and R σ,2/1 = 0.06%. We conclude, therefore, that the non-optimality of the scanning configuration leads to ratios close to 0: The first-and second-order solutions are similar in terms of parameter and dispersion when temporal correlations are additionally considered. 2. Simplification of the fully populated VCM with its diagonal counterpart Rd ,2/2diag and R σ,2/2diag Figure 9a, b (middle) highlights further the non-necessity of accounting for correlations as the scanning con- As the configuration deviates from the optimal condition, the fully populated VCM can be replaced by its diagonal counterpart. 3. Simplification of the fully populated VCM with its diagonal counterpart Rd ,2/2equi and R σ,2/2equi derived by the DCM approach The results obtained with the DCM are similar to the previous case. Once more, the equivalent model can replace the fully populated VCM with a high confidence. Even if at the parameter level, Rd ,2/2equi is higher than Rd ,2/2diag , it remains at a level close to 0. Similar to the reference case, R σ,2/2equi is approximately five times lower than R σ,2/2diag . As R σ,2/2diag , R σ,2/2equi decreases as the scanning configuration becomes worth more. These conclusions were similar for all other scanning configurations we simulated, and are not discussed further.

Using the DCM in a real case
To highlight how to use the DCM in real case, a white plane of size 1 m × 1 m was scanned at a distance of 10 m with a Z+F 2016 using the scanning modus "high" and "superhigh" (premium). This corresponds approximately to a scanning rate as chosen in the simulations. We chose two scanning configurations, i. e., the optimal one with no tilt (see Sect. 4) and a scanning configuration with a slight tilt of 20 • for θ . The point clouds were preprocessed to avoid edge effects and outliers, and finally cut using a free software. The point clouds obtained resulted in 30 lines of 30 points, similar to our simulated planes. We adopted a standard deviation of 0.004 • for the angles and 0.2 mm for the range. The values were taken from the manufacturer's datasheet and agree with the one of more accurate model based on the intensity value such as in Wujanz et al. (2017). The first-order solution was computed in a first approximation using the corresponding diagonal VCM built as explained in Sect. 3.3. The correlation structure of the range was estimated from the residuals using a Matérn model by fixing the smoothness to 1 2 : This corresponds to an autoregressive process of first order AR(1) in one dimension, see Rasmussen and Williams (2006). Due to the chosen parameterization of the Matérn covariance function, a balance between ν and α occurs, i. e., the estimated correlation parameter will be smaller than if a higher smoothness would have been estimated, see Stein (1999). As shown in Sect. 4.4.1 within a simulated framework, we are allowed-even for the suboptimal scanning configuration-to make this simplification. A second possibility is to estimate the parameters of a Matérn model and decrease α by 0.5 by keeping ν fixed to 1 2 : This approximation is valid for the use in the DCM and not for an accurate correlation modeling, it leads to less computational demanding matrix inversion. We make use of the first strategy in this contribution and further adopt a "global" correlation model, i. e., the white noise component is not independently estimated and considered to decrease the correlation length accordingly, see Kermarrec and Schön (2017b). We performed the estimation linewise using the MATLAB function ar. For the optimal scanning configuration, the mean values were 0.11 (respectively, 0.14 for the suboptimal scanning configuration) over the lines with a standard deviation of 1 × 10 −3 , respectively, 3 × 10 −3 for the correlation coefficient for the setting high. We found for the setting superhigh a higher correlation coefficient of 0.31 with a standard deviation of 0.7 × 10 −3 (respectively, 0.35 with a standard deviation of 1.5 × 10 −3 for the suboptimal scanning configuration). The correlation coefficients are nearly independent of the scanning configuration: This validates that temporal correlations can be estimated from the residuals, i. e., at a distance of 10 m under labor condition, we do not expect atmospheric effects to affect the correlation structure. We are aware that this estimation may be biased due to the small samples under consideration; however, the stable results are a validation that the bias is probably kept small: Otherwise some numerical instability in the parameter estimation would have been visible. An accurate determination of the parameters is beyond the scope of the present paper and would overload it, i. e., we wish to validate the use of the DCM so that a rough approximation is enough here to show its range of application. Simulations having shown that the parameter estimation is not affected by this approximation, we only concentrate on the dispersion for the real data analysis.

Optimal scanning configuration
For the optimal scanning configuration, the first-order dispersion of the range d using the estimated fully populated VCM reaches 0.101 mm for the setting high (respectively, 0.08 mm for the setting superhigh). We found an increase lower than 1% for both case using the second-order dispersion, which is coherent with the simulation results for the chosen standard deviation of the range; the value of the correlation coefficient corresponds approximately to α = 1. Using a diagonal VCM resulted in an underestimated dispersion of 0.076 mm (respectively, 0.068 mm), which also corresponds to the value found in the simulations (see Fig. 8b, left, middle). We made use of the DCM for the range by means of the V I F computed from the correlation coefficient (see "Appendix B"). The corrected dispersion value reached

Suboptimal scanning configuration
In that case, and following the simulations, the secondorder dispersion is not worth computing for the range standard deviation under consideration (see Fig. 9b, left, middle). Using the estimated fully populated VCM, a value of 0.35 mm for the setting high (respectively, 0.28 mm for the setting superhigh) was found. This is a slightly higher value than for the optimal scanning configuration. Using a diagonal VCM or the DCM changed the results by less than 0.2% and can replace with a high trustworthiness the fully populated VCM in the computation of the dispersion. Please note, the increased number of digits of the presented results is only used for the sake of comparability. We additionally performed the parameter estimation for all four cases. As shown in the simulation, an improved stochastic model has a low impact on the solution. This effect was confirmed in the real data analysis and is here not presented with the sake not to overload the present contribution.

Conclusion of the real data analysis
The DCM is a simplification based on the AR(1) model: a Matérn model with a fix smoothness, balanced by a corre-sponding "decrease" in the estimated correlation parameter. The methodology for using the DCM with real data is summarized in a flowchart form in Fig. 10. Simulations have shown that the DCM can be used with a high trustworthiness, which we validated by means of the real data analysis from a plane fitting. It allows one, in a first approximation, to correct the first-or second-order dispersion adequately, as well as for parameter estimation when needed. Further investigations will concentrate on how to estimate the AR(1) parameter and the white noise component, following, for example, Kargoll et al. (2018).

Conclusion
A knowledge of the stochastic properties of TLS errors is of great importance to avoid untrustworthy test decisions for deformation, and unrealistic dispersion or parameter estimation. When a linearization of the functional model is performed, biases are introduced in the LS approximation and a second-order solution may be more appropriate to get realistic estimates. Unfortunately, physical correlations of the raw TLS measurements combined with the scanning configuration are expected to impact this improved solution. We modeled the correlation structure of the TLS range measurements with the Matérn covariance function to analyze this impact. The three model parameters, i. e., smoothness, correlation length and variance, can be trustworthily estimated, even for small samples, by means of the de-biased Whittle likelihood. Alternatively, they could be fixed without further estimation based on empirical investigation or modeling. Accounting for an improved stochastic model for TLS measurements leads to fully populated VCM. The estimation of the second-order solution may become computationally demanding: Some investigations were needed to weight the corresponding benefits against computing the first-order solution only. Based on simulated TLS measurements from a plane scanned with different measurement configurations and considering various correlated noise structures, we have shown that the ratio between the first-and second-order solution was below 2% for the dispersion and 0.05% for the estimation of the range for a reference configuration without tilt of the plane. The ratio was found to decrease as the correlations increased, as well as for suboptimal scanning configurations: There is, thus, a high risk of miss-estimating the dispersion when the first-order solution is computed. This effect is getting higher when correlations are further underestimated. Indeed, we investigated whether a simple diagonal VCM without taking correlations into account would have led to the same LS results and found that the dispersion increased with the level of correlations to reach a ratio of up to 60% between the results obtained with fully populated and diagonal VCM for a range standard deviation of 5 mm. The ratio decreased as the scanning configuration deviated from the reference one. We concluded that accounting for correlations with a fully populated VCM in the LS adjustment is crucial when objects are scanned under optimal scanning conditions, particularly if the first-order solution is computed. This result has driven our research to answer our third scientific question: Is it possible to replace the fully populated VCM with a DCM or equivalent diagonal model for a LS plane fitting? In this contribution, we could successfully confirm the empirical results found for GPS adjustment by modeling various scanning conditions: A fully populated VCM can be resumed to a diagonal matrix, which accounts in a "hidden" way for correlations in a LS adjustment. The ratio differences DCM versus fully populated VCM were smaller than 2%, compared with values up to 60% for the corresponding diagonal matrix. We advise making use of an AR(1) model for the correlation structure with the DCM for numerical stability. This assumption further simplifies the computation of the inverse of the VCM, resuming it to a simple factor. Empirical estimations or physical modeling of an accurate correlation structure of TLS range measurements remain the topic of further contributions. a first approximation with the MATLAB function pwelch to estimate the psd of the process.

B Appendix: Variance inflation factor
The noise model of TLS range measurements can be approximated with the autoregressive model of the first order AR(1), which assumes a weak stationarity. The correlation matrix of the corresponding process has a Toeplitz structure (Rao and Toutenburg 1995), i. e., where ρ is the autocorrelation coefficient, which can be estimated or fixed a priori. The main advantage of the AR(1) over other correlation models is that an explicit inverse Q −1 AR (1) exists, i. e., Consecutively, the diagonal equivalent matrix, following Sect. 3.3.3 is given by Thus, assuming that for large matrices, the first and last terms of the diagonal that have different values than the rest can be neglected, we approximate Q AR(1),equ by Q AR(1),equi = V I FI, where I is the identity matrix and the variance inflation factor is defined as V I F = 1+ρ 1−ρ .
The corresponding VCMˆ r for the range is obtained bŷ r ,equi = σ 2 0,r V I FI.