# How to deal with the high condition number of the noise covariance matrix of gravity field functionals synthesised from a satellite-only global gravity field model?

- 355 Downloads

## Abstract

The posed question arises for instance in regional gravity field modelling using weighted least-squares techniques if the gravity field functionals are synthesised from the spherical harmonic coefficients of a satellite-only global gravity model (GGM), and are used as one of the noisy datasets. The associated noise covariance matrix, appeared to be extremely ill-conditioned with a singular value spectrum that decayed gradually to zero without any noticeable gap. We analysed three methods to deal with the ill-conditioned noise covariance matrix: Tihonov regularisation of the noise covariance matrix in combination with the standard formula for the weighted least-squares estimator, a formula of the weighted least-squares estimator, which does not involve the inverse noise covariance matrix, and an estimator based on Rao’s unified theory of least-squares. Our analysis was based on a numerical experiment involving a set of height anomalies synthesised from the GGM GOCO05s, which is provided with a full noise covariance matrix. We showed that the three estimators perform similar, provided that the two regularisation parameters each method knows were chosen properly. As standard regularisation parameter choice rules do not apply here, we suggested a new parameter choice rule, and demonstrated its performance. Using this rule, we found that the differences between the three least-squares estimates were within noise. For the standard formulation of the weighted least-squares estimator with regularised noise covariance matrix, this required an exceptionally strong regularisation, much larger than one expected from the condition number of the noise covariance matrix. The preferred method is the inversion-free formulation of the weighted least-squares estimator, because of its simplicity with respect to the choice of the two regularisation parameters.

## Keywords

Local quasi-geoid Least-squares Global gravity field model Noise covariance matrix Regularisation Unified theory of least-squares Spherical radial basis functions## 1 Introduction

In local gravity field modelling, a global gravity field model (GGM) may be considered as another noisy dataset next to the local datasets such as terrestrial and shipboard gravity anomalies, airborne gravity disturbances, radar altimeter deflections of the vertical or along-track height anomaly differences. In Stokes-based approaches, part of the information in the form of noise degree variances is routinely used for the modification of the Stokes kernel (e.g. Sjöberg 1980, 1981; Wenzel 1981). Sjöberg (2005, 2011) extended the formalism, which now allows to include a full noise covariance matrix of the spherical harmonic coefficients.

So far, no publication is known to the authors, which indeed used a full noise covariance matrix of the GGM in local gravity field modelling. The only exception is Klees et al. (2017). This may be explained among others by the fact that in the past, a full noise covariance matrix was not available or was considered as being not reliable enough. This has changed recently with the latest generation of GGMs, which are mainly based on low-low satellite-to-satellite (ll-SST) tracking data of the Gravity Recovery and Climate Experiment (GRACE) mission, satellite gravity gradiometry (SGG) data of the Gravity field and steady-state Ocean Circulation Explorer (GOCE) mission, and high-low satellite-to-satellite (hl-SST) tracking data of GRACE, GOCE and many other low-earth orbiters. Post-fit residual analysis (e.g. Farahani et al. 2013), has become a powerful tool to improve the noise model of the satellite data. Numerically efficient algorithms were developed, which propagate the full data noise covariance matrices into the estimated spherical harmonic coefficients of the GGM. One example is GOCO05s (Mayer-Gürr et al. 2015), which is complete to degree 280 and provided with a full noise covariance matrix. This matrix has been propagated using the law of covariance propagation from the noise covariance matrices of the individual datasets used to compute GOCO05s. The noise covariance matrices of the individual datasets are based on a post-fit residual analysis and modelled using empirical covariance functions and ARMA models, respectively, depending on the dataset.

When estimating a local model of the disturbing potential using least-squares techniques, there are basically two possibilities to include ll-SST, hl-SST, and SGG data: i) using these data directly as observations (e.g. Eicker 2008; Eicker et al. 2014; Bucha et al. 2015; Naeimi and Bouman 2017) or ii) using the spherical harmonic coefficients of the GGM (e.g. Schmidt et al. 2007; Klees et al. 2017). In the former case, the noise covariance matrices of the satellite datasets are well-conditioned. However, dealing with original data may increase the numerical complexity of the parameter estimation significantly, in particular when using ll-SST data of the GRACE satellite gravity mission with a proper noise model. However, it also comes at a price. The spherical harmonic coefficients cannot be used directly as observations in local gravity field modelling (Klees et al. 2017). Instead, gravity field functionals (e.g. disturbing potential values, height anomalies, or gravity disturbances) need to be synthesised from the spherical harmonic coefficients at the Earth’s surface or at altitude. The associated noise covariance matrix has to be computed from the noise covariance matrix of the spherical harmonic coefficients using the law of covariance propagation.

Propagating the noise covariance matrix of spherical harmonic coefficients, into a set of gravity field functionals over a local area at the Earth’s surface or at altitude provides a noise covariance matrix that has a gradually decreasing singular value spectrum without any noticeable gap (cf. Sect. 2). Depending on the point density, the noise covariance matrix may be extremely ill-conditioned, meaning that the spectral norm condition number is much larger than \(\varepsilon ^{-1}\), where \(\varepsilon \) is the relative rounding error unit of IEEE 754 double precision arithmetic. In this study, we investigate three approaches to deal with the ill-conditioned noise covariance matrix: (i) applying Tikhonov regularisation (Tikhonov 1963) to the noise covariance matrix in combination with the standard formula for the weighted least-squares estimator; (ii) using an alternative formula for the weighted least-squares estimator, which does not require to invert the noise covariance matrix (Grafarend and Schaffrin 1993); and (iii) using an estimator based on the theory of unified least-squares (Rao 1971, 1973, 1978), which was designed among others to deal with a rank-deficient noise covariance matrix.

The reminder of the paper is organised as follows: in Sect. 2, we investigate several parameters, which have an influence on the condition number of the noise covariance matrix of gravity field functionals when propagated from a full noise covariance matrix of spherical harmonic coefficients of a state-of-the-art GGM. It appears that the condition number of the noise covariance matrix increases exponentially with the density of the points at which the gravity field functionals are synthesised. In Sect. 3, we investigate the minimum point density, which is required to reduce the functional model error below the noise level in the data. In Sect. 4, we introduce the three methods investigated in this study. Aspects such as the experimental setup, and the choice of various regularisation parameters each method requires to be made are the subject of Sect. 5. In Sect. 6, we present and discuss the results of the numerical experiments. Section 7 provides a summary and the conclusions.

## 2 The condition number of the noise covariance matrix of a gravity field functional synthesised from a satellite-only GGM

When propagating the full noise covariance matrix of a spherical harmonic model of the Earth’s gravity field into gravity field functionals using the law of covariance propagation, the condition number of the gravity field functionals’ noise covariance matrix essentially depends on a number of parameters, among others, the density of the data points, the size of the data area, the maximum degree of the GGM, the altitude of the data points, and the type of gravity field functional.

### 2.1 Impact of the point density

*N*, which determines the distance between the grid nodes, i.e. the point density. The number of grid nodes is close to but does not exceed \(2 + {4 \over \pi } N^2\) over the whole surface of the sphere (Reuter 1982; Freeden et al. 1998). The grids were located on the Earth’s surface and covered an area bounded by \(44^\circ \)–\(68^\circ \)N and \(11^\circ \)W–\(15^\circ \)E. Each height anomaly noise covariance matrix was computed by covariance propagation from the full noise covariance matrix of the unregularised GOCO05s spherical harmonic model complete to degree \(L=200\). Truncating GOCO05s at degree 200 makes sense here as beyond that degree the commission error grows exponentially and may quickly attain values much larger than the uncertainty of local datasets (e.g. terrestrial gravity anomalies) used in local gravity field modelling. For instance, over the data area defined above, we found that the average height anomaly noise standard deviation from the unregularised GOCO05s GGM complete to degree 280 can be modelled as \(e^{0.03676 L - 6.5191}\) m, which is 2.3 cm at \(L=200\), but already about 15 cm at \(L=250\) and more than 40 cm at \(L=280\). This exponential grow of the commission error implies that in local gravity field modelling, it does not make sense to use GOCO05s up to the maximum degree, except some areas in the world where terrestrial gravity datasets have an even poorer quality.

Characteristic numbers of the Reuter grids used in Fig 1 and the condition number of the noise covariance matrix of height anomalies synthesised on these grids from the GOCO05s noise covariance matrix of spherical harmonic coefficients complete to degree 200

| # Points | Mean point distance | Condition number |
---|---|---|---|

200 | 435 | 0d54m00s | \(9.7 \cdot 10^8\) |

220 | 516 | 0d49m05s | \(6.3 \cdot 10^{12}\) |

240 | 617 | 0d45m00s | \(1.4 \cdot 10^{16}\) |

260 | 729 | 0d41m32s | \(6.6 \cdot 10^{19}\) |

280 | 834 | 0d38m34s | \(7.0 \cdot 10^{22}\) |

300 | 958 | 0d36m00s | \(1.9 \cdot 10^{27}\) |

350 | 1297 | 0d30m51s | \( 4.0 \cdot 10^{34}\) |

Figure 1 and Table 1 reveal that the condition number increases exponentially with increasing point density. It may easily exceed \(10^{15}\), which implies the loss of all significant digits in IEEE 754 double precision arithmetic when computing its inverse.

The point density needs to be chosen with care. The results of Table 1 suggest that a low point density is to be preferred. On the other hand, the point density must be high enough (i) to preserve all information contained in the GGM over the area of interest, and (ii) to guarantee that the error of the local model of the disturbing potential is negligible compared to the effect of the data noise. Therefore, in Sect. 3, we will determine the minimum point density needed to reconstruct a GGM over a local area of interest with a model error significantly below the effect of the data noise.

### 2.2 Impact of the size of the data area

We also found that the condition number of the data noise covariance matrix depends on the size of the data area. Over the range of data areas we investigated (up to a size of \(34^\circ \times 36^\circ \)), we found that the larger the data area, the larger the condition number. For instance, if the height anomalies were located on a Reuter grid with \(N=240\), the condition number of the height anomaly noise covariance matrix was \(4.0\cdot 10^{10}\) for a data area of \(49^\circ {-}63^\circ \)N, \(6^\circ \)W–\(10^\circ \)E, and increased to \(1.4 \cdot 10^{16}\) and \(1.9 \cdot 10^{22}\) for a data area of \(44^\circ {-}68^\circ \)N, \(11^\circ \)W–\(15^\circ \)E and \(39^\circ {-}73^\circ \)N, \(16^\circ \)W–\(20^\circ \)E, respectively. Similar condition numbers were obtained when other areas on the globe were chosen. On the other hand, it is well known that if the data area is not global, the disturbing potential is distorted with the largest distortions along the border of the data area. These distortions, which are usually referred to as edge effects, reduce towards the centre of the data area (e.g. Schachtschneider et al. 2010). Hence, to reduce the distortions, the data area has to be chosen larger than the area of interest. For instance, Naeimi (2013) suggested an extension of the data area beyond the area of interest by \({10{,}000 \over L_{\tiny {\text{ min }}}}\) km assuming that the data have no energy at spherical harmonic degrees \(\le L_{\tiny {\hbox {min}}}\). In our study, we found empirically that an extension of \(5^\circ \) reduces the distortions over the area of interest to a level below 2 mm. This result is independent of the size of the area of interest as shown in (Schachtschneider et al. 2010). Hence, only for data areas much smaller than the one considered in the numerical experiments of Sect. 5, the condition number of the noise covariance matrix may be small enough to allow for a stable computation of the inverse.

### 2.3 Impact of other parameters

Other parameters which may influence the condition number of the data noise covariance matrix, comprise the maximum degree of the GGM, the type of gravity field functional, and the altitude of the data grid.

The dependency on the maximum degree of the GGM was found to be moderate. To understand this result, we must remember that when increasing the maximum degree of the GGM, we also have to use a denser dataset. For instance, when we used GOCO05s complete to degree 200 and located the data points on a Reuter grid with \(N=201\), the condition number was \(9.7 \cdot 10^{8}\). When we used GOCO05s complete to degree 280 and located the data on a Reuter grid with \(N=281\), the condition number increased to \(2.6 \cdot 10^{10}\). We obtained similar results when using larger values of *N*.

The effect of the type of gravity field functionals and the altitude of the data grid on the condition number of the data noise covariance matrix appeared to be marginal. For instance, when we used gravity disturbances instead of height anomalies, the condition number increased by a factor of only 4.6; when we used data at an altitude of 250 km instead of data on the Earth’s surface, the condition number increased by a factor of only 1.6.

Supported by these results, we will represent the GGM in terms of height anomalies at points located on the Earth’s surface, and will use a data area which is \(5^\circ \) larger than the area of interest in Sects. 3, 5, and 6.

## 3 Model error as function of the data point density

To investigate the model error as function of the data point density, we first needed to select a local model of the disturbing potential. Without loss of generality, we used a spherical radial basis function (SRBF) model. SRBFs have been used successfully in many studies of local gravity field and (quasi-) geoid modelling (e.g. Klees et al. 2008; Eicker 2008; Wittwer 2009; Bentel et al 2013; Naeimi 2013; Slobbe 2013; Lin et al. 2014; Bentel and Schmidt 2016; Lieb et al. 2016; Bucha et al. 2016; Naeimi and Bouman 2017).

The following experimental setup was chosen. The disturbing potential was set equal to the regularised GOCO05s spherical harmonic model from degree 151–200. The area of interest was bounded by \(49^\circ \)–\(63^\circ \)N and \(6^\circ \)W–\(10^\circ \)E (i.e. the size is \(1500 \times 1000\) km). Noise-free height anomalies were generated on a Reuter grid with control parameter *N* located on the Earth’s surface. The latter was represented by a smoothed version of the General Bathymetric Chart of the Oceans 2008 (GEBCO_08) grid, a terrain model for ocean and land with a spatial resolution of \(30''\) (www.gebco.net). The local model of the disturbing potential comprised Poisson wavelets of order 3 (Holschneider et al. 2003), which were truncated at the maximum degree 200 of the disturbing potential. The poles of the Poisson wavelets were located at a constant depth beneath the data points. Though working with truncated Poisson wavelets is not necessary for the experiments of Sect. 5, it is a prerequisite when combining the GGM dataset with high-resolution local datasets as shown in (Klees et al. 2017). It ensures spectral consistency between the GGM dataset and its noise covariance matrix and the SRBF model of the disturbing potential.

We selected a number of Reuter grids with different control parameters ranging from \(N=201\) to \(N=350\). For each grid, we generated noise-free height anomalies and estimated the SRBF model parameters using ordinary least-squares. We always applied Tikhonov regularisation with a unit regularisation matrix. The estimated SRBF model of the disturbing potential was used to synthesise height anomalies on an equal-angular control grid of width \(27'00''\) covering the area of interest, and comprising 1085 grid points. The differences between them and the height anomalies directly synthesised from the spherical harmonic coefficients of the disturbing potential are referred to as “model errors”.

*N*. For each

*N*, the statistics refer to a least-squares solution obtained for a depth of the Poisson wavelets and a regularisation parameter providing the smallest model error among a set of candidate depths and regularisation parameters. As expected, the model error decreases with increasing point density. What model error is acceptable depends on the impact of data noise on the estimated quasi-geoid model. An indication of the latter is obtained when propagating the GOCO05s noise covariance matrix into height anomalies. For GOCO05s complete to degree 200, the height anomaly noise standard deviations range from 1.6 to 2.7 cm over the area of interest. The maximum absolute model error should be significantly smaller than 1.6 cm. Table 2 shows that the choice \(N = 240\) provides a maximum absolute model error of 2 mm, i.e. a factor of 8 below the smallest height anomaly noise standard deviation. Therefore, we used \(N=240\) in the numerical experiments of Sect. 5.

SRBF model error in terms of height anomalies (in units of cm) as a function of the Reuter grid control parameter *N*

| # Points | Distance (km) | Min | Max | Mean | RMS |
---|---|---|---|---|---|---|

201 | 435 | 99 | \(-\,6.66\) | 7.24 | \(9.6 \cdot 10^{-3}\) | 1.77 |

210 | 470 | 95 | \(-\,3.28\) | 2.93 | \(3.5 \cdot 10^{-2}\) | 0.87 |

220 | 516 | 91 | \(-\,0.84\) | 0.68 | \(7.82 \cdot 10^{-3}\) | 0.20 |

240 | 617 | 83 | \(-\,0.20\) | 0.19 | \(-\,6.4 \cdot 10^{-4}\) | 0.04 |

260 | 729 | 77 | \(-\,0.05\) | 0.05 | \(-\,3.0 \cdot 10^{-4}\) | 0.01 |

280 | 834 | 72 | \(-\,0.03\) | 0.03 | \(-\,1.7 \cdot 10^{-4}\) | 0.01 |

300 | 958 | 67 | \(-\,0.03\) | 0.03 | \(-\,2.0 \cdot 10^{-6}\) | 0.01 |

350 | 1297 | 57 | \(-\,0.02\) | 0.02 | \(-\,4.8 \cdot 10^{-6}\) | 0.01 |

The condition number of the height anomaly noise covariance matrix for data on a Reuter grid with \(N=240\) is \(1.4 \cdot 10^{16}\) (cf. Table 1). A straightforward inversion of this matrix would imply the loss of all significant digits in IEEE 754 double precision arithmetic. Reducing the SRBF model error further would require an even denser grid of height anomalies, which further increases the condition number according to Fig. 1 and Table 1. For instance, using a Reuter grid with \(N=350\) reduces the model error to 0.2 mm, but increases the condition number of the height anomaly noise covariance matrix \(4.0 \cdot 10^{34}\). How to deal with extremely ill-conditioned noise covariance matrices of height anomalies is the subject of Sect. 4.

Note that the model error depends on various settings such as the type of the SRBF, the grid used to locate the poles of the SRBFs and the data points, respectively, the area of interest, the extension of the parameterisation area beyond the data area, etc. Hence, each choice may lead to a different model error as function of the data point density.

In our experiments, the parameterisation area was chosen identical to the data area. Several studies suggest to extend the parameterisation area beyond the data area (e.g. Naeimi 2013; Bentel et al. 2013a; Eicker et al. 2014; Bucha et al. 2016). This raises the question whether in this case the model error can be made small enough to avoid any oversampling at the benefit of a condition number small enough to allow for a direct inversion without regularisation. In “Appendix A”, we present the results of a series of experiments designed to investigate this question. They reveal that when extending the parameterisation area beyond the data area, the model error statistics improve, though we still need to oversample by at least a factor of 1.2 to obtain a maximum absolute model error which is comparable to the one obtained without an extension of the parameterisation area.

Next to the experiments presented in “Appendix A”, we did a series of experiments with different types of SRBFs [Shannon kernel (Freeden et al. 1998) and point mass kernel (Hardy and Göpfert 1975)] and different point distributions to locate the poles of the SRBFs and the data points, respectively [Reuter grid, Fibonacci grid (Gonzalez 2010), triangle vertex grid (Eicker 2008)]. For each chosen setup, we could find parameter settings, which provide model error statistics as function of the data point density similar to the ones shown in Table 2. Importantly, we always needed to oversample by at least a factor of 1.2 to reduce the model errors to a level below the effect of data noise by at least a factor of 5. Moreover, the condition numbers of the corresponding data noise covariance matrices were identical to within a factor of 5.

## 4 Dealing with the ill-conditioned noise covariance matrix

In this study, we investigate three approaches to deal with the high condition number of the noise covariance matrix: (i) apply Tikhonov regularisation (Tikhonov 1963) to the ill-conditioned data noise covariance matrix and use the standard formula for the weighted least-squares estimator, (ii) use a formula of the weighted least-squares estimator, which does not require the computation of the inverse of the noise covariance matrix (Grafarend and Schaffrin 1993), and (iii) use Rao’s generalised least-squares estimator (Rao 1971, 1973, 1978).

*E*is the expectation operator and

*D*is the dispersion operator, \(\mathbf d\) is the vector of height anomalies, \(\mathbf c\) is the vector of SRBF coefficients, \(\mathbf C\) is the height anomaly noise covariance matrix, \(\mathbf Q\) is the cofactor matrix, and \(\sigma ^2\) is the variance factor. An element \(A_{ji}\) of the design matrix is equal to \(\varPsi (x_j,z_i)\), where \(\varPsi \) is the SRBF, \(z_i\) is the coordinate vector of the i-th SRBF pole, and \(x_j\) is the coordinate vectors of the j-th data point, i.e.

*L*is the degree up to which the GGM is used when synthesising the data, \(\{\lambda _l: l=1 \ldots L\}\) are the Legendre coefficients of the SRBF with respect to the surface \(\sigma _R\) of a sphere of radius

*R*, \(\hat{x}_j = {x_j \over |x_j|}\) and \(\hat{z}_i = {z_i \over |z_i|}\) are points on the unit sphere, and \(Q_l\) is the reproducing kernel of the space of spherical harmonics of degree

*l*. For the Poisson wavelets of order 3, it is \(\lambda _l = l^3\).

### 4.1 Regularisation of the noise covariance matrix

*n*is the number of columns and rows of the matrix \(\mathbf Q\). Here, we use Tikhonov regularisation with a unit regularisation matrix. It is equivalent to an approximation \(\mathbf Q_{\hbox { tikh}}\) of \(\mathbf Q\), which is defined as

*n*singular values \(\sigma _i\) of \(\mathbf Q\) with \(\sigma _i + \lambda '\). Then, the spectral norm condition number of \(\mathbf Q_{\hbox { tikh}}\) reduces from \({\sigma _1 \over \sigma _n}\) to \({\sigma _1 \over \lambda '}\), which when \(\lambda ' \gg \sigma _n\) is much smaller than \({\sigma _1 \over \sigma _n}\). The inverse of the regularised cofactor matrix \(\mathbf Q_{\hbox { tikh}}\), is then computed as the Caley inverse of \(\mathbf Q_{\hbox { tikh}}\), i.e.

### 4.2 Inversion-free weighted least-squares estimator

*n*is the number of observations,

*m*is the number of parameters, and \(\lambda {\hbox { eff}}\) is referred to as the “effective regularisation parameter”. This choice of \(\lambda \) and \(\lambda '\) implies that the amount of regularisation applied to \(\mathbf A' \mathbf A\) when computing \(\mathbf B_{\hbox { reg}}\) and applied to \(\mathbf A \mathbf A' + \mathbf Q \mathbf B_{\hbox { reg}} \mathbf Q\) when solving the normal equations, respectively, is the same.

### 4.3 Least-squares solution according to Rao’s unified theory of least-squares

Note that the g-inverse \(\mathbf T^-\) does not need to be a g-inverse of \(\mathbf Q\) (Rao 1978). In the strictly rank-deficient case, it can be computed as the pseudo-inverse of a truncated singular value decomposition of \(\mathbf T\), where the truncation index *r* is identical to the rank of \(\mathbf T\) where \(r < n\). In our case, the choice of the truncation index is not straightforward as the singular value spectrum of \(\mathbf T\) gradually decreases to zero without any noticeable gap.

*q*, and compute the g-inverse \(\mathbf T^-\) as

*q*largest singular values of \(\mathbf T\), and \(\mathbf U_q\) is the associated matrix of singular vectors.

### 4.4 Dispersion matrices

## 5 Experimental setup, quality assessment, and parameter choice rule

### 5.1 Experimental setup

The performance of the afore-mentioned methods were investigated using numerical experiments. We used the experimental setup of Sect. 3. The height anomalies were synthesised on a Reuter grid with \(N=240\), comprising 617 data points over the data area. The condition number of the noise covariance matrix \(\mathbf C\) is \(1.4 \cdot 10^{16}\) (cf. Sect. 3). Contrary to the data used in Sect. 3, we added zero-mean Gaussian noise to the noise-free height anomalies. The noise was generated using a SVD of the height anomaly noise covariance matrix \(\mathbf C\). The noisy height anomalies form the elements of the vector \(\mathbf d\).

### 5.2 Quality assessment

- 1.From every least-squares estimate \(\hat{\mathbf c}\) of the SRBF coefficients, we synthesise height anomalies on a control grid, \(\hat{\mathbf d}_s = \mathbf A_s\,\hat{\mathbf c}\). The true height anomalies, \(\mathbf d_s\), are synthesised from the spherical harmonic model of the disturbing potential. As a quality measure of a least-squares estimate \(\hat{\mathbf c}\), we use the RMS of the errors \(\{ \hat{d}_{s,i}-d_{s,i}: i=1 \ldots q\}\), i.e.For a good least-squares estimate \(\hat{\mathbf c}\), \(\varepsilon _{\hbox { RMS}}\) should not differ much from the noise SDs of the height anomalies at the control points, which are computed by covariance propagation from the full noise covariance matrix of spherical harmonic coefficients of the unregularised GOCO05s model complete to degree 200. Figure 2 shows a spatial rendition of the noise variances over the area of interest. The noise standard deviations range from 1.6 to 2.7 cm (cf. Sect. 3) and have a mean of \(2.3 \pm 0.2\) cm.$$\begin{aligned} \varepsilon _{\hbox { RMS} }:= { \Vert \hat{\mathbf d}_s - \mathbf d_s\Vert \over \sqrt{q} }. \end{aligned}$$(25)
- 2.We compute the dispersion \(D(\hat{\mathbf d}_s)\) and compare it with the dispersion \(D(\mathbf d_s)\). The latter is computed by covariance propagation from the full noise covariance matrix of spherical harmonic coefficients of the unregularised GOCO05s model. As a quality measure, we use the relative errorwhere \(\Vert \cdot \Vert \) is the spectral norm.$$\begin{aligned} \varepsilon _{\hbox { rel}}:= { \Vert D(\hat{\mathbf d_s}) - D(\mathbf d_s)\Vert \over \Vert D(\mathbf d_s)\Vert }, \end{aligned}$$(26)

### 5.3 Parameter choice rule

Each of the three methods presented in Sect. 4 requires the choice of a suitable regularisation parameter \(\lambda \) to stabilise the normal matrix. Moreover, each method requires to fix a second parameter, i.e. \(\lambda '\) of Eq. (5) for WLS-reg, \(\lambda '\) of Eq. (14) for WLS, and *r* of Eq. (20) for GLS.

When computing the WLS-reg estimate (cf. Sect. 4.1), \(\lambda '\) of Eq. (5) is the regularisation parameter of the cofactor matrix. This parameter has an impact on the condition number of the normal matrix of Eq. (9). Therefore, we need to search a two-dimensional parameter space to find suitable values for the two regularisation parameters \(\lambda \) and \(\lambda '\). The task to choose the two regularisation parameters \(\lambda \) and \(\lambda '\) when computing the WLS estimate (cf. Sect. 4.2) is reduced to finding the single parameter \(\lambda _{\hbox { eff}}\) of Eq. (15). The computation of the GLS estimate (cf. Sect. 4.3) requires to fix \(\lambda \) and *r*. For the latter, we use the relative error \(\kappa \) of Eq. (20). This makes the search for *r* independent of the search for \(\lambda \). Therefore, fixing the two parameters \(\lambda \) and *r* reduces to two one-dimensional problems.

Finding suitable values for these parameters requires a parameter choice rule. Standard parameter choice rules such as the *L*-curve (Hansen and O’Leary 1993), generalised cross-validation (Wahba 1990) or variance component estimation (Koch and Kusche 2002) cannot be used here, as there are as many data as SRBF coefficients. In this study, we considered the quality measure \(\varepsilon _{\hbox { RMS}}\) of Eq. (25) and \(\varepsilon _{\hbox { rel}}\) of Eq. (26) as potential candidates. Numerical experiments revealed that \(\varepsilon _{\hbox { rel}}\) is more sensitive to the choice of these parameters than \(\varepsilon _{\hbox { RMS}}\). Therefore, we used the minimum of \(\varepsilon _{\hbox { rel}}\) as the parameter choice rule. In some cases, this choice also provided the smallest value of \(\varepsilon _{\hbox { RMS}}\). If not, we found that the parameter(s), which minimised \(\varepsilon _{\hbox { rel}}\) provided a value of \(\varepsilon _{\hbox { RMS}}\), which did not differ more than 0.05 mm from the smallest \(\varepsilon _{\hbox { RMS}}\).

## 6 Results and discussion

The WLS-reg, WLS, and GLS estimates, which minimise \(\varepsilon _{\hbox { rel}}\), Eq. (25)

Estimator | \(\varepsilon _{\hbox { RMS}}\) (cm) | \(\varepsilon _{\hbox { rel}}\) | \(e_{\hbox { RMS}}\) (cm) | \(\lambda _{\hbox { eff}}\) | |
---|---|---|---|---|---|

WLS-reg, \(\lambda '_{\hbox { eff}} = 1.9 \cdot 10^{-2}\) | 2.71 | \(1.0 \cdot 10^{-3}\) | 0.013 | \(1.0 \cdot 10^{-10}\) | 0.852 |

WLS | 2.72 | \(9.8 \cdot 10^{-4}\) | 0.010 | \(1.6 \cdot 10^{-12}\) | 0.866 |

GLS, \(r=492\) | 2.72 | \(1.2 \cdot 10^{-3}\) | 0.082 | \(1.0 \cdot 10^{-9}\) | 0.797 |

WLS-diag | 2.72 | \(9.5 \cdot 10^{-1}\) | 0.012 | \(1.0 \cdot 10^{-11}\) | 0.853 |

*RD*), which is defined as

*R*is the resolution matrix (i.e. the matrix \(\mathbf S \mathbf A\), if the least-squares estimate is \(\hat{\mathbf c} = \mathbf S\,\mathbf d\)), and

*n*is the number of parameters (e.g. Aster et al. 2013).

*RD*is an indication of the contribution of the data to the estimated SRBF coefficients in the presence of regularisation; the larger

*R*, the higher the contribution of the data to the estimated SRBF coefficients. According to Table 3, WLS-reg and WLS perform the same with a model resolution degree of 85–87%, whereas the model resolution degree for GLS is smaller, about 80%. The somehow lower model resolution degree for GLS may be due to the choice of the g-inverse of the matrix \(\mathbf T\) of Eq. (17), which is a challenging task due to the gradually decreasing singular value spectrum of this matrix.

Figure 4 shows \(\varepsilon _{\hbox { rel}}\) as function of \(\lambda _{\hbox { eff}}\) for the WLS estimate. The minimum is attained at \(\lambda _{\hbox { eff}} = 1.6 \cdot 10^{-12}\). However, the curve is very flat over a broad range of effective regularisation parameters between about \(10^{-12}\) and \(10^{-8}\). The corresponding least-squares estimates of the SRBF coefficients are very close to each other, and the estimates do not differ more than 0.02 cm in terms of height anomalies over the area of interest. We consider this as a positive result as it makes it easy to find a suitable regularisation parameter.

The results for the GLS estimate are shown in Fig 5. The left plot of Fig 5 shows the relative error \(\kappa \) of Eq. (20) as function of the truncation index *q*. A clear minimum is attained at \(q = r = 492\). The relative error is \(\kappa (492) = 3.2 \cdot 10^{-9}\). This indicates that finding a good g-inverse \(\mathbf T^-\) using Eq. (19) may be possible with a truncated singular value decomposition despite the gradually decreasing singular value spectrum of \(\mathbf T\). Whether this applies to other datasets and areas of interest, as well, remains open, and may be considered as a weak point of the GLS estimator. The right plot of Fig. 5 shows \(\varepsilon _{\hbox { rel}}\) as function of the effective regularisation parameter \(\lambda _{\hbox { eff}}\), which is used to regularised the normal matrix. The minimum is attained at \(\lambda _{\hbox { eff}} = 1 \cdot 10^{-9}\). Similar to what was found for the WLS estimate, the curve is very flat around the minimum over a broad range of effective regularisation parameters from \(10^{-10}\) to \(10^{-3}\). Again, the corresponding least-squares estimates of the SRBF coefficients are almost identical. In terms of height anomalies, the solutions do not differ more than 1 mm.

Table 3 shows that the smallest value of \(\varepsilon _{\hbox { rel}}\) is about \(10^{-3}\) for the three methods investigated in this paper. Hence, the noise covariance matrix of the height anomalies at the control grid always agrees very well with the noise covariance matrix directly propagated from the noise covariance matrix of the spherical harmonic model of the disturbing potential. The same applies to the fit of each solution to the control data; the RMS misfit is \(\varepsilon _{\hbox { RMS}} = 2.7\) cm for all three methods. This is at the upper limit of the height anomaly noise standard deviations directly propagated from the noise covariance matrix of GOCO05s over the area of interest (which range from 1.6 to 2.7 cm, cf. Fig 2). The estimator that ignores the data noise covariances provides the same RMS misfit of 2.7 cm. However, the error \(\varepsilon _{ \text{ rel }}\) is 0.95, i.e. almost three orders of magnitude larger than for the other three estimators. This is in line with what we expect from theory. When applying weighted least-squares to a single dataset, errors in the data noise covariance matrix have a minor effect on the least-squares estimate, but a significant effect on the dispersion of linear functionals of the estimate.

## 7 Summary and conclusions

In this study, we investigated three methods to deal with the high condition number of the noise covariance matrix of a state-of-the-art GGM after propagation into gravity field functionals over a local data area. This problem is relevant when estimating a local model of the disturbing potential considering all available datasets as being noisy.

We showed that the noise covariance matrix of height anomalies propagated from the full noise covariance matrix of GOCO05s, has a gradually decreasing singular value spectrum without any noticeable gap and a high condition number. The latter depends among others on the density of the points where the height anomalies are synthesised. The density has to be chosen high enough to guarantee that the error of the local SRBF model of the disturbing potential is negligible compared to the effect of the noise in the data. We showed that this requires a point density, which is higher than the maximum degree of the GGM suggests. The associated height anomaly noise covariance matrix had a condition number larger than the reciprocal value of the relative rounding error unit of IEEE 754 double precision arithmetic. Therefore, a straightforward computation of the weight matrix would imply the loss of all significant digits.

SRBF model error in terms of height anomalies (in units of cm) as a function of the extension of the parameterisation area beyond the data area

Extension | # SRBFs | # Data points | Min (cm) | Max (cm) | Mean (cm) | SD (cm) |
---|---|---|---|---|---|---|

\(0^\circ \) | 435 | 435 | \(-\,6.05\) | 5.85 | 0.05 | 2.20 |

\(1^\circ \) | 503 | 435 | \(-\,6.44\) | 6.07 | 0.05 | 2.07 |

\(2^\circ \) | 576 | 435 | \(-\,6.51\) | 5.70 | 0.03 | 2.29 |

\(3^\circ \) | 650 | 435 | \(-\,6.35\) | 6.96 | 0.01 | 2.36 |

\(4^\circ \) | 761 | 435 | \(-\,6.51\) | 6.32 | 0.00 | 2.34 |

\(5^\circ \) | 850 | 435 | \(-\,6.66\) | 6.06 | 0.01 | 2.32 |

We showed that all three methods provide least-squares estimates of the SRBF coefficients which were identical within noise. Moreover, the dispersion matrices of the estimated SRBF coefficients and of height anomalies at a set of control points agreed very well with each other and with the height anomaly noise covariance matrix directly propagated form the full noise covariance matrix of spherical harmonic coefficients of the GGM. Prerequisite was that the two regularisation parameters each of the three methods knows, were chosen properly. We demonstrated that a parameter choice rule which uses the dispersion matrix of height anomalies at a control dataset allowed such a choice.

Among the three methods, we prefer the inversion-free weighted least-squares estimator. We showed that the choice of the two regularisation parameters can be reduced successfully to the choice of a single parameter, and the least-squares estimate and its dispersion matrix appeared to be quite robust against this choice in the numerical experiments. The fact that the inversion-free weighted least-squares estimator requires the solution of a system of linear equations of a size equal to the number of data does not pose numerical problems in real applications. We found that the weak point of Rao’s generalised least-squares estimator is the need to compute a g-inverse of a matrix with a gradually decreasing singular value spectrum without any noticeable gap. In this study, we computed such a g-inverse successfully with a truncated singular value decomposition. However, whether this applies to other situations than considered in this study, is an open question. Using Tikhonov regularisation of the data noise covariance matrix in combination with the standard formula for the weighted least-squares estimator, required an exceptionally strong regularisation of the data noise covariance matrix to obtain a good least-squares estimate of the SRBF coefficients and an accurate dispersion matrix of the estimated SRBF coefficients. We found that the major drawback of this method is the need to search a two-dimensional space to find optimal values of the two regularisation parameters.

## Notes

### Acknowledgements

This study was performed in the framework of the Netherlands Vertical Reference Frame (NEVREF) project, funded by the Netherlands Technology Foundation STW. This support is gratefully acknowledged. We also want to thank T Mayer-Gürr for providing us with two GOCO05s spherical harmonic models including the full noise covariance matrices.

## References

- Aster RC, Borchers B, Thurber CH (2013) Parameter estimation and inverse problems, 2nd edn. Academic Press, AmsterdamGoogle Scholar
- Bentel K, Schmidt M, Gerlach C (2013a) Different radial basis functions and their applicability for local gravity field representation on the sphere. Int J Geomath 4:67–96. https://doi.org/10.1007/s13137-012-0046-1 CrossRefGoogle Scholar
- Bentel K, Schmidt, M (2016) Combining different types of gravity observations in regional gravity modeling in spherical radial basis functions. In: Sneeuw N, Novak P, Crespi, M, Sanso, F (eds) VIII Hotine-Marussi symposium on mathematical geodesy, IAG symposia, vol 142, pp 115–120. https://doi.org/10.1007/1345_2015_2
- Bucha B, Bezdĕk A, Sebera J, Janák J (2015) Global and regional gravity field determination from GOCE kinematic orbit by means of spherical radial basis functions. Surv Geophys 36:773–801CrossRefGoogle Scholar
- Bucha B, Janák J, Papc̆o J, Bezdĕk A (2016) High-resolution regional gravity field modelling in a mountainous area from terrestrial gravity data. Geophys J Int 207:949–966CrossRefGoogle Scholar
- Chambodut A, Panet I, Mandea M, Diament M, Holschneider M, Jamet O (2005) Wavelet frames: an alternative to spherical harmonic representation of potential fields. Geophys J Int 163:875–899CrossRefGoogle Scholar
- Eicker A (2008) Gravity field refinement by radial basis functions from in-situ satellite data. PhD thesis, Institut für Geodäsie und Geoinformation der Universität Bonn, D98Google Scholar
- Eicker A, Schall J, Kusche J (2014) Regional gravity modelling from spaceborne data: case studies with GOCE. Geoph J Int 196:1431–1440CrossRefGoogle Scholar
- Farahani HH, Ditmar P, Klees R, Liu X, Zhao Q, Guo J (2013) The static gravity field model DGM-1S from GRACE and GOCE data: computation, validation and an analysis of GOCE mission’s added value. J Geod 87:843–867. https://doi.org/10.1007/s00190-013-0650-3 CrossRefGoogle Scholar
- Förstner W, Moonen B (1999) A metric for covariance matrices. In: Krumm VS, Schwarze S (1999) Quo vadis geodesia...? Festschrift for Erik W Grafarend on the occasion of his 60th birthday, TR Dept of Geodesy and Geoinformatics, Stuttgart UniversityGoogle Scholar
- Freeden W, Gervens T, Schreiner M (1998) Constructive approximation on the sphere. Oxford Science Publication, Clarendon PressGoogle Scholar
- Gonzalez A (2010) Measurement of areas on a sphere using Fibonacci and latitude-longitude lattices. Math Geosci 42:49–64. https://doi.org/10.1007/s11004-009-9257-x CrossRefGoogle Scholar
- Grafarend EW, Schaffrin B (1993) Ausgleichungsrechnung in linearen Modellen. B.I. Wissenschaftsverlag, MannheimGoogle Scholar
- Hansen PC, O’Leary DP (1993) The use of the L-curve in the regularisation of discrete ill-posed problems. SIAM J Sci Comput 14:1487–1503CrossRefGoogle Scholar
- Hardy RL, Göpfert WM (1975) Least squares prediction of gravity anomalies, geoidal undulations, and deflections of the vertical with multiquadric harmonic functions. Geophys Res Lett 2:423–426CrossRefGoogle Scholar
- Holschneider M, Chambodut A, Mandea M (2003) From global to regional analysis of the magnetic field on the sphere using wavelet frames. Phys Earth Planet Inter 135:107–124CrossRefGoogle Scholar
- Klees R, Tenzer R, Prutkin I, Wittwer T (2008) A data-driven approach to local gravity field modelling using spherical radial basis functions. J Geod 82:457–471. https://doi.org/10.1007/s00190-007-0196-3 CrossRefGoogle Scholar
- Klees R, Slobbe DC, Farahani HH (2017) A methodology for least-squares local quasi-geoid modelling using a noisy satellite-only gravity field model. J Geod. https://doi.org/10.1007/s00190-017-1076-0 Google Scholar
- Koch KR, Kusche J (2002) Regularization of geopotential determination from satellite data by variance components. J Geod 76:259–268CrossRefGoogle Scholar
- Lieb V, Schmidt M, Dettmering D, Börger K (2016) Combination of various observation techniques for local modeling of the gravity field. J Geophys Res Solid Earth. https://doi.org/10.1002/2015JB012586 Google Scholar
- Lin M, Denker H, Müller J (2014) Local gravity field modelling using free-positioned point masses. Stud Geophys Geod 58:207–226. https://doi.org/10.1007/s11200-013-1145-7 CrossRefGoogle Scholar
- Mayer-Gürr T, Kvas A, Klinger B, Maier A (2015) The new combined satellite only model GOCO05s. EGU General Assembly 2015, Vienna, Austria. https://doi.org/10.13140/RG.2.1.4688.6807
- Naeimi M (2013) Inversion of satellite gravity data using spherical radial base functions. Doctoral Dissertation, Leibniz University Hannover, Deutsche Geodätische Kommission, Reihe C, Heft Nr. 711, p 130Google Scholar
- Naeimi M, Bouman J (2017) Contribution of the GOCE gradiometer components to regional gravity solutions. Geoph J Int 209:559–569Google Scholar
- Rao CR (1971) Unified theory of linear estimation. Indian J Stat Ser A (1961–2002) 33(4):371–394Google Scholar
- Rao CR (1973) Unified theory of least squares. Commun Stat Part A Theory Methods 1:1–8CrossRefGoogle Scholar
- Rao CR (1978) Least squares theory for possibly singular models. Can J Stat 6:19–23CrossRefGoogle Scholar
- Reuter R (1982) Über Integralformeln der Einheitssphäre und harmonische Splinefunktionen. Veröff Geod Inst RWTH Aachen 33:1982Google Scholar
- Schachtschneider R, Holschneider M, Mandea M (2010) Error distribution in regional inversion of potential field data. Geophys J Int 181:1428–1440Google Scholar
- Schmidt M, Fengler M, Mayer-Gürr T, Eicker A, Kusche J, Sánchez L, Han SC (2007) Regional gravity modelling in terms of spherical base functions. J Geod 81:17–38CrossRefGoogle Scholar
- Schuh WD, Müller S, Brockmann JM (2015) Completion of band-limited data sets on the sphere. In: Kutterer H, Seitz F, Alkhatib H, Schmidt M (eds) The 1st international workshop on the quality of geodetic observations and monitoring systems (QuGOMS’11), IAG symposia, vol 140. Springer, Berlin. https://doi.org/10.1007/978-3-319-10828-5_25
- Sjöberg LE (1980) Least squares combination of satellite harmonics and integral formulas in Physical Geodesy. Gerlands Beiträge zur Geophysik 89:371–377Google Scholar
- Sjöberg LE (1981) Least squares combination of satellite and terrestrial data in physical geodesy. Ann Geophys 37:25–30Google Scholar
- Sjöberg LE (2005) A local least-squares modification of Stokes’ formula. Stud Geophys Geod 49:23–30CrossRefGoogle Scholar
- Sjöberg LE (2011) Local least squares spectral filtering and combination by harmonic functions on the sphere. J Geod Sci 1:355–360. https://doi.org/10.2478/v10156-011-0015-x Google Scholar
- Slobbe, DC (2013) Roadmap to a mutually consistent set of offshore vertical reference frames. PhD thesis, Delft University of Technology, p 233. https://doi.org/10.4233/uuid:68e4e599-51ab-40df-8fb4-918f5f54a453
- Tikhonov AN (1963) Solution of incorrectly formulated problems and the regularisation method. Dokl Akad Nauk SSSR 151: 501–504 = Soviet Math Dokl 4: 1035–1038Google Scholar
- Wahba G (1990) Spline models for observational data. SIAM CBMS-NSF regional conference series in applied mathematics, vol 59Google Scholar
- Wenzel HG (1981) Zur Geoidbestimmung durch Kombination von Schwereanomalien und einem Kugelfunktionsmodell mit Hilfe von Integralformeln. Z Vermess 106:102–111Google Scholar
- Wittwer T (2009) Local gravity field modelling with radial basis functions. Doctoral Dissertation, Delft University of Technology, Delft, The Netherlands, p 191Google Scholar

## Copyright information

**Open Access**This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made.