Background

High-resolution regional gravity field recovery is of considerable importance not only for surveying and mapping, but also for research fields, such as oceanography (understanding ocean circulation and currents), geophysics (investigating the structure of seismic activities and the lithosphere), and geodynamics (Kuroishi 2009; Panet et al. 2011; Shih et al. 2015).

Typically, middle- and short-wavelength gravity field signals down to a few kilometers are extracted from high-resolution ground-based measurements, e.g., terrestrial and shipborne gravity data, which are only available in geographically limited regions (Wang et al. 2012; Odera and Fukuda 2014; Lieb et al. 2016). In contrast, long-wavelength signals from tens of kilometers or larger are often recovered using global geopotential models (GGMs) derived from satellite observations. Over the last 10 years, launches of the Gravity Field and Climate Experiment (GRACE) (Tapley et al. 2004) and Gravity Field and Steady-State Ocean Circulation Explorer (GOCE) (Rummel et al. 2002) missions have greatly contributed to improving the spatial resolution and accuracy of GGMs. Moreover, developing various observational techniques, e.g., GPS, airborne gravimetric measurements, and satellite altimetry missions, can further improve regional gravity fields (Hwang et al. 2006; Jiang and Wang 2016; Wu and Luo 2016). Combined, these data sets form a solid basis for modeling high-resolution and high-quality regional gravity fields. However, these data have heterogeneous spatial coverage and resolutions, various error characteristics, and different spectral contents, which make their use an open issue. Thus, the aim of this study is to adopt an approach that combines heterogeneous data and extracts different spectral contents from various observational techniques for regional gravity field recovery.

The Stokes/Molodensky integral makes it difficult to combine heterogeneous data, while the least squares collocation (LSC) is numerically inefficient in managing cases that involve a large number of point-wise data (Wittwer 2009). The new gravity field described in this study is parameterized using Poisson wavelets. Poisson wavelets are radially symmetric basis functions that have localizing properties in both the spatial and frequency domains, which have been used extensively in regional gravity field modeling and potential field analysis (Tenzer and Klees 2008; Hayn et al. 2012; Bentel et al. 2013).

We also investigate several aspects that affect the solution quality derived from Poisson wavelets. To begin with, as heterogeneous data have different spatial coverage and resolutions, the derived least squares system from Poisson wavelets is typically ill-conditioned, where regularization is mandatory for deriving reliable results (Wittwer 2009). One of the key points that affect the quality of regularization is the choice of regularization matrices (Chambodut et al. 2005). Unlike using diagonal regularization matrices in global gravity field modeling from spherical harmonics (Kusche and Klees 2002), the regularization matrices derived from various constraints in regional scale are no longer entirely diagonal. The choice of regularization matrices may affect the solution quality, which is investigated in this study.

Moreover, mainly due to commission errors in the GGMs and uncorrected systematic errors in the data, the computed gravimetric quasi-geoid/geoid usually deviates from local values observed from GPS/leveling data by a centimeter level or larger (Wu et al. 2016). Generally speaking, corrector surface (Featherstone 2000; Fotopoulos 2005; Nahavandchi and Soltanpour 2006) or more complicated algorithms, e.g., least squares collocation (Tscherning 1978) and boundary-value methodology (Klees and Prutkin 2008; Prutkin and Klees 2008), can be applied to reduce systematic errors and properly combine GPS/leveling data and gravimetric solutions. However, given the difficulty in choosing a proper corrector surface, as well as the associated algorithmic complexity for the least-squares collocation and boundary-value approach, we propose a direct methodology for removing the inconsistency between these two data sets. The GPS/leveling data are treated as an independent observation group and added to the functional model for the gravity field computation. Using this method, the quasi-geoid/geoid that fits the local leveling system is computed in one step, no systematic errors exist between the data, and post-processing procedures, e.g., corrector surface or LSC approaches, for calibrating systematic errors are not required.

The rest of the paper is organized as follows: the main principle of regional gravity modeling from Poisson wavelets is first introduced, and heterogeneous gravity-related observations are linked to the functional model parameterized by Poisson wavelets. The weights for different observation groups are determined through the variance component estimation (VCE) approach, and unknown coefficients are estimated through the least squares adjustment. Further, the Tikhonov regularization method is introduced to deal with the ill-conditioned system, where a rapid synthesis method for computing zero- and first-order regularization matrices is provided. In the following section, Hong Kong is selected as the study area and heterogeneous data are introduced. The numerical results are also shown in this part, where the new height reference surface, HKGEOID-2016, is determined. In addition, HKGEOID-2016 is compared with existing models, e.g., HKGEOID-2000 (Luo et al. 2005) and recently published GGMs, such as EIGEN-6C4 (European Improved Gravity Model of the Earth by New Techniques 6C4) and EIGEN-6C3STAT (Förste et al. 2012, 2014), for cross-validation. The last section contains the main summary and conclusions.

Methods

Functional model and parameters estimation

Following Holschneider and Iglewska-Nowak (2004), we denote Ω R as the sphere of radius R, Ω R  = {(u 1, u 2, u 3): u 21  + u 22  + u 23  = R 2}, and we also denote IntΩ R as the interior and ExtΩ R as the exterior:

$$ {\text{Int}}({\text{Ext}})\;\varOmega_{R} = \left\{ {\left( {u_{1} ,u_{2} ,u_{3} } \right){:}\;u_{1}^{2} + u_{2}^{2} + u_{3}^{2} < ( > )R^{2} } \right\} $$
(1)

Consider two points \( \varvec{y} \), \( \varvec{z} \), \( \left| \varvec{y} \right| < R < \left| \varvec{z} \right| \), and then the exterior Poisson wavelet of degree d in position \( \varvec{y} \) evaluated at \( \varvec{z} \) is defined by (Holschneider and Iglewska-Nowak 2004)

$$ W_{\varvec{y}}^{{{\text{ext}},d}} \left( \varvec{z} \right) = \frac{R}{{\left| \varvec{z} \right|}}\sum\limits_{l = 0}^{\infty } {l^{d} \left( {\frac{{\left| \varvec{y} \right|}}{{\left| \varvec{z} \right|}}} \right)}^{l} (2l + 1)P_{l} \left( {\hat{\varvec{z}}^{T} \hat{\varvec{y}}} \right) $$
(2)

where \( \varvec{z = }\left( {z_{1} ,z_{2} ,z_{3} } \right)^{\text{T}} \), \( \varvec{y = }\left( {y_{1} ,y_{2} ,y_{3} } \right)^{\text{T}} \), and \( \varvec{\hat{z} = }\frac{\varvec{z}}{{\left| \varvec{z} \right|}} \) and \( \hat{y} = \frac{\varvec{y}}{{\left| \varvec{y} \right|}} \) are the unit vectors of \( \varvec{z} \) and \( \varvec{y} \), respectively. \( P_{l} \left( {\hat{\varvec{z}}^{T} \hat{\varvec{y}}} \right) \) is the Legendre function.

In the framework of remove-compute-restore (RCR) methodology, only the residual disturbing potential is parameterized by Poisson wavelets, while the long- and short-wavelength parts of the gravity field are recovered using the global geopotential model (GGM) and residual terrain model (RTM), respectively (Omang and Forsberg 2000). Based on the Runge–Krarup theory, the residual disturbing potential, T res, is estimated using a linear combination of Poisson wavelets (Klees et al. 2008):

$$ T_{\text{res}} \left( \varvec{z} \right) = \sum\limits_{i = 1}^{K} {\beta_{i} W_{{\varvec{y}_{i} }}^{{{\text{ext}},d}} (\varvec{z})} $$
(3)

where \( \varvec{z} \) and \( \varvec{y} \) are interpreted as the 3-D coordinates of the observations and position of a Poisson wavelet, respectively. K is the number of Poisson wavelets, β i is an unknown coefficient, which should be estimated from the data. Moreover, Ω R is chosen as a sphere that is entirely located within the Earth, e.g., a Bjerhammar sphere. We note that the depth of the Poisson wavelet under the Bjerhammar sphere, computed as \( d = R - \left| \varvec{y} \right| \), is the key point to determining its property in the frequency domain (Wittwer 2009).

After the linearization and spherical approximation, the residual gravity anomalies, Δg res, can be linked to Poisson wavelets through (Klees et al. 2008)

$$ \Delta g_{\text{res}} \left( \varvec{z} \right) = \sum\limits_{i = 1}^{K} {\beta_{i} \left( { - \frac{\partial }{{\partial \left| \varvec{z} \right|}}W_{{\varvec{y}_{i} }}^{{{\text{ext}},d}} (\varvec{z}) - \frac{2}{{\left| \varvec{z} \right|}}W_{{\varvec{y}_{i} }}^{{{\text{ext}},d}} (\varvec{z})} \right)} \;\;\;\; $$
(4)

The computed model, together with the RTM reduction, derives the quasi-geoid instead of geoid, and the residual height anomaly, ςres, is derived from Poisson wavelets using the Bruns formula (Heiskanen and Moritz 1967):

$$ \varsigma_{\text{res}} \left( \varvec{z} \right) = \sum\limits_{i = 1}^{K} {\beta_{i} \frac{{W_{{y_{i} }}^{{{\text{ext}},d}} (\varvec{z})}}{{\gamma \left( \varvec{z} \right)}}} $$
(5)

where γ is the normal gravity value.

In order to properly combine the gravimetric quasi-geoid/geoid and GPS/leveling data, parts of the GPS/leveling data, ς pres , are treated as observations, which are added in the functional model for the regional gravity field computation:

$$ \varsigma_{\text{res}}^{p} \left( \varvec{z} \right) = \sum\limits_{i = 1}^{K} {\beta_{i} \frac{{W_{{y_{i} }}^{{{\text{ext}},d}} (\varvec{z})}}{{\gamma \left( \varvec{z} \right)}}} + \eta^{T} \chi $$
(6)

where η is the 7-parameter model, which is used to absorb systematic errors between these two data sets, and χ is a vector of unknown parameters. More specifically, η and χ are expressed as (Eshagh and Zoghi 2016)

$$ \eta = \left( \begin{aligned} { \cos }\varphi \,{ \cos }\lambda \hfill \\ { \cos }\,\varphi \,{ \sin }\lambda \hfill \\ { \sin }\varphi \hfill \\ { \cos }\varphi \,{ \sin }\varphi \,{ \cos }\lambda /\left( {1 - e^{2} { \sin }^{2} \varphi } \right)^{1/2} \hfill \\ { \cos }\,\varphi \,{ \sin }\,\varphi \,{ \sin }\lambda /\left( {1 - e^{2} { \sin }^{2} \varphi } \right)^{1/2} \hfill \\ { \sin }^{2} \varphi /\left( {1 - e^{2} { \sin }^{2} \varphi } \right)^{1/2} \hfill \\ 1 \hfill \\ \end{aligned} \right);\quad \chi = \left( \begin{aligned} \chi_{1} \hfill \\ \chi_{2} \hfill \\ \chi_{3} \hfill \\ \chi_{4} \hfill \\ \chi_{5} \hfill \\ \chi_{6} \hfill \\ \chi_{7} \hfill \\ \end{aligned} \right) $$
(7)

where φ and λ are the latitude and longitude of the GPS/leveling data, e is the second eccentricity of the reference ellipsoid.

Typically, observations \( \varvec{l}_{p} \) in group p are linked to the residual disturbing potential \( T_{\text{res}} \left( \varvec{z} \right) \) through

$$ \varvec{l}_{p} +\varvec{\varDelta}_{p} + b_{p} = \varvec{L}_{p} T_{\text{res}} \left( \varvec{z} \right) = \sum\limits_{i = 1}^{K} {\beta_{i} \varvec{L}_{p} \varPsi_{i} \left( {\varvec{z},\varvec{y}_{i} } \right)} ,\quad p = 1,2, \ldots ,J $$
(8)

where \( \varvec{\varDelta}_{p} \) are observation errors, b p is an unknown bias parameter, \( \varvec{L}_{p} \) is the specific function linking observations to the residual disturbing potential, J is the number of observation groups.

Assuming that the observation noise is white Gaussian with zero mean, Eq. (8) is rewritten as

$$ \varvec{l}_{p} - \varvec{e}_{p} = \varvec{A}_{p} \varvec{x},\;E\{ \varvec{e}_{p} \} = 0,\;\;D\{ \varvec{e}_{p} \} = \varvec{C}_{p} = \sigma_{p}^{2} \varvec{Q}_{p} $$
(9)

where \( \varvec{x} \) is the K′ × 1 vector of unknown coefficients, which consists of unknown parameters of Poisson wavelets and bias parameters. For example, \( \varvec{x} = \left[ {\beta_{1} ,\beta_{2} , \ldots ,\beta_{K} ,b_{1} ,b_{2} , \ldots ,b_{J} } \right]^{\prime } \) and K′ = K + J, \( \varvec{A}_{p} \) is the m p  × K′ design matrix of group p, \( \varvec{l}_{p} \) is the m p  × 1 corresponding observation vector, \( \varvec{e}_{p} \) is the m p  × 1 vector of corresponding stochastic errors, and m p is the number of observations in group p. E{·} and D{·} are the expectation and dispersion operators, respectively. \( \varvec{C}_{p} \) is the error variance–covariance matrix of group p, and σ 2 p and \( \varvec{Q}_{p} \) are the variance factor and cofactor matrix, respectively.

The observations in different groups are assumed to be uncorrelated, and the error variance–covariance matrix of all observation groups is expressed as

$$ \varvec{C} = \left( \begin{aligned} \varvec{C}_{1} \;\;0\;\;0\;\; \cdots \;\;0 \hfill \\ 0\;\;\;\varvec{C}_{2} \;0\;\; \cdots \;\;0 \hfill \\ \; \vdots \;\;\;\; \vdots \;\;\; \vdots \;\;\; \cdots \;\; \vdots \hfill \\ 0\;\;\;0\;\;\;0\;\; \cdots \;\varvec{C}_{J} \; \hfill \\ \end{aligned} \right) $$
(10)

Point-wise data can be directly linked to Poisson wavelets through the functional model described above. However, as these data are not homogeneously distributed and data gaps may exist in locally, the associated least squares system is typically ill-conditioned, and regularization is mandatory (Wittwer 2009). Usually, the Tikhonov regularization method is applied for tackling the ill-conditioned issues, because it does not need a priori information (Xu 1992). For a given α (regularization parameter) and κ (regularization matrix), the least-squares solution of Eq. (9) is (Klees et al. 2008)

$$ \hat{\varvec{x}} = \left( {\sum\limits_{p = 1}^{J} {\varvec{A}_{p}^{T} \varvec{C}_{p}^{ - 1} \varvec{A}_{p} } + \alpha \kappa } \right)^{ - 1} \left( {\sum\limits_{p = 1}^{J} {\varvec{A}_{p}^{T} \varvec{C}_{p}^{ - 1} \varvec{l}_{p} } } \right) $$
(11)

The widely used L-curve approach, which is the log–log scale plot of the residual norm versus solution norm, is used to estimate the regularization parameter in this study. For discrete ill-conditioned problems, this plot displays an ‘L-shape’ with a corner point, where a certain balance between the regularized solution and fit to the data is achieved (Hansen et al. 2007). Moreover, the variance component estimation (VCE) approach can be applied to determine the optimal variance factors of different observation groups (Koch and Kusche 2002; Kusche 2003):

$$ \hat{\sigma }_{p}^{2} = \frac{{\varvec{v}_{p}^{T} \varvec{Q}_{p}^{ - 1} \varvec{v}_{p} }}{{r_{p} }} $$
(12)

where \( \varvec{v}_{p} = \varvec{A}_{p} \hat{\varvec{x}} - \varvec{l}_{p} \) is the vector of residuals for group p and r p is the corresponding redundancy number. The estimation of the variance factor is iteratively computed until convergence is achieved.

The final quasi-geoid, ς, is computed by combining the residual part with the long- and short-wavelength components derived from the GGM (ςGGM) and RTM (ςRTM), respectively (Omang and Forsberg 2000):

$$ {\varsigma } = {\varsigma }_{\text{GGM}} + {\varsigma }_{\text{RTM}} + {\varsigma }_{\text{res}} $$
(13)

However, the geoid (N) instead of a quasi-geoid is needed in some cases, where their relationship is described by

$$ N - \varsigma = \frac{{\bar{g} - \bar{\gamma }}}{{\bar{\gamma }}}h = \frac{{\Delta g_{B} }}{{\bar{\gamma }}}h $$
(14)

where \( \bar{g} \) and \( \bar{\gamma } \) are the mean value of gravity and normal gravity, respectively, Δg B is the Bouguer anomaly, and h is topographical height.

Choice of regularization matrix

The regularization matrix is the key element controlling the quality of the regularized solutions, which describes the signal energy decreasing from large to small scale (Chambodut et al. 2005). For selecting the optimal regularization matrix, the zero- and first-order regularization matrices are investigated.

Zero-order Tikhonov regularization

With the addition theorem of spherical harmonics, the Poisson wavelets are rewritten as (Wittwer 2009)

$$ W_{\varvec{y}}^{{{\text{ext}},d}} (\varvec{z}) = \sum\limits_{l = 0}^{\infty } {\sum\limits_{m = - l}^{l} {\psi_{l} \bar{Y}_{lm} \left( {\hat{\varvec{z}}} \right)} \bar{Y}_{lm} \left( {\hat{\varvec{y}}} \right)} ,\;\;\psi_{l} = l^{d} \left( {\frac{\left| R \right|}{{\left| \varvec{z} \right|}}} \right)^{l + 1} $$
(15)

where \( \bar{Y}_{lm} \) are the fully normalized spherical harmonics, l and m are their degree and order, respectively.

If we assume Poisson wavelets are the target functions on the sphere, i.e., \( s = W_{\varvec{y}}^{{{\text{ext}},d}} (\varvec{z}) \), then the entries of the zero-order regularization matrix κ 0 P,Q are expressed as the inner product of different Poisson wavelets (Heiskanen and Moritz 1967):

$$ \kappa_{P,Q}^{0} = \left\langle {s_{P} ,s_{Q} } \right\rangle = \frac{1}{{4\pi R^{2} }}\int_{{\varOmega_{R} }} {W_{{\varvec{y}_{P} }}^{{{\text{ext}},d}} (\varvec{z})} W_{{\varvec{y}_{Q} }}^{{{\text{ext}},d}} (\varvec{z})d\varOmega_{R} $$
(16)

Equation (16) is also rapidly synthesized by Holschneider and Iglewska-Nowak (2004)

$$ \left\langle {W_{{\varvec{y}_{P} }}^{{{\text{ext}},d}} ,W_{{\varvec{y}_{Q} }}^{{{\text{ext}},d}} } \right\rangle = \frac{R}{{\left| {\varvec{y}_{P} } \right|}}W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d}} \left( {\varvec{y}_{P}^{*} } \right) = \frac{R}{{\left| {\varvec{y}_{Q} } \right|}}W_{{\varvec{y}_{P} }}^{{{\text{ext}},2d}} \left( {\varvec{y}_{Q}^{*} } \right) $$
(17)

where \( \varvec{y}_{P}^{*} = \frac{{R^{2} }}{{\left| {\varvec{y}_{P} } \right|^{2} }}\varvec{y}_{P} \) and \( \varvec{y}_{Q}^{*} = \frac{{R^{2} }}{{\left| {\varvec{y}_{Q} } \right|^{2} }}\varvec{y}_{Q} \).

First-order Tikhonov regularization

The zero-order Tikhonov regularization derived above is suitable for functions restricted to a sphere σ U belonging to L 2(σ U). However, this function space is quite large and includes many unsmooth functions. To introduce a scalar product suitable for smoother functions, a target function is selected as the first-order derivative of the Poisson wavelets, and the inner products of the target functions are used to derive the entries of the first-order regularization matrix.

Assuming the target function is a first-order radial derivative of the Poisson wavelets, the entries of the radially constrained regularization matrix κ r P,Q are (Heiskanen and Moritz 1967)

$$ \kappa_{P,Q}^{r} = \frac{1}{{4\pi R^{2} }}\int_{{\varOmega_{R} }} {\frac{{\partial W_{{\varvec{y}_{P} }}^{{{\text{ext}},d}} (\varvec{z})}}{{\partial \left| \varvec{z} \right|}}} \frac{{\partial W_{{\varvec{y}_{Q} }}^{{{\text{ext}},d}} (\varvec{z})}}{{\partial \left| \varvec{z} \right|}}{\text{d}}\varOmega_{R} $$
(18)

As \( \frac{\partial }{{\partial \left| \varvec{z} \right|}}\psi_{l} = - \frac{l + 1}{{\left| \varvec{z} \right|}}\psi_{l} \), in Eqs. (15), (18) is rewritten as

$$ \kappa_{P,Q}^{r} = \sum\limits_{l = 0}^{\infty } {\psi_{l}^{2} \left( {\frac{l + 1}{{\left| \varvec{z} \right|}}} \right)}^{2} (2l + 1)P_{l} \left( {\hat{\varvec{y}}_{P}^{T} \hat{\varvec{y}}_{Q} } \right) $$
(19)

According to Holschneider and Iglewska-Nowak (2004), κ r P, Q is rapidly computed as

$$ \kappa_{P,Q}^{r} = \frac{R}{{\left| \varvec{z} \right|^{2} \left| {\varvec{y}_{P} } \right|}}\left( {W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d + 2}} \left( {\varvec{y}_{P}^{*} } \right) + 2W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d + 1}} \left( {\varvec{y}_{P}^{*} } \right) + W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d}} \left( {\varvec{y}_{P}^{*} } \right)} \right) $$
(20)

Similarly, the entries of the horizontal constrained regularization matrix R h P,Q read

$$ \kappa_{P,Q}^{h} = \frac{R}{{\left| \varvec{z} \right|^{2} \left| {\varvec{y}_{P} } \right|}}\left( {W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d + 2}} \left( {\varvec{y}_{P}^{*} } \right) + W_{{\varvec{y}_{Q} }}^{{{\text{ext}},2d + 1}} \left( {\varvec{y}_{P}^{*} } \right)} \right) $$
(21)

Results and discussion

Study area and data

The study area is bounded by 113.83° to 114.5° in longitude and from 22.15° to 22.58° in latitude, as shown in the region enclosed in a black rectangle in Fig. 1, which covers the entire Hong Kong mainland, parts of Shenzhen, as well as the South China Sea. Figure 1 also shows the distribution of heterogeneous gravity data; the red dots represent terrestrial data, including measurements both in Hong Kong and parts of Shenzhen. The green dots indicate locations of shipborne data coverage, which are sparsely distributed. As a result, we also incorporate satellite altimetry-derived gravity data to fill these data gaps, indicated with yellow dots, obtained from DTU13 (DTU = Technical University of Denmark) that uses multi-satellite altimetry data (e.g., TOPEX/Poseidon, Jason-1, Jason-2, ERS-2) over 20 years. DTU13 has a spatial resolution of 1′ × 1′, and the altimetry gravity data precision is approximately several mGal globally (Andersen et al. 2013). We note that even the terrestrial data are not homogeneously distributed and data gaps exist, especially for mainland Hong Kong. This makes the study area a good location to test the feasibility of our adopted approach for managing data gaps. Because the currently used leveling system, Hong Kong Principal Datum (HKPD) over Hong Kong, is an orthometric height system (Chen and Luo 2004), the modeled quasi-geoid from Eq. (13) is converted to a geoid for further validation and comparison.

Fig. 1
figure 1

Study area and distribution of gravity data

Global geopotential model and digital terrain model

Earth Gravity Model 2008 (EGM2008) with a full degree and order (d/o) of 2190 (Pavlis et al. 2008, 2012) is chosen to recover long-wavelength signals of the regional gravity field. EGM2008 was developed by combining satellite gravity observations, ground-based gravity measurements and radar altimetry data; the accuracy of EGM2008 is 0.239 m when compared to globally distributed GPS/leveling data, which is one of the most accurate GGMs published in the recent years http://icgem.gfz-potsdam.de/ICGEM/evaluation/evaluation.html. A high-resolution and high-quality DTM (digital terrain model) is needed for applying RTM corrections to smooth the regional gravity field at short scales (Forsberg and Tscherning 1981). Two data sources, SRTM (Shuttle Radar Topography Mission) and GEBCO (General Bathymetric Chart of the Oceans), are combined to generate a DTM with a spatial resolution of three arc-seconds over land and sea. In order to reduce edge effects, a larger RTM computational area than study region is need; the derived DTM is shown in Fig. 2a. The mean elevation surface (MES) used in the RTM is computed from a high-order spherical harmonic expansion of Earth’s topography, the DTM2006.0 topography model (Pavlis et al. 2007, 2008). The truncated degree of DTM2006.0 is consistent with the GGM used in this study, EGM2008 with a full d/o of 2190. The derived mean elevation surface is shown as Fig. 2b. Considering the curvature of Earth, tesseroids instead of prisms are chosen as the integral elements (Heck and Seitz 2007). The density parameter is selected as ρ land = 2.67 g/cm3 and ρ sea = 1.64 g/cm3 given the difference between the mean crustal density and seawater (Forsberg 1984). We also show detailed topographical information over Hong Kong (Fig. 3).

Fig. 2
figure 2

Digital terrain model (DTM) and mean elevation surface (MES). a DTM, b MES

Fig. 3
figure 3

Digital terrain model over Hong Kong

GPS/leveling data

There are 168 high-quality GPS stations in Hong Kong, the ellipsoidal heights are referenced to the WGS84 ellipsoid (ITRF 96 @ 1998:121), and their heights were determined with precise geometric leveling and trigonometric leveling. The entire GPS/leveling dataset is divided into three groups (Fig. 4). The leveling heights of GPS/leveling data in the first group were derived from precise leveling measurement, while the heights of the next two groups were surveyed from precise leveling and trigonometric height measurements. Based on recent leveling network adjustments, the accuracy of the overall GPS/leveling data is approximately 1 cm. The first two groups, i.e., group I (58 points) and II (49 points), are used as observations, aimed at reducing inconsistency between the gravimetric solution and GPS/leveling data. The third group (61 points), group III, is applied to evaluating the quality of the computed model.

Fig. 4
figure 4

Three groups of GPS/leveling data. a Group I; b Group II; c Group III

Gravity data

The original gravity database in Hong Kong has 507 terrestrial and 133 shipborne point-wise gravity observations, the spacing on land is 2 and 2–4 km at sea. They were collected using a Lacoste and Romberg model ‘G’ land gravity meter and model ‘H/U’ seabed gravity meter from Electronic and Geophysical Services Ltd (Luo et al. 2005). The local gravity base is connected to International Gravity Standardization Net 1971 (IGSN 71) with an accuracy of 0.03 mGal (Evans 1990). Moreover, recent gravity measurements also provide 623 discrete terrestrial gravity observations. The heights of gravity data in Hong Kong are referenced to the Hong Kong Principal Datum. In addition, gravity measurements in the neighboring Shenzhen region are also available, 1312 points on land and 876 points at sea, with 1 km spatial resolution. They were measured with a Lacoste & Romberg model ‘G’ and ‘D’ land gravimeter and model ‘S’ sea gravimeter in 2001, and the accuracy of observed data is better than 0.1 mGal. The heights of gravity measurements over Shenzhen are referenced to the National Vertical Datum 1956 (Chen and Luo 2004). Further, due to poor coverage of the marine gravity observations, DTU13-derived altimetry gravity anomalies are also introduced. The DTU13 model uses the EGM2008-derived geoid as the vertical datum (Andersen et al. 2013). To unify these heterogeneous gravity data from various sources, transforming parameters are used for height datum unification, and the heights of gravity data from Shenzhen and the DTU13 model are unified to the local heights based on the Hong Kong Principal Datum. To quantify the DTU13 model quality in the target region, DTU13-derived gravity anomalies are compared with marine gravity data (Fig. 5). The comparison indicates large differences in the coastal areas, where the quality of the altimetry data is suspect (Hwang et al. 2006). The statistics in Table 1 show the accuracy of the DTU13 model for the seas around Hong Kong is approximately 4.3 mGal. The bias of the DTU13 model versus marine gravity data is at the level of 4.8 mGal, which needs to be estimated as unknown bias parameters in the functional model, as shown in Eq. (8). The imperfection in height transforming parameters and uncorrected systematic errors in the data may result in unreduced biases in the data after height unification. Thus, except for the terrestrial gravity data over Hong Kong, terrestrial data in Shenzhen and all shipborne and altimetry data are supposed to have biases. For data derived from specific source, we assume a constant bias parameter in the functional model, and three bias parameters in total are estimated using the least square adjustment.

Fig. 5
figure 5

Difference between DTU13-derived gravity anomalies and marine gravity data

Table 1 Statistics of the difference between altimetry and marine gravity data (Units: mGal)

Based on RCR methodology, residual gravity data are computed by subtracting the long- and short-wavelength parts from the original observations. Figure 6 shows the residual gravity data, where (a) and (d) represent the residual terrestrial data, (b) and (e) display the residual marine data, and (c) and (f) show the residual altimetry data. The top panels, (a), (b) and (c), show the residual observations obtained by only subtracting EGM2008-derived quantities from the original data. The bottom panels, (d), (e) and (f), demonstrate the residual data derived by subtracting EGM2008-derived quantities and RTM corrections. After incorporating RTM corrections, the residual gravity field is much smoother, and the most significant improvements are found south-east of Shenzhen, in New Territory, Lan Tau and Hong Kong land, regions with a tendency toward topographical variation (Fig. 3). As shown in Table 2, the standard deviation (SD) of the residual terrestrial data is reduced from 11.9 to 4.4 mGal, with approximately 63% improvement after incorporating the RTM reductions. For marine data, the improvement is not as much as the terrestrial data, the SD of shipborne data is only reduced from 7.2 to 6.0 mGal, mainly due to the low resolution of the bathymetry model, the GECBO model has a spatial resolution of 30″ × 30″, and the small variation in local bathymetry (Fig. 3). Similarly, the SD of altimetry data is only reduced from 2.3 to 1.4 mGal with RTM reductions.

Fig. 6
figure 6

Residual gravity observations. The top and bottom panels represent data without and with RTM corrections, respectively; a, d are terrestrial data, b, e are shipborne data, and c, f are airborne data

Table 2 Statistics of the residual gravity observations (Units: mGal)

Numerical results

Determination the regularization matrix and regularization parameter

We put the Poisson wavelets on a Fibonacci grid beneath the topography, keeping it parallel with the topography (Tenzer et al. 2012). However, as mentioned above, data gaps in the observations usually lead to an ill-conditioned least squares system. Thus, before computing a high-quality model, the reliability of the L-curve approach and performances of various regularization matrices need to be evaluated. The numerical experiment is designed as follows, the grid depth is selected as 40 km, and the number of Poisson wavelets is approximately chosen as 1600. Terrestrial, shipboard and altimetry gravity data, as well as the two groups of GPS/leveling data (i.e., group I and II), are combined for modeling. The normal matrix with this parameterization is highly ill-conditioned, which is the proper case study for investigating the performances of the Tikhonov regularization technique. Figure 7 shows L-curve plots for various regularization matrices, where the optimal regularization parameter is between 10−11 and 10−10 when zero-order regularization is applied. For the first-order regularization, both for the radial and horizontal constrained matrices, the optimal parameter is situated between 10−12 and 10−11. To further demonstrate the L-curve corner, the regularization parameters are densified around the intervals where optimal parameters are located (red circles in Fig. 7); the corresponding information can be found in Tables 3 and 4. The estimated optimal regularization parameters for zero- and first-order regularization matrices are α = 10−10.6 and α = 10−11.3, respectively. However, the L-curve method may lead to obtain an over-smoothed solution (Xu 1998). Therefore, we also provide the quality of various solutions using an external validation from the GPS/leveling data from group III, (Tables 3, 4). These results indicate that the optimal parameters estimated from the L-curve method are consistent with the direct evaluation results, suggesting the L-curve method in this study is reliable. In addition, there are no differences in the performances of different first-order regularization matrices, i.e., the radial and horizontal constrained matrices, and thus we do not distinguish these matrices in the following discussion. However, when comparing the quality of solutions derived from zero- and first-order regularization, we conclude the application of first-order regularization produces better results, and the accuracy of the corresponding model reaches 1.6 cm. In comparison, for solutions computed from zero-order regularization, the accuracy decreases to 1.8 cm. These results show the choice of regularization matrices has a non-negligible effect on the solutions, and the first-order regularization may be more preferable for local gravity field modeling.

Fig. 7
figure 7

Optimal regularization parameters computed from the L-curve methodology based on various regularization matrices. a Zero-order, b first-order (radial gradient constraint), c first-order (horizontal gradient constraint)

Table 3 Evaluation of different geoids with various regularization parameters when zero-order regularization is used (Units: cm)
Table 4 Evaluation of different geoids with various regularization parameters when first-order regularization is used (Units: cm)

Optimal network design for Poisson wavelets

Another key point affecting the final solution quality is the network design of the Poisson wavelets, where the optimal depth and number of Poisson wavelets need to be carefully estimated (Wu et al. 2016). The depth should not be chosen too shallow, which may lead to overfitting problem. However, too deep Poisson wavelets may cause a heavily ill-conditioned normal matrix, where strong regularization is need, and the associated regularization errors may corrupt the solution. A good depth choice should be a trade-off between data fit and smoothness of the solution (Slobbe 2013). In addition to depth, the number of Poisson wavelets also affects the stability and the quality of the solution. To obtain the optimal network design, we use a trial-and-error approach, i.e., preselected depths and number of Poisson wavelets are combined to form various networks; based on these networks, different solutions are computed. For selecting the best solution, GPS/leveling points serve as evaluation data, where the combination of the depth and number that obtains the best fit to the evaluation data is the optimal parameterization. Through trial and error, the depth of the grid was found to increase from 10 to 40 km with an interval of 5 km, and the number of Poisson wavelets in the computational domain changed from 400 to 2000 with an increment of 400. Similar to the case study designed in the previous section, the first two GPS/leveling groups, together with the heterogeneous gravity data, are incorporated for gravity field modeling. The third group of GPS/leveling serves as evaluation data for validating the different solutions. Finally, the L-curve approach with first-order Tikhonov regularization is applied for managing the ill-conditioned system. Figure 8 shows the quality of the different models parameterized using various networks, where a minimum SD of 1.1 cm is obtained when the depth is 10 km and the number is 1600. Adding more Poisson wavelets only increases the computational load, without gaining improvement, e.g., the accuracy of the geoid decreases to 1.2 cm when the depth is unchanged, but the number increases to 2000. We also note that there are other choices that provide slightly worse solutions with a SD value a few millimeters larger than 1.1 cm, which indicates the solution is quite robust with respect to minor changes in network parameterization.

Fig. 8
figure 8

Accuracy of different geoids with respect to various Poisson wavelet networks

A new height reference surface over Hong Kong

Based on the optimal network designed above, a new height reference surface, HKGEOID-2016, over Hong Kong is estimated. Figure 9 provides the estimated biases for the gravity data, with magnitudes of −0.6, −0.7 and −4.5 mGal for terrestrial, shipborne and altimetry data, respectively. However, the estimated bias in the altimetry data is not exactly consistent with the results shown in Table 1, where the bias between shipborne measurements and altimetry data is roughly −4.8 mGal. This also indicates the biases estimated from the functional model parameterized in this study may deviate from the true value, and additional efforts are needed for further improvements in the near future. Figure 10 displays the residuals of the gravity observations after the least squares adjustment. As shown in Table 5, the functional model fits the terrestrial and marine data much better than the altimetry data, and the SD of the residuals for terrestrial and marine data is approximately 1.2 and 1.1 mGal, respectively, while for altimetry data this value significantly increases to 4.8 mGal. In addition, the most prominent residuals in the altimetry data are located at coastal areas, which further demonstrates the poor DTU13 quality over these regions. Moreover, we conclude that the parameterization of the functional model with constant unknown bias parameters does not thoroughly remove systematic errors in the data. Future work should also incorporate densification of shipborne gravity measurements, which may further improve the solution quality over marine areas. Further, for the two groups of GPS/leveling data used to combine the gravimetric solution and GPS/leveling data, i.e., groups I and II, the residuals are displayed in Fig. 11, and the corresponding statistics are provided in Table 6. These results show the internal agreement between the model and GPS/leveling data. For comparison, we also provide results derived from the purely gravimetric geoid, i.e., the solution modeled only from gravity data using Poisson wavelets. The corresponding results display differences between the purely gravimetric solution and GPS/leveling data at the decimeter level, ranging from 6.7 to 22.3 cm. SD values are 4.1 and 3.6 cm, and mean values are 15.1 and 15.4 cm for groups I and II, respectively. These statistics indicate systematic errors exist between these two data sets. Because the GPS/leveling data are acquired using high-quality GPS measurements and spirit leveling observations, it is usually hypothesized that errors in the computed gravimetric model primarily account for the observed systematic errors; these are usually due to commission errors in the GGM, as well as uncorrected systematic errors in the data. However, as shown in Table 6, the incorporation of two groups of GPS/leveling data leads to a significant reduction in SD values for differences between the modeled geoid heights and observed ones, to 0.2 and 0.3 cm for groups I and II, respectively. More important, introducing these GPS/leveling data substantially reduces the mean difference between the model and GPS/leveling data. The zero mean values for both groups of data also indicate that the gravimetric model and GPS/leveling data can be properly combined through the direct approach we proposed. Furthermore, systematic errors, especially in the computed model, have been significantly reduced when compared to the case derived from a purely gravimetric solution.

Fig. 9
figure 9

Estimated biases for heterogeneous gravity data

Fig. 10
figure 10

Residuals of gravity observations. a Terrestrial, b shipborne, c altimetry

Table 5 Statistics of the residuals of gravity data (Units: mGal)
Fig. 11
figure 11

Residuals of geoidal heights at GPS/leveling points. a Group I; b Group II

Table 6 Statistics of the residuals of geoidal heights at GPS/leveling points (Units: cm)

Results from the GPS/leveling data for groups I and II only provide the internal agreement between the modeled geoid and GPS/leveling data, and cannot be used for assessing the solution quality because they are used as observations for modeling. Thus, the third group of GPS/leveling data, with independent 61 point-wise measurements, is used for external validation and the result is displayed in Fig. 12. All residuals are within 2.0 cm (Table 7), the SD of the differences between the HKGEOID-2016 and GPS/leveling points is approximately 1.1 cm, and the zero mean value also confirms the geoid model and GPS/leveling data have been combined properly. For further evaluation, we also compare HKGEOID-2016 with the existing model, HKGEOID-2000 (Luo et al. 2005). Figure 13 shows the differences between the observed and computed geoidal heights at all GPS/leveling points based on the various geoids. Compared to the solution derived from HKGEOID-2000, as shown in the Table 8, the SD of these differences is reduced from 2.4 to 0.6 cm when HKGEOID-2016 is incorporated. We mainly attribute these improvements to the incorporation of more gravity-related observations, as well as recent leveling network adjustments to reduce the systematic errors in the leveling system over Hong Kong. We also note that the mean value of the residuals is 0.8 cm when HKGEOID-2000 is used, which indicates systematic errors may exist in the original GPS/leveling data used by Luo et al. (2005).

Fig. 12
figure 12

External validation of HKGEOID-2016

Table 7 External accuracy of HKGEOID-2016 (Units: cm)
Fig. 13
figure 13

Differences between GPS/leveling data and geoidal heights derived from various geoids. a HKGEOID-2016; b HKGEOID-2000

Table 8 Statistical differences between GPS/leveling data and geoidal heights based on various geoids (Units: cm)

The performances of the recently published GGMs are also investigated in Hong Kong, i.e., EGM2008 with d/o 2190 (Pavlis et al. 2012), EIGEN-6C3STAT (d/o 1949) and EIGEN-6C4 (d/o 2190) (Förste et al. 2012, 2014) (Fig. 14). The statistics in Table 9 show the accuracy of these GGMs is not better than 4 cm, and the mean deviation between GGM and local geoid is larger than 17 cm, which is undesirable for engineering purposes or geophysical investigation. The main reason for the poor performances of these GGMs is their development was implemented without confidential gravity data from China, and the quality of these models in regions such as China is suspect, and local refinements are necessary. Moreover, differences between the HKGEOID-2016 and GGM-derived geoids are shown in Fig. 15, these differences are at the decimeter level, and the incorporation of locally distributed data primarily contributes to improving the fine structures of the regional gravity field at short scales.

Fig. 14
figure 14

Differences between GGM-derived geoidal heights and GPS/leveling data (note mean values are removed). a EGM2008; b EIGEN-6C4; c EIGEN-6C3STAT

Table 9 Statistical differences between GPS/leveling data and GGM-derived geoidal heights (Units: cm)
Fig. 15
figure 15

Differences between HKGEOID-2016 and a EGM2008, b EIGEN-6C4, and c EIGEN-6C3STAT

Conclusions

Poisson wavelets are used for modeling the regional gravity field using data from various observational techniques. The method combines data with different spatial coverage, various noise levels, and spectral contents. As a case study, terrestrial, shipborne, and satellite altimetry gravity data, as well as GPS/leveling measurements, are incorporated in the Poisson wavelet model for regional gravity field recovery over Hong Kong.

The Tikhonov regularization is introduced to manage the ill-conditioned least squares system; in particular, the performances of various regularization matrices are investigated. The numerical results show solutions with first-order regularization provide better results, where the accuracy of the local solution increases by 0.2 cm compared to that obtained from zero-order regularization. These results also indicate that first-order regularization may be more preferable in regional gravity field recovery using Poisson wavelets. Moreover, a direct approach is proposed to properly combine the gravimetric quasi-geoid/geoid and GPS/leveling data; a subset of the GPS/leveling data is treated as an independent observation group to formulate the new functional model, and the quasi-geoid/geoid that fits the local leveling system can be modeled in a single step. The results show the SD is approximately 0.3 cm for the residuals on the first two GPS/leveling groups, regarded as observations. The zero mean value indicates the gravimetric model and GPS/leveling data can be properly combined through this direct approach. In addition, an external validation with 61 independent GPS/leveling points shows that the accuracy of the new geoid, HKGEOID-2016, is approximately 1.1 cm. Compared with the original solution, HKGEOID-2000, the SD of the differences between observed and computed geoidal heights at all GPS/leveling points is reduced from 2.4 to 0.6 cm when HKGEOID-2016 is incorporated. This is a significant improvement. In addition, the performances of three recently published GGMs, EGM2008, EIGEN-6C3STAT and EIGEN-6C4, are investigated in Hong Kong. The corresponding results show the accuracies of these GGMs are all below 4 cm. The deviation from the local geoid at the decimeter level also indicates the GGM alone cannot recover a high-quality gravity field in this region, and local refinement is necessary.

Several issues should be carefully considered to make further improvements to the local geoid. Usually, the geoid is poorly modeled in the coastal areas due to unfavorable data coverage (Hipkin et al. 2004; Filmer and Featherstone 2012). In the study region, the distribution of shipborne data is quite sparse, and the quality of the altimetry data degrades in the vicinity of coastal areas. Therefore, the densification of shipborne data would further improve the local geoid in future work. In addition, the developments of appropriate altimeter waveform retracking approaches may also contribute to improving the geoid over coastal areas (Hwang et al. 2006; Andersen and Knudsen 2009). Errors in the GGM inevitably propagate into regional solutions because we usually consider the GGM as error-free data implemented in the remove-restore framework. The magnitude of these commission errors in the GGMs reaches the centimeter scale or larger (Pavlis et al. 2012), which should be carefully considered when computing the quasi-geoid/geoid at centimeter accuracy. These commission errors are actually calibrated with GPS/leveling data using the direct approach proposed in this study. However, the errors in the GGM could also be quantified if the full error variance–covariance matrix of the spherical coefficients is known, and a more realistic error variance–covariance matrix of the data could be estimated through error propagation. In this manner, the weights of different observation groups may be more properly determined, and the accuracy of the quasi-geoid/geoid, especially for the purely gravimetric one, may be further improved.