Introduction

Among the many factors determining processes in the Earth’s interior and exterior, the gravity field is of key importance. An accurate knowledge of the gravity field is also crucial for satellite orbits, which serve for precise geo-location of various measurements conducted in satellite missions, for example, for such missions as: the Challenging Mini-Satellite Payload (CHAMP) (Reigber et al. 2005), the Gravity Recovery and Climate Experiment (GRACE) (Tapley et al. 2004) and the Gravity Field and Steady-State Ocean Circulation Explorer (GOCE) (Rummel et al. 2011). Therefore, in recent years, the gravity field recovery was the subject of numerous works leading to many gravity field models. These models are mostly connected with the three aforementioned satellite missions. The estimated gravity models were validated and compared with each other, taking into account different aspects of their performance. Generally, these aspects can be divided into two main test areas: The first area is related to the Earth itself and the second area to satellite orbits. Concerning the first test area, for example, Gruber et al. (2011) assessed gravity models comparing the geoid heights obtained from the given gravity models with the corresponding ones estimated from the GPS levelling. Various authors compared the GOCE-related gravity models in terms of the gravity anomaly and the geoid height differences in local areas such as Germany (Voigt and Denker 2015), Norway (Šprlák et al. 2015), the Mediterranean area (Carrion et al. 2015) and on the continental scale—in South America (Matos et al. 2015). In turn, Hirt et al. (2015) evaluated the GOCE gravity models by comparing them with gravity models derived from the ET2014 topography. Yi and Rummel (2014) compared the GOCE-related gravity models with the EGM2008 model. This comparison was performed globally as well as in selected geographical regions using geoid root-mean-square (RMS) differences. Xu et al. (2017) compiled the GOCE-only gravity model GOSG01 and investigated its consistency with other GOCE-related satellite models, taking into account GPS-levelling data.

In the aforementioned second test area, Lejba et al. (2007), Sośnica et al. (2012) and Cheng and Ries (2015), taking into account the satellite laser ranging (SLR) observations, investigated the RMS of SLR residuals coming from the estimation of the Laser Geodynamics Satellite (LAGEOS) orbits using given gravity models. The three GOCE-related gravity models were tested by Gruber et al. (2011), who compared orbital residuals derived in the frame of orbit determination of the CHAMP satellite and both GRACE satellites. Tsoulis and Papanikolaou (2012) computed short arcs of the CHAMP and GRACE-A satellites using a numerical integration method and compared them with the corresponding Rapid Science Orbit (RSO) for different tested geopotential models. The authors indicated the band-limited behavior of gravity models in an orbit determination process. In their next two works, Tsoulis and Papanikolaou (2013, 2014) validated GRACE and CHAMP mission-related gravity models by the short-arc GOCE orbit propagation using the GOCE Rapid Science Orbit data as a reference and tested the GOCE-related gravity models fitting the adjusted daily GOCE orbits into positions in the precise science orbit (PSO) (Bock et al. 2011) and in the GRACE-A and GRACE-B satellite precise orbits. In these works, the authors emphasize the spectral contribution of individual gravity models in a degree-wise cumulative sense. Förste et al. (2014) described the performance of the high-degree EIGEN-6C4p model comparing the estimated GOCE orbits to the PSO. Baur et al. (2014) compared the relative performance of GOCE gravity field models which resulted in a comparison of different estimation approaches of these models. The adopted gravity fields were estimated using the GOCE kinematic orbits.

This work, through entering the trend of the aforementioned research, focuses on an evaluation of the eight chosen GRACE-related gravity models in terms of the dynamic orbit determination, by fitting obtained positions into the positions taken from the official PSO, consisting of the two types of orbits: kinematic orbits and reduced-dynamic orbits, which were delivered by the European Space Agency (ESA). Both types of GOCE orbit were validated using the satellite laser ranging (SLR) technique by means of residuals determined as differences between ranges based on SLR and ranges derived from the aforementioned kinematic and reduced-dynamic orbits (Strugarek et al. 2018). Finally, SLR validation results for the entire GOCE mission allowed an accuracy of 2.42 cm to be determined for the kinematic orbit and 1.84 cm for the reduced-dynamic orbit (Bock et al. 2014). However, taking into account the time range of the used GOCE orbits, these accuracies were slightly different in the years 2009 and 2010—1.91 cm, 1.84 cm for the kinematic orbit and 1.73 cm, 1.56 cm for the reduced-dynamic orbit (Bock et al. 2014).

The kinematic orbit is a purely geometric solution based on the satellite-to-satellite tracking (SST) measurements (Bock et al. 2011), whereas the reduced-dynamic orbit is derived by the reduced-dynamic approach, in which, besides the SST measurements, the a priori gravity model and additional dynamic models are used. Moreover, the dynamic part of this approach is limited by the estimation of set empirical parameters—in this case, pseudo-stochastic, piece-wise constant accelerations (Bock et al. 2011). The fitting of the estimated orbits to the kinematic orbit is motivated by independence of the GOCE kinematic orbit from any background dynamic models, especially from any a priori gravity model. On the other hand, the fit to the reduced-dynamic orbit was also performed because of its higher smoothness and greater resistance to measurement problems. The use of both GOCE orbit types to the fitting of estimated arcs was also an opportunity to compare the quality of these orbits. The GOCE orbit is particularly suitable for the presented study because of its extremely low altitude at about 255 km, where the gravitational signal is as large as possible (Tsoulis and Papanikolaou 2013). On the other hand, the selection of gravity models not related to the GOCE mission avoids possible correlations via the underlying SST measurements. Therefore, the gravity models derived from the observational data coming from the GOCE mission are not included in this study.

It is well known that the gravity field model is a key component in a satellite orbit determination process, strictly speaking in the dynamic orbit determination. However, existing numerous gravity models require an appropriate selection for aforementioned process. The gravity models being the products of the GRACE mission are known for their good quality and constitute a large group in the entire collection of available models (Drewes 2012). Therefore, this work focuses on comparing an orbital performance of the selected gravity models coming from the mentioned group. Due to the fact that all tested models come from the same mission, showing differences in their performance in an orbital aspect may be a demanding task. In order to reveal these differences, different variants of computations, including different numbers of orbital arcs, their different lengths and using background models or not, were adopted in the work. Finding these differences in the orbital performance and possible patterns associated with them is one of the goals of this work. The basic question is whether it is possible, and to what extent, to determine statistically significant differences in the performance of selected gravity models in an orbital aspect? Another purpose of the work resulting from the aforementioned goal may be the possibility of using the obtained numerical results to the aforementioned selection of gravity field models for the dynamic orbit determination process of a low Earth orbiting satellite—for example, the fully dynamic orbit determination of the GOCE satellite, as a low Earth orbiting satellite, was performed by Casotto et al. (2013). The centimeter-accuracy PSO GOCE satellite orbit, as an example of low Earth orbiter trajectory, can be a good reference in the context of selection of appropriate gravity models for a description of motion of a satellite orbiting at a low altitude. On the other hand, as mentioned above, the use of such a low satellite as GOCE for the research is particularly convenient due to the weakening of the gravitational signal with increasing altitude, which causes the orbit to be sensitive only to a certain range of spherical harmonic coefficients. This limits the comparison of the gravity model performance only to the long-wavelength parts. In the case of GOCE satellite, the limitation of long-wavelength part of gravity model is equal to about the degree and order of 120 spherical harmonic coefficients. This limitation is indicated by Bezděk et al. (2014) and Baur et al. (2014), who obtained static gravity field models from the kinematic positions of various satellites, such as GRACE A/B, CHAMP and GOCE.

In “Investigation procedure” section of this work, a short description of the research procedure, including the dynamic orbit determination process, and the characteristics of investigated gravity models are given. The results of the orbit determination process for the three chosen arc lengths and a performance comparison of gravity models are presented and described in “Research results” section. Finally, “Summary and conclusions” section contains a summary and concluding remarks with the highlighted performance differences for the selected gravity models.

Investigation procedure

All computations were performed using a software package called the Toruń Orbit Processor (TOP). One of the first versions of this software was presented in Drożyner (1995). The TOP package is based on the dynamic orbit determination algorithm, where a satellite orbit is integrated numerically using the Cowell 8th-order predictor–corrector component. The satellite orbital arcs obtained, including gravitational and non-gravitational perturbing forces, are in the form of a time series of state vectors consisting of position and velocity of the satellite. Additionally, the satellite orbit can be improved by incorporating a set of observations. As already mentioned, centimeter-level accuracy Cartesian coordinates (Bock et al. 2011) of the GOCE satellite along the kinematic and reduced-dynamic orbits were treated here as observations. These coordinates, originally expressed in the ITRF2005 reference frame, were transformed into the inertial reference frame (IRF) connected with the standard epoch J2000.0 (ESA 2010). This was possible by using a set of the ESA-delivered quaternions (ESA 2010) which described the transformation between both reference frames.

The GOCE orbital arcs were estimated in an iterative process using the least-squares method. In this process, the aforementioned quaternions were also used. Besides gravity models, a set of additional force models was optionally used. These background force models can be divided into two groups: The first one describes gravitational perturbing forces and the second one describes non-gravitational perturbing forces (Eshagh and Najafi-Alamdari 2007). In this case, the following additional forces were modeled: ocean tides and Earth tides—the Monitoring Earth Rotation and Intercomparison of Techniques (MERIT) standards model (Melbourne et al. 1983), third body effects—the planetary ephemerides DE200/LE200 (Standish et al. 1992), relativity effects—the Painleve formulation, taking into account spherical symmetrical space–time with the Schwarzschild metric, solar radiation pressure (direct and albedo effects)—models implemented in the TOP package (Drożyner 1995). The air drag pressure effect is omitted here due to its compensation in the drag-free mode flight of the GOCE satellite (Bock et al. 2011).

The computed orbital arcs were parametrized by components of the satellite initial state vector. Strictly speaking, the estimated unknowns were six corrections to the mentioned components, i.e., to the initial position and velocity. Therefore, the observation equation can be written in the form:

$$x_{ik}^{o} + v_{ik} = x_{ik}^{c} + \frac{{\partial x_{ik}^{c} }}{{\partial \varvec{p}_{0} }}\Delta \varvec{p}_{0}^{\text{T} } .$$
(1)

In this equation, \(x_{ik}^{o}\), \(x_{ik}^{c}\) (\(i\) = 1, 2, 3) are the observed and computed \(i\)-th coordinate of the satellite at the k-th epoch, respectively, \(v_{ik }\) is the correction to the observed \(i\)-th coordinate at the \(k\)-th epoch, \(\varvec{ p}_{0}\) is the initial state vector and \(\Delta \varvec{p}_{0}\) is the estimated correction vector to the initial state vector \(\varvec{p}_{0}\). The numerical integration of the variational equations in the orbit improvement process makes it possible to determine the partial derivatives \(\partial x_{ik}^{c} / \partial \varvec{p}_{0}\) at the \(k\)-th epoch in the Eq. (1).

The use of simplified parameterization in the adjustment process is motivated by the need to emphasize the contribution of the given gravity models to the orbit determination. In another case, additional estimated empirical parameters could absorb some part of the gravity model error (Tsoulis and Papanikolaou 2014). To further increase the gravity field model contribution, some variants of computations were performed without the background force models. On the other hand, the computation variants, including the background force models, made it possible to assess the degree of dependence of the obtained results on the specified set of these models. The taken lengths of determined arcs are equal to 45.2, 90.2 min and 1 day, which is close to the half, the one and near the sixteen GOCE revolution periods, respectively. The selection of the three different arc lengths is motivated by the need to investigate the question: To what extent do the differences of performance of the specified gravity models change when the length of the estimated orbital arc changes?

The obtained orbits for the specified gravity models were compared to the corresponding GOCE kinematic and reduced-dynamic orbits in the IRF. As a result, for the comparison of the performance of gravity models, the 3D RMS differences between both orbits were computed. These RMS values are based on the differences between the estimated and observed kinematic and reduced-dynamic positions of the GOCE satellite. The estimated arcs were included in the time range spanning between November 02, 2009, and February 11, 2010. The approximate values of the initial positions and velocities were adopted from both reference orbits, i.e., from the kinematic and reduced-dynamic orbits. The fifty initial epochs have been assigned to the fifty estimated arcs with the following dates: in 2009—November 02 to November 25, November 28 to December 02, December 18, December 29; in 2010—January 06, January 08 to January 09, January 12 to January 16, January 18 to January 26, February 05, February 10. For all dates, except February 05, 2010, the same moment of time (23 h 59 m 45.00 s UTC) is assigned. For February 05, 2010, this moment is 23 h 29 m 55.22 s UTC.

In Table 1, the GRACE-related gravity models taken for testing have been specified. Thanks to the International Center for Global Earth Models (ICGEM) at Deutsches GeoForschungsZentrum Potsdam (Drewes 2012), these models were obtained as sets of normalized dimensionless coefficients of spherical harmonic expansion. In this study, the EGM2008 gravity model was truncated at degree and order 360. For each model, the corresponding types of data used in its estimation and the data period are also given. Taking into account the data period, the chosen gravity models can be divided into two groups. The first group of older models for which the data used is from before 2010. These are the first four models from Table 1, i.e., EIGEN-5S, EIGEN-51C, EGM2008 and TONGJI-GRACE01. The second group of newer models, with the part of data acquired after 2010, includes the next four models: ITSG-GRACE2014K, ITSG-GRACE2014S, ITU_GRACE16 and HUST-GRACE 2016S.

Table 1 Gravity models investigated in this work and data used in their recovery

Research results

As mentioned above, taking into account the positions from the estimated orbital arcs and the GOCE kinematic and reduced-dynamic positions, the resulting set of RMS values was obtained (being a measure of fit of the mentioned arcs to the corresponding GOCE kinematic and reduced-dynamic arcs). Several computation variants were obtained. In each variant, a set of the ten or fifty RMS values corresponding to the fit of the estimated arcs to the ten or fifty selected GOCE reference arcs was determined for the given gravity model. Thus, the performance comparison of the selected gravity models was based on the averaged RMS values. Below, these averaged RMS values will be marked simply as RMS.

Generally, two basic computational variants were realized. In the first basic variant, the gravity models were tested in terms of the fit to the ten aforementioned short arcs of kinematic and reduced-dynamic orbits with lengths 45.2 and 90.2 min. For both arc lengths, the maximum differences between RMS values (not shown here) for the particular gravity models do not exceed 0.4 cm for the arc length of 45.2 min and 1.3 cm for the arc length of 90.2 min. Taking into account the accuracy of determining the GOCE PSO at the level of 2 cm, this means that the obtained differences of RMS values do not have statistical significance. In other words, the performance of all tested gravity models is the same within the accuracy of the GOCE PSO. However, the obtained RMS values averaged for all gravity models and the estimated ten orbital arcs can be compared for both lengths of arcs and for both types of the PSO when the background models are switched on and off. These RMS values (as averages of eighty values) are presented in Table 2. It is clearly seen that the twofold increase in the arc length in all cases is associated with more than a twofold increase in the RMS value.

Table 2 RMS values averaged for all compared gravity models and the ten estimated orbital arcs depending on the computation variant with the different estimated arc lengths—45.2 min and 90.2 min, the different reference orbits and with or without switching the background models

But what is characteristic for the variants without taking into account the background models (BM), the RMS values increase by about a factor of near 2.3, but while taking into account the BM, these values increase to about a factor of 2.6. On the other hand, the ratios obtained by dividing RMS values without the BM by RMS values with the BM are also shown in Table 2. These ratios can be treated as a measure of the relative effectiveness of using the BM. As one can see, the ratio values depend both on the arc length and on the type of the used reference orbit. The use of the BM is more effective for the shorter 45.2 min arc (ratios 4.42, 4.71) than for the longer 90.2 min arc (ratios 3.95, 4.11) and is also more effective for the reduced-dynamic PSO (RD) as the reference (ratios 4.71, 4.11) than for the kinematic PSO (KIN) as the reference (ratios 4.42, 3.95). The decrease in efficiency of the performance of the BM at the increasing length of the estimated arc may be related to the growing systematic error caused by the simplified parameterization of the orbit. In turn, the higher efficiency of the BM in the case of the fit to the RD orbit than to the KIN orbit is connected with a greater smoothness of the RD orbit compared to the KIN orbit due to the existence of a dynamic part in the process of the RD orbit determination (Bock et al. 2014). However, for the above first basic computational variant, the fit to the KIN orbits and to the RD orbits was associated with a 1-s and 10-s sampling interval, respectively. The higher sampling interval for the RD orbits may be the additional factor determining the better fit to these orbits.

As already mentioned above, the maximum differences between the computed RMS values for the particular gravity models are equal to 0.4 cm and 1.3 cm for the arc length of 45.2 min (arc 1) and of 90.2 min (arc 2), respectively. This indicates that these differences increase with the growing arc length. Therefore, in order to obtain the statistical significance of RMS values, the second basic variant of computation was performed. This variant includes the estimation of 1-day arcs with respect to both reference orbit types, i.e., the fit of determined 1-day arcs to the corresponding 1-day arcs of the KIN and RD orbits. For the fit to the 1-day reference orbits, a 60-s sampling interval was adopted. The results are presented in Tables 3 and 4 as RMS values computed as averages for estimated arcs fitted to the corresponding KIN and RD reference arcs. For the KIN reference orbit, RMS values were computed, taking into account the corresponding values for the nine estimated arcs. The value of RMS for the fit to the 1-day KIN reference arc with the initial epoch Dec 18 23 h 59 m 45.00 s UTC was at a level of 250 m due to data gaps—missing kinematic positions for a number of epochs.

Table 3 RMS values of the fit to the 1-day KIN orbits as averages of nine values for each investigated gravity modela
Table 4 RMS values of the fit to the 1-day RD orbits as averages of ten values for each investigated gravity modela

Thus, the results for this arc can be treated in some sense as outliers. Comparing these results to the results for the short arcs, it is clearly seen that RMS values increased from the level of centimeters and decimeters to the level of meters. Along with this increase, there was also an increase in the difference between the RMS values for the particular gravity models. These differences in the fit to the 1-day reference orbits reach over two meters, significantly exceeding the two-centimeter threshold associated with the accuracy of the reference orbits. Looking at the mean RMS values in Tables 3 and 4 for variants without using the BM, it can be seen that, similar to the case of the 90.2 min arc, the better fit of estimated arcs is achieved for the KIN orbit (mean RMS 6.70 m) than for the RD orbit (mean RMS 7.49 m). It seems that this may be connected with a greater level of the distance of estimated arcs with respect to the reference orbit without using the BM. However, enabling the BM, as for the short arcs, gives a better fit to the RD orbit—the mean RMS 2.82 m against 4.94 m for the KIN orbit.

This is reflected by the ratio of mean RMS values 2.65 for the fit to the RD orbit, which is, similarly as for the short arcs, clearly larger than the corresponding value of 1.36 for the fit to the KIN orbit. Thus, the greater efficiency in the use of BM in terms of the fit to the RD orbits is also confirmed for the longer 1-day arcs. On the other hand, taking into account the increasing orbital arcs, it is clearly seen that, in this aspect, the aforementioned efficiency decreases—the mean ratio is 4.56 and 4.03 (Table 2) for the 45.2 min arcs and 90.2 min arcs, respectively, whereas this ratio is 2.00 (averaged the ratio of the mean RMS values for both types of the reference orbits from Tables 3 and 4) for the 1-day arcs. In general, the above results indicate the clearly better fit of the estimated orbital arcs to the RD orbit than to the KIN one. As already mentioned, this could be associated with a greater smoothness of the RD orbit than for the KIN orbit, which is strongly dependent on the quality of GPS measurements. This could confirm the greater accuracy of the RD orbit.

Essentially in Tables 3 and 4, the relative performance of the selected GRACE mission-related gravity models is given in terms of RMS of the fit of estimated arcs based on these models. The investigated models can be divided into three groups. The first one includes the gravity models such as TONGJI-GRACE01, HUST-GRACE2016S and EIGEN-5S. These three models clearly dominate for all computational variants with smaller RMS values that differ from the RMS values for the remaining models by 11 cm to 65 cm (Tables 3 and 4). The characteristic pattern for this group can be noticed—in the computational variants for both reference orbits without the BM, the gravity model EIGEN-5S exhibits the best performance by 6 cm (Table 3) and 10 cm (Table 4). The two remaining models HUST-GRACE2016S and TONGJI-GRACE01 achieve the same RMS values for the fit to the KIN and RD orbits—6.37 m and 7.14 m, respectively. For the computational variants with the BM, the order of the gravity models is the same for the KIN and RD orbits: TONGJI-GRACE01, HUST-GRACE2016S, EIGEN-5S. However, the differences between RMS values are 2 cm for both computational variants (Tables 3 and 4). This means that these differences remain at a level of the reference orbit accuracy (Bock et al. 2014). In this case, it can be stated that the performance of these models is the same. Thus, the inclusion of the BM has led to the same performance of these three gravity models. The outstanding performance of EIGEN-5S is also not a surprise. This model truncated to the degree and order of 120 was the a priori gravity model in the estimation process of RD orbits (Bock et al. 2014). But at this point it is worth noting that this model also has outstanding performance in terms of the fit to the KIN orbits, which are directly independent of any dynamic models. In the case of the fit to the RD orbits without the BM, EIGEN-5S is the best-performing model (Table 4). This is especially important because the computational variant without using the BM is independent of the specific set of used BM and thus may be treated with a slightly greater weight than the variant using the given BM.

The second group of gravity models includes ITSG-GRACE2014K, EIGEN-51C, EGM2008. It is worth noting, for the three computational variants (without and with using the BM with respect to the KIN reference orbit and with using the BM with respect to the RD reference orbit), that the order of these models according to increasing RMS values is the same—as listed above. In the fourth variant (without the BM with respect to the RD orbit), ITSG-GRACE2014K moved to the third position while the order of the two remaining models is still the same, i.e., in this case, there is the following order: EIGEN-51C, EGM2008, ITSG-GRACE2014K (Table 4).

Finally, the third group of models with clearly higher RMS values includes ITU-GRACE16 and ITSG-GRACE2014S. The difference in RMS between the first of these models and the last model of the third group varies from 41 cm to 60 cm depending on the computational variant (Tables 3 and 4). As one can see, in all cases ITSG-GRACE2014S has the highest RMS values among all tested models. In comparing the performance of the two ITSG-GRACE2014-type gravity models, it is clearly seen that in all computational variants the regularized ITSG-GRACE2014K model (Mayer-Gürr et al. 2014) has the lower RMS values than the unconstrained ITSG-GRACE2014S model (Kvas et al. 2014).

Taking into account the results presented in Tables 3 and 4 for the computational variant without the use of the BM and for the variant with the use of the BM, one can see some partial impact of using the given set of BM on the order of gravity models according to the increasing RMS values. On the one hand, the same three groups of models can also be indicated after enabling the BM, but on the other hand: In the first group—the performance of all three models became equal, and in the second group—a partial change in the order took place—ITSG-GRACE2014K “jumped” from the third to the first position in the group. Generally speaking, after applying the BM for the fit to both types of the reference orbits, there were no significant changes in the order of the gravity models—the basic division of gravity models into the three groups remained unchanged with the two minor aforementioned changes.

In order to illustrate the dependence of the effectiveness of using the BM on the selected gravity models, the ratios of the RMS values were computed by dividing RMS for variants without the BM by RMS for variants with the use of the BM (Table 5). These results were also calculated for both types of the reference orbits. It is clear that with the gravity models: EIGEN-5S, HUST-GRACE2016S, TONGJI-GRACE01 (Tables 3 and 4), a higher effectiveness of using the BM in terms of the fit to both types of the reference orbits is connected. For the gravity models: EGM2008, ITU-GRACE16, ITSG-GRACE2014S, the clearly lower ratios were obtained (Table 5).

Table 5 Ratio of RMS values for the selected gravity models depending on the reference orbit

In Table 6, the growth rates of RMS values are presented. These values were obtained by dividing the mean RMS for the fit to both KIN and RD orbits by the length of the corresponding arc. The growth rates of RMS were also separately calculated for the variants without and with the use of the BM.

Table 6 Growth rates of the RMS values for the fit of estimated arcs to the KIN and RD orbits

From the different values of the RMS growth rates, the non-linear increase of RMS values (for both variants—without and with the BM) results. For variants without the BM, the RMS growth rates are clearly greater than in the case of the variants with the BM. But in the first case (without the BM), the RMS growth rates do not show any trend remaining on average on the level of 0.53 cm/min, whereas in the second case (with the BM) the RMS growth rates clearly increase for both the KIN and RD orbits (Table 6). This increase confirms the aforementioned decrease in the effectiveness of using BM with an increase in the estimated arc length. Lower rate values for the RD orbits also confirm the better fit of the estimated arcs for these orbits than for the KIN orbits, especially in the computational variants using the BM.

So far, all the results were based on the average RMS values for the fit of ten arcs for each gravity model. To strengthen the statistics presented, the fit of fifty 1-day orbital arcs to the corresponding RD reference arcs was performed for each tested gravity field model. Additionally, the BM were taken into account in this computation. The obtained set of RMS values for the gravity models are given in Table 7. The listed dates correspond to the begin of initial epochs at the moment of time 23 h 59 m 45.00 s UTC. At the bottom of Table 7, the mean values of RMS with the standard deviations for all models are presented. The minimal and maximal value of RMS is also listed for each model. It is clearly seen that the differences between RMS values for all models are at the level of decimeters and meters significantly exceeding the accuracy level of the reference orbit. Comparing the mean RMS values in Table 4 (estimation of 10 orbital arcs per model) with the corresponding values in Table 7 (estimation of 50 arcs per model), it is evident that the increase in the number of statistic data did not significantly change the relative differences in the performance of individual gravity models, on the contrary, a very similar image emerges from both sets of results. First of all, the division of tested models into the three groups still exists. The first group includes: EIGEN-5S, HUST-GRACE2016S, TONGJI-GRACE01. This time in all three cases the mean RMS is exactly the same—2.32 m. The two models of second group, i.e., ITSG-GRACE2014K and EIGEN-51C, have the same mean RMS value—2.52 m, whereas the RMS value is slightly larger (2.57 m) for EGM2008, similarly as in Table 4 (for the ten arcs per model). Finally, the gravity models ITU-GRACE16 and ITSG-GRACE2014S belong to the third group with the largest RMS values 2.96 m and 3.37 m, respectively. It is also noticeable that the differences in the performance of particular models are reduced when the number of estimated arcs per model is increased from 10 to 50.

Table 7 RMS values of the fit to the fifty 1-day RD orbits (initial epochs from the range November 2, 2009–February 10, 2010) for each investigated gravity modela

When analyzing the distribution of RMS values in Table 7, it is also possible to notice another division of the models into two characteristic groups. They differ in the dispersion of RMS values. The standard deviations in Table 7 are the measure of this dispersion. The models EIGEN-5S, HUST-GRACE2016S and TONGJI-GRACE01 have clearly smaller dispersion at the level of 0.5 m, but for these models the best RMS values are always above 1 m. The remaining models belong to the second group with clearly larger (by a factor of 2–3) standard deviations in the range from 0.98 m to 1.69 m. In this case, the best RMS values are below 1 m, reaching 0.70 m. However, this group can be divided further into two subgroups coinciding with the two groups mentioned above; the first subgroup consisted of ITSG-GRACE2014K, EIGEN-51C and EGM2008 with the standard deviations at the level of 1 m and the second one that consists of ITU-GRACE16 and ITSG-GRACE2014S with the standard deviations around 1.5 m. Therefore, the following pattern can be presented: the lower dispersion of RMS values about 0.5 m and somewhat larger minimum values of RMS up to 1.29 m (Table 7), and on the other hand the larger RMS dispersion from 0.98 m to 1.69 m and the lower minimum values of RMS from 0.70 m to 0.87 m. Regarding the maximum RMS values, the following pattern can also be indicated: the smaller the dispersion (smaller standard deviation), the lower the maximum RMS value. This pattern can be seen more clearly than in the case of the RMS minimum value, it occurs for all three groups of models divided according to the standard deviation values.

Taking into account RMS differences and all estimated 1-day orbital arcs, the loss of accuracy on average 0.46 m when using the models EIGEN-5S, HUST-GRACE2016S and TONGJI-GRACE01 in the domain of minimal RMS values (Min RMS values, Table 7) is smaller than the accuracy loss on average 1.73 m in the domain of maximal RMS values (Max RMS values, Table 7) in the case of the use of the remaining gravity models.

The different RMS values for the particular days in Table 7, being the basis for formulating the above observations, reflect first of all differences in the performance of the tested gravity models and to a much lesser extent the differences in the accuracy of corresponding 1-day arcs of the reference orbit. This statement is justified by the analysis of the values of SLR residuals of the PSO RD reference orbit for the period from November 1, 2009, until mid-January 2010 presented by Bock et al. (2011). As already mentioned, the RMS of these residuals is below 2 cm, while the maximum values in individual cases for single SLR stations reach up to 10 cm. In one isolated case, in the middle of December 2009, a residual of about 20 cm appears for Greenbelt station (Bock et al. 2011).

Summary and conclusions

The long-wavelength parts of eight GRACE-related gravity models were investigated and compared with each other by the fit of estimated orbital arcs to the official reduced-dynamic and kinematic orbits of the GOCE satellite. The Cartesian coordinates of both types of the orbit were observations in the dynamic orbit determination process using the least-squares adjustment. Different variants of estimated arcs, with three different arc lengths, ten and fifty arcs estimated for each gravity model and different options for additional background dynamic models—with and without these models in the estimation process, were obtained. As a measure of the fit of estimated arcs, the RMS difference between these arcs and the reference orbits was adopted. The results were obtained in the time span between November 02, 2009, and February 11, 2010.

Comparing the efficiency of using the background models, it is evident that for the increasing orbital arc, this efficiency decreases, for example, almost a double decrease for the 1-day arcs compared to the 90.2-min arcs. It is caused by increasing systematic errors due to the simplified parametrization of estimated orbital arcs.

Inclusion in the calculations (with ten estimated 1-day arcs for each model) of the background models did not significantly affect the relative performance of the tested gravity models compared to the option without considering these (background) models.

The obtained results confirm the better fit to the reduced-dynamic orbit than to the kinematic orbit, which may be connected with a greater smoothness of the reduced-dynamic orbit and, consequently, its greater accuracy.

Taking into account the short arcs of 45.2 min and 90.2 min (for the ten estimated arcs for each tested model), the performance differences of given gravity models were not resolved due to too small differences between the RMS values for the particular gravity models. These differences were under the accuracy level (about 2 cm) of the PSO reduced-dynamic and kinematic reference orbits. Thus, the performance of given gravity models is equivalent in the aspect of estimation of several tens of minute arcs within the reference orbit accuracy.

For the 1-day arcs, both for the ten and the fifty estimated arcs per each gravity model, without and with the background models, the performance differences between the investigated gravity models are visible in the following two aspects:

  1. (a)

    The differences of RMS of fit of estimated arcs for the tested gravity models reach the level of several meters, significantly exceeding the 2 cm threshold of the reference orbit accuracy.

  2. (b)

    The analysis of particular RMS values (variant with fifty arcs for each gravity model with the background models) reveals the characteristic pattern of the performance of the gravity models. The gravity models EIGEN-5S, HUST-GRACE2016S and TONGJI-GRACE01, with exactly the same smallest mean RMS values (for all 1-day arcs), have the slightly higher minimum RMS values (by a factor on average 1.6) with the smaller range of RMS variations compared to the smaller minimum RMS values for the remaining gravity models but in turn with the larger range of RMS variations. Consequently, for the three aforementioned models, the smaller maximum values of RMS are obtained, while in turn for the remaining models, i.e., for ITU-GRACE16, ITSG-GRACE2014S, ITSG-GRACE2014K, EIGEN-51C and EGM2008, the maximum RMS values are higher (by a factor on average 1.5).

Considering the aspect (b), it can be concluded that the described effects in a certain sense are mutually compensating in the domain of minimum and maximum RMS values. So, the performance of these both groups of gravity models in this respect can be regarded as complementary. However, taking into account the smallest mean RMS values and the smaller standard deviations of RMS values, the models of the first group, i.e., EIGEN-5S, HUST-GRACE2016S and TONGJI-GRACE01 should be highlighted. Due to the same mean RMS values and similar values of standard deviations, the performance of these models can be treated as equivalent. Considering the distribution of RMS values for the particular orbital arcs, these gravity models work stably and predictably, without showing such large changes in daily RMS values as in the case of other models. It is worth reminding here that the gravity model EIGEN-5S was the a priori model in the estimation process of the reduced-dynamic PSO (reference orbit) (Bock et al. 2011). In turn, another model from this group HUST-GRACE2016S is based on the longest-period GRACE data from all tested models. The length of this period is 13 years (Zhou et al. 2017).