1 Introduction

The terrestrial ionosphere is a large collection of free electrons and ions distributed at an altitude of about 60–1500 km above the Earth's surface (Bilitza et al. 2011; Schaer 1999). The dispersive nature of the ionosphere causes the phase advance and the group delay of radio waves. Similarly, total electron content (TEC) or electron density along the propagation path of radio wave is able to be inverted by using Global Navigation Satellite System (GNSS) dual-frequency observations (Alizadeh et al. 2011; Lanyi and Roth 1988). By using multi-system and multi-frequency observations provided by GNSS, high-precision global ionospheric models are established (Calabia and Jin 2020; Goss et al. 2019; Hernandez-Pajares et al. 2009; Li et al. 2020; Schaer 1999). In this way, GNSS has been one of the main technologies for modeling, monitoring and studying the ionosphere (Calabia et al. 2022; Feng et al. 2019; Mannucci et al. 1998).

In order to organize and coordinate the ionospheric analysis centers for routine calculation and provide stable and reliable ionospheric combined products, The International GNSS Service (IGS) established the IGS ionosphere working group (IWG) in 1998 (Feltens and Schaer 1998). At present, eight IGS Ionosphere Associate Analysis Centers (IAACs) independently adopt different approaches to estimate global ionospheric maps (GIMs). The Jet Propulsion Laboratory (JPL), Universitat Politècnica de Catalunya (UPC) and Deutsches Geodätisches Forschungsinstitut der Technischen Universität München (DGFI-TUM) employ spherical triangles with splines (Mannucci et al. 1998), tomography with splines and kriging (Hernández-Pajares et al. 1999; Orus et al. 2005) and B-spline expansion (Goss et al. 2019), respectively. The Center for Orbit Determination in Europe (CODE), European Space Agency/European Space Operations Center (ESA/ESOC), Natural Resources Canada (NRCan), Chinese Academy of Sciences (CAS) and Wuhan University (WHU) adopt the spherical harmonic expansion (Feltens 2007; Ghoddousi-Fard 2014; Li et al. 2014; Schaer 1999; Zhang and Zhao 2018).

Although the spherical harmonic expansion has been widely adopted in ionospheric modeling, it is not optimal for representing observations of heterogeneous density and quality (Schmidt et al. 2011, 2006). Since the ground-based GNSS stations are inhomogeneously distributed around the world and there are large data gaps in areas such as parts of the ocean, desert and polar region, the reliability of GIMs estimated with GNSS observations is significantly reduced. There are oversized and negative vertical TECs (VTECs) estimated with function-based models and flat VTECs estimated with grid-based models (An et al. 2019; Hernandez-Pajares et al. 2002; Zhang et al. 2013). Several solutions have been proposed to mitigate the inaccurate VTECs in GIMs, including the assimilation of climate models and empirical models such as the International Reference Ionosphere (IRI) model (Bilitza et al. 2017), the Bent model (Bent et al. 1975) and the NeQuick model (Hernandez-Pajares et al. 2002; Hochegger et al. 2000; Mannucci et al. 1998; Wang et al. 2016), attaching inequality constraint (Zhang et al. 2013) and integrating observations of the radio occultation, satellite altimetry and Doppler orbitography and radiopositioning integrated by satellite (DORIS) (Alizadeh et al. 2011; An et al. 2019; Li et al. 2014). However, the computing burdens and the quantity and timeliness of the measured observations restrict the effectiveness of the above methods.

The IRI model is a joint project sponsored by the Committee on Space Research (COSPAR) and the International Union of Radio Science (URSI) with the goal of developing and improving international standard for parameters in the Earth’s ionosphere (Bilitza et al. 2011; Rawer et al. 1978). The IRI is the official standard for the ionosphere recognized by International Standardization Organization, URSI, COSPAR and the European Cooperation for Space Standardization, and has been one of the most effective ionosphere models and widely recognized and accepted (Bilitza 2018). Furthermore, the accuracy of the IRI-2016 model has improved by integrating reliable observations from a variety of technologies such as ionosonde, incoherent scatter radar, rocket, topside sounder, in situ satellite, GNSS and GNSS radio occultation. The IRI model is capable of providing reliable ionospheric parameters from the lower boundary to a user-specified upper boundary on a global scale. Therefore, the big gaps of global ground-based GNSS stations can be well filled with IRI VTECs, and the reliability of GIM has been significantly improved (Hernandez-Pajares et al. 2002; Wang et al. 2016). However, the calibration methods are still subject to in-depth study, while GNSS is only used for validation (Bilitza and Reinisch 2021). Wang et al. (2016) pointed out that there is no very good agreement between the IRI VTECs and the observed values, especially during geomagnetic storms. The VTEC biases between the IRI-2016 and IGS final GIM (IGSG) mainly range from -10 to 10 TEC unit (TECu), and the maximum is larger than 30 TECu over the equatorial ionization anomaly regions (Chen et al. 2020). Therefore, it is not appropriate to directly integrate the VTECs from IRI and GNSS in ionospheric modeling. Since IRI is an empirical model, the method of artificially attaching prior noise to IRI VTEC is commonly adopted for integrating the VTECs of IRI and GNSS (An et al. 2019; Wang et al. 2016). However, since the biases between the IRI and GNSS are not very clear at present, and the ionosphere varies greatly in different times and spaces, it is difficult to accurately express the priori noises with a simple functional model.

In this study, in order to mitigate the inaccurate VTECs in GIMs caused by the inhomogeneous distribution of global GNSS stations, a method of automated assimilation of multi-GNSS and Virtual Observation Stations (VOSs) observations assisted by spatiotemporal varying parameters and priori constraints in global ionospheric modeling is proposed. This study consists of five parts. Section 2 introduces the algorithms of deriving and smoothing slant TECs (STECs) from GNSS observations, analyzes the functional and stochastic model of the VTEC biases between GNSS and IRI-2016, elaborates the strategies of establishing VOSs with the help of IRI-2016 model and generates biases for each VOS, and provides theories of spherical harmonic expansion and parameter estimation. Section 3 presents the effectiveness of attaching VOSs and biases by comparing the estimated Shanghai Astronomical Observatory final GIM (SHAG) with assessment stations, IGSG and the actual calculated VTEC biases between IGSG and IRI-2016. Section 4 validates the long-term reliability of SHAG spanned about one solar cycle with IGS final GIMs and of Jason-2/-3 VTECs. Section 5 summaries the study and draws conclusions.

2 Strategies of global ionosphere modeling

This section introduces the strategy of retrieving and smoothing TECs from GNSS observations, analyzes the functional and stochastic model of the GNSS and IRI-2016 VTEC biases, proposes the method of establishing VOS and introduces the spherical harmonic expansion and the least squares method used in SHAG estimation.

2.1 Deriving and smoothing STECs from GNSS observations

The pseudorange and carrier phase observations of GNSS on a specific frequency can be expressed as follows:

$$ \begin{array}{*{20}c} {\left\{ {\begin{array}{*{20}c} {P_{r}^{s} = \rho_{r}^{s} + c\left( {\Delta t^{s} - \Delta t_{r} } \right) + I_{r}^{s} + T_{r}^{s} + c\left( {B^{s} + B_{r} } \right) + \varepsilon^{P} } \\ {L_{r}^{s} = \rho_{r}^{s} + c\left( {\Delta t^{s} - \Delta t_{r} } \right) - I_{r}^{s} + T_{r}^{s} + c\left( {b^{s} + b_{r} } \right) + \lambda N_{r}^{s} + \varepsilon^{L} } \\ \end{array} } \right.} \\ \end{array} $$
(1)

where \(P_{r}^{s}\) and \({L}_{r}^{s}\) are pseudorange and carrier phase observations between satellite \(s\) and receiver \(r\); \({\rho }_{r}^{s}\) is the geometry distance; \(c\) is the speed of light; \({\Delta t}^{s}\) and \({\Delta t}_{r}\) are clock errors of satellite and receiver, respectively; \({I}_{r}^{s}\) is the ionospheric delay of pseudorange and advance of carrier phase along line of sight (LOS); \({T}_{r}^{s}\) is the tropospheric delay along LOS; \({B}^{s}\) and \({B}_{r}\) are pseudorange delays of satellite and receiver, respectively; \({b}^{s}\) and \({b}_{r}\) are carrier phase delays of satellite and receiver, respectively; \(N\) and \(\lambda \) denote the ambiguity and the wavelength of carrier phase, respectively; and \({\varepsilon }^{P}\) and \({\varepsilon }^{L}\) are other errors of pseudorange and carrier phase, such as observation noise and multi-path error.

Ignoring ionospheric higher-order terms and only preserving the first-order term, \({I}_{r}^{s}\) can be expressed as follows (Bassiri and Hajj 1993):

$$ \begin{array}{*{20}c} {I_{r}^{s} = \frac{{e^{2} }}{{8\pi^{2} \varepsilon_{0} mf^{2} }}{\text{STEC}}_{r}^{s} \approx 40.308\frac{1}{{f^{2} }} {\text{STEC}}_{r}^{s} } \\ \end{array} $$
(2)

where \(e\) and \(m\) are the electron charge and mass; \({\varepsilon }_{0}\) is the permittivity of the free space; \({\mathrm{STEC}}_{r}^{s}\) is the TECs along LOS; and \(f\) is the frequency of carrier.

Neglecting relatively small errors, such as the multi-path error, observation noise and carrier phase delay of satellite and receiver, the dual-frequency geometry-free linear combinations of pseudorange and carrier phase are expressed as follows (Lanyi and Roth 1988):

$$\begin{array}{c}\left\{\begin{array}{c}{P4}_{r,i}^{s}={P}_{r, i,{L}_{A}}^{s}-{P}_{r,i, {L}_{B}}^{s}=40.308\left(\frac{1}{{f}_{{L}_{A}}^{2}}-\frac{1}{{f}_{{L}_{B}}^{2}}\right){\mathrm{STEC}}_{r, i}^{s}+c\left({\mathrm{DCB}}^{s}+{\mathrm{DCB}}_{r}\right)\\ {L4}_{r, i}^{s}={L}_{r,i, {L}_{A}}^{s}-{L}_{r,i,{L}_{B}}^{s}=-40.308\left(\frac{1}{{f}_{{L}_{A}}^{2}}-\frac{1}{{f}_{{L}_{B}}^{2}}\right){\mathrm{STEC}}_{r,i}^{s}+{N}_{{L}_{A}}{\lambda }_{{L}_{A}}-{N}_{{L}_{B}}{\lambda }_{{L}_{B}}\end{array}\right.\end{array}$$
(3)

where \({P4}_{r,i}^{s}\) and \({L4}_{r,i}^{s}\) are geometry-free linear combined values of pseudorange and carrier phase at epoch of \(i\); \({L}_{A}\) and \({L}_{B}\) correspond to two frequency points of GNSS; and \({\mathrm{DCB}}^{s}\) and \({\mathrm{DCB}}_{r}\) are the differential code biases (DCBs) of satellites and receivers.

Because \({P4}_{r}^{s}\) contains relatively large observation noise and multi-path error, the carrier phase to code leveling (CCL) method (Mannucci et al. 1998, 1993) is used to reduce the effects of noise and multi-path errors:

$$\begin{array}{l}\left\{\begin{array}{l} \Delta {L4}_{r}^{s}=\frac{1}{n}\sum_{i=1}^{n}\left({P4}_{r,i}^{s}+{L4}_{r,i}^{s}\right) =\frac{1}{n}\sum_{i=1}^{n}\left(c\cdot {\mathrm{SPR}}_{r}^{s}+{N}_{{L}_{A}}{\lambda }_{{L}_{A}}-{N}_{{L}_{B}}{\lambda }_{{L}_{B}}\right) \\ \qquad \qquad \qquad \qquad \qquad\quad\qquad\qquad {\mathrm{SPR}}_{r}^{s}={\mathrm{DCB}}^{s}+{\mathrm{DCB}}_{r}\\{\widehat{L4}}_{r,i}^{s}=-{L4}_{r,i}^{s}+\Delta {L4}_{r}^{s}=40.308\left(\frac{1}{{f}_{{L}_{A}}^{2}}-\frac{1}{{f}_{{L}_{B}}^{2}}\right) {\mathrm{STEC}}_{r,i}^{s}+c\cdot {\mathrm{SPR}}_{r}^{s}\end{array}\right.\end{array}$$
(4)

where \(\Delta L4_{r}^{s}\) is the average of a continuous arc which contains the inter-frequency satellite-plus-receiver bias (SPR) and ambiguity information; \(n\) represents the epochs of a continuous arc; \({\text{SPR}}_{r}^{s}\) is the sum of satellite DCB and receiver DCB; and \(\widehat{L4}_{r,i}^{s}\) is the smoothed geometry-free linear combined value at epoch \(i\). Note that the effect of smoothing largely depends on the length of a continuous arc (Wang et al. 2015). Hence, only the GNSS satellites arcs with more than 30 min of continuous observation time are employed in ionospheric modeling.

IGS provides global multi-system and multi-frequency GNSS observations from over 500 ground-based GNSS stations (Montenbruck et al. 2014). Compared with global ionospheric modeling with GPS or GPS/GLONASS observations adopted by most of IGS Ionosphere Associate Analysis Centers, better coverage of ionospheric pierce points (IPPs) is achieved by integrating observations of multi-GNSS such as GPS, GLONASS, Galileo, BEIDOU and QZSS, which means higher reliability of ionospheric model, especially for high-spatiotemporal-resolution ionospheric modeling (Ren et al. 2016). Therefore, all IGS stations and GPS, GLONASS, Galileo and BEIDOU observations are employed in this study. The station distribution of IGS network and the corresponding tracking GNSS systems (GPS, GLONASS, Galileo and BEIDOU) on April 10, 2021, are shown in Fig. 1. We see that there are large gaps in parts of the ocean, desert and polar region.

Fig. 1
figure 1

Distribution of IGS stations tracking GPS (blue dots), GLONASS (green squares), Galileo (yellow diamonds) and BEIDOU (red hexagrams) on April 10, 2021

2.2 Establishing VOSs and attaching bias parameters

Ground-based GNSS stations are inhomogeneously distributed across the Earth’s surface and there are large data gaps, which result in a significant reliability reduction of GIMs. We establish fourteen VOSs with the IRI-2016 model to cover the large data gaps (corresponding to VOS-01–VOS-14 in Fig. 2). The location and serial number of VOSs and the IPP distribution of GPS, GLONASS, Galileo, BEIDOU and VOS observations on April 10, 2021, are shown in Fig. 2. Note that VOSs are only set up in the regions with large data gaps, and it is not necessary to set VOSs in regions with relatively small gaps. By entering the parameters such as the latitude, longitude, time and upper height (20,000 km is adopted in this study) of VOS’s IPP (shown as Table 1) into IRI-2016 model, the virtual observations of VOS are obtained.

Fig. 2
figure 2

Location and serial number of fourteen VOSs (VOS-01–VOS-14) and seven test regions (R01–R07), and the IPP distribution of GPS (blue dots), GLONASS (green dots), Galileo (yellow dots), BEIDOU (red dots) and VOS (black dots) observations on April 10, 2021

Table 1 Serial numbers of VOS and their coverage and interval of longitude, latitude and time

In order to assimilate the GNSS and VOS observations in ionospheric modeling, the functional and stochastic model of the deviation between GNSS and IRI-2016 should be studied in advance. We selected seven small regions in South America where GNSS stations are densely distributed as the test regions, and we calculated the VTEC biases between GNSS and IRI-2016 for each test region. Seven test regions are located in longitudes of 60–70°W and latitudes of 70°S–0° with the intervals of 10°, corresponding to R01–R07 in Fig. 2. Since the IGSG is generated by ranking, weighting and combining the GIMs provided by CODE, ESA/ESOC, JPL and UPC, and provides high accuracy and reliability (Hernandez-Pajares et al. 2009, 2017), the GNSS VTECs in the seven regions were obtained from IGSG. The IRI-2016 predicted VTECs in the corresponding IGSG grid points were calculated by IRI-2016 model. The VTEC biases of seven test regions between GNSS and IRI VTECs from January 1, 2009, to January 1, 2020, are shown in Fig. 3. The VTEC biases of R-01 (in high latitudes) and R-07 (in low latitudes) during solar comparatively active periods (from January 1, 2014, to April 1, 2014) and quiet periods (from January 1, 2019, to April 1, 2019) are shown in Fig. 4.

Fig. 3
figure 3

VTEC biases between IGSG and IRI-2016 from January 1, 2009, to January 1, 2020, for seven test regions, where Region-01–Region-07 correspond to R01–R07 in Fig. 2 (unit: TECu)

Fig. 4
figure 4

VTEC biases between IGSG and IRI-2016 in Region-01 (corresponding to R-01 in high latitudes) and Region-07 (corresponding to R-07 in low latitudes) from January 1, 2014, to April 1, 2014 (a), and from January 1, 2019, to April 1, 2019 (b) (unit: TECu)

Figure 3 shows that the difference between GNSS and IRI-2016 VTECs during solar active periods is generally greater than that during solar quiet periods, and the difference in low latitudes is greater than that in high latitudes. During solar quiet periods, the VTEC biases vary from − 20 to 20 TECu, and mainly range from − 5 to 5 TECu for regions located in high latitudes and from − 10 to 10 TECu for regions located in low latitudes. During solar active periods, the VTEC biases vary from − 30 to 60 TECu, and mainly range from − 10 to 20 TECu for regions located in high latitudes and from − 20 to 30 TECu for regions located in low latitudes. Considering that the reliability of IRI-2016 VTECs is comparatively lower than that of GNSS VTECs and the spatiotemporal distribution of these two kinds of observations have intersection, to mitigate the influence of IRI-2016 VTECs on GNSS VTECs when integrating these two kinds of observations in ionospheric modeling, the priori noises of IRI-2016 VTECs are set to different values according to the geographic latitudes and solar activity levels. The solar radio flux at 10.7 cm index (F10.7) is adopted to characterize solar activity levels. Referring to Fig. 3 and the F10.7 index (ftp://ftp.seismo.nrcan.gc.ca/spaceweather/solar_flux), the priori noise values of IRI-2016 VTECs are set to 5–30 TECu and the detailed values in different latitudes and F10.7 indices are list in Table 2. The priori noise of GNSS pseudorange observation is set to 0.3 m, its weight is calculated by the elevation of satellite, and then, the priori noise of CCL values are determined according to the variance–covariance propagation law (Wang et al. 2016).

Table 2 Priori noise values of IRI-2016 VTECs in different geographic latitudes and solar activity indices

As the biases between GNSS and IRI-2016 VTECs are relatively large and present significant diurnal variation under different circumstances (shown as Fig. 4), it is inappropriate to integrate GNSS and IRI-2016 VTECs in ionospheric modeling only with prior noises, and it is difficult to describe the biases by a simple functional model. Because the ionospheric activity is generally assumed to be static over a short time span in ionospheric modeling, a piecewise constant is introduced to VOSs to further assume the VTEC bias between VOSs and GNSS as a constant in a small region and over a short period. Therefore, on the basis of adding priori noise for GNSS and VOS observations, the spatiotemporal varying biases are attached to each VOS with a 1-h interval to automatically absorb the VTEC biases between VOSs and GNSS:

$$ \begin{array}{*{20}c} {{\text{VTEC}}_{{{\text{GNSS}}}}^{i} = {\text{VTEC}}_{{{\text{VOS}}}}^{i} + {\text{Bias}}_{{{\text{VOS}}}}^{i,t} } \\ \end{array} $$
(5)

where \(i\) is the serial number of VOS, \(t\) denotes the time, \({\mathrm{VTEC}}_{\mathrm{VOS}}\) represents the VTEC of VOS, \({\mathrm{VTEC}}_{\mathrm{GNSS}}\) is the VTEC derived from GNSS and \({\mathrm{Bias}}_{\mathrm{VOS}}\) is the VOS bias parameter.

2.3 Methods of global ionosphere modeling and parameter estimation

The spherical harmonic expansion is well suited to express the spatiotemporal distribution and variation of the ionosphere on a global scale (Li et al. 2014; Schaer 1999; Zhang and Zhao 2019). In this study, we employ the single-layer model (SLM) to parameterize the VTECs of the ionosphere (Mannucci et al. 1998), which assumes all free electrons are in a shell of infinitesimal thickness at a fixed altitude. The modified single-layer model (MSLM) mapping function is adopted to convert the STEC derived from GNSS shown in Eq. (4) to VTEC (Schaer 1999). The two-dimensional spherical harmonic expansion, the mapping function and the VTEC of VOS can be expressed as follows:

$$\begin{array}{l}\left\{\begin{array}{l}\begin{array}{l}{\text{VTEC}}\left(\beta ,\gamma \right)=\sum_{n=0}^{{n}_{\mathrm{max}}}\sum_{m=0}^{n}{\widetilde{P}}_{nm}\left(\mathrm{sin}\beta \right)\\ \left[{\widetilde{C}}_{nm}\mathrm{cos}\left(m\gamma \right)+{\widetilde{S}}_{nm}\mathrm{sin}\left(m\gamma \right)\right]\end{array}\\ {\text{STEC}}\left(\beta ,\gamma \right)={\text{VTEC}}\left(\beta ,\gamma \right)\cdot {\text {MF}} \left({z}^{^{\prime}}\right)\\ {\text {MF}} \left({z}^{^{\prime}}\right)=\frac{1}{\mathrm{cos}{z}^{^{\prime}}}\\ \mathrm{sin}{z}^{^{\prime}}=\frac{R}{\left(R+H\right)}\mathrm{sin}\left(\alpha z\right)\\ {\text{VTEC}}\left(\beta ,\gamma ,t,s\right)={\mathrm{VTEC}}_{\text {VOS}}\left(\beta ,\gamma ,t,s\right)+{\mathrm{Bias}}_{\mathrm{VOS}}(t,s)\end{array}\right.\end{array}$$
(6)

where \(\beta\) and \(\gamma\) are the geomagnetic latitude and sun-fixed longitude of IPP, respectively; \(n_{{{\text{max}}}}\) is the maximum of degree and is set to 15; \(\tilde{P}_{nm}\) is the normalized associated Legendre function of degree n and order m; \(\tilde{C}_{nm}\) and \(\tilde{S}_{nm}\) are the ionospheric model coefficients to be estimated; \({\text{MF}} \) is the mapping function; \(z^{\prime}\) and \(z\) are the corrected and uncorrected satellite zenith at a GNSS station, respectively; \(\alpha\) is the correction factor set to 0.9782; \(R\) is the mean radius of the Earth set to 6378 km; H is the height of MSLM set to 450 km; \(s\) denotes the serial number of VOS; \(t\) represents the time; and \({\text{Bias}}_{{{\text{VOS}}}}\) are estimated at 1-h interval to absorb the VTEC biases between GNSS and IRI-2016.

Since the ionospheric information derived from GNSS contains the satellite-plus-receiver biases, the biases are simultaneously estimated with the ionospheric model coefficients. To separate the satellite-plus-receiver biases into the DCBs of satellites and receivers, the zero-mean of satellite DCBs constraint condition (Schaer 1999; Wang et al. 2015) is imposed to each GNSS constellation:

$$ \begin{array}{*{20}c} {\mathop \sum \limits_{i = 1}^{N} {\text{DCB}}^{S, i} = 0 } \\ \end{array} $$
(7)

where the \(N\) denotes the number of satellites for the specific GNSS constellation and \(S\) stands for the GNSS constellation, such as GPS, GLONASS, Galileo and BEIDOU.

The least squares method is adopted to solve the parameters to be estimated. The solutions of the parameter vector \(\hat{X}\) and the covariance matrix \(D_{{\hat{X}\hat{X}}}\) are as follows:

$$ \begin{array}{*{20}c} {\left\{ {\begin{array}{*{20}c} {\hat{X} = \left( {\left. {\sum B^{T} PB} \right)} \right.^{ - 1} \left( {\sum B^{T} PL} \right)} \\ {V = B\hat{X} - L} \\ {\hat{\sigma }_{0} = \sqrt {\begin{array}{*{20}c} {\frac{{V^{T} PV}}{n - t}} \\ \end{array} } } \\ {D_{{\hat{X}\hat{X}}} = \hat{\sigma }_{0}^{2} \left( {\sum B^{T} PB} \right)^{ - 1} } \\ \end{array} } \right. } \\ \end{array} $$
(8)

where \(\hat{X} = \left[ {ION,{ }SPR,{ }VB{ }} \right]^{{\text{T}}}\) is the parameter vector to be estimated, among which the \(ION\) consists of the ionospheric model coefficients \(\tilde{C}_{nm}\) and \(\tilde{S}_{nm}\), and it is estimated with a 1-h interval; \(SPR\) and \(VB\) are the parameter vectors of SPRs and VOS biases, and these are estimated at 1-day and 1-h intervals, respectively; \(B\) denotes the design matrix, the design matrix of GNSS station is obtained from Eqs. (4) to Eq. (6), and the design matrix of VOS is obtained from Eq. (6); \(P\) is the weight matrix of observations; \(L\) denotes the vector of actual observations; \(B^{T} PB\) is the normal equation matrix and \(B^{T} PL\) is the right hand side of the normal equation system; the \(B^{T} PB\) and \(B^{T} PL\) are established for each GNSS and VOS station; \(V\) represents the vector of the observation corrections; \(\hat{\sigma }_{0}\) is the standard deviation of the unit weight; \(n\) is the number of observations; \(t\) is the number of parameters to be estimated; \(D_{{\hat{X}\hat{X}}}\) is the covariance matrix of \(\hat{X}\). After imposing the zero-mean of satellite DCBs constraint condition to SPRs, the DCBs of each satellite and receiver are obtained. The piecewise linear (PWL) is introduced to interpolate consecutive sets of \(ION\) (Schaer 1999). The GIM in the IONosphere map EXchange format (IONEX) is generated when the VTECs and the corresponding RMSEs on specific grid points are calculated with the spatiotemporal resolution of 1-h × 2.5° × 5.0° in time, latitude and longitude, respectively (Schaer et al. 1998).

3 Validation of VOSs and estimated biases

In this section, the effectiveness of attaching VOS in ionospheric modeling are validated with the VTECs of independent GNSS stations and IGSG. The estimated VOS biases are compared with the calculated biases between IGSG and IRI-2016 to verify the validity of adding VOS bias parameters.

3.1 Validation of VOSs with assessment stations

In order to validate the effectiveness of VOS and the performance of different intervals of VOS biases during different solar activity periods, two kinds of strategies are adopted for estimating the SHAG on the solar active day (April 10, 2014) and the solar quiet day (April 10, 2017). One was to only use ground-based GNSS observations (without VOS), the other was to attach VOSs and biases in the regions of large data gaps. The bias intervals of VOS are set to 1, 2, 4, 6, 8, 12 and 24 h.

The VTECs retrieved from independent assessment GNSS stations are compared with the SHAG estimated according to the above strategies to verify how much improvement VOSs will bring to the large gaps between the IGS stations, find the optimal interval of bias and validate whether VOSs have negative effects on the regions where the GNSS stations are homogeneously distributed. The distribution of modeling and assessment stations on April 10, 2017, are shown in Fig. 5. The modeling stations were selected from IGS network, and the assessment stations were obtained from University Navstar Consortium (UNAVCO) and Système d’observation du Niveau des aux Littorales (SONEL) networks. Note that ISPA (IGS station) and ILSG were unavailable since January 10, 2019, and December 12, 2017, respectively, and there were no other stations on that regions. Hence, ISPA and ILSG were used as assessment stations to evaluate the performance of VOSs in low-latitude areas. The STECs retrieved from assessment stations can be obtained from Eq. (4) and is expressed as follows:

$$ \begin{array}{*{20}c} {\begin{array}{*{20}c} \\ {{\text {STEC}}_{r,i}^{s} = \frac{{f_{{L_{A} }}^{2} \cdot f_{{L_{B} }}^{2} }}{{40.308 \cdot \left( {f_{{L_{B} }}^{2} - f_{{L_{A} }}^{2} } \right)}}\left[ {\widehat{L4}_{r,i}^{s} - c \cdot \left( {{\text {DCB}}^{s} + {\text {DCB}}_{r} } \right)} \right]} \\ \end{array} } \\ \end{array} $$
(9)
Fig. 5
figure 5

GNSS station distributions for modeling the global ionosphere (Modeling Station), assessing the impact of VOSs on the large gaps between IGS GNSS stations [Assessment Station (a)] and assessing the impact of VOSs on the areas with GNSS stations [Assessment Station (b)] on April 10, 2017. Assessment Station (a) is used for verifying the effectiveness of VOSs and biases in the large gaps and find the optimal time interval of bias, and Assessment Station (b) is used to validate whether VOSs negatively affect the model reliability in regions with GNSS modeling stations

The DCBs of the satellites and assessment stations were estimated together with the modeling stations and different bias intervals (including without VOS) to reduce the influence of DCB on the derived STEC. The STECs of assessment stations with different bias intervals are extracted from Eq. (9) with the corresponding DCBs. The VTECs of SHAG corresponding to the extracted STECs were obtained by interpolating VTEC maps (Schaer et al. 1998). After converting the extracted STECs to VTECs with the mapping function in Eq. (6), the VTECs of assessment stations were used to evaluate the effectiveness of attaching VOSs. The bias, the mean value, the STD and RMSE of VTEC in the study are calculated as follows:

$$ \begin{array}{*{20}c} {\left\{ {\begin{array}{*{20}c} {{\text{VTEC}}_{{{\text{bias}}}} = {\text{VTEC}} - {\text{VTEC}}_{{{\text{ref}}}} } \\ {{\text{VTEC}}_{{{\text{MEAN}}}} = \frac{1}{n} \cdot \mathop \sum \limits_{i = 1}^{n} {\text{VTEC}}^{i} } \\ {{\text{VTEC}}_{{{\text{STD}}}} = \sqrt {\frac{1}{n} \cdot \mathop \sum \limits_{i = 1}^{n} \left( {{\text{VTEC}}^{i} - {\text{VTEC}}_{{{\text{MEAN}}}} } \right)^{2} } } \\ {{\text{VTEC}}_{{{\text{RMS}}}} = \sqrt {\frac{1}{n} \cdot \mathop \sum \limits_{i = 1}^{n} \left( {{\text{VTEC}}_{{{\text{bias}}}}^{i} } \right)^{2} } } \\ \end{array} } \right.} \\ \end{array} $$
(10)

where the \({\text{VTEC}}_{{{\text{bias}}}}\), \({\text{VTEC}}_{{{\text{MEAN}}}}\), \({\text{VTEC}}_{{{\text{STD}}}}\) and \({\text{VTEC}}_{{{\text{RMS}}}}\) represent the bias, the mean value, the standard deviation (STD) and the RMSE of VTEC, respectively, \({\text{VTEC}}\) is the modeled value, \({\text{VTEC}}_{{{\text{ref}}}}\) is the reference value of \({\text{VTEC}}\) and n denotes the number of \({\text{VTEC}}\).

Four independent assessment stations [corresponding to Assessment Station (a) in Fig. 5] located in the sparse areas of GNSS stations were selected as references to validate the VOSs and biases, among which the ILSG and ISPA are located at low latitudes, the KEPA is located at middle latitudes and the THUR is located at high latitudes. The VTEC RMSEs of SHAG without VOSs and with VOSs and different bias intervals during solar active and quiet periods with respect to assessment stations are listed in Table 3. We can see that the VTEC RMSEs of assessment stations with VOSs and biases are all significantly better than those without VOS, especially for those station located in low latitudes (ILSG and ISPA) and during the solar active periods. Compared with the SHAG without VOS, the RMSEs of SHAG with VOSs and biases (1-h interval) are reduced by 2.28–4.68 TECu and 0.93–1.83 TECu, corresponding to 21.65–51.61% and 27.74–53.06% during the solar active and quiet periods, respectively.

Table 3 VTEC RMSEs of SHAG estimated without VOS and with VOSs and different bias intervals on a solar relatively active day (April 10, 2014) and on a solar relatively quiet day (April 10, 2017) with respect to the STECs retrieved from assessment stations (unit: TECu)

For SAHG estimated with different bias intervals, the VTEC RMSEs generally decrease with the shortening of bias intervals, and the improvement in reliability of SHAG is particularly noticeable at low latitudes and during solar active periods, with a maximum reduction of 0.95 TECu for SHAG estimated with 1-h interval relative to 24-h interval. However, in middle and high latitudes and during solar quiet periods, the discrepancy in SHAG estimated with different bias intervals is not significant. The RMSEs of SHAG estimated with 1-h interval are occasionally higher than those estimated with higher intervals, but the maximum difference is only 0.21 TECu at THUR during solar active periods. Based on the above analysis, we can see that attaching VOSs and biases in the large gaps noticeably enhance the reliability of SHAG in the large data gap areas, and it is optimal to set bias interval to 1 h.

Four validation stations [corresponding to Assessment Station (b) in Fig. 5] were selected to verify whether VOSs have negative effects on the areas where GNSS stations are homogeneously distributed, among which the APNT and TBTG are located at low latitudes and close to VOSs, the BELE is located at low latitudes and far away from VOSs, and the OSLS is located at middle and high latitudes and far away from VOSs. The VTEC RMSEs of four assessment stations with respect to SHAG with VOSs and without VOSs during solar active and quiet periods are listed in Table 3. We can see that the assessment stations close to VOSs (APNT and TBTG) are slightly affected by VOSs, with the maximum RMSE discrepancy of 0.19 TECu, while the assessment stations far away from VOSs (BELE and OSLS) are not affected by VOSs.

By validating SHAG with independent assessment stations, it can be seen that attaching VOSs are capable of enhancing the reliability of SHAG in the large GNSS data gaps, and attaching bias with 1-h interval to each VOS evidently improves the consistency of SHAG with respect to the retrieved STECs of assessment stations. Moreover, attaching VOSs and biases on the basis of GNSS has no effect on modeling in the regions where the GNSS stations are homogeneously distributed.

3.2 Validation of VOSs and biases with IGSG

In order to verify whether attaching VOSs and biases improves the reliability of SHAG, the SHAG without VOS and with VOSs and biases (1-h interval) on a solar relatively active periods (April 10, 2014) and on a solar quiet periods (April 10, 2021) at 22:00 UTC was estimated and compared with IGSG.

Figure 6 shows the VTEC maps of SHAG at 22:00 UTC without VOS and with VOSs during different periods of solar activity, and the corresponding VTEC bias maps with respect to IGSG are also depicted. It can be seen that due to the lack of observations, the SHAG without VOS has comparatively large biases with respect to IGSG in the large data gaps, with the maximum biases of 191.20 in the solar active periods and 54.30 TECu in the solar quiet periods, and a wide range of inaccurate VTECs appear in the large data gaps. After attaching VOSs and VOS biases, the inaccurate VTECs in the large data gaps are effectively mitigated. The VTEC morphology of the ionosphere in the equatorial ionization anomaly regions with VOSs and biases (see Fig. 6c, g) is more explicit than that without VOS (see Fig. 6a, e). With the help of VOSs and biases, the maximum biases of SHAG with respect to IGSG at 22:00 UTC are reduced from 191.20 to 44.80 TECu and from 51.30 to 12.40 TECu, and the RMSEs are reduced from 8.07 to 3.75 TECu and from 2.72 to 1.65 TECu during solar active and quiet periods, respectively.

Fig. 6
figure 6

Global VTEC maps of SHAG and the VTEC bias maps of SHAG with respect to IGSG. The first two rows represent the distributions of SHAG without VOS (a) and with VOSs (c) and the distribution of VTEC bias error with regard to IGSG without VOS (b) and with VOSs (d) on April 10, 2014 (corresponding to the day of year 100, in 2014), at 22:00 UTC. The last rows represent the distributions of SHAG without VOS (e) and with VOSs (g) and the distribution of VTEC bias error with regard to IGSG without VOS (f) and with VOSs (h) on April 10, 2021 (corresponding to the day of year 100, in 2021) at 22:00 UTC

3.3 Comparison of bias values and the calculated VTEC biases

In order to validate whether attaching VOS biases are capable of absorbing the VTEC biases, the VTEC biases between IGSG and IRI-2016 were calculated and compared to the bias values estimated with 1-h interval. Since the VTEC map interval of IGSG is 2 h, the bias values were collected at the same interval. Figure 7 shows the bias values in the large GNSS data gap region of VOS-03 in 2014 (solar active year) 2017 (solar quiet year) and the calculated VTEC biases between IGSG and IRI-2016 in the corresponding region. It can be seen that the trend of bias values and calculated VTEC biases are quite consistent during both solar active and quiet periods, the magnitudes of them both mainly range from − 10 to 20 TECu in 2014 and from − 5 to 5 TECu in 2017. Moreover, from the enlarged portions of Fig. 7 we can see that bias values show obvious diurnal variation, which is consistent with the calculated VTEC biases.

Fig. 7
figure 7

Estimated VOS bias (VB) values of VOS-03 in 2014 (a) and 2017 (c) and the VTEC biases between IGSG and IRI-2016 in VOS-03 in 2014 (b) and 2017 (d), and the enlarged portion of each figure represents the bias values from 0:00 on June 1 to 0:00 on July 1 of the corresponding year (unit: TECu)

We compared the VTEC values of estimated SHAG (with VOSs and biases), IGSG and IRI-2016 in the large GNSS data gap region corresponding to VOS-05 on a solar relatively active day (April 10, 2014) and a quiet day (April 10, 2017). Note that a GNSS station (ISPA) is located in this region, which is used in IGSG estimation and not used in SHAG estimation, and thus, IGSG is used as a reference. Figure 8a, b shows the estimated VOS bias values and calculated VTEC biases between IGSG and IRI-2016 in the region of VOS-05, and the VOS bias values and calculated VTEC biases show good agreement in both solar active and quiet days. The VTECs in VOS-05 on April 10, 2014, and April 10, 2017, are presented in Fig. 8c, d. The mean VTEC deviations of SHAG and IRI-2016 with respect to IGSG are listed in Table 4. We can see that with the help of attaching biases, SHAG is more consistent with IGSG than IRI-2016. The mean VTEC deviations of SHAG with respect to IGSG are 3.72 and 1.14 TECu in the solar active and quiet days, while those of IRI-2016 with respect to IGSG are 5.57 and 3.21 TECu, respectively. From the above analysis, it can be concluded that the VOS biases are capable of absorbing the VTEC biases between GNSS and IRI-2016.

Fig. 8
figure 8

VOS bias (VB) values of VOS-05 and the biases between IGSG and IRI-2016 on April 10, 2014 (a), and on April 10, 2017 (b), and the VTECs of SHAG, IRI-2016 and IGSG in VOS-05 on April 10, 2014 (c), and on April 10, 2017 (d) (unit: TECu)

Table 4 Mean VTEC deviations of SHAG and IRI-2016 relative to IGSG in VOS-05 on April 10, 2014, and on April 10, 2017 (unit: TECu)

4 Validation of SHAG

This section validates the long-term reliability of SHAG with IGS final GIMs and Jason-2/-3 VTECs, and compares the performance of SHAG without VOS and with VOS.

4.1 IGS final GIM validation

We employ our new methodology to estimate SHAG spanned about one solar cycle (2009–2019) and validate the long-term consistency of SHAG with the final GIMs of IGS (IGSG), CODE (CODG), JPL (JPLG), UPC (UPCG), ESA/ESOC (ESAG), NRCan (EMRG), CAS (CASG) and WHU (WHUG). Figure 9 shows the diurnal mean VTEC series of all GIMs and diurnal mean VTEC biases and VTEC RMSEs of GIMs with respect to IGSG from 2009 to 2019. It can be seen that although different model and strategies were adopted to generate GIMs, the diurnal mean VTECs of GIMs are consistent and are able to respond to different solar activities. The diurnal mean VTEC biases and RMSEs of SHAG with respect to IGSG are about -2.39–0.78 TECu and 1.09–4.99 TECu, respectively, which are comparable to other GIMs.

Fig. 9
figure 9

Diurnal mean VTEC series of GIMs (a), the diurnal mean VTEC bias of GIMs with respect to IGSG (b) and diurnal VTEC RMSEs (c) of GIMs with respect to IGSG from 2009 to 2019 (unit: TECu)

As the ionospheric activities vary significantly at different latitudes, in order to further verify the reliability of SHAG at different latitudinal bands, we divided the globe from 90° N to 90° S into 6 latitudinal bands spaced 30° apart and calculated the VTEC biases and RMSEs of GIMs (excluding EMRG due to incomplete data) with respect to IGSG at 6 latitudinal bands and global scales from 2009 to 2019, which are summarized in Tables 5 and 6, respectively. Table 5 shows that there is almost no VTEC bias between UPCG and IGSG on the global scale, but the biases vary greatly at different latitudinal bands, with a maximum difference of 2.53 TECu. The VTEC biases of SHAG with respect to IGSG are generally lower than other GIMs, especially in latitudinal bands of 30°N–0°, 0°–30°S and 30°S–60°S, which correspond to the main distribution regions of VOSs, indicating that VOS improves the consistency of SHAG and IGSG to some extent. As illustrated in Table 6, the VTEC RMSEs of GIMs fluctuates greatly at different latitudinal bands, with the maximum variation ranging from 1.38 to 3.48 TECu (ESAG), and the VTEC RMSE is usually better in the northern hemisphere than that in the southern hemisphere and is generally better in high latitudes than that in low latitudes. Compared with other GIMs, CODG has the best performance in VTEC RMSEs at both at different latitudinal bands and global scale. The VTEC RMSE of SHAG ranges from 1.57 to 2.90 TECu, which is in good agreement with the other GIMs.

Table 5 VTEC biases of GIMs with respect to IGSG at different latitudinal bands and on global scales (unit: TECu)
Table 6 VTEC RMSEs of GIMs with respect to IGSG at different latitudinal bands and on global scales (unit: TECu)

4.2 Jason-2/-3 VTEC validation

The dual-frequency (Ku-/C- band) observations of satellite altimetry such as TOPEX, Jason series can be used to estimate the VTEC from the Earth’s surface (mainly over the surface of ocean and ice) to the orbit altitude of the satellite (Imel 1994). Comparing to TEC from GNSS, satellite altimetry TEC provides an independent reference in the ocean, especially in large GNSS data gaps, and has been widely adopted to verify the reliability of GIM (Hernández-Pajares et al. 1999; Hernandez-Pajares et al. 2017; Mannucci et al. 1998; Roma-Dollase et al. 2017). The Level-2 high-precision geophysical data records (GDRs) of Jason-2 and Jason-3 are adopted to extract VTECs and validate SHAG. To obtain high-precision VTECs from Jason-2 and Jason-3 observations, the observations reflections on the ice, isolated and jump points are rejected, the observations are smoothed with a sliding window of n = 16 consecutive samples, and then, we convert the vertical phase ionospheric delay in Ku band (\(I_{ku}\), for \(f_{ku}\) = 13.575 GHz) to the VTEC of satellite altimetry (Roma-Dollase et al. 2017). Jason-2 and Jason-3 have the ground track with a 254 pass and 10-day cycle, and the distribution of IPPs and VTEC values of Jason-3 in period 143 (from December 26, 2019, to January 5, 2020) obtained by this strategy are shown in Fig. 10.

Fig. 10
figure 10

Distribution of IPPs and extracted VTEC values of Jason-3 from December 26, 2019, to January 5, 2020

After obtaining the VTEC from satellite altimetry, the VTEC from GIM is calculated by interpolating TEC maps (Schaer et al. 1998). The VTEC biases, STDs, and RMSEs of GIMs (including the SHAG without VOS) with respect to Jason-2 (from 2009 to 2019) and Jason-3 (from 2016 to 2019) were calculated using Eq. (10). The biases of GIMs with respect to Jason-2 and Jason-3 VTECs are shown in Fig. 11, and the STDs and RMSEs are summarized in Table 7.

Fig. 11
figure 11

VTEC biases of GIMs with respect to Jason-2 VTEC from 2009 to 2019 (a) and Jason-3 VTEC from 2016 to 2019 (b) on global scales (unit: TECu)

Table 7 STDs and RMSEs of GIMs with respect to Jason-2 VTECs from 2009 to 2019 and Jason-3 VTECs from 2016 to 2019 on global scales

Due to the different orbit altitudes of GNSS satellites (about 20,000 km) and Jason satellites (about 1336 km), there is a bias of a few TECu between VTECs inverted by GNSS and Jason satellites (Jee et al. 2010) and the biases are relatively large during solar active periods and small during quiet periods, which can been clearly seen in Fig. 11. Therefore, the STD, which reflects the stability of bias, is adopted to validate the reliability of GIMs. We can see that the STDs of IGS GIMs with respect to Jason-2 and Jason-3 VTECs range from 3.70 to 4.68 TECu and from 2.56 to 3.19 TECu, respectively, among which IGSG and UPCG perform better, and those of SHAG are 4.15 and 2.82 TECu, which are comparable to IGS GIMs. However, the biases of SHAG without the assistance of VOS are relatively less stable, especially during solar active periods, with the STDs of 4.66 and 3.03 TECu with respect to VTECs of Jason-2 and Jason-3, respectively, and the RMSE of SHAG with respect to Jason-2 VTEC is 4.68 TECu, which is also larger than that of SHAG with VOSs and most of IGS GIMs such as the IGSG, UPCG, WHUG, CASG and CODG.

In order to validate the reliability of SHAG in the large GNSS data gaps, we compared the GIMs in specific regions corresponding to the VOS-01, VOS-02 and VOS-03 with respect to Jason VTECs, and the biases of GIMs are shown in Fig. 12 and the STDs and RMSEs are summarized in Table 8. It can be seen that the bias of SHAG without VOS in the large GNSS data gaps is evidently larger than that of other GIMs, and the biases exceed to 20 TECu during solar active periods. The STDs of SHAG without VOS with respect to Jason-2 and Jason-3 VTECs are 6.84 and 3.77 TECu, respectively, which are significantly worse than those of SHAG with VOSs and IGS GIMs. By attaching VOSs and VOS biases in SHAG estimation, the STDs of SHAG with respect to Jason-2 and Jason-3 are reduced to 3.27 and 2.30 TECu, respectively, and the RMSEs of SHAG are also dramatically reduced, indicating that VOSs enhance the reliability of SHAG to the same level as that of IGS GIMs.

Fig. 12
figure 12

VTEC biases of GIMs with respect to Jason-2 VTECs (a) and Jason-3 VTECs (b) in the specific regions corresponding to VOS-01, VOS-02 and VOS-03 (unit: TECu)

Table 8 STDs and RMSEs of GIMs with respect to Jason-2 VTECs from 2009 to 2019 and Jason-3 VTECs from 2016 to 2019 in the specific regions corresponding to VOS-01, VOS-02 and VOS-03

5 Conclusions

For the purpose of mitigating the inaccurate VTECs in GIMs caused by the inhomogeneous distribution of GNSS stations, we proposed a method of attaching VOSs based on IRI-2016 model in global ionospheric estimation using GPS, GLONASS, Galileo and BIEDOU observations. Since there are relatively large and complicated spatiotemporal varying VTEC biases between GNSS and IRI VTECs, which makes it inappropriate to directly assimilate these two kinds of VTECs, we thoroughly analyze the characteristics of VTEC biases between GNSS and IRI-2016 in different latitudes from 2009 to 2019, and develop an improved functional and stochastic model of VTEC bias. The priori noises of IRI-2016 VTECs are set from 5 to 30 TECu depending on the geographic latitude of IPP and solar flux. The spatiotemporal varying biases, which are estimated as piecewise constants with an interval of 1 h, are attached to VOSs to mitigate the VTEC discrepancy between GNSS and IRI-2016.

The independent assessment GNSS stations, IGSG and the calculated biases between IGSG and IRI-2016 were adopted to verify the effect of attaching VOSs and biases in GIM estimation. Validation results show that:

  1. 1.

    By validating SHAG with the STECs retrieved from four independent assessment GNSS stations, it can be seen that attaching VOSs and biases with 1-h interval significantly enhances the reliability of SHAG in the large GNSS data gaps. Compared with the SHAG without VOS, the RMSEs of SHAG with VOSs are reduced by 21.65–51.61% and 27.74–53.06% during solar active and quiet periods, respectively.

  2. 2.

    Using two VTEC maps of IGSG at UTC 22:00 on April 10, 2014 (solar active), and on April 10, 2021 (solar quiet), as a reference, the SHAG without VOS has a wide range of inaccurate VTECs with the maximum biases up to 191.20 and 54.30 TECu during solar active and quiet periods, respectively. With the help of VOSs, the large wide range of inaccurate VTECs are effectively mitigated, the consistency of SHAG with respect to IGSG is significantly improved, and the RMSE is reduced from 8.07 to 3.75 TECu and from 2.72 to 1.65 TECu during solar active solar quiet periods, respectively.

  3. 3.

    In the regions of large GNSS data gaps, the estimated biases are in good agreement with the calculated VTEC biases between GNSS and IRI-2016, and The VOS bias values also show similar diurnal variation characteristics to the calculated VTEC biases. Because the VOS bias parameters absorb the VTEC biases, SHAG are closer to IGSG than IRI VTECs.

The IGS final GIMs and satellite altimetry VTECs of approximately one solar cycle (2009–2019) were used as the references to verify the long-term reliability of SHAG. Validation results indicate that:

  1. 1.

    The diurnal mean VTEC biases and RMSEs of SHAG with respect to IGSG are about − 2.39 to 0.78 TECu and 1.09–4.99 TECu, respectively, which are comparable to those of IGS GIMs on global scales. Moreover, the VTEC biases of SHAG with respect to IGSG are generally lower than that of other IGS GIMs in latitudinal bands of 30°N–0°, 0°–30°S and 30°S–60°S, which correspond to the main distribution regions of VOSs.

  2. 2.

    The STDs of IGS GIMs with respect to Jason-2 VTECs from 2009 to 2019 and Jason-3 VTECs from 2016 to 2019 range from 3.70 to 4.68 TECu and from 2.56 to 3.19 TECu, respectively, and those of SHAG are 4.15 and 2.82 TECu, respectively, which implies that the reliability of SHAG is comparable to that of IGS final GIMs. In particular, the STDs of SHAG without VOS with respect to Jason-2 and Jason-3 VTECs in the large data gaps are 6.84 and 3.77 TECu, respectively, which are evidently worse than those of IGS GIMs, and the biases exceed to 20 TECu during solar active periods. With the help of attaching VOSs and biases, the STDs of SHAG with respect to Jason-2 and Jason-3 are reduced to 3.27 and 2.30 TECu, respectively, which means that the reliability of SHAG is enhanced to the same level as that of IGS GIMs.

From the above analysis, we conclude that VOSs are capable of compensating for the inhomogeneous distribution of ground-based GNSS stations in global ionospheric modeling, and VOS biases are capable of absorbing the VTEC biases between GNSS and IRI-2016. The inaccurate VTECs in SHAG are significantly mitigated, and SHAG is in good agreement with IGS final GIMs. Furthermore, the methodology proposed in this study represents an efficient tool for automated global ionospheric modeling integrating multi-source data and is also applicable to solve the problem of abnormal results in routine estimation of ionospheric products when partial measured observations cannot be obtained on time. In the next step, we plan to integrate multi-source ionospheric observation data from the satellite altimetry, GNSS radio occultation and low earth orbit satellite into VOS to further improve the reliability of GIM in the large GNSS data gaps.