1. Introduction

A small fraction of the Earth’s magnetic field is generated by electrical currents within the conductive layers of the ionosphere, near 110 km altitude. On geomagnetically quiet days, at mid-to-low latitudes, the ionospheric magnetic field has an amplitude of 10 to 50 nT on the ground and is referred to as the “Sq” magnetic field (see, e.g., Richmond and Thayer, 2000). It undergoes a characteristic daily variation, visible in geomagnetic observatory recordings, where the Z component increases (in absolute value) from sunrise to noon and decreases from noon to sunset. In a ±5? latitudinal band centered on the geomagnetic dip-equator, the amplitude of this daily variation is even larger and can reach up to 100 nT on the ground around noon. The ionospheric field in this band is caused by the equatorial electrojet, a thin current flowing eastward along the geomagnetic dip-equator.

Global spherical harmonic models of the Sq field have classically been determined from observatory data, relying on the global observatory network. Such models (e.g., Schmucker, 1999a, b; Takeda, 2002) provide a description of the large-scale current system generating the Sq field, as well as its seasonal and solar cycle variability. Use of magnetic measurements from low-Earth orbiting satellites such as ϕrsted and CHAMP makes it possible to model the Sq field up to higher spherical harmonic degrees, and to also model the equatorial electrojet field (Sabaka et al., 2002, 2004). However, satellite data are collected above the ionosphere. They cannot separate the primary ionospheric field from the secondary, induced field generated by electrical currents within Earth’s mantle. Such a separation is possible, however, if a pre-determined mantle conductivity model is available.

The upcoming ESA Swarm satellite mission, to be launched in 2013, will provide measurements of the Earth’s magnetic field with unprecedented precision from three identical satellites orbiting at different altitudes and local times (Friis-Christensen et al., 2006). Several research institutions have teamed up with ESA to form the Swarm Satellite Constellation Application and Research Facility (SCARF), a distributed processing facility that will produce geomagnetic field models during the Swarm mission (Olsen et al., 2013). Two of these models will be spherical harmonic descriptions of the ionospheric magnetic field at mid-to-low latitudes, i.e., below 55? dipole latitude. One of them will be calculated as part of a comprehensive inversion, where all major sources of the geomagnetic field will be calculated simultaneously (Sabaka et al., 2013); the other will be calculated through a dedicated inversion, after removing the magnetic fields generated by other sources from the data.

In the present paper, we introduce the Swarm Dedicated Ionospheric Field Inversion (DIFI) algorithm (Section 2), which will be used for calculating the dedicated ionospheric field from Swarm data. The DIFI chain takes into account the seasonal and solar cycle variability of the field. It was thoroughly tested during its development phase using synthetic data. Results of these tests are presented in Section 3. The DIFI chain is one of the three chains developed by IPGP within SCARF; other chains are the Dedicated Lithospheric Field Inversion (DLFI) chain (Thébault et al., 2013) and the Equatorial Electric Field (EEF) inversion chain (with NOAA, Alken et al., 2013).

2. DIFI Algorithm

2.1 Data pre-processing

During the exploitation phase, the DIFI algorithm will be used to calculate two types of models: ionospheric field models calculated over at least one year of data, which will include a description of the seasonal variation of model coefficients, and ionospheric field models calculated over less than one year of data (typically three to six months), with coefficients assumed constant with respect to season. The second type of model will be calculated only during the first year of the mission, to provide a first model within a few months after the commissioning phase. In what follows, we describe the algorithm to be used for the calculation of a full model, including seasonal variation, assuming one full year of level 1b magnetic data is available as input.

By default, the DIFI algorithm reads all level 1b vector magnetic data from Swarm A, one of the two satellites orbiting side-by-side at about 460 km altitude, and from Swarm C, the satellite orbiting at a higher altitude of about 530 km, in a different local time sector. Several failure cases have been investigated as part of the preparation phase within SCARF, for example the complete failure of one satellite or the lack of vector data from one of the satellites. If such an unhappy event were to occur, the DIFI algorithm would still be able to ingest reduced level 1b dataset. In what follows, we describe the nominal scenario where level 1b vector magnetic data are available from both Swarm A and C satellites.

The first block of the DIFI algorithm consists in reading all level 1b Swarm magnetic field data available during the considered time interval, selecting data at magnetically quiet times, and correcting these data for non-ionospheric fields. This block is refered to as “Data pre-processing” in Fig. 1. Level 1b magnetic data are selected using standard geomagnetic indices: Kp, Dst and the interplanetary magnetic field (IMF) By and Bz components. The minimum and maximum acceptable values for these parameters will be adjusted for each real dataset, in order to maximize spatial, local time and seasonal data coverage, while minimizing the overall magnitude of disturbance fields. Also, in order to minimize the size of the dataset to be inverted, level 1b data are decimated to one sample every 15 s prior to the inversion.

Fig. 1.
figure 1

Flowchart of the DIFI algorithm.

The next step of the pre-processing block is data correction. It aims at removing non-ionospheric contributions from the total vector field recorded by each satellite. Three main sources are considered: the core, the lithosphere and the electrical currents in the magnetosphere. By default, SCARF dedicated (Hamilton, 2013; Rother et al., 2013; Thébault et al., 2013) and/or comprehensive (Sabaka et al., 2013) models are used to remove contributions from these sources. If needed, non-SCARF, auxiliary models will be considered during the exploitation phase to improve the quality of the final model.

Observatory data (hourly mean values, see Macmillan and Olsen, 2013) can also be used by the DIFI algorithm, although not in the default mode. These data are pre-processed in a similar way: first, they are selected according to geomagnetic Kp and Dst indices, as well as IMF By and Bz components; second, they are corrected for the core, lithospheric and magnetospherie fields using the same dedicated or comprehensive SCARF models. Observatory biases, i.e., the small-scale lithospheric field not described by these models, are left as variables to be determined by the inversion.

2.2 Model parameterization

The DIFI algorithm relies on the same ionospheric field model parameterization as the comprehensive models (see, e.g., Sabaka et al., 2000, 2002, 2013). In what follows, we briefly summarize the main features of this parameterization, using similar matrix notations as Sabaka et al. (2000).

Primary sources of the ionospheric magnetic field at mid-to-low latitudes are electrical currents flowing in the E-region of the ionosphere, at about h = 110 km altitude. The time-varying magnetic field generated by these currents induces secondary currents in the upper layers of the Earth’s electrically conducting mantle, which in turn contribute to the total ionospheric magnetic field. The spherical harmonic modeling of the ionospheric magnetic field relies on the assumption that Swarm satellites fly above these ionospheric sources. As a consequence, the ionospheric magnetic field B (i.e., observations minus contributions from the core, lithosphere and magnetosphere) may be expressed as B = − ΔV, where V is a magnetic potential.

Let us introduce the following set of basis functions:

((1))
((2))

where r is the radius, θ d the dipole colatitude, ϸ d the dipole longitude, t (expressed in yrs) the season counted from January 1st, at 00:00 universal time, t m (expressed in hrs) the magnetic universal time, a the mean Earth radius (a = 6371.2 km), the Schmidt normalized associated Legendre function of degree n and order m,ω s = 2π rad/yr the fundamental angular frequency for seasonal variation, ω p = 2À/24 rad/hr the fundamental angular frequency for diurnal variation, s and p the associated wavenumbers. The magnetic universal time is defined as

((3))

where ϸ d,s is the dipole longitude of the sub-solar point (defined as the point on the Earth’s surface closest to the sun), expressed in degrees. Then the potential V may be uniquely expressed in the dipole reference frame as:

((4))
((5))

where ϵ, ι and ϵ′ are vectors of complex coefficients and ( m nsp )*, respectively, and S e and S i are the vectors of the and respectively. Here Re{z} denotes the real part of complex number z, z* denotes its conjugate and AH denotes the conjugate transpose of vector or matrix A. At ground altitude, the basis functions and e coefficients describe the primary ionospheric field, while the basis functions and ι coefficients describe the secondary (i.e., induced) ionospheric field. At satellite altitude, all sources are internal and therefore the basis functions describe the total field. The ι coefficients are the same at ground and at satellite altitude.

The induced field is related to the primary field through a transfer function, which may be expressed in matrix form as

((6))

For a 1D mantle conductivity, the Q matrix is diagonal, while for a 3D conductivity it is dense for one single frequency, and block-diagonal for multiple frequencies. The DIFI algorithm can deal with both the 1D and 3D cases. Noting that the radial component of the ionospheric field is continuous through the current sheet at r = a + h, we necessarily have (by taking the radial derivative of and at r = a + h)

((7))

where C is a diagonal real matrix with the following elements

((8))

Therefore, (4) and (5) become

((9))
((10))

It is advantageous to model the ionospheric field in the quasi-dipole (QD) coordinate system (Richmond, 1995; Emmert et al, 2010), which follows the geometry of the Earth’s main magnetic field. This is mostly because the equatorial electrojet flows along the geomagnetic dip-equator, which is bended with respect to the dipole equator in the South American sector. Using QD coordinates helps minimizing the total number of parameters to be determined. The DIFI algorithm relies on the same QD basis function as Sabaka et al. (2000),

((11))
((12))

where (θ q , ϸ q ) are the QD colatitude and longitude, h = 110 km is the altitude of the ionospheric electrical currents, and are coefficients of the matrix relating spherical harmonics in dipole and quasi-dipole coordinates:

((13))

Here Nmax and Mmax are chosen so that the convergence of the above summation is sufficient (see numerical values in the text below and Fig. 2). Equations (11) and (12) may be expressed in matrix form:

((14))
((15))

where T e and T i are the vectors of the and respectively, D is the matrix of coefficients, U e is the diagonal matrix of (a/(a +h))n−1 values, and U i is the diagonal matrix of ((a +h)/a)n+1 values. It is worth noting that the spherical harmonics in QD coordinates are not orthogonal. By construction, the and functions have an exact QD geometry only on the sphere r =a + h. By analogy with Eqs. (4)(5), the DIFI algorithm expresses the ionospheric magnetic potential in quasi-dipole coordinates as

((16))
((17))

where and are vectors of complex coefficients and ∊̃ l ksp respectively. This leads to the following constraints on the ϵ, ι and ϵ′ vector of coefficients:

((18))
((19))
((20))

Combining (18)(20) with (9)(10) then leads to

((21))
((22))
Fig. 2.
figure 2

Modulus of the complex elements of the D matrix relating dipole and quasi-dipole spherical harmonics through Eq. (13), for Kmax = 45, Lmax = 5, Nmax = 60, Mmax = 12. The color scale is in powers of 10. There are N q = 475 quasi-dipole harmonics (abscissae) and N d = 1368 dipole harmonics (ordinates).

It is also assumed that the ionospheric magnetic field responds linearly to solar activity, parameterized by the solar radio flux index F10.7 (expressed in solar flux units, or SFU, where 1 SFU = 10−22 W m−2 Hz−1). Then is replaced by , where the so-called Wolf ratio N = 14.85 × 10−3 SFU−1 was determined by Olsen et al. (1993). It is worth noting that the Wolf ratio actually varies with season (see, e.g., Penquerc’h and Chulliat, 2009). This effect will be investigated during the Swarm mission and could lead us to develop a more sophisticated parameterization of the response to solar activity in later DIFI models.

The quasi-dipole vector of coefficients in (21)(22) is truncated for 1 ≤ kKmax,− min(k, Lmax ) ≤ l ≤ min(k, Lmax ), s min ≤ s ≤ s max and p min pp max . The dipole vector of coefficients ϵ in (9)(10) (as well as (18)) is truncated for 1 ≤ nNmax, − min(n, Mmax ) ≤ m ≤ min(n, Mmax ), s min ss max and p min ≤ p ≤ p max . This leads to a total of N q = Lmax ( Lmax + 2) + (KmaxLmax )(2 Lmax + 1) quasi-dipole coefficients and dipole coefficients for each pair of wavenumbers (p, s). In the tests reported in Section 3, the ionospheric field was modelled in quasi-dipole coordinates up to degree Kmax = 45 and order Lmax = 5 (hence N q = 475 coefficients), with diurnal variations from p min = 0 to pmax = 4 (i.e., down to a period of 6 hours) and seasonal variations from s min = −2 to s max = 2 (i.e., constant, annual and semi-annual variation). These parameters are the ones set in the original SCARF specifications (Swarm Level 2 Processing System Consortium, 2013). Note that p is arbitrarily taken positive, so that modes propagate westward for l > 0 (or m > 0) and eastward for l ≤ 0 (or m ≤ 0). Unlike Sabaka et al. (2000, 2002), we do not select only modes closest to the local time modes l = p. A numerical investigation of Eq. (13) shows that Nmax = 60 and Mmax = 12 (hence N d = 1368 coefficients) are sufficient to achieve convergence of the D matrix. The modulus of the obtained matrix is shown in Fig. 2.

2.3 Inversion

The “building and solving of normal equations” block of the DIFI algorithm (Fig. 1) uses a standard iterative least squares technique to find a model solution that minimizes the following objective function:

((23))

where is γ the data vector, the data estimate vector calculated from the model, C e the data covariance matrix, λ a damping parameter and C m a damping matrix. For the purpose of the tests based upon synthetic data, we used an identity matrix for the data covariance matrix, and set the damping parameter to zero. These choices will be revised during the exploitation phase, when real data will be considered. We anticipate that the actual data covariance matrix will be diagonal, assuming that the data noise is caused by stationary and uncorrelated processes. For each data type (satellite and observatory) and each component, variances will be iteratively determined, starting from an existing Sq field model. Regarding the damping matrix, a simple, diagonal matrix will first be used, to minimize those coefficients that will be found less constrained by the data. If needed, more sophisticated damping strategies will be used, such as minimizing night-time ionospheric currents (Sabaka et al., 2002).

2.4 Output

In order to facilitate the distribution and use of the ionospheric field models produced by the Swarm SCARF, it was decided to distribute them as sets of real Gauss coefficients in dipole coordinates. The same format is to be used for both the comprehensive and dedicated chains. Specifically, the scalar potential V1 of the primary ionospheric field is expressed as:

((24))
((25))

where and are real coefficients. Note that the (resp. the are obtained by taking the real part of the product ϵHS e in Eq. (4) (resp. the product ϵ′HS i in Eq. (5)).

The scalar potential V2 of the secondary (induced) ionospheric field is expressed as in Eq. (25), but with real coefficients ; these are obtained by taking the real part of the product ιHS i in Eqs. (4)(5).

Equations (7)(8) relate the coefficients below and above the ionosphere in Eqs. (24)(25). Therefore, only coefficients and are provided in the MIO-SHA product files. The exact format of these files is given in Table 1 (and can also be found in the product specification document, Swarm Level 2 Processing System Consortium, 2013).

Table 1.
figure Tab1

General structure of a MIO_SHA product file. All notations are defined in the main text (Subsections 2.2 and 2.4), except θ NGP and ϸ NGP which stand for the colatitude and longitude of the Northern pole of the geomagnetic dipole. Each file includes two models, describing the primary ionospheric field at ground and the secondary (induced) ionospheric field, as explained in Subsection 2.4. Each model is stored as a N d × (2N sp ) block of coefficients, where N d (defined in Section 2.2) is the number of dipole coefficients for a given pair of diurnal and seasonal wavenumbers and N sp = (pmaxpmin + 1)(smaxsmin + 1) is the number of such pairs. The first two columns contain the degree n and order m, respectively; positive (resp. negative) m values indicate that the line is that of coefficients to be multiplied by cos mϸ d (resp. sin mϸ d ) in the spherical harmonic expansion. In each line, coefficients are stored in columns 3 to 2N sp + 2, by iterating on s and p, in that order.

3. Test Results

During the preparation phase, the DIFI algorithm was tested using synthetic data. The way the synthetic orbits and magnetic data were generated is described in Olsen et al. (2013). We used only one year of synthetic data, from January 1, 2000 to December 31, 2000, and relied on real Kp,Dst, IMF and F10.7 values for the selected period.

The synthetic data were selected using the following criteria: K p ≤ 2 o , −20 nT ≤ Dst ≤ 20 nT, −8 nT ≤+ IMF B y ≤ 8 nT and −2 nT ≤ IMF B z ≤ 6 nT. Starting from synthetic Level 1b 1 Hz data, the selection process lead to 583705 data triples for each satellite A and C. The data were further decimated so that the time difference between two successive data never fell below 15 s, which corresponds to a minimum distance of about 105 km (since Swarm satellites orbit at about 7 km/s).

Removal of non-ionospheric contributions were implemented in different ways, depending on the tests carried out. During the so-called “AR1 test”, data were corrected using the same reference field models as the ones used for generating the synthetic data (see Olsen et al., 2013, for details about these models). As the random noise added to the synthetic data had a very small standard deviation (between 0.1 and 0.7 nT, depending on the component) compared to the typical amplitude of ionospheric fields, the data after correction were very close to the synthetic ionospheric field data themselves. Thus, the AR1 test essentially amounted to a closed-loop simulation of both the pre-processing and inversion blocks. During the second, so-called “AR2 test”, data were corrected using core, lithospheric and magneto-spheric field models derived as part of the test by SCARF partners, thus imperfectly reproducing the contribution of each source to the synthetic data, as would be the case with real data.

For both tests, we used the same Q matrix as the one used for generating the synthetic dataset (see Olsen et al., 2013, for details). This matrix was derived from a 3D electrical conductivity model of the mantle, including oceans, designed by Kuvshinov et al. (2006). We used the D matrix already mentioned in Section 2.2 and presented in Fig. 2.

3.1 AR1 test

Data residuals statistics for the AR1 test are presented in Table 2. The means of the data residuals are close to zero; the standard deviations are of the same order of magnitude or smaller than the noise added to the synthetic data, for all three components. This suggests that the closed-loop test was successful. Still, it is important to also check the maps of data residuals for various universal times (UT) and seasons, as these could reveal very localized (in space or time) discrepancies from the data. We divided the year in four seasons and the day in 8 UT intervals, plotting a total of 32 maps for each component. A careful examination of the maps did not reveal any anomaly. An example of data residual maps for the three components is shown in Fig. 3.

Fig. 3.
figure 3

Example of data residual maps, AR1 test. The parallels of dipole latitudes ±55° are represented as thick black lines.

Table 2. Residual statistics (in nT) for AR1 and AR2 tests.

Data residuals, however, do not provide all the information needed to check the validity of the DIFI-derived model. The obtained model has to be valid at ground level as well as at satellite altitude, and its separation into primary and induced fields has to be correct. This can be checked by plotting maps of differences between the field calculated from the reference model (used to produce the synthetic data) and the DIFI-derived model. We again plotted 32 maps at different seasons and UT for each component, each type of field (primary, induced and total) and at 0 and 400 km altitude. Examples of such maps showing the radial component of the ionospheric field and associated field differences are shown in Figs. 4 and 5. Note that the color scales were calculated from the absolute minima and maxima found in the 32 maps of the same kind. As expected, differences are smallest for the total field at satellite altitude, i.e., where the model is most constrained by the data. The total field differences at satellite altitude are slightly larger than the residuals for the same season (Fig. 3), which suggests some possible effect of incomplete data coverage. For seasons having better data coverage, e.g., Spring, the total field differences are indeed smaller (less than 1 nT). Interestingly, the differences in the primary and secondary fields are of opposite signs and partially cancel out each other; this could be due to the fact that no damping was applied in this run. Also, there is a prominent four-spot structure for the radial component, indicating some slight deformation of the Northern and Southern foci of the Sq field.

Fig. 4.
figure 4

Example of ionospheric field maps, B r component, AR1 test. The parallels of dipole latitudes ±55? are represented as thick black lines.

Fig. 5.
figure 5

Example of maps showing field differences, B r component, AR1 test. The parallels of dipole latitudes ±55° are represented as thick black lines.

A more synthetic view of the model error at the Earth’s surface is provided by Fig. 6, which shows the distribution of differences at non-polar latitudes (À90 − θ d À ≤ 55°). To produce this figure, differences were calculated on a regular grid, for 8 three-hour universal time bins and 40 seasonal bins. (Although one seasonal bin per day should be used to account for differences caused by day-to-day magneto-spheric field fluctuations, this number is not critical when using synthetic data and will be adjusted during the operational phase.) For each component, the standard deviation of these differences divided by the maximum field value (from all bins) is taken as a measure of the model performance. Numerical results are provided in Table 3. It can be checked that the performances are below 2%, which is well below the threshold requirement of 10% (see table 5 in Olsen et al., 2013). These numbers can be seen as the maximum performance achievable by the current version of the DIFI chain.

Fig. 6.
figure 6

Distribution of non-polar field differences, AR1 test. B is the modulus of the field.

Table 3. Model performances (as defined in the main text) for AR1 and AR2 tests.

3.2 AR2 test

For the AR2 test, data corrections relied on the dedicated core (Rother et al., 2013) and lithospheric (Thébault et al., 2013) field models, and the magnetospheric part of the comprehensive (Sabaka et al., 2013) field model, all of them derived from the AR2 dataset. Data residuals statistics (see Table 2) show that using imperfect field models for data correction leads to significant residuals after the inversion. This is confirmed by the analysis of residuals maps; see for example Fig. 7, which was selected for the same season and UT as Fig. 3 in order to facilitate comparison.

Fig. 7.
figure 7

Example of data residual maps, AR2 test. The parallels of dipole latitudes ±55° are represented as thick black lines.

Like for the AR1 test, we plotted maps of the field calculated from the DIFI-derived model and of the differences between that field and the reference field. Examples of such maps are shown in Figs. 8 and 9. Note that the color scales were again calculated from the extrema in the 32 maps of the same kind (at different seasons and UT). The imperfect data correction leads to much larger differences between the reference model and the DIFI-derived model than those obtained in the AR1 test. The differences for the total field are antisymmetrical with respect to the magnetic dip equator and are rather uniformly distributed in local time. This pattern is also observed at other local times and seasons, but the position of extrema is not always linked to the Sq foci, unlike in the example shown in Fig. 9. This probably reflects the leakage of the core and magnetospheric fields in the ionospheric field model, since these two fields are not, or only weakly, local-time-dependent. As in the AR1 case, the differences for the primary and secondary fields are larger than for the total field, which is probably due to the absence of damping. The spatial pattern of the differences for these fields greatly varies with season and local time and is therefore difficult to interpret.

Fig. 8.
figure 8

Example of ionospheric field maps, B r component, AR2 test. The parallels of dipole latitudes ±55° are represented as thick black lines.

Fig. 9.
figure 9

Example of maps showing field differences, AR2 test. The parallels of dipole latitudes ±55° are represented as thick black lines.

The imperfect data correction also affects the overall performance of the model at the Earth’s surface (see Table 3). The performance is below or slightly above the threshold requirement of 10% (Olsen et al., 2013). Plotting the distribution of non-polar differences at the Earth’s surface (Figure 10) reveals two bumps in the B r distribution, around 4 nT, as well as some bias of a few nT in B¸ and the scalar field. The bumps are also present when plotting the output of the pre-processing block; i.e., they are not caused by the inversion. We found that they originate in the imperfect correction for the core field.

Fig. 10.
figure 10

Distribution of non-polar field differences, AR2 test. B is the modulus of the field.

4. Concluding Remarks

In the present paper, we described the algorithm of the DIFI chain to be used during the upcoming Swarm mission to calculate global, spherical harmonic models of the quiet-time ionospheric field at mid-to-low latitudes. This algorithm relies on quasi-dipole coordinates to minimize the number of parameters, while describing the smallest spatial features of the average field such as the equatorial electro-jet. It can take advantage of a 3D conductivity model of the mantle and oceans when separating the primary and induced fields.

Closed-loop tests using synthetic data showed that the DIFI algorithm is able to reproduce the original primary and induced ionospheric fields to a very high accuracy, at ground and satellite altitudes. Tests relying on other SCARF models produced from the same dataset for data correction of the core, lithospheric and magnetospheric fields revealed that the pre-processing block of the algorithm is very sensitive to the quality of the correcting models. This problem is specific to the DIFI chain, as this chain is the last in the sequence of dedicated inversion chains to be processed. As a result, all errors made by previous chains in the sequence cascade down to the DIFI input. However, the inversion block proved remarkably stable with respect to these input errors; it is possible to calculate a meaningful ionospheric field, with performances within the threshold requirement, even when starting from imperfectly corrected data. During the exploitation phase, no reference field will be available to validate the DIFI chain output. The validation will be made by comparing the total field at ground with measurements from geomagnetic observatories.