Introduction

The electrical conductivity is an important geophysical parameter connected to the thermal, chemical, and mineralogical state of the Earth’s mantle. A traditional technique to study the distribution of electrical conductivity in deep regions of the Earth is the electromagnetic induction (EMI) method. On the global planetary scale, the natural sources of electromagnetic energy with the largest potential to probe the deep Earth structure are present in the magnetosphere, the ionosphere, and in the Earth’ oceans. An excellent overview of recent advances in the area of global EMI forward and inverse modeling is given by Kuvshinov (2015).

In the last two decades, the low-orbit satellite missions dedicated to measurements of the Earth’s magnetic fields, such as CHAMP, Ørsted, SAC-C, and Swarm, have provided invaluable insight into the structure and dynamics of various physical processes from the dynamo in the Earth’s core to the magnetosphere (Stolle et al. 2018). Recently, these datasets have been extended by careful processing and calibration of platform magnetometer records from other missions, such as CryoSat-2 (Olsen et al. 2020).

In the area of global EM induction, spherically symmetric (1-D) conductivity models based on satellite data were obtained (Kuvshinov and Olsen 2006; Velímský 2010; Püthe et al. 2015; Civet et al. 2015; Grayver et al. 2017, among others). The global three-dimensional (3-D) inverse problem represents a significant challenge in terms of input data processing, efficient implementation of the forward solver, and balanced regularization. So far, only ground observatory data were inverted in terms of 3-D mantle conductivity models (Kelbert et al. 2009; Tarits and Mandéa 2010; Semenov and Kuvshinov 2012; Sun et al. 2015). The main challenge in the 3-D inversion of satellite data is the separation of the primary, inducing components, and the secondary, induced fields observed by a small number of fast moving platforms with sufficient spatial and temporal resolution. One possible path that we follow here, is to isolate the time series of external and internal spherical harmonic (SH) coefficients by the process of comprehensive inversion (Sabaka et al. 2013). These time series are provided as a Swarm mission Level 2 product by the European Space Agency. Since the rather critical assessment of the early versions of this product by Martinec et al. (2018), the situation has improved significantly (Sabaka et al. 2018, 2020). In this paper, we apply the methodology developed by Velímský (2013) and Maksimov and Velímský (2017) to the latest version of satellite-derived SH coefficients, and obtain the first 3-D mantle conductivity models with contributions from Swarm and CryoSat-2 satellite data.

Methods

Detailed descriptions and tests of the forward solver, the inverse solver, and the sensitivity calculations in the context of global 3-D EM induction problem in the time domain are presented in Velímský and Martinec (2005), Velímský (2013), and Maksimov and Velímský (2017), respectively. Our approach to the 3-D mantle conductivity inversion builds on the process of comprehensive inversion (CI). Here we provide only a short summary of the method, and refer the reader to the detailed description in the works of the original authors (Sabaka et al. 2013, 2018, 2020), and in particular in the paper by Kuvshinov et al. (2020) in this issue.

The forward problem

In the spherical Earth G of radius a with 3-D distribution of electrical conductivity \(\sigma (\varvec{r})>0\), and under magneto-quasistatic approximation, the magnetic field \(\varvec{B}(\varvec{r};t)\) is governed by the EM induction equation,

$$\begin{aligned} \nabla \times \left[ \frac{1}{\sigma (\varvec{r})}\nabla \times \varvec{B}(\varvec{r};t)\right] + \mu _0\frac{\partial \varvec{B}(\varvec{r};t)}{\partial t} = \varvec{0}, \quad \mathrm {for}\; r\le a. \end{aligned}$$
(1)

Here \(\varvec{r}=(r,\Omega )= (r,\vartheta ,\varphi )\) stands for the position vector described by the radial coordinate r, colatitude \(\vartheta\), longitude \(\varphi\), t is time, and \(\mu _0\) is the magnetic permeability of vacuum. In the surrounding insulating atmosphere, the magnetic field is fully described by a scalar magnetic potential \(U(\varvec{r};t)\), satisfying the homogeneous Laplace equation,

$$\begin{aligned} \nabla ^2 U(\varvec{r};t) = 0, \quad \mathrm {for}\; r\ge a. \end{aligned}$$
(2)

On the surface \(r=a\), continuity of the magnetic field is required,

$$\begin{aligned} \varvec{B}(r=a,\Omega ;t) = - \left. \nabla U(r,\Omega ;t)\right| _{r=a}. \end{aligned}$$
(3)

The solution of the Laplace Eq. (2) can be written in terms of SH series,

$$\begin{aligned} U(\varvec{r};t)= \sum \limits _{j=1}^{\infty }\sum \limits _{m=-j}^j \left[ G_{jm}^{(\mathrm {e})}(t)\left( \frac{r}{a}\right) ^j +G_{jm}^{(\mathrm {i})}(t)\left( \frac{a}{r}\right) ^{j+1}\right] Y_{jm}(\Omega ), \end{aligned}$$
(4)

where \(G_{jm}^{(\mathrm {e})}(t)\) and \(G_{jm}^{(\mathrm {i})}(t)\) are the SH coefficients describing, respectively, the external (primary, inducing) and internal (secondary, induced) magnetic field. Here we use fully normalized, real spherical harmonics \(Y_{jm}(\Omega )\). The spherical harmonic-finite element approach to global time-domain EM induction introduced by Velímský and Martinec (2005) uses the external boundary condition. The time series of external field coefficients \(G_{jm}^{(\mathrm {e})}(t_i)\) is prescribed in the discretized linear system of equations corresponding to formulation (1, 2, 3). The SH expansion is truncated at a finite degree \(j_{\mathrm {max}}^{(\mathrm {e})}\), and constant sampling in time is assumed,

$$\begin{aligned} t_i=i\,\Delta t,\quad i=1\dots N. \end{aligned}$$
(5)

For a given conductivity distribution \(\sigma (\varvec{r})\), the method integrates Eq. (1) using the Crank–Nicolson scheme. Besides the complete solution \(\varvec{B}(\varvec{r};t)\), it provides also the series of internal field coefficients \(G_{jm}^{(\mathrm {i})}(t_i)\), using the same time sampling, and truncated at \(j_{\mathrm {max}}^{(\mathrm {i})}\ge j_{\mathrm {max}}^{(\mathrm {e})}\).

Observatory, Swarm and CryoSat-2 data

The CI is based on fitting of a complex parameterization of individual magnetic field constituents into observatory and satellite data, including the along-track and cross-track gradients. In the first phase of the CI process, the models of the core field and its secular variations, the lithospheric field, the ionospheric, magnetospheric, and oceanic tidal fields, are constructed from quiet-time signals. The EMI process is accounted for in the parameterization of the magnetospheric fields by means of RC index (Olsen et al. 2014), using a 1-D conductivity model, and in the parameterization of ionospheric fields by a transfer matrix \({\mathbf {Q}}(\omega )\), based on a 1-D mantle conductivity with a 2-D surface layer representing the conductivity contrast between the oceans and continents (Sabaka et al. 2013). The rotation angles between the common reference frame of each spacecraft, and the reference frame of individual magnetometers are also co-estimated. The data selection is restricted to night-side, quiet-time signals, defined by the limiting values of Kp index, and rate of change of the Dst index (Sabaka et al. 2013, 2018, 2020).

For our purpose, the second phase of the CI processing is crucial. Here, the observatory and satellite data from all times, including the disturbed periods, are assembled. The observatory dataset comprises 5 or 6 years of hourly mean vector measurements of magnetic field from 161 ground observatories (Macmillan and Olsen 2013). The Swarm dataset consists of the vector magnetic field measurements from Swarm Alpha and Bravo satellites at 1-min sampling, covering the same period. Finally, the CryoSat-2 dataset includes 5 years of calibrated magnetic field vectors measured by the platform magnetometers at 1-min sampling (Olsen et al. 2020).

The core, lithospheric, and in most cases also the ionospheric signals obtained by the CI are subtracted, and the resulting residua are further down-weighted, or completely removed in the polar areas. The SH coefficients of external and internal field up to a specified degree, \(j_{\mathrm {max}}^{\mathrm {(e)}}\), \(j_{\mathrm {max}}^{\mathrm {(i)}}\), and order are then obtained by SH analysis of data in bins of selected duration \(\tau\), typically several hours. Distinct values of \(\tau\) are used for the dipole terms (1, 0), and the remaining spherical harmonics. Table 1 gives an overview of various datasets from the CI, and respective settings. The particular choices of \(j_{\mathrm {max}}^{\mathrm {(e)}}\), \(j_{\mathrm {max}}^{\mathrm {(i)}}\), and \(\tau\) are subject to a trade-off, governed by the spatio-temporal resolution of the datasets. Increasing the width of the time bin would allow to increase formally also the SH truncation degree, but the field variations on shorter time scales would then yield spurious spatial variations. Moreover, the smaller spatial scales of magnetospheric and induced fields are both reduced at the altitude of low-orbit satellites by downward, and upward continuation, respectively (Kuvshinov et al. 2020). Although the 3-D mantle conductivity inversion was carried out for all of the datasets, we limit the graphic presentation of the results to only two cases in this paper. The dataset 0513 is based on 5 years of data, including the data from the CryoSat-2 platform magnetometers. The dataset 0604 extends the length of the time series to 6 years, however, without the inclusion of CryoSat-2 data.

Three additional remarks deserve the attention of the reader. First, the external and internal field SH coefficients provided by the CI use the traditional Schmidt’s semi-normalization, and must be converted to fully normalized values using formulas (5, 6) from Velímský (2013). Second, the higher sampling rate of the dipolar coefficients is not exploited in our modeling, as the forward and inverse solvers require the same constant time step for all spherical harmonics. Therefore, the dipolar coefficients are downsampled by averaging to the same sampling rate as the remaining coefficients. The renormalization and time-averaging can be combined into a single linear operator \(\Gamma\),

$$\begin{aligned} {\mathbf {G}} = \Gamma \, {\mathbf {g}}, \end{aligned}$$
(6)

where \({\mathbf {g}}\) represents a vector arrangement of the CI-based, Schmidt semi-normalized SH coefficients in the original time sampling, and \({\mathbf {G}}\) is a similar arrangement of the fully normalized, time-averaged harmonics. Figure 1 shows the time series of the external and internal field coefficients for the 0513 dataset, after such processing.

Fig. 1
figure 1

Time series of external (top) and internal (bottom) field coefficients for CI model 0513, after normalization and time-averaging

Thirdly, the CI also provides the inverse covariance matrices \({\mathbf {D}}_{\mathrm {MMA}}\), relating the information on the uncertainties of the SH coefficients. Without going into the detailed structure of \(\Gamma\), we can use it also to obtain the inverse covariance matrix of the fully normalized time-averaged coefficients as

$${\mathbf{D}} = \left( {\Gamma {\mkern 1mu} {\mathbf{D}}_{{{\text{MMA}}}}^{{ - 1}} {\mkern 1mu} \Gamma ^{{\text{T}}} } \right)^{{ - 1}} .$$
(7)

In the synthetic tests performed in Velímský (2013) only the diagonal terms of the inverse covariance matrix were used. Here we take the next step by including the terms relating the SH coefficients of different degree and order (jm), \((j',m')\) that belong to the same time bin of length \(\tau\). The elements of \({\mathbf {D}}\) then satisfy

$$\begin{aligned} D_{jmj'm'}^{ii'} \left\{ \begin{array}{ll} \ne 0 &{} \mathrm {for}\; t_i \backslash \tau = t_{i'} \backslash \tau ,\\ = 0 &{} \mathrm {otherwise,} \end{array}\right. \end{aligned}$$
(8)

where \(\backslash\) denotes the integer division operator. Since i and \(i'\) are used as the outer indices, the inverse covariance matrix has block-diagonal structure. Figure 2 shows an example of one such block for the 0513 dataset, corresponding to time level 2018.0. Note that the diagonal terms dominate. Although the matrix is positive definite, small negative weights do appear off the diagonal.

Fig. 2
figure 2

One block of the inverse covariance matrix corresponding to time level 2018.0, CI model 0513, after normalization and time-averaging

Table 1 Overview of various datasets for which the 3-D conductivity inversion is attempted

The inverse problem

The purpose of the inverse problem is to reconstruct the conductivity distribution \(\sigma (\varvec{r})\) from the time series of external and internal field coefficients \(G_{jm}^{(\mathrm {e,obs})}(t_i)\), \(G_{jm}^{(\mathrm {i,obs})}(t_i)\) obtained by processing of observatory and satellite data; the \(\mathrm {obs}\) supersript now differentiates the observation-based time series from the predictions of the forward solver. Because of inherent non-uniqueness of the inverse problem, the class of admissible conductivity models is restricted by regularization. In our case, the inversion is defined over a finite-dimensional manifold \({\mathcal {M}}\). Since the electrical conductivity is positive-valued, a natural choice is to parameterize its (decimal) logarithm

$$\begin{aligned} - \log \frac{\sigma (\varvec{r})}{\sigma _0} = \sum \limits _{k=1}^K \sum \limits _{j=0}^{j_{\mathrm {max}}^\rho }\sum \limits _{m=-j}^j \rho _{jm}^k\xi _k(r)Y_{jm}(\Omega ), \end{aligned}$$
(9)

where \(\sigma _0= 1\; {\hbox {S}}/{\hbox {m}}\) is used to preserve correct dimensionality, and the negative sign stems from the internal representation of the model by means of resistivity, rather than conductivity. The radial discretization is defined on K layers, where \(\xi _k(r)= 1\) in the kth layer \(r_k\le r \le r_{k+1}\), and zero otherwise. The lateral resolution is governed by the truncation degree \(j_{\mathrm {max}}^\rho \le j_{\mathrm {max}}^{\mathrm {(i)}}\). The real parameters \(\rho _{jm}^k\) are arranged into a vector \({\mathbf {m}}\in {\mathcal {M}}\) with dimension \(M=(K-1)(j_{\mathrm {max}}^\rho +1)^2\). The parameterization (9) is used for all \(r < a-\delta\). In the uppermost layer of thickness \(\delta = 13\ {\hbox {km}}\), the electrical conductivity is not included in the model vector \({\mathbf{m}}\), but fixed to an a priori distribution, taking into account the conductivities of igneous rocks, seawater, continental, coastal, and ocean sediments, along the lines of Everett et al. (2003).

Following Velímský (2013), we solve the regularized inverse problem by finding the minimum of a penalty function

$$\begin{aligned} F({\mathbf {m}};\lambda ) = \chi ^2({\mathbf {m}}) + \lambda R^2({\mathbf {m}}) \end{aligned}$$
(10)

over \({\mathcal {M}}\). Here, \(\chi ^2({\mathbf {m}})\) is the data misfit, measuring the distance between observed data and prediction of the forward problem. Let \(G_{jm}^{(\mathrm {i})}({\mathbf {m}};t_i)\) be the solution of the forward problem (1, 2, 3), excited by the series of external field coefficients \(G_{jm}^{(\mathrm {e})}(t_i)=G_{jm}^{(\mathrm {e,obs})}(t_i)\), and using the conductivity model \(\sigma ({\mathbf {m}};\varvec{r})\) obtained from a particular realization of \({\mathbf {m}}\in {\mathcal {M}}\) using Eq. (9). The data misfit is then defined as

$$\begin{aligned} \chi ^2({\mathbf {m}}) = \sum \limits _{ijm}\sum \limits _{i'j'm'} N_{jj'} \left( G_{jm}^{(\mathrm {i})}({\mathbf {m}};t_i)-G_{jm}^{(\mathrm {i,obs})}(t_i)\right) D_{jmj'm'}^{ii'} \left( G_{j'm'}^{(\mathrm {i})}({\mathbf {m}};t_{i'})-G_{j'm'}^{(\mathrm {i,obs})}(t_{i'})\right) , \end{aligned}$$
(11)

where

$$\begin{aligned} N_{jj'}=\frac{\sqrt{(2j+1)\,(j+1)\,(2j'+1)\,(j'+1)}}{8\pi \,N}. \end{aligned}$$
(12)

Note that in Velímský (2013), only the simplified formula with diagonal covariance matrix was used.

The regularization function \(R^2({\mathbf {m}})\) in the present work constrains the \(L_2\) norm of the first spatial derivative of the log-conductivity over the volume \(G_\delta\), which excludes the uppermost Kth layer,

$$\begin{aligned} R^2({\mathbf {m}})= & {} \frac{1}{a} \int \limits _{G_\delta } \left| \nabla \log \frac{\sigma ({\mathbf {m}};\varvec{r})}{\sigma _0}\right| ^2\mathrm {d}V\approx \frac{1}{a}\sum \limits _{k=2}^{K-1} \sum \limits _{j=0}^{j_{\mathrm {max}}^\rho }\sum \limits _{m=-j}^j h_k \nonumber \\&\times \left[ \frac{r_k^2\left( \rho _{jm}^k-\rho _{jm}^{k-1}\right) ^2}{h_k^2} + j(j+1)\left( \rho _{jm}^k\right) ^2\right] , \end{aligned}$$
(13)

where \(h_k= r_k-r_{k-1}\). Thus, the fixed near-surface conductivity map, and its interface with the underlying mantle do not contribute to the regularization value. Other options were also explored (e.g., constraining the second derivatives of log-conductivity, or using different weighting of radial and lateral parts of the smoothing operators), but the results do not not convey any substantially different information, and are not presented here.

The penalty function \(F({\mathbf {m}};\lambda )\) defined in Eq. (10) is minimized for a series of 15 regularization coefficients \(\lambda =\left( 5,2,1,0.5,0.2,0.1,\dots ,0.0001\right)\), providing a series of models

$$\begin{aligned} {\tilde{{\mathbf {m}}}}(\lambda ) = \mathrm {arg}\min \limits _{{\mathbf {m}}\in {\mathcal {M}}} F({\mathbf {m}};\lambda ). \end{aligned}$$
(14)

The first run for \(\lambda =5\) is started from \({\mathbf {m}}={\mathbf {0}}\), corresponding to a homogeneous sphere with unit conductivity, and each subsequent run is initiated at the minimum obtained for the nearest larger \(\lambda\). The minimization algorithm is based on the limited-memory quasi-Newton method with Broyden–Fletcher–Goldfarb–Shanno formula (Press et al. 1992, Section 10.7) used for gradual updates of the approximate Hessian. The gradient of the data misfit in the model space \(\nabla _{{\mathbf {m}}}\chi ^2({\mathbf {m}})\) is evaluated using the adjoint approach (Velímský 2013; Maksimov and Velímský 2017). Finally, the data misfits \(\chi ^2({\tilde{{\mathbf {m}}}}\left( \lambda )\right)\) are plotted versus the regularizations \(R^2\left( {\tilde{{\mathbf {m}}}}(\lambda )\right)\), in the form of L-curve (Hansen 1992). The optimal regularization parameter \({\tilde{\lambda }}\) and corresponding model \({\tilde{{\mathbf {m}}}}({{\tilde{\lambda }}})\) are then chosen visually near the maximum inflection point, thus balancing the data misfit and the regularization function. Figure 3 presents the L-curves obtained for the two inversion runs described in detail below.

Fig. 3
figure 3

Trade-off plots between data misfit and regularization (the L-curves) for CUP20-OS6 (left) and CUP20-OCS5 (right) inversions, respectively. The red numbers denote the respective values of the regularization parameter \(\lambda\)

Sensitivity analysis

We use the methodology described by Maksimov and Velímský (2017) to calculate the data Hessian matrix in the final, optimally regularized model \({\tilde{{\mathbf {m}}}}({{\tilde{\lambda }}})\),

$$\begin{aligned} H_\chi = \left. \nabla _{\mathbf {m}}\nabla _{\mathbf {m}}\chi ^2({\mathbf {m}}) \right| _{{\mathbf {m}}={\tilde{{\mathbf {m}}}}({{\tilde{\lambda }}})} \end{aligned}$$
(15)

for each dataset. The efficient algorithm based on the combination of the forward, adjoint, forward scatter, and adjoint scatter problems allows the assembly of the entire matrix at the cost equivalent to \(2M+2\) forward runs. The Hessian matrix provides a useful insight into the sensitivity of data to individual model parameters. Figure 7 shows the data Hessians for the models presented in this paper, and detailed discussion is presented in the next section. Note, that in the minimum of penalty function \(F({\mathbf {m}})\), only the full Hessian matrix \(H_F\), which includes also the second derivatives of regularization term, is positive definite. The data Hessian \(H_\chi\) can have small negative values on the diagonal, but we find it more instructive for discussion of data sensitivity.

Inversion settings

The input settings used for the CUP20-OSC5 and CUP20-OS6 runs are summarized in Table 2, together with the respective diagnostics results. The radial parameterization of the conductivity model is selected closely to the previous 3-D inversions by Semenov and Kuvshinov (2012, further referenced as ETH12) and Sun et al. (2015, further referenced as OSU15). The layer interfaces are placed at the depths of 40, 250, 410, 520, 670, 900, 1200, 1600, 2400, and 2891.2 km. The lateral resolution of the forward solver is set to \(j_{\mathrm {max}}=8\), and the layers from the inverse model are further sub-discretized into 192 radial nodes in total, comprising also the near-surface layer, and the core with constant conductivity of \(2\times 10^5 {\mathrm{S}}/{\mathrm{m}}\).

Table 2 The input settings, and the diagnostics outputs (regularization parameter, data misfit, and regularization value) of the 3-D runs presented in this paper

Results

Fig. 4
figure 4

Spherically symmetric electrical conductivity profiles. Models MIN1DM_0501, CUP20-OSC5, and CUP20-OS6 are shown, respectively, with black, red, and blue thick lines. The spherically symmetric parts of other models obtained by inversions of the remaining datasets in Table 1 are shown by thin lines of various colors without further distinction

We start this section by comparison of the 1-D, spherically symmetric part of the recovered 3-D models. It is governed only by the \(\rho _{00}^k\) components of the model vector \({\mathbf {m}}\). Figure 4 shows the 1-D profiles for all 3-D inversion runs of various CI datasets. The CUP20-OSC5 and CUP20-OS6 models are marked, respectively, by thick red and blue lines. In addition, we show by thick black line the 1-D conductivity profile MIN1DM_0501, which was obtained by joint regularized inversion of satellite-derived C-responses and \(M_2\) tidal signals (Grayver et al. 2017). Although this approach uses different datasets, model parameterization and regularization, and the forward modeling is based on the integral-equation method in the frequency domain, there is remarkable agreement between the 1-D profiles below 250 km. Our models fail to resolve the low-conductivity layer below the lithosphere–asthenosphere boundary, but this is expected as the periods shorter than 6 hrs are filtered out from our input datasets. Use of alternative settings of the data processing in the CI has the largest impact on the 1-D conductivity profiles in the upper mantle. Below 900 km, the spherically symmetric parts of all models are in good agreement.

Fig. 5
figure 5

Cross-sections of electrical conductivity models CUP20-OS6, CUP20-OSC5, OSU15-j3, and ETH12-j3 in the upper mantle

Fig. 6
figure 6

Cross-sections of electrical conductivity models CUP20-OS6, CUP20-OSC5, OSU15-j3, and ETH12-j3 in the lower mantle

Figures 5, 6 compare the lateral cross-sections of CUP20-OSC5, CUP20-OS6, OSU15-j3, and ETH12-j3 models in the upper and lower mantle, respectively. In order to compare only the large-scale features of all models, we have low-pass filtered the OSU15 and ETH12 models by least-squares fitting of the SH series up to degree \(j_{\mathrm {max}}^\rho = 3\) from Eq. (9) into the grid values of log-conductivity in each layer, hence the -j3 appendage. The lateral variations in the ETH12-j3 model start only below the depth of 410 km.

We can observe some systematic features in the CUP20-OSC5, CUP20-OS6, OSU15-j3, and ETH12-j3 models throughout the transition zone of the upper mantle. All four models prefer increased conductivity below Eurasia, the southern Pacific and the Indian Ocean, and lower conductivity below the western Pacific. The largest disagreement between the models is below South America. Here both CUP20 models, and to a lesser extent also the ETH12-j3 model, infer a conductivity increase from the Pacific coast, across the continent, and further into the South Atlantic, while the OSU15-j3 model shows rather weaker lateral dependence, and in the opposite direction. The ETH12-j3 model is significantly more conductive in the lowest part of the transition zone, and shows less lateral variations.

In Fig. 6, we summarize the results of the four conductivity models for the lower mantle. Note that the color scale changes with respect to the one used in Fig. 5. However, it remains consistent across the models. In the uppermost part of the lower mantle we observe good agreement between both CUP20 models and the ETH12-j3 model in the position of the lateral conductivity heterogeneities, although in the latter case more suppressed by regularization. Increased conductivity is observed below the South America, and Eurasia, and there is also a less distinct conductive patch below the southern Pacific. This feature also occurs in the OSU15-j3 model. On the other hand, a large negative conductivity anomaly is present below both Americas only in the OSU15-j3 model.

Proceeding below the depths of 1000 km, the OSU15-j3, and the ETH12-j3 models show almost no large-scale lateral conductivity variations. However, in the case of CUP20 models, it is interesting to compare the lower mantle conductivity features with the distribution of the large low shear-wave velocity provinces (LLSVP) observed by seismic tomography (Lekic et al. 2012; Doubrovine et al. 2016; Hosseini et al. 2018). A conductive feature similar in its shape and size to the African LLSVP, although partially shifted to the east, is present in both of our models, spanning from the Eurasia to the southern Indian Ocean. A second conductive feature overlaps well with the Pacific LLSVP, although it seems to extend further below the South America, which is not observed in the tomography models. A resistive area spanning the entire depth of the lower mantle in the CUP20 models is located below the Australia. This feature is consistent with the inversion of monthly mean values from geomagnetic observatories by Tarits and Mandéa (2010, not shown here).

A quantitative interpretation of the conductivity models in terms of lower mantle temperature and composition is beyond the scope of this paper. In general, where the conductivity increase coincides with lower seismic velocities, the temperature is probably the main control parameter (Verhoeven et al. 2009). However, other effects, such as the iron spin transitions and MORB presence can influence the lower mantle conductivity in both ways (Lin et al. 2007; Ohta et al. 2010).

Fig. 7
figure 7

Hessians (second derivatives of data misfit) obtained for the final models CUP20-OS6 (top) and CUP20-OCS5 (bottom), respectively. The parameters on the horizontal and vertical axes are ordered from left to right, and top to bottom, respectively. The outer index corresponds to layers and depths, with individual layers separated by the gridlines. The inner joint degree-order index is similar to Fig. 2, with the addition of (0, 0) term at the beginning of each layer block. Only the degree is annotated

Finally, the sensitivity of the datasets used in the inversions to individual model parameters of CUP20-OS6 and CUP20-OSC5 is assessed by studying the respective Hessians in Fig. 7. In both cases, the largest sensitivities are observed below the transition zone, and slowly decreasing down to 2400 km. A second, less distinct area of large second derivatives, is present just above the transition zone, especially for the CUP20-OS6 model. However, given the 6 hrs sampling rate of the external and internal SH coefficients, we do not believe that the data has significant resolution above the transition zone. The data sensitivity to individual model parameters is also influenced by the thickness of individual layers in a non-linear way. We have tested it in several inversion runs with alternative parameterization, using layers of constant thickness of 100 km (not shown here). In those cases, the relative amplitudes of Hessian elements above the transition zone are reduced. Throughout the models, there are significant correlations and anticorrelations between the SH coefficients of the same degree and order across neighboring layers, which quickly fall off for more distant layers. Within each layer, the diagonal sensitivities dominate, and they decrease with increasing SH degree.

Conclusions and outlook

Together with the paper by Kuvshinov et al. (2020), our work represents the first attempt to obtain the 3-D image of mantle conductivity using satellite data. The small number of satellites is the main limiting factor in these studies. As we deal with the transient signals originating in the magnetosphere, and their induced counterparts, the separation of temporal and spatial variations from observations on a moving platform becomes a challenging problem with some necessary trade-offs. So far, the process of CI has been able to recover the time series of external and induced field coefficients up to degree and order 3, with time resolution starting at 6 hrs. Hence, the 3-D conductivity inversion must be focused on the lower mantle, and only large-scale, averaged features can be reconstructed from the data. The 3-D conductivity models discussed in the preceding section have recovered robust features which have only small sensitivity to the changes in model parameterizations and data-filtering settings in the CI. The inclusion of CryoSat-2 platform magnetometer data had only small influence on the inversion results, it did allow for a slightly larger reduction of the data misfit in the CUP20-OSC5 inversion. However, it is not sufficient to significantly increase the spatial resolution of the inversion. It remains an open question, whether inclusion of additional platform magnetometer data with Swarm-derived calibration will enable us to progress to SH degree 5 or beyond in the near future.

The previous global 3-D inversions used the concept of local observatory responses. Hence, more details can be recovered in densely covered areas (e.g., below Europe) at the risk of obtaining spurious variations in poorly covered areas, unless corrected by regularization. Such approach is not suitable for satellite data, where global descriptions of the inducing and induced signals, or corresponding global transfer functions are needed. A combination of both approaches is one possible way for future studies. Moreover, the small-scale induced fields in the lower mantle, carrying the information on the small-scale conductivity variations, are subject to geometric attenuation and EM shielding throughout the Earth, just like the considerably stronger small-scale fields of the geodynamo.

While the 3-D EMI sounding of the upper mantle can benefit from additional energy sources in the ionosphere (Kuvshinov and Koch 2015; Guzavina et al. 2019) and oceans (Grayver et al. 2017; Zhang et al. 2019), the choice of the EM excitation of the lower mantle is more limited. The inversion of the geomagnetic jerks (Nagao et al. 2003) is burdened by the principal ambiguity in the recovery of source field and conductivity, and similar argument can be used for the exploitation of the length-of-day variations through the electromagnetic core–mantle coupling (Holme 2000).

The EMI sounding by magnetospheric sources thus remains the only viable approach to provide a geophysical constraint on the lower mantle conductivity distribution, and thus contribute to the knowledge of the thermochemical and mineralogical state of the mantle. The main challenge for the future lies in the sparsity of observations, and data processing methods to overcome it. The 3-D forward EMI problem requires a boundary condition, external or other, to be specified everywhere on the surface for sufficiently long time series, or large periods. The source model thus needs to be interpolated in space and time/frequency from the available datapoints to fill the gaps, while suppressing incoherent contributions of non-inductive origin. Several promising data-driven approaches are being developed. One direction is the use of an a priori conductivity model in the construction of the source model. The source then can be constrained by using only the horizontal component of magnetic field which is less sensitive to lateral conductivity variations (Kuvshinov 2008; Grayver et al. 2020). The principal component analysis, as applied for ionospheric sources by Sun et al. (2015), could be also exploited here. Another path calls for a more detailed pre-processing and cleaning of satellite data in the polar areas by accounting for the polar electrojets and field-aligned currents by means of optimized model parameterization (Martinec et al. 2018). These signals, while not able to induce significant currents inside the Earth due to their geometry, are a strong source of noise at high latitudes. Finally, the physics-based models of the magnetosphere can also be in principle used to provide the excitation signals, as was demonstrated by Honkonen et al. (2018) on forward 3-D modeling of EM fields induced by large geomagnetic storms. It remains to be seen, whether the additional complexity of this approach can help to overcome the problems of purely data-derived source models.