Abstract
Despite impressive progress in the development and application of electromagnetic (EM) deterministic inverse schemes to map the 3-D distribution of electrical conductivity within the Earth, there is one question which remains poorly addressed—uncertainty quantification of the recovered conductivity models. Apparently, only an inversion based on a statistical approach provides a systematic framework to quantify such uncertainties. The Metropolis–Hastings (M–H) algorithm is the most popular technique for sampling the posterior probability distribution that describes the solution of the statistical inverse problem. However, all statistical inverse schemes require an enormous amount of forward simulations and thus appear to be extremely demanding computationally, if not prohibitive, if a 3-D set up is invoked. This urges development of fast and scalable 3-D modelling codes which can run large-scale 3-D models of practical interest for fractions of a second on high-performance multi-core platforms. But, even with these codes, the challenge for M–H methods is to construct proposal functions that simultaneously provide a good approximation of the target density function while being inexpensive to be sampled. In this paper we address both of these issues. First we introduce a variant of the M–H method which uses information about the local gradient and Hessian of the penalty function. This, in particular, allows us to exploit adjoint-based machinery that has been instrumental for the fast solution of deterministic inverse problems. We explain why this modification of M–H significantly accelerates sampling of the posterior probability distribution. In addition we show how Hessian handling (inverse, square root) can be made practicable by a low-rank approximation using the Lanczos algorithm. Ultimately we discuss uncertainty analysis based on stochastic inversion results. In addition, we demonstrate how this analysis can be performed within a deterministic approach. In the second part, we summarize modern trends in the development of efficient 3-D EM forward modelling schemes with special emphasis on recent advances in the integral equation approach.
Similar content being viewed by others
References
Abubakar A, Habashy T (2013) Three-dimensional visco-acoustic modeling using a renormalized integral equation iterative solver. J Comput Phys 249:1–12
Avdeev D, Knizhnik S (2009) 3D integral equation modeling with a linear dependence on dimensions. Geophysics 74:89–94
Avdeev D, Kuvshinov A, Epova K (2002) Three-dimensional modelling of electromagnetic modelling of electromagnetic logs from inclined-horizontal wells. Phys Solid Earth 38:975–980
Avdeev D, Kuvshinov A, Pankratov O, Newman G (1997) High-performance three-dimensional electromagnetic modeling using modified Neumann series. Wide-band numerical solution and examples. J Geomagn Geoelectr 49:1519–1539
Avdeev D, Kuvshinov A, Pankratov O, Newman G (2000) 3-D EM modelling using fast integral equation approach with Krylov subspaces accelerator. In: 2nd EAGE conference and technical exhibition, vol 2, Scotland, Scotland
Avdeev D, Kuvshinov A, Pankratov O, Newman G (2002) Three-dimensional induction logging problems, part I: an integral equation solution and model comparisons. Geophysics 67(2):413–426
Bayes T (1763) An essay towards solving a problem in the doctrine of chances. Philos Trans R Soc Lond 53:370–418
Beck R, Hiptmair R, Hoppe RH, Wohlmuth B (2000) Residual based a posteriori error estimators for eddy current computation. Math Model Numer Anal 34:159–182
Bodin T, Sambridge M, Rawlinson N, Arroucau P (2012) Transdimensional tomography with unknown data noise. Geophys J Int 189:1536–1556
Börner RU (2010) Numerical modelling in geo-electromagnetics: advances and challenges. Surv Geophys 31:225–245
Brown V, Hoversten M, Key K, Chen J (2012) Resolution of reservoir scale electrical anisotropy from marine CSEM data. Water Resour Res 77(2):E147–E158
Bürg M (2000) A residual-based a posteriori error estimator for the hp-finite element method for Maxwells equations. Appl Numer Math 62:922–940
Bürg M (2013) Convergence of an automatic hp-adaptive finite element strategy for Maxwells equations. Appl Numer Math 72:188–206
Chen J, Hoversten GM, Key K, Nordquist G, Cumming W (2011) Stochastic inversion of magnetotelluric data using a sharp boundary parameterization and application to a geothermal site. Geophysics 77(4):E265–E279
Chen J, Hoversten GM, Vasco D, Rubin Y, Hou Z (2007) A Bayesian model for gas saturation estimation using marine seismic AVA and CSEM data. Geophysics 72(2):WA85–WA95
Chew W, Jin J, Lu C, Michielssen E, Song J (2014) Fast solution methods in electromagnetics. IEEE Trans Antenna Propag 45:533–543
Christensen N (1990) Optimized fast Hankel transform filters. Geophys Prospect 38:545–568
Epanomeritakis I, Akcelik V, Ghattas O, Bielak J (2008) A Newton-CG method for large-scale three-dimensional elastic full waveform seismic inversion. Inverse Probl 24:1–26
Ernst OG, Gander MJ (2011) Why it is difficult to solve Helmholtz problems with classical iterative methods. Numer Anal Multiscale Probl 83:325–361
Fainberg E, Zinger B (1980) Electromagnetic induction in a nonuniform spherical model of the Earth. Ann Geophys 36:127–134
Farquharson CG, Miensopust MP (2011) Three-dimensional finite-element modelling of magnetotelluric data with a divergence correction. J Appl Geophys 75:699–710
Fichtner A, Trampert J (2011) Hessian kernels of seismic data functionals based upon adjoint techniques. Geophys J Int 185:775–798
Frayss V, Giraud L, Gratton S, Langou J (2003) A set of GMRES routines for real and complex arithmetics on high performance computers. CERFACS technical report TR/PA/03/3
Geraskin A, Kruglyakov M, Kuvshinov A (2015) Novel robust and scalable 3-D forward solver based on contracting integral equation method and modern programming technologies. Comput Geosci (submitted)
Gilks WR, Richardson S, Spiegelhalter D (eds) (1996) Markov chain Monte Carlo in practice. Chapman and Hall, London, pp 1–485
Grandis H, Menvielle M, Roussignol M (1999) Bayesian inversion with Markov chains-I. The magnetotelluric one-dimensional case. GJI 138:757–768
Grandis H, Sumintaredja P, Irawan D (2012) A template for 1-D inversion of geo-electromagnetic data using MCMC method. EMSEV 2012, Gotemba Kogen Resort, Gotemba, Japan, October 14, 2012, pp 1–4
Grayver A, Burg M (2014) Robust and scalable 3-D geo-electromagnetic modelling approach using the finite element method. Geophys J Int. doi:10.1093/gji/ggu119
Grayver A, Kolev T (2015) Large-scale 3D geo-electromagnetic modeling using parallel adaptive high-order finite element method. Geophysics 80(6):277–291
Greenbaum A (1997) Iterative methods for solving linear systems. SIAM, Philadelphia, USA
Haber E, Ascher UM (2001) Fast finite volume simulation of 3D electromagnetic problems with highly discontinuous coefficients. SIAM J Sci Comput 22(6):1943–1961
Hastings WK (1970) Monte carlo sampling methods using Markov chains and their applications. Biometrika 57:97–109
Hiptmair R (2002) Finite elements in computational electromagnetism. Acta Numer 11:237–339
Hohmann G (1975) Three-dimensional induced polarization and electromagnetic modeling. J Geophys 40:309–324
Hursan G, Zhdanov M (2002) Contraction integral equation method in three-dimensional electromagnetic modeling. Radio Sci 37:2001R. doi:10.1029/S002513
Kaipio J, Somersalo E (2005) Statistical and computational inverse problems. Springer, New York
Kamm J, Pedersen L (2014) Inversion of airborne tensor VLF data using integral equations. Geophys J Int. doi:10.1093/gji/ggu161
Kelbert A, Kuvshinov A, Velimsky J, Koyama T, Ribaudo J, Sun J, Martinec Z, Weiss C (2014) Global 3-D electromagnetic forward modelling: a benchmark study. Geophys J Int 197:785–814
Kirk BS, Peterson JW, Stogner RH, Carey GF (2006) libmesh: a C++ library for parallel adaptive mesh refinement/coarsening simulations. Eng Comput 22:237–254
Kolmogorov AN (1956) Foundations of the theory of probability, 2nd edn. Chelsea, New York
Koyama T, Shimizu H, Utada H, Ichiki M, Ohtani E, Hae R (2006) Water content in the mantle transition zone beneath the North Pacific derived from the electrical conductivity anomaly. AGU Geophys Monogr Ser 168:171–179
Koyama T, Utada H, Avdeev D (2008) Fast and memory-saved 3-D forward modeling code for MT by using integral equation method. In: Abstract book. 19th workshop on electromagnetic induction in the Earth, China
Kuvshinov A (2008) 3-D global induction in the oceans and solid Earth: recent progress in modeling magnetic and electric fields from sources of magnetospheric, ionospheric and oceanic origin. Surv Geophys. doi:10.1007/s10712-008-9045-z
Kuvshinov AV, Utada H, Avdeev D, Koyama T (2005) 3-D modelling and analysis of Dst C-responses in the North Pacific Ocean region, revisited. Geophys J Int 160:505–526
Logg A, Wells GN (2010) Dolfin: automated finite element computing. ACM Trans Math Softw 37:20:1–20:28
Lu CC, Chew W (1994) A multilevel algorithm for solving boundary-value scattering. Microwave Opt Technol Lett 7:466–470
Mackie R, Smith J, Madden T (1994) 3-Dimensional electromagnetic modeling using finite-difference equation—the magnetotelluric example. Radio Sci 29(4):923–935
Martin J, Wilcox LC, Burstedde C, Ghattas O (2012) A stochastic Newton MCMC method for large-scale statistical inverse problems with application to seismic inversion. SIAM J Sci Comput 34(3):A1460–A1487
Metivier L, Brossier R, Virieux J, Operto S (2013) Full waveform inversion and the truncated Newton method. SIAM J Sci Comput 35:401–437
Metropolis N, Rosenbluth AW, Rosenbluth MN, Teller AH, Teller E (1953) Equation of state calculations by fast computing machines. J Chem Phys 21:1087–1092
Michielssen E, Boag A (1996) A multilevel matrix decomposition algorithm for analyzing scattering from large structures. IEEE Trans Antennas Propag 44:1086–1093
Millard X, Liu QH (2003) A fast volume integral equation solver for electromagnetic scattering from large inhomogeneous objects in planarly layered media. IEEE Trans Antennas Propag 51:2393–2401
Minsley BJ (2011) A trans-dimensional Bayesian Markov chain Monte Carlo algorithm for model assessment using frequency-domain electromagnetic data. GJI 187:252–272
Monk P (2003) Finite element methods for Maxwells equations. Oxford University Press, Oxford
Mulder W (2006) A multigrid solver for 3D electromagnetic diffusion. Geophys Prospect 54:633–649
Newman G, Alumbaugh D (2002) Three-dimensional induction logging problems, part 2: a finite-difference solution. Geophysics 61:484–491
Newman GA, Hoversten GM (2000) Solution strategies for two- and three-dimensional electromagnetic inverse problems. Inverse Probl 16:1357–1375
Nie X, Li LW, Yuan N, Yeo TS (2013) A fast integral equation solver for 3D induction well logging in formations with large conductivity contrasts. J Comput Phys 61:645–657
Pankratov O, Avdeev D, Kuvshinov A (1995) Electromagnetic field scattering in a homogeneous Earth: a solution to the forward problem. Phys Solid Earth 31:201–209
Pankratov O, Kuvshinov A (2010) General formalism for the efficient calculation of derivatives of EM frequency domain responses and derivatives of the misfit. Geophys J Int 181:229–249
Pankratov O, Kuvshinov A (2015) General formalism for the efficient calculation of the Hessian matrix of EM data misfit and Hessian-vector products based upon adjoint sources approach. Geophys J Int 200:1449–1465
Pankratov O, Kuvshinov A, Avdeev D (1997) High-performance three-dimensional electromagnetic modeling using modified Neumann series. Anisotropic case. J Geomag Geoelectr 49:1541–1547
Phillips JR, White J (1996) A precorrected-FFT method for electrostatic analysis of complicated 3-D structures. IEEE Trans Comput Aided Des Integr Circuits Syst 16:1059–1071
Prasolov VV (2004) Lobachevsky geometry. MCCME Publishing House, Moscow (in Russian)
Puzyrev V, Koldan J, de la Puente J, Houzeaux G, Vazquez M, Cela JM (2013) Efficient pre-conditioned iterative solution strategies for the electromagnetic diffusion in the earth: finite-element frequency-domain approach. Geophys J Int 193:678–693
Raiche A (1974) An integral equation approach to three-dimensional modelling. Geophys J R Astr Soc 36:363–376
Ren Z, Kalscheuer T, Greenhalgh S, Maurer H (2013) A goal-oriented adaptive finite-element approach for plane wave 3-D electromagnetic modelling. Geophys J Int 194:700–718
Rius JM, Parron J, Heldring A, Tamayo J, Ubeda E (2008) Fast iterative solution of integral equations with method of moments and matrix decomposition algorithm - singular value decomposition. IEEE Trans Anteenas Propag 56:2314–2324
Rokhlin V (1990) Rapid solution of integral equations of scattering theory in two dimensions. J. Comp. Phys. 36:414–439
Rosas-Carbajal M, Linde N, Kalscheuer T, Vrugt JA (2013) Two-dimensional probabilistic inversion of plane-wave electromagnetic data: methodology, model constraints and joint inversion with electrical resistivity data. GJI 196:1–17
Santosa F, Symes WW (1988) Computation of the Hessian for least-squares solutions of inverse problems of reflection seismology. Inverse Probl 4:211–213
Schwarzbach C, Börner RU, Spitzer K (2011) Three-dimensional adaptive higher order finite element simulation for geo-electromagnetics—a marine CSEM example. Geophys J Int 187:63–74
Singer B (1995) Method for solution of Maxwell’s equations in non-uniform media. Geophys J Int 120:590–598
Singer B (2008) Electromagnetic integral equation approach based on contraction operator and solution optimization in Krylov subspace. Geophys J Int 175:857–884
Singer B, Fainberg E (1995) Generalization of the iterative dissipative method for modeling electromagnetic fields in nonuniform media with displacement currents. J Appl Geophys 34:41–46
Singer B, Fainberg E (1997) Fast and stable method for 3D modeling of electromagnetic field. Explor Geophys 34:130–135
Smith TJ (1996) Conservative modeling of 3-D electromagnetic fields, part II: biconjugate gradient solution as an accelerator. Geophysics 61:1319–1324
Sun J, Egbert G (2012) A thin-sheet model for global electromagnetic induction. Geophys J Int 189:343–356
Sun J, Kuvshinov A (2015) Accelerating EM integral equation forward solver for global geomagnetic induction using SVD based matrix compression method. Geophys J Int 1200:1003–1009
Tarantola A (2005) Inverse problem theory and methods for model parameter estimation. Society for Industrial and Applied Mathematics, Philadelphia
Um ES, Commer M, Newman GA (2013) Efficient pre-conditioned iterative solution strategies for the electromagnetic diffusion in the earth: finite-element frequency-domain approach. Geophys J Int 193:1460–1473
Weidelt P (1975) Electromagnetic induction in three-dimensional structures. J Geophys 41:85–109
Acknowledgments
The authors would like to thank Alexander Grayver for many seminal discussions over the course of this work, and Alexey Geraskin and Alexander Grayver for their input to Sect. 11. We wish to thank William Lowrie who helped us to improve the English presentation of this paper. We extend our gratitude to Chester Weiss and an anonymous reviewer for constructive comments on the manuscript. This work has been supported by the European Space Agency through ESTEC contract No. 4000102140/10/NL/JA and in part by the Russian Foundation for Basic Research under Grant No. 13-05-12111. Oleg Pankratov acknowledges the support of ETH during his stay in Zurich as a visiting professor.
Author information
Authors and Affiliations
Corresponding author
Appendices
Appendix 1: The Misfit Renormalization
Quadratic approximation [\({ - {\textstyle \frac{1}{2}} \left( { 2{\mathbf{g}}_{k}^T{\mathbf{s}}+ {\mathbf{s}}^T{\mathtt{H}}_{{{k}}}{\mathbf{s}}}\right) }\), cf. Eq. (26)] of the misfit \({\beta _d}({\mathbf{m}})\) might not work well in the desired vicinity of the current conductivity model \({\mathbf{m}}^{(k)}\). In this case we suggest to use the following renormalization of the misfit
which tells us that the larger local uncertainty ellipsoid (explained later in this section) has more chances to catch the true minimum point as illustrated in Fig. 8.
Renormalization (72) leads to a modification in Eqs. (28), (32), (33), (58) and (37) as follows
and to corresponding equations for \(\tilde{\mathbf{F}}\) (recall that \(\tilde{\mathbf{F}}\) is a low-rank approximation for \(\mathbf{F}\), see Sect. 5).
We see that the renormalization leads to the similarity transformation of the uncertainty ellipsoid, but it does not change the Newtonian step \(\left( {-{\mathtt{H}}_{{{k}}}^{-1}{\mathbf{g}}_{k}}\right)\). We call here as uncertainty ellipsoid the following set
where \(B(\mathbf{0},1)\) is a unit sphere in \({\mathbb{R}}^{N_{\mathcal {M}}}\) centered at the origin.
Finally, the renormalization factor \(\kappa\) might be one of the parameters that are needed to keep the acceptance rate (percentage of the accepted models in the sample) to be in the desired interval; e.g., in the M–H method it is generally accepted that the rate should be between 20 and 70 %.
Appendix 2: Summary of Formulae to Calculate Data Misfit Gradient
This and follow-up appendices summarize the results presented in Pankratov and Kuvshinov (2015). But before we proceed with final formulae we introduce definitions to be used.
1.1 Green’s Operators
Let us define an operator \({\mathbf{G}}^{\cdot \cdot }{}\) in 3-D space \({\mathbb{R}}^3_{}\)
where \(\mathbf{E}\) and \(\mathbf{H}\) are electric and magnetic fields, \(\mathbf{j}^{\rm imp}\) and \(\mathbf{h}^{\rm imp}\) are impressed (extraneous) electric and magnetic sources, respectively, \(\mathbf{r}\in {\mathbb{R}}^3_{}\) is a position vector, \(i=\sqrt{-1}, {\omega }=2\pi /{\text {Period}}\) is an angular frequency, \(\sigma \left( {\mathbf{r}}\right)\) and \(\mu \left( {\mathbf{r}}\right)\) are electric conductivity and magnetic permeability distributions in an Earth’s model, respectively. In this appendix we assume that \(\sigma \left( {\mathbf{r}}\right)\) is a real-valued function. One can readily generalize the concept for complex-valued conductivity. The corresponding formulae are provided in the last appendix of Pankratov and Kuvshinov (2015). All fields, \(\mathbf{E}, \mathbf{H}, \mathbf{j}^{\rm imp}, \mathbf{h}^{\rm imp}\), are complex-valued functions of \({\omega }\) and \(\mathbf{r}\). In addition the fields \(\mathbf{E}\) and \(\mathbf{H}\) depend on \(\sigma\) and \(\mu\). We study the derivatives with respect to \(\sigma\) only. Green’s operator \(\mathbf{G}^{\cdot \cdot }\) depends on functional arguments \(\mathbf{j}^{\rm imp}\) and \(\mathbf{h}^{\rm imp}\). Hereinafter the dependence of Green’s operator on \(\sigma , \mathbf{r}\), and \({\omega }\) is omitted but implied. Time dependence of fields is accounted for by \(e^{-i{\omega }t}\), which reads, for example, for the electric field as \(\breve{\mathbf{E}}(\mathbf{r},t)=\int \mathbf{E}(\mathbf{r},{\omega })e^{-i{\omega }t}\hbox {d}\omega\). At this stage we do not specify the coordinate system in \({\mathbb{R}}^3_{}\); this means that \(\mathbf{r}\) can be, for example, a triplet of Cartesian coordinates, (x, y, z), or a triplet of spherical coordinates, \((r, \theta , \phi )\). As far as the column in the left-hand side (LHS) of Eq. (79) contains two fields, \(\mathbf{E}\) and \(\mathbf{H}\), operator \(\mathbf{G}^{\cdot \cdot }\) can be represented via operators \(\mathbf{G}^{e\cdot }, \mathbf{G}^{h\cdot } \, \mathbf{G}^{ee}, \mathbf{G}^{eh}, \mathbf{G}^{he}, \mathbf{G}^{hh}\) as follows
where operators \(\mathbf{G}^{e\cdot }\) and \(\mathbf{G}^{h\cdot }\) are electric and magnetic components of \(\mathbf{G}^{\cdot \cdot }\), operator \(\mathbf{G}^{ee}\) is a restriction of \(\mathbf{G}^{e\cdot }\) to electric sources etc.
Let us introduce an electromagnetic field, \(\mathbf{u}\), as
which is a complex-valued six-dimensional (6-D) vector. Let us denote the space of such vectors as \({\mathcal {U}}\cong {\mathbb{C}}^6\). Note that once we have chosen coordinates in 3-D space \({\mathbb{R}}^3_{}\) with the following basis
then we naturally and unambiguously have a coordinate system and basis \({\mathbf{e}}'_{1}, \cdots , {\mathbf{e}}'_{6}\) in 6-D complex space \({\mathcal {U}}\)
saying that \({\mathbf{e}}'_{1}, {\mathbf{e}}'_{2}, {\mathbf{e}}'_{3}\) are \({\mathbf{e}}_{1}, {\mathbf{e}}_{2}, {\mathbf{e}}_{3}\) for electric fields, whereas \({\mathbf{e}}'_{4}, {\mathbf{e}}'_{5}, {\mathbf{e}}'_{6}\) are \({\mathbf{e}}_{1}, {\mathbf{e}}_{2}, {\mathbf{e}}_{3}\) for magnetic fields, respectively.
1.2 Polarizations/Sources
Let
be a set of linearly independent distributions (in space and frequency) of the impressed sources, \(\mathbf{f}^{\rm imp}_p\). For example, in magnetotelluric (MT) studies, \(N_{\mathcal {P}}=2\), and \(\mathbf{f}^{\rm imp}_1\) and \(\mathbf{f}^{\rm imp}_2\) correspond to the plane waves of different orientations. Each \(\mathbf{f}^{\rm imp}_p\) produces electric, \(\mathbf{E}_p\), and magnetic, \(\mathbf{H}_p\), fields that constitute an EM field \(\mathbf{u}_p\) that can be written via \(\mathbf{G}^{\cdot \cdot }\) operator (104) as
1.3 Inversion Domain and Parameterization
As far as the inversion is usually done numerically, let the inversion domain, \(V^{\rm inv}\), be represented as
where \(\left\{ { V_l }\right\} _{{ l\in {\mathcal {M}}}}, \ \ {\mathcal {M}}= \left\{ { 1,\ldots ,N_{\mathcal {M}}}\right\}\), represent a set of elementary volumes \(V_l\), and within each volume \(V_l\) let the conductivity be a constant \(\sigma \left( {\mathbf{r}}\right) =\sigma _l\). We assemble this conductivity distribution in the following vector
and introduce model parameterization as
where function \({\mathbf{m}}= \mathbf{\nu }^{-1}({\varvec{\sigma }})\) can be implemented, for example, to preserve conductivity to be positive. Note that a popular choice is \({\mathbf{m}}=\ln {\varvec{\sigma }}\). We also remark that some volumes \(V_l\) might be cells (or combinations of cells) of the 3-D part of the model.
1.4 Observation Sites, Frequencies, Response Functions and Misfit
Let
be the experimental responses and their uncertainties, respectively, and let \(N_{{\mathcal {G}}}\) be the number of all responses. Let \(\mathbf{r}_g\), and \({\omega }_g\) be the spatial location and the frequency, respectively, at which the response \(\varPhi _g\) has been obtained.
Let \({\mathcal {S}}\) be a set of observation sites
where \({\mathbf{s}}_1,\ldots ,{\mathbf{s}}_{N_{\mathcal {S}}}\) are different observation sites, and \({N_{\mathcal {S}}}\) is the number of sites.
Let \({\varOmega }\) be a set of observation frequencies
where \(f_1,\ldots ,f_{N_{{\varOmega }}}\) are different observation frequencies, and \({N_{{\varOmega }}}\) is the number of frequencies. The definitions (89)–(91) are introduced in this specific way intentionally in order to stress the fact that in practice an actual set of experimental responses to be used for inversion varies with frequency and site.
For each \(g\in {\mathcal {G}}\), the predicted response, \(\theta _g\), can be written in the following form
Finally the misfit is introduced as
Following Pankratov and Kuvshinov (2015) we write the elements of data misfit gradient as
where an adjoint source \(\mathbf{J}^M_{p}\) is given by
where \(^*\) stands for complex conjugation. Table 2 summarizes the steps needed to calculate the misfit gradient. From the Eq. (94) it is seen that we need \(2N_{\mathcal {P}}{N_{{\varOmega }}}\) forward modellings in total to calculate the data misfit gradient.
Appendix 3: Summary of Formulae to Calculate the Hessian-Vector Products
We are interested to calculate \(\mathop {\text {Hess}}\nolimits _{{\beta _d}}\mathbf{a}_k, k=1,\ldots ,K\), where \(\mathbf{a}_k\) we represent as
where \({\displaystyle \mathbf{1}}_{\scriptscriptstyle V_l}\)(\(\mathbf{r}\)) is an indicator function given by
Following Pankratov and Kuvshinov (2015), the l-th element of the Hessian-vector product \(\mathop {\text {Hess}}\nolimits _{{\beta _d}}\mathbf{a}_k\) (which is a vector) has a form
where \({\mathcal {B}}^{A}_{kl}\) and \({\mathcal {B}}^{L}_{kl}\) are
where \(\mathbf{J}^M_{p}({\omega })\) is defined in Eq. (95) and \(\mathbf{G}^{e\cdot }\left( {\mathbf{J}^B_{p}(\mathbf{a}_k,{\omega })}\right)\) and \(\mathbf{G}^{e\cdot }\left( {\mathbf{J}^\varPsi _{p}(\mathbf{a}_k,{\omega })}\right)\) are as follows
We make here three notes.
-
Term \(\int \limits _{V_l}{{{\mathbf{E}_p}}}\cdot \mathbf{G}^{e\cdot }\left( {{\mathbf{J}^\varPsi _{p}({\displaystyle \mathbf{1}}_{\scriptscriptstyle V_k})}}\right) \hbox {d}v\) vanishes if the response \(\varPsi\) is a linear function of EM field \(\mathbf{u}\) (e.g., for most of the CSEM methods).
-
Term \(\int \limits _{V_l}\delta _{lk}\nu _l''{{\mathbf{E}_p}({\omega })}\cdot \mathbf{G}^{e\cdot }\left( {\mathbf{J}^M_{p}({\omega })}\right) \, \hbox {d}v\) vanishes if \({\varvec{\sigma }}={\mathbf{m}}\).
-
One can readily generalize the concept for complex-valued conductivity \(\sigma\). The corresponding formulae are provided in the last appendix of Pankratov and Kuvshinov (2015).
Table 3 provides a number of forward modellings needed to calculate the Hessian-vector product K times. As seen from the table, a single Hessian-vector product can be calculated for a price of \(O({N_{{\varOmega }}}{N_{\mathcal {S}}})\) forward problem runs. Moreover, if such a product is calculated multiple times the price drops down to \(2{N_{P} N_{\varOmega }}\) per product. Note also that computation of the Hessian itself is merely calculation of the Hessian-vector product for \(K=N_{\mathcal {M}}\) times with respective vectors \(\mathbf{a}_k = {\displaystyle \mathbf{1}}_{\scriptscriptstyle V_k}, ~ k=1,\ldots ,N_{\mathcal {M}}\).
Appendix 4: Contracting Integral Equation in a Nutshell
Let \(\sigma (\mathbf{r})\) be a desired 3-D model of complex-valued conductivity, including the real part term \(\gamma (\mathbf{r})=\mathop {\text {Re}}\nolimits \sigma (\mathbf{r})>0\) as well as the imaginary part term \(\eta (\mathbf{r}) = \mathop {\text {Im}}\nolimits \sigma (\mathbf{r})\) that describes displacement currents and/or induced polarization effects. Let the model be excited by electric source \(\mathbf{j}^{\rm imp}\). Let us search for the electric field excited by \(\mathbf{j}^{\rm imp}\) in the model \(\sigma\). This electric field is the electric field solution of Maxwell’s equations
Let \(\sigma _b(\mathbf{r})\) be any model where we can evaluate Green’s operator \(\mathbf{G}_{{\sigma _b}}^{ee}\) that is electric field solution of the following Maxwell’s equations
We refer to \(\sigma _b\) as a reference model, e.g., it could be a background (host) model. In further discussion we assume that \(\sigma _b\) describes one-dimensional (1-D) conductivity section, i.e. \(\sigma _b \equiv \sigma _b(z)\). With such introduced reference model, an action of operator \(\mathbf{G}_{{\sigma _b}}^{ee}\) on the field \(\mathbf{j}^{\rm imp}\) is represented by the following convolution integral
where \({\mathbb {V}}\) is the volume, occupied by \(\mathbf{j}^{\rm imp}\). Note, that analogously we can obtain “reference” magnetic field, \(\mathbf{H}_b\), via corresponding Green’s operator \(\mathbf{G}^{he}_{\sigma _b}\), namely, \(\mathbf{H}_b=\mathbf{G}^{he}_{\sigma _b}(\mathbf{j}^{\rm imp})\).
Let \(\sigma _a\) be an anomalous conductivity distribution
By trivial manipulations with Eqs. (103) and (104) one can arrive at the scattering (integral) equation with respect to \(\mathbf{E}\)
where
Note that the composition operator \(A=\mathbf{G}_{{\sigma _b}}^{ee}\circ \sigma _a\) acts on \(\mathbf{E}\) as \(A\mathbf{E}=\mathbf{G}_{{\sigma _b}}^{ee}(\sigma _a\mathbf{E})\). The solution of Eq. (107) can be written in the following form
where \({\mathbf {1}}\) is a unit operator. If operator A is contracting (which means that \(\Vert A\Vert <1)\) then \(\left( {{\mathbf {1}}- A}\right) ^{-1}\) can be represented as
and thus the solution of Eq. (109) reads as the following Neumann series
Generally, contracting properties of operator \(\mathbf{G}_{{\sigma _b}}^{ee}\circ \sigma _a\) are not known to us.
In Pankratov et al. (1995) it is shown that the energy inequality for Maxwell’s equations can be expressed as
or in alternative form
where
Here \(\gamma _b=\mathop {\text {Re}}\nolimits \sigma _b\) is the real part of the reference conductivity. Energy inequality (113) is sharp in the following sense: it turns into equality for a reference model with real-valued conductivity (\(\mathop {\text {Im}}\nolimits \sigma _b=0\)). In the presence of the imaginary part of conductivity, some part of the energy can radiate into the space, which makes (113) an inequality.
Using inequality (112) it is possible to obtain a new scattering equation with contracting operator. Let us first rewrite scattering equation (107) in the form
where \(x=\mathbf{E}\) and \(b=\mathbf{E}_b\). Let us renormalize it as
and
with some unknown multipliers \(P(\mathbf{r}), Q(\mathbf{r}), \lambda (\mathbf{r})\). Such renormalization modifies scattering equation (115) to a new scattering equation
with new linear operator
and new source term
Let us then require that new scattering operator B be expressed as a composition of \(\mathbf{K}_{{\sigma _b}}\) with some multiplication operator R
and hence we obtain
Here \(g\ne 0\) is an undefined constant, thus we can assign
as it is done in all works on CIE.
Deducing that \(\frac{1}{P}\frac{1}{1+\lambda } = \sqrt{\gamma _b}\) and substituting expressions (122)–(123) into Eqs. (116) and (120), we get the expressions for \(\mathbf{E}\) and \(\beta\) as follows
It is proven in Pankratov et al. (1995) that
for feasible conductivity distributions (\(\mathop {\text {Re}}\nolimits \sigma >0, \mathop {\text {Re}}\nolimits \sigma _b>0\)), thus together with Eq. (113) it implies that new scattering operator (121) is contracting
and the Neumann series for Eq. (118)
is always convergent. We call new scattering equation (118) the contracting integral equation (CIE). An optimum choice of the reference conductivity, as well as an estimate of condition number of CIE system operator, \({\mathbf {1}}- B\), is discussed in the next appendix.
Appendix 5: Choice of Optimal Reference Model and Estimate of Condition Number for CIE System Operator
In the previous appendix we considered always contracting series (128) to solve Maxwell’s equations (103). Now we are interested in a choice of “optimum” reference conductivity, i.e. the conductivity that delivers the fastest convergence of the series (128). Let us imagine that we want to obtain the solution with desired accuracy \(\varepsilon\), i.e.
where
Using the relation
we obtain that the number of iterations is governed by the following approximate equality
From this relation and Eq. (126) it follows that the minimum number of iterations is achieved for the reference conductivity \(\sigma _b(z)=\sigma _{\rm opt}(z)\) such that
where
Here \(\varPi\) stands for the right half plane of complex variable \(\sigma\)
and \(M(z)\subset \varPi\) is the range of values of \(\sigma (x,y,z)\) at horizontal plane \(z=Const\)
Under these assumptions, it turns out that
Now we notice that
where
is the Lobachevsky–Bolyai (Prasolov 2004) distance in the right half plane \(\varPi\). Using Lobachevsky–Bolyai geometry formalism, with some efforts, it can be shown that
where \({\mathcal {O}}(M(z))\) is the minimum-size Lobachevsky–Bolyai circle that contains the set M(z). Moreover it can be shown that
Here \(\mathop {\text {diam}}\nolimits (M)\) is the diameter of set M in the Lobachevsky–Bolyai half plane \(\varPi\)
In addition it can be proven that \(2\mathop {\text {arsinh}}\nolimits \left( {\frac{\mathop {\text {sinh}}\nolimits x}{\sqrt{3}/{2}}}\right)\) is monotonically increasing and convex upward (for \(x>0\)) function, from which the following inequality is valid
From Eqs. (140) to (143) it follows that the optimum number of iterations for the desired accuracy \(\varepsilon\) is specified by the following inequality
where
Here conductivity pair \(\sigma _1=\gamma _1+i\xi _1, \sigma _2=\gamma _2+i\xi _2\) from M(z) is the most mutually distant (in terms of Lobachevsky–Bolyai geometry) pair, i.e. a pair that delivers a maximum to expression in (145).
Using the developed formalism, it is also possible to estimate condition number \(\kappa\) of CIE system operator \({\mathbf {1}}-B\) which obeys the following inequality
Let us illustrate an application of formulae (144)–(145) for the Cole-Cole induced polarization (IP) conductivity model
with the following typically adopted parameters
Let us decompose the conductivity into real and imaginary parts as follows
and let us vary location (x, y, z) in a thin layer at depth z, thus denoting in this layer
and in the same manner for the imaginary part, thus getting that for all locations in the layer z, complex-valued conductivity values belong to the following (Euclidean) rectangle
Let us evaluate \(\tilde{E}(z)\) for the Euclidean rectangle (151) using formulae (144)–(145). Value of \(\tilde{E}(z)\) is then a maximum value of (145) for any pair of complex numbers \(\gamma _1+i\xi _1, \gamma _2+i\xi _2\) that belong to Euclidean rectangle (151) as follows
where
Next, taking into account Cole–Cole Eqs. (147)–(148) we get \(|\xi _{\max }| \sim 0.1|\gamma _{\max }|\) and thus
We see that for a high-contrasting inductive polarized z-layer the value of \(\tilde{E}(z)\) grows as the contrast of the real part of conductivity, \(K:=\frac{\gamma _{\max }}{\gamma _{\min }}\). The other consequence is that the IP contrast in Cole–Cole model plays significant role in finding the optimum model, if the contrast of the real part of conductivity is greater than 100. The latter value is a threshold: for \(K\ge 100\) we get \(\tilde{E}(z) \sim \frac{1}{20}K\), whereas for \(K\le 100\) we have \(\tilde{E}(z) \sim \frac{1}{2} \sqrt{K}\).
Final remark of this section is that if conductivity is a real-valued function (which is the most common case in EM studies), then for \(\tilde{E}(z)\) the following equality holds
where
Equation (155) means that the number of iterations and condition number for the optimal model are proportional to the square root of the maximum lateral contrast in the model. As for the optimum conductivity, \(\sigma _{\rm opt}(z)\), it is equal in this case to the conductivity of the host section outside the depths occupied by the inhomogeneities, but at depths with laterally inhomogeneous distribution of conductivity it has the form
Rights and permissions
About this article
Cite this article
Pankratov, O., Kuvshinov, A. Applied Mathematics in EM Studies with Special Emphasis on an Uncertainty Quantification and 3-D Integral Equation Modelling. Surv Geophys 37, 109–147 (2016). https://doi.org/10.1007/s10712-015-9340-4
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10712-015-9340-4