1 Introduction

Precision low-energy tests of quantum electrodynamics are not entirely pure QED tests. Even in the case of purely leptonic systems, theory requires a certain hadronic input and, in particular, an input from hadronic vacuum polarization (hVP).

There is a number of QED observables for which we need such hVP contributions. The most important one is the anomalous magnetic moment of a muon \(a_\mu \). A calculation of the leading hVP contribution to \(a_\mu \) at the level better than one percent is required for a comparison of theory and experiment (see, e.g., [1]). Other observables include the anomalous magnetic moment of an electron \(a_e\), muonium hyperfine (Mu HFS) interval \(\nu _\mathrm{Mu}\) (of the 1s state), the Lamb shift in hydrogen \(E_L(\mathrm{H})\) for various states, the most important of which is 1s, and the Lamb shift in muonic hydrogen \(E_L(\mu \mathrm{H})\) for the 2s state or, more accurately for the \(2p-2s\) interval. Contributions to the Lamb shift in other ordinary and muonic light atoms such as [\(\mu \)]D, [\(\mu \)]\(^3\)He\(^+\), and [\(\mu \)]\(^4\)He\(^+\) are also required for theoretical evaluation and interpretation of experimental data.

While the leading-order (LO) term for \(\varDelta a_\mu (\mathrm{hVP})\) has been recently accurately calculated in many papers, the LO contributions for \(\varDelta a_e (\mathrm{hVP})\) ([2, 3]) and for \(\varDelta \nu _\mathrm{Mu} (\mathrm{hVP})\) ([2]) are found in a few publications only. There are also not very accurate calculations of the related contributions to Lamb shift in H [4] and \(\mu \)H (see, e.g., [5]). A number of results on the next-to-leading-order (NLO) hVP contributions to \(a_\mu \) and LO and NLO hVP contributions to some other observables, which do not require a high accuracy, exist as well.

Even if the required accuracy of the other applications is not as high as that for \(a_\mu \), there is still a question of a consistent evaluation of all the hVP contributions once one makes a ‘grand QED evaluation’ with many observables. Such an evaluation would be helpful for a determination of values of fundamental constants (see, e.g., [6]) or for a ‘gross QED test’ to constrain new physics.

Here we consider relations between the leading hVP contributions to various QED observables. Some of the relations are exact, others are approximate. All the corrections can be presented in a generic form, since they are proportional to an integral

$$\begin{aligned} \mathcal{I}=\int {\mathrm{d}s\,} \rho (s)\,K(s) \end{aligned}$$
(1)

over a dispersion variable s. Here K(s) is a result of an integration over a QED part of the diagrams, usually expressed in terms of elementary functions, and

$$\begin{aligned} \rho (s)=\frac{R(s)}{3s} \end{aligned}$$
(2)

is a quantity experimentally determined from \(e^+e^-\)-annihilation to hadrons or extracted from similar phenomena. Accurate calculations may require interpolations or extrapolations of the existing data as well as theoretical partly model-dependent input in the area where the data are not very accurate or not available at all. Until recently the dispersion integration in (1) was considered as the only way to find LO hVP contributions, however recently lattice calculations started delivering results with comparable accuracy.

The purpose of our work here is not to find an hVP contribution by itself, say, to Mu HFS interval from scratch, but to check how a change in R(s), due to a possible update of the experimental data or corrections, could affect the existing results. We take advantage of exact and approximate relations between K’s for various observables which we derive in this paper. Such approximate relations may be not sufficient to find a contribution by itself, however for a small correction (compared to a previously obtained value) the approximate relations should be sufficient. They can be also used to consider the uncertainty due to a scatter of the data and discrepancies. One more application of the approximate relations is due to higher-order hVP contributions, for which the high accuracy is not required.

The variety of possible results for the R function is not entirely due to its successful improvement. There is a scatter of the experimental results obtained in different experiments and/or evaluated by different methods. Therefore, different existing parameterizations of the R function could be in part inconsistent.

There are two very different groups of problems in hVP calculations with the R function for various QED observables.

  • For pure leptonic cases (\(a_\mu \), \(a_e\), and \(\nu _\mathrm{Mu}\)), the hVP contribution should be taken into account. The required fractional accuracy for the leading hVP term differs from an application to application. The highest one is for \(\varDelta a_\mu (\mathrm{hVP})\). One of the problems is an internal theoretical problem—the R function, determined from different groups of data or with help of corrections being approached differently, has inconsistencies. That should increase the uncertainty. The spread is present in the calculation of \(\varDelta a_\mu (\mathrm{hVP})\) (see, e.g., [1]), but not for two other leptonic observables. For them the uncertainty due to the scatter of the results should be estimated.

    A bigger problem is a discrepancy between theory and experiment for \(a_\mu \). There are four options to resolve the discrepancy. One is ‘new physics’. Once a model of new physics is introduced to force agreement between theory and experiment for \(a_\mu \), it should be considered for the other observables. The second option is a possible problem with the experiment on \(a_\mu \) [7]. The third one is due to a possible incorrectness of the hVP calculations (questions about the required [raw] data for R(s) and corrections for measurements, say, radiative corrections). The fourth option is a problem with a contribution of hadronic light-by-light scattering (hLbL). Other possibilities are unlikely. For all the options without involvement of new physics, we can consider either corrections (one can correct various values to reach agreement or choose which of the values is a ‘correct’ one) or an increase in the related uncertainties. Note, in the case of Mu HFS interval we need not only \(\varDelta \nu _\mathrm{Mu}(\mathrm{hVP})\), but also a value of \(a_\mu \) to calculate the leading term, and a certain hLBL correction to find one of the higher-order hadronic contributions. Their treatment should be consistent with the interpretation of the \(a_\mu \) discrepancy.

    Let us consider one of the options—the \(a_\mu \) experiment is correct, while the hLbL contribution is reasonably estimated. Then, we can extract

    $$\begin{aligned} \varDelta a_\mu ^{\text {from expt}} (\text {LO hVP})=720.1(6.9)\times 10^{-10} \end{aligned}$$
    (3)

    from a comparison of theory (without the leading hVP term) [3] and experiment. Indeed, such an interpretation would affect \(\varDelta a_e ({\text {LO hVP}})\) and \(\varDelta \nu _\mathrm{Mu} ({\text {LO hVP}})\), probably increasing their values by a few percent. This example demonstrates the importance of the interpretation of the \(a_\mu \) discrepancy for other observables. The shift is definitely a few times larger than the uncertainty of any ‘direct’ theoretical calculation for \(\varDelta a_e({\text {LO hVP}})\), \(\varDelta \nu _\mathrm{Mu}({\text {LO hVP}})\) etc.

    Another alternative is to consider the scatter in determination of \(a_\mu \) (i.e., the very difference in experimental and theoretical values) as an estimation of the uncertainty in determination of \(\varDelta a_\mu ({\text {LO hVP}})\). In this case, we have to define the latter as

    $$\begin{aligned} \varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})=705(15)\times 10^{-10}\;, \end{aligned}$$
    (4)

    which would likely increase the other hVP values by a few percent (but less that for the related results ‘from experiment’) and set their uncertainty to a level of a few percent (essentially higher that for the results ‘from experiment’).

    Use of either of those two values of \(\varDelta a_\mu (\mathrm{hVP})\) for other observables should push the related hVP contributions for other observables beyond the present uncertainty of their direct calculations. A correct interpretation of any hVP contribution requires establishment of a relation between \(\varDelta a_\mu (\mathrm{hVP})\) and the hVP contribution to an observable quantity of interest.

  • Observables with hadronic nuclei, such as the Lamb shift in ordinary and muonic atoms, should be treated differently. We cannot measure the nuclear radii, but we can extract their value from the electric-charge form factors. Meanwhile, we basically measure not the electric form factor \(G_E(q^2)\), but a value \(G_E(q^2)/(1-\pi _{h}(q^2))\) (see, e.g., a discussion in [4]). Here \(\pi _{h}(q^2)\) is a certain factor related to hadronic vacuum polarization. That means that any ‘incorrect’ treatment of the hadronic VP means rather an effective redefinition of the electric form factor and the related rms charge radius. At the present level of accuracy, such a ‘redefinition’ of the charge radius affects neither a determination of \(R_\infty \), nor internal consistency of evaluation of data related to H, \(\mu \)H, D, \(\mu \)D, etc., as far as the hVP factor \((1-\pi _{h}(q^2))\) is introduced in all the observables in a consistent way and as far as all the related nuclei are characterized by the data on the elastic electron-nucleus scattering. In such a case, the hVP contributions can be completely ignored and that would not lead to any problem in the interpretation.

    However, the situation with compound nuclei is not that simple as for the hydrogen nucleus, a proton. For a proton, one has no alternative to the scattering data and the measured quantity is expressed through \(G_E(q^2)/(1-\pi _{h}(q^2))\). For heavier nuclei, starting with the deuteron, one can construct an efficient nuclear model. In this case, the model ‘speaks’ rather directly in terms of \(G(q^2)\) and the hVP correction should be added. Often the radius itself is obtained from scattering, while a model describes various higher-order nuclear-structure effects, but still should produce a reasonable value of the radius as a consistency test.

    Once the overall set of available data includes both H and D atoms and their muonic analogues, a certain consistent treatment of hVP is required.

    Numerically, the mentioned effects are of minor importance for an evaluation of the current data; however, their importance may change with increase in the accuracy and increase in the nuclear charge Z. It is also important to be in control of those small corrections to be on a safe side.

With a dispersion-integral substitution for the Euclidean momentum \(q^2\)

$$\begin{aligned} \frac{1}{q^2} \rightarrow \frac{\alpha }{\pi }\int {\frac{ \mathrm{d} s~ \rho (s)}{q^2+s}} \end{aligned}$$
(5)

one can present the result for a LO hVP contribution of the observable \(X_i\) as

$$\begin{aligned} \varDelta X_i(\mathrm{hVP})=C_i \int {\mathrm{d}s\;} \rho (s)\,K_i(s,m_i)\;. \end{aligned}$$
(6)

Here, \(C_i\) is a normalization constant (see Table 3) and \(m_i\) is a characteristic mass for the observable \(X_i\). For X being the Lamb shift for an ns state in hydrogen (or other H-like atoms) and \(a_e\), \(m_i=m_e\), while for \(a_\mu \), \(\nu _\mathrm{Mu}\) and the Lamb shift in \(\mu \)H (and other muonic atoms) \(m_i=m_\mu \). As mentioned, often a parameterization for the dispersion density \(\rho (s)\) in terms of the R function [see (2)] is utilized.

The most important property of the expression (5) is that the QED kernels \(K_i(s)\) are known [8,9,10,11] or can be relatively easily derived for the leading hVP contributions to various QED observables, while the R function can be expressed through a cross section of the \(e^+e^-\) annihilation to hadrons, which is measured (or through similar effects such as the rate of hadronic modes of \(\tau \) decay). That makes the outcome for different observables \(X_i\) correlated. Comparing various \(K_i\) we are to identify those correlations.

Fig. 1
figure 1

The LO hVP contribution to the anomalous magnetic moment of an electron (a muon)

The kernels for the leading-order hVP contributions are known. The result for the anomalous magnetic moment of a muon (\(m=m_\mu \)) or an electron (\(m=m_e\)) (see Fig. 1) is of the form [8,9,10]

$$\begin{aligned} K_\mathrm{a}(s) =- & {} \frac{\frac{s^2}{2m^4} - \frac{2s}{m^2} + 1}{\sqrt{1 - \frac{4m^2}{s}}} \ln {\frac{1 +\sqrt{1 - \frac{4m^2}{s}}}{1 - \sqrt{1 -\frac{4m^2}{s}}}}\nonumber \\+ & {} \biggl (\frac{s^2}{2m^4} - \frac{s}{m^2} \biggr ) \ln {\frac{s}{m^2}} -\frac{s}{m^2} + \frac{1}{2}\;; \end{aligned}$$
(7)

the kernel for the LO hVP contribution to the 1s Mu HFS interval (\(m=m_\mu \)) (see Fig. 2) reads [11] (see also [12])

$$\begin{aligned} K_\mathrm{Mu}(s)= & {} -\left( \frac{s}{4m^2}+2 \right) \sqrt{1-\frac{4m^2}{s}} \ln {\frac{1+\sqrt{1-\frac{4m^2}{s}}}{1-\sqrt{1-\frac{4m^2}{s}}}}\nonumber \\&~&+\left( \frac{s}{4m^2} +\frac{3}{2} \right) \ln {\frac{s}{m^2}} -\frac{1}{2}\;; \end{aligned}$$
(8)

while the result for the Lamb shift in H (\(m=m_e\)) and \(\mu \)H (\(m=m_\mu \)) (see Fig. 3) is obvious

$$\begin{aligned} K_\mathrm{L}(s)=\frac{m^2}{s}\;. \end{aligned}$$
(9)

(Here and throughout the paper we apply units in which \(\hbar =c=1\).)

Fig. 2
figure 2

The leading hadronic contribution to the muonium HFS interval

Fig. 3
figure 3

The LO hVP contribution to the Lamb shift in hydrogen and muonic hydrogen

Sometimes calculations (within a ‘big evaluation’) deal with different R functions for different observables \(X_i\) (see, e.g., [6]). (Indeed, nobody applies by themselves different R’s parameterizations within one paper, but frequently results, which have already been obtained in various original papers with different R’s for different X’s, are utilized within a single review or compilation without any reservations.) Such an evaluation is inconsistent. A consistent one would suggest the use of the same R for all the observables involved.

We intend to find the corrections to calculation of various hVP contributions. One of the purposes is to be able to correct and to make consistent the results for different X’s for the case when the original evaluation has been performed with different R’s. To do that we should be able to find what would happen with values of the hVP contributions to two observables if we [slightly] change the R function.

Table 1 LO hVP contribution to various QED observables. Here, \(\widetilde{E}_F=8/3\,(Z\alpha )^4m_r^3/(m_em_\mu )\) is the leading nonrelativistic (NR) HFS term, which includes only the Dirac’s values of the lepton magnetic moments (see Sect. 4); \(m_r\) is the reduced mass of the related two-body system (muonium \(e\mu \), hydrogen ep, muonic hydrogen \(\mu p\)). Z is the nuclear charge which is equal to unity for all the mentioned two-body atoms; however, the result may be generalized to other hydrogen-like atoms with \(Z\ne 1\) as we do for the Lamb shift in this paper

Having R(s), it is possible to find the LO hVP contribution to any observable directly. We intend to consider relations between the LO hVP contributions for two reasons. Indeed, the relations will simplify an adaptation of a result of a calculation on one observable to another. (We remind that there is quite a large number of different evaluations of LO hVP for \(a_\mu \) and a relatively few calculations for other observables. We prepare an ‘instrument’ to adjust any result for \(a_\mu \) to a calculation for another observable.) As we mentioned, a direct calculation is possible for all observables and such relations would only simplify the adaptation of direct results for \(a_\mu (\mathrm{LO}\;\mathrm{hVP})\) to other quantities. In the meantime, we also have to deal with indirect results on \(a_\mu ({\text {LO hVP}})\) which follow from a comparison of theory and experiment (see Eqs. 3 and 4)). Those indirect results are not related to any set of data on R (or related only in part). In this case, we need relations to convert a result on \(a_\mu \) to other observables. Similar situation exists with adaptation of lattice results \(a_\mu ({\text {LO hVP}})\). Those are direct results, but they are not based on a dispersion presentation (6).

With the so-called \(a_\mu \) puzzle still unresolved, there is no consensus choice of a value of \(a_\mu \). It is uncertain due to the discrepancy and the uncertainty is essentially larger than [claimed] uncertainties of individual theoretical or experimental determinations. Since we have no preferences on the resolution of the discrepancy, such an uncertainty should be considered as an uncertainty of all the involved key contributions including the one due to hVP. Once we consider \(\varDelta a_\mu (\mathrm{hVP})\) more uncertain than its individual determinations [(cf. (4)], we have to acknowledge that until a consensus on the solution of the \(a_\mu \) puzzle is reached the experimental part of the dispersion integrand for \(\varDelta a_\mu (\mathrm{hVP})\) [(cf. (6)] is compromised and an additional uncertainty should be assigned to it. That in its turn means an additional uncertainty for all calculations with R(s). We demonstrate below that certain relations between hVP contributions to different QED observables could serve this purpose. We stress that we consider an error in R(s) just as a possible solution of the problem on the same ground as the other possibilities (an experimental error in \(a_\mu \), a theoretical error in hLbL, or new physics).

Table 2 LO hVP contributions to \(a_e\) and Lamb shift in hydrogen and muonic hydrogen. The average has the same uncertainty as individual contributions since both original results are based on experimental data, a large portion of which is the same

2 Slope of the hadronic vacuum polarization \(\varPi '_h(0)\) and LO hVP contributions to various observables

As seen from Table 1, some kernels are of a simple form and in particular a few of them are \(\propto m^2/s\). Let us introduce a notation for the slope of hVP polarization at zero

$$\begin{aligned} \varPi '_h(0)=\int {\frac{ \mathrm{d} s~ \rho (s)}{s}}\;. \end{aligned}$$
(10)

The slope is given for the vacuum polarization as a function of the Euclidean momentum. The slope for the polarization as a function of the standard Minkovsky four-momentum has the opposite sign.

In the terms of the slope, we find

$$\begin{aligned} \varDelta a_e({\text {LO hVP}})= & {} \frac{1}{3} \left( \frac{\alpha }{\pi }\right) ^2 m_e^2\,\varPi '_h(0), \end{aligned}$$
(11)
$$\begin{aligned} \varDelta E_L({\text {LO hVP}}, 1s, \mathrm{H})= & {} -\frac{4\alpha (Z\alpha )^4m_r^3}{\pi }\,\varPi '_h(0),\end{aligned}$$
(12)
$$\begin{aligned} \varDelta E_L({\text {LO hVP}}, 2s, \mu \mathrm{H})= & {} -\frac{\alpha (Z\alpha )^4m_r^3}{2\pi }\,\varPi '_h(0)\;. \end{aligned}$$
(13)

The LO hVP for several other observables, such as the \(1s-2s\) frequency in positronium and its ground-state HFS interval, is also expressed in terms of \(\varPi '_h(0)\) (see Sect. A for detail). Being too small, the positronium LO hVP contributions are out of our interest here.

Results of the hVP contribution to the Lamb shift in ordinary and muonic hydrogen, published in various recent compilations, are consistent, but based on old and relatively inaccurate calculations such as [4]. Meanwhile, a few of recent and accurate evaluations of the hVP contribution to \(a_e\) have been available in the literature, which allows us to improve the Lamb shift results (see below). (We remind that our concern is not an accuracy of the calculations by itself, but their reliability (for the individual ones) and their overall consistency (for comparison of the results for different observables).)

To find a value of \(\varPi '_h(0)\), we use calculations of the hVP contribution to \(g_e-2\) in (11) The latter was recently found in a few publications [2, 3]. The results of those direct calculations of the integral over the experimental data for the \(a_e\) contribution are given in Table 2. In the same table, we give the extracted value for \(\varPi '_h(0)\) and the related results for the hVP contributions to the Lamb shift in H and \(\mu \)H.

The results from [2] to [3] are in agreement; however, they are based very much on the same data and we cannot treat them as uncorrelated. Because of that we choose their average as a central value and use their individual uncertainty as the uncertainty of the average.

Note, at this stage we discuss a result of ‘direct calculations’ only, i.e., the result performed with an explicit value of the R function and for a quantity directly related to \(\varPi '_h(0)\).

Table 3 The leading hVP contributions in the limit \(m_\mu , m_e\rightarrow 0\). Here we also include the cases where K(s) is simple and coincides with its asymptotics

We have studied above the most simple kernels from Table 1. The other are more complicated. To obtain simple relations, we may rely on their asymptotics. Even \(K_{a_e}(s)\) has a more complicated shape [see (7)] than it is given in Table 1. However, the dispersion integral startsFootnote 1 with \(s_0=(2m_\pi )^2\), which leads to a saturation of the asymptotics

$$\begin{aligned} K_\mathrm{a}(s)=\frac{m_e^2}{3s}\;, \end{aligned}$$
(14)

at \( s\gg m_e^2\), that is valid for the whole integration area.

The corrections beyond the leading term due to the further \(m_e^2/s\) expansion are below \(10^{-5}\) for the all area of the s integration, so we may practically consider the identity (14) as an exact one, since the uncertainty of R(s) is always much higher than \(10^{-5}\). For this reason, we have given the expression above as an exact one in Table 1.

A similar equation for the case of \(a_\mu \) is applicable, but as an approximate one (see Table 3). Its accuracy with actual s is roughly at 15% level for the most of the integration area (see the plot in Fig. 4). The lowest accuracy is at the lowest s, close to the \(2\pi \) threshold. The error there is about 35%. However, that area does not contribute too much to the s integral, which is related to both its central value and the uncertainty.

Fig. 4
figure 4

Kernel ratio \(r(s)=K_{a_\mu }(s)/K_{a_\mu :\,\mathrm lead}(s)\) as a function of \(\sqrt{s}\). We remind that \(m_\rho \simeq 0.78\;\)GeV, \(2m_\pi \simeq 0.28\;\)GeV

To understand the accuracy of an approximation for the kernel K(s), we have to look at the function R(s), a convolution with which we are to calculate. Roughly the behavior of the R function is presented in Fig. 5. The dominant contributions come from \(s\simeq m_\rho ^2\). Note, the \(\omega \) and \(\phi \) contributions are at \(s\approx m_\rho ^2\) from the point of view of the accuracy of the approximation with a smooth-behaving kernel K. Meanwhile \((m_\mu /m_\rho )^2\simeq 0.02\) which does not make the limit \(m_\mu \rightarrow 0\) sufficient for accurate calculations, but still makes it useful for a consideration of the correlations between different contributions.

Fig. 5
figure 5

The R-function determined from the electron-positron annihilation to hadrons as published in our paper [5]). The plot was kindly provided to us by Simon Eidelman and we are grateful to him for his permission to use it in [5] and in this paper.)

The contribution from the \(\rho \)-meson (\(m_{\rho ^0}=775.26\)(25) MeV; \(\varGamma _\rho =147.8(9)\) MeV [13]) as well as from the \(\omega (m_\omega =782.65(12)\) MeV; \(\varGamma _\omega =8.49(8)\) MeV) and \(\phi \) (\(m_\phi =1019.461(16)\) MeV; \(\varGamma _\phi =4.249(13)\) MeV) mesons, which come from the same area of s, strongly dominates in both the LO hVP contribution to \(a_\mu \) and its uncertainty (see, e.g., [1]).

Comparing the kernel \(K_{a_\mu }\) and its asymptotics (see Fig. 4), we find that except of the area essentially below the \(\rho \) meson mass, the difference between the kernel \(K_{a_\mu }\) and its asymptotics \(K_{a_\mu :\,{\mathrm{lead}}}\) can be described as

$$\begin{aligned} K_{a_\mu }(s)=(0.83\pm 0.13) K_{a_\mu :\,\mathrm{lead}}(s)\;. \end{aligned}$$
(15)

A use of \(K_{a_\mu :\,\mathrm{lead}}\) allows us to express any \(a_\mu \)\(({\text {LO hVP}})\) result in terms of \(\varPi '_h(0)\). However, the accuracy of (15) may be not entirely sufficient for the whole area of s integration. We have to specifically consider an area of low s where the approximation is not that accurate. Another area where 15% in (15) is not sufficient is related to very high s. The ratio at \(s\rightarrow \infty \) is unity; however, the essential result comes from s in a wide area around \(m_\rho ^2\) for which the unity is not achievable.

As is well known for a while, the experimental result on \(a_\mu \) [7] does not agree with theory (see [1] for detail). If any corrections to theory should be done, they are to be small. We intend to apply the relation in (15) rather for corrections to results (or differences between the results) than to the complete values. We are interested in the area below \(\sqrt{s}=2\) GeV. A reason for that is that above this value one can not only measure \(\rho (s)\), but also calculate it within pQCD [14, 15]. The results are in good agreement (see, e.g., [16, 17]). We do not expect any big corrections there.

Applying that relation to the corrections should only allow us to maintain a high accuracy of the results. Since we consider possible corrections to theory, we have to choose theoretical values which will be ‘corrected’, i.e., our starting point. There are somewhat different theoretical results, but for analyzing the shifts it is not important which one to use. We take advantage that the accurate calculations were performed in [3] on \(a_\mu ({\text {LO hVP}})\) and \(a_e({\text {LO hVP}})\) in a consistent way and utilize the related results from [3]

$$\begin{aligned} \varDelta a_\mu ({\text {LO hVP}})= & {} 689.5(3.3)\times 10^{-10}\;, \end{aligned}$$
(16)
$$\begin{aligned} \varDelta a_e({\text {LO hVP}})= & {} 1.849(11)\times 10^{-12} \end{aligned}$$
(17)

as our starting point. We remind that the latter one corresponds to \(\varPi '_h(0)\,m_e^2=1.028(6)\times 10^{-6}\) (cf. Table 2).

Denoting a shift (from the reference values) as \(\delta \), we find

$$\begin{aligned} \delta \mathcal{I}_{a_e}= & {} \bigl (1.20\pm 0.15\bigr )\left( \frac{m_e}{m_\mu }\right) ^2\delta \mathcal{I}_{a_\mu }\;,\nonumber \\ \delta \varPi '_h(0)\,m_e^2= & {} 3\,\delta \mathcal{I}_{a_e}\;. \end{aligned}$$
(18)

We intend to apply the relation (15) not to the complete contribution to \(a_\mu \), which is a convolution with R(s), but to its possible changes due to changes in R(s). Such an approximation on K(s) with focussing on area relatively close to \(m_\rho \) is valid for relatively large changes in R(s). ‘Large change’ means either a shift in \(a_\mu \) larger than or comparable with the uncertainty of \(a_\mu (\mathrm{hVP})\) or a serious reconsideration of the uncertainty by increasing or decreasing it by a factor, say, of two. Probability of all such changes should follow the value of the uncertainty of partial hVP contribution, i.e., by a product of K(s) and the [absolute] uncertainty R(s) [cf. (6)]. The area, which dominates in the uncertainty budget, is expected to be the most probable area from which the change would come. That validates the approximation. Once we consider a small change in \(a_\mu \), it may come from any area including those where the approximation in (15) is not applicable.

Fig. 6
figure 6

Direct, indirect, and lattice (with red squares) values for the LO hVP contribution to the anomalous magnetic moment of a muon. The lattice results are summarized in Table 5

Table 4 ‘Reconstruction’ of the LO hVP contributions. ‘Scatter’ stands for an evaluation based on \(\varDelta a_\mu ^{\text {from scat}}(\mathrm{hVP})\), while ‘experiment’ is for an evaluation based on \(\varDelta a_\mu ^{\text {from expt}} (\mathrm{hVP})\). A direct value for comparison of the final results is based on our average in Table 2. For technical evaluations, we have to start with a direct value for which we know \(\varDelta a_\mu ({\text {LO hVP}})\), which is based on results of [3]

3 The \(a_\mu \) discrepancy and its consequences for various LO hVP

The main motivation to obtain the approximate relations above is the discrepancy in determination of \(a_\mu \). One can interpret them as a discrepancy between a theoretical result, where the uncertainty comes from hadronic effects, and an experimental one [7], which may be considered as an indirect determination of the hadronic contribution.

Theoretical predictions for the LO hVP contribution to the anomalous magnetic moment of a muon have a certain scatter (see, e.g., [1, 3, 18]). The scatter is essentially smaller than the discrepancy of theory and experiment and we ignore it here. As a theoretical value, we use the result in (16) obtained in [3], which is chosen by us because in that publication a result on \(\varDelta a_e\)(LO hVP) is also presented. As mentioned above, a scatter of theoretical results on \(\varDelta a_\mu ({\text {LO hVP}})\) is not our concern in this paper, so we can start with any theoretical result on \(\varDelta a_\mu ({\text {LO hVP}})\).

In principle, the hadronic NLO includes an uncertainty from the hadronic light-by-light scattering (hLbL) contribution. In this sense, we can consider the result

$$\begin{aligned} \varDelta a_\mu ^\mathrm{from~expt} ({\text {LO hVP}})=720.1(6.9)\times 10^{-10}\;, \end{aligned}$$

as one extracted from experiment [7] assuming that hLbL is calculated correctly. (To extract \(\varDelta a_\mu ^\mathrm{from}\,\mathrm{expt}\)\( ({\text {LO hVP}})\) we utilize the QED, NLO hVP, NNLO hVP, and hLbL-scattering contributions as given in [3]).

The most realistic value, however, is neither theoretical nor experimental, but (4)

$$\begin{aligned} \varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})=705(15)\times 10^{-10}\;, \end{aligned}$$

found from scatter of the theoretical and experimental results (see Fig. 6). In the evaluations below, we give values of various hVP contributions, related to direct theoretical calculations; based on \(\varDelta a_\mu ^{\text {from expt}} ({\text {LO hVP}})\), extracted from experiment, and the one due to the scatter of the data. The latter is the estimation recommended in this paper.

On base of the approximate relations between the LO hVP contributions to \(a_\mu \) and \(a_e\) [see (15)] and exact relations between the LO hVP contributions to \(a_e\), \(\varPi '_h(0)\), and to the Lamb shift in H and \(\mu \)H [see (10), (11), (12), (13), and Table 2], we find the results for the LO hVP contribution to various observables as summarized in Table 4 and Fig. 7.

Fig. 7
figure 7

Direct (a from [2], b from [3], and average from Table 2), indirect (from experiment and from scatter of theory and experiment), and lattice (from [19]) values for the LO hVP contribution to the anomalous magnetic moment of an electron and to \(\varPi _h'(0)\). The lattice value is discussed below

Note that there is no consensus on the theoretical value. Results from scan over s in \(e^+e^-\) annihilation are currently less accurate (around the \(\rho \)-meson mass etc.). The data on \(\tau \)-decay should be corrected due to violation of the isotopic invariance (introduction of which in [20] explained in the principle then discrepancy between decay data and annihilation by a \(\rho ^{(0)}-\gamma \) mixing). However, there exist a few different parameterizations for such a correction, which lead to somewhat controversial results [21,22,23]. Overall analysis sometimes utilizes \(\tau \)-decay data (see, e.g., [3, 23]), while in other cases it does not (see, e.g., [18, 24]).

Use of recent and more accurate initial-state-radiation (ISR) data shows a certain discrepancy between different experiments (see, e.g., ISR data for the \(\pi ^+\pi ^-\) channel from KLOE [25,26,27], BaBar [28], and BESIII [29] and their comparison in [18]). Nevertheless, the scatter of theoretical data is essentially smaller than the discrepancy of theory with the experiment. Currently, it is not that important which theoretical value is used as input as far as eventually we plan to calculate the value of \(\varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})\).

In our analysis, in this paper we are limited to publications which consider the LO hVP contribution to [at least] two observables. One of such papers [3] presents results on the contributions to \(a_\mu \) and \(a_e\) while applying \(\tau \)-decay data, while the other paper [2], important for our consideration, gives results for \(a_e\) and Mu HFS interval ignoring \(\tau \) data. Consideration of both options made our analysis a representative one.

One has also to mention the result for LO hVP from lattice calculations. Until recently their accuracy was not comparable with that of theory and experiment. Significant progress in the field has produced a situation when the uncertainty of the lattice results is comparable with the uncertainty of \(\varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})\).

Table 5 Lattice results on \(\varDelta a_\mu ({\text {LO hVP}})\). They are to be compared with \(\varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})=705(15)\times 10^{-10}\) from (4)

The lattice results are summarized in Table 5. The results are obtained by independent groups but using somewhat similar methods. Below we do not apply the lattice results on \(\varDelta a_\mu ({\text {LO hVP}})\) to obtain the LO hVP contribution to other variables, but consider them as a confirmation that \(\varDelta a_\mu ^{\text {from scat}}({\text {LO hVP}})\), which we utilize here, is a reasonable estimation. Once the lattice results on \(a_\mu \) will reach a somewhat higher accuracy, they may be applied to other QED observables along with the technique that was developed in this paper.

There is also a lattice calculation for \(\varDelta a_e({\text {LO hVP}})\). The result [19] of \(1.89(6)\times 10^{-12}\) is in perfect agreement with a result from scatter in Table 4 (see also Fig. 7).

4 LO hVP contribution to muonium HFS interval

Tables 1 and 3 on the LO hVP contributions contain results for a number of observables considered above in detail. One of important QED observables has not yet been discussed. This remaining case is considered here. It is for the muonium 1s HFS interval. Prior a discussion of the LO hVP term, we have to mention the leading NR contribution to the HFS interval, which is called the Fermi energy. It has several slightly different definitions, such as

$$\begin{aligned} {E}_F= & {} \frac{8}{3}\,(Z\alpha )^4\frac{m_r^3}{m_em_\mu }\left( 1+a_e\right) \left( 1+a_\mu \right) \;, \end{aligned}$$
(19)
$$\begin{aligned} \widetilde{E}_F= & {} \frac{8}{3}\,(Z\alpha )^4\frac{m_r^3}{m_em_\mu }\;, \end{aligned}$$
(20)

where the upper version of the Fermi energy is a result of NR interaction of two full magnetic moments of an electron and a muon, while the one in the bottom is with their Dirac’s values. It is convenient to consider the former as the leading term, while expressing the recoil effect (including hVP which is one of them) in units related to the latter. (That is not important for LO hVP, but a difference in definitions of Fermi energy as a prefactor for hVP affects definition of NLO hVP.)

Indeed, the anomalous magnetic moments of leptons have their hVP contributions of order \(\alpha ^2\) and the related contributions to the muonium HFS interval would be \(\alpha ^2(Z\alpha )^4m_e\) with various suppression of the mass factors. However, once the leading NR term, \(E_F\), is calculated with using the actual values of \(a_e, a_\mu \), those contributions are included in \(E_F\) and we are interested in finding the hVP contributions beyond them.

Fig. 8
figure 8

Kernel ratios \(r=K_\mathrm{Mu}/K_\mathrm{Mu: lead}\) (red) and \(r_0=K_\mathrm{Mu}/K_\mathrm{Mu:\,0}\) (blue) as a function of \(\sqrt{s}\). The related asymptotics are defined in (21) and (22). We remind that \(m_\rho \simeq 0.78\;\)GeV, \(2m_\pi \simeq 0.28\;\)Gev

The related LO hVP contribution to the muonium 1s HFS interval is given in terms of (6) and (1) in Table 1 [see also (8)] [11]. It is of order \(\alpha (Z\alpha )^5m_e\) with a characteristic suppression mass factor of \((m_e/m_\rho )^2\). The related asymptotics at \(s\gg m\) is of the form

$$\begin{aligned} K_\mathrm{Mu: lead}(s)=\frac{m_\mu ^2}{s}\,\left[ \frac{9}{2}\ln \frac{s}{m_\mu ^2}+\frac{15}{4}\right] \;. \end{aligned}$$
(21)

The results at the limit of \(m_i\rightarrow 0\) are summarized in Table 3 and one can note that the asymptotics in (21) has a logarithmically enhanced term in contrast to all the other asymptotics considered above [cf. (14) and (9)]. That does not allow us to easily relate the results for the hVP contributions to \(a_\mu , a_e\) and to the Lamb shift with that for the Mu HFS interval. To find such relations, we need an additional simplifying approximation.

To make such, we note that the dominant contribution comes from an area around \(s=m_\rho ^2\) (see Fig. 5). Let us introduce

$$\begin{aligned} K_\mathrm{Mu:\,0}(s)=\frac{m_\mu ^2}{s}\,\left[ \frac{9}{2}\ln \frac{m_\rho ^2}{m_\mu ^2}+\frac{15}{4}\right] \;. \end{aligned}$$
(22)
Table 6 HVP integrals \(\mathcal{I}\) for \(a_\mu , a_e\) and Mu HFS interval. Here, \(\mathcal{I}_{a_\mu }^{(0)}=({m_\mu }/{m_e})^2\,\mathcal{I}_{a_e}\) is the integral related to \(K_\mathrm{a:lead}(m_\mu ,s)\). Values in roman follow the original calculations in [2, 3] or experimental result in [7]; the italic ones are from combinations of original data including the evaluation in this paper

The expression is less accurate as an approximation compared to the one in (21); however, it is easily related to the asymptotics of other kernels [cf. (14) and (9)]. As to the error of the approximation, it is plotted in Fig. 8. Near the \(2\pi \) threshold the error is about 30–35% because of higher-order terms of \(m^2/s\) expansion. At \(\sqrt{s}=2\;\)GeV the error also reaches that level because of the substitution of

$$\begin{aligned} \ln {s}\rightarrow \ln {m_\rho ^2}\;. \end{aligned}$$

The contribution from the area above \(\sqrt{s}=2\;\)GeV to the Mu HFS interval is not very large (about 20%) and the R function is well described by a step function obtained from perturbative QCD [14, 15].

The latter circumstance makes reliable the high-s end of the s integration in the dispersion integral, because of consistency between experimental data and perturbative QCD theory. Due to that, we expect that the major shift in R(s) (if any) may happen between the \(2\pi \)-threshold and, say, \(\sqrt{s}=2\) GeV. That allows us to write for the difference between a result of a direct calculation and one with a true R(s) function, based on certain value of \(a_\mu \) (which may be obtained from experiment, estimated from scatter of experiment and theory, or taken from a lattice calculation), as an indirect estimation, the following

$$\begin{aligned} \delta \mathcal{I}_\mathrm{Mu} =\left[ \frac{27}{2}\ln \frac{m_\rho ^2}{m_\mu ^2}+\frac{45}{4}\right] \left( 1.1\pm 0.4\right) \delta \mathcal{I}_{a_\mu } \,.~~~ \end{aligned}$$
(23)

We remind that we are interested in relatively large changes in \(a_\mu \) (see discussion above).

As ‘reference’ points (to calculate shift \(\delta \mathcal{I}_\mathrm{Mu}\)) we consider the results from [2] to [3]. In [2] they found \(a_e({\text {LO hVP}})\) and \(\nu _\mathrm{Mu}({\text {LO hVP}})\), which allows one to relate a result on \(\mathcal{I}_{a_e}\) to the corrected value of \(\mathcal{I}_\mathrm{Mu}\). Since (23) sets a relation between shifts in \(\mathcal{I}_{a_\mu }\) and \(\mathcal{I}_\mathrm{Mu}\), we have to find first a shift in \(\mathcal{I}_{a_\mu }\) from a known shift in \(\mathcal{I}_{a_e}\). The results of [3], which include \(\varDelta a_e({\text {LO hVP}})\) and \(\varDelta a_\mu ({\text {LO hVP}})\), are suitable for that (see the previous section for details). In terms of the \(\mathcal{I}\) integrals, the results of our evaluations are summarized in Table 6.

The results due to the \(a_\mu \)-discrepancy are

$$\begin{aligned} \varDelta \nu _\mathrm{Mu}({\text {LO hVP}})= & {} 240(4)\;\mathrm{Hz}~~~~~\text {for expt}\;,\nonumber \\ \varDelta \nu _\mathrm{Mu}({\text {LO hVP}})= & {} 236(5)\;\mathrm{Hz}~~~~~\text {for scatter}\;. \end{aligned}$$
(24)

5 NLO hVP contribution to the Lamb shift in ordinary and muonic hydrogen

The NLO hVP contributions to a number of QED observables are known, but not to all of them. In particular, the NLO contribution for \(a_\mu \) is known [23, 37] (see also, e.g., [1]) as well as the ones for \(a_e\) (see, e.g., [2, 3, 37]) and Mu HFS interval [38, 39]. (For some variables such as \(a_\mu , a_e\) even NNLO hVP is known [40]; for \(a_\mu , a_e\), and Mu hfs interval, the hLbL contributions are of the same orders as NLO hVP and needed for a complete NLO hadronic term (see, e.g., [1, 3, 23, 41, 42] for detail).)

Here we consider the NLO hVP contribution to the Lamb shift in ordinary and muonic hydrogen. NLO in H and \(\mu \)H includes a term, which is due to the two-loop vertex with one of those two loops being hVP (see Fig. 9). It is of relative order \(\alpha \) compared to the related LO hVP term. In the case of the Lamb shift of an ns state in hydrogen that is a complete NLO correction.

That NLO term is of order \(\alpha ^2(Z\alpha )^4m(m/m_\rho )^2\). It comes from a slope of the Dirac form factor \(F_1'(0)\) at zero momentum transfer and from the value of the Pauli form factor \(F_2(0)\) at zero momentum due to the mentioned two-loop vertex, depicted in Fig. 9. The latter, \(F_2(0)\), is the anomalous magnetic moment of the orbiting lepton. This contribution relates to LO hVP to the related anomalous magnetic moment and it is indeed known. However, since two contributions to the Lamb shift of an s state (due to \(F_1'(0)\) and \(F_2(0)\) ), which are technically very similar, we consider both of them as a single corrections within the same formalism (see below).

There is also a contribution to the Lamb shift at \(l\ne 0\) in ordinary and muonic hydrogen of the same order as the mentioned NLO hVP contribution for the s states. It is completely described by the anomalous magnetic moment in the same matter as the leading QED term for \(l\ne 0\), once we deal with a complete value of the anomalous magnetic moment of the orbiting lepton \(a_L\)

$$\begin{aligned} \varDelta E_L^{(a)}(nl)=a_L \frac{(Z\alpha )^4m_r^2}{m}\frac{j(j+1)-l(l+1)-3/4}{l(l+1)(2l+1)n^3}.~~~ \end{aligned}$$
(25)

To find the LO hVP contribution to the Lamb shift of \(l\ne 0\) states, which is of order \(\alpha ^2(Z\alpha )^4m(m/m_\rho )^2\), i.e., of the same order as the NLO hVP contribution for s states, it is sufficient to utilize the LO hVP contribution of the anomalous magnetic moment, \(\varDelta a_L({\text {LO hVP}})\), into the identity above. Since this contribution is known (see above), we do not consider it in this paper and focus on the Lamb shift in ns states.

Fig. 9
figure 9

The NLO hVP contribution to the Lamb shift in H and \(\mu \)H due to the two-loop vertex

The integral presentation (in terms of s) for the vertex contribution is of the same shape in H and \(\mu \)H. While that is a complete result for hydrogen, for its muonic analog we need to take into account other effects as well.

We remind that in H and \(\mu \)H the LO hVP contribution to the Lamb shift (see Fig. 3) is proportional to \(|\varPsi (0)|^2\), the squared value of the wave function at origin. In the muonic case, there is a number of different sources for the \(\alpha \) correction to \(|\varPsi (0)|^2\) in addition to the two-loop-vertex term discussed above. One of them is from an NR correction to \(|\varPsi (0)|^2\) due to the Uehling potential which is known in a semi-analytic form (see, e.g., [43]), while the other is due to an NR correction with electronic vacuum polarization (eVP) on the same photon as hVP. Those NR specific \(\mu \)-atom NLO hVP corrections are presented in Fig. 3. The results could be generalized for light two-body muonic atoms. The Uehling correction to the wave function at origin is given for the 1s and 2s states of atoms \(A\le 10\) in [44].

Fig. 10
figure 10

The Uehling-potential (eVP) correction to the LO hVP contribution. The double horizontal line is for the NR Coulomb Green function of a bound muon. The NLO correction is of order \(\alpha ^2(Z\alpha )^4m\)

Let us consider now all the mentioned effects in more detail, starting with the two-loop vertex. A two-loop vertex, where one loop is a VP one, delivers pure lepton contributions as well as hVP ones. The simplest leptonic one is the eVP contribution to the Lamb shift in hydrogen, which is well known [45,46,47,48]. A similar contribution is due to muonic vacuum polarization (\(\mu \)VP) in \(\mu \)H (which can be found with a simple scaling substitution \(m_e\rightarrow m_\mu \)). There are also asymmetric in masses pure leptonic contributions, such as eVP in \(\mu \)H and \(\mu \)VP in H, which are known only in part.

Let us find \(C_\mathrm{vert2}(m)\) and \(K_\mathrm{vert2}(m,s)\) for s states, defined in general in (6), where m is the mass of the orbiting particle. Following [49], we obtain for the Lamb shift of the ns state with an orbiting particle with mass m

$$\begin{aligned} C_\mathrm{vert2}(m)=24 \left( \frac{\alpha }{\pi }\right) ^2\,\frac{(Z\alpha )^4 }{n^3} \frac{m_r^3}{m^2} \end{aligned}$$
(26)

and

$$\begin{aligned} K_\mathrm{vert2}(m,s)= & {} \frac{1}{288}\int _0^1{\mathrm{d}x} \biggl [ \frac{m^2x}{D}\left( 6+15x-13x^2\right) \nonumber \\&+ \frac{m^4x^3}{D^2}\, \left( 10-7x+7x^2\right) \biggr ], \end{aligned}$$
(27)

where

$$\begin{aligned} D(x, s) = m^2x^2 + s(1-x)\;. \end{aligned}$$

The identities above are for the sum of two mentioned earlier contributions (due to \(F_1'(0)\) and \(F_2(0)\)).

Here s is a dispersion variable [cf. (5)], which can be introduced both for leptonic and hadronic VP loops. In the leptonic case, we use the parameterization

$$\begin{aligned} \frac{1}{q^2} \rightarrow \frac{\alpha }{\pi }\int {\frac{d v}{1-v^2}\;\frac{ v^2\left( 1-\frac{v^2}{3}\right) }{q^2+\frac{4m_l^2}{(1-v^2)}}}\;, \end{aligned}$$

where \(m_l\) is the lepton mass in the VP loop.

Performing the x-integration, we find

$$\begin{aligned} K_\mathrm{vert2}(m,s)= & {} -\frac{1}{72}\,\Biggl \{ \frac{s (5m^2 - 2s)}{m^2(s - 4m^2)} + \frac{s^2 + 2m^4}{m^4}\ln {\frac{s}{m^2}}\nonumber \\&- \frac{s(s^3 - 6s^2 m^2 + 8s m^4 - 6m^6)}{m^4(s - 4m^2)}\cdot J_{10}(s)\Biggr \},\nonumber \\ \end{aligned}$$
(28)

where

$$\begin{aligned} J_{10}(s)= & {} \int _0^1{\mathrm{d}x} \frac{m^2}{d}\\= & {} \frac{1}{\sqrt{s}\,\sqrt{s-4m^2}}\\&\times \biggl \{ 2\ln {\frac{s+\sqrt{s}\,\sqrt{s-4m^2}-2m^2}{s+\sqrt{s}\sqrt{s-4m^2}}} +\ln {\frac{s}{m^2}}\,\biggr \}. \end{aligned}$$

It is useful to consider asymptotic behavior at \(s\gg m^2\). The result reads

$$\begin{aligned} K_\mathrm{vert2:lead}(m,s) = \frac{m^2}{s}\biggl [\frac{1}{36}\ln {\frac{s}{m^2}} + \frac{1}{54}\biggr ]. \end{aligned}$$
(29)

Using the kernel obtained above, we can consider various leptonic VP contributions to the Lamb shift of s states in H and \(\mu \)H. Such contributions are eVP and \(\mu \)VP for H and eVP and \(\mu \)VP for \(\mu \)H. The contribution, for which the loop particle and the orbiting one are the same, agrees with the known theoretical result [48].

The eVP correction to the Lamb shift in \(\mu \)H reads

$$\begin{aligned} \varDelta E_\mathrm{vert:eVP}(ns,\;\mu \mathrm{H})= & {} \frac{\alpha ^2(Z\alpha )^4}{\pi ^2 n^3}\, \frac{m_r^3}{m_\mu ^2} \nonumber \\&\times \left[ \frac{1}{9} \ln ^2{\left( \frac{m_\mu }{m_e}\right) ^2}-\frac{10}{27} \ln {\left( \frac{m_\mu }{m_e}\right) ^2}\right. \nonumber \\&\left. ~~ + \frac{2\pi ^2}{27} + \frac{85}{162}\right] \;, \end{aligned}$$
(30)

which agrees with the result from [50].

Fig. 11
figure 11

An ‘asymmetric in masses’ contribution to the Lamb shift in H due to the two-loop vertex, where one of the loops is a \(\mu \)VP one

The \(\mu \)VP correction to the Lamb shift in hydrogen (see Fig. 11), which is a new result, is obtained as

$$\begin{aligned} \varDelta E_\mathrm{vert:{\mu }VP}(ns,\;\mathrm{H})= & {} \frac{\alpha ^2(Z\alpha )^4}{\pi ^2 n^3}\, \frac{m_r^3}{m_e^2} \left( \frac{m_e}{m_\mu }\right) ^2 \nonumber \\&\times \left[ \frac{2}{45}\ln {\left( \frac{m_\mu }{m_e}\right) ^2} + \frac{97}{675} \right] ,~~~ \end{aligned}$$
(31)

which contributes 27 Hz to the 1 s Lamb shift in hydrogen and deuterium.

In the case of the NLO hVP, the diagram in Fig. 9 allows us numerical evaluations only. In the case of hydrogen, the expression can be simplified (since \(m_e^2/s\ll 1\))

$$\begin{aligned} E_L^\mathrm{hVP: vert2}(ns)= & {} 24 \left( \frac{\alpha }{\pi }\right) ^2\,\frac{(Z\alpha )^4m_r^3}{n^3}\nonumber \\&\times \int {\frac{\mathrm{d} s~ \rho (s)}{s}}\left[ \frac{1}{36}\ln \frac{s}{m_e^2}+\frac{1}{54}\right] .~~~ \end{aligned}$$
(32)

There are different possibilities to obtain a fast preliminary estimation. In particular, we consider two simple methods below. One is to use a simplistic model for the dispersion function R(s), which we have previously used for other occasions (cf. [38, 39]). A related result is given in Table 7. For \(\mu \)H, we have applied an exact kernel, since the asymptotics in (32) is not very accurate for \(\mu \)H. Therefore, our estimation is obtained with an exact kernel K(s) and approximate function \(\rho (s)\).

Table 7 Calculation of the NLO: vert2 hVP contribution to the Lamb shift in H and \(\mu \)H. We give here no uncertainty, but illustrate the scatter of the data obtained by different methods. The uncertainty of NLO hVP can be neglected comparing with the uncertainty of LO hVP. The NLO term rather contributes to the central hVP value, but not to its uncertainty

Another estimation can be obtained with complete R(s) but very approximate K(s). To begin with, we use a rough approximation (cf. (32))

$$\begin{aligned} K_L^\mathrm{vert2:approx}(m, s)=\frac{m^2}{s}\left[ \frac{1}{36}\ln \frac{m_\rho ^2}{m^2}+\frac{1}{54}\right] \;. \end{aligned}$$

which leads to useful relations

$$\begin{aligned} \mathcal{I}_L^\mathrm{vert2:lead}(\mathrm{H})= & {} \left[ \frac{1}{12}\ln \frac{m_\rho ^2}{m_e^2}+\frac{1}{18}\right] \mathcal{I}_{a_e}\;,\nonumber \\ \mathcal{I}_L^\mathrm{vert2:lead}(\mu \mathrm{H})= & {} \left[ \frac{1}{12}\ln \frac{m_\rho ^2}{m_\mu ^2}+\frac{1}{18}\right] \mathcal{I}_{a_\mu }^{(0)}\;. \end{aligned}$$
(33)

For numerical estimations, which are given in the first line of Table 7, we utilize results from [3] for the right-hand side of the identities in (33).

We have obtained two independent estimations for the Lamb shift in both ordinary and muonic hydrogen, and their scatter characterizes the uncertainty of those estimations.

The estimation can be improved once we use an approximation of the kernel as a combination of kernels for Mu HFS interval and for \(a_e\) (or \(a_\mu \)) with the coefficients good for asymptotics \(s\gg m^2\). In this case, we can use the identity (32) with \(\ln (s/m^2)\) without a substitution \(s\rightarrow m_\rho ^2\), which is not very accurate for a broad range of s.

In particular, one can express the asymptotics of this kernel in terms of asymptotics for the kernels for \(a_\mu , a_e\) and for Mu HFS interval as

$$\begin{aligned} K_L^\mathrm{vert2:lead}(m_e, s)= & {} \frac{m_e^2}{m_\mu ^2}\left[ \frac{1}{162}K_\mathrm{Mu:lead}(m_\mu ,s)\right. \nonumber \\&+\left. \left( \frac{1}{6}\ln \frac{m_\mu }{m_e}-\frac{1}{72}\right) K_\mathrm{a:lead}(m_\mu ,s)\right] \;,\nonumber \\ K_L^\mathrm{vert2:lead}(m_\mu , s)= & {} \frac{1}{162}K_\mathrm{Mu:lead}(m_\mu ,s)\nonumber \\&-\frac{1}{72}K_\mathrm{a:lead}(m_\mu ,s)\;. \end{aligned}$$
(34)

Using the same coefficients, we apply now the relations for the complete kernels, which makes those relations approximate. The result for their integration over s is known (see above) and, following the scatter results, we obtain

$$\begin{aligned} \mathcal{I}_L^\mathrm{vert2}(\mathrm{H})\simeq & {} 4.1\times 10^{-7}\;,\nonumber \\ \mathcal{I}_L^\mathrm{vert2}(\mu \mathrm{H})\simeq & {} 6.1\times 10^{-3}\;. \end{aligned}$$
(35)

To check the accuracy of (35) and introduce corrections, we have studied the ratios of interest. In previous sections, we were interested in small corrections to known values. It was sufficient to study corrections to R(s) in a relatively narrow area. Here we are to calculate the whole NLO contributions and therefore we are rather interested in the complete area of s integration.

The accuracy for the contribution to the Lamb shift in hydrogen is determined by the ratio

$$\begin{aligned}&\frac{K_L^\mathrm{vert2}(m_e, s)}{\frac{1}{162}\frac{m_e^2}{m_\mu ^2}K_\mathrm{Mu}(m_\mu ,s)+\left( \frac{1}{6}\ln \frac{m_\mu }{m_e}-\frac{1}{72}\right) K_\mathrm{a}(m_e,s)}\nonumber \\&\quad =0.985\pm 0.012\, \end{aligned}$$
(36)

plotted in Fig. 12.

Fig. 12
figure 12

Kernel ratio r of (36)

For muonic hydrogen, the ratios of interest are (see Fig. 13)

$$\begin{aligned} \frac{K_L^\mathrm{vert2}(m_\mu , s)}{\frac{1}{162}K_\mathrm{Mu}(m_\mu ,s)-\frac{1}{72}K_\mathrm{a}(m_\mu ,s)}=0.92\pm 0.06\;, \end{aligned}$$
(37)

and

$$\begin{aligned} \frac{K_L^\mathrm{vert2}(m_\mu , s)}{\frac{1}{162}K_\mathrm{Mu}(m_\mu ,s)-\frac{1}{72}\frac{m_\mu ^2}{m_e^2}K_\mathrm{a}(m_e,s)}=0.93\pm 0.05\;. \end{aligned}$$
(38)
Fig. 13
figure 13

Kernel ratios r of (37) (red) and (38) (blue)

Using the relations above, we obtain central values given in Table 7. We still do not give any uncertainty. The most uncertain estimation can be obtained from the scatter value. Even its uncertainty is smaller that the uncertainty of the approximate relations we apply. The uncertainty of the NLO: vert2 term is much smaller than the one of LO hVP and can be ignored.

Finally, for the NLO hVP: vert2 terms we arrive at

$$\begin{aligned} \varDelta E_L^\mathrm{H}(1s)= & {} 18\;\mathrm{Hz}\;,\nonumber \\ \varDelta E_L^{\mu \mathrm H}(2p-2s)= & {} -0.02\;\mu \mathrm{eV}\;. \end{aligned}$$
(39)

6 The hVP contribution to the Lamb shift in hydrogen and muonic hydrogen

The NLO hVP contributions have been known for \(a_\mu \) (see, e.g., [18]), \(a_e\) (see, e.g, [2]), and Mu HFS interval [38, 39], but not for the Lamb shift in both ordinary and muonic hydrogen. In principle, any accurate evaluation of LO should be accompanied with an NLO calculation, since the LO uncertainty is often comparable to the NLO contribution. The NLO contributions to Lamb shift in H and \(\mu \)H have been already in part considered above.

In the case of hydrogen, NLO consists of the vert2 term only. It is of relative order of \(\alpha \) to the LO hVP term. A contribution with an extra \(Z\alpha \) exists, but it has an additional suppression factor of \(m_e/m_\rho \). Theory of hVP for the 1s Lamb shift in hydrogen is summarized in Table 8. We consider NLO not as the result by itself, but rather to compare it with the uncertainty of LO. For the 1s state in hydrogen it is 18 Hz.

Table 8 The hVP contribution in hydrogen. We use here the results based on scatter of \(a_\mu \) data
Table 9 LO and NLO hVP contributions to the \(2p-2s\) Lamb shift in \(\mu \)H, based on scatter estimations. See Sect. B in Appendix for detail

The NLO contribution is the same (in units of the LO term) for any s state. Since the NLO contribution has an additional factor of \(\alpha \), it is the same in units of the LO term for all the low-Z H-like atoms. In particular, for an arbitrary s state of a low-Z H-like atom it is

$$\begin{aligned} 18\, \mathrm{Hz}\times \frac{Z^4}{n^3}\;. \end{aligned}$$

The LO hVP term also scales as \(Z^4/n^3\).

Theory of the Lamb shift in muonic hydrogen is somewhat different from that in hydrogen. In particular, additionally to the vert2 term there are NR Uehling contributions. One of them is due to a modification of the wave function at origin, that has been considered for a number of occasions (see the left diagram in Fig. 10). The other one is with eVP on the same photon as hVP (see the right diagram in Fig. 10). Both are similar to the corrections in muonic atoms for HFS interaction and for finite-nuclear-size contributions (FNS) to the Lamb shift (cf., e.g., [44, 51]). We apply here the related results following [44]. The evaluation of soft contributions due to the Uehling potential is given in Appendix in Sect. B for the nuclear charge \(Z=1-10\).

Summary on theory of hVP for the \(2p-2s\) Lamb shift in \(\mu \)H is given in Table 9. Similarly to the case of hydrogen above, here we consider NLO not as the result by itself, but rather to compare it with the LO uncertainty. The results on other low-Z two-body muonic atoms are given in Appendix in Sect. B.

As mentioned in a number of publications (see, e.g., [4, 5, 52]), there is a certain interference in treatment of the FNS and hVP contributions to the Lamb shift in H and \(\mu \)H (and other atoms). In a number of situations, such as, e.g., in determination of the form factors, the hVP correction is not subtracted while evaluating the scattering data and therefore instead of form factors one measures rather

$$\begin{aligned} \frac{G(q^2)}{1-\frac{\alpha }{\pi }{\frac{\varPi _h(q^2)}{q^2}}}\;. \end{aligned}$$

For the expansion at low \(q^2\), that means that instead of the charge radius squared we effectively deal with

$$\begin{aligned} -\frac{1}{6} R_p^2+\frac{\alpha }{\pi }\varPi _h'(0)\;. \end{aligned}$$
(40)

If only the LO hVP contribution to the Lamb shift of either ordinary or muonic atom is important, it is more advantageous not to consider hVP explicitly at all, which means an effective redefinition of the rms nuclear charge radius following (40).

Mathematically speaking, FNS and hVP do not make a single block. While fitting, they have a somewhat different structure and it may be good to separate them. Corrections, e.g., a soft QED correction for \(\mu \) atoms, are also not of exactly the same form. The combinatoric coefficient is different (see Sect. B in Appendix). A similar situation is with the radiative corrections to electron-proton scattering, e.g., eVP corrections for one-photon exchange have a different (by a factor of two) coefficient for form factor and for hVP. Because of logarithmic enhancement for \(q^2\gg m_e^2\), such a correction can be more important for scattering than for the bound problem.

In the case of light compound nuclei (deuteron, triton, helion, \(\alpha \)-particle), the situation is more tricky. The radius can be determined either from muonic atoms or from scattering. In the meantime, there are various nuclear models (see, e.g., [53]) which may be efficiently used for calculation of the nuclear-structure contributions. A value of the radius used in theory of compound nuclei is usually not the one obtained from the models, but they still have to produce such a radius from the model for a consistency test. The hVP effects are definitely to be added to nuclear-physics-model calculations of the radius. However, at the present level of accuracy that is not really important. The most accurate applications relate H and \(\mu \)H and their isotopes, where the mentioned effects are rather marginal.

Here, we have calculated hVP corrections in ordinary and \(\mu \) atoms and it is up to those who consider their complete theory to include them or not. However, the decision should be explicitly explained in order to avoid confusion and to allow ones to maintain consistency with other calculations.

7 Conclusions

In area of precision low-energy tests of QED, there is a number of discrepancies, such as the proton-radius puzzle and a controversy with \(a_\mu \). These discrepancies create problems at two different levels. One that attracts most of the attention is roughly speaking a question, which of controversial results are right and which are wrong. In meantime, there is an additional problem in the shadow of the previous one. Presence of a controversy means that certain data and certain standard treatments are compromised. Meanwhile, they are not applied exclusively to the controversial quantities, but are often also used for a number of others.

For instance, the proton-radius puzzle tells us that probably the use of the data on the form factors of a proton in a wide region, as it is usually done, is incorrect. (If it is, it is not clear whether we have a problem with the data or their interpretation, including the fitting ‘as usual’.) That means not only that the related value of the proton charge radius is [probably] wrong, but also that the magnetic radius and various other quantities such as the so-called Zemach and Friar radii, cannot be considered as reliable, once they are deduced from the same evaluation that delivers a probably incorrect value of \(R_p\). The Zemach radius [54] is required to obtain theoretical predictions for the HFS interval in ordinary and muonic hydrogen, while the Friar radius [55,56,57] is a key element of a presentation of a higher-order FNS contribution, which plays an important role for a determination of \(R_p\) from the Lamb shift in \(\mu \)H. (The related higher-order FNS contribution is essentially smaller than a discrepancy for the radius from \(\mu \)H spectroscopy and \(e-p\) scattering, but essentially larger than the uncertainty of \(R_p\) due to the \(\mu \)H experiment [58] and of a pure QED part of the related theory (see, e.g., [5]).) See [59, 60] for more details.

A number of recent indications from hydrogen spectroscopy [61, 62] and electron-proton low-momentum-transfer scattering [63] have pointed out that the muonic value of the proton charge radius [58] is in general correct. (Not all the recent results support the muonic value of \(R_p\), e.g., the spectroscopic result [64] confirms the ‘old’ hydrogen value, which is consistent with a result of a [former] ‘usual’ evaluation from \(e-p\) scattering [65].) However, they do not help for understanding what to do with the overall scattering data and their fits for the electric form factor, that is required for a calculation of the Friar term in order to eventually obtain a muonic value of \(R_p\).

The problem with \(a_\mu \) is somewhat different. If theory is compromised, that means that a calculation of LO hVP is compromised at the first place. (The [dominant] pure QED part of theory [66] is much more accurate and reliable than the LO hVP contribution, while the higher-order hadronic effects are smaller than LO hVP and therefore also more reliable (see [1] for detail).)

Standard theoretical evaluations of the LO hVP term are based on the dispersion integral in (1) over experimental data (presented with the R(s) function). The same function R(s) is applied for other observables. The discrepancy for \(a_\mu \) should be considered as the most accurate test of calculations of the LO hVP contribution through a dispersion integral. The discrepancy indicates that the dispersion calculation is compromised. It is the most important not just to update various hVP results from an occasion to an occasion, but to check whether a more recent value is consistent with the previous one, and to be sure that all the results for different observables, applied within a certain analysis, are consistent.

Disagreement between experiment and theory on \(a_\mu \) opens a question, how the hVP results for \(a_\mu \) should be interpreted. There are four options: (i) experiment [7] is not correct; LO hVP theory (see, e.g., [3]) is not; the evaluation of hLbL is not; or certain new physics is present.

Technically, the option of new physics as a solution of the discrepancy between theory and experiment leads to the same consequences for the application as a possible [relatively] large error in the hLbL contribution. The experimental value of \(a_\mu \) is applicable in such a case as well as the R function and all the direct results on the LO hVP contributions to various observables; however, one has to consider an additional contribution to various QED observables, which has not been accounted for (or has not been correctly accounted for) previously. Such a contribution (e.g., due to a conceptual error in the hLbL term or due to new physics) for an observable of interest may happen to be larger than the uncertainties due to LO hVP (theoretical) and \(a_\mu \) (experimental). In both cases (hLbL and new physics), the contribution is to be model dependent. The required new physics should be introduced with a certain model or one has to consider a completely new approach to hLbL, drastically different from the previous calculations. Eventually, that leads to a bunch of model-dependent corrections to various observables.

Until such a model appears we have to focus our attention on two other options: either the \(a_\mu \) measurement [7] has a systematic error, or the data on R(s), applied for a standard calculation of LO hVP, have to be reconsidered. The data may involve the measurements errors, the errors in interpretation (such as radiative corrections), or models to extend the data beyond the experimental reach (to low-luminosity kinematic areas and/or channels).

Without new data or a hint, we are not capable to choose between hVP theory or experimentFootnote 2, and therefore as an estimation of the accuracy of LO hVP we consider the scatter of the data from its direct and indirect determination. The estimation, based on the scatter, pretty well agrees with the lattice results [19, 30,31,32,33,34,35,36] (see Fig. 6). As a consistent set of hVP contributions, we choose the values given in Table 10. They are based on scatter of \(a_\mu \) (see Fig. 6) and details of their evaluation have been given in the paper.

Table 10 The hVP contributions to various QED observables. We present LO hVP for most of the contributions, except for the Lamb shift, for which we present LO+NLO

The muonium 1s HFS interval deserves a somewhat more accurate consideration. Technically speaking, the largest hVP contribution to Mu HFS interval is not the one considered here as the LO hVP term, but another one due to \(a_\mu \). They are of the same order in \(\alpha \) but have different suppression mass factors, such as \(m_\mu ^2/m_\rho ^2\) (for \(a_\mu \)) and \(m_em_\mu /m_\rho ^2\) (for a ‘direct’ contribution to Mu HFS). A small LO hVP contribution to \(a_e\) is also of the same order in \(\alpha \) but has a suppression factor of \(m_e^2/m_\rho ^2\).

Once we define the leading NR term of the muonium HFS interval, the Fermi energy \(E_F\), as the result of interaction of full moments of electron and muon as given in (19), the \(a_\mu \)-related hVP contribution is a part of the Fermi energy, while the leading hVP correction to it is \(\varDelta \nu _\mathrm{Mu}({\text {LO hVP}})\). However, we have to consider the leading NR term and the LO hVP correction to the Mu HFS interval in a consistent way. Since QED theory of \(a_\mu \) is set, any uncertainty and error is understood as uncertainty or error of direct or indirect determination of the contribution of hVP effects. A simultaneous inclusion in the same theoretical expression of an experimental value of \(a_\mu \) from [7] and \(\varDelta \nu _\mathrm{Mu}({\text {LO hVP}})\) from a dispersion integration (1) (such as the result from [2]) is inconsistent.

Our recommended set of consistent values for the involved quantities, based on the scatter for \(a_\mu \), is

$$\begin{aligned} a_\mu= & {} 11\,659\,193.7(15.4)\times 10^{-10}\;,\nonumber \\ \varDelta \nu _\mathrm{Mu}(\mathrm{hVP})= & {} 241(5)\,\mathrm{Hz}\;,\nonumber \\ r\,\bigl (a_\mu ,\nu _\mathrm{Mu\; hfs}(\mathrm{hVP})\bigr )= & {} 0.87\,, \end{aligned}$$
(41)

where we also give a correlation coefficient.

In Table 10, we present for the Lamb shift in hydrogen and muonic hydrogen a sum LO+NLO hVP, since NLO contribution is an original result of this paper. One may be interested in LO and NLO separately and not only for ordinary and muonic hydrogen, but also for other atoms. The NLO hVP in ordinary atoms and a hard part of the contribution to muonic atoms is discussed in Sect. 5, while the soft part of the muonic-atom correction is present in Appendix in Sect. B. Here we summarize the results for \(A=1-4\) for ordinary and muonic atoms that are given in Tables 11 and 12, respectively.

One may also be interested in their intercorrelations. As concerns correlations for the Lamb shift of the ns states in ordinary and muonic atoms, all the LO hVP contributions are proportional to the same value, namely \(\varPi _h'(0)\) and therefore they are correlated with a correlation coefficient \(r=1\). (We remind that the LO hVP contribution vanishes for \(l\ne 0\).)

Table 11 The LO+NLO hVP contribution to the 1s Lamb shift in light (\(Z=1,2)\) [ordinary] hydrogen-like atoms
Table 12 The LO+NLO hVP contribution to the \(2p-2s\) Lamb shift in light (\(Z=1,2)\) muonic atoms

We do not consider here few-electron atoms, such as He, or simple molecules, such as HD\(^+\); however, we expect that LO hVP for most of low-energy variables is to be described with contact interactions (an electron-electron or electron-nucleus one) and therefore it is to be proportional to \(\varPi '_h(0)\), which makes our finding directly applicable for such atomic and molecular systems. The exception is the hyperfine interaction (cf. Sect. 4) for which correlations make a more complicated form.

Having in mind various projects on accurate measurements of pure leptonic properties, such as the anomalous magnetic moment of electron [68] and muon [67, 69], energy intervals in muonium [70, 71] and positronium [72, 73], progress in related theoretical QED calculations [74,75,76,77,78], and in determination of involved fundamental constants (such as the fine structure constant \(\alpha \) [79]), a better understanding of related hVP contributions, that is one of the limiting factor of theory of pure leptonic systems, is important.

After the work was completed a number of new publications become available [80,81,82,83,84,85,86]. They somewhat improve the situation on certain issues but do not affect the conclusions of the paper. One of the results of this paper is the relations between LO hVP contributions to different observable. The data may play a role of starting point. The most important problem with the data is a discrepancy of theoretical and experimental results on \(a_\mu \). A minor change of a dispersion result for the theoretical value is marginal comparing with the size of the discrepancy. Once we deal with a half sum of theoretical and experimental values as the central value of our prediction and their half difference as an estimation of the uncertainty, any change in actual theoretical values is negligible.

Still, new data-driven evaluations of the LO hVP contribution to \(a_\mu \) was presented in [80,81,82]. Two of them [80, 81] utilize \(e^+e^-\) annihilation data only, while the other [82] uses also the \(\tau \)-decay data. The consensus on use or non-use of \(\tau \) data has not yet reached.

There is a new result on LO hVP from lattice [83].

There has been also a progress in calculation of hadronic LbL contribution [84,85,86]. Within our logics the latter contributes to the indirect determination of the LO hVP by striping the experimental value of \(a_\mu \) out of all the other theoretical contributions. The hLbL one is a dominant contribution to the theoretical uncertainty of such a procedure.

Technically, the results from [80] could be useful for realization of our strategy line, because that seems to be the first paper with appropriate description of R(s) where the LO hVP contribution is calculated with the same R(s) for \(a_\mu \), \(a_e\), and Mu HFS. That should simplify the evaluations, however, the eventual results (see Table 10) would not change much.