Chiral perturbation theory of muonic-hydrogen Lamb shift: polarizability contribution

The proton polarizability effect in the muonic-hydrogen Lamb shift comes out as a prediction of baryon chiral perturbation theory at leading order and our calculation yields ΔE(pol)(2P-2S)=8-1+3μ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta E^{(\mathrm{pol})} (2P-2S) = 8^{+3}_{-1}\, \upmu $$\end{document}eV. This result is consistent with most of evaluations based on dispersive sum rules, but it is about a factor of 2 smaller than the recent result obtained in heavy-baryon chiral perturbation theory. We also find that the effect of Δ(1232)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Delta (1232)$$\end{document}-resonance excitation on the Lamb shift is suppressed, as is the entire contribution of the magnetic polarizability; the electric polarizability dominates. Our results reaffirm the point of view that the proton structure effects, beyond the charge radius, are too small to resolve the ‘proton radius puzzle’.


Introduction
The eight standard-deviation (7.9σ ) discrepancy in the value of proton's charge radius obtained from elastic electronproton scattering [1] and hydrogen spectroscopy [2] on one hand and from the muonic-hydrogen (µH) spectroscopy [3,4] on the other, a.k.a. the proton charge radius puzzle [5,6], is yet to meet its fully agreeable solution. One way to solve it is to find an effect that would raise the µH Lamb shift by about 310 µeV, and it has been suggested that proton structure could produce such an effect at O(α 5 em ), e.g. [7,8]. Most of the studies, however, derive an order of magnitude smaller effect of proton structure beyond the charge radius [9][10][11][12][13][14][15].
The O(α 5 em ) effects of proton structure in the Lamb shift are usually divided into the effect of (i) the 3rd Zemach moment, (ii) finite-size recoil, and (iii) polarizabilities. The first two are sometimes combined into (i ) the 'elastic' 2γ contribution, while the polarizability effect is often split between (ii ) the 'inelastic' 2γ and (iii ) a 'subtraction' term, a e-mail: alarcon@kph.uni-mainz.de cf. Table 1. The 'elastic' and 'inelastic' 2γ contributions are well constrained by the available empirical information on, respectively, the proton form factors and unpolarized structure functions. The 'subtraction' contribution must be modeled, and in principle one can make up a model where the effect is large enough to resolve the puzzle [8].
In this work we observe that chiral perturbation theory (χ PT) contains definitive predictions for all of the above mentioned O(α 5 em ) proton structure effects, hence no modeling is needed, assuming of course that χ PT is an adequate theory of the low-energy nucleon structure. Some of the effects were already assessed in the heavy-baryon variant of the theory (HBχ PT), namely: Nevado and Pineda [11] computed the polarizability effect to leading order (LO) [i.e., O( p 3 )], while Birse and McGovern [13] computed the 'subtraction' term in O( p 4 ) HBχ PT (with the caveat explained in the end of Sect. 4). Here, on the other hand, we work in the framework of a manifestly Lorentz-invariant variant of χ PT in the baryon sector, referred to as Bχ PT [16][17][18][19]. At least the LO results for nucleon polarizabilities are known to be very different in the two variants of the theory, e.g., the proton magnetic polarizability is (in units of 10 −4 fm 3 ): 1.2 in HBχ PT [20] vs. −1.8 in Bχ PT [21][22][23]. Thus, the LO effect of the pion cloud is paramagnetic in one case and diamagnetic in the other (see [24,25] for more on HBχ PT vs. Bχ PT). Due to these qualitative and quantitative differences it is interesting to examine the Bχ PT predictions for the 2γ contributions to the Lamb shift. Here we compute the polarizability effect at LO Bχ PT and indeed find it significantly different from the LO HBχ PT results of Nevado and Pineda [11]; see Table 1.
Our result for the 'subtraction' and 'inelastic' contributions differ from most of the previous works because we have neglected the effect of the nucleon transition into its lowest excited state-the (1232). We argue, however (in Sect. 3), that the latter effect cancels out of the polarizability contribution. Thus, even though the 'subtraction' and 'inelastic' Table 1 Summary of available calculations of the 'subtraction' (second row), 'inelastic' (third row), and their sum-polarizability (last row) effects on the 2S level of µH. The last column represents the χPT predictions obtained in this work; here the omitted effect of the (1232)-resonance excitation is missing in the first two ('subtraction' and 'inelastic') numbers, but it does not affect the total polarizability contribution where it is to cancel out (µeV) Pachucki [9] Martynenko [10] Nevadoand Pineda [11] Carlson and Vanderhaeghen [12] Birse and McGovern [13] Gorchtein et al. [14] LO-BχPT [this work] 5 ) a Adjusted value; the original value of Ref. [14], +3.3, is based on a different decomposition into the 'elastic' and 'polarizability' contributions b Taken from Ref. [12] values appear to be very different from the empirical values due to neglect of the (1232) excitation, the polarizability contribution is not affected by this neglect.
The details of our calculation and main results are presented in the following section. Remarks on the role of the (1232) excitation are given in Sect. 3. The heavy-baryon expansion of our results is discussed in Sect. 4. An "effectiveness" criterion is applied to the HBχ PT and Bχ PT results in Sect. 5. The conclusions are given in Sect. 6. Expressions for the LO χ PT forward doubly virtual proton Compton scattering (VVCS) amplitude and pion electroproduction cross sections are given in Appendices A and B, respectively.

Outline of the calculation and results
We begin with the leading order chiral Lagrangian for the pion and nucleon fields, as well as the minimally coupled photons; see e.g. [16]. After a chiral rotation of the nucleon field the Lagrangian resembles that of the chiral soliton model; see [26] for details. As the result, the pseudovector π N N interaction transforms into the pseudoscalar one, while a new scalar-isoscalar ππ N N interaction is generated. The original and the redefined pion-nucleon Lagrangians, expanded up to the second order in the pion field, take the form L (1) L (1) where N (x) and M N is the nucleon field and mass, respectively, π a (x) is the pion field; g A 1.27, f π 92.4 MeV.
Upon the minimal inclusion of the electromagnetic field, the two Lagrangians give identical results for the O( p 3 ) Compton scattering amplitude and the isovector term proportional to (g 2 A − 1) does not contribute. Working with the second Lagrangian, however, simplifies a lot the evaluation of the two-loop graphs needed for the Lamb-shift calculation. The resulting Feynman diagrams, omitting crossed and time-reversed ones, are shown in Fig. 1.
These graphs represent an O(α 2 em ) correction to the Coulomb potential and can be treated in stationary perturbation theory. Since the Coulomb wave function is O(α 3/2 em ), the first-order contribution of these graphs to the energy shift is O(α 5 em ) as requested. As any energy transfer in the atomic system brings in extra powers of α em , we neglect it, and hence consider strictly the zero-energy forward kinematics. In this case the Feynman amplitude M is a number in momentum space, corresponding to a potential equal to M δ( r ). Because of the δ-function only the S-levels are shifted: where φ 2 n = m 3 r α 3 em /(π n 3 ) is the hydrogen wave function at the origin, for m r = m M p /(m + M p ) the reduced mass of the lepton-proton system, and m , M p = M N the corresponding masses of the constituents.
It is customary for the 2γ contributions to be split into leptonic and hadronic parts, i.e., where e 2 = 4πα em is the lepton charge squared, and is the leptonic tensor, with and q the 4-momenta of the lepton and the photons, respectively; g μν = diag(1, −1, −1, −1) is the Minkowski metric tensor. The tensor T μν is the unpolarized VVCS amplitude, which can be written in terms Fig. 1 The two-photon exchange diagrams of elastic lepton-nucleon scattering calculated in this work in the zero-energy (threshold) kinematics. Diagrams obtained from these by crossing and time-reversal symmetry are included but not drawn of two scalar amplitudes: with P the proton 4-momentum, ν = P · q/M p , Q 2 = −q 2 , P 2 = M 2 p . Note that the scalar amplitudes T 1,2 are even functions of both the photon energy ν and the virtuality Q. Terms proportional to q μ or q ν are omitted because they vanish upon contraction with the lepton tensor.
Going back to the energy shift one obtains [12]: In this work we calculate the functions T 1 and T 2 by extending the Bχ PT calculation of real Compton scattering [26] to the case of virtual photons. We then split the amplitudes into the Born (B) and non-Born (NB) pieces: The Born part is defined in terms of the elastic nucleon form factors as in, e.g. [13,27]: In our calculation the Born part was separated by subtracting the on-shell γ N N pion loop vertex in the one-particlereducible VVCS graphs; see diagrams (b) and (c) in Fig. 1.
Focusing on the O( p 3 ) corrections (i.e., the VVCS amplitude corresponding to the graphs in Fig. 1) we have explicitly verified that the resulting NB amplitudes satisfy the dispersive sum rules [28]: with ν 0 = m π + (m 2 π + Q 2 )/(2M p ) the pion-production threshold, m π the pion mass, and σ T (L) the tree-level cross section of pion production off the proton induced by transverse (longitudinal) virtual photons, cf. Appendix B. We hence establish that one is to calculate the 'elastic' contribution from the Born part of the VVCS amplitudes and the 'polarizability' contribution from the non-Born part, in accordance with the procedure advocated by Birse and McGovern [13].
Substituting the O( p 3 ) NB amplitudes into Eq. (6) we obtain the following value for the polarizability correction: This is quite different from the corresponding HBχ PT result for this effect obtained by Nevado and Pineda [11]: We postpone a detailed discussion of this difference till Sect. 4.
It is useful to observe that a much simpler formulas can be obtained upon making the low-energy expansion (LEX) of the VVCS amplitude, assuming that the photon energy in the atomic system is small compared to all other scales. To leading order in LEX, we may neglect the ν dependence in the numerator of Eq. (6) and, after Wick-rotating q to Euclidean hyperspherical coordinates [i.e., setting ν = i Q cos χ, q = (Q sin χ sin θ cos ϕ, Q sin χ sin θ sin ϕ, Q sin χ cos θ)] and angular integrations, find the following expression: with the weighting function w(τ ) shown in Fig. 2 and given by Plugging in here the LO Bχ PT expressions for i.e., nearly the same as before the LEX, cf. Eq. (10). This comparison shows that the LEX is applicable in this case, i.e.: in the energy-shift formula of Eq. (6) the ν-dependence of the numerator can to an extremely good approximation be neglected. As shown in Sect. 4, this approximation works well in the case of the HBχ PT calculation too.
To estimate the uncertainty of the LO result, we first observe that for low Q the VVCS amplitudes go as where α E1 and β M1 are the electric and magnetic dipole polarizabilities of the proton (hence the name "polarizability contribution"). Given the shape of the weighting function plotted in Fig. 2, the main contribution to the integral in Eq. (12) comes from low Q's, and therefore β M1 cancels out. The dominant polarizability effect in the Lamb shift thus comes from the electric polarizability α E1 . The Bχ PT physics of α E1 is such that to obtain the empirical number of about 11 (in units of 10 −4 fm 3 ), 7 comes from LO (π N loops) and 4 from NLO (π loops), with uncertainty of about ±1 from the O( p 4 ) low-energy constant [26]. Since in the present calculation we include only the LO π N loops, we expect our value to increase in magnitude when going to the next order (i.e., including the π loops). As the result, we replace the usual uncertainty of 15 % ( m π /GeV ) due to the higher-order effects by an uncertainty of 30 % [ (M − M p )/GeV] toward the magnitude increase, anticipating in this way the effect of the π loops. The 15 % uncertainty remains toward the magnitude decrease. With the uncertainty thus defined, our result is This is the number given in the third row of the last column in Table 1, where it can be compared to some previous results.
Most of them agree on the polarizability contribution. As for the 'inelastic' and 'subtraction' contributions, their meaningful comparison can only be made together with discussing the role of the (1232)-resonance excitation.

Remarks on the (1232) contribution and 'subtraction'
Presently the most common approach to calculate the polarizability effect relies on obtaining the VVCS amplitude from the sum rules of (9). Unfortunately, even a perfect knowledge of the inclusive cross sections (or, equivalently, the unpolarized structure functions) determines the VVCS amplitude only up to the subtraction function T The total result is therefore divided into the 'inelastic' part which is determined by empirical cross sections, and the 'subtraction' term which stands for the contribution of the subtraction function.
We can also perform such a division and based on the low-energy version of the sum rules [i.e., Eq. (12)] obtain This looks very different from the dispersive calculation, cf. Table 1. The main reason for this is the (1232)resonance excitation mechanism shown by the graph in Fig. 3.
We have checked that the dominant, magnetic-dipole (M1), part of electromagnetic nucleon-to-transition is strongly suppressed here, as is the entire magnetic polarizability (β M1 ) contribution, cf. discussion below Eq. (15). It is not suppressed in the 'inelastic' and 'subtraction' contribution separately, but it cancels out in the total. Thus, even though it is well justified to neglect the graph in Fig. 3 at the current level of precision, the split into 'inelastic' and 'subtraction' looks unfair without it.
In most of the dispersive calculations the cancelation of the excitation, as well as of the entire contribution of β M1 , occurs too, because the subtraction function is at low Q expressed though the empirical value for β M1 . Even the HBχ PT-inspired calculation of the subtraction function [13], which does not include the (1232) explicitly, is not an exception, as a low-energy constant from O( p 4 ) is chosen to achieve the empirical value for β M1 . Even at O( p 3 ) HBχ PT, the chiral-loop contribution to β M1 is-somewhat counterintuitively-paramagnetic and not too far from the empirical value, leading to a reasonable result for the 'subtraction' contribution. We take a closer look at the HBχ PT prediction for the various Lamb-shift contributions in the following section.
The central value for the 'subtraction' contribution obtained by Gorchtein et al. [14] is negative, even though theexcitation is included in their 'inelastic' piece. The quoted uncertainty of their subtraction value, however, is too large to point out any contradiction of this result with the other studies.

Heavy-baryon expansion
The heavy-baryon expansion, or HBχ PT [20,29], was called to salvage "consistent power counting" which seemed to be lost in Bχ PT, i.e. the straightforward, manifestly Lorentz-invariant formulation of χ PT in the baryon sector [16]. However, as pointed out by Gegelia et al. [30,31], the "powercounting violating terms" are renormalization scheme dependent and as such do not alter physical quantities. Furthermore, in HBχ PT they are absent only in dimensional regularization. If a cutoff regularization is used the terms which superficially violate power counting arise in HBχ PT as well, and must be handled in the same way as they are handled nowadays in Bχ PT-by renormalization.
In this work for example, all such (superficially powercounting-violating) terms, together with ultraviolet divergencies, are removed in the course of renormalization of the proton field, charge, anomalous magnetic moment, and mass. We use the physical values for these parameters and hence the on-mass-shell (OMS) scheme. This is different from the extended on-mass-shell scheme (EOMS) [17], where one starts with the parameters in the chiral limit. The physical observables, such as the Lamb shift in this case, would of course come out exactly the same in both schemes, provided the parameters in the EOMS calculation are chosen to yield the physical proton mass at the physical pion mass.
Coming back to HBχ PT. Despite the above-mentioned developments the HBχ PT is still often in use. The two EFT studies of proton structure corrections done until now [11,13] are done in fact within HBχ PT. We next examine these results from the Bχ PT perspective.
One of the advantages of having worked out a Bχ PT result is that the one of HBχ PT can easily be recovered. We do it by expanding the expressions of Appendix A in μ = m π /M N , while keeping the ratio of light scales τ π = Q 2 /4m 2 π fixed. For the leading term the Feynman-parameter integrations are elementary and we thus obtain the following heavy-baryon expressions: The first expression reproduces the result of Birse and McGovern (cf. T 1 in the appendix of [13] 1 ). We have also verified that these amplitudes correspond to the ones 1 At subleading order in the heavy-baryon expansion, we obtain This expression reproduces the g 2 A terms of T (4) 1 in the appendix of Ref. [13], apart from the terms inside the square brackets. These terms of Nevado and Pineda [11] at zero energy (ν = 0), up to a convention for an overall normalization of the amplitudes. We have also reproduced their expressions for T 1 and T 2 (cf. Eq. (3.2) and (3.5) in Ref. [11]) for all ν and Q 2 .
Substituting these expressions into (12), we obtain the following value for the polarizability contribution to the 2Slevel shift in µH: This is slightly different from the result of Ref. [11] that we quote in Eq. (11), which is because of the neglected energy dependence, i.e., the use of the LEX in deriving Eq. (12) from (6). Still, the difference between the exact and LEX result is well within the expected 15 % uncertainty of such calculation and hence we conclude that the LEX approximation works well in this case too. Substitution to Eq. (17) yields the HBχ PT predictions for the 'inelastic' and 'subtraction' contributions: Neglecting for a moment the difference between τ π and τ μ , we obtain very simple closed expressions for the Lamb-shift contributions: where G 0.9160 is the Catalan constant. This should provide an impression of the parametric dependencies arising in χ PT for this effect. The resulting numbers are within the expected uncertainty for HBχ PT result, and they can in principle be easily improved in a perturbative treatment of the pion-muon mass difference. So far we have been discussing the O( p 3 ) result. At higher orders one in addition to the VVCS calculation needs to consider the appropriate operators from the effective leptonnucleon Lagrangian with corresponding low-energy constants fixed to, e.g., the low-energy lepton-nucleon scatter-Footnote 1 continued come from the expansion of the leading pion loop contribution to the term β M1 Q 2 in powers of m π and hence are part of δβ in that reference.
ing. Birse and McGovern [13] computed the VVCS amplitude T 1 (0, Q 2 ) to order O( p 4 ), but they evaded the consideration of the lepton-nucleon terms by introducing a "physical cutoff" in Q. Hence, their resulting calculation of the subtraction term is strongly cutoff dependent and lies, strictly speaking, outside the χ PT framework; we refer to it as "HBχ PTinspired" calculation.

"Effectiveness" of HBχ PT vs. Bχ PT
Although at high enough orders HBχ PT and Bχ PT are bound to yield the same results, at low orders this is not necessarily so and practice shows that especially at 'predictive' orders, where there are no free low-energy constants to absorb the differences, HBχ PT and Bχ PT results differ substantially, sometimes even in the sign of the total effect (cf. the order p 3 result for the magnetic polarizability of the nucleon [24,26]). The proton polarizability contribution to the Lamb shift is apparently such a case as well. So, having found the substantial differences between the HBχ PT and Bχ PT predictions the obvious question is: which one is more reliable, if any?
A rather common point of view is that, since HBχ PT neglects only the effects of "higher order", any substantial disagreement only signals the importance of higher-order effects and hence neither of the calculations should be trusted at this order. On the other hand, it is plausible that not all the higher-order effects are large, but only the ones present in the Bχ PT calculation and dismissed in the one of HBχ PT. In support of the latter scenario is the physical principle of analyticity-consequence of (micro-)causality, which in Bχ PT is obeyed exactly, while in HBχ PT it is obeyed only approximately, albeit improvable order by order.
Another, perhaps more quantitative criterion is the one put forward by Strikman and Weiss [32]. In the interpretation of Ref. [24], it requires that the high-momentum contribution of finite (renormalized) loop integrals over quantities which are invariant under redefinitions of hadron fields should not exceed the expected uncertainty of the given-order calculation. In other words, the contribution from beyond the scales at which the effective theory is applicable should not exceed a natural estimate of missing higher-order effects.
In our case the VVCS amplitudes are such quantities invariant under redefinitions of pion and nucleon fields and hence it makes sense to examine Fig. 4, where the polarizability effect is plotted as a function of the ultraviolet cutoff Q max imposed on the momentum integration in (12).
The figure clearly shows that the relative size of the highmomentum contribution in the HBχ PT case is substantially larger than in Bχ PT.
Assuming the breakdown scale for χ PT is of order of the ρ-meson mass, m ρ = 777 MeV, we can make a more quanti- In the Bχ PT case, the contribution from momenta above m ρ is less than 15 %, well within the expected uncertainty.

Conclusion and outlook
Is the proton polarizability effect different in muonic versus electronic hydrogen so as to affect the charge radius extraction? The answer is 'yes'. From the LEX formula in Eq. (12), one sees that the polarizability contribution not only affects the charge radius extraction from the Lamb shift but also that this effect is about m μ /m e ≈ 200 times stronger in µH than in eH. Indeed, the weighting function plotted in Fig. 2 for the two cases is much larger in the muon case. The lepton mass acts, in fact, as a cutoff scale. Nonetheless, the Bχ PT result obtained hereby demonstrates that the magnitude of this effect is not nearly enough to explain the 'proton radius puzzle', which amounts to a discrepancy of about 300 µeV.
As seen from Table 1, our Bχ PT result for the polarizability effect agrees with the previous evaluations based on dispersive sum rules, but it is substantially smaller in magnitude than the HBχ PT result of Nevado and Pineda [11]. This is of course not the first case when the Bχ PT and HBχ PT results differ significantly-the polarizabilities themselves provide such an example.
The differences between HBχ PT and Bχ PT results are often interpreted as the uncertainty of χ PT calculations. This interpretation is too naive as there are physical effects that distinguish the two. For example, the Bχ PT calculations obey analyticity exactly while the HBχ PT ones only approximately. Furthermore, we have checked that in HBχ PT the contribution from momenta beyond the χ PT applicability domain is somewhat bigger than the expected uncertainty of the calculation. The Bχ PT result is more "effective" in this respect, as the high-momentum contribution therein is well within the expected uncertainty.
Within the Bχ PT calculation, we have verified the dispersive sum rules given in (9) and confirmed the statement of Ref. [13] that the split between the 'elastic' and 'inelastic' 2γ contributions corresponds unambiguously to the split between the Born and non-Born parts of the VVCS amplitude, rather than between the pole and non-pole parts.
We have observed that the (1232)-excitation mechanism shown in Fig. 3 does not impact the Lamb shift in a significant way because the dominant magnetic-dipole (M1) transition is suppressed, as is the entire magnetic polarizability effect. The (1232)-excitation effect is, however, important for the dispersive calculation because it is prominent in the proton structure functions and hence must be included in the 'subtraction' contribution to achieve a consistent cancelation of the M1 (1232) excitation. In most of the models this is roughly achieved by using an empirical value for the magnetic polarizability which includes the large paramagnetic effect of the M1 (1232) excitation. In the HBχ PT-inspired calculation of the 'subtraction' term [13] the -excitation is not included; however, the situation is ameliorated by the low-energy constant from O( p 4 ), which is chosen to reproduce the empirical value of the magnetic polarizability.
Naive dimensional analysis shows that χ PT at leading order is capable of yielding predictions for the entire twophoton correction to the Lamb shift. The polarizability part of that correction has been considered in this work. The last row of the last column of Table 1 contains the O( p 3 ) Bχ PT prediction for the proton polarizability effect on the 2S-level of µH. One needs to add to it the 'elastic' contribution (or, alternatively, the third Zemach moment together with 'finitesize recoil'), to obtain the full O(α 5 em ) effect of the proton structure in µH Lamb shift. Using an empirical value for the 'elastic' contribution from Ref. [13] [i.e., −24.7(1.6) µeV], our result for the full 2γ contribution to the 2P -2S Lamb shift is in nearly perfect agreement with the presently favored value [5,13] of 33(2) µeV.