1 Introduction

Muonic-atom spectroscopy has been successful at determining the charge radii of proton, deuteron, helion and alpha-particle with unprecedented precision through Lamb shift measurements [1,2,3]. It also holds the potential to impact tests of ab-initio nuclear theories and bound-state QED [4]. The proton Zemach radius \(R_\textrm{Z}\) has been extracted from a measurement of the 2S hyperfine splitting (hfs) in muonic hydrogen (\(\mu \)H) [5] with a \(3.4\%\) uncertainty:

$$\begin{aligned} R_\textrm{Z}(\mu \textrm{H})=1.082(37)\, \textrm{fm}, \end{aligned}$$
(1)

by comparing to the theory prediction in Ref. [6] that relies on a data-driven evaluation of the proton polarizability contribution [7]:

$$\begin{aligned} E_\textrm{hfs}^\mathrm {pol.}(2S,\mu \text {H})=8.0(2.6)\, \upmu \textrm{eV}. \end{aligned}$$
(2)

Several collaborations are now preparing a measurement of the ground-state (1S) hfs in \(\mu \)H with ppm precision: CREMA [8], FAMU [9, 10] and J-PARC [11] (see Ref. [12] for a comparison of the different experimental methods). These future measurements hold the potential to extract the Zemach radius with a sub-percent uncertainty, thereby constraining the magnetic properties of the proton.

A precise theory prediction for the 1S hfs in \(\mu \)H is essential for the success of the experimental campaigns. Firstly, to narrow down the frequency search range, which is important given the limited beam time available to the collaborations at PSI, RIKEN-RAL and J-PARC. Secondly, for the interpretation of the results. One can either extract the Zemach radius given a theory prediction for the proton-polarizability effect in the \(\mu \)H 1S hfs, or vice versa, extract the proton-polarizability effect with input for the Zemach radius. Furthermore, one can combine the precise measurements of the 1S hfs in H and \(\mu \)H to disentangle the Zemach radius and polarizability effects, leveraging radiative corrections as explained in Ref. [13], and compare their empirical values to theoretical expectations.

The biggest uncertainty in the theory prediction comes from proton-structure effects, entering through the two-photon exchange (TPE). These contain the above-mentioned Zemach radius and polarizability effects. Presently, they are evaluated within a “data-driven” dispersive approach [14,15,16]. While the dispersive method itself is rigorous, it requires sufficient experimental data to map out the proton spin structure functions \(g_1(x,Q^2)\) and \(g_2(x,Q^2)\) as full functions of the Bjorken variable x and the photon virtuality \(Q^2\). This has been the aim of a dedicated “Spin Physics Program” at Jefferson Lab [17,18,19,20,21] that recently extended the previously scarce data for \(g_2\) [22, 23].

In this work, we use an entirely different approach – the chiral perturbation theory (\(\chi \)PT) [24,25,26] – which has been successfully used to give a prediction for the proton-polarizability effect in the \(\mu \)H Lamb shift [27]. To be precise, we work in the framework of baryon chiral perturbation theory (B\(\chi \)PT) – the manifestly Lorentz-invariant formulation of \(\chi \)PT in the baryon sector [26, 28, 29] (see also [30, 31] for reviews). We show that the leading-order (LO) B\(\chi \)PT prediction for the polarizability effect in the hfs is effectively vanishing, thereby, in substantial disagreement with the data-driven evaluations.

The paper is organized as follows. In Sect. 2, we discuss the forward TPE, and in particular, the polarizability effect in the hfs. A new formalism where one splits into contributions from the longitudinal-transverse and helicity-difference photoabsorption cross sections of the proton, \(\sigma _{LT}\) and \(\sigma _{TT}\), is introduced in Eq. (12). It will be shown that this decomposition is advantageous for both the dispersive, as well as the effective field theory (EFT) calculations as it gives a cleaner access to the uncertainties. More details are given in Appendix A. In Sect. 3, we present our LO B\(\chi \)PT prediction for the polarizability effect in the hfs of H and \(\mu \)H, together with a detailed discussion of the uncertainty estimate. In Sect. 4, we compare our results to data-driven dispersive and heavy baryon effective field theory (HB EFT) calculations. In Sect. 5, the Zemach radius is extracted from H and \(\mu \)H spectroscopy based on our prediction for the polarizability effect. In Sect. 6, we discuss the TPE effect in the \(\mu \)H hfs in view of the forthcoming experiments. Full details of the theoretical prediction for the 1S \(\mu \)H hfs are collected in Appendix C. We finish with an outlook and conclusions.

2 Two-photon exchange in the hyperfine splitting

The (muonic-)hydrogen hfs receives contributions from QED-, weak- and strong-interaction effects:

$$\begin{aligned} E_{\textrm{hfs}}(nS)=\frac{E_\textrm{F}}{n^3}\left( 1+\Delta _\textrm{QED}+\Delta _\textrm{weak}+\Delta _\textrm{strong}\right) , \end{aligned}$$
(3)

where the leading-order in \(\alpha \) contribution is given by the Fermi energy:

$$\begin{aligned} E_\textrm{F}=\frac{8Z\alpha }{3a^3}\frac{1+\kappa }{mM}, \end{aligned}$$
(4)

with \(\alpha \) the fine-structure constant, Z the charge of the nucleus (in the following \(Z=1\) for the proton), m, M the lepton and proton masses, \(\kappa \) the anomalous magnetic moment of the proton, and \(a^{-1}=\alpha m_r\) the inverse Bohr radius, with \(m_r=mM/(m+M)\) the reduced mass. The strong-interaction effects arise from the composite structure of the proton. They begin to enter at \(\mathcal {O}(\alpha ^5)\), see for instance Ref. [14], where they are split into the Zemach-radius, recoil, and polarizability contributions:

$$\begin{aligned} \Delta _\textrm{strong}=\Delta _\textrm{Z}+\Delta _\textrm{recoil}+\Delta _\mathrm {pol.}\,, \end{aligned}$$
(5)

which can all be attributed to the forward TPE shown in Fig. 1. For a first comprehensive theory summary of the Lamb shift, fine and hyperfine structure in \(\mu \)H, including proton-structure dependent effects, we refer to Ref. [32]. The Zemach and recoil terms (\(\Delta _\textrm{Z}\) and \(\Delta _\textrm{recoil}\)) are elastic contributions with a proton in the intermediate state, see Fig. 1a. The diagram in Fig. 1b contains excited intermediate states (\(\pi N\), \(\Delta \)-isobar, etc.) represented by the ‘blob’. It generates the polarizability effect (\(\Delta _\mathrm {pol.}\)) that shall be evaluated in this work.

Fig. 1
figure 1

Two-photon-exchange diagram in forward kinematics: a elastic contribution; b polarizability contribution. The horizontal lines correspond to the lepton and the proton (bold), where the ‘blob’ represents all possible excitations. The crossed diagrams are not drawn

The forward TPE contribution to the hfs can be expressed through the spin-dependent forward doubly-virtual Compton scattering (VVCS) amplitudes, \(S_1\) and \(S_2\), cf. Eq. (A2). The latter can be related to the proton structure functions \(g_1\) and \(g_2\) in a dispersive approach, cf. Eqs. (A12) and (A11). A full derivation of the well-known formalism for the TPE contribution to the hfs can be found in Appendix A.

The largest TPE effect is due to the Zemach radius contribution:

$$\begin{aligned} \Delta _\textrm{Z}=-2\alpha m_r R_\textrm{Z}. \end{aligned}$$
(6)

The recoil contribution is one order of magnitude smaller [7], and will not be considered in this paper. It has been recently updated in Ref. [33]. The hfs is therefore best suited for a precision extraction of the Zemach radius, defined as the following integral over the electric and magnetic Sachs form factors \(G_{E}(Q^2)\) and \(G_{M}(Q^2)\) [32]:

$$\begin{aligned} R_\textrm{Z}=-\frac{4}{\pi }\int _0^\infty \frac{\textrm{d}Q}{Q^2} \left[ \frac{G_{E}(Q^2)G_{M}(Q^2)}{1+\kappa }-1\right] , \end{aligned}$$
(7)

where \(q^2=-Q^2\) is the photon virtuality. Equivalently, we can write:

$$\begin{aligned} R_\textrm{Z}= & {} \langle r\rangle _E+\langle r\rangle _M-\frac{2}{\pi ^2} \int _0^\infty \frac{\textrm{d}t}{t}\frac{{{\,\textrm{Im}\,}}G_M(t)}{1+\kappa }\nonumber \\{} & {} \times \int _0^\infty \frac{\textrm{d}t'}{t'}\frac{{{\,\textrm{Im}\,}}G_E(t')}{\sqrt{t}+\sqrt{t'}}, \end{aligned}$$
(8)

where the linear electric and magnetic radii are defined as:

$$\begin{aligned} \left\langle r \right\rangle _{E,M}=\frac{2}{\pi } \int _0^\infty \frac{\textrm{d}t}{t^{3/2}}{{\,\textrm{Im}\,}}G(t), \end{aligned}$$
(9)

with \({{\,\textrm{Im}\,}}G(t)\) the imaginary part of the normalized electric or magnetic Sachs form factor, \(G_{E,M}(Q^2)/G_{E,M}(0)\). As one can see from Eqs. (7) and (8), a measurement of the Zemach radius gives access to the magnetic properties of the proton.

The polarizability effect in the hfs is fully constrained by empirical information on the proton spin structure functions \(g_1(x,Q^2)\) and \(g_2(x,Q^2)\), and the Pauli form factor \(F_2(Q^2)\), functions of \(Q^2\) and the Bjorken variable \(x=Q^2/2M\nu \), where \(\nu \) is the photon energy in the lab frame. This is in contrast to the Lamb shift, where the knowledge of a subtraction function, \(T_1(0,Q^2)\) or \(T_1(iQ,Q^2)\) [34], is needed.Footnote 1 It reads:Footnote 2

$$\begin{aligned} \Delta _\mathrm {pol.}&=\Delta _1+\Delta _2= \frac{\alpha m}{2\pi (1+\kappa ) M} \big ( \delta _1+\delta _2\big ), \end{aligned}$$
(10a)
$$\begin{aligned} \delta _1&=2\int _0^\infty \frac{\textrm{d}Q}{Q}\left\{ \frac{5+4v_l}{(v_l+1)^2}\Big [4I_1(Q^2)+F_{2}^2(Q^2)\Big ]\right. \nonumber \\&\quad -\frac{32M^4}{Q^4}\int _0^{x_0}\textrm{d}x\, x^2g_1(x,Q^2)\frac{1}{(v_l+ v_x)(1+ v_x)(1+v_l)}\nonumber \\&\quad \times \left. \left( 4+\frac{1}{1+ v_x}+\frac{1}{v_l+1}\right) \right\} , \end{aligned}$$
(10b)
$$\begin{aligned} \delta _2&=96M^2\int _0^\infty \frac{\textrm{d}Q}{Q^3}\int _0^{x_0}\textrm{d}x\,g_2(x,Q^2) \left( \frac{1}{v_l+ v_x}-\frac{1}{v_l+1} \right) , \end{aligned}$$
(10c)

with \(x_0\) the inelastic threshold, \(v_l=\sqrt{1+\nicefrac {1}{\tau _l}}\), \(v_x=\sqrt{1+x^2\tau ^{-1}}\), \(\tau _l=\nicefrac {Q^2}{4m^2}\), \(\tau =\nicefrac {Q^2}{4M^2}\), and the generalized Gerasimov–Drell–Hearn (GDH) integral:

$$\begin{aligned} I_1(Q^2)=\frac{2M^2}{Q^2}\int _0^{x_0}\textrm{d}x\, g_1(x,Q^2)={\bar{I}}_1(Q^2)-F_{2}^2(Q^2)/4. \nonumber \\ \end{aligned}$$
(11)

Here, \({\bar{I}}_1\) is the polarizability part of \(I_1\). For the origin of the Pauli form factor in the above equations, see discussion in Appendix A.

As we will show in Sect. 3.2, instead of decomposing into \(\Delta _1\) and \(\Delta _2\), it is convenient to decompose into contributions from the longitudinal-transverse and helicity-difference cross sections \(\sigma _{LT}\) and \(\sigma _{TT}\):

$$\begin{aligned} \Delta _\mathrm {pol.}= & {} \Delta _{LT}+\Delta _{TT}+\Delta _{F_2}\nonumber \\= & {} \frac{\alpha m}{2\pi (1+\kappa ) M}\left( \delta _{LT}+\delta _{TT}+\delta _{F_2}\right) , \end{aligned}$$
(12a)

where we define:

$$\begin{aligned} \delta _{LT}&=\frac{4M}{\alpha \pi ^2}\int _0^\infty \!\textrm{d}Q\int _0^{x_0}\!\textrm{d}x\,\frac{1}{v_l+v_x}\frac{1}{x^2+\tau }\nonumber \\&\quad \times \left[ 1-\frac{1}{(1+v_l)(1+v_x)}\right] \sigma _{LT}(x,Q^2), \end{aligned}$$
(12b)
$$\begin{aligned} \delta _{TT}&=\frac{4M^2}{\alpha \pi ^2}\int _0^\infty \frac{\textrm{d}Q}{Q}\int _0^{x_0}\frac{\textrm{d}x}{x} \frac{1}{1+v_l}\nonumber \\&\quad \times \left[ \frac{2\tau }{x^2+\tau }+\frac{1}{(v_l+v_x)(1+v_x)}\right] \sigma _{TT}(x,Q^2), \end{aligned}$$
(12c)
$$\begin{aligned} \delta _{F_2}&=2\int _0^\infty \frac{\textrm{d}Q}{Q}\frac{5+4v_l}{(v_l+1)^2}\,F_{2}^2(Q^2). \end{aligned}$$
(12d)

Or equivalently, in terms of the VVCS amplitudes, we can write:

$$\begin{aligned} \delta _{LT}&=\frac{8M}{\alpha }\frac{1}{(2\pi )^3} \frac{1}{i}\int _{-\infty }^\infty \!\textrm{d}\nu \!\int \!\textrm{d}\textbf{q}\,\frac{1}{Q^4-4m^2\nu ^2}\left\{ {\bar{S}}_1(\nu ,Q^2)\nonumber \right. \\&\quad \left. +\frac{\nu }{M}{\bar{S}}_2(\nu ,Q^2)\right\} \nonumber \\&\quad -\frac{4}{m^2}\int _0^\infty \textrm{d}Q\,Q\,(v_l-1)\,F_{2}^2(Q^2), \end{aligned}$$
(13a)
$$\begin{aligned} \delta _{TT}&=\frac{4M}{\alpha }\frac{1}{(2\pi )^3} \frac{1}{i}\int _{-\infty }^\infty \!\textrm{d}\nu \!\int \!\textrm{d}\textbf{q}\,\nonumber \\&\quad \times \frac{1}{Q^4-4m^2\nu ^2}\left\{ \frac{\nu }{M}{\bar{S}}_2(\nu ,Q^2)-\frac{\nu ^2}{Q^2} {\bar{S}}_1(\nu ,Q^2)\right\} \nonumber \\&\quad -2\int _0^\infty \frac{\textrm{d}Q}{Q}\frac{1}{(v_l+1)^2}\,F_{2}^2(Q^2). \end{aligned}$$
(13b)

Here, \({\bar{S}}_i\) denotes the non-Born part of the amplitudes. An advantage of the B\(\chi \)PT calculation in this work is that the non-Born amplitudes can be calculated directly, and need not be constructed through the dispersive formalism. Furthermore, at the present order of our calculation in the B\(\chi \)PT power counting, there are no contributions to the elastic form factors, and thus, \(I_1\) in Eq. (11) is given by the polarizability part only.

3 Chiral loops

Assuming B\(\chi \)PT is an adequate theory of low-energy nucleon structure, it should be well applicable to atomic systems, where the relevant energies are naturally small. In Ref. [27], the polarizability effect in the \(\mu \)H Lamb shift has been successfully predicted at LO in B\(\chi \)PT. Here, we extend this calculation to the polarizability effect in the hfs. This requires the spin-dependent non-Born VVCS amplitudes, \({\bar{S}}_1\) and \({\bar{S}}_2\), at chiral \(\mathcal {O}(p^3)\) in the B\(\chi \)PT power counting.

Figure 1 in Ref. [27] shows the leading polarizability effect given by the TPE diagrams of elastic lepton-proton scattering with one-loop \(\pi N\) insertions. For the Compton-like processes, it is convenient to use the chirally-rotated leading B\(\chi \)PT Lagrangian for the pion \(\pi ^a(x)\) and nucleon N(x) fields [38]:

(14)

where \(\gamma _5=i \gamma ^0 \gamma ^1 \gamma ^2 \gamma ^3\), \(g_A\simeq 1.27\) [39] is the axial coupling of the nucleon, \(f_\pi \simeq 92.21 \) MeV is the pion-decay constant, \(\tau ^a\) are the Pauli matrices, \(M_N\simeq 938.27\) MeV and \(m_\pi \simeq 139.57\) MeV are the nucleon and pion masses.Footnote 3 As described in Ref. [27], the Born part is separated from the \(\mathcal {O}(p^3)\) VVCS amplitudes by subtracting the on-shell pion-loop \(\gamma N N\)-vertex in the one-particle-reducible VVCS graphs, see diagrams (b) and (c) in Figure 1 of Ref. [27]. For more details on the B\(\chi \)PT framework, we refer to Refs. [40,41,42], where the complete next-to-next-to-leading-order (NNLO) in the \(\delta \)-expansion [43] B\(\chi \)PT calculation of the spin-independent and spin-dependent nucleon VVCS amplitudes can be found.Footnote 4

In practice, most results here were obtained based on our B\(\chi \)PT prediction for the \(\pi N\)-production channel in the structure functions \(g_i\), given in Ref. [42, Appendix B]. It has been verified that the results agree with the calculation based on the VVCS amplitudes \({\bar{S}}_i\).

3.1 Numerical results

Our LO B\(\chi \)PT prediction for the polarizability effect in the 1S hfs of H and \(\mu \)H amounts to:

$$\begin{aligned} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \textrm{H})= & {} 0.69 (2.03)\,\textrm{peV}, \end{aligned}$$
(15a)
$$\begin{aligned} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \mu \textrm{H})= & {} 6.8(11.4)\,\upmu \textrm{eV}. \end{aligned}$$
(15b)

The error estimate will be described and motivated in the subsequent sections. The corresponding contributions to the nS hfs are trivially obtained through a \(1/n^3\) scaling, as can be seen from Eqs. (3) and (4). Splitting into contributions from the spin structure functions \(g_1\) and \(g_2\), we obtain:

$$\begin{aligned}{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \textrm{H}, \Delta _1)=0.3(3.1)\,\textrm{peV},\nonumber \\{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \textrm{H}, \Delta _2)=0.4(1.0)\,\textrm{peV}, \end{aligned}$$
(16a)
$$\begin{aligned}{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \mu \textrm{H}, \Delta _1)=5.2(16.5)\,\upmu \textrm{eV},\nonumber \\{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \mu \textrm{H},\Delta _2)=1.6(5.2)\,\upmu \textrm{eV}. \end{aligned}$$
(16b)

Strikingly, the contributions from the longitudinal-transverse and helicity-difference cross sections \(\sigma _{LT}\) and \(\sigma _{TT}\):

$$\begin{aligned}{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \textrm{H}, \Delta _{LT})=5.1(1.5)\,\textrm{peV},\nonumber \\{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S, \textrm{H}, \Delta _{TT})=-4.4(1.3)\,\textrm{peV},\end{aligned}$$
(16c)
$$\begin{aligned}{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S,\mu \textrm{H}, \Delta _{LT})= 30.0(9.0)\,\upmu \textrm{eV},\nonumber \\{} & {} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\mathrm {pol.}}(1S,\mu \textrm{H},\Delta _{TT})=-23.2(7.0)\,\upmu \textrm{eV}, \end{aligned}$$
(16d)

are one order of magnitude larger than the total, and differ in their respective signs. This indicates a cancellation of LO contributions between \(\Delta _{LT}\) and \(\Delta _{TT}\).

Including in addition the correction due to electron vacuum polarization (eVP) in the TPE diagram, see Fig. 10 and discussion in Appendix B, gives a negligible effect within the present uncertainties:

$$\begin{aligned} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\text {pol. + eVP}}(1S, \textrm{H})= & {} 0.72 (2.07)\,\textrm{peV}, \end{aligned}$$
(17a)
$$\begin{aligned} E_\textrm{hfs}^{{\langle \textrm{LO}\rangle }\,\text {pol. + eVP}}(1S, \mu \textrm{H})= & {} 7.0(11.6)\,\upmu \textrm{eV}. \end{aligned}$$
(17b)

Nevertheless, it is important in view of the anticipated 1 ppm accuracy (corresponding to \(\sim 0.2\,\upmu \)eV) of the \(\mu \)H 1S hfs measurement by the CREMA collaboration [8]. We therefore include the additional \(\Delta _\mathrm {pol.}^\textrm{eVP} (\text {H})=0.01\) ppm and \(\Delta _\mathrm {pol.}^\textrm{eVP}(\mu \text {H})=1\) ppm on top of \(\Delta _\mathrm {pol.} (\text {H})=0.12(35)\) ppm and \(\Delta _\mathrm {pol.}(\mu \text {H})=37(62)\) ppm.

To understand why the contributions from \(\sigma _{LT}\) and \(\sigma _{TT}\) largely cancel in \(\Delta _\mathrm {pol.}\), we study the heavy-baryon (HB) limit of the spin-dependent VVCS amplitudes [48]. Expanding the LO B\(\chi \)PT expression for the \({\bar{S}}_1\) amplitude in \(\mu =m_\pi /M_N\) while keeping the ratio of the light scales \(\tau _\pi =\nicefrac {Q^2}{4m_\pi ^2}\) fixed, one obtains:

$$\begin{aligned} {\bar{S}}_1(0,Q^2)\!{\mathop {=}\limits ^{\text {HB}}}\!-\frac{3\,\alpha \, g_A^2}{16 f_\pi ^2} \,m_\pi \!\left[ 1-(1+\tau _\pi )\frac{\arctan \sqrt{\tau _\pi }}{\sqrt{\tau _\pi }}\right] \!.\nonumber \\ \end{aligned}$$
(18)

We then take a closer look at the first term in the low-energy polarizability expansion:

$$\begin{aligned} \frac{{\bar{S}}_1(0,Q^2)}{Q^2}\Big \vert _{Q^2\rightarrow 0}= & {} M_N\Big \{\gamma _{E1M2}-3\alpha M_N\Big [P^{\prime (M1,M1)1}(0)\nonumber \\{} & {} +P^{\prime (L1,L1)1}(0)\Big ]\Big \}. \end{aligned}$$
(19)

The HB\(\chi \)PT predictions for the proton polarizabilities [49,50,51,52,53,54] entering Eq. (19) read:

$$\begin{aligned} \gamma _{E1M2}= & {} \frac{\alpha g_A^2}{(4\pi f_\pi )^2}\frac{1}{m_\pi ^2}\frac{1}{6}\left[ 1-\frac{7\pi }{4}\frac{m_\pi }{M_N}\right] \!, \end{aligned}$$
(20a)
$$\begin{aligned} P^{\prime (M1,M1)1}(0)= & {} \frac{g_A^2}{(4\pi f_\pi )^2}\frac{1}{m_\pi ^2}\frac{1}{18M}\left[ -1+\frac{7\pi }{4}\frac{m_\pi }{M_N}\right] \!, \nonumber \\ \end{aligned}$$
(20b)
$$\begin{aligned} P^{\prime (L1,L1)1}(0)= & {} \frac{g_A^2}{(4\pi f_\pi )^2}\frac{1}{m_\pi ^2}\frac{1}{9M}\left[ 1-\frac{17\pi }{8}\frac{m_\pi }{M_N}\right] \!. \nonumber \\ \end{aligned}$$
(20c)

We can see that the leading terms in the chiral expansion are of \(\mathcal {O}(1/m_\pi ^2)\). They cancel among the different polarizabilities, thus, Eq. (21) becomes a subleading contribution:

$$\begin{aligned} \frac{{\bar{S}}_1(0,Q^2)}{Q^2}\Big \vert _{Q^2\rightarrow 0}{\mathop {=}\limits ^{\text {HB}}}\frac{\alpha g_A^2}{32 f_\pi ^2}\frac{1}{m_\pi }. \end{aligned}$$
(21)

Accordingly, one would expect the chiral loops in the hfs to be small. Indeed, the LO B\(\chi \)PT prediction in Eq. (15) is essentially vanishing, where the small number is mainly a remnant of higher orders in the HB expansion. This has to be taken into account in the uncertainty estimate.

Fig. 2
figure 2

The amplitude \({\bar{S}}_1(0,Q^2)\) at LO in B\(\chi \)PT (blue) and HB\(\chi \)PT (red). The dashed lines show the corresponding slope terms, i.e., the first terms in the expansion in powers of \(Q^2\). The B\(\chi \)PT slope has been calculated from the polarizabilities given in Ref. [55, Table I], the HB\(\chi \)PT slope is given in Eq. (21)

Note that the HB expansion above has been introduced for instructive purposes only, but is not entering our calculation of the polarizability effect. The HB\(\chi \)PT prediction of the \(S_1(0,Q^2)\) amplitude, Eq. (18), raises with Q, thus, its contribution to the hfs will be divergent. This can be seen from Fig. 2, where we compare the chiral-loop contribution to \({\bar{S}}_1(0,Q^2)\) as predicted by B\(\chi \)PT and HB\(\chi \)PT, respectively.

3.2 Uncertainty estimate

Fig. 3
figure 3

Polarizability effect on the 1S hyperfine splitting in H (left panel) and \(\mu \)H (right panel): Cutoff dependence of the leading-order \(\pi N\)-loop contribution. The total results, Eqs. (15a) and (15b), are indicated by the black arrows

B\(\chi \)PT is a low-energy EFT of QCD describing strong interactions in terms of hadronic degrees of freedom (pion, nucleon, \(\Delta (1232)\) resonance). An important requirement for a reliable B\(\chi \)PT prediction is that the contribution from beyond the scale at which this EFT is safely applicable, i.e., \(Q_\textrm{max}>m_\rho =775\) MeV, has to be small. For the LO B\(\chi \)PT prediction of the polarizability effect in the \(\mu \)H Lamb shift [27], the contribution from beyond this scale was less than \(15\%\), thus, within the expected uncertainty. Comparing the TPE master formulas for Lamb shift and hfs, Eqs. (A8) and (A7), the weighting function in the former has a stronger suppression for large \(Q^2\). It is therefore important to verify that the same quality criterion still holds for the hfs prediction presented here.

Let us consider the polarizability effect as a running integral with a momentum cutoff \(Q_\textrm{max}\), as shown in Fig. 3. The convergence of the \(\Delta _1\) (green line) contribution, as well as of the total \(\Delta _\mathrm {pol.}\) (black line), is poor. They display a sign change of the running integral at energies above \(Q_\textrm{max}\approx 2\) GeV (\(\mu \textrm{H}\)) and \(\approx 4\) GeV (\(\textrm{H}\)), respectively. \(\Delta _2\) (red line) converges better. Its contributions from above \(Q_\textrm{max}=m_\rho \) amount to \(42\%\) (H) and \(26\%\) (\(\mu \textrm{H}\)), respectively.

The bad high-momentum asymptotics indicated above are merely an artefact of the conventional splitting into \(\Delta _1\) and \(\Delta _2\). For the alternative splitting into \(\Delta _{LT}\) and \(\Delta _{TT}\), introduced in Eq. (12), the cut-off dependence improves considerably. For \(\Delta _{TT}\) (blue line), the contribution from above \(Q_\textrm{max}=m_\rho \) amounts to less than \(4\%\) for both hydrogens. For \(\Delta _{LT}\) (orange line), the high-energy contributions are less than \(35\%\) (\(\mu \textrm{H}\)) and \(32\%\) (\(\textrm{H}\)), respectively. In this way, our results are in agreement with the natural expectation of uncertainty for a LO prediction, \(30\%\) [\(\simeq (M_\Delta -M)/\textrm{GeV}\)], in B\(\chi \)PT with inclusion of the \(\Delta \) resonance. Based on this analysis, we decided to assign errors of \(30\%\) to the \(\sigma _{LT}\) and \(\sigma _{TT}\) contributions, and propagate them to \(\Delta _1\), \(\Delta _2\) and \(\Delta _\mathrm {pol.}\). It is interesting to note that in this way the uncertainty of \(\Delta _1\) is larger than the uncertainty of \(\Delta _\mathrm {pol.}\). This can be understood from the opposite signs of the \(\Delta _{i,j}\) contributions, where \(i=1,2\) and \(j=LT,TT\), on the example of \(\mu \)H:

$$\begin{aligned} \Delta _{1,\,LT}=&227\,\text {ppm},\quad \Delta _{1,\,TT}&=-198\,\text {ppm},\end{aligned}$$
(22a)
$$\begin{aligned} \qquad \Delta _{2,\,LT}=&-62\,\text {ppm},\quad \Delta _{2,\,TT}&=71\,\text {ppm}. \end{aligned}$$
(22b)

4 Comparison with other results

In this section, we compare our LO B\(\chi \)PT prediction for the polarizability effect in the \(\textrm{H}\) and \(\mu \textrm{H}\) hfs to other available evaluations. Furthermore, we study the contribution of the \({\bar{S}}_1(0,Q^2)\) subtraction function and the scaling of the polarizability effect with the lepton mass.

4.1 Heavy-baryon effective field theory

Let us start by comparing our B\(\chi \)PT prediction to other model-independent calculations using HB EFT [56,57,58].Footnote 5 First results for the elastic and inelastic TPE effects on the hfs in H and \(\mu \textrm{H}\) have been obtained in Ref. [56], where the contribution of the leading chiral logarithms, \(\mathcal {O}(m^3 \alpha ^5/M^2\times \left[ \ln m_{\pi }, \ln \Delta , \ln m\right] )\), was calculated in HB EFT matched to potential NRQED. At this order in the chiral expansion the polarizability effects in the hfs from pion-nucleon and pion-delta loops cancel each other in the large-\(N_c\) limit, while the \(\Delta \) exchange cancels part of the point-like corrections, see also Ref. [48]. The analytical results presented in Refs. [56, 59] motivate the relative size of the Zemach and polarizability corrections.

Updated HB EFT predictions for the TPE effects on the hydrogen spectra can be found in Refs. [57, 58, 60]. In Ref. [58], the difference between the pion-loop polarizability contributions in \(\textrm{H}\) and \(\mu \textrm{H}\) is quoted as

$$\begin{aligned} \Delta c_4\equiv c_{4,\textrm{pol}}^{\mu \textrm{H}}-c_{4,\textrm{pol}}^{\textrm{H}}=0.17(9), \; \end{aligned}$$
(23)

where \(c_4\) is a Wilson coefficient linked to the hfs in the following way:

$$\begin{aligned} E_\textrm{hfs}(nS)=\frac{E_\textrm{F}}{n^3}\frac{3\alpha }{2 \pi (1+\kappa )}\frac{m}{M}\,c_4. \end{aligned}$$
(24)

For comparison, we can evaluate the analogue of \(\Delta c_4\) from other theory predictions for the polarizability contribution. Within errors, our LO B\(\chi \)PT prediction agrees with this result:

$$\begin{aligned} \Delta c_4= 0.09(0.46). \end{aligned}$$
(25)

Here, the uncertainties of the H and \(\mu \)H predictions have been combined in quadrature to estimate the error on their difference. For comparison, from the data-driven dispersive evaluations of Carlson et al. [14], one can deduce:

$$\begin{aligned} \Delta c_4=-\,0.27(1.53), \end{aligned}$$
(26)

where we combined all errors quoted in Ref. [14] and estimated the error on \(\Delta c_4\) in the same way as done above.

4.2 Data-driven dispersive evaluations

Fig. 4
figure 4

Comparison of available results for the polarizability effect on the hyperfine splitting in H and \(\mu \)H (upper and lower panel) [14,15,16]

There is a clear discrepancy between the B\(\chi \)PT prediction, presented here, and the conventional data-driven dispersive evaluations. The dispersive evaluations rely on empirical information for the inelastic proton spin structure functions, the elastic Pauli form factor and polarizabilities. The discrepancy can be seen from Fig. 4, where our LO B\(\chi \)PT prediction for the polarizability effect in the \(\textrm{H}\) and \(\mu \textrm{H}\) hfs is compared to the available dispersive evaluations. Adding an estimate for the next-to-leading-order (NLO) effect of the \(\Delta (1232)\) resonance [61], obtained from large-\(N_c\) relations for the nucleon-to-delta transition form factors, to the model-independent LO B\(\chi \)PT prediction will improve agreement for \(\Delta _2\) but not for \(\Delta _\textrm{pol}\).

The origin of this discrepancy has to be understood in order to give a reliable prediction of the TPE effect in the \(\mu \)H hfs, needed for the forthcoming experiments. Part of the discrepancy might be due to underestimated uncertainties. An evaluation of the total polarizability effect suffers from cancellations in two places: firstly, between contributions from the cross sections \(\sigma _{LT}\) and \(\sigma _{TT}\), secondly, between the elastic Pauli form factor \(F_2\) and the inelastic structure functions in the low-Q region. Each of these cancellations reduces the result by an order of magnitude. In the calculation presented here, the former is taken into account by estimating the uncertainty due to higher-order corrections in the B\(\chi \)PT power counting based on the large \(\sigma _{LT}\) and \(\sigma _{TT}\) contributions, see discussion in Sect. 3.2. In the dispersive approach, it would be important to take into account correlations between parametrizations of the \(g_1\) and \(g_2\) structure functions, which both rely on measurements of \(\sigma _{LT}\) and \(\sigma _{TT}\). The latter cancellations in the low-Q region will be discussed in the following subsection.

4.3 Low-Q region and contribution of the \({\bar{S}}_1(0,Q^2)\) subtraction function

One major drawback of the data-driven dispersive evaluations is that they require independent input for the inelastic spin structure functions or related polarizabilities, and the elastic Pauli form factor. Our notation in Eq. (10b) conveniently illustrates how the zeroth moment of the inelastic spin structure function \(g_1\) and the elastic Pauli form factor \(F_2\) combine in the subtraction function:

$$\begin{aligned} {\bar{S}}_1(0,Q^2)= & {} \frac{2\pi \alpha }{M}\left[ F_{2}^2(Q^2)+4I_1(Q^2)\right] \nonumber \\= & {} \frac{8\pi \alpha }{M}{\bar{I}}_1(Q^2). \end{aligned}$$
(27)

At \(Q^2=0\), this is zero, because the Pauli form factor, \(F_2(0)=\kappa \), and the generalized GDH integral, \(I_1(0)=-\kappa ^2/4\), so the two terms cancel exactly. A NLO B\(\chi \)PT prediction of the slope amounts to: \([\overline{I}_1]'(0)=0.39(4)\,\textrm{GeV}^{-2}\) [42]. It can be expressed through a combination of lowest-order spin [\(\gamma _{E1M2}\)] and generalized polarizabilities [\(P^{\prime (M1,M1)1}\) (0) and \(P^{\prime (L1,L1)1}(0)\)], see Eq. (19). In the HB\(\chi \)PT expansion, we showed that the leading \(\mathcal {O}(\nicefrac {1}{m_\pi ^2})\) terms cancel among these individual polarizabilities, given in Eq. (20), turning the result subleading in \(\mathcal {O}(\nicefrac {1}{m_\pi })\), see Eq. (21). We can conclude that there is a strong cancellation between the elastic and inelastic contributions, which continues for higher \(Q^2\).

The contribution of \({\bar{S}}_1(0,Q^2)\) to the hfs is given by:

$$\begin{aligned} E_\textrm{hfs}^{\langle {\bar{S}}_1(0,Q^2)\rangle }(nS)= & {} \frac{E_\textrm{F}}{n^3} \frac{\alpha m}{\pi (1+\kappa ) M}\int _0^\infty \frac{\textrm{d}Q}{Q}\frac{5+4v_l}{(v_l+1)^2}\nonumber \\{} & {} \times \left[ 4I_1(Q^2)+F_{2}^2(Q^2)\right] .\qquad \end{aligned}$$
(28)

Evaluations of this subtraction function contribution with empirical parametrizations for \(g_1(x,Q^2)\) and \(F_2(Q^2)\) tend towards larger values than the LO B\(\chi \)PT prediction. A partial calculation of the TPE effect at NLO in B\(\chi \)PT, considering only the one-loop box diagram with intermediate \(\Delta (1232)\)-excitation, will lower the theoretical prediction for the polarizability contribution from B\(\chi \)PT further, and in fact, turn it into a negative contribution [61, 62]. Any imprecision in the empirical parametrizations, and thus in the cancellation between the elastic and inelastic moments, is enhanced by the 1/Q prefactor in the infrared region of the integral in Eq. (28). Therefore, the B\(\chi \)PT calculation, where the polarizability effect can be accessed directly through the non-Born part of the VVCS amplitudes and does not rely on input from separate measurements, has a clear advantage in this regard.

To illustrate this further, we reproduce the estimate for \(\Delta _1\) in the low-Q region from Ref. [14] (see references therein for the details on the input). In this region, no experimental data from EG1 [63, 64] exist and the integral is completed by interpolating data between higher \(Q^2\) and \(Q^2=0\), making use of empirical values for the static polarizabilities. For \(Q^2 \in \left\{ 0,Q_\textrm{max}^2\right\} \) with \(Q_\textrm{max}^2=0.0452\,\textrm{GeV}^2\), the approximate formulas read [14]:

$$\begin{aligned} \delta _1(\text {H})\sim & {} \left( \underbrace{-\frac{3}{4}\kappa ^2 r_\textrm{Pauli}^2}_{\rightarrow \,-2.19} +\underbrace{18M^2 c_{1B}}_{\rightarrow \,3.54} \right) Q_\textrm{max}^2=1.35(90),\nonumber \\ \end{aligned}$$
(29a)
$$\begin{aligned} \delta _1(\mu \text {H})\sim & {} \left[ \underbrace{-\frac{1}{3}\kappa ^2 r_\textrm{Pauli}^2}_{\rightarrow \, -1.45} +\underbrace{8 M^2 c_{1}}_{\rightarrow \, 2.13} \underbrace{-\frac{M^2}{3 \alpha } \gamma _0}_{\rightarrow \, 0.18} \right] \nonumber \\{} & {} \times \int _0^{Q_\textrm{max}^2} \textrm{d}Q^2 \beta _1(\tau _\mu )=0.86(69), \end{aligned}$$
(29b)

where

$$\begin{aligned} \beta _1(\tau _\mu )=-3 \tau _\mu +2 \tau _\mu ^2 +2 (2-\tau _\mu ) \sqrt{\tau _\mu (\tau _\mu +1)}. \end{aligned}$$
(30)

Note that the formulas for H and \(\mu \)H differ, because one sets \(m_e=0\). The first terms are related to the elastic Pauli form factor, where \(r_\textrm{Pauli}=-\nicefrac {6}{\kappa }\, \nicefrac {\textrm{d}}{\textrm{d}Q^2}\,F_2(Q^2)\vert _{Q^2=0}\) is the Pauli radius. The other terms are related to the \(g_1\) contribution. Considering the more general Eq. (29b), they are defined through:

$$\begin{aligned} I_1(Q^2)= & {} M \int _{\nu _0}^\infty \frac{\textrm{d}\nu }{\nu ^2}\,g_1(\nu ,Q^2)\nonumber \\= & {} -\kappa ^2/4+2M^2 c_1 Q^2+\mathcal {O}(Q^4),\end{aligned}$$
(31a)
$$\begin{aligned} \gamma _0= & {} \frac{2\alpha }{M}\int _{\nu _0}^\infty \frac{\textrm{d}\nu }{\nu ^4}\,g_1(\nu ,0). \end{aligned}$$
(31b)

The strong cancellation between elastic and inelastic contributions, observed in Eq. (29), can be a source of uncertainty.

In addition, the quality of the low-Q approximation is rather poor. We can test it at LO in B\(\chi \)PT. Recall that at this order in the B\(\chi \)PT power counting, there is no contribution to the elastic form factors. Therefore, only the inelastic structure function \(g_1\) enters. Our results are shown in Fig. 5. The approximate formulas in Eq. (29) give a \(50\%\) (\(67\%\)) larger value for \(\delta _1\) in the region of \(Q^2<0.0452\) GeV\(^2\) in the case of \(\mu \)H (H). Therefore, in the data-driven dispersive approach one has to properly account for the uncertainty introduced by the approximate formulas, as well as from cancellations between elastic and inelastic contributions.

Fig. 5
figure 5

The polarizability contribution \(\delta _1\) in the low-Q region for hydrogen (red) and muonic hydrogen (blue). The solid lines are the exact results according to Eq. (10b) with an upper cut on the Q integration. The dotted and dashed lines are evaluated with the approximate formulas for hydrogen and muonic hydrogen, respectively, see Eq. (29)

4.4 Scaling with lepton mass

It is customary to use the high-precision measurement of the 1S hfs in H [65, 66]:

$$\begin{aligned} E^{\,\text {exp.}}_{1S\text {-hfs}}(\textrm{H})= & {} 1\,420.405\,751\,768(1)\,\text {MHz}, \end{aligned}$$
(32)

to refine the prediction of the TPE in the \(\mu \)H hfs [16, 58] or the prediction of the total \(\mu \)H hfs [13]. We will do the same in Sect. 6. The strategies in Refs. [13, 16, 58] are slightly different, but all make statements about the scaling of various contributions to the hfs in a hydrogen-like atom when varying the lepton mass \(m_\ell \).

In Fig. 6, we study the scaling of the polarizability effect based on our LO B\(\chi \)PT prediction. In the left panel, we assume that the \(\Delta _i\) (with \(i=1,2,LT,TT\) and pol.) are scaling with the reduced mass \(m_r\). In the right panel we assume that the \(\delta _i\) are independent of the lepton mass, thus, \(\Delta _i\) would be scaling with \(m_\ell \). The curves in the upper (lower) panel are normalized for H (\(\mu \)H), so they are fixed to 1 at \(m_\ell =m_e\) (\(m_\ell =m_\mu \)). If the polarizability effect would scale according to our assumptions, i.e., \(\propto m_r\) or \(\propto m_\ell \), all curves would be constantly 1. We can see that the scaling works best for the contributions from \(\sigma _{LT}\) and \(\sigma _{TT}\), which are large in their absolute values. Considering the total, in which the contributions from \(\sigma _{LT}\) and \(\sigma _{TT}\) cancel by about one order of magnitude, the scaling violation is enhanced by about one order of magnitude in relative terms. The same enhancement of the scaling violation can be observed for the numerically small contributions from \(g_1\) and \(g_2\). Comparing left and right panels, the B\(\chi \)PT predictions seem to support the assumption that \(\Delta _{LT}\) and \(\Delta _{TT}\) are scaling with \(m_r\). For \(\Delta _{LT}\), the scaling is nearly perfect. For \(\Delta _{TT}\), we observe a violation of the scaling that is increasing with lepton mass. The approximation \(\Delta _{TT}(\mu \text {H}) \sim m_r(\mu \text {H}) / m_r(\text {H})\,\Delta _{TT}(\mu \text {H})\) holds at the level of \(10\%\). The approximation holds on a similar level after including an estimate for the NLO effect of the \(\Delta (1232)\) resonance [61].

Fig. 6
figure 6

Scaling of \(\delta _i\) and \(\Delta _i/m_r\) (with \(i=1,2,LT,TT\) and pol.), as a function of the lepton mass \(m_\ell \)

5 Extraction of the Zemach radius from spectroscopy

The TPE, entering the hfs, can be decomposed into Zemach radius, polarizability and recoil contributions, as described in Eq. (5). On top of the \(\mathcal {O}(\alpha ^5)\) TPE, we consider the leading radiative corrections given by eVP, see Fig. 10 and discussion in Appendix B. Our prediction for the polarizability effect in the hfs, which is smaller than the conventional results from data-driven dispersive evaluations, also implies a smaller proton Zemach radius as previously determined from spectroscopy, cf. Eq. (1). In the following, we will extract the Zemach radius from the precisley measured 1S hfs in H, see Eq. (32), and the 2S hfs in \(\mu \)H [5]:

$$\begin{aligned} E^{\,\text {exp.}}_\text {HFS}(2S, \mu \textrm{H})=22.8089(51) \,\text {meV}. \end{aligned}$$
(33)

We use the theory predictions for the 1S hfs in H [13]:

$$\begin{aligned} E_{\text {hfs}}(1S, \text {H})= & {} \Bigg [1\,420\,453.106(10) \underbrace{-54.430(7)\,\left( \frac{R_{\textrm{Z}}}{\text {fm}}\right) +E_\textrm{F}\,\Big (0.99807(13)\,\Delta _\textrm{recoil}+1.00002\,\Delta _\mathrm {pol.}\Big )}_{\text {TPE including radiative corrections}}\Bigg ]\,\text {kHz} \end{aligned}$$
(34)

and the 2S hfs in \(\mu \)H [13]:

$$\begin{aligned} E_{\text {hfs}}(2S, \mu \text {H})= \Bigg [22.9584(8)\underbrace{-0.16319(2)\left( \frac{R_{\textrm{Z}}}{\text {fm}}\right) +\frac{E_\textrm{F}}{8}\,\Big (1.01580(4)\,\Delta _\textrm{recoil}+1.00326\,\Delta _\mathrm {pol.}\Big )}_{\text {TPE including radiative corrections}}\Bigg ]\,\text {meV}, \end{aligned}$$
(35)

with the recently re-evaluated \(\mathcal {O}(\alpha ^5)\) recoil correction [33]:

$$\begin{aligned} \Delta _\textrm{recoil}(\textrm{H})= & {} 5.269\,{}^{+0.017}_{-0.004}\;\text {ppm}, \end{aligned}$$
(36a)
$$\begin{aligned} \Delta _\textrm{recoil}(\mu \textrm{H})= & {} 837.6\,{}^{+2.8}_{-1.0}\;\text {ppm}, \end{aligned}$$
(36b)
Table 1 Determinations of the proton Zemach radius \(R_{\textrm{Z}}\), in units of fm

up to a factor 3 more precise than the previous best determination [67] based on the electromagnetic form factors obtained from dispersion theory [68]. An itemized list of contributions to the 2S hfs in \(\mu \)H is given in Table 3 of Appendix C. From the LO B\(\chi \)PT prediction for the polarizability effect, including also the eVP in Eq. (17), we obtain:

$$\begin{aligned} R_\textrm{Z}(\textrm{H})= & {} 1.010(9)\,\textrm{fm}, \end{aligned}$$
(37a)
$$\begin{aligned} R_\textrm{Z}(\mu \textrm{H})= & {} 1.040(33)\,\textrm{fm}. \end{aligned}$$
(37b)

This can be compared to other determinations of the proton Zemach radius collected in Table 1.Footnote 6 The radii we find are in agreement with the proton form factor analysis from Ref. [69], which uses the proton charge radius from the \(\mu \)H Lamb shift [5] as a constraint for their fit.

Figure 7 shows how the Zemach and charge radius of the proton are correlated. It suggests that a “smaller” charge radius, as seen initially in the \(\mu \)H Lamb shift by the CREMA collaboration [5] (red line), comes with a “smaller” Zemach radius. The dashed black curve is calculated with a dipole form, \(G(Q^2)\propto (1+Q^2/\Lambda ^2)^{-2}\), for the electric and magnetic Sachs form factors, by varying \(\Lambda \). The light red and orange bands show \(R_Z\) as extracted by us, Eq. (37), based on the LO B\(\chi \)PT prediction for the polarizability effect in the hfs.

Fig. 7
figure 7

Correlation between the Zemach and charge radius of the proton. Our extractions based on LO B\(\chi \)PT are compared to results from Lin et al. [68], Borah et al. [69], CREMA [5], Distler et al. [70], Kelly [71], Bradford et al. [72], Arrington et al. [73], and Arrington and Sick [74]

6 Theory prediction for the ground-state hyperfine splitting in \(\mu \)H

The upcoming measurements of the 1S hfs in \(\mu \)H [8,9,10,11] crucially rely on a precise theory prediction. The limiting uncertainty is given by the TPE, which is conventionally split into Zemach radius, polarizability and recoil contributions [13]:

$$\begin{aligned} E_{\text {hfs}}(1S, \mu \text {H})= \Big [183.797(7)\underbrace{-1.30653(17)\left( \frac{R_{\textrm{Z}}}{\text {fm}}\right) + E_\textrm{F}\,\Big (1.01656(4)\,\Delta _\textrm{recoil}+1.00402\,\Delta _\mathrm {pol.}\Big )}_{\text {TPE including radiative corrections}}\Big ]\,\text {meV}, \end{aligned}$$
(38)
Fig. 8
figure 8

Two-photon-exchange effect on the 1S hyperfine splitting in \(\mu \)H [14, 16, 58]

Fig. 9
figure 9

Predictions for the 1S hyperfine splitting in \(\mu \)H [13, 16, 58], compared to the projected uncertainty of the planned CREMA measurement (red vertical line)

see Appendix C and Table 2 for an itemized list of the individual contributions. As explained in Sect. 4.4, it is customary to refine the theory prediction of 1S hfs in \(\mu \)H with the help of the high-precision measurement of the 1S hfs in H. We do so by combining our B\(\chi \)PT prediction for the polarizability effect in the \(\mu \)H hfs, Eq. (17b), and the Zemach radius extracted from H spectroscopy, Eq. (37a), based on the same prediction for the polarizability effect in the H hfs. We arrive at:

$$\begin{aligned} E_{\text {hfs}}(1S, \mu \text {H})= & {} 182.640(18)\,\textrm{meV}, \end{aligned}$$
(39a)
$$\begin{aligned} E^\text {TPE}_{\textrm{hfs}}(1S, \mu \text {H})= & {} -\,1.157(16)\,\textrm{meV}, \end{aligned}$$
(39b)

where \(E^\text {TPE}_{\textrm{hfs}}\) corresponds to the TPE including radiative corrections and recoil corrections from Ref. [33], as indicated by the curly brace in Eq. (38).

In Figs. 8 and 9, we compare our predictions to results from data-driven dispersive evaluations [14, 16] and HB EFT [58]. While almost all available predictions for the total hfs in \(\mu \)H are in agreement after the H refinement procedure, further improvements of the theory are required in order to compete with the anticipated experimental accuracy.

7 Conclusions and outlook

We have presented the LO B\(\chi \)PT prediction for the \(\mathcal {O}(\alpha ^5)\) polarizability effect on the hfs in H and \(\mu \)H, see Eq. (15). Contrary to the data-driven evaluations, the B\(\chi \)PT prediction is compatible with zero. This was expected from the HB\(\chi \)PT limit of the VVCS amplitudes, in particular \({\bar{S}}_1(0,Q^2)\), which partially display a cancellation of the leading order in the chiral expansion of small \(m_\pi \), see discussion in Sect. 3.1. The small polarizability effect is then mainly a remnant of higher orders in the HB expansion.

A new formalism where the polarizability effect is split into contributions from the longitudinal-transverse and helicity-difference cross sections, \(\sigma _{LT}\) and \(\sigma _{TT}\), instead of contributions from the spin structure functions, \(g_1\) and \(g_2\), has been introduced in Eq. (12). It was shown that these contributions, \(\Delta _{LT}\) and \(\Delta _{TT}\), cancel by one order of magnitude when combined into \(\Delta _\mathrm {pol.}\). Only \(\Delta _{LT}\) and \(\Delta _{TT}\) are good observables in the B\(\chi \)PT framework, for which the contributions from beyond the scale at which this EFT is safely applicable, \(Q_\textrm{max}>m_\rho =775\) MeV, are within the expected uncertainty. In addition, only \(\Delta _{LT}\) and \(\Delta _{TT}\) satisfy the conventionally assumed scaling with the reduced mass \(m_r\) of the hydrogen-like system to \(10\%\) relative accuracy, while the cancellations in \(\Delta _\mathrm {pol.}\) enhance any violation in the scaling by one order of magnitude.

As shown in Fig. 4, our model-independent LO B\(\chi \)PT prediction is substantially smaller than the data-driven dispersive evaluations. An estimate for the effect of the \(\Delta (1232)\)-resonance [61], obtained from large-\(N_c\) relations for the nucleon-to-delta transition form factors, shows that the discrepancy is likely to increase at the NLO. The smaller polarizability effect, in turn, leads to a smaller Zemach radius as extracted from the experimental 1S hfs in H and the 2S hfs in \(\mu \)H, cf. Eq. (37). Therefore, resolving the present discrepancy for the polarizability effect is crucial for the analysis of the forthcoming measurements of the 1S hfs in \(\mu \)H and the extraction of the Zemach radius.

The data-driven approach relies on empirical information on the inelastic spin structure functions, or the measured cross sections to be precise, as well as the elastic form factors and polarizabilities at \(Q^2=0\). Due to the large cancellations between \(\sigma _{LT}\) and \(\sigma _{TT}\), as well as \(g_1\) and \(F_2\), precise parametrizations of the former are needed, and the uncertainty of the TPE evaluation has to be estimated with great care, taking into account all correlations. Furthermore, due to a lack of data at low Q, one uses interpolation from \(Q^2=0\) to the onset of data [14]. As we showed in Sect. 4.3 based on LO B\(\chi \)PT, the quality of these approximations is rather poor and is yet another source of uncertainty. New data from the Jefferson Lab “Spin Physics Program” [17,18,19,20,21], including also the substantially extended dataset for \(g_2\) [23], will allow for a re-evaluation of the polarizability effect on the hfs in H and \(\mu \)H.

An accurate theoretical prediction of the 1S hfs in \(\mu \)H is crucial for the future measurement campaigns, since it allows to reduce the search range for the resonance in experiment. Thus, one might find the resonance faster and acquire more statistics during the allocated beam time, see discussion in Ref. [13]. The present discrepancy between predictions for the polarizability effect can be mended, if the high-precision measurement of the 1S hfs in H is implemented as a constraint. Applying this procedure, good agreement is found between all theory predictions for the total 1S hfs in \(\mu \)H hfs, see Fig. 9. Eventually, after a successful measurement of the 1S hfs in \(\mu \)H, one can combine it with the 1S hfs in H to disentangle the Zemach radius and polarizability effects, leveraging radiative corrections as explained in Ref. [13]. The empirical polarizability effect, obtained in this way, can reach a precision of \(\sim 40\) ppm [13]. That is sufficient to discriminate between the presently inconsistent theoretical predictions.