1 Introduction

The persisting discrepancy between the Standard Model evaluation and the experimental determination [1] of the muon anomalous magnetic moment \(a_\mu \) is one of the outstanding open problems in particle physics and is traditionally considered a harbinger of New Physics. Moreover, the forthcoming results from the Fermilab E989 experiment, which aim to improve the present accuracy by a factor of 4 to reach an uncertainty of about \(16 \times 10^{-11}\) (i.e. 0.14 ppm) [2], make it even more crucial and timely to further scrutinize and improve control over theory predictions.

Together with the hadronic vacuum polarization contribution, the hadronic light-by-light (HLbL) is the major source of theoretical uncertainty in the Standard Model [3,4,5]. In the last years, significant efforts have been devoted to improve the determination of \(a_\mu ^{\text {HLbL}}\) and reduce model dependence by using analytic approaches based on dispersion relations [6,7,8,9,10,11,12,13,14,15,16,17] as well as lattice QCD [18,19,20,21,22,23,24]. In particular, the dispersive framework for the HLbL tensor in Refs. [6,7,8,9,10,11] has enabled accurate data-driven determinations with controlled error estimates of the contributions from one- and two-pion intermediate states.

In this framework \(a_\mu ^{\text {HLbL}}\) is evaluated via a two-loop integral of dispersively reconstructed scalar functions against analytically known kernels. At sufficiently small space-like photon virtualities, contributions from low-mass states accessible to a dispersive treatment are enhanced. At higher virtualities such an enhancement does not occur leading to important effects from higher intermediate states, which are constrained by operator product expansions (OPEs) and perturbative QCD (pQCD).

More specifically, there are two relevant kinematic regimes concerning short-distance constraints (SDCs) on \(a_\mu ^{\text {HLbL}}\) for asymptotic values of (subsets of) the photon virtualities. Since one of the photons corresponds to the static electromagnetic source in the definition of \(g-2\), one asymptotic regime is realized when the remaining three space-like photon virtualities are comparable and much larger than \(\varLambda _\text {QCD}^2\), and the other when two space-like photon virtualities are much larger than both the third and \(\varLambda _\text {QCD}^2\). The latter SDC was first derived by Melnikov and Vainshtein (MV) [25] using an OPE that leads to relations involving longitudinal and transversal amplitudes of the correlator of two vector and one axial current (VVA) in the chiral limit. The former SDC was also discussed in Ref. [25] based on the quark-loop calculation at leading order in pQCD and its derivation was recently put on a firmer theoretical ground by means of an OPE in an external electromagnetic background field [26].

Tree-level resonance exchanges cannot make \(a_\mu ^{\text {HLbL}}\) comply with all SDCs unless an infinite number of states is included. This is due to the fact that the transition form factors (TFFs) describing the resonance couplings to off-shell photons are subject themselves to asymptotic QCD constraints [27,28,29], which make the full HLbL four-point function decay too fast at high virtualities.Footnote 1 MV proposed a model to satisfy the longitudinal and transversal OPE SDCs through a modification of the pion pole contribution [25, 31], which affects also the low-energy region. Recently, alternative model-dependent solutions have been investigated to fulfill both OPE and pQCD SDCs by instead adding degrees of freedom to the ground-state pseudoscalars. In this context, Refs. [32, 33] proposed the inclusion of infinite towers of excited pseudoscalar poles in large-\(N_c\)-inspired Regge models to satisfy longitudinal SDCs away from the chiral limit,Footnote 2 while the effect of summing over axial-vector contributions in holographic QCD was the subject of Refs. [40, 41].Footnote 3 Through the explicit summation of intermediate states, these models provide specific interpolations between the low-energy region and the asymptotic regimes for the scalar functions that determine \(a_\mu ^{\text {HLbL}}\).

The goal of this paper is complementary to these studies. We introduce an approach based on more general interpolating scalar functions, independent of the physical mechanism that is ultimately responsible for their actual form outside the low-energy region. The multi-parameter families of functions studied here satisfy all constraints rigorously derived from general principles: unitarity, analyticity and crossing in the low-energy domain, OPE and pQCD constraints in the mixed and high-energy regions. Here we focus on longitudinal SDCs since these are tightly related to the pseudoscalar poles for which accurate low-energy input is available and their implementation does not involve any mixing of OPE constraints among different scalar functions [33]. Error estimates as well as the role played by the various parameters and assumptions, can be easily and transparently addressed in our approach and are investigated in detail in our numerical study.

Crucial input for our analysis is provided by an accurate low-energy representation of the scalar functions. In the following we will mostly assume that this is given by the ground-state pseudoscalar poles. In this context, an important role is played by the lightest state (\(\pi ^0\)), whose contribution is under firm theoretical control thanks to a dispersive evaluation [12,13,14]. Improved determinations of the effects of SDCs can be obtained in a straightforward way within our approach once similarly precise, model-independent information about further relevant intermediate states in the energy region up to 1–2 GeV become available. In order to illustrate this aspect and compare against a different way to estimate the contribution from SDCs, we have applied our method also to the case where the lightest axial-vector meson is included in the low-energy region using input from holographic QCD [40, 41] and neglecting issues related to intrinsic model dependence.

The paper is structured as follows. In Sect. 2 we review the relevant constraints on HLbL and the assumptions made in their derivations. Section 3 describes our interpolation between the OPE and pQCD asymptotic constraints while in Sect. 4 we discuss its smooth connection with the low-energy region. In Sect. 5 we present our numerical analysis with particular emphasis on the error estimation. Conclusions are drawn in Sect. 6. Appendix A is devoted to the analysis of the convergence properties of our interpolants.

2 Longitudinal short-distance constraints on HLbL

2.1 Master formula for \(a_\mu ^\text {HLbL}\) and pseudoscalar pole contributions

In order to set up the notation, we start by summarizing the relevant definitions and results from Refs. [9, 11]. The HLbL contribution to \(a_\mu \) is governed by the fourth-rank vacuum polarization tensor for fully off-shell photon-photon scattering in pure QCD,

(1)

with momenta assigned as \(q_1+q_2+q_3 = q_4\). In this expression, the electromagnetic current for the light quark triplet is given by

$$\begin{aligned} j^\mu (x)= & {} {\bar{\psi }}(x) {{\mathscr {Q}}} \gamma ^\mu \psi (x), \nonumber \\ \psi= & {} (u,d,s)^T, \nonumber \\ {\mathscr {Q}}= & {} \frac{1}{3}\hbox {diag}(2,-1,-1)\, . \end{aligned}$$
(2)

By generalizing the procedure introduced by Bardeen and Tung [43] and Tarrach [44] in their studies of doubly-virtual Compton scattering, it is possible to derive a generating redundant “BTT” set of 54 Lorentz structures,

$$\begin{aligned} \varPi ^{\mu \nu \lambda \sigma } = \sum _{i=1}^{54} T_i^{\mu \nu \lambda \sigma } \varPi _i, \end{aligned}$$
(3)

which is manifestly gauge invariant, closed with respect to crossing relations and such that the scalar functions \(\varPi _i\) are free of kinematic singularities.

The HLbL contribution to \(a_\mu \) can be derived from the tensor \(\varPi ^{\mu \nu \lambda \sigma }\) by using projection operator methods [5, 45, 46] in the static limit \(q_4 \rightarrow 0\). After performing a Wick rotation to Euclidean momenta, angular averages [47, 48] lead to the master formula [11]

$$\begin{aligned} a_\mu ^\text {HLbL}= & {} \frac{2\alpha ^3}{3\pi ^2}\int _{0}^{\infty } \mathop {}\!\mathrm {d}{Q_{1}} \int _{0}^{\infty } \mathop {}\!\mathrm {d}{Q_{2}} \int _{-1}^{1} \mathop {}\!\mathrm {d}\tau \sqrt{1-\tau ^2}\,Q_1^3\, Q_2^3\,\nonumber \\&\times \sum _{i=1}^{12} T_i(Q_1,Q_2,\tau ) \,{\bar{\varPi }}_i(Q_1,Q_2,\tau ), \end{aligned}$$
(4)

where \(Q_{1,2}\) denote the magnitudes of the Euclidean loop four-momenta, \(Q_{1,2} = \sqrt{-q_{1,2}^2}\), and \(\tau \) is the cosine of the angle between these vectors. The scalar functions \({\bar{\varPi }}_i\) are linear combinations of the previous \(\varPi _i\) for \(q_4 \rightarrow 0\). The analytic expressions of the integration kernels \(T_i\) are given in Ref. [11].

Parameterizing the three-dimensional integration domain by the coordinates [49]

$$\begin{aligned} \varSigma \in [0,\infty )\ ,\quad r \in [0,1]\ ,\quad \phi \in [0,2\pi )\, , \end{aligned}$$
(5)

which are related to the non-vanishing photon virtualities by

$$\begin{aligned} Q_1^2= & {} \frac{\varSigma }{3}\left( 1 - \frac{r}{2} \cos \phi - \frac{r}{2} \sqrt{3}\sin \phi \right) , \nonumber \\ Q_2^2= & {} \frac{\varSigma }{3}\left( 1 - \frac{r}{2} \cos \phi + \frac{r}{2} \sqrt{3}\sin \phi \right) , \nonumber \\ Q_3^2= & {} Q_1^2 + 2 Q_1 Q_2 \tau + Q_2^2 = \frac{\varSigma }{3}\left( 1 + r \cos \phi \right) , \end{aligned}$$
(6)

will prove very useful in the following discussion about asymptotic constraints on HLbL. The master formula in Eq. (4) then takes the form

$$\begin{aligned} a_\mu ^\text {HLbL}= & {} \frac{\alpha ^3}{432\pi ^2}\int _{0}^{\infty } \mathop {}\!\mathrm {d}\varSigma \,\varSigma ^3 \int _{0}^{1} \mathop {}\!\mathrm {d}r\, r\sqrt{1-r^2} \int _{0}^{2\pi } \mathop {}\!\mathrm {d}\phi \nonumber \\&\times \sum _{i=1}^{12} T_i(\varSigma ,r,\phi ) {\bar{\varPi }}_i(\varSigma ,r,\phi )\, . \end{aligned}$$
(7)

In terms of the \(Q_i^2\) coordinates, the integration domain amounts to a cone with tip at the origin. In terms of \((\varSigma , r, \phi )\), a given point in this cone is specified by the distance \(\varSigma \) to the tip of the point’s projection on the symmetry axis (\(\varSigma = Q_1^2+Q_2^2+Q_3^2\)), and by the polar coordinates r and \(\phi \) on the plane containing the point and orthogonal to the symmetry axis, normalized such that \(r = 1\) corresponds to the surface of the cone.

In the master formula, a special role is played by the scalar functions \({\bar{\varPi }}_{1,2}\), which fulfill

$$\begin{aligned} {\bar{\varPi }}_2 = {\mathscr {C}}_{2,3}[{\bar{\varPi }}_1] \quad \text {and}\quad {\mathscr {C}}_{1,2}[{\bar{\varPi }}_1] = {\bar{\varPi }}_1, \end{aligned}$$
(8)

where the crossing operator \({\mathscr {C}}_{i,j}\) exchanges momenta and Lorentz indices of the photons i and j. These functions are the only ones describing the effects of pseudoscalar tree-level exchanges. For small values of \(\varSigma \), the pion pole dominates yielding the largest contribution to \(a_\mu ^{\text {HLbL}}\) and also \(\eta \)/\(\eta '\) poles yield sizable effects. Furthermore, distinctively, the OPE SDCs on \({\bar{\varPi }}_{1,2}\) do not involve other scalar functions [33].

The functional form of \({\bar{\varPi }}_{1,2}\) in specific kinematic regimes is constrained according to analytic QCD results, which we will fully exploit to estimate the impact of (longitudinal) SDCs on

$$\begin{aligned} a_\mu ^\text {long}\equiv & {} \frac{\alpha ^3}{432\pi ^2}\int _{0}^{\infty } \mathop {}\!\mathrm {d}\varSigma \int _{0}^{1} \mathop {}\!\mathrm {d}r \int _{0}^{2\pi } \mathop {}\!\mathrm {d}\phi \,\varSigma ^3\, r\sqrt{1-r^2} \nonumber \\&\times \left[ T_1(\varSigma , r, \phi ) + T_2\left( \varSigma , r, \phi + \frac{2\pi }{3}\right) \right] \nonumber \\&\times {\bar{\varPi }}_1(\varSigma , r, \phi ), \end{aligned}$$
(9)

where the shift in the variable \(\phi \) in \(T_2\) corresponds to the crossing operation on \({\bar{\varPi }}_1\). Thus for our analysis, we only need to study one BTT scalar function in the \(g-2\) kinematics.

For the purpose of later discussion, we stress here that a pole term in \({\bar{\varPi }}_1\) due to a single-particle intermediate state of mass M yielding the denominator \(Q_3^2 + M^2\) leads to a hierarchy among contributions in the space-like momentum region relevant for \(a_\mu ^\text {HLbL}\). For small values of \(Q_3^2\) larger masses get suppressed, while for \(Q_3^2\) comparable to the squared mass of the heavier state or larger, no suppression is expected.Footnote 4 This effect is of course modified by the numerator in \({\bar{\varPi }}_1\), which encodes information on the strength of the coupling to two (off-shell) photons, but it still helps us identify which states can be relevant at specific energy scales and which not, independent of the values of \(Q_{1,2}^2\).

The lightest state contributing to \({\bar{\varPi }}_1\) is \(\pi ^0\). The unitarity relation for a single pseudoscalar intermediate state yields

$$\begin{aligned} {\bar{\varPi }}_1^\text {PS-pole} = -\frac{F_{\text {PS} \gamma ^* \gamma ^*}(-Q_1^2, -Q_2^2) F_{\text {PS} \gamma ^* \gamma ^*}(-Q_3^2, 0)}{Q_3^2 + m_\text {PS}^2}\, , \end{aligned}$$
(10)

where the numerator is given by the product of a doubly-virtual and a singly-virtual transition form factor (TFFs) for an on-shell pseudoscalar meson (PS), which is defined by the matrix element

$$\begin{aligned}&{\text {i}}\int \mathop {}\!\mathrm {d}^4 x\, e^{\,{\text {i}}q_1\cdot x} \langle 0| {{\text {T}}}\{j_\mu (x) j_\nu (0)\}|\text {PS}(q_1+q_2)\rangle \nonumber \\&\quad = \varepsilon _{\mu \nu \alpha \beta } q_1^\alpha q_2^\beta \,F_{\text {PS}\gamma ^*\gamma ^*}(q_1^2,q_2^2) \end{aligned}$$
(11)

with \(\varepsilon ^{0 1 2 3}=+1\). If we set \({\bar{\varPi }}_1 = {\bar{\varPi }}_1^\text {PS-pole}\), then \(a_\mu ^\text {long}\) amounts to the pseudoscalar pole contribution \(a_\mu ^{\text {PS-pole}}\). In the \(\pi ^0\) case, this has been evaluated within a few percent accuracy via a data-driven dispersive approach [12,13,14],

$$\begin{aligned} a_{\mu ,\text {disp}}^{\pi ^0\text {-pole}} = 62.6^{+3.0}_{-2.5}\times 10^{-11}\, . \end{aligned}$$
(12)

This result agrees with other recent determinations based on lattice QCD [50], Canterbury approximants [51], Dyson–Schwinger equations [52, 53] and AdS/QCD models [54]. While a dispersive analysis of the doubly-virtual \(\eta \)/\(\eta '\) TFFs has not been completed yet,Footnote 5 the method of Canterbury approximants in Ref. [51] provides data-driven determinations and associated uncertainties also for the \(\eta \)/\(\eta '\) TFFs. In our numerical analysis of SDCs, we have used as input the dispersive \(\pi ^0\) TFF from Refs. [13, 14] and compared our final results against those with form factors from Canterbury and Dyson–Schwinger approaches, while for \(\eta /\eta '\) we have used the TFFs in Ref. [51] and compared against Ref. [52].

The asymptotic constraints on \(a_\mu ^\text {HLbL}\) [25, 26] (see also Refs. [42, 58]) that we are going to discuss in the next sections have been translated into the BTT framework in Refs. [26, 32, 33]. In this context, there are two distinct relevant kinematic regimes. The first (asymmetric) one is realized when one of the photon virtualities is much smaller than the other two, which are large and comparable, e.g. \(Q_1^2 \sim Q_2^2 \gg Q_3^2\). The second (symmetric) limit occurs when all the Euclidean non-vanishing photon virtualities are large and comparable in size (\(Q_1^2 \sim Q_2^2 \sim Q_3^2 \gg \varLambda _\text {QCD}^2 \)). Both asymptotic limits correspond to \(\varSigma \rightarrow \infty \) but for different values of r and \(\phi \): the asymmetric limit \(Q_1^2 \sim Q_2^2 \gg Q_3^2\) corresponds to \(r=1\) and \(\phi = \pi \) while the symmetric configuration occurs in a neighborhood of \(r=0\) (see Fig. 1).

In the following we will review the relevant constraints on \({\bar{\varPi }}_1\) at large \(\varSigma \) and describe in detail our method to provide general families of interpolants for \({\bar{\varPi }}_1(\varSigma , r, \phi )\) between low- and high-energy regions in the \(g-2\) integral.

Fig. 1
figure 1

The circle represents the boundary of the \(g-2\) integration domain for a fixed value of \(\varSigma \). The angles \(\phi = \pi /3\), \(\phi = \pi \) and \(\phi =5 \pi /3\) correspond to \(Q_2^2= Q_3^2\), \(Q_1^2 = Q_2^2\) and \(Q_1^2=Q_3^2\), respectively. The colored regions denote where SDCs on \({\bar{\varPi }}_1\) hold at large \(\varSigma \). The blue domains yield contributions to \({\bar{\varPi }}_1\) from the OPE expansion of the VVA correlator that are sub-leading compared to the green one, while the orange region corresponds to the pQCD constraint

2.2 The asymmetric asymptotic region: OPE constraints

For large Euclidean values of \({\hat{q}} \equiv (q_1 - q_2)/2\), one can expand the time-ordered product of two electromagnetic currents, which defines the tensor

$$\begin{aligned} \varPi ^{\mu \nu }(q_1, q_2) = {\text {i}}\int \mathop {}\!\mathrm {d}^4 x\, \mathop {}\!\mathrm {d}^4 y \,e^{-{\text {i}}(q_1\cdot x + q_2 \cdot y)}{{\text {T}}}\{j^\mu (x) j^\nu (y)\}, \nonumber \\ \end{aligned}$$
(13)

into a series of local operators. At leading order in \(\alpha _s\), by matching single-quark matrix elements and omitting the unit operator, which does not contribute to the connected HLbL tensor in Eq. (1), one obtains [59]

$$\begin{aligned}&\varPi ^{\mu \nu }(q_1, q_2)\nonumber \\&\quad = \int \mathop {}\!\mathrm {d}^4 z\, e^{-{\text {i}}(q_1 + q_2) \cdot z} \left( -\frac{2{\text {i}}}{{\hat{q}}^2}\varepsilon ^{\mu \nu \alpha \beta } {\hat{q}}_\alpha j_{5\beta }(z)\right) + \cdots \, , \nonumber \\ \end{aligned}$$
(14)

where the axial current \(j_5^\mu \) is defined by \(j_5^\mu (x)={\bar{\psi }}(x){\mathscr {Q}}^2 \gamma ^\mu \gamma _5 \psi (x)\) with charge matrix given in Eq. (2). The ellipsis denotes sub-leading terms suppressed by powers of \(\{|q_1+q_2|/|{\hat{q}}|,\,\varLambda _{\mathrm{QCD}}/|{\hat{q}}|\}\). This result implies that, at leading order in the OPE and at leading order in \(\alpha _s\), the HLbL tensor can be expressed in terms of the correlator of two vector currents with an axial current,

$$\begin{aligned} \varPi _{\mu \nu \lambda \sigma }(q_1,q_2,q_3)= & {} \frac{2 {\text {i}}}{{\hat{q}}^2} \varepsilon _{\mu \nu \alpha \beta } {\hat{q}}^\alpha \int \mathop {}\!\mathrm {d}^4 x\mathop {}\!\mathrm {d}^4 y\, e^{-{\text {i}}q_3\cdot x} e^{{\text {i}}q_4\cdot y} \nonumber \\&\times \langle 0| {{\text {T}}}\{j_\lambda (x) j_\sigma (y) j_5^\beta (0)\}|0\rangle + \cdots \nonumber \\ \end{aligned}$$
(15)

for \(Q_1^2 \sim Q_2^2 \gg \{Q_3^2,Q_4^2,\varLambda _\text {QCD}^2\}\). This three-point function, which also appears in the calculation of fermion loop electroweak contributions to \(a_\mu \) [60, 61], can be decomposed into Lorentz structures that are longitudinal and transversal with respect to the Lorentz index of the axial current (see e.g. Ref. [62]). The corresponding longitudinal scalar function determines the asymptotic behavior of \({\bar{\varPi }}_1\) in the asymmetric region and is fixed by the axial Adler–Bell–Jackiw anomaly up to chiral corrections and the gluon anomaly. Neglecting these effects, which will be discussed below, this translates into the following constraint [33] for the singlet and octet flavor components of \({\bar{\varPi }}_1(Q_1^2, Q_2^2, Q_3^2)\), defined by the decomposition of the axial current,

$$\begin{aligned} {\bar{\varPi }}_1^{(a), \mathrm{OPE}}(Q^2, Q^2, Q_3^2) = -\frac{2 N_c \,C_a^2}{\pi ^2 Q^2 Q_3^2}\quad \mathrm{for} \quad a=\{3,8,0\}, \nonumber \\ \end{aligned}$$
(16)

where \(C_a = {{\text {Tr}}}({\mathscr {Q}}^2 \lambda _a)/2\) in terms of the charge matrix \({\mathscr {Q}}\) and Gell–Mann matrices \(\lambda _a\). In particular,

$$\begin{aligned} C_3 = \frac{1}{6}\, ,\quad C_8 = \frac{1}{6\sqrt{3}}\, ,\quad C_0 = \frac{2}{3\sqrt{6}}. \end{aligned}$$
(17)

Since Eq. (16) relies on a perturbative calculation of the VVA correlator, it holds in the kinematic limit \(Q_1^2 \sim Q_2^2 \equiv Q^2 \gg Q_3^2 \gg \varLambda _\text {QCD}^2\). For the non-singlet components (\(a=3,8\)), since perturbative [63] as well as non-perturbative [64, 65] corrections are absent in the chiral limit, the hierarchy between \(\varLambda _\text {QCD}^2\) and \(Q_3^2\) can be dropped. In contrast, the singlet channel (\(a=0\)) is affected by the gluon anomaly, even in the chiral limit. This does not modify Eq. (16) for \(Q_3^2 \gg \varLambda _\text {QCD}^2\) [33], but the extrapolation to small \(Q_3^2\) is only valid if in addition the large-\(N_c\) limit is considered, where the anomaly vanishes. Furthermore, in the crossed kinematics (\(Q_2^2 \sim Q_3^2 \gg Q_1^2\) and \(Q_1^2 \sim Q_3^2 \gg Q_2^2\)), the leading-order OPE contributions to \({\bar{\varPi }}_1\) vanish.

Let us now compare against the pseudoscalar pole contributions, focusing on the pion pole first. In the chiral limit and using the fact that \(\lim \limits _{Q^2\rightarrow \infty } Q^2 F_{\pi \gamma ^*\gamma ^*}(-Q^2, -Q^2) = 4 C_3 F_\pi \) at leading order in \(\alpha _s\) [66, 67], one finds

$$\begin{aligned} \lim \limits _{Q^2 \rightarrow \infty } Q^2 {\bar{\varPi }}_1^{\pi ^0\text {-pole}}(Q^2, Q^2, Q_3^2) = -4 C_3 F_\pi \frac{F_{\pi \gamma ^*\gamma ^*}(-Q_3^2, 0)}{Q_3^2}\, .\nonumber \\ \end{aligned}$$
(18)

This expression has a pole at \(Q_3^2 = 0\) since \(F_{\pi \gamma ^*\gamma ^*}(0, 0) = 3 C_3 / (2\pi ^2 F_\pi )\). The location of this pole as well as its residue agree with in Eq. (16) (cf. Ref. [25]), which is consistent with the pion being the only massless isovector state in the chiral limit.

For finite quark masses, the pole in \({\bar{\varPi }}_1^{\pi ^{0}\text {-pole}}\) is shifted from \(Q_3^2 = 0\) to \(Q_3^2 = -m_\pi ^2\), which lies outside the integration domain for \(a_\mu ^{\text {HLbL}}\). The closest point in the integration region for fixed asymptotic \(\varSigma \) (see Fig. 1) is at \(Q_3^2 = 0\), where

$$\begin{aligned} \lim \limits _{Q^2 \rightarrow \infty } Q^2 {\bar{\varPi }}_1^{\pi ^{0}\text {-pole}}(Q^2, Q^2, 0)= & {} -4 C_3 F_\pi \frac{F_{\pi \gamma ^*\gamma ^*}(0, 0)}{m_\pi ^2} \nonumber \\= & {} - \frac{6 C_3^2}{\pi ^2 m_\pi ^2}\, . \end{aligned}$$
(19)

Here few percent chiral corrections to \(F_{\pi \gamma ^*\gamma ^*}(0, 0)\) [68,69,70,71,72] have been neglected. This is still close to the actual pole, which leads to the enhancement by \(m_\pi ^{-2}\). Since no other contribution receives the same enhancement, the last expression is expected to provide an excellent approximation to the true in the specified limit.Footnote 6 We observe that the OPE result, which is derived in the chiral limit, reproduces Eq. (19) if the pole position is shifted by the pion mass as dictated by the pion pole contribution

(20)

This is also consistent with the OPE result in Eq. (16) for \(Q^2 \gg Q_3^2 \gg \varLambda _\text {QCD}^2\), where chiral corrections are sub-leading. Thus, Eq. (20) is exact for \(Q_3^2 \gg \varLambda _\text {QCD}^2\), relies on the assumption of pion dominance at \(Q_3^2 \ll \varLambda _\text {QCD}^2\) and has the correct chiral limit Eq. (16) for all \(Q_3^2\). We extend it to the \(\eta \)/\(\eta '\) channels and write

(21)

Here \(C_\pi = C_3\) but \(C_{\eta / \eta '}\) cannot be directly identified with \(C_{0/8}\) due to \(\eta \)-\(\eta '\)-mixing. In analogy to the pion channel, we assume that ground-state single-pseudoscalar exchanges dominate , despite the fact that the \(\eta \)/\(\eta '\) poles are further away from \(Q_3^2 = 0\). This assumption implies that \(C_{\eta / \eta '}\) can be read off from the pole contributions

$$\begin{aligned} \lim \limits _{Q^2\rightarrow \infty } Q^2{\bar{\varPi }}_1^{{\eta / \eta '}\text {-pole}}(Q^2, Q^2, 0) = -\frac{6 C_{{\eta / \eta '}}^2}{\pi ^2 m_{{\eta / \eta '}}^2}. \end{aligned}$$
(22)

One can show that in the chiral limit and neglecting the gluon anomaly [33]

$$\begin{aligned}&\lim \limits _{Q_3^2\rightarrow 0} \, \lim \limits _{Q^2\rightarrow \infty } Q^2 Q_3^2 \nonumber \\&\qquad \times \left( {\bar{\varPi }}_1^{\eta \text {-pole}}(Q^2, Q^2, Q_3^2) + {\bar{\varPi }}_1^{\eta '\text {-pole}}(Q^2, Q^2, Q_3^2)\right) \nonumber \\&\quad = -\frac{6 (C_8^2 + C_0^2)}{\pi ^2}. \end{aligned}$$
(23)

At this point we note that, besides the \(\alpha _s\) corrections to the TFFs and OPE coefficient discussed in Sect. 2.3, which affect all ground-state pseudoscalars in the same way, the gluon anomaly induces a running of the flavor singlet decay constant [73,74,75]. This running leads to an incomplete cancellation between the decay constants in the symmetric asymptotic and the real photon limits, which has a sizable impact due to the large scale separation [76, 77].

Since \({\bar{\varPi }}_1^{{\eta / \eta '}\text {-pole}}\) can be expressed in terms of TFFs according to Eq. (10), assuming that corrections due to non-vanishing meson masses are negligible both in the real photon limit and in the symmetric asymptotic limit of the \({\eta / \eta '}\) TFFs, Eqs. (2123) together imply

$$\begin{aligned} C_\eta ^2 + C_{\eta '}^2= C_8^2 + C_0^2 \end{aligned}$$
(24)

up to the above-mentioned anomaly-induced scale- dependence, which leads to a violation of this equality (cf. Sect. 5.2).

For \(Q_3^2 \gg \varLambda _\text {QCD}^2\), the additional \(Q_3^2\)-suppression of the singly-virtual TFF leads to a mismatch between the pseudoscalar pole contributions and the OPE constraint. In Ref. [25] MV proposed to solve this issue by setting the singly-virtual TFF equal to a constant. This prescription is not compatible with the dispersive definition of the pole contributions in the framework summarized in Sect. 2.1, according to which, instead, an infinite tower of heavier intermediate states is needed to saturate the constraint (see e.g. Ref. [33]). For this purpose, summations of series of contributions from excited pseudoscalars [32, 33] and axials [40, 41] have been recently performed in the context of hadronic models. In Sects. 5.3 and 5.4, we will compare the outcome of our analysis against these estimates of the effects of longitudinal SDCs.

2.3 \(\alpha _s\) corrections to the OPE

The derivation of Eq. (14) has been performed at leading order in \(\alpha _s\). Since no other operator of dimension 3 can appear in that OPE, \(\alpha _s\) corrections only affect the OPE coefficient of the axial-vector current. In Refs. [78,79,80], this coefficient has been calculated to next-to-leading order (NLO). Including this contribution in Eq. (14) leads to

$$\begin{aligned}&\varPi ^{\mu \nu }(q_1, q_2) = \int \mathop {}\!\mathrm {d}^4 z \,e^{-{\text {i}}(q_1 + q_2) \cdot z} \nonumber \\&\quad \times \bigg (-\frac{2{\text {i}}}{{\hat{q}}^2} \left( 1 - \frac{\alpha _s}{\pi }\right) \varepsilon ^{\mu \nu \alpha \beta } {\hat{q}}_\alpha j_{5\beta }(z) + {\mathscr {O}}\left( {\hat{q}}^{-2}\right) \bigg ). \end{aligned}$$
(25)

It follows that the NLO version of Eq. (21) reads

(26)

The two-current operator product not only enters the HLbL tensor, but also the pion TFF (see Eq. (11)). Thus, any perturbative correction to the OPE Wilson coefficient automatically implies the same perturbative correction to the symmetric limit of the pion TFF and vice versa. In fact, the symmetric asymptotic pion TFF has been calculated to NLO in Refs. [14, 81],Footnote 7

$$\begin{aligned}&F_{\pi \gamma ^*\gamma ^*}(-Q^2,-Q^2) \nonumber \\&= \left( 1 - \frac{\alpha _s}{\pi } + {\mathscr {O}}\left( \alpha _s^2\right) \right) \frac{2 F_\pi }{3Q^2} + {\mathscr {O}}\left( Q^{-4}\right) , \end{aligned}$$
(27)

which is consistent with Eq. (25). The fact that the \(\alpha _s\) corrections agree between the HLbL tensor in the asymmetric asymptotic limit and the symmetric asymptotic pion TFF guarantees that the pion pole saturates at \(Q_3^2 = 0\) in the chiral limit also beyond leading order in \(\alpha _s\).

A comment on the renormalization scale dependence of the terms in Eq.(25) is in order here. The non-singlet components of the axial current are conserved (up to quark mass corrections) and thus their anomalous dimensions vanish. This is not true for the singlet component due to the gluon anomaly [82, 83], but we neglect this effect here because it starts at \({\mathscr {O}}(\alpha _s^2)\). Therefore, since the perturbatively expanded dimensionless part d of the Wilson coefficient is scale (\(\mu \)) independent,

$$\begin{aligned} d\left( -\frac{{\hat{q}}^2}{\mu ^2}, \alpha _s(\mu ^2)\right) = d(1, \alpha _s(-{\hat{q}}^2)) \end{aligned}$$
(28)

and the terms \(\alpha _s^n \ln ^{n-1} (-{\hat{q}}^2/\mu ^2)\) (\(n\ge 1\)) can be resummed using as input the \(\beta \)-function and the one-loop result with \(\alpha _s\) evaluated at the scale \(-{\hat{q}}^2\) (see also [84]).

2.4 The symmetric asymptotic limit: perturbative QCD constraints

In Ref. [26] it has been shown that the pQCD quark loop is the leading term of an OPE in the kinematic limit \(Q_1^2 \sim Q_2^2 \sim Q_3^2 \gg \varLambda _\text {QCD}^2\), where the fourth (external) photon has vanishing momentum in \((g-2)\)-kinematics. At leading order in this OPE and at leading order in \(\alpha _s\) [33],

$$\begin{aligned} {\bar{\varPi }}_1^\text {pQCD} (q_1^2, q_2^2, q_3^2)= & {} \frac{N_c {{\text {Tr}}}{\mathscr {Q}}^4}{16\pi ^2} \int _{0}^{1} \mathop {}\!\mathrm {d}x \int _{0}^{1-x} \mathop {}\!\mathrm {d}y \, I_1(x,y) \nonumber \\= & {} \frac{1}{24\pi ^2} \int _{0}^{1} \mathop {}\!\mathrm {d}x \int _{0}^{1-x} \mathop {}\!\mathrm {d}y\, I_1(x,y), \nonumber \\ I_1(x,y)= & {} -\frac{16 x(1-x-y)}{\varDelta _{132}^2} \nonumber \\&- \frac{16 x y(1-2x)(1-2y)}{\varDelta _{132} \varDelta _{32}}, \nonumber \\ \varDelta _{ijk}= & {} m_q^2-xy q_i^2 -x(1-x-y)q_j^2 \nonumber \\&- y(1-x-y) q_k^2, \nonumber \\ \varDelta _{ij}= & {} m_q^2-x(1-x)q_i^2 - y(1-y) q_j^2. \nonumber \\ \end{aligned}$$
(29)

In the symmetric limit, neglecting terms that are suppressed by powers of \(m_q^2/Q^2\),

(30)

where we have chosen to adopt the same flavor decomposition as for the asymmetric OPE case, Eq. (16). If higher-order perturbative corrections are small, the leading-order result above is expected to be a good approximation also away from the fully symmetric configuration as long as large logarithms of ratios of momenta are absent.

Since \({\bar{\varPi }}_1^\text {PS-pole}\) decays like \(Q^{-6}\), (towers of) hadronic contributions beyond ground-state pseudoscalar poles have to be responsible for the behavior shown by Eq. (30). Following the MV prescription in Ref. [25], the parametric dependence on Q can be reproduced but with an incorrect coefficient.

In order to saturate the pQCD result in the isosinglet channels, we need coefficients \(C_{\eta / \eta '}^\text {pQCD}\) satisfying

$$\begin{aligned} C_8^2 + C_0^2 = \left( C^\text {pQCD}_{\eta }\right) ^2 + \left( C^\text {pQCD}_{{\eta '}}\right) ^2. \end{aligned}$$
(31)

Since Eq. (24) is violated, we define

$$\begin{aligned} \left( C_{\eta / \eta '}^\text {pQCD}\right) ^2 = (1 + \delta _0) C_{\eta / \eta '}^2\, , \end{aligned}$$
(32)

where the parameter \(\delta _0\) is chosen such that Eq. (31) holds.

3 Interpolating between asymptotic constraints

We approximate the true \({\bar{\varPi }}_1(\varSigma , r, \phi )\) following a two-step procedure. We first select functional forms that are valid for asymptotic \(\varSigma \) and are compatible with the constraints discussed in the previous section. We then interpolate between this set of functions and various representations of \({\bar{\varPi }}_1\) at small \(\varSigma \) determined by single-particle intermediate states. Here we work at leading order in \(\alpha _s\). Perturbative corrections will be discussed in our numerical analysis in Sect. 5.

The relevant constraints on \({\bar{\varPi }}_1\) at large \(\varSigma \) are given by Eq. (21) for \(Q_1^2 = Q_2^2 \gg Q_3^2\) and Eq. (30) for \(Q_1^2 = Q_2^2 = Q_3^2\). Both expressions as well as the vanishing result of the leading-order OPE contribution in the crossed kinematics are compatible with

(33)

if \( C_\text {PS} = C_\text {PS}^\text {pQCD} \). Thus, Eq. (33) interpolates between symmetric and asymmetric asymptotic limits. According to Sect. 2.4, \(\delta _0\) parameterizes the anomaly corrections to the singlet VVA correlator and the resulting shift in \(C_\text {PS}^\text {pQCD}\) with respect to \(C_\text {PS}\). Since a term proportional to \(\varSigma ^{-2}\) and independent of \((r, \phi )\) does not change the leading behavior at \(Q_3^2 = 0\) and thus does not spoil compatibility with the OPE constraint, we subtract \(36 \delta _0 C_{\eta / \eta '}^2/(\pi ^2 \varSigma ^2)\) from Eq. (33) in the case of \(\eta \)/\(\eta '\).

Obviously, the choice made in Eq. (33) and the exact form of the singlet correction are not unique and we are free to add a generic function such that the interpolant still satisfies the constraints. In order to have a non-negligible effect at asymptotic values of \(\varSigma \), this additional function should also scale as \(\varSigma ^{-2}\) and we demand it to be finite and analytic for all \(r\le 1\).Footnote 8 Therefore it can be approximated by a Taylor series in \(r\cos \phi \) and \(r \sin \phi \) truncated after order M,

(34)

where

$$\begin{aligned} a_{0,0} = 0, \quad a_{i,2j+1} = 0 \end{aligned}$$
(35)

for integer j, due to the pQCD constraint and crossing symmetry.

Up to now, we have applied the quark-loop result only at \(r = 0\). However, the fact that Eq. (29) holds also in a neighborhood of this point can be used to fix the coefficients \(a_{i,j}\). To this end, we fitted Eq. (34) at fixed asymptotic \(\varSigma \) with \(M = 2\) to Eq. (29) for \(r<0.9\).Footnote 9 We chose a grid of equally separated points in this fitting region and minimized the sum of the relative squared differences between our interpolant and the leading-order quark-loop expression. The resulting 5 dimensionless fit parameters in the pion channel read

$$\begin{aligned} a_{1,0}= & {} {-0.170},\quad a_{2,0} = {0.094},\quad a_{0,2} = {-0.554},\nonumber \\ \quad a_{1,2}= & {} {-0.169},\quad a_{2,2} = {-0.756} \end{aligned}$$
(36)

and are all at most \({\mathscr {O}}(1)\), as expected since in Eq. (34) they parameterize relative corrections. This holds true also for the \({\eta / \eta '}\) channels, where the numerical values are different. In Sect. 5.1.2 we will discuss uncertainties due to the chosen fitting range, the number of parameters in the fit and \(\alpha _s\) corrections to the asymptotic constraints.

4 Interpolating between low and high energies

The next step is to smoothly connect our representation of \({\bar{\varPi }}_1\) for \(\varSigma \gg \{\varLambda _\text {QCD}^2,M_\text {PS}^2,\dots \}\) given by Eq. (34) to an accurate low-energy description. We achieve this by adding suitable terms to that are sub-leading at large \(\varSigma \). For each choice of r and \(\phi \), the coefficients of these terms are then matched onto an input low-energy representation of \({\bar{\varPi }}_1\) at a suitably defined surface \(\varSigma ^\text {match}(r,\phi )\). In Sect. 4.2 we will discuss how \(\varSigma ^\text {match}\) is related to the mass scale at which intermediate states beyond the ones explicitly considered start to affect \({\bar{\varPi }}_1\).

4.1 Interpolation functions and matching procedure

For \(\varSigma > \varSigma ^\text {match}\) we consider the following two interpolation functions

(37)

whose leading terms at asymptotic \(\varSigma \) are given in Eq. (34), whereas below the matching surface we set . In Appendix A we will show that these functions converge to the true in the limit \(N\rightarrow \infty \) when matched to exact low-energy input using the convergence property of a Taylor series. The two different forms given in Eq. (37) will be used to estimate the sensitivity of our numerical results on the specific choice of interpolation between low and high energies.Footnote 10

The coefficients \(b_i(r,\phi )\) are fixed from the requirement that the have the same value and the same \(N-1\) \(\varSigma \)-derivatives as the low-energy representation if evaluated at \(\varSigma = \varSigma ^\text {match}(r,\phi )\) for each \((r,\phi )\). No expansion is performed in r and \(\phi \), which is crucial to obtain a smooth transition to the low-energy regime.

Determining the optimal value of N is a non-trivial issue. On the one hand, larger values of N seem to be preferable since the true \({\bar{\varPi }}_1\) is analytic for space-like momenta and thus all derivatives are continuous. On the other hand, matching many derivatives leads to a function that is almost saturated by the low-energy input contribution up to considerably higher energies than \(\varSigma ^\text {match}(r,\phi )\). Since it is desirable to match at least one derivative in order to have \({\bar{\varPi }}_1\) differentiable at the matching point, we will use \(N \in \{2,3\}\) in order to estimate the dependence on N.

Interpolation functions with a logarithmic dependence on \(\varSigma \) are not forbidden. This can stem, for example, from non-perturbative corrections leading to terms like \(\ln {(Q_i^2/M^2)}\), where M is some non-perturbative mass scale. In fact, the Regge model considered in Refs. [32, 33] leads to interpolants containing terms like \(Q^{-4}\ln {(Q^2/\sigma ^2)}\) for \(Q_i^2 = Q^2 \rightarrow \infty \), where \(\sigma ^2\) could e.g. be the Regge slope of the excited pseudoscalar masses. In order to allow for such a logarithmic approach of the asymptotic expression, we additionally consider the alternative interpolant

(38)

and use again \(N \in \{2,3\}\).

4.2 The matching surface \(\varSigma ^\text {match}\)

The remaining crucial ingredient in our procedure is the function \(\varSigma ^\text {match}(r,\phi )\), which determines the value of \(\varSigma \) at which the matching is performed for given r and \(\phi \). Choosing it too low leads to important modifications of at low energies with consequent overestimation of .Footnote 11 Conversely, choosing \(\varSigma ^\text {match}\) too high assumes the low-energy input to dominate beyond what is expected according to mass and phase-space considerations and thus leads to underestimate .

For small values of \(Q_3^2\), the \(\pi ^0\), \(\eta \), \(\eta '\) poles are assumed to dominate independently of \(Q_{1,2}^2\), due to the pole at \(Q_3^2 = -m_\text {PS}^2\) (see Sects. 2.1 and 2.2). This implies that no matching is needed in this regime, i.e. \(\varSigma ^\text {match}(1,\pi ) = \infty \). The most general function that is analytic for all \((r,\phi )\) except for a (first-order) pole at \((r,\phi ) = (1,\pi )\) can be written as

$$\begin{aligned} \varSigma ^\text {match}(r,\phi ) = \frac{3m^2}{1+r \cos \phi }\left( 1 + P(r \cos \phi , r \sin \phi )\right) , \end{aligned}$$
(39)

where \(m^2\) determines the matching scale at \(r = 0\) and P is a polynomial with two arguments and no constant term. The transformation property of \({\bar{\varPi }}_1\) under crossing specified in Eq. (8) restricts P to contain only even powers of \(r \sin \phi \).

The parameter \(m^2\) sets the absolute mass scale of \(\varSigma ^\text {match}\) and should thus be related to the masses of the states affecting \({\bar{\varPi }}_1\) beyond the ones explicitly included, namely \(\pi ^0\), \(\eta \), \(\eta '\) here. In the following, we will assume that contributions to \({\bar{\varPi }}_1\) in the \(g-2\) kinematics stemming from multi-particle intermediate states are dominated by narrow resonances while non-resonant effects lead to negligible corrections to the matching procedure and can be simply added to our final results.Footnote 12 This is realized for example in the large-\(N_c\) limit of pure QCD: since the short-distance expressions for \({\bar{\varPi }}_1\) in both symmetric and asymmetric limits scale like \(N_c\), these can indeed be saturated by single-meson exchanges (see Ref. [85]). Non-resonant contributions from multi-hadron intermediate states (like \(2\pi \), 2K, \(\pi \eta \), \(3\pi \),...) are sub-leading for large \(N_c\) and thus cannot contribute to the SDCs. Since scalar mesons have no impact on \({\bar{\varPi }}_1\), the lightest states beyond the ground-state pseudoscalars that are the most relevant at small \(Q_3^2\) (see Sect. 2.1) are the axial mesons like \(a_1(1260)\) and the tensor mesons like \(f_2(1270)\), with masses in the 1–2 GeV region, whose effects on \(a_\mu ^\text {HLbL}\) can presently be estimated only using hadronic models.

For \(P(x,y) = 0\), \(\varSigma =\varSigma ^\text {match}\) corresponds to \(Q_3^2 = m^2\). Since a state of mass M ceases to be suppressed by the denominator \((Q_3^2 + M^2)\) compared to lighter states when \(Q_3^2\) approaches \(M^2\), \(m^2\) should be chosen well below \(M^2\). At the same time, it should not be taken too small, because we do not expect any large contribution to \({\bar{\varPi }}_1\) at \(Q_3^2 \ll M^2\). We thus regard \(m^2 = {0.5}{\,\text {GeV}^2}\) as a good starting point for our analysis. In Sect. 5.1.4 we will discuss a range of choices for this parameter as well as the effects of the polynomial

$$\begin{aligned} P(x,y) = \sum _{i=0}^{M} \sum _{j=0}^{M} \frac{1}{i!\ j!} p_{i,j}\ x^i y^j, \end{aligned}$$
(40)

which we have estimated by means of a Monte Carlo sampling over the coefficients \(p_{i,j}\).

5 Numerical results and error analysis

5.1 The isovector channel

The isovector channel is best suited to our method since it is characterized by a large contribution from the low-energy region dominated by the well-known pion pole, which does not mix (strongly) into the other flavor channels. The lightest one-particle intermediate state beyond the \(\pi ^0\) in this channel is the \(a_1(1260)\), whose effect at low energies is suppressed by the large mass gap. The numerical dominance of this channel at low energies, however, does not imply that the same holds true at intermediate and high energies. In fact, the values of \(C_a\) in Eq. (17) make the flavor singlet channel the numerically most important one in the asymptotic region where meson masses can be neglected, i.e. for \(Q_i^2 \gg \varLambda _\text {QCD}^2\). In Sect. 5.2 we will discuss the inclusion of \(\eta /\eta '\) and in Sect. 5.4 also the case of the isovector ground-state axial, which is however affected by a larger degree of model dependence.

We start by selecting a “reference” set of assumptions and input parameters. The impact of their modifications will be assessed in the next sections and will define the range of our predictions in the form of an uncertainty band. This procedure allows us also to examine how the estimate of the effects of SDCs would be improved by more precise information on the pion pole, the contributions from states with masses around 1 GeV and the asymptotic regime.

As low-energy reference input, we took the leading-order dispersive \(\pi ^0\) singly- and doubly-virtual TFFs [13, 14], while the corresponding \({\mathscr {O}}{(\alpha _s)}\) correction is included in the uncertainty. As reference interpolating function, we used with \(N=3\) (see Eq. (37)), which turned out to yield results that are central in the range spanned by the interpolants 1, 2 and 3 and \(N\in \{2,3\}\) (cf. Eq. (49) below). For the asymptotic function, we included information from pQCD away from \(r=0\) in the way explained in Sect. 3, while \(\alpha _s\) corrections contribute to the uncertainty. For the matching surface we used Eq. (39) with \(P(x,y) = 0\) and \(m_\text {ref}^2 = {0.5}~\text {GeV}^{2}\). The resulting function, which we call , is shown in Fig. 2 for \(r=0\) together with the uncertainty band for the interpolants that we are going to discuss in the next sections. Our reference outcome for the contribution to \(a_\mu ^\text {long}\) due to the longitudinal SDCs in the isovector channel is

(41)

where comes from using in the master formula Eq. (9), and \(a_{\mu ,\text {disp}}^{\pi ^0\text {-pole}}\) is given in Eq. (12) according to Refs. [13, 14]. In Sect. 5.1.5 we will argue that our final result does not strongly depend on the choice of the reference set of parameters.

Fig. 2
figure 2

The pion pole contribution and associated uncertainty from Refs. [13, 14] vs. the reference interpolant and its error band which includes all sources of uncertainty considered in the present analysis (see discussion in Sects. 5.1.15.1.4 below)

5.1.1 Pion TFF uncertainties

We shall now describe the effects of modifying the different ingredients of the reference configuration, one by one, starting from the pion TFF. By propagating the errors quoted in Refs. [13, 14] for the dispersive determination of the pion TFF and by summing the different contributions in quadrature, taking as well into account that a modification of the TFF affects both terms in Eq. (41), we obtained an asymmetric error band around the reference result with boundary values

(42)

which correspond to the asymmetric error for the dispersive \(\pi ^0\) TFF. Given the smallness of these uncertainties, the (negative) correlation between them and the uncertainties of \(a_{\mu ,\text {disp}}^{\pi ^0\text {-pole}}\) can be safely neglected.

In order to study the impact of different pion TFF parameterizations, we compared the previous results against the ones obtained using, both for the construction of the interpolant and the evaluation of \(a_{\mu }^{\pi ^0\text {-pole}}\), the \(C^1_2\) Canterbury approximant with \(a_{\pi ; 1,1} = 2 b_\pi ^2\) of Ref. [51] and the Dyson–Schwinger TFF from Ref. [52]. We obtained

(43)

which are both compatible with the reference result within the range given above. We conclude that the outcome of our analysis is very robust against changes in the TFF input and that the present knowledge of the pion TFF is sufficient for our purposes.

5.1.2 Asymptotic uncertainties

Here we focus on the uncertainties in (see Eq. (34)), which are related to

  • the choices made in the fit to the quark-loop result that lead to Eq. (36), namely the degree M of the polynomial and the radius \(r_\text {max}\) of the fitting domain;

  • \(\alpha _s\) corrections to the OPE constraint as given by Eq. (26);

  • \(\alpha _s\) corrections to the quark loop.

We start by discussing the fit to the quark-loop result. In Sect. 3, we chose \(M=2\), which leads to a strongly improved fit quality compared to \(M=1\). Considering a larger value of M gives an estimate of the errors made by approximating the pQCD result by a polynomial at \(r < r_\text {max}\) at fixed asymptotic \(\varSigma \) and by extrapolating to the regime \(r>r_\text {max}\), which is unknown except for the OPE constraint. Choosing \(M=3\) shifts the result for by only \({0.02\times {10^{-11}}}\) indicating that the truncation at \(M=2\) is sufficient. We also studied the effects of a substantial reduction of the radius, namely from \(r_\text {max} = 0.9\) down to \(r_\text {max} = 0.5\), where no logarithm of ratios of squared momenta is larger than 1. We found that this leads to a small shift \(({0.07\times 10^{-11}})\). We did not consider \(r_\text {max}>0.9\) since fixed-order pQCD is not expected to converge for r close to 1 due to large logarithms. Combining linearly the uncertainties from the choice of M and the fitting radius gives

(44)

with respect to the reference contribution of longitudinal SDCs to the pion pole input in Eq. (41).

Let us now focus on the estimate of the separate perturbative corrections to either the OPE or the pQCD result. Since those concerning the OPE should not be extrapolated into the domain of validity of pQCD, for asymptotic \(\varSigma \) we write (cf. Eq. (26))

(45)

Here the Heaviside step function \(\theta \) ensures that the perturbative correction only affects a region around \(Q_3^2 = 0\), whose size can be varied via the free parameter A. By setting \(A=1/29\), this region does not extend into the \(r<0.9\) domain. The choice of the renormalization scale \(\mu ^2\) is the same as in Ref. [14] and agrees with our discussion in Sect. 2.3 up to a factor of 2 since \(-{\hat{q}}^2 = (Q_1^2 + Q_2^2)/2 - Q_3^2/4 \approx (Q_1^2 + Q_2^2)/2\) in the relevant regime. In our numerical analysis, for the running of \(\alpha _s\) we used the three-flavor one-loop beta function and matched to \(\alpha _s(\mu ^{2} = M_{{\tau }^{2}}) = 0.35\).

According to the discussion in Sect. 2, the OPE constraint in the chiral limit is saturated by the pion pole at \(Q_3^2 = 0\) to all orders in perturbation theory. For this reason in a consistent analysis the OPE coefficient and the pion TFF in the symmetric limit should be taken at the same perturbative accuracy. Hence we replaced in Eq. (37) by and matched the correspondingly modified to the pion-pole contribution with TFFs including \({\mathscr {O}}(\alpha _{s})\) effects [14, 81].Footnote 13 Using this interpolant and this pion-pole result, our outcome for is larger than the reference result Eq. (41) by

(46)

For \(A=1/3\), the domain where the correction applies extends down to \(r=0.25\), but nevertheless the shift of turns out to be \(-0.05\times 10^{-11}\) and thus still negligible. The smallness of these shifts can be understood from the large values of \(\varSigma ^\text {match}\) in the region where these perturbative corrections apply. For this reason, the effect is almost completely included in the pion pole contribution, where it also has a small impact [14].

Since the NLO calculation of the quark loop has not been performed yet, we can only provide a rough estimate of the uncertainty related to unknown \({\mathscr {O}}(\alpha _s)\) corrections. We assumed in analogy with Eq. (45),

(47)

and as in the leading-order quark loop fit, we set \(r_\text {max}=0.9\). Using this expression in Eq. (37) for the matching to the pion-pole with leading-order dispersive TFF, we obtained a shift of \({-0.18\times 10^{-11}}\) compared to the reference result. Even when inflating this uncertainty by a factor of 2,

(48)

this effect is still sufficiently small compared to the current precision goal. We stress that once NLO calculations become available, should be constructed to analytically interpolate between the NLO expressions for the OPE and the quark loop. The discontinuous functions employed here only serve to provide a ballpark estimate of NLO effects.

5.1.3 Choice of interpolation functions

In Eqs. (37) and (38) we have introduced three different interpolation functions, characterized by two or three free parameters to be matched to the low-energy representation. The corresponding results for the contribution from longitudinal SCDs are

(49)

where given by Eq. (41) has been included for completeness.

We observe that the slower logarithmic approach to the asymptotic limits in the interpolant 3 leads to smaller results, especially when compared to the similar interpolant 1. Setting

(50)

all values listed above are within the range .

Fig. 3
figure 3

The figure displays the dispersive pion pole contribution, the reference interpolant and the (orange) band corresponding to the various choices of the parameter m. The blue line indicates the value of the matching surface for \(m^2 = m_\text {min}^2\). The green band shows the sum of the \(\pi ^0\)- and \(\pi (1300)\)-pole contributions, where the latter has been calculated using input from R\(\chi \)T and phenomenology, including errors

5.1.4 Choice of \(\varSigma ^\text {match}(r,\phi )\)

The function \(\varSigma ^\text {match}(r,\phi )\) in Eq. (39) contains the mass parameter m and the polynomial P(xy), which has been set equal to zero so far. We have argued in Sect. 4.2 that m should be chosen considerably smaller than the mass M of the lightest resonances contributing to \({\bar{\varPi }}_1\) in addition to the ground-state pseudoscalar mesons. For this reason, for the reference interpolant we set \(m^2={0.5}~\text {GeV}^{2}\). Here we discuss the effects of alternative choices for this parameter within a range between \(m_\text {min}\) and \(m_\text {max}\).

Since according to Sect. 4.2 a conservative choice for the upper end of the range is \(m_\text {max} \simeq M\), we set \(m_\text {max}^2 = {1}~\text {GeV}^{2}\). In order to determine an appropriate value for \(m_\text {min}\), one has to estimate isovector contributions beyond the \(\pi ^0\)-pole. Following our argument in Sect. 4.2, it is sufficient to restrict ourselves to single-particle intermediate states and focus on the one giving the largest effect at energies around m. We assumed this to be given by the pseudoscalar \(\pi (1300)\) for the following reasons. Models for tensor mesons around 1 GeV give similar or smaller contributions to \(a_\mu ^\text {HLbL}\) [16, 86, 87]. For ground-state axials, recent studies based on different approximations and hadronic models yield quite different numerical results, see e.g. Refs. [40, 41, 86, 88], leading to large uncertainties. If future model-independent analyses show that axial-meson exchanges are responsible for significant effects in also at relatively small momenta, then these contributions should be added to the pion pole before the matching is performed since our procedure relies on a sufficiently precise knowledge of below \(\varSigma ^\text {match}(r,\phi )\). Neglecting issues related to model dependence, in Sect. 5.4 we will discuss the inclusion in our procedure of information from holographic QCD on the lightest axial meson.

As for the light pseudoscalars, the interaction of \(\pi (1300)\) with two photons can be described by a TFF, which determines the contribution to \({\bar{\varPi }}_1\) as in Eq. (10). In our analysis we used as input the \(\pi (1300)\) TFF derived in Ref. [89] in the framework of Resonance Chiral Theory (R\(\chi \)T) [90]. We fixed the free parameters in Eq. (69) of Ref. [89] by requiring that (i) the \(\pi ^0\)-TFF satisfies the Brodsky–Lepage condition, i.e. \(F_{\pi ^0\gamma ^*\gamma ^*}(-Q^2,0) = 2 F_\pi /Q^2 +{\mathscr {O}}(Q^{-4})\) and (ii) the two-real-photon limit of the excited pion TFF is in the range \(F_{\pi (1300)\gamma ^*\gamma ^*}(0,0)\in [0,{0.0544}]~\text {GeV}^{-1}\), argued for in Ref. [33] based on experimental results [91, 92]. The upper boundary of this interval leads to the most conservative error estimate in our analysis, and is used in the following.

Our procedure to determine \(m_\text {min}\) can be illustrated by means of Fig. 3. Given a value of \(m_\text {min}\), at large enough \(\varSigma \), the range of interpolants (orange band) spanned by \(m\in [m_\text {min},m_\text {max}]\) safely includes the green band representing the sum of the contributions from \(\pi ^0\) and \(\pi (1300)\), including errors on the latter due to the range for \(F_{\pi (1300)\gamma ^*\gamma ^*}(0,0)\). Since this is not the case at small \(\varSigma \), the contribution to \(a_\mu ^\text {long}\) from this region is underestimated in our approach. To gauge this effect, we calculated the integral in Eq. (9) with

(51)

using the maximal \(\pi (1300)\) contribution and restricting the \(\varSigma \)-domain to the region below the point where the bands start to fully overlap (as a function of r and \(\phi \)). This integral gives the missed contribution at a fixed \(m_\text {min}\). Repeating this calculation for different values of \(m_\text {min}\) yields the function and by inverting this, we determined \(m_\text {min}\) by fixing to values well below the accuracy goal set by forthcoming experimental results. For we obtained \(m_\text {min, 1}^2 = {0.35}~\text {GeV}^{2}\) and for , \(m_\text {min, 2}^2 = {0.13}~\text {GeV}^{2}\).

Numerically, \(m_\text {max} = {1}\,\text {GeV}\) leads to the shift and the two values \(m_\text {min, 1}\) and \(m_\text {min, 2}\) yield and , respectively. If we add to the latter numbers, we obtain the conservative estimates

(52)

In the following we will use for the main results and keep as an alternative, even more conservative uncertainty.

We also considered a different parameterization for the \(\pi (1300)\) TFF, namely the one given by the Regge model in Refs. [32, 33]. Using the empirical \(m_{\pi (1300)} = {1.30}\,\text {GeV}\) instead of the Regge-model value of 1.36 GeV used in these references and following the same procedure discussed above, we obtained \(m_\text {min, 1}^2 = {0.53}\,\text {GeV}^{2}\) and \(m_\text {min, 2}^2 = {0.20}\,\text {GeV}^{2}\). This leads to

(53)

where we again added \(a_\mu ^\text {missed}\) to the uncertainties in the upward direction. For our final result we use the more conservative uncertainty estimates given in Eq. (52).

In order to study the effects of the polynomial in \(\varSigma ^\text {match}(r,\phi )\), Eq. (40), we set \(M=2\) and sampled the free parameters according to a standard normal distribution. Since the pion pole gives an excellent approximation of \({\bar{\varPi }}_1\) for very small \(\varSigma \) at any \((r,\phi )\), we only allowed for parameters giving \(\varSigma ^\text {match}(r,\phi )>\varSigma _t\) for all \((r,\phi )\), where \(\varSigma _t\) is defined as the smallest value of \(\varSigma \) such that

$$\begin{aligned} \frac{{\bar{\varPi }}_1^{\pi (1300)\text{- }pole}(\varSigma ,r,\phi )}{{\bar{\varPi }}_1^{\pi ^{0}\text {-pole}}(\varSigma ,r,\phi )} = 0.02 \end{aligned}$$
(54)

holds for some \((r,\phi )\). With \({\bar{\varPi }}_1^{\pi (1300)\text{- }pole}\) calculated using R\(\chi \)T, we obtained \(\varSigma _t = {0.57}\,\text {GeV}^{2}\). This condition ensures that there are no large contributions from our interpolation at points where R\(\chi \)T predicts a very small excited pion contribution. From this we calculated a distribution of results for , which features a Gaussian-like peak close to the reference result and asymmetric tails, and read off the 16% quantiles from both sides corresponding to the \(1\sigma \) errors for a Gaussian. This gives

(55)

We have checked that this result is stable against the inclusion of terms of order 3 in the polynomial and moderate changes in the value of the ratio in Eq. (54).

5.1.5 Estimate of the effects of longitudinal SDCs in the isovector channel

Table 1 The effects on \(a_\mu ^{\text {HLbL}}\) of longitudinal SDCs assuming that the low-energy region is dominated by ground-state pseudoscalar poles, whose contributions are taken as input. In each flavor channel the results are presented as the shifts \(\varDelta a_{\mu ,\text {ref}}\) with respect to the pole contributions for a specific reference set of parameters and a list of uncertainties corresponding to different choices for each of these parameters. In the last two rows, these uncertainties are added in quadrature and the final range is symmetrized. See main text for details
Fig. 4
figure 4

Relative contributions to the total uncertainty in the isovector channel. For asymmetric errors the mean of the squared errors is used

The \(\pi ^0\)-column of Table 1 collects all uncertainties in our estimate of the effects of longitudinal SDCs in the isovector channel, as described in the previous subsections. By combining them in quadrature we get

(56)

Since we do not regard the reference parameterization as the central value, we symmetrized the uncertainty to finally obtain the range

(57)

Using instead in Eq. (52), the final result would be \((3.8 \pm 2.7)\times 10^{-11}\). Notice that, despite the fact that it likely overestimates the range of longitudinal short-distance effects, this interval is still definitely compatible with the current precision goal.

Figure 4 shows the contributions to the quadratic error from the different sources discussed above. The vastly dominant effect stems from the interpolation between low and high energies, with an especially crucial role played by the choice of m, the scale at which the matching between the low-energy representation of \({\bar{\varPi }}_1\) and the interpolant is performed. The uncertainties \(\delta _\text {int}\), \(\delta _m\) and \(\delta _{P(x,y)}\) could be reduced by additional low-energy input concerning further intermediate states and higher-order terms in the symmetric and asymmetric OPEs, which would help constrain the coefficients \(b_i(r,\phi )\) in the interpolants in Eqs. (37) and (38). The uncertainties related to the perturbative corrections are considerably smaller. While we do not expect that calculations of \(\alpha _s\) corrections will crucially improve the final estimate, these perturbative results will definitely be important to better assess the regime of validity of the asymptotic constraints and thereby verify and sharpen some of our assumptions.

We have also checked that our results are robust against the choice of different reference sets of parameters. For example, if we set \(m_\text {ref}\) equal to the previous boundary values for the uncertainty in the reference configuration, namely \(m_\text {ref}^2 = {0.35}\,\text {GeV}^{2}\) and \(m_\text {ref}^2 = {1}\,\text {GeV}^{2}\) and choose \(m^2\in [0.35, 1]\,\text {GeV}^{2}\) as the range for the error estimation as before, we get

(58)

where all other sources of uncertainty are included. We obtained similar results by selecting as reference different interpolants or different values of the number of free parameters N contained therein.

5.2 The isoscalar contributions

In this section the procedure presented above for the isovector case is applied to the isoscalar channels with \({\eta / \eta '}\)-poles as low-energy input. In our analysis, we employed the Canterbury TFFs from Ref. [51] in the reference solution.Footnote 14 We determined the parameters encoding \(\eta -\eta '\)-mixing as explained in Sects. 2.2 and 2.4 and obtained

$$\begin{aligned} C_\eta = {0.164}, \quad C_{\eta '} = {0.219}, \quad \delta _0 = {0.110}, \end{aligned}$$
(59)

which shows that \(\delta _0\) is indeed sizable.

Following the same procedure for the construction of the reference interpolant as in Sect. 5.1, we found

$$\begin{aligned} \varDelta a_{\mu , \text {ref}}^\eta = {2.58\times 10^{-11}}, \quad \varDelta a_{\mu , \text {ref}}^{\eta '} = {3.91\times 10^{-11}}. \end{aligned}$$
(60)

The uncertainty estimation proceeds in the same way as in the isovector channel up to minor modifications. Since error bands for the doubly-virtual TFFs in all kinematic configurations are not available in the literature, we estimated uncertainties by considering another TFF representation, namely the one based on Dyson–Schwinger equations (DSE) [52]. This yields \(\varDelta a_{\mu , \text {ref}}^\eta = {2.11\times 10^{-11}}\) and \(\varDelta a_{\mu , \text {ref}}^{\eta '} = {4.20\times 10^{-11}}\). The fact that individual results for \(\eta \) and \(\eta '\) channels differ by 18 and 8%, but the sum only by 3% can be understood by comparing the mixing parameters

$$\begin{aligned} C_\eta ^\text {DSE} = {0.148}, \quad C_{\eta '}^\text {DSE} = {0.228}, \quad \delta _0^\text {DSE} = {0.127}, \end{aligned}$$
(61)

against those obtained from the Canterbury parameterization. These coefficients enter quadratically, which leads to a reshuffling between and . Due to Eq. (24) this effect drops out in the sum up to the anomaly correction affecting the OPE regime. As TFF contribution to the uncertainty on we took the absolute value of the differences between the results from the Canterbury and DSE TFFs.

Since NLO results are not available for the \({\eta / \eta '}\) TFFs, we estimated the NLO OPE uncertainty by simply rescaling the one in the pion channel by the ratio of the reference outcomes. Due to the smallness of this uncertainty, this is expected to be sufficiently accurate.

For the range \([m_\text {min}, m_\text {max}]\) and the minimal allowed value \(\varSigma _t\) for \(\varSigma ^\text {match}\) in the Monte Carlo simulation for P(xy), we took the results from the isovector channel. We rescaled the \(\pi (1300)\) term below the matching surface by the ratio of reference results when adding this contribution to . This is justified by the fact that the first excited pseudoscalars in the three flavor channels have similar masses, despite the large mass difference of the pseudo-Goldstone bosons.

All results are collected in Table 1 and our final estimate for the longitudinal short-distance effects in reads

(62)

The relative contributions to the uncertainties are similar to the pion case illustrated in Fig. 4. A more precise description of \(\eta -\eta '\)-mixing would of course help better separate the two isoscalar channels but would not play an important role in their sum leading to negligible shifts in the total contribution from longitudinal SDCs.

5.3 Sum over the flavor channels and comparison with literature

Combining the results from Sects. 5.1 and 5.2, obtained under the assumption that the ground-state pseudoscalar mesons dominate the low-energy region, our estimate for the total effect of the longitudinal SDCs on HLbL amounts to

(63)

where we have combined the three uncertainties linearly since they originate from the same sources in all three channels.

This result is remarkably close to what is expected based on flavor symmetry considerations. If the U(3) symmetry emerging in the combined chiral and large-\(N_c\) limit is assumed, then . Using our isovector uncertainty and adding linearly a standard 30% U(3) breaking effect, we obtain

$$\begin{aligned} \varDelta a_\mu ^\text {long}= (10.4 \pm 8.3)\times 10^{-11}. \end{aligned}$$
(64)

For this reason we do not expect that a more refined analysis of the subtler isosinglet contributions is going to change substantially our final results.

References [32, 33] have recently studied the possibility of saturating SDCs away from the chiral limit by including a tower of excited pseudoscalar states in the context of a Regge model matched to the pQCD quark loop. Their outcome is \(\varDelta a_\mu ^\text {long}= (13 \pm 6)\times 10^{-11}\), which is well compatible with ours within errors. For the \(\eta '\)-channel, the Regge model yields , which is somewhat larger than our result but still compatible within errors.Footnote 15 This can partly be explained by the different value for \(C_{\eta '}\) used in Refs. [32, 33], namely \(C_{\eta '} = 0.239\), which results from imposing that Eq. (24) holds exactly.

Fig. 5
figure 5

as a function of a lower limit on \(Q_3^2\) in Eq. (9): our reference result and corresponding error band against the tower of excited pseudoscalars in the large-\(N_c\) Regge model 1 of Refs. [32, 33] and the curve from the MV model [25] evaluated using the up-to-date dispersive pion TFF. At small non-vanishing \(Q_{3,\text {min}}^2\), our reference curve is constant due to the finite \(\varSigma ^\text {match}\), which for \(P(x,y)=0\) corresponds to \(m^2 = Q_3^2 = \text {const.}\), whereas the Regge model has a slope due to the absence of such a cutoff. The upper end of our error band shows a slope because of the inclusion of the \(\pi (1300)\) contribution in that region ss

Figure 5 shows as a function of a lower cutoff on \(Q_3^2\) in our approach as well as the large-\(N_c\) Regge model 1 of Refs. [32, 33]. In order to obtain this plot, we calculated the integral in Eq. (9) as a function of a lower limit on \(Q_3^2\) (which depends on \(\varSigma \), r and \(\phi \)) for both the full as well as the pion pole contribution (cf. Eq. (41)). The Regge model result lies within our error band for all \(Q_{3,\text {min}}^2\).

Our estimate of longitudinal short-distance effects as well as the one in Refs. [32, 33] are smaller than the shift obtained in Ref. [25], \(\varDelta a_\mu ^\text {long}= 23.5\times 10^{-11}\), which even increases to about \(38\times 10^{-11}\) if up-to-date TFF input is used [33]. These large values are due to two features of the MV model: the fact that the singly-virtual TFF is set to a constant over the whole integration region and not only in the OPE regime, and the fact that in the symmetric asymptotic limit the parametric momentum dependence is correct but its coefficient is too large. Both of these features can be clearly seen in Fig. 5 and are responsible for the discrepancies in the slope at small \(Q_{3,\text {min}}^2\) and the values at large \(Q_{3,\text {min}}^2\), respectively.

References [40, 41] have studied how the inclusion of an infinite tower of axial-vector mesons could help satisfy the OPE SDCs, focusing for this purpose on the relevant TFFs in the context of holographic QCD models. According to Ref. [40], the tower of axial-vector mesons contributes \((29{-}41)\times 10^{-11}\) to \(a_\mu ^\text {HLbL}\) of which (57–58)% are attributed to \(a_\mu ^\text {long}\). Using instead holographic QCD input only for the momentum dependence of the TFF and fixing its normalization from experiment reduces the estimate of the contribution to \(a_\mu ^\text {HLbL}\) from the tower of axials to \((22\pm 5)\times 10^{-11}\). Reference [41] finds \(14\times 10^{-11}\) for the effect of axials on \(a_\mu ^\text {long}\). Thus, the results of these studies appear to be at the high end of our range in Eq. (63). However, we stress that comparing these numbers with our result is not properly justified. Indeed, while in these models the parametric \(\varSigma \)-dependences implied by pQCD and the OPE in the respective limits are correctly reproduced, the coefficients thereof are typically too small. In addition, the lightest multiplet of axials significantly alters \({\bar{\varPi }}_1\) at small photon virtualities, which implies that in our approach its contribution should be included in the low-energy representation. This aspect will be discussed in the next section, also to show how information on additional states in the 1 GeV region can be incorporated in our analysis.

5.4 Including ground-state axial mesons at low energies

Here we adopt a model-dependent approach to illustrate the application of our procedure to the case of the inclusion in the low-energy region of the lightest of the axial-vector mesons, for which no dispersive treatment in the BTT formalism is available yet. According to the holographic QCD models in Refs. [40, 41] and using the notation of Ref. [40], the contribution to \({\bar{\varPi }}_1\) of an axial meson of mass \(M_A\) in the flavor channel a can be written as

(65)

where \(A(Q_1^2, Q_2^2)\) is the axial TFF.

Among the various scenarios analyzed in Ref. [40], the hard-wall model by Hirn and Sanz (HW2) [93], which was also studied in Ref. [41] with different parameters, reproduces best the measured mass, the measured equivalent two-photon decay width and the singly virtual momentum dependence measured by L3 for the lightest multiplet [94, 95]. Furthermore, it yields asymptotic axial TFFs whose momentum dependence is consistent with the behavior derived in Ref. [29]. The infinite tower of axials has the correct momentum scaling in the asymmetric asymptotic regime dictated by the OPE constraint, but the coefficient is 38% too small [40].

We focused on the isovector channel, which is sufficient for our illustrative purposes, and thus on the inclusion of the \(a_1\) meson. Based on the HW2 model, we obtained \(a_{\mu ,\text {HW2}}^{a_1} = 3.3\times 10^{-11}\) for the \(a_1\) contribution to \(a_\mu ^\text {long}\). The rest of the tower of isovector axial mesons in this model yields , implying that in this framework about 80% of the total effect comes from the lightest state.Footnote 16

By matching the interpolant in Eq. (37) to the contributions from the pion and the holographic \(a_1\) with the reference set of assumptions in Sect. 5.1, we obtainedFootnote 17

(66)

This result is more than twice as large as the resummed tower in HW2, , but the significance of this discrepancy could only be assessed by a more sophisticated analysis including uncertainties, which is beyond the scope of this work. However, in the holographic model the infinite tower of axials does not fully saturate the pQCD nor the OPE constraints, which suggests that additional degrees of freedom besides axials should be included in a more realistic model.

We then considered the choice of parameters made in Ref. [41] and referred to as HW2(UV-fit) in Ref. [40]. This model is constructed to obey the OPE constraint exactly, but fails to describe low-energy physics like the \(\rho \)-meson mass, the pion TFF and the axial TFFs measured by L3. The longitudinal contribution from \(a_1\) in this case amounts to \(a_{\mu ,\ \text {HW2(UV-fit)}}^{a_1} = 3.4\times 10^{-11}\) and the tower of states increases the value by . Our reference interpolant leads to , which is again larger than the model result. However, also in HW2(UV-fit) the pQCD constraint is not fully fulfilled by the tower of axials.

Neglecting issues related to intrinsic model dependence in the low-energy input, our method based on interpolants that by construction satisfy all constraints indicates that the effects of longitudinal SDCs are relatively small compared to the dominant low-energy contributions, and what is crucial in order to achieve higher precision is to gain control over the latter. We stress that a reliable prediction with a robust uncertainty estimate of the effects of axial meson exchanges would require model-independent input information.

6 Conclusions

In this paper we have introduced a novel approach to incorporate longitudinal SDCs into the calculation of the HLbL contribution to the muon \(g-2\). At variance with the previous estimates based on hadronic models, we have constructed general functions interpolating between low-, mixed- and high-energy regions, without resorting to specify which and how hadronic intermediate states are responsible for saturating the constraints. Furthermore, our method allows us also to study in detail the role played by parameters and assumptions in a transparent and numerically efficient way.

Our main premise is that an accurate low-energy representation of the longitudinal function \({\bar{\varPi }}_1(Q_1^2, Q_2^2, Q_3^2)\) entering the HLbL integral can be obtained by taking into account only intermediate states that are under good theoretical and numerical control. For the \(\pi ^0\), due to the location of its pole, the form of this low-energy representation can be straightforwardly extended even to large \(Q_1^2\) and \(Q_2^2\) as long as \(Q_3^2\) stays small. Using available input for the \(\pi ^0\)-pole term, we find that the shift due to longitudinal SDCs on the isovector part of \(a_\mu ^\text {HLbL}\) is in the range \((2.6\pm 1.5)\times 10^{-11}\). By including in the analysis also the isoscalar components, which the \(\eta \)- and \(\eta '\)-poles are assumed to dominate at low energies, we obtained that longitudinal SDCs increase \(a_\mu ^\text {HLbL}\) by \((9.1\pm 5.0)\times 10^{-11}\) in total. The quoted ranges encompass uncertainties in the low-energy input, perturbative corrections and fitting errors at asymptotic momenta, parametric variations of the functional form of the interpolants and of the matching surface, at which these functions are matched to the low-energy input, with the latter dominating the total uncertainty. Thus, according to our analysis, infinite towers of states heavier than 1 GeV, albeit crucial for the saturation of SDCs, give a relatively small contribution to \(a_\mu ^\text {HLbL}\) and this effect can be estimated with sufficient precision using our method. Conversely, states with masses around 1 GeV contributing significantly to the low-energy region play a decisive role also in a precision determination of short-distance effects.

Our result for the effects of longitudinal SDCs on \(a_\mu ^\text {HLbL}\) agrees with recent model estimates [32, 33], fulfills the accuracy goal set by the forthcoming experimental results and is significantly smaller than the earlier model result of Ref. [25], especially when up-to-date TFF input is used. Furthermore, neglecting issues concerning intrinsic model dependence and the fact that holographic QCD calculations in Refs. [40, 41] do not completely saturate the SDCs, we find in agreement with these studies that the infinite tower of axials has a relatively small impact on the longitudinal part of \(a_\mu ^\text {HLbL}\) if the lightest multiplet is treated explicitly as a low-energy contribution.

It will be straightforward to incorporate in our approach model-independent information on further intermediate states as well as higher-order corrections to asymptotic expressions once these become available. Furthermore, our method can be generalized to the case of transversal SDCs. Therefore, it paves the way for a combination of all available low- and high-energy information on HLbL into one model-independent, accurate numerical estimate of this contribution to the muon \(g-2\).