1 Introduction

SU(3) flavor symmetry and its breaking play an important role in our understanding of the strong interaction in the non-perturbative regime. In the limit of an exact SU(3) flavour symmetry, one can relate the magnetic moments of the octet baryons and the \(\Lambda \Sigma ^0\) transition to those of the proton and the neutron via the celebrated Coleman-Glashow formulae [1]. Nonetheless, in nature SU(3) flavor symmetry is broken. This must be properly taken into account in order to improve the description of the baryon magnetic moments by inducing a realistic SU(3)-breaking mechanism. Chiral perturbation theory (ChPT), the low-energy effective field theory of QCD (see e.g. [2,3,4,5,6]), provides an appropriate framework to tackle this problem in a systematic fashion. However, it was noticed long ago that the leading order (LO) chiral corrections are large and tend to worsen the results, as exemplified, e.g., in [7,8,9,10]. This issue has often been used to question the validity of the SU(3) baryon ChPT altogether, see e.g. [11].

It was shown in Ref. [12] that one can achieve, however, an order by order improvement in the description of the octet baryon magnetic moments with the extended-on-mass-shell (EOMS) formulation of baryon ChPT [13, 14].Footnote 1 Although it seems that the puzzle has been solved, a natural question is what happens at the next-to-next-to-leading order (NNLO). Because of the increased number of unknown low-energy constants (LECs), a clear answer to this question has not yet been provided.

In the last two decades several calculations of the octet baryon magnetic moments in heavy baryon (HB) ChPT up to NNLO have been performed both with [9, 18, 19] and without [20] the inclusion of the baryon decuplet. It was shown in Ref. [20] that at NNLO the convergence of the HBChPT is quite good, contrary to the pattern exhibited at NLO. One should note that, however, in Ref. [20] the contributions of the two NNLO LECs \(b_6^{D',F'}\) are absorbed into the two LO ones \(b_6^{D,F}\). This is legitimate as long as one works at the physical quark masses, as the primed LECs merely amount to a quark mass dependent shift of the unprimed ones that can not be disentangled.

Despite of all these studies, it remains unclear whether the convergence pattern of the HB [9], the infrared (IR) [10], and the EOMS formulation [12] observed in the description of the octet baryon magnetic moments, one of the cleanest observables, is particular to NLO and accidental, where no unknown LECs contribute, or it might be a more genuine feature of different formulations. Given that ChPT plays an indispensable role in our understanding of low energy strong interaction physics, it is of utmost importance to clarify this puzzling situation.

In this work we address this question by performing a study of the octet baryon magnetic moments at NNLO using the EOMS renormalization scheme. Since at this order the number of LECs is larger than that of the data, we first use convergence as a criterion to constrain the two LO LECs, \(b^{D,F}_6\), which cannot be distinguished from the two NNLO LECs \(b_6^{D',F'}\) at the physical point. We then predict the light quark mass dependence of the octet baryon magnetic moments and contrast them with the results of the HB and IR formulations, and the state of the art lattice QCD simulations. Second, we fit the LECs to the lattice QCD data and then predict the strange quark mass dependence of the magnetic moments. It is shown that depending on how one determines the LECs, the predicted dependence is rather different, which could be investigated in more detail by future lattice QCD simulations.

This paper is organized as follows. In Sect. 2, we present the theoretical framework. Results and discussions are given in Sect. 3, followed by a short summary in Sect. 4.

2 Theoretical framework

The octet baryon magnetic moments are defined via baryon matrix elements of the electromagnetic current \(J_\mu \) as follows

$$\begin{aligned} \left\langle {\bar{B}}|J_\mu |B \right\rangle =\bar{u}_B(p_f)\left[ \gamma _{\mu } F_{1}^{B}(t) +\frac{i\sigma _{\mu \nu }q^{\nu }}{2m}F_{2}^{B}(t)\right] u_B(p_i),\nonumber \\ \end{aligned}$$
(1)

where \(\bar{u}_B\) and \(u_B\) are Dirac spinors, m is the baryon mass, and \(F_1^B\) and \(F_2^B\) denote the Dirac and Pauli form factor, respectively. The four-momentum transfer is defined as \(q=p_f-p_i\) and \(t=q^2\). At \(t=0\), \(F_{2}^{B}(0)\) is the so-called anomalous magnetic moment, \(\kappa _{B}\), and the magnetic moment is \(\mu _{B}=\kappa _{B}+ q_{B}\), with \(q_B\) the charge of the baryon.

In ChPT, one can calculate the baryon magnetic moments order by order, i.e.,

$$\begin{aligned} \mu _B=\mu _B^{(2)}+\mu _B^{(3)}+\mu _B^{(4)}+\cdots , \end{aligned}$$
(2)

where the numbers in the superscripts are the chiral order, defined as \(n_\mathrm {ChPT}=4L-2N_M-N_B+\sum _k k V_k\) for a properly renormalized diagram with L loops, \(N_M\)(\(N_B\)) meson (baryon) propagators, and \(V_k\) vertices from the \(k^{\mathrm{th}}\) order Lagrangian. Because of the large non-zero baryon mass in the chiral limit, this power counting is broken in a naive application of the \(\overline{\mathrm {MS}}\) regularization scheme [3]. To recover the power counting, several approaches have been proposed, such as the HB method [21, 22], the IR approach [23] and the EOMS scheme [13, 14]. In recent years, it has been shown that the EOMS scheme has some advantages because it satisfies all symmetry and analyticity constrains and converges relatively faster in certain cases, see e.g. Ref. [24] for a short review.

Fig. 1
figure 1

Feynman diagrams contributing to the octet baryon magnetic moments up to NNLO. a Contributes to LO, while b and c depict the NLO corrections. All other diagrams represent the NNLO contribution. Solid, dashed and wiggly lines denote baryons, Goldstone bosons and photons, respectively. The small and medium solid dots refer to vertices obtained from \({\mathcal {L}}_{MB}^{(1)}\) and \({\mathcal {L}}_B^{(1)}\), in order. The heavy dots refer to vertices from \({\mathcal {L}}_{MB}^{(2)}\) . The circles refer to vertices from \({\mathcal {L}}_{M}^{(2)}\). The crosses refer to mass insertions \(\sim \) \(b_{D,F}\). The square refers to a vertex from \({\mathcal {L}}_{MB}^{(4)}\). The partners of the wave function renormalization (g) and baryon mass insertion (i) are not shown explicitly

The Feynman diagrams needed to calculate \(\mu _B\) up to NNLO are shown in Fig. 1. The LO contributions are provided by the following Lagrangian [25],

$$\begin{aligned} {\mathcal {L}}_{MB}^{(2)}=\frac{b_{6}^{D}}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left\{ F_{\mu \nu }^{+},B\right\} \right\rangle +\frac{b_{6}^{F}}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left[ F_{\mu \nu }^{+},B\right] \right\rangle ,\nonumber \\ \end{aligned}$$
(3)

where \(\sigma ^{\mu \nu }=\frac{i}{2}\left[ \gamma ^{\mu } ,\gamma ^{\nu }\right] \), \(F_{\mu \nu }^{+}=(u^{\dag }QF_{\mu \nu }u+uQF_{\mu \nu }u^{\dag })\), \(Q=|e|\mathrm {diag}(2,-1,-1)/3\) is the quark charge matrix with e the electron charge, \(u= \mathrm {exp}\left[ i\Phi /2F_\phi \right] \), with \(\Phi \) the unimodular matrix containing the pseudoscalar nonet, \(F_\phi \) the pseudoscalar decay constant, and \(F_{\mu \nu }=\partial _{\mu }A_{\nu }-\partial _{\nu }A_{\mu }\) is the conventional photon field strength tensor. Moreover, \(\langle \ldots \rangle \) denotes the trace in flavor space. At LO, there are only two LECs, \(b_{6}^{D}\) and \(b_{6}^{F}\).

The leading SU(3) breaking corrections come from loop diagrams. They arise at order \({\mathcal {O}}(p^3)\) in the chiral counting and are determined completely in terms of the lowest order LECs from \({\mathcal {L}}^{(1)}_B+{\mathcal {L}}^{(2)}_{M} + {\mathcal {L}}^{(1)}_{MB}\), namely,

$$\begin{aligned} {\mathcal {L}}_{B}^{(1)}= & {} \left\langle \bar{B}i\gamma ^{\mu } D_\mu B - m_0 {\bar{B}} B\right\rangle , \end{aligned}$$
(4)
$$\begin{aligned} {\mathcal {L}}_{M}^{(2)}= & {} \frac{F_\phi ^{2}}{4}\left\langle u_{\mu }u^{\mu }+\chi ^{+}\right\rangle , \end{aligned}$$
(5)
$$\begin{aligned} {\mathcal {L}}_{MB}^{(1)}= & {} \frac{D}{2}\left\langle \bar{B}\gamma ^{\mu }\gamma ^{5}\left\{ u_{\mu },B\right\} \right\rangle +\frac{F}{2}\left\langle \bar{B}\gamma ^{\mu }\gamma ^{5}\left[ u_{\mu },B\right] \right\rangle , \end{aligned}$$
(6)

where \(m_0\) denotes the baryon mass in the chiral limit, \(D_\mu B=\partial _\mu B+[\Gamma _\mu , B]\), \(\Gamma _\mu =\frac{1}{2}(u^\dagger \partial _\mu u + u\partial _\mu u^\dagger )-\frac{i}{2}(u^\dagger v_\mu u + u v_\mu u^\dagger )\), with \(v_\mu \) the vector source, \(u_{\mu }=i(u^{\dag }\partial _{\mu }u-u\partial _{\mu }u^{\dag })+(u^\dagger v_\mu u- uv_\mu u^ \dagger )\) and \(\chi ^{\pm }=u^{\dag }\chi u^{\dag }\pm u\chi ^\dagger u\). Here, \(\chi =2B_{0}{\mathcal {M}}\), with \({\mathcal {M}}\) the quark mass matrix \({\mathcal {M}}=\mathrm {diag}(m_l,m_l,m_s)\). In what follows, we work in the isospin limit \(m_l=(m_u+m_d)/2\). Further, \(B_{0}=|\langle 0|\bar{q}q|0\rangle |/F_\phi ^{2}\). The axial vector couplings D and F are determined from hyperon decays to be \(D=0.8\) and \(F=0.46\), and these values will be taken throughout.

The explicit expressions of the LO and NLO results can be found in Ref. [12]. In the following, we focus on the NNLO contributions. At this order one has to include one-loop diagrams with exactly one vertex from \({\mathcal {L}}^{(2)}_{MB}\) as well as additional tree contributions from \({\mathcal {L}}^{(4)}_{MB}\). The fourth order contribution to \(\mu _B\) is given as (see Fig. 1):

$$\begin{aligned} \mu ^{(4)}= & {} \mu ^{(4,d)}+\mu ^{(4,e)}+\mu ^{(4,f)}+\mu ^{(4,g)}\nonumber \\&\quad +\,\mu ^{(4,h)}+\mu ^{(4,i)}+\mu ^{(4,j)}. \end{aligned}$$
(7)

The terms contributing to \(\mu ^{(4,d)}_{B}\) collect the tree contributions with exactly one insertion from the following chiral Lagrangian [20, 26],

$$\begin{aligned} {\mathcal {L}}_{MB}^{(4)}= & {} +\frac{b_6^{D'}}{8m_0}\left\langle \chi ^{+}\right\rangle \left\langle \bar{B}\sigma ^{\mu \nu }\{F_{\mu \nu }^{+},B\}\right\rangle \nonumber \\&+\frac{b_6^{F'}}{8m_0}\left\langle \chi ^{+}\right\rangle \left\langle \bar{B}\sigma ^{\mu \nu }[F_{\mu \nu }^{+},B]\right\rangle \nonumber \\&+\frac{\alpha _1}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left[ \left[ F^{+}_{\mu \nu },B\right] ,\chi ^{+}\right] \right\rangle \nonumber \\&+\frac{\alpha _2}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left\{ \left[ F^{+}_{\mu \nu },B\right] ,\chi ^{+}\right\} \right\rangle \nonumber \\&+\frac{\alpha _3}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left[ \left\{ F^{+}_{\mu \nu },B\right\} ,\chi ^{+}\right] \right\rangle \nonumber \\&+\frac{\alpha _4}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }\left\{ \left\{ F^{+}_{\mu \nu },B\right\} ,\chi ^{+}\right\} \right\rangle \nonumber \\&+\frac{\beta _1}{8m_0}\left\langle \bar{B}\sigma ^{\mu \nu }B\right\rangle \left\langle \chi ^{+}F_{\mu \nu }^{+}\right\rangle , \end{aligned}$$
(8)

where \(b_6^{D'},\) \(b_6^{F'},\) \(\alpha _{1,2,3,4}\) and \(\beta _1\) are LECs.

At this order, we also have to consider double derivative operators at the meson-baryon vertex with the photon hooking on to the meson loop, see Fig. 1h. The corresponding terms of the dimension two Lagrangian read [20, 26]

$$\begin{aligned} {\mathcal {L}}_{MB}^{(2')}=\frac{i}{2}\left\{ b_{9}\left\langle \bar{B}\sigma ^{\mu \nu }u_\mu \right\rangle \left\langle u_\nu B\right\rangle +b_{10,11}\left\langle \bar{B}\sigma ^{\mu \nu }\left( \left[ u_\mu ,u_\nu \right] ,B\right) _{\pm }\right\rangle \right\} ,\nonumber \\ \end{aligned}$$
(9)

where \(b_{9,10,11}\) are LECs. They are estimated via resonance saturation in Ref. [20] and re-evaluated at \(m_0=0.94\) GeV, yielding \(b_9=0.43\) GeV\(^{-1}\), \(b_{10}=0.86\) GeV\(^{-1}\), and \(b_{11}=0.45\) GeV\(^{-1}\). We will call these values of the LECs set I. An improved determination has been given in Ref. [10], the corresponding values are \(b_9=1.36\) GeV\(^{-1}\), \(b_{10}=1.24\) GeV\(^{-1}\), and \(b_{11}=0.46\) GeV\(^{-1}\). We will refer to these as set II.

At NNLO, one also needs the LO chiral corrections to the baryon masses, which are provided by the following Lagrangian [25]:

$$\begin{aligned} {\mathcal {L}}_{MB}^{(2'')}=b_D\left\langle \bar{B}\left\{ \chi _{+},B\right\} \right\rangle +b_F\left\langle \bar{B}\left[ \chi _{+},B\right] \right\rangle . \end{aligned}$$
(10)

The two LECs \(b_D\) and \(b_F\) are fixed by fitting to the octet baryon mass splittings at \({\mathcal {O}}(p^2)\), yielding \(b_D=0.066\) GeV\(^{-1}\) and \(b_F=-0.21\) GeV\(^{-1}\).

The NNLO tree-level contributions can be obtained rather straightforwardly. The results are shown in the Appendix. Following the EOMS prescription to restore the power counting, we obtain the following NNLO loop results,

$$\begin{aligned} \mu _B^{(4,e)}(M_\phi )= & {} C_B^{(4,e)}(\phi )\frac{M_\phi ^2}{16\pi ^2 F_{\phi }^2}\log \left( \frac{M_\phi ^2}{\lambda ^2}\right) , \end{aligned}$$
(11)
$$\begin{aligned} \mu _{B}^{(4,f)}(M_\phi )= & {} \frac{-C_{B}^{(4,f)}(\phi )}{16 \pi ^2 F_{\phi }^2 m_0^2 \sqrt{4 m_0^2-M_\phi ^2}}\nonumber \\&\times \left\{ \phantom {\left. \left. -M_\phi ^4\log \left( \frac{M_\phi ^2}{m_0^2}\right) \right\} \right\} }\left( 2 M_\phi ^5-4 m_0^2 M_\phi ^3\right) \cos ^{-1} \left( \frac{M_\phi }{2m_0}\right) \right. \nonumber \\&\left. +\sqrt{4 m_0^2-M_\phi ^2} \left\{ m_0^2M_\phi ^2 \left[ \log \left( \frac{M_\phi ^2}{\lambda ^2}\right) +2\right] \right. \right. \nonumber \\&\left. \left. -M_\phi ^4\log \left( \frac{M_\phi ^2}{m_0^2}\right) \right\} \right\} , \end{aligned}$$
(12)
$$\begin{aligned} \mu _{B}^{(4,g)}(M_\phi )= & {} \frac{C_{B}^{(4,g)}(\phi ) M_\phi ^2}{16\pi ^2 F_\phi ^2 m_0^2 \left( 4 m_0^2-M_\phi ^2\right) } \nonumber \\&\times \left\{ \left( 4 m_0^2-M_\phi ^2\right) \left\{ m_0^2 \left[ \log \left( \frac{\lambda ^2}{m_0^2}\right) -4\right] \right. \right. \nonumber \\&\left. \left. +\left( 2 M_\phi ^2-3 m_0^2\right) \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right\} \right. \nonumber \\&\left. \phantom {\left\{ \left( 4 m_0^2-M_\phi ^2\right) \left\{ m_0^2 \left[ \log \left( \frac{\lambda ^2}{m_0^2}\right) -4\right] \right. \right. }+4 M_\phi \left( 3 m_0^2-{M_\phi }^2\right) \sqrt{4 m_0^2-M_\phi ^2} \cos ^{-1}\left( \frac{M_\phi }{2m_0}\right) \right\} , \nonumber \\\end{aligned}$$
(13)
$$\begin{aligned} \mu _{B}^{(4,h)}(M_\phi )= & {} C_{B}^{(4,h)}(\phi )\frac{m_0 M_{\phi }^2 \log \left( \frac{M_{\phi }^2}{\lambda ^2}\right) }{8 \pi ^2 F_{\phi }^2}, \end{aligned}$$
(14)
$$\begin{aligned} \mu _{B}^{(4,i)}(M_\phi )= & {} \frac{C_{B}^{(4,i)}(\phi )}{4 \pi ^2 m_0^3F_{\phi }^2 \left( 4 m_0^2-M_\phi ^2\right) ^{3/2}}\nonumber \\&\times \left\{ \phantom {\left. \left. -\left( M_\phi ^3-4 m_0^2 M_\phi \right) ^2 \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] \right\} } 2M_\phi \left( -16 m_0^6+30 m_0^4M_\phi ^2-10 m_0^2 M_\phi ^4+M_\phi ^6\right) \right. \nonumber \\&\left. \times \cos ^{-1}\left( \frac{M_\phi }{2m_0}\right) \right. \nonumber \\&\left. +\sqrt{4 m_0^2-M_\phi ^2} \left[ \phantom {\left. -\left( M_\phi ^3-4 m_0^2 M_\phi \right) ^2 \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] }4m_0^6-13 m_0^4 M_\phi ^2+2 m_0^2 M_\phi ^4\right. \right. \nonumber \\&\left. \left. -\left( M_\phi ^3-4 m_0^2 M_\phi \right) ^2 \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] \right\} , \end{aligned}$$
(15)
$$\begin{aligned} \mu _{B}^{(4,j)}(M_\phi )= & {} \frac{C_{B}^{(4,j)}(\phi )}{4\pi ^2F_\phi ^2m_0^3\left( 4m_0^2-M_\phi ^2\right) ^{3/2}}\nonumber \\&\times \left\{ \sqrt{4m_0^2-M_\phi ^2}\left[ \phantom {\left. \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] }-20m_0^6+15m_0^4M_\phi ^2-2 m_0^2 M_\phi ^4\right. \right. \nonumber \\&\left. \left. -\left( 4 m_0^6-21m_0^4 M_\phi ^2+9 m_0^2 M_\phi ^4-M_\phi ^6\right) \right. \right. \nonumber \\&\left. \left. \times \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] \right. \nonumber \\&\left. -2M_\phi \left( -32 m_0^6+37 m_0^4M_\phi ^2-11 m_0^2 M_\phi ^4+M_\phi ^6\right) \right. \nonumber \\&\left. \phantom {\left\{ \sqrt{4m_0^2-M_\phi ^2}\left[ \phantom {\left. \log \left( \frac{M_\phi ^2}{m_0^2}\right) \right] }-20m_0^6+15m_0^4M_\phi ^2-2 m_0^2 M_\phi ^4\right. \right. }\times \cos ^{-1}\left( \frac{M_\phi }{2m_0}\right) \right\} , \end{aligned}$$
(16)

where the coefficients \(C_B^{(4,e,f,h,i,j)}(\phi =\pi , K, \eta )\) are tabulated in the Appendix and \(\lambda \) is the renormalization scale. We have checked that our results agree with those of Ref. [20] in the heavy mass limit up to analytical terms.

3 Results and discussions

At LO and NLO, the two LECs \(b_6^{D,F}\) can be determined by fitting to the seven experimental data \(\mu _{p,n,\Lambda , \Sigma ^+,\Sigma ^-, \Xi ^{-},\Xi ^{0}}\). An extensive discussion of the EOMS results in comparison with the HB and IR results is given in Ref. [12]. At NNLO, however, there are nine LECs, two from the LO contribution and seven from the NNLO contribution. As a result, the experimental data alone can not uniquely determine all the nine LECs. In Ref. [20], the two NNLO LECs \(b_{6}^{D',F'}\) are absorbed by the two LO LECs, \(b_{6}^{D,F}\), while in the present work we keep explicitly \(b_6^{D',F'}\). Note that at the physical point, only the combinations \(\bar{b}^{D,F}_6=b_{6}^{D,F} + \langle \chi _+\rangle b_6^{D',F'}\) are relevant. In our numerical analysis, the decay constant and the chiral limit value of the baryon masses are chosen to be \(F_{\phi }=0.108\) GeV, an average of the pion, kaon, and eta decay constants, and \(m_0=0.94\) GeV, following the argument in Ref. [20]. The dimensional regularization scale is set at \(\lambda =1.0\) GeV. We also have performed calculations allowing for slight variation of \(m_0\) or \(\lambda \) about these values, e.g., \(\lambda =0.9\sim 1.1\) GeV and \(m_0=0.8\sim 1.1\) GeV, and found that such changes have negligible effects on our results. It should be noted that in principle at the order we are working, one should use the chiral limit values for D, F, and \(F_\phi \). Using the physical/average values for them will lead to differences of higher chiral order. We checked that using the chiral limit values for D, F, and \(F_\phi \), such as those determined in Ref. [27, 28], affect little our results and conclusions.

Fig. 2
figure 2

Left panel: dependence of the convergence rate \(C_R=0.6\) on \(b_6^{D,F}\) in the EOMS scheme. Right panel: convergence rates, \(C_R\), of the HB, IR, and EOMS scheme as a function of \(b_6^{D,F}\). The green shaded area in both plots are the same. The blue stars denote the optimal \(b_6^{D,F}\), located in the center of the shaded area and later used to study convergence, i.e., \((b_6^{D}, b_6^{F})=(3.54,1.17)_\mathrm {EOMS}\), \((4.22, 2.30)_\mathrm {HB}\), \((3.96, 1.00)_\mathrm {IR}\)

Table 1 Contributions of different chiral orders of the HB, IR, and EOMS schemes up to NNLO with the LECs of Table 2
Table 2 LO and NNLO low-energy constants denoted by the blue stars in Fig. 2, where \(\bar{b}^D_6\) and \(\bar{b}^F_6\) are linear combinations of the two LO and NNLO LECs, \(\bar{b}^D_6=b_6^{D}+\langle \chi _+\rangle b_6^{D'}\), \(\bar{b}^F_6=b_6^{F}+\langle \chi _+\rangle b_6^{F'}\)
Fig. 3
figure 3

Octet baryon magnetic moments as functions of the pion mass squared. The central values of the LECs are given in Table 2, while the shaded bands represent higher order contributions not considered (see text for details). The half filled square indicates that we have chosen a positive sign for \(\mu _{\Lambda \Sigma ^0}\)

3.1 Using convergence to constrain the low-energy constants

We assume that BChPT has a reasonable convergence rate in the uds three flavor sector, namely, higher order contributions are suppressed compared to lower order ones, in terms of \(M_K/\Lambda _\mathrm {ChPT}\approx 0.5\). Under this assumption, we can use the convergence criterion and the experimental data to constrain the nine LECs. More specifically, we define the convergence rate (\(C_R\)) as

$$\begin{aligned} C_R= & {} \mathrm {max}(\mu _B^{(3)}/\mu _B^{(2)},\mu _B^{(4)}/\mu _B^{(3)})\quad \mathrm {with}\nonumber \\ B= & {} p,n, \Lambda , \Sigma ^+,\Sigma ^0,\Sigma ^-,\Lambda \Sigma ^0, \Xi ^{-},\Xi ^{0}. \end{aligned}$$
(17)
Table 3 \({\tilde{\chi }}^2\) between the ChPT results obtained in the HB, IR, and EOMS schemes and the LQCD data of Refs. [30,31,32] with the optimal \(b_6^{(D,F)}\) denoted by the blue stars in Fig. 2

When fitting, we can set the convergence rate to a particular value, e.g., 0.5, and search for a combination of LECs that can satisfy such a requirement. In the left panel of Fig. 2, we show how one can fix the range of \(b_6^{D,F}\) by requiring \(C_R\le 0.6\) in the EOMS scheme. It can be seen that indeed there exist some combinations of \(b_6^{D,F}\) which can satisfy the requirement, namely the green shaded area. In the right panel of Fig. 2, we show the best convergence rate achievable in the EOMS scheme, in comparison with those in the HB and IR schemes. One can see that only in the EOMS scheme a convergence rate about 0.6 can be achieved,Footnote 2 while the convergence rates in the HB and IR schemes are relatively larger. This is consistent with the findings at NLO [12]. On the other hand, from a typical convergence pattern of the three renormalization schemes shown in Table 1 with the corresponding LECs given in Table 2, one notices that the contributions of different chiral orders in the HB ChPT are more moderate, though not as small as one naively expects.

It should be noted that in the numerical study we have taken the values given above for the LECs D, F, \(b_D\), \(b_F\), \(b_9\), \(b_{10}\), and \(b_{11}\). In principle, there are some uncertainties. These can originate either from the data used to fix them, or from the chiral orders at which they are determined, or from the validity of the assumption adopted, such as resonance saturation for the case of \(b_9\), \(b_{10}\), and \(b_{11}\). Unfortunately, there is no easy way to quantify these uncertainties. We have checked that using set II instead of set I, we find some quantitative differences but the overall trends are not affected.

Table 4 Low-energy constants determined by fitting to the lattice QCD data of Ref. [30] and the corresponding \({\tilde{\chi }}^2\) in the IR, HB, and EOMS BChPT up to NLO
Fig. 4
figure 4

Fits to the lattice QCD data of Ref. [30] at \({\mathcal {O}}(p^3)\) with the EOMS, HB, and IR BChPT. The bands represent estimated higher order corrections as explained in the text

Fig. 5
figure 5

Fits to the lattice QCD data of Ref. [30] at \({\mathcal {O}}(p^4)\) with the EOMS, HB, and IR BChPT. The bands represent higher order contributions as explained in the text

Table 5 Low-energy constants obtained by fitting to the lattice QCD data of Ref. [30] and the corresponding \({\tilde{\chi }}^2\) in the IR, HB, and EOMS BChPT up to NNLO
Table 6 Contributions of different chiral orders of the HB, IR, and EOMS schemes up to \({\mathcal {O}}(p^4)\) with the LECs obtained by fitting to the lattice QCD data
Fig. 6
figure 6

Predicted dependence of the octet baryon magnetic moments on the strange quark mass with the LECs determined by fitting to the lattice QCD data of Ref. [30] at \({\mathcal {O}}(p^4)\) with the EOMS, HB, and IR BChPT. The u / d quark mass is fixed at their physical value while the strange quark mass is proportional to \(2M_K^2-M_\pi ^2\) according to leading order ChPT. The bands represent higher order contributions

Fig. 7
figure 7

The same as Fig. 6, but with the LECs given in Table 2

One way to test the LECs determined above and also to distinguish different formulations of BChPT is to study the light quark mass dependence of the magnetic moments. Fixing the strange quark mass to its physical value with the LO ChPT relation \(2 B_0 m_s=(2 M_K^2-M_\pi ^2)|_\mathrm {phys.}\), we show the pion mass dependence of the magnetic moments in Fig. 3. The lines are obtained with the LECs tabulated in Table 2, while the bands denote higher order contributions not considered in the NNLO study. They are obtained according to Ref. [29]:

$$\begin{aligned} \delta \mu _{B}^{(i)}=\mathrm{{max}} \left( Q^{i-1}|\mu _B^{(2)}|, Q^{i+1-j}|\mu _B^{(j)}| \right) ,\quad 2\le j\le i ,\nonumber \\ \end{aligned}$$
(18)

where \(\delta \mu _B^{(i)}\) are the uncertainties of chiral order i, \(Q=M_\pi /\Lambda _{\mathrm{{ChPT}}}\), \(M_\pi =0.138~\mathrm{{GeV}}\), and \(\Lambda _{\mathrm{{ChPT}}}\sim 1\) GeV is the chiral symmetry breaking scale. In addition, the requirement [29]

$$\begin{aligned} \delta \mu _B^{(i+1)}\ge Q\delta \mu _B^{(i)} \end{aligned}$$
(19)

is also satisfied. It is clear that these contributions reflect partly the convergence pattern, i.e., the slower the convergence, the larger the higher order contributions.

One can see that the three schemes display rather different pion mass dependence. For comparison, the state of the art lattice QCD results [30,31,32] are shown as well. Note that such a comparison is only meant to be qualitative, since these simulations are performed with the strange quark mass close to its physical value but not exactly at the physical point and furthermore current lattice QCD simulations still contain systematic uncertainties not under complete control. It is clear that the EOMS results are in better agreement with the lattice data, as corroborated by the unweighted \({\tilde{\chi }}^2=\sum \left( \mu _\text {th}-\mu _\text {lattice}\right) ^2\) between the results of each scheme and the lattice QCD data [30,31,32] shown in Table 3. One should note that only at relatively large pion masses, e.g., \(M_\pi >200\) or 300 MeV, one can distinguish the results from different formulations of BChPT using lattice QCD simulations.

3.2 Fitting to the lattice QCD data

Now we take a more practical attitude, forgetting about the convergence constraint, and determine the relevant LECs by fitting to the lattice QCD data directly. Among the three lattice QCD studies we considered, the one of Ref.  [30] features the largest number of simulation points. Therefore, we fit to these data to determine the LECs. At \({\mathcal {O}}(p^3)\), one has only two LECs. Their values from the best fit together with the corresponding \({\tilde{\chi }}^2\) are tabulated in Table 4. The predicted pion mass dependence is shown in Fig. 4, where the lattice data from Refs. [31, 32] are also shown. One can see that only the EOMS formulation can describe the lattice QCD data reasonably well, consistent with the finding in the SU(2) sector [33]. Furthermore, as shown in Table 4, the LECs of the EOMS formulation determined from the fit to the lattice QCD data are similar to those determined by the experimental data [12].

The situation becomes different, however, if we fit to the lattice data with the \({\mathcal {O}}(p^4)\) BChPT results, all the three formulations can describe the lattice data with similar quality, as shown in Fig. 5 and Table 5. On the other hand, it seems that with the LECs determined from the best fit, the predicted \(\mu _{\Lambda \Sigma _0}\) by the IR formulation is in better agreement with the experimental value (if a positive sign is taken). Nevertheless, one should note that the convergence pattern, particularly those of the IR and HB schemes, is destroyed (see Table 6).

One way to distinguish the different formulations in the present case is to study the strange quark mass dependence of the magnetic moments, shown in Figs. 6, and 7. One can see that depending on how one determines the LECs, either by fitting to the lattice QCD data or to the experimental data with the convergence constraint, the dependences on the strange quark mass are quite different.Footnote 3 It is clear that one needs more lattice data with varying strange quark mass to check which scenario is more realistic.

4 Summary

We studied the octet baryon magnetic moments in baryon chiral perturbation theory with the extended-on-mass-shell renormalization scheme up to next-to-next-to-leading order. We determined the low-energy constants following two different strategies, either by fitting to the experimental data with convergence as a further constraint or by fitting to lattice QCD data directly. It was shown that in the first case the extended-on-mass shell formulation seems to describe better the lattice QCD data, while in the second case, although all three formulations of baryon chiral perturbation theory can describe the lattice QCD data, they predict rather different strange quark mass dependence. Clearly more lattice QCD simulations are needed to better understand the situation and the convergence pattern of baryon chiral perturbation theory.