1 Introduction

The up- and down-quark masses are fundamental parameters in Quantum Chromodynamics (QCD), and they play important roles in the phenomenological study of the Standard Model. However, they are not physical observables, due to the confinement nature of QCD. Usually one needs to adopt some QCD-based non-perturbative methods, such as lattice QCD, QCD sum rules, effective field theories, etc., to determine the quark masses. A recent review of the determinations of the light quark masses from lattice QCD can be found in Ref. [1].

The QCD sum rules, developed more than forty years ago by Shifman, Vainshtein and Zakharov (SVZ) [2,3,4,5], has some peculiar advantages in the study of hadron phenomenology. Its starting point in evaluating the properties of the concerned hadron is to construct the current, which possesses the important information about the concerned hadron, like quantum numbers, constituent quarks and gluons. With the currents one can then construct the two-point correlation function, which matches the QCD and phenomenological sides of sum rules. Within the QCD sum rules, in recent years there are remarkable progresses on the determination of the light quark masses [6,7,8,9,10,11,12,13,14,15,16]. Many of these works rely on the pseudoscalar condensates or the divergences of the axial-vector currents to determine the light quark masses from QCD sum rules. The golden cases to determine the \(m_d+m_u\) and \(m_d-m_u\) would be the divergence of the isovector axial-vector current and the divergence of isovector vector current, respectively [13, 17]. Alternatively the isoscalar scalar channel can be also used to calculate the \(m_d+m_u\) [14, 18, 19]. In this situation, on the phenomenological side one needs the isoscalar scalar spectral functions, which can contain the contributions from the scalar resonances \(f_0(500)\)(\(\sigma \)), \(f_0(980)\), \(f_0(1370)\) and other heavier ones. Indeed there are serve debates on the roles of the different resonances played in the scalar QCD sum rules [14, 20,21,22], specially on the low lying states \(f_0(500)\) and \(f_0(980)\). In the present work, we attempt to address this issue by adopting the phenomenological spectral functions calculated within the unitarized U(3) resonance chiral theory [23,24,25]. The spectral functions are constructed by the unitarized scalar form factors, which are pure predictions once the scattering amplitudes are determined through the fits to scattering observables. In the complex energy plane, the isoscalar scalar resonance poles for the \(f_0(500)\), \(f_0(980)\), \(f_0(1370)\) are found, whose contributions are coherently included in the spectrum functions, together with two-meson continuum effects from the chiral effective field theory. The prescription of the unitarized chiral spectrum functions improves the Breit–Wigner description of the broad scalar \(\sigma \) resonance [19]. Furthermore, there is no free parameter in our spectrum functions, since they can be predicted from the scattering observables [23,24,25].

Despite the success of QCD sum rules, there remain several subtle issues concerning their application, such as the way to determine the continuum threshold parameter \(s_0\) and the uncertainties of the results. In Refs. [19, 26,27,28], a Monte-Carlo-based QCD sum rule is introduced, which starts from the theoretical formula of the SVZ QCD sum rule and uses Monte-Carlo based uncertainty analysis to proceed the study. The Monte-Carlo-based QCD sum rule has some obvious advantages: (a) the continuum threshold \(s_0\) can be automatically determined via the fitting procedure; (b) the effects from the uncertainties of the nonperturbative OPE parameters and the phenomenological spectral functions can be more reliably taken into account. Consequently the Monte-Carlo-based QCD sum rule is considered to be able to quantitatively improve the uncertainty analysis, comparing with the conventional SVZ sum rule, which relies on some empirical ways to obtain the platform of the \(s_0\) and Borel transformation parameters.

In this work, we aim at using the both the conventional SVZ and Monte-Carlo-based QCD sum rules in the isoscalar scalar channel to determine the average of the up- and down-quark masses \((m_u+m_d)/2\), which will be shortly denoted as \(m_q\) in later discussions. Comparisons from both methods will be analyzed in detail. Another novelty of this work is to implement the sophisticated chiral spectrum functions in the phenomenological side, where one does not need to manually separate the background and the resonance contributions. Furthermore the unitarized chiral spectrum functions are considered to be more suitable for the broad scalar resonances than the Breit–Wigner forms [19].

The paper is organized as follows. We elaborate the essential formulas of the QCD Sum Rules and the chiral spectrum functions in Sect. 2. The numerical analysis and results are given in Sect. 3. The last part is left for a short summary and conclusions.

2 Theoretical formalism

Within the QCD sum rule approach, the connection between quarks/gluons and hadrons is through the two-point correlation function, constructed by two currents, i.e.

$$\begin{aligned} \Pi (q^2) = i \int d^4 x e^{i q \cdot x} \langle \Omega | T\{ j_S(x), j_S^\dagger (0)\}| \Omega \rangle , \end{aligned}$$
(1)

where \(j_S\) is the relevant hadronic current, the subscript S represents the scalar current, and \(\Omega \) denotes the physical vacuum. In this work, we focus on the current in the isoscalar scalar channel

$$\begin{aligned} j_S(x) = m_q \frac{1}{\sqrt{2}} \bigg ({\bar{u}}(x) u(x) + {\bar{d}}(x) d(x)\bigg ), \end{aligned}$$
(2)

where \(m_q = \frac{1}{2} (m_u + m_d)\).

It is stressed that our primary aim in this work is to determine the light-quark mass by using the tool of the QCD sum rule, instead of scrutinizing the internal structures of the scalar resonances. For the latter purpose, it can be crucial to delicately choose the proper currents in the sum-rule analyses. For example, the \(\sigma \) resonance has been extensively studied in the QCD sum rule approach by using the mixed currents of the \({\bar{q}}q\) type and the gluons [29, 30], and also the various tetraquark currents [5, 31,32,33,34,35,36,37,38,39]. However, for the purpose to extract the light-quark mass, it is adequate to analyze the \(j_S\) current of Eq. (2) within the framework of QCD sum rule, as long as one can properly calculate the hadronic contributions to two-point correlation functions of the current, which will be discussed in detail in the following.

The correlation function has two equivalent representations: the QCD side and the phenomenological one. The QCD representation of this function can be calculated by using the Operator Product Expansion (OPE) method. In addition, it has been noticed that the instanton effects could be important in the pseudoscalar and scalar channels [14, 19, 21, 22, 40, 41]. Therefore we also include the instanton contributions in the QCD side, which are calculated within the instanton liquid model [40, 41]. The correlation function from the QCD side then includes both the OPE and instanton parts

$$\begin{aligned} \Pi ^{\text {(QCD)}} (q^2) = \Pi ^{\text {(OPE)}}(q^2) + \Pi ^{ \text {(Inst)} }(q^2). \end{aligned}$$
(3)

After performing the Borel transformation of Eq. (3), one can obtain [14, 19, 21, 22, 40, 41],

$$\begin{aligned}&R^{\text {(QCD)}}(\tau ,{{\hat{m}}}_q)=R^{\text {(OPE)}}(\tau ,{{\hat{m}}}_q)+R^{\text {(Inst)}}(\tau ,{{\hat{m}}}_q)\nonumber \\&\quad =m_q^2(1/\sqrt{\tau }) \left\{ \frac{3}{8\pi ^2} \frac{1}{\tau ^2} \left[ 1 + 4.821098 \frac{\alpha _s(1/\tau )}{\pi } \right. \right. \nonumber \\&\left. \left. \qquad + 21.97646 \left( \frac{\alpha _s(1/\tau )}{\pi } \right) ^{2}\right. \right. + 53.14197 \left. \left( \frac{\alpha _s(1/\tau )}{\pi } \right) ^{3} \right] \nonumber \\&\qquad + \frac{\langle \alpha _s G^2\rangle }{8\pi } \left( 1 + \frac{11}{2}\frac{\alpha _s(1/\tau )}{\pi } \right) + 3 \langle m_q {{\bar{q}}}q \rangle \left( 1 + \frac{13}{3} \frac{\alpha _s(1/\tau )}{\pi } \right) \nonumber \\&\qquad - \frac{176}{27} \pi \kappa \alpha _s\langle {{\bar{q}}}q\rangle ^2 \left[ \frac{\alpha _s(1/\tau )}{\alpha _s(\mu ^2_0)} \right] ^{1/9} \tau \nonumber \\&\qquad \left. + \frac{3}{8\pi ^2} \frac{e^{\frac{-\rho ^2}{2\tau }}\rho ^2}{\tau ^3} \left[ K_0 \left( \frac{\rho ^2}{2 \tau } \right) + K_1 \left( \frac{\rho ^2}{2\tau }\right) \right] \right\} , \end{aligned}$$
(4)

where the last two terms with the modified Bessel functions \(K_0\) and \(K_1\) stand for the contributions from the instanton, and \(\rho \) represents the instanton size in the instanton liquid model [40, 41]. In Eq. (4), \(\tau \) is the Borel parameter, \(\kappa \) denotes the vacuum factorization violation factor that parameterizes the deviation of the four-quark condensate from a product of two-quark condensates, and \(\mu _0\) is the renormalization scale for condensates. The running coupling constant and running mass have been calculated up to four-loop order in the perturbative corrections [22, 42]. The four-loop result with three-flavor quarks for the running coupling constant at scale \(1/\sqrt{\tau }\) takes the form [22, 42]

$$\begin{aligned} \alpha _s(1/\tau )= & {} \pi \left\{ \frac{4}{9 } \frac{1}{L} - \frac{256 \ln L}{729L^{2}} + \frac{1}{L^3} \left[ \frac{ 16384 \ln ^{2} L }{59049} \right. \right. \nonumber \\&\left. \left. - \frac{16384 \ln L}{59049} + \frac{6794}{59049}\right] \right\} , \end{aligned}$$
(5)

with \(L = \ln \left( \frac{1}{\tau \Lambda _{\text {QCD}}^2}\right) \). The QCD scale is taken as \(\Lambda _{\text {QCD}}=0.353 \,\text {GeV}\) throughout this work [43]. The explicit scale \(1/\sqrt{\tau }\) dependence of the light-quark mass in Eq. (4) up to four-loop order is [22, 42]

$$\begin{aligned}&m_q(1/\sqrt{\tau }) = {\hat{m}}_q \frac{1}{\left( \frac{1}{2} L\right) ^{4/9}} \left\{ 1 + \frac{290}{729}\frac{1}{L} - \frac{256}{729}\frac{\ln L}{L} \right. \nonumber \\&\quad + \left( \frac{550435}{1062882} - \frac{80\zeta (3)}{729} \right) \frac{1}{L^2} \nonumber \\&\quad - \frac{388736}{531441}\frac{\ln L}{L^2} + \frac{106496 }{531441}\frac{\ln ^2 L}{L^2} \nonumber \\&\quad + \left( \frac{2121723161}{2324522934} + \frac{8}{6561}\pi ^{4} - \frac{119840 }{531441}\zeta (3) \right. \nonumber \\&\quad \left. - \frac{8000 }{59049} \zeta (5) \right) \frac{1}{L^3} + \left( - \frac{611418176}{387420489} + \frac{112640}{531441} \zeta (3) \right) \frac{\ln L}{L^3} \nonumber \\&\quad \left. + \frac{335011840}{387420489} \frac{\ln ^2 L}{L^3} - \frac{149946386}{1162261467} \frac{\ln ^3 L}{L^3} \right\} , \end{aligned}$$
(6)

where \({{\hat{m}}}_q\) is the renormalization-group (RG) invariant light-quark mass and \(\zeta \) stands for the Riemann zeta function.

On the phenomenological side, we need to provide the phenomenological spectral function which is related to the correlation function through the dispersive integral. It is typical to write the phenomenological spectral function as

$$\begin{aligned} \mathrm{Im}\Pi ^{\text {(Phen)}}(s)= \mathrm{Im}\Pi ^{\text {(Res)}}(s) + \mathrm{Im}\Pi ^{\text {(ESC)}}(s), \end{aligned}$$
(7)

where in the conventional QCD sum rule the first term in the right-hand side usually includes the resonance contributions via the Breit–Wigner or \(\delta \)-function forms, and the second term corresponds to the excited states and continuum. The \(\mathrm{Im}\Pi ^{\text {(ESC)}}(s)\) part incorporates the excited states and higher energy continuum above the energy scale \(s_0\), that should be determined via the sum-rule method. In this work we use the unitarized chiral form factors [23, 24] to construct the resonant part \(\mathrm{Im}\Pi ^{\text {(Res)}}(s)\) of phenomenological spectral function, which coherently include resonances and the two-meson continuum in the low energy sector.

In the present convention, the spectral function or the imaginary part of the two-point correlator is related to the two-meson form factors through

$$\begin{aligned}&\mathrm{Im}\Pi ^{\text {(Res)}}(s)=\frac{3 m_\pi ^4}{32\pi }\sum _{k=1}^{5} \frac{q_k(s)}{\sqrt{s}}\big |F_k^{\mathrm{{S}}}\big |^2\theta (s-s_k^{\mathrm{{threshold}}}), \end{aligned}$$
(8)

where the subscript k runs over the relevant two-meson channels: \(\pi \pi , K{\bar{K}}, \eta \eta ,\eta \eta '\) and \(\eta '\eta '\), \(F_{k}^{\mathrm{S}}(s)\) denotes the isoscalar-scalar two-meson form factor of the kth channel for the \(j_S\) current in Eq. (2), and \(q_k(s)\) stands for the center of mass three-momentum of the two particles in the kth channel. We will keep the same conventions of the form factors as those in Refs. [23, 24] and accordingly introduce other relevant coefficients in Eq. (8) to match the normalization of the correlator of \(j_S\) in Eqs. (1) and (4). At low energies the intermediate \(\pi \pi \) state is the most important hadronic channel in the isoscalar scalar case. In the meanwhile, we also include the \(K{\bar{K}}, \eta \eta ,\eta \eta '\) and \(\eta '\eta '\) channels in this work. The scalar form factors \(F_{PP'}^{\mathrm{S}}(s)\) are calculated in the framework of the unitarized U(3) resonance chiral theory [23, 24]. For the sake of completeness, we recap the construction of the chiral phenomenological spectrum function here. The leading order (LO) U(3) chiral Lagrangian is

$$\begin{aligned} {\mathcal {L}}^{(\mathrm{LO})}=\frac{ F^2}{4}\langle u_\mu u^\mu \rangle + \frac{F^2}{4}\langle \chi _+ \rangle + \frac{F^2}{3}M_0^2 \ln ^2{\det u}, \end{aligned}$$
(9)

where the first two terms coincide with the conventional SU(3) chiral perturbation theory [44] and the last one parameterizes the effect of the QCD \(U_A(1)\) anomaly, giving rise to the LO mass \(M_0\) to the singlet \(\eta _0\) meson. The definitions of the basic chiral tensors \(u_\mu \) and \(\chi _+\) can be found in Refs. [23,24,25] and references therein. Regarding the higher order effects beyond the LO, one could include the higher order local operators [44] or alternately one can also introduce explicitly the heavier degrees of freedom, such as the bare resonance states [45]. The latter method is employed to calculate the two-meson scattering amplitudes and the form factors in Refs. [23,24,25] within the framework of resonance chiral theory [45]. Three types of resonances, including scalar (S), vector (V) and pseudoscalar (P) ones, are considered and the relevant interacting Lagrangians read

$$\begin{aligned} {\mathcal {L}}_{S}&= c_d\langle \,S_8 u_\mu u^\mu \,\rangle + c_m \langle \,S_8 \chi _+ \,\rangle +{\widetilde{c}}_d S_1 \langle \,u_\mu u^\mu \,\rangle \nonumber \\&\quad + {\widetilde{c}}_m S_1 \langle \,\chi _+ \,\rangle , \end{aligned}$$
(10)
$$\begin{aligned} {\mathcal {L}}_{V}&= \frac{i G_V}{2\sqrt{2}}\langle V_{\mu \nu }[ u^\mu , u^\nu ]\rangle , \end{aligned}$$
(11)
$$\begin{aligned} {\mathcal {L}}_{P}&= i d_m \langle \,P_8 \chi _- \,\rangle + i {\widetilde{d}}_m P_1 \langle \,\chi _- \,\rangle . \end{aligned}$$
(12)

Two relevant local operators at the next-to-leading order that cannot receive contributions after integrating the resonances in Eqs. (10)–(12), are also taken into account

$$\begin{aligned} {\mathcal {L}}_{\Lambda }&= \frac{F^2\, \Lambda _1}{12} \partial ^\mu X \partial _\mu X -\frac{F^2\, \Lambda _2}{12} X \langle \,\chi _- \,\rangle , \quad (X=\ln \det U). \end{aligned}$$
(13)

The chiral one-loop calculation with explicit tree-level resonance exchanges in the U(3) resonance chiral theory is completed in Ref. [25]. Later on the one-loop calculation of the strangeness conserving scalar and pseudoscalar form factors is also carried out in a similar theoretical framework in Refs. [23, 24]. An algebraic approximation of the N/D method is used to perform the unitarization of the scattering amplitudes and the scalar two-meson form factors. Under this assumption, the unitarized scalar form factor takes the form

$$\begin{aligned} {\mathcal {F}}(s) = \big [ 1 + N(s) \cdot g(s) \big ]^{-1}\cdot R(s), \end{aligned}$$
(14)

with

$$\begin{aligned} N(s)= & {} {T(s)}^{\mathrm{LO+Res+Loop}} + T(s)^{\mathrm{LO}}\cdot g(s)\cdot T(s)^{\mathrm{LO}}, \end{aligned}$$
(15)
$$\begin{aligned} R(s)= & {} {F(s)}^{\mathrm{LO+Res+Loop}} + N(s)^{\mathrm{LO}} \cdot g(s)\cdot F(s)^{\mathrm{LO}}. \end{aligned}$$
(16)

Here T(s) and F(s) stand for the S-wave two-meson scattering amplitudes and the scalar form factors from the perturbative calculations, respectively. The superscripts LO, Res, Loop denote the contributions from the leading order, resonance exchanges (also including the \(\Lambda _1\) and \(\Lambda _2\) terms) and the chiral loops, in order. By construction, the quantities N(s) and R(s) do not contain any right-hand cut. The latter effect is included in the g(s) function, which is given by

$$\begin{aligned} g(s)= & {} \frac{1}{16\pi ^2}\bigg [ a_{SL}(\mu )+\log \frac{m_2^2}{\mu ^2}\nonumber \\&-x_+\log \frac{x_+-1}{x_+} -x_- \log \frac{x_--1}{x_-} \bigg ], \nonumber \\ x_{\pm }= & {} \frac{s+m_1^2-m_2^2}{2s}\pm \frac{1}{-2s}\sqrt{ -4 m_1^2 s+(s+m_1^2-m_2^2)^2}. \nonumber \\ \end{aligned}$$
(17)

In coupled-channel case, it becomes a diagonal matrix spanned in the scattering channel space and for the kth channel one should replace \(m_1\) and \(m_2\) in Eq. (17) by the masses of the two particles in that channel. The subtraction constants \(a_{SL}\) in Eq. (17) are usually free parameters that need to be fixed in the phenomenological fits. In our unitarization procedure, all the unknown parameters appearing in the unitarized scalar form factors also enter in the unitarized scattering amplitudes. As a consequence, the phenomenological fits to the various experimental scattering observables [23,24,25] enable one to fix all the unknown parameters in the scalar form factors. The uncertainties of the resonance couplings \(G_V\), \(c_d\), \(c_m\), \({\tilde{c}}_d\), \({\tilde{c}}_m\), the subtraction constants and also the bare masses of the resonances have been estimated in Refs. [24, 25] by using the error bars of the various experimental data, including the phase shifts and inelasticities from the \(\pi \pi \rightarrow \pi \pi \), \(\pi \pi \rightarrow K{\bar{K}}\), \(K\pi \rightarrow K\pi \) processes with different isospin and angular momentum numbers and also the \(\pi \eta \) event distributions. Such uncertainties of the relevant parameters can be also used to make the errorbar analyses of the form factors and phenomenological spectrum functions, which will be explicitly shown in the next section. The spectral function used here clearly improves the one in Ref. [19], with \(\sigma \) and \(f_0(980)\) included in the Breit–Wigner formalism, which is likely to be improper for the broad resonance. An additional free parameter is introduced as the weight of the two resonances in their spectral function. In Ref. [14], the phenomenological spectral density for the isoscalar scalar channel is assumed to be solely contributed by the \(\pi \pi \) channel. In our study, the heavier-mass channels, such as the \(K{\bar{K}}, \eta \eta \), \(\eta \eta '\) and \(\eta '\eta '\), are explicitly included within the U(3) chiral framework.

For the ESC contributions in the phenomenological spectral density in Eq. (7), the traditional ESC model is utilized in this work [19, 22],

$$\begin{aligned} \frac{1}{\pi }\mathrm{Im} \Pi ^{\text {(ESC)}}(s)= & {} m_q^2 (1/\sqrt{\tau }) \left\{ \frac{3s}{8\pi ^2} \left[ 1 + \frac{\alpha _s}{\pi } \Big ( \frac{17}{3} - 2 \ln \big ( s \tau \big ) \Big )\right. \right. \nonumber \\&\left. \left. \quad + \Big ( \frac{\alpha _s}{\pi }\Big )^{2} \Big ( 31.8640 \right. \right. \nonumber \\&\quad - \frac{95}{3} \ln \big ( s \tau \big ) + \frac{17}{4} \ln ^{2} \big (s \tau \big ) \Big ) \nonumber \\&\quad + \Big ( \frac{\alpha _s}{\pi }\Big )^{3} \big ( 89.1564 - 297.596 \ln \big ( s \tau \big ) \nonumber \\&\quad \left. + \frac{229}{2} \ln ^{2} \big (s \tau \big ) - \frac{221}{24} \ln ^{3} \big ( s \tau \big ) \right] \nonumber \\&\left. \quad - \frac{3}{4\pi } s J_1(\sqrt{s} \rho ) Y_1(\sqrt{s} \rho ) \right\} \theta (s - s_0), \end{aligned}$$
(18)

where \(J_1\) and \(Y_1\) are Bessel functions of the first and second kind, respectively, and \(s_0\) is the continuum threshold separating the contributions from higher excited states and continuum. It should be noted that we include in Eq. (18) the perturbative contribution from the quark-hadron duality up to four-loop order [22].

By performing the Borel transformation of Eq. (7), one can obtain the phenomenological representation for the correlation function

$$\begin{aligned} R^{\text {(Phen)}}(\tau ,s_0,{{\hat{m}}}_q)= & {} \frac{1}{\pi } \int _{4 m_\pi ^2}^\infty e^{-s\tau } \mathrm{Im}\Pi ^{\text {(Phen)}}(s) \, ds \nonumber \\= & {} R^{\text {(Res)}}(\tau , s_0) + R^{\text {(ESC)}} (\tau , s_0, {{\hat{m}}}_q).\nonumber \\ \end{aligned}$$
(19)

The master equation for QCD sum rules can be established by matching the results from the QCD calculation in Eq. (4) with its counter part of the phenomenological one in Eq.  (19), i.e.,

$$\begin{aligned} R^{\text {(QCD)}}(\tau , {{\hat{m}}}_q) = R^{\text {(Phen)}}(\tau , s_0, {{\hat{m}}}_q). \end{aligned}$$
(20)

Within the SVZ QCD sum rules, in order to extract the light quark mass \(m_q\), we firstly derive the expression of \(\hat{m_q}\) from Eq. (20). After an analytical manipulation, we obtain

$$\begin{aligned} {\hat{m}}_q = \frac{m_\pi ^2\sqrt{3}}{8\pi } \sqrt{\frac{ \int _{4m_\pi ^2}^{s_0} e^{-s \tau } \mathrm{Im}\Pi ^{\text {(phen)}}(s) ds}{f^2(\tau ) g(\tau )- \int _{s_0}^\infty e^{-s \tau } f^2(\tau ) h(\tau , s) ds}}\, ,\nonumber \\ \end{aligned}$$
(21)

where the functions \(f(\tau ), \, g(\tau )\), and \(h(\tau , s)\) are defined as

$$\begin{aligned}&f(\tau ) \equiv \frac{1}{\left( \frac{1}{2} L\right) ^{4/9}} \left\{ 1 + \frac{290}{729}\frac{1}{L} - \frac{256}{729}\frac{\ln L}{L}\right. \nonumber \\&\quad + \left( \frac{550435}{1062882} - \frac{80\zeta (3)}{729} \right) \frac{1}{L^2} \nonumber \\&\quad - \frac{388736}{531441}\frac{\ln L}{L^2} + \frac{106496 }{531441}\frac{\ln ^2 L}{L^2}\nonumber \\&\quad + \left( \frac{2121723161}{2324522934} + \frac{8}{6561}\pi ^{4} - \frac{119840 }{531441}\zeta (3) \right. \nonumber \\&\quad \left. - \frac{8000 }{59049} \zeta (5) \right) \frac{1}{L^3} + \left( - \frac{611418176}{387420489} + \frac{112640}{531441} \zeta (3) \right) \frac{\ln L}{L^3}\nonumber \\&\quad \left. + \frac{335011840}{387420489} \frac{\ln ^2 L}{L^3} - \frac{149946386}{1162261467} \frac{\ln ^3 L}{L^3} \right\} , \end{aligned}$$
(22)
$$\begin{aligned}&g(\tau ) \equiv \frac{3}{8\pi ^2} \frac{1}{\tau ^2} \left\{ \left[ 1 + 4.821098 \frac{\alpha _s (1/\tau )}{\pi }\right. \right. \nonumber \\&\left. \quad + 21.97646 \left( \frac{\alpha _s(1/\tau )}{\pi } \right) ^{2} + 53.14197 \left( \frac{\alpha _s(1/\tau )}{\pi }\right) ^{3} \right] \nonumber \\&\quad + \frac{\langle \alpha _s G^2 \rangle }{8\pi } \left( 1 + \frac{11}{2} \frac{\alpha _s(1/\tau )}{\pi }\right) \nonumber \\&\quad + 3\langle m_q {{\bar{q}}}q \rangle \left( 1 + \frac{13}{3} \frac{\alpha _s(1/\tau )}{\pi }\right) \nonumber \\&\quad - \frac{176}{27} \pi \kappa \alpha _s \langle {{\bar{q}}}q \rangle ^2 \left[ \frac{\alpha _s(1/\tau )}{\alpha _s(\mu ^2_0)} \right] ^{1/9} \tau \nonumber \\&\left. \quad + \frac{3}{8\pi ^2}\frac{e^{\frac{-\rho ^2}{2\tau }}\rho ^2}{\tau ^3} \left[ K_0 \left( \frac{\rho ^2}{2 \tau } \right) + K_1 \left( \frac{\rho ^2}{2\tau }\right) \right] \right\} , \end{aligned}$$
(23)
$$\begin{aligned}&h(\tau , s) \equiv \left\{ \frac{3s}{8\pi ^2} \left[ 1 + \frac{\alpha _s(1/\tau )}{\pi } \big (\frac{17}{3} - 2 \ln \big ( s \tau \big ) \big )\right. \right. \nonumber \\&\quad + \big ( \frac{\alpha _s(1/\tau )}{\pi }\big )^{2} \big ( 31.8640 - \frac{95}{3} \ln \big ( s \tau \big )\nonumber \\&\quad + \frac{17}{4} \ln ^{2} \big ( s \tau \big ) \big ) + \big (\frac{\alpha _s(1/\tau )}{\pi }\big )^{3} \big (89.1564 - 297.596 \ln \big ( s \tau \big )\nonumber \\&\quad \left. + \frac{229}{2} \ln ^{2} \big ( s \tau \big ) - \frac{221}{24} \ln ^{3} \big ( s \tau \big ) \big ) \right] \nonumber \\&\left. \quad - \frac{3}{4\pi } s J_1(\sqrt{s} \rho ) Y_1(\sqrt{s} \rho ) \right\} \theta (s-s_0). \end{aligned}$$
(24)

Then we can obtain the scale-running light-quark mass \(m_q (1/\sqrt{\tau })\) from Eq. (6).

In this work, we will follow another version of the QCD sum rules, that is the so-called Monte-Carlo-based QCD sum rules [26, 46], to determine the threshold parameter \(s_0\) and the RG-invariant light quark mass \({\hat{m}}_q\). To be more specific, the values of \(s_0\) and \({\hat{m}}_q\) will be determined by minimizing the following quantity

$$\begin{aligned} \chi ^2_{k,\xi }=\sum _{j=1}^{n_B}\frac{[R^{\text {(QCD)}}_k(\tau _j,{{\hat{m}}}_{q}^{k,\xi }) -R^{\text {(phen)}}_\xi (\tau _j,s_{0}^{k,\xi }, {{\hat{m}}}_{q}^{k,\xi })]^2}{\sigma _{\text {QCD}}^2(\tau _j)}.\nonumber \\ \end{aligned}$$
(25)

In order to quantitatively estimate the error bars of the resulting \(s_0\) and \({\hat{m}}_q\) caused by the uncertainties from the input QCD parameters, we generate large amount of \(R^{\text {(QCD)}}_k\) by randomly varying the OPE condensate parameters and the instanton size \(\rho \) within ten percent uncertainties and randomly sampling the \(\kappa \) parameter in the range \(2 \sim 4\). Meanwhile, we also use the results from Ref. [24] to randomly generate large samples of phenomenological spectral functions. The index k in Eq. (25) corresponds to the kth configuration among the randomly generated QCD-parameter samples, the index \(\xi \) runs over different samples of the phenomenological spectral functions. The Borel parameter \(\tau _j = \tau _{\text {min}} + (\tau _{\text {max}} - \tau _{\text {min}}) \times (j-1)/(n_B-1)\), runs over the appropriate window \([\tau _{\text {min}}, \tau _{\text {max}}]\), which range will be explained later in next section. We will take \(n_B=21\) in Eq. (25) as in Refs. [26, 46]. In fact, we have tried to increase the values of \(n_B\) in the calculation, and it turns out that larger values of \(n_B\) barely affect the results. The large samples from the randomly generated \(R^{\text {(QCD)}}_k\) and \(R^{\text {(phen)}}_\xi \) will then be implemented in Eq. (25) to repeatedly determine the \(s_0\) and \({\hat{m}}_q\). The large amount of the fitted \(s_0\) and \({\hat{m}}_q\) will be used for uncertainty studies.

The quantity \(\sigma _{\text {QCD}}^2\) in Eq. (25) corresponds to the variance of the \(R^{\mathrm{{(QCD)}}}_{k=1,2,\ldots n_S}\). In our study we generate a large sample of \(R^{\mathrm{{(QCD)}}}_{k=1,2,\ldots n_S}\) with \(n_S=20000\) at each Borel parameter \(\tau _j\). At the jth Borel parameter \(\tau _j\) the variance \(\sigma _{\text {QCD}}^2\) takes the standard definition

$$\begin{aligned} \sigma _{\text {QCD}}^2 (\tau _j) = \frac{1}{(n_S - 1)} \sum _{k=1}^{n_S} [R_k^{\text {(QCD)}} (\tau _j)-\overline{R^{\text {(QCD)}}}(\tau _j)]^2,\nonumber \\ \end{aligned}$$
(26)

where k denotes the kth set of the randomly generated \(n_S\) QCD-parameter samples, and \(\overline{R^{\text {(QCD)}}}(\tau _j)\) corresponds to the mean value of the huge samples \(R^{\mathrm{{(QCD)}}}_{k=1,2,\ldots n_S}\). It is pointed out that we estimate the values of \(\sigma _{\text {QCD}}^2(\tau _j)\) by taking into account the explicit dependences of \(R^{\text {(QCD)}}\) on the OPE condensates, the instanton size \(\rho \), and the \(\kappa \) in Eq. (4). Since the quantity \({\hat{m}}_q\) is implicitly dependent on the condensates and the parameters \(\rho \) and \(\kappa \), we do not explicitly take the uncertainties of \({\hat{m}}_q\) when estimating \(\sigma _{\text {QCD}}^2\) and \({\hat{m}}_q\) will be fixed at the value obtained by taking the central values of the relevant input QCD parameters. To be more specific, we firstly calculate the random configurations of the \(R^{\text {(QCD)}}\) by assigning ten-percent uncertainties to the OPE condensate parameters and the instanton size \(\rho \), and also generating random samples of \(\kappa \) between 2 and 4. All the random samples of the input parameters are generated with Gaussian distributions. Afterwards, we evaluate the standard variance \(\sigma _{\text {QCD}}^2\) of the large data sets of \(R^{\text {(QCD)}}\). Next, we use the \(\sigma _{\text {QCD}}^2\) in the expression of \(\chi ^2\) to fit the optimal results \(s_0\) and \({\hat{m}}_q\). The resulting curves of \(R^{\text {(QCD)}}(\tau )\), together with the uncertainties \(\sigma _{\mathrm{QCD}}(\tau _{j=1,2,\ldots ,21})\), are explicitly shown in Fig. 1.

Fig. 1
figure 1

\(R^{\text {(QCD)}}\) in Eq. (4), together with its uncertainties \(\sigma _{\mathrm{QCD}}(\tau _{j=1,2,\ldots ,21})\) in Eq. (26), as functions of the Borel parameter \(\tau \). See the text for details about the estimation of the uncertainty \(\sigma _{\mathrm{QCD}}\)

3 Numerical results

For the numerical analysis, we follow Ref. [19] to implement ten-percent uncertainties for the OPE parameters evaluated at \(\mu _0=1~\text {GeV}\), whose explicit values read

$$\begin{aligned} \begin{aligned} \langle \alpha _s G^2\rangle&= (0.070 \pm 0.007) ~\text {GeV}^4 \\ \langle m_q {\bar{q}} q \rangle&= - (0.10 \pm 0.01 ) ~\text {GeV}^4,\\ \kappa \alpha _s \langle {\bar{q}} q \rangle ^2&= (1.49\pm 0.15)\times 10^{-4} \kappa ~\text {GeV}^6 \\ \rho&= (1.67\pm 0.17) \; \text {GeV}^{-1}, \end{aligned} \end{aligned}$$
(27)

where the factor \(\kappa \) is introduced to quantify the violation of factorization hypothesis in estimating the dimension-6 quark condensates. The values of \(\kappa \) have been determined in the range of 2–4 in different channels [43, 47, 48]. The influences of the \(\kappa \) parameter and the instanton on the OPE expansion, will be discussed in a moment.

For the input phenomenological spectral functions, we will take the analyses from Refs. [24, 25]. In the former reference two different multiplets of bare scalar resonances and one multiplet of pseudoscalar resonances are included in the study of the spectral functions, while in the latter reference only one multiplet of bare scalar states are considered. As a result, somewhat different values of the parameters are also obtained in Refs. [24, 25]. In the following discussions, the values obtained by using the phenomenological spectral functions of Ref. [24] will be considered as the preferred central results. The deviations resulting from the use of spectral functions of Ref. [25] will be considered as systematic uncertainties to the central results. In addition, we also randomly generate large samples of phenomenological spectral functions according to the results from Ref. [24] (see also the discussions in the last section), which will be used to estimate the statistical uncertainties of the \(s_0\) and \({\hat{m}}_q\). The phenomenological spectral functions after including the Borel transformation factor \(e^{-\tau s}\) are explicitly shown in Fig. 2. It is obvious that the contributions in the energy region below 1 GeV are dominant and the effects from the high energy tails beyond the 1.4 GeV are tiny and negligible.

Fig. 2
figure 2

The phenomenological spectrum functions \(\mathrm{Im}\Pi (s)\) multiplied by the Borel transformation factor \(e^{-\tau s}\) with \(\tau =1.0\) GeV\(^{-2}\) as a function of energy. The red line corresponds to the spectral function by using the best fit parameters and the corresponding amplitudes given in Ref. [24] and the surrounding gray areas are randomly generated according to the uncertainties estimated in the same reference. The blue line stands for the spectral function by taking the best fit parameters and related amplitudes from Ref. [25]. In both cases, the effects in the energy region beyond 1 GeV are found to be small

3.1 Effects of the parameter \(\kappa \) and the instanton in \(R^{\text {(QCD)}}\)

The sizes of the dimension-six four-quark condensates have been intensively discussed in the literature [14, 28, 43, 47,48,49,50,51,52,53,54,55,56], and there is still no consensus on their precise values. It is plausible that the magnitudes of the four-quark condensates are underestimated in the vacuum saturation assumption [2, 3], meaning that the \(\kappa \) parameter can be obviously larger than one. Indeed a wide range of \(\kappa \) between 2 and 4 [14, 28, 43, 47, 48, 52], is determined from various phenomenological processes, including the \(e^+ e^-\rightarrow hadrons\), the \(\tau \rightarrow hadron\) decays, the baryon spectra, etc. A recent analysis by combing the electron-positron annihilation and the inclusive \(\tau \) decay processes gives the preferred value of \(\kappa =3.0 \pm 0.2\) [43], which fits well in the wide range \(2 \sim 4\). In this work, we make a very conservative estimate of the \(\kappa \) by taking its value as \(3.0\pm 1.0\), and assess its influence on the determination of the light-quark mass.

In order to clearly show the effects of the \(\kappa \), the instanton and other nonperturbative terms in the QCD spectral function (4), we give in Fig. 3 the ratios of the various contributions with respect to the perturbative QCD result by taking \(\tau =1.0~\text {GeV}^{-2}\). It should be noted that the contribution of the four-quark condensate is negative and we take its absolute value in Fig. 3 in order to clearly show the sizes of different nonperturbative contributions.

Fig. 3
figure 3

The ratios of the various nonperturbative contributions with respect to the perturbative QCD part in \(R^{(\text {QCD})}\). The two vertical dashed lines represent the conservative range of the parameter \(\kappa \) used in our study

According to Fig. 3, it is obvious that the magnitude of the four-quark condensate is larger than the two-quark and the two-gluon condensates, but is smaller than the instanton contribution for \(2<\kappa <4\). It is clear that the contribution from the perturbative QCD term dominates \(R^{\text {(QCD)}}\) in Eq. (4). Furthermore, we also study the dependence of the various terms in the OPE with the Borel parameter \(\tau \), and the results are given in Fig. 4. To be definite, we show the curves by taking \(\kappa =3.0\) and verify that the results by taking different values of \(\kappa \) are quantitatively similar. As in Fig. 3, the perturbative QCD term is the dominant part, while the contributions from the nonperturbative parts, including the instanton, and the quark and gluon condensates, slightly vary when the Borel parameter \(\tau \) is increased up to 1.2 GeV\(^{-2}\). The sensitivity of the parameter \(\kappa \) is related to the relative size of the four-quark condensate to the perturbative QCD term, which will give part of the uncertainties of the Borel window \(\tau \), \(s_0\) and \({\hat{m}}_q\) in our analysis. Detailed studies on the error-bar analyses will be given in the next section.

Fig. 4
figure 4

The \(\tau \) dependence of the ratios of various terms with respect to the full \(R^{(\text {QCD})}\) in Eq. (4). The results by taking \(\kappa =3.0\) are explicitly given here. The curves by taking different values of \(\kappa \) between 2.0 and 4.0 are quantitatively similar, and hence are not explicitly shown

Fig. 5
figure 5

a The OPE convergence \(\text {ratio}_{\langle {\bar{q}} q \rangle ^2}\) and the resonance contribution \(\text {ratio}^{\text {RC}}\) as functions of the Borel parameter \(\tau \) with \(s_0 = 3.10~\text {GeV}^2\) and \(\kappa = 3.0\). The blue and red lines represent \(\text {ratio}_{\langle {\bar{q}} q \rangle ^2}\) and \(\text {ratio}^{\text {RC}}\), respectively. b The RG-invariant light-quark mass \({\hat{m}}_q\) as a function of the Borel parameter \(\tau \) at different values of \(s_0\) with \(\kappa = 3.0\). The black dots on each line indicate the lower/upper limits of the valid Borel window

3.2 Results from the conventional SVZ sum rules

In the conventional SVZ QCD sum rules, the parameters \(\tau \) and \(s_0\) are fixed according to two well accepted criteria [2,3,4, 57]. The first one is required by the convergence of the OPE. One needs to find a proper region for \(\tau \) to guarantee that the contribution from the highest dimension operator (HDO) included is less than \(10\%\) in the OPE side. The degree of convergence can be defined as

$$\begin{aligned} \text {ratio}_{\langle {\bar{q}} q\rangle ^2} = \frac{R^{\text {(OPE)}}_{\langle {\bar{q}} q\rangle ^2}(\tau , {\hat{m}}_q)}{R^{\text {(OPE)}}(\tau , {\hat{m}}_q)}, \end{aligned}$$
(28)

where \(R^{\text {(OPE)}}(\tau , {\hat{m}}_q)\) represents the overall OPE contributions in Eq. (4), and the subscript \(\langle {\bar{q}} q \rangle ^2\) refers the HDO condensate here.

Table 1 Results from the SVZ QCD sum rules. The uncertainties of the Borel windows \(\tau _{\mathrm{min,max}}\), \(s_0\) and \({\hat{m}}_q\) are obtained by taking into account the error bars of the OPE parameters in Eq. (27) and \(\kappa =3.0\pm 1.0\)

The second criterion requires that the portion of resonance contribution (RC), i.e., the contributions from the energy region below \(s_0\), should be over 50% [4], which can be formulated as

$$\begin{aligned} \text {ratio}^{\text {RC}} = \frac{R^{\text {(QCD)}}(\tau ,{\hat{m}}_q,s_0 )}{R^{\text {(QCD)}}(\tau , {\hat{m}}_q)}, \end{aligned}$$
(29)

being \(R^{\text {(QCD)}}(\tau ,{\hat{m}}_q,s_0 )\) the effects from the QCD side below \(s_0\). Under this prerequisite, the contributions of higher excited and continuum states will be suppressed.

The standard procedure of the SVZ sum rules is to mutually vary the \(s_0\) and \(\tau \) to find their proper values, in order to satisfy the two aforementioned criteria and find a smooth plateau for the \({\hat{m}}_q\). On the smooth plateau, the RG-invariant light quark mass \({\hat{m}}_q\) should be in principle independent of the Borel parameter \(\tau \), or at least only shows weak dependence. During this procedure, the preferred value of \(s_0\) is found to be 3.10 GeV\(^2\) for \(\kappa = 3.0\). We will tentatively assign a several-percent uncertainty for the \(s_0\) as \(3.10\pm 0.20\) GeV\(^2\) in later discussions [58, 59]. The fulfillment of the two criteria and smooth plateaus are shown in Fig. 5. The dependences of two ratios of Eqs. (28) and (29) with the Borel parameter \(\tau \) at the preferred \(s_0 = 3.10 \) GeV\(^2\) are given in the left panel, from which one can easily identify the valid region of \(\tau \). In the right panel of Fig. 5, we show the rather weak dependence of \({\hat{m}}_q\) with the Borel parameter \(\tau \). By taking into account of the error bars of the OPE condensates, the instanton size and the \(\kappa \) parameter, we repeatedly take the standard procedure to determine the values of \(s_0\) and \(\tau \). The results of the \(s_0\) and \(\tau \) with their uncertainties extracted from the SVZ sum rules are summarized in Table 1.

A slightly different treatment of the phenomenological spectral function from Ref. [25], provides another source of uncertainty to \(s_0\) and \(m_q\), compared to the results in Table 1 obtained by using the spectral functions from Ref. [24]. The parameters by taking the phenomenological spectral functions from Ref. [25] with \(\kappa =3.0\) are found to be

$$\begin{aligned} s_0= 2.90~\mathrm{GeV^2}, \quad {\hat{m}}_q= 4.12~\mathrm{MeV}, \quad m_q= 3.13~\mathrm{MeV}.\nonumber \\ \end{aligned}$$
(30)

The deviations between the results in Eq. (30) and the central values in Table 1 are considered as systematic uncertainties caused by the different treatments of spectral functions. In this way, our determination of the RG-invariant light-quark mass \({\hat{m}}_{q}\) via the SVZ QCD sum rules reads

$$\begin{aligned} {\hat{m}}_{q}= 4.54^{+0.21}_{-0.29} \pm 0.42 ~\text {MeV}, \end{aligned}$$
(31)

where the first error bar stems from the uncertainties of Borel parameter \(\tau \), threshold parameter \(s_0\), the condensates, the instanton size \(\rho \) and the factor \(\kappa \), and the second one is originated by taking two different sets of spectral functions [24, 25]. Similarly for the scale dependent light-quark mass, our determination takes the value

$$\begin{aligned} m_q(2 ~\text {GeV}) = 3.46^{+0.16}_{-0.22} \pm 0.33 ~\text {MeV}. \end{aligned}$$
(32)

3.3 Results from the Monte-Carlo-based sum rules

As an complementary study, we also analyze the RG-invariant light-quark mass via the Monte-Carlo-based QCD sum rule, which is expected to give a more solid quantitative estimation of the uncertainty. In this approach, the values of \(s_0\) and \({\hat{m}}_q\) are obtained by minimizing the \(\chi ^2\) functions of Eq. (25) for a given \(R^{\mathrm{(QCD)}}\) and \(R^{\mathrm{(phen)}}\). According to the results from the SVZ approach shown in Fig. 5, we will take a conservative range for \(0.37\,\mathrm{GeV}^{-2}<\tau <1.05\,\mathrm{GeV}^{-2}\) in the Monte-Carlo-based sum rules. We have explicitly verified that the two criteria of the SVZ sum rules in the conservative ranges of \(\tau \) with the solutions of \(s_0\) from the minimization procedures are generally well satisfied.

Fig. 6
figure 6

\(\chi ^2\) as a function of \(s_0\) and \({\hat{m}}_q\), where the central condensate values, the central phenomenological spectrum and \(\kappa = 3.0\) are taken. A strong correlation between the \(s_0\) and \({\hat{m}}_q\) can be clearly seen

Fig. 7
figure 7

The histogram distributions of the resulting parameters \(s_0\) and \(m_q\) (obtained from the RG-invariant \({\hat{m}}_q\)) from the repeated \(\chi ^2\) minimization procedures by taking the large amount of random samples of the OPE condensates, the instanton size \(\rho \), the parameter \(\kappa \) and the spectral functions from Ref. [24]

To quantitatively estimate the uncertainties of \(s_0\) and \({\hat{m}}_q\), we generate huge amounts of random configurations of the \(R^{\mathrm{(QCD)}}\) by assigning ten-percent uncertainties to the OPE condensate parameters and the instanton size \(\rho \). The value of \(\kappa \) is allowed to randomly float between 2.0 and 4.0. In addition, we also generate large random samples of the spectral functions according to the results from Ref. [24], as explicitly shown in Fig. 2. In Fig. 6, we show the typical dependence of the \(\chi ^2\) function with \(s_0\) and \({\hat{m}}_q\) for a given set of \(R^{\mathrm{(QCD)}}\) and \(R^{\mathrm{(phen)}}\). There exists a minimum valley of the \(\chi ^2\) function in the \(s_0\) and \({\hat{m}}_q\) plane, which implies the clear correlation of the two parameters in a single minimization procedure. Although the shape of the minimum valley sometimes looks moderately smooth, it is not difficult to find the minimum point in the valley, which is exactly the solutions of \(s_0\) and \({\hat{m}}_q\) that we are looking for. Repeating the minimization procedures by using the random configurations of \(R^{\mathrm{(QCD)}}\) and \(R^{\mathrm{(phen)}}\) gives us a huge amount of solutions for \(s_0\) and \({\hat{m}}_q\).

In Fig. 7, we show the histogram distributions of the resulting parameters \(s_0\) and \(m_q\) (obtained from \({\hat{m}}_q\) via Eq. (6)), which seem somewhat deviating the Gaussian-like distributions. After a scrutinized investigation, we find that the non-Gaussian distributions in Fig. 7 are mainly caused by the \(\rho \) parameter that describes the instanton size, although the Gaussian uncertainties are assumed for the \(\rho \) during the error-bar analyses. However, by looking at the nontrivial dependence of the \(\rho \) parameter in \(R^{\mathrm{(QCD)}}\) (4), it is not surprised to see the somewhat non-Gaussian outputs in Fig. 7. The resulting values of the \(s_0\), \({\hat{m}}_q\) and \(m_q\) with their statistical uncertainties contributed by the OPE parameters, instanton size, the \(\kappa \) parameter and spectral functions from Ref. [24] obtained in the Monte-Carlo-based sum rules are summarized in Table 2.

To make a conservative estimation of the uncertainties, we also take the spectral function with the parameters and amplitudes from Ref. [25] to determine \(s_0\) and the light-quark masses. The explicit results are found to be

$$\begin{aligned} s_0= 2.94~\mathrm{GeV^2}, \quad {\hat{m}}_q= 4.10~\mathrm{MeV}, \quad m_q= 3.12~\mathrm{MeV}.\nonumber \\ \end{aligned}$$
(33)

As in the SVZ sum-rule case, the deviations by using the different spectral functions calculated with the different parameters and amplitudes from Refs. [24, 25] are considered as systematic uncertainties, which can be quadratically added to the statistical ones in Table 2. Therefore our final determination of the \(s_0\) is

$$\begin{aligned} s_0= 3.10\pm 0.11 \pm 0.16~\mathrm{GeV^2}, \end{aligned}$$
(34)

where the first error bar is the statistical one from Table 2 and the second error bar, obtained by taking the difference of the central values in Table 2 and Eq. (33), stands for the systematical uncertainty caused by different spectral functions. Similar rules are also applied in the following discussions. The combined result for the RG-invariant light-quark mass \({\hat{m}}_q\) is

$$\begin{aligned} {\hat{m}}_q= 4.52\pm 0.18 \pm 0.42 ~\mathrm{MeV}, \end{aligned}$$
(35)

and the scale-dependent light-quark mass obtained at 2 GeV reads

$$\begin{aligned} m_q= 3.44\pm 0.14 \pm 0.32 ~\mathrm{MeV}, \end{aligned}$$
(36)

which perfectly agrees with the determination from the SVZ sum rule in Eq. (32) and is also nicely consistent with the PDG value [60]

$$\begin{aligned} m_q^{\text {PDG}}(2~\text {GeV}) = \frac{1}{2} (m_u + m_d) = 3.45 ^{+0.55}_{-0.15}~\text {MeV}. \end{aligned}$$
(37)

As shown in Fig. 4, the instanton term gives the largest nonperturbative contribution in the QCD spectral function. It is interesting to further study its roles in the determinations of the light-quark masses. If the contribution of the instanton is turned off, we obtain in the Monte-Carlo QCD sum rule \(s_0=2.28 \text {GeV}^{2}\), \(m_q=3.77\) MeV for \(\kappa = 3.0\), which can be compared with the values in Eqs. (34) and (36) by taking all the contributions. In the SVZ QCD sum rule approach, we obtain similar values \(s_0=2.25 \text {GeV}^{2}\), \(m_q=3.78\) MeV for \(\kappa = 3.0\) by simply turning off the instanton contributions. It is clear that the instanton tends to reduce the light-quark mass \(m_q\) to the PDG average value.

Table 2 Results from the Monte-Carlo based QCD sum rules: \(s_0\), \({{\hat{m}}}_q\) and the scale-dependence light quark masses \(m_q\). A conservative Borel window \(0.37~\mathrm{GeV}^{-2}<\tau <1.05~\mathrm{GeV}^{-2}\) is taken here. See the text for details. The statistical uncertainties are contributed by the OPE condensates, the instanton size \(\rho \), the parameter \(\kappa \) and the spectral functions of Ref. [24], i.e., the shaded areas shown in Fig. 2

Comparing with the results from the Breit–Wigner-type spectral functions used in Ref. [19], our predictions with more sophisticated scalar spectral functions and higher order QCD corrections reduce the magnitudes of \(m_q\) and turns out to be perfectly consistent with the PDG average value [60], which is mainly based on several lattice determinations. It is worthy pointing out that our results in Eqs. (32) and (36) obtained from the sum rules with the scalar currents are compatible with the recent determinations from the pseudoscalar channels [13].

4 Conclusions

In this work, we determine the average light-quark mass \(m_q=(m_u+m_d)/2\) in the isoscalar scalar channel both from the SVZ and the Monte-Carlo-based QCD sum rules, where sophisticated phenomenological spectral functions from the chiral effective theory are used in our analysis. The phenomenological spectral functions, that are determined from the unitarized U(3) chiral theory, coherently include the contributions from the \(f_0(500), f_0(980)\) and \(f_0(1370)\), together with the two-meson continuum effects. The present study gives an independent determination of the light-quark mass from the scalar QCD sum rules, compared to the lattice calculations and the sum rules in the pseudoscalar channels.

The prediction of the scale-dependent light-quark mass from the conventional SVZ sum-rule method is \(m_q(2 ~\text {GeV})=(3.46^{+0.16}_{-0.22} \pm 0.33 )~\text {MeV}\), where the first part of the uncertainties are caused by the OPE condensates, the instanton size, the \(\kappa \) parameter, the Borel parameter \(\tau \) and the threshold parameter \(s_0\), and the second part of uncertainties are obtained by taking the two different sets of phenomenological spectral functions. Similarly our prediction from the Monte-Carlo-based QCD sum rule turns out to be \(m_q (2~\text {GeV}) = (3.44 \pm 0.14 \pm 0.32) ~\text {MeV}\), which is nicely consistent with the result from the SVZ approach and also remarkably agrees with the PDG value \(3.45_{-0.15}^{+0.55}\) MeV. Our work provides a solid proof that a proper way to include \(f_0(500)\) and \(f_0(980)\) in the spectral functions is important to obtain sensible results in the QCD sum rules. The present theoretical formalism is also expected to be extended to the strange sector to determine the strange-quark mass.