1 Introduction

The celebrated Feynman–Hellmann theorem [1, 2] addresses the situation when a quantum-mechanical Hamiltonian \(H(\lambda )\) of a given system depends on a some external parameter \(\lambda \). The energy spectrum \(E_n(\lambda )\) and the wave functions \(|\Psi _n(\lambda )\rangle \) will then depend on this parameter as well. The theorem relates the \(\lambda \)-dependence of the energy spectrum to the matrix element of the operator \(\mathrm{d}H(\lambda )/\mathrm{d}\lambda \):

$$\begin{aligned} \frac{\mathrm{d}E_n(\lambda )}{\mathrm{d}\lambda } =\biggl \langle \Psi _n(\lambda )\biggl |\frac{\mathrm{d}H(\lambda )}{\mathrm{d}\lambda }\biggr |\Psi _n(\lambda )\biggr \rangle . \end{aligned}$$
(1)

In the context of QCD, one often identifies the abstract parameter \(\lambda \) with the quark masses \(m_q\) and studies the dependence of the hadron spectrum on the quark masses. Since the quark-mass-dependent part of the QCD Hamiltonian takes the form \(H_m=\sum _qm_q{\bar{q}}q\), the dependence of, say, the nucleon mass on the quark masses is given by

$$\begin{aligned} \frac{\mathrm{d}m_N}{\mathrm{d}m_q}=\frac{1}{2m_N}\,\langle N|{\bar{q}}q|N\rangle . \end{aligned}$$
(2)

Here, the factor \(1/(2m_N)\) emerges from the relativistic normalization of the one-particle states, \(\langle N'|N\rangle =(2\pi )^32E_N\delta ^3(\mathbf{p}'_N-\mathbf{p}^{}_N)\).

Below, we shall explicitly consider only the three light quark flavors \(q=u,d,s\) and, for simplicity, assume that isospin is conserved: \(m_u=m_d={\hat{m}}\). The non-strange and strange \(\sigma \)-terms of the nucleon are defined, respectively, as

$$\begin{aligned} \sigma _N=\frac{{\hat{m}}}{2m_N}\langle N|{\bar{u}}u+{\bar{d}}d|N\rangle ,\quad \sigma ^s_N=\frac{m_s}{2m_N}\langle N|{\bar{s}}s|N\rangle , \end{aligned}$$
(3)

and the strangeness content of the nucleon is given by

$$\begin{aligned} y=\frac{2\langle N|{\bar{s}}s|N\rangle }{\langle N|{\bar{u}}u+{\bar{d}}d|N\rangle }. \end{aligned}$$
(4)

These \(\sigma \)-terms contain important information as regards the effect of the explicit chiral symmetry breaking (\(m_q\ne 0\)) on the hadronic observables. In addition, the nucleon \(\sigma \)-term is an important input for the estimates of WIMP cross sections in dark matter direct detection experiments see, e.g., [3,4,5,6,7,8,9], as well as in searches for the lepton flavor violation [10, 11] and electric dipole moments [12,13,14,15,16,17]. The strange \(\sigma \)-term of the nucleon is relevant for the kaon condensation and the formation of the neutron stars, as well as the study of the heavy ion collisions [18,19,20], etc. The extraction of the \(\sigma \)-terms from the experimental data is a very delicate issue since, in particular, it implies an analytic continuation of the amplitudes below threshold (to the Cheng-dashed point). In this respect, we note that, from a thorough theoretical analysis of the problem on the basis of dispersion relations and using the input from chiral perturbation theory (ChPT), in Ref. [21] the value \(\sigma _N\simeq 45~\text{ MeV }\) was obtained for the non-strange \(\sigma \)-term. The most recent and comprehensive analysis, carried out in Ref. [22], is based on the Roy–Steiner equations for \(\pi N\) scattering and yields a larger value \(\sigma _N=(59.1\pm 3.5)~\text{ MeV }\). As explained in detail in Ref. [23], most of the difference can be traced back to the new and improved values of the pion–nucleon scattering lengths deduced from pionic hydrogen and deuterium. This conclusion has been strengthened by a recent reanalysis of the low-energy pion–nucleon scattering data [24].

In recent years, the \(\sigma \)-terms have also been measured on the lattice [25,26,27,28,29,30,31]. In general, two methods are employed in these measurements: a direct measurement of the matrix element and extracting the \(\sigma \)-terms from the quark mass dependence of the hadron masses with the use of the Feynman–Hellmann theorem. The results obtained with the use of both methods are compatible with each other.Footnote 1 The above example is a nice demonstration of the fact that the lattice provides us with additional tools to study the structure of hadrons: while the quark masses on the lattice are free parameters, the experimental data correspond to the physical values of the quark masses, which cannot be varied.

The strangeness content of the nucleon, \(y_N\), which was defined in Eq. (4), is closely related to the \(\sigma \)-term and measures the contribution of the strange quarks to the nucleon mass. It can be shown that a large magnitude of the non-strange \(\sigma \)-term might signal a large violation of the Gell-Mann–Okubo rule for the baryon octet, and/or a large admixture of the \({\bar{s}}s\) state to the physical nucleon state that, in its turn, would mean that the properties of the nucleon must differ significantly from those obtained in a simple quark model.

In this paper, we shall demonstrate that the knowledge of the \(\sigma \)-terms allows one to answer the question as regards the nature of the given hadronic states—namely, whether these states are standard quark-model states (qqq for baryons, \({\bar{q}}q\) for mesons), or contain a sizable exotic admixture (pentaquarks, tetraquarks, \(\ldots \)). This criterion, which is a straightforward extension of the argument with the strangeness content of the nucleon, was first proposed in Ref. [33]. In the present article, we extend the discussion to the resonance case. It will be shown that the quark mass dependence of the measured hadron masses in the multiplets obeys certain constraints, if these states are well described by the quark model. Very different constraints emerge, say, for tetraquarks, pentaquarks, etc. We verify this statement by explicit calculations in ChPT, as well as by using large-\(N_c\) arguments. Note that, at present, a different strategy for identifying the exotic multiquark states is used on the lattice. Namely, one picks up a large set, including both the standard three-quark (quark–antiquark) operators, and the multiquark operators, and calculates the overlap of a given state with the states produced by these operators from the vacuum. A large overlap with the multiquark state would then signal that a given hadron contains a large exotic component and vice versa. Albeit intuitively crystal clear, the argument has to be taken with a grain of salt. The overlap integrals contain information as regards the short-range physics (e.g., about the smearing used in the construction of the operators) and are therefore not observable quantities. Stated differently, all operators which have the same quark content but use different smearing, describe the same state but lead to different overlaps. Consequently, the statements about the nature of the states, which were made on the basis of the calculation of the overlaps, are not completely unambiguous. On the contrary, the measured masses of the hadrons on the lattice are observable quantities and, thus, the constraints on the quark mass dependence, considered in the present article, are devoid of such short-distance ambiguities.

Furthermore, note that all candidates for exotica in QCD are resonances. Consequently, we shall need to extend the derivation of the Feynman–Hellmann theorem to the unstable states in quantum field theory (note that, in Ref. [34], a generalization for the so-called Gamov states has been considered within the framework of non-relativistic quantum mechanics; see also Ref. [35]). This extension is the one of the main results of the present article. Note also that it is not clear, how the overlaps could be used in this case—even in principle. As is well known, resonances do not correspond to a single energy level on the lattice but rather can be associated to a group of a close-by levels. Introducing a novel method to distinguish the exotic states on the lattice seems to be inevitable from this viewpoint as well.

The layout of the present paper is as follows. In Sect. 2, we consider the quark mass dependence in the flavor multiplets and derive the above-mentioned constraints. Section 3 contains the discussion of this dependence within ChPT and large-\(N_c\) QCD. In Sect. 4, we perform the analysis of the available lattice data from the QCDSF collaboration on the quark mass dependence of the low-lying mesons, baryon octet and baryon decuplet (only stable states) and demonstrate that, as expected, these contain a reasonably small admixture of exotica. Finally, in Sect. 5, the Feynman–Hellmann theorem for the resonance states is derived. Section 6 contains our conclusions.

2 Distinguishing the quark-model states from exotic states

The very notion of an exotic state in QCD needs the quark model as a reference point. Below, we shall generalize the notion of the strangeness content of the nucleon and define observable quantities, which characterize a given hadronic multiplet. In the quark model, these quantities are given by exact group-theoretical factors. Then the closeness of the measured values of these quantities to the quark-model values will be interpreted that the hadrons in this multiplet are predominately non-exotic.

We explain the method for the example of the baryon octet. Let \(|B\rangle ,~B=N,\Sigma ,\Xi ,\Lambda \), denote the eight different states of the octet. The Gell-Mann–Okubo type relations for the \({\bar{q}}q\) matrix elements take the form

$$\begin{aligned} y_B\doteq & {} \langle B|\bar{s}s|B\rangle =a_B+b_BY+c_B\biggl (I(I+1)-\frac{1}{4}\, Y^2-1\biggr ),\nonumber \\ x_B\doteq & {} \langle B|\bar{u}u+\bar{d}d|B\rangle =a_B'+b_B'Y\nonumber \\&+\,c_B'\biggl (I(I+1)-\frac{1}{4}\,Y^2-1\biggr ), \end{aligned}$$
(5)

where Y and I stand, respectively, for the hypercharge and the isospin of the state \(|B\rangle \). From the Feynman–Hellmann theorem, one gets

$$\begin{aligned} y_B=\frac{\mathrm{d}m_B^2}{\mathrm{d}m_s},\quad x_B=\frac{\mathrm{d}m_B^2}{\mathrm{d}{\hat{m}}}. \end{aligned}$$
(6)

The following relations are straightforwardly obtained:

$$\begin{aligned} \gamma _B\doteq & {} \frac{c_B}{a_B-c_B}= \frac{2y_\Sigma -y_N-y_\Xi }{2(y_N+y_\Xi )-y_\Sigma },\nonumber \\ \beta _B\doteq & {} \frac{2b_B}{3(a_B-c_B)}= \frac{y_N-y_\Xi }{2(y_N+y_\Xi )-y_\Sigma },\nonumber \\ \gamma '_B\doteq & {} \frac{c'_B}{a'_B-c'_B}= \frac{2x_\Sigma -x_N-x_\Xi }{2(x_N+x_\Xi )-x_\Sigma },\nonumber \\ \beta '_B\doteq & {} \frac{2b'_B}{3(a'_B-c'_B)}= \frac{x_N-x_\Xi }{2(x_N+x_\Xi )-x_\Sigma }. \end{aligned}$$
(7)

Note that the quantities \(\gamma _B,\beta _B,\gamma _B',\beta _B'\) are scale-invariant in QCD and are therefore devoid of short-range ambiguities.

Further, these ratios can be straightforwardly calculated in the quark model, where the matrix element of the operator \({\bar{q}}q\) is merely given by the total number of quarks and antiquarks with flavor f present in the state B. A simple calculation gives

$$\begin{aligned} \gamma _B=\gamma '_B=0,\quad \beta _B=-\frac{2}{3},\quad \beta '_B=\frac{1}{3}. \end{aligned}$$
(8)

The same method can be applied, e.g., to the pseudoscalar meson octet. In this case, \(\beta _P=\beta '_P=0\) from the beginning, and the remaining coefficients are equal to

$$\begin{aligned} \gamma _P\doteq & {} \frac{2(y_\pi -y_K)}{4y_K-y_\pi },\nonumber \\ \gamma '_P\doteq & {} \frac{2(x_\pi -x_K)}{4x_K-x_\pi } , \end{aligned}$$
(9)

where

$$\begin{aligned} y_P=\frac{\mathrm{d}M_P^2}{\mathrm{d}m_s},\quad x_P=\frac{\mathrm{d}M_P^2}{\mathrm{d}{\hat{m}}},\quad P=\pi ,K. \end{aligned}$$
(10)

Note that the above relations remain valid even in the presence of \(\eta \)\(\eta '\) mixing. The matrix elements with \(\eta ,\eta '\) are merely not present there. The quark-model values of the above coefficients are easily calculated:

$$\begin{aligned} \gamma _P=-\frac{1}{2},\quad \gamma '_P=1, \end{aligned}$$
(11)

whereas, for example, using the wave functions for the tetraquark octet from Ref. [36], one gets completely different values: \(\gamma _P=1,~\gamma '_P=-\frac{1}{5}\).

The generalization to the other multiplets is straightforward. Below, we present the formulas for the baryon decuplet (note that, using SU(3) symmetry, one may rewrite the following formulas in different equivalent forms):

$$\begin{aligned} \gamma _\Delta =\frac{y_\Omega -y_{\Sigma ^*}}{2y_{\Sigma ^*}},\quad \gamma '_\Delta =\frac{x_\Omega -x_{\Sigma ^*}}{2x_{\Sigma ^*}}. \end{aligned}$$
(12)

The quark-model values are \(\gamma _\Delta =1\) and \(\gamma '_\Delta =-\frac{1}{2}\).

The argument then goes as follows. On the lattice, one may obtain the quark mass dependence of the various members of the multiplets and thus extract the scale-independent quantities \(\gamma ,\gamma ',\beta ,\beta '\) for a given multiplet. Any statistically significant deviation of these quantities from their quark model values can be interpreted as the effect coming from the SU(3) breaking and/or from the significant admixture of the non-quark-model states. Moreover, on the lattice one could determine the derivatives of the hadron masses with respect to the quark masses in the vicinity of the SU(3)-symmetric point \(m_s={\hat{m}}\) as well. On the basis of this analysis one could then unambiguously judge about the exotic content of the multiplets in the vicinity of this point.

Up to this point, there is little new information. The arguments like given above have been used in the past already (see, e.g., [37]). However, as mentioned in the introduction, one would like to generalize these arguments to the case of unstable states, which do not correspond to a single energy level on the lattice. Note that the different levels may have different quark mass dependence, so the quantities \(\gamma ,\gamma ',\beta ,\beta '\) cannot be defined unambiguously in this case. However, prior to considering the problem of the resonances, we would like to validate our arguments in the large-\(N_c\) limit, as well as through direct calculations in ChPT.

3 Validation of the method

3.1 Large-\(N_c\)

In the context of QCD, the quark-model relations are reproduced in the limit \(N_c\rightarrow \infty \), which corresponds to the quenching of the virtual quark loops in the path integral. One immediately arrives at the answer in the meson sector. First of all, note that \(y_\pi =0\) and, consequently, \(\gamma _P=-\frac{1}{2}\) (without assuming the exact SU(3) symmetry). Further, one gets \(x_\pi =2x_K\) and the quark-model value \(\gamma '_P=1\) is reproduced.

The situation with the baryons is more subtle. Of course, in the quenched theory, one immediately gets \(y_N=0\) (without assuming the exact SU(3) symmetry) and \(y_\Xi =2y_\Sigma \), \(x_N=3x_\Xi ,~x_\Sigma =2x_\Xi \), so that the quark model values \(\gamma _B=0,~\beta _B=-\frac{2}{3},~\gamma '_B=0,~\beta '_B=\frac{1}{3}\) are again reproduced. However, in the large-\(N_c\) limit, the baryons consist of \(N_c\) quarks and not just of three quarks—so, the above arguments do not apply straightforwardly. In fact, it was shown that, in this limit, baryons represent static objects (their mass grows like \(N_c\)), which at leading order in \(N_c\) can be described by using, e.g., a constituent quark model [38,39,40,41,42,43]. For any value of \(N_c\), the baryons, containing \(N_c\) quarks, belong to a completely symmetric irreducible representation of the SU(6) spin-flavor group. Counterparts of the “usual” baryons \(N,\Sigma ,\Xi ,\ldots \) are those members of the larger multiplets, which have the same spin-flavor quantum numbers. For example, the proton has spin \(s=s_z=\frac{1}{2}\), isospin \(I=I_z=\frac{1}{2}\), hypercharge \(Y=1\), charge \(Q=1\) and baryon number \(B=1\) for any given \(N_c\).

Let us now obtain the values of \(\gamma _B,~\beta _B,~\gamma '_B,~\beta '_B\) in the large-\(N_c\) limit. As known, for an arbitrary \(N_c\), the generators corresponding to the baryon number and the hypercharge are given by

$$\begin{aligned} {\hat{B}}=\frac{1}{N_c}\,\text{ diag }(1,1,1),\quad {\hat{Y}}=\frac{1}{N_c}\,\text{ diag }(1,1,1-N_c). \end{aligned}$$
(13)

It is straightforward to check that the mass term in the Lagrangian is given by

$$\begin{aligned} \mathcal{L}_m={\hat{m}}{\bar{\psi }}((N_c-1){\hat{B}}+{\hat{Y}})\psi +m_s{\bar{\psi }}({\hat{B}}-{\hat{Y}})\psi . \end{aligned}$$
(14)

From the above equation one may read off the values for \(x_B,y_B\) (up to an overall normalization factor)

$$\begin{aligned} x_N= & {} (N_c-1)+1,\quad x_\Sigma =(N_c-1),\nonumber \\ x_\Xi= & {} (N_c-1)-1,\quad y_N=0,\quad y_\Sigma =1,\quad y_\Xi =2, \end{aligned}$$
(15)

and, finally,

$$\begin{aligned} \gamma _B=\gamma _B'=0,\quad \beta _B=-\frac{2}{3},\quad \beta '_B=\frac{2}{3(N_c-1)}. \end{aligned}$$
(16)

As we see, the quantity \(\beta _B\) stays finite in the large-\(N_c\) limit, whereas the quantities \(\gamma _B,\gamma _B',\beta _B'\) all vanish in this limit. On the other hand, at \(N_c=3\) the quark model results are reproduced. Note also that our results are in complete agreement with Ref. [44] at the leading order in \(1/N_c\).

3.2 Chiral perturbation theory

3.2.1 Goldstone boson octet

At one loop, the pion and kaon masses in 3-flavor ChPT are given by (see, e.g. [45])

$$\begin{aligned} M_\pi ^2= & {} {\mathring{M}}_\pi ^2\biggl \{1+\mu _\pi -\frac{1}{3}\,\mu _\eta +2{\hat{m}} K_3+K_4\biggr \}\nonumber \\ M_K^2= & {} \mathring{M}_K^2\biggr \{1+\frac{2}{3}\,\mu _\eta +({\hat{m}}+m_s)K_3+K_4\biggr \}, \end{aligned}$$
(17)

where

$$\begin{aligned} \mu _P= & {} \frac{\mathring{M}_P^2}{32\pi ^2 F_0^2}\,\ln \frac{\mathring{M}_P^2}{\mu ^2},\quad P=\pi ,K,\eta , \nonumber \\ {\mathring{M}}_\pi ^2= & {} 2{\hat{m}}B_0,\quad \mathring{M}_K^2=({\hat{m}}+m_s)B_0,\nonumber \\ \mathring{M}_\eta ^2= & {} \frac{2}{3}\,({\hat{m}}+2m_s)B_0,\quad K_3=\frac{8B_0}{F_0^2}\,(2L^r_8-L^r_5),\nonumber \\ K_4= & {} (2{\hat{m}}+m_s)\frac{16B_0}{F_0^2}\,(2L^r_6-L^r_4), \end{aligned}$$
(18)

with \(B_0,F_0,L_i^r\) the parameters of the ChPT Lagrangian, \(\mathring{M}_P^2\) are the pseudoscalar meson masses squared at leading order, and \(\mu \) denotes the scale of dimensional regularization. Calculating the parameters \(\gamma _P,\gamma '_P\) from these expressions, we get

$$\begin{aligned} \gamma _P= & {} -\frac{1}{2}\,\biggl \{ 1+\frac{{\mathring{M}}_\pi ^2}{96\pi ^2 F_0^2}\biggl ( \ln \frac{\mathring{M}_\eta ^2}{\mu ^2}+1\biggr ) \nonumber \\&-\frac{12 {\mathring{M}}_\pi ^2}{F_0^2}\,(2L^r_6-L^r_4)\biggr \},\nonumber \\ \gamma '_P= & {} 1+\frac{3{\mathring{M}}_\pi ^2}{16\pi ^2F_0^2}\, \ln \frac{{\mathring{M}}_\pi ^2}{\mu ^2} -\frac{6\mathring{M}_\eta ^2+{\mathring{M}}_\pi ^2}{48\pi ^2F_0^2}\ln \frac{\mathring{M}_\eta ^2}{\mu ^2} \nonumber \\&+\frac{7{\mathring{M}}_\pi ^2-3\mathring{M}_\eta ^2}{96\pi ^2F_0^2} +\frac{36({\mathring{M}}_\pi ^2-\mathring{M}_\eta ^2)}{F_0^2}\,(2L^r_8-L^r_5) \nonumber \\&-\frac{24(3\mathring{M}_\eta ^2-{\mathring{M}}_\pi ^2)}{F_0^2}\,(2L^r_6-L^r_4). \end{aligned}$$
(19)

If we recall now that, in the large-\(N_c\) limit, \(F_0=O(N_c^{1/2})\), \(L_5^r,L_8^r=O(N_c)\), \(L_4^r,L_6^r=O(1)\) and the meson masses are of order 1 as well, it is seen that \(\gamma _P,\gamma '_P\) tend to the quark-model values in the limit \(N_c\rightarrow \infty \) and \(m_s={\hat{m}}\). Further, one could estimate the deviation of these parameters in the real world from their exact quark-model values. To this end, we replace \(F_0\) by the pion decay constant \(F_\pi =92.2~\text{ MeV }\) and \({\mathop {M}\limits ^\mathsf{o}}_\pi \), \({\mathop {M}\limits ^\mathsf{o}}_K\) by the physical meson masses (\({\mathop {M}\limits ^\mathsf{o}}_\eta \) is determined from the Gell-Mann–Okubo relation). Using the central values for the low-energy constants (LECs) \(L_i^r\) from Ref. [45] at \(\mu =770~\text{ MeV }\)

$$\begin{aligned} L_4^r=L_6^r=0,\quad L_5^r=2.2\cdot 10^{-3},\quad L_8^r=1.1\cdot 10^{-3}, \end{aligned}$$
(20)

we get

$$\begin{aligned} \gamma _P=-\frac{1}{2}\,(1+9\cdot 10^{-4}),\quad \gamma '_P=1+4\cdot 10^{-2}. \end{aligned}$$
(21)

In order to estimate the uncertainty, we present the results obtained by using the LECs from Ref. [46]. There are different sets of LECs in this paper. We use the \(O(p^4)\) fit from Table 1: \(L_4^r=L_6^r=0,~L_5^r=1.2\times 10^{-3},~L_8^r=0.5\times 10^{-3}\), as well as the fit to lattice data: \(L_4^r=0.04\times 10^{-3},~L_6^r=0.07\times 10^{-3},~L_5^r=0.84\times 10^{-3},~L_8^r=0.36\times 10^{-3}\); see Table 5 of this paper. The results are

$$\begin{aligned} \gamma _P= & {} -\frac{1}{2}\,(1+9\times 10^{-4}),\quad \gamma '_P=1+0.30 \quad [O(p^4)],\nonumber \\ \gamma _P= & {} -\frac{1}{2}\,(1-2\times 10^{-3}),\quad \gamma '_P=1-7\times 10^{-2} \quad [\text{ lattice }],\nonumber \\ \end{aligned}$$
(22)

The SU(3)-symmetric point is achieved from the physical point by varying the quark masses such that the sum of all quark masses \(2{\hat{m}}+m_s=\text{ const }\). This corresponds to \(M_\pi =M_K=M_\eta =413~\text{ MeV }\). At this point, using the LECs from Ref. [45], we get

$$\begin{aligned} \gamma _P=-\frac{1}{2}\,(1-5\times 10^{-3}),\quad \gamma '_P=1-2\times 10^{-2}, \end{aligned}$$
(23)

whereas the use of the LECs from Ref. [46] gives the following answer:

$$\begin{aligned} \gamma _P= & {} -\frac{1}{2}\,(1-5\times 10^{-3}),\quad \gamma '_P=1-2\times 10^{-2} \quad [O(p^4)],\nonumber \\ \gamma _P= & {} -\frac{1}{2}\,(1-3\times 10^{-2}),\quad \gamma '_P=1-0.12 \quad [\text{ lattice }].\nonumber \\ \end{aligned}$$
(24)

As we see, the corrections to the quark model values for the pseudoscalar octet are reasonably small. The uncertainties, which stem from the LECs, are, however, sizable and the correlations between various LECs should be taken into account.

3.2.2 Ground-state baryon octet

The case of the baryon octet is another illustration of the method. One could use the result on the baryon masses in ChPT, which are available in the literature; see e.g. Refs. [47,48,49]. In this article, we restrict ourselves to the \(O(p^2)\) calculations. The relevant part of the effective Lagrangian is given by

$$\begin{aligned} \mathcal{L}^{(2)}= & {} b_D\langle {\bar{B}}\{\chi _+,B\}\rangle +b_F\langle {\bar{B}}[\chi _+,B]\rangle \nonumber \\&+\,b_0\langle {\bar{B}}B\rangle \langle \chi _+\rangle +\cdots , \end{aligned}$$
(25)

where the matrix B is the baryon octet field and \(\chi _+=u^\dagger \chi u^\dagger +u\chi u\), with \(\chi =2B_0\mathcal{M},~\mathcal{M}=\text{ diag }({\hat{m}},{\hat{m}},m_s)\). Further, \(u=\exp (i\phi /(2F_0))\) is the Goldstone boson field and \(b_D,b_F,b_0\) are the pertinent LECs. The expressions for the coefficients \(\gamma _B,\gamma '_B,\beta _B,\beta '_B\) can be readily obtained from the calculated baryon masses at this order

$$\begin{aligned} \gamma _B= & {} -\frac{2b_D}{3b_0+4b_D},\quad \gamma '_B=\frac{b_D}{3b_0+b_D},\nonumber \\ \beta _B= & {} -\frac{2b_F}{3b_0+4b_D},\quad \beta '_B=\frac{b_F}{3b_0+b_D}. \end{aligned}$$
(26)

Unlike the Goldstone boson case, these expressions do not reduce automatically at lowest order to the quark-model values given by Eq. (8). The quark-model results are reproduced, if \(b_D/b_0=0\) and \(b_F/b_0=1\). This statement seems to be supported by phenomenological values of these LECs [47] (all in \(\text{ GeV }^{-1}\) units)

$$\begin{aligned} -0.79\le & {} b_0\le -0.70,\quad 0.01\le b_D\le 0.07,\nonumber \\ -0.61\le & {} b_F\le -0.48. \end{aligned}$$
(27)

It could be interesting to consider the constraints on the higher-order LECs, which emerge in a similar fashion and, possibly, from the observables other than the baryon masses as well. However, since the convergence in the SU(3) baryon ChPT is a somewhat painful issue [50, 51], we do not consider any further constraints here. The results of Ref. [52, 53] support this statement. For example, the NLO results for these constants are \(b_0=-0.273(6),~b_D=0.0506(17),~b_F=-0.179(1)\), whereas at the NNLO one has \(b_0=-0.886(5),~b_D=0.0482(17),~b_F=-0.517(7)\) (all in \(\text{ GeV }^{-1}\) units). One sees that the general pattern persists, albeit the values of the LECs vary quite a bit.

In Ref. [44], comparing the chiral Lagrangians at arbitrary \(N_c\) and at \(N_c=3\), it was shown that \(b_D/b_0=0\) and \(b_F/b_0=1\) hold in the limit of large \(N_c\). Therefore, the quark-model values are exact at \(N_c=3\) up to the corrections that vanish in the large-\(N_c\) limit. Note also that in Eq. (26) the large-\(N_c\) limit cannot be performed straightforwardly, because it is obtained from the Lagrangian given in Eq. (25), which is defined at \(N_c=3\) only. In fact, as easily seen, the above constraints on \(b_0,b_D,b_F\) are incompatible with \(\beta '_B\rightarrow 0\) and \(\beta _B\ne 0\) in the large-\(N_c\) limit.

3.2.3 Ground-state vector meson octet

The vector meson octet contains unstable particles. So, strictly speaking, our formulas are not directly applicable there. However, in order to get an intuitive understanding of the problem, one could still use an effective Lagrangian with vector mesons (see, e.g., Ref. [54]) and evaluate the pertinent coefficients \(\gamma _V,\gamma '_V\), defined through the Eqs. (9, 10) with the replacements \(\pi \rightarrow \rho \) and \(K\rightarrow K^*\). The quark-mass-dependent part of the effective Lagrangian is given by [54]

$$\begin{aligned} \mathcal{L}_m= & {} a_1\langle \{W_\mu ^\dagger ,W^\mu \}\chi _+\rangle +a_{13}\langle W_\mu ^\dagger W^\mu \rangle \langle \chi _+\rangle \nonumber \\&+\,c_1\langle W_\mu ^\dagger \chi _+W^\mu \chi _+\rangle +c_2\langle \{W_\mu ^\dagger ,W^\mu \}\chi _+\chi _+\rangle , \end{aligned}$$
(28)

where the LEC \(a_{13}\) is suppressed at \(O(N_c^{-1})\) with respect to the other LECs (we do not display the \(1/N_c\)-suppressed terms at order \(p^4\)).

As is well known, the Goldstone boson loops are suppressed by a factor \(N_c\). For this reason, we drop them in the expression of the masses completely. The contact term contribution to the masses is given by

$$\begin{aligned} M_\rho ^2= & {} \mathring{M}_V^2+8a_1B_0{\hat{m}} +4a_{13}B_0(2{\hat{m}}+m_s)\nonumber \\&+16(c_1+2c_2)(B_0{\hat{m}})^2,\nonumber \\ M_{K^*}^2= & {} \mathring{M}_V^2+4a_1B_0({\hat{m}}+m_s)+4a_{13}B_0(2{\hat{m}}+m_s)\nonumber \\&+16c_1B_0^2{\hat{m}}m_s+16c_2B_0^2({\hat{m}}^2+{\hat{m}}_s^2), \end{aligned}$$
(29)

with \(\mathring{M}_V^2\) the vector meson octet mass in the chiral limit. From this, one can immediately read off the expressions for \(\gamma _V,\gamma '_V\):

$$\begin{aligned} \gamma _V= & {} \frac{2(y_\rho -y_{K^*})}{4y_{K^*}-y_\rho }= -\frac{1}{2}\, \frac{4a_1}{4a_1+3a_{13}}\nonumber \\&+O(p^4)+O(p^2N_c^{-1}),\nonumber \\ \gamma '_V= & {} \frac{2(x_\rho -x_{K^*})}{4x_{K^*}-x_\rho }= \frac{a_1}{a_1+3a_{13}}-\frac{12c_1(M_K^2-M_\pi ^2)}{a_1+a_{13}}\nonumber \\&+O(p^4) +O(p^2N_c^{-1}). \end{aligned}$$
(30)

It is seen that the corrections to \(\gamma _V,\gamma '_V\) vanish in the large-\(N_c\) and SU(3) symmetry limit. From this example, it becomes clear, how the effective theory for the exotic multiplets (e.g., the scalar mesons) could be constructed. The operator basis, whose form is dictated by the symmetries, is the same in the case of the standard and exotic particles. The difference emerges at the level of the LECs: certain LECs in the effective Lagrangian with exotic particles are not suppressed in the large-\(N_c\) limit.

4 Testing with lattice data

The QCDSF collaboration has studied the quark mass dependence of hadron masses in various meson and baryon multiplets. This analysis provides us with an ideal input to test our predictions. In order to make the comparison straightforward, both for the bare and renormalized quark masses, we define

$$\begin{aligned} {\bar{m}}= & {} \frac{1}{3}\,(2{\hat{m}}+m_s),\quad m_1={\hat{m}}-m_s,\nonumber \\ {\bar{m}}^r= & {} \frac{1}{3}\,(2{\hat{m}}^r+m_s^r),\quad m_1^r={\hat{m}}^r-m_s^r, \end{aligned}$$
(31)

The bare and renormalized quark masses are related by

$$\begin{aligned} {\bar{m}}^r=Z_m^S{\bar{m}},\quad m_1^r=Z_m^{NS}m_1,\quad Z_m^S/Z_m^{NS}=1+\alpha _Z. \end{aligned}$$
(32)

Calculating the derivatives with respect to the renormalized masses, we get

$$\begin{aligned} y_a\doteq & {} \frac{\partial M_a^2}{\partial m_s^r}\biggr |_{{\hat{m}}^r=\mathsf{const}} =\frac{1}{3}\,\frac{\partial M_a^2}{\partial {\bar{m}}^r}\biggr |_{m_1^r=\mathsf{const}} -\frac{\partial M_a^2}{\partial m_1^r}\biggr |_{{\bar{m}}^r=\mathsf{const}}, \nonumber \\ x_a\doteq & {} \frac{\partial M_a^2}{\partial {\hat{m}}^r}\biggr |_{m_s^r=\mathsf{const}} =\frac{2}{3}\,\frac{\partial M_a^2}{\partial {\bar{m}}^r}\biggr |_{m_1^r=\mathsf{const}} +\frac{\partial M_a^2}{\partial m_1^r}\biggr |_{{\bar{m}}^r=\mathsf{const}}.\nonumber \\ \end{aligned}$$
(33)

Here, the \(M_a\) denote the hadron masses in a given multiplet. One may relate this to the derivative with respect to the bare masses

$$\begin{aligned} \frac{\partial M_a^2}{\partial {\bar{m}}^r}= & {} \frac{1}{Z_m^S}\,\frac{\partial M_a^2}{\partial {\bar{m}}} =\frac{1}{Z_m^{NS}(1+\alpha _Z)}\,\frac{\partial M_a^2}{\partial {\bar{m}}},\nonumber \\ \frac{\partial M_a^2}{\partial {\bar{m}}_1^r}= & {} \frac{1}{Z_m^{NS}}\,\frac{\partial M_a^2}{\partial m_1}. \end{aligned}$$
(34)

SU(3) symmetry in the vicinity of the symmetric point introduces additional constraints on the derivatives with respect to \(m_1\). Namely, retaining only linear terms in \(m_1\), the hadron masses in the multiplets are given by [55]

pseudoscalar octet:

$$\begin{aligned} M_\pi ^2= & {} M_P^2+\frac{2}{3}\,\alpha _P m_1,\quad M_K^2=M_P^2-\frac{1}{3}\,\alpha _P m_1,\nonumber \\ M_{\eta _s}^2= & {} M_P^2-\frac{4}{3}\,\alpha _P m_1. \end{aligned}$$
(35)

vector octet:

$$\begin{aligned} M_\rho= & {} M_V+\frac{2}{3}\,\alpha _V m_1,\quad M_{K^*}=M_V-\frac{1}{3}\,\alpha _V m_1,\nonumber \\ M_{\phi _s}= & {} M_V-\frac{4}{3}\,\alpha _V m_1. \end{aligned}$$
(36)

baryon octet:

$$\begin{aligned} M_N= & {} M_B+A_1m_1,\quad M_\Lambda =M_B+A_2m_1,\nonumber \\ M_\Sigma= & {} M_B-A_2m_1,\quad M_\Xi =M_B-(A_1-A_2)m_1. \end{aligned}$$
(37)

baryon decuplet:

$$\begin{aligned} M_\Delta= & {} M_D+Am_1,\quad M_{\Sigma ^*}=M_D, \nonumber \\ M_{\Xi ^*}= & {} M_D-Am_1,\quad M_\Omega =M_D-2m_1. \end{aligned}$$
(38)

The expansion coefficients depend on the variable \({\bar{m}}\). Calculating the derivatives with respect to the variables \({\bar{m}},\,m_1\) and introducing the notation

$$\begin{aligned} \lambda _P= & {} \frac{1}{1+\alpha _Z}\,\frac{dM_P^2}{d{\bar{m}}},\quad \lambda _A=\frac{1}{1+\alpha _Z}\,\frac{dM_A}{d{\bar{m}}},\nonumber \\ A= & {} V,B,D, \end{aligned}$$
(39)

one gets

$$\begin{aligned} \gamma _P= & {} \frac{-2(\alpha _P/\lambda _P)}{1+2(\alpha _P/\lambda _P)},\quad \gamma '_P=\frac{(\alpha _P/\lambda _P)}{1-(\alpha _P/\lambda _P)},\nonumber \\ \gamma _V= & {} \frac{-2(\alpha _V/\lambda _V)}{1+2(\alpha _V/\lambda _V)},\quad \gamma '_V=\frac{(\alpha _V/\lambda _V)}{1-(\alpha _V/\lambda _V)},\nonumber \\ \gamma _B= & {} \frac{(3A_2/\lambda _B)}{1-(3A_2/\lambda _B)},\quad \gamma '_B=\frac{-(3A_2/\lambda _B)/2}{1-(3A_2/\lambda _B)/2},\nonumber \\ \beta _B= & {} \frac{-2A_1/\lambda _B+A_2/\lambda _B}{1-(3A_2/\lambda _B)},\quad \beta '_B=\frac{A_1/\lambda _B-A_2/(2\lambda _B)}{1+(3A_2/\lambda _B)/2},\nonumber \\ \gamma _D= & {} \frac{3A}{\lambda _D},\quad \gamma '_D=-\frac{3A}{2\lambda _D}. \end{aligned}$$
(40)

Note that the above relations are valid exactly in the SU(3)-symmetric point, where \(m_1=0\) and the masses of all hadrons in the same multiplet are equal.

The quark-model values are

$$\begin{aligned} \frac{2\alpha _P}{\lambda _P}= & {} \frac{2\alpha _V}{\lambda _V}=1,\quad \frac{3A_1}{\lambda _B}=1,\nonumber \\ \frac{3A_2}{\lambda _B}= & {} 0,\quad \frac{3A}{\lambda _D}=1, \end{aligned}$$
(41)

whereas, e.g., the tetraquark value is completely different:

$$\begin{aligned} \frac{2\alpha _T}{\lambda _T}=-\frac{1}{2}. \end{aligned}$$
(42)

That is, even the sign is different from the case of the the ordinary \({\bar{q}}q\)-mesons, allowing one to clearly distinguish between both QCD configurations.

Having set the definitions, we proceed with the verification of our method using the lattice data from Ref. [55]. We will only consider results corresponding to the largest lattice size \(32^3\times 64\), hence avoiding the discussion of finite-size lattice corrections. Bare quark masses are defined as

$$\begin{aligned} am_q=\frac{1}{2}\left( \frac{1}{\kappa _{q}}-\frac{1}{\kappa _{0,c}}\right) . \end{aligned}$$
(43)

On the symmetric line, i.e. for \({\hat{m}}=m_s={\bar{m}}\) and \(\kappa _l=\kappa _s=\kappa _0\), the QCDSF results for the pseudoscalars, vectors, octet and decuplet baryons are given in Table 1. Note that the results corresponding to the second row, \(\kappa _0=0.12092\), have been slightly updated with respect to the values in Ref. [55].

Table 1 Updated values of hadron masses on the symmetric line from the \(32^3\times 64\) lattice [55]. The input in the second row is updated as compared to Ref. [55] (this is marked by the \(*\))
Fig. 1
figure 1

Linear fit for pseudoscalar and vector mesons and octet and decuplet baryon along the symmetric line. In the case of the vectors, only the first three points have been considered in the fit, since the last one corresponds to an unstable particle

Table 2 Pseudoscalar meson masses \(aM_\pi \), \(aM_K\) and \(aM_{\eta _s}\) and vector meson masses \(M_\rho \), \(aM_{K^*}\) and \(aM_{\phi _s}\) on the \(\bar{m}=\text{ constant }\) line from the \(32^3\times 64\) lattice [55]. The input in the fourth row is updated as compared to Ref. [55] (this is marked by the \(*\))
Table 3 Octet baryon masses \(aM_N\), \(aM_\Lambda \), \(aM_\Sigma \) and \(aM_\Xi \) and decuplet baryon masses \(M_\Delta \), \(aM_{\Sigma ^*}\), \(aM_{\Xi ^*}\) and \(aM_\Omega \) on the \(\bar{m}=\text{ constant }\) line for the \(32^3\times 64\) lattice [55]. The input in the fourth row is updated as compared to Ref. [55] (this is marked by the \(*\))

Chiral symmetry requires the pion mass to vanish in the chiral \({\bar{m}}\rightarrow 0\) limit, which allows one to determine the critical hopping parameter \(\kappa _{0,c}\) by extrapolating the pseudoscalar masses through the symmetric line. Thus, considering at first order

$$\begin{aligned} M_\pi ^2= \frac{a_\pi }{\kappa _0}+b_\pi =a_\pi \left( \frac{1}{\kappa _0}-\frac{1}{\kappa _{0,c}}\right) =2 a_\pi {\bar{m}},\;\kappa _{0,c}{=}-\frac{a_\pi }{b_\pi }, \end{aligned}$$
(44)

and fitting the data in Table 1, one can directly extract the value of the \(a_\pi \) and \(b_\pi \) coefficients, namely

$$\begin{aligned} a_\pi =\frac{1}{2}\frac{dM_P^2}{d{\bar{m}}}=2.249\pm 0.037,\quad b_\pi =-18.57\pm 0.31,\nonumber \\ \end{aligned}$$
(45)

so that the critical hopping parameter reads

$$\begin{aligned} \kappa _{0,c}=0.12098\pm 0.00283. \end{aligned}$$
(46)

The uncertainties have been computed using a bootstrap with a normally distributed sample of 1000 points, with a standard deviation defined from the 68% of the distribution.

Once the critical hopping parameter \(\kappa _{0,c}\) has been obtained, Eq. (43) allows one to study the behavior of the vector and octet and decuplet baryons at the symmetric line. Considering again a linear fit to the data in Table 1, we obtain

$$\begin{aligned} \frac{\mathrm{d}M_V}{\mathrm{d}{\bar{m}}}= & {} 7.66\pm 2.47,\quad \frac{\mathrm{d}M_B}{\mathrm{d}{\bar{m}}}=19.33\pm 2.3,\nonumber \\ \frac{\mathrm{d}M_D}{\mathrm{d}{\bar{m}}}= & {} 12.28\pm 4.13, \end{aligned}$$
(47)

where the errors have been computed using again a bootstrap method.

The fit results, together with the lattice data in Table 1 are plotted in Fig. 1, showing a nice linear behavior and hence the fit quality. Note that for the vector mesons we have got rid of the last data point, which lies quite well above the \(2m_\pi \) threshold and hence corresponds to an unstable particle.

Beyond the symmetric line, no reference to \(\kappa _{0,c}\) is needed, since

$$\begin{aligned} a m_1=\frac{1}{2}\left( \frac{1}{\kappa _l}-\frac{1}{\kappa _s}\right) . \end{aligned}$$
(48)

The QCDSF collaboration results for pseudoscalar and vector mesons, and octet and decuplet baryons corresponding to the \(32^3\times 64\) lattice size can be found in Tables 2 and 3. Once more, the results of the fourth row, i.e. for \(\kappa _l=0.121145\) and \(\kappa _s=0.120413\), are also updated as compared with Ref. [55].

Fig. 2
figure 2

Fit results for pseudoscalar (left up panel) and vector (right up panel) mesons and octet (left down panel) and decuplet (right down panel) baryons along the non-symmetric line

Using these data together to Eq. (48), we can perform a linear fit to compute the coefficients in Eqs. (36)–(38), getting

$$\begin{aligned} \alpha _P&=1.242\pm 0.003,\quad \alpha _V=1.64 \pm 0.05,\nonumber \\ A_1&=2.26\pm 0.09,\quad A_2=0.51\pm 0.05,\\ A&=1.33\pm 0.07,\nonumber \end{aligned}$$
(49)

where, once more, the errors have been computed using a bootstrap with a normally distributed sample of 1000 points, and the fit results have been plotted in Fig. 2. As in the symmetric case, we have not included the last two points for the \(\rho \) and \(K^*\) mesons, which correspond to mass values well above the \(\pi \pi \) and \(\pi K\) threshold, respectively, and hence to unstable particles. As one can see in Fig. 2, the linear fits work extremely well for pseudoscalar mesons, and relatively well for vectors and octet baryons. On the contrary, the decuplet results show deviations from a linear behavior. Nevertheless, their description does not improve with a quadratic function either and a linear behavior will be assumed in order to test our criterion for the decuplet.

4.1 Pseudoscalar mesons

Since the data on the pseudoscalar meson masses are almost linear in the quark masses—in other words, the Gell-Mann–Oakes–Renner relation is obeyed very well, we will assume an exact linear dependence to extract the renormalization constant \(\alpha _Z\),

$$\begin{aligned} \alpha _Z=\frac{1}{2\alpha _P}\frac{\mathrm{d}M_P^2}{\mathrm{d}{\bar{m}}}-1=0.811\pm 0.029, \end{aligned}$$
(50)

so our criterion turns into an identity for the pseudoscalar mesons

$$\begin{aligned} \frac{2\alpha _P}{\lambda _P}=1. \end{aligned}$$
(51)

In the case of the vector mesons as well as for the octet and decuplet baryons, using the results in Eqs. (47) and (49), together with the renormalization constant in (50), one gets for the ratios in Eq. (41) the values:

4.2 Vector mesons:

$$\begin{aligned} \frac{2\alpha _V}{\lambda _V}=0.778\pm 0.014, \end{aligned}$$
(52)

4.3 Baryon octet:

$$\begin{aligned} \frac{3A_1}{\lambda _B}=0.635\pm 0.080,\quad \frac{3A_2}{\lambda _B}=0.144\pm 0.023,\quad \end{aligned}$$
(53)

4.4 Baryon decuplet:

$$\begin{aligned} \frac{3A}{\lambda _D}=0.590\pm 0.202. \end{aligned}$$
(54)

These results confirm a behavior for vectors and octet and decuplet baryons close to the quark model prediction, with deviations compatible with a \(1/N_c\simeq 30\%\) correction, as expected from our analysis in Sect. 3. In particular, the results for vector mesons are far away from a tetraquark prediction,Footnote 2 confirming their ordinary \({\bar{q}}q\) nature, as it has been known for long from vector meson dominance models [56, 57] or the \(1/N_c\) expansion [58,59,60,61,62,63]. These results are hence a check of the consistency of our method, opening the way to its application to non-ordinary meson candidates, as for instance the light scalar mesons. Nevertheless, this requires a generalization of the Feynman–Hellman theorem for resonances. That is what we will address in the next Section.

5 Unstable particles

In this section, we present a derivation of the analog of the Feynman–Hellmann theorem for resonances in quantum field theory. To this end, we first consider stable (w.r.t. the strong interactions) particles again (say, the Goldstone bosons) and give a derivation in the language of the Green functions, which does not refer to the eigenstates and eigenvalues of the QCD Hamiltonian at all.

Let \(P^a(x)={\bar{q}}(x)\,i\gamma _5\frac{1}{2}\,\lambda ^a q(x)\) be a composite field that describes the Goldstone bosons. Here, \(\lambda ^a,~a=1,\ldots 8\), are the Gell-Mann flavor matrices. The two-point function of these fields

$$\begin{aligned} D^{ab}(p^2)=i\int d^4xe^{ipx}\langle 0|TP^a(x)P^b(0)|0\rangle \end{aligned}$$
(55)

contains a single pole at the physical Goldstone boson mass

$$\begin{aligned} D^{ab}(p^2)\rightarrow \frac{\delta ^{ab}Z_a}{M_a^2-p^2}+\cdots ,\quad a=1,\ldots 8. \end{aligned}$$
(56)

Here, the ellipses stand for the terms which are regular at \(p^2\rightarrow M_a^2\). For simplicity, we do not consider the case \(a=8\) any further, as \(\eta \)\(\eta '\) mixing would have to be taken into account. The constant \(Z_a\) can be expressed via the matrix element of \(P^a\) between the vacuum and the one-particle state:

$$\begin{aligned} Z_a^{1/2}=\langle 0|P^a(0)|P,a\rangle . \end{aligned}$$
(57)

Next, we shall use the fact that, in perturbation theory, one can shuffle the quark mass term between the free Lagrangian and the interaction part. We shall use this freedom to put all of the mass term into the interaction.Footnote 3 In this case, the two-point function is given by

$$\begin{aligned} \langle 0|TP^a(x)P^b(0)|0\rangle= & {} U_0^{-1} \bigg \langle \Omega _0\bigg |TP_0^a(x)P_0^b(0)\nonumber \\&\times \exp \bigg (i\int d^4y\mathcal{L}_{\mathrm{int}}(y)\bigg )\bigg |\Omega _0\bigg \rangle , \nonumber \\ U_0= & {} \biggl \langle \Omega _0\biggl |T\exp \biggl (i\int d^4y\mathcal{L}_{\mathrm{int}}(y)\biggr |\Omega _0\biggr \rangle ,\nonumber \\ \end{aligned}$$
(58)

where the interaction part of the QCD Lagrangian is split into the quark-mass-dependent and quark-mass-independent parts according to

$$\begin{aligned} \mathcal{L}_{\mathrm{int}}(x)=\mathcal{L}_0(x)-\sum _q Z_F^{-1}Z_m m_q^r{\bar{q}}_0(x)q_0(x). \end{aligned}$$
(59)

Here, \(q_0(x)\) is the free massless quark field, \(P^a_0(x)={\bar{q}}_0(x)\,i\gamma _5\frac{1}{2}\,\lambda ^a q_0(x)\), and \(|\Omega _0\rangle \) denotes the vacuum in the theory with massless quarks. The quark-mass-independent part of the Lagrangian is not shown explicitly. Further, \(Z_F,Z_m\) denote the quark wave function and quark mass renormalization constants, respectively, and \(m_q^r\) are the renormalized quark masses. For simplicity, the minimal subtraction scheme is implied, where \(Z_F,Z_m\) do not depend on the quark flavor and masses.

Let us now ask the question: how do the Goldstone boson masses depend on the renormalized quark masses \(m_q^r(\mu )\), with the scale \(\mu \) and the renormalized coupling constant \(g^r(\mu )\) fixed? In order to answer this question, we shall first differentiate Eq. (56) with respect to the quark mass \(m_q^r\):

$$\begin{aligned} \frac{\partial D^{ab}(p^2)}{\partial m_q^r} {\rightarrow }{-}\frac{Z_a\delta ^{ab}}{(M_a^2{-}p^2)^2}\,\frac{\partial M_a^2}{\partial m_q^r} +\text{ single } \text{ pole }+\text{ regular }.\nonumber \\ \end{aligned}$$
(60)

On the other hand, differentiating Eq. (58) with respect to \(m_q^r\), one obtains

$$\begin{aligned}&\frac{\partial D^{ab}(p^2)}{\partial m_q^r} =\int \mathrm{d}^4xe^{ipx} \biggl \{U_0^{-1}\biggl \langle \Omega _0\biggl |TP_0^a(x)P_0^b(0)\nonumber \\&\qquad \times \exp \biggl (i\int \mathrm{d}^4y\mathcal{L}_{\mathrm{int}}(y)\biggr )\nonumber \\&\qquad \times \int \mathrm{d}^4z\, Z_F^{-1}Z_m{\bar{q}}_0(z)q_0(z) \biggr |\Omega _0\biggr \rangle \nonumber \\&\qquad -\,U_0^{-2}\frac{\partial U_0}{\partial m_q^r} \biggl \langle \Omega _0\biggl |TP_0^a(x)P_0^b(0)\nonumber \\&\qquad \times \exp \biggl (i\int \mathrm{d}^4y\mathcal{L}_{\mathrm{int}}(y)\biggr )\biggr |\Omega _0\biggr \rangle \biggr \} \nonumber \\&\quad =\int \mathrm{d}^4x \mathrm{d}^4z e^{ip(x-z)} \biggl \{U_0^{-1}\biggl \langle \Omega _0\biggl |TP_0^a(x)P_0^b(z) \nonumber \\&\qquad \times \exp \biggl (i\int \mathrm{d}^4y\mathcal{L}_{\mathrm{int}}(y)\biggr )\nonumber \\&\qquad \times Z_F^{-1}Z_mS_0(0) \biggr |\Omega _0\biggr \rangle \nonumber \\&\qquad -\,U_0^{-1}\biggl \langle \Omega _0\biggl |TP_0^a(x)P_0^b(z) \exp \biggl (i\int \mathrm{d}^4y\mathcal{L}_{\mathrm{int}}(y)\biggr )\biggr |\Omega _0\biggr \rangle \nonumber \\&\qquad \times U_0^{-1}\biggl \langle \Omega _0\biggl |TZ_F^{-1}Z_mS_0(0) \exp \biggl (i\int \mathrm{d}^4y\mathcal{L}_{\mathrm{int}}(y)\biggr )\biggr |\Omega _0\biggr \rangle \biggr \}.\nonumber \\ \end{aligned}$$
(61)

Here, \(S_0(x)={\bar{q}}_0(x)q_0(x)\) is the (unrenormalized) scalar density. It is now seen that the above equation can be written in the form

$$\begin{aligned} \frac{\partial D^{ab}(p^2)}{\partial m_q^r} =\Gamma ^{ab}(p,p), \end{aligned}$$
(62)

where

$$\begin{aligned}&\Gamma ^{ab}(p,q)=\int d^4x\, d^4z\, e^{ipx-iqz} \langle 0|TP^a(x)P^b(z)S^r(0)|0\rangle _\mathsf{conn}, \nonumber \\&\quad \langle 0|TP^a(x)P^b(z)S^r(0)|0\rangle _\mathsf{conn} =\langle 0|TP^a(x)P^b(z)S^r(0)|0\rangle \nonumber \\&\qquad -\langle 0|TP^a(x)P^b(z)|0\rangle \langle 0|TS^r(0)|0\rangle , \end{aligned}$$
(63)

and \(S^r(x)=Z_F^{-1}Z_mS_0(x)\) denotes the renormalized scalar density.

Further, inserting a complete set of states in the above equation, it is straightforward to see that the quantity \(\Gamma ^{ab}(p,q)\) contains a double pole in the variables \(p^2,q^2\):

$$\begin{aligned}&\Gamma ^{ab}(p,q)\rightarrow \nonumber \\&\quad -\frac{\delta ^{ab}\langle 0|P^a(0)|P,a\rangle \langle P,a|S^r(0)|P,b\rangle \langle P,b| P^b(0)|0\rangle }{(M_a^2-p^2)(M_a^2-q^2)} +\cdots ,\nonumber \\ \end{aligned}$$
(64)

where the ellipses denote the less singular terms. Comparing the coefficients in front of the double pole in Eqs. (60) and (64), we finally arrive at the Feynman–Hellmann theorem,

$$\begin{aligned} \frac{\partial M_a^2}{\partial m_q^r}=\langle P,a|S^r(0)|P,a\rangle . \end{aligned}$$
(65)

As already mentioned, the advantage of this derivation as compared to the standard one is that it does not refer to the (real) eigenvalues of the Hamiltonian \(E_n(\lambda )\) from the beginning, dealing instead with the Green functions in QCD. Hence, this derivation can be directly generalized to the resonances.Footnote 4 In the latter case, the two-point function is defined by an expression similar to Eq. (55) where, instead of \(P^a(x)\), one may use any operator with the quantum number of a given resonance. Then the two-point function has a pole on some unphysical Riemann sheet of the complex \(p^2\)-plane and not on the real axis. By the same token, the matrix element of the operator \(S^r(x)\) between the resonance “states” is defined similarly to Eq. (64), through the residue of the three-point function at the double pole.Footnote 5 Consequently, the only difference between the stable states and the resonances boils down to the question whether a pole is real or not. This difference is inessential for the derivation of the Feynman–Hellmann theorem, which is given above. Consequently, it still holds, if one interprets \(M_a^2\) as a resonance pole position in the complex plane and not as the energy of an isolated energy level on the lattice. Furthermore, the large-\(N_c\) and SU(3) symmetry arguments apply to the Green functions irrespective of whether they have a real or a complex pole. Consequently, the physical meaning of the parameters \(\gamma ,\gamma ',\beta ,\beta '\) remains the same, albeit they become complex for resonances. Namely, if these happen to be close to the (real) quark model values for a given multiplet, then this multiplet has a little admixture of exotica and vice versa.

One may also wonder whether the Gell-Mann–Okubo formula, which was extensively used above, is applicable in the case of the resonances. The answer to this question can be found along a similar pattern. The quark mass term in the Lagrangian can be rewritten as (to ease notations, the renormalization constants are suppressed)

$$\begin{aligned} \mathcal{L}_m= & {} {\hat{m}}({\bar{u}}u+{\bar{d}}d)+m_s{\bar{s}}s={\bar{m}}({\bar{u}}u +{\bar{d}}d +{\bar{s}}s)\nonumber \\&+\,\lambda \frac{m_1}{3}\,({\bar{u}}u +{\bar{d}}d -2{\bar{s}}s), \end{aligned}$$
(66)

where \(\lambda =1\) in the real world. Suppose now that, for any \(\lambda \), one has a multiplet of poles \(M_a^2\) (real or complex) in the two-point function of the operators with appropriate quantum numbers. One could now differentiate with respect to the parameter \(\lambda \) and get

$$\begin{aligned} \frac{\partial M_a^2}{\partial \lambda }=\langle P,a|O_8^r(0)|P,a\rangle , \end{aligned}$$
(67)

where \(O_8^r\) denotes the renormalized operator proportional to \({\bar{u}}u +{\bar{d}}d -2{\bar{s}}s\). In the case of the resonances, the matrix element in the right-hand side is understood, as the residue of the pertinent three-point function at the complex pole.

The above relation is written for any \(\lambda \). For \(\lambda =1\), we are back to the real world. One may first consider it for \(\lambda \rightarrow 0\), where it yields the first-order correction to the SU(3)-symmetric limit. The Lagrangian is explicitly SU(3) symmetric, as \(\lambda \rightarrow 0\), and both the interpolating particle fields and the operator \(O_8^r\) transform as irreducible tensor operators of SU(3). The group-theoretical analysis applies directly to the three-point function, and the Gell-Mann–Okubo formula holds—even for the resonances.

Continuation to \(\lambda =1\), i.e., back to the real world, is a subtle issue. The (approximate) validity of the Gell-Mann–Okubo formula is, in fact, equivalent to the statement that the linear term in \(\lambda \) describes the spectrum well up to \(\lambda =1\). In case of stable particles, there exists no internal contradiction in assuming this. Consider, however, the situation, when all particles in the multiplet are stable at the SU(3)-symmetry point. Increasing \(\lambda \) introduces the mass splitting, and some of the particles become unstable at \(\lambda =\lambda _{\mathrm{crit}}<1\). Assuming the analyticity in \(\lambda \) then leads to the controversy since, as it is well known, the real and imaginary parts of the pole position have cusps at threshold. Consequently, the assumption that the Gell-Mann–Okubo formula approximately holds for the resonance masses as well implies that the cusp effects are small, and hence this approximation should work better for the resonances with higher spin.

Finally, a few words about testing the exotic content of the resonances with the use of the lattice simulations. As is well known, the resonances do not correspond to the individual energy levels of the lattice QCD spectrum. In order to extract the position of the resonance pole, one has to first determine the phase shift at a given energy by use of the Lüscher equation and, at the next stage, find the pole position through the extrapolation into the complex energy plane. Recent years have seen some progress in this direction; see, e.g., Refs. [69, 70], which is summarized in the recent review in Ref. [71]. More work is, however, necessary to perform a full-fledged investigation of exotic resonances on the lattice.

6 Conclusions

  1. 1.

    We propose a criterion which allows one to judge whether the hadrons in a given multiplet are predominately quark-model states or exotic states. The quantities \(\gamma ,\gamma ',\beta ,\beta '\) for different multiplets are observable quantities, expressed through the \(\sigma \)-terms. In the quark model, these quantities are exactly given by the group-theoretical factors. Should it turn out that the values of these quantities for some multiplet in the real world significantly differ from the quark model values, then one would interpret this as a signature of the exotic character of a multiplet in question.

  2. 2.

    The above criterion has been verified, using Chiral Perturbation Theory and large-\(N_c\) arguments. It has been shown that the quark model values for \(\gamma ,\gamma ',\beta ,\beta '\) emerge in QCD at the leading order in \(1/N_c\), both in the meson and in the baryon sector.

  3. 3.

    Using the hadron mass values, measured at different input values of the quark masses in lattice QCD simulations, we have verified our criterion in the case of the pseudoscalar and vector meson octets, as well as the low-lying baryon octet and decuplet. As expected, the fit to the lattice data gives results close to the quark model predictions. In some parameters, the difference of order of \(1/N_c^{}\simeq 30\%\) is observed. It will be extremely important to apply the same criterion to the lightest scalar meson octet, which is the most obvious candidate for the low-lying exotic multiplet.

  4. 4.

    The Feynman–Hellmann theorem, which has been used above to calculate the quantities \(\gamma ,\gamma ',\beta ,\beta '\), has been generalized for the resonance states. A field-theoretical proof is provided. The criterion for the exotic multiplets does not change its form. Such a generalization is necessary, because all candidates for QCD exotica are resonances and not stable particles.