1 Introduction

Lattice QCD has been used to calculate nonperturbatively hadronic matrix elements needed for many aspects of Standard Model (SM) phenomenology [1, 2]. Simulating heavy quarks in lattice QCD is especially challenging [3], because the lattice spacing may be of similar size to, or even larger than, the quark Compton wavelength. In this paper we examine a way to control discretization effects for massive (i.e., \(m_Q\gg \varLambda _{{\mathrm{{QCD}}}}\)) fermions in lattice gauge theory, building on the Fermilab action [4]. The key is to note that the dynamics of a quark confined by QCD takes place at shorter distances: of order mv and \(mv^2\) in quarkonia [5,6,7,8,9] and of order \(\varLambda _{{\mathrm{{QCD}}}}\) in heavy–light systems [10,11,12,13]. This physics permits an understanding of heavy-quark discretization effects via heavy-quark effective theory (HQET) [14,15,16] or nonrelativistic QCD (NRQCD) [17].

These tools have been used to extend the Fermilab action [4] to an improvement (in the Symanzik [18] sense) known as the Oktay–Kronfeld (OK) action [19]. Whereas NRQCD [5,6,7,8,9] and HQET [10,11,12,13] are nonrelativistic effective field theories treating heavy quarks as two-component spinors, these actions are based on the Wilson clover action [20, 21] with four-component spinors. To retain a suitable remnant of Lorentz symmetry in physical quantities, the lattice action must have a time-space axis-interchange asymmetric form [4]. The action is then tuned with explicitly (bare) mass dependent couplings \(c_i(m_0a)\) to connect smoothly the large \(m_Qa>1\) and small \(m_Qa\ll 1\) mass limits. In this way, the discretization errors can be controlled for arbitrary quark mass. In addition, a continuum limit without fine tuning is possible with the Fermilab action, which is not the case with direct discretizations of NRQCD.

The OK action [19] incorporates all dimension-six and certain dimension-seven bilinear operators. Although there are many such operators, many can be eliminated as redundant. Furthermore, only six are needed for tree-level matching to QCD through \(\mathrm {O}(\lambda ^3)\) (\(\lambda \sim \varLambda _{{\mathrm{{QCD}}}}/m_Q\) or \(\varLambda _{{\mathrm{{QCD}}}}a\)) in HQET power counting for heavy–light mesons and \(\mathrm {O}(v^6)\) (v being the relative velocity of the quark and antiquark) in NRQCD power counting for quarkonium. When \(m_0a\ll 1\), OK improvement reduces to \(\mathrm {O}(a^2)\) with some \(\mathrm {O}(a^3)\) terms in Symanzik [18] power counting. It is expected that the bottom- and charm-quark discretization errors could be reduced below the \(1\%\) level with the OK action [19].

The aim of this paper is to carry out two critical numerical tests to see whether the theoretical improvement [19] is achieved in practice. As we show below, the results are encouraging enough to proceed with related work: improving electroweak currents [22, 23] and computing renormalization factors. The tests examine combinations of heavy–light meson and quarkonium masses, designed to isolate discretization errors. For spin-independent improvements, we compute a combination of spin-averaged heavy–light and quarkonium masses discussed in Refs. [24, 25]. For spin-dependent improvements, we compute the hyperfine splittings in each system.

The veracity of the spin-independent test is based on a nonrelativistic description of binding energies via the Breit equation [25, 26]. This analysis homes in on errors in hadronic rest and kinetic masses. A by-product is an interesting wrinkle on the Fermilab “nonrelativistic interpretation,” which forgoes the tuning of the rest mass. Other work with (essentially) the Fermilab action requires that the rest mass equal the kinetic mass [27, 28]. Our analysis reveals that this step propagates errors in the kinetic binding energy to the rest mass, a point that may underappreciated; see Sect. 5.

The lattice data for this work have been generated from a MILC asqtad-staggered \(N_f=2+1\) ensemble with \(a\approx 0.12\) fm [29], which is coarse enough to possess clear heavy-quark discretization effects with the Fermilab action. We use an optimized conjugate-gradient inverter [30] to calculate quark propagators with both the tadpole-improved Fermilab and the OK actions over ranges spanning the b and c quarks. For the b-like and c-like quarks, the data are generated with four different values of the hopping parameter for the OK action, and two for the Fermilab action. Tadpole improvement of all terms is fully implemented in the conjugate-gradient program, as in an early, preliminary study [31]. Preliminary reports of the analysis reported here can be found in Refs. [32, 33].

The outline of this paper is as follows. In Sect. 2, we briefly review the OK action and discuss its tadpole improvement. In Sect. 3, we describe the correlators we generate and the fits we use to extract the lightest meson energies in each channel at each momentum. In Sect. 4, we describe the fits needed to extract the rest and kinetic masses of the pseudoscalar and vector mesons. In Sect. 5, we assess the improvement from higher-order kinetic terms in the OK action with the mass combination of Refs. [24, 25]. In Sect. 6, we assess the improvement from higher-order chromomagnetic interactions in the OK action by inspecting the difference of the hyperfine splittings of the rest and kinetic masses. We conclude in Sect. 7.

2 Oktay–Kronfeld action

The Oktay–Kronfeld (OK) action [19] \(S_{\text {OK}}\) is an improved version of the Fermilab action [4] \(S_{\text {FL}}\), which takes the clover action [21] for Wilson fermions [20], but chooses time-space asymmetric couplings:

$$\begin{aligned} S_{\text {FL}}= & {} S_0 + S_B + S_E, \end{aligned}$$
(1)
$$\begin{aligned} S_{\text {OK}}= & {} S_{\text {FL}} + S_{\text {new}}. \end{aligned}$$
(2)

The explicit form of each piece \(S_0\), \(S_B\), \(S_E\), and \(S_{\text {new}}\), including many redundant operators and several operators not needed for tree-level matching, can be found in Ref. [19]. Here, we give an appropriate form for numerical simulations with tree-level tadpole improvement [34]. We write the action with gauge covariant translation operators \(T_{\pm \mu }{\psi }(x)=U_{\pm \mu }(x){\psi }(x\pm a{\hat{\mu }})\) with \(U_{{-\mu }}(x) = U^{\dag }_\mu (x-a{\hat{\mu }})\).

We write the action with dimensionless fields \(E_{x,i}\), \(B_{x,i}\), \(\psi _x\), and \(\bar{\psi }_x\),

$$\begin{aligned}&E_{x,i} = a^{2} E_{i}(x), \end{aligned}$$
(3)
$$\begin{aligned}&B_{x,i} = a^{2} B_{i}(x),\end{aligned}$$
(4)
$$\begin{aligned}&\psi _{x} = a^{3/2} \psi (x), \end{aligned}$$
(5)
$$\begin{aligned}&\bar{\psi }_{x} = a^{3/2} \bar{\psi }(x), \end{aligned}$$
(6)

where the chromomagnetic field \(B_{i}(x)\) and chromoelectric field \(E_{i}(x)\) are

$$\begin{aligned} B_{i}(x)= & {} \frac{1}{2} \sum _{j,k=1}^{3} \varepsilon _{ijk} F_{jk}(x),\end{aligned}$$
(7)
$$\begin{aligned} E_{i}(x)= & {} F_{4i}(x), \end{aligned}$$
(8)

and

$$\begin{aligned} F_{\mu \nu } = \frac{1}{8a^2} \sum _{\rho = \pm \mu } \sum _{\sigma = \pm \nu } \mathop {{\mathrm {sgn}}}(\rho \sigma ) \left[ T_{\rho }T_{\sigma }T_{-\rho }T_{-\sigma } - \text {H.c.} \right] \end{aligned}$$
(9)

is the four-leaf-clover field strength.

Once the action is written in terms of the translation operators \(T_\mu \), tadpole improvement can be carried out with the following steps [4]: Rewrite the gauge covariant translation operators \(T_{\pm \mu }\) in terms of tadpole-improved ones \(\tilde{T}_{\pm \mu }\) by using \(T_{\pm \mu }=u_0\tilde{T}_{\pm \mu }\); replace the couplings \(c_i\rightarrow \tilde{c}_i\), \(r_s\rightarrow \tilde{r}_s\), \(\zeta \rightarrow \tilde{\zeta }\), and \(m_0\rightarrow \tilde{m}_0\) (the \(u_0\) factors are absorbed into the couplings \(\tilde{c}_i\), \(\tilde{r}_s\), \(\tilde{\zeta }\), and \(\tilde{m}_0\)); and finally, multiply each term of the action by the factor \(u_0\) to recover the original form of the action. Then we arrive at the tadpole-improved OK action in terms of the unimproved translation operators, explicit tadpole factors, the tadpole-improved couplings \(\tilde{c}_i\), and the hopping parameter \(\kappa \):

$$\begin{aligned}&\kappa = \tilde{\kappa }/u_0, \end{aligned}$$
(10)
$$\begin{aligned}&\frac{1}{2\tilde{\kappa }} = \tilde{m}_0 a + ( 1 + 3\tilde{r}_{s} \tilde{\zeta } + 18 \tilde{c}_4 ). \end{aligned}$$
(11)

The OK action then is

$$\begin{aligned} S_{\text {OK}}= & {} \frac{1}{2\kappa } \bar{\psi }_x \psi _{x} \nonumber \\&-\,\frac{1}{2} \bar{\psi }_x (1-\gamma _{4}) T_{4} \psi _{x} -\frac{1}{2} \bar{\psi }_x (1+\gamma _{4}) T_{-4}\psi _{x} \nonumber \\&-\,\frac{1}{2} \bar{\psi }_x \sum _{i=1}^{3} ( \tilde{r}_{s}\tilde{\zeta } + 8\tilde{c}_{4} -\gamma _{i}(\tilde{\zeta } - 2\tilde{c}_{1} - 12\tilde{c}_{2}) ) T_{i} \psi _{x} \nonumber \\&-\,\frac{1}{2} \bar{\psi }_x \sum _{i=1}^{3} ( \tilde{r}_{s}\tilde{\zeta } + 8\tilde{c}_{4} +\gamma _{i}(\tilde{\zeta } - 2\tilde{c}_{1} - 12\tilde{c}_{2}) ) T_{-i} \psi _{x} \nonumber \\&+\,\frac{(\tilde{c}_{1} + 2\tilde{c}_{2})}{2u_0} \bar{\psi }_x \sum _{i=1}^{3} \gamma _{i} (T_{i}T_{i} - T_{-i}T_{-i}) \psi _{x} \nonumber \\&+\,\frac{\tilde{c}_{4}}{u_0} \bar{\psi }_x \sum _{i=1}^{3} (T_{i}T_{i} + T_{-i}T_{-i}) \psi _{x} \nonumber \\&+\,\frac{\tilde{c}_{2}}{2u_0} \bar{\psi }_x \sum _{i=1}^{3} \sum _{j \ne i}^{3} \gamma _{i} \{ (T_{i}-T_{-i}), (T_{j}+T_{-j}) \} \psi _{x} \nonumber \\&+\,i\frac{\tilde{c}_{5}}{4 u_0^2} \bar{\psi }_x \sum _{i=1}^{3} \varSigma _{i} T^{(3)}_{i} \psi _{x} -\frac{\tilde{c}_E \tilde{\zeta }}{2u_0^3} \bar{\psi }_x \sum _{i=1}^{3} \alpha _{i} E_{x,i} \psi _{x} \nonumber \\&-\,i\frac{(\tilde{c}_B \tilde{\zeta } + 16\tilde{c}_{5})}{2u_0^3} \bar{\psi }_x \sum _{i=1}^{3} \varSigma _{i} B_{x,i} \psi _{x} \nonumber \\&+\,\frac{\tilde{c}_{EE}}{2u_0^4} \bar{\psi }_x \sum _{i=1}^{3} \gamma _{i} [ (T_{4} - T_{-4}), E_{x,i} ] \psi _{x} \nonumber \\&+\,i\frac{\tilde{c}_{3}}{2u_0^4} \bar{\psi }_x \sum _{i=1}^{3} \gamma _{i} \varSigma _{i} \{ (T_{i}-T_{-i}), B_{x,i} \} \psi _{x} \nonumber \\&+\,i\frac{\tilde{c}_{3}}{2u_0^4} \bar{\psi }_x \sum _{i=1}^{3} \sum _{j \ne i}^{3} \gamma _{i} \varSigma _{j} [ (T_{i}-T_{-i}), B_{x,j} ] \psi _{x} \nonumber \\&+\,i\frac{\tilde{c}_{5}}{u_0^4} \bar{\psi }_x \sum _{i=1}^{3} \left( -\frac{1}{4} \varSigma _{i} T^{(3)}_{i} \phantom {\sum _{j \ne i}^{3}} \right. \nonumber \\&+ \left. \sum _{j \ne i}^{3} \{ \varSigma _{i} B_{x,i}, (T_{j} + T_{-j}) \} \right) \psi _{x}, \end{aligned}$$
(12)

where

$$\begin{aligned} T^{(3)}_{i} \equiv \sum _{j,k=1}^{3} \varepsilon _{ijk} \left[ T_{-k}(T_{j}-T_{-j})T_{k} - T_{k}(T_{j}-T_{-j})T_{-k} \right] \nonumber \\ \end{aligned}$$
(13)

is the three-link part of the \(c_{5}\) term (which is composed of three- and five-link terms), \(i=1,2,3\) denote the spatial directions, and a sum over sites \(x\in \mathbb {Z}^4\) is implied. The matrices \(\alpha _i\) and \(\varSigma _i\) are defined by

$$\begin{aligned} \sigma ^{\mu \nu }= & {} \frac{i}{2} [ \gamma ^{\mu },\gamma ^{\nu } ], \end{aligned}$$
(14)
$$\begin{aligned} \sigma ^{4i}= & {} i\sigma ^{0i} = i\alpha ^i, \end{aligned}$$
(15)
$$\begin{aligned} \sigma ^{ij}= & {} -\varepsilon ^{ijk}\varSigma _{k}, \end{aligned}$$
(16)

where \(\gamma ^\mu \) are the Dirac matrices, and the totally antisymmetric tensor component \(\varepsilon ^{123}=1\). We have coded this form of the tadpole-improved OK action with the USQCD [35] software QOPQDP and the MILC code, with the optimization scheme discussed in Ref. [30].

Table 1 Input hopping parameters \(\kappa _{\text {OK}}\) for the OK action and \(\kappa _{\text {FL}}\) for the Fermilab action. Note that these inputs are exact and are not chosen by tuning with a physical observable. The resulting kinetic meson masses cover the region of physical heavy–light and quarkonium masses at this lattice spacing, falling in two regions, referred to as b-like and c-like, respectively. Vertically aligned values yield approximately equal heavy–light meson masses

The tadpole-improved couplings \(\tilde{c}_i\) are obtained by applying the matching conditions in Ref. [19], substituting \(\tilde{m}_0a\) for \(m_0a\), \(\tilde{r}_s\) for \(r_s\), \(\tilde{\zeta }\) for \(\zeta \), and using Eq. (11); in practice, \(\tilde{r}_s = r_s\) and \(\tilde{\zeta }=\zeta \). The redundant coefficient of the Wilson term, \(r_s\), is set to one to lift the unwanted fermion doublers as usual. The tuning parameter \(\zeta =\kappa _s/\kappa _t\) plays the following role. Recall the definitions of the rest mass \(m_1\) and the kinetic mass \(m_2\), from the energy \(E(\varvec{p})\):

$$\begin{aligned}&m_1 = E(\varvec{0}), \end{aligned}$$
(17)
$$\begin{aligned}&\frac{1}{m_2}= \left. \frac{\partial ^2 E}{\partial p_i^2} \right| _{\varvec{p}=\varvec{0}}. \end{aligned}$$
(18)

At the tree level [4, 19],

$$\begin{aligned}&m_1a = \ln (1 + m_0a), \end{aligned}$$
(19)
$$\begin{aligned}&\frac{1}{m_2a} = \frac{ r_{s} \zeta }{ 1+m_0a } + \frac{ 2 \zeta ^{2} }{ m_0a(2+m_0a) }, \end{aligned}$$
(20)

for both the Fermilab and the OK actions. Substituting \(\tilde{m}_0a\) from Eq. (11) for \(m_0a\) in Eqs. (19) and (20) yields tadpole-improved tree-level masses, denoted \(\tilde{m}_1\) and \(\tilde{m}_2\). One can arrange for \(m_2=m_1\) (or \(\tilde{m}_1=\tilde{m}_2\)) by tuning \(\zeta \). In fact, this tuning can be carried out nonperturbatively with hadron masses – denoted in this paper \(M_1\) and \(M_2\). Unfortunately, in that case the discretization errors of \(M_2\) are passed on to \(M_1\).

In this paper, we set \(\zeta =1\), which is just the Fermilab nonrelativistic interpretation [4], in which \(m_2\) is taken as the physically relevant mass. In the splittings of hadron rest masses, the quark contribution \(m_1\) cancels; the combinations of masses of the OK action presented in Sects. 5 and 6 are based on splittings and, hence, are immune to this choice. (The quark rest mass also does not affect matrix elements of local operators [14].) The couplings of the OK action in Eq. (12) – \(\tilde{c}_B\), \(\tilde{c}_E\), \(\tilde{c}_1\), \(\tilde{c}_2\), \(\tilde{c}_3\), \(\tilde{c}_4\), \(\tilde{c}_5\), and \(\tilde{c}_{EE}\) – are obtained by substituting \(\tilde{m}_0a\) for \(m_0a\) on the right-hand sides of Eqs. (4.72)–(4.79) in Ref. [19], with \(\zeta =r_s=1\) [and \(r_E=r_{EE}=0\) in Eqs. (4.78) and (4.79) of Ref. [19]].

Note that the rest mass \(m_1\) and kinetic mass \(m_2\) have the bare-mass dependences in Eqs. (19) and (20) for both the OK and the Fermilab actions. However, the hopping parameter \(\kappa \), Eq. (11), differs by the \(c_4\) term in Eq. (12), which, along with the \(c_1\) and \(c_2\) terms, improves the \(\mathrm {O}((a\varvec{p})^4)\) terms in the dispersion relation [19].

3 Meson correlators

3.1 Data description

We use the MILC asqtad-staggered \(N_f=2+1\) gauge ensemble that has dimensions \(N_L^3\times N_T=20^3\times 64\), \(\beta =6.79\), tree-level tadpole factor \(u_0=0.8688\) from the plaquette, and lattice spacing \(a\approx 0.12\) fm [29]. The asqtad-staggered action [36,37,38,39,40] is used for the light degenerate sea quarks with mass \(am_l=0.02\) and strange sea quark with mass \(am_s=0.05\). For the tests reported here, we use \(N_{\text {cfg}}=500\) of the approximately 2000 available configurations.

On each configuration, we use \(N_{\text {src}}=6\) sources \((t_i,\varvec{x}_i)\) for calculating valence quark propagators. The source time slices \(t_i\) are evenly spaced along the lattice with a randomized offset \(t_0\in [0,20)\) for each configuration. The spatial source coordinates \(\varvec{x}_i\) are randomly chosen within the spatial cube.

We compute two-point correlators

$$\begin{aligned} C^M(t,\varvec{p}) = \sum _{\varvec{x}} e^{\mathrm {i}\varvec{p} \cdot \varvec{x}} \langle {\mathscr {O}}^{M\dag }(t,\varvec{x}) {\mathscr {O}}^M(0,\varvec{0})\rangle , \end{aligned}$$
(21)

for heavy–strange mesons \(M=\bar{Q}s\) and quarkonia \(M=\bar{Q}Q\) with momentum \(\varvec{p}\) and heavy quarks \(Q\) in the b- and c-like regions. The interpolating operators \({\mathscr {O}}^M(x)\) are

$$\begin{aligned}&{\mathscr {O}}^{\bar{Q}q}_{\mathsf {t}}(x) = \bar{\psi }_\alpha (x) \varGamma _{\alpha \beta } \varOmega _{\beta {\mathsf {t}}}(x) \chi (x), \end{aligned}$$
(22)
$$\begin{aligned}&{\mathscr {O}}^{\bar{Q}Q}(x) = \bar{\psi }_\alpha (x) \varGamma _{\alpha \beta } \psi _{\beta }(x), \end{aligned}$$
(23)

for heavy–light mesons and quarkonium, respectively. Here the heavy-quark field \(\psi \) is the Wilson-type fermion appearing in the OK or Fermilab action, while the light-quark field \(\chi \) is the staggered fermion appearing in the asqtad-staggered action. The taste degree of freedom for the staggered fermion is obtained from the one-component field \(\chi \) with \(\varOmega (x) = \gamma _1^{x_1} \gamma _2^{x_2} \gamma _3^{x_3} \gamma _4^{x_4}\) [26, 41]. We generate pseudoscalar and vector-meson correlators with the spin structures \(\varGamma = \gamma _5\) and \(\gamma _i\) (for \(i=1,2,3\)), respectively.

We generate data with ten meson momenta \(a\varvec{p} = 2\pi \varvec{n}/N_L\): \(\varvec{n}=(0,0,0)\), (1, 0, 0), (1, 1, 0), (1, 1, 1), (2, 0, 0), (2, 1, 0), (2, 1, 1), (2, 2, 0), (2, 2, 1), (3, 0, 0), including all permutations of the components. These momenta all satisfy \((a\varvec{p})^2<0.9\).

The hopping parameter values chosen for this study span the b- and c-like regions for \(a\approx 0.12\) fm and are given in Table 1. We fix the valence light-quark mass for the heavy–light meson correlators to the strange sea quark mass \(am_s\). In anticipation of tuning runs for the OK action, we generate heavy–light and quarkonium correlators by using the OK action with four different values for the hopping parameter \(\kappa _{\text {OK}}\). For purposes of comparison, we simulate the b- and c-like regions with two values each for the hopping parameter \(\kappa _{\text {FL}}\) of the Fermilab action. The numerical values of the OK couplings, which are obtained from the formulas in Ref. [19] with \(\tilde{m}_0a=u_0[1/2\kappa -1/2{\kappa _{\text {crit}}}]\), are given to four digits in Table 2.

Table 2 Values for the couplings in the OK action, computed from the input hopping parameters in Table 1, the critical hopping parameter \(\kappa _{\text {crit}}=0.05354\), and the tadpole factor \(u_0=0.8688\). The errors on \(\kappa _{\text {crit}}\) and \(u_0\) are unimportant for this study and, thus, neglected. Our code evaluates the improvement coefficients directly from the tree-level matching result as described in the text. Here, we present truncated values. The OK couplings \(\tilde{c}_B=r_s=1\), \(\tilde{c}_4=9/24\), and \(\tilde{c}_5=1/4\) do not depend on the quark mass \(\tilde{m}_0a\)

3.2 Correlator fits

The ground-state energies are extracted from correlator fits to the function

$$\begin{aligned} f(t)= & {} g(t) + g(T-t), \end{aligned}$$
(24)
$$\begin{aligned} g(t)= & {} \sum _{i=0,1} \left[ A_i e^{-E_i t} - (-1)^t A_i^p e^{-E_i^p t} \right] \nonumber \\= & {} A_0 e^{-E_0t} \sum _{i=0,1} \left[ R_i e^{-\varDelta E_it} -(-1)^t R_i^p e^{-\varDelta E_i^pt} \right] , \end{aligned}$$
(25)

where \(A_0\) is the ground-state amplitude, and \(A_1\) is the first excited-state amplitude. We also incorporate the staggered parity partner state with amplitude \(A_i^p\) and energy \(E_i^p\) into the fit function. In practice, we take an amplitude ratio \(R_i^{(p)}=A^{(p)}_i/A_0\) and energy difference \(\varDelta E^{(p)}_i=E^{(p)}_i-E^{(p)}_{i-1}\) as fit parameters instead of \(A^{(p)}_i\) and \(E^{(p)}_i\). By definition \(R_0=1\), \(\varDelta E_0=0\), \(E^p_{-1}\equiv E_0\), and \(R_0\) and \(\varDelta E_0\) are not fit parameters. The parity partners are not present for quarkonium, because they do not contain (valence) staggered fermions, so we set \(R_i^p=0\) for quarkonium. The fits are carried out with the Bayesian priors given in Table 3.

Table 3 Bayesian priors used for two-point correlator fits. For heavy–light mesons, the same priors are used for both heavy quarks \(Q=c,b\). For bottomonium, we present a fit without the excited state, i.e., \(R_1=0\) in Eq. (25)

We carry out correlated fits with a statistical estimate of the covariance matrix \(\mathrm {cov}(t,t^\prime )\) between different time slices t and \(t^\prime \),

$$\begin{aligned} {\mathrm {cov}}(t,t^\prime ) = \frac{1}{\mathscr {N}} \sum _{i=1}^{N_{\text {cfg}}} \left[ C_i(t) - C(t)\right] \left[ C_i(t^\prime ) - C(t^\prime )\right] , \end{aligned}$$
(26)

where the normalization factor \(\mathscr {N}=N_{\text {cfg}}(N_{\text {cfg}}-1)\), \(C_i(t)\) represents the average over the \(N_{\text {src}}\) sources for the \(i^{\text {th}}\) gauge field, and the two-point correlator \(C(t)=C^M(t,\varvec{p})\) for a given meson type M and momentum \(\varvec{p}\) is estimated with the ensemble average

$$\begin{aligned} C(t) = \frac{1}{N_{\text {cfg}}} \sum _{i=1}^{N_{\text {cfg}}} C_i(t). \end{aligned}$$
(27)

To increase statistics, we take advantage of the structure of Eq. (24) and average the data for time separations t and \(T-t\). Then we take the fit interval \([t_{\text {min}},t_{\text {max}}]\), where \(0<t_{\text {min}}<t_{\text {max}}<T/2\), equal to [7, 15] for the heavy–light meson with the OK action and [9, 15] with the Fermilab action. For charmonium, the fit interval is [9, 14] for both actions. For bottomonium, the fit interval is [12, 17] for the OK action and [14, 19] for the Fermilab action. We fix these intervals for fits to all correlators, independent of hopping parameter \(\kappa \), momentum \(\varvec{p}\), and pseudoscalar vs. vector channel.

The \(t_{\text {min}}\) are chosen by observing the effective mass, defined by

$$\begin{aligned} m_{\text {eff}}(t) = \frac{1}{2} \ln \left( \frac{C^M(t)}{C^M(t+2)}\right) \end{aligned}$$
(28)

as well as comparing the fit results E with the \(m_{\text {eff}}(t)\). Figure 1 shows the effective mass \(m_{\text {eff}}(t)\) and correlator fit residual r(t)

$$\begin{aligned} r(t) = \frac{C^M(t)-f(t)}{|C^M(t)|} \end{aligned}$$
(29)

for a pseudoscalar heavy–light meson correlator generated with the OK action, \(\kappa _{\text {OK}}=0.041\), and \(\varvec{p}=\varvec{0}\). The pseudoscalar bottomonium correlator fit results obtained with the same action and parameters are given in Fig. 2. To estimate the statistical errors, we use a single-elimination jackknife.

Fig. 1
figure 1

Correlated fit results for a pseudoscalar heavy–light meson correlator generated with the OK action at \(\kappa =0.041\) and \(\varvec{p}=\varvec{0}\). a Residual defined in Eq. (29) and its statistical error from a fit over \(7\le t\le 15\); the kinks for \(t<7\) are a remnant of the oscillating, wrong-parity states, seen in the right-hand side of Eq. (25). b Effective mass (points) defined in Eq. (28) with fit result for the ground-state energy E (horizontal red line plotted over the chosen fit interval)

Fig. 2
figure 2

Correlated fit results for a pseudoscalar bottomonium correlator generated with the OK action at \(\kappa =0.041\) and \(\varvec{p}=\varvec{0}\). a Residual defined in Eq. (29) and its statistical error from a fit over \(12\le t\le 17\). b Effective mass (points) defined in Eq. (28) with fit result for the ground-state energy E (horizontal red line plotted over the chosen fit interval)

A feature of our b-like heavy–light correlators is that they are somewhat more precise for the OK action than for the Fermilab action. At the same time b-like quarkonium and both types of c-like correlators are similarly precise for the two actions. We do not understand the reason for this behavior. Of course, it propagates to the meson masses and their combinations examined in the next sections.

4 Meson masses

The dispersion relation of the mesons has the same form as that of the quarks with the quark masses \(m_1\) and \(m_2\) given in Eqs. (17) and (18), respectively, replaced by meson masses \(M_1\) and \(M_2\). The mismatch between the meson rest and kinetic masses can be exploited to test the improvement of nonrelativistically interpreted actions.

We fit the ground-state energy \(E(\varvec{p})\) in Eq. (25) for each momentum \(\varvec{p}\) to the nonrelativistic dispersion relation to obtain the rest mass \(M_1\) and kinetic mass \(M_2\) of each meson. Including terms up to \(\mathrm {O}\!\left( \varvec{p}^6\right) \), the dispersion relation is

$$\begin{aligned} E(\varvec{p})= & {} M_1 + \frac{\varvec{p}^2}{2M_2} - \frac{(\varvec{p}^2)^2}{8M_4^3} + E_4^\prime + E_6 + E_6^\prime , \end{aligned}$$
(30)
$$\begin{aligned} E_{4}^\prime= & {} - \frac{a^3W_4}{6} \sum _i p_i^4, \end{aligned}$$
(31)
$$\begin{aligned} E_{6}= & {} \frac{(\varvec{p}^2)^3}{16M_6^5}, \end{aligned}$$
(32)
$$\begin{aligned} E_{6}^\prime= & {} \frac{a^5W_6^\prime }{2} \varvec{p}^2 \sum _i p_i^4 - \frac{a^5W_6}{3} \sum _i p_i^6. \end{aligned}$$
(33)

The \(M_{4,6}\) are generalized masses; the rest mass \(M_1\) and these generalized masses \(M_{4,6}\) are expected [4, 19] to approach the kinetic mass \(M_2\) in the continuum limit. The O(3) rotation-symmetry-breaking terms are \(E_4^\prime \) and \(E_6^\prime \). In the continuum limit, the coefficients \(a^3W_4\), \(a^5W'_6\), and \(a^5W_6\) vanish. We fit the simulation data for \(E(\varvec{p})\) to the right-hand side of Eq. (30), taking the full covariance matrix among the different momentum channels, and investigate variations by excluding some or all of the higher-order terms \(E_4^\prime \), \(E_6\), and \(E_6^\prime \). We do not introduce priors here. The seven fit parameters are \(M_1\), \(M_2^{-1}\), \(M_4^{-3}\), \(M_6^{-5}\), \(W_4\), \(W'_6\), and \(W_6\). We find that we do not need all seven terms in the dispersion relation. In many cases, it suffices to keep only the first five, while some fits give better p values with the first six.

For the pseudoscalar heavy–light meson, as the spectrum becomes more relativistic, \(M_2\) approaches \(M_1\), and including the \(E_6\) term results in a better fit. Because the vector heavy–light meson spectrum has a larger statistical error than the pseudoscalar-meson spectrum, the \(E_6\) term is not only determined to be statistically zero, but also does not improve the fit. The \(E_6\) term also improves the fit for the charmonium spectrum with the Fermilab action.

The \(W_6^\prime \) term in addition to the \(E_6\) term improves the fit for the bottomonium spectrum with the Fermilab action. Note that the correlator fit for bottomonium does not include any excited states, although including a single excited state is statistically consistent, when fitting Eq. (30) without the \(E_6\) and \(E_6^\prime \) terms. We use the fits with excited states to cross-check the single-state fits. We observed the same behavior with the two-state fit for the bottomonium spectrum with the OK action. Note that the bottomonium spectrum for the OK action still results in a good fit without the \(E_6\) and \(E_6^{\prime }\) terms, when the correlator fit only accounts for the ground state.

Fig. 3
figure 3

The pseudoscalar heavy–light meson (circle) and quarkonium (square) subtracted energies \(\widetilde{E}\) [Eq. (34)] as a function of \((a\varvec{p})^2\) for a the OK action at \(\kappa =0.041\) and b the Fermilab (clover) action at \(\kappa =0.083\). The lines show fits to Eq. (30). The errors are from a single-elimination jackknife. In these plots, the errors for the bottomonium data points and fit lines are scaled by a factor 10. The behavior for vector mesons is similar, apart from the larger statistical errors

Figure 3 shows the dispersion relation fits to the pseudoscalar heavy–light meson and quarkonium data generated with the OK (Fermilab) action with \(\kappa _{\text {OK}}=0.041\) (\(\kappa _{\text {FL}}=0.083\)). We plot the dispersion relation fit results after subtracting the rest mass and the higher-order term \(E_4^\prime \). We thus define

$$\begin{aligned} \widetilde{E} = E - M_1 - E_4^\prime \end{aligned}$$
(34)

to draw the plot. As one can see from Fig. 3, the slope – and, hence, the kinetic mass \(M_2\) – is reliably determined from fits to Eq. (30). Interestingly, the OK action leads to statistical errors noticeably smaller than those from the Fermilab action, especially for the heavy–light masses.

The fit results for the heavy–light mesons are summarized in Table 4a, b. They are obtained from correlated fits. From these tables for the heavy–light mesons, one can see that the higher-order generalized mass \(M_4\) approaches the kinetic mass \(M_2\) as the kinetic mass decreases. The rest mass \(M_1\) also approaches the kinetic mass \(M_2\). Note that \(M_4\) does not necessarily agree better with \(M_2\) for the OK action, even though the OK action tunes the action such that \(m_4=m_2\). A possible explanation is that the binding energy \(M_4-m_4\) stems from higher-dimension effects not yet incorporated into the OK action.

The correlated fit results for quarkonia are summarized in Table 5a, b. For the charmonium results obtained with the OK action, \(M_4\) is closer to the kinetic mass \(M_2\) than to the rest mass \(M_1\), but for the Fermilab action \(M_4\) is closer to \(M_1\). In the bottomonium region, \(M_4\) is closer to the rest mass \(M_1\) than the kinetic mass \(M_2\) for both actions. The difference \(M_2-M_1\) is large for the Fermilab action at \(\kappa _{\text {FL}}=0.083\), 0.091, but it is diminished with the OK action at the comparable values \(\kappa _{\text {OK}}=0.039\), 0.040, 0.041, 0.042. However, even with the OK action, the difference between \(M_4\) and \(M_2\) in the bottomonium spectra is larger than that in the heavy–light meson spectra.

Table 4 Dispersion fit results for heavy–light mesons with masses near those of the \(D_s^{(*)}\) and \(B_s^{(*)}\). The energies E are from the correlator fits to the function in Eq. (30). The first column indicates the lattice action and the (approximate) heavy-quark flavor. The second column is the hopping parameter \(\kappa \). The following columns are the rest mass \(M_1\), the kinetic mass \(M_2\), the generalized masses \(M_4\) and \(M_6\), and the coefficient of an O(3) symmetry breaking term \(W_4\). The last two columns are the \(\chi ^2\) divided by the degrees of freedom (dof) \(N_{\text {data}}-N_{\text {param}}\), p-value. All ten momenta data points are included in the dispersion relation fit; these spectra are obtained from the correlator fits to Eq. (25) which include all correlations. Fits to the dispersion relation include all correlations among different momenta. Errors are all from a single-elimination jackknife
Table 5 Dispersion fit results for quarkonium. The energies E are from the correlator fits to the function in Eq. (30). The first column indicates the lattice action and the (approximate) heavy-quark flavor. The second column is the hopping parameter \(\kappa \). The following columns are the rest mass \(M_1\), the kinetic mass \(M_2\), the generalized masses \(M_4\) and \(M_6\), and the coefficient of an O(3) symmetry breaking terms \(W_4\) and \(W_6^\prime \). The last two columns are the \(\chi ^2\) divided by the degrees of freedom (dof) \(N_{\text {data}}-N_{\text {param}}\), p-value. All ten momenta data points are included in the dispersion relation fit; these spectra are obtained from the correlator fits to Eq. (25), setting \(R_{i}^{p} =0\), which include all correlations; \(R_1=0\) for bottomonium fits. Fits to the dispersion relation include all correlations among different momenta. Errors are all from a single-elimination jackknife

5 The “inconsistency” [24]

To study how the Fermilab method works, Ref. [24] introduced the quantity

$$\begin{aligned} I = \frac{2{\delta }M_{\overline{Q}q} - ({\delta }M_{\overline{Q}Q} + {\delta }M_{\overline{q}q})}{2M_{2\overline{Q}q}}, \end{aligned}$$
(35)

where

$$\begin{aligned} \delta M_{\overline{Q}q} = M_{2\overline{Q}q} - M_{1\overline{Q}q}. \end{aligned}$$
(36)

The combination I should vanish, even when the quark rest masses \(m_1\) are mistuned. If I does not vanish, it means that the action contains nontrivial discretization effects at higher order in the HQET or NRQCD power counting, so I is often called the “inconsistency.” In fact, as we reproduce below, \(I\ne 0\) for the Fermilab action [24] (unless \(m_2a\ll 1\)), as one can see in Eq. (40).

The explanation is as follows [25]. The meson masses \(M_i\) \((i=1,2)\) can be written as a sum of the quark masses \(m_i\) and the binding energy:

$$\begin{aligned} M_{i\overline{Q}q} = m_{i\overline{Q}} + m_{iq} + B_{i\overline{Q}q}, \end{aligned}$$
(37)

which defines \(B_i\). Upon substituting Eq. (37) into Eqs. (36) and (35), the quark masses cancel out, and the inconsistency becomes a relation among the binding energies,

$$\begin{aligned} I = \frac{2{\delta }B_{\overline{Q}q} - ({\delta }B_{\overline{Q}Q} + {\delta }B_{\overline{q}q})}{2M_{2\overline{Q}q}}, \end{aligned}$$
(38)

where

$$\begin{aligned} \delta B_{\overline{Q}q} = B_{2\overline{Q}q} - B_{1\overline{Q}q}. \end{aligned}$$
(39)

The quantities in Eqs. (36), (37), and (39) for heavy (\(\overline{Q}Q\)) and light (\(\overline{q}q\)) quarkonium are defined similarly. Because light quarks always have \(ma\ll 1\), the \(\mathrm {O}((ma)^2)\) distinction between rest and kinetic masses is negligible for them, so we omit \(\delta M_{\bar{q}q}\) (or \(\delta B_{\bar{q}q}\)) when computing I. The rest-mass binding energy \(B_1\) is sensitive to effects of \(\mathrm {O}(\lambda )\) or \(\mathrm {O}(v^2)\), while the kinetic-mass binding energy \(B_2\) is sensitive to effects of \(\mathrm {O}(\lambda ^3)\) or \(\mathrm {O}(v^4)\), with their larger relative discretization errors (for the clover and Fermilab actions). The discretization errors can be studied in the nonrelativistic limit via the Breit equation, yielding [25, 26]

$$\begin{aligned} \delta B_{\overline{Q}q}= & {} \frac{5}{3} \frac{\langle \varvec{p}^2\rangle }{2\mu _2} \left[ \mu _2 \left( \frac{m_{2\overline{Q}}^2}{m_{4\overline{Q}}^3} + \frac{m_{2q}^2}{m_{4q}^3} \right) - 1 \right] \nonumber \\&+\, \frac{4}{3} a^3 \frac{\langle \varvec{p}^2\rangle }{2\mu _2} \mu _2 \left[ w_{4\overline{Q}} m_{2\overline{Q}}^2 + w_{4q} m_{2q}^2 \right] + \mathrm {O}(\varvec{p}^4)\nonumber \\ \end{aligned}$$
(40)

in the S wave, where the reduced mass \(\mu _2^{-1} = m_{2\overline{Q}}^{-1} + m_{2q}^{-1}\), and \(m_2\), \(m_4\), and \(w_4\) are defined through the quark analog of Eq. (30). Here, \(\varvec{p}\) is the relative momentum of \(\overline{Q}\) and q in their center of mass. The OK action matches \(m_4=m_2\) and \(w_4=0\) at the tree level, so the two expressions in square brackets are of order \(\alpha _s\). Because we use a tadpole-improved action, these one-loop errors are expected to be small.

Fig. 4
figure 4

Inconsistency I for a the spin-averaged mass \(\overline{M}=\frac{1}{4}(M+3M^*)\), b pseudoscalar-meson mass, and c vector-meson mass. Data labels denote the value of \(\kappa \times 10^3\). The purple circles (green squares) represents the OK (Fermilab) action. The errors are from the jackknife. Vertical lines with bands represent the physical masses from the PDG [42] with experimental and (asymmetric) lattice-spacing errors added in quadrature. For the OK action I almost vanishes (cf. horizontal red line), but for the Fermilab action it does not

We calculate the inconsistency I from the pseudoscalar and vector masses presented in Sect. 4. The derivation of Eq. (40) does not include spin-dependent effects, which should contribute to I for both pseudoscalar and vector channels. To separate spin-dependent effects, which are the subject of Sect. 6, from spin-independent ones, we form the spin-averaged mass,.

$$\begin{aligned} \overline{M}=\frac{1}{4}(M+3M^*), \end{aligned}$$
(41)

writing M (\(M^*\)) for pseudoscalar (vector) masses. We calculate the quantity I with such spin-averaged rest and kinetic masses and plot the result in Fig. 4a. For orientation, we also show the physical \(B_s\) and \(D_s\) masses (for this ensemble) with vertical bands. We find that I is close to 0 for the OK action even in the b-like region, whereas the Fermilab action leads to a very large deviation \(I\approx -0.6\) from the continuum value, \(I=0\), as in Ref. [24]. This outcome provides good numerical evidence that the improvement of spin-independent effects of the OK action is realized in practice. The small I remaining stems from still higher-dimension kinetic operators of \(\mathrm {O}(\lambda ^4)\) in HQET, or \(\mathrm {O}(v^7)\) in NRQCD, power counting, which are not addressed by the OK action. For completeness, Fig. 4b, c show I for the pseudoscalar and vector channels, demonstrating that spin-dependent effects do not alter the conclusions.

Let us end this section with the “tuning wrinkle” mentioned in the introduction. By definition, Eq. (37), \(M_2\) contains a binding energy, which, via \(m_4\) in Eq. (40), is sensitive to the higher-dimension discretization errors addressed by the OK action. With the Fermilab action (as sometimes implemented [27, 28]), a hadron-mass-based tuning of \(\zeta \) transmits these errors from \(M_2\) to \(M_1\). This nuance has gone unappreciated; see, for example, Ref. [43].

6 Hyperfine splittings

The hyperfine splitting is the difference in the masses of the vector and pseudoscalar mesons:

$$\begin{aligned} \varDelta _1&= M_1^{*} - M_1 , \end{aligned}$$
(42)
$$\begin{aligned} \varDelta _2&= M_2^{*} - M_2 . \end{aligned}$$
(43)

From Eq. (39), one has

$$\begin{aligned} \varDelta _2 - \varDelta _1 = \delta {B^{*}} - \delta {B}. \end{aligned}$$
(44)

Spin-independent contributions cancel in this binding-energy difference, so the hyperfine difference \(\varDelta _2-\varDelta _1\) diagnoses the improvement of the spin-dependent \(c_3\) and \(c_5\) terms of \(\mathrm {O}(\lambda ^3)\) in HQET power counting, or \(\mathrm {O}(v^6)\) in NRQCD.

Fig. 5
figure 5

Hyperfine splitting \(\varDelta _2\) obtained from the kinetic masses vs. \(\varDelta _1\), obtained from the rest masses. The square (green) represents the Fermilab action data, and the circle (purple) represents the OK action data. The labels are \(\kappa \times 10^3\), corresponding to kinetic masses close to the physical \(B_s\) (83, 41) and \(D_s\) (49, 127) masses, as shown in Fig. 4. The continuum limit is represented by the line (red) \(\varDelta _2=\varDelta _1\). Errors are from the jackknife

As one can see in Fig. 5a, the OK action shows clear improvement for quarkonium. The data points from the OK action lie much closer to the continuum value \(\varDelta _2 = \varDelta _1\) (the red line) for all simulated values of \(\kappa _{\text {OK}}\); in the charmonium region, they remain consistent with the continuum line within the error. The heavy–light results in Fig. 5b also show clear improvement in the region near the \(D_s\) mass. The results with the OK action remain consistent with the continuum limit throughout the \(B_s\) mass region, but the improvement is not yet statistically significant. Higher statistics would resolve the issue. All in all, the hyperfine splittings show the improvement from the higher-dimension chromomagnetic interactions – those with couplings \(c_3\) and \(c_5\).

For both quarkonia and heavy–light mesons, the hyperfine splitting of the kinetic mass, \(\varDelta _2\), has a larger error than that of the rest mass, \(\varDelta _1\), because the kinetic mass requires correlators with \(\varvec{p}\ne \varvec{0}\), which are noisier than those with \(\varvec{p}=\varvec{0}\). As the rest mass \(M_1\) and the kinetic mass \(M_2\) are determined with smaller error with the OK action than the Fermilab action (see Sect. 4), the statistical errors shown for the hyperfine splittings \(\varDelta _1\) and \(\varDelta _2\) in Fig. 5 are smaller with the OK action, especially in the case of \(\varDelta _2\).

7 Conclusions

Our tests of the Fermilab improvement program are based on the two-point correlators for mesons generated with the Fermilab [4] and OK [19] actions. Looking ahead to phenomenological applications [22, 23, 44], we have chosen four hopping parameters for the OK action in each of the b- and c-quark mass regions. For the Fermilab action, we have simulated two b-like and two c-like hopping parameters. Although the lattice data span the physically interesting regions, the central aim of this paper is to test the improvement theory of Refs. [4, 19] against simulation data, independent of the phenomenological interpretation of the action’s parameters.

We focus on tests of both the spin-independent and the spin-dependent terms in the OK action. The results for the (spin-averaged) quantity I, known as the inconsistency [24], show that the OK action succeeds in improving the effects that generate the kinetic-mass binding energy. The hyperfine splitting shows that the OK action significantly improves the higher-dimension spin-dependent, chromomagnetic effects, at least in quarkonium. For the heavy–light system, the data show a clear improvement for smaller c-like masses, but in the b-like region, large statistical errors prevent us from reaching firm conclusions.

This study has yielded some noteworthy byproducts. As \(am_0\) is reduced, here with fixed \(a\approx 0.12\) fm, the meson masses \(M_1\), \(M_2\), and \(M_4\) approach each other, verifying expectations for the Fermilab and OK actions. The difference between \(M_4\) and \(M_2\), for \(am_2\not \ll 1\), is not much smaller for the OK action than the Fermilab action. By analogy with the inconsistency I [24, 25], this feature is probably explained by the associated binding energy \(M_4-m_4\), which stems from higher-dimension effects not improved by the OK action. Finally, in the b-like region, \(am_2\sim 3\), the OK action produces statistically more precise results than the Fermilab action for the heavy–light correlator energies and, hence, the masses \(M_2\) and \(M_4\).

As an application of the OK action, a calculation of the form factors for the \(B\rightarrow D^{(*)}\ell \nu \) semileptonic decays is under way, with the aim of determining the CKM matrix element \(|V_{cb}|\). To achieve the desired sub-percent precision on the relevant form factors, it will be necessary to derive the analog of the OK action for currents [22, 23], and to calculate the renormalization. Meanwhile some of us are extending the present work to a full-fledged tuning run [44].