1 Introduction

Lattice QCD has been shown to be a powerful tool to determine scattering quantities from first principles. The standard approach is the Lüscher method [2], which relates the finite-volume spectrum obtained from the lattice to the infinite-volume scattering amplitude. It has been applied to many physical systems, including results at the physical point—see Ref. [3] for a review. The formalism has also been recently extended to three particles with three different but conceptually equivalent formulations available in the literature at present [4,5,6,7,8], see Refs. [9, 10] for recent reviews.

In Ref. [1], the authors propose a new strategy to extract scattering quantities. Henceforth, this will be referred to as the BH method. This approach is based on the usage of four-point functions rather than energy levels. The hope is that this approach can be generalised more easily to multi-hadron processes.

As pointed out by the authors, the case of threshold kinematics is particularly favourable as it allows for a direct extraction of the scattering length, with the \(\pi N\) channel being one concrete example.

In this letter we test this novel approach in a scalar \(\phi ^4\) theory. Using this theory has proven to be an excellent test bed for novel scattering studies, as shown in Refs. [11,12,13]. In order to mimic the \(\pi N\) case, we consider two mass nondegenerate real scalar particles. We explore the necessary techniques, and the optimal approach to use the BH method at threshold. Moreover, we compare to the standard Lüscher approach and find good agreement.

2 Description of the Model

The Euclidean model used here is composed by two real scalar fields \(\phi _i, i=0,1\) with the Lagrangian

$$\begin{aligned} \mathcal{L}= \sum _{i=0,1} \left( \frac{1}{2} \partial _\mu \phi _i\partial _\mu \phi _i +\frac{1}{2}m_i \phi _i^2 +\lambda _i \phi _i^4\right) +\mu \phi _0^2 \phi _1^2\, , \end{aligned}$$
(1)

with nondegenerate (bare) masses \(m_0<m_1\). The Lagrangian has a \(Z_2\otimes Z_2\) symmetry \(\phi _0\rightarrow -\phi _0\otimes \phi _1\rightarrow -\phi _1\), which prevents sectors with even and odd number of particles to mix.

To study the problem numerically, we define the theory on a finite hypercubic lattice with lattice spacing a and a volume \(T \cdot L^3\), where T denotes the Euclidean time length and L the spatial length. We define the derivatives of the Lagrangian (Eq. 1) on a finite lattice as the finite differences \(\partial _\mu \phi (x)=\frac{1}{a}\,(\phi (x+a\mu )-\phi (x))\). In addition, periodic boundary conditions are assumed in all directions. The discrete action is given in Ref. [12] for the complex scalar theory, but it is trivial to adapt to this case. We set \(a=1\) in the following for convenience.

3 Observables

In Ref. [1], Bruno and Hansen derived a relation between the scattering length \(a_0\) and the following combination of Euclidean four-point and two-point correlation functions at the two-particle threshold:

$$\begin{aligned} C_4^\mathrm {BH}(t_f,t,t_i) \equiv \frac{\langle \tilde{\phi }_0(t_f)\tilde{\phi }_1(t)\tilde{\phi }_1(t_i) \tilde{\phi }_0(0)\rangle }{\langle \tilde{\phi }_0(t_f) \tilde{\phi }_0(0)\rangle \langle \tilde{\phi }_1(t)\tilde{\phi }_1(t_i) \rangle } -1, \end{aligned}$$
(2)

with the time ordering \(t_f> t> t_i > 0\), and \(\tilde{\phi }_{i}(t)=\sum _\mathbf{x}\phi _{i}(t,\mathbf{x})\) being field projected to zero spatial momentum. The relation of \(C_4^\mathrm {BH}\) to the scattering length reads

$$\begin{aligned} C_4^\mathrm {BH}(t_f,t,t_i) \xrightarrow [t\gg t_i\gg 0]{T\gg t_f\gg t} \frac{2}{ L^3}&\bigg [ \pi \frac{a_0 }{\mu _{01} }(t-t_i) \nonumber \\ - 2a_0^2\sqrt{\frac{2 (t-t_i)}{\mu _{01}} }&+ O\left( (t-t_i)^0\right) \bigg ], \end{aligned}$$
(3)

where \(\mu _{01}=(M_0 M_1)/(M_0+M_1)\) is the reduced mass. It is defined in terms of the renormalized masses \(M_0\) and \(M_1\) of the two particles. These masses can be extracted as usual from an exponential fit at large time distances of the two-point correlation functions \(\langle \tilde{\phi }_{i}(t)\tilde{\phi }_{i}(0) \rangle \approx A_{1,i} \left( e^{ - M_{i} t} + e^{ - M_{i} (T - t)}\right) \) for \(i=0,1\). To reduce the statistical error we average over all points with the same source sink separation.

4 Numerical result

4.1 BH method

We generate ensembles using the Metropolis-Hastings algorithm with bare masses \(m_0=-4.925\) and \(m_1=-4.85\), and for simplicity we choose \(\lambda _0=\lambda _1=2\mu =2.5\). The list of ensembles generated in this work with their corresponding measured values of the masses \(M_0\) and \(M_1\) are compiled in Table 1. In this model, as observed in previous investigations of the scalar theory [13], we do not see relevant effects of excited states in the two-point correlators, i.e., they are dominated by the ground state from the first time slice.

In the following we discuss three different strategies to extract the scattering length:

  1. 1.

    We attempt a direct fit of Eq. (3) the the data.

  2. 2.

    We include an overall constant in the fit to account for the \(O\left( (t-t_i)^0\right) \) effect.

  3. 3.

    We make use of a shifted function at fixed \(t_i\) and \(t_f\), \(\Delta _t C_4^\mathrm {BH}(t_f,t,t_i)= C_4^\mathrm {BH}(t_f,t+1,t_i)-C_4^\mathrm {BH}(t_f,t,t_i)\), which cancels the constant term. We then determine \(a_0\) by fitting to

    $$\begin{aligned} \begin{aligned}&\Delta _t C_4^\mathrm {BH}(t_f,t,t_i) \approx \frac{2}{L^3}\Bigg [\pi \frac{a_0}{\mu _{01}}-2a_0^2\sqrt{\frac{2}{\mu _{01}}} \\&\quad \times \Big (\sqrt{t+1-t_i} - \sqrt{t-t_i}\,\Big )\Bigg ]. \end{aligned} \end{aligned}$$
    (4)

The three methods are compared in Fig. 1 for one of our ensembles. The black triangles represent the correlator of Eq. (3) divided by \((t-t_i)\) with \(t_i=3\) and \(t_f=16\). This representation is convenient, as it converges towards a constant when \((t-t_i) \rightarrow \infty \). From the monotonic increase of the data points, it is clear that the effect of the \(\left( (t-t_i)^0\right) \) term in Eq. (3) is still sizeable even at large time separations. A fit in the time region [10, 14]—the black band—is reasonable (\(\chi ^2/d.o.f\sim 0.7\)) but results in large uncertainties. The quality of the fit deteriorates very quickly if the fit range is extended: a fit in the time region [6, 14] yields a \(\chi ^2/\mathrm {dof}\sim 5\).

With the second strategy—the red band in Fig. 1—one is able to start fitting at significantly smaller t-values. The data are well described with a \(\chi ^2/\mathrm {dof}\sim 0.2.\)

For the third approach, we study \(\Delta _t C_4^\mathrm {BH}(t)\). This is shown in Fig. 1 as blue circles, and the blue band represents the best fit result with error. The main advantage of the last strategy is that it allows us to extract the physical information at smaller t without introducing extra parameters in the fit. Indeed, the data looks almost constant over the complete t-range available. Only very close to \(t_i\) the square root term might become visible.

Fig. 1
figure 1

Four-point function of Eq. (3) multiplied by \(L^3/2\), for \(L=22\) and \(T=96\) with \(t_i=3\) and \(t_f=16\) divided by \((t-t_i)\) black triangles. the dashed vertical lines represent the fit interval, the black band represent the result of the fit Eq. (3) and the red band is the same fit with an extra constant term. The blue circles and band represent the discrete derivative of the correlator Eq. (4) and the corresponding fit

For this third strategy, which looks most promising from a systematic point of view, we also investigate the dependence on the choice for \(t_{i}\) and \(t_{f}\). This is shown in Fig. 2 for the same ensemble as in Fig. 1. We do not observe any significant systematic effect stemming from excited state contributions when changing \(t_i\) or \(t_f\). However, we clearly see significantly smaller statistical uncertainties with smaller \(t_i\) and \(t_f\) values.

Fig. 2
figure 2

Plot of the discrete derivative of the correlator Eq. (4) for different values of \(t_i\) and \(t_f\). We do not observe any systematic shift and all correlators are compatible. The points with smaller \(t_i\) and \(t_f\) tend to have smaller error

4.2 Comparison to the Lüscher method

In this section we compare the BH method described above with the Lüscher threshold expansion [14, 15]. The latter relates the two-particle energy shift, defined as \(\Delta E_{2}=E_{2}-M_0-M_1\), to the scattering length \(a_0\) via

$$\begin{aligned} \Delta E_{2} =-\frac{2\pi a_{0}}{\mu _{01} L^3}\left[ 1 + c_1 \frac{a_{0}}{L} + c_2\left( \frac{a_{0}}{L}\right) ^2 \right] +O\left( L^{-6}\right) , \end{aligned}$$
(5)

with \(c_1=-2.837297\), \(c_2= 6.375183\) and \(E_2\) being the interacting two-particle energy. \(E_2\) can be extracted from \(C_2(t) = \langle \tilde{\phi }_1(t)\tilde{\phi }_0(t) \tilde{\phi }_1(0)\tilde{\phi }_0(0) \rangle \), whose large-t behaviour is

$$\begin{aligned} C_2(t)&\xrightarrow [T-t\gg 0]{t\gg 0} A_2 e^{-E_2 \frac{T}{2}} \cosh {\left( E_2 (t-\frac{T}{2})\right) } \nonumber \\ +&B_2 e^{-(M_0+M_1) \frac{T}{2}} \cosh {\left( (M_1-M_0) (t-\frac{T}{2})\right) }. \end{aligned}$$
(6)

Note that the last term is a known thermal pollution due to finite T in the presence of periodic boundary conditions. Using \(M_0\) and \(M_1\) as input determined from the corresponding two-point functions, the only additional parameter is \(B_2\).

Alternatively, it is possible to eliminate the second term defining

$$\begin{aligned} \tilde{C}_2(t)=C_2(t)/\cosh {\left( (M_1-M_0) \left( t-\frac{T}{2}\right) \right) }, \end{aligned}$$
(7)

and then taking the finite derivative

$$\begin{aligned} \begin{aligned} \Delta _{t}{\tilde{C}}_2(t)&={\tilde{C}}_2(t+1)-{\tilde{C}}_2(t). \end{aligned} \end{aligned}$$
(8)

The two-particle energies obtained from Eq. (6) are compatible to those from Eq. (8). The results are reported in Table 1, along with the values for the scattering length \(a_0\) computed from \(E_2\) using Eq. (5). We have calculated the correlated difference between our two estimates of \(a_0\) obtained within the Lüscher method, column 7 and 8 of Table 1. We find that the difference is always compatible with zero within one sigma, with the exception of the ensembles L20T48 and L32T96 where it is within two sigma.

A comparison between the BH and the Lüscher method is depicted in Fig. 3 for all our ensembles. The values are compatible with each other, however the BH method gives systematically larger values for \(a_0\). We do not observe a clear statistical correlation or anti correlation between the BH and the Lüscher method. We find that the correlation coefficient varies within the range [− 0.25, 0.6] in all ensembles. We notice that exponentially suppressed finite-volume errors are in principle a source of systematic error. However, in Fig. 3, the determination of \(a_0\) does not show any systematic effect varying L from 20 to 32.

Fig. 3
figure 3

Comparison of \(a_0\) computed with BH method Eq. (4) with \(t_i=2\) and \(t_f=10\) (blue circles), with \(t_i=3\) and \(t_f=16\) (red triangles) and Lüscher method Eq. (5) (black squares) in the top panel. In the bottom panel we plot the correlated difference between Lüscher method and BH method. The horizontal bands correspond to the weighted average of each method

Table 1 Values of \(a_0\), \(M_0\), \(M_1\) and \(E_2\) measured. The column \(\Delta _t C^\mathrm {BH}_4\) corresponds to the value of \(a_0\) fitted with Eq. (4) fixing \(t_i=3\) and \(t_f=16\) or \(t_i=2\) and \(t_f=10\), the column \(C_\mathrm {BH}+c\) is the result of the fit Eq. (3) adding a constant term. The two-particle energy \(E_2\) is computed from \(C_2\) with the fit of Eq. (6) and from \(\Delta \tilde{C}_2\) with Eq. (8). The corresponding value of \(a_0\) computed with the Lüscher method is reported in the corresponding columns. We used \(2\cdot 10^{7}\) configurations for each ensemble, generated from 200 replicas each of \(10^5\) thermalized configurations, we bin the configurations in blocks of \(10^5\) (the entirely replica) and we resample the resulting 200 configurations with jackknife. For the light mass \(M_0\) we measured the integrated autocorrelation time \(\tau _{int}\sim 1.5\) , while \(\tau _{int}\sim 0.5\) for \(M_1\), we skip 1000 configurations in each replica for thermalization

5 Conclusion

In this letter, we have investigated the BH method, proposed in Ref. [1], using a scalar theory on the lattice. We have indeed verified that it is a viable method to obtain the scattering length, and that it produces results that are compatible with those of the Lüscher method [14]. The most reliable strategy to analyse the four-point function is found to be the use of finite differences in time to remove an overall constant term.

We observe a systematic difference between the Lüscher and BH method. Interestingly, for each ensemble separately both determinations appear compatible. The systematic trend becomes evident only after averaging over all runs, as shown in the bands of Fig. 3. This might be attributed to different lattice artefacts, since both methods represent different estimators for \(a_0\). We are not able to check this hypothesis here, because we cannot take the continuum limit. However, the different systematics of the two methods offer in general a useful opportunity for cross checks.

The statistical error is similar in both approaches. Also the scaling in L appears to be similar, with maybe a slight advantage for the BH method. However, any advantage of one method compared to another one will in general depend on the theory considered. We conclude that it seems promising to use the BH method in lattice QCD for instance for \(\pi N\) scattering, where also the lattice spacing dependence could be investigated.