1 Introduction

Hypernuclei are bound states of one or two hyperons together with a core composed of nucleons. They extend the nuclear chart into a third dimension, augmenting the usual two dimensions of proton number and neutron number. We will use the notation Y for a \(\varLambda \) or \(\Sigma \) hyperon and N for a nucleon. Due to the scarcity of direct hyperon–nucleon (YN) and hyperon–hyperon (YY) scattering data, these unusual forms of baryonic matter play an important role in pinning down the fundamental baryon–baryon forces. This requires on the one hand an effective field theory (EFT) description of the underlying forces, as pioneered in Refs. [1, 2], and on the other hand a numerically precise and consistent method to solve the nuclear A-body problem, such as nuclear lattice EFT (NLEFT) [3, 4]. For calculations combining these chiral EFT forces at LO and NLO [5, 6] with other many-body methods, see e.g. Refs. [7,8,9,10,11,12].

In view of the success of NLEFT in the description of nuclear spectra and reactions, it seems natural to extend this method to hypernuclei. However, this is not quite straightforward. While one can extend the four spin-isospin degrees of freedom comprising the nucleons to include the \(\varLambda \) and \(\Sigma \) states [13], this has not been done because there is no longer an approximate symmetry such as Wigner’s SU(4) symmetry [14] that protects the Monte Carlo (MC) simulations against strong sign oscillations when using auxiliary fields.Footnote 1 The physics of hypernuclei therefore requires a different approach, and in this paper we show how the computational problems are solved using the impurity lattice Monte Carlo (ILMC) method.

The ILMC method was introduced in Ref. [16] in the context of a Hamiltonian theory of spin-up and spin-down fermions, and applied to the intrinsically non-perturbative physics of Fermi polarons in two dimensions in Ref. [17]. The ILMC method is particularly useful for the case where only one fermion (of either species) is immersed in a “sea” of the other species. Within the standard auxiliary field Monte Carlo method, such an extreme imbalance would lead to unacceptable sign oscillations in the Monte Carlo probability weight. In the ILMC method, the minority particle is “integrated out”, resulting in a formalism where only the majority species fermions appear as explicit degrees of freedom, while the minority fermion is represented by a “worldline” in Euclidean projection time. The spatial position of this worldline is updated using Monte Carlo updates, while the interactions between the majority fermions are described by the auxiliary field formalism [4].

Here, we apply the ILMC method to the inclusion of hyperons into NLEFT simulations. We identify the \(\varLambda \) hyperon as the minority species, which we represent by a worldline in Euclidean time. This \(\varLambda \) worldline is treated as immersed in an environment consisting of some number of nucleons. We focus on the Monte Carlo calculation of the binding energy of light hypernuclei, by means of a simplified YN interaction, consisting of a single contact interaction, tuned to a best description of the the empirical binding energies of the s-shell hypernuclei with \(A=3,4,5\).Footnote 2 For the NN interaction, we use a simple leading order interaction similar to that described in Ref. [18]. We benchmark our ILMC results against Lanczos calculations of transfer matrix and exact Euclidean projection calculations with initial/final states and number of time steps that match the ILMC calculations. We note that our Monte Carlo method is free from any approximation about the nodal structure of the many-body wave function. This is the first application of such unconstrained Monte Carlo simulations to hypernuclei.

This paper is organized as follows. In Sect. 2, we present the path integral formalism for our system of nucleons and one hyperon. We first write the nucleon–nucleon interaction first without auxiliary fields and then with auxiliary fields. In Sect. 3 we present the equivalent system using normal-ordered transfer matrices. In Sect. 4, we derive the impurity worldline formalism for the chosen YN interaction, and introduce the concept of the “reduced” transfer matrix operator, which acts on the nucleons only. In Sect. 5, we discuss the Monte Carlo updating of the hyperon worldline and the auxiliary fields, which encode the interactions between nucleons. In Sect. 6, we present results for the ground state energies of the s-shell nuclei and hypernuclei. In Sect. 7, we conclude with a discussion of future improvements and applications of the impurity lattice Monte Carlo method to hypernuclei and other quantum many-body systems.

2 Path integral formalism

We develop the ILMC formalism following Ref. [16], who considered a system of spin-up and spin-down fermions, with a contact interaction which operates between fermions of opposite spin. The situation here is completely analogous, we have one majority species, the nucleons, and one impurity, the \(\varLambda \). As usual in NLEFT, we consider positions on a spatial lattice denoted by \(\mathbf {n}\) and lattice spacing a. We also assume that Euclidean time has been discretized, such that slices of the Euclidean time are denoted by \(n_t\) with temporal lattice spacing \(a_t\). The partition function can be expressed in terms of the Grassmann path integral

$$\begin{aligned} {\mathcal {Z}} = \int \left[ \prod _{\begin{array}{c} \mathbf {n}, n_t \\ s=N,Y \end{array}} d\zeta _s^{}(\mathbf {n}, n_t^{}) d\zeta _s^*(\mathbf {n}, n_t^{}) \right] \exp (-S[\zeta , \zeta ^*]), \end{aligned}$$
(1)

where the subscripts N refer to all nucleon spin and isospin components and Y refers to all hyperon spin components. In this study we consider only \(\varLambda \) hyperons. In future work we will also consider \(\Sigma \) hyperons or account for their influence via three-baryon interactions involving a \(\varLambda \) and two nucleons. We also make the simplifying assumption that the hyperon–nucleon and nucleon–nucleon interaction are spin-independent and neglect Coulomb interactions. Because of the spin-independent interaction and the fact that we have only one Lambda hyperon, from this point onward we can restrict our attention to only one spin component of the hyperon.

Assuming that the exponent of the Euclidean action in Eq. (1) is treated by a Trotter decomposition, we find

$$\begin{aligned} S[\zeta , \zeta ^*]&\equiv \sum _{n_t} \bigg \{ S_t^{}[\zeta , \zeta ^*,n_t^{}] + S_Y^{}[\zeta ^{}, \zeta ^*,n_t^{}] \nonumber \\&\quad + S_N^{}[\zeta ^{}, \zeta ^*,n_t^{}] + S_{YN}^{}[\zeta , \zeta ^*,n_t^{}] \nonumber \\&\quad + S_{NN}^{}[\zeta , \zeta ^*,n_t^{}] \bigg \}, \end{aligned}$$
(2)

where the component due to the time derivative is

$$\begin{aligned} S_t^{}[\zeta , \zeta ^*,n_t^{}]&\equiv \!\!\! \sum _{\mathbf {n},s = N,Y} \zeta _s^*(\mathbf {n},n_t^{}) \nonumber \\&\quad \times [ \zeta _s^{}(\mathbf {n},n_t^{}+1) - \zeta _s^{}(\mathbf {n},n_t^{}) ], \end{aligned}$$
(3)

while \(S_Y^{}\) and \(S_N^{}\) describe the kinetic energies of the hyperons and nucleons, respectively. Further, \(S_{YN}\) provides the YN interaction, and \(S_{NN}\) the NN interaction, which we shall consider next.

2.1 The hyperon–nucleon interaction

For the hyperons, we take for simplicity the lowest-order (unimproved) kinetic energy

$$\begin{aligned} S_Y^{}[\zeta , \zeta ^*,n_t^{}]&\equiv 6h \sum _{\mathbf {n}} \zeta _Y^*(\mathbf {n},n_t^{}) \zeta _Y^{}(\mathbf {n},n_t^{}) \nonumber \\&\quad - h \sum _{\mathbf {n}} \sum _{l = 1}^3 \, \zeta _Y^*(\mathbf {n},n_t^{}) [ \zeta _Y^{}(\mathbf {n}+{\hat{e}}_l^{},n_t^{})\nonumber \\&\quad + \zeta _Y^{}(\mathbf {n}-{\hat{e}}_l^{},n_t^{}) ], \end{aligned}$$
(4)

with

$$\begin{aligned} h \equiv \frac{\alpha _t^{}}{2m_Y^{}}, \end{aligned}$$
(5)

where \(m_Y\) is the hyperon mass, and we have defined \(\alpha _t^{} \equiv a_t^{}/a\) as the ratio of temporal and spatial lattice spacings.

The YN interaction is given by

$$\begin{aligned}&S_{YN}^{}[\zeta , \zeta ^*,n_t^{}] \equiv \alpha _t^{} C_{YN}^{} \sum _{\mathbf {n}} \rho _N^{}(\mathbf {n}, n_t^{}) \rho _Y^{}(\mathbf {n}, n_t^{}), \end{aligned}$$
(6)

where

$$\begin{aligned} \rho _N^{}(\mathbf {n}, n_t^{}) \equiv \sum _{i,j} \rho _{i,j}^{}(\mathbf {n}, n_t^{}) \equiv \sum _{i,j} \zeta ^*_{i,j}(\mathbf {n}, n_t^{}) \zeta ^{}_{i,j}(\mathbf {n}, n_t^{}), \end{aligned}$$
(7)

and

$$\begin{aligned} \rho _Y^{}(\mathbf {n}, n_t^{}) \equiv \zeta _Y^*(\mathbf {n}, n_t^{}) \zeta _Y^{}(\mathbf {n}, n_t^{}), \end{aligned}$$
(8)

are nucleon and hyperon densities, respectively, with spin \(i=0,1\) (up, down) and isospin \(j=0,1\) (proton, neutron). The tuning of the coupling constant \(C_{YN}\) is discussed in Sect. 6.

Note that this is a simplified version of the pionless EFT calculation of Ref. [19], which also included a three-body interaction at LO. Such an interaction is sub-leading in chiral EFT approaches (such as NLEFT). See also the recent work in Ref. [20].

2.2 The nucleon–nucleon interaction

For the kinetic energy of the nucleon degrees of freedom, we likewise use the lowest-order expression

$$\begin{aligned} S_N^{}[\zeta , \zeta ^*,n_t^{}]&\equiv \frac{3\alpha _t^{}}{m_N^{}}\sum _{\mathbf {n}} \rho _N^{}(\mathbf {n}, n_t^{}) \nonumber \\&\quad -\frac{\alpha _t^{}}{2m_N^{}} \sum _{\mathbf {n}} \sum _{l = 1}^3 \bigg [\rho _N^{}(\mathbf {n}, \mathbf {n}+{\hat{e}}_l^{}, n_t^{}) \nonumber \\&\quad + \rho _N^{}(\mathbf {n}, \mathbf {n}-{\hat{e}}_l^{}, n_t^{})\bigg ], \end{aligned}$$
(9)

where

$$\begin{aligned}&\rho _N^{}(\mathbf {n}, \mathbf {n}^\prime , n_t^{}) \equiv \sum _{i,j} \zeta ^*_{i,j}(\mathbf {n}, n_t^{}) \zeta ^{}_{i,j}(\mathbf {n}^\prime , n_t^{}), \end{aligned}$$
(10)
$$\begin{aligned}&\rho _N^{}(\mathbf {n}, n_t^{}) \equiv \rho _N^{}(\mathbf {n}, \mathbf {n}, n_t^{}), \end{aligned}$$
(11)

and \(m_N^{}\) is the nucleon mass. Here, the \({\hat{e}}_l^{}\) are unit vectors in lattice direction l.

The Wigner SU(4)-symmetric part of the leading-order (LO) NN interaction of Refs. [21,22,23] is used for the present work. This is an approximate symmetry [14] of the low-energy nucleon–nucleon interactions, where the spin and isospin degrees of freedom of the nucleons can be rotated as four components of an SU(4) multiplet. We have

$$\begin{aligned} S_{NN}^{}[\zeta , \zeta ^*,n_t^{}]&\equiv \frac{\alpha _t^{} C_{NN}^{}}{2} \! \! \! \sum _{\mathbf {n}, \mathbf {n}^\prime , \mathbf {n}^{\prime \prime }} \rho _N^s(\mathbf {n}^\prime ,n_t^{}) f_{s_\mathrm{L}^{}}^{}(\mathbf {n}^\prime - \mathbf {n}) \nonumber \\&\quad \times f_{s_\mathrm{L}^{}}^{}(\mathbf {n} - \mathbf {n}^{\prime \prime }) \rho _N^s(\mathbf {n}^{\prime \prime },n_t^{}), \end{aligned}$$
(12)

where

$$\begin{aligned} \rho _N^s(\mathbf {n}, n_t^{}) \equiv \sum _{i,j} \zeta ^{s_\mathrm{NL}^{}*}_{i,j}(\mathbf {n}, n_t^{}) \zeta ^{s_\mathrm{NL}^{}}_{i,j}(\mathbf {n}, n_t^{}), \end{aligned}$$
(13)

is the smeared nucleon density, and the (local) smearing function \(f_{s_\mathrm{L}}\) is defined as

$$\begin{aligned} f_{s_\mathrm{L}^{}}(\mathbf {n})&\equiv 1 \; \mathrm{for} \; | \mathbf {n} | = 0, \nonumber \\&\equiv s_\mathrm{L}^{} \; \mathrm{for} \; | \mathbf {n} | = 1, \nonumber \\&\equiv 0 \; \mathrm{otherwise}, \end{aligned}$$
(14)

and the (non-locally) smeared Grassmann fields are given by

$$\begin{aligned} \zeta ^{s_\mathrm{NL}^{}}_{i,j}(\mathbf {n}, n_t^{}) \equiv \zeta _{i,j}^{}(\mathbf {n}, n_t^{}) + s_\mathrm{NL}^{} \sum _{|\mathbf {n}^\prime | = 1} \zeta _{i,j}^{}(\mathbf {n} + \mathbf {n}^\prime , n_t^{}), \end{aligned}$$
(15)

and

$$\begin{aligned} \zeta ^{s_\mathrm{NL}^{}*}_{i,j}(\mathbf {n}, n_t^{}) \equiv \zeta ^*_{i,j}(\mathbf {n}, n_t^{}) + s_\mathrm{NL}^{} \sum _{|\mathbf {n}^\prime | = 1} \zeta ^*_{i,j}(\mathbf {n} + \mathbf {n}^\prime , n_t^{}), \end{aligned}$$
(16)

where the values of the parameters \(C_{NN}\), \(s_\mathrm{L}\) and \(s_\mathrm{NL}\) used for the present work are discussed in Sect. 6 (see also Ref. [18] for a full treatment).

For the NN interaction we can reduce the expressions quadratic in the nucleon densities using the relation

$$\begin{aligned}&\exp \left( -\frac{\alpha _t^{} C_{NN}^{}}{2} {\tilde{\rho }}^2 \right) \nonumber \\&\quad = \frac{1}{\sqrt{2\pi }}\int _{-\infty }^\infty d\phi \, e^{-\frac{\phi ^2}{2}} \exp \left( \sqrt{-\alpha _t^{} C_{NN}^{}} \, \phi {\tilde{\rho }} \right) , \end{aligned}$$
(17)

where

$$\begin{aligned} {\tilde{\rho }} \equiv \sum _{\mathbf {n}^\prime } f_{s_\mathrm{L}^{}}^{}(\mathbf {n} - \mathbf {n}^{\prime }) \rho _N^s(\mathbf {n}^{\prime }, n_t^{}), \end{aligned}$$
(18)

for each lattice site (\(\mathbf {n}, n_t^{}\)), such that \(\phi (\mathbf {n}, n_t^{})\) is treated as a scalar auxiliary (Hubbard-Stratonovich) field. The NN action then becomes

$$\begin{aligned}&\exp (-S_{NN}^{}[\zeta , \zeta ^*,n_t^{}]) \nonumber \\&= \int \prod _{\mathbf {n}} \left[ \frac{d\phi (\mathbf {n},n_t^{})}{\sqrt{2\pi }} e^{-\frac{1}{2} \phi ^2(\mathbf {n},n_t^{})}\right] \exp (-S_{\phi N}^{}[\zeta , \zeta ^*,n_t^{}]), \end{aligned}$$
(19)

for Euclidean time slice \(n_t^{}\), where

$$\begin{aligned}&S_{\phi N}^{}[\zeta , \zeta ^*,n_t^{}] = -\sqrt{-\alpha _t^{} C_{NN}^{}} \nonumber \\&\qquad \times \sum _{\mathbf {n}, \mathbf {n}^\prime } \phi (\mathbf {n}, n_t^{}) f_{s_\mathrm{L}^{}}^{}(\mathbf {n} - \mathbf {n}^\prime ) \rho _N^s(\mathbf {n}^\prime , n_t^{}), \end{aligned}$$
(20)

for \(C_{NN} < 0\).

In the ILMC calculations, the path integral over the auxiliary field \(\phi \) is evaluated using either local Metropolis algorithm updates or global lattice updates using the hybrid Monte Carlo (HMC) algorithm. See Ref. [18] for more details on efficient Monte Carlo algorithms.

3 Transfer matrix formalism

Derivations of Feynman rules are usually easier to perform in the Grassmann formalism. However, actual NLEFT calculations are performed using the transfer matrix Monte Carlo method. As noted in Ref. [16], the Grassmann and transfer matrix operator formulations are connected by the exact relationship

$$\begin{aligned}&\mathrm {Tr} \big \{ : f_{N_t-1}^{}[a_s^{}(\mathbf {n}),a_{s^\prime }^\dagger (\mathbf {n}^\prime )] : \cdots : f_0^{}[a_s^{}(\mathbf {n}),a_{s^\prime }^\dagger (\mathbf {n}^\prime )] : \big \} \nonumber \\&\quad = \int \Bigg [ \prod _{\begin{array}{c} \mathbf {n}, n_t \\ s=N,Y \end{array}} d\zeta _s^{}(\mathbf {n}, n_t^{}) d\zeta _s^*(\mathbf {n}, n_t^{}) \Bigg ]\nonumber \\&\quad \exp \left( -\sum _{n_t} S_t^{}[\zeta , \zeta ^*, n_t^{}]\right) \nonumber \\&\qquad \times \prod _{n_t=0}^{N_t-1} f_{n_t}^{}\big [\zeta _s^{}(\mathbf {n}, n_t^{}), \zeta _{s^\prime }^*(\mathbf {n}^\prime , n_t^{})\big ], \end{aligned}$$
(21)

where f is an arbitrary function, \(a_s^\dagger \) and \(a_s^{}\) denote creation and annihilation operators for the fermion degrees of freedom, and the colons signify normal ordering. Using this identity, we can write the partition function in Eq. (1) as

$$\begin{aligned} {\mathcal {Z}} = \mathrm {Tr}({\hat{M}}^{N_t^{}}), \end{aligned}$$
(22)

where \({\hat{M}}\) is the (normal-ordered) transfer matrix operator.

We can use Eq. (21) to define the full transfer matrix operator as

$$\begin{aligned} {\hat{M}} = \, : \exp (-\alpha _t^{}{\hat{H}}) :. \end{aligned}$$
(23)

with Hamiltonian

$$\begin{aligned} {\hat{H}} \equiv {\hat{H}}_0^N + {\hat{H}}_0^Y + {\hat{H}}_{NN}^{} + {\hat{H}}_{YN}^{}. \end{aligned}$$
(24)

We now go through each of these terms. The nucleon kinetic energy term is

$$\begin{aligned} {\hat{H}}_0^N&\equiv \frac{3}{m_N^{}}\sum _{\mathbf {n}} {\hat{\rho }}_N^{}(\mathbf {n}) \nonumber \\&\quad -\frac{1}{2m_N^{}} \sum _{\mathbf {n}} \sum _{l = 1}^3 \left[ {\hat{\rho }}_N^{}(\mathbf {n}, \mathbf {n}+{\hat{e}}_l^{}) + {\hat{\rho }}_N^{}(\mathbf {n}, \mathbf {n}-{\hat{e}}_l^{})\right] , \end{aligned}$$
(25)

with

$$\begin{aligned}&{\hat{\rho }}_N^{}(\mathbf {n}, \mathbf {n}^\prime ) \equiv \sum _{i,j} a^{\dagger }_{i,j}(\mathbf {n}) a^{}_{i,j}(\mathbf {n}^\prime ), \end{aligned}$$
(26)
$$\begin{aligned}&{\hat{\rho }}_N^{}(\mathbf {n}) \equiv {\hat{\rho }}_N^{}(\mathbf {n},\mathbf {n}). \end{aligned}$$
(27)

The hyperon kinetic energy term is

$$\begin{aligned} {\hat{H}}_0^N&\equiv \frac{3}{m_Y^{}}\sum _{\mathbf {n}} {\hat{\rho }}_Y^{}(\mathbf {n}) \nonumber \\&\quad -\frac{1}{2m_Y^{}} \sum _{\mathbf {n}} \sum _{l = 1}^3 \left[ {\hat{\rho }}_Y^{}(\mathbf {n}, \mathbf {n}+{\hat{e}}_l^{}) + {\hat{\rho }}_Y^{}(\mathbf {n}, \mathbf {n}-{\hat{e}}_l^{})\right] , \end{aligned}$$
(28)

with

$$\begin{aligned}&{\hat{\rho }}_Y^{}(\mathbf {n}, \mathbf {n}^\prime ) \equiv \sum _{i,j} a^{\dagger }_Y(\mathbf {n}) a^{}_Y(\mathbf {n}^\prime ), \end{aligned}$$
(29)
$$\begin{aligned}&{\hat{\rho }}_Y^{}(\mathbf {n}) \equiv {\hat{\rho }}_Y^{}(\mathbf {n},\mathbf {n}). \end{aligned}$$
(30)

The NN interaction is

$$\begin{aligned} {\hat{H}}_{NN}^{}&= \frac{C_{NN}^{}}{2}: \! \! \! \sum _{\mathbf {n}, \mathbf {n}^\prime , \mathbf {n}^{\prime \prime }} {\hat{\rho _N^s}}(\mathbf {n}^\prime ) f_{s_\mathrm{L}^{}}^{}(\mathbf {n}^\prime - \mathbf {n}) \nonumber \\&\quad \times f_{s_\mathrm{L}^{}}^{}(\mathbf {n} - \mathbf {n}^{\prime \prime }) {\hat{\rho _N^s}}(\mathbf {n}^{\prime \prime }):, \end{aligned}$$
(31)

with

$$\begin{aligned} {\hat{\rho _N^s}}(\mathbf {n}) \equiv \sum _{i,j} a^{s_\mathrm{NL}^{}\dagger }_{i,j}(\mathbf {n}) a^{s_\mathrm{NL}^{}}_{i,j}(\mathbf {n}), \end{aligned}$$
(32)

and the operators \(a^{s_\mathrm{NL}^{}\dagger }_{i,j}(\mathbf {n})\) and \(a^{s_\mathrm{NL}^{}}_{i,j}(\mathbf {n})\) are defined in terms of the (non-locally) smeared annihilation and creation operators

$$\begin{aligned} a^{s_\mathrm{NL}^{}}_{i,j}(\mathbf {n}) \equiv a_{i,j}^{}(\mathbf {n}) + s_\mathrm{NL}^{} \sum _{|\mathbf {n}^\prime | = 1} a_{i,j}^{}(\mathbf {n} + \mathbf {n}^\prime ), \end{aligned}$$
(33)

and

$$\begin{aligned} a^{s_\mathrm{NL}^{}\dagger }_{i,j}(\mathbf {n}) \equiv a^{\dagger }_{i,j}(\mathbf {n}) + s_\mathrm{NL}^{} \sum _{|\mathbf {n}^\prime | = 1} a^{\dagger }_{i,j}(\mathbf {n} + \mathbf {n}^\prime ). \end{aligned}$$
(34)

The YN interaction is

$$\begin{aligned} {\hat{H}}_{YN}^{} = C_{YN}^{} \sum _{\mathbf {n}} {\hat{\rho }}_N^{}(\mathbf {n}) \hat{\rho }_Y^{}(\mathbf {n}). \end{aligned}$$
(35)

When rewriting the nucleon-nulceon interaction with auxiliary fields, the partition function takes the form

$$\begin{aligned} {\mathcal {Z}} = \int&\prod _{\mathbf {n},n_t} \left[ \frac{d\phi (\mathbf {n},n_t^{})}{\sqrt{2\pi }} e^{-\frac{1}{2} \phi ^2(\mathbf {n},n_t^{})}\right] \mathrm {Tr}[{\hat{M}}^{(N_t-1)} \cdots {\hat{M}}^{(0)}], \end{aligned}$$
(36)

where

$$\begin{aligned} {\hat{M}}^{(n_t)} \equiv \, : \exp (-\alpha _t^{}{\hat{H}}^{(n_t)}) :, \end{aligned}$$
(37)

with

$$\begin{aligned} {\hat{H}}^{(n_t)} \equiv {\hat{H}}_0^N + {\hat{H}}_0^Y + {\hat{H}}^{(n_t)}_{\phi N} + {\hat{H}}_{YN}^{}, \end{aligned}$$
(38)

and

$$\begin{aligned} {\hat{H}}^{(n_t)}_{\phi N}&= -\sqrt{-\alpha _t^{} C_{NN}^{}} \nonumber \\&\quad \times \sum _{\mathbf {n}, \mathbf {n}^\prime } \phi (\mathbf {n}, n_t^{}) f_{s_\mathrm{L}^{}}^{}(\mathbf {n} - \mathbf {n}^\prime ) {\hat{\rho }}_N^s(\mathbf {n}^\prime , n_t^{}). \end{aligned}$$
(39)

4 Impurity worldlines and reduced transfer matrices

We now integrate out the hyperon degree of freedom and derive a “reduced” transfer matrix operator , which acts on the nucleon degrees of freedom only. Let us consider the transfer matrix between time slices \(n_t\) and \(n_t+1\). Let \(\mathinner {|{\mathbf {n}}\rangle }\) represent the state with the hyperon at lattice site \(\mathbf {n}\). We first consider the case when the hyperon hops from lattice site \(\mathbf {n}\) to \(\mathbf {n}\pm {\hat{e}}_l\). We then have

(40)

where is the reduced transfer matrix operator acting on only the nucleons with

(41)

where

$$\begin{aligned} {{\hat{H}}}^{(n_t)}_{\mathbf {n}\pm {\hat{e}}_l,\mathbf {n}} = H_0^N + {\hat{H}}_0^Y + {\hat{H}}^{(n_t)}_{\phi N}. \end{aligned}$$
(42)

Next we consider the case when the hyperon remains at lattice site \(\mathbf {n}\) between time slices \(n_t\) and \(n_t+1\). We then have

(43)

where the reduced transfer matrix is

(44)

with

$$\begin{aligned} {{\hat{H}}}^{(n_t)}_{\mathbf {n},\mathbf {n}} = H_0^N + {\hat{H}}_0^Y + \hat{H}^{(n_t)}_{\phi N}+\frac{C_{YN}^{}}{1-6h} {\hat{\rho }}_N^{}(\mathbf {n}) + \cdots . \end{aligned}$$
(45)

The ellipses refers to terms with higher powers of \({\hat{\rho }}\) and additional factors of \(\alpha _t\). These are lattice artifacts that disappear in the limit \(\alpha _t \rightarrow 0\). They are needed to cancel the higher-order powers of the \(C_{YN}\) term when expanding the exponential in Eq. (44) beyond the linear term. In the full transfer matrix such terms vanish upon normal ordering of the hyperon field since we have only one hyperon in our system. However, when we integrate out the hyperon worldline, such terms no longer vanish since the hyperon is no longer a dynamical field.

In our simulations here we drop all such higher-order terms from our ILMC simulations. This choice constitutes a redefinition of our starting interaction to include some small higher-body interactions between the hyperon and more than one nucleon. Since we will take \(\alpha _t\) to be very small, the most important induced higher-body interaction is a small three-body interaction. The three-body interaction has the form

$$\begin{aligned} {\hat{H}}_{YNN}^{} = - \frac{\alpha _t^{} C_{YN}^2}{2(1-6h)} \sum _{\mathbf {n}} {\hat{\rho }}_N^{}(\mathbf {n}) {\hat{\rho }}_N^{}(\mathbf {n}) {\hat{\rho }}_Y^{}(\mathbf {n}), \end{aligned}$$
(46)

We see explicitly that this term is a lattice artifact that disappears when \(\alpha _t^{} \rightarrow 0\).

5 Monte Carlo calculation

We now describe how ILMC calculations are performed using the Projection Monte Carlo (PMC) method. Let us first assume that the impurity has been fixed at a given spatial lattice site, and that no “hopping” of the impurity occurs during the Euclidean time evolution. We shall then relax this constraint, and discuss a practical algorithm for updating the configuration of the hyperon worldline.

5.1 Stationary impurity

For a stationary hyperon impurity, the reduced transfer matrix is given by Eq. (44), and for the purposes of the PMC calculation, we define the Euclidean projection amplitude

(47)

for a product of \(N_t^{}\) Euclidean time slices, where j and k denote different initial cluster states. As usual, this is expressed as a determinant of single-particle amplitudes, which gives

$$\begin{aligned} Z_{jk}^{}(N_t^{}) = \mathrm {det} \, M_{p\times p}^{jk}, \end{aligned}$$
(48)

where

(49)

for p nucleons. By means of the projection amplitudes (48), we construct

$$\begin{aligned}{}[{\hat{M}}^a_{}(N_t^{})]_{qq^\prime }^{} \equiv \sum _{q^{\prime \prime }}Z_{qq^{\prime \prime }}^{-1}(N_t^{}) Z_{q^{\prime \prime }q^\prime }^{}(N_t^{}+1), \end{aligned}$$
(50)

which is known as the “adiabatic transfer matrix”. If we denote the eigenvalues of (50) by \(\lambda _i(N_t)\), we find

$$\begin{aligned} \lambda _i^{}(N_t^{}) = \exp (-\alpha _t^{}E_i^{}(N_t^{}+1/2)), \end{aligned}$$
(51)

such that the low-energy spectrum is given by the “transient” energies

$$\begin{aligned} E_i^{}(N_t^{}+1/2) = -\frac{\log (\lambda _i^{}(N_t^{}))}{\alpha _t^{}}, \end{aligned}$$
(52)

at finite temporal lattice spacing \(a_t\). For the case of a single trial cluster state with p nucleons, Eq. (48) reduces to

$$\begin{aligned} Z(N_t^{}) = \mathrm {det} \, M_{p\times p}^{00}, \end{aligned}$$
(53)

for the case of a single trial state. The ground-state energy is obtained from

$$\begin{aligned} E_0^{}(N_t^{}+1/2) = -\frac{\log (Z(N_t^{}+1)/Z(N_t^{}))}{\alpha _t^{}}, \end{aligned}$$
(54)

in the limit \(N_t \rightarrow \infty \), where the exact low-energy spectrum of the transfer matrix will be recovered. Note that the argument \(N_t+1/2\) is conventionally assigned to the transient energy computed from the ratio of projection amplitudes evaluated at Euclidean time steps \(N_t+1\) and \(N_t\).

As an example, for the hypertriton we have \(p = 2\) nucleons after the impurity hyperon has been integrated out. We start the Euclidean time projection with a single initial trial cluster state (\(j = k = 0\)) consisting of a spin-up proton and a spin-up neutron. As there are no terms that mix spin or isospin, the other components of each single-particle state are set to zero, and remain so during the PMC calculation. For the spatial parts of the nucleon wave functions, we may choose, for example, the zero-momentum state

$$\begin{aligned} | \phi _{0,0}^{} \rangle = | \phi _{1,0}^{} \rangle = \langle 0,0,0 \rangle , \end{aligned}$$
(55)

in the notation of Ref. [16], which denotes plane-wave orbitals in a cubic box. In principle, we may also choose any other plane-wave state with non-zero momentum (see Table 1 of Ref. [16]), or any other more complicated trial state. For the heavier nuclei, it is indeed better to choose an initial state where the nucleons are clustered together. In this case we sum over all possible translations of the cluster in order construct an initial state with zero total momentum.

5.2 Hopping impurity

If the hyperon impurity is allowed to hop between nearest-neighbor sites (from one Euclidean time slice to the next), the Euclidean projection amplitude becomes a sum over hyperon worldline configurations. This gives

(56)

where the product

(57)

is expressed in terms of the reduced transfer matrices (44) and (41). Here, \(\mathbf {n}_j\) denotes the spatial position of the hyperon impurity (which has been integrated out) on time slice j. The expressions for the projection amplitude and determinant are generalized to

$$\begin{aligned} Z_{jk}^{}(N_t^{}) = \sum _{\mathbf {n}_0^{}, \ldots , \mathbf {n}_{N_t}^{}} \mathrm {det} \, M_{p\times p}^{jk}, \end{aligned}$$
(58)

where

(59)

such that the determinant is now to be computed over all possible hyperon wordline configurations.

We note that the worldline configuration is to be updated stochastically using a Metropolis algorithm. Thus, proposed changes in the impurity worldline are accepted or rejected by importance sampling with \(|Z_{jj}(N_t)|\) as the probability weight function. Here, j denotes one of the initial trial nucleon cluster states.

5.3 Worldline updates

The updating of the impurity worldline is handled in two steps: The generation of a new proposed worldline, and a Metropolis accept/reject step to determine whether to use the generated worldline. For this work, the worldline \(W(\mathbf {n}, n_t^{})\) is a function of only the lattice site \(\mathbf {n}\) and the Euclidean time step \(n_t^{}\), and is equal to 1 where the impurity is present, and 0 at all other lattice points. From the expressions of the reduced transfer matrices, the worldline at two adjacent time steps, \(W(\mathbf {n}^\prime ,n_t^{})\) and \(W^\prime (\mathbf {n}^\prime , n_t^{}+1)\) must obey the relation \(|\mathbf {n} - \mathbf {n}^\prime | \le 1\). For an illustration of the impurity (hyperon) worldline, see Fig. 1.

Fig. 1
figure 1

Illustration of the hyperon worldline. In the reduced transfer matrix formalism, the hyperon has been “integrated out”, and the interaction between the hyperon and the nucleons is mediated by an effective “background field” generated by the hyperon worldline

For the non-interacting worldline, we can generate new configurations from the free probabilities, as determined from the reduced transfer matrices. In this case, \(P_h = h\) is the hopping probability, and \(P_s = (1 - 6h)\) is the probability to remain stationary. When initializing the worldline at the beginning of the MC simulation, we may start from a configuration where the worldline is completely stationary (“cold start”) or one where the worldline either hops or remains stationary at each time step according to the probabilities \(P_h\) and \(P_s\) (“warm start”).

At the beginning of every sweep through the lattice, we propose a new worldline to use for that sweep. This is done by taking the previous worldline and choosing a random time at which we cut the worldline and regenerating it either in the forwards and backwards time direction. The new worldline is then accepted or rejected using a Metropolis accept or reject condition to preserve detailed balance associated with the absolute value of the amplitude.

6 Results

For the results presented in what follows, we use a spatial lattice spacing \(a = 1/\)(100 MeV) and temporal lattice spacing of \(a_t = 1/\)(300 MeV). The non-local smearing parameter is chosen to be \(s_\mathrm{NL} = 0.2\), and the local smearing parameter is set to \(s_\mathrm{L} = 0.0\). Since we only consider s-shell nuclei and hypernuclei in this study, the local attraction provided by \(s_\mathrm{L}\) for heavier nuclei is not needed [21]. The coupling constant \(C_{NN}\) is set to \(-7.5\times 10^{-6}\) \(\hbox {MeV}^{-2}\), and this combination of parameters yields a nucleon–nucleon scattering length \(a_{NN} = 6.86\) fm and effective range \(r_{NN} = 1.77\) fm. The scattering length and effective range are calculated using Lüscher’s finite volume method [24], as described in the Appendix of Ref. [25]. We find that these parameters produce good results for the average S-wave phase shifts as well as the three- and four-nucleon binding energies. The exact transfer matrix calculation of the three-nucleon system and the Monte Carlo calculation of the four-nucleon system are both described in the following paragraphs. As stated previously, in this study the spin-dependent terms of the nucleon–nucleon interaction are not accounted for.

For the YN interaction, we set \(C_{YN}\) according to the best overall fit to the light hypernuclei. Fitting to the \(\varLambda \) separation energies for \(^3_\varLambda \)H, \(^4_\varLambda \)H/He, and \(^5_\varLambda \)He, we find \(C_{YN} = -1.6 \times 10^{-5}\) \(\hbox {MeV}^{-2}\). This gives \(a_{YN} = -0.45\) fm for the scattering length and \(r_{YN} = -0.45\) fm for the effective range. In Table 1, we present benchmark calculations of the ILMC results for \(^3_\varLambda \)H in comparison with exact transfer matrix calculations. We show the results for the energy as a function of Euclidean projection time.

Table 1 ILMC results for the energy of \(^3_\varLambda \)H versus Euclidean time in comparison with exact transfer matrix results for periodic box length 15.8 fm

We see that the agreement is quite good. The initial/final nucleon trial states for these calculations are taken to be spatially constant functions, which correspond to single-particle states of zero momentum in a periodic cubic box. The hyperon initial/final wave function is also taken be a constant function. Since we use a constant initial/final state wave function for the hyperon, the initial/final positions for the hyperon worldline are irrelevant in the Monte Carlo updating process. These exact transfer matrix calculations include the induced three-baryon interaction described in Eq. (46).

In Table 2, we present exact Lanczos transfer matrix calculations of the ground state of \(^2\)H, \(^3_\varLambda \)H, and separation energy \(B_\varLambda ^{}\), as a function of periodic box length. In this work, we also present the exact Lanczos transfer matrix calculation wherever it is computationally possible and using Monte Carlo for cases where it is not. Given the extremely small \(\varLambda \) separation energy, it is necessary to go to very large volumes in order to remove finite volume artifacts. Interestingly, \(B_\varLambda ^{}\) is found to be relatively constant with the periodic box size L. This suppression of the finite volume dependence is an indication that the asymptotic normalization coefficient of the hypertriton wave function is small [26, 27].

Table 2 Exact transfer matrix results for \(^2\)H, \(^3_\varLambda \)H, and the separation energy \(B_\varLambda ^{}\) versus periodic box length

In Fig. 2, we present ILMC results for the \(^4_{\varLambda }\)H/He energy versus Euclidean time. These calculations use a periodic box size of \(L = 15.8\) fm with up to \(N_t = 300\) Euclidean time steps. In order to extract the ground state energy, we use the extrapolation ansatz

$$\begin{aligned} E(t) = E_0^{} + c \exp (- \varDelta E t), \end{aligned}$$
(60)

which takes into account the residual dependence of the first excited state that couples to our initial/final states. For this calculation, we use an initial/final state where the nucleon states have a spatially decaying exponential form with respect to the nucleus center of mass, while the initial/final hyperon wave function is a constant function. It suffices to have an initial/final state with some overlap with the ground state wave function, and we find that these choices work very well.

Fig. 2
figure 2

ILMC results for the \(^4_{\varLambda }\)H/He energy versus Euclidean projection time in a periodic box size of \(L = 15.8\) fm. We extract the ground state energy using an exponential ansatz for the asymptotic time dependence

In Fig. 3, we show lattice Monte Carlo (LMC) results for the \(^4\)He energy versus Euclidean time. As there are no hyperons in this system, these are auxiliary field Monte Carlo calculations without impurity worldlines. These calculations use a periodic box size of \(L = 9.9\) fm with up to \(N_t = 150\) Euclidean time steps. In order to extract the ground state energy, we again use the exponential ansatz in Eq. (60). For this calculation, we again use an initial/final state where the nucleons have a spatially-decaying exponential form with respect to the nucleus center of mass.

Fig. 3
figure 3

LMC results for the \(^4\)He energy versus Euclidean projection time in a periodic box size of \(L = 9.9\) fm. We extract the ground state energy using an exponential ansatz for the asymptotic time dependence

In Fig. 4, ILMC results are shown for the \(^5_\varLambda \)He energy versus Euclidean time. These calculations use a periodic box size of \(L = 9.9\) fm with up to \(N_t = 250\) Euclidean time steps. We again use the exponential ansatz from Eq. (60) to extract the ground state energy. Similar to the \(^4_{\varLambda }\)H/He calculation, here we use an initial/final state where the nucleons have a spatially decaying exponential form with respect to the nucleus center of mass, while the initial/final hyperon wave function is a constant function.

Fig. 4
figure 4

ILMC results for the \(^5_\varLambda \)He energy versus Euclidean time in a periodic box size of \(L = 9.9\) fm. We extract the ground state energy using an exponential ansatz for the asymptotic time dependence

In Table 3, we present the lattice results for all of the s-shell nuclei and hypernuclei. The exact transfer matrix results are shown without error bars, while the ILMC and LMC results are shown with error bars that take into account stochastic errors and extrapolation errors. There is also a residual systematic error due to finite volume effects. For a box size of \(L = 29.6\) fm, the finite volume error on \(^2\)H is 0.04 MeV, and the estimated finite volume error for \(^3_\varLambda \)H is also \(\simeq 0.04\) MeV. As both corrections are in the same direction (with more binding at finite volume), the resulting finite volume error on the separation energy is \(<0.002\) MeV.

For a box size of \(L = 15.8\) fm, the finite volume error on \(^3\)H/He is \(\simeq 0.10\) MeV, and the estimated finite volume errors for \(^4_\varLambda \)H/He are also \(\simeq 0.10\) MeV. For a box size of \(L = 9.9\) fm, the finite volume error on \(^4\)He is \(\simeq 1.5\) MeV, and the estimated finite volume errors for \(^4_\varLambda \)H/He are \(\simeq 2.0\) MeV.

Table 3 Summary of lattice results (exact transfer matrix, ILMC and LMC) for the energies of light nuclei and hypernuclei, and for separation energies. Comparisons with experimental separation energies are given where such data exists. These comparisons are averaged over Wigner SU(4) and \(\varLambda \) spin components. For the case of \(^4_\varLambda \)H/He, we average over the \(0^+\) and \(1^+\) separation energies for \(^4_\varLambda \)H and \(^4_\varLambda \)He weighted by number of spin components. More data can be found in the review Ref. [33]

For the comparison with the experimental results, we average over Wigner SU(4) and \(\varLambda \) spin components where data exists. We see that while the \(B^\mathrm{exp}_\varLambda \) is larger than the experimental values for \(^3_\varLambda \)H and \(^5_\varLambda \)He, the separation is smaller than experimental value for \(^4_\varLambda \)H/He. This is an indication that there are deficiencies in our very simple treatment of the YN and NN interactions. However, this serves as a good starting point for determining the essential features of the YN interactions needed to describe the structure and properties of hypernuclei.

7 Discussion

We have shown, as a proof of principle, how state-of-the-art NLEFT calculations can be extended to include hyperons. As the number of hyperons in realistic hypernuclei is small (typically one or two) relative to the number of nucleons, we have applied the ILMC method whereby the hyperon “impurity” is integrated out and represented by a hyperon “worldline”, the position of which is updated during the MC calculation. Effectively, the standard NLEFT calculations for nucleons are augmented by a “background field” induced by the hyperon worldline. We have benchmarked the ILMC method by presenting preliminary MC results for the s-shell hypernuclei, using a simplified interaction similar to pionless EFT.

One of the most promising aspects of this work is the fact that the ILMC simulations scale very favorably with the number of nucleons. We have found that nearly all of the computational effort is consumed in calculating single-nucleon amplitudes as a function of the auxiliary field. As this part of the code scales linearly with the number of nucleons, it should be possible to perform calculations of hypernuclei with up to one hundred or more nucleons. We note also that the particular set of interactions that we have used here can also be directly applied to studying the properties of a bosonic impurity immersed in a superfluid Fermi gas. By modifying the included P-wave interactions of the impurity, we would also be able to describe the properties of an alpha particle immersed in a gas of superfluid neutrons. The possible applications of this method clearly go well beyond hypernuclear structure calculations and have general utility for numerous quantum many-body systems.

Returning to hypernuclear systems, the obvious next extension of this work is to include spin-dependent YN interactions. The importance of the spin-dependence of the YN interaction can be seen clearly in the splittings between the \(0^+\) and \(1^+\) states in \(^4_\varLambda \)H and \(^4_\varLambda \)He [34]. One should also include explicit \(\varLambda N\)-\(\Sigma N\) transitions, see e.g. Ref. [35], as well as one-meson exchange interactions that would put the YN interaction in the same EFT formalism [5, 6] as currently used for the NN interaction in NLEFT [23].

The number of adjustable parameters in the YN interaction will then increase. The most natural approach, in line with the treatment of the NN interaction, would be to fit such parameters to \(\varLambda N\) scattering phase shifts. However, due to the paucity of such data (especially at low energies), we expect to need at least the hypertriton binding energy as an additional constraint, as it is also done in continuum chiral EFT, see e.g. Ref. [6]. As the effects of \(\varLambda N\)-\(\Sigma N\) transitions are included, it may be necessary to use further empirical data on other light hypernuclei to constrain the relevant LECs. A further extension concerns the extension to \(S=-2\) hypernuclei, which on the one hand would involve the YY interactions [36,37,38] and on the other hand a modified ILMC algorithm for two interacting worldlines. Work along these lines is underway.