1 Introduction

While the microscopic origin of confinement in quantum chromodynamics (QCD) remains elusive due to the lack of analytic methods to solve QCD at small energies, the formation of a region of strong chromomagnetic flux, a flux tube, between quark and antiquark provides a heuristic explanation for quark confinement. Strong evidence for flux tube formation has been found in numerous simulations of lattice QCD, both in the quenched approximation, e.g. [1], and in simulations with dynamical fermions, e.g. [2] (for reviews and more detailed lists of references, see [3, 4]).

For large quark–antiquark distances R, the flux tube is expected to resemble a thin energy string, so that its dynamics will be governed by an effective (bosonic) string theory (EST). Here and in the following, we neglect the effects of dynamical quarks, which enable the formation of a light quark–antiquark pair from the vacuum and a breaking of the string. Following two decades of tremendous progress, a number of features of the EST, including the spectrum up to \(O(R^{-5})\) [5,6,7],Footnote 1 are by now rather well understood. See Refs. [4, 8] for recent reviews.Footnote 2 Most of the parameters in the EST are constrained by Lorentz symmetry and take universal values. The first nonuniversal parameter in the EST, denoted as \(\bar{b}_2\) (in its dimensionless version), appears at order \(O(R^{-4})\) and is the coefficient of a boundary term.

The spectrum of the flux tube excitations can be computed in numerical simulations of pure gauge theories, where dynamical fermions are absent. Typically good agreement between the EST predictions and the lattice results has been observed down to \(q\bar{q}\) separations where the EST is expected to break down (for a compilation of results see [4]). Since the nonuniversal coefficients appear in subleading terms, their extraction requires very accurate results for the energy levels up to large values of R. So far sufficient accuracy has only been achieved for the coefficient \(\bar{b}_2\) in three dimensions (3d) [15,16,17,18,19], where the simulations are at least an order of magnitude less expensive than in the 4d case. First hints for a nonvanishing \(\bar{b}_2\) in 4d \({\text {SU}}(3)\) gauge theory, however, have recently been obtained from both the groundstate energy—the static potential—and the flux tube profile at nonzero temperature [20, 21]. Despite a parametric suppression, the most precise computation of \(\bar{b}_2\) comes from the static potential, which can be obtained with much higher accuracy than the excitation spectrum. The consistency between the boundary coefficient extracted from the static potential [18] and the excited states has so far only been checked for a comparably coarse lattice spacing [15, 19]. In this article, we will extend and improve on our initial studies of the excitation spectrum in 3d \({\text {SU}}(2)\) gauge theory, Refs [15, 22,23,24]. In particular, we aim at investigating the continuum approach of the excited states at large R, which has not been possible with sufficient accuracy previously, and compare our results to the EST predictions using the EST parameters extracted in Ref. [18].

Currently, the main challenge from the theory side concerns the inclusion of corrections due to the vorticity of the flux tube, possibly showing up as massive modes on the worldsheet (e.g. [25,26,27,28]; see also the more detailed discussion and list of references in Ref. [18]). Candidate states with contributions from massive modes have been seen in 4d \({\text {SU}}(N)\) gauge theories [29,30,31,32]. For closed strings, the results for such an anomalous state are in good agreement with the contribution of a massive pseudoscalar particle on the worldsheet, known as the worldsheet axion [33, 34] (see also [35]). Similar anomalous states do not appear in 3d, which might be related to the absence of the topological coupling term and, consequently, less sensitivity of the energy levels to massive modes. In 3d, the presence of massive modes leads to an additional term which mixes with the boundary correction term at \(O(R^{-4})\) and, consequently, impacts the numerical result for \(\bar{b}_2\). Since the 3d massive mode contributions to the excited state energies are unknown, we cannot perform a direct comparison of the cases with and without massive modes as done for the static potential in [18] and leave this for future studies.

2 Excited states of the QCD flux tube

In this section, we will focus on the extraction of the excited states of the flux tube in 3d \({\text {SU}}(2)\) gauge theory for multiple lattice spacings. The comparison with the EST is left for the next section.

2.1 Computation of Wilson loops for excited states

While for the extraction of the static potential the ideal observables are Polyakov loop correlation functions (e.g. [18, 36, 37]), spatiotemporal Wilson loops of extents R and T, including operators coupling to particular quantum number channels on the spatial lines, are suitable observables for the extraction of excited states. The main difficulty is the need for large loops for which the signal-to-noise ratio decreases exponentially with the area of the loops. At a given value of the \(q\bar{q}\) distance R, the reliable high-precision extraction of the energy state demands the sufficient suppression and control of the contributions of excited states to the Wilson loop expectation value (cf. Eq. (2)). In the first comprehensive study, large correlation matrices including highly optimized operators in combination with smearing and anisotropic lattices have been used to extract the excitation spectrum in different gauge theories and three and four dimensions [30, 31, 38]. After careful diagonalization, excited state contributions are suppressed in the individual eigenvalue correlators,Footnote 3 so that the convergence to the \(T\rightarrow \infty \) limit is enhanced for the low-lying states. An alternative strategy has been pursued in [22,23,24], using an improved version of the Lüscher–Weisz multilevel algorithm [40] for error reduction to reliably extract large loops for a small number of spatial operators and temporal extents. The residual excited state contaminations have been removed using suitable fit functions.

Both strategies give reliable results for intermediate values of R, but for large values of R the contamination due to excited states becomes more and more severe, since the energy gaps to the excited states decrease. Consequently, there is doubt about the sufficient suppression of the excited contributions in this regime. A combination of the two methods, using the improved multilevel algorithm in combination with a large set of operators, correlation matrices and fits including excited state contributions, has been used in Ref. [15], leading to accurate results for the excited states in 3d \({\text {SU}}(2)\) gauge theory at large values of R, albeit at a single and comparably large lattice spacing. In this study, we use this strategy, further improving on the analysis by employing improved fits and more temporal extents in the analysis, to extract the spectrum at smaller lattice spacings.

In 3d, the string energy levels can be classified by the quantum numbers of charge conjugation and parity (CP). The individual combinations are denoted as channels. Using suitable sets of spatial operators \(S_i^j\), the correlation matrices with respect to these quantum numbers can be transformed to a block-diagonal form using the linear combinations

$$\begin{aligned} \begin{array}{l} S_{i}^{++} = S_{i}^{1} + S_{i}^{2} + S_{i}^{3} + S_{i}^{4} \\ S_{i}^{+-} = S_{i}^{1} + S_{i}^{2} - S_{i}^{3} - S_{i}^{4} \\ S_{i}^{--} = S_{i}^{1} - S_{i}^{2} - S_{i}^{3} + S_{i}^{4} \\ S_{i}^{-+} = S_{i}^{1} - S_{i}^{2} + S_{i}^{3} - S_{i}^{4} \; . \end{array} \end{aligned}$$
(1)

To extract excited states in a given (CP) channel, we use the eight different operator sets introduced in Ref. [15]. Setting up a generalized eigenvalue problem for these correlation matrices for the reliable extraction of the eigenvalues in the limit \(T\rightarrow \infty \) is problematic, since the correlation matrices might be ill-conditioned already for the smallest temporal extent available. As in Ref. [15], we thus diagonalize the correlation matrices for each value of T separately utilizing the QR reduction method. The resulting eigenvalues are denoted as \(\lambda ^{CP}_{n}(R,T)\) with \(n=0,\,\ldots ,\,7\). To check that we are identifying the right eigenvalues with increasing T, we look at the associated eigenvectors. Those belonging to the same states for different T should be almost parallel, whereas those belonging to different states perpendicular. In the following, we will mostly focus on the ground states in the individual channels, for which the identification of the eigenvalues is unambiguous. The only excited state which we consider is the first excited state in the \((+,+)\)-channel, for which the identification becomes more difficult at smaller lattice spacings. In contrast to the eigenvalues obtained from a generalized eigenvalue problem, the resulting eigenvalues might include contaminations from other states in the channel, albeit with overlaps which are strongly suppressed for \(T\rightarrow \infty \).

2.2 Removal of excited state contaminations

The eigenvalues of the correlation matrices generically obey the spectral representation [41]

$$\begin{aligned} \lambda ^{CP}_{n}(R,T) = \beta ^{CP}_n(R) e^{-E^{CP}_{n}(R)\,T} \Big [ 1 + \sum _{k=0\ne n}^{\infty } \widetilde{\alpha }_{k,n}^{CP}(R) \, e^{-|\Delta E^{CP,CP}_{kn}(R)|\,T} \Big ] . \end{aligned}$$
(2)

Here, \(E^{CP}_{n}\) are the energies in the (CP)-channel,

$$\begin{aligned} \Delta E^{CP;CP'}_{nm}(R) \equiv E^{CP}_n(R)-E^{CP'}_m(R) \end{aligned}$$
(3)

are the energy differences, \(\beta ^{CP}_n\) the overlap of the eigenstate with energy state n and \(\widetilde{\alpha }_{k,n}^{CP}(R)\) the ratio of overlaps of the eigenstate with energy states k and n. At finite T, it depends on the size of the energy gaps and the reduction of overlaps which state dominates the sum on the right hand side of Eq. (2).

At large T and with sufficient suppression of contributions from other states, the energies can be extracted using the asymptotic \(T\rightarrow \infty \) formula

$$\begin{aligned} -\ln \left( \lambda ^{CP}_{n}(R,T)\right) = \bar{E}^{CP}_{n}(R)\,T - \ln \left( \beta ^{CP}_{n}(R)\right) \; . \end{aligned}$$
(4)

In practice, however, contaminations from other states are not negligible due to the high accuracy for the eigenvalues, so that Eq. (4) cannot be used to extract the energies reliably. To remove contaminations from other states, we perform a simultaneous fit to the results for the eigenvalues for all available \(T_a\) and \(T_b\) to the leading-order formula (see also [15, 22, 24])

$$\begin{aligned} \begin{array}{rl} - \frac{1}{T_{b}-T_{a}} \, \ln \left[ \frac{ \lambda ^{CP}_{n} ( R , T_{b} ) }{ \lambda ^{CP}_{n} ( R , T_{a} ) } \right] = &{} E^{CP}_{n} ( R ) + \frac{1}{T_{b}-T_{a}} \, \alpha _n^{CP}(R) \, e^{ - \delta _n^{CP}(R) \, T_{a} }\\ &{} \times \left( 1 - e^{ - \delta _n^{CP}(R) \, ( T_{b} - T_{a} ) } \right) \, . \end{array} \end{aligned}$$
(5)

Here, \(T_a<T_b\), \(\alpha _n^{CP}(R)\equiv \widetilde{\alpha }_{n+1,n}^{CP}(R)\) and \(\delta _n^{CP}(R)\) is the energy gap to the closest state (or the one with the dominant contribution to the sum on the right-hand-side of Eq. (2)) in the channel. In most of the cases, these fits lead to accurate results with acceptable control over the systematic effects. In some cases, however, these fits become unstable with respect to statistical fluctuations of the eigenvalues. This is particularly true for large values of R and higher excited states. To further constrain the fits, we include the data for the individual eigenvalues in the global fit, leading to the additional relations

$$\begin{aligned} - \frac{1}{T_{b}} \, \ln \left[ \lambda ^{CP}_{n} ( R , T_{b} ) \right] = E^{CP}_{n} ( R ) - \frac{1}{T_{b}} \, \left( \gamma _n^{CP}(R) + \alpha _n^{CP}(R) \, e^{ - \delta _n^{CP}(R) \, T_{b} } \right) \, , \end{aligned}$$
(6)

where \(\gamma _n^{CP}(R)=\ln \big (\beta ^{CP}_{n}(R)\big )\) is an additional fit parameter. Despite this additional parameter, the joint fits using Eqs. (5) and (6) are generically more stable. A particular example for such a fit is shown in Fig. 5 in “Appendix 1”. To control the systematics in these fits, we implement further checks which are also described in “Appendix 2”.

The energy differences \(\Delta E^{CP;CP'}_{nm}(R)\) from Eq. (3) can be extracted independently from the total energies, so that they might serve as independent crosschecks. Using Eq. (2), one can derive the analogue to Eq. (5) for the energy differences

$$\begin{aligned} \begin{array}{rl} &{}- \frac{1}{T_{b}-T_{a}} \, \ln \left[ \frac{ \lambda ^{CP}_{n} ( R , T_{b} ) \, \lambda ^{CP'}_{m} ( R , T_{a} ) }{ \lambda ^{CP}_{n} ( R , T_{a} ) \, \lambda ^{CP'}_{m} ( R , T_{b} ) } \right] \\ \quad &{}= \Delta E^{CP;CP'}_{nm}(R) + \frac{1}{T_{b}-T_{a}} \, \overline{\alpha } \, e^{ - \overline{\delta } \, T_{a} } \left( 1 - e^{ - \overline{\delta } \, ( T_{b} - T_{a} ) } \right) \; , \end{array} \end{aligned}$$
(7)

where \(\overline{\alpha }=\overline{\alpha }_{nm}^{CP;CP'}(R)\) is a suitable combination of the overlaps and \(\overline{\delta }=\overline{\delta }_{nm}^{CP;CP'}(R)\) corresponds to the energy gap to the closest state in the \(CP'\) channel for eigenvalue m (which is equivalent to the gap in the CP channel for eigenvalue n to leading order in 1/R). In analogy to the fits for the total energies, we can improve and stabilize the fits by including the analogue of Eq. (6) for the energy differences in the global fit,

$$\begin{aligned} - \frac{1}{T_{b}} \, \ln \left[ \frac{\lambda ^{CP}_{n} ( R , T_{b} )}{\lambda ^{CP'}_{m} ( R , T_{b} ) } \right] = \Delta E^{CP;CP'}_{nm}(R) - \frac{1}{T_{b}} \, \left( \overline{\gamma } + \overline{\alpha } \, e^{ - \overline{\delta } \, T_{b} } \right) \, , \end{aligned}$$
(8)

where \(\overline{\gamma }=\overline{\gamma }_{nm}^{CP;CP'}(R)=\beta ^{CP}_{n}(R)/\beta ^{CP'}_m(R)\).

In the analysis, we considered fits including all loops, fits for which we left out the data from loops with the smallest temporal extents, which was beneficial when for the smallest temporal extents the contaminations from higher excited states were still sizeable, and, in very few cases, fits where the data from the largest temporal extent have been excluded. The latter was beneficial when the results from the largest temporal extents showed large fluctuations, but has only been considered when none of the other fits obeyed the quality criteria of “Appendix 2”. In the following, we will only use results for which the removal of the excited states has been successful, i.e. those energies and differences for which at least one of the aforementioned fits passed the checks of “Appendix 2”. We estimated the systematic uncertainty associated with the removal of the excited states from the difference of the best fit result—being the one with data from the maximal number of temporal extents included for which the fit passed the checks—with the result from a fit where the data from the loop with the smallest temporal extent of the best fit have been excluded, whenever the latter fit was considered to be reliable.

2.3 Results for the flux tube spectrum

Table 1 Parameters of the simulations

We perform simulations of 3d \({{\text {SU}}}(2)\) gauge theory employing the Wilson plaquette action with the common mixture of heatbath [42] and three overrelaxation [43] updates. The simulation parameters together with the parameters of the multilevel algorithm are collected in Table 1. We set the scale using the Sommer parameter \(r_0\) [44], which has been determined for the present parameters with high accuracy in Ref. [18]. The energies include a lattice spacing dependent additive normalization. We get rid of this normalization by subtracting the normalization constant \(V_0\) obtained in Ref. [18] from fitting the potential to the EST prediction. Both \(r_0\) and \(V_0\) are listed together with other EST parameters in Table 2. Note that the ensembles at \(\beta =5.0\) have already been analysed in Ref. [15]. Here, we reanalyse the data using the improved fits described above.

Table 2 Results for the Sommer parameter \(r_0\), the string tension \(\sigma \), the normalization constant \(V_0\) and the boundary coefficient \(\bar{b}_2\) for different \(\beta \) values and in the continuum limit from Ref. [18]

We show the results for the energies in Fig. 1. Ancillary files including the jackknife bins of the energies and energy differences are provided along with the preprint version of this paper on the arXiv. Unfortunately, the removal of contaminations from other states becomes more and more problematic when going to finer lattices. This is in particular true for results from excited states in the individual channels, here in particular the first excited state in the \((+,+)\) channel, so that we could not extract results for this state on our \(\beta =10.0\) ensemble. Where results for three lattice spacings are available, we see that lattice artefacts are small (see also Figs. 2 and 3), so that the continuum results are not expected to differ significantly from the results of the individual lattice spacings.

Fig. 1
figure 1

Results for the flux tube spectrum for the lowest three energy levels. To fix the normalization, we have subtracted the value of \(V_0\) for the individual lattice spacings. We show only results for which the contamination from excited states has been removed. The coloured open symbols for the \(n=2\) states are the results for the first excited state in the \((+,+)\)-channel. The grey open symbols for the ground state are the results for the potential from Ref. [18] for comparison. The dotted lines are the predictions from the LC spectrum

3 Comparison to the EST spectrum

The energy levels and differences discussed in the previous section can now be compared to the predictions of the EST.

3.1 EST predictions for the spectrum

We first discuss the EST predictions for the energy levels of the flux tube relevant for this study. For an in-depth discussion of the EST properties, we refer the reader to the reviews [4, 8] and the extended discussion of section 2 in [18].

Using the action up to six derivatives order and the constraints for the coefficients [16, 45,46,47] from Lorentz invariance, the spectrum of a flux tube (open string) of length R to \(O(R^{-5})\) is given by [5, 6]

$$\begin{aligned} E^{\mathrm{EST}}_{n,l}(R) = E^{\mathrm{LC}}_{n}(R) - \bar{b}_2\frac{\pi ^3}{\sqrt{\sigma ^3} R^4} \Big ( B_n^l + \frac{d-2}{60} \Big ) - \frac{\pi ^3 (d-26)}{48 \sigma ^2 R^5} C_n^l + O(R^{-\xi }) . \end{aligned}$$
(9)

Following the arguments from [15, 32, 34, 48], the first term on the right-hand-side is the full light-cone spectrum [49]

$$\begin{aligned} E^{\mathrm{LC}}_{n}(R) = \sigma \, R \, \sqrt{ 1 + \frac{2\pi }{\sigma \,R^{2}} \, \left( n - \frac{1}{24} \, ( d - 2 ) \right) } \;. \end{aligned}$$
(10)

\(\bar{b}_2\) is the dimensionless leading-order boundary coefficient and \(B_n^l\) and \(C_n^l\) are dimensionless coefficients tabulated in Table 3 for the lowest few string states. The B and C coefficients depend on the representation of the state with respect to rotations around the string axis and lift the degeneracies of the light-cone spectrum. The lowest order correction term to Eq. (9) is expected to appear with an exponent \(\xi =6\) if the next correction originates from another boundary term.

Table 3 String states in the lowest four energy levels, their representation in terms of creation operators \(\alpha _m\) in the Fock space and the values for the coefficients \(B_n^l\) and \(C_n^l\)

The effective string theory is expected to break down for \(\sqrt{\sigma }R\lesssim 1\), where the energy of the degrees of freedom reaches the QCD scale. The EST does not account for several QCD processes, which are allowed generically in the microscopic theory. Among them are glueball emission and (virtual) exchange, as well as inner excitations of the flux tube. The latter are expected to appear as massive excitations on the worldsheet and are not included in the standard form of the EST. For the static potential, rigidity or basic massive mode contributions can be included in the EST analysis. They contaminate the extraction of the boundary coefficient \(\bar{b}_2\) and also add an additional term to the potential (see sections 3 and 5 in Ref. [18]). For excited states, the explicit form of such rigidity or massive mode corrections has not been computed so far, so that we cannot test the presence of such corrections in this analysis.

3.2 Comparison of the data to the EST predictions

To enable the visibility of small differences at large R, we from now on plot rescaled energies and differences following (see also [24])

$$\begin{aligned} E^{\mathrm{rsc}}_{n}(R) = \left( \frac{E_n(R) - V_0}{\sqrt{\sigma }} - \sqrt{\sigma }R \right) \, \frac{\sqrt{\sigma }R}{\pi } + \frac{1}{24}, \end{aligned}$$
(11)

for which the expansion of the LC spectrum, Eq. (10), to \(O(R^{-1})\) yields \(E^{\mathrm{rsc}}_n=n\) and \(\Delta E^{\mathrm{rsc}}_{nm}=n-m\).

Fig. 2
figure 2

Rescaled results for the energies associated with the first excited string state. The yellow band is the EST prediction including the boundary correction with continuum parameters, while the green solid line includes the parameters for \(\beta =5\). The curves with the boundary coefficients for \(\beta =7.5\) and 10.0 are very close to the continuum curve and therefore have not been included in the plot. The “LO” curve is the EST prediction to \(O(R^{-1})\) (\(E=n\)) and the “LC” curve the one from the light cone spectrum with continuum parameters

In Figs. 2 and 3, we show the rescaled energies in comparison with the predictions of the EST including continuum parameters. Note that differences in the LC curves for the different lattice spacings would not be visible in the figures. For the full EST prediction, containing also the boundary term, we also include the curve with the parameters of \(\beta =5.0\). For the other \(\beta \) values, the full EST curves lie between the \(\beta =5.0\) and continuum curves, but are very close to the latter. Generically, for both \(n=1\) and 2 the data qualitatively follow the LC curves down to very small values of R, where the EST is no longer expected to describe the flux tube dynamics. Quantitatively, small deviations are visible, which, however, appear to remain more or less constant with R. Note that this is to some extend an artefact of the rescaling, Eq. (11), which includes subtractions and a multiplication with R. Consequently, a decreasing deviation, like the one in Fig. 1, might appear constant in Figs. 2, 3 and 4.

Fig. 3
figure 3

Rescaled results for the energies associated with the second excited string state. The curves and points are as in Fig. 2. The open symbols and dashed lines are the results and full EST predictions for the first excited state in the \((+,+)\) channel

For \(n=1\), both the \(\beta =5.0\) and 7.5 data become consistent with the full EST prediction around \(R\approx 2.5 r_0\) (please note the difference in normalization and the reanalysis for \(\beta =5.0\) as compared to the results presented in Ref. [19]), even though the \(\beta =7.5\) data shows a slight tendency to overshoot the curve for \(R\gtrsim 3 r_0\). For \(\beta =10.0\) this trend continues and the data lies above the continuum curve for \(R\gtrsim 2.5 r_0\). This might hint to deviations from the curve in the continuum limit, but it could also be an artefact of insufficient removal of the excited states contributions which become more severe in this region. In case of the former, it could be a sign for a higher order corrections, or massive mode contributions, needed to describe the energies accurately at these distances, eventually leading to an approach to the \(O(R^{-5})\) EST prediction from above. For \(n=2,\) the filled points are expected to approach the solid curves, while the open symbols should approach the dashed ones. This seems to be the case for \(\beta =5.0\) and to some extent also for \(\beta =7.5\) and 10.0. For the latter, the data from the ground state in the \((-,-)\) channel (filled symbols, corresponding to \(\vert 2,2\big \rangle \) in Table 3) lie below the EST prediction, but seem to approach it asymptotically. The results for the first excited state in the \((+,+)\) channel (open symbols, corresponding to \(\vert 2,1\big \rangle \) in Table 3) lie below the data from the \((-,-)\) channel, in agreement with the EST predictions, and are much closer to the LC curve. Note, however, that the agreement of the \(\beta =5.0\) with the LC curve likely is accidental, arising through lattice artefacts, as indicated by the \(\beta =7.5\) results. The data approach the full EST curve for \(\beta =5.0\) and \(\beta =7.5\), even though final conclusions are difficult, since reliable results for large R values—and as such for \(\beta =10.0\)—are not available for this state.

Fig. 4
figure 4

Rescaled energy difference between the first excited state in the \((+,+)\) channel and the ground state in the \((-,-)\) channel. The dotted line corresponds to LO and LC predictions—which vanish for this difference—and the curves are the EST prediction including the boundary term for \(\beta =5.0\) and in the continuum as before

While conclusions concerning the agreement with the full EST from the first excited state in the \((+,+)\) channel are difficult, the difference between this state and the ground state in the \((-,-)\) channel might be more sensitive to the boundary term. Generically, the difference might be useful to investigate the form of correction terms in the EST as long as they lift the Nambu–Goto degeneracies, since all universal terms belonging to the \(n=2\) states cancel. Unfortunately, this difference is also very difficult to compute with control over the systematic effects and so far we have only been able to obtain reliable results at \(\beta =5.0\). Those results, obtained from the individual analysis of the energy differences, are shown in Fig. 4. In this normalization, the difference appears almost constant with R while in fact it decreases with \(R^{-1}\). It is unclear whether it eventually approaches the EST prediction for larger R due to the large uncertainties in this region. We note that the \(R^{-1}\) correction to the LC prediction could well be a sign for a massive mode being responsible for this difference to be nonvanishing. At the same time, it could be a combined contribution of higher order corrections which mimics such a \(R^{-1}\) correction.

4 Conclusions

We have extracted the spectrum of the open flux tube in 3d \({\text {SU}}(2)\) pure gauge theory up to the second excited state for multiple lattice spacings. The combination of the multilevel algorithm and a variational method allowed for precise results up to \(q\bar{q}\) distances of about \(4 r_0\). Excited state contaminations have been removed using a sophisticated fitting procedure with several checks for systematic effects. The results qualitatively follow the energy levels of the Nambu–Goto string theory in the light cone quantization, Eq. (10), down to small values of R where the EST is not expected to provide a valid description of the flux tube excitations. Quantitatively, however, we observe deviations which we compare to the predictions of the full EST, including a boundary term on top of the Nambu–Goto action with coefficients computed in Ref. [18]. We observe that lattice artefacts are small in general, confirming the findings from Ref. [24]. However, some lattice artefacts might be visible for the first excited state at large values of R.

While the results tend to agree with the EST predictions, in particular the results for the second exited state show the expected splitting predicted by the EST, we observe some deviations for the first excited state at smaller lattice spacings. This could be a sign for higher order or massive mode corrections becoming important in the continuum limit, a generic disagreement with the predictions at large R, or uncontrolled systematic effects. To verify either of these scenarios, further and more accurate results at large R are needed. For \(n=2,\) the data apparently tend to approach the full EST predictions asymptotically for all lattice spacings, even though the approach is slower on the finer lattices. A particularly interesting quantity with respect to correction terms to Nambu–Goto energy levels in the EST is the difference between the first excited state in the \((+,+)\) and the ground state in the \((-,-)\) channel, which is vanishing for the LC spectrum. Results with the current precision—we were only able to extract results on our coarsest lattice—decrease with \(R^{-1}\). It is, however, difficult to judge whether the results will first converge to the boundary correction at large R or quantitatively disagree with this term.

Despite the drastic increase in precision, results for larger values of R and better precision for the higher excited states are needed to fully confirm or falsify the agreement between spectrum and EST predictions for the excited states. So far we could not observe any unambiguous discrepancy between data and EST, despite the fact that some deviations become apparent on the finer lattices. However, these could still be remnants of systematic effects, which, generically, become harder to control for larger values of R. Of particular importance for future studies is the inclusion of possible corrections due to massive modes in the EST predictions, which in 3d so far have not been computed within the EST framework.