1 Introduction

Soon after the inception of Quantum Chromodynamics (QCD) Mandelstam [1, 4], ’t Hooft [2] and Nambu [3] suggested the formation of narrow color-electric flux tubes as the mechanism of confinement. This suggestion has since been essentially confirmed by lattice QCD in two-body \((Q {\bar{Q}})\) systems. But, its extension to three quarks allows two possibilities: the Y-string and the \(\Delta \)-string, with the topologies of the corresponding letters. The question of which type of string best describes the three-quark confinement potential in QCD has been open ever since.

In spite of three decades of efforts in lattice QCD, [5,6,7,8,9,10,11], the functional form of the three-heavy-quark potential remains unknown. Even the two most recent calculations [10, 11] have drawn incompatible conclusions. It must be emphasized that the analyses of all of the above lattice calculations used only single-variable fitting - three-body potentials depend on three independent variables, however.

A three-variable analysis naturally leaves more lattitude in the interpretation of lattice results. Consequently, the deviations (“error bars”) may – and indeed do – turn out larger than in a single-variable analysis. In Refs. [12, 13] we re-analysed the lattice data from [10, 11] in terms of three hyperspherical variables, and in the present Letter we compare them for the first time. This re-analyses graphically shows how different the chosen geometries were between the two calculations. There are only two (small) subsets of 3-quark geometries that are common to both [10] and [11]: (a) the isosceles, and (b) the right-angled triangles.

Our re-analyses showed that both calculations suffer from significant, unaccounted for deviations (“effective error bars”) from smooth potential curves in the same region of triangle shape space: near the equilateral triangle configuration. In this region, the two string potentials, the Y and the \(\Delta \), are indistinguishable.Footnote 1 We do not attempt to explain these (enhanced) deviations here – for these are a matter for the original authors. At any rate, these two are the latest state-of-the-art calculations. The deviations are much smaller and the resulting potential curves smoother in [11] than in [10]. The resulting functional form of [11] is generally within the deviations of [10], i.e., [11] curve may be viewed as the converged result, within two (small) subsets of their common 3-quark geometries: (a) isosceles, and (b) right-angled triangles. Here the “converged” data pass between the Y- and the \(\Delta \)-string predictions, thus excluding both. We re-iterate that the above statement is subject to the above proviso that in certain regions of shape space, both sets of results are subject to unaccounted for deviations from smooth curves that are largest near the equilateral triangle point in shape space.

In a different line of enquiry, Bornyakov et al. [14] calculated and graphically displayed the geometrical distribution of the color-flux density among three static quarks. In the equilateral triangle configuration three flux tubes meet at the unique triangle center, but in asymmetrical triangle configurations Bissey et al. [15] showed that this three-tube junction moves and stays away from the Fermat–Torricelli center required by the Y-string, thus leaving many open questions.

As stated above, we showed that neither the Y-string nor the \(\Delta \)-string model form acceptable descriptions of the lattice data [10, 11]. The lattice three-static-quark potential extracted from the small overlap of Refs. [10, 11] agrees with the potential resulting from three pieces of elastic string joined together at a skewed Y-junction, which lies on the Euler line, but is displaced from the Fermat–Torricelli center in the direction pointing away from the barycenter of the triangle. This skewed Y-string model is also in qualitative agreement with the results of the lattice calculation of SU(3) color flux-tube configurations connecting three static quarks, or SU(3) sources, by Bissey et al [15], and thus offers a potential end to this long-standing dilemma.

2 Lattice QCD data

The two calculations have several important differences in implementation: Koma and Koma used a \(24^4\) lattice at \(\beta = 6.0\) with a lattice spacing \(a= 0.093\) fm, 221 three-quark geometries and only one gauge configuration, using the “multilevel algorithm” technique and the Polyakov loop; Sakumichi and Suganuma did theirs using the Wilson loop at two \(\beta \) values: (a) \(\beta = 5.8\) on \(16^3 \times 32\) lattice (\(a= 0.148(2)\) fm) and (b) \(\beta = 6.0\) on \(20^3 \times 32\) lattice (\(a= 0.1022(5)\) fm), with 1000–2000 gauge configurations, and 101 and 211 three-quark geometries, respectively. The Sakumichi and Suganuma study [10] may be viewed as an update on the Takahashi et al. study [9] made some 13 years earlier: their choices of geometries, and the methods are quite similar, as are the conclusions.

Because of the differing implementations, the studies have different systematic and statistical error bars. The error bars estimated by the original authors have sometimes turned out insufficient to cover the apparent deviations from smooth continuous curves, particularly in the region near the equilateral configuration, see Sect. 4.2. That is a matter for the original authors to (re)consider, however; we neglect such troublesome regions of shape space.

Fig. 1
figure 1

Distributions of 3q configurations: a Ref. [10] \(\beta = 5.8\) data; b Ref. [10] \(\beta = 6.0\) data; and c Ref. [11] \(\beta = 6.0\) data. The color code denotes the number of different sizes for an identical configuration. Note the small overlap, and the complementarity of the chosen geometries in Refs. [10] and [11]

Our strategy is to compare the two studies [10] and [11] on a common baseline by (re)expressing both data sets in terms of three-body hyperspherical coordinates. We then compare the results in the regions of common shape-space – where the studies used three-quark triangles of the same shape, though not necessarily of the same size.

We then compare the two sets of extracted data against the predicted Y-string and \(\Delta \)-string potentials to see if and when, they agree, within their differing error bars, and if they suggest the Y-string or the \(\Delta \)-string model explanation.

3 Hyper-spherical coordinates

In the hyper-spherical coordinate system, a three-quark system is described by the hyper-radius, \(R {=} \sqrt{\frac{1}{3}\sum _{i<j}^3 (\mathbf{r}_i {-}{} \mathbf{r}_j)^2}\), which is proportional to the root-mean-square distance of the three particles from their geometrical barycenter and thus denotes the size of the system and two hyper-spherical angles, \(\alpha = \arccos \Big (\frac{2|{\varvec{\rho }} \times {\varvec{\lambda }}|}{{\varvec{\rho }}^2 + {\varvec{\lambda }}^2}\Big )\) and \(\phi = \arctan \Big (\frac{2{\varvec{\rho }} \cdot {\varvec{\lambda }}}{{\varvec{\rho }}^2 - {\varvec{\lambda }}^2}\Big )\), or the (xy) coordinates in the equatorial plane \(x = \Big (\frac{2{\varvec{\rho }} \cdot {\varvec{\lambda }}}{{\varvec{\rho }}^2 + {\varvec{\lambda }}^2}\Big )\) and \(y = \Big (\frac{{\varvec{\rho }}^2 - {\varvec{\lambda }}^2}{{\varvec{\rho }}^2 + {\varvec{\lambda }}^2}\Big )\), which define the shape of the three-quark triangle. Here \({\varvec{\rho }} = \frac{1}{\sqrt{2}}(\mathbf{r}_{1} - \mathbf{r}_{2})\), \({\varvec{\lambda }} = \frac{1}{\sqrt{6}}(\mathbf{r}_{1} + \mathbf{r}_{2}- 2 \mathbf{r}_{3})\) are the Jacobi relative vectors.

The hyperradius R scales linearly, \(R \rightarrow \lambda R\), with \(\lambda \) under dilations/contractions of spatial coordinates \(\mathbf{r}_i \rightarrow \lambda \mathbf{r}_i\) (\(i=1,2,3\)) and thus measures the size of the system; the two dimensionless (“shape”) hyperangles, or some functions thereof, see appendix A. The scaling symmetry properties reflect only on the hyperradius, whereas the permutation symmetry reflects only on the \(\phi \) hyperangular dependence.

Hyper-spherical coordinates have four key advantages over the binary separations \(r_{ij} = \sqrt{(\mathbf{r}_i - \mathbf{r}_j)^2}\) as the three-body variables of choice: Firstly, all of the symmetries of the system are accounted for. Secondly, hyper-spherical coordinates allow simple equations to be written to define the functional form of quark confinement potential, see appendix A. Thirdly, hyper-spherical coordinates allow the size dependence of the quark confinement potential to be separated from its shape dependence because the confining potential is homogenous. Fourth, permutation-adapted hyperspherical coordinates graphically display the dynamical symmetry of the Y-string potential, see Appendix A.

The advantages of hyper-spherical coordinates allow the results from [10] and [11] to be compared, in spite of their differing lattice sizes. The standard Ansatz, [5,6,7,8,9,10,11, 19,20,21] for the three-quark potential V in QCD

$$\begin{aligned} V_{\mathrm{data}}(\alpha , \phi , R) = -\frac{A(\alpha , \phi )}{R} + B(\alpha , \phi )R + C, \end{aligned}$$
(1)

henceforth referred to as the Coulomb+Linear potential, is readily implemented in these variables, and the size-dependence of the confining potential \(B(\alpha , \phi )R\) is removed; here \(-\frac{A(\alpha , \phi )}{R}\) is the Coulomb part, \(B(\alpha , \phi )R\) is the confining part and C is a constant term.

4 Re-analysis

4.1 Method

At fixed values of shape variables \((\alpha , \phi )\), the functions \(A(\alpha , \phi )\), \(B(\alpha , \phi )\) and \(C(\alpha , \phi )\) are (unknown) constants. In order to determine their values we have fitted the equilateral triangle data. With these constants fixed at one point in the shape-space, we can subtract the well-known Coulomb \(V_{\mathrm{Coul}}^{fit}\) and constant \(C_{fit}\) term in all of shape-space (see appendix B), and be left with the confining potential \(V^{*}(x,y)R = V_{\mathrm{data}} - V_{\mathrm{Coul}}^{fit} - C_{fit}\).

The distributions of Sakumichi and Suganuma’s, as well as of Koma and Koma’s three-quark geometries, represented as points in the shape-space disc, are shown in Fig. 1. Note the complementarity of the two sets, and the small overlap regions – two mutually orthogonal straight lines in one elementary “pizza slice” cell – between the two sets of chosen geometries.

The two data sets have been defined in terms of a common hyper-spherical coordinate system and a subset of points has been selected where the two data sets have geometric overlap. Results of Refs. [10] and [11] can now be compared to each other and to the Y-string and \(\Delta \)-string predictions.

4.2 Results

We showed in Ref. [12] that Koma and Koma’s [11] lattice results yielded a continuous, generally smooth functional dependence along these two (orthogonal) lines in the shape-space disc. In Ref. [13] we subjected Sakumichi and Suganuma’s lattice data [10] to the same analysis as Koma and Koma’s [11] in Ref. [12], with somewhat less convincing conclusions: the deviations from a unique, smooth curve are larger than in [11]. Nevertheless, the data in Ref. [10] show a marked improvement over the data in Ref. [9] in terms of reduced deviations.Footnote 2

The hyperangular dependences of Sakumichi and Suganuma’s [10] and Komas’ [11] confinement potentials are shown in Figs. 2 and 3 next to each other, so as to facilitate comparison. The first impression is unfavourable: Sakumichi and Suganuma’s isosceles data show large deviations from a smooth curve between \(-0.4 \le y \le 0.3\) at \(\beta = 5.8\) (Fig. 2a), and between \(-0.4 \le y \le 0.7\) at \(\beta = 6.0\) (Fig. 2b). A correspondingly large deviation in Koma and Koma’s isosceles data appears only below \(y \le - 0.4\) (Fig. 2c), however.

The right-angled triangle data of Ref. [10], on the other hand, shows similar deviations (only) in the \(-0.3 \le x \le 0.3\) at \(\beta = 5.8\) (Fig. 3a), and between \(-0.7 \le x \le 0.7\) at \(\beta = 6.0\) (Fig. 3b). A corresponding deviation in Ref. [11] right-triangle data appears only above \(|x| \ge 0.75\) (Fig. 3c). Note that in all cases the deviations of Ref. [10] data bracket those of Ref. [11].

One may therefore view Ref. [11] as having superior accuracy to that of Ref. [10].Footnote 3 Consequently, we shall use Ref. [11] as the benchmark result.

The discerning reader will also notice the hyper-radial dependence of the above bounds in Figs. 23. Our interactive web site [24] allows the interested reader to change the value(s) of the hyperradius R, and thus select the data to be observed through filters of one’s own choice.

Fig. 2
figure 2

Extracted values of the hyperangular part of the confining potential \(V^{*}(x,y) = 1/R[V_{\mathrm{data}} - V_{\mathrm{Coul}}^{fit} - C_{fit}]\) for isosceles triangles (\(x=0.5\)): a \(\beta =5.8\), Ref. [10]; b \(\beta = 6.0\), Ref. [10]; c \(\beta = 6.0\), Ref. [11]. The black points correspond to all values of size R, whereas the red ones correspond to sizes \(R \ge 7\). The blue lines represent the \(\Delta \)-string prediction, the green lines the Y-string

Fig. 3
figure 3

Extracted values of the hyperangular part of the confining potential \(V^{*}(x,y) = 1/R[V_{\mathrm{data}} - V_{\mathrm{Coul}}^{fit} - C_{fit}]\) for right triangles (\(y=0\)): a \(\beta =5.8\), Ref. [10]; b \(\beta = 6.0\), Ref. [10]; c \(\beta = 6.0\), Ref. [11]. The black points correspond to all values of size R, whereas the red ones correspond to sizes \(R \ge 7\). The blue lines represent the \(\Delta \)-string prediction, the green lines the Y-string

Note that the benchmark data consistently fall between the \(\Delta \)-string prediction (upper, blue) and the Y-string prediction (lower, green) in the aforementioned region. It ought to be clear that neither the Y- nor the \(\Delta \)-string can adequately describe the present lattice data.

5 Interpretation: skewed junction Y-string

This unexpected result calls for an interpretation in terms of an elastic string model. We define an infinite class of skewed Y-string potentials:

$$\begin{aligned} V_{\mathrm{sk. Y}}= & {} \sigma _{\mathrm{sk.Y}} \sum _{i=1}^3 |\mathbf{x}_{i} - \mathbf{x}_{0}(\alpha )| \ \end{aligned}$$
(2)

whereby three pieces of elastic string are joined at a junction, \(\mathbf{x}_{0}(\alpha )\), defined as

$$\begin{aligned} \mathbf{x}_{0}(\alpha )= & {} \mathbf{x}_{0}^{\mathrm{F.T.}} + \alpha \Delta \mathbf{x}_{0} . \ \end{aligned}$$
(3)

Here \(\alpha \in (-\infty ,+\infty )\) is a free parameter such that the barycenter \(\mathbf{x}_{0}^{\mathrm{CM}} = \mathbf{x}_{0}(\alpha =-1)\) corresponds to \(\alpha =-1\):, and the Fermat–Torricelli center \(\mathbf{x}_{0}^{\mathrm{F.T.}} = \mathbf{x}_{0}(\alpha =0)\) corresponds to \(\alpha =0\). Thus the junction lies on the Euler line,Footnote 4 defined by the vector \(\Delta \mathbf{x}_{0}\)

$$\begin{aligned} \Delta \mathbf{x}_{0}= & {} \mathbf{x}_{0}^{\mathrm{F.T.}} - \mathbf{x}_{0}^{\mathrm{CM}} \nonumber \\= & {} \frac{1}{2 l_Y^2} \sqrt{\frac{3}{2}} \left( \mathbf{V} + \frac{\mathbf{{W}} \times \mathbf{A}}{|{\varvec{\rho }} \times {\varvec{\lambda }}|}\right) \ \end{aligned}$$
(4)

where

$$\begin{aligned} \mathbf{V}&= {\varvec{\lambda }} ({\varvec{\lambda }}^2 - {\varvec{\rho }}^2 ) - 2 {\varvec{\rho }} ({\varvec{\rho }} \cdot {\varvec{\lambda }}) \end{aligned}$$
(5a)
$$\begin{aligned} \mathbf{A}&= {\varvec{\rho }} ({\varvec{\lambda }}^2 - {\varvec{\rho }}^2) + 2 {\varvec{\lambda }} ({\varvec{\rho }} \cdot {\varvec{\lambda }}) \end{aligned}$$
(5b)
$$\begin{aligned} \mathbf{{W}}&= {\varvec{\rho }} \times {\varvec{\lambda }} \end{aligned}$$
(5c)
$$\begin{aligned} l_Y^2&= \frac{3}{2} ({\varvec{\rho }}^2 + {\varvec{\lambda }}^2 + 2 |{\varvec{\rho }} \times {\varvec{\lambda }}|) \end{aligned}$$
(5d)

The junction determined by the lattice data is displaced from the Fermat–Torricelli point of the triangle in the direction away from the barycenter, i.e., at positive \(\alpha > 0\). Any point on the Euler line may be used as a junction of three strings, within a subset of the shape-space disc. This region of applicability of a string potential is determined by the position of the center at which the strings have their junction, and how the position of the center changes with the triangle shape. Of course, in an equilateral triangle all centers coincide. As a triangle shape becomes more obtuse, some of its centers move outside the triangle. When a center moves outside the boundary of the triangle, it stops being acceptable as a junction of three strings. The exit point on the boundary inherits the property of being the junction.

We find that such a string model potential can fit the data in Figs. 2 and 3 , i.e., the lattice results of both [10] and [11], though the spread of the data does not, as yet, allow a more precise determination than \(\alpha \simeq 0.5 \pm 0.2\).

The direction of the displacement of the junction from the Fermat–Torricelli center is well established, however, as being opposite to the barycenter. This leads to a reduction of the value of the critical angle, down from \(120^{\circ }\), which is sufficient to show that this elastic string model is in qualitative agreement with the results of the lattice calculation of SU(3) color flux-tubes connecting three quarks by Bissey et al. [15], who found flux-tubes in the shape of letters L and T, i.e., with a critical angle of \(90^{\circ }\).

6 Discussion

The displacement of the three-string junction from the Fermat–Torricelli point is perhaps the least expected feature of three-quark confinement. Neither Bissey et al. [15], who observed it first, nor anyone thereafter has offered an explanation of the T- and L-letter-shaped flux tubes.

To be fair, de Forcrand and Jahn [17] suggested a theoretical scenario wherein a transition from the \(\Delta \)-string, holding at shorter distances, to the Y-string holding at longer distances would take place at separations of around 0.8 fm. Putting aside, for the time being, theoretical arguments against this scenario advanced in Ref. [16], see also Appendix A, we note that there is no evidence for such a transition taking place in the data shown in Fig. 5, nor in any of the results in Refs. [10, 11].

Of course, our conclusions are only as good as the data they are based on, which left a number of things to be desired, so we can only re-iterate that the deviations from continuous curves must be ironed out. Therefore, all further checks, including refutations, corroborations and refinements by future lattice QCD studies will be welcome. A straightforward check would have to contain both sets of geometries (isosceles and right-angled) used so far, at as many different hyper-radii as possible, whereas a refinement would include new type(s) of geometries, again at least at four different values of hyper-radius.

Our result, if correct, has consequences for three-quark spectroscopy and the confinement potential for multiquark systems:

  1. 1.

    in baryon spectroscopy the three-quark force leaves a clear signature in the second odd-parity, and higher shells of baryon resonances, and even there only in a few select states [25,26,27]. However, these three-quark force effects can easily be confused with relativistic effects. Therefore the shifted junction is not likely to be observed soon in heavy baryon spectroscopy.

  2. 2.

    The displacement of the three-string junction from the Fermat–Torricelli point would dramatically influence the binding energies and confinement properties of multiquark systems. This is because in systems of four or more quarks the Y-string is replaced by a Steiner tree [28], which should be distorted due to the skewness of the junction.

Our results open new questions: firstly, what is the precise position of the string junction for three-quarks at zero temperature? Secondly, what happens to this junction at non-zero temperatures? Thirdly, what are the Casimir scaling properties of the confining 3-body potential? How does the three-body potential depend on the color SU(3) multiplet to which the three bodies belong? It has been suggested [29] that the color SU(3) dependence should be the symmetric structure constants \(d^{abc}\), but that implies its vanishing for self-conjugate multiplets, such as the octet, which would be a dramatic effect.