1 Introduction

The study of the \(\eta \rightarrow 3\pi \) decay process is interesting, first and foremost, in the context of the determination of the quark mass ratio

$$\begin{aligned} Q^2=\frac{m_s^2-{{\hat{m}}}^2}{m_d^2-m_u^2}\, ,\quad \quad {\hat{m}}=\frac{1}{2}\,(m_u+m_d). \end{aligned}$$
(1)

In order to extract the value of Q to high precision, it is very important to have a robust control on the final-state interactions in this decay, which lead to a strong effect in the width. To this end, one would like to have a non-perturbative framework, allowing the resummation of a certain class of the final-state interactions to all orders. Dispersion relations are the ideal tool for this. Incorporating 2-particle unitarity and crossing symmetry then leads to a system of coupled integral equations for the 3 isospin amplitudes in this decay. It took quite some time until these equations were written down in their final form. The development started with the pioneering work of Khuri and Treiman [1]. The mathematical structure of this type of equation was investigated in the following decade [2,3,4,5,6,7,8,9,10], see also the monograph [11]. Later, interest in the dispersive method waned. A revival occurred in the nineties, when it was demonstrated [12,13,14,15] how dispersion relations allow one to incorporate final state interactions of S- and P-waves in a reliable and calculable manner. Refs. [13, 14] contain a detailed discussion of the role of subtractions in the dispersive representation, while the uniqueness of the solutions is investigated in [14]. Further, current algebra [16] and chiral perturbation theory results [17] were used to relate the parameter Q to this framework (normalization of the dispersive amplitude), and to get a handle on the subtraction constants. For a review of the early developments until 1990 we refer the reader to Ref. [12], see also the lecture notes [18]. An improved representation of the \(\pi \pi \) phase shifts [19, 20], the evaluation of electromagnetic corrections [21, 22] as well as new experimental information on this decay [23,24,25,26,27,28,29,30] triggered new investigations of \(\eta \rightarrow 3\pi \) in the dispersive framework [31,32,33,34,35,36,37,38,39,40,41]. A very comprehensive analysis has recently been presented in Ref. [42]. For applications of the dispersive framework in other three-body decays, see Refs. [34, 35, 43,44,45,46,47]. Further attempts to incorporate a class of final state interactions in the \(\eta \rightarrow 3\pi \) amplitudes may be found in Refs. [17, 48,49,50,51,52].

The present article is devoted to a discussion of numerical methods to solve the integral equations that occur in the formulation of Ref. [42]. In this connection, the investigations [2,3,4,5,6,7,8,9,10] revealed the following two important aspects:

  1. (i)

    In the evaluation of the angular averages of the amplitudes, the integration path in \(z=\cos \theta \) must be deformed into the complex plane, in order not to destroy the holomorphic properties of the amplitudes. The deformation is fixed by providing the (eta mass)\(^2\) with an infinitesimal positive imaginary part: \(M_\eta ^2\rightarrow M^2=M_\eta ^2+i\delta \) , with \(\delta \rightarrow 0^+\) at the end of the calculation.

  2. (ii)

    Singularities emerge in the angular averaged amplitudes, at the pseudothreshold \(s=(M_\eta -M_\pi )^2\). Some of these singularities are of a non-integrable type in the Lebesgue sense. The (positive) imaginary part of the eta mass, however, acts as a regulator, that can be removed after the dispersive integral has been performed. Handling these singularities properly is a delicate affair, as is illustrated e.g. by the discussions in Refs. [13, 15, 31, 35, 41, 42, 45, 52].

We shortly mention several previous methods to obtain solutions of these equations. In Refs. [13, 15, 31,32,33, 42, 43], the equations were solved directly through iterations, with a careful treatment of the mentioned singularities. In Refs. [35, 41], integral kernels are introduced, which make the numerical procedure faster, and which identify the mentioned singularities at the pseudothreshold in a clear fashion. Through the interchange of the order of integrations, the so-called Pasquier inversion [9, 10], it is possible to carry out one integration and to obtain integral equations in one variable that are further solved by iterations (see Refs. [36, 39, 53, 54] for more details). Finally, it is possible to obtain integral equations for the angular averaged amplitudes instead of the amplitudes themselves. This technique has recently been invoked in Refs. [44, 45]. Last but not least, the convergence of the iterative procedure to solve the equations is not a priori guaranteed. The iterative procedure converges very well (typically, after 3–4 iterations) in the \(\eta \rightarrow 3\pi \) decays, but the convergence becomes an issue for decaying particles with heavier masses, or for equations with more subtractions [45]. For this reason, e.g., in Ref. [44], a matrix inversion method was used to find a solution beyond the iterative procedure. These enterprises are numerically very demanding, in particular for the reasons spelled out above.

In the following, we propose a numerical framework that avoids the difficulties (i) and (ii) altogether. We show that one may deform the integration path in the dispersion integral (not in the angular average) into the complex plane. After choosing a properly deformed path, the integrand becomes regular, even at \(\delta =0\) (except at threshold, where a mild, integrable square root singularity persists). In addition, the angular integration can be carried out in the original interval \(-1 \le z \le 1\). Further, discretizing the integrals through a Gauss–Legendre quadrature, the integral equations are transformed into a set of linear matrix equations, whose solution is easily found by iterations. The procedure is pretty straightforward, takes very little CPU time and does not lead to any spurious irregularities in the solutions.Footnote 1

The idea to avoid singularities in integration through path deformation is not new. An early reference is the work of Hadamard [56], whose method was rediscovered 50 years later by investigation of analytic properties of perturbation theory [57, 58], and then extended and extensively used in S-matrix theory [58]. The analogue of the procedure for the quantum-mechanical three-body problem is well known since decades [59, 60]. As already said, Aitchison and Pasquier mentioned this possibility for Khuri–Treiman-type equations in Ref. [55]. For the numerical evaluation of loop graphs in Quantum Field Theory, path deformations in momentum space [61] as well as in Feynman parameter space [62] are used. All these techniques, including the present one, may be summarized under the heading “Applications of Cauchy’s integral theorem”.

The layout of the article is as follows. In Sect. 2, we display the integral equations for the \(\eta \rightarrow 3\pi \) amplitudes in the form worked out recently in Ref. [42]. We describe in some details the technical difficulties that one encounters while solving the equations in Sect. 3, while in Sect. 4, we define the deformed path in the dispersion integral and show that in this manner, the singularities in the integral equations disappear. In Sect. 5 we describe the numerical procedure for solving the equations, whereas a summary and conclusions are given in Sect. 6. In Appendix A we comment on the phase shifts used and collect some notation. The holomorphic continuation of the integrand is discussed in Appendix B, path deformations in general are investigated in Appendix C, and the case \(\omega \rightarrow 3\pi \) is shortly discussed in appendix D.

2 The equations

We start from the integral equations for the three isospin amplitudes \(M_I(s)\) in the framework specified in [42],

$$\begin{aligned} M_I(s)= & {} \varOmega _I(s)\left[ P_I(s)+s^{n_I}\int _{4M_{\pi }^2}^\infty d\mu _I(x)\frac{{\hat{M}}_I(x)}{x-s-i\epsilon }\right] \,,\nonumber \\&I=0,1,2\,.\end{aligned}$$
(2)

The various quantities are defined as follows. The Omnès functions are

$$\begin{aligned} \varOmega _I(s)=\exp \left( \frac{s}{\pi }\int _{4M_{\pi }^2}^\infty \frac{dx}{x}\frac{\delta _I(x)}{x-s-i\epsilon }\right) , \end{aligned}$$
(3)

where \(\delta _I(x)\) denote the elastic \(\pi \pi \) phase shifts (S-wave for \(I=0,2\), P-wave for \(I=1\)). The hat-functions \({{\hat{M}}}_I\) are defined in terms of angular averages,

$$\begin{aligned} {\hat{M}}_I(x)=\frac{1}{\kappa }\sum _{m,I'}C^m_{II'}(x,\kappa )\langle z^mM_{I'}\rangle (x), \end{aligned}$$

with

$$\begin{aligned} \langle z^mM_I\rangle (x)= & {} \frac{1}{2}\int _{-1}^1 dz z^m M_I(h (x,z)); \nonumber \\&I,I',m \in \{0,1,2\}\,,\end{aligned}$$
(4)

where

$$\begin{aligned}&h(x,z)=\frac{1}{2}\left[ M^2+3M_\pi ^2-x+z\,\kappa (M^2,M_\pi ^2,x)\right] , \nonumber \\&\kappa (M^2,M_\pi ^2,x)=\sqrt{1-\frac{4M_\pi ^2}{x}}\sqrt{x-(M-M_\pi )^2} \nonumber \\&\quad \qquad \qquad \qquad \qquad \times \sqrt{x-(M+M_\pi )^2}\,.\end{aligned}$$
(5)

Finally, the measures are

$$\begin{aligned}&d\mu _I(x)=\frac{dx}{\pi x^{n_I}}\frac{\sin (\delta _I(x))}{|\varOmega _I(x)|}, \end{aligned}$$

and

$$\begin{aligned} \{n_0,n_1,n_2\}= & {} \{2,1,2\}\,; \nonumber \\ M^2= & {} M_\eta ^2 +i\delta \,,\quad \delta \rightarrow 0^+\,.\end{aligned}$$
(6)

The \(C^m_{II'}(x,\kappa )\) are polynomials in x and \(\kappa \), and \(P_I(s)\) denote subtraction polynomials, see [42]. The phase shifts \(\delta _I\) are needed as input to solve these equations. We relegate a discussion of them to the Appendix A, where we also collect some of the notation used below.

3 Technical hurdles

During the numerical, iterative solution of the Eqs. (2)–(6), two main problems occur.

(i) The angular averages \(\langle z^m M_I \rangle (x)\) amount to an integration along the path h(xz) of the argument of the amplitudes \(M_I\). For fixed x, the argument runs along a straight line, from \(h_-(x)\) to \(h_+(x)\), where

$$\begin{aligned}&h_\pm (x)=\frac{1}{2}\left[ M^2+3M_\pi ^2-x\mp \kappa (M^2,M_\pi ^2,x)\right] \,.\end{aligned}$$
(7)

The Kacser function \(\kappa (M^2,M_\pi ^2,x)\) [5] is holomorphic in the complex x-plane, cut along the real axis from \(x=0\) to \(x=4 M_\pi ^2\) and along a straight line from \(x=(M-M_\pi )^2\) to \(x=(M+M_\pi )^2\) , see Fig. 1.Footnote 2 Therefore, fixing its value at some point in the complex x-plane renders it unique in the cut plane. In the following, we use the convention

$$\begin{aligned} \kappa (M^2,M_\pi ^2,x)=x + O(1)\,,x \rightarrow \infty \,.\end{aligned}$$
(8)

In Fig. 2, we display the endpoints \(h_\pm (x)\) as the integration variable x runs from the threshold to infinity. Because \(\kappa (M^2,M_\pi ^2,x)\) is holomorphic in the cut x plane, the endpoints \(h_\pm \) move – for \(\delta > 0\) – along curves that are infinitely often differentiable with respect to the real variable x. The dash–dotted vertical line connects the endpoints \(h_\pm (x)\) with \(x_2\!<\!x\!<\!x_3\). It is seen that a straight line between \(h_-(x)\) and \(h_+(x)\) crosses the integration path in the dispersive representation (2). This is also the case for \(x_1\!<\!x\!<\!x_2\). The problem is discussed at many places in the literature – early references are [2, 4]. Its solution amounts to deform the path in the z integration (or in the integration over the variable h after a change of variables \(z\rightarrow h\)), such that this crossing is avoided. We do not discuss this point any further here.

Fig. 1
figure 1

Branch points (blue arrows) and cuts (solid blue) of the Kacser function \(\kappa (M^2,M_\pi ^2,x)\), and integration paths. We connect the branch points with cuts in the intervals \([0,4M_\pi ^2]\) and \([(M-M_\pi )^2,(M+M_\pi )^2)]\). The original path of integration for the dispersive representation (2) runs along the real line, from A to DEF (dotted black), the deformed path along ABCDEF (dashed black). See appendix A for \(x_i\) and \(A,B,\ldots F\). The figure is not drawn on scale

Fig. 2
figure 2

Real and imaginary parts of the endpoints \(h_\pm (x)\) in (7) in the angular averages (4), for \(x\in [4M_\pi ^2,\infty )\). For better visibility, we have kept in the eta mass a positive imaginary part. Solid (dashed) line: lower (upper) endpoint. The dash–dotted line connects \(h_-(x)\) with \(h_+(x)\), with \(x_2\!<\!x\!<\!x_3\). It is seen that this line crosses the original path of integration in the dispersive representation (2). The same happens for \(x_1\!<\!x\!<\!x_2\). See text for details

(ii) The second problem arises after the angular integration has been performed in one way or the other. The angular averages develop singularities of the type

$$\begin{aligned} {{\hat{M}}_I}(x) \sim \frac{K}{[(M - M_\pi )-x)]^{p} }\,,\end{aligned}$$
(9)

where \(p=(\frac{1}{2},\frac{3}{2},\frac{1}{2})\) for \(I=(0,1,2)\), and K denotes a constant. These singularities then show up in the dispersive integrals (2). It turns out that, after the integration over x has been performed, the limit \(M \rightarrow M_\eta \) exists. [See e.g. Refs. [13, 42], where the dispersive integral has been worked out explicitly.] We note that the case \(p=\frac{3}{2}\) corresponds to a non–integrable singularity at \(\delta =0\): one is not allowed to interchange the limit \(\delta \rightarrow 0^+\) with the integration over x. These singularities render the standard procedure to construct a numerical solution of the integral equations rather delicate and cumbersome. In the following, we present a method to solve these equations in an easy and straightforward manner, that avoids the problems (i) and (ii) altogether. [Including higher partial waves leads to even stronger singularities [63]. Our method also covers these cases, see the following section.]

Fig. 3
figure 3

Integration paths for the singular integral G(s) in (10). The original path runs along the real line, from \(x_1\) to \(x_3\) (solid line). The upper horizontal line (dotted) stands for the cut attached to the branch point at \(x=(M-M_\pi )^2\). The two singularities of the integrand are indicated with filled circles. Since these are located in the upper complex plane, the path may be deformed into the lower half plane, e.g., into the polygonal line \(x_1B_1C_1x_3\) (dashed line), without changing the value of the integral. See Appendix A for \(x_i,B_1,C_1\)

4 Avoiding singular integrals

There is a simple way to cope with the singularities of the angular averages. To illustrate, we consider the integral [13, 42]

$$\begin{aligned}&G(s)=\int _{x_1}^{x_3}\frac{g(x)\, dx}{(x-s-i\epsilon )((M-M_\pi )^2-x)^{3/2}}\,,\nonumber \\&g(x)=\frac{(x-x_1)(x_3-x)}{(x_2-x_1)(x_3-x_2)}\,.\end{aligned}$$
(10)

We display in Fig. 3 the singularities of the integrand with two black dots in the complex x-plane: one at \(x=s+i\epsilon \), the second at \(x=(M-M_{\pi })^2{\mathop {\rightarrow }\limits ^{\delta \rightarrow 0^+}}x_2\). We have also drawn a cut that emerges from the branch point at this second singularity, reaching out to \(x = \infty +i\delta \) (dotted line). The path of integration is indicated with a solid line, from \(x_1\) to \(x_3\). We now observe that the integrand is holomorphic in the complex half-plane Im\((x)\!<\!\epsilon ,\delta \). Therefore, we may deform the path of integration into the polygonal line \(x_1B_1C_1X_3\) (dashed), without changing the value of the integral. There are then no singularities anymore on the integration path, and we may set \(\epsilon =\delta =0\) before performing the integration. This proves that the limit \(\epsilon ,\delta \rightarrow 0^+\) of the original integral (10) exists [13, 42]. [It is obvious that this remains true if the exponent 3 / 2 in (10) is replaced by any \(p\in {}\mathbb {C}\).] On the other hand, approaching the real axis from below, one encounters a pinch singularity at \(s=(M-M_\pi )^2\), which results in a singular behaviour of the function G(s).

A numerical integration along the dashed path does not pose any problems, because the integrand is smooth as a function of x. To illustrate, we display in Fig. 4 the real part (solid line) and imaginary part (dashed line) of the function G(s) (analytic expression is given in [42]), in units of the pion mass. For comparison, the result of the numerical integration along the dashed polygonal line with 160 Gauss–Legendre points is shown with filled circles. It is seen that the agreement is perfect.

Fig. 4
figure 4

Analytical and numerical results for the function G(s) in Eq. 10. The solid and dashed lines denote the real and imaginary part of the analytical result [42], the filled circles denote the result from the Gauss–Legendre integration along the polygonal line \(x_1B_1C_1x_3\) in Fig. 3

Fig. 5
figure 5

Real and imaginary part of the endpoints \(h_\pm (x)\) in (7), if the integration in the dispersive integral (2) is performed along the deformed path ABCDEF in Fig. 1. Compare with Fig. 2, which shows the situation in the standard case

Fig. 6
figure 6

Displayed are some straight lines (dash–dotted) that connect the lower and upper ends \(h_\pm (x)\) in the angular averages (4), see also Fig. 5. Axes in units of \(M_\pi ^2\). The eta mass is real. It is seen that the dash–dotted lines do not cross the path ABCD and hence they stay within the holomorphy domain of the amplitudes \(M_I(s)\). One can therefore perform the angular averages in their original form

Fig. 7
figure 7

Solution of the isospin zero amplitude. Notation as in Ref. [42]

We now use the same method in the original equations (2). Suppose that the integrand in (2) is holomorphic in some region \(\hbox {Im}(x)\!<\! 0\) (we discuss this point in appendix B). Then we can avoid the singularities generated by the zeros in the function \(\kappa (M^2,M_\pi ^2,x)\) by deforming the path as shown in Fig 1. Aside from avoiding the singularities in \({\hat{M}}_I(x)\), this procedure has the following advantage. Consider the endpoints of the angular integration as they now occur on the deformed path, see Fig. 5. There, we display the two endpoints \(h_\pm (x)\). It is seen that the problem with leaving the holomorphy domain of \(M_I(s)\) does not occur anymore. See also Fig. 6, where we display some of the paths \(h(x,z),~ -1\le z \le 1\), that connect \(h_\pm (x)\). We conclude that one can solve the Eq. (2) in their original form, avoiding complicated path deformations and singular integrals, provided that the path of integration in (2) is deformed properly – e.g. according to Fig. 1.

The deformed path displayed in Fig. 1 is obviously not the only one with these properties – see Appendix C for a discussion of this point.

5 Solving the integral equations

Finally, we make use of the fact that now, the integrands are smooth along the path of integration for \(s<D\), except at the threshold \(s=4 M_\pi ^2\), where an integrable singularity of the type \(\text{ const }/\sqrt{x-4M_\pi ^2}\) occurs (note that \({{\hat{M}}}_I(x)\rightarrow \) const. as \(x\rightarrow 4M_\pi ^2)\). The singularity can be tamed with a variable transformation \(z=A+(B-A)\tau ^2\,,0\le \tau \le 1\). Therefore, an integration using the Gauss–Legendre method [64] is adequate. As we now show, this method has the further advantage that the integral equations boil down to a matrix equation, with matrix elements that need to be evaluated only once, before the iteration, which then becomes trivial.

We use Gauss–Legendre quadrature both for integration over x and over z. The integration path over x is split into linear pieces: AB, BC, CD and DF (the point F corresponds to the upper limit of integration: the phase shifts \(\delta _I(x)\) are equal either to 0 or to \(\pi \), if x moves right to F). The variable x from each interval is mapped to the interval [0, 1], and the Gauss–Legendre mesh points \(x_i\) and weights \(w^x_i\) are used to carry out the integration. The number of points on these intervals is \(N_{AB}\), \(N_{BC}\), \(N_{CD}\) and \(N_{DF}\), respectively. Further, the integral over z for a given x always runs from \(-1\) to 1. The mesh points and weights are denoted by \(z_a\) and \(w^z_a\), respectively, and the number of mesh points is chosen to be \(L_{AB}\), \(L_{BC}\), \(L_{CD}\) and \(L_{DF}\) for x belonging to the one of the above intervals.

It is useful to introduce a multi-index,

$$\begin{aligned} \alpha =(i,a)\, , \quad h_\alpha =h(x_i,z_a)\, ,\quad w_\alpha =\frac{1}{2}\,w^x_iw^z_a\, . \end{aligned}$$
(11)

Here,

$$\begin{aligned} \alpha= & {} 1,\cdots N\, , \nonumber \\ N= & {} N_{AB}L_{AB}+N_{BC}L_{BC}+N_{CD}L_{CD} +N_{DF}L_{DF}\, . \end{aligned}$$
(12)

Combining the Eqs. (2) and (4), we may write

$$\begin{aligned} M_I(s)=\varOmega _I(s)\biggl [P_I(s)+\sum _{\alpha ,I'}w_\alpha K_{II'}(s,h_\alpha ) M_{I'}^\alpha \biggr ], \end{aligned}$$
(13)

where

$$\begin{aligned} K_{II'}(s,h_\alpha )= & {} \sum _m \frac{s^{n_I}}{\pi \kappa (M_\eta ^2,M_\pi ^2,x_i) x_i^{n_I}}\,C^m_{II'}(x_i,z_a) \nonumber \\\times & {} \frac{\sin (\delta _I(x_i))}{|\varOmega _I(x_i)|}\frac{z_a^m}{x_i-s}\,. \end{aligned}$$
(14)

Finally, letting the variable s run over the set \(h_\alpha ,~\alpha =1,\ldots N\) and introducing the notations \(R_{II'}^{\beta \alpha }=w_\alpha \varOmega _I(h_\beta )K_{II'}(h_\beta ,h_\alpha )\) and \(G_I^\beta =\varOmega _I(h_\beta )P_I(h_\beta )\), we can write down Eq. (13) as a matrix equation

$$\begin{aligned} M_I^\beta =G_I^\beta +\sum _{\alpha ,I'} R_{II'}^{\beta \alpha }M_{I'}^\alpha \, ; \,\quad \quad \beta =1,\ldots ,N\,.\end{aligned}$$
(15)

This equation generates iteration series for the vectors \(M_I^\beta \), \(I=0,1,2\), which are rapidly convergent for \(\eta \rightarrow 3\pi \) (note that the quantities \(R_{II'}^{\beta \alpha }\) need to be evaluated only once for a given set of phase shifts). For \(s\not \in [D,F]\), the amplitudes can then be directly constructed from \(M_I^\beta \) by using the dispersive representation (2). For \(s\in [D,F]\), the angular averaged amplitude must be interpolated to perform the Cauchy integral. Moreover, if one manages to invert the large matrix \(R_{II'}^{\beta \alpha }\) numerically, Eq. (15) can be solved directly, without iterations, even if the latter do not converge [44]. Because there was no need to do so in our case, we sticked to the iteration procedure.

Finally in Fig. 7, we display the isospin zero amplitude for one particular choice of the subtraction polynomials, and for a specific choice of the \(\pi \pi \) phase shifts. The amplitude agrees with the one constructed by Lanz [65], except near the threshold, where the cusp structure is different, and near the pseudo-threshold in the \(I=1\) channel.

6 Summary and conclusions

  1. 1.

    We have considered numerical aspects of the integral equations Eqs. (2)–(6) that govern \(\eta \rightarrow 3\pi \) decays [42]. The standard approach to solve these equations numerically is confronted with two main technical hurdles: angular averages along complicated paths in the complex plane, and singularities of the integrand near the integration path in the dispersive representation.

  2. 2.

    Holomorphicity of the phase shifts in the low energy region allows one to deform the original path of integration in the dispersive representation (not in the angular averages). Both problems disappear [55]: The angular averages can be performed in their original form, and there are no nearby non–integrable singularities on the integration path.

  3. 3.

    As the integrands are smooth, a Gauss–Legendre integration becomes feasible. The integral equations turn into a matrix equation, whose iterative solution can be obtained very efficiently.

  4. 4.

    We have constructed the 6 fundamental solutions [42] of Eqs. (2)–(6) for several sets of phase shifts, in the region \(0\!<\!s\!<\!20 \, M_\pi ^2\). The results generally agree with the solutions obtained by Lanz [65].

  5. 5.

    Tables with the fundamental solutions for 8 different sets of phase shifts can be found as ancillary files in the archive version of the present article under the link https://arxiv.org/abs/1809.06399, together with tables for the phase shifts used.

  6. 6.

    As we show in Appendix D, the very same technique is expected to also work for \(\omega \rightarrow 3 \pi \) [43]. It remains to be seen to what extent it can be applied to other three-body decays and to the calculation of form factors [34, 35, 43,44,45,46,47].