Dispersive treatment of KS→γγ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S\rightarrow \gamma \gamma $$\end{document} and KS→γℓ+ℓ-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S\rightarrow \gamma \ell ^+\ell ^-$$\end{document}

We analyse the rare kaon decays KS→γγ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S \rightarrow \gamma \gamma $$\end{document} and KS→γℓ+ℓ-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S \rightarrow \gamma \ell ^+\ell ^-$$\end{document}(ℓ=eorμ)\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\ell = e \text{ or } \mu )$$\end{document} in a dispersive framework in which the weak Hamiltonian carries momentum. Our analysis extends predictions from lowest order SU(3)L×SU(3)R\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$SU(3)_L\times SU(3)_R$$\end{document} chiral perturbation theory (χ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\chi $$\end{document}PT3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_3$$\end{document}) to fully account for effects from final-state interactions, and is free from ambiguities associated with extrapolating the kaon off-shell. Given input from KS→ππ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S \rightarrow \pi \pi $$\end{document} and γγ(∗)→ππ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma \gamma ^{(*)}\rightarrow \pi \pi $$\end{document}, we solve the once-subtracted dispersion relations numerically to predict the rates for KS→γγ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S \rightarrow \gamma \gamma $$\end{document} and KS→γℓ+ℓ-\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$K_S \rightarrow \gamma \ell ^+\ell ^-$$\end{document}. In the leptonic modes, we find sizeable corrections to the χ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\chi $$\end{document}PT3\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$_3$$\end{document} predictions for the integrated rates.


Introduction
In the study of kaon decays, our ability to obtain precise predictions from the Standard Model (SM) depends on whether the underlying physics is predominantly of short-or longdistance nature. At one end of a broad spectrum of possible decay channels, there are "golden modes" like K → πνν, where the amplitude factorises into a hadronic form factor and perturbative corrections-both of which are under excellent theoretical control [1]. In such cases, the resulting prediction can be at a level of precision that competes with (or even surpasses) current experimental measurements. This state of affairs can lead to powerful constraints on physics beyond the SM and drives much of the theoretical and experimental interest in these modes.
By contrast, non-leptonic decays such as K → ππ and K → πππ are dominated by long-distance contributions involving hadronic matrix elements of four-quark operators. The evaluation of these matrix elements is a notoriously difficult non-perturbative problem, and this hinders the comparison of theory with experiment. a e-mail: tunstall@itp.unibe.ch In between these extremes lies a range of decay modes in which a clean separation of the short-and long-distance physics can be achieved with varying degrees of success.
In view of current high-statistics kaon experiments such as NA62 [40], we believe it is timely to consider extending the scope of dispersive methods to S = 1 processes involving the effective weak Hamiltonian H w , and in particular to two-body decays. Such an extension was proposed sometime ago by Büchler et al. [41,42], who treated the decay K → ππ dispersively by allowing H w to carry momentum, thereby overcoming the difficulty that the kinematics in two-body decays are completely fixed. The advantage of this approach over χ PT 3 is that (a) only a few subtraction constants are required as input, and (b) ππ rescattering effects are fully accounted for in terms of Omnès factors and calculable dispersive integrals in crossed channels. Moreover, by allowing H w to carry momentum, the ambiguities associated with taking the kaon off-shell [42,43] are entirely avoided.
In this article, we extend the dispersive framework developed in [41] to the rare decays K S → γ γ and K S → γ + − ( = e or μ). In lowest-order (LO) χ PT 3 , the amplitudes for K S → γ γ ( * ) possess the well known feature of ultraviolet finite π ± , K ± one-loop diagrams coupled to the external photons. For the pure radiative decay, the chiral prediction [2,44,45] for the rate is in reasonable agreement with the experimental average [46] BR while the predictions [47] for the leptonic modes are typically expressed in terms of the ratios Although these decays have not yet been measured, they may lie within reach of the KLOE-2 experiment at DA NE [48], which is projected to be sensitive down to K S branching ratios of O(10 −9 ). Given these projections, it is clearly of interest to determine what impact ππ rescattering effects have on the χ PT 3 predictions (3).
The outline of this paper is as follows. In Sect. 2 we introduce the general formalism needed to analyse K S → γ γ * dispersively, and derive the decomposition of the decay amplitude into a basis of scalar functions that are free from kinematic zeros and singularities. In particular, we use this basis to extend the LO χ PT 3 calculation [47] to the case where H w carries non-zero momentum. Section 3 reviews the dispersive framework developed for K S → ππ [41], which forms a key input in our analysis of K S → γ γ * . In Sect. 4 we examine K S → γ γ and find that the inclusion of effects from FSI improves the agreement between theory and experiment. We also comment on how our results compare with previous work [49] based on extrapolating the kaon offshell. Section 5 concerns K S → γ + − , where we observe that FSI and the pion vector form factor lead to sizeable corrections of the LO χ PT 3 predictions. Our summary is given in Sect. 6.

Preliminaries
We begin by considering the radiative decay whose amplitude is given by where ε 1,2 are the polarization vectors of the photons. The tensor A μν is defined in terms of the pure I = 1/2 matrix element 2 where J μ is the electromagnetic current of the light quarks u, d, s, and we allow the weak Hamiltonian H w to carry non-zero momentum h μ = 0. Then the decay amplitude (5) becomes a function of the three Mandelstam variables which satisfy In what follows it is convenient to set h 2 = 0, while keeping h μ = 0 in general. Doing so does not result in a loss of generality, but does simplify several expressions derived in this paper. To recover the physical decay amplitude, one simply takes the limit h μ → 0, in which case the kinematic variables become fixed at the values

Tensor decomposition
To set up a dispersive framework for K S → γ γ * , the first step is to decompose A μν in a basis of independent tensors, whose scalar coefficients are free from kinematic singularities and zeros. This can be achieved by applying the prescription of Bardeen and Tung [50], and Tarrach [51]; our approach resembles the tensor decomposition of γ * γ * → ππ discussed in [52][53][54]. Let q i = {q 1 , q 2 , h − k} label the three independent momenta and observe that Lorentz covariance and C Pinvariance implies a decomposition in terms of ten tensors 3 The scalar functions {A 1 , A i j 2 } are not all independent since A μν is constrained by the electromagnetic Ward identities A convenient way to impose the constraint (11) is to introduce the gauge projector and let it act on both indices of A μν : By definition, this leaves the physical tensor A μν invariant and removes contributions that do not satisfy the Ward identities; with this procedure the set of scalar functions reduces to five. The new basis functionsĀ i are free from kinematic singularities, but contain zeros because the tensorsT i μν contain single and double poles in q 1 · q 2 . As shown in [52][53][54], the removal of these poles can be performed by adding suitable linear combinations ofT i μν with non-singular coefficients, followed by a rescaling in powers of q 1 · q 2 . In our case, contraction with ε 1 and setting q 2 1 = 0 imposes two additional constraints, so the final result is 3 The terms ∼ i, j ε μνρσ q ρ i q σ j A i j 3 are allowed by Lorentz covariance, but violate P and C P symmetry. Fig. 1 Lowest-order χPT 3 graphs for K S → γ γ * , where the weak Hamiltonian H w carries momentum where the scalar functions B i are free from kinematic zeros and singularities, and the corresponding tensors are At the physical point (9) there are only two independent momenta, so A μν reduces to T 1 μν times the coefficient Evidently, the determination of the scalar functions B i completely fixes the prediction for the K S → γ γ * amplitude (5).
Before discussing our dispersive treatment of the scalar functions B i , it is instructive to extend the LO χ PT 3 calculation of K S → γ γ * [47] to the case where H w carries momentum. In the conventions of [2], the graphs shown in Fig. 1 yield where G 8 = 9.1 × 10 −6 GeV −2 is the octet coupling at O( p 2 ), F π = 92.2 MeV is the pion decay constant [46], and the loop integral is where φ = π ± or K ± . The integral is ultraviolet finite and can be evaluated in terms of Feynman parameters: where the denominator is given by In (19), the second tensor vanishes upon contraction with ε 1 , so we find that only Here, the quantity is defined [2] in terms of the one-loop functions At the physical point (9), the expression in (22) agrees with the original χ PT 3 result [47], as it should.
As emphasised in [55], tadpole cancellation completely eliminates the weak mass operator at O( p 2 ) in the χ PT 3 expansion. The argument can be extended to O( p 4 ) [56] and remains valid when H w carries momentum.

Unitarity and ππ intermediate states
Let us now analyse the unitarity relation due to the intermediate ππ state. In the s-channel, this contribution reads where A ππ and W μν are the amplitudes for the subprocesses K S → ππ and γ γ * → ππ respectively. On the left-hand side of the cut, the Mandelstam variables are while on the right-hand side, W μν can be decomposed into a basis of three independent tensors [21,[52][53][54]: where and The phase space integration (25) must project each of the tensors t i μν onto linear combinations of T i μν . However, the integration is trivial if contributions from D waves and higher are neglected. This is because in this approximation, A ππ is independent of t and u , while the scalar functions W i can be expressed in terms of a single helicity partial wave [53,54], Since W 1 is independent of the pion momenta, the tensor t 1 μν can be pulled under the phase space integral (25). Equating the scalar coefficients then gives the analytical result where we have introduced the kinematic factor At higher energies, other intermediate states like 4π , KK etc. will contribute to the s-discontinuity of A μν . Moreover, for a complete dispersive treatment one should also consider discontinuities in the t-and u-channels. We will not consider any of these contributions to the dispersion relation for A μν , and we explain below on what grounds these approximations can be justified.

Dispersive framework for K → ππ
The construction of a dispersion relation for K S → γ γ ( * ) requires input from K S → ππ and γ γ ( * ) → ππ. It is well known that one-loop chiral corrections to the K S → ππ amplitude are substantial, and largely due to significant rescattering effects of pions in the final state [57][58][59]. An understanding of FSI is thus essential in order to make sense of puzzles such as the I = 1/2 rule or the SM prediction for ε /ε. As noted in Sect. 1, dispersive techniques are well suited to addressing FSI; here we review the dispersive framework [41,42] We begin with the standard isospin decomposition for the where A 1/2 is generated by the I = 1/2 component of H w , and we have omitted a term involving I = 3/2. 2 As in Sect. 2, we allow the effective weak Hamiltonian H w to carry momentum h μ = 0, so the amplitude reads where the corresponding Mandelstam variables are given in (26), and satisfy The physical K 0 → ππ decay amplitude is then obtained by taking the limit h μ → 0, at which point we have If contributions from the imaginary parts of D waves and higher are neglected, it is possible to decompose A 1/2 in terms of single-variable functions where the angular dependence is contained in and the explicit expressions for N i and R i can be found in [41]. As a result of this simplification, the dispersive treatment of the full amplitude A 1/2 is reduced to solving a coupled set of dispersion relations of the single-variable functions appearing in the right-hand side of (37). As shown in [41], these relations can be solved numerically, with a minimum of two subtraction constants 4 needed to ensure convergence of the dispersive integrals. One of these constants a ππ can be determined at the soft-pion point where A 1/2 is related to the on-shell K → π amplitude A π : Note that with both K and π on-shell, the weak operator H w in A π necessarily carries momentum. The relevance of lattice calculations of A π in connection with the I = 1/2 rule has recently been discussed in [62].
On the other hand, the second constant b ππ can be obtained by considering e.g. the derivative ∂ A 1/2 /∂s at the soft-pion point (39). Ideally, lattice techniques would be used to determine a ππ and b ππ , although such calculations remain to be undertaken. Thus the approach taken in [41] was essentially pragmatic: to illustrate the role of FSI, the value of b ππ was fixed by applying χ PT 3 , so that where the dimensionless parameter X controls the size of the expected NLO corrections; on the basis of the 3-flavour expansion it can be varied between X = ±0.3. We note that the relation (41) is not affected by the weak mass term in H w ; see Sect. 2.2.
From the solutions to the dispersion relations, it is a straightforward matter to reconstruct the K → ππ amplitude. For u fixed near the physical value m 2 π , it has been shown [63] that the contribution due to C(s, t , u ) is negligible relative to M 0 in the low-energy region s 1.5 GeV 2 . Thus to a good approximation, we can write where the quantity parametrises the NLO corrections, and is the Omnès function [64] subtracted at s = 0, with δ 0 0 the ππ scattering phase shift in the I = = 0 channel.
The K → ππ amplitude in (42) can be determined up to the unknown subtraction constant a ππ , modulo chiral corrections parametrised by X . As a result, a first principles prediction for K → ππ is not currently possible within this framework. Fortunately, this does not pose a problem for K S → γ γ * since we can eliminate the dependence on a ππ by matching to A 1/2 = A 0 e iδ 0 at the physical point (36): where is the empirical value of the I = 0 amplitude [2]. In this way, the dispersive representation of K S → γ γ * is largely determined in terms of measurable quantities, and as we show in Sects. 4 and 5 this leads to rather small uncertainties in our final results.

Dispersion relations for K S → γ γ
As a first application of our dispersive framework, here we consider the case where both photons are on-shell. A complete dispersive treatment of K S → γ γ (with H w carrying momentum) would require an analysis of all possible intermediate states in all three channels s, t and u-clearly a daunting task. A simplification which has proven to be particularly effective for other scattering processes at low energies is to neglect the contributions to discontinuities coming from D waves and higher. This leads to a dispersive representation of the scattering amplitude in terms of single-variable functions, much like in the case of the K → ππ amplitude discussed in Sect. 3. As in that case, we expect that at the physical point (9), the contributions to the S wave coming from discontinuities in the t and u channels are negligible, and so will not consider them. Effectively this means that we construct a dispersion relation of the form-factor type (i.e. with a right-hand cut only), and only for the S wave. Moreover, we will explicitly consider only the effect of ππ rescattering, which at low energies should be by far the most important one. Indeed, this expectation is borne out by the LO χ PT 3 result discussed in Sect. 2.2. Let us define A γ γ (s) = e 2 B 1 (s), whose imaginary part coincides with the s-discontinuity in (31) once we set q 2 2 = 0: Here α = e 2 /4π is the fine-structure constant, and h 0 0,++ is the projection of h 0 ++ onto the I = 0 channel. The real part then follows from a once-subtracted dispersion relation at s = s 0 : where a γ γ is the subtraction constant. The subtraction is necessary because the π ± , K ± loop contribution to the χ PT 3 amplitude vanishes at the point and, moreover, to ensure convergence of the dispersive integral. This feature can be deduced from the explicit form of the χ PT 3 amplitude in (22), with q 2 2 = 0. It follows that matching A γ γ (s 0 ) onto LO χ PT 3 fixes a γ γ = 0, although in general, a γ γ will receive SU (3) corrections due to terms at O( p 6 ) in the chiral expansion. It is important to note that by matching below the ππ threshold, we make use of χ PT 3 only in a kinematic region where the typically large corrections due to FSI are entirely absent, i.e. where the 3-flavour expansion should behave as expected.
To compute the integral in (48), we require input for h 0 0,++ and the S wave of the K S → ππ amplitude which, in our representation, is given by 0 0 . Concerning the latter, the dispersive representation of the single-variable functions in (37) is only valid in the elastic scattering region 4m 2 π < s < 16m 2 π , even though the first significant inelastic contribution is due to the KK intermediate state when s > 4m 2 K . Taking this into account would require a coupled-channel analysis of K S → ππ and K S → K K , which is beyond the scope of this work. Moreover, it is unclear whether this would lead to better precision, because there are no sources of experimental information on K S → K K , and we would have to rely completely on χ PT 3 to determine the subtraction constants, with correspondingly large uncertainties.
We will thus stick to a single-channel treatment and only consider the contribution to the imaginary part of the S wave specified in (47). This implies that the phases of h 0 0,++ and 0 0 have to match exactly in order for Im s A γ γ to be real, as it should be in a single-channel treatment. This is guaranteed in the elastic region, which effectively extends up to s = 4m 2 K , but above that threshold an ambiguity arises: do the phases of the K S → ππ partial waves continue to behave like the elastic scattering phase shifts δ I , or do they exhibit a sharp "dip" like the one observed [12] in the scalar form factor of the pion? 5 This ambiguity affects both quantities: h 0 0,++ as well as 0 0 . We take a pragmatic approach to the problem and follow Moussallam [21], who constructs a phase with the property where s π lies near the KK threshold and is the point where δ 0 0 crosses π . The corresponding Omnès function 0 0 [φ] thus displays a "dip" across the inelastic region. Another option is to evaluate the Omnès function with the phase of h 0 0,++ , 5 See also the discussion in [23] which shows how, in the coupledchannel treatment, the phase of the scalar form factor of the pion is sensitive to the input for the subtraction constants. We stress, however, that even in cases where the phase of the form factor continues to track δ 0 0 after the KK threshold, the modulus of the form factor still has a dip rather than a peak at s = 4M 2 K .

Fig. 3 Energy dependence of phase shift inputs (top) and magnitude of the corresponding Omnès functions (bottom)
as input. Watson's theorem ensures φ 0 0 = ψ 0 0 in the elastic region, and leads to two representations for 0 0 which are in very close agreement at low energy. A comparison of the two phases and corresponding Omnès factors is shown in Fig. 3.
As argued by Moussallam [21] and earlier by Morgan and Pennington [65] (see also the discussion in [12]), unless the operator which is responsible for the creation of the pion pair has a large overlap with the f 0 (980), one expects a weak coupling to the f 0 (980), and correspondingly a dip in the amplitude. The only known example of an operator whose amplitude would have a peak instead of a dip is that of thē ss operator.
We thus conclude that our preferred phase is the one given in Eq. (50) and with this we will obtain our central results. The phase in (51) will be used to estimate our systematic uncertainty. More extreme behaviours-like "Solution 1" in [33]-are deemed to be very unlikely and will not be considered.
Regarding the input for h 0 0,++ , we use data from the coupled-channel analysis of γ γ → ππ performed by García-Martín and Moussallam (GMM) [19]. Since the determination of h 0 0,++ in this analysis is expected to be reliable up to s 2 GeV 2 , 6 it is necessary to impose a cutoff Fig. 4 Cutoff dependence of the dispersive amplitude |Re A γ γ | at the physical point (9), where the blue band corresponds to the systematic uncertainty. For comparison, the PDG value of |Re A γ γ | and its 1σ uncertainties is shown by the green band, while the lowest-order prediction from χPT 3 is shown by the dashed red line in our dispersion integral (48). At the physical point s = m 2 K , a comparison of the cutoff dependence is shown in Fig. 4, where |Re A γ γ | is seen to exhibit a very mild sensitivity to variations in .
Taking = 1.2 as a benchmark value, the energy dependence of the real and imaginary parts of A γ γ is shown in Figure 5. As expected, the dispersive representation agrees with LO χ PT 3 below the ππ threshold. However, for s > 4m 2 π , the effects from FSI distort the amplitude, producing a significant enhancement (suppression) of the real (imaginary) part. These effects lead to an enhanced prediction for the branching ratio which brings the SM and experiment (2) into much better agreement. The uncertainty has been determined by considering the variation X = ±0.3, shifting the value of s 0 by 30%, the comparison of the two Omnès inputs (Fig. 3), and an estimate of contributions from the high-energy region > 1.2 GeV, where the phase of 0 0 is guided to π and the helicity partial wave is fixed to a constant value |h 0,++ | ≈ 4. Combined in quadrature, the final uncertainty has turned out to be remarkably modest.

Comparison to the literature
As shown in Fig. 5, the real part of A γ γ receives a significant enhancement in absolute value at s = m 2 K due to FSI. A similar observation has been made by Kambor and Holstein (KH) [49], who estimated the effects of ππ rescattering in K S → γ γ and K L → π 0 γ γ by extrapolating the kaon mass off-shell. Focusing on the former process, we can adapt their where B(s) is a scalar function whose definition is given in [49]. In our comparison, we have updated the input used in [49] to account for improved determinations [19] of the Omnès factor and helicity partial wave. The resulting predictions at the benchmark value of = 1.2 GeV are shown in Table 1, where we also list the pure octet 2 predictions from χ PT 3 . We note that although the KH formalism produces a branching ratio consistent with experiment, it relies on the assumption that one can extrapolate the kaon mass off the mass shell. As discussed in [42,43], this procedure suffers from an inherent ambiguity as there is no unique way in which to perform the off-shell extrapolation. By contrast, our framework always involves on-shell states, and is free from such ambiguities.

Dispersion relations for K S → γ + −
We now consider the case where the photon momentum in K S → γ γ * can remain off-shell q 2 2 = 0. As in Sect. 4, we focus on contributions from S waves and define A γ γ * (s, q 2 2 ) = e 2 B 1 (s, q 2 2 ). In the presence of ππ rescattering in the I = 0 channel, the s-discontinuity reads Table 1 Determinations of the K S → γ γ amplitude A γ γ and branching ratio at the physical point s = m 2 K . The numbers in the row labelled "This work" have been obtained with input from GMM
The Born contribution to the helicity partial wave 8 produces a double pole ∼ (s − q 2 2 ) 2 in disc s A γ γ * , so a decomposition of the integrand, 7 The absence of anomalous thresholds in K S → γ γ * follows from the same arguments used for γ γ * → ππ [21]; see also [66] for a general treatment. 8 The Clebsch-Gordan factor of √ 4/3 is due to the rotation from the charge basis to the isospin one [21].
is required in order to evaluate the dispersive integral numerically. In the above, F V π denotes the vector form factor of the pion, and is set to unity in LO χ PT 3 . Using the identity in (59), we get the Born part of the K S → γ γ * amplitude where we have defined Similarly, for the rescattering contribution, we use the identity where In the evaluation of (60) and (63), we use the two Omnès inputs discussed in Sect. 4, as well as the pion form factor and helicity partial waves h 0 0,++ obtained from Moussallam's single-channel analysis of γ γ * → ππ [21]. The range of validity of h 0 0,++ can be inferred by comparing the result from the single-channel analysis at q 2 2 = 0 with that from GMM's coupled-channel analysis of γ γ → ππ [19]. As shown in Fig. 6 Energy dependence of helicity partial waves obtained from dispersive analyses of γ γ ( * ) → ππ [19,21] Near √ s = 0.8, the phase is close to π/2, so small variations in the zero of Re h 0 0,++ can lead to a large variation in Im h 0 0,++ . From a conservative viewpoint, this suggests that the cutoff be fixed to 0.8 GeV. However, we have checked that increasing the cutoff to = 1.2 GeV does not lead to a difference of more than ≈ 7% in the resulting predictions for A γ γ * . Note that this 7% is the effect of a 100% uncertainty on our input between 0.8 and 1.2 GeV. Since this small change is covered by our estimate of the systematic uncertainty, we take the larger cutoff as a benchmark value in our numerics and stress that only a coupled-channel analysis for this process would allow one to better assess this source of uncertainty and push the cutoff to yet higher energies. As noted in Sect. 4, however, there are non-trivial difficulties in performing a coupled-channel analysis for two-body K decays. 9 B. Moussallam, private communication.
By combining (60) and (63), we obtain the desired result for the total K S → γ γ * amplitude: For fixed values of q 2 2 , we first compare the predictions arising from (66) against those of χ PT 3 . In Fig. 7, we show the energy dependence of the amplitude for three values of q 2 2 . As shown in the figure, when q 2 2 < 4m 2 π , the effect of FSI resembles that previously seen in K S → γ γ (Fig. 5), with the real (imaginary) parts enhanced (suppressed) relative to χ PT 3 . However, as q 2 2 increases above the ππ threshold, the pion form factor F V π becomes progressively more important, and both real and imaginary parts in the dispersive amplitude are enhanced relative to LO χ PT 3 . This feature can be clearly seen in Fig. 8, where we keep s = m 2 K fixed and vary q 2 2 within the physical region of the three-body decay. The effect of including the pion form factor in the χ PT 3 amplitude shows a moderate enhancement at large q 2 2 , especially for the real part. We also note that even for small values of q 2 2 , the dispersive amplitude differs from χ PT 3 due to the effects of FSI.
We now consider the predictions for the K S → γ + − decay rates. Here the differential decay rate is [47] where the electromagnetic spectral function is given by In Fig. 9, we compare the χ PT 3 prediction [47] for the differential decay rate involving muons against our dispersive result. Evidently, the corrections are large for q 2 2 0.05: again, this can be inferred from the q 2 2 behaviour shown in Fig. 8. We also see that, for this mode, the dominant source of the enhancement is due to the pion form factor.
The integrated rates (normalised to the total K S decay width) are shown in Table 2, where the uncertainties are determined as in Sect. 4, except for the subtraction constant: here we keep the subtraction point fixed and vary the χ PT 3 amplitude by 30%. In both cases, the corrections are sizeable: for the electron mode we see a shift of O(50%), while in the muon mode we have a shift of O(100%). The origin of these   Differential decay width for K S → γ μ + μ − , normalised to the total K S rate. Colour coding as in Fig. 8 shifts are different in each case. For the electron mode, the phase space is peaked near the origin q 2 2 = 0, so the role of F V π is suppressed and the dominant effect is due to FSI. On the other hand, the enhancement in the muon mode is predominantly due to the form factor (Fig. 9). Table 2 Predictions for the branching ratio of K S → γ + − . The second row indicates the effect of including the pion vector form factor F V π in the χPT 3 amplitude This work (4.38 ± 0.57) × 10 −8 (1.45 ± 0.27) × 10 −9

Summary
Current and near-future searches for rare kaon decays are reaching sensitivities where a better control over the longdistance contribution to the relevant amplitudes is needed. Chiral perturbation theory and lattice QCD are two of the main tools which allow a systematic calculation of these contributions, but getting FSI under good control in either of these approaches is challenging. Dispersion relations offer a different, complementary methodology to the previous two, which addresses specifically the treatment of FSI. If one can match the dispersive and the chiral representation, and solve the dispersion relation, one can usually obtain much better control over FSI effects. In this paper, we have taken a first step in this direction by introducing a dispersive framework for K S → γ γ and K S → γ + − .
A key feature of our analysis is that by allowing the weak Hamiltonian to carry momentum, there is no need to extrapolate the kaon mass off-shell. Moreover, the input for the sub-amplitudes K S → ππ and γ γ ( * ) → ππ provide a strong constraint on the dispersive amplitude, and when expressed in terms of measurable quantities we find rela-tively small uncertainties in our final predictions. In particular, the Born contribution to γ γ * → ππ has a negligible uncertainty because the pion vector form factor is known to high precision: for this particular contribution, going off-shell in the photon momentum does not lead to larger uncertainties.
In general, we find that the effects due to FSI provide sizeable corrections to the predictions from LO χ PT 3 . For K S → γ γ , these effects distort the amplitude such that the relative size of the real and imaginary parts are interchanged. That LO χ PT 3 predicts too large an imaginary part can be concluded on the basis of unitarity alone and by taking as input the experimental measurements of K S → ππ and γ γ → ππ at s = m 2 K : LO χ PT 3 overshoots the correct value by 21%. As for the real part, we need to rely on analyticity and on a dispersive treatment of both K S → ππ as well as γ γ → ππ, where the latter is also well constrained by data. The uncertainties involved here are larger, but still allow us to firmly conclude that the prediction of LO χ PT 3 has the correct sign (negative), but substantially underestimates the absolute value: we obtain an enhancement of about 70%. This feature has been observed earlier by Kambor and Holstein [49], who noted that the reasonable agreement between the rates from LO χ PT 3 and experiment should be not be viewed as a success of the effective theory, since unitarization methods produce nearly identical results. Our results confirm this observation and places it on a stronger footing since we do not rely on off-shell extrapolations.
For K S → γ + − , we found that the pion vector form factor produces an additional source of enhancement over LO χ PT 3 . Since the form factor is well known experimentally in both the timelike and the spacelike region, we can evaluate this particular correction very reliably, which is an important outcome of this analysis. Although less pronounced in the electron mode due to phase space suppression, we observed a particularly large increase in the rate for the muon mode. In view of this result, we believe the muon mode has good prospects of being observed at the projected sensitivities of KLOE-2.
In our analysis, we have restricted ourselves to the case where at most one photon is off-shell. It would be interesting to extend our dispersive framework to the doubly off-shell amplitude K S → γ * γ * , which provides the dominant contribution to the rare decay K S → + − . For the muon mode, LHCb [68] has recently placed an upper bound on the rate BR(K S → μ + μ − ) < 9 × 10 −9 , and future upgrades are expected to improve the sensitivity down to O(10 −10 ) [69]. Given that a signal well above 10 −11 has been claimed [70] to be clear evidence of physics beyond the SM, determining the role of FSI in this mode will be essential in order to draw definite conclusions regarding the SM background. Work in this direction is currently in progress.