1 Introduction

The present experimental status of the \(K \rightarrow \pi ^+\pi ^0\pi ^0 \gamma \) decays is rather meager [1]. So far the observation is claimed by the single experiment [2] with statistics of 5 events and \(BR=(7.6_{-3.0}^{+6.0}) \times 10^{-6}\). The radiative photon is generated predominantly via internal bremmstrahlung with infrared pole at \(E_{\gamma }=0\). Compared to the study of \(K^{+} \rightarrow \pi ^{+}\pi ^{+}\pi ^{-}\gamma \) decay reported by us earlier [3] the \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) decay has less charged particles in the final state, i.e the bremsstrahlung component is less pronounced and there is a chance to detect more interesting effects. In this article we present the observation and measurement of that decay with considerably improved precision. This decay has certain interest for the theory, in particular, for the chiral perturbation theory (ChPT). The calculations in the next-to-leading order in this framework are done in [1, 4] predicting the branching ratio at \(3.76 \times 10^{-6}\) for \(E_{\gamma }^*>10\) MeV. It is interesting to compare this result to the experimental data.

2 The OKA setup

The OKA is a fixed target experiment dedicated to the study of kaon decays using the decay in flight technique. It is located at NRC ”Kurchatov Institute”-IHEP in Protvino (Russia). A secondary kaon-enriched hadron beam is obtained by an RF separation with the Panofsky scheme. The beam is optimized for the momentum of 17.7 GeV/c with kaon content of about 12.5% and intensity up to \(5\times 10^{5}\) kaons per U-70 accelerator spill.

Fig. 1
figure 1

Schematic elevation view of the OKA setup. \(S_{1-4}\) – beam scintillation counters; \(C_{1,2}\) – gas Cherenkov counters; \(BPC_{1-4}\) – beam proportional chambers; \(M_1\) – beam analyzing magnet; GS – guard system; \(PC_{1-8}\) – proportional chambers for secondary track; SM – secondary track analyzing magnet; ST – straw tubes; DT – drift tubes; HODO – matrix hodoscope; \(S_{bk1},S_{bk2}\) – beam killer counters; BGD, GAMS – electromagnetic calorimeters; GDA – hadron calorimeter; \(\mu C\) – muon counter. See the text for details

The OKA setup (Fig. 1) makes use of two magnetic spectrometers along with an 11 m long decay volume (DV) filled with helium at atmospheric pressure and equipped with a guard system (GS) of lead-scintillator sandwiches mounted in 11 rings inside the (DV) for photon veto. It is complemented by an electromagnetic calorimeter BGD [5] with a wide central opening.

The first magnetic spectrometer measures momentum of the beam particles with a resolution of \(\sigma _{p}/p\) \(\sim 0.8\%\). It consists of a vertically (y) deflecting magnet M\(_{1}\) surrounded by a set of 1 mm pitch (beam) proportional chambers BPC\({}_{(}\) \({}_{1Y,}\) \({}_{2Y,}\) \({}_{2X,...,}\) \({}_{4Y}\) \({}_{)}\). It is supplemented by two threshold Cherenkov counters Č\(_{1}\)\(_{2}\) for kaon identification and by beam trigger scintillation counters S\(_{(1)}\), S\(_{(2)}\), S\(_{(4)}\). To measure the charged tracks from decay products the second magnetic spectrometer is used (with resolution of \(\sigma _{p}/p \sim \) 1.3–2% for momentum range of 2–14 GeV/c). It consists of a wide aperture 200 \(\times \) 140 cm\(^2\) horizontally (x) deflecting magnet SM with \(\int Bdl \sim 1\) Tm surrounded by tracking stations: proportional chambers PC\(_{(1,...,8)}\), straw tubes ST\(_{(1,2,3)}\) and drift tubes DT\(_{(1,2)}\). A matrix hodoscope HODO is used to improve time resolution and to link xy projections of a track.

At the end of the setup there are: an electromagnetic calorimeter GAMS\(_\mathtt{(ECAL)}\) of 15X\({}_{0}\) (consisting of \(\sim \) 2300 3.8 \(\times \) 3.8 \(\times \) 45 cm\(^3\) lead glass blocks) [6] with the energy resolution of \(\sigma _E/E = 0.015 + 0.1/\sqrt{E}\) and the space resolution of 2–8 mm, a hadron calorimeter GDA\(_\mathtt{(HCAL)}\) of 5\(\lambda \) (constructed from 120 20\(\times \)20 cm\({}^{2}\) iron-scintillator sandwiches with WLS plates readout) and a wall of \(4\times 1 \)m\({}^{2}\) muon counters \(\mu \)C behind the hadron calorimeter.

More details on the OKA setup can be found in [7, 8].

3 The data and the analysis procedure

Two sequential runs of data with beam momentum of 17.7 GeV/c recorded by OKA collaboration in 2012 and in 2013 are analysed in search for \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) decay.

The main trigger requires a coincidence of 4 beam scintillation counters, a combination of two Cerenkov’s (Č\(_{1}\) sees pions, Č\(_{2}\) pions and kaons) and, finally, anticoincidence of two scintillation counters S\(_{bk1}\), S\(_{bk2}\), located on the beam axis after the magnet to suppress undecayed beam particles: \(\mathtt{Tr_{Kdecay}=}\) \(\mathtt{S_{1}{\cdot }S_{2}{\cdot }S_{3}{\cdot }S_{4}{\cdot }\overline{\check{C}}_{1}{\cdot }\check{C}_{2}{\cdot }{\overline{S}}_{bk} }\). The trigger additionally requires an energy deposition in GAMS-2000 e.m. calorimeter higher than  2.5 GeV to suppress the dominating \(K^{+}\rightarrow \mu ^{+}\nu \) decay: \(\mathtt{Tr_{GAMS}=Tr_{Kdecay} \cdot (E_{GAMS}>2.5}\)  GeV).

The Monte Carlo (MC) statistics is generated with Geant-3.21 [9] program comprising a realistic description of the setup. The MC events are passed through full OKA reconstruction procedures.

For the estimation of the background to the selected data set, a sample of the Monte Carlo events with six main decay channels of charged kaon (\(\pi ^{+}\pi ^{0}\), \(\pi ^{+}\pi ^{0}\pi ^{0}\), \(\pi ^{+}\pi ^{0}\gamma \), \(\mu ^{+}\nu \gamma \), \(\pi ^{0}\mu ^{+}\nu \), \(\pi ^{0}e^{+}\nu \)) mixed accordingly to their branching fractions, with the total statistics \(\sim \) 10 times larger than that of the recorded data sample is used. Every MC event has a weight \(w \sim |M|^2\) where M is the matrix element of the decay. The weights for the 3-body decays are calculated from the data, presented in PDG [10], the matrix element of the \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) decay comes from [4]. The pileup and processes when the beam kaon scatters or interacts while passing the setup are also added.

3.1 Event selection

The total of \(\sim \) \(3.6\times 10^{9}\) events with kaon decays are logged in of which \(\sim \) \(8\times 10^{8}\) events are reconstructed with a single charged particle in the final state. The primary selection criteria are:

  • The secondary positive track is neither \(e^+\) (by E/p ratio) nor \(\mu ^+\) (no signal in \(\mu C\) counter).

  • The angle between the beam and the secondary track \(\Theta >2\) mrad (to suppress the beam background); the distance between the beam and secondary track \(<1\) cm.

  • The decay vertex is within the decay volume.

  • Only one segment of the charged track downstream the analysing magnet.

  • 5 \(\gamma \) with energy \(E > 0.5\,\textrm{GeV}\) detected.

  • Out of all possible combinations of 4 \(\gamma \)s the one with minimal value of \(R^2 = (m_{ij} - m_{\pi ^0})^2 + (m_{kl} - m_{\pi ^0})^2, \quad ij\ne kl\) is identified as \(\pi ^0\pi ^0\) and the remaining \(5\textrm{th}\), “stray” \(\gamma \) considered bremsstrahlung.

  • \(5\textrm{th} \gamma \) hits GAMS rather than BGD.

  • At least one out of 4 \(\gamma \), making \(\pi ^0\pi ^0\), hits GAMS. This has some influence since GAMS is in trigger and BGD is not.

  • \(5\textrm{th} \gamma \) energy in \(K^+\) rest frame is \(10<E_{\gamma }^*(measured)<50\,\textrm{MeV}\).

About 230k events selected at this stage. Per MC simulation the only background process surviving this selection is non-radiative decay \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\) of similar topology less one \(\gamma \). The extra “ghost” \(\gamma \) easily emerges due to the fluctuations of \(\pi ^+\) hadronic shower in GAMS e.m. cal. This background decay is \(\approx 5000\) times more frequent (\(1.76\%\)) than the radiative decay in question thus presenting a major challenge in this analysis. We employ a Radial Basis Function Network (RBFN) neural net [11] to suppress this background. MC events of \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\) and \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) decays passed the primary selection criteria are used to train the neural network (NN); the training set contains 100k events of each type. For the inputs to the NN we use the variables relevant to discrimination between two types of events:

  • \(\varDelta E = E_{\pi ^+} + \sum _{i=1}^{5} E_{\gamma _i} - E_{beam}\) – the energy balance in the event. Any extra \(\gamma \) piled up on top of major background process \(K^{+} \rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\) results in \(\varDelta E>0\).

  • \(E_{\gamma }\) – the \(5\textrm{th} \gamma \) energy.

  • \(d_{\gamma }\) – distance from the \( 5\textrm{th} \gamma \) to the track at GAMS plane.

  • \(\chi _{\gamma }^2\)\(\chi ^2\) of the \(5\textrm{th} \gamma \) shower shape fit. All 3 bullets above may help discriminating the events where the fluctuations of \(\pi ^+\) shower in GAMS may mimic an extra \(\gamma \).

  • \(\chi _{\pi ^+\pi ^0\pi ^0}^2\) - \(\chi ^2\) of 3C-fit to \(\pi ^+\pi ^0\pi ^0\). Good \(\chi ^2\) is likely to come from \(\pi ^+\pi ^0\pi ^0\).

  • \(\chi _{\pi ^+\pi ^0\pi ^0\gamma }^2\)\(\chi ^2\) of 3C-fit to \(\pi ^+\pi ^0\pi ^0\gamma \). Bad \(\chi ^2\) suggests something wrong with the event.

  • \(M_{\pi ^+\pi ^0\pi ^0}\) – mass of \(\pi ^+\pi ^0\pi ^0\) in 3C-fit. \(\pi ^+\pi ^0\pi ^0\gamma \)-event fitted to \(\pi ^+\pi ^0\pi ^0\)-hypothesis results in the mass shifted to the lower values due to disregarded \(\gamma \). ( In 3C fit the \(\gamma \) energies are fitted under 3 kinematic constraints: \(m_{\gamma _1 \gamma _2} = m_{\gamma _3 \gamma _4} = m_{\pi ^0}\) and the total energy of the secondaries equals to the beam energy.)

On its output the RBFN produces a real number. Moving the output threshold offers control over signal:background ratio (Fig. 2).

Fig. 2
figure 2

Neural net performance, the axes are normalized to the MC sample used for training. With the thresholds used in this analysis (shown with bullets) the NN can suppress the background 1000 times while retaining about 1/4 of good events

The sample of 230k events left after primary selection is processed by this neural net to obtain the mass spectra in Fig. 3. The mass is calculated per

$$\begin{aligned} m^2= & {} E^2 - \vec {p}^2; \nonumber \\ E= & {} \sqrt{p_{\pi ^+}^2 + m_{\pi ^+}^2} + E_{\pi _1^0} + E_{\pi _2^0} + E_{\gamma }, \nonumber \\ \vec {p}= & {} \vec {p}_{\pi ^+} + \vec {p}_{\pi _1^0} + \vec {p}_{\pi _2^0} + \vec {p}_{\gamma } \end{aligned}$$
(1)

where \(E_i, \vec {p_i}, m_i\) stand for the energies, momenta and mass of the particles in the final state. Clear peak is seen in the mass spectra for 3 different thresholds on RBFN output. The observed peak width is dominated by the GAMS energy resolution and is reproduced by MC.

3.2 Fitting mass spectra

The mass spectra in Fig. 3 are fitted in order to determine the number of events of the decay. We tried two different background shape models:

  • MC shape:

    $$\begin{aligned} Data= & {} \alpha \times MC(K^+ \rightarrow \pi ^+\pi ^0\pi ^0\gamma ) \nonumber \\{} & {} + \beta \times MC(K^+ \rightarrow \pi ^+\pi ^0\pi ^0) \end{aligned}$$
    (2)

    with free scaling parameters \(\alpha , \beta \);

  • Gaussian shape + second order polynomial, peak position and width being free parameters.

Fig. 3
figure 3

Mass spectra for different cuts on RBFN output; data are the points with error bars, the MC background and signal are the dark and light histograms respectively

The numbers for different fits came out close within errors (Table 1). We adopt the fit with MC background shape and \(RBFN>0.5\) as the basis to obtain the BR because in this case the background shape is more smooth due to better MC statistics thus the error is driven by limited data rather then the size of MC sample. Also the peak is more statistically significant in this case, the p-value for the fit to background hypothesis is \(p=9 \times 10^{-6}\). The other 5 fits are used to estimate possible systematic errors. In particular, the background shape in G+P2 fits do not rely on MC at all so the close numbers in columns 2 and 3 of Table 1 prove the result insensitive to the background shape model.

Table 1 Number of \(K^{+ }\rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) events in the peak and corresponding branching ratios in units of \(10^{-6}\) for different RBFN thresholds and background models

4 Branching ratio

The decay of similar topology, \(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\), used for normalization to cancel out most of uncertainties in efficiency calculation. We have about \(2\times 10^6\) of those at hand with no considerable background (Fig. 4).

$$\begin{aligned} BR_{\pi ^{+}\pi ^{0}\pi ^{0}\gamma } = \frac{n_{\pi ^{+}\pi ^{0}\pi ^{0}\gamma } }{n_{\pi ^{+}\pi ^{0}\pi ^{0}}} \times \frac{\varepsilon _{\pi ^{+}\pi ^{0}\pi ^{0}} }{\varepsilon _{\pi ^{+}\pi ^{0}\pi ^{0}\gamma }} \times BR_{\pi ^{+}\pi ^{0}\pi ^{0}} \end{aligned}$$
(3)

where n, \(\varepsilon \) and BR are the numbers of detected events, detection efficiencies and branching ratios. The detection efficiencies for both decays obtained through MC simulation. For \( RBFN>0.5 \quad \varepsilon _{\pi ^{+}\pi ^{0}\pi ^{0}\gamma } = 0.0205, \quad \varepsilon _{\pi ^{+}\pi ^{0}\pi ^{0}} = 0.141 \). Due to the infrared pole in bremmstrahlung amplitude we quote the BR for the \(\gamma \) energy in \(K^+\) rest frame \(E_{\gamma }^* > 10\) MeV. The choise of 10 MeV cutoff is driven by our experimental setup being insensitive to lower energy \(\gamma \)s. There is a slight \((\sim 1\,\textrm{MeV})\) difference between the true and observed \(\gamma \) energies due to finite resolution. It is known from the simulation that the sample in Fig. 3, left is not contaminated with events with \(E_{\gamma }^*(true) < 10\,\textrm{MeV}\). Then by normalizing the detection efficiency \(\varepsilon _{\pi ^{+}\pi ^{0}\pi ^{0}\gamma }\) to the number of simulated events with \(E_{\gamma }^*(true) > 10\,\textrm{MeV}\) we obtain the branching ratio not afected by the resolution on \(E_{\gamma }^*\).

Fig. 4
figure 4

Observation of \(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\) decay used for normalization: data (black) and MC (blue)

$$\begin{aligned}{} & {} \frac{BR(K^+ \rightarrow \pi ^+\pi ^0\pi ^0\gamma )}{BR(K^+ \rightarrow \pi ^+\pi ^0\pi ^0)} = (2.1 \pm 0.5) \times 10^{-4}; \nonumber \\{} & {} BR(K^+ \rightarrow \pi ^+\pi ^0\pi ^0\gamma ) = (3.7 \pm 0.9) \times 10^{-6}, \nonumber \\{} & {} E_{\gamma }^* > 10\,\textrm{MeV}. \end{aligned}$$
(4)

5 \(\gamma \) spectrum and angular distribution

The area around \(K^+\) mass \((0.485<m(\pi ^+\pi ^0\pi ^0\gamma )<0.505)\) of the spectrum in the leftmost frame of Fig. 3 (RBFN>0.5) selected for this study. Scaled MC background \(\gamma \) spectrum was then subtracted from the \(\gamma \) spectrum of selected events:

$$\begin{aligned} \frac{dn}{dE_{\gamma }^*} = \left( \frac{dn}{dE_{\gamma }^*} \right) _{exp} - \beta \times \left( \frac{dn}{dE_{\gamma }^*} \right) _{bkg MC}, \end{aligned}$$
(5)

with scaling factor \(\beta \) taken from the fit (2) of the \(RBFN>0.5\) mass spectrum. \(E_{\gamma }^*\) is the \(\gamma \) energy in \(K^+\) rest frame. The distribution over the angle between \(\gamma \) and \(\pi ^+\) obtained in the same way. The resulting energy spectrum and angular distribution agree with ChPT prediction although the errors are large (Fig. 5). Both spectra are severily distorted by detection efficiency and complex selection criteria.

Fig. 5
figure 5

Energy spectrum of \(\gamma \) and the angle between \(\gamma \) and \(\pi ^+\) in \(K^+\) rest frame

6 Systematic errors

The systematic errors in the measured BR are mostly cancel out because they are almost the same in the signal and normalization processes. That is the dominant contribution to the systematic error is due to the radiative photon measurement. The BR values derived from the mass spectra fits with different RBFN thresholds and background models mutually agree. Based on Table 1 we conclude that there are no systematics introduced by the fit beyond statistical errors. The MC estimate of the other sources comes out well below \(25\%\) statistical error:

  • Wrong \(\gamma \) combination. Per MC the processing algorithm picks the wrong \(\gamma \) combination for \(\pi ^0\pi ^0\) in \(4.5\%\) of events identified as \(\pi ^{+}\pi ^{0}\pi ^{0}\gamma \). In this case the mass peak broadens 3-fold contributing more to the background than to the peak.

  • \(\gamma \) detection threshold varied from 0.4 to 0.6 GeV, \(4\%\) variation in BR.

  • GAMS threshold curve. The triggering efficiency raises gradually from 0 to 1 with the GAMS energy deposition increase [12]. The MC study showed detection efficiencies for both decays being insensitive to particular shape of this curve down to \(1\%\).

  • Overall normalization uncertainty evaluated to \(4\%\) including selection criteria and \(BR(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^{0})\).

All these sources, being added in quadrature, result in \(\sigma _{syst} = 0.27\times 10^{-6} \approx 7.5\%\). So the overall error is driven by limited statistics.

7 Conclusions

The OKA data are analyzed in search for \(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^{0}\gamma \) decay. The major background source, the decay \(K^{+}\) \(\rightarrow \) \(\pi ^{+}\pi ^{0}\pi ^{0}\), is \(\approx 5000\) times more intense than the radiative decay in question. The RBFN neural network is employed to suppress the background down to Signal:Noise\(\approx 1:1\) level; about 60 events of the decay observed. The branching ratio obtained by normalization to the decay of similar topology \(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^0\), \(BR(K^{+}\rightarrow \pi ^{+}\pi ^{0}\pi ^0\gamma ) = (3.7 \pm 0.9(stat) \pm 0.3(syst)) \times 10^{-6} (E_{\gamma }^*>10\,\textrm{MeV})\) agrees with ChPT prediction of \(3.76 \times 10^{-6}\). The \(\gamma \) energy spectrum and angular distribution are also in agreement with ChPT although the errors are large. The observation of the decay proves feasibility of its detailed study on a larger statistical sample.