1 Introduction

The existence of sterile neutrinos has been an important issue in the field of neutrino physics for over 20 years. The experimental results from [1,2,3,4,5,6] could be interpreted as indications of the existence of sterile neutrinos with mass-squared differences of around 1 \(\mathrm {eV}^2\).

The JSNS\(^2\) experiment, proposed in 2013 [7], is designed to search for neutrino oscillations caused by such a sterile neutrino at the Material and Life science experimental Facility (MLF) in J-PARC. The facility provides an intense and high-quality neutrino beam with \(1.8 \times 10^{14}\,\nu \)/year/cm\(^2\) from muon decay-at-rest (\(\mu \)DAR) produced using a 1 MW proton beam with a 25 Hz repetition rate. The neutrino source is produced by impinging 3 GeV protons from a rapid cycling synchrotron on a mercury target in the MLF. The experiment uses a Gadolinium (Gd) loaded liquid scintillator (Gd-LS) detector with 0.1 w% Gd concentration placed at 24 m from the target.

The JSNS\(^2\) experiment aims to direct test the LSND observation [1] with improvements of the experimental technique. Observing \({\bar{\nu }}_{\mu } \rightarrow {\bar{\nu }}_{e}\) oscillation using a \(\mu \)DAR neutrino source via inverse beta decay (IBD) reaction, \({\bar{\nu }}_{e} + p \rightarrow e^{+} + n\), is the same experimental principle used by the LSND experiment [1]. On the other hand, there are several improvements offered by the JSNS\(^2\) experiment. In order to identify IBD events, a delayed coincidence between the positron signal (prompt signal: up to 53 MeV) and neutron capture signal is used for selection. Gd is used to identify neutron captures. After capturing thermal neutrons, Gd generates gamma-rays with higher energies and shorter capture times (8 MeV, \(\sim 30~\upmu \)s) than hydrogen (2.2 MeV, \(\sim 200~\upmu \)s). Therefore, backgrounds accidentally coincident in the delayed signal region can be reduced by \(\sim 6\) times compared to the hydrogen capture used in the LSND experiment, due to the shorter capture time. In addition, the short-pulsed beam, 100 ns width double pulses in 600 ns interval in each spill, enables us to set a timing window for the IBD prompt signal as 1.5–10 \(\upmu \mathrm {s}\)  from the proton beam collision so that the neutrinos from pion and kaon decay and fast neutrons generated at the target can be rejected efficiently. However, the efficiency for the \(\mu \)DAR neutrinos can be kept at 62% because of the muon lifetime (\(2.2~\upmu \mathrm {s}\)). The cosmogenic background is also reduced by a factor of \(10^{-4}\).

There are a number of correlated backgrounds to the IBD signal, characterized by a time-coincident prompt and delayed signal. The most concerning one is a cosmic-induced neutron whose prompt signal is made by a recoil proton and a delayed signal from the neutron capture on Gd after thermalization. Approximately 99% of this neutron background can be rejected by pulse shape discrimination (PSD) using the waveform shape difference between the prompt signal of IBD and neutron events. Detailed discussion about the signal detection principle and the background rejection technique can be found in [7, 8].

2 Commissioning run

The JSNS\(^2\) experiment commenced data taking with a single detector in June 2020. The data taking period was from June 5th to 15th – approximately 10 days. The proton beam power was 600 kW during this period. There was a one-day beam-off period due to facility maintenance, and thus we obtained beam-off data at that time. The integrated number of proton-on-target (POT) collected was \(8.9 \times 10^{20}\), corresponding to less than 1.0% of the required POT of the JSNS\(^2\) experiment. The expected number of IBD signal events from this run is estimated to be much less than 1 [8]. Therefore, all observed events are likely background.

2.1 Experimental setup

The JSNS\(^2\) detector is a cylindrical liquid scintillator detector with 4.6 m diameter and 3.5 m height placed at a distance of 24 m from the mercury target of the MLF. It consists of 17 tonnes of Gd-LS contained in an acrylic vessel, and 33 tonnes unloaded liquid scintillator (LS) in a layer between the acrylic vessel and a stainless steel tank. The LS and the Gd-LS consist of LAB (linear alkyl benzene) as the base solvent, 3 g/L PPO (2,5-diphenyloxazole) as the fluor, and 15 mg/L bis-MSB (1,4-bis(2-methylstyryl) benzene) as the wavelength shifter. The LS volume is separated into two independent layers by an optical separator that forms two detector volumes in the one detector. The region inside the optical separator, called the “inner detector”, consists of the entire volume of the Gd-LS and \(\sim \)25 cm thick LS layer. Scintillation light from the inner detector is observed by 96 Hamamatsu R7081 photomultiplier tubes (PMTs) each with a 10-in. diameter. The outer layer, called the “veto layer”, is used to detect cosmic-ray induced particles coming into the detector. A total of 24 of 10-in. PMTs are set in the veto layer whose inner surfaces are fully covered with reflection sheets in order to improve the collection efficiency of the scintillation light.

2.2 Data acquisition system and triggers

PMT signal waveforms from both the inner detector and the veto layer are digitized and recorded at a 500 MHz sampling rate by 8-bit flash analog-to-digital converters (FADCs). As a trigger, we utilize a 25 Hz periodic signal from the kicker magnet which directs the proton beam towards the MLF target, called the “kicker trigger”. The width of the acquired waveform in this trigger scheme is set to 25 \(\upmu \mathrm {s}\) , which thoroughly covers the prompt signal timing window of the IBD events. We mainly used this trigger and data acquisition to obtain the beam-on data for a background estimation within the sterile neutrino search.

The other trigger, called the “self trigger”, uses an analog sum of the PMT signals from the inner detector. The trigger threshold is set to 80 mV equivalent to 100 p.e. and its efficiency reaches approximately 99% above 7 MeV. The FADCs record 496 ns wide waveforms during the data taking using the self trigger. We utilized the self trigger during the beam-off period, in order to obtain cosmogenic events, and when taking calibration data using a 252-Californium (\(^{252}\)Cf) neutron source. A detailed description of the detector and the triggers are given in [9, 10], respectively.

3 Event selection and results

The data set used for the background estimation was obtained using the kicker trigger. The total number of beam spills was 8,092,503, equivalent to 3.5 days of data taking. Since the obtained waveforms, given the 25 \(\upmu \mathrm {s}\) width, contain multiple events, we therefore used an event definition based on the number of hit PMTs in order to extract each event. We constructed a hit time series at each trigger by accumulating hit information along the FADC window with 60 ns coincidence width over all the PMTs. The coincidence width was determined by considering a typical PMT pulse shape and a safety factor from timing calibration. The event discrimination threshold is set to 10 hits and 50 p.e., which corresponds to an energy well below 1 MeV.

The event vertex position and energy reconstruction is performed simultaneously based on a maximum-likelihood algorithm using the charge response of each PMT. Both vertex position and energy were calibrated by deploying a \(^{252}\)Cf source and using the 8 MeV peak in the energy spectrum resulting from thermal neutron capture on Gd (nGd). The reconstruction performance can be found in [11] for nGd events and [12] for events with up to 60 MeV using Michel electrons.

The selection criteria and estimated efficiency are given in Table 1 and a more detailed description is found in [8]. The selection criteria applied in this analysis are nearly identical to those in [8].

The prompt signal of IBD candidates is selected using a time difference from the beam collision timing (\(\Delta t_{\mathrm {beam-p}}\)) and its energy (\(E_{\mathrm {p}}\)). We applied the following requirements; \(1.5\le \Delta t_{\mathrm {beam-p}}\le 10\) \(\upmu \mathrm {s}\)  and \(20\le E_{\mathrm {p}}\le 60 \) MeV, in order to fully cover \(\mu \)DAR neutrinos from the mercury target. The timing selection rejects beam-induced fast neutrons in the beam on-bunch timing (\(0 \le \Delta t_{\mathrm {beam-p}}\le 1.5~\upmu \mathrm {s}\)) as well as neutrino backgrounds from kaon and pion decay whose lifetimes are 12 ns and 26 ns, respectively. Figure 1 demonstrates the energy and timing selection windows in a two-dimensional distribution of energy and timing. The red box represents the prompt signal region and the orange dashed box displays the region of beam-induced fast neutron events.

In order to identify neutron captures on Gd, the 8 MeV peak of the delayed signal energy (\(E_{\mathrm {d}}\)) is selected using a requirement of \(7\le E_{\mathrm {d}}\le 12 \) MeV [13]. In addition, the conditions on the time and spatial difference between the prompt and delayed signal, \(\Delta t_{\mathrm {p-d}} \le 100~\upmu \mathrm {s}\) and \(\Delta \mathrm {VTX}_{\mathrm {p-d}} \le 60\) cm, were used for correlated event selection. The range of digitized waveforms obtained using the kicker trigger limited the range of \(\Delta t_{\mathrm {p-d}}\), and thus the selection was replaced with \(\Delta t_{\mathrm {p-d}} \le 25~\upmu \mathrm {s}\).

There are nGd events associated with fast neutrons induced by the beam contributing to the IBD delayed signal as an accidental background [14]. Because these nGd events spatially correlate with an activity made by beam-induced fast neutrons, we can reject them the spatial correlation. In particular, we applied a requirement on the spatial difference between on-bunch event and delayed candidates, \(\Delta \mathrm {VTX}_{\mathrm {OB-d}}\ge 110\) cm. The distribution of \(\Delta \mathrm {VTX}_{\mathrm {OB-d}}\) and the efficiency estimation can be found in [13]. The on-bunch event tagging condition is set to \(0 \le \Delta t_{\mathrm {beam-OB}} < 1.5~\upmu \mathrm {s}\) and \(1 \le E_{\mathrm {OB}}\le 200\) MeV.

Note that a nitrogen purge of the Gd-LS/LS was not performed. Thus, the PSD was unavailable in this commissioning run data. The timing veto, a selection using a likelihood based on \(\Delta t_{\mathrm {beam-p}}\) and \(\Delta t_{\mathrm {p-d}}\) [13], was also not applied due to the limited waveform width.

Table 1 The IBD selection criteria and their efficiencies in the JSNS\(^2\) experiment [8]. Note that due to the data taking strategy in the commissioning run we applied \(\Delta t_{\mathrm {p-d}}\le 25\) \(\upmu \mathrm {s}\) to the commissioning run data for background estimation

The IBD selection results in the distributions of \(E_{\mathrm {p}}\) (a), \(E_{\mathrm {d}}\) (b) and \(\Delta \mathrm {VTX}_{\mathrm {p-d}}\) (c), \(\Delta t_{\mathrm {p-d}}\) (d) and \(\Delta \mathrm {VTX}_{\mathrm {OB-d}}\) (e) as shown in Fig. 2, respectively. The regions indicated by the red arrows represent the selected event candidates. As a result, \(59 \pm 8\) correlated event candidates remain within the IBD selection region for the sterile neutrino search.

Fig. 1
figure 1

A two-dimensional distribution of energy and timing before event selection, used to demonstrate the IBD selection region. The selected regions for the IBD prompt signal from positron (red box), the IBD delayed signal from gamma-rays resulted in thermal neutron capture on Gd (green box) and the beam on-bunch event (orange dashed box) are overlaid. Note that the events are shown around the prompt signal timing region. There are two event clusters within 0–1.5 \(\upmu \mathrm {s}\) , which reflect the proton beam structure of the MLF. They are caused by neutrons produced at the mercury target. One can see that the IBD prompt signal region is well separated from the on-bunch region. The events in the IBD delayed signal region must also satisfy \(\Delta t_{\mathrm {p-d}} > 0\) in the actual delayed coincidence

Fig. 2
figure 2

The distributions of the variables used in the IBD selection: a \(E_{\mathrm {p}}\), b \(E_{\mathrm {d}}\), c \(\Delta t_{\mathrm {p-d}}\), d \(\Delta \mathrm {VTX}_{\mathrm {p-d}}\) and e \(\Delta \mathrm {VTX}_{\mathrm {OB-d}}\). Each distribution has all of the selection criteria applied, except for the selection on the variable displayed. The red arrows show the selection criteria of each variable

Taking an efficiency difference between the beam-off data obtained by the self trigger and the beam-on data with a 25 \(\upmu \mathrm {s}\)  FADC window into consideration, the expected number of cosmic-induced fast neutron events in the IBD selection is \(55.9 \pm 4.3\). This is consistent with the number of the correlated events obtained above. The remaining background rates are mainly produced by cosmic-induced fast neutrons, with a measured rate of \((2.45 \pm 0.20) \times 10^{-5}\) /spill. By assuming 99% neutron rejection with the PSD capability, the cosmic-induced fast neutron event rate will be comparable with the expected event rate of the signal IBD at the best-fit oscillation parameter from the LSND experiment. Therefore, it is crucial to construct a particle identification technique with the PSD capability for fast neutron rejection.

Fig. 3
figure 3

Top: the likelihood score distributions of the Michel electron sample (blue) and the fast neutron sample (red). The Michel electron sample represents the distribution of the neutrino signal. Bottom: the CNN score distributions of the Michel electron sample (blue) and the fast neutron sample (red). The label “Central region only” indicates that the event vertices around the detector center (\(r < 60\) cm and \(|z| < 60\) cm) are used

4 Pulse shape discrimination

As described above, we have found a large number of background events caused by cosmic-induced fast neutrons for the sterile neutrino search. Thus, diisopropylnaphthalene (DIN, \(\mathrm {C}_{16}\mathrm {H}_{20}\)) was dissolved into the Gd-LS in order to enhance the PSD capability. DIN is commercially available and widely used as a base solvent of organic liquid scintillator. Several neutrino experiments using a liquid scintillator detector have adopted it and achieved good PSD capability [15, 16]. Approximately 8% by volume of DIN was dissolved into the Gd-LS at the beginning of the first physics run from January 2021. Nitrogen purging was performed before data taking which led to 10% increase in light yield as a result. The test data obtained using the self trigger in the first run was used to investigate the PSD capability.

We developed a PSD algorithm based on likelihood discrimination using probability density functions (PDFs) associated with the waveform. The PDFs were constructed by accumulating the ratio of a peak count to a count at each sample of digitized waveforms for each PMT. We defined a log-likelihood for PSD with a weight of the total charges representing photon statistics as follows:

$$\begin{aligned} \log {\mathcal {L}}_{\alpha } = \sum ^{96}_{i = 0} Q^{\mathrm {total}}_i \sum ^{248}_{j = 40} \log ( P^{\alpha }_i (Q^{\mathrm {ratio}}_{ij}) ) \quad (\alpha = \mathrm {sig, bkg}),\nonumber \\ \end{aligned}$$
(1)

where \(Q^{\mathrm {total}}_i\) represents the observed charge on the ith PMT, and \(P^{\alpha }_i\) is the probability of the charge ratio \(Q^{\mathrm {ratio}}_{ij}\) of the jth sample of the digitized waveform of the ith PMT. The first 39 samples were used for a baseline calculation and thus ignored in computing the likelihood value. Two likelihood values representing signal-like (\({\mathcal {L}}_{\mathrm {sig}}\)) and fast neutron background-like (\({\mathcal {L}}_{\mathrm {bkg}}\)) were computed, and the log-likelihood difference of them, defined as a score, was then used for the event selection.

In parallel, another method for PSD was developed which utilizes a Convolutional Neural Network (CNN), a deep learning algorithm commonly used in the computer vision area. The same data in the likelihood method are put in as a 96-channel image with \(1 \times 208\) pixels (FADC sampling points). A CNN structure composed of three blocks of convolution, max-pooling, ReLU, Batch Normalization and Dropout layers [17, 18] are used for the pattern recognition in the waveform image. Two fully-connected layers composed by ReLU and Dropout layers are followed after the convolution block for the signal-to-background classification. The model is trained to minimize the binary cross-entropy loss function, with the values closer to 1.0 corresponding to signals and 0.0 to the backgrounds. The kernel size of the first convolution layer was chosen to be 11 for optimal performance of the signal-to-background classification.

In this study, we used the data samples of Michel electron events associated with cosmic muon and cosmic-induced fast neutrons for the signal-like and the background control sample, respectively, instead of Monte-Carlo simulation. The selected energy range is identical in terms of the IBD prompt signal definition (20–60 MeV). As a starting point of the PSD study, the event vertices around the center of the detector (\(r < 60\) cm and \(|z| < 60\) cm) were selected. We selected data files with odd ID numbers for the PDF construction and training the CNN model, and even ones for a performance evaluation.

Figure 3 shows the result of the PSD performance of the likelihood (top) and the CNN method (bottom), respectively. The likelihood method achieved fast neutron rejection of (\(97.4 \pm 0.5\))% in taking the likelihood score \(\ge 0\) with a signal efficiency of (\(94.2 \pm 2.6\))%. The performance of the CNN method was similar to the result of the likelihood method with a cut on the CNN score \(\ge 0.17\). Therefore, we have obtained close performance to the experimental requirement (99% rejection), given an impurity of the fast neutron control sample coming from accidental coincidence caused by cosmic-induced gamma-ray leading to a signal-like pulse shape. An understanding of the impurity will improve the fast neutron rejection efficiency in maintaining the signal efficiency.

The JSNS\(^2\) experiment performed data taking with the improvement against the dominant background for the sterile neutrino search from the physics run (January to June 2021), and obtained high statistics data equivalent to \(1.45 \times 10^{22}\) POT. According to the study on the PSD capability as a function of DIN concentration [19], further improvement on the PSD capability beyond the neutron rejection efficiency of 99% can be anticipated by an increase in DIN concentration from 8 to 10%, performed in June 2021 [9].

5 Summary

JSNS\(^2\) is a sterile neutrino search experiment aiming to directly test the positive result of the LSND experiment using a decay-at-rest neutrino source at the MLF and a Gd-LS detector.

We performed the commissioning run of ten days in June 2020. As a result of the IBD selection to the commissioning run data, we observed \(59 \pm 8\) correlated event candidates in the signal region of the sterile neutrino search. This is consistent with the cosmic-induced fast neutron rate measured based on the beam-off data.

In order to reduce the cosmic-induced fast neutron background, 8% in volume of DIN was dissolved into the Gd-LS to improve the PSD capability for first physics run starting January 2021. We also developed two independent PSD methods based on a likelihood using waveform PDFs and machine learning with a CNN. Using a control sample of Michel electrons and fast neutron events in the central region of the detector, we achieved a fast neutron rejection of (\(97.4 \pm 0.5\))% with a signal efficiency of (\(94.2 \pm 2.6\))% if we require a likelihood score \(\ge 0\). The CNN method shows a similar performance to the likelihood method with a requirement of the CNN score \(\ge 0.17\). Further improvements on the PSD capability can be expected with an increase in DIN concentration as well as an understanding of the impurity in the control samples. The JSNS\(^2\) experiment will start data taking with the improvements from the coming physics run in 2021/2022.