1 Introduction

The SABRE experiment seeks to detect the annual modulation of the dark matter interaction rate in NaI(Tl) crystals with sufficient sensitivity to test the long standing DAMA result in a model independent way. Among the best-motivated candidates to solve the dark matter puzzle [1], Weakly Interacting Massive Particles (WIMPs) [2] are thought to exist as a halo within which our Galaxy resides. Despite the decades-long effort of direct detection experiment searches for interactions of such dark matter candidates with different target materials, the DAMA experiment (short for DAMA/NaI and DAMA/LIBRA) [3] is the only one reporting a positive observation. The DAMA evidence of a clear annual modulation, which satisfies the criteria for a model-independent WIMP-induced signal [4], has reached a strong statistical significance of \(12.9~\sigma \) [5], after 20 years of data taking at the Gran Sasso National Laboratory (LNGS), in Italy. While the null result of experiments based on different target materials and other direct detection techniques [6,7,8,9,10,11,12,13] cannot be reconciled with such modulation in the simplest scenario of WIMP-nucleon elastic scattering, a NaI(Tl) annual modulation search similar to DAMA is still crucial to shed light on the origin of the positive signal [14].

Currently running NaI(Tl) dark matter experiments are COSINE-100 [15] at the YangYang Laboratory in South Korea, and ANAIS-112 [16] at the Canfranc Laboratory in Spain. They both feature a mass of about 100 kg of NaI(Tl) scintillating crystals in an underground setup and have so far neither confirmed nor fully disproved the DAMA observation [17,18,19]. Given the very low count rate and energy featured in dark matter nuclear recoils, its detection is challenged by background from radioactive decays, where the dominating component arises from impurities in the crystals themselves. One key feature of DAMA that has never been rivalled by competing experiments is the background rate which is as low as \(\sim \) 0.7 count/day/kg/keV in the energy range where the modulation is observed: [1–6] keV [5]. The amplitude of the observed modulation is at the level of 0.01 count/day/kg/keV. Consequently, background levels about two or three times higher than those of DAMA/LIBRA, such as achieved by the ANAIS and COSINE experiments, negatively affect their sensitivity. Indeed these experiments, even after several years of operation, might not be able to resolve all possible scenarios in interpreting the DAMA signal as a dark matter signature.

The SABRE concept and ambitious goal is to obtain an ultra-low background rate in the energy region of interest, namely of the order of 0.1 count/day/kg/keV, that is several times lower than the DAMA/LIBRA level. This challenging goal is achievable by developing high-purity crystals and operating them inside a liquid scintillator veto for active background rejection. This SABRE veto concept has also been adopted by the COSINE experiment.

The SABRE scientific program is currently in a Proof-of-Principle (PoP) phase at LNGS, to assess the radio-purity of the crystals, as well as the acceptance of the liquid scintillator veto. In the longer term, twin detectors of at least 50 kg of NaI(Tl) will be located in the northern and southern hemispheres, namely at LNGS and at the Stawell Underground Physics Laboratory (SUPL) in Australia, located 240 km north-west of Melbourne, where a section of an active gold mine is being converted into a laboratory. The twin locations will help to identify any possible contribution to the modulation from seasonal or site-related effects. The SABRE concept and the PoP setup are described in more detail in Ref. [20].

In this paper we focus on the initial characterization of a relatively large crystal, named NaI-33, that marks a breakthrough after a technical effort of several years. In Sect. 2 we recall the SABRE production strategy for high purity crystals; in Sect. 2.1 we give more details about the production of the crystal NaI-33; in Sect. 3 we describe the underground facility used for the measurements and we detail the data taking campaign; in Sect. 4 we show the main results on the crystal parameters and backgrounds and we also outline the analysis strategy we developed.

2 SABRE high purity NaI(Tl) crystals

For many years, the quest for high-purity NaI(Tl) crystals has focused on removing common impurities such as \(^{238}\)U, \(^{232}\)Th, \(^{40}\)K and \(^{87}\)Rb. One of the \(^{40}\)K decay channels includes an Auger electron or X-ray cascade with a total energy of \(\sim \)3.2 keV, which happens to be in the energy region where the DAMA modulation is observed. Consequently, a large effort was devoted to reduce, or being able to identify, the \(^{40}\)K background. Eventually, the potassium content in the NaI(Tl) powder and crystals was progressively lowered to the point where the background is now dominated by \(^{210}\)Pb and cosmogenic-activated isotopes [21], in particular \(^{3}\)H.

To keep the intrinsic contaminants at a very low level, SABRE developed a method to obtain ultra-pure NaI powder and a clean procedure to grow crystals. More details on this method can be found in Refs. [20, 22]. Princeton University and industrial partner Sigma-Aldrich produced ultra-high purity NaI powder, so-called Astro Grade powder, with \(^{39}\)K levels consistently lower than 10 ppb. The NaI(Tl) crystals were then grown by Radiation Monitoring Devices, Inc. (RMD), using the vertical Bridgman technique [23]. In this method, the powder is placed inside a sealed ampule, reducing the possibility of contamination during the growth phase. In 2015, we successfully grew at RMD a 2 kg crystal with an average \(^{39}\)K level of \(9\pm 1\) ppb and with \(^{87}\)Rb upper limit of 0.1 ppb measured by Inductively Coupled Plasma Mass Spectrometry (ICP-MS) [24, 25]. Based on these results, a simulation study of the background contributions expected for the SABRE PoP phase can be found in Ref. [26]. The study demonstrates the importance of reducing the background contributions from radioactive decays inside the crystal. The same conclusion can be drawn from the background model presented by COSINE and ANAIS experiments in [27, 28], verified against experimental data. However ICP-MS only permits testing for a subset of the backgrounds, namely primordial parents such as U, Th, K and Rb. Cosmogenic isotopes produced after the crystal growth, \(^{210}\)Pb and other non-primordial parents, if present, can only be measured by direct counting of the crystal as a scintillator in an underground setup. Consequently, the focus in the past years has shifted to the growth of a NaI(Tl) crystal with a sufficient mass to perform a physics run. We have started to acquire signals with a SABRE grade NaI(Tl) crystal for a preliminary background characterization reported in this paper. In the next few months we aim at a more comprehensive characterization with the PoP setup.

2.1 SABRE crystal NaI-33

At the end of October 2018, a new NaI(Tl) crystal, named NaI-33, was grown. The powder and the crucible were prepared by the Princeton group, and the crystal was grown in an oven at RMD. The crystal was then cut into an octagon and polished, obtaining a final mass of 3.4 kg. In order to determine the concentration of \(^{39}\)K, and consequently of \(^{40}\)K, in the final crystal, samples from the tip and the tail of the ingot were analyzed using ICP-MS at Seastar Chemicals. The average \(^{39}\)K concentration within the crystal volume was extrapolated to be 4.3 ± 0.2 ppb, several times lower than 20 ppb, the upper limit reported for the DAMA/LIBRA crystals [29]. Further details on the powder preparation, the crystal growth and the results on internal radioactivity can be found in Ref. [22].

During May 2019, the crystal NaI-33 was assembled in the SABRE detector module, developed for the PoP phase and described in Ref. [20]. Two Photomultiplier Tubes (PMTs) were directly coupled to each end of the crystal, while its lateral surface was wrapped with reflector, for higher light collection acceptance. For NaI-33, we used Polytetrafluoroethylene (PTFE) as a reflector and 3-inch Hamamatsu R11065-20 PMTs, which were specifically developed to have a low intrinsic radioactivity.Footnote 1 We purchased selected units with a quantum efficiency as high as \(\sim \) 35% at 420 nm. The detector assembly was placed inside an air- and light-tight copper enclosure. The cylinder, end-caps, and support rods are made out of low-radioactivity copper. The holders of the crystal and of the PMTs are made out of high-purity Delrin. Being that NaI is highly hygroscopic, all parts were thermally treated prior to assembly, to remove any residual moisture. For the same reason, the assembly procedure was performed entirely under dry nitrogen atmosphere inside a glove box. After sealing, the detector module was flushed with high purity 5.5 N\(_{2}\) gas to avoid humidity and radon. The detector module before the sealing of the copper enclosure can be seen in Fig. 1. In order to limit cosmogenic activation, the crystal was then shipped by surface transportation and delivered on 6 Aug 2019 at LNGS, where it was immediately stored underground.

Fig. 1
figure 1

NaI-33 assembly in the glovebox, ready for the insertion into the copper enclosure

3 Experimental setup at LNGS

The experimental SABRE setup at LNGS consists of two facilities: a shielded counting system in Hall B and the PoP facility in Hall C. Upon arrival of the NaI-33 crystal, the PoP was not yet ready for operations: consequently, for the first crystal characterization reported in this paper, we used our facility in Hall B. The facility consists of a passive shielding to reduce the \(\gamma \) environmental background, made of low radioactivity copper and lead. The shielding is enclosed in a Lexan box that can be sealed and flushed with high-purity nitrogen. In August 2019, we commenced data taking with detector NaI-33 and continued for approximately a year. This time was divided into two periods denoted as Run 1 and Run 2, interleaved by the refurbishment of the shielding performed at the end of February 2020. Shielding during Run 1 featured 5 cm of copper and a layer of lead at least 17.5 cm thick on all sides. The experimental volume inside the shielding allowed for the insertion of two detector modules, which was useful to acquire data with a radioactive source (see Sect. 4.5). During Run 2, the thickness of low radioactivity copper was increased to 10 cm, effectively reducing the background due to the surrounding lead layer. However, this was done at the expense of the experimental volume, that then could only host one detector module. Figure 2 shows the two setups.

Fig. 2
figure 2

Open passive shielding in the SABRE Hall B area. Left: Run 1 shielding. Right: Run 2 shielding with increased copper layer and detector module in position. The the anti-radon box is not shown here

We define the presence of a PMT pulse by discriminating the signal over an amplitude threshold corresponding to approximately 1/3 of photoelectron. Data acquisition is triggered by the logical AND of the two PMT pulses occurring within a window of 125 ns. For each event, we acquire the waveforms of each PMT, together with their amplified (\(\times \) 10) copies. Amplified signals are used for the low energy analysis and saturate above 500 keV energy. The PMT signals without amplification instead provide sensitivity to \(\alpha \) decays. Waveforms are digitized at 12 bit and 250 MS/s by CAEN v1720 modules. The acquisition window was set at 5 \(\upmu \)s, including a 1.5 \(\upmu \)s pre-trigger for baseline evaluation. The trigger rate was \(\sim \) 0.48 \(\hbox {s}^{-1}\).

An offline reconstruction program processes each event and computes physical quantities. PMT pulses are isolated within the acquisition window; the baseline is computed on the pre-trigger samples and subtracted; then the pulse integral is computed. The same operations are performed for each channel, amplified and not, and for the software sum of the two PMT pulses. The event energy estimator is the integral of the sum channel. Pulse Shape parameters are also computed during this phase and are useful to separate scintillation events in the crystal from PMT noise, as well as to distinguish \(\alpha \) from \(\beta \)/\(\gamma \) interactions. Pulse Shape parameters are discussed in detail in Sect. 4.4.

4 Data analysis and results

Data acquired during Run 1 were used to measure light yield and energy resolution (Sect. 4.1), and to determine the \(\alpha \) rate (Sect. 4.2); however, for the analysis of its evolution over time, all datasets were used. Run 1 included a campaign with a \(^{108\mathrm{m}}\)Ag source (see Sect. 4.5). Run 2 data were used for the background study. In order to build the low energy spectrum, we used an additional dataset, denoted as Run 2-lowRn. These data were acquired while flushing the setup with high-purity nitrogen and, consequently, with a reduced background contribution from Rn and its daughters. A small fraction of the datasets was not included in the analysis due to noisy experimental conditions related to activity in the laboratory. The live time for each run is reported in Table 1.

The characterization of the NaI-33 will proceed within the PoP setup in Hall C, which has recently been commissioned, and it will be the subject of a separate publication.

Table 1 Runs setup details including live time

4.1 Light yield and energy resolution

To characterize the crystal performance, we used a \(^{241}\)Am calibration source positioned adjacent to the copper enclosure and longitudinally aligned with the crystal centre. We repeated these source runs a few times throughout the data taking. The spectrum acquired with the source is shown in Fig. 3, including a gaussian fit to the 59.5 keV line of \(^{241}\)Am, on the spectrum built from the sum of the two PMT channels. The crystal light yield and the energy resolution are defined as

$$\begin{aligned} {\mathrm{LY}}= & {} \frac{\mu [{\mathrm{photoelectrons}}]}{E_{peak}[{\mathrm{keV}}]} \end{aligned}$$
(1)
$$\begin{aligned} \frac{{\mathrm{FWHM}}}{E}= & {} \frac{2.355~\sigma }{\mu } \end{aligned}$$
(2)

where \(\mu \) and \(\sigma \) are the mean and standard deviation from the gaussian fit.

The low tail of the \(^{241}\)Am peak is longer than its high tail due to a small internal crystal contamination of \(^{210}\)Pb. In fact \(^{210}\)Pb decays with 80.2% probability emitting a beta with maximum energy 16.9 keV and gamma/electron emissions of 46.5 keV, producing an excess around 50 keV. In order to avoid this bias, we evaluate the energy resolution fitting in an asymmetric interval around the peak of [620, 730] photoelectrons (phe). The resulting energy resolution at this energy is 13.2%. In order to compute the light yield we independently acquired the S.E.R. (Single Electron Response) spectra of each PMT with a 12-bit, 2.5 GS/s oscilloscope. The ratio of the fitted \(^{241}\)Am peak position to the S.E.R. returns a light yield of \(11.1 \pm 0.2\) phe/keV. The error takes into account the systematic uncertainty due to the fact that among calibration runs we could not possibly place the source at exactly the same position. The contribution from the fit to the S.E.R. spectrum was found to be negligible in comparison. For reference, the light yield obtained by other NaI(Tl) experiments ranges as 6–10 phe/keV (DAMA-PhaseII [31]), 14–15.5 phe/keV (COSINE [32]), 12.7–16 phe/keV (ANAIS [33]).

Fig. 3
figure 3

NaI-33 calibration spectrum with \(^{241}\)Am source. The gaussian fit to the 59.5 keV peak is shown in red

4.2 Measurement of the \(\alpha \) rate

The identification and counting of \(\alpha \) particles has two main aims: (1) a measurement of the internal radioactivity of the crystal in terms of U/Th contamination and \(^{210}\)Pb contamination out of secular equilibrium; (2) a measurement of the \(\alpha \) quenching factor in NaI.

Identification of \(\alpha \) particles exploits the Pulse Shape discrimination capability of NaI(Tl) crystals, where signals induced by \(\alpha \) decays are faster than \(\beta /\gamma \) ones. In particular, the amplitude-weighted mean time \(\langle t\rangle _{600}\) (defined in Sect. 4.4), gives excellent separation between \(\alpha \) particles and \(\beta /\gamma \) particles. This can be seen in Fig. 4 where above 200 keV energy our \(\alpha \) selection cut is also shown and gives 100% acceptance. To this end, the calibration of the \(\beta \) spectrum was performed using a \(^{228}\)Th \(\gamma \) source. Specifically we exploited the 239 keV line of \(^{212}\)Pb, the 583 keV and 2615 keV lines of \(^{208}\)Tl.

Fig. 4
figure 4

Distribution of mean time vs. energy of \(\alpha \) (red) and \(\beta \)/\(\gamma \) (black) events. The blue dotted line shows our typical separation cut

In order to evaluate the content of primordial radioactive chains in the crystal, we searched for fast \(\beta \)-\(\alpha \) sequences, called Bismuth-Polonium (Bi-Po). These are present in \(^{232}\)Th and \(^{238}\)U chains with atomic mass numbers of 212 and 214, respectively, each with a characteristic decay time. The Bi-Po-212 sequences can be identified in the same digitization windows, because of the very short lifetime of \(^{212}\)Po (431 ns). Since the lifetime of \(^{214}\)Po is 237 \(\upmu \)s, correlated Bi-Po-214 decays are instead found in event pairs occurring in close succession. Considering the rate of singles is 0.02 \(\hbox {s}^{-1}\) we computed the probability of accidental coincidences to be \(\sim 10^{-7}\) (\(\sim 10^{-5}\)) for the Bi-Po-212 (Bi-Po-214) selection.

We corrected the rates of Bi-Po-214 and Bi-Po-212 sequences for the acceptance of the time-distance cut, respectively 93% and 81%, and we estimated the activity of \(^{226}\)Ra and \(^{228}\)Th inside the crystal. These are \(5.9 \pm 0.6~\upmu \)Bq/kg and \(1.6 \pm 0.3~\upmu \)Bq/kg, respectively. These uncertainties are statistical only, and are dominant due to the very low number of coincident counts. If the assumption of secular equilibrium for \(^{238}\)U and \(^{232}\)Th held, such values would correspond to a part-per-trillion (ppt) level contamination, which is very close to our target [26].

With very few counts ascribed to long-lived radioactive chains, the \(\alpha \) spectrum is dominated by the decay of \(^{210}\)Po. Generally, this originates from an out-of-equilibrium contamination of \(^{210}\)Pb, accidentally introduced sometime during the manufacture of the crystal (e.g. due to an exposure to radon). We applied a loose energy selection around the \(^{210}\)Po peak in the \(\alpha \) spectrum and measured its activity over a period of a several months with results shown in Fig. 5. We observe an increase well described by a build-up function with the half-life of \(^{210}\)Po (138.4 days). The fit also includes a constant contribution due to other \(\alpha \) decays that satisfy the selection requirement on the energy. This contribution is computed from the result of the two Bi-Po analyses to be 0.03 mBq/kg, including also \(^{210}\)Po in equilibrium with \(^{226}\)Ra, and fixed in the fit. The date of the initial \(^{210}\)Pb contamination is freely determined by the fit to be 9 September 2018 with an uncertainty of 10 days. This date corresponds to the last days before the beginning of the growth process. The activity of out-of-equilibrium \(^{210}\)Po, and hence of \(^{210}\)Pb, reaches a plateau of \(0.51 \pm 0.02\) mBq/kg (statistical uncertainty). While such activity is still higher than our target, it is amongst the lowest ever reported after DAMA/LIBRA [27, 28].

Measuring the ratio of the \(^{210}\)Po peak position in the \(\beta \)-calibrated spectrum over the nominal \(\alpha \) energy of 5304 keV, we can estimate the quenching factor for \(\alpha \) particles as \(0.63 \pm 0.01\).

Fig. 5
figure 5

\(\alpha \) activity in \(^{210}\)Po energy region as a function of time with 5-days binning. The best fit function to the data (red solid line) is the sum of the build-up function with the mean life of \(^{210}\)Po (blue dotted line) and a constant component independently determined (orange dotted line). The black dashed line shows the activity of \(^{210}\)Po at equilibrium

4.3 Study of cosmogenic activation

The crystal NaI-33 arrived at LNGS on August 6th and it was exposed to cosmic rays (at sea level) for a total of 279 days. To identify the contribution from cosmogenic activated isotopes, we compared experimental spectra acquired at different time periods, namely 29 days, 78 days and 104 days after the arrival of the detector underground. The comparison is done in the energy regions (20–45) keV and (55–80) keV. These are the only intervals where a contribution from cosmogenic isotopes is visible in our experimental spectra. The energy region between 45 and 55 keV was not considered as it is dominated by \(^{210}\)Pb. The plots in Fig. 6 clearly show a decay of the count rate in the energy regions considered. We attribute this to the decay of the cosmogenic isotopes \(^{125}\)I (half-life: 59.4 days, main emission: 67.3 keV) and \(^{126}\)I (half-life: 12.9 days, main emission: 31.8 keV).

Fig. 6
figure 6

Fit to the experimental spectra acquired at different time periods: 29 days (blue), 78 days (red) and 104 days (green) after the arrival of the detector underground. The peak at 67.3 keV (top) is attributed to the decay of \(^{125}\)I, while the peak at 31.8 keV (bottom) is attributed to the decay of \(^{126}\)I and is visible only in the measurement at 29 days, given the short decay time

To verify this and quantify the activity of such isotopes, we fitted the two peaks in the three spectra (whenever the peak was visible) with a gaussian plus linear background function and extracted the number of counts in the peaks. This was then compared to the corresponding counts in the Monte Carlo simulated spectra of the isotopes under study. The simulated spectra take into account the decay of both \(^{125}\)I and \(^{126}\)I.

Table 2 Upper limit on the activity of Tellurium cosmogenic isotopes in the NaI-33 crystal, at the time of the crystal arrival underground. For each isotope we indicate the energy of the gamma emission whose \(\pm 3 \sigma \) interval in the spectrum was used to extract the activity limit

Concerning \(^{125}\)I, the ratios of experimental to simulated data are well in agreement for the spectra at 78 days and 104 days, but less so with the spectrum at 29 days. Indeed the value of the ratio is not constant as expected and this might indicate that another isotope with a different decay time is contributing to the peak at 67.3 keV. Considering the spread of the ratios as a measure of the uncertainty with which we can assess the presence of the isotope, this study indicates a \(^{125}\)I activity at the time of the crystal arrival underground between 1.0 and 1.7 mBq/kg.

A similar estimation was done for \(^{126}\)I. For this isotope we can only rely on the measurement at 29 days, given the short decay time. The number of counts in the 31.8 keV peak in such spectrum was compared to the corresponding counts in the Monte Carlo spectrum and the resulting activity at the time of the crystal arrival underground is (3.4 ± 0.5) mBq/kg.

Cosmogenic activation in NaI can also produce isotopes of Tellurium, such as \(^{121\mathrm{m}}\)Te, \(^{123\mathrm{m}}\)Te, \(^{125\mathrm{m}}\)Te, and \(^{127\mathrm{m}}\)Te. We could not directly identify the presence of those isotopes, as their characteristic gamma lines are either not visible on our experimental spectrum or hidden by the above-seen contributions from Iodine cosmogenic activated isotopes. However, by summing the experimental counts in a \(\pm 3 \sigma \) interval around the energy of an expected gamma line, and assuming the square root of the number of counts as the \(1 \sigma \) limit on the observable events, we can then compare to the Monte Carlo expectations and extract an upper limit on the activity of a given isotope, at the time of the crystal arrival underground. Table 2 reports the limits we calculated with this method on the experimental spectrum at 29 days.

Converting the cosmogenic activities we derived for Iodine and Tellurium isotopes into production rates at sea level, we obtain values that are in rough agreement with those measured by the ANAIS [21] and COSINE [34] experiments.

4.4 Pulse shape parameters

As reported by all NaI(Tl) experiments, the low energy spectrum is populated by events that do not show the correct pulse shape of scintillation light in the crystal, and must be efficiently discarded as noise. We developed a twofold selection strategy for this purpose, detailed in the next sections. More specifically, we report a cut-based method (Sect. 4.5) and a more sophisticated Boosted Decision Tree approach (Sect. 4.6). The results of these analyses are given in Sect. 4.7.

We define here the pulse shape variables common to both approaches; some of them are the same or similar to the ones also exploited by other NaI experiments [31, 33, 35]. In the definitions of the variables, \(h_i\) and \(t_i\) are the pulse amplitude in mV and the time of the \(i-\)th sample (starting from the trigger, \(t_0\)), respectively. \(h_{max}\) is the absolute value of the maximum pulse amplitude in mV. \(C_{(t_i,t_f)}\) is defined as the pulse area between \(t_i\) and \(t_f\) in nanoseconds. \(E_0\) and \(E_1\) correspond to the energy measured using the two PMTs in the 1000 ns following the trigger.

$$\begin{aligned} \text{ Amplitude } \text{ weighted } \text{ mean } \text{ time }\, \langle t \rangle _{600}= & {} \frac{\sum \limits _{t_i< 600\mathrm{~ns}} h_i t_i}{\sum \limits _{t_i<600\mathrm{~ns}} h_i} \end{aligned}$$
(3)
$$\begin{aligned} \text{ Charge } \text{ over } \text{ maximum } \, \mathrm{CoM}= & {} \frac{C_{(0,1000)}}{h_{max}} \end{aligned}$$
(4)
$$\begin{aligned} \text{ Tail-to-total } \text{ pulse } \text{ shape } X_{1}= & {} \frac{C_{(100,600)}}{C_{(0,600)}} \end{aligned}$$
(5)
$$\begin{aligned} \text{ Head-to-total } \text{ pulse } \text{ shape } X_{2}= & {} \frac{C_{(0,50)}}{C_{(0,600)}} \end{aligned}$$
(6)
$$\begin{aligned} \text{ Asymmetry } \, A= & {} \, \frac{E_0-E_1}{E_0+E_1} \end{aligned}$$
(7)
$$\begin{aligned} \text{ Head-to-middle } \text{ pulse } \text{ shape }\, C_{2}/C_{1}= & {} \frac{C_{(200,400)}}{C_{(0,200)}} \end{aligned}$$
(8)
$$\begin{aligned} \text{ Middle-to-tail } \text{ pulse } \text{ shape } \,C_{3}/C_{2}= & {} \frac{C_{(400,600)}}{C_{(200,400)}} \end{aligned}$$
(9)
$$\begin{aligned} \text{ Time } \text{ Variance } \, \mathrm{Var}[t]= & {} \langle t^2 \rangle _{600} - \langle t \rangle ^2_{600} \end{aligned}$$
(10)
$$\begin{aligned} \text{ Skewness } \, \mathrm{Skw}[t]= & {} \frac{\langle t^3 \rangle _{2000}}{(\langle t^2 \rangle _{2000})^{3/2}} \end{aligned}$$
(11)
$$\begin{aligned} \text{ Kurtosis } \, \mathrm{Krt}[t]= & {} \frac{\langle t^4 \rangle _{2000}}{(\langle t^2 \rangle _{2000})^2} -3 \end{aligned}$$
(12)
$$\begin{aligned} \text{ Combination } \text{ of }\, X_1 \hbox {and} X_2\, \mathrm{ES}= & {} \frac{1-(X_{2}-X_{1})}{2} \end{aligned}$$
(13)
$$\begin{aligned} \text{ Combination } \text{ of } \text{ Skw[t] } \text{ and } \text{ Krt[t] } \, \mathrm{SK}= & {} (a \cdot \mathrm{Skw}[t] - b \cdot \mathrm{Krt}[t]- c)^2 \nonumber \\&\quad + (\mathrm{Krt}[t] - d)^2 \end{aligned}$$
(14)

where \(a = 30\), \(b = 5.2\), \(c = 50\), and \(d = 2\), as fixed from the analysis of the \(^{108m}\)Ag run data. In addition to the variables defined above, we define a Kolmogorov–Smirnov distance from an \(^{241}\)Am reference pulse, and the number of clusters (\(\mathrm{NC}\)). A cluster is defined by the waveform exceeding an amplitude threshold corresponding to \(\sim \)1 phe. Accepting only events with \(\mathrm{NC} > 2\) in each of the PMTs, we eliminate accidental coincidences of single photo-electrons in opposing PMTs and events originated in the PMTs due to their intrinsic radioactivity.

4.5 Cut-based analysis

We define here the selection criteria adopted for low energy noise rejection, tuned on data acquired with a \(^{108\mathrm{m}}\)Ag source.

The \(^{108\mathrm{m}}\)Ag source produces three \(\gamma \) rays of 434 keV, 614 keV, and 723 keV. We exploit these simultaneous emissions by placing a commercial NaI detector along NaI-33 inside the shielding (see Fig. 7). Requiring coincidence between the two detectors within 125 ns and using a copper layer as energy degrader between the source and the NaI-33 crystal, we obtain a continuum of low energy scintillation events. We have evaluated the contamination of the sample by accidental coincidences and found it to be negligible.

Fig. 7
figure 7

Schematic view of the experimental setup used for the \(^{108\mathrm{m}}\)Ag source campaign. It is shown the passive shielding of lead and copper, with the NaI-33 detector module and the commercial NaI detector. \(\gamma \) rays emitted from the \(^{108\mathrm{m}}\)Ag source, identified by a red circle, are degraded via a copper layer

We apply cuts on the variables defined in Eqs. 3101314, on the number of clusters and on the Kolmogorov-Smirnov distance. The selection criterion on each variable is chosen to conserve at least 95% of signal events with energy within (1–20) keV (with the exception of Eq. 14, where we preserve only 68% of events). We show in Fig. 8 the distributions of a few of the shape parameters in the source data. The values corresponding to the selection criteria are shown as red lines.

The combined acceptance of all the cuts as a function of energy and the resulting low-energy spectrum will be presented below (Sect. 4.7) in comparison with those obtained with the boosted decision tree analysis which will now be described.

Fig. 8
figure 8

Distributions of some of the shape parameters in the data with a \(^{108\mathrm{m}}\)Ag source. Red lines indicate the thresholds of the selection criteria

4.6 Boosted decision tree analysis

In order to improve the separation between noise and scintillation events, we implemented a multivariate analysis using Boosted Decision Trees (BDT) [36, 37], with a focus on low energy events up to 100 keV.

The BDT algorithm has to be trained using a sample of true scintillation events as “signal” and a sample of noise event as “background”. For the signal training we used the \(^{108\mathrm{m}}\)Ag data, selecting the events in coincidence with the commercial crystal, weighted in order to match the energy distribution expected for the crystal background.

For the background training we used NaI-33 data in the 1–10 keV energy region, which is dominated by noise. We evaluate from the cut-based analysis that the contamination of signal events in the background training set is about 5% and consider this bias reasonably small. As cross-check, we also trained the BDT with a background set obtained by inverting the selection described in Sect. 4.5, achieving comparable results.

The analysis was performed by training two separate BDTs, which we refer to as \(\hbox {BDT}_1\) and \(\hbox {BDT}_2\). The input variables for \(\hbox {BDT}_1\) are those defined in Eqs. 310, while for \(\hbox {BDT}_2\) we add also Skewness and Kurtosis (11 and 12), similarly to what is done in [15]. As for the cut-based analysis, we evaluate the selection on the BDT variables using the data with the \(^{108\mathrm{m}}\)Ag source.

The BDT classifier output is defined in the range \([-1,1]\) where lower scores are assigned to noise-like events and higher scores to signal-like events. Figure 9 shows the output as a function of the energy for the source data.

We define an energy-dependent threshold for the two BDTs defined as following:

$$\begin{aligned} \left\{ \begin{array}{@{}l@{ }l} BDT_{1(2)}> \frac{a_{1(2)}}{\log _{10}(20)} \log _{10}(E) - b_{1(2)} &{}: \text {if } (1\le E <20)\text { keV} \\ BDT_{1(2)} > c_{1(2)} &{}: \text {if } E \ge 20\text { keV} \end{array} \right. \nonumber \\ \end{aligned}$$
(15)

where \(a_i\), \(b_i\) and \(c_i\) are chosen in order to ensure an acceptance for each BDT of \(\sim \)50% (99.8%) in the energy range [1–3] keV ([20–100] keV).

Fig. 9
figure 9

\(\hbox {BDT}_1\) (top) and \(\hbox {BDT}_2\) (bottom) classifiers output for the \(^{108\mathrm{m}}\)Ag data, with superimposed energy dependent threshold in red

We then combine the two BDT algorithms by selecting as scintillation events only those passing both thresholds, with the results described in the next section.

4.7 Comparison of analysis procedures

The results of the cut-based analysis and the BDT analysis are shown in Figs. 10 and 11. Figure 10 compares the acceptance as a function of energy below 60 keV. The acceptance is measured using the \(^{108\mathrm{m}}\)Ag data, by computing the fraction of events passing the selection. Figure 11 shows the low energy spectrum before and after noise rejection. This spectrum corresponds to a live time of 58 days. The spectra obtained with the two methods are corrected for their respective acceptance. After noise rejection the average count rate in the 5–20 keV energy range is close to 1 count/day/kg/keV. The count rate below 5 keV will be evaluated operating the crystal in the PoP setup with more favourable noise conditions.

The multivariate analysis shows a promising performance: the acceptance is significantly higher with respect to the cut-based approach. At the same time, the energy spectra of events selected with the two methods, after being corrected for the acceptance, compare well above 8 keV whereas at lower energies the BDT approach grants a slightly higher noise rejection.

Fig. 10
figure 10

Acceptance vs energy for the cut-based analysis and the multivariate BDT analysis

Fig. 11
figure 11

NaI-33 low energy spectrum before (black dashed line) and after noise rejection. We show the result of the cut-based (blue points) and multivariate BDT (red points) analyses. The blue and red histograms are corrected by the respective cut acceptance

5 Conclusions

The NaI-33 crystal is the most radiopure crystal produced so far by the SABRE collaboration, with a size suitable for the PoP phase and close to the target of 5 kg for the full-scale experiment. The \(^{39}\)K content measured with ICP-MS is \(4.3\pm 0.2\) ppb, significantly lower than all values reported by experiments DAMA, which quotes a 20 ppb upper limit [29], COSINE-100 and ANAIS-112, which have both 17 ppb at best [27, 28]. We reported here the results from a first characterization performed using the SABRE passive shielding setup in Hall B of LNGS.

The crystal scintillation properties were measured with the 59.5 keV line of an \(^{241}\)Am source. Our detector shows a competitive light yield, as high as \(11.1 \pm 0.2\) phe/keV, and an energy resolution of 13.2% (FWHM/E).

The presence of trace elements from primordial radioactive chains was evaluated using Bi-Po delayed coincidence analysis and found to be at the part-per-trillion (ppt) level. However, \(^{210}\)Pb out of secular equilibrium is observed, and is one of the major sources of background at low energy for NaI(Tl) experiments. We studied the build-up of \(^{210}\)Po over time which allowed us to estimate the present activity of \(^{210}\)Pb to be \(0.51 \pm 0.02\) mBq/kg. We also measured an \(\alpha \) quenching factor of \(0.63\pm 0.01\) at the \(^{210}\)Po decay energy \(E_{\alpha }=5304\) keV.

We estimated the activity of Iodine cosmogenic isotopes \(^{125}\)I (1.0–1.7 mBq/kg) and \(^{126}\)I (\(3.4 \pm 0.5\) mBq/kg) at the time of the arrival of crystal NaI-33 underground, by studying the decay of the count-rate in the corresponding gamma lines and comparing with Monte Carlo simulations. Only upper limits on the activity of Tellurium cosmogenic isotopes could be derived, as no characteristic gamma emissions from these isotopes were visible in our spectra.

We employed two pulse-shape-driven analysis techniques to reject noise at low energy: a cut-based analysis and a multivariate Boosted Decision Tree. The latter shows a significantly higher acceptance. The acceptance-corrected energy spectra obtained with the two methods are found to be compatible above \(\sim \)8 keV. Below this energy, the spectra are similar though the multivariate analysis shows a slightly better noise rejection. Part of the rate can be attributed to residual noise events, leaving room for improvements in the rejection in the region around and below 5 keV.

We will continue the characterization of crystal NaI-33 in the PoP setup, equipped with the active veto system. The goal in this phase will be to perform direct counting of decays from \(^{40}\)K and possibly \(^{22}\)Na, and to build a complete background model. The commissioning was successfully completed in July 2020 and the measurement of crystal NaI-33 inside the liquid scintillator started in August.

The results already achieved in the passive shielding and presented in this work show that the average interaction rate in the 5–20 keV energy range is close to 1 count/day/kg/keV. This, and other results presented in this paper, show a breakthrough achievement for the SABRE project. The PoP setup is expected to further improve the current understanding of the NaI-33 background toward the SABRE full scale experiment design.