Abstract
CUORE Upgrade with Particle IDentification (CUPID) is a foreseen ton-scale array of Li2MoO4 (LMO) cryogenic calorimeters with double readout of heat and light signals. Its scientific goal is to fully explore the inverted hierarchy of neutrino masses in the search for neutrinoless double beta decay of 100Mo. Pile-up of standard double beta decay of the candidate isotope is a relevant background. We generate pile-up heat events via injection of Joule heater pulses with a programmable waveform generator in a small array of LMO crystals operated underground in the Laboratori Nazionali del Gran Sasso, Italy. This allows to label pile-up pulses and control both time difference and underlying amplitudes of individual heat pulses in the data. We present the performance of supervised learning classifiers on data and the attained pile-up rejection efficiency.
Similar content being viewed by others
Avoid common mistakes on your manuscript.
1 Introduction
Neutrinoless double beta decay (\(0\nu \beta \beta\)) is a posited lepton number violating process that would probe the Majorana nature of neutrinos. Sensitive searches for such decay have started exploring the inverted hierarchy region of neutrino masses [1,2,3,4,5]. To do so, they reached high mass of candidate emitter nuclei, low background radioactivity, good energy resolution. Cryogenic calorimeters yielded some of the best energy resolutions in \(0\nu \beta \beta\) searches to date [1, 6,7,8].
The CUORE experiment has reached the ton scale of detector mass, an energy resolution of \(\sim 7.8\,\)keV and a background index in the region of interest for \(0\nu \beta \beta\) of \(1.4\times 10^{-2}\) counts/(keV kg year), dominated by partly contained \(\alpha\) decays [1]. CUORE Upgrade with Particle Identification (CUPID) aims at fully exploring the inverted hierarchy region of neutrino masses, corresponding to half-life sensitivities of \(> 10^{27}\,\)year [9]. It will use the successful cryogenic infrastructure that hosts CUORE to deploy a ton-scale array of Li\(_2\)MoO\(_4\) (LMO) scintillating crystal bolometers equipped with Ge bolometer light detectors (LD) to achieve simultaneous readout of the phonon and scintillation signals [10]. The effectiveness of the double readout technique was proved by the CUPID-0 [6, 11, 12] and CUPID-Mo [7] experiments and will bring the expected background down to \(10^{-4}\) counts/(keV kg year). The nearly simultaneous occurrence of separate \(2\nu \beta \beta\) decay events in the same crystal (pile-up) within the time resolution of the detector is an irreducible source of background that needs to be mitigated in the design phase. Given the short half life of \(^{100}\)Mo of \(^{100}\mathrm {T}_{1/2} = 7.1 \times 10^{18}\) year and assuming \(\sim 300\) g mass, \(100\%\) enrichment in \(^{100}\)Mo and a 1 ms resolving time the expected background contribution from pile-up events is \(3.5\times 10^{-4}\) counts/(keV kg year) [9].
CUPID will use neutron-transmutation-doped (NTD) thermistors to read out the phonon signal on both LMOs and LDs. The rise time of thermal pulses is a function of the NTD working resistance, which is a function of temperature, and the capacitance of the readout line. A campaign of measurements was carried out in the Laboratori Nazionali del Gran Sasso (LNGS) to study the time resolution for pile up discrimination of LMO crystals. An analysis based on optimal filtering (OF) [13] of the reconstructed pulses showed a \(>90\%\) pile-up discrimination efficiency down to \(\Delta t \sim 1\) ms [14]. This work is aimed at exploring the effectiveness of alternative analysis approaches. We analyze the same data with a deep learning classification algorithm.
2 Measurement
An array of 8 cubic LMO detectors \(45 \times 45 \times 45\) mm\(^3\) arranged in a tower of 2 floors of 4 crystals each was operated at \(\sim 18\) mK temperature in the Hall C facility at LNGS between summer 2019 and spring 2020. Each crystal faced two light detector Ge disks; the crystals on the bottom floor were wrapped in a reflecting foil. The shape and size of LMO crystals resemble the envisioned CUPID design. Further details on the experimental setup can be found in [14]. Three LMO crystals were equipped with a functioning Si heater, which was driven by an arbitrary function generator (Tektronix AFG1062). All detectors were equipped with NTD thermistors, biased with a constant current. The working resistance of the NTDs was in the \((10{-}50)\) M\({\Omega }\) range. The typical rise time of pulses was \(\sim 15\) ms. Each NTD was low-pass filtered (Bessel-Thomson) with a cutoff frequency of 63 Hz and the voltage at its ends continuously digitized and stored on disk at 2 kHz sampling frequency. A derivative trigger algorithm was used to identify pulses. Triggered pulses were enclosed in 5 s windows, with 1 s time before the trigger (pre-trigger) used as a proxy for the temperature before the pulse. Random triggers were fired throughout the data taking to collect noise samples.
We performed reference heater pulses generation paying special attention to the accurate reproduction of the pulse rise time with respect to events originating from natural radioactivity (physics pulses). We excited the Si heaters with waveforms \(w(t;A,\tau )\) of square, triangular (sharp rise, linear fall), exponential shape with given voltage amplitude A and typical time \(\tau\). We found that despite their different origin, triangular-shaped heater excitations best reproduced the rising edge of physics pulses [14]. We tuned the A and \(\tau\) parameters of each waveform to reconstruct in the \((1{-}2.5)\) MeV energy range and as close as possible to the mean of the rise time distribution of physics pulses. The sample of reference (single) heater pulses mimics the detector response to single decay signal events. The time between one pulse generation and the following one was set to 15 s, in order to let the detector return to the initial condition. To generate heater pile-up pulses at a time distance \(\Delta t\), we superimposed excitations of the Si heater as
Let \(\alpha = A_1/A_2\) be the amplitude ratio between the excitations. In order to explore amplitude ratios in the range \(\alpha = 0.24{-}1.4\), we fixed \(A_1 = 170\) mV and varied \(A_2 = (40{-}240)\) mV in steps of 50 mV. We explored \(\Delta t = (1{-}40)\) ms. For each doublet \((\alpha ,\Delta t)\), we collected a sample of \(\sim 100\) pulses.
3 Data Analysis
The data analysis can be divided into two steps, a low-level processing and high-level analysis. The former is aimed at selecting a clean sample of heater pulses; the latter is the actual pile-up discrimination analysis. This study closely follows [8] as far as the low-level processing is concerned and implements machine learning algorithms in the high level analysis step.
The low-level processing acts on 5 s time windows (waveforms) around triggered events. Each waveform is filtered with an OF [13]. The energy of each event is obtained calibrating the amplitude of filtered pulses with a \(^{232}\)Th source. We reject waveforms that contain more than one triggered pulse, or where the pre-trigger slope is not compatible with 0. To discriminate heater from physics pulses, we first select events according to their timestamp. Then, for each pulser configuration, we further restrict our sample of heater pulses to those that fulfill
where E is the reconstructed energy, \(E_{\mathrm{med}}\) is the median reconstructed energy and \(E_{\mathrm{MAD}}\) is the median absolute deviation. Equivalent results can be obtained replacing the energy variable in Eq. (2) with the raw amplitude of the pulse before filtering, effectively decoupling the deep learning algorithm from other steps of the analysis. Since the rate of physics pulses is much lower than the one of heater pulses, this procedure is effective in removing the outliers of the heater pulse distribution due to either natural radioactivity, unidentified pile-up events or pulses produced in unstable detector conditions.
The high-level analysis is based on the Keras [15] implementation of a convolutional neural network (CNN) classifier algorithm. The CNN was designed to read full windows of raw pulses, before any OF is applied. We pre-process the data by subtracting the average sampled voltage in the pre-trigger part of the pulse (baseline) and linearly scaling individual pulses to obtain unit amplitude.
The CNN includes 10 deep convolutional blocks, each made of 8 filters 20 samples long with a ReLU activation function, followed by a MaxPooling layer with a factor 2 subsampling. A fully connected feed-forward classifier layer connects the output layer with a Softmax activation function. Training was performed minimizing a categorical cross-entropy loss function for 200 epochs with the Adam algorithm [16] and a constant learning rate of \(10^{-4}\). For a comprehensive review on machine learning, see [17] and references therein.
Due to the small size of the available dataset, we rely on cross-validation techniques [18] to assess the performance of the CNN model to classify pile-up events. We use a fivefold cross-validation. We randomize the data and split them into 5 non-overlapping subsets. From each of the subsets, we retain a \(20\%\) fraction of events to assess the model performance after training is completed (test set). We further split the remaining part to build a training set \((80\%)\) and a validation set \((20\%)\). We train independently 5 identical CNN models with the same choice of hyperparameters (learning rate, number of epochs, etc.) and architecture (layer structure). We optimize hyperparameters keeping the test sets blinded. The output layer of the CNN returns a classifier score in the (0, 1) range. For each data subset, we compute the predicted score on the test set using the corresponding CNN model. We compare the prediction with the label assigned to each test pulse according to its timestamp. We evaluate the classification efficiency as a function of \(\Delta t\) as the ratio between the number of correctly labeled events over the total number of events. We fit the classification efficiency curve (Fig. 1) with an exponential saturation function
and define the pile-up time resolution of each channel as the \(\Delta t > 0\) at which the \(90\%\) classification efficiency is reached.
4 Results
We present the performance of a new technique based on a deep learning classifier to discriminate pile-up events in cryogenic LMO bolometers. The classifier is trained on single-pulse and pile-up samples generated exciting a Si heater with a programmable function generator and tested on an independent sample of events generated with the same technique. We achieve a \(>90\)% reconstruction efficiency of signal events and a \(>90\)% rejection efficiency for \(\Delta t > 1{-}2\) ms depending on the channel. The deep-learning and OF based pile-up rejection approaches yield so far equivalent results [14].
This work is part of a technical optimization campaign of the CUPID detector design and data processing techniques. A detailed Monte Carlo simulation of the detector response incorporating pile-up rejection is underway to disentangle the interplay of parameters such as the noise level, sampling and Bessel cutoff frequencies, pulse rise time and signal-to-noise ratio. Preliminary results indicate that similar performances can be achieved for physics pulses. In future measurements, we plan to improve the cryogenic stability, noise conditions and excitation methods to better reproduce both the rising and falling edge of physics pulses and eventually explore smaller pile-up time differences.
Data Availability
The datasets generated during and/or analyzed during the current study are available from the corresponding author on reasonable request.
Change history
14 September 2022
Missing Open Access funding information has been added in the Funding Note.
References
D.Q. Adams et al. (CUORE Collaboration), Phys. Rev. Lett.124 (2020) 12, 122501. https://doi.org/10.1103/PhysRevLett.124.122501
M. Agostini et al., Final results of gerda on the search for neutrinoless double-\(\beta\) decay. Phys. Rev. Lett. 125, 252502 (2020). https://doi.org/10.1103/PhysRevLett.125.252502
S.I. Alvis et al., A search for neutrinoless doublebeta decay in 76Ge with 26 kg-yr of exposure from the MAJORANA DEMONSTRATOR. Phys. Rev. C 100, 5501 (2019). https://doi.org/10.1103/PhysRevC.100.025501. arXiv:1902.02299
G. Anton et al., Search for neutrinoless double-\(\beta\) decay with the complete EXO-200 dataset. Phys. Rev. Lett. (2019). https://doi.org/10.1103/PhysRevLett.123.161802. arXiv:1906.02723
Gando, A. et al. Search for Majorana Neutrinos near the Inverted Mass Hierarchy Region with KamLAND-Zen. Phys. Rev. Lett.117, 082503, https://doi.org/10.1103/PhysRevLett.117.082503 (2016). [Addendum: Phys.Rev.Lett. 117, 109903 (2016)], 1605.02889
O. Azzolini et al., Final result of CUPID-0 Phase-I in the search for the \(^{82}\)Se neutrinoless double-\(\beta\) decay. Phys. Rev. Lett. 123, 032501 (2019). https://doi.org/10.1103/PhysRevLett.123.032501
E. Armengaud et al., New limit for neutrinoless double-beta decay of \(^{100}\)Mo from the CUPID-Mo experiment. Phys. Rev. Lett. 126(18), 181802 (2021). https://doi.org/10.1103/PhysRevLett.126.181802
A. Armatol et al. [CUPID], Characterization of cubic Li\(_{2}^{100}\)MoO\(_4\) crystals for the CUPID experiment. Eur. Phys. J. C 81(2), 104 (2021). https://doi.org/10.1140/epjc/s10052-020-08809-8. arXiv:2011.13656 [physics.ins-det]
W.R. Armstrong et al. [CUPID], CUPID pre-CDR. arXiv:1907.09376 [physics.ins-det]
A. Armatol et al. [CUPID and CROSS], A CUPID Li\(_{2}\)\(^{100}\)MoO\(_4\) scintillating bolometer tested in the CROSS underground facility. JINST 16(2), P02037 (2021). https://doi.org/10.1088/1748-0221/16/02/P02037. arXiv:2011.13806 [physics.ins-det]
O. Azzolini et al., CUPID-0: a double-readout cryogenic detector for double beta decay search. Nucl. Instrum. Methods A 958, 162441 (2020). https://doi.org/10.1016/j.nima.2019.162441
O. Azzolini et al., Evidence of single state dominance in the two-neutrino double-\(\beta\) decay of \(^{82}\)Se with CUPID-0. Phys. Rev. Lett. 123(26), 262501 (2019). https://doi.org/10.1103/PhysRevLett.123.262501
E. Gatti, P.F. Manfredi, Processing the signals from solid state detectors in elementary particle physics. Riv. Nuovo Cim. 9(1), 1–146 (1986). https://doi.org/10.1007/BF02822156
A. Armatol et al. [CUPID], Novel technique for the study of pileup events in cryogenic bolometers. Phys. Rev. C 104(1), 015501 (2021). https://doi.org/10.1103/PhysRevC.104.015501. arXiv:2011.11726 [physics.ins-det]
F. Chollet et al., Keras (2015), https://keras.io
D.P. Kingma, J. Ba, Adam: a method for stochastic optimization. arXiv:1412.6980 [cs.LG]
I. Goodfellow, Y. Bengio, A. Courville, Deep Learning (MIT Press, Cambridge, 2016)
S. Arlot, A. Celisse, A survey of cross-validation procedures for model selection. Stat. Surv. 4, 40–79 (2010). https://doi.org/10.1214/09-SS054
Acknowledgements
The CUPID Collaboration thanks the directors and staff of the Laboratori Nazionali del Gran Sasso and the technical staff of our laboratories. This work was supported by the Istituto Nazionale di Fisica Nucleare (INFN); by the European Research Council (ERC) under the European Union Horizon 2020 program (H2020/2014-2020) with the ERC Advanced Grant No. 742345 (ERC-2016-ADG, project CROSS) and the Marie Sklodowska-Curie Grant Agreement No. 754496; by the Italian Ministry of University and Research (MIUR) through the grant Progetti di ricerca di Rilevante Interesse Nazionale (PRIN 2017, Grant No. 2017FJZMCJ); by the US National Science Foundation under Grants No. NSF-PHY-1401832, No. NSF-PHY-1614611 and No. NSF-PHY-1913374. This material is also based upon work supported by the US Department of Energy (DOE) Office of Science under Contracts No. DE-AC02- 05CH11231 and NO. DE-AC02-06CH11357; and by the DOE Office of Science, Office of Nuclear Physics under Contracts No. DE-FG02-08ER41551, No. DE-SC0011091, No. DESC0012654, No. DE-SC0019316, No. DE-SC0019368, and No. DE-SC0020423. This work was also supported by the Russian Science Foundation under Grant No. 18-12-00003 and the National Research Foundation of Ukraine under Grant No. 2020.02/0011. This research used resources of the National Energy Research Scientific Computing Center (NERSC). This work makes use of both the DIANA data analysis and APOLLO data acquisition software packages, which were developed by the CUORICINO, CUORE, LUCIFER and CUPID-0 Collaborations.
Funding
Open access funding provided by Università degli Studi di Roma La Sapienza within the CRUI-CARE Agreement.
Author information
Authors and Affiliations
Corresponding author
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Fantini, G., Armatol, A., Armengaud, E. et al. Machine Learning Techniques for Pile-Up Rejection in Cryogenic Calorimeters. J Low Temp Phys 209, 1024–1031 (2022). https://doi.org/10.1007/s10909-022-02741-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10909-022-02741-9