1 Introduction

Neutrinoless double-\(\beta \) (\(0\nu \beta \beta \)) decay is a hypothetical process in which two neutrons in a nucleus are transformed simultaneously into two protons with the emission of only two electrons. Such a process violates lepton number conservation and requires the neutrino to be its own antiparticle (Majorana particle). In combination with cosmological observations and direct neutrino mass measurement, a non-zero \(0\nu \beta \beta \) decay rate would highly constrain the standard light, left-handed neutrino exchange mechanism via the effective Majorana neutrino mass or shed light on alternative processes [1].

The highest half-life sensitivity to \(0\nu \beta \beta \) decay requires the experiments to achieve large target mass, high detection efficiency, good energy resolution and most complete elimination of background at the Q-value of the decay (\(Q_{\beta \beta }\)). The goal of the GERmanium Detector Array (Gerda) experiment was to realize a background-freeFootnote 1 experiment for the first time. Gerda was located at the underground Laboratori Nazionali del Gran Sasso (LNGS) of INFN, Italy. Gerda used up to 43 kg of high purity germanium (HPGe) detectors enriched in the candidate isotope \(^{76}\)Ge up to 88%. They ensure high detection efficiency, low intrinsic background and excellent energy resolution. The bare HPGe detectors were operated in liquid argon (LAr), which served as cooling medium and as active shield against environmental backgrounds at \(Q_{\beta \beta } = 2039\) keV. The details of the experimental setup and its upgrade from Phase I to Phase II can be found elsewhere [2, 3].

The Phase II data taking took place between December 2015 and November 2019 with an upgrade of the detector array and the surrounding LAr instrumentation in 2018. Gerda operated three types of enriched HPGe detectors arranged in an array of 7 strings: 7 semi-coaxial detectors, referred to as coaxial detectors for brevity, from Phase I with a total mass of 15.6 kg; 30 Broad Energy Germanium (BEGe) detectors (20.0 kg) [4, 5]; and 5 inverted coaxial (IC) detectors (9.6 kg) [6, 7], which were installed in summer 2018. The accumulated exposure, product of total detector mass and respective livetime, amounts to 103.7 kg year for Phase II. In order to avoid bias in the event selection criteria, Gerda followed a strict blinding strategy, where events within a \(Q_{\beta \beta }\pm 25\) keV energy window were processed only after the analysis had been finalized.

2 Signal formation, readout and processing in germanium detectors

Gerda used p-type HPGe diodes of three different geometries called coaxial, BEGe and inverted coaxial (Fig. 1). They all have a relatively thick ([0.8–2.6] mm) n\(^{+}\) electrode, formed by lithium diffusion, and a thin (\(\sim \)300 nm) p\(^{+}\) electrode created by boron implantation. The p\(^{+}\) and n\(^{+}\) electrodes are separated by a groove with non-conducting surface. Because of their small p\(^{+}\) electrode the BEGe and IC detectors belong to the class of point-contact detectors [12] which exhibits intrinsic performance advantages with respect to energy resolution and pulse shape discrimination.

The Ge detectors are operated under reverse bias voltage such that almost the entire volume is depleted of free charge carriers. An interaction in the active volume creates a number of electron-hole pairs proportional to the deposited energy. The charge carriers drift according to the electric field created by both the positive bias voltage applied to the n\(^{+}\) contact and the volume charge density due to the net bulk impurity concentration. The electrons are collected at the n\(^{+}\) contact, the holes at the p\(^{+}\) contact which is used for readout. The n\(^{+}\) layer covers most of the crystal surface and features a large Li concentration. Exhibiting zero electric field beyond the p-n junction till the outer surface, a charge created in this layer will only experience thermal diffusion with two possible outcomes: recombine (loss) or reach the depleted volume (collection). As a result, two generic regions can be identified, the dead layer with high probability of no charge collection and a transition layer with partial charge collection.

During the drift of charge carriers, charge is induced on the readout contact as described by the Shockley–Ramo theorem [13]:

$$\begin{aligned} Q(t) = -Q_0 \left[ \Phi ^w(\mathbf {r}_h(t)) - \Phi ^w(\mathbf {r}_e(t)) \right] , \end{aligned}$$
(1)

where \(Q_0\) is the total charge carried by the holes or electrons and \(\Phi ^w(\mathbf {r}_{h/e}(t))\) is the weighting potential along the drift path of holes or electrons. The weighting potential is shown in Fig. 1 for the three different detector geometries used in Gerda also indicating the geometry of the p\(^+\) and n\(^+\) surfaces. The weighting potentials have been calculated using the AGATA Detector Library [14] pulse shape simulation package.

Due to their small p\(^+\) contact, BEGe and IC detectors have a weighting potential distribution that is very small in most of the volume and sizable only close to the p\(^+\) contact. This results in similar waveforms Q(t) from interactions in a large part of the volume. Multiple energy deposits can be treated as a superposition of single interactions. In Fig. 2, normalized example pulses from a BEGe detector are shown for a SSE, a MSE, an event close to the p\(^+\) contact and an event near the dead layer of the n\(^+\) contact with incomplete charge collection. As shown, surface events produce characteristic pulse shapes being fast close to the p\(^+\) contact due to the strong electric field and slow near the n\(^+\) contact due to the weak electric field and the transition layer.

In coaxial detectors, both the hole and electron drift play a role in the pulse formation, which result in different pulse shapes throughout the volume of the detector. In Fig. 3, simulated example pulses from different parts of the detector are shown. Similarly to BEGe detectors, coaxial detectors also show special pulse shape characteristics in case of surface events. Indeed, energy deposits near the groove or the bottom of the borehole cause faster pulses.

The signals induced on the p\(^+\) contact of the Gerda detectors are read out by charge sensitive amplifiers located in the LAr about 35 cm above the detector array. The signals are digitized at 25 MHz for \(160~\upmu \)s and at 100 MHz for 10 \(\upmu \)s. Both traces are centered at the rising edge of the charge pulse Q(t). The offline analysis of the digitized signals follows the procedures described in [15, 16]. The 25 MHz traces used for the energy reconstruction ensure the excellent energy resolution, while the 100 MHz traces are used in the PSD methods presented in the following sections. The energy estimator E is reconstructed with a zero-area cusp filter [17], whose parameters are optimized for each detector and calibration run.

3 Overview of event samples and discrimination methods

Weekly calibration runs with \(^{228}\)Th sources are performed to determine the energy scale and resolution of the detectors [18] and to calibrate and train the PSD techniques. Figure 4 shows a calibration spectrum highlighting the different event samples used in pulse shape analysis. The most prominent feature is the full energy peak (FEP) at 2615 keV from \(^{208}\)Tl decay. Its double escape peak (DEP) at 1593 keV is used as a sample of SSEs as the electron and positron from pair production deposit their energy in a small volume and both annihilation \(\gamma \) rays leave the detector. The FEP at 1621 keV from \(^{212}\)Bi is used as a sample of MSEs that is sufficiently near in energy to the DEP in order to avoid noise dependent biases. To test the performance of MSE rejection, the FEP and single escape peak (SEP) of \(^{208}\)Tl, mostly featuring MSEs are used while the Compton continuum region around \(Q_{\beta \beta }\pm 35\) keV (CC(\(Q_{\beta \beta }\))) serves to estimate the background rejection in the \(0\nu \beta \beta \) decay signal region.

In physics data, the standard neutrino accompanied double-\(\beta \) (\(2\nu \beta \beta \)) decay provides another sample of signal-like events that is equally distributed in the whole detector volume and used for the investigations in the following. After applying the LAr veto cut [3], about 97% of the events in the 1000–1300 keV region originate from \(2\nu \beta \beta \) decays. Beside MSEs from \(\gamma \) rays, the physics data have a significant amount of surface events from \(\alpha \) and \(\beta \) decays that can be discriminated thanks to their specific pulse shape. The physics spectrum at low energies is dominated by \(\beta \) decays of \(^{39}\)Ar up to its Q-value of 565 keV. However, these events are not used in the pulse shape analysis due to their relatively high noise. A prominent background source at \(Q_{\beta \beta }\) is the \(\beta \) decay of \(^{42}\)K, which is produced as a progeny of the long-lived \(^{42}\)Ar and has a Q-value of 3525 keV. Beta particles deposit their energy in germanium within a few mm resulting in events partly in the dead and transition layers of the n\(^+\) surface. Such n\(^+\) surface events can induce slow pulse shapes with incomplete charge collection. Apart from possible HPGe bulk contamination, that have been shown to be insignificant [19, 20], \(\alpha \) particles can only reach the active volume of the detector at the thin p\(^+\) contact or at the non-conducting groove producing pulse shapes with fast rise as shown in Figs. 2 and 3. A clean sample of \(\alpha \) surface events is found in physics data above the \(^{42}\)K Q-value. The most prominent structure at these energies is a broad peak at 5304 keV, the \(\alpha \) energy of the \(^{210}\)Po \(\alpha \) decay (\(^{238}\)U decay chain) [19, 21].

Due to their different geometries, BEGe and IC detectors are treated separately from coaxial detectors in the pulse shape analysis. In the case of the BEGe and IC detectors one parameter, A/E, is used to classify background events, where A is the maximum current amplitude as indicated in Fig. 2 and E is the energy. As MSEs and surface events at the n\(^+\) contact are characterized by longer, i.e. wider current pulses, they feature a lower A/E value compared to SSEs, while surface events at the p\(^+\) contact show a higher A/E value [22]. Therefore, rejecting events on both sides of the A/E distribution of SSEs enhances the signal to background ratio. The details of the A/E analysis are presented in Sect. 4. In the case of coaxial detectors an artificial neural network (ANN [23]) is used to discriminate SSEs from MSEs similar to the approach applied in Phase I [10]. To discard events close to the p\(^+\) contact, a dedicated cut on the risetime of the pulses is applied. The training and optimisation of the ANN and risetime cuts for coaxial detectors is described in Sect. 5. An additional cut is applied to all detectors to remove events with slow or incomplete charge collection. These events arise from energy depositions in a non-depleted volume (n\(^+\) layer or insulating groove). These events are identified through the difference between two energy estimates performed using the same digital filter but different shaping times as summarized in Sect. 6. The signal efficiency of \(0\nu \beta \beta \) decay is estimated using the survival fraction of DEP and \(2\nu \beta \beta \) decay events by taking into account the energy dependence of the different PSD techniques. The details of this extrapolation to \(Q_{\beta \beta }\) including systematic uncertainties are found in Sects. 7 and 1.

4 The A/E method for BEGe and IC detectors

The amplitude of the current pulse A is computed after applying 3 times a moving window average (MWA) filter with 50 ns length and interpolating the pulse down to 1 ns sampling time. This filtering procedure optimizes the high frequency noise attenuation while preserving the pulse shape information. The energy estimator E is determined by a pseudo Gaussian filter with a shaping time of 10 \(\upmu \)s. A is then divided by E before calibration, providing the raw A/E for each pulse. The raw A/E is then corrected for time stability and energy dependence before a cut value is defined.

For each calibration run, the A/E distribution of events in the 1000–1300 keV region is fitted with a Gaussian (SSEs) and a low-side tail (MSEs) as described in [10]. The position of the Gaussian \(\mu _{A/E}\) from each calibration of the four years of data taking is used to define stable time periods where the raw A/E changes by less than its \(\sigma _{A/E}\) resolution. Instabilities are mostly related to hardware changes and a few detectors show a small systematic drift of the raw A/E. Physics data between the stable periods are removed from the analysis causing a few percent exposure loss. After normalizing the raw A/E by the average \(\mu _{A/E}\) within a given time period, the data of all calibrations are merged, only separating before and after the upgrade of 2018.

The A/E of SSEs in the Compton continuum of the merged calibration data show a small linear energy dependence of a few percent per MeV, due to the larger charge cloud size at higher energies that broadens the current pulse. The energy dependence of \(\mu _{A/E}(E)\) and \(\sigma _{A/E}(E)\) is described by a linear and a \(\sqrt{b+c/E^2}\) type of function, respectively, as shown in Fig. 5 for a BEGe (GD61A) and an inverted coaxial (IC74A) detector as examples. In addition to the correction for the energy dependence, A/E is normalized to the mean of the A/E distribution of the DEP, which lies about 0.25% above the SSE band.

The cut values for each detector are determined on the energy-independent A/E classifier defined as \(\zeta =([A/E]/\mu _{A/E}(E)-1)/\sigma _{A/E}(E)\). Its distribution is centered around zero and has a standard deviation of one for SSEs. The low-side A/E cut against MSEs and n\(^+\) surface events is chosen to yield a DEP survival fraction of 90%. The resulting cut values range from \(\zeta =-1.9\) to \(-1.2\) and from \(\zeta =-1.9\) to \(-1.7\) for BEGe and IC detectors, respectively. The high-side A/E cut against p\(^+\) surface events is chosen at \(\zeta =3.0\) for each detector, in order to reject all \(\alpha \) events in physics data above 3525 keV. It has been shown that the high-side A/E cut discards events, including degraded \(\alpha \) events, from a small volume around the p\(^+\) contact [24] causing the survival fraction of events after the high-side A/E cut to be proportional to the detector mass.

Fig. 1
figure 1

Geometry and weighting potential of a typical BEGe, an inverted coaxial and a coaxial detector. The p\(^+\) and n\(^+\) contacts are indicated schematically

Fig. 2
figure 2

Examples of different type of normalized charge pulses (red) along with the derived current pulses (blue) from a BEGe detector: single-site event, multi-site event, event near the p\(^+\) contact and event near the n\(^+\) contact with incomplete charge collection

Fig. 3
figure 3

Simulated charge pulses in indicated parts (bulk, groove, bottom/side of borehole) of a typical coaxial detector. The electronic response of the readout chain and noise are not included here

Figure 6 shows for BEGe and IC detectors the energy distribution of calibration data before and after the A/E cuts described above as well the corresponding survival fractions. As expected the survival fraction of events in the FEPs and SEP is much lower than in the DEP. Events in the Compton continuum are discarded with about the same probability independent of their energy but depending on the overall detector size, and more generally speaking from the detector type. IC detectors discard a higher fraction of events because of the higher probability of multiple scattering of \(\gamma \) rays due to their larger size.

The survival fractions of events in the DEP, FEP and CC(\(Q_{\beta \beta }\)) are shown in Fig. 7 for each detector before and after the upgrade. The DEP survival fractions are slightly smaller than 90% due to the high A/E cut. The rejection of MSEs shows a small dependence on the detector position in the string because of different electronic noise conditions. This effect was reduced after the upgrade. The IC detectors (detector numbers above 35) reject MSEs more efficiently than BEGe detectors.

Fig. 4
figure 4

Calibration spectrum highlighting the different event samples used in the pulse shape analysis

Fig. 5
figure 5

Distribution of A/E as a function of energy from \(^{228}\)Th calibration data for the BEGe detector GD61A (top) and the IC detector IC74A (bottom). The dashed lines indicate the linear energy dependence and the 1-\(\sigma \) width of SSEs

Fig. 6
figure 6

Energy distribution of events from calibration data before and after the A/E cut (top) and their ratio (bottom) for BEGe and IC detectors

Fig. 7
figure 7

Survival fraction of events in the \(^{208}\)Tl DEP, \(^{212}\)Bi FEP and CC(\(Q_{\beta \beta }\)) for each detector (see Table 3 for detector numbers and types). Open (filled) symbols show the calibration dataset before (after) the upgrade. The dashed lines separate the detector strings in the array. The uncertainties are only statistical and smaller than the markers

In order to check the validity of the A/E corrections and cuts in the whole dataset, survival fractions of the usual event samples are studied for each calibration run. The average survival fraction from all BEGe detectors is shown for each calibration in Fig. 8 as a function of time. The A/E cut shows, for both BEGe and IC detector types, a stable behaviour at the 3% relative level during the whole data collection period when applied to Compton continuum events at \(Q_{\beta \beta }\). Residual instabilities stem on one side from changes in the detector gain or leakage current and on the other side from statistical fluctuations.

Fig. 8
figure 8

Average survival fractions of events in the \(^{208}\)Tl DEP, \(^{212}\)Bi FEP and CC(\(Q_{\beta \beta }\)) for BEGe detectors as a function of time. Each data point represents a calibration run with its statistical uncertainty

The raw A/E is corrected in the same way for physics data and the same cut values are applied as in calibration data. In order to check the validity of the corrections and the cut values, Fig. 9 shows a comparison of the A/E classifier between DEP events from calibration data and \(2\nu \beta \beta \) events from physics data for BEGe and IC detectors. By construction, the A/E classifier peaks at 0 and has a width of about 1 for these SSEs. The agreement between physics and calibration data is satisfactory and confirms the applied correction procedure. For each detector, the residual difference of the average A/E, between calibration and physics data, is included in the systematic uncertainties by shifting the cut value accordingly.

Fig. 9
figure 9

A/E classifier distributions after LAr veto comparing events in the DEP from calibration data and \(2\nu \beta \beta \) decay events from physics data from BEGe (left) and IC (right) detectors. The histograms are normalized to their integrals

The energy distributions of events before and after the A/E cut are shown in Fig. 10 for the whole physics data corresponding to 61.8 kg year exposure from BEGe (53.3 kg year) and IC (8.6 kg year) detectors. Events in the \(2\nu \beta \beta \) decay region survive the cut with a high probability, while the \(^{40}\)K and \(^{42}\)K peaks at 1461 keV and 1525 keV, respectively, mostly featuring MSEs, are reduced significantly. High energy events above 3525 keV coming from p\(^+\) surface events are all discarded by the high A/E cut by definition. In a 240 keV wide window around \(Q_{\beta \beta }\), only 7 events in BEGe detectors and 1 event in IC detectors survive the LAr veto and A/E cuts. This results in the corresponding unique background indicesFootnote 2 of Gerda in the region of interest of \(5.5_{-1.8}^{+2.4} \cdot 10^{-4}\) counts/(keV kg year) and \(4.9_{-3.4}^{+7.3} \cdot 10^{-4}\) counts/(keV kg year) for BEGe and IC detectors, respectively.

Fig. 10
figure 10

Energy distributions of all Gerda Phase II physics data from the BEGe and IC detectors before (grey) and after (blue) the A/E cut. The position of \(Q_{\beta \beta }\) is indicated. The prominent \(^{40}\)K and \(^{42}\)K lines are located at 1461 keV and 1525 keV, respectively

5 The ANN and risetime methods for coaxial detectors

The 100 MHz waveform traces are used to compute the artificial neural network input variables (IVs) and the risetime. They are first filtered with a MWA of 30 ns width three times. They are subsequently baseline subtracted and normalized between 0 and 1 by the amplitude of the 25 MHz pole-zero corrected traces. This amplitude is provided by a trapezoidal filter with a typical precision of 0.2% at 2 MeV.

The ANN IVs are a list of the 50 times at which the resulting waveform reaches [1%, 3%, \(\ldots \), 99%] of its amplitude (see Fig. 11). An interpolation of the 10 ns wide gaps between data points allows for a more precise estimation. These IVs are computed for all physics and calibration events found in Gerda. However, given the degraded signal-to-noise ratio at low energy, only events above 1000 keV are considered in this pulse shape analysis to avoid loosely constrained energy dependence correction. Calibration runs are used to optimize the discrimination of SSEs from MSEs. The network is built on two hidden layers with 50 and 51 neurons, using the TMVA-MLPBFGS algorithm [23].

Figure 12 shows as an example the classifier distributions from the ANN training of the ANG5 detector with the events from the indicated DEP and FEP peaks. The distributions from the Compton events under the peaks are statistically subtracted using the distributions of the events in the energy side-bands of the peaks. The lower and upper side-bands are defined by selecting events falling in the [\(-9 \sigma \),\(-4.5 \sigma \)] and [\(4.5 \sigma \),\(9 \sigma \)] energy regions where \(2.355 \cdot \sigma \) is the full width at half maximum used to quantify the energy resolution of the Ge detectors. The indicated ANN cut keeps 90% of the events in the DEP peak.

Due to the significant change in hardware, the data taken before and after the 2018 upgrade periods have been trained separately. Finer splittings of the data have yielded signal efficiencies and background rejection values which agreed on the one-percent level. As a result of the limited statistics, a minimal number of two datasets was preferred. Typically, about 10,000 and 15,000 events enter the signal (DEP) and background (\(^{212}\)Bi FEP) samples, respectively, for the ANN training of each detector. Similarly to the A/E method, the ANN cut for each detector and each training period is set on the \(^{208}\)Tl DEP classifier distribution such that 90% of the events survive.

The risetime is estimated after interpolating the waveform with a 1 ns time step. From studies on the rejection of \(\alpha \) particles [25], the [10–90%] amplitude signal risetime was selected (see Fig. 11). This parameter is used to reject \(\alpha \) events on the p\(^+\) contact that develop faster signals (see Fig. 3). The cut definition relies on the maximization of the following figure of merit:

$$\begin{aligned} f(x) = \varepsilon ^2_{2\nu \beta \beta }(x) \cdot (1-\varepsilon _{\alpha }(x)), \end{aligned}$$
(2)

where \(\varepsilon _{2\nu \beta \beta }(x)\) is the \(2\nu \beta \beta \) survival fraction at risetime cut x and \(\varepsilon _{\alpha }(x)\) is the corresponding survival fraction of \(\alpha \) events. Only physics data after ANN-MSE and LAr veto, to increase purity of samples, are used for this figure of merit that allows to reject most of the \(\alpha \) particles while preserving a high \(2\nu \beta \beta \) decay signal survival fraction. Figure 13 depicts an example of such a cut definition. On average, about 90% of the high energy \(\alpha \) particles are rejected.

Fig. 11
figure 11

Zoom on a typical normalized 100 MHz trace of a coaxial detector. The 50 ANN input variables (red circles) and rise time estimates (dashed green) are indicated

Fig. 12
figure 12

Trained ANN classifier values for events in the \(^{208}\)Tl DEP (SSEs) and \(^{212}\)Bi FEP (MSEs) from the ANG5 detector. The histograms are normalized to their integrals

Fig. 13
figure 13

Survival fractions after the rise time cut for \(2\nu \beta \beta \) decay and \(\alpha \) particles events from the Phase II data before the upgrade of the ANG4 detector. Also shown are the figure of merit and the chosen cut value

The calibration energy spectrum of all coaxial detectors before and after applying the PSD cuts is shown in Fig. 14. As targeted, the ANN cut removes preferentially the regions highly populated by MSEs (FEPs and SEP in particular) and preserves 90% of the \(^{208}\)Tl DEP. On the contrary, the risetime cut deployed to reject events with fast risetime is insensitive to these types of events and hence has a high survival fraction for both SSEs and MSEs.

Fig. 14
figure 14

Energy distributions of calibration data events from the coaxial detectors before and after the ANN and risetime cuts (top) and the corresponding survival fractions (bottom)

Fig. 15
figure 15

Survival fractions of events in the \(^{208}\)Tl DEP, \(^{212}\)Bi FEP and CC(\(Q_{\beta \beta }\)) for each coaxial detector after ANN and risetime cuts. Open (filled) symbols show the calibration dataset before (after) the 2018 upgrade. The uncertainties are only statistical and smaller than the markers

Various survival fractions for all coaxial detectors, after applying both ANN and risetime cuts for the two data taking periods, are depicted in Fig. 15. In addition, the stabilities of these cuts in three calibration energy regions are plotted in Figs. 16 and 17. Apart from a slight improvement of the ANN rejection of the Compton continuum events at \(Q_{\beta \beta }\) after the 2018 upgrade due to an improved signal cable management, an overall 3% level stability in PSD performance is observed over the course of Gerda Phase II. Compared to Gerda Phase I [10], on average a 7% relative worsening is observed, mostly attributed to the different electronics scheme, hence different noise. The risetime cut also shows a very stable behavior during this period.

Fig. 16
figure 16

Average survival fractions of events after the ANN cut in the \(^{208}\)Tl DEP, \(^{212}\)Bi FEP and CC(\(Q_{\beta \beta }\)) for coaxial detectors as a function of time. Each data point represents a calibration run with its statistical uncertainty

Fig. 17
figure 17

Average survival fractions of events after the risetime cut in the \(^{208}\)Tl DEP, \(^{212}\)Bi FEP and CC(\(Q_{\beta \beta }\)) for coaxial detectors as a function of time. Each data point represents a calibration run with its statistical uncertainty

The result of the pulse shape analysis applied on the full 41.8 kg\(\cdot \)yr physics data exposure, after the LAr veto cut, is summarized in Fig. 18. The ANN is preserving 80.5(3)% of the \(2\nu \beta \beta \) decay event sample while removing 62(1)% of the \(^{40}\)K and \(^{42}\)K lines. The reduction of the \(^{214}\)Bi FEPs at 1806 keV and 2204 keV is also visible. However, it is in general inefficient at rejecting fast and degraded \(\alpha \) events from about 3500 keV to 5500 keV, as 58.4(8)% of them survive. After applying also the risetime cut, 67.8(4)% of \(2\nu \beta \beta \) decay events remain. The \(\alpha \) background is suppressed by a factor of \(\sim \)23 above 3500 keV, only 93 events out of 2169 survive the PSD cuts. The complementarity between the two cuts yields for the coaxial detectors a dataset that remains background-free in the region of interest with a background index of \(5.0^{+2.6}_{-2.0}~\cdot ~10^{-4}\) counts/(keV kg year).

Fig. 18
figure 18

Top: ANN classifier of physics events from coaxial detectors as a function of energy. Middle: Risetime of events surviving the ANN cut as a function of energy. Bottom: Energy distribution of events from coaxial detectors before and after the ANN and risetime cuts

6 Events with incomplete charge collection

Events from the n\(^+\) layer or the groove featuring slow or incomplete charge collection (see Fig. 2) have an uncertainty on energy reconstruction because the ZAC filter [17] is optimized for the FEP resolution with a relatively short integration time. These particular events, especially for coaxial detectors, can survive the ANN and risetime cuts. In order to discard these events with uncertain ZAC energy reconstruction, an additional classification based on energies reconstructed with pseudo Gaussian filters with different integration times is performed. The energy is reconstructed with an integration time of 4 \(\mu \)s (\(E_{short}\)) and 20 \(\mu \)s (\(E_{long}\)). The ratio \(E_{short}/E_{long}\) is then normalized to its average observed in events from the 2615 keV line in calibration data. The classifier is defined as:

$$\begin{aligned} \delta E = \left( \frac{ E_{short}/E_{long}}{ \langle E_{short}/E_{long} \rangle _{FEP} } - 1 \right) \cdot E, \end{aligned}$$
(3)

where E is the default ZAC-reconstructed energy for each event. With this definition, the classifier \(\delta E\) has an average of 0 keV.

The \(E_{short}\)/\(E_{long}\) distribution normalization is performed time dependently in order to account for possible instabilities of the readout electronics: for each calibration run the mean of the fitted Gaussian to the FEP distribution is used as a normalization factor for the following physics events.

Figure 19 shows the resulting \(\delta E\) distributions for calibration events in the \(^{208}\)Tl FEP and in the CC(\(Q_{\beta \beta }\)) as well as for physics events in the \(2\nu \beta \beta \) region for the ANG2 detector as an example. Events in the Compton continuum have a higher fraction with large negative \(\delta E\) values. This is due to the higher fraction of pulses with incomplete charge collection that is not present in a peak where the whole energy has been collected. Physics data in the \(2\nu \beta \beta \) region shows a similar Gaussian distribution as calibration data with no significant energy dependence.

Fig. 19
figure 19

Distribution of the \(\delta E\) classifier for calibration events in the \(^{208}\)Tl FEP and in the Compton region around \(Q_{\beta \beta }\) as well as for physics events in the \(2\nu \beta \beta \) region for the coaxial ANG2 detector. The histograms are normalized to their integrals

A cut value is applied to the lower side of the \(\delta E\) distribution and it is set for each detector separately \(3\sigma \) away from zero.Footnote 3 The cut value is loose enough that more than 99% of signal events are kept whereas those with uncertain energy (significant difference between energies reconstructed by the ZAC and Gauss algorithms) are rejected.

For the BEGe and IC detectors, most of the \(2\nu \beta \beta \) decay events rejected by the \(\delta E\) cut are also rejected by the A/E cut. For BEGe detectors out of 3477 events cut by either method only 7 are rejected by the \(\delta E\) cut (for energies above 1000 keV 12 events in total are cut by \(\delta E\)). For the IC detectors none of the 521 \(2\nu \beta \beta \) decay events is removed by the \(\delta E\) cut alone.

For the coaxial detectors the correlation between ANN, risetime and \(\delta E\) cuts is weaker. Analyzing the \(2\nu \beta \beta \) decay region for the full Phase II dataset, one gets 4970 events (out of 15,433) removed by either method, 1 event cut by all three methods and 57 by \(\delta E\) only. 83 events from coaxial detectors with \(E > 1000\) keV are cut by \(\delta E\) only. Many of these events show slow pulses that could originate from the detector surface but are not cut by ANN or risetime cuts.

The survival fractions for different energy regions of physics and calibration data have been studied for each detector separately. Survival fractions of \(2\nu \beta \beta \) decay events for the different detector types are presented in Table 1 before and after the other PSD cuts. Values for DEP, FEP, CC(\(Q_{\beta \beta }\)) and \(2\nu \beta \beta \) decay events for the two datasets before and after the 2018 upgrade are very close to 100%. The Compton region as well as the \(2\nu \beta \beta \) region show lower acceptance because of the contribution of slow pulses.

For BEGe and IC detectors the impact of the \(\delta E\) cut is negligible, while for coaxial detectors a small correction of the efficiency has to be taken into account.

Table 1 Survival fractions of \(2\nu \beta \beta \) decay events for the \(\delta E\) cut without and in combination with other PSD methods (A/E or ANN+risetime). The uncertainties are statistical only

7 PSD detection efficiencies at Q\(_{\beta \beta }\)

In the absence of signal proxies at \(Q_{\beta \beta }\) in calibration and physics data and due to a not sufficiently accurate modeling of the pulse shape response in simulation, \(\varepsilon _\mathrm{{PSD}}\) is estimated for point contact and coaxial detectors from the extrapolation of the survival fractions of the \(^{208}\)Tl DEP and of the \(2\nu \beta \beta \) decay events at 1593 keV and 1150 keV to \(Q_{\beta \beta }\) (see circles and squares in Fig. 20).

Fig. 20
figure 20

Extrapolation of reference survival fractions (circles) to \(Q_{\beta \beta }\) (squares) using the energy dependence deduced from indicated two samples of down-scaled waveforms (see text). Examples are given for BEGe (top), coaxial (middle) and IC detectors (bottom) for the A/E, ANN and \(\delta E\) methods, respectively. As to the BEGe-A/E example on top: the reference survival fraction is corrected for Compton events below the \(^{208}\)Tl DEP while for the rescaled distribution (red) this correction is missing

For that purpose, sets of down-scaled waveforms are used to evaluate the energy dependence of the various PSD methods. This is based on the observation that the energy dependence of PSD classifiers is dominated by the electronic noise in Gerda. Two samples from calibration data have been considered, namely \(^{208}\)Tl DEP and CC (\(Q_{\beta \beta }\)) events (see Fig. 4). The waveform \(\omega '(k,E)\) at energy E is produced by down-scaling the waveform \(\omega (k,E_S)\), of true energy \(E_S\), from one of these two samples and superimposing it to a baseline sample b(k) obtained from random triggers:

$$\begin{aligned} \omega '(k,E) = \omega (k,E_S) \cdot \frac{E}{E_S} + b(k) \cdot \left( \sqrt{1-\left( \frac{E}{E_S}\right) ^2}\right) \end{aligned}$$
(4)

where k is the index running from 0 \(\mu \)s to 160 \(\mu \)s and from 0 to 10 \(\upmu \)s for the low and high frequency traces introduced in Sect. 2, respectively. By applying the complete set of digital processing steps and analysis methods described earlier, such a procedure allows to estimate the contribution of the energy dependence, induced by the signal-to-noise ratio variation, on the PSD methods results (for more details see [25]).

Figure 20 shows the survival fractions of the \(^{208}\)Tl DEP and CC(\(Q_{\beta \beta }\)) datasets as a function of the down-scaled energy for the A/E, ANN and \(\delta E\) methods.Footnote 4 Each plot depicts the energy dependence of a given detector type, as a result of a weighted average of the individual detector responses, to illustrate the general trends. The reference efficiency is extrapolated to \(Q_{\beta \beta }\) using the average energy dependence of \(^{208}\)Tl DEP and CC(\(Q_{\beta \beta }\)) datasets.Footnote 5 Table 2 shows how the different PSD procedures contribute to the resulting average detection efficiencies \(\varepsilon _\mathrm{{PSD}}\) of the various detector types for each data taking period. The efficiencies of the point contact detectors are almost 30% larger than those of the coaxial detectors.

Table 2 Average signal detection efficiencies at \(Q_{\beta \beta }\) of individual PSD methods for the different detector types and data taking periods. The corresponding total PSD detection efficiencies \(\varepsilon _{\tiny {\text {PSD}}}\) and their systematic uncertainties are shown in bold; they are estimated via a MC sampling of individual values (see Table 3) and are thus different from the product of the individual PSD methods efficiencies reported here

The analysis of the final Gerda results [26] is based on time-dependent and detector-wise datasets. Table 3 (Appendix A) reports the overall PSD efficiency for each detector separately. Since the central values of the individual detector efficiencies exhibit significant shifts, the average overall PSD efficiencies \(\varepsilon _\mathrm{{PSD}}\) of Table 2 have been obtained, in fact, from an exposure weighted Monte-Carlo sampling, and not from a simple average.

The systematic uncertainty of the extrapolation is estimated from the difference of the slopes of the two down-scaled waveform samples. It is on average about 0.5% and 1.3% for point contact and coaxial detectors, respectively, the latter detector type being more sensitive to the noise due to its larger p\(^{+}\) contact capacitance.

In addition, two other systematic effects have been taken into account: (1) the difference between the calibration and physics data and (2) the difference between signal proxies and \(0\nu \beta \beta \) decay events. The former applies only to point contact detectors as the \(^{208}\)Tl DEP from calibration data is used as signal proxy. Shifting the A/E cut, for each detector, by the A/E distribution bias observed between the \(^{208}\)Tl DEP and \(2\nu \beta \beta \) decay (Fig. 9) leads to an average relative systematic uncertainty of 1.9%. The latter makes use of pulse shape simulation (see Appendix B) to quantify the PSD performance bias between \(2\nu \beta \beta \) decay and \(^{208}\)Tl DEP events and those coming from \(0\nu \beta \beta \) decay. Indeed, the signal proxies feature much lower energies hence different Bremsstrahlung contribution. In addition, \(^{208}\)Tl DEP events have a higher probability to happen close to the detector surface while \(0\nu \beta \beta \) decays would occur homogeneously throughout the detector bulk. A 2.3% and 1.5% absolute uncertainty has been estimated for BEGe and IC detectors, respectively, while it amounts to 4% for coaxial detectors. This last estimate is larger due to the difficulty to match the ANN training performance in simulation with the data.

8 Summary

Nowadays, running a background-free \(0\nu \beta \beta \) decay experiment is essential to boost the \(T^{0\nu }_{1/2}\) sensitivity on a reasonable time scale. Over the past years, the Gerda collaboration demonstrated the feasibility of such a program by upgrading its initial setup with additional point contact detectors (BEGe and IC), a LAr veto instrumentation and lower mass holders. As a consequence, the sensitivity linearly increased with the exposure. The interplay between passive and active shielding techniques has proven to be highly effective. In this paper, we focused on the ability offered by germanium detectors to analyze the topological structure of the recorded events. This topology, distinct for signal-like and background-like events at a 100 ns time scale, is best scrutinized with the high frequency based data acquisition system of Gerda. Using the Phase II dataset, we confirmed the superior discriminating power of point contact detectors (BEGe and IC) over the historical coaxial detectors thanks to the simple A/E parameter. Their \(0\nu \beta \beta \) decay PSD efficiency is 89% and 69%, respectively for a similar background index of about \(5\cdot 10^{-4}\) counts/(keV kg year) after all cuts. We also demonstrated the high and stable performance in LAr of the newly produced enriched inverted-coaxial point contact germanium detectors. The LEGEND collaboration will deploy these IC detectors for its future \(^{76}\)Ge \(0\nu \beta \beta \) decay search program.

The total detection efficiencies for \(0\nu \beta \beta \) decay are discussed and listed for each detector in Appendix A.