Improved EDELWEISS-III sensitivity for low-mass WIMPs using a profile likelihood approach

We report on a dark matter search for a Weakly Interacting Massive Particle (WIMP) in the mass range mχ∈[4,30]GeV/c2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m_{\chi } \in [4, 30]\,\mathrm{GeV/}c^2$$\end{document} with the EDELWEISS-III experiment. A 2D profile likelihood analysis is performed on data from eight selected detectors with the lowest energy thresholds leading to a combined fiducial exposure of 496 kg-days. External backgrounds from γ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma $$\end{document}- and β\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\beta $$\end{document}-radiation, recoils from 206Pb\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$^{206}\mathrm{Pb}$$\end{document} and neutrons as well as detector intrinsic backgrounds were modelled from data outside the region of interest and constrained in the analysis. The basic data selection and most of the background models are the same as those used in a previously published analysis based on boosted decision trees (BDT) [1]. For the likelihood approach applied in the analysis presented here, a larger signal efficiency and a subtraction of the expected background lead to a higher sensitivity, especially for the lowest WIMP masses probed. No statistically significant signal was found and upper limits on the spin-independent WIMP-nucleon scattering cross section can be set with a hypothesis test based on the profile likelihood test statistics. The 90 % C.L. exclusion limit set for WIMPs with mχ=4GeV/c2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$m_\chi = 4\,\mathrm{GeV/}c^2$$\end{document} is 1.6×10-39cm2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1.6 \times 10^{-39}\,\mathrm{cm^2}$$\end{document}, which is an improvement of a factor of seven with respect to the BDT-based analysis. For WIMP masses above 15GeV/c2\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$15\,\mathrm{GeV/}c^2$$\end{document} the exclusion limits found with both analyses are in good agreement.


Introduction
Through different astrophysical observations on a wide range of cosmological scales, it is well established that ∼27 % of the energy density in the Universe is made up of an unknown dark matter [2]. A well-motivated class of particles proposed to solve the dark matter problem are weakly interacting massive particles (WIMPs) with masses of the order of GeV/c 2 to TeV/c 2 and an extremely low scattering cross section with ordinary matter. Direct detection experiments search for the elastic scattering of a WIMP from the galactic dark matter halo in detectors on Earth-based experiments. The nuclear recoils from such interactions would have an exponentially falling energy spectrum up to a few keV, depending on the mass m χ of WIMPs. In addition, the expected rate is smaller than one interaction per kg of target material per year. To minimize the background for this rare event search, the EDEL-WEISS experiment is located in the Modane underground laboratory (LSM) in the French-Italian Alps, where a rock overburden of 4800 m w.e. reduces the cosmic muon flux down to 5 muons/m 2 /day. Remaining muons are tagged with an active muon veto system surrounding the experiment [3], followed by 50 cm of polyethylene and 20 cm of lead to suppress neutrons and gammas. Inside these layers of shielding a cryostat made of ultra-pure copper houses germanium monocrystals which are cooled down to 18 mK. A simultaneous measurement of the heat and ionization energies produced in a recoil allows to discriminate between the dominant electron recoils (ER) from radioactivity and nuclear recoils (NR), which at low energies are only caused by neutrons and the expected WIMP signal.
Other direct detection dark matter experiments use similar approaches based on the same principle to discriminate between different backgrounds and a possible signal from WIMPs. Exclusion limits on the WIMP-nucleon spin-independent scattering cross section from LUX [4] and SuperCDMS [5] are in strong tension with favoured parameter regions based on observations by DAMA/LIBRA [6], CoGeNT [7] and CDMSII-Si [8].
Almost all existing signal claims for low-mass WIMPs can be excluded at 90 % C.L. with the improved limits that were recently published by the EDELWEISS-III collaboration [1] considering standard assumptions about the WIMPnucleus interaction and the galactic halo model. Data from a 10-month WIMP search run were analysed in terms of low-mass WIMPs with masses m χ ∈ [4,30] GeV/c 2 using a method based on Boosted Decision Trees (BDT). No statistically significant excess of events was observed for eight selected detectors, resulting in exclusion limits up to a factor 40 stronger at m χ = 7 GeV/c 2 , compared to results from previous EDELWEISS-II [9] low-energy data. Such a cutbased analysis performs well when the separation of signal and background is sufficient, as is the case for higher WIMP masses. However, at low energy, the finite resolutions of the detectors cause the electron and nuclear recoils to have overlapping populations in the distributions of the variables that serve as discriminator. A separaration thus requires a cut at lower energy, resulting in a severely reduced efficiency. To overcome this problem, the analysis presented here uses an alternative approach which is based on the maximum likelihood, similar to e.g. [4,10]. It is an unblind analysis performed on a similar data sample that was recorded with the same detectors as in [1]. With its completely different analysis approach it improves the sensitivity for low-mass WIMPs and allows to cross-check the results of the BDTbased analysis. Instead of extracting limits without background subtraction from a smaller signal region with optimized signal-to-background ratio, the maximum likelihood method is used to model and fit the data in the entire region of interest (RoI). Thus, the remaining WIMP signal after detector efficiency corrections is not further reduced, while expected backgrounds are fitted and can be subtracted. The systematic uncertainties of the background predictions are taken into account by constraints in the likelihood fit and the calculation of exclusion limits.
The operating principle of the EDELWEISS-III detectors and the selection criteria for the analysed data are detailed in Sect. 2, while a description of the different background components is presented in Sect. 3. The formalism of the likelihood model for the analysis is explained in Sect. 4, both for fitting the data to individual detectors, as well as for a combined fit of a common signal to all detectors. We also detail how the exclusion limit is set using a hypothesis test based on the profile likelihood test statistics. A discussion of the fit results and a comparison with the result achieved with the BDT method follows in Sect. 5.

EDELWEISS-III detectors and selection of data
The detectors used in EDELWEISS-III are of the full interdigit FID800 type [11]. These are high-purity Germanium bolometers in a cylindrical shape of 7 cm diameter and 4 cm height with masses ranging from 820 to 890 g due to small variations both in diameter and height. Aluminium electrodes cover all sides of the detector in concentric rings to collect the charge carriers produced in a particle recoil. Glued on the top and bottom surfaces each is a neutron transmutation doped (NTD) Ge-sensor, which measures the micro-Kelvin temperature increase due to the energy deposit. The measurement of both heat and ionization signals on an event-by-event basis allows to discriminate rare NR events from the dominating ER events. The latter are mostly due to radioactive background outside the detector which produces γ -radiation. Their ionization yield Q, i.e. the ratio of ionization over total recoil energy, is defined as Q ER = 1 by the energy calibration with a 133 Ba γ -source. The ionization yield Q NR for nuclear recoils from neutrons and WIMPs is quenched and thus ∼3 times smaller. It depends on the recoil energy E r and can be parameterized as Q NR (E r ) = 0.16·(E r /keV) 0.18 for EDEL-WEISS Ge-detectors, which is consistent with the Lindhard theory [12]. The separation of particle types due to their different Q-value is only applicable if the produced charges are properly collected. For recoils close to the detector surface, where charge trapping is important, this is not guaranteed. Therefore, FID detectors are designed to discriminate between surface events and events originating in the bulk of the detector. Interleaved electrode rings on the detector surface are wired together and the resulting groups on the top half are biased with +4 V, −1.5 V and −4 V, +1.5 V on the bottom half. The electric field configuration divides the detector into a fiducial volume from which charges are drifted to the so-called fiducial electrodes (at ±4 V) and a near-surface volume, for which a signal is also seen on the veto electrodes (at ∓1.5V). This fiducialization is used to significantly suppress backgrounds from surface interactions and select only events with full charge collection efficiency. In this analysis, we consider the heat signal as the resolution weighted average heat energy of the two NTDs and the ionization signal as the averaged fiducial ionization energy of events coming from the fiducial electrodes. These two observables, named E heat and E ion in the following, are both in units of keV ee , as they have been calibrated to fiducial electron recoils from a 133 Ba γ -source. A signal on one of the two veto electrodes, i.e. E veto , is only used to reject surface events.
The data analysed in this work was taken between July 2014 and April 2015, when 24 FID800 detectors with full read-out were installed in the cryostat. WIMP search data was recorded for a total of 161 live days, together with an additional 25 days of calibration data from 133 Ba γ -and AmBe neutron-sources. As we are searching for WIMP signals at very low energies, a proper understanding of the trigger for such events is crucial. Out of the 24 installed detectors, eight were selected [1] because of their low trigger thresholds and good noise conditions. Only hourly periods satisfying requirements on the FWHM baseline resolution of the heat and ionization channels (FWHM heat < 1 keV ee and FWHM ion < 0.7 keV ee ) are considered, leading to a livetime ranging from 95 to 133 days per detector. More details on the trigger condition and the applied quality cuts, as well as the corresponding efficiencies, are given in [1]. The RoI for WIMP search is chosen such that the recoil spectrum for WIMPs with masses up to m χ ≈ 30 GeV/c 2 is included with good efficiency of up to 60 %. We define the RoI by requiring the ionization energy to be 0 ≤ E ion ≤ 15 keV ee and the heat energy of events to be below 15 keV ee . The analysis threshold in heat energy E min heat varies from detector to detector and depends on the efficiency of the online trigger. The EDEL-WEISS DAQ-system triggers events on each of the two heat channels, with a trigger threshold adapted on the scale of a few minutes to the instantaneous noise conditions. To ensure a high signal efficiency while minimizing systematic effects we define the analysis threshold in heat as the corresponding energy, for which the livetime-averaged trigger efficiency for each individual detector is above 80 %. This leads to values ranging from E min heat = 0.9 keV ee for the best detector up to E min heat = 1.5 keV ee for the one with lowest efficiency. The last cut applied on the data is the selection of fiducial events, which is of particular importance for this analysis. The two observables considered in the likelihood analysis are heat energy (black markers). Events before the fiducial cut and in the extended energy range are shown as gray points. Coloured lines indicate the detector-averaged positions that are expected for different background components depending on their ionization yields and collection voltage biases (see text). From top to bottom electron recoils from tritium decay as well as Compton and cosmogenic gammas in the fiducial volume (blue), surface gammas (dashed blue), nuclear recoils from neutron scattering (magenta), surface betas (dashed green) and 206 Pbrecoils (dashed brown). Heat-only events have only noise on the ionization channels and no ionization signal on average (red). The coloured contour indicates an m χ = 10 GeV/c 2 WIMP signal the heat and ionization energies. To select unambiguous fiducial events only, we require the signal on each of the two veto electrodes to be within ±1.64 σ veto of the energy-dependent Gaussian noise, where σ veto increases with increasing fiducial ionization energy E ion as described in [13]. The cut with a total acceptance of 81 % was chosen after initial tests on a subsample of data and combines a strong discrimination of surface events with an acceptable signal efficiency loss compared to the BDT analysis [1]. The fiducial efficiency was determined for each detector from the homogeneously distributed decay of cosmogenically activated isotopes in the crystal: the number of K-shell electron capture (EC) events giving a peak triplet at 10 keV was fitted with and without fiducial cut. The resulting effective 1 fiducial masses found with this method vary between 508 and 562 g for the eight detectors. The data after all cuts on data quality and noise periods, before and after the application of the fiducial cut is shown in Fig. 1. The total fiducial exposure is 496 kg-days. Different event populations can be observed and, at higher energies, are distinguishable via their ionization yield Q. Surface events from γ -and β-radiation as well as 206 Pb-recoils can be efficiently rejected with the fiducial cut. Remaining events between the electron recoils and heat-only populations are natural WIMP candidates. A likelihood analysis can assess the probability of these events to be of signal or background origin. As input to the likelihood model a probability density function (PDF) describing each different component is required.

Modelling of the signal and background components
Thanks to the effective shielding scheme of the EDELWEISS experiment, the residual background mainly originates from radioactive materials inside the cryostat such as connectors, holding structure and detector copper casings as well as from decays of cosmogenically activated isotopes within the detectors [14,15]. Each background component is modelled with a data-driven approach: unblinded data from outside the region of interest (sideband data), acquired in the same WIMP run, are fitted and extrapolated to the low-energy RoI considered in the analysis. In order to construct a likelihood model describing the data for each of the eight detectors, a PDF is calculated for each different background component i. This PDF P i describes a recoil spectrum ρ i (E r ) in the two observables heat and ionization energy. It takes into account the ionization yield Q i for each background, the efficiency of the trigger on the heat channel ε(E heat ) and the efficiency of the fiducial cut ε fid (E r ) as well as a Gaussian smearing due to the degraded, energy-dependent resolutions σ heat and σ ion of a given detector. In the energy range covered by this analysis, the intrinsic widths of the Q i -distributions of the different populations are small compared to the effect of σ heat and σ ion , and are neglected. Before normalization, the PDF can be written as: where the function f i (E r ) allows to calculate the observed heat signal of a given recoil energy. It includes the additional heating via the Neganov-Luke effect [16,17], produced by the scattering of charges which are collected by electrodes with a differential voltage U (in volts): The selected detectors have an electric potential of U ref = 8 V between the fiducial electrodes and bulk ERevents were used to calibrate the energy scale of all heat and fiducial ionization channels. For charges created in the nearsurface volume, the Neganov-Luke contribution to the heat energy is smaller, due to the reduced potential of only 5.5 V between fiducial and veto electrodes. It reduces the measured heat energy for surface events, as can be seen in Fig. 1 for the group of 10 keV cosmogenic peaks at the surface which are observed at E heat ≈ 7.7 keV ee . The measured average value of Q i for those events is 0.9. For surface backgrounds from β's and 206 Pb-recoils as well as so-called heat-only events, the spectrum in heat energy was directly extracted from sideband data. For those components the smearing in heat energy is already included and the PDF before normalization can be directly expressed as: where the average measured value of Q i for surface β's and 206 Pb-recoils are 0.4 and 0.1, respectively. The suppression of surface events via the fiducial cut decreases at low energies due to the finite resolution of the ionization channels: the veto energy of a surface event can be smaller than the noise on the veto electrode, thus the event will not be rejected. To a small extent, our data selection is therefore polluted by surface events with heat energies just above the analysis threshold.
To build the PDF for these events we take into consideration the efficiency ε fid (E r ) as a function of recoil energy. For surface events the survival probability after the fiducial cut is highly reduced, as is shown in Fig. 2 for different background components. It is calculated for each of the detector sides considering the baseline resolution σ veto and the measured energy E veto of the corresponding veto electrode: For events originating in the bulk of the crystal, no signal is measured on the veto electrodes and only noise is reconstructed. The efficiency of the fiducial cut is ε fid bulk = 81 % as described in Sect. 2. The fraction of surface nuclear recoils leaking into the acceptance below 5 keV (Fig. 2), and increasing further the WIMP efficiency, is neglected. With the definition of the PDF mentioned above, the WIMP signal and the following background components can be fully described.

WIMP signal
A signal PDF is constructed for each WIMP mass m χ independently, using Eq. 1. The parametrization for the ionization yield Q NR for nuclear recoils has been validated to a recoil energy (keV)  Fig. 2 Survival probability for different surface background components on the top side of detector FID824 as a function of initial recoil energy: electron recoils (blue) from Compton and cosmogenic γ 's as well as β's from the tritium beta decay, nuclear recoils from neutrons or WIMPs (violet), β's (green) and 206 Pb-recoils (brown) from external radioactivity. The dashed part of the efficiency curves is below the analysis threshold E min heat = 0.9 keV ee in heat energy (see Eq. 2). Only surface events from β's and 206 Pb-recoils are considered in the analysis as they have a significant contribution after all cuts. For bulk events, the survival probability (or efficiency for the WIMP signal) is considered as approximately constant (black line), neglecting the leakage of surface WIMPs into the acceptance precision of 5 % using neutron calibration data taken during the same run. In the description of the signal PDF, Q NR is a nuisance parameter and constrained with its systematic uncertainty. The recoil spectrum for the scattering of WIMPs on natural germanium with an average of A = 72.6 nucleons is calculated following [18]. For all astrophysical parameters we use values corresponding to the Standard Halo Model (SHM), i.e. ρ local DM = 0.3 GeV/c 2 /cm 3 , v 0 = 220 km/s, v earth = 230 km/s and v esc = 544 km/s. With the cuts described in Sect. 2 a potential WIMP signal is reduced to ∼60 % for m χ = 30 GeV/c 2 . Detector FID824 has the highest sensitivity for a WIMP signal due to its good baseline of the heat channel and the resulting low heat threshold E min heat = 0.9 keV ee . For this detector, the signal fraction after cuts decreases to 2 · 10 −4 for a m χ = 4 GeV/c 2 signal but is above 1 % for masses m χ > 5 GeV/c 2 .

Heat-only events
The dominant background in the EDELWEISS-III lowenergy data are heat-only events. They are present in all detectors with different intensity and constitute between 85 and 95 % of the events in the RoI after all cuts. For those events the data acquisition was triggered by a clear signal on one or both NTD heat sensors, while only noise can be seen on each of the four ionization channels, and the signals of the two NTDs are compatible. The heat energy spectrum of those events shows an exponential decrease (e.g. Fig. 3) heat energy (keVee)  Fig. 3 Heat energy spectrum of events with negative ionization energy ("heat-only") for detector FID824 after quality cuts. Below E heat ≈ 1 keV ee the data is dominated by random triggers on noise fluctuations, the efficiency of the heat trigger is given for reference (gray). The spectrum is modelled with a Kernel density estimation function (orange). Of the 63,400 events in the sideband, 5386 are above the 80 % efficiency analysis threshold of E min heat = 0.9 keV ee for this detector (dashed line). Inset: ionization energy spectrum of events above the analysis threshold fitted with a gaussian function of fixed width. The fitted mean of 0.021 ± 0.005 keV ee results in a systematic error of 14.9 % for the expected number of events in the RoI for all detectors and overlaps with randomly triggered noise fluctuations near the heat threshold of a detector. The variation of the heat-only event rate shows a common behaviour for all detectors: a simultaneous burst of the rate which coincides with a period of unstable operating conditions due to the cryogenic system followed by an exponential decay with a time constant of around 20 days which is not compatible with any of the known radioactive isotopes in the setup. A particle origin, e.g. from 206 Pb-recoils absorbed in one of the electrodes and producing no ionization signal, can be excluded due to the high rate and temporal behaviour. Internal radiation within the NTD heat sensors is rejected by a cut requiring a coincident signal in both NTDs described in [1]. The source of heat-only events is yet unknown, but possible explanations are the creation of phonons from friction of the detector with the holders, or stress near the NTD gluing spot. Several strategies are pursued to identify the origin of those events and to significantly reduce them in future runs. We use the sideband with negative ionization energy to model heat-only events in the RoI. In the absence of a theory to describe the shape of the heat-only energy spectrum, we use a Kernel density estimation (KDE) function of the data in this sideband to model this background. The ionization energy spectrum has a gaussian shape with a width given by the average baseline noise for the ionization channels. Fitting the distribution of sideband data in E ion with a gaussian indicates a small possible shift of the mean with respect to E ion = 0 keV ee . That shift is only statistically significant for some of the detectors and is related to a small fraction of <1 % uncorrected cross-talk between heat and fiducial ion-ization channels. The effect of a possible shift on the number of expected events for this background is taken as a systematic uncertainty and ranges between 0.4 and 14.9 % and is considered in the constraint for this background. For the most sensitive detector, FID824, Fig. 3 shows heat and ionization energy spectra of the sideband data with the respective models. In principle, the heat-only sideband can be contaminated by underfluctuations of the ionization energy from lowenergy event populations with small ionization yield, such as 206 Pb-recoils and β-particles. Considering the low number of expected events for these components (O(10) events above the analysis threshold per detector) compared to the high rate of heat-only events, the effect on the extracted spectrum is negligible. It was also checked that the number of events for a possible WIMP signal of mass m χ in the heat-only sideband is negligible for the cross section excluded in the following. The heat-only sideband data (E ion < 0) and modelled PDF in the RoI (E ion > 0) are shown in Fig. 4, together with WIMP signals for two different masses.

Electron recoils from Gammas and Betas
The energy spectrum of electron recoils in the fiducial volume up to 15 keV ee consists of a set of peaks on top of a continuous component. This component is due to the Compton scattering of gamma rays from external radioactive sources and to betas from the decay of 3 H inside the detectors [19]. The observed peaks are produced by mono-energetic gammas from electron capture reactions within the crystal and result from the activation of different isotopes due to cosmic rays or neutron calibration. The intensity of these peaks is different for each detector and depends on its age and exposure to cosmic rays before installation underground. In the energy range between 5 keV ee and 7.7 keV ee X-rays from the K-shell EC of the isotopes 49 V (E = 4.97 keV), 51 Cr (5.46 keV), 54 Mn (5.99 keV), 55 Fe (6.54 keV), 56,57,58 Co (7.11 keV) and 56 Ni (7.71 keV) are included in the fit as potential peaks. Around 10 keV ee a triplet of 65 Zn (8.98 keV), 68 Ga (9.66 keV) and 68 Ge (10.37 keV) can be resolved, which has corresponding L-shell peaks at 1.10, 1.19 and 1.30 keV (Fig. 5, light blue). While the K-shell peaks are well separated from a WIMP signal in the analysis parameter space, the 3 L-shell peaks can have significant overlap with a signal for the lowest WIMP masses probed. Depending on the analysis threshold E min heat of each detector, the fraction of those peaks in the RoI can vary significantly, from almost full coverage to only a tail of the gaussian peak. With the known L/K-shell ratio of 11 % [20] and the calculated peak fraction above threshold we extrapolate the rate of L-shell X-ray events. For this we perform a sideband fit of fiducial events in the electron recoil band 3 keV ee < E heat , E ion < 30 keV ee with a separate likelihood model including all K-shell peaks, Compton gammas and tritium β-events. We find the extrapolated rate of tritium decay for each detector to be in agreement within uncertainties with the rates found in [19]. Systematic uncertainties for all ERcomponents in the RoI are propagated from the errors of this sideband fit and are typically O(30 %).

Unrejected surface events
At higher energies, the fiducial cut allows the rejection of all surface events, as they would induce a clear signal E veto on one of the two veto electrodes. For low ionization energies however, the rejection can fail. If the ionization energy of a surface event is low enough, so that E veto < 1.64 σ veto , the event passes the cut. For particle types with low ionization yield Q i , the produced ionization energy is smaller, and therefore less charge is collected on the veto electrodes to reject surface events. The surface events in this analysis are mostly 206 Pb-recoils and β-particles originating from the 238 U decay chain of surrounding materials such as 222 Rn daughter isotopes [15]. Those particles have a small penetration depth or even scatter on the crystal surface. Another possible component would originate from the electron recoils described in Sect. 3.3, which are also produced in the near-surface volume. However, due to their high ionization yield of Q ER ≈ 1, the rejection of these surface events above the heat threshold E min heat is very efficient: the expected number of events in the RoI after the applied fiducial cut was calculated to be well below 10 −2 for all detectors and these events are therefore not considered in the analysis. For both β's and 206 Pb-recoils, the spectrum in heat energy is extracted from a clear selection of surface events with energies E veto > 5 σ veto and then extrapolated to the lower heat threshold within the RoI. The ionization yield of the events is fitted from the same sideband data. We do not include any uncertainty on the fitted

Nuclear recoils from neutrons
Neutron background can mimic a WIMP signal, as neutrons can produce single scatter nuclear recoils with the same ionization yield Q NR as WIMPs, according to an exponential energy spectrum. We distinguish between two different sources of neutrons in our detectors: muon-induced and radiogenic neutrons. Simulations showed that in the energy range of this analysis, the number of single scattering neutrons induced by muons is compatible with zero after vetoing [21]. For radiogenic neutrons coming from radioactivity due to (α, n) reactions and spontaneous fissions within the cryostat, Monte Carlo simulations have been performed with all known sources to derive their energy spectrum down to the lowest energies. The spectral shape of the radiogenic neutron background shows little dependence on the exact location of individual sources and can be fitted and parametrized by a double exponential law in the energy range of 2-20 keV nr , calibrated for nuclear recoil interactions. The normalization of the spectrum is derived from data taken with 17 detectors during the same EDELWEISS-III physics run. In the energy range of 10-100 keV nr , nine multiple scattering events are found in the 90 % C.L. nuclear recoil band for a fiducial exposure of 1309 kg-days. This number cannot be reproduced with the simulation of all known sources and hints at an additional neutron source in the experiment. The Monte Carlo simulation however is able to reproduce the measured single-overmultiple-ratio within uncertainties. We derive the normalized neutron spectrum for each detector by weighting it with corresponding exposure in the present data set, as well as the single-over-multiple-ratio of 0.45 from simulations. After all cuts and efficiency corrections, the expected background from single scatter neutrons in the RoI is similar for all detectors and has an average value of μ exp neutron = 0.20±0.07 events (Table 1). Expected rates for individual detectors have a combined uncertainty of 45 % coming from the single-overmultiple ratio uncertainty and the statistical error from the measurement of multiples.
With respect to the BDT based analysis [1], most of the background components listed above are identical. Deviations are mainly related to the different fiducial cut and the resulting survival probability of background components. The preselection applied before the BDT analysis accepts more surface beta and gamma events than the present stricter fiducial cuts, leaving the BDT a larger population of these events to optimize its multi-parametric selection. The present fiducial cut effectively removes most of them. For the same reason we do not include so-called triple events with a signal on both fiducial and one veto electrode. Lastly we intentionally differentiate between bulk events from Compton γ 's and tritium β's as two separate components in the likelihood analysis although their energy spectra are approximately degenerated in the RoI. An overview of the expected event rates summarized for different types of backgrounds is given in Table 1. The total background for both detector FID824 and all detectors combined is within 1-2 % agreement with the observed number of events.

Profile likelihood analysis
With the model of the different background components i as PDFs P i (E heat , E ion ), we can now define for each detector a total PDF P tot as the sum of all backgrounds and the signal for a given WIMP mass m χ : where the combined fitted rate of all components is ν = μ χ + i μ i , while the number of observed events is N . The rate of WIMP events μ χ (σ ) is a function of the parameter of interest, the WIMP-nucleon cross section σ , and is proportional to the integrated signal in the RoI. All other fit parameters are the event rates µ for the different background components i. The shape of all PDFs is fixed and given by the detector resolutions of heat and ionization energies, as well as the ionization yield and energy spectra for each component. Each of the background event rates μ i is a nuisance parameter and constrained in the fit with a Gaussian constraint term. The expected rate μ exp i for each background is calculated by integrating its unnormalized PDF, while the width σ i of the constraint term is given by the combined statistical and systematic error on this value and varies between all backgrounds and detectors. With this information, we can for each detector construct the extended likelihood function in heat and ionization energies: From these likelihood functions with detector specific PDFs and constraint terms, we can construct a joint likelihood function describing the data for all eight detectors combined: Here, each detector has its own, independent background and signal PDFs, as well as nuisance parameters and constraint terms. The only common fit parameter shared by all likelihood terms is the WIMP-nucleon scattering cross section σ .
If the fit for a given WIMP mass results in a maximum likelihood estimator (MLE) for the cross sectionσ compatible with zero within errors, we can set an exclusion limit. For this we follow [22] and perform a hypothesis test based on the profile likelihood ratio λ(σ ): whereμ are the MLE of the nuisance parameters when maximizing the likelihood for a fixed value of the cross section σ . The test statistics q σ used to reduce the data to a single value is defined as: The probability distribution functions f (q σ |H σ ) and f (q σ | H 0 ) under the signal hypothesis H σ and the backgroundonly hypothesis H 0 are used to find the cross section σ for which H σ can be excluded at 90 % C.L. while correcting for downward fluctuations of the background following the prescription in [23]. The parametrisation of the probability distribution functions f (q σ |H σ ) and f (q σ |H 0 ) with an approximation as described in [22] was found to yield limits with a C.L. less than 90 % for some of the probed WIMP masses. We use Monte Carlo generated datasets to derive all upper limits. Calculations of limits are performed with the RooStatspackage [24], based on the RooFit [25] framework with which all PDFs and likelihood functions are constructed.

Results
The likelihood fit of the selected data after all cuts is a twostep process: first, an unbinned fit to the data of the eight detectors is performed independently, using the constrained likelihood functions L j described in Eq. 6. It allows to find the best fit values for all of the nuisance parameters describing the different backgrounds for each detector. With these values as starting point, a combined fit over all detectors is performed, in which the individual signal PDFs P χ (m χ ) share a common WIMP-nucleon scattering cross section σ . The result of the individual fit of a WIMP signal with m χ = 4 GeV to data of detector FID824 is shown in Fig. 5. Fitted signal rates for three different WIMP signal masses are given in Table 2. For FID824, which is the most sensitive detector due to its low energy threshold E min heat = 0.9 keV ee , no signal is fitted for any of the probed WIMP masses m χ ∈ [4,30] GeV. The same is valid for detector FID839 for all WIMP masses. For several other detectors (FID827, FID838, FID841 and FID842) a strong degeneracy between the WIMP signal and heat-only events is observed. This degeneracy is only present up to WIMP masses of around 10 GeV/c 2 . Therefore, a sig- nal is fitted for these detectors while the rate of heat-only events is underestimated by the same magnitude. However, the resulting signals are always associated with large uncertainties, and they are compatible with zero within 2 σ . Three detectors (FID825, FID837 and FID838) have events above E heat = 2 keV ee which are close to the ionization yield Q NR expected for nuclear recoils, as shown in Fig. 1. For these events a degeneracy between WIMP signal and neutron component is observed, depending on the WIMP mass m χ . For masses above m χ ≈ 10 GeV/c 2 those events are better described by the signal component, as the expected rate of neutrons is constrained to much lower average values. The resulting excess is between 1 and 3 signal events which are fitted, depending on the detector and in all cases compatible with no signal. The best fit rates for all other backgrounds are in good agreement with the values expected from their constraint terms, independent of the probed WIMP mass. In the combined fit over all detectors with a common signal cross section σ , the degeneracies for individual detectors are alleviated and no signal is fitted up to a mass of m χ = 6 GeV/c 2 . For masses above, the aforementioned nuclear recoil candidate events lead to a positive but not significant signal, which peaks at m χ = 7 GeV/c 2 and then decreases again. Due to the similar exposure and the small influence of the different heat threshold E min heat for larger mass WIMP signals, the fitted cross section leads to a fairly similar signal rate which is fitted for all eight detectors. Again, all of the fitted signals in the combined fit are compatible with zero.
In the absence of a statistically significant signal for any of the probed WIMP masses, we set 90 % C.L. upper limits for the WIMP-nucleon cross section σ using the profile likelihood based test statistics described in Eq. 9. The resulting exclusion limit is shown in Fig. 6. For masses Fig. 6 Calculated 90 % C.L. exclusion limit on the spin-independent WIMP-nucleon scattering cross section σ as a function of WIMP mass m χ for the combined fit over all detectors (solid red). The green and yellow band represent the 1 and 2σ confidence band of the expected median sensitivity (dashed black). Shown for comparison is the result of the BDT based analysis [1] (dashed red). Contours show possible signals from CDMS-II (Si) [8] (blue), DAMA [26] (brown), CRESST-II [27] (pink) and CoGeNT [7] (orange). Other existing exclusion limits are from EDELWEISS-II [9] (small red dashes), CoGeNT [28] (orange), CRESST [29] (pink), SuperCDMS [5] (purple), XENON100 [30] (black), CDMSlite [31] (dashed violet) and LUX [4] (green) below m χ = 6 GeV/c 2 the observed limit is better than the expected median sensitivity due to an underfluctuation of background in the most sensitive detectors. For masses above m χ = 10 GeV/c 2 the limit is 2-3 sigma above the expected sensitivity, due to the presence of NR candidate events in multiple detectors which are in excess with respect to the expected neutron background. This excess is in good agreement with the observations quoted in [1], as both data sets contain these events. However, the 90 % C.L. limit of the analysis here presented is a factor of seven stronger for m χ = 4 GeV/c 2 due to the absence of cuts other than the fiducial selection and the resulting higher signal efficiency as well as a subtraction of the expected backgrounds. For higher masses above ∼15 GeV/c 2 limits from BDT and likelihood approaches are in very good agreement. For these masses the nuclear recoil spectrum of WIMPs extends to high energies, where it can be well discriminated from the dominating heat-only background and the electron recoil component. In this case, the BDT analysis can easily separate a signal with high efficiency and there is no gain in performing a likelihood analysis. The ∼15 % lower exposure due to the stricter fiducial cut with a total acceptance of 81 % is compensated by a higher signal efficiency, and the resulting limits are very similar. We also find a good agreement between the ratios of observed and expected exclusion limits for the two analyses, which we consider to be an inherent property of the data and a confirmation of the validity of this approach.

Conclusion
We have presented a search for low-mass WIMPs with the EDELWEISS-III experiment, using eight selected detectors and data taken with a total fiducial exposure of 496 kg-days after all cuts. A data-driven approach was used to model relevant backgrounds from sideband data. For each detector a likelihood function describing the data in heat and fiducial ionization energies was constructed, with constraint terms for each of the nuisance parameters taking into account systematic uncertainties. No statistically significant signal was found, neither for the fit of data from single detectors, nor for a combined fit over all detectors with a common signal cross section. Exclusion limits were set with a hypothesis test using a profile likelihood based test statistics, including corrections for under-fluctuations of the background. At 90 % C.L. limit we exclude spin-independent WIMP-nucleon scattering cross sections of σ = 1.6 × 10 −39 cm 2 (6.9 × 10 −44 cm 2 ) for a WIMP mass of m χ = 4 GeV (m χ = 30 GeV). Thanks to the higher signal efficiency and a subtraction of the expected backgrounds, the likelihood analysis shows an improvement of a factor of ∼7 for 4 GeV/c 2 WIMPs compared to a BDT based analysis while reproducing the limit at 15 GeV/c 2 and above. The results and achieved sensitivity underline the power of a maximum likelihood analysis based on detailed background models.