1 Introduction

In 1950, Robert W. Pringle observed that the light output of NaI(Tl) scintillation detector is not proportional to the incident photon energy [1]. This discovery sparked numerous studies on nonproportionality (nPR) between light output and the incident electron\(/\gamma \) energy using various scintillation detectors [2,3,4,5,6,7,8,9].

The nPR phenomenon has been observed in various scintillators, including NaI(Tl), but the detailed values were published differently in each experiment. To understand the physics behind the phenomenon and the discrepancies observed, numerous remarkable studies have been published over the decades. These studies include the improvement of measurement techniques [10,11,12,13] as well as theoretical interpretations [14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31], with a subset focusing on simulations.

In recent years, the dark matter (DM) community has shown increasing interest in the nPR of NaI(Tl), particularly in the low-energy region. It is driven by experiments searching for DM using NaI(Tl) detectors, inspired by the observations made by DAMA/LIBRA [32,33,34,35]. DAMA/LIBRA claimed to have detected an annual modulation signal compatible with DM interaction below 6 keV, which has motivated other experiments to analyze low-energy scintillation events in an attempt to replicate their results [36, 37]. Moreover, the search for low-mass DM signal in the low-energy region has intensified [38, 39], with consideration of interactions such as weakly interacting massive particle (WIMP) scattering on the nucleus [40,41,42,43,44,45], the Migdal effect [46,47,48], and DM-electron scattering [49,50,51,52,53,54,55]. Another phenomenon that relies on the analysis of low-energy scintillation is coherent elastic neutrino-nucleus scattering (CE\(\nu \)NS). Ongoing efforts aim to observe CE\(\nu \)NS of reactor neutrinos using NaI(Tl) detectors [56]. Additionally, a feasibility test of future NaI(Tl) detectors for CE\(\nu \)NS observations of solar and supernova neutrinos has been conducted [57]. These endeavors crucially require understanding extremely low-energy events.

While substantial efforts have been dedicated to lowering the energy threshold through hardware and software upgrades [36, 58, 59], there has been a relative neglect of the study on nPR of scintillation response. However, misconceptions regarding scintillation response could lead to incorrect interpretations of DM and \(\nu \) signals. It is also evident in the recent measurements and interpretations of nuclear recoil quenching factors, where taking nPR into account results in an energy scale difference of about 25% at 1 keV electron equivalent [60]. This is particularly relevant in the low-energy region, where light yield rapidly changes and can introduce significant biases. Unfortunately, previously measured nPR curves have not adequately covered the low-energy region, resulting in a scarcity of reported light-yield measurements at these energies, with only a few indirect or obsolete values available [3, 10].

To address the aforementioned challenges, this paper presents the results of our \(\gamma \)-ray calibration of NaI(Tl) crystals used in the COSINE-100 experiment. We utilized various mono-energetic peaks from internally contaminated isotopes, covering an energy range from 0.87 to 88 keV. Given the long-term nature of the DM search experiment, we extracted these energy peaks by simultaneously examining their decay times and peak positions. To complement the data, we also conducted \(\gamma \) spectroscopy using another NaI(Tl) crystal from the same manufacturer.

In Sect. 2, we describe the experimental setup, encompassing the COSINE-100 experiment, as well as the additional \(\gamma \) spectroscopy. COSINE-100 data, which forms the basis of this study, is summarized in Sect. 3, highlighting the observations and fitting procedures. Section 4 focuses on the \(\gamma \) spectroscopy methodology, outlining the measurements and analysis techniques to supplement COSINE-100 data. In Sect. 5, we consolidate the results and present the measured nPR. Additionally, we provide the resolution and validate it using a waveform simulation. Finally, in Sect. 6, we conclude by summarizing the key findings of our research and emphasizing the implication for future research.

2 Experimental setup

The COSINE-100 experiment employed eight high-purity NaI(Tl) crystals developed in collaboration with Alpha Spectra Inc. These crystals collectively served as the target for DM direct detection. They had a total mass of 106 kg and were equipped with one 3-in. photomultiplier tube (PMT) at each of the two ends, enabling the detection of scintillation light. Each crystal was housed within a copper encapsulation. The setup featured eight such encapsulations submerged in a 2200 L volume of LAB-based liquid scintillator (LS), which served as an active veto system [61]. An acrylic box contained the LS, with eighteen PMTs attached to its walls for scintillation light detection, respectively. Events were categorized as multiple-hit or single-hit events based on whether particles left signals in other detectors (LS or crystal). The acrylic box is surrounded by copper and lead, which provide passive shielding layers, while an outermost layer consists of plastic scintillator panels used as a muon veto [62].

Signals collected by the crystal PMTs were recorded, as waveforms, through a data acquisition (DAQ) system. The DAQ system comprised pre-amplifiers, analog-to-digital converters (ADC), and a trigger control board (TCB), with detailed specifications outlined in Sect. 7 of Ref. [63]. The charge deposition of an event, serving as an indicator of energy, is determined through the integration of the waveform over a 5 \(\upmu \)s time window following noise removal. Data taking for physics purposes commenced on October 20, 2016, at the Yangyang Underground Laboratory [63], and this study utilized data collected through June 9, 2022. Notably, three crystals were excluded from the analysis due to their low light yields or high noise levels [42, 44].

Despite the high purity of these NaI(Tl) crystals, radioactive decay signals were observed inside them [64]. Specifically, in the region of interest of this study, three long-lived isotopes – \(\mathrm {^{210}Pb}\), \(\mathrm {^{22}Na}\), and \(\mathrm {^{40}K}\) – were clearly identified as contamination inside the crystals. Moreover, several short-lived isotopesFootnote 1 were generated by cosmic rays prior to installation, including \(\mathrm {^{109}Cd}\), \(\mathrm {^{113}Sn}\), \(\mathrm {^{125}I}\), \(\mathrm {^{121\,m}Te}\), and \(\mathrm {^{127\,m}Te}\) [65]. Signal rates from these isotopes were characterized by the isotopes’ decay times and effectively modeled using Monte Carlo simulations. This paper reanalyzes the energy peaks associated with these isotopes, which serve as calibration points.

Apart from the crystals used in COSINE-100, a smaller-sized NaI(Tl) crystal from the same manufacturer was utilized for a \(\gamma \) spectroscopy experiment. The crystal had a cylindrical shape with a diameter of 3 inches and a height of 7 inches. It was enveloped in 2–3 mm of PTFE, enclosed by 2 mm of aluminum, and fitted with a quartz window. It featured a 3-inch PMT identical to those used in COSINE-100. The crystal was further shielded using 5-cm-thick lead bricks and installed at the IBS HQ ground laboratory in Daejeon, Korea. The DAQ system for this crystal also included a pre-amplifier, an ADC module, and a TCB, all with the same specifications as those in COSINE-100. Notably, a more stringent trigger condition was applied to reduce the high trigger rate arising from noise and background events at the low-energy region. This setup facilitated measurements of energy responses to external \(\gamma \) sources, such as \(\mathrm {^{241}Am}\), \(\mathrm {^{133}Ba}\), \(\mathrm {^{109}Cd}\), and \(\mathrm {^{137}Cs}\). Importantly, the peaks used for analysis remained independent of the trigger condition.

3 Extracting internal peaks

From COSINE-100 data, there are peaks resulting from the decay of isotopes contaminated in the crystals, which were used for calibration. Throughout the entire experimental period, three peaks from long-lived isotopes – 49 keV from \(\mathrm {^{210}Pb}\), 0.87 keV from \(\mathrm {^{22}Na}\), and 3.2 keV from \(\mathrm {^{40}K}\) – remained consistently visible. These peaks were extracted by modeling their shapes and the background components around them from the charge distributions stacked over the entire experiment duration. Short-lived isotopes also produced additional charge peaks that were visible only in the early data. To extract them, a time-dependent model for charge distributions was developed in accordance with the decay of each isotope. The following sections outline the formulation and results of the modeling.

3.1 Long-lived isotopes

The charge distribution of single-hit events in COSINE-100 revealed a distinct peak from \(\mathrm {^{210}Pb}\). It decays to \(\mathrm {^{210}Bi}\) through \(\beta \) decay, where 84% of them are excited at 46.5 keV. The subsequent deexcitation of this daughter nucleus emits an energy of 46.5 keV, while the energy of \(\beta \) ray is widely distributed, having an average of 4.2 keV. Due to the asymmetric shape of the \(\beta \) spectrum and the resolution-smearing effect, the peak position becomes approximately 49 keV.

To effectively represent this composition, the peak is modeled with the function \(f_{\textrm{Pb}}(q)\), defined as

$$\begin{aligned} \begin{aligned} f_{\textrm{Pb}}(q)&= \textrm{Q} \left( q;~ A,~ q_{\textrm{min}},~ q_{0} \right) *\textrm{N} \left( q;~ \mu =0,~ \sigma ^{2} \right) \\&\quad + \textrm{AG} \left( q;~ B,~ \mu ,~ \sigma ^{2}_{\textrm{l}},~ \sigma ^{2}_{\textrm{r}} \right) + Cq + D, \end{aligned} \end{aligned}$$
(1)

where q is the integrated charge from the signal waveform, while all the other parameters are free fitting parameters. In this equation, the first part (\(\textrm{Q} *\textrm{N}\)) represents the 49 keV peak from \(\mathrm {^{210}Pb}\), where \(\textrm{Q}\) is the quadratic distribution, defined as

$$\begin{aligned} \begin{aligned} \textrm{Q}&\left( q; ~ A, ~ q_{\textrm{min}}, ~ q_{0} \right) \\&~~ \equiv {\left\{ \begin{array}{ll} A \left( q - q_{0} \right) ^{2}, &{} \text {for } q_{\textrm{min}} \le q \le q_{0} \\ 0, &{} \text {otherwise} \end{array}\right. }, \end{aligned} \end{aligned}$$
(2)

where A is the normalization factor, \(q_{0}\) is the vertex position, and \(q_\textrm{min}\) is the minimum range of the distribution. In the context of the decay process of \(\mathrm {^{210}Pb}\), \(q_\textrm{min}\) represents the charge deposition associated with the energy of the \(\gamma \) ray, while \(q_{0}\) corresponds to the charge deposition at the maximum total energy. The symbol \(*\) denotes the convolution operator, and \(\textrm{N}\) is the normal distribution centered at 0 with its standard deviation given by the parameter \(\sigma \). The convolution of this normal distribution mimics the resolution-smearing effect in the charge distribution.

For the background distribution around this peak, we adopted a sum of a linear component (\(Cq + D\)) and an asymmetric Gaussian distribution \(\textrm{AG}\) defined as

$$\begin{aligned} \begin{aligned} \textrm{AG}&\left( q;~ B,~ \mu ,~ \sigma ^{2}_{\textrm{l}},~ \sigma ^{2}_{\textrm{r}} \right) \\&~ \equiv {\left\{ \begin{array}{ll} B \exp {\left[ -\left( q - \mu \right) ^2 \, / \, 2 \sigma ^{2}_{l} \right] }, &{} \text {for }q < \mu \\ \\ B \exp {\left[ -\left( q - \mu \right) ^2 \, / \, 2 \sigma ^{2}_{r} \right] }, &{} \text {for }q \ge \mu \\ \end{array}\right. }, \end{aligned} \end{aligned}$$
(3)

where B is the normalization constant, \(\mu \) is the peak position, and \(\sigma _{\textrm{l}}\) and \(\sigma _{\textrm{r}}\) are the standard deviations impacting the left and right tails. The distributions accurately represented the background components, as confirmed by the simulated distributions created using the Geant4 Monte Carlo simulation toolkit [66]. The fitting method produced a bias of less than 0.1% in the peak position, which suggests that it is reliable enough to handle other subdominant radioactive isotopes like \(\mathrm {^{121\,m}Te}\) or \(\mathrm {^{129}I}\). The data distribution and the fitted model are depicted in Fig. 1.

The position of the \(\mathrm {^{210}Pb}\) peak was determined as the position where the peak component (\(\textrm{Q} *\textrm{N}\)) is maximized. To interpret the position as the light amount for the energy, we assign a 1% systematic uncertainty to account for the unknown spatial distribution of \(\mathrm {^{210}Pb}\) within the crystal and for the poorly known spatial dependence of the detector response. The total uncertainty is visually depicted as a yellow band around the vertical line representing the peak position in Fig. 1. The stability of fitting was improved by assigning the peak position based on its maximum value. However, this also caused minor fluctuations in the true peak energies among the detectors due to their resolution difference. The bias term was determined to be about 0.5% and was considered when calculating the light yield.

Fig. 1
figure 1

The charge distribution around the 49 keV peak of \(\mathrm {^{210}Pb}\), as measured by a NaI(Tl) crystal in COSINE-100 (depicted by the black crosses). The solid orange curve represents the fitted model described in Eq. (1). The \(\mathrm {^{210}Pb}\) peak is drawn as the dashed yellow curve. The vertical yellow line indicates the position of the \(\mathrm {^{210}Pb}\) peak, along with its associated error. The dotted blue curve represents the summed spectrum of background components

In the charge distribution of multiple-hit events, we observed two distinct peaks associated with isotopes from internal contamination, specifically \(\mathrm {^{22}Na}\) and \(\mathrm {^{40}K}\). Approximately 10% of \(\mathrm {^{22}Na}\) decays to the 1275 keV level of \(\mathrm {^{22}Ne}\) through electron capture (EC), while a similar proportion of \(\mathrm {^{40}K}\) decays via EC to the 1461 keV level of \(\mathrm {^{40}Ar}\) [67, 68]. The daughter atoms undergo cascade processes, leaving characteristic peaks at their respective K-shell edges within the crystal where \(\mathrm {^{22}Na}\) and \(\mathrm {^{40}K}\) originally resided. Concurrently, high-energy \(\gamma \) rays are emitted due to the deexcitation of these daughter nuclei. These energetic \(\gamma \) rays can escape the crystal, potentially triggering the neighboring crystals or the LS veto system, leading to coincidences. Consequently, we clearly identified two low-energy peaks in the charge distribution of multiple-hit events, corresponding to 0.87 keV and 3.2 keV. These values represent the K-shell binding energies of \(\mathrm {^{22}Ne}\) and \(\mathrm {^{40}Ar}\) [69]. The charge distribution is shown in Fig. 2, where the observed peak positions and shapes align with our expectations.

For the 0.87 keV peak, a scaled Poisson distribution was used for modeling. The distribution is represented as

$$\begin{aligned} f_{\textrm{Na}} \left( q; ~ A, ~ \lambda , ~ s \right) \equiv A\,\frac{\lambda ^{q / s }\,e^{-\lambda }}{\Gamma \left( q / s \, + 1 \right) }, \end{aligned}$$
(4)

where A is the normalization constant, \(\lambda \) is the expected number of photo-electrons (\(N_{\textrm{pe}}\)), and s denotes the scaling factor originating from the PMTs. The 3.2 keV peak produced sufficient photo-electrons such that we modeled it using a normal distribution. The background components were attributed to Compton scattering of high-energy external \(\gamma \) rays, resulting in relatively smooth shapes. To account for the uncertainty in the background distribution’s shape, flat or linear-shaped distributions were adopted as background models, and the difference between results from these models was considered as a systematic uncertainty. All the parameters denoting the normalization, position, and width of the peaks, and the shape of background distribution are free in the fitting process. The charge distribution, fitted model, and the extracted peak positions are shown in Fig. 2, and the peak positions are summarized in Table 1.

Fig. 2
figure 2

The charge distribution of multiple hit events at the low-energy region, as recorded by a NaI(Tl) crystal in COSINE-100 (indicated by the black crosses). The solid orange curve corresponds to the fitted model. Additionally, the dashed yellow curve represents the scaled Poisson distribution for the 0.87 keV from \(\mathrm {^{22}Na}\), while the dashed pink curve illustrates the normal distribution for the 3.2 keV from \(\mathrm {^{40}K}\). The vertical lines and bands depict their peak positions and associated errors. The dotted blue curve represents the linear-shape model for the background components

Table 1 List of peak positions extracted from COSINE-100 data

3.2 Short-lived isotopes

Short-lived isotopes, such as \(\mathrm {^{109}Cd}\), \(\mathrm {^{113}Sn}\), \(\mathrm {^{125}I}\), \(\mathrm {^{121\,m}Te}\), and \(\mathrm {^{127\,m}Te}\), were initially activated by cosmic rays while the crystals were exposed on the ground and disappeared quickly due to their fast decay times. Their decay processes emit \(\gamma \) rays and X-rays at various energy levels, making them valuable for this study. Unfortunately, their energy levels overlap, and their activities are scarce, posing challenges for extracting their peaks using the method outlined in Sect. 3.1. To overcome this issue, we leveraged constraints based on their decay characteristics.

To observe the decays of these isotopes, we accumulated charge distributions over 475 days from the beginning of data acquisition, dividing it into 19 periods, each spanning 25 days. Figure 3 shows the distributions for three periods: 1st, 5th, and 9th. While decaying and constant components are visually identifiable, determining the position of each peak directly from the spectral shape is challenging. We built a model to extract the peaks from these spectra, with consideration of the decay characteristics of the short-lived isotopes.

Fig. 3
figure 3

The charge distributions from the initial COSINE-100 data and the extracted peaks attributed to short-lived isotopes. The data points, indicated by black crosses, represent measurements from each data collection period and have been simultaneously fitted by using Eq. (8). The time-normalized late distribution L(q) is depicted as gray histograms in each data collection period. Each isotope component is visualized as a stacked histogram in its own color

The time-dependent charge distributions for each short-lived isotope i can be expressed as

$$\begin{aligned} D_{i} \left( q, ~ t; ~ A_{i}, ~ \vec {\mu }_{i}, ~ \vec {\sigma }_{i} \right) = A_{i} \, e^{-t / \tau _{i}} \sum _{j} f_{ij} \, \textrm{N} \left( q; ~ \mu _{ij}, ~ \sigma ^{2}_{ij} \right) ,\nonumber \\ \end{aligned}$$
(5)

where t denotes time, \(A_{i}\) and \(\tau _{i}\) are the initial activity and lifetime of isotope i, and \(f_{ij}\) is the branching fraction for the jth peak of isotope i. Each peak is modeled as a normal distribution \(\textrm{N} (q)\) with peak position \(\mu _{ij}\) and resolution \(\sigma _{ij}\). For the \(\mathrm {^{210}Pb}\) component, the normal distribution in Eq. (5) was replaced with \(\textrm{Q} *\textrm{N}\), as used in Sect. 3.1.

The charge distribution accumulated between times \(t_{s}\) and \(t_{e}\) can be modeled as

$$\begin{aligned} \begin{aligned} S&\left( q, ~ t_{s}, ~ t_{e}; ~ \left\{ A_{i}, ~ \vec {\mu }_{i}, ~ \vec {\sigma }_{i} \right\} \right) \\&= \int _{t_{s}}^{t_{e}} \left[ \sum _{i} D_{i} \left( q, ~ t; ~ A_{i}, ~ \vec {\mu }_{i}, ~ \vec {\sigma }_{i} \right) + C \left( q \right) \right] s \left( t \right) dt, \end{aligned} \end{aligned}$$
(6)

where s(t) denotes the detector operational state (1 or 0) at time t. The term C(q) represents the time-independent component in the spectrum, consisting of isotopes with decay times orders of magnitude longer than the experiment’s duration. Instead of constructing a complex analytic model for the uninteresting C(q) term, we attempted to eliminate its explicit dependence. Since the term is common to all time periods, subtracting a charge distribution from another period with proper time-normalization could effectively remove the C(q) dependency.

For the subtraction, a late distribution L(q) was accumulated, expressed as \(L \left( q \right) = S \left( q, ~ t_{0}, ~ t_{1} \right) \) for \(t_{0}\) and \(t_{1}\) defined as 650 days and 1,150 days after the start of data acquisition. Due to its long time period, L(q) has a sufficiently small error to be treated in the model as a known distribution rather than fitted data. Replacing the C(q) term with L(q) from Eq. (6) yields

$$\begin{aligned}{} & {} S \left( q, ~ t_{s}, ~ t_{e}; ~ \left\{ A_{i}, ~ \vec {\mu }_{i}, ~ \vec {\sigma }_{i} \right\} \right) \nonumber \\{} & {} \quad = \int _{t_{s}}^{t_{e}} \sum _{i} D_{i} \left( q, ~ t \right) \, s \left( t \right) \, dt \nonumber \\{} & {} \qquad + \frac{\int _{t_{s}}^{t_{e}} s(t) \, dt}{\int _{t_{0}}^{t_{1}} s(t) \, dt} \left[ L \left( q \right) - \int _{t_{0}}^{t_{1}} \sum _{i} D_{i} \left( q, ~ t \right) \, s \left( t \right) \, dt\right] ,\qquad \end{aligned}$$
(7)

which depends only on the set of parameters related to the peaks. Consequently, the number of events for the kth 25-days-period spectrum in the lth charge bin is expected as \(S_{kl} = S \left( q_{l}, ~ (k - 1) \Delta T, ~ k \Delta T \right) \), where \(q_{l}\) is the central value of the lth charge bin, and \(\Delta T\) is the period interval, 25 days.

To extract peaks from the short-lived isotopes, we conducted a simultaneous fit on 19 distributions using a binned likelihood,

$$\begin{aligned}{} & {} {\mathcal {L}} \left( \left\{ A_{i}, \vec {\mu }_{i}, ~ \vec {\sigma }_{i} \right\} ; ~ {\textbf{O}}, ~ \vec {\mu }^{A}, ~ \vec {\sigma }^{A} \right) \nonumber \\{} & {} \quad = \prod _{k}^{19} \prod _{l}^{N_{\textrm{bin}}} \frac{ S_{kl}^{O_{kl}} e^{-S_{kl}} }{ O_{kl}! } \times \prod _{i} \exp \left[ - \frac{1}{2} \left( \frac{A_{i} - \mu _{i}^{A}}{\sigma _{i}^{A}} \right) ^{2} \right] , \nonumber \\ \end{aligned}$$
(8)

where \(O_{kl}\) is the observed number of events in the kth period spectrum at the lth charge bin. The initial activity \(A_{i}\), peak positions \(\mu _{ij}\), and resolutions \(\sigma _{ij}\) were treated as free parameters, while those for \(\mathrm {^{210}Pb}\) were kept constant at the values obtained in Sect. 3.1. Constraints for the initial activities, \(\mu ^{A}_{i}\) and \(\sigma ^{A}_{i}\) were obtained from our previous measurement [65]. As seen in Fig. 3, the extracted peaks effectively account for the charge distributions in different time domains simultaneously.

For verification, we applied the same fitting procedure to simulated distributions, allowing us to obtain the true mean energy value for each peak. Through this process, light yields were obtained for several points ranging from 25 to 88 keV, as summarized in Table 1. Crystal 2, cooled underground for about 2.75 years before the experiment started, had the lowest amount of short-lived cosmogenically induced isotopes [65]. It was the longest cooling time among the crystals, resulting in larger errors and missing elements in Table 1 for Crystal 2. The true energy estimation from the simulated distribution for Crystal 2 was also unstable due to the same reason, and these uncertainties were considered in the nPR analysis. Moreover, the nPR analysis excluded the peaks from isotopes \(\mathrm {^{121\,m}Te}\) and \(\mathrm {^{127\,m}Te}\) because their amounts were small and their peak positions overlapped with stronger isotopes.

4 Gamma spectroscopy

An independent experiment was conducted for \(\gamma \) spectroscopy to acquire complementary data and broaden the energy range of our analysis. Charge distributions were measured using four \(\gamma \) sources, \(\mathrm {^{241}Am}\), \(\mathrm {^{133}Ba}\), \(\mathrm {^{109}Cd}\), and \(\mathrm {^{137}Cs}\). These sources were placed near the crystal and aligned at its center during data taking. The energies of the \(\gamma \) rays and X-rays emitted from these sources, along with their exposure times, are listed in Table 2. Data without any sources was also collected over an extended period to obtain the background distributions before and after acquiring data with the sources. This allowed us to verify the stability of the PMT gain and background distribution with time. The pure peak for each source was identified by subtracting the background distribution, as illustrated in Fig. 4.

Table 2 List of sources used for \(\gamma \) spectroscopy, with measured peak positions. The energy was obtained by modeling the simulated spectra

The peak positions were determined by fitting the charge distributions. The charge distribution for isotope i can be modeled as

$$\begin{aligned} S_{i} \left( q \right) = \sum _{j} N_{ij} \, \textrm{CB} \left( q; ~ \mu _{ij}, ~ \sigma _{ij}, ~ \alpha _{ij}, ~ n_{ij} \right) , \end{aligned}$$
(9)

where \(N_{ij}\) represents the normalization constant of the jth peak, and \(\textrm{CB} (q)\) denotes a Crystal Ball function commonly used to model peaks measured by scintillation crystals with energy loss processes [70]. The function includes parameters for peak position \(\mu _{ij}\) and resolution \(\sigma _{ij}\), which are free during the fitting process. The \(\alpha _{ij}\) and \(n_{ij}\) parameters account for the threshold and the exponent of the power-law low-end tail part in the distribution, respectively. Flat and linear background components were added to the \(\mathrm {^{133}Ba}\) and \(\mathrm {^{137}Cs}\) distributions, respectively, to model Compton scattering of high-energy \(\gamma \) rays.

Fig. 4
figure 4

Charge distributions of external sources after subtraction of the background distribution, recorded by the sample crystal for \(\gamma \) spectroscopy. Black crosses represent the data, and orange curves depict the fitting results used to extract the peak positions. The peak positions extracted are shown as vertical lines with error bands

The model was fitted to data by minimizing the following chi-squared:

$$\begin{aligned} \chi ^{2}_{i} \equiv \sum _{l}^{N_{\textrm{bin}}} \dfrac{ \left[ D_{il} - \left( t_{i}/t_{b}\right) B_{l} - S_{i} \left( q_{l} \right) \right] ^{2} }{ D_{il} + \left( t_{i}/t_{b}\right) ^{2} B_{l} }. \end{aligned}$$
(10)

In this equation, \(D_{il}\) and \(B_{l}\) are the measured number of events in ith source and background data, respectively, at the lth charge bin, centered at \(q_{l}\). Exposure times for the ith source and background data are represented by \(t_{i}\) and \(t_{b}\), respectively. The extracted peak positions are displayed in Fig. 4 as vertical lines with their uncertainties.

The same fitting procedure was conducted with simulated distributions, as done in Sect. 3. All dimensions of the detector geometry were implemented based on measurements, except for the thickness of the PTFE wrapped inside the aluminum case. To account for the uncertainty, two different conditions were simulated with this thicknesses set to 2 mm and 3 mm. The simulation results were fitted separately, and the difference in peak positions was assigned to the systematic error of the true energy. One can see those errors and the extracted peak positions in Table 2.

5 Scintillation response of NaI(Tl) crystal

This section presents the nPR of NaI(Tl) crystals based on the measurements described in the previous sections. We also compare our results with previous references. In addition, the energy resolution for each peak was measured and validated using a waveform simulation developed, considering the characteristics of the NaI(Tl) scintillator [71].

5.1 Photon response measurements

The relative light yield obtained via peak extraction is depicted in Fig. 5. In the previous sections, we measured numerous charges of peaks from the data and estimated the true energy values from simulated spectra. The charge was divided by the true energy for each peak. To emphasize nonproportional behavior over the absolute values, the ratio of charge to energy was normalized by a constant for each crystal, termed the relative light yield. Normalization was chosen to set the value to unity at 49 keV for all crystals. The error bars include systematic errors resulting from the radioactive source’s position and the peak extraction method. These measurements exhibited consistency across the crystals and the measuring techniques within the associated uncertainties. The well-known K-shell dip of NaI(Tl) at 33 keV is also identifiable.

Fig. 5
figure 5

The relative light yield as a function of incident \(\gamma \)-ray energy, normalized to the value at 49 keV. The measurements are represented by colored dots with error bars, and the fitted function is drawn as the solid orange curve. The translucent orange band represents the uncertainty from the modeling at 68% confidence level

The data was fitted using an empirical function developed based on the Payne’s model [15,16,17,18,19], as shown by the orange curve in Fig. 5. An expectation arises for the presence of the L-shell dip structure around 4.5 keV, although there is an insufficient number of data points below 20 keV to constrain the detailed shape of the dip. We have checked the consistency of our curve with measurements from two references comparing quantities, such as the light-yield ratio between 4.5 and 20 keV, the ratio between 0.87 and 20 keV, and the slope around 10 keV. These were found to be consistent with previous reports [3, 10], as summarized in Table 3. Despite this consistency check, we acknowledge that our curve between 4.5 and 20 keV is less convincing, resulting in a relatively wider error band, with its width dependent on the selected model parameterization. The measurement of light yield in this energy region is left for future investigation.

To evaluate the nPR in the high-energy region above 200 keV, we examined its consistency with the data reported in Ref. [7]. We measured the light yields by fitting the prominent \(\gamma \) peaks from the high-energy spectrum, including 295 keV from \(\mathrm {^{214}Pb}\), 511 and 1274 keV from \(\mathrm {^{22}Na}\), 609 and 1764 keV from \(\mathrm {^{214}Bi}\), 1461 keV from \(\mathrm {^{40}K}\), and 2615 keV from \(\mathrm {^{208}Tl}\). The measured light yields demonstrate agreement with the nPR curve reported in Ref. [7], as illustrated in Fig. 6.

Table 3 Comparison of \(\gamma \) nPR measurements. \(\textrm{LY}_{\gamma } \left( E \right) \) is the light yield for a \(\gamma \) ray with incident energy E, and the relative light yield \(\textrm{LY}_{\gamma ,\,{49}} \left( E \right) \equiv \textrm{LY}_{\gamma } \left( E \right) / \textrm{LY}_{\gamma } \left( {49}\,\textrm{keV} \right) \)
Fig. 6
figure 6

The relative light yield (normalized at 662 keV) for the high-energy \(\gamma \) peaks is represented by black dots with error bars. The orange dots and solid orange curve represent the nPR measured from Ref. [7], where our data agrees well with the reference

5.2 Resolution and waveform simulation

Once a peak is extracted, the energy resolution \(\sigma _{ij}\) is naturally obtained along with the peak position \(\mu _{ij}\). As shown in the dashed gray curve in Fig. 7, the characteristic \(1/\sqrt{E}\)-like shape, originating from the Poisson fluctuations in \(N_{\textrm{pe}}\) for the PMT, is clearly visible. Also resolution effects due to the detector geometry and the distribution of the number of amplified electrons in the PMT are observed.

Fig. 7
figure 7

The relative resolution as a function of incident \(\gamma \)-ray energy for a NaI(Tl) crystal. The black dots with error bars represent the measurements, and the solid orange curve represents the model of the resolution curve, as described in Eq. (11), with an error band at 68% confidence level. The solid pink curve illustrates the \(\sqrt{a/q}\) component from the resolution model, which closely matches the waveform simulation (solid blue curve). The dashed gray curve represents the Poisson fluctuation in \(N_{\textrm{pe}}\)

To account for these effects, the relative resolution as a function of charge was modeled using the following equation,

$$\begin{aligned} \frac{\sigma (q)}{q} = \sqrt{\frac{a}{q} + b}. \end{aligned}$$
(11)

Here, a and b are fitting parameters, where a scales the Poisson resolution contribution and b typically arises from non-uniformity of detector response, which is not included in the simulation. In Fig. 7, the orange line represents the model fitted to the measurements, with the uncertainty shown as a band. The resolution \(\sigma (q)\) is transformed to \(\sigma (E)\) via the nPR curve shown in Fig. 5, leading to the emergence of non-analytic points at the 4 keV L-shell edge and the 33 keV K-shell edge. The pink line represents the function when \(b = 0\).

Given the focus on DM detection experiments, where the region of interest typically lies below 6 keV, it is crucial to investigate the resolution in the sub-keV range, particularly for future upgrades. Since the effect of electron amplification in PMTs and DAQ system is most dominant at low energies, except for that due to the Poisson fluctuations with \(N_{\textrm{pe}}\), a waveform simulation developed for the COSINE-100 experiment was utilized to validate the resolution in this region [71]. This simulation incorporates factors such as rise and decay times of NaI(Tl) scintillation, fluctuations in the single photo-electron shape, DAQ electronics, and charge accumulation processes. Therefore, by inputting \(N_{\textrm{pe}}\) that takes into account Poisson fluctuations, we can estimate the resolution with these effects included.

The blue line in Fig. 7 shows that the simulation successfully replicates the data. Despite not accounting for the intricate details of detector geometry and other mechanisms, the simulation exhibits good agreement with the measurements at the low-energy region. In addition, the agreement between the simulation and the pink curve, which is the measurement with the constant-term factor excluded, confirms the possibility of extrapolating the resolution curve to the low-energy direction. These findings hold particular significance for scintillator experiments focusing on low-energy ranges.

6 Summary and conclusion

In conclusion, this study has thoroughly examined the nPR characteristics of NaI(Tl) scintillators within the context of DM search experiments. Through a comprehensive calibration process using COSINE-100 data, complemented by \(\gamma \) spectroscopy, we have gained valuable insights into the behavior of NaI(Tl) crystals.

The measurement of light yields at various energies was successfully achieved by analyzing internal peaks originating from both long-lived and short-lived isotopes. Extraction of peaks from long-lived isotopes was easily accomplished due to stable experimental data acquisition. A significant milestone was reached by directly measuring the light yield at 0.87 keV, generated by the cascade following \(\mathrm {^{22}Na}\) EC decay, facilitated by the active veto system. This direct measurement serves as validation of an indirect measurement obtained by analyzing X-rays near the Iodine K-shell using synchrotron radiation [10]. Overcoming the challenge of extracting multiple overlaid peaks from short-lived isotopes was made possible through long-term data collection and time-dependent analysis.

In the nPR curve measured here, the distinctive K-shell dip structure was observed. The nPR curve was fitted using an empirical curve. The resulting fitted nPR curve exhibits consistency with previously reported measurements, particularly in the energy range below 20 keV. However, the lack of data points around the 10 keV region limits the precision of the calibration curve, leaving the error band relatively wider. This aspect remains a focus for future investigation.

The resolution analysis of NaI(Tl) scintillators revealed a characteristic resolution curve that follows the \(1/\sqrt{N_{\textrm{pe}}}\) trend, as expected for Poisson fluctuations in \(N_{\textrm{pe}}\). Waveform simulation provided compelling confirmation of the resolution at low energies, allowing for the extension of the resolution curve to lower energy ranges. This enhancement in our ability to detect low-energy DM recoil signals is very important.

The findings of this study hold significant implications for background modeling in NaI(Tl) experiments. Precise calibration and characterization of the nPR effect are essential for accurately modeling the background spectra resulting from radioactive interactions involving electrons and photons. To facilitate more detailed simulations, the integration of the nonproportional light production of NaI(Tl) into the Geant4 toolkit is under development, building on similar efforts reported previously [72, 73].

Furthermore, it is imperative to consider nPR calibration when interpreting signals that may come from DM interactions with electrons or photons. As the scientific community explores diverse scenarios involving boosted DM and light DM, the significance of these interactions is poised to grow. This study lays the foundation for more robust and accurate DM search experiments in the future.