1 Introduction

Detection of reactor antineutrinos with Liquid Scintillator (LS) detectors has played an important role in the development of neutrino physics [1,2,3,4,5,6]. It is also the approach used in Jiangmen Underground Neutrino Observatory (JUNO) [7] as a next-generation neutrino experiment. Reactor antineutrinos are usually detected via the Inverse Beta Decay (IBD) reaction, \(\bar{\nu }_e + p \rightarrow e^{+}+n \). The positron deposits its kinetic energy in the LS, then annihilates immediately with an electron and emits mostly two back-to-back 0.511 MeV gammas. The neutron scatters in the detector until it is thermalized and captured on a nucleus. Subsequently, the nucleus de-excites with the emission of gamma or alpha particles. Since the neutron capture happens about 30 \(\upmu \)s after the positron annihilation, an IBD reaction produces a prompt and a delayed signal in the detector. This time correlation of the prompt and delayed signals is powerful to distinguish the signal from the backgrounds. The kinetic energy of the neutron is negligible. The initial \(\bar{\nu }_e\) energy can be approximately calculated as \(E_{\bar{\nu }_e} \approx E_{\textrm{prompt}} + 0.784 {\textrm{MeV}}\) [8], where \(E_{\textrm{prompt}}\) is the sum of the positron kinetic energy and the annihilation energy.

In a liquid scintillator detector, charged particles interact with the liquid scintillator and release photons. The number of detected photons depends on the position of the charged particle. This is caused by the attenuation of the photons in the liquid scintillator, the total acceptance angle of the photon detectors, the reflection from the material surface, and so on [9]. The position dependence of the number of detected photons is called detector non-uniformity and is generally independent of the particle energy. In addition, the number of emitted photons is not proportional to the kinetic energy of the charged particle. This effect is referred to as the “physics non-linearity” and is mainly caused by the ionization quenching and Cherenkov radiation [8]. Non-uniformity and non-linearity could compromise the measurement of the antineutrino energy spectrum. Therefore, both effects need to be corrected by calibration.

The Taishan Antineutrino Observatory (JUNO-TAO, or TAO) is a satellite experiment of JUNO. It is located at 30 m from a reactor core of the Taishan Nuclear Power Plant and detects reactor antineutrinos with 2.8 tons Gadolinium-doped Liquid Scintillator (GdLS). The primary physics goal of JUNO is to determine the neutrino mass ordering by measuring the reactor antineutrino spectrum with an energy resolution of better than 3%/\(\sqrt{{\mathrm{E(MeV)}}}\) [7]. TAO will measure the reactor antineutrino spectrum with an energy resolution of better than 2% at 1 MeV to provide a model-independent reference spectrum for JUNO and a benchmark measurement to test nuclear databases. Achieving these goals requires less than 1% uncertainty in the physics non-linearity and less than 0.5% residual non-uniformity after correction [10]. In this paper, we present a calibration strategy that meets these requirements.

The organization of this paper is as follows. In Sect. 2, we introduce the TAO experiment and present the conceptual design of the calibration system. In Sect. 3, we discuss the details of calibration sources. In Sect. 4, we develop a method to correct the non-uniformity in the TAO detector and verify its effectiveness through simulations. In Sect. 5, we discuss the approach to calibrate the physics non-linearity and verify it through simulations. Finally, we give a conclusion in Sect. 6.

2 The calibration system of the TAO experiment

2.1 The TAO experiment

Fig. 1
figure 1

Schematic of the TAO detector, which consists of the central detector, the calibration system, the outer shielding, and the veto system. The Calibration system consists of an Automated Calibration Unit (ACU) and a Cable Loop System (CLS). A segment of the CLS cable is located in the GdLS with the start point marked as \({\textrm{P}}_{\textrm{0}}\). A coordinate system is defined with the origin at the center of the central detector and the Z-axis pointing upwards

As shown in Fig. 1, TAO consists of the Central Detector (CD), the calibration system, the outer shielding and the veto system. In the CD, there is a spherical acrylic vessel with an inner diameter of 1.8 m, which contains approximately 2.8 tons of GdLS. The acrylic vessel is viewed by about \(10\,{\text {m}}^2\) Silicon Photo-multipliers (SiPMs) with a photon detection efficiency of \(\sim \,50\)%. To reduce the dark noise of the SiPMs, the detector will operate at − 50 \(^\circ \)C. To fully contain the energy deposition of gammas from IBD positron annihilation, events within 25 cm to the edge of the acrylic vessel are excluded in the IBD selection, resulting in a fiducial volume (FV) with a diameter of 1.3 m. Full detector simulation shows about 4500 photoelectrons (PE) per MeV can be detected by SiPMs, leading to an energy resolution of about 2% at 1 MeV. TAO is able to achieve a vertex resolution of better than 5 cm using the recorded charge and timing information by the SiPMs and the electronics system [10].

2.2 Calibration system

The calibration system contains the Automated Calibration Unit (ACU), which is reused and modified from the Daya Bay experiment [11], and a Cable Loop System (CLS), as shown in Fig. 1. A segment of the CLS cable is located in the GdLS with the start point marked as \({\textrm{P}}_{\textrm{0}}\). A coordinate system is defined with the origin at the center of the central detector and the Z-axis pointing upwards. The ACU and CLS calibrate the energy response on and off the central axis (Z-axis), respectively.

As shown in Fig. 2a, the ACU consists of a turntable and three mechanically independent motor/pulley/wheel assemblies. The turntable comprises two plates (named as top and middle plates) and three assemblies mounted on the top plate. Each assembly is capable of deploying a source into the detector along the central Z-axis once the turntable rotates to a specific angle, similar to the applications in Daya Bay and JUNO [11, 12]. An ultraviolet (UV) light source, a \(^{68}\)Ge source, and a combined source that contains multiple gamma sources and one \(^{241}\)Am-\(^{13}\)C neutron source will be installed on three assemblies, one source for each assembly.

The \(^{68}{\textrm{Ge}}\) source and the combined source consist of radioactive materials and a stainless steel enclosure with Teflon coating, as shown in Fig. 2b. The Teflon coating reduces the absorption of light by the source with high reflectivity of 95% [13]. In addition, Teflon is compatible with GdLS while stainless steel is not. Each source has two counterweights with a mass of about 40 g, which allows the wire to maintain tension when the source is deployed into the liquid scintillator. When \(^{68}\)Ge and the combined source are parked in the ACU, they are placed within a stainless steel shield and a Borated Polyethylene (BPE) shield respectively as shown in Fig. 2a. These two shields are cylinders with a wall thickness of 2.25 inch and a height of 5 inch. Below the BPE shield, there is another BPE disk with a diameter of 3.25 inch and a height of 6.4 inch. These shields decrease the probability of the radiation of the sources going into the GdLS to a negligible level compared to other backgrounds in the TAO detector.

Fig. 2
figure 2

a Automated calibration system. b Design of \(^{68}\)Ge source (left) and combined radioactive source (right). The combined source contains multiple gamma sources (\(^{137}\)Cs, \(^{54}\)Mn, \(^{40}\)K, \(^{60}\)Co) and one neutron source (\(^{241}\)Am-\(^{13}\)C)

Fig. 3
figure 3

Schematic of the CLS. The CLS contains two motors, two anchors glued on the inner surface of the acrylic vessel, and one cable. The radioactive source (\(^{137}\)Cs) is plated on a small area of the stainless steel cable. The cable passes through the anchors and can be pulled in either direction by two stepper motors to deploy the radioactive source along the track of the cable in the detector

The UV light source is equipped with a high-OH silica optical fiber with a core diameter of 400 \(\upmu \)m and a diffuser made of ultrapure quartz that improves the isotropy of the source’s radiation. The wavelength of the UV light source is 265 nm by default. It can be changed to 420 nm or any other values by the envisaged simplified replacement of one or two LEDs in the optoelectronic unit, which is located outside the detector shielding. The UV light source is used to monitor the stability of the parameters of the TAO detector. This task includes monitoring the state of each SiPM channel and calibrating its timing, gain, and quantum efficiency. The UV light source can also be used to test the data acquisition and offline analysis pipeline and to study the event pileup in the CD.

The CLS allows us to calibrate the detector response in the off-central axis region. It adopts experience from the JUNO CLS [9, 14]. The CLS includes two stepper motors, two anchors, a stainless steel cable, load cells, and limit switches. All components can work at low temperatures down to \(-\,60\,^\circ \)C. The radioactive source (\(^{137}\)Cs) is plated on a small area of the stainless steel cable, and the cable is covered with a 0.1 to 0.2 mm thick Teflon coating along its entire length to prevent contamination of the GdLS. The anchors made by Teflon and acrylic are glued on the inner surface of the acrylic vessel in the CD. The cable passes through the anchors and can be pulled in either direction by two stepper motors to deploy the radioactive source along the track of the cable in the detector as shown in Fig. 3. The positions of the anchors are optimized so that we can use limited calibration positions along the cable to obtain comprehensive information about the non-uniformity response of the detector. When the calibration is completed, the radioactive source is pulled inside the ACU. The limit switch is used to align the radioactive source to the zero position. The load cell is used to monitor the tension of the stainless steel cable to avoid breaking or loosening.

2.3 Simulation software

The simulation software of TAO is based on the framework SNiPER [15] and the Geant4 (10.04.p02) [16] simulation software. It contains the geometry of the TAO detector, the optical parameters, and the list of physics processes. The optical parameters of GdLS, such as the absorption and re-emission probability, are taken from Daya Bay [17] since they use similar GdLS and these optical parameters have been measured to be unchanged at \(-\,50\,^\circ \)C compared to that at room temperature [18]. The light yield of the TAO GdLS is measured to be 96% of the JUNO LS at room temperature and increases by 9% at \(-\,50\,^\circ \)C [19]. The “Livermore Low Energy” model is used to describe the electromagnetic physical processes for photons, electrons, hadrons, and ions [20]. Electronics effects are not included in the simulation since the hardware has not yet been fully tested.

3 Calibration sources

The radioactive sources and reaction processes considered in the calibration are listed in Table 1. In this section, we discuss details about the calibration sources, such as the selection of the radioactive source activities, the selection criteria of calibration events, and the fitting algorithm for the visible energy spectra of the radioactive sources. The visible energy of an event is defined as

$$\begin{aligned} E_{\textrm{vis}} = N_{\textrm{PE}}/Y_{\textrm{0}}, \end{aligned}$$
(1)

where \(N_{\textrm{PE}}\) is the number of photo-electrons detected by the SiPMs, and \(Y_{\textrm{0}}\) is the photo-electron yield, which is determined to be 4445 PE/MeV by simulating the neutron capture on hydrogen at the CD center and dividing the average detected PEs by the gamma energy 2.22 MeV.

3.1 Calibration sources carried by ACU

Table 1 List of radioactive sources and processes used for the TAO calibration

As mentioned in Sect. 2.2, \(^{137}\)Cs, \(^{54}\)Mn, \(^{40}\)K, \(^{60}\)Co, and \(^{241}\)Am-\(^{13}\)C are put into one source enclosure, called the “combined source”, and \(^{68}\)Ge is put into another source enclosure.

The \(^{68}\)Ge can emit positron which then annihilates in the source enclosure to release two 0.511 MeV gammas. The \(^{68}\)Ge is separated from other radioactive isotopes because it has a half-life of only 271 days and needs to be replaced after three years. The simulated visible energy spectrum of \(^{68}\)Ge is shown in Fig. 4a. An energy loss tail arises in the visible energy spectrum because gamma particles may deposit some of their energy in non-scintillating materials such as the source enclosure, weights, etc. For this kind of single calibration source, we use a Gaussian function to model the fully absorbed peak and a complementary error function with a normalization parameter to model the energy loss tail. The fitting function is

$$\begin{aligned}{} & {} f(E_{\textrm{vis}};\eta _{0},\eta _{1},\mu ,\sigma ) \nonumber \\{} & {} = \eta _{0}\cdot \left( e^{-\frac{(E_{\textrm{vis}}-\mu )^{2}}{2\sigma ^{2}}} + \eta _{1}\cdot \textrm{erfc}\left( \frac{E_{\textrm{vis}} - \mu }{\sqrt{2}\sigma }\right) \right) , \end{aligned}$$
(2)

where erfc is the complementary error function, and \(\mu \) and \(\sigma \) represent the mean and standard deviation of the visible energy spectrum of the fully absorbed peak. \(\eta _{0}\) is the absolute amplitude of the fully absorbed peak while \(\eta _1\) is the relative amplitude of the energy loss tail.

Fig. 4
figure 4

a The simulated visible energy spectrum of \(^{68}\)Ge. The number of \(^{68}\)Ge events is equivalent to 100 s calibration data. The contributions of the fully absorbed peak and the energy loss tail are shown in the figure. The mean value of the fully absorbed peak is not equal to 1.022 MeV due to the physics non-linearity. b Gamma spectra and the fitting results of the radioactive isotopes in the combined source. The number of combined source events is equivalent to 10 h calibration data. The spectra of \(^{137}\)Cs and \(^{54}\)Mn are fitted separately from the spectra of \(^{40}\)K and \(^{60}\)Co

The \(^{241}\)Am-\(^{13}\)C source can release neutrons with kinetic energy less than 100 keV through the \(^{13}\)C(\(\alpha \),n)\(^{16}\)O reaction. The \(^{16}\)O may be in a second excited state and de-excites with the release of 6.13 MeV gamma [8]. When the combined source parks in the ACU, neutrons from \(^{241}\)Am-\(^{13}\)C may pass through the shield and enter the central detector. To reduce these neutron backgrounds to an acceptable level, the neutron rate of \(^{241}\)Am-\(^{13}\)C is chosen to be 2 Bq [21]. Signals from \(^{241}\)Am-\(^{13}\)C are selected as prompt-delayed pairs with the accidental backgrounds being removed by the offset-window method [22]. The \(^{16}\)O\(^{*}\) de-excitation gamma produces the prompt signal and the neutron capture peak gives the delayed signal. This prompt-delayed coincidence distinguishes the signals of the \(^{241}\)Am-\(^{13}\)C source from other signals generated by gammas in the combined source. In GdLS, neutron capture mainly happens on the Gd nucleus (n-Gd) and the H nucleus (n-H). Neutron capture on the Gd nucleus will emit multiple gammas whose energy spectra are complex [23], so only the n-H gamma is used for the non-linearity calibration. Since the energy loss tail of n-Gd events affects the peak of the n-H visible energy spectrum, a function as \(\eta _{2} +\) \( f(E_{\textrm{vis}};\) \(\eta _{0}, \eta _{1}, \mu , \sigma )\) is used to model the visible energy spectrum around the n-H fully absorbed peak, where \(\eta _{2}\) models the energy loss tail of the n-Gd spectrum.

The activities of \(^{137}\)Cs, \(^{54}\)Mn, \(^{40}\)K, \(^{60}\)Co are optimized so that the energy loss tail of one source won’t form a significant background for the fully absorbed peaks of other sources, as shown in Fig. 4b. Given the activities we choose, the tails of the \(^{40}\)K and \(^{60}\)Co spectra have little impact on the spectra of \(^{137}\)Cs and \(^{54}\)Mn. Therefore, the spectra of \(^{54}\)Mn and \(^{137}\)Cs are fitted together with one function to account for the overlapping in the spectra, and the spectra of \(^{40}\)K and \(^{60}\)Co are fitted together with another function as shown in Fig. 4b. The natural abundance of \(^{40}\)K is about 0.012% [24] so that enriched \(^{40}\)K is used to reduce the volume and mass of the radioactive source.

3.2 Calibration source carried by CLS

CLS carries a \(^{137} {\textrm{Cs}}\) source of about 500 Bq. We can not use \(^{68}\)Ge at CLS because the released positron will pass through the Teflon thin coating, and the visible energy of gammas produced by positron annihilation is mixed with the visible energy of positron kinetic energy. The main branch of \(^{137} {\textrm{Cs}}\) decay is through beta decay to the excited state of barium, which then de-excites and emits a gamma of 0.662 MeV. The mean half-life of the excited \(^{137}{\textrm{Ba}}\) is about 2.55 min, and the end-point kinetic energy of the \(\beta \) ray is about 0.518 MeV, so the 0.662 MeV gamma ray can be distinguished from the \(\beta \) ray.

3.3 Cosmogenic \(^{12}\)B

Cosmogenic \(^{12}\)B events can be used in non-linearity calibration. The simulation shows that the muon rate in the TAO CD is about 330 Hz with the input of the measured muon rate in the experimental hall which is about 1/3 of the ground level. The energetic cosmic muons and their daughters can interact with \(^{12}\)C in GdLS to produce unstable isotopes like \(^{12}\)B [7]. The half-life of \(^{12}\)B is 20.2 ms. About 98.2% of them decays to the ground state of \(^{12}\)C by \(\beta \)-emission with a Q value of 13.4 MeV [25]. Therefore, the \(^{12}\)B events can provide a constraint on the electron non-linearity model, especially in the high energy range. A showering muon which produces neutrons in the detector has much larger probability to produce a \(^{12}\)B together than those muons without a neutron followed. To obtain a clean \(^{12}\)B sample, we first select muons with a neutron followed (neutron-tagged muons). The neutron-tagged muon rate is reduced to less than 1 Hz. Then we select \(^{12}\)B candidates within a (5 ms, 50 ms) time window after the muon [8]. The accidental backgrounds can be evaluated by the offset-window method and then be subtracted. About \(10^5\) \(^{12}\)B events can be selected in the FV in three years. About 2.8% \(^{12}\)N decay events are mixed into the energy spectrum of \(^{12}\)B. The impact on the physics non-linearity is less than 0.05% when \(^{12}\)N events are taken into consideration. For simplicity, \(^{12}\)N events are ignored in this study.

4 Non-uniformity calibration

When particles interact with the GdLS in the CD at different positions, the detector responses are different. This is referred to as the non-uniformity of the detector. The detector non-uniformity \(g(r,\theta ,\phi )\) is defined as the photo-electron yield at a given position relative to the photo-electron yield at the center of the detector, where r, \(\theta \), and \(\phi \) are the radius, polar angle and azimuthal angle of the given position in spherical coordinates, respectively. The origin of the spherical coordinate is at the center of the CD, and the zenith points upwards, as shown in Fig. 1. The non-uniformity of the detector degrades the energy resolution and should be understood well.

To understand how well the non-uniformity \(g(r,\theta ,\phi )\) can be calibrated, we first simulate the detector responses with 1 MeV electrons at given positions in the detector to extract the true non-uniformity map as the reference. Then we simulate the detector responses with gamma sources deployed at a limited number of positions and calibrate the non-uniformity. The location of the CLS anchors and selection of the calibration points are optimized by comparing the non-uniformity calibration and the reference. Once the non-uniformity \(g(r,\theta ,\phi )\) is obtained, the visible energy can be evaluated with

$$\begin{aligned} E^{\textrm{prompt}}_{\textrm{vis}}(r,\theta ,\phi ) = \frac{N_\textrm{PE}}{g(r,\theta ,\phi ) \cdot Y_{\textrm{0}}}. \end{aligned}$$
(3)

4.1 Ideal non-uniformity map

To obtain the reference non-uniformity \(g(r,\theta ,\phi )\), electrons with a kinetic energy of 1 MeV are simulated at 1364 vertices. These vertices are located on a grid with 20 mm spacing in the r-direction and 6\(^{\circ }\) in the \(\theta \)-direction. The electron is nearly a point source because it deposits energy in a volume within a radius of a few millimeters, which is much smaller than gamma. Since the detector is approximately symmetric rotationally, we assume \(g(r,\theta ,\phi ) \approx g(r,\theta ) \). The full non-uniformity map shown in Fig. 5a is obtained with the Clough–Tocher two-dimension interpolation [26,27,28]. We refer to this reference non-uniformity map as \(g_\textrm{ideal}(r,\theta )\).

Fig. 5
figure 5

a 3D surface of \(g_{\textrm{ideal}}(r,\theta )\) obtained by simulations of the electrons with a kinetic energy of 1 MeV at many vertices in the detector. b \(E_{\textrm{vis}}\) spectrum of the positrons with a kinetic energy of 3 MeV. The blue line is the \(E_{\textrm{vis}}\) spectrum of the positrons at the detector center. The orange line is the \(E_{\textrm{vis}}\) spectrum of the uniformly distributed positrons. the red line is the \(E_{\textrm{vis}}\) spectrum corrected with the non-uniformity \(g_{\textrm{ideal}}(r,\theta )\) for the uniformly distributed positrons

Simulation shows that the \(g_{\textrm{ideal}}(r,\theta )\) can correct the non-uniformity of IBD positron well. We simulate the IBD events uniformly distributed within the CD and a set of the events at the CD center, then reconstruct the uniformly distributed events with Eq. (3). Figure 5b is an example for positrons with a kinetic energy of 3 MeV. The difference between the reconstructed positron energy spectrum and the positron energy spectrum at the detector center is small. The relative difference between their central values and the relative energy resolution difference (\(\Delta \sigma /E\)) are less than 0.15% and 0.01%, respectively. This conclusion is applicable to the positrons in the kinetic energy range from 0 to 8 MeV.

4.2 Optimizing the finite-point uniformity calibration

In this section, we optimize the layout of the CLS system and select a set of deployment calibration points to obtain a good approximation of \(g_{\textrm{ideal}}(r,\theta )\). The optimization results also feed back to the calibration system design.

4.2.1 Optimize the anchor locations

As shown in Fig. 5a, \(g_{\textrm{ideal}}(r,\theta )\) is almost symmetric with respect to the plane of \(\theta =90^{\circ }\). To be more specific, \(|g_{\textrm{ideal}}(r,\theta ) - g_\textrm{ideal}(r,180^{\circ } - \theta )|\) \(/g_{\textrm{ideal}}(r,\theta )\) is less than \(0.2\% \) within the FV thanks to the symmetric arrangement of the SiPMs and the symmetry of the acrylic vessel. A minor asymmetry comes from the different sizes of two holes at two poles of the acrylic vessel. It is safe to assume \(g(r,\theta )\) \( = g(r,180^{\circ } - \theta )\) within a precision of 0.2%.

When the locations of the two anchors are fixed, we select very dense points \(\vec {P}_{\textrm{calib}}\) that can be reached by the calibration system and calculate \(g_{\textrm{ideal}}(\vec {P}_\textrm{calib})\). Then, the Clough–Tocher two-dimension interpolator is applied to obtain \(g_{\textrm{calib}}(r,\theta ;\) \( \theta _{1},\theta _{2},\phi _{2})\), where \(\theta _{1}\), \(\theta _{2}\) and \(\phi _{2}\) represent the locations of two anchors. Since we assume that the detector is rotationally symmetric with respect to the z-axis, we set \(\phi _{{1}} = 0\) where \(\phi _{{1}}\) is the azimuthal angle of one anchor. In order to optimize the locations of the anchors, a penalty function

$$\begin{aligned} L(\theta _{1},\theta _{2},\phi _{2}) = \int \limits _{S_{R}}(g_\textrm{ideal}(r,\theta ) - g_{\textrm{calib}}(r,\theta ; \theta _{1},\theta _{2},\phi _{2}))^2\cdot dV \nonumber \\ \end{aligned}$$
(4)

is defined to evaluate the difference between \(g_\textrm{ideal}(r,\theta )\) and \(g_{\textrm{calib}}(r,\theta ; \theta _{1},\theta _{2},\phi _{2})\), where \(S_{R}\) means a spherical volume whose radius is smaller than R. R is set to be 700mm to include the FV with a radius less than 650 mm. Besides, limitations of \(\theta _1 > 90^{\circ }\) and \(\theta _2 > \theta _1\) are added to allow the installation and positioning of the anchors to be more convenient. We obtain \(\theta _{1} \approx 102.5^{\circ }\)\(\theta _{2} \approx 155.2^{\circ }\)\(\phi _{2} \approx 151.7^{\circ }\) by minimizing \(L(\theta _1,\theta _2,\phi _2)\).

4.2.2 Determine suitable calibration points

Once the locations of the two anchors are determined, the track of the CLS cable is fixed. Only limited calibration points along the track of the CLS cable can be used, considering the limitation of the calibration time. The criterion to select suitable calibration points is to set more calibration points where the modulus of the gradient of \(g(r,\theta )\) is larger. For the ACU system, we set a calibration point every 100mm in the area with a radius less than or equal to 500mm, and set a calibration point every 50 mm in the area with a radius of 500 mm to 850 mm. For the CLS system, to begin with the starting point of the CLS cable (\(\vec P_0\)), we add a calibration point (\(\vec P_i\)) when conditions \(0.01 \le |g_{\textrm{ideal}}(\vec P_{i}) - g_\textrm{ideal}(\vec P_{i-1})|\) and 25 mm \(\le |\vec P_i - \vec P_{i-1}|\) are met at the same time, or when condition 50 mm \(\le |\vec P_i - \vec P_{i-1}|\) is met. The starting point \(\vec P_0\) is marked in Fig. 1. In total, 110 points for non-uniformity calibration are selected and shown as the solid points in Fig. 6a.

Fig. 6
figure 6

a \(g_{\gamma }(r,\theta )\) interpolated from the \(^{68}\textrm{Ge}\) and \(^{137}{\textrm{Cs}}\) calibration data. Solid circles are real calibration points. Hollow circles are the symmetry points of the solid circles with respect to \(\textrm{z}=0\) plane. The point at the detector center is shown as a line from \(\theta = 0^\circ \) to \(\theta = 180^\circ \) at \(\textrm{R}=0\) mm. b \(g_{\gamma }(r,\theta ) - g_\textrm{ideal}(r,\theta )\). The red line is the boundary of the FV

4.3 Verification of the calibration

We plan to use the \(^{68}\)Ge and the \(^{137}\)Cs on the ACU and the \(^{137}\)Cs on the CLS to perform the non-uniformity calibration. We can obtain the photo-electron yield relative to the center for a given calibration point and radioactive source on the ACU or CLS. For example, for a given calibration point on the CLS, we place the \(^{137}{\textrm{Cs}}\) there and obtain the photo-electron yield, then divide it by the photo-electron yield of \(^{137}{\textrm{Cs}}\) at the detector center. Then we use the Clough–Tocher two-dimension interpolation to obtain the \(g(r,\theta )\), as shown in Fig. 6a. Since this non-uniformity map is obtained using gamma sources, it is marked as \(g_{\gamma }(r,\theta )\). Near two poles of the acrylic vessel, the total absorption energy peaks can not be fitted due to the large energy loss effect. This results in missing values near the regions corresponding to the two poles of the acrylic vessel in Fig. 6a. The difference between \(g_{\gamma }(r,\theta )\) and \(g_{\textrm{ideal}}(r,\theta )\) is less than 0.01 in most areas, and the difference is larger near the two poles of the acrylic vessel, as shown in Fig. 6b. This difference prevents us from perfectly correcting the effects of non-uniformity, resulting in a residual non-uniformity. The residual non-uniformity, denoted as RN, is defined as

$$\begin{aligned} \textrm{RN} = \sqrt{\frac{1}{V_{\textrm{FV}}}\cdot \int _{\textrm{FV}} (g_{\gamma }(r,\theta ) - g_{\textrm{ideal}}(r,\theta ) - \overline{\Delta g})^{2}\cdot dV},\nonumber \\ \end{aligned}$$
(5)

where \(V_{\textrm{FV}}\) is the total volume of FV and

$$\begin{aligned} \overline{\Delta g} = \frac{1}{V_{\textrm{FV}}}\cdot \int _\textrm{FV}(g_{\gamma }(r,\theta ) - g_{\textrm{ideal}}(r,\theta ))\cdot dV. \end{aligned}$$
(6)

\(\overline{\Delta g}\) is about − 0.14% and will cause bias of reconstructed visible energy. RN leads to degradation in the resolution of reconstructed visible energy, but it is only about 0.2%, satisfying the requirement of less than 0.5%. Thus, the energy non-uniformity can be well calibrated with the radioactive sources on ACU and CLS.

4.4 Energy resolution

Fig. 7
figure 7

a Energy resolution of reconstructed IBD prompt events for detector central events, and uniformly distributed events corrected by different non-uniformity maps. For central events, the energy resolution is quite close to the result corrected by \(g_{\textrm{ideal}}\) causing some overlaps. b \(E_{\textrm{vis}}\) bias of reconstructed \(e^{+}\) spectrum. \(E^{\textrm{prompt}}_{\textrm{vis}}\) here means visible energy of central IBD prompt events. \(E^{\textrm{rec}}_\textrm{vis}\) is the visible energy of reconstructed IBD prompt events which are distributed uniformly within the FV. Vertex smearing here takes into account a 5 cm vertex resolution

In order to get the energy resolution of the TAO, IBD events are simulated at the center of the CD. For mono-energetic IBD events, we can calculate the standard deviation (\(\sigma \)) and mean of their visible energies (\(E^{\textrm{prompt}}_{\textrm{vis}}\)). The energy resolution of IBD events is defined as \(\sigma / E^{\textrm{prompt}}_{\textrm{vis}}\) and is shown in Fig. 7a.

Residual non-uniformity and electronics effects such as cross talk, dark noise, charge resolution can cause the energy resolution degradation (\(\Delta \sigma / E^{\textrm{prompt}}_{\textrm{vis}}\)). The energy resolution degradation due to electronics effects is less than 0.23% at 1 MeV, see Ref. [10] for details. To study the energy resolution degradation due to residual non-uniformity, Eq. (3) and \(g_{\gamma }(r,\theta )\) are used to reconstruct uniformly distributed IBD events. Taking the vertex reconstruction resolution into account, a 5 cm Gaussian vertex smearing is added to the true vertex of each simulated event. The reconstructed visible energy spectrum of the IBD events whose reconstructed vertices are within FV is compared with the visible energy spectrum of the central IBD events. It can be seen from Fig. 7a that the energy resolution degradation is less than 0.05%. Figure 7b shows that the bias of reconstructed energy is smaller than 0.3%.

5 Non-linearity calibration

In this section, we first introduce the non-linearity model. Then the systematic biases and uncertainties of the visible energy of these calibration sources are discussed one by one. Finally, we apply the model to fit the calibration data and obtain the calibration performance.

5.1 Model of physics non-linearity

The visible energy of the prompt event of the IBD reaction can be decomposed as

$$\begin{aligned} E_{\textrm{vis}}^{\textrm{prompt}} = E^{\textrm{e}}_{\textrm{vis}} + E^{\textrm{anni}}_{\textrm{vis}}, \end{aligned}$$
(7)

where \(E^{\textrm{e}}_{\textrm{vis}}\) is the visible energy associated with the positron kinetic energy and \(E^{\textrm{anni}}_{\textrm{vis}}\) is that of the annihilation gammas. \(E^{\textrm{e}}_{\textrm{vis}}\) should be equal to the visible energy of an electron with the same kinetic energy, thus can be calibrated with the \(\beta \) decay sources [8]. \(E^{\textrm{anni}}_{\textrm{vis}}\) can be calibrated with the \(^{68}\)Ge source. The physics non-linearity for an electron or a positron is defined by

$$\begin{aligned} f_{\textrm{nonlin}}^{\textrm{e}} = E^{\textrm{e}}_{\textrm{vis}}/E^{\textrm{e}}, \end{aligned}$$
(8)

where \(E^{e}\) is the true kinetic energy of the electron or positron.

The physics non-linearity is caused by ionization quenching and the emission of Cherenkov radiation.

Ionization quenching In general, when the particles deposit energy in the GdLS, the solvent molecules are excited, then the energy is transferred to the fluorescent molecules through dipole-dipole interactions [8]. The energy transfer efficiency is reduced, which leads to a non-linearity between the energy converted to scintillation photons (\(E_{\textrm{scint}}\)) and the kinetic energy of the electron or positron \(E^e\). This is known as the quenching effect and can be described by the empirical formula known as Birks’ law [29]:

$$\begin{aligned} E_{\textrm{scint}}(E^{\textrm{e}},k_{\textrm{B}}) = \int \limits _{0}^{E^{\textrm{e}}}\frac{dE}{1+k_{\textrm{B}}\cdot \frac{dE}{dx}}, \end{aligned}$$
(9)

where \(k_{\textrm{B}}\) is the Birks’ constant and dE/dx is the stopping power. dE/dx is obtained from an ESTAR calculation [30] using the TAO liquid scintillator properties.

Cherenkov radiation Cherenkov photons are produced if the phase velocity of light in the medium is less than the velocity of a charged particle [31]. We assume that the number of photo-electrons produced by the Cherenkov radiation is a function of \(E^{e}\). The function, namely \(f_{\textrm{C}}\), is obtained with the Geant4 simulations as shown in Fig. 8a. \(f_{\textrm{C}}\) is set to 1 at 1 MeV while the absolute contribution of the Cherenkov radiation can be determined by the calibration data.

Considering these two contributions, the total non-linearity for the energy deposited by an electron or a positron is defined as:

$$\begin{aligned} f_{\textrm{nonlin}}^{\textrm{e}}(E^{\textrm{e}};A,k_{\textrm{B}},k_{\textrm{C}}) = A\cdot \left( \frac{E_{\textrm{scint}}(E^{\textrm{e}},k_{\textrm{B}})}{E^e} + k_{\textrm{C}}\cdot \frac{f_{\textrm{C}}(E^{\textrm{e}})}{E^{\textrm{e}}}\right) , \nonumber \\ \end{aligned}$$
(10)

where A is a normalization factor, and \(k_{\textrm{C}}\) is the relative contribution of the Cherenkov light to the scintillation light, both to be determined from the fitting.

Fig. 8
figure 8

a Number of photo-electrons produced by the Cherenkov radiation. \(f_{\textrm{C}}\) is set to 1 at 1 MeV and the absolute contribution of the Cherenkov radiation can be determined by the calibration data. b The kinetic energy distributions of secondary electrons or positrons for gamma sources

A gamma particle deposits its energy in the LS via secondary electrons. The physics non-linearity of gamma can be written as

$$\begin{aligned} \begin{aligned} f_{\textrm{nonlin}}^{\gamma }&(E^{\textrm{e}};A,k_{\textrm{B}},k_{\textrm{C}}) \equiv \frac{E_{\textrm{vis}}^{\gamma }}{E^{\gamma }} \\&= \frac{\int \nolimits _0^{E_{\gamma }}P(E^{\textrm{e}})\cdot f_{\textrm{nonlin}}^{\textrm{e}}(E^{\textrm{e}};A,k_{\textrm{B}},k_{\textrm{C}})\cdot E^{\textrm{e}}\cdot dE^{\textrm{e}}}{\int \nolimits _0^{E_{\gamma }}P(E^{\textrm{e}})\cdot E^{\textrm{e}}\cdot dE^{\textrm{e}}}, \end{aligned} \nonumber \\ \end{aligned}$$
(11)

where \(P(E^{\textrm{e}})\) denotes the probability density function of a given gamma converting to secondary electrons or/and positrons of kinetic energy \(E^{\textrm{e}}\) via Compton scattering, photoelectric effect, or pair production [9]. \(P(E^{\textrm{e}})\) is determined with simulations as shown in Fig. 8a.

As for a continuous spectrum like \(^{12}\)B \(\beta \)-decay, we can also calculate the expected visible energy distribution as

$$\begin{aligned} P_{\textrm{v}}(E_{\textrm{vis}}) = \left( P_{\textrm{k}}(E^{\textrm{e}}(E_{\textrm{vis}})) \cdot \left| \frac{dE^{\textrm{e}}}{dE_{\textrm{vis}}} \right| \right) \otimes \textrm{Res}(E_{\textrm{vis}}), \end{aligned}$$
(12)

where \(P_{\textrm{v}}(E_{\textrm{vis}})\) and \(P_{\textrm{k}}(E^{\textrm{e}})\) mean the visible energy and the kinetic energy distributions of the continuous \(\beta \) spectrum respectively, and Res\((E^{\textrm{e}})\) is the energy resolution of the detector. \(E^{\textrm{e}}(E_{\textrm{vis}})\) can be calculated with Eqs. (8) and (10).

This physics non-linearity model is similar to the model used in the energy calibration of Daya Bay [8] which uses similar GdLS.

5.2 Systematic biases and uncertainties

In this section, we analyze the systematic biases of the visible energy of the calibration sources used in non-linearity calibration. The effects that lead to systematic biases are analyzed one by one. The approach used here is similar to that used in the Daya Bay [8] and JUNO [9] experiments.

5.2.1 Energy loss effect

Gamma particles may deposit some of their energy in non-scintillating materials such as the source enclosure, weights, etc. This results in a tail in the visible energy distribution and might cause a bias in the fit to the fully absorbed peak. For such gamma source, we can apply Eq. (2) to fit its visible energy spectrum as shown in Fig. 4a. The fitted \(\mu \) value is referred to as \(E_{\textrm{vis}}^{\textrm{fit}}\). For comparison, events with energy fully absorbed in the GdLS are selected with the Monte Carlo truth information and fitted with a Gaussian. The fitted mean is referred to as \(E_{\textrm{vis}}^{\textrm{ideal}}\). We define \((E_\textrm{vis}^{\textrm{fit}} - E_{\textrm{vis}}^{\textrm{ideal}})/E_{\textrm{vis}}^{\textrm{ideal}}\) as the fitting bias to measure the deviation caused by energy loss.

Fig. 9
figure 9

The fitting bias due to the energy loss effect

As shown in Fig. 9, the fitting bias is less than 0.2% for radioactive sources. The energy loss in the non-scintillating materials is less for higher energy gamma. However, the differences for the gamma sources used here are negligible. We assume that the fitting biases between these gammas are correlated.

5.2.2 Shadowing effect

Scintillation photons could be absorbed or reflected by the source enclosure or the weights. The impact on the calibration is referred to as “shadowing effect”. To reduce the absorption, the surfaces of the enclosure and the weights are covered by Polytetrafluoroethylene (PTFE) with high reflectivity [13]. In this study, we assume the reflectivity is 95%.

In order to decouple the shadowing effect from the energy loss effect, only events with energy fully deposited in the GdLS are selected in the simulations. The visible energy spectrum is modeled by a Gaussian function. The fitted mean of the Gaussian is marked as \(E_{\textrm{vis}}^{\textrm{shadow}}\). For comparison, we also simulate the naked radioactive sources without the enclosure and weights to avoid shadowing effect. The fitted mean is marked as \(E_{\textrm{vis}}^\textrm{nos}\). The bias caused by the shadowing effect is defined as \((E_{\textrm{vis}}^{\textrm{shadow}} - E_{\textrm{vis}}^{\textrm{nos}})/E_{\textrm{vis}}^\textrm{nos}\). As shown in Fig. 10, the shadowing bias of each radioactive source is smaller than 0.1%. We assume that the shadowing biases are correlated among different radioactive sources because they are caused by the similar enclosures and weights.

Fig. 10
figure 10

The shadowing bias due to the absorption of scintillation photons by the surfaces of the source enclosure and weights

5.2.3 \(^{16}\textrm{O}^{*}\) 6.13 MeV gamma bias

When the \(^{241}\textrm{Am}\)-\(^{13}\)C source releases a gamma of 6.13 MeV, it also releases a neutron of a kinetic energy of less than 100 keV [8]. While the neutron capture signal is delayed due to the neutron thermalization process thus separated from the 6.13 MeV gamma, the proton recoil caused by this neutron produces a small amount of scintillation light and overlaps with the gamma. The amount of the scintillation light relies on the neutron scattering process and varies event by event. This introduces about 0.4% bias to the calibration using the 6.13 MeV gamma.

5.2.4 Residual bias after non-uniformity correction

The detector response is not uniform. The vertexes of the \(\beta \) decays of the cosmogenic \(^{12}\)B distribute uniformly in the detector. When they are used for calibration, the detector non-uniformity must be corrected. Gammas coming from the calibration sources may not deposit their energy at the position of the sources since a gamma in the MeV energy region has an attenuation length of \(\sim \, 20\) cm in the liquid scintillator. And the energy is not deposited at a single point. Therefore, the detector non-uniformity also affects the visible energy spectra of gammas. The non-uniformity effect can be corrected but not perfectly. As described in Sect. 4.4, the residual bias after non-uniformity correction can be controlled within \(0.3\%\). It is conservatively taken as a fully correlated bias among different energies.

5.2.5 Instrumental non-linearity

The non-linearity caused by the SiPMs and the electronics readout can be neglected. The scintillation light is detected with 4024 tiles of SiPMs in the TAO CD. Each tile contains 32 SiPMs of \(12\times 6\) mm\(^2\) in dimension and is read out with two electronics channels. Each SiPMs contains 12800 Single Photon Avalanche Diodes (SPADs) of \(75\times 75\) \(\upmu \)m\(^2\) in dimension. There are \(1.65\times 10^9\) SPADs in total. They collect about 4500 photo-electrons per MeV for electrons. Therefore, the response of the SiPMs is linear for events in the MeV energy region since the photo-electron multiplicity on a SPAD is a small quantity. The electronics readout of the SiPMs has the good resolution to distinguish the photo-electron multiplicity for several photo-electrons. In average, each electronics channel reads 0.55 photo-electrons per MeV. Therefore, the electronics readout is expected to be linear.

5.2.6 Summary of systematic bias and uncertainty

The effects discussed above are summarized in Table 2. We handle the uncertainties as the calibration strategy in JUNO [9]. We assume that all the biases can be corrected, but with a 100% uncertainty. This means that the uncertainty is assumed to be 100% of the absolute value of the corresponding bias. The uncertainties are further separated into either “correlated” at different energies, or “single point”, depending on how they would move individual energy points up and down. For example, the uncertainties due to the shadowing effects are correlated among different sources. On the other hand, the bias due to 6.13 MeV gamma is independent from the others (single point). Different types of uncertainties have different impacts on the non-linearity fitting in the next section.

Table 2 A list of systematic biases and uncertainties. The uncertainties are assumed to be 100% of the absolute values of the biases. The last column indicates whether the biases and uncertainties are correlated between different sources or energies

5.3 Non-linearity fitting

As planned, a non-linearity calibration will be performed once a month. At each calibration time, the \(^{68}\)Ge source will be placed in the center of the detector for 100 s, and the combined source will be placed in the center of the detector for 10 h due to the low activity of the neutron source. Therefore, we simulated one set of calibration data to verify the effectiveness of the calibration strategy. Besides, \(10^5\) \(^{12}\)B events that can be collected in FV in three years are simulated. Equation (3) is used to reconstruct the visible energy of each event. \(^{12}\)B events below 3 MeV are not used due to high background contamination in this range. For simplicity, \(^{12}\)B events from 12 MeV to 14 MeV are also ignored because there are about 30% \(^{12}\)N events in this energy range [8].

Fig. 11
figure 11

a The best fit points of the radioactive gamma sources compared with the simulated data points. There are two fitting cases on the plot, namely with and without the \(^{12}\)B constraint. The fitted data points are shifted horizontally for visibility. b Best fit and simulated \(^{12}\)B spectra. The total number of \(^{12}\)B events is about \(10^5\), which can be collected in FV in 3 years of data taking

Fig. 12
figure 12

Electron non-linearity fitting results. \(E^{e}\) is the kinetic energy of the electron or positron. The black line is the true inherent non-linearity. The green line and band are the best fit and the \(68\%\) confidence interval without the constraint from the \(^{12}\)B spectrum. The red line and band are the best fit and the \(68\%\) confidence interval with the constraint from the \(^{12}\)B spectrum

To fit the detector energy non-linearity with the model shown in Eq. (11), a \(\chi ^{2}\) is defined as:

$$\begin{aligned} \chi ^{2} = \sum _{i=1}^{7}\left( \frac{M_i^{\gamma } - P_i^{\gamma }}{\sigma _i}\right) ^{2} + \sum _{j=1}^{90}\frac{\left( M_{j}^{^{12}\textrm{B}} - P_{j}^ {^{12}\textrm{B}}\right) ^{2}}{M_{j}^{^{12}\textrm{B}}}, \end{aligned}$$
(13)

where \(M_i^{\gamma }\) and \(P_i^{\gamma }\) are measured and predicted visible energy peaks of gamma sources, respectively. \(\sigma _i\) contains statistical and systematic uncertainties of \(M_i^{\gamma }\). The systematic uncertainties are listed in Table 2. In the energy range from 3 MeV to 12 MeV, the energy spectrum of \(^{12}\)B is equally divided into 90 bins. \(M_{j}^{^{12}\textrm{B}}\) and \(P_{j}^{^{12}\textrm{B}}\) are the numbers of \(^{12}\)B events measured and predicted with Eq. (12) in the j-th bin, respectively. In the regular calibration of each month, only the first term in Eq. (13) is used in the fitting, and when \(10^5\) \(^{12}\)B events are accumulated, the second item is included. Minimizing the \(\chi ^2\), we obtain the best fit values of the non-linearity model parameters A, \(k_{\textrm{B}}\) and \(k_{\textrm{C}}\) in Eq. (11). The fitting results are shown in Fig. 11. The difference between the fitted \(E^{\gamma }_{\textrm{vis}}\) and the simulated \(E^{\gamma }_{\textrm{vis}}\) is less than 0.2%.

To calculate the 68% confidence interval of the best fit non-linearity, we sample the visible energy of \(\gamma \)s and \(^{12}\)B \(\beta \) decays according to the statistical uncertainty and the systematic uncertainties listed in Table 2. For a correlated uncertainty, a random number that obeys the normal distribution is generated and is multiplied by the uncertainty to calculate the offsets for the visible energies of all relevant calibration sources. For the single point uncertainty, the data point is shifted independently. We repeat this procedure 5000 times and fit each set of data to get the electron non-linearity curves. Finally, we get the 68% confidence interval for the physics non-linearity as shown in Fig. 12.

For comparison, mono-energetic electrons are simulated at the CD center to obtain the true inherent non-linearity, as shown in Fig. 12. For the situation without the \(^{12}\)B data constraint, the best fit curve and the inherent non-linearity agree within 0.2% in the energy range from 0.5 MeV to 8 MeV. The uncertainty of the best fit curve is less than 0.6% in the same energy range, better than our requirement of less than 1% uncertainty. For the situation with the constraint of three years of \(^{12}\)B data, the uncertainty is less than 0.4%.

6 Conclusion

A calibration strategy for the TAO detector has been developed to understand its non-uniformity and physics non-linearity. The TAO detector contains two independent calibration systems called the ACU and CLS. The ACU is capable to carry two radioactive sources and one LED source and deploys one of them into the detector along the central vertical axis at each time, while the CLS is designed to carry a single radioactive source that can be deployed to off-axis positions. For non-uniformity, we utilize the ACU and CLS to deploy radioactive sources to 110 positions to study the detector response, then generate a map to correct the detector non-uniformity. After the correction, the residual non-uniformity is less than 0.2%. The energy resolution degradation and energy bias caused by the residual non-uniformity can be controlled within 0.05% and 0.3% respectively. For the physics non-linearity, we utilize several gamma sources with energies ranging from a few hundred keV to several MeV to control the uncertainty within 0.6% for electron or positron with kinetic energy greater than 0.5 MeV. \(^{\textrm{12}}\)B events can be used to reduce the uncertainty of physics non-linearity down to 0.4% assuming statistics collected in three years of data taking. With this calibration strategy, TAO is projected to measure high-precision reactor antineutrino energy spectrum.