1 Introduction

High-energy collisions at the large hadron collider (LHC) create a suitable environment for the production of light (anti-)nuclei. In ultra-relativistic heavy-ion collisions light (anti-)nuclei are abundantly produced [1,2,3], but in elementary pp collisions their production is lower [1, 4,5,6]. As a consequence, there are only few detailed measurements of (anti-)nuclei production rate in pp collisions. However, with the recently collected large data sample it is now possible to perform more differential measurements of light (anti-)nuclei production as a function of multiplicity and transverse momentum. In this paper, we present the detailed study of the multiplicity dependence of (anti-)deuteron production in pp collisions at \(\sqrt{s} =\) 13 TeV, the highest collision energy so far delivered at the LHC.

The production mechanism of light (anti-)nuclei in high-energy hadronic collisions is not completely understood. However, two groups of models have turned out to be particularly useful, namely statistical hadronisation models (SHM) and coalescence models. The SHMs, which assume particle production according to the thermal equilibrium expectation, have been very successful in explaining the yields of light (anti-)nuclei along with other hadrons in Pb–Pb collisions [7], suggesting a common chemical freeze-out temperature for light (anti-)nuclei and other hadron species. The ratio between the \(p_{\mathrm {T}}\)-integrated yields of deuterons and protons (d/p ratio) in Pb–Pb collisions remains constant as a function of centrality, but rises in pp and p–Pb collisions with increasing multiplicity, finally reaching the value observed in Pb–Pb [1, 8, 9]. The constant d/p ratio in Pb–Pb collisions as a function of centrality is consistent with thermal production, suggesting that the chemical freeze-out temperature in Pb–Pb collisions does not vary with centrality [10]. Assuming thermal production in pp collisions as well, the lower d/p ratio would indicate a lower freeze-out temperature [10]. On the other hand, the ratio between the \(p_{\mathrm {T}}\)-integrated yields of protons and pions (p/\(\pi \) ratio) does not show a significant difference between pp and Pb–Pb collisions [11, 12]. Also, for p–Pb collisions the freeze-out temperature obtained with SHMs using only light-flavoured particles is constant with multiplicity and its value is similar to that obtained in Pb–Pb collisions [13]. Thus, the increase of the d/p ratio with multiplicity for smaller systems cannot be explained within the scope of the grand-canonical SHM as is done in case of Pb–Pb. It is also not consistent with a simple SHM that the d/p and p/\(\pi \) ratios behave differently as a function of multiplicity even though numerator and denominator differ in both cases by one unit of baryon number. Nonetheless, a process similar to the canonical suppression of strange particles might be worth considering also for baryons. A recent calculation within the SHM approach with exact conservation of baryon number, electric charge, and strangeness focuses on this aspect [14].

In coalescence models (anti-)nuclei are formed by nucleons close in phase-space [15]. In this approach, the coalescence parameter \(B_2\) quantitatively describes the production of \(\text {(anti-)deuterons}\). \(B_2\) is defined as

$$\begin{aligned} B_{2}\left( p_{\mathrm {T}}^p\right)= & {} E_{d}\frac{\mathrm {d}^3N_{d}}{\mathrm {d}p_{d}^3}\bigg /\left( E_{p}\frac{\mathrm {d}^3N_{p}}{\mathrm {d}p_{p}^3}\right) ^2\nonumber \\= & {} \frac{1}{2\pi p_{\mathrm {T}}^d}\frac{\mathrm {d}^2N_{d}}{\mathrm {d}y\mathrm {d}p^{d}_{\mathrm {T}}} \; \bigg / \left( \frac{1}{2\pi p_{\mathrm {T}}^p}\frac{\mathrm {d}^2N_{p}}{\mathrm {d}y\mathrm {d}p_{\mathrm {T}} ^{p}}\right) ^2 , \end{aligned}$$
(1)

where E is the energy, p is the momentum, \(p_{\mathrm {T}}\)  is the transverse momentum and y is the rapidity. The labels p and d are used to denote properties related to protons and deuterons, respectively. The invariant spectra of the \(\text {(anti-)protons}\) are evaluated at half of the transverse momentum of the deuterons, so that \(p_{\mathrm {T}}^p = p_{\mathrm {T}}^d / 2\). Neutron spectra are assumed to be equivalent to proton spectra, since neutrons and protons belong to the same isospin doublet. Since the coalescence process is expected to occur at the late stage of the collision, the parameter \(B_2\) is related to the emission volume. In a simple coalescence approach, which describes the uncorrelated particle emission from a point-like source, \(B_2\) is expected to be independent of \(p_{\mathrm {T}}\) and multiplicity. However, it has been observed that \(B_2\) at a given transverse momentum decreases as a function of multiplicity, suggesting that the nuclear emission volume increases with multiplicity [2, 9, 16]. In Pb–Pb collisions the \(B_2\) parameter as a function of \(p_T\) shows an increasing trend, which is usually attributed to the position-momentum correlations caused by radial flow or hard scatterings [17, 18]. Such an increase of \(B_2\) as a function of \(p_{\mathrm {T}}\) has in fact also been observed in pp collisions at \(\sqrt{s}=7\) TeV  [6]. However, if pp collisions are studied in separate intervals of multiplicity, \(B_2\) is found to be almost constant as a function of \(p_{\mathrm {T}}\)  [8]. Similarly, \(B_2\) does not depend on \(p_{\mathrm {T}}\) in multiplicity selected p–Pb collisions [9]. Moreover, the highest multiplicities reached in pp collisions are comparable with those obtained in p–Pb collisions and not too far from peripheral Pb–Pb collisions. Therefore, the measure of \(B_2\) as a function of \(p_{\mathrm {T}}\) for finer multiplicity intervals in pp collisions at \(\sqrt{s} =\) 13 TeV gives the opportunity to compare different collision systems and to evaluate the dependence on the system size.

The paper is organized as follows. Section 2 discusses the details of the ALICE detector. Section 3 describes the data sample used for the analysis and the corresponding event and track selection criteria. Section 4 presents the data analysis steps in detail, such as raw yield extraction and various corrections, as well as the systematic uncertainty estimation. In Sect. 5, the results are presented and discussed. Finally, conclusions are given in Sect.  6.

2 The ALICE detector

A detailed description of the ALICE detectors can be found in [19] and references therein. For the present analysis the main sub-detectors used are the V0, the inner tracking system (ITS), the time projection chamber (TPC) and the time-of-flight (TOF), which are all located inside a 0.5 T solenoidal magnetic field.

The V0 detector [20] is formed by two arrays of scintillation counters placed around the beampipe on either side of the interaction point: one covering the pseudorapidity range \(2.8< \eta < 5.1\) (V0A) and the other one covering \(-3.7< \eta < -1.7\) (V0C). The collision multiplicity is estimated using the counts in the V0 detector, which is also used as trigger detector. More details will be given in Sect. 3.

The ITS [21], designed to provide high resolution track points in the proximity of the interaction region, is composed of three subsystems of silicon detectors placed around the interaction region with a cylindrical symmetry. The silicon pixel detector (SPD) is the subsystem closest to the beampipe and is made of two layers of pixel detectors. The third and the fourth layers consist of silicon drift detectors (SDD), while the outermost two layers are equipped with double-sided silicon strip detectors (SSD). The inner radius of the SPD, 3.9 cm, is essentially given by the radius of the beam pipe, while the inner field cage of the TPC limits the radial span of the entire ITS to be 43 cm. The ITS covers the pseudorapidity range \(|\eta |<0.9\) and it is hermetic in azimuth.

The same pseudorapidity range is covered by the TPC [22], which is the main tracking detector, consisting of a hollow cylinder whose axis coincides with the nominal beam axis. The active volume, filled with a Ne/CO\(_2\)/N\(_2\) gas mixture (Ar/CO\(_2\)/N\(_2\) in 2016), at atmospheric pressure, has an inner radius of about 85 cm, an outer radius of about 250 cm, and an overall length along the beam direction of 500 cm. The gas is ionised by charged particles traversing the detector and the ionisation electrons drift, under the influence of a constant electric field of \(\sim \) 400 V/cm, towards the endplates, where their position and arrival time are measured. The trajectory of a charged particle is estimated using up to 159 combined measurements (clusters) of drift times and radial positions of the ionisation electrons. The charged-particle tracks are then formed by combining the hits in the ITS and the reconstructed clusters in the TPC. The TPC is used for particle identification by measuring the specific energy loss (\(\mathrm {d} E/\mathrm {d} x\)) in the TPC gas.

Table 1 Summary of the relevant information about the multiplicity classes and the fits to the measured transverse momentum spectra of anti-deuterons. \(\langle \mathrm {d}N_{ch}/\mathrm {d}\eta \rangle \) is the mean pseudorapidity density of the primary charged particles [25]. n and C are the parameters of the Lévy–Tsallis fit function [27]. \(\mathrm {d}N/\mathrm {d}y\) is the integrated yield, with statistical uncertainties, multiplicity-uncorrelated and multiplicity-correlated systematic uncertainties (see the text for details). \(\langle p_{\mathrm {T}} \rangle \) is the mean transverse momentum

The TOF system [23] covers the full azimuth for the pseudorapidity interval \(|\eta |<0.9\). The detector is based on the multi-gap resistive plate chambers (MRPCs) technology and it is located, with a cylindrical symmetry, at an average distance of 380 cm from the beam axis. The particle identification is based on the difference between the measured time-of-flight and its expected value, computed for each mass hypothesis from track momentum and length. The overall resolution on the time-of-flight of particles is about 80 ps.

A precise starting signal for the TOF system can be also provided by the T0 detector, consisting of two arrays of Cherenkov counters, T0A and T0C, which cover the pseudorapidity regions \(4.61< \eta < 4.92\) and \(-3.28< \eta < -2.97\), respectively [24]. Alternatively, the start time can be provided by the TOF itself or the bunch-crossing time can be used, as described in [24].

3 Data sample

The data samples used in this work consist of approximately 950 million minimum bias pp events collected during the LHC proton runs in 2016 and 2017. The data were collected using a minimum bias trigger requiring at least one hit in both the V0 detectors. Moreover, the timing information of the V0 scintillators is used for the offline rejection of events triggered by interactions of the beam with the residual gas in the LHC vacuum pipe. To ensure the best possible performance of the detector, events with more than one reconstructed primary interaction vertex (pile-up events) were rejected.

The production of primary \(\text {(anti-)deuterons}\) is measured around mid-rapidity. In particular, the spectra are provided within a rapidity window of \(|y| < 0.5\). To ensure that all tracks have the maximal length, only those in the pseudorapidity interval \(|\eta | < 0.8\) are selected. In order to guarantee good track momentum and \(\mathrm {d} E/\mathrm {d} x\) resolution in the relevant \(p_{\mathrm {T}}\) ranges, the selected tracks are required to have at least 70 reconstructed points in the TPC and two points in the ITS. In addition, at least one of the ITS points has to be measured by the SPD in order to assure for the selected tracks a resolution better than 300 \(\mu \)m on the distance of closest approach to the primary vertex in the plane perpendicular (DCA\(_{xy}\)) and parallel (DCA\(_{z}\)) to the beam axis [19]. Furthermore, it is required that the \(\chi ^2\) per TPC reconstructed point is less than 4 and tracks originating from kink topologies of weak decays are rejected.

Data are divided into ten multiplicity classes, identified by a roman number from I to X, going from the highest to the lowest multiplicity. However, in this analysis classes IV and V are merged into a single class to achieve a better statistical precision. The multiplicity classes are determined from the sum of the V0 signal amplitudes and defined in terms of percentiles of the INEL\(>0\) pp cross section, where INEL > 0 events are defined as collisions with at least one charged particle in the pseudorapidity region \(|\eta |<\)1 [25]. The mean charged particle multiplicity \(\left\langle {\mathrm {d} N_{ch}/\mathrm {d} \eta } \right\rangle \) for each class is reported in Table 1.

4 Data analysis

4.1 Raw yield extraction

The identification of \(\text {(anti-)deuterons}\) is performed with two different methods, depending on their transverse momentum. For \(p_{\mathrm {T}} < \) 1 GeV/c, the identification is done using a measurement of the \(\mathrm {d} E/\mathrm {d} x\) in the TPC only. In particular, for each \(p_{\mathrm {T}}\) interval the number of \(\text {(anti-)deuterons}\) is extracted through a fit with a Gaussian with two exponential tails to the \(n_{\sigma }\) distribution. Here, \(n_{\sigma }\) is the difference between the measured TPC \(\mathrm {d} E/\mathrm {d} x\) and the expected one for \(\text {(anti-)deuterons}\) divided by the TPC \(\mathrm {d} E/\mathrm {d} x\) resolution. However, for \(p_{\mathrm {T}} \ge \) 1 GeV/c it is more difficult to separate \(\text {(anti-)deuterons}\) from other charged particles with this technique. Therefore, the particle identification in this kinematic region is performed using the TOF detector. The squared mass of the particle is computed as \(m^{2} = p^{2}\left( t_{\mathrm{TOF}}^2/L^2 - 1/c^2\right) \), where \(t_\mathrm{TOF}\) is the measured time-of-flight, L is the length of the track and p is the momentum of the particle. In order to reduce the background, only the candidates with a dE/dx measured in the TPC compatible within 3\(\sigma \) with the expected value for a (anti-)deuteron are selected. The squared-mass-distributions are fitted with a Gaussian function with an exponential tail for the signal. A significant background is present for \(p_{\mathrm {T}} \ge \) 1.8 GeV/c and is modelled with two exponential functions. In the range where the background is negligible, the raw yield is extracted by directly counting the candidates. Otherwise, the squared-mass distribution is fitted with the described model, using an extended-maximum-likelihood approach. The \(\text {(anti-)deuteron}\) yield is then obtained by a fit parameter.

4.2 Efficiency and acceptance correction

A correction for the tracking efficiency and the detector acceptance must be applied to obtain the real yield. The correction is evaluated from Monte Carlo (MC) simulated events. The events are generated using the standard generator PYTHIA8 (Monash 2013) [26]. However, PYTHIA8 does not handle the production of nuclei. Therefore, in each event it is necessary to inject \(\text {(anti-)deuterons}\). In each pp collision one deuteron or one anti-deuteron is injected, randomly chosen from a flat rapidity distribution in the range \(|y|<\) 1 and a flat \(p_{\mathrm {T}}\) distribution in the range \(p_{\mathrm {T}} \in [0,10] \) GeV/c. The correction is defined as the ratio between the number of reconstructed \(\text {(anti-)deuterons}\) in the rapidity range \(|y|<0.5\) and in the pseudorapidity interval \(|\eta |<0.8\) and the number of generated ones in \(|y|<0.5\). The correction is computed separately for deuterons and anti-deuterons and for the TPC and TOF analyses.

Another correction is related to the trigger efficiency. All the selected events are required to have at least one charged particle in the acceptance, i.e. in the pseudo-rapidity region \(|\eta |<1\) (INEL > 0) [25]. Due to the imperfection of the trigger, some INEL > 0 events are wrongly rejected (event loss). Consequently, all the \(\text {(anti-)deuterons}\) produced in the erroneously rejected events are lost as well (signal loss). Therefore, it is necessary to correct the spectra for the event and the signal losses. Event loss is more relevant at low multiplicity and almost negligible at high multiplicity (\(\sim \) 12% for multiplicity class X and < 1‰ for multiplicity class I). The corrections are computed from MC simulations, because both the number of rejected events and the number of \(\text {(anti-)deuterons}\) produced in those same events are known. However, it is not possible to count the number of lost \(\text {(anti-)deuterons}\) directly, because the artificial injection of one (anti-)deuteron per event will bias the number of lost candidates that can be extracted from this MC data set. Instead, the number of lost pions, kaons and protons are extracted from a different MC data set and then these values are extrapolated to the deuteron mass. The standard transport code used in ALICE simulations is GEANT3. However, it is known from other ALICE analyses on nuclei that GEANT4 provides a more realistic transport of (anti-)nuclei. The GEANT3 response is hence scaled to the GEANT4 one to take into account this effect. Moreover, the spectra obtained with TOF are further corrected to take into account the TPC-TOF matching efficiency using a data-driven approach. This correction was evaluated for the analysis of the (anti-)deuteron production in the p–Pb data sample collected in 2013 [9]. In that year not all the modules of the transition radiation detector (TRD), which is located between the TPC and the TOF, were already installed. In this way it was possible to compute the effects of the presence of the TRD, comparing the (anti-)deuteron yields in the regions where the TRD modules were present and in those where they were not yet installed. This correction was also verified with Run 2 data, by comparing the yields extracted with the TPC with those extracted with the TOF in the \(p_{\mathrm {T}}\) region where both the techniques can be used.

4.3 Subtraction of secondary deuterons

Secondary deuterons are produced in the interaction of particles with the detector material and their contribution must be subtracted from the total measured deuteron yield. However, the production of secondary anti-deuterons is extremely rare due to baryon number conservation. Hence, the correction is applied only to the deuteron spectra. The fraction of primary deuterons is evaluated via a fit to the DCA\(_{xy}\) distribution of the data, as described in [1]. The template for primary deuterons is obtained from the measured DCA\(_{xy}\) of anti-deuterons. The template from secondary deuterons is instead obtained from MC simulations. The production of secondary deuterons is more relevant at low \(p_{\mathrm {T}}\) (at \(p_{\mathrm {T}} = \) 0.7 GeV/c the fraction of secondary deuterons is \(\sim \) 40%) and decreases exponentially with the transverse momentum (< 5% for \(p_{\mathrm {T}} = \) 1.4 GeV/c). The only other possible contribution to secondary deuterons that is known is the decay \(^{3}_{\Lambda }\mathrm H \rightarrow \mathrm {d} + \mathrm {p} +\pi \). However, \(^{3}_{\Lambda }\mathrm H\) production has not yet been observed in pp collisions and its production yield is therefore lower than that of \(^3\)He, which is less than a thousandth of the deuteron production rate [6].

4.4 Systematic uncertainties

A list of all the sources of systematic uncertainty is shown in Table 2. The values are reported for the multiplicity classes I and X, for the lowest and highest \(p_{\mathrm {T}}\) values.

Table 2 Summary of the main contributions to the systematic uncertainties for the extreme multiplicity classes I and X. Values in brackets are referred to anti-deuterons. If they are not present, the systematic uncertainty is common for deuterons and anti-deuterons. More details about the sources of the uncertainties can be found in the text

The track selection criteria are a source of systematic uncertainty. In this category we include all the contributions related to the single-track selection: DCA, number of clusters in the TPC and, for the TOF analysis, the width of the \(\mathrm {d} E/\mathrm {d} x\) selection applied in the TPC. These uncertainties are evaluated by varying the relevant selections, as done in [8]. At low \(p_{\mathrm {T}}\) (\(p_{\mathrm {T}} < 1\) GeV/c) the contribution is 2% for deuterons due to the DCA\(_z\) and DCA\(_{xy}\) selections, which influence the estimation of the fraction of primary deuterons, while for anti-deuterons this systematic uncertainty is around 1%. It increases with \(p_{\mathrm {T}}\) and the growth is more pronounced for low multiplicity. The systematic uncertainty on the signal extraction is evaluated by directly counting the (anti-)deuteron candidates. It is obtained by varying the interval in which the direct counting is performed. Its contribution is \(\sim \) 1% at low \(p_{\mathrm {T}}\) and increases with \(p_{\mathrm {T}}\). Another source of systematic uncertainty is given by the incomplete knowledge of the material budget of the detector in the Monte Carlo simulations. The effect is evaluated by comparing different MC simulations in which the material budget was increased and decreased by 4.5%. This value corresponds to the uncertainty on the determination of the material budget by measuring photon conversions. This particular systematic uncertainty is below 1%. The imperfect knowledge of the hadronic interaction cross section of \(\text {(anti-)deuterons}\) with the material contributes to the systematic uncertainty as well. Its effect is evaluated with the same data-driven approach used to investigate the TOF-matching efficiency, as described in Sect. 4.2. Half of the correction, corresponding to the 1\(\sigma \) confidence interval, is taken as its uncertainty contributing 4% to the systematic uncertainty for deuterons and 7.5% for anti-deuterons. Similarly, an uncertainty related to the ITS-TPC matching is considered. It is evaluated from the difference between the ITS-TPC matching efficiencies in data and MC and its contribution is less than 2.5%. Finally, a source of systematic uncertainties results from the signal loss correction. It is assumed to be half of the difference between the signal-loss correction (described in Sect. 4.2) and 1. It is strongly dependent on the event multiplicity: it is negligible at high multiplicity (multiplicity classes from I to VII) and contributes up to 6% in the lowest multiplicity class (class X). Where present, it decreases with \(p_{\mathrm {T}}\).

Fig. 1
figure 1

Transverse-momentum spectra of deuterons (top) and anti-deuterons (bottom) measured in pp collisions at \(\sqrt{s}=13~\hbox {TeV}\) in different multiplicity classes (circles) and in INEL>0 events (squares). The mean charged-particle multiplicity for classes I and X are reported in the figures and all the values for the multiplicity classes can be found in Table 1. For the analyses in multiplicity classes, the multiplicity increases moving from the bottom of the figure upwards. The statistical uncertainties are represented by vertical bars while the systematic uncertainties are represented by boxes. The dashed lines are individual fits with a Lévy–Tsallis function [27]

5 Results and discussion

The transverse momentum spectra of deuterons and anti-deuterons in different multiplicity classes as well as INEL>0 pp collisions are reported in Fig. 2. The spectra normalised to inelastic pp collisions (INEL) are included in the data provided with this paper. The mean charged-particle multiplicity \(\left\langle {\mathrm {d} N_{ch}/\mathrm {d} \eta } \right\rangle \) for each class is reported in Table 1. The spectra exhibit a slight hardening with increasing multiplicity: the slope of the spectra becomes less steep and the mean transverse momentum \(\left\langle p_{{\mathrm {T}}}\right\rangle \) moves towards higher values. This effect is similar to that observed in Pb–Pb collisions, where it is explained with the presence of increasing radial flow with centrality [1, 28]. However, in pp collisions the intensity of the hardening is not as dramatic. The ratio between the spectra of anti-deuterons and deuterons for all the multiplicity classes under study is reported in Fig. 2. The ratio is compatible within uncertainties with unity in all multiplicity classes.

To calculate the integrated yield (\(\mathrm {d} N/\mathrm {d} y\)) and the mean \(p_{\mathrm {T}}\) the spectra have been fitted with the Lévy–Tsallis function [27, 29, 30]:

$$\begin{aligned} \frac{\mathrm {d}^2N}{\mathrm {d}y \, \mathrm {d}p_{\mathrm {T}}} = \frac{\mathrm {d}N}{\mathrm {d}y}\frac{p_{\mathrm {T}}\left( n-1\right) \left( n-2\right) }{nC [nC + m\left( n-2\right) ]}\left( 1+\frac{m_{\mathrm {T}}-m}{nC}\right) ^{-n}, \end{aligned}$$
(2)

where m is the particle rest mass (i.e. the mass of the deuteron), \(m_{\mathrm {T}}=\sqrt{m^2+p_{\mathrm {T}}^2}\) is the transverse mass, while n, \(\mathrm {d} N/\mathrm {d} y\) and C are free fit parameters. The Lévy–Tsallis function is used to extrapolate the spectra in the unmeasured regions of \(p_{\mathrm {T}}\). One contribution to the systematic uncertainty is obtained by shifting the data points to the upper border of their systematic uncertainty and to the corresponding lower border. The difference between these values and the reference one is taken as an uncertainty which amounts to \(\sim \) 11%. Another contribution to the systematic uncertainty is estimated by using alternative fit functions such as simple exponentials depending on \(p_{\m