Investigation of the energy dependence of the pT0 parameter in the Pythia 8 Monte Carlo event generator

By using predictions from the Pythia 8 Monte Carlo event generator, we determine the energy-dependent turn-over values (pT0) of the partonic cross section for the simulation of multiparton interactions. Since the observed energy dependence of the pT0 values is not well described by a power-law function, we introduce an additional energy-dependent term, to better describe the experimental observations in the energy range sqrt(s) = 0.3-13 TeV. We obtain a similar level of agreement for predictions using various parton densities.


I. INTRODUCTION
In order to simulate a hadron-hadron collision event, standard Monte Carlo (MC) event generators, such as pythia 8 [1], are based on a factorized ansatz, for which any hadron-hadron cross section can be written as a product of two non-perturbative processindependent parton density functions (PDF), one for each of the colliding protons, and a perturbative parton-parton cross section. The so-called underlying event (UE) represents the whole additional activity which occurs at lower scales accompanying the hard scattering, and consists of several components, such as initial-and final-state radiation (ISR and FSR, respectively), multiple parton interactions (MPI), and beam-beam remnants (BBR). All the coloured partons produced by these processes are finally rearranged into colourless hadrons, during the hadronization process. Due to all the different contributions, the resulting hadronic collision is a complex multiparticle process. Particularly relevant for the characterization of the UE are the MPI, which consist of numerous additional 2-to-2 partonparton interactions, occurring within the single collision event. Due to the large increase of the parton density at small longitudinal momentum fractions x, the MPI contribution increases with increasing collision energy.
The perturbative partonic cross section of a generic process with two incoming and two outgoing partons (the so-called 2→2 processes), as a function of the exchanged transverse momentum p T , can be expressed as: where α s is the strong coupling. By integrating the cross section over p 2 T , one obtains: which shows that the total 2→2 cross section tends to diverge at small values of exchanged transverse momentum. While the simulation of the hard scattering generally involves relatively large p T values (p T > 5 GeV), generated MPI processes might reach very low p T scales (p T ∼ 1 GeV) where the rapid increase of the partonic cross section becomes relevant and might lead to unphysical results. In order to tame the behaviour of the partonic cross section as a function of p T , the Pythia 8 event generator introduces a regularization, by shifting the value of p T by a quantity p T0 , leading to a formulation of the partonic cross section as follows: dσ dp 2 which after integration becomes:σ Such a cross section does not present any divergence for p T → 0 any longer. In the simulation, p T0 serves as a phenomenological parameter, which can not be obtained from any first principle, but must be determined from data. Many studies have been performed [2,3], in order to determine the values of p T0 which, as a function of the center-of-mass energy √ s, is generally between 1.2-2.5 GeV in the √ s range of 300-13000 GeV. The Pythia 8 MC event generator implements an energy dependence of the p T0 parameter, according to a power law of the form: where p T0 is the regularizator of the partonic cross section, which solves its divergent behaviour for p T → 0, p ref T0 is the p T0 at a reference energy √ s 0 , and the parameter determines the energy dependence. This formulation of the p T0 energy extrapolation follows the same energy dependence as the total hadronic cross section. No other options for the p T0 energy dependence are available in Pythia 8. Previous studies have shown the difficulty of describing measurements sensitive to the UE at various center-of-mass energies [2,3] in the range of 300-7000 GeV, measured at different colliders.
At a given center-of-mass energy, the amount of simulated MPI in Pythia 8 depends on the turn-over p T0 , the PDF, and the overlap of the matter distributions of the two colliding hadrons. The MPI processes produce a lot of coloured partons in the final state, creating a dense net of colour lines which spatially overlap with the fields produced by the partons of the hard scattering and with each other. All the generated colour lines may be connected between each other according to the so-called colour reconnection (CR). The CR mechanism implements the possibility for different colour strings to be reconnected and to exchange colour information.
In MC event generators, the PDF are a crucial ingredient for the simulation of both the hard scattering and the UE, as they parametrize the distributions of the partons inside each hadron which cannot be calculated analytically a priori. However, they can be extracted from fits to the data. These fits use analytical calculations of the hard scattering performed at a certain order of the strong coupling α S . From a fit with a leading-order (LO) calculation in α S , one obtains a LO PDF set, with next-to-leading (NLO) or next-to-next-to-leading (NNLO) order calculations, respectively a NLO or NNLO PDF set is obtained. One of the striking differences between LO PDF sets and NLO or NNLO ones is the gluon distribution at small values of longitudinal momentum fractions x and scales Q 2 , which is rather flat for NLO and NNLO PDF sets while tends to increase quickly for LO PDF sets at small x. Note that all PDF sets have a significant uncertainty in this region, and the "correct" behaviour of the gluon distribution at small x is not yet established. Furthermore, LO PDF can be directly interpreted as parton densities inside the proton and their behaviour can be easily related to measurable quantities, while this is not obvious for NLO or NNLO PDF sets. Because of these reasons, the usage of LO PDF sets for the UE simulation are generally preferred, but nothing prevents to achieve a good description of the measurements, if one uses NLO or NNLO PDF sets.
The goodness of the UE simulation provided by MC event generators and corresponding tunes can be tested by comparing predictions with available data. These data are generally measurements of the number of charged particles and their transverse momentum sum in different regions of the phase space relative to the direction of the hardest objects in the event. In particular, the hard object, which might be a jet, a charged particle or a Z boson, identifies a direction in the transverse plane. The transverse plane is then divided into four regions, according to their azimuthal angle: a "toward" and an "away" region sensitive to the hard scattering and its recoiling object, and two "transverse" regions, more sensitive to UE contributions. In recent measurements, the two transverse regions are further divided into separate measurements. The transverse region with the highest activity is called "transMAX" while the one with the smallest activity is labelled as "transMIN". The charged-particle multiplicity and the transverse momentum sum of the charged particles, measured as a function of the transverse momentum of the leading charged particle, are referred to as "UE observables" in the following.
In this document, by using UE observables at various collision energies, we determine separately the p T0 values, which best fit the measurements. We use predictions of the pythia 8.226 event generator produced with various PDF sets evaluated at different order in α S . After observing that the obtained p T0 values are not properly predicted by Eq. 5, we introduce a modification of the energy extrapolation by introducing an additional term, according to the formula: where the quantity c is a free energy-independent parameter 1 . The new proposed term of the energy dependence is an attempt of achieving better predictions of p T0 values. However, it does not significantly modify the structure of the power law already implemented in Pythia 8 and the additional term is expected to introduce a little correction with respect to Eq. 5.

II. DETERMINATION OF p T0 VALUES AT VARIOUS ENERGIES
In order to determine the best values of p T0 , fits to observables sensitive to contributions of MPI at soft and semi-hard scales are performed independently at various collision energies. The considered observables are the charged particle multiplicity and average p T sum densities as a function of the leading charged-particle transverse momentum, p max T in the transMIN and transMAX regions. Five different sets of measurements of these observables are considered at various collision energies: 300, 900 and 1960 GeV measured by the CDF experiment [4], 7000 GeV measured by the CMS experiment [5] and 13 TeV measured by the ATLAS experiment [6]. The region between 0.5 < p max T < 1 GeV and the region between 0.5 < p max T < 3 GeV are excluded by the fits performed, respectively, at √ s = 7 TeV and √ s = 13 TeV, since they are found to be affected by contributions of diffractive processes, whose free parameters are not considered in the tuning procedure.
Fits are performed for three different PDF sets released by the NNPDF31 collaboration [7]. They refer to the LO, NNPDF31 lo as 0130, the NLO, NNPDF31 nlo as 0118, and NNLO, NNPDF31 nnlo as 0118, sets. All fits use as baseline the hadronization parameters of the Monash tune [2]. Additionally, they use a range for colour reconnection probability equal to 2.17 and an overlap matter distribution modelled by a double gaussian function with radius and fraction matter in the core equal to, respectively, 0.43 and 0.46. 1 In a preliminary study, the quantity c was considered dependent on the collision energy, in a functional form such as c = √ s √ s0 a . After performing the fit to the data, it turned out that c is very weakly dependent on the energy, i.e. the quantity a is very close to 0. In order to eliminate any bias on the choice of the reference energy √ s 0 on c, the energy dependence was not further considered.
These values of parameters were obtained from preliminary tuning attempts using the LO NNPDF31 sets and they were used also for fits with the other PDF fits for consistency. The energy reference used for the extrapolation is set to the collision energy of the considered data points used in the fits. This translates into p T0 = p ref T0 in Eq. 5. This choice is important in order to reduce the number of fitted parameters at each energy and to eliminate any energy dependence in each single fit. The parameters used in the pythia 8 configuration are listed in Table I.  [9] software. About 30 different choices of the p T0 parameter are considered for building the set of anchor points in the one-dimensional parameter space. For each choice of parameters, two million events are generated, so that for each considered bin, the statistical uncertainty of the MC predictions is smaller than the uncertainty of the experimental data. It has been checked that the bin-by-bin envelopes of the different MC predictions encompass well the data points. After running the different predictions, Professor performs an interpolation of the bin values for the considered observables as a function of p T0 , according to a third-order polynomial function. We checked that the degree of the polynomial used for the interpolation does not influence the tune results. The obtained function f b (p) describes the MC response of each bin b as a function of the vector of the parameters p. The final step is the minimization of the χ 2 function given by the formula: where R b is the data value for each bin b and ∆ b expresses the total bin uncertainty of the data. The experimental uncertainties are assumed to be uncorrelated between data points. The minimization procedure gives in return the values of the parameters which are able to best describe the considered data.

III. RESULTS
The p T0 values obtained from the fits to the data at the various considered energies, as well as the value of the goodness of the fit, are shown in the Tab. II.
The different PDF sets show quite different values of p T0 . In particular, the LO PDF set requires more rapidly changing values as a function of energy, than NLO and NNLO PDF sets. This is the impact of the different behaviour of the gluon distribution at small x values, which are more relevant for higher collision energies. In order to reproduce the UE observables, a larger gluon density prefers a smaller amount of MPI contributions (which translates into a larger p T0 value), while larger MPI contributions, i.e. smaller p T0 values, are needed for smaller gluon densities. The p T0 values obtained for the NLO and NNLO PDF sets are very similar to each other. The fact that the gluon distribution at small x is quite flat for the NLO and NNLO PDF sets has the effect that the p T0 values are very weakly dependent on the energy. They range between 1.4 and 1.95 in the energy range of 0.3-13 TeV. The results show that for all measurements and considered PDF sets, we are able to obtain a χ 2 /Ndf value close to 1. The relative uncertainties obtained for the p T0 values are all of the order of 1-2%. The p T0 values constitute the input for the fits according to the functions in Eqs. 5 and 6. The results of the fitting procedure are shown in Figures 1 and 2 for the two functions, respectively. Table III summarizes the parameters of the two functions, as well as the goodness of fit divided by the number of degrees of freedom.
Both functions are able to follow the trend of the p T0 values in the considered energy range. Applying Eq. 5, the low-energy points (300 and 900 GeV), are not well described giving the relatively high χ 2 /Ndf values, which go up to 2.97 for the NNLO PDF set. By including the additional term in the energy dependence, the behaviour at low energy significantly improves and the value of χ 2 /Ndf decreases down to values close to 1.
It has been checked that the modified energy depedence interpolates well, i.e. it gives reliable predictions of p T0 values within the fitted range, for instance, at √ s = 2.76 TeV. By comparing Pythia 8 predictions obtained with the p T0 value at √ s = 2.76 TeV from Eq. 6 and the parameters of Table I, a very good level of agreement is obtained for the UE data measured by CMS at that energy [10]. The modified energy dependence function can also be reliably used for extrapolation of p T0 values at energies smaller than 300 GeV and higher than 13 TeV and constitutes a alternative to the default Pythia 8 energy extrapolation.   The new extracted energy dependence can be used to extrapolate p T0 values at high energies. In this Section, we show results for √ s = 100 TeV. Note that for the phase space relevant for such an energy, i.e. the gluon distribution at small x values, the current PDF sets are not constrained by any data but only extrapolated from measurements at lower energies. In Table IV, the p T0 values for the different PDF sets are listed as predicted by Eq. 5, referred to as "old fit", and by Eq. 6, referred to as "new fit". While a very small difference is observed between old and new fits for NLO and NNLO PDF sets, the two p T0 values for the LO PDF set differ between each other. This is due to the fact that tunes using  a NLO or a NNLO PDF set prefer a very weak p T0 energy dependence, in order to describe measurements at various collision energies. Instead, in tunes using a LO PDF set, one needs a more rapidly increasing p T0 as a function of energy, which is differently predicted by the old and the new fit.  Figure 3 shows predictions using the various PDF sets on the average charged-particle multiplicity and average charged-particle transverse momentum sum in the transMIN and transMAX regions, as a function of the transverse momentum of the leading charged particle (p max T ), at √ s = 100 TeV. While predictions obtained with the tunes based on NLO and NNLO PDF sets are very similar to each other, independently of the considered energy extrapolation, predictions from the LO tunes differ of up to 10% between each other. In particular, the new fit predicts a higher p T0 value, and consequently a lower activity in terms of number of charged particles and of transverse momentum. Predictions obtained with NLO and NNLO PDF sets are significantly lower than predictions obtained with LO PDF sets, of less than 10% if the new fit is used and up to 20% if the energy extrapolation is carried out through the old fit. By performing such measurements at a high collision energy, e.g. 100 TeV, one may be able to validate the performance of the energy extrapolation functions, considered in this document. LO 3. Predictions of the pythia 8 tunes at √ s = 100 TeV, obtained with the various PDF sets and the p T0 obtained from Eq. 5 or Eq. 6, are shown for average charged particle multiplcities (top plots) and average transverse momentum sum (bottom plots) in the transMIN and transMAX regions, as a function of the transverse momentum of the leading charged particle (p max T ). Curves labelled as "Old fit" refer to predictions using p T0 as predicted by Eq. 5, while curves labelled as "New fit" use the p T0 value as predicted by Eq. 6 Below each panel, the ratios of all predictions to the ones obtained with the LO PDF set and p T0 from Eq 5 are displayed.

V. SUMMARY AND CONCLUSIONS
The energy dependence of the p T0 parameter in the pythia 8 Monte Carlo event generator has been investigated. From observables sensitive to multiparton interactions at low and semi-hard scales at various collision energies, we find that the inclusion of an additional term to the simple power-law function implemented in pythia 8 significantly improves the description of p T0 values obtained at collision energies of 300 and 900 GeV, inferred by measurements performed at the CDF experiment. This conclusion holds for Pythia 8 predictions using parton distribution functions determined at leading, next-to-leading, or next-to-next-to-leading order in the strong coupling for the underlying event simulation. The additional term is found to be very similar for all parton densities. The modified energy dependence function can be reliably used for extrapolation of p T0 values at energies smaller than 300 GeV and higher than 13 TeV and constitutes a valuable alternative to the default Pythia 8 energy extrapolation.