1 Introduction

Monte Carlo (MC) simulation codes describe hadron-hadron collisions with models based on several components. The hard scattering component of the event consists of particles from the hadronization of partons whose kinematics are predicted using perturbative matrix elements (MEs), along with partons from initial-state radiation (ISR) and final-state radiation (FSR) that are simulated using a showering algorithm. The underlying event (UE) consists of the beam-beam remnants (BBR) and the particles that arise from multiple-parton interactions (MPI). The BBR are what remains after a parton is scattered out of each of the two initial beam hadrons. The MPI are additional soft or semi-hard parton–parton scatterings that occur within the same hadron–hadron collision. Generally, observables sensitive to the UE also receive contributions from the hard-scattering components. Accurately describing observables that are sensitive to the UE not only requires a good description of BBR and MPI, but also a good modeling of hadronization, ISR, and FSR. Standard MC event generators, such as pythia8 [1], herwig [2, 3], and sherpa [4] have adjustable parameters to control the behavior of their event modeling. A set of these parameters, which has been adjusted to better fit some aspects of the data, is referred to as a tune.

In a previous study [5], we presented several pythia8 and herwig++ UE tunes constructed for a center-of-mass energy \(\sqrt{s}\) lower than 13\(\,\text {TeV}\). The CMS pythia8 tune CUETP8M1 is based on the Monash tune [6], both using the NNPDF2.3LO parton distribution function (PDF) set [7]. The CMS pythia8 tune CUETP8S1-CTEQ6L1 is based instead on the tune 4C [8]. Both tunes CUETP8M1 and CUETP8S1-CTEQ6L1 were constructed by fitting the CDF UE data at \(\sqrt{s}=900\,\text {GeV} \) and 1.96\(\,\text {TeV}\) [9] together with CMS UE data at \(\sqrt{s}=7\,\text {TeV} \) [10]. A similar procedure was used for the determination of the herwig++ tune (CUETHppS1) with the CTEQ6L1 PDF set [11]. A collection of previously published tunes is documented in [6, 8, 12, 12,13,14,15].

In this paper, a new set of tunes for the UE simulation in the pythia8 (version 8.226) event generator is obtained by fitting various measurements sensitive to soft and semi-hard MPI at different hadron collision energies [9, 10], including data from \(\sqrt{s}=13\,\text {TeV} \) [16]. These tunes are constructed with the leading order (LO), next-to-leading order (NLO), and next-to-next-to-leading order (NNLO) versions of the NNPDF3.1 PDF set [17] for the simulation of all UE components. Typically, the values of strong coupling used for the simulation of the hard scattering are chosen consistent with the order of the PDF set used.

The new tunes are obtained by fitting CDF UE data at \(\sqrt{s}=1.96\,\text {TeV} \) [9], together with CMS UE data at \(\sqrt{s}=7\,\text {TeV} \) [10] and at 13\(\,\text {TeV}\) [16, 18]. For the first time, we show that predictions obtained with tunes based on higher-order PDF sets are able to give a reliable description of minimum-bias (MB) and UE measurements with a similar level of agreement to predictions from tunes using LO PDF sets. We also compare the predictions for multijet, Drell–Yan, and top-antiquark (\(\mathrm{t}\overline{\mathrm{t}}\)) processes from pythia8 with new tunes in ME-parton shower (PS) merged configurations.

In Sect. 2 we describe observables that are sensitive to MB and UE: diffractive processes [19], where one or both protons remain intact after the collision; and double-parton scattering (DPS), where two hard scatterings occur within the same collision. In Sect. 3, we compare the tunes that were constructed before the data at \(\sqrt{s}=13\,\text {TeV} \) were available (“Pre-13\(\,\text {TeV}\) ” tunes) with UE data measured at 13\(\,\text {TeV}\). Section 4 is dedicated to a general discussion of the choice of PDF sets and strong coupling values for the UE simulation. In Sect. 5 we describe the new tunes. Section 6 shows the validation of the new CMS pythia8 tunes for multijet, Drell–Yan, \(\mathrm{t}\overline{\mathrm{t}}\), and DPS processes. Section 7 is the summary and conclusions.

2 Observables for characterizing minimum bias, underlying event, and double-parton scattering

Minimum bias is a generic term that refers to inelastic events that are collected with a loose event selection that has the smallest bias possible. The MB observables are constructed from data with little or no additional selection requirements. The majority of MB collisions are soft, with a typical transverse momentum scale \(p_{\mathrm {T}} \lesssim 2\,\text {GeV} \). The UE is defined as the activity that is not associated with the particles originating from the hard scattering of two partons and is generally studied in events that contain a hard scattering with \(p_{\mathrm {T}} \gtrsim 2\,\text {GeV} \). The main contribution to the UE comes from color exchanges between the beam partons and is modeled in terms of MPI, BBR, and color reconnection (CR). The MB and UE observables have quite different kinematic properties because they are affected by different mixtures of hard and soft scattering processes.

As illustrated in Fig. 1, one can use the topological structure of a typical hard hadron-hadron collision to study the UE experimentally. On an event-by-event basis, a leading object is used to define regions of \(\eta \text {-}\phi \) space that are sensitive to the modeling of the UE, where \(\eta \) is the pseudorapidity and \(\phi \) is the azimuthal scattering angle defined in the xy plane. The azimuthal separation between charged particles and the leading object, \(\varDelta \phi \ = \phi - \phi _{\text {max}}\), is used to define the UE-sensitive regions. Here \(\phi _{\text {max}}\) is the azimuth of the leading object and \(\phi \) is the azimuth angle of an outgoing charged particle. The regions are labelled as ‘toward’ (\(|\varDelta \phi |\le 60^{\circ }\)), ‘away’ (\(|\varDelta \phi |>120^{\circ }\)), and ‘transverse’ (\(60^{\circ }<|\varDelta \phi |\le 120^{\circ }\)). The transverse region can further be separated into transMAX and transMIN. On an event-by-event basis, transMAX (transMIN) is defined as the transverse region having the maximum (minimum) of either the number of charged particles, or scalar \(p_{\mathrm {T}}\) sum of charged particles (\(p_{\mathrm {T}} ^\text {sum}\)), depending on the quantity under study.

Fig. 1
figure 1

Illustration of several \(\phi \) regions relative to the leading object that are sensitive to the underlying event. See the text for the details on the definitions of the regions

Published UE studies used the charged-particle jet with the largest \(p_{\mathrm {T}}\) [16], the dilepton system in DY [20, 21], or \(\mathrm{t}\overline{\mathrm{t}}\) [22] events as the leading (i.e., the highest \(p_{\mathrm {T}}\)) objects. The tunes from CDF and CMS data [9, 10] made use of the charged particle with the largest \(p_{\mathrm {T}}\) (\(p_{\mathrm {T}} ^\text {max}\)) as the “leading object”, and use only charged particles with \(p_{\mathrm {T}} > 0.5 \,\text {GeV} \) and \(|\eta | < 0.8\) to characterize the UE. The toward region contains the leading object, and the away region is expected to include the object recoiling against the leading one. Most of the UE contributions, i.e., PS and MPI, are contained in the two transverse regions. For events with multiple ISR or FSR emissions, transMAX often contains a third hard jet, while both transMAX and transMIN receive contributions from the MPI and BBR components. Typically, the transMIN observables are more sensitive to the MPI and BBR components of the UE.

Observables sensitive to UE contributions are the charged-particle multiplicity and the charged-particle scalar-\(p_{\mathrm {T}}\) sum densities in the \(\eta \text {-}\phi \) space, measured in transMIN and transMAX. The tunes that are constructed by fitting such UE-sensitive observables are referred to as “UE tunes”.

The pythia8 MC event generator also simulates single-diffractive (SD) dissociation, double-diffractive (DD) dissociation, central-diffractive (CD), and nondiffractive (ND) processes [23], which contribute to the inelastic cross section in hadron-hadron collisions. In SD, CD, and DD events, one or both of the beam particles are excited into color singlet states, which then decay. The SD and DD processes correspond to color singlet exchanges between the beam hadrons, while CD corresponds to double color singlet exchange with a diffractive system produced centrally. For ND processes, color exchanges occur, the outgoing remnants are no longer color singlets, and a multitude of particles is produced. All processes except SD are defined as nonSD (NSD) processes. An NSD-enhanced sample is required to have an energy deposit in both the backward (\(-5< \eta < -3\)) and the forward (\(3< \eta < 5\)) regions of the detector. The details of the selection for different types of diffractive events can be found in Ref. [24].

Generally, MC models such as pythia8 regularize the contributions of the primary hard-scattering processes and MPI to the differential cross section by using a threshold parameter \(p_{\mathrm {T}} ^0\). The primary hard-scattering processes and the MPI are regularized in the same way with this parameter. This threshold is expected to have a dependence on the center-of-mass energy of the hadron-hadron collision, \(\sqrt{s}\). The threshold at a reference center-of-mass energy \(\sqrt{s}=7\,\text {TeV} \) is called pT0Ref. In pythia8 the energy dependence is parameterized using a power law function with a reference energy parameter \(s_0\) and an exponent \(\epsilon \). At a given center-of-mass energy, the amount of MPI depends on the threshold \(p_{\mathrm {T}} ^0\), the PDF, and the overlap of the matter distributions of the two colliding hadrons. Smaller values of \(p_{\mathrm {T}} ^0\) result in larger MPI contributions because of a higher MPI cross section. Each MPI adds colored partons to the final state, creating a dense net of color lines that spatially overlap with the fields produced by the partons of the hard scattering and with each other. All the generated color lines may connect to each other according to the CR model.

Since pythia8 regularizes both the cross section for MPI and the cross section of collisions with low-\(p_{\mathrm {T}}\) exchange using the \(p_{\mathrm {T}} ^0\) parameter, one can model the overall ND cross section by letting the \(p_{\mathrm {T}}\)  of the primary hard scattering become small. In this simple approach, the UE in a hard-scattering process is related to MB collisions. At the same center-of-mass energy, the activity in the UE of a hard-scattering process is greater than that of an average MB collision. In pythia8, this is caused by the higher MPI activity in hard-scattering processes compared to a typical MB collision. By demanding a hard scattering, one forces the collision to be more central, i.e., with a small impact parameter between the protons, and this increases the probability of MPI. For MB collisions, peripheral collisions, where the impact parameter between the two colliding protons is large, are most common.

Typically MPI interactions contain particles with substantially lower \(p_{\mathrm {T}}\) (“softer”). However, occasionally two hard 2-to-2 parton scatterings can occur within the same hadron-hadron collision. This is referred to as DPS. Tunes that are constructed by fitting DPS-sensitive observables are referred to as “DPS tunes”. Ultimately, one universal tune that simultaneously accurately describes observables in hard scattering events, as well as MB collisions, is desirable.

The goals of this paper are to produce improved 13\(\,\text {TeV}\) pythia8 tunes with well-motivated parameters, and to provide an investigation of the possible choices that can be made in pythia8 which simultaneously describe a wide range of UE and MB measurements and are suitable for merged configurations, where a ME calculation is interfaced to the simulation of UE contributions.

3 Comparisons of predictions for UE observables from previous tunes to measurements at 13\(\,\text {TeV}\)

In this section, comparisons are presented between data collected at \(\sqrt{s}=13\,\text {TeV} \) and predictions from tunes obtained using fits to measurements performed at lower center-of-mass energies. Figure 2 displays comparisons of CMS data at 13\(\,\text {TeV}\) [16] for the transMIN and transMAX charged-particle \(p_{\mathrm {T}} ^\text {sum}\) densities, as functions of the leading charged-particle \(p_{\mathrm {T}} ^\text {max}\). The data are compared with predictions from the pythia8 tunes CUETP8S1-CTEQ6L1 [5], CUETP8M1 [5], and Monash [6].

The CMS Monash-based tune CUETP8M1 does not describe the central values of the data at \(\sqrt{s}=13\,\text {TeV} \) well, nor does the original Monash tune. For example, CUETP8M1 and Monash tunes do not predict enough UE activity in the region with \(p_{\mathrm {T}} ^\text {max}> 5\,\text {GeV} \) (the “plateau” region) of transMIN at 13\(\,\text {TeV}\), with a disagreement of \(\approx \)10% and \(\approx \)5%, respectively. The transMIN observables are very sensitive to MPI, which suggests that tune CUETP8M1 does not produce enough charged particles at 13\(\,\text {TeV}\). In addition, CUETP8M1 does not provide a good fit to the jet multiplicity in \(\mathrm{t}\overline{\mathrm{t}}\) production either at 8\(\,\text {TeV}\) or at 13\(\,\text {TeV}\) [25, 26]. High jet multiplicity \(\mathrm{t}\overline{\mathrm{t}}\) events are sensitive to the modeling of the ISR. Hence, CUETP8M1 may not have the proper mixture of MPI and ISR. The ATLAS collaboration has also observed some discrepancies between the predictions of the A14 tune [12], used as standard tune for analyses of 7 and 8\(\,\text {TeV}\) data, and the data at 13\(\,\text {TeV}\) [27].

The CMS UE tunes were constructed by fitting CDF UE data at \(\sqrt{s}=900\,\text {GeV} \) and 1.96\(\,\text {TeV}\), together with CMS UE data at \(\sqrt{s}=7\,\text {TeV} \). In Fig. 2 the CMS UE tunes provide a fairly good description of the 13\(\,\text {TeV}\) UE data. Because the CMS UE tunes were obtained by fitting UE observables at various collision energies (\(\sqrt{s}=900\), 1960, and 7000\(\,\text {GeV}\)), they underestimate the data at \(\sqrt{s}=13\,\text {TeV} \). This might be an indication of the need to improve the energy extrapolation function implemented in pythia8 [28]. Predictions obtained with the Monash tune, which is the default pythia8 tune, slightly better reproduce the 13\(\,\text {TeV}\) UE data, but is somewhat worse at describing the UE observables at \(\sqrt{s}=900\) and 1960\(\,\text {GeV}\) than the CMS UE tunes.

Predictions from the herwig7.1 tune UE-MMHT [3] are also shown. The H7-UE-MMHT tune was obtained by fitting UE data at \(\sqrt{s}=0.9\) and 7\(\,\text {TeV}\). This tune is based on the MMHT2014 PDF set [29] and is able to describe the plateau region of the UE observables at \(\sqrt{s}=13\,\text {TeV} \). The part of the spectrum at \(p_{\mathrm {T}} ^\text {max}> 5\,\text {GeV} \) is not well reproduced in the range of the leading charged-particle \(p_{\mathrm {T}} ^\text {max}\) between 2 and 7\(\,\text {GeV}\), with differences of up to 30% with respect to the data. The predictions from herwig7 achieve an overall good agreement with measurements at \(\sqrt{s}=7\,\text {TeV} \) [30], while the disagreement observed for measurements at \(\sqrt{s}=13\,\text {TeV} \) might indicate the need for further tuning of the new soft MPI model [30]. Since many parameters related to PS changed between herwig++ and herwig7, the CMS tunes extracted for herwig++ with the CTEQ6L1 PDF set and documented in Ref. [5] are not updated and should not be used with herwig7.

Since no currently available tune is able to optimally reproduce the UE data at \(\sqrt{s}=13\,\text {TeV} \), we aim to produce improved pythia8 UE tunes.

Fig. 2
figure 2

The (left column) transMIN and (right column) transMAX  charged-particle \(p_{\mathrm {T}} ^\text {sum}\) (upper row), and multiplicity (lower row) densities for particles with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta | < 2.0\), as a function of the transverse momentum of the leading charged particle (\(p_{\mathrm {T}} ^\text {max}\)), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [16]. The data are compared with the pythia8 tune Monash, the CMS pythia8 tunes CUETP8S1-CTEQ6L1 and CUETP8M1, and the herwig7 (labelled as “H7”) tune UE-MMHT. The ratios of the simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

4 PDF and strong coupling values for the tunes

Two of the basic input parameters to the predictions are the choice of the order of the PDF sets and values of the strong coupling \(\alpha _S \). These appear in the hard partonic MEs, the PS model, and the MPI model. The \(\alpha _S \) values used in simulations at LO or NLO are typically different. Traditionally, the perturbative order of the PDF is matched to the order of the ME calculation. Merged calculations capture some higher-order corrections with respect to the formal order of the ME calculation. Merging schemes, such as the \(k_{\mathrm {T}}\)–MLM [31] or CKKW [32, 33], allow the combination of predictions of jet production using ME calculations with those from PS emissions for soft and collinear parton radiation at leading-log accuracy without double counting or dead regions. Merging can be applied also for processes generated at NLO. Using the same PDF set and \(\alpha _S \) value in the ME calculations and in the simulation of the various components of the PS is advocated in Ref. [34], and by the herwig7 and sherpa Collaborations, especially when the PS simulation is merged with calculations of higher-order MEs. The PDF used for the hard process is constrained by the accuracy of the ME calculation. If we require the PDF to match between the ME and PS, simulations with a (N)NLO ME will also require a (N)NLO PDF in the PS. Depending on the process, this may not have a significant effect. For PS MC event generators, different strategies are adopted; CMS [5] and ATLAS [12] tunes are traditionally based on LO PDFs, pythia8 [6] tunes are mostly based on LO PDFs, new sherpa [4] tunes are based on NNLO PDFs, and herwig7 [30] provide tunes based on NLO PDFs. The usage of a LO PDF set in the UE simulation is motivated by the fact that MPI processes occur at very low energy scales, where a physical (positive) gluon distribution is required by the parton shower. However, there is no consensus on the choice of the order of the PDF. For example, in the NNPDF3.1 set at NNLO, the gluon distribution remains physical even at very low scales.

In the pythia8 tunes produced prior to this paper, the values used for \(\alpha _S \) were often not the same as those used in the PDFs. For example, in the Monash tune, the FSR \(\alpha _S (m_{\mathrm{Z}{}{}})\), set to 0.1365, is obtained by fitting pythia8 predictions to LEP event-shape measurements [6], the ISR \(\alpha _S (m_{\mathrm{Z}{}{}})\) is assumed to be equal to FSR \(\alpha _S (m_{\mathrm{Z}{}{}})\), and the hard scattering and MPI \(\alpha _S (m_{\mathrm{Z}{}{}})\) is set to 0.13 according to the value used in the LO PDF set. Even though the \(\alpha _S \) values are free parameters in event generators and various possibilities are viable, the usual course is to choose them consistent with the value used by the PDF set.

In this paper, a collection of new tunes is presented for PDF sets that are evaluated at different accuracies and tested against observables of MB, UE, and hard processes. The NNPDF3.1 PDF sets at the LO, NLO, and NNLO accuracy are used [17]. The LO PDF set uses an \(\alpha _S (m_{\mathrm{Z}{}{}})\) value of 0.13, while 0.118 [35] is the \(\alpha _S (m_{\mathrm{Z}{}{}})\) value used for the NLO and NNLO PDF sets. None of the central values of the PDF sets have negative values for any parton flavor in the phase space relevant for comparisons. Special care is required when applying these tunes at high-x regions, where the parton distributions in NNPDF3.1 NLO and NNLO PDF may become negative, which implies an unphysical (negative) value of the calculated cross sections.

The UE simulation is performed by pythia8, together with PS merged with a calculation of a higher-order or a multileg ME provided by external programs, such as powheg [36] or madgraph5_amc@NLO (mg5_amc) [37]. The issue of combining external ME calculations with PS contributions is addressed by the merging procedure. The procedures considered in this paper are the “FxFx” [38] or the “POWHEG” [39] methods for merging higher-order (NLO) MEs to PS and the “MLM” method [31].

During this study, we also investigated the effect of imposing an additional rapidity (y) ordering to ISR in these merging calculations. The pythia8 Monash tune includes a rapidity ordering for both ISR and MPI. The rapidity ordering acts as an extra constraint on the \(p_{\mathrm {T}}\)-ordered emissions, thus reducing the phase space for parton emission.

5 New CMS pythia8 tunes at 13\(\,\text {TeV}\)

In the following, a set of new 13\(\,\text {TeV}\) pythia8.226 tunes is presented with different choices of values of the strong coupling used in the modeling of the ISR, FSR, hard scattering, and MPI, as well as the order of its evolution as a function of the four-momentum squared \(Q^2\). We distinguish the new tunes according to the order of the PDF set used: LO-PDF, NLO-PDF, or NNLO-PDF. The tunes are labeled as CPi, where CP stands for “CMS pythia8 ” and i is a progressive number from 1 to 5. Only five parameters related to the simulation of MPI, to the overlap matter distribution function [40], and to the amount of CR are constrained for the new CMS tunes. In all tunes, we use the MPI-based CR model [41]. The CP tunes are multipurpose tunes, aiming for a consistent description of UE and MB observables at several collision energies and a reliable prediction of the UE simulation in various processes when merged with higher-order ME calculations.

The settings, used in the determination of the new CMS pythia8 UE tunes, are as follows:

  • Tune CP1 uses the NNPDF3.1 PDF set at LO, with \(\alpha _S \) values used for the simulation of MPI, hard scattering, FSR, and ISR equal to, respectively, 0.13, 0.13, 0.1365, and 0.1365, and running according to an LO evolution.

  • Tune CP2 is a slight variation with respect to CP1, uses the NNPDF3.1 PDF set at LO, with \(\alpha _S \) values used for the simulation of MPI, hard scattering, FSR, and ISR contributions equal to 0.13, and running according to an LO evolution.

  • Tune CP3 uses the NNPDF3.1 PDF set at NLO, with \(\alpha _S \) values used for the simulation of MPI, hard scattering, FSR, and ISR contributions equal to 0.118, and running according to an NLO evolution.

  • Tune CP4 uses the NNPDF3.1 PDF set at NNLO, with \(\alpha _S \) values used for the simulation of MPI, hard scattering, FSR, and ISR contributions equal to 0.118, and running according to an NLO evolution.

  • Tune CP5 has the same settings as CP4, but with the ISR emissions ordered according to rapidity.

The parameters related to the simulation of the hadronization and beam remnants are not varied in the fits and are kept fixed to the values of the Monash tune. The overlap distribution between the two colliding protons is modeled according to a double-Gaussian functional form with the parameters coreRadius and coreFraction. This parametrization of the transverse partonic overlap of two protons identifies an inner, denser part, the so-called core, and an outer less dense part. The coreRadius parameter represents the width of the core and the coreFraction, the fraction of quark and gluon content enclosed in the core. A double-Gaussian function is preferred for modeling the proton overlap over the negative exponential used in some previous tunes. Tunes using a double-Gaussian function tend to better reproduce the cross sections measured by the CMS experiment at \(\sqrt{s}=7\,\text {TeV} \) [10], simultaneously as a function of charged-particle multiplicity and transverse momenta.

The parameter that determines the amount of simulated CR in the MPI-based model is varied in the fits. A small (large) value of the final-state CR parameter tends to increase (reduce) the final particle multiplicities.

The new CMS pythia8 tunes are extracted by varying the parameters listed in Table 1 and by fitting UE observables at various collision energies. In the fitting procedure, we use the charged-particle and \(p_{\mathrm {T}} ^\text {sum}\) densities, measured in transMIN and transMAX regions as a function of \(p_{\mathrm {T}} ^\text {max}\), as well as the charged-particle multiplicity as a function of pseudorapidity \(\eta \), measured by CMS at \(\sqrt{s}=13\,\text {TeV} \) [16, 18]. In addition, we also use the charged-particle and \(p_{\mathrm {T}} ^\text {sum}\) densities as a function of the leading charged-particle \(p_{\mathrm {T}}\), measured in transMIN and transMAX by CMS at \(\sqrt{s}=7\,\text {TeV} \) [10] and by CDF at \(\sqrt{s}=1.96\,\text {TeV} \) [9].

Table 1 Parameters in the pythia8 MC event generator together with the PDFs determine the energy dependence of MPI, the overlap matter distribution function, and the amount of simulated color reconnection. The parameter ranges used for the fits are also listed

Tunes are determined by generating sets of predictions using the rivet[42] (version 2.5.2) and the professor[43] (version 1.4.0) frameworks with around 150 different choices of the five parameter values used in the event simulation. The predictions form a grid in the five-dimensional parameter space which is fitted using a third-order polynomial function. The uncertainty introduced in the fitted parameters due to the interpolation procedure is negligible compared with the quoted tune uncertainty. Results are found to be stable if one decreases this number to 100 or increases to 200, or uses a fourth-order polynomial function for the grid interpolation. The generated inelastic events include ND and diffractive (DD\(+\)SD\(+\)CD) contributions. The UE observables used to determine the tunes are sensitive to diffractive contributions only at very small \(p_{\mathrm {T}} ^\text {max}\) values (<3\(\,\text {GeV}\)). The ND component is dominant for \(p_{\mathrm {T}} ^\text {max}\) values greater than \(\approx \)3.0\(\,\text {GeV}\), since the cross section of the diffractive components rapidly decreases as a function of the exchanged \(p_{\mathrm {T}}\). Minimum-bias observables, such as the inclusive charged-particle multiplicity as a function of \(\eta \), are sensitive to all contributions over the whole spectrum.

The fit is performed by minimizing the \(\chi ^2\) function

$$\begin{aligned} \chi ^2(p)=\sum _{\text {O}_j} \sum _{i}\frac{(f_{i,\text {O}_j}(p)-R_{i,\text {O}_j})^2}{\varDelta _{i,\text {O}_j}^2} \end{aligned}$$
(1)

where the sum runs over each bin i of every observable O\(_\mathrm {j}\). The \(f_\mathrm {i}(p)\) functions represent a parametrization of the dependence of the predictions in bin i on the tuning parameters, \(R_\mathrm {i}\) is the value of the measured observable in bin i, and \(\varDelta _\mathrm {i}\) is the total experimental uncertainty of \(R_\mathrm {i}\). The best fit values of the tuned parameters are shown in Table 2 for CP1 and CP2, i.e., the tunes using LO PDF sets, and in Table 3 for CP3, CP4, CP5, i.e., the tunes using NLO or NNLO PDF sets. Uncertainties in the parameters of these tunes are discussed in Appendix A. No correlation across bins is included in the minimized \(\chi ^2\) function.

The value of pT0Ref and its energy dependence is very different between tunes based on LO PDF sets and tunes based on NLO or NNLO PDFs. While pT0Ref is around 2.3–2.4\(\,\text {GeV}\) for CP1 and CP2 tunes with \(\epsilon \approx 0.14\)–0.15, CP3, CP4, and CP5 tunes prefer much lower values for both pT0Ref (\(\approx \)1.4–1.5) and \(\epsilon \) (\(\approx \)0.03–0.04). A value of \(\epsilon \) of \(\approx \)0.03–0.04 corresponds to a very weak energy dependence of the threshold of the MPI cross section. These results can be understood by considering the shapes of the gluon densities at small x for the different PDF sets. In order to describe the UE observables, the rapidly increasing gluon densities at small x in LO PDF sets favor large values of pT0Ref. Meanwhile NLO and NNLO PDF sets, whose gluon densities are more flat at low x, need higher contributions of MPI, i.e., a small value of pT0Ref. Figure 3 shows the number of MPI observed for the various tunes and the gluon distribution at a reference scale of \(\mu = 3\,\text {GeV} \) for various NNPDF versions. The larger number of simulated MPI for NLO and NNLO tunes with respect to LO tunes is apparent.

Fig. 3
figure 3

Distribution of number of MPI simulated by the tunes Monash, CP2, CP3, CP4, and CP5 (left). Gluon distribution function at a reference scale of \(\mu = 3\,\text {GeV} \) (right) for the NNPDF2.3LO PDF set and the different versions of the NNPDF3.1 PDF set: LO, NLO, and NNLO. The ratio of NNPDF3.1 gluon distribution functions to the NNPDF2.3LO gluon distribution function are also shown

We have found that the values of pT0Ref and \(\epsilon \) also depend on the order of the running used for \(\alpha _S \). In particular, fits based on NLO or NNLO PDF sets, i.e., CP3, CP4, or CP5, with an LO \(\alpha _S \) running prefer even smaller values for both pT0Ref and \(\epsilon \) than the ones in the tunes obtained with an NLO \(\alpha _S \) running. This is because \(\alpha _S \) runs faster at NLO than at LO. When \(\alpha _S \) is run from the same value at the same scale (\(m_{\mathrm{Z}{}{}} \)), the effective coupling at low scales is larger for NLO running than for LO running. Therefore, a lower pT0Ref is needed for NLO \(\alpha _S \) running than for LO \(\alpha _S \) running to obtain a similar number of MPI.

For tunes based on NLO and NNLO PDF sets, the value of pT0Ref is as low as the initial scale of the PDF \(Q_\text {min}^2\). For interactions occurring at \(Q^2\) which are lower than \(Q_\text {min}^2\), the value of the PDF is left frozen to the value assumed at the initial scale.

Table 2 CMS pythia8 LO-PDF tunes CP1 and CP2. Both the values at \(Q = m_{\mathrm{Z}{}{}} \) and the order of running with \(Q^2\) of the strong coupling \(\alpha _S (m_{\mathrm{Z}{}{}})\) are listed. In these tunes, we use the Schuler-Sjöstrand diffraction model [44] and also include the simulation of CD processes. The number of degrees of freedom for tunes CP1 and CP2 is 63
Table 3 CMS pythia8 NLO-PDF tune CP3 and NNLO-PDF tunes CP4 and CP5. Both the values at \(Q = m_{\mathrm{Z}{}{}} \) and the order of running with \(Q^2\) of the strong coupling \(\alpha _S \) are listed. In these tunes, we use the Schuler-Sjöstrand diffraction model [44] and also include the simulation of CD processes. The number of degrees of freedom for tunes CP3, CP4, and CP5 is 63
Fig. 4
figure 4

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [16]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 2.0\). The transMIN densities are more sensitive to the MPI, whereas the transMAX densities are more sensitive to ISR and FSR. The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2. The ratios of the simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 5
figure 5

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [16]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 2.0\). The data are compared with the CMS pythia8 (N)NLO-PDF tunes CP3, CP4, and CP5. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 6
figure 6

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CMS \(\sqrt{s}=7\,\text {TeV} \) analysis [10]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 0.8\). The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 7
figure 7

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CMS \(\sqrt{s}=7\,\text {TeV} \) analysis [10]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 0.8\). The data are compared with the CMS pythia8 (N)NLO-PDF tunes CP3, CP4, and CP5. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 8
figure 8

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CDF \(\sqrt{s}=1.96\,\text {TeV} \) analysis [9]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 0.8\). The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 9
figure 9

The transMIN (upper left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (upper right) densities and the transMAX (lower left) charged-particle and charged \(p_{\mathrm {T}} ^\text {sum}\) (lower right) densities, as a function of the transverse momentum of the leading charged particle, \(p_{\mathrm {T}} ^\text {max}\), from the CDF \(\sqrt{s}=1.96\,\text {TeV} \) analysis [9]. Charged hadrons are measured with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 0.8\). The data are compared with the CMS pythia8 (N)NLO-PDF tunes CP3, CP4, and CP5. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 10
figure 10

The pseudorapidity distribution of charged hadrons measured in \(|\eta | < 2\) for an inclusive selection in inelastic proton-proton collisions, with zero magnetic field strength (B = 0 T), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [18]. The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2 (left), and with the CMS pythia8 NLO-PDF tune CP3 and the CMS pythia8 NNLO-PDF tunes CP4 and CP5 (right). The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

The contribution from CR also changes among the different tunes and depends on the choice of PDF and its order. In particular, the amount of CR is also affected by the shape of the PDFs at small fractional momenta x.

Parameters related to the overlap matter distribution function differ between the different tunes. They are strongly correlated with the other UE parameters governing the MPI and CR contributions. In general, for a given value of the matter fraction (coreFraction), MPI contributions increase for decreasing values of the core radius (coreRadius). The inclusion of the rapidity ordering for ISR in tune CP5 impacts the UE observables by reducing the number of charged particles, and needs to be compensated by a larger amount of MPI contributions.

The \(\chi ^2\) per degree of freedom (dof) listed in Tables 2 and 3 refers to the quantity \(\chi ^2(p)\) in Eq. (1), divided by dof in the fit. The eigentunes (Appendix A) correspond to the tunes in which the changes in the \(\chi ^2\) (\(\varDelta \chi ^2\)) of the fit relative to the best fit value equals the \(\chi ^2\) value obtained in the tune, i.e., \(\varDelta \chi ^2_\text {min}=\chi ^2\). Such a variation of the \(\chi ^2\) produces a tune whose uncertainty bands are roughly the same as the uncertainties in the fitted data points. This is the main motivation why this choice of variation was considered. For all tunes in Tables 2 and 3, the fit quality is good, with \(\chi ^2\)/dof values very close to 1.

Figures 4, 5, 6 and 7 show comparisons of the UE observables measured at various collision energies to predictions from the new tunes. Figures 4 and 5 compare the charged-particle and \(p_{\mathrm {T}} ^\text {sum}\) densities measured at \(\sqrt{s}=13\,\text {TeV} \) by the CMS experiment [16] in the transMIN and transMAX regions to predictions from the LO-PDF-based tunes and the higher-order-PDF-based tunes. Figures 6 and 7 compare the charged-particle and \(p_{\mathrm {T}} ^\text {sum}\) densities measured at \(\sqrt{s}=7\,\text {TeV} \) by the CMS experiment [10] in the transMIN and transMAX regions to predictions from the LO-PDF-based tunes and the higher-order-PDF-based tunes. In Figs. 8 and 9 similar comparisons are shown for the observables measured at \(\sqrt{s}=1.96\,\text {TeV} \) by the CDF experiment [9] in the transMIN and transMAX regions. All predictions reproduce well the UE observables at \(\sqrt{s}=1.96\), 7, and 13\(\,\text {TeV}\). Predictions from LO tunes are slightly better than the higher-order tunes in describing the energy dependence of the considered UE measurements.

In the region of small \(p_{\mathrm {T}} ^\text {max}\) values (\(p_{\mathrm {T}} ^\text {max}<3\,\text {GeV} \)), where contributions from diffractive processes are relevant, the predictions do not always reproduce the measurements and exhibit discrepancies up to 20%. Predictions from all of the new tunes cannot reproduce the UE data measured at \(\sqrt{s} = 300\) and 900\(\,\text {GeV}\) [9].

Figure 10 shows the charged-particle multiplicity as a function of pseudorapidity for charged particles in \(|\eta | < 2\) measured by the CMS experiment at \(\sqrt{s}=13\,\text {TeV} \) [18] in MB events. These events were recorded with no magnetic field, so all particles irrespective of their \(p_{\mathrm {T}}\) are measured. Data are compared with the predictions of the new pythia8 tunes. All of them are able to reproduce the measurement at the same level of agreement, independently of the PDF used for the UE simulation. We could not find any MB or UE observable where the level of agreement between data and predictions from the different tunes is significantly different.

6 Validation of the new pythia8 tunes

In this section, comparisons of the predictions obtained with the new tunes to various experimental measurements performed by the CMS experiment are provided. Unless otherwise stated, the comparisons are made at \(\sqrt{s}=13\,\text {TeV} \). We compare the CMS UE tunes with MB and UE data measured at central and forward pseudorapidities that are not used in the fits. We examine how well multijet, Drell–Yan, and top quark observables are predicted by MC simulations using higher-order ME generators merged with pythia8 with the various new tunes.

6.1 Comparisons using event-shape observables

In this subsection, predictions of the new tunes are compared to event-shape observables measured at LEP, in electron-positron collisions. These observables are particularly sensitive to the value of \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\). Given the leptonic initial state, there is no effect coming from the values of the MPI, color reconnection, and ISR parameters.

When predictions with pythia  8 are used, an optimal value of \(\alpha _S ^{\mathrm {FSR}}(m_{\mathrm{Z}{}{}})\sim 0.13\) is found, which best describes these observables, independent of the PDF used for the modeling of the PS evolution.

Figures 11 and 12 display the oblateness (O), sphericity (S), thrust (T), and thrust major (\(T_\text {major}\)), measured in \({\mathrm{{\mathrm{e}{}{}}}^{+}} {\mathrm{{\mathrm{e}{}{}}}^{-}} \rightarrow {\mathrm{Z}{}{}} \upgamma ^{{*}} \rightarrow {\mathrm{q}{}{}} {\overline{\mathrm{{\mathrm{q}{}{}}}}{}{}} \) final states at \(\sqrt{s} = 91.2\) \(\,\text {GeV}\) by the ALEPH experiment [45]. These observables measure the topology of the event. An isotropic event would have a value of T close to 0.5, while values of T close to 1 correspond to 2-jet events.

Predictions obtained with mg5_amc with up to 4 partons in the final state, and interfaced with the UE from the tune CUETP8M1 and the new pythia  8 tunes CP2, CP3, and CP5 are considered (Fig. 11). Predictions using the tune CP2 do not describe the event-shape observables very well, with discrepancies with the data up to 30% in the T and \(T_\text {major}\). In particular, tune CP2 predicts too many isotropical events. A similar description is obtained for predictions of mg5_amc \(+\) pythia  8 with the tune CUETP8M1. A better agreement in the event-shape variables is observed for predictions using tune CP3 and CP5. A correct description of event-shape observables strongly depends on the value of the FSR strong coupling. The observations above indicate that when merged configurations are considered, i.e., mg5_amc \(+\) pythia8, where partons at higher multiplicities in the final state are simulated at the ME level, the description of event-shape observables degrades. A value of \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\sim 0.13\) generally overestimates the number of final-state partons, while a lower \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\sim 0.12\) performs better.

At large values of T, where the hadronization effects become relevant, we observe a large difference between predictions from tunes using a small \(\alpha _S ^\mathrm {FSR}\) (CP3 and CP5) and tunes using a large \(\alpha _S ^\mathrm {FSR}\) (CP2 and CUETP8M1). These differences may be due to the interplay between the value of the strong coupling and the hadronization. Analyses particularly sensitive to hadronization should carefully evaluate the corresponding systematic uncertainties. In some cases retuning hadronization parameters may be desired.

We also compared mg5_amc \(+\) pythia  8 with CP5, and CP5 with CMW rescaling [46] (Fig. 12). Apart from T, for all shape variables considered, CP5 without CMW rescaling describes the data better.

Fig. 11
figure 11

The normalized cross sections as a function of event-shape variables, oblateness (upper left), sphericity (upper right), thrust (lower left), and thrust major (lower right) from the ALEPH \(\sqrt{s} =91.2\,\text {GeV} \) analysis [45], compared with the predictions by mg5_amc \(+\) pythia8 with \(k_{\mathrm {T}}\)–MLM merging, for tunes CP2, CP3, and CP5. The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 12
figure 12

The normalized cross sections as a function of event-shape variables, oblateness (upper left), sphericity (upper right), thrust (lower left), and thrust major (lower right) from the ALEPH \(\sqrt{s} =91.2\,\text {GeV} \) analysis [45], compared with the predictions by mg5_amc \(+\) pythia8 with \(k_{\mathrm {T}}\)–MLM merging, for tune CP5, CP5 with CMW rescaling, CP5 FSR up, and CP5 FSR down. The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

6.2 Comparisons using MB and other UE observables

In this subsection, predictions of the new tunes are compared to the observables measured in MB collisions that are sensitive to contributions from soft emissions and MPI. Figure 13 shows the charged-particle multiplicity as a function of pseudorapidity [24] in NSD-enhanced and SD-enhanced event samples. The details of the selections can be found in Ref. [24]. These observables are sensitive to SD, CD, and DD dissociation. It is observed that predictions from all of the tunes are similar to each other and describe well the measurements for both considered selections. This shows that the number of charged particles produced in diffractive processes and inelastic collisions is simultaneously described by the new CMS tunes. Figure 13 also demonstrates that tunes based on NNPDF3.1 PDF sets at orders higher than LO adequately describe the MB data.

Figure 14 shows the UE observables, i.e., charged-particle multiplicity and \(p_{\mathrm {T}} ^\text {sum}\) densities [16], as a function of the \(p_{\mathrm {T}}\) of the leading jet reconstructed using just charged particles. The observables shown in Fig. 14 are from events selected without requiring any NSD- or SD-enhanced selections. The CMS UE tunes describe well UE-sensitive data measured using the leading charged-particle jet for \(p_{\mathrm {T}}\) \(^{\text {jet}}>10\,\text {GeV} \). Tunes based on NLO or NNLO PDF sets, i.e., CP3, CP4, and CP5, describe the region at lower \(p_{\mathrm {T}}\) \(^{\text {jet}}\) better than CP1 and CP2, which are based on LO PDF sets. Predictions obtained with CP1 and CP2 underestimate the UE observables by about \(\approx \)15–20%. Predictions obtained with CP3, CP4, and CP5 describe the UE in events characterized using the leading charged particle, as well as those characterized by the leading charged-particle jet, quite well.

Predictions for observables measured in the forward region are compared with data and shown in Figs. 15 and 16. The energy flow, defined as the average energy per event [47], as a function of \(\eta \) with the Hadron Forward (HF) calorimeter [48] covering \(3.15<|\eta |< 5.20\) and the CASTOR calorimeter [48] covering \(-6.6<\eta <-5.2\), is well reproduced by all tunes. A different level of agreement is achieved for predictions from the new CMS tunes for the spectrum of the total energy E measured in the CASTOR calorimeter at \(\sqrt{s}=13\,\text {TeV} \) [49], displayed in Fig. 16. In particular, the tunes based on LO PDF sets reproduce the energy spectrum well at large values (\(\text {E}>2000\,\text {GeV} \)), but have differences of up to 40% at low values (\(\text {E}<800\,\text {GeV} \)). The tunes using higher-order PDF sets are closer to the data at low energy values, with differences up to 20%, but tend to overestimate the energy at large values. This dissimilar behaviour is driven by the different pT0Ref values of the tunes. The fiducial inelastic cross sections [50], when two different selections are applied in the forward region, are not well reproduced by any of the new tunes or by CUETP8M1, with differences up to 10%. This might be because of the Schüler–Sjöstrand [44] diffraction model used in the simulation, which might have a suboptimal description of the low-mass diffractive components. A better description might be provided by tunes using the Donnachie–Landshoff [51] or minimum-bias Rockefeller [52] diffractive models.

Fig. 13
figure 13

The pseudorapidity distribution (\(p_{\mathrm {T}} >0.5\,\text {GeV} \), \(|\eta |<2.4\)) for the NSD-enhanced (left) and the SD-enhanced (right) event selection of charged particles in inelastic proton-proton collisions, from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [24]. The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2, the CMS pythia8 NLO-PDF tune CP3, and the CMS pythia8 NNLO-PDF tunes CP4 and CP5. The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 14
figure 14

The transMIN charged-particle multiplicity (left column) and \(p_{\mathrm {T}}\) sum densities (right column) for particles with \(p_{\mathrm {T}} >0.5\,\text {GeV} \) in \(|\eta |< 2.0\) as a function of the transverse momentum of the leading charged-particle jet, \(p_{\mathrm {T}} ^\text {jet}\), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [16]. The upper-row plots show the LO tunes, while the lower-row plots show the higher-order tunes. The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 15
figure 15

The energy flow measured in an inclusive selection as a function of pseudorapidity, from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [47]. The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2 (left), and with the CMS pythia8 NLO-PDF tune CP3 and the CMS pythia8 NNLO-PDF tunes CP4 and CP5 (right). The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Fig. 16
figure 16

The total energy spectrum measured in the pseudorapidity interval \(-6.6<\eta <-5.2\), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [49]. The data are compared with the CMS pythia8 LO-PDF tunes CP1 and CP2 (left), and with the CMS pythia8 NLO-PDF tune CP3 and the CMS pythia8 NNLO-PDF tunes CP4 and CP5 (right). The ratio of the simulations to the data (MC/Data) is also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

6.3 Comparisons using observables in multijet final states

In this subsection, we present comparisons of observables measured in multijet final states. For these studies, the NLO dijet MEs implemented in the powheg event generator merged with the pythia8 simulation of the PS and UE are used. The merging between the powheg ME calculations and the pythia8 UE simulation is performed using the shower-veto procedure, which rejects showers if their transverse momentum is greater than the minimal \(p_{\mathrm {T}}\) of all final-state partons simulated in the ME (parameter \(p_{\mathrm {T}}\) \(^\text {hard}= 2\,\text {GeV} \) [53]). Variables in multijet events, such as jet transverse momenta or azimuthal dijet correlations, are expected to be less affected by MPI contributions, since jets at high \(p_{\mathrm {T}}\) (>100\(\,\text {GeV}\)) mainly originate from the hard scattering or additional hard emissions, which are simulated in the powheg calculation by the ME formalism. However, the MPI contribution still has some impact because it adds an average energy offset to the event, which is then included in the jet reconstruction [54, 55]. The predictions reproduce well inclusive jet cross sections as a function of jet \(p_{\mathrm {T}}\) at both central and forward jet rapidities, irrespective of the cone size (0.4 or 0.7) used for the jet clustering algorithm [56].

Fig. 17
figure 17

The azimuthal difference \(\varDelta \phi _{1,2}\) between the leading two jets with \(|\eta |<2.4\) in dijet events with leading-jet transverse momentum in the range (left) \(200<p_{\mathrm {T}} ^{\text {lead}}< 300\,\text {GeV} \) and (right) \(300<p_{\mathrm {T}} ^{\text {lead}}<400\,\text {GeV} \), from the CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [57]. The jets are reconstructed using the anti-\(k_{\mathrm {T}}\) jet finding algorithm [58, 59] with a distance parameter of 0.4. The data are compared with predictions of the NLO dijet ME calculation from powheg, interfaced to the pythia8 tunes CUETP8M1, CP2, CP4, and CP5. Tunes CP1 and CP3 are not shown in the plot but present a similar behavior as tunes CP2 and CP4. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Figure 17 shows the normalized cross section [57] as a function of the azimuthal difference \(\varDelta \phi _{1,2}\) between the two leading jets for two different selections on the leading jet \(p_{\mathrm {T}}\): \(200<p_{\mathrm {T}} <300\,\text {GeV} \) and \(300<p_{\mathrm {T}} <400\,\text {GeV} \). The results indicate that UE tunes based on an NLO evaluation of \(\alpha _S (m_{\mathrm{Z}{}{}})\) describe the data better than UE tunes based on LO evolution. In particular, the better agreement is driven by the lower value of \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})\). In fact, predictions obtained with powheg merged with pythia8 with the CUETP8M1 or CP2 tune exhibit a strong jet decorrelation, due to a large contribution from emissions simulated from the PS, and they overestimate the cross sections at small and medium \(\varDelta \phi _{1,2}\) values (\(\varDelta \phi _{1,2}< 2.4\)). The PS component is reduced by the lower value of \(\alpha _S (m_{\mathrm{Z}{}{}})\), which increases the degree of correlation between the selected jets, resulting in a better description of the data by predictions of the CP4 and CP5 tunes. A similar outcome was also observed for an analogous measurement performed at the D0 experiment at \(\sqrt{s}=1.96\,\text {TeV} \) [60]. In general, predictions obtained from powheg  \(+\) pythia8 tend to differ from the data at low and intermediate \(\varDelta \phi _{1,2}\) values (\(\varDelta \phi _{1,2}< 2.7\)) by 10–40%.

6.4 Comparisons using observables sensitive to double-parton scattering

In this subsection, we present comparisons of predictions of the new tunes to DPS-sensitive observables measured by the CMS experiment at \(\sqrt{s}=7\,\text {TeV} \) in final states with four jets (4j) [61], and with two jets originating from bottom quarks (b jets) [62] and two other jets (2b2j) [63].

The topology in the transverse plane of the physics objects measured in the final state is sensitive to contributions from DPS. In particular, the 4j analysis performed by the CMS experiment requires two jets at high \(p_{\mathrm {T}}\) (hard jets) and two jets at low \(p_{\mathrm {T}}\) (soft jets); the 2b2j measurement selects two jets originating from b quarks and two other jets (light-flavor jets). Both of them measured the \(\varDelta \)S observable, defined as:

$$\begin{aligned} \varDelta S=\arccos \left( \frac{\vec {p}_{\text {T,1}}\cdot \vec {p}_{\text {T,2}}}{|\vec {p}_{\text {T,1}} | |\vec {p}_{\text {T,2}} |}\right) , \end{aligned}$$
(2)

where \(\vec {p}_\mathrm {T,1}\) refers to the momentum of the hard-jet or bottom jet pair system and \(\vec {p}_\mathrm {T,2}\) to that of the soft-jet or light-flavor jet pair system. This variable relates the production planes of the hard (bottom) jet and soft- (light-flavor) jet pairs. Details of the event selection and of the specific analyses can be found in Refs. [61] and [63].

Assuming that the two hard scatterings occurring within the same collision are completely independent of each other, the DPS cross section for a given process can be expressed through the inclusive partonic cross sections of the two single scatterings and an effective cross section, \(\sigma _{\text {eff}}\). In a geometrical approach, this cross section is related to the transverse size of the proton and to the total inelastic proton-proton (\({\mathrm{p}{}{}} {\mathrm{p}{}{}} \)) cross section [64, 65]. When no correlations among the partons inside the proton are present, \(\sigma _{\text {eff}}\) is similar to the inelastic \({\mathrm{p}{}{}} {\mathrm{p}{}{}} \) cross section. In this simple factorized approach, one expects \(\sigma _{\text {eff}}\) to be independent of the partonic final states of the two hard processes occurring within the same collision. In pythia8, the value of \(\sigma _{\text {eff}}\) is calculated by dividing the ND cross section by the so-called “enhancement factor”, which depends on the parameters of the overlap matter distribution function and on pT0Ref [40]. For central \({\mathrm{p}{}{}} {\mathrm{p}{}{}} \) collisions, the enhancement factor tends to be large, translating to a lower value of \(\sigma _{\text {eff}}\) and a larger DPS contribution. For peripheral interactions, enhancement factors are small, giving large values of \(\sigma _{\text {eff}}\) and a small DPS contribution.

Table 4 shows the values of \(\sigma _{\text {eff}}\) published by the CMS Collaboration for the 4j and the 2b2j measurements. A previous study [5] concluded that observables sensitive to semi-hard MPI and those sensitive to DPS cannot be described by a single set of parameters. Table 5 displays the \(\sigma _{\text {eff}}\) values obtained from the new CMS UE tunes. The central values of \(\sigma _{\text {eff}}\) are consistent among the new tunes and are slightly larger than the values of the DPS-based tunes [5].

Figure 18 shows the comparisons of predictions obtained from pythia8 with tunes CUETP8M1, CP2, CP4, and CP5 to the DPS observables measured in the 4j and 2b2j final states. Predictions from the CP2 tune based on a LO PDF set describe the central values better than the CP4 and CP5 tunes based on an NNLO PDF set or the old tune CUETP8M1. This is due to the different pT0Ref value used by CP2, CP4, and CP5, which determines the amount of simulated MPI. The value of the pT0Ref parameter is driven by the distribution of the gluon distribution function at low x, which is very different in LO and NNLO PDF sets. Additionally, predictions obtained with CP4 describe the DPS-sensitive observables better than CP5. This is due to the different rapidity ordering used for the PS emissions in the two tunes. By removing the rapidity ordering for the PS emissions (CP4), the simulation produces more radiation and decreases the correlation between the selected jet pairs compared to CP5. This reduced jet correlation tends to mimic a DPS event by producing low values of \(\varDelta \)S. We have checked that the observables sensitive to color coherence, which were measured by the CMS experiment at \(\sqrt{s}=7\,\text {TeV} \) [66], are well described by predictions from both CP4 and CP5 tunes, despite the difference in the rapidity ordering of the PS simulation between the two tunes.

Table 4 Values of \(\sigma _{\text {eff}}\) at \(\sqrt{s}=7\,\text {TeV} \) published by the CMS Collaboration for the four-jet final states, obtained by fitting predictions of the pythia8 MC event generator to DPS-sensitive measured observables
Table 5 Values of \(\sigma _{\text {eff}}\) at \(\sqrt{s}=7\) and 13\(\,\text {TeV}\) obtained with the new CMS UE tunes
Fig. 18
figure 18

The correlation observable \(\varDelta \)S measured in 4j (left) and 2b2j (right) production, compared to predictions of pythia8 tunes CUETP8M1, CP2, CP4, and CP5, from the CMS \(\sqrt{s}=7\,\text {TeV} \) analyses [61, 63]. Tunes CP1 and CP3 are not shown in the plot but show a similar behaviour as, respectively, tunes CP2 and CP4. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

6.5 Comparisons using observables in top quark production

In the following, we investigate how the new pythia8 tunes describe the CMS \(\mathrm{t}\overline{\mathrm{t}}\) data when different ME generators, namely powheg and mg5_amc, are employed. Both ME configurations use the NNPDF3.1 NNLO PDF with \(\alpha _S (m_{\mathrm{Z}{}{}})=0.118\) and assume a top quark mass (\(m_{\mathrm{t}{}{}} \)) value of 172.5\(\,\text {GeV}\).

In the powheg configuration, the ME heavy quark production mode [36, 39, 68] is used. In this configuration, powheg simulates inclusive \(\mathrm{t}\overline{\mathrm{t}}\) production at NLO, where the first additional jet is computed at LO, while mg5_amc performs the calculation with up to 2 additional jets at NLO, with a third jet simulated at LO. The powheg generator scales the real emission cross section by a damping function that controls the ME-PS merging and that regulates the high-\(p_{\mathrm {T}}\) radiation. The damping variable used in the powheg simulation is set to 1.379 times \(m_{\mathrm{t}{}{}} \), a value derived from data at \(\sqrt{s}=8\,\text {TeV} \) in the dilepton channel using a similar ME calculation and assuming the CP5 tune. The factorization and renormalization scales are assumed equal to the transverse mass of the top quark, \(m_{\mathrm {T}} ^{\mathrm{t}{}{}} =\sqrt{\smash [b]{m_{\mathrm{t}{}{}} ^2+p_{\mathrm {T}} ^2}}\). The minimum \(p_{\mathrm {T}}\) for the emission of light quarks in powheg is 0.8\(\,\text {GeV}\). The pThard parameter is set to 0 and the powheg hardness criterion, defined by the pTdef option, is set to 1. The merging scale in mg5_amc  is set to 40\(\,\text {GeV}\), and the threshold applied to regulate multijet MEs in the mg5_amc  FxFx merging procedure, is 20\(\,\text {GeV}\).

Distributions [69] in the lepton\(+\)jets channel are compared to predictions from different tunes using various settings, namely, powheg  \(+\) pythia8, and mg5_amc+pythia8 with FxFx merging [38], referred to as mg5_amc [FxFx] hereafter, with the CUETP8M1, CP2, CP4, and CP5 tunes. Figure 19 (upper panel) displays the normalized \(\mathrm{t}\overline{\mathrm{t}}\) cross section in bins of \(p_{\mathrm {T}}\) of the top quark decaying leptonically (\({\mathrm{t}{}{}} _\ell \)), in data and simulation. For all tunes, powheg  + pythia8 predictions have deviations below 10% with respect to the central values of the data. The central values of predictions from mg5_amc [FxFx] and data agree within \(\approx \)10% for \(p_{\mathrm {T}} ({\mathrm{t}{}{}} _\ell )<400\,\text {GeV} \) and within \(\approx \)20% for higher \(p_{\mathrm {T}}\).

Figure 19 (middle panel) shows the normalized \(\mathrm{t}\overline{\mathrm{t}}\) cross section in bins of m(\(\mathrm{t}\overline{\mathrm{t}}\)) in data and simulation. Predictions from powheg and mg5_amc [FxFx] with the new tunes describe the central values of the data reasonably well. Normalized \(\mathrm{t}\overline{\mathrm{t}}\) cross sections in bins of number of additional jets in data and simulation in the lepton\(+\)jets channel at \(\sqrt{s}=13\,\text {TeV} \) are shown in Fig. 19 (lower panel). The cross sections are compared with the predictions of powheg and of mg5_amc [FxFx]. The central values predicted by powheg  \(+\) pythia8 are in good agreement with data when CP5 tune is used. The value of \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})\) in combination with the rapidity ordering for ISR in the pythia8 simulation affects the additional jet distribution in \(\mathrm{t}\overline{\mathrm{t}}\) events. Predictions obtained from powheg  \(+\) pythia8 overestimate the data when a high value of \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})\approx 0.13\) is used (CUETP8M1 and CP2 tunes) irrespective of rapidity ordering for ISR. It is observed that even when \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})=0.118\) is used, predictions from the CP4 tune overshoot the data at high jet multiplicities. A much better agreement of central values is obtained only when rapidity ordering for ISR is switched on in the pythia8 simulation and \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})=0.118\) is used as in the CP5 tune. Predictions from mg5_amc [FxFx]\(+\) pythia8 with CUETP8M1, CP2, CP4, and CP5 tunes describe the central values of the data reasonably well.

Fig. 19
figure 19

The normalized \(\mathrm{t}\overline{\mathrm{t}}\) cross section in the lepton\(+\)jets channel, as a function of the transverse momentum of the top quark for leptonically decaying top quarks (\(t_\ell \)) (upper), the invariant mass of the \(\mathrm{t}\overline{\mathrm{t}}\) system, m(\(\mathrm{t}\overline{\mathrm{t}}\)) (middle), and in bins of number of additional jets (lower) from CMS \(\sqrt{s}=13\,\text {TeV} \) analysis [69]. The data are compared with the predictions of powheg (left) and mg5_amc [FxFx] (right). In both cases, the PS simulation is done with the pythia8 tunes CUETP8M1, CP2, CP4, or CP5. Tunes CP1 and CP3 are not shown in the plot but present a similar behaviour as, respectively, tunes CP2 and CP4. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Comparisons are also made using jet substructure observables in \(\mathrm{t}\overline{\mathrm{t}}\) events in the lepton\(+\)jets channel using measurements by CMS at \(\sqrt{s}=13\,\text {TeV} \) [70]. Figure 20 displays the comparisons using the distribution of the angle between two groomed subjets, \(\varDelta R_g\), which is found to be the most sensitive to \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\) [70]. The data are compared to simulations with the tunes CUETP8M1, CP2, CP4, and CP5, as well as CP5 FSR up (\(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})=0.122\)), CP5 FSR down (\(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})=0.115\)), and CP5 with CMW rescaling. It is observed that tunes with higher \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\) (CUETP8M1, CP2, and CP5 FSR up) describe the data better. Tune CP5 with CMW rescaling resolves the discrepancy of CP5 at high \(\varDelta R_g\), but worsens the description at \(\varDelta R_g\sim 0.27\) compared to CP5. It should be noted that a fit to the \(\varDelta R_g\) distribution using a b-enriched sample yields \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})=0.130^{+0.016}_{-0.020}\) [70] without CMW rescaling, while a fit to the distirubtion of the UE observable \(\overline{p_{\mathrm {T}}}\) measured in \(\mathrm{t}\overline{\mathrm{t}}\) events yields \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})=0.120\pm 0.006\) [22]. Therefore, in \(\mathrm{t}\overline{\mathrm{t}}\) events, UE and jet substructure observables prefer different central \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\) values, but they are compatible within uncertainties.

Fig. 20
figure 20

Comparison with the measurement [70] of the angle between two groomed subjets, \(\varDelta R_g\) in \(\mathrm{t}\overline{\mathrm{t}}\) events predicted by powheg  \(+\) pythia8 for the different tunes. The data are compared to tunes CUETP8M1, CP2, CP4, and CP5 (left). Tunes CP1 and CP3 are not displayed but they present a similar behavior as tunes CP2 and CP4, respectively. The data are also compared to CP5, CP5 FSR up, CP5 FSR down, and CP5 with CMW rescaling (right). The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Horizontal bars indicate the associated bin width

6.6 Comparisons using observables in W and Z boson production

In this subsection, we present a validation of the new CMS UE tunes for observables measured in events with a W or Z boson in the final state at \(\sqrt{s}=13\,\text {TeV} \). For the comparisons, we use predictions obtained with mg5_amc \(+\) pythia8 at LO using the \(k_{\mathrm {T}}\)–MLM merging scheme, and at NLO using the FxFx merging scheme. The \(k_{\mathrm {T}}\)–MLM merging scale is set to 19\(\,\text {GeV}\), while for FxFx the corresponding scale is set to 30\(\,\text {GeV}\). In both cases the MEs include the final states with 0, 1, 2, and 3 partons, and up to 2 partons are calculated at NLO precision in the FxFx case. To ease the comparison of the different tunes, the same PDF, NNPDF3.1 NNLO, and \(\alpha _S (m_{\mathrm{Z}{}{}}) = 0.118\) are used for the ME calculation independently of the tune.

First, UE observables [21] in Drell–Yan events in an invariant mass window of 81–101\(\,\text {GeV}\) around the Z boson peak for muonic decays are studied. The charged-particle density and transverse momentum sum are measured as a function of the Z boson \(p_{\mathrm {T}}\) in the three regions introduced in Sect. 6.2: toward, away, and transverse. The regions are defined with respect to the Z boson direction. The measurements are compared with FxFx predictions obtained with the CUETP8M1, CP2, CP4, and CP5 tunes in Fig. 21. The measurements are, in general, well-described by all tunes.

Fig. 21
figure 21

The charged-particle multiplicity (left) and \(p_{\mathrm {T}} ^\text {sum}\) (right) in the toward (upper), transverse (middle), and away (lower) regions measured as a function of the Z boson \(p_{\mathrm {T}}\) in Drell–Yan events at \(\sqrt{s}=13\,\text {TeV} \) [21], and compared with the predictions obtained by an inclusive NLO ME calculated by mg5_amc, interfaced to the UE simulation of pythia8 with the CUETP8M1, CP2, CP4, and CP5 tunes. Tunes CP1 and CP3 are not shown in the plot but present a similar behaviour as, respectively, tunes CP2 and CP4. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty of the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

The description of the cross section as a function of the jet multiplicity is also investigated in Z \(+\)jets [71] and W \(+\)jets [72] final states. The Z \(+\)jets measurement is restricted to the phase space where the two leptons have \(p_{\mathrm {T}} >20\,\text {GeV} \) and \(|y |<2.4\) and the dilepton mass lies in a \(\pm 20\,\text {GeV} \) window around 91\(\,\text {GeV}\). The momenta of the photons inside a cone of \(\varDelta \text {R}<0.1\) are added to the lepton momentum in order to partly recover the energy lost by FSR. Jets are clustered using the anti-\(k_{\mathrm {T}}\) algorithm with \(R = 0.4\) and must satisfy the criteria \(p_{\mathrm {T}} > 30\,\text {GeV} \) and \(|y | < 2.4\). The distance between the selected leptons and the leading jet \(\varDelta R(\ell ,\mathrm {j})\) must be greater than 0.4. For the W \(+\)jets measurement, the phase space is restricted by a transverse mass requirement, \(m_{\mathrm {T}} > 50\,\text {GeV} \), and by requirements on the muon, \(p_{\mathrm {T}} > 25\,\text {GeV} \) and \(|y |<2.4\). In the Z \(+\)jets measurements the same clustering algorithm, the FSR recovery prescription described above, and the lepton jet separation requirement are applied.

The comparisons of the jet multiplicities to various predictions are shown in Fig. 22. The measurement of the cross section inclusive in the number of jets, N, is not available for the W \(+\)jets analysis and the lower plots start at \(N=1\). The \(k_{\mathrm {T}}\)–MLM predictions of the jet multiplicity have little sensitivity to the UE and PS tunes, so all the tunes provide a good description of this observable, with a slightly better agreement observed for the CP2 tune. In the case of the FxFx sample, the CP5 tune predicts fewer events with a jet multiplicity of more than four with respect to the measurement. The deficit increases for increasing jet multiplicities. The CUETP8M1 tune shows a similar behaviour, though.

Fig. 22
figure 22

Comparison with the measurement [71, 72] of the inclusive jet multiplicity in Z \(+\)jets (upper) and W \(+\)jets (lower) events predicted by mg5_amc \(+\) pythia8 with \(k_{\mathrm {T}}\)–MLM merging (left) and FxFx merging (right) for the different tunes. Tunes CP1 and CP3 are not shown in the plot but present a similar behaviour as, respectively, tunes CP2 and CP4. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Predictions using the new CMS UE tunes are also compared with the \(p_{\mathrm {T}} \) balance between the Z boson and the jets with \(p_{\mathrm {T}} > 30\,\text {GeV} \) and \(|y |<2.4\) using the variable \(p_{\mathrm {T}}\) \(^\text {bal} = |{\vec p}_{\mathrm {T}} ({\mathrm{Z}{}{}}) + \sum _{\text {jets}} {\vec p}_{\mathrm {T}} (\mathrm {j}_i) |\) [71]. This variable is sensitive to PS and UE. The comparison is shown in Fig. 23 for events with at least one jet. Differences between the tunes are significant only in the region below \(\approx \)20\(\,\text {GeV}\). The discrepancy in this region for the FxFx samples indicates that the distribution peaks at lower values for CP4 and CP5 than in data.

Fig. 23
figure 23

Comparison with the measurement [71] of the \(p_{\mathrm {T}}\) balance predicted by mg5_amc \(+\) pythia8 with \(k_{\mathrm {T}}\)–MLM merging (left) and FxFx merging (right) for the different tunes for events with at least one jet. Tunes CP1 and CP3 are not shown in the plot but they present a similar behaviour as tunes CP2 and CP4, respectively. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in the data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

Results of Ref. [71] are also used to validate the description of the transverse momentum of the weak vector boson in \({\mathrm{Z}{}{}} \) \(+\) \(\ge 1\) jet events. The comparison is shown in Fig. 24. The new tunes provide similar descriptions for this distribution. Predictions using \(k_{\mathrm {T}}\)–MLM achieve a poor agreement with the data, independently of the UE tune, with respect to FxFx, which is able to describe the transverse momentum of the Z boson at \(p_{\mathrm {T}} >10\,\text {GeV} \). The region below 10\(\,\text {GeV}\) is poorly described for both FxFx and \(k_{\mathrm {T}}\)–MLM and the new tunes, but is well-described by predictions using the CUETP8M1 tune.

Fig. 24
figure 24

Comparison with the measurement [71] of the \(p_{\mathrm {T}}\) (Z) predicted by mg5_amc \(+\) pythia8 with \(k_{\mathrm {T}}\)–MLM merging (left) and FxFx merging (right) for the different tunes. Tunes CP1 and CP3 are not shown in the plot but they present a similar behaviour as tunes CP2 and CP4, respectively. The ratios of simulations to the data (MC/Data) are also shown, where the shaded band indicates the total experimental uncertainty in data. Vertical lines drawn on the data points refer to the total uncertainty in the data. Vertical lines drawn on the MC points refer to the statistical uncertainty in the predictions. Horizontal bars indicate the associated bin width

To summarize the study of weak vector boson production, the CP2, CP4, and CP5 tunes provide similar descriptions of the UE observables with a reasonable agreement with the data. In general, the CP2 tune performs better in describing variables such as \(p_{\mathrm {T}}\) \(^{\text {bal}}\) and \(p_{\mathrm {T}}\) (Z). For the jet multiplicity, the CP2 and CP4 tunes are equally good in describing the measurement, whereas CP5 tends to undershoot the PS dominated region with at least five jets with a significance of 3.5 standard deviations.

7 Summary and conclusions

A new set of tunes for the underlying-event (UE) simulation in the pythia8 event generator is obtained by fitting various measurements sensitive to soft and semihard multipartonic interactions at different collision energies. To derive these tunes, the leading order (LO), next-to-leading order (NLO), or next-to-next-to-leading order (NNLO) versions of the NNPDF3.1 parton distribution function (PDF) set for the simulation of the underlying-event components are used. In these tunes, the values of the strong coupling, \(\alpha _S (m_{\mathrm{Z}{}{}})\), used for the simulation of hard scattering, initial- and final-state radiation, and multiple-parton interactions are chosen consistent with the order of the PDF used. In the LO NNPDF3.1 set, \(\alpha _S (m_{\mathrm{Z}{}{}}) = 0.130\), whereas for the NLO and NNLO NNPDF3.1 sets, \(\alpha _S (m_{\mathrm{Z}{}{}}) = 0.118\). In general, the combination of contributions from multiple-parton interactions and parton-shower emissions is crucial to give a good description of variables measured in soft-collision events. The infrared threshold is relatively independent of center-of-mass energy when using NLO or NNLO PDF sets. Irrespective of the specific PDF used, predictions from the new tunes reproduce well the UE measurements at center-of-mass energies \(\sqrt{s}=1.96\) and 7\(\,\text {TeV}\). A significant improvement in the description of UE measurements at 13\(\,\text {TeV}\) is observed with respect to predictions from old tunes that were extracted using data at lower collision energies. For the first time, predictions based on higher-order PDF sets are shown to give a reliable description of minimum-bias (MB) and UE measurements, with a similar level of agreement as predictions from tunes using LO PDF sets.

Predictions of the new tunes agree well with the data for MB observables measured at pseudorapidities in the central (\(|\eta |< 2.4\)) and forward (\(3.2<|\eta |< 4.7\)) regions. The new CMS tunes simultaneously describe the number of charged particles produced in diffractive processes and MB collisions. Neither the new CMS tunes nor the CUETP8M1 tune describe the very forward region (\(-6.6<\eta <-5.2\)) well.

Measurements sensitive to double-parton scattering contributions are reproduced better by predictions using the LO PDF set in the UE simulation, without rapidity ordering of the initial-state shower.

The UE simulation provided by the new tunes can be interfaced to higher-order and multileg matrix element generators, such as powheg and mg5_amc, without degrading the good description of UE observables. Such predictions also reproduce well observables measured in multijet final states, Drell–Yan, and top quark production processes. The central values of the normalized \(\mathrm{t}\overline{\mathrm{t}}\) cross section in bins of the number of additional jets predicted by powheg  \(+\) pythia8 overestimate the data when a high value of \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})\gtrsim 0.130\) is used (CMS pythia8 CP1 and CP2 tunes). Even when \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})=0.118\) is used, the CP4 tune overestimates the data at high jet multiplicities. This is cured by the rapidity ordering of the initial-state shower (CP5 tune). Measurements of azimuthal dijet correlations are also better described when a value of \(\alpha _S ^\mathrm {ISR}(m_{\mathrm{Z}{}{}})=0.118\) is used in predictions obtained with powheg merged with pythia8.

Comparisons with LEP event-shape observables and the distribution of the angle between two groomed subjets (\(\varDelta R_g\)) in \(\mathrm{t}\overline{\mathrm{t}}\) events at the LHC show that in ME-PS merged configurations CMW rescaling is disfavored. It is also found that \(\varDelta R_g\) is better described by tunes with \(\alpha _S ^\mathrm {FSR}(m_{\mathrm{Z}{}{}})\) higher than \(\sim \)0.120 while LEP event-shape observables and UE event observables in \(\mathrm{t}\overline{\mathrm{t}}\) events prefer a central value \(\sim \)0.120 [22].

All of the new CMS tunes are supplied with their eigentunes, which can also be used to determine the uncertainties associated with the theoretical predictions. We show that predictions using the new tunes based on PDFs determined at LO, NLO, and NNLO agree reasonably well with the measurements, and that the new tunes can also be applied to LO and NLO calculations merged with parton showers, multiple-parton interactions, and hadronization.