Measurement of distributions sensitive to the underlying event in inclusive Z boson production in pp collisions at √s = 13 TeV with the ATLAS detector

This paper presents measurements of charged-particle distributions sensitive to the properties of the underlying event in events containing a Z boson decaying into a muon pair. The data were obtained using the ATLAS detector at the LHC in proton–proton collisions at a centre-of-mass energy of 13 TeV with an integrated luminosity of 3.2 fb − 1 . Distributions of the charged-particle multiplicity and of the charged-particle transverse momentum are measured in regions of the azimuth deﬁned relative to the Z boson direction. The measured distributions are compared with the predictions of various Monte Carlo generators which implement diﬀerent underlying-event models. The Monte Carlo model predictions qualitatively describe the data well, but with some signiﬁcant discrepancies.


Introduction
A typical proton-proton (pp) collision studied at the LHC consists of a short-distance hard-scattering process and accompanying activity collectively termed the underlying event (UE). The hard scattering processes have a momentum transfer sufficiently large that the strong coupling constant is small and the cross-section may be calculated perturbatively in quantum chromodynamics (QCD). The driving mechanisms for the production of the UE are at a much lower momentum scale. These mechanisms include partons not participating in the hard-scattering process (beam remnants), radiation processes and additional hard and semi-hard scatters in the same pp collision, termed multiple parton interactions (MPI). Phenomenological models are required to describe these processes using several free parameters determined from experiment. In addition to furthering the understanding of the proton's internal structure and the related soft-QCD processes, accurate modelling of the UE is crucial for many data analyses at a hadron collider, either to precisely determine Standard Model quantities or to search for new particles and interactions.
The UE is not distinguishable from the hard scatter on an event-by-event basis. However, there are observables which are sensitive to the UE properties, as first introduced by the CDF Collaboration in proton-antiproton (pp) collisions at a centre-of-mass energy of 1.8 TeV [1]. An example of such an observable can be defined by topological considerations, based on the activity measurement in the direction transverse 1 to a reference object.
The object in the event with the leading transverse momentum relates the UE activity to the scale of the momentum transfer in the hard interaction. In general, processes with leptonic final states like Drell-Yan events are experimentally clean and theoretically well understood, allowing reliable identification of the particles from the UE. The absence of QCD final-state radiation (FSR) permits a study of different kinematic regions with varying transverse momenta of the Z boson due to harder or softer initial-state radiation (ISR).
Previous measurements of distributions sensitive to the properties of the UE in Drell-Yan events were performed in pp collisions at a centre-of-mass energy of 7 TeV by the ATLAS [2] and CMS [3] Collaborations and at a centre-of-mass energy of 13 TeV by the CMS Collaboration [4]. Both measurements at √ s = 7 TeV verified that the dependence of the UE activity on the dimuon invariant mass is qualitatively well described by the P +P 8 and Herwig++ sets of tuned parameters but with some significant discrepancies. Reference [2] provides distributions which are sensitive to the choice of parameters used in the various UE models.
This paper presents distributions of four observables sensitive to the UE in events containing a Z boson produced in pp collisions at a centre-of-mass energy of 13 TeV in the ATLAS detector at the LHC, where the singly produced Z boson decays into µ + µ − . Observables measured as a function of the transverse momentum of the Z boson, p Z T , in various regions of phase space are compared with predictions from several Monte Carlo (MC) event generators.

Underlying-event observables and measurement strategy
Events containing two muons originating from the decay of a singly produced Z boson form a particularly interesting sample for studying the UE. The final-state Z boson is well-identified and colour neutral, so that interaction between the final-state leading particle and the UE is minimal. Gluon radiation from the quarks or gluons initiating the hard scatter are, however, an important consideration as these give the remainder of the event a non-zero transverse momentum and change the kinematics of the final-state. Observables are therefore measured in different regions of the transverse plane, which are defined relative to the direction of the Z boson as illustrated in Figure 1.
A charged particle lies in the away region if its azimuthal angle relative to the Z boson direction |∆φ| is greater than 120 • . This region is heavily dominated by the hadronic recoil against the Z boson from initial state quark/gluon radiation and is therefore not particularly sensitive to the UE. The toward (|∆φ| ≤ 60 • ) and transverse (60 • < |∆φ| ≤ 120 • ) regions contain less contamination from the hard process after subtraction of the two muons from the Z boson. The transverse region is sensitive to the UE because, by construction, it is perpendicular to the direction of the Z boson and hence is expected to have a lower level of activity from the hard-scattering process than the away region. The two transverse regions are differentiated on an event-by-event basis by their scalar sum of charged-particle p T . The one with the larger sum is labelled trans-max and the other trans-min [5,6]. The trans-min region is highly sensitive to the UE activity because it is less likely that activity from recoiling jets leaks into this region.
Four distributions are studied to understand the UE activity. The first is the charged-particle transverse momentum dN ch /dp ch T distribution inclusive over all selected particles. The final spectrum for this variable is accumulated over all events and then normalized. The next three are evaluated on an event-by-event basis: the charged-particle multiplicity dN ev /d(N ch /δηδφ), the scalar sum of the transverse momentum of those particles dN ev /d(Σp T /δηδφ), and the mean transverse momentum dN ev /d(mean p T ), where mean p T is the quotient of Σp T and N ch (provided N ch > 0 in the corresponding region). The distributions of these variables are produced separately for charged particles lying in each of the regions described above. The charged-particle multiplicity and the scalar sum of transverse momenta are normalized relative to the area of the corresponding region in the η-φ space. This simplifies the comparison of the activity in different regions. The distributions are distinguished in different ranges of the Z boson transverse momentum p Z T and for two regions of transverse thrust T ⊥ [7]. Transverse thrust characterizes the topology of the tracks in the event and is The thrust axisn is the unit vector which maximizes T ⊥ . Here the summation is done on an event-by-event basis over the transverse momenta p T of all charged particles except the two muons. Transverse thrust has a maximum value of 1 for a pencil-like dijet topology and a minimum value of 2/π for a circularly symmetric distribution of particles in the transverse plane, as illustrated in Figure 1. As proposed in Ref.
[8], events with lower values of T ⊥ are more sensitive to the MPI component of the UE. The two regions of thrust examined in this paper are T ⊥ ≤ 0.75 and T ⊥ > 0.75, which are optimized to distinguish extra jet activity from the actual UE activity. A measurement of transverse thrust in combination with the UE activity was done at √ s = 7 TeV [9], but it did not distinguish the transverse regions.
In this paper, all measurements are also performed inclusively in T ⊥ . In total, the spectra of the four observables are measured in 96 regions of phase space, i.e. in eight bins of p Z T ; in the away, toward, trans-max, and trans-min regions; and for low, high, and inclusive T ⊥ . The bin boundaries in p Z T are (0, 10,20,40,60,80,120,200,500) GeV. In addition to distributions of the four observables, the arithmetic means N ch , Σp T , and mean p T are evaluated as functions of p Z T in each of the various regions of phase space.

Data and simulated event samples
Data recorded in 2015 with the ATLAS detector at the LHC in proton-proton collisions at a centre-of-mass energy of 13 TeV are used in this analysis. The data set corresponds to an integrated luminosity of 3.2 fb −1 . Only events recorded when the detector was fully operational are considered.
Simulated MC events are used both to estimate the contamination from background processes in data and to correct the measured data for detector inefficiency and resolution effects (Section 6.1).
The Z → µµ signal process was simulated using the next-to-leading-order P [14,15] event generator with the CT10 set of parton distribution functions (PDFs) [16] and interfaced to the P 8.170 event generator [17,18] to simulate the parton shower, hadronization and UE with the CTEQ6L1 PDF set and the AZNLO set of tuned parameters [19]. The latter option tunes the event generator to the p Z T measurement at √ s = 7 TeV [19]. Hence, it retunes the overall UE activity by adjusting the P MPI cut-off parameter to the UE activity of the previous measurement [2] in the lowest p Z T bin (0 to 5 GeV). P [20] was used to simulate final-state electromagnetic radiation. The P generator uses p T -ordered parton showers and a hadronization model based on the fragmentation of colour strings. Its MPI model interleaves the ISR and FSR emissions with MPI scatters. An alternative signal sample used for cross-checks and systematic uncertainty evaluations was simulated using S 2.2.0 [21], which has an independent implementation of the parton shower, hadronization, UE and FSR. The S samples utilize the NNPDF30NNLO PDF set [22] and were generated with the nominal tune set of version 2.2.0. The S generator uses leading-order matrix elements with a model for MPI similar to that of P 8 but without interleaving the FSR. It implements a cluster hadronization model similar to that of Herwig++. S and P impose the infrared cut-off for MPI as a smooth function. In contrast, Herwig++ implements it as a step function. A signal sample produced with the MC generator Herwig++ [23] using the UE-EE-5 tune [24] provided by the generator's authors and the corresponding CTEQ6L1 PDF set is compared with unfolded data in Section 7. This tuning uses energy extrapolation and was developed to describe the UE and double parton interaction effective cross-section. Herwig++ uses, similarly to P , a leading-logarithm parton shower model matched to leading-order matrix element calculations, but it implements a cluster hadronization scheme with parton showering ordered by emission angle.
Three sources of background are estimated using MC samples: Z → ττ, WW → µν µν, and the tt process, each of which was simulated using P [25, 26] interfaced to P 8 or P 6 for tt. The P tune set for Z → ττ and WW → µν µν is the same as was used for the signal process (AZNLO). The Perugia 2012 [27] tune set was used for simulation of the tt process.
Overlaid MC-generated minimum-bias events [28] simulate the effect of multiple interactions in the same bunch crossing (pile-up). These samples were produced with P 8 using the A2 tune set [29] in combination with the MSTW2008LO PDF set. The A2 tune set was matched to the ATLAS minimum-bias measurement at √ s = 7 TeV [30]. The mean number of interactions per bunch crossing µ during the 2015 data-taking with 25 ns bunch spacing was 13.5. The simulated samples are reweighted to reproduce the distribution of the number of interactions per bunch crossing observed in the data.
The G 4 [31] program simulated the passage of particles through the ATLAS detector. Differences in muon reconstruction, trigger, and isolation efficiencies between MC simulation and data are evaluated using a tag-and-probe method [32], and the simulation is corrected accordingly. Additional factors applied to the MC events correct for the description of the muon energy and momentum scales and resolution, which are determined from fits to the observed Z boson line shapes in data and MC simulations [32]. Finally, correction factors adjust the distribution of the longitudinal position of the primary pp collision vertex [33] to the one observed in the data.

Event and track selection
Candidate Z → µµ events are selected by requiring that at least one out of two single-muon triggers be satisfied. A high-threshold trigger requires a muon to have p T > 40 GeV, whilst a low-threshold trigger requires p T > 20 GeV and the muon to be isolated from additional nearby tracks. All events are required to have a primary vertex (PV). The PV is defined as the reconstructed vertex in the event with the highest Σp T of the associated tracks, consistent with the beam-spot position (spatial region inside the detector where collisions take place) and with at least two associated tracks with p T > 400 MeV.
The main selections to define the regions of phase space are summarized in Table 1. The reconstruction procedure for muon candidates combines tracks reconstructed in the inner detector with tracks reconstructed in the MS [32]. The reconstructed muons are required to have p T > 25 GeV and |η| < 2.4. Track quality requirements are imposed to suppress backgrounds, and the muon candidate is required to be isolated using a p T -and η-dependent 'gradient' isolation criterion [32] based on track and calorimeter information. Muon candidates consistent with having originated from the decay of a heavy quark are rejected by requiring the significance of the transverse impact parameter (|d 0 /σ(d 0 )|, with d 0 representing the transverse impact parameter and σ(d 0 ) the related uncertainty) to be below 3. Furthermore, the muon candidates must be associated to the PV, i.e. the longitudinal (|z 0 sin θ|) impact parameter is less than 0.5 mm. The variables d 0 and z 0 are measured relative to the PV.
Events are required to have exactly two opposite-charged muons satisfying the selection criteria above. The invariant mass of the dimuon system must be between 66 GeV and 116 GeV.
Tracks reconstructed in the ID from the passage of charged particles are used to form the UE observables. Each reconstructed track is required to have p T > 0.5 GeV, |η| < 2.5, one hit in the innermost layer is required (if expected) and in total at least one hit in the pixel detector and at least six hits in the SCT. The tracks must have been assigned to the PV, i.e. the transverse and longitudinal impact parameters of the tracks relative to the PV must be smaller than 2 mm and 1.5 mm respectively. An additional requirement on the quality of the fit of the track to the hits in the detector applies to tracks with p T > 10 GeV in order to suppress mismeasured tracks at high p T . This criterion affects mainly the tracks associated with the muon candidates and has little impact on the predominantly low-p T tracks of the UE activity.
The kinematics of the Z boson and of the charged particles in the event define the phase space of the fiducial region (particle level). This closely reflects the selection made on measured detector quantities outlined before. Simulated events are required to have two prompt muons that satisfy p T > 25 GeV and |η| < 2.4 with each muon defined at the 'bare' level (after final-state QED radiation). The measurements are all reported in bins of p Z T , the results presented in this paper are not sensitive to the predicted shape of the p Z T spectrum, even though they are sensitive to jet activity in the event. As a cross-check the observables are constructed as defined before but the muons are unfolded to the 'dressed' level (i.e. collinear QED FSR is added to the 'bare' level muons) similar to the previous UE measurement in Z events [2]. The difference between the results after unfolding to different generator levels is below the percent level and is less than the uncertainty related to the unfolding procedure. Charged particles must be stable, i.e. have a proper lifetime with cτ > 10 mm, with p T > 0.5 GeV. and |η| < 2.5.
The statistical uncertainties of the data and the MC simulations are propagated using the bootstrap method [34]. While the statistical error of the data is the limiting factor for all distributions at high p Z T , it does not limit the measurements in phase-space regions of lower p Z T , which are particularly important for tuning MC simulations. Table 1: A summary of the fiducial volume definition of the measurement, the particle-level definition, and the main observables. The first row lists selection criteria for the signal muons (indicated with an µ as superscript) limited by the detector geometry, while the cut on the dimuon invariant mass m yields a low background contamination. 6 Corrections and systematic uncertainties

Unfolding
An iterative Bayesian unfolding technique is used to correct the data for detector inefficiencies and resolution [35][36][37]. Response matrices connect each observable at the detector and particle levels; these are constructed using the P +P 8 signal MC sample which is overlayed with pile-up events at detector level. Each response matrix corresponds to a bin of p Z T or thrust, with the migration of events between p Z T or thrust bins corrected using a per-bin purity correction factor. In the context of MC simulations, the purity of one bin is defined as the fraction of events that are reconstructed in the same bin as the original particle level quantity. The bin intervals in p Z T and thrust are chosen to yield high purities (> 0.9 for the bins in p Z T and > 0.85 for the two bins in T ⊥ ) enabling the per-bin corrections. For the observable dN ch /dp ch T , two unfolding iterations are sufficient for convergence of the unfolding results, while for all other observables eight iterations are performed. The evaluation of the mean value of each observable in a bin of p Z T and thrust occurs after unfolding. The bin boundaries are the same at both the detector and particle levels.

Background subtraction
The background contributions to the selected data from the Z → ττ, tt, and WW → µν µν processes are estimated using MC simulations. In total, these are about 0.7% of selected data events. This fraction varies from 0.9% for the lowest bin in p Z T to the per mille level for the highest p Z T bin. The background contribution from multijet processes is estimated using a data-driven technique based on the isolation and charge of the two reconstructed muons, similar to previous analyses [2]. The size of the multijet contribution in the data is less than 0.1%. The unfolding of the data is done after the subtraction of all MC and data-driven background estimates.

Systematic uncertainties
Systematic uncertainties can arise due to possible mismodelling of the muon momentum scale or resolution, as well as the reconstruction, identification, and isolation efficiencies. Furthermore, limited knowledge of the ID material distribution [38] dominates the uncertainties in the track reconstruction inefficiencies. Also the effect of falsely reconstructed tracks (when there is no corresponding charged particle) contributes to all observables.
All uncertainties related to imperfect modelling of the detector are assessed using MC simulations. The data are first unfolded using the nominal MC simulation samples. Then the data are unfolded with MC samples where the parameter of the simulation which is affected by the mismodelling is varied by ±1σ of its estimated uncertainty. The average of the up and down shifts is assigned as the corresponding systematic uncertainty.
Since the observables are primarily track-based, the track-related systematic uncertainties dominate the total detector-related uncertainty. These are of the order of 2% regardless of the observable and region. Systematic uncertainties related to the muon reconstruction are a negligible fraction of the overall uncertainty.
Uncertainties due to mismodelling of the background processes are also considered. For the background processes modelled with MC simulations, the electroweak background normalization is varied by ±5% and the tt background normalization by ±15% (approximately within their theoretical uncertainties [39,40]) and the effect on the final measurements is estimated. The full effect of including the multijet background or not is taken as an uncertainty. The combined background-related uncertainties form a negligible fraction of the total systematic uncertainty. The dependence of the background uncertainty on p Z T is negligible for this measurement.
An important consideration for these measurements is the modelling of the pile-up, since the MC simulations must correct for contamination from pile-up tracks through the unfolding procedure. When averaging over all simulated events about 13% of the selected tracks which are compatible with the primary vertex originate from pile-up.
A variation in the pile-up reweighting of the MC simulations is included to cover the uncertainty on the ratio between the predicted and measured inelastic cross-section in the fiducial volume defined by M X > 13 GeV where M X is the mass of the hadronic system [41]. The value of µ assumed in the MC simulations for the unfolding process is varied by ±9% from the nominal value. This uncertainty in the pile-up modelling is one of the largest sources of systematic uncertainty in the tails of the distributions of p T , N ch , Σp T , and mean p T , and for the mean distributions. The uncertainties related to the inaccuracies of the detector and pile-up modelling are combined and referred to as the 'Detector' uncertainty in the following figures.
Two additional cross-checks validate the pile-up modelling and the consistency of removing the pile-up effects via the unfolding technique. First, the unfolding procedure for all observables in all measurement bins is repeated for three intervals of µ , namely [8-10], [11][12][13] and [14][15][16]. A mismodelling of pile-up in MC simulations would manifest itself less in the interval of 8 ≤ µ ≤ 10 and more in the interval of 14 ≤ µ ≤ 16. The unfolded results for the three intervals are found to be fully compatible within their associated statistical uncertainties, confirming the consistency of the handling of pile-up in the unfolding process.
Secondly, a complementary data-driven technique based on the Hit Backspace Once More (HBOM) method [42] is used. The intention is to reproduce pile-up contaminations as realistically as possible. Hence, the track information associated with non-primary vertices in the data is bundled to form a pile-up library. A random sample is drawn from this library and used as an example of pile-up effects in data. If this random sample is added to an individual event, the pile-up effect increases. A sampling of the library is subsequently used to pollute events with additional pile-up. Six iterations of pollution are applied, i.e. up to six random samples from the pile-up library are added to each event. Then the observables are constructed from these additionally contaminated events. Assuming the values of the observables evolve smoothly with each iteration of additional pile-up, an extrapolation in each bin to the value with zero pile-up vertices yields the HBOM estimate of pile-up subtracted data. The data are subsequently unfolded using a version of the P +P signal MC samples without pile-up vertices. The results obtained using this method are consistent with the nominal procedure, and no additional uncertainty is assigned.
The uncertainty associated with the unfolding technique is evaluated using a data-driven method. It accounts for the dependence of the unfolding on the usage of prior knowledge from the MC simulation, i.e. the particle level quantities. The ratio of data to simulation at detector-level is evaluated and smoothed for each observable. The smoothed ratio is then used to reweight the simulations by applying the event-weight according to the particle level quantity. The reweighted detector-level distribution is then unfolded using the regular response matrix. The relative difference between the reweighted particle-level distribution and the reweighted and unfolded detector-level distribution is treated as a systematic uncertainty. This dependence on prior knowledge from the MC simulation is the dominant systematic uncertainty in most distributions at lower values of p Z T . An additional method of estimating the uncertainty related to the unfolding is to unfold the detector-level MC distributions generated with S using the unfolding matrices based on the P +P MC sample. The results are compared with the particle level quantities predicted by S . After taking the uncertainty due to the MC prior into account, a slight discrepancy between the unfolded S sample and the particle-level distributions remains. Therefore, an additional contribution to the MC prior uncertainty is introduced to cover this remaining non-closure of the unfolded result and the S generator level. In general, it does not exceed the 2-4% level and is smoothed over the full range of the observable. In a few cases, this non-closure component dominates the MC prior uncertainty. These two separate unfolding uncertainties are added in quadrature in all figures.
All sources of systematic uncertainty are considered uncorrelated and are combined in quadrature. The MC prior uncertainty is one of the largest contributors to the total systematic uncertainty at all values of p T and in each p Z T region. The statistical uncertainty of the data rises with increasing p Z T , contributing a significant fraction of the overall uncertainty. The breakdown of the individual sources of uncertainties for the four observables, p T , N ch , Σp T , and mean p T is illustrated in Figure 2 for the example of events with 10 < p Z T < 20 GeV in the trans-min region (the region most sensitive to the UE), inclusively in T ⊥ . Figure 3 shows the systematic uncertainties in the arithmetic mean of the N ch and Σp T spectra in the trans-min region as a function of p Z T inclusively in T ⊥ . The largest contributions to the total systematic uncertainties of the mean distributions at all p Z T values come from either the MC prior uncertainty or the track-related uncertainties. The statistical uncertainties of the data become large for p Z T greater than around 200 GeV.

Unfolded observables and comparison with model predictions 7.1 Overview of the results
Distributions of p T , N ch , Σp T , and mean p T are obtained in slices of p Z T for the different regions defined in the transverse plane and different regions of T ⊥ . The results for N ch and Σp T are normalized relative to the area of the region in η and φ. In addition to the measurements in slices of p Z T , the arithmetic means of N ch , Σp T , and mean p T ( N ch , Σp T , and mean p T ) are measured as a function of p Z T . Only a selection of the most relevant results is discussed in this section: the comparison of the unfolded data to the predictions of different MC generators focuses on the trans-min region. While the toward region provides insights of similar importance for tuning MC generators after having removed the two muons, the discussion focuses on the trans-min region to better facilitate comparison with previous measurements. The UE activity in the toward region is higher compared with that in trans-min. This is expected since the trans-min region is defined as the subregion of the transverse region with the lower activity and for Z → µµ events the UE activity is expected to be of similar magnitude in the toward and transverse regions. The trans-min region is statistically less affected by radiation and it is essentially the region where the contribution from ISR is subtracted. Apart from this difference in the amount of activity, the predictive performance of the different MC generators is comparable in the toward and trans-min regions. No significant difference in the predictive power between these regions is observed. Both N ch and Σp T measured in the trans-min are compared with previous measurements of the UE in Z boson events at lower centre-of-mass energies.

Differential distributions
Figures 4 and 5 show the unfolded p T spectrum, N ch , Σp T , and mean p T for the trans-min region inclusively in T ⊥ for events with p Z T between 10 and 20 GeV and between 120 and 200 GeV. The predictions from P +P , S , and Herwig++ are compared with the data. The ratio of prediction to data is shown beneath each plot. None of the tested MC generators describes all aspects of the data well and in some regions the differences exceed the 70% level. Generally, the MC generators predict a higher number of particles with small p T than is observed in data (see top left of Figures 4 and 5). This is consistent with the MC predictions tending to lower values of mean p T , as is shown on the lower right plots of Figures 4 and 5. The largest differences between data and simulation are at low N ch and low Σp T , and arise due to the steeper transverse momentum spectrum of charged particles in MC simulations. P +P and S predict a higher fraction of events with fewer charged particles and a consistently smaller sum of p T . However, Herwig++ slightly overestimates the fraction of particles with p T > 2.5 GeV and is qualitatively closer to the shape of the distributions of N ch and Σp T . With rising p Z T , the data p T spectrum becomes harder, and N ch , Σp T , and mean p T increase. The relative discrepancy remains the same in comparisons with the generator predictions.
The dependence on T ⊥ is illustrated in Figure 6 for the unfolded p T spectrum in the trans-min region for events with 10 < p Z T < 20 GeV and 120 < p Z T < 200 GeV. Similar to the results for the measurement inclusive in T ⊥ , the MC generators predict a higher fraction of particles with low p T than present in data. The predictions of P +P are closer to the measured distributions in the lower p Z T region, but S describes better the full p T range in the higher p Z T bin. The Herwig++ simulations have significant statistical fluctuations at higher p T . The most striking difference between the different regions in T ⊥ is observed for the P +P generator when focusing on the low p Z T bins for N ch as presented in Figure 7. In MPI-sensitive regions (left plot in Figure 7) the distribution of N ch by P +P is   Figure 4: Measured spectra of p T (upper left), the charged-particle multiplicity, N ch (upper right), the scalar sum of the transverse momentum of those particles, Σp T , (lower left) and the mean transverse momentum, mean p T (lower right) in the trans-min region inclusively in T ⊥ for events with 10 < p Z T < 20 GeV. Predictions of P +P , S . and Herwig++ are compared with the data. The ratios shown are predictions over data.
shifted towards higher numbers of charged-particles relative to the data, i.e. overshooting the data in the range 1 ≤ N ch /δηδφ ≤ 2.5. But in the high thrust region (right plot) the MC generator underestimates the data almost over the full range except for the first two bins. In contrast, the performances of S and Herwig++ are consistent when comparing the low and high thrust regions for N ch ; Herwig++ overestimates N ch , and S underestimates it. The same effect is observed for the distributions of Σp T but is less significant and therefore not presented. As pointed out in Ref.
[8], the regions of high values of T ⊥ are dominated by extra jet activity which is not adequately modelled in P +P , as shown in the right plots in Figures 6 and 7. , and Herwig++ are compared with the data. The ratios shown are predictions over data.

Underlying-event activity as a function of p Z
T Figure 8 shows the mean number of charged particles and the mean of the scalar sum of the transverse momenta of those particles per unit η-φ space as a function of p Z T in the transverse, trans-min, and trans-max regions inclusively in T ⊥ . The trans-min region is further separated by T ⊥ in the right plots of Figure 8. In the trans-min region, the UE-sensitive variables N ch and Σp T rise slowly with increasing Z boson transverse momentum. In contrast, the observables in the trans-max region have a strong dependence on p Z T . This is because it is heavily contaminated with the Z boson hadronic recoil leaking into the transverse region. The slope of the UE activity in the trans-min region as a function of p Z T for events of high T ⊥ is similar to the inclusive measurement. The total amount of activity measured in the trans-min region for events with high T ⊥ is lower than the inclusive measurement due to the correlation of activity in the transverse region and T ⊥ . Furthermore, the right-hand plots of Figure 8 demonstrate that the UE activity is higher for events with lower T ⊥ , as expected [8]. Lower values of T ⊥ also increase the dependence on p Z T in the trans-min region.
The MC modelling of individual measurements in all 96 phase-space regions is further investigated by comparing the measured arithmetic means of the N ch , Σp T , and mean p T as functions of p Z T . Figures 9 and  10 show comparisons with the predictions of P +P , S , and Herwig++ for the trans-min and towards regions inclusively in T ⊥ . The predictions fail to describe the data in either of the regimes. For p Z T > 20 GeV, Herwig++ predicts a slower rise in UE activity with rising p Z T than in the measured distributions. On the other hand, P +P and S qualitatively describe the 'turn-on' effect of the UE activity, i.e. a steeper slope at low p Z T which vanishes at higher values of p Z T . For P +P , the rise of the UE activity is underestimated, and hence the discrepancy with data grows with p Z T and stabilizes around p Z T = 100 GeV. Only in the toward region of the mean of the mean p T is S in good agreement with the data.
The p Z T dependence for the two regions of T ⊥ in the trans-min region is summarized in Figures 11 and 12. In the low T ⊥ region, the prediction by S improves, e.g. for N ch the discrepancy shrinks from about 30% to roughly 10%. Referring to the same observable, P +P is in agreement with data for p Z T > 80 GeV in the low T ⊥ regime within the uncertainties. For the selection on high T ⊥ all generators underestimate the UE activity. S provides the best description of the data in mean p T . Apart from the toward region, it tends to a constant underestimation but agrees with the overall shape. The agreement of P +P with data is better for T ⊥ < 0.75 than for the inclusive measurement. The predictions of Herwig++ in the trans-min region improve with higher values of p Z T and also in events of lower T ⊥ . However, the discrepancy between Herwig++ and the data in the lowest bins remains regardless of the selected region. Figure 13 presents a comparison of the measured N ch and Σp T for different centre-of-mass energies. The results for √ s = 7 TeV are taken from the previous ATLAS measurement of the UE activity in Z boson events [2]. The event selection criteria are similar to the analysis presented in this paper, but the previous measurement also includes the Z → e + e − channel. The CDF measurements at

Discussion and conclusion
Measurements of four observables sensitive to the activity of the UE in Z → µµ events are presented using 3.2 fb −1 of √ s = 13 TeV pp collision data collected with the ATLAS detector at the LHC in 2015. Those observables are the p T of charged particles, the number of charged particles per event (N ch ), the sum of charged-particle p T per event (Σp T ), and the mean of charged-particle p T per event (mean p T ). They are measured in intervals of the Z boson p T and in different azimuthal regions of the detector relative to the Z boson direction. The arithmetic means of the distributions are plotted as functions of the Z boson p T , inclusively of and in regions of transverse thrust.
The predictions from three Monte Carlo generators (P +P 8, S and Herwig++) are compared with the data. In general, all tested generators and tunes show significant deviations from the data distributions regardless of the observable. The arithmetic means of the observables deduced from the predictions of P +P 8 and S match the main features of the UE activity in the fiducial region. The turn-on effect, i.e. the rising activity as a function of the hard-scatter scale (here p Z T ), is visible as is a saturation of this effect for higher values of p Z T . In contrast to the other generators, Herwig++ fails to reproduce the turn-on effect at low p Z T as it predicts that the UE activity decreases as a function of p Z T when considered only in the p Z T < 20 GeV region. Otherwise, all generators underestimate the activity of the UE when quantified as the arithmetic mean of the observables for inclusive T ⊥ . The generators predict the mean values better in comparison with the data when focusing on the MPI-sensitive regions. P +P 8 is in agreement with data within the uncertainties for N ch and Σp T , indicating an adequate handling of the MPI activity. However, since the predictive power shrinks for the region with T ⊥ > 0.75 in comparison with the inclusive measurement, the simulation of contributions other than MPI to the UE activity needs to be improved. Reference [8] points out that the region with T ⊥ > 0.75 is dominated by extra jet activity, giving a first indication for a possible improvement of the MC generator prediction. This conclusion is valid when focusing on P +P 8 for different regions of T ⊥ for individual bins of p Z T . In comparison with the measurements at √ s = 7 TeV [2], the performance of Herwig++ is consistent for p Z T > 20 GeV. Both measurements use the energy-extrapolation tunes [24] provided by the Herwig++ authors, i.e. UE-EE-3 for √ s = 7 TeV and in the analysis presented here UE-EE-5. The latter tune was additionally validated against Tevatron and LHC measurements at √ s = 900 GeV and √ s = 7 TeV [44]. The prediction of Herwig++ is slightly better for the distributions of N ch and Σp T at higher values of p Z T . In the previous measurements, the divergence increased with p Z T , which might be related to improper modelling of the impact parameter. Apart from overestimating the mean activity, Herwig++ improved relative to the √ s = 7 TeV measurements in the description of the shape of dN ev /d(Σp T /δηδφ), dN ev /d(mean p T ), and dN ev /d(N ch /δηδφ) in the presented p Z T -bins. Qualitatively it performs better than the other generators. P +P 8 performs as well at √ s = 13 TeV as it does at √ s = 7 TeV, but is tuned with AU2 (only the MPI part was tuned by ATLAS using √ s = 7 TeV UE data) in the previous measurements. Nevertheless, this indicates that the MPI energy extrapolation of P 8 works well, which is in agreement with the better description for distributions at low T ⊥ .
In contrast, while at √ s = 7 TeV S version 1.4.0 with the CT10 PDF set consistently overestimates the UE activity metrics N ch and Σp T by 5% to 15%, the present analysis and S version reveal a continuous underestimation. At    [6] J. Pumplin The ATLAS Collaboration