Measurement of distributions sensitive to the underlying event in inclusive Z boson production in pp collisions at s\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sqrt{s}$$\end{document} = 13 TeV with the ATLAS detector

This paper presents measurements of charged-particle distributions sensitive to the properties of the underlying event in events containing a Z\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Z$$\end{document} boson decaying into a muon pair. The data were obtained using the ATLAS detector at the LHC in proton–proton collisions at a centre-of-mass energy of 13 Te\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\text {Te}\text {V}$$\end{document} with an integrated luminosity of 3.2fb-1\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3.2~\text{ fb }^{-1}$$\end{document}. Distributions of the charged-particle multiplicity and of the charged-particle transverse momentum are measured in regions of the azimuth defined relative to the Z\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Z$$\end{document} boson direction. The measured distributions are compared with the predictions of various Monte Carlo generators which implement different underyling event models. The Monte Carlo model predictions qualitatively describe the data well, but with some significant discrepancies.


Introduction
A typical proton-proton ( pp) collision studied at the LHC consists of a short-distance hard-scattering process and accompanying activity collectively termed the underlying event (UE). The hard-scattering processes have a momentum transfer sufficiently large that the strong coupling constant is small and the cross-section may be calculated perturbatively in quantum chromodynamics (QCD). The driving mechanisms for the production of the UE are at a much lower momentum scale. These mechanisms include partons not participating in the hard-scattering process (beam remnants), radiation processes and additional hard and semi-hard scatters in the same pp collision, termed multiple parton interactions (MPI). Phenomenological models are required to describe these processes using several free parameters determined from experiment. In addition to furthering the understanding of the proton's internal structure and the related soft-QCD processes, accurate modelling of the UE is crucial for many data analyses at a hadron collider, either to precisely determine Standard Model quantities or to search for new particles and interactions.
The UE is not distinguishable from the hard scatter on an event-by-event basis. However, there are observables which are sensitive to the UE properties, as first introduced by the CDF Collaboration in proton-antiproton ( pp) collisions at a centre-of-mass energy of 1.8 TeV [1]. An example of such an observable can be defined by topological considerations, based on the activity measurement in the direction transverse 1 to a reference object. 1 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the zaxis along the beam pipe. The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upwards. Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the z-axis. The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2). Angular distance is measured in units of R ≡ ( η) 2 + ( φ) 2 . The object in the event with the leading transverse momentum relates the UE activity to the scale of the momentum transfer in the hard interaction. In general, processes with leptonic final states like Drell-Yan events are experimentally clean and theoretically well understood, allowing reliable identification of the particles from the UE. The absence of QCD final-state radiation (FSR) permits a study of different kinematic regions with varying transverse momenta of the Z boson due to harder or softer initial-state radiation (ISR).
Previous measurements of distributions sensitive to the properties of the UE in Drell-Yan events were performed in pp collisions at a centre-of-mass energy of 7 TeV by the ATLAS [2] and CMS [3] Collaborations and at a centreof-mass energy of 13 TeV by the CMS Collaboration [4]. Both measurements at √ s = 7 TeV verified that the dependence of the UE activity on the dimuon invariant mass is qualitatively well described by the Powheg+Pythia8 and Herwig++ sets of tuned parameters but with some significant discrepancies. Reference [2] provides distributions which are sensitive to the choice of parameters used in the various UE models.
This paper presents distributions of four observables sensitive to the UE in events containing a Z boson produced in pp collisions at a centre-of-mass energy of 13 TeV in the ATLAS detector at the LHC, where the singly produced Z boson decays into μ + μ − . Observables measured as a function of the transverse momentum of the Z boson, p Z T , in various regions of phase space are compared with predictions from several Monte Carlo (MC) event generators.

Underyling event observables and measurement strategy
Events containing two muons originating from the decay of a singly produced Z boson form a particularly interesting sample for studying the UE. The final-state Z boson is wellidentified and colour neutral, so that interaction between the final-state leading particle and the UE is minimal. Gluon radiation from the quarks or gluons initiating the hard scatter are, however, an important consideration as these give the remainder of the event a non-zero transverse momentum and change the kinematics of the final-state. Observables are therefore measured in different regions of the transverse plane, which are defined relative to the direction of the Z boson as illustrated in Fig. 1. A charged particle lies in the away region if its azimuthal angle relative to the Z boson direction | φ| is greater than 120 • . This region is heavily dominated by the hadronic recoil against the Z boson from initial state quark/gluon radiation and is therefore not particularly sensitive to the UE. The toward (| φ| ≤ 60 • ) and transverse (60 • < | φ| ≤ 120 • ) regions contain less contamination from the hard process after subtraction of the two muons from the Z boson. The transverse region is sensitive to the UE because, by construction, it is perpendicular to the direction of the Z boson and hence is expected to have a lower level of activity from the hard-scattering process than the away region. The two transverse regions are differentiated on an event-by-event basis by their scalar sum of charged-particle p T . The one with the larger sum is labelled trans-max and the other transmin [5,6]. The trans-min region is highly sensitive to the UE activity because it is less likely that activity from recoiling jets leaks into this region.
Four distributions are studied to understand the UE activity. The first is the charged-particle transverse momentum dN ch /d p ch T distribution inclusive over all selected particles. The final spectrum for this variable is accumulated over all events and then normalized. The next three are evaluated on an event-by-event basis: the charged-particle multiplicity dN ev /d(N ch /δηδφ), the scalar sum of the transverse momentum of those particles dN ev /d( p T /δηδφ), and the mean transverse momentum dN ev /d(mean p T ), where mean p T is the quotient of p T and N ch (provided N ch > 0 in the corresponding region). The distributions of these variables are produced separately for charged particles lying in each of the regions described above. The charged-particle multiplicity and the scalar sum of transverse momenta are normalized relative to the area of the corresponding region in the η-φ space. This simplifies the comparison of the activity in different regions. The distributions are distinguished in different ranges of the Z boson transverse momentum p Z T and for two regions of transverse thrust T ⊥ [7]. Transverse thrust characterizes the topology of the tracks in the event and is The thrust axisn is the unit vector which maximizes T ⊥ .
Here the summation is done on an event-by-event basis over the transverse momenta p T of all charged particles except the two muons. Transverse thrust has a maximum value of 1 for a pencil-like dijet topology and a minimum value of 2/π for a circularly symmetric distribution of particles in the transverse plane, as illustrated in Fig. 1. As proposed in Ref. [8], events with lower values of T ⊥ are more sensitive to the MPI component of the UE. The two regions of thrust examined in this paper are T ⊥ < 0.75 and T ⊥ ≥ 0.75, which are optimized to distinguish extra jet activity from the actual UE activity. A measurement of transverse thrust in combination with the UE activity was done at √ s = 7 TeV [9], but it did not distinguish the transverse regions.
In this paper, all measurements are also performed inclusively in T ⊥ . In total, the spectra of the four observables are measured in 96 regions of phase space, i.e. in eight bins of p Z T ; in the away, toward, trans-max, and trans-min regions; and for low, high, and inclusive T ⊥ . The bin boundaries in p Z T are (0, 10, 20, 40, 60, 80, 120, 200, 500) GeV. In addition to distributions of the four observables, the arithmetic means N ch , p T , and mean p T are evaluated as functions of p Z T in each of the various regions of phase space.

The ATLAS detector
The ATLAS detector [10][11][12] at the LHC covers nearly the entire solid angle around the collision point. It consists of an inner tracking detector (ID) surrounded by a thin superconducting solenoid, electromagnetic and hadronic calorimeters, and a muon spectrometer (MS) incorporating three large superconducting toroid magnets. The ID is immersed in a 2 T axial magnetic field and provides charged-particle tracking in the range |η| < 2.5. A high-granularity silicon pixel detector typically provides four measurements per track and is surrounded by a silicon microstrip tracker (SCT), which usually provides four three-dimensional measurement points per track. These silicon detectors are complemented by a transition radiation tracker, which enables radially extended track reconstruction up to |η| = 2.0.
The MS comprises separate trigger and precision tracking chambers which measure the deflection of muons in a magnetic field generated by superconducting air-core toroids. The precision chamber system covers the region |η| < 2.7 with three layers of monitored drift tubes, complemented by cathode-strip chambers in the forward region, where the background is highest. The muon trigger system covers the range |η| < 2.4 with resistive-plate chambers in the barrel and thin-gap chambers in the endcap regions.
A two-level trigger system is used to select interesting events [13]. The level-1 trigger is implemented in hardware and uses a subset of the muon spectrometer and calorimeter information to reduce the event rate to around 100 kHz. This is followed by a software-based trigger which runs offline reconstruction algorithms and reduces the event rate to approximately 1 kHz.

Data and simulated event samples
Data recorded in 2015 with the ATLAS detector at the LHC in proton-proton collisions at a centre-of-mass energy of 13 TeV are used in this analysis. The data set corresponds to an integrated luminosity of 3.2 fb −1 . Only events recorded when the detector was fully operational are considered.
Simulated MC events are used both to estimate the contamination from background processes in data and to correct the measured data for detector inefficiency and resolution effects (Sect. 6.1).
The Z → μμ signal process was simulated using the next-to-leading-order Powheg [14,15] [2] in the lowest p Z T bin (0 to 5 GeV). Photos [20] was used to simulate provided by the generator's authors and the corresponding CTEQ6L1 PDF set is compared with unfolded data in Sect. 7. This tuning uses energy extrapolation and was developed to describe the UE and double parton interaction effective cross-section. Herwig++ uses, similarly to Pythia, a leading-logarithm parton shower model matched to leadingorder matrix element calculations, but it implements a cluster hadronization scheme with parton showering ordered by emission angle.
Three sources of background are estimated using MC samples: Z → τ τ , W W → μνμν, and the tt process, each of which was simulated using Powheg [25,26] interfaced to Pythia8 or Pythia6 for tt. The Pythia tune set for Z → τ τ and W W → μνμν is the same as was used for the signal process (AZNLO). The Perugia 2012 [27] tune set was used for simulation of the tt process.
Overlaid MC-generated minimum-bias events [28] simulate the effect of multiple interactions in the same bunch crossing (pile-up). These samples were produced with Pythia 8 using the A2 tune set [29] in combination with the MSTW2008LO PDF set. The A2 tune set was matched to the ATLAS minimum-bias measurement at √ s = 7 TeV [30]. The mean number of interactions per bunch crossing μ during the 2015 data-taking with 25 ns bunch spacing was 13.5. The simulated samples are reweighted to reproduce the distribution of the number of interactions per bunch crossing observed in the data.
The Geant4 [31] program simulated the passage of particles through the ATLAS detector. Differences in muon reconstruction, trigger, and isolation efficiencies between MC simulation and data are evaluated using a tag-and-probe method [32], and the simulation is corrected accordingly. Additional factors applied to the MC events correct for the description of the muon energy and momentum scales and resolution, which are determined from fits to the observed Z boson line shapes in data and MC simulations [32]. Finally, correction factors adjust the distribution of the longitudinal position of the primary pp collision vertex [33] to the one observed in the data.

Event and track selection
Candidate Z → μμ events are selected by requiring that at least one out of two single-muon triggers be satisfied. A high-threshold trigger requires a muon to have p T > 40 GeV, whilst a low-threshold trigger requires p T > 20 GeV and the muon to be isolated from additional nearby tracks. All events are required to have a primary vertex (PV). The PV is defined as the reconstructed vertex in the event with the highest p T of the associated tracks, consistent with the beamspot position (spatial region inside the detector where collisions take place) and with at least two associated tracks with p T > 400 MeV.
The main selections to define the regions of phase space are summarized in Table 1. The reconstruction procedure for muon candidates combines tracks reconstructed in the inner detector with tracks reconstructed in the MS [32]. The reconstructed muons are required to have p T > 25 GeV and |η| < 2.4. Track quality requirements are imposed to suppress backgrounds, and the muon candidate is required to be isolated using a p T -and η-dependent 'gradient' isolation criterion [32] based on track and calorimeter information. Muon candidates consistent with having originated from the decay of a heavy quark are rejected by requiring the significance of the transverse impact parameter (|d 0 /σ (d 0 )|, with d 0 representing the transverse impact parameter and σ (d 0 ) the related uncertainty) to be below 3. Furthermore, the muon candidates must be associated to the PV, i.e. the longitudi-   Events are required to have exactly two opposite-charged muons satisfying the selection criteria above. The invariant mass of the dimuon system must be between 66 GeV and 116 GeV.
Tracks reconstructed in the ID from the passage of charged particles are used to form the UE observables. Each reconstructed track is required to have p T > 0.5 GeV, |η| < 2.5, one hit in the innermost layer is required (if expected) and in total at least one hit in the pixel detector and at least six hits in the SCT. The tracks must have been assigned to the PV, i.e. the transverse and longitudinal impact parameters of the tracks relative to the PV must be smaller than 2 mm and 1.5 mm respectively. An additional requirement on the qual-ity of the fit of the track to the hits in the detector applies to tracks with p T > 10 GeV in order to suppress mismeasured tracks at high p T . This criterion affects mainly the tracks associated with the muon candidates and has little impact on the predominantly lowp T tracks of the UE activity.
The kinematics of the Z boson and of the charged particles in the event define the phase space of the fiducial region (particle level). This closely reflects the selection made on measured detector quantities outlined before. Simulated events are required to have two prompt muons that satisfy p T > 25 GeV and |η| < 2.4 with each muon defined at the 'bare' level (after final-state QED radiation). The measurements are all reported in bins of p Z T , the results presented in this paper are not sensitive to the predicted shape of the p Z T spectrum, even though they are sensitive to jet activity in   Fig. 3 A summary of the systematic uncertainties in the arithmetic mean of the N ch and p T spectra in the trans-min region as a function of p Z T . Here 'Prior' combines the two approaches to estimate the unfolding-related uncertainties. 'Detector' includes the modelling of the detector and the pile-up conditions the event. As a cross-check the observables are constructed as defined before but the muons are unfolded to the 'dressed' level (i.e. collinear QED FSR is added to the 'bare' level muons) similar to the previous UE measurement in Z events [2]. The difference between the results after unfolding to different generator levels is below the percent level and is less than the uncertainty related to the unfolding procedure. Charged particles must be stable, i.e. have a proper lifetime with cτ > 10 mm, with p T > 0.5 GeV and |η| < 2.5.
The statistical uncertainties of the data and the MC simulations are propagated using the bootstrap method [34]. While the statistical error of the data is the limiting factor for all distributions at high p Z T , it does not limit the measurements in phase-space regions of lower p Z T , which are particularly important for tuning MC simulations.

Unfolding
An iterative Bayesian unfolding technique is used to correct the data for detector inefficiencies and resolution [35][36][37]. Response matrices connect each observable at the detector and particle levels; these are constructed using the Powheg+Pythia8 signal MC sample which is overlayed with pile-up events at detector level. Each response matrix corresponds to a bin of p Z T or thrust, with the migration of events between p Z T or thrust bins corrected using a per-bin purity correction factor. In the context of MC simulations, the purity of one bin is defined as the fraction of events that are reconstructed in the same bin as the original particle level quantity. The bin intervals in p Z T and thrust are chosen to yield high purities (> 0.9 for the bins in p Z T and > 0.85 for the two bins in T ⊥ ) enabling the per-bin corrections. For the observable dN ch /d p ch T , two unfolding iterations are sufficient for convergence of the unfolding results, while for all other observables eight iterations are performed. The evaluation of the mean value of each observable in a bin of p Z T and thrust occurs after unfolding. The bin boundaries are the same at both the detector and particle levels.

Background subtraction
The background contributions to the selected data from the Z → τ τ , tt, and W W → μνμν processes are estimated using MC simulations. In total, these are about 0.7% of selected data events. This fraction varies from 0.9% for the lowest bin in p Z T to the per mille level for the highest p Z T bin. The background contribution from multijet processes is estimated using a data-driven technique based on the isolation and charge of the two reconstructed muons, similar to previous analyses [2]. The size of the multijet contribution in the data is less than 0.1%. The unfolding of the data is done after the subtraction of all MC and data-driven background estimates.

Systematic uncertainties
Systematic uncertainties can arise due to possible mismodelling of the muon momentum scale or resolution, as well as the reconstruction, identification, and isolation efficiencies. Furthermore, limited knowledge of the ID material distribution [38] dominates the uncertainties in the track reconstruction inefficiencies. Also the effect of falsely reconstructed tracks (when there is no corresponding charged particle) contributes to all observables.

Fig. 4
Measured spectra of p T (upper left), the charged-particle multiplicity, N ch (upper right), the scalar sum of the transverse momentum of those particles, p T , (lower left) and the mean transverse momentum, mean p T (lower right) in the trans-min region inclusively in T ⊥ for events with 10 < p Z T < 20 GeV. Predictions of Powheg+Pythia, Sherpa and Herwig++ are compared with the data. The ratios shown are predictions over data All uncertainties related to imperfect modelling of the detector are assessed using MC simulations. The data are first unfolded using the nominal MC simulation samples. Then the data are unfolded with MC samples where the parameter of the simulation which is affected by the mismodelling is varied by ±1σ of its estimated uncertainty. The average of the up and down shifts is assigned as the corresponding systematic uncertainty.
Since the observables are primarily track-based, the trackrelated systematic uncertainties dominate the total detectorrelated uncertainty. These are of the order of 2% regardless of the observable and region. Systematic uncertainties related to the muon reconstruction are a negligible fraction of the overall uncertainty.
Uncertainties due to mismodelling of the background processes are also considered. For the background processes modelled with MC simulations, the electroweak background normalization is varied by ±5% and the tt background normalization by ±15% (approximately within their theoretical uncertainties [39,40]) and the effect on the final measurements is estimated. The full effect of including the multijet background or not is taken as an uncertainty. The combined background-related uncertainties form a negligible fraction of the total systematic uncertainty. The dependence of the

Fig. 5
Measured p T spectra (upper left), the charged-particle multiplicity N ch (upper right), the scalar sum of the transverse momentum of those particles p T (lower left), and the mean transverse momentum, mean p T (lower right) in the trans-min region inclusively in T ⊥ for events with 120 < p Z T < 200 GeV. Predictions of Powheg+Pythia, Sherpa, and Herwig++ are compared with the data. The ratios shown are predictions over data background uncertainty on p Z T is negligible for this measurement.
An important consideration for these measurements is the modelling of the pile-up, since the MC simulations must correct for contamination from pile-up tracks through the unfolding procedure. When averaging over all simulated events about 13% of the selected tracks which are compatible with the primary vertex originate from pile-up.
A variation in the pile-up reweighting of the MC simulations is included to cover the uncertainty on the ratio between the predicted and measured inelastic cross-section in the fiducial volume defined by M X > 13 GeV where M X is the mass of the hadronic system [41]. The value of μ assumed in the MC simulations for the unfolding process is varied by ±9% from the nominal value. This uncertainty in the pile-up modelling is one of the largest sources of systematic uncertainty in the tails of the distributions of p T , N ch , p T , and mean p T , and for the mean distributions. The uncertainties related to the inaccuracies of the detector and pile-up modelling are combined and referred to as the 'Detector' uncertainty in the following figures.
Two additional cross-checks validate the pile-up modelling and the consistency of removing the pile-up effects via the unfolding technique. First, the unfolding procedure for all observables in all measurement bins is repeated for three intervals of μ , namely [8][9][10], [11][12][13] and [14][15][16].  The uncertainty associated with the unfolding technique is evaluated using a data-driven method. It accounts for the dependence of the unfolding on the usage of prior knowledge from the MC simulation, i.e. the particle level quantities. The ratio of data to simulation at detector-level is evaluated and smoothed for each observable. The smoothed ratio is then used to reweight the simulations by applying the event-weight according to the particle level quantity. The reweighted detector-level distribution is then unfolded using the regular response matrix. The relative difference between the reweighted particle-level distribution and the reweighted and unfolded detector-level distribution is treated as a systematic uncertainty. This dependence on prior knowledge from the MC simulation is the dominant systematic uncertainty in most distributions at lower values of p Z T . An additional method of estimating the uncertainty related to the unfolding is to unfold the detector-level MC distributions generated with Sherpa using the unfolding matrices based on the Powheg+Pythia MC sample. The results are compared with the particle level quantities predicted by Sherpa. After taking the uncertainty due to the MC prior into account, a slight discrepancy between the unfolded Sherpa sample and the particle-level distributions remains. Therefore, an additional contribution to the MC prior uncertainty is introduced to cover this remaining non-closure of the unfolded result and the Sherpa generator level. In general, it does not exceed the 2-4% level and is smoothed over the full range of the observable. In a few cases, this non-closure component dominates the MC prior uncertainty. These two separate unfolding uncertainties are added in quadrature in all figures.
All sources of systematic uncertainty are considered uncorrelated and are combined in quadrature. The MC prior uncertainty is one of the largest contributors to the total sys-tematic uncertainty at all values of p T and in each p Z T region. The statistical uncertainty of the data rises with increasing p Z T , contributing a significant fraction of the overall uncertainty. The breakdown of the individual sources of uncertainties for the four observables, p T , N ch , p T , and mean p T is illustrated in Fig. 2 for the example of events with 10 < p Z T < 20 GeV in the trans-min region (the region most sensitive to the UE), inclusively in T ⊥ . Figure 3 shows the systematic uncertainties in the arithmetic mean of the N ch and p T spectra in the trans-min region as a function of p Z T inclusively in T ⊥ . The largest contributions to the total systematic uncertainties of the mean distributions at all p Z T values come from either the MC prior uncertainty or the track-related uncertainties. The statistical uncertainties of the data become large for p Z T greater than around 200 GeV.

Overview of the results
Distributions of p T , N ch , p T , and mean p T are obtained in slices of p Z T for the different regions defined in the transverse plane and different regions of T ⊥ . The results for N ch and p T are normalized relative to the area of the region in η and φ. In addition to the measurements in slices of p Z T , the arithmetic means of N ch , p T , and mean p T ( N ch , p T , and mean p T ) are measured as a function of p Z T . Only a selection of the most relevant results is discussed in this section: the comparison of the unfolded data to the predictions of different MC generators focuses on the trans-min region. While the toward region provides insights of similar importance for tuning MC generators after having removed the two muons, the discussion focuses on the trans-min region to better facilitate comparison with previous measurements. The UE activity in the toward region is higher compared with that in trans-min. This is expected since the trans-min region is defined as the subregion of the transverse region with the lower activity and for Z → μμ events the UE activity is expected to be of similar magnitude in the toward and transverse regions. The trans-min region is statistically less affected by radiation and it is essentially the region where the contribution from ISR is subtracted. Apart from this difference in the amount of activity, the predictive performance of the different MC generators is comparable in the toward and trans-min regions. No significant difference in the predictive power between these regions is observed. Both N ch and p T measured in the trans-min are compared with previous measurements of the UE in Z boson events at lower centre-of-mass energies. Figures 4 and 5 show the unfolded p T spectrum, N ch , p T , and mean p T for the trans-min region inclusively in T ⊥ for events with p Z T between 10 and 20 GeV and between 120 and 200 GeV. The predictions from Powheg+Pythia, Sherpa, and Herwig++ are compared with the data. The ratio of prediction to data is shown beneath each plot. None of the tested MC generators describes all aspects of the data well and in some regions the differences exceed the 70% level. Generally, the MC generators predict a higher number of particles with small p T than is observed in data (see top left of Figs  The ratios shown are predictions over data 5). This is consistent with the MC predictions tending to lower values of mean p T , as is shown on the lower right plots of Figs. 4 and 5. The largest differences between data and simulation are at low N ch and low p T , and arise due to the steeper transverse momentum spectrum of charged particles in MC simulations. Powheg+Pythia and Sherpa predict a higher fraction of events with fewer charged particles and a consistently smaller sum of p T . However, Herwig++ slightly overestimates the fraction of particles with p T > 2.5 GeV and is qualitatively closer to the shape of the distributions of N ch and p T . With rising p Z T , the data p T spectrum becomes harder, and N ch , p T , and mean p T increase. The relative discrepancy remains the same in comparisons with the generator predictions.

Differential distributions
The dependence on T ⊥ is illustrated in Fig. 6 for the unfolded p T spectrum in the trans-min region for events with 10 < p Z T < 20 GeV and 120 < p Z T < 200 GeV. Similar to the results for the measurement inclusive in T ⊥ , the MC generators predict a higher fraction of particles with low p T than present in data. The predictions of Powheg+Pythia are closer to the measured distributions in the lower p Z T region, but Sherpa describes better the full p T range in the higher p Z T bin. The Herwig++ simulations have significant statistical fluctuations at higher p T . The most striking difference between the different regions in T ⊥ is observed for the Powheg+Pythia generator when focusing on the low p Z T bins for N ch as presented in Fig. 7. In MPI-sensitive regions (left plot in Fig. 7) the distribution of and Herwig++ are compared with the data. The ratios shown are predictions over data N ch by Powheg+Pythia is shifted towards higher numbers of charged-particles relative to the data, i.e. overshooting the data in the range 1 ≤ N ch /δηδφ ≤ 2.5. But in the high thrust region (right plot) the MC generator underestimates the data almost over the full range except for the first two bins. In contrast, the performances of Sherpa and Herwig++ are consistent when comparing the low and high thrust regions for N ch ; Herwig++ overestimates N ch , and Sherpa underestimates it. The same effect is observed for the distributions of p T but is less significant and therefore not presented. As pointed out in Ref. [8], the regions of high values of T ⊥ are dominated by extra jet activity which is not adequately modelled in Powheg+Pythia, as shown in the right plots in Figs. 6 and 7. 7.3 Underyling event activity as a function of p Z T Figure 8 shows the mean number of charged particles and the mean of the scalar sum of the transverse momenta of those particles per unit η-φ space as a function of p Z T in the transverse, trans-min, and trans-max regions inclusively in T ⊥ . The trans-min region is further separated by T ⊥ in the right plots of Fig. 8. In the trans-min region, the UEsensitive variables N ch and p T rise slowly with increasing Z boson transverse momentum. In contrast, the observables in the trans-max region have a strong dependence on p Z T . This is because it is heavily contaminated with the Z boson hadronic recoil leaking into the transverse region. The slope of the UE activity in the trans-min region as a function of p Z T for events of high T ⊥ is similar to the inclusive measurement. The total amount of activity measured in the trans-min region for events with high T ⊥ is lower than the inclusive measure-ment due to the correlation of activity in the transverse region and T ⊥ . Furthermore, the right-hand plots of Fig. 8 demonstrate that the UE activity is higher for events with lower T ⊥ , as expected [8]. Lower values of T ⊥ also increase the dependence on p Z T in the trans-min region. The MC modelling of individual measurements in all 96 phase-space regions is further investigated by comparing the measured arithmetic means of the N ch , p T , and mean p T as functions of p Z T . Figures 9 and 10 show comparisons with the predictions of Powheg+Pythia, Sherpa, and Herwig++ for the trans-min and towards regions inclusively in T ⊥ . The predictions fail to describe the data in either of the regimes. For p Z T > 20 GeV, Herwig++ predicts a slower rise in UE activity with rising p Z T than in the measured distributions. On the other hand, Powheg+Pythia and Sherpa qualitatively describe the 'turn-on' effect of the UE activity, i.e. a steeper slope at low p Z T which vanishes at higher values of p Z T . For Powheg+Pythia, the rise of the UE activity is underestimated, and hence the discrepancy with data grows with p Z T and stabilizes around p Z T = 100 GeV. Only in the toward region of the mean of the mean p T is Sherpa in good agreement with the data.
The p Z T dependence for the two regions of T ⊥ in the transmin region is summarized in Figs. 11 and 12. In the low T ⊥ region, the prediction by Sherpa improves, e.g. for N ch the discrepancy shrinks from about 30% to roughly 10%. Referring to the same observable, Powheg+Pythia is in agreement with data for p Z T > 80 GeV in the low T ⊥ regime within the uncertainties. For the selection on high T ⊥ all generators underestimate the UE activity. Sherpa provides the best description of the data in mean p T . Apart from the toward  The ratios shown are predictions over data region, it tends to a constant underestimation but agrees with the overall shape. The agreement of Powheg+Pythia with data is better for T ⊥ < 0.75 than for the inclusive measurement. The predictions of Herwig++ in the trans-min region improve with higher values of p Z T and also in events of lower T ⊥ . However, the discrepancy between Herwig++ and the data in the lowest bins remains regardless of the selected region.
7.4 Comparison with other centre-of-mass energies Figure 13 presents a comparison of the measured N ch and p T for different centre-of-mass energies. The results for √ s = 7 TeV are taken from the previous ATLAS measure-ment of the UE activity in Z boson events [2]. The event selection criteria are similar to the analysis presented in this paper, but the previous measurement also includes the Z → e + e − channel.   The error bars correspond to the full uncertainties of the corresponding measurement UE e.g. MPI. Hence, the rise of the UE activity as a function of √ s is expected.

Discussion and conclusion
Measurements of four observables sensitive to the activity of the UE in Z → μμ events are presented using 3.2 fb −1 of √ s = 13 TeV pp collision data collected with the ATLAS detector at the LHC in 2015. Those observables are the p T of charged particles, the number of charged particles per event (N ch ), the sum of charged-particle p T per event ( p T ), and the mean of charged-particle p T per event (mean p T ). They are measured in intervals of the Z boson p T and in different azimuthal regions of the detector relative to the Z boson direction. The arithmetic means of the distributions are plotted as functions of the Z boson p T , inclusively of and in regions of transverse thrust.
The predictions from three Monte Carlo generators (Powheg+Pythia8, Sherpa and Herwig++) are compared with the data. In general, all tested generators and tunes show significant deviations from the data distributions regardless of the observable. The arithmetic means of the observables deduced from the predictions of Powheg+Pythia8 and Sherpa match the main features of the UE activity in the fiducial region. The turn-on effect, i.e. the rising activity as a function of the hard-scatter scale (here p Z T ), is visible as is a saturation of this effect for higher values of p Z T . In contrast to the other generators, Herwig++ fails to reproduce the turn-on effect at low p Z T as it predicts that the UE activity decreases as a function of p Z T when considered only in the p Z T < 20 GeV region. Otherwise, all generators underestimate the activity of the UE when quantified as the arithmetic mean of the observables for inclusive T ⊥ . The generators predict the mean values better in comparison with the data when focusing on the MPI-sensitive regions. Powheg+Pythia8 is in agreement with data within the uncertainties for N ch and p T , indicating an adequate handling of the MPI activity. However, since the predictive power shrinks for the region with T ⊥ ≥ 0.75 in comparison with the inclusive measurement, the simulation of contributions other than MPI to the UE activity needs to be improved. Reference [8] points out that the region with T ⊥ > 0.75 is dominated by extra jet activity, giving a first indication for a possible improvement of the MC generator prediction. This conclusion is valid when focusing on Powheg+Pythia8 for different regions of T ⊥ for individual bins of p Z T . In comparison with the measurements at √ s = 7 TeV [2], the performance of Herwig++ is consistent for p Z T > 20 GeV. Both measurements use the energy-extrapolation tunes [24] provided by the Herwig++ authors, i.e. UE-EE-3 for √ s = 7 TeV and in the analysis presented here UE-EE-5. The latter tune was additionally validated against Tevatron and LHC measurements at √ s = 900 GeV and √ s = 7 TeV [44]. The prediction of Herwig++ is slightly better for the distributions of N ch and p T at higher values of p Z T . In the previous measurements, the divergence increased with p Z T , which might be related to improper modelling of the impact parameter. Apart from overestimating the mean activity, Herwig++ improved relative to the √ s = 7 TeV measurements in the description of the shape of dN ev /d( p T /δηδφ), dN ev /d(mean p T ), and dN ev /d(N ch /δηδφ) in the presented p Z T -bins. Qualitatively it performs better than the other generators.
Powheg+Pythia8 performs as well at √ s = 13 TeV as it does at √ s = 7 TeV, but is tuned with AU2 (only the MPI part was tuned by ATLAS using √ s = 7 TeV UE data) in the previous measurements. Nevertheless, this indicates that the MPI energy extrapolation of Pythia8 works well, which is in agreement with the better description for distributions at low T ⊥ .
In contrast, while at √ s = 7 TeV Sherpa version 1.4.0 with the CT10 PDF set consistently overestimates the UE activity metrics N ch and p T by 5% to 15%, the present analysis and Sherpa version reveal a continuous underesti-mation. At √ s = 13 TeV, the discrepancy relative to the data decreases with higher values of p Z T . CERN

Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors' comment: All ATLAS scientific output is published in journals, and preliminary results are made available in Conference Notes. All are openly available, without restriction on use by external parties beyond copyright law and the standard conditions agreed by CERN. Data associated with journal publications are also made available: tables and data from plots (e.g. cross section values, likelihood profiles, selection efficiencies, cross section limits, ...) are stored in appropriate repositories such as HEPDATA (http:// hepdata.cedar.ac.uk/). ATLAS also strives to make additional material related to the paper available that allows a reinterpretation of the data in the context of new theoretical models. For example, an extended encapsulation of the analysis is often provided for measurements in the framework of RIVET (http://rivet.hepforge.org/)." This information is taken from the ATLAS Data Access Policy, which is a public document that can be downloaded from http://opendata.cern.ch/record/413 [opendata.cern.ch].] Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License (http://creativecomm ons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. Funded by SCOAP 3