1 Introduction and motivation

The production of Z bosons and their decay to lepton pairs at the Large Hadron Collider (LHC) through the Drell–Yan mechanism [1] has been the topic of very fruitful and detailed studies in the LHC experiments [2,3,4,5,6,7,8]. The precision of the measurements has motivated over the past decade impressive theoretical developments, mostly in the area of higher-accuracy quantum chromodynamic (QCD) predictions, but also in the area of parton distribution functions (PDFs). The sub-percent precision achieved for the absolute and normalised fiducial cross-section measurements of the Z-boson transverse momentum, \(p_{\textrm{T}}\), as a function of its rapidity, \(|y|\), at the Z pole strongly constrains state-of-the-art theoretical calculations. In the best cases, these calculations combine approximate next-to-next-to-next-to-next-to-leading logarithm (N\(^4\)LL) perturbative resummation at low \(p_{\textrm{T}}\) with O(\(\alpha _s^3\)) (N\(^3\)LO) fixed-order perturbative calculations at high \(p_{\textrm{T}}\), where \(\alpha _s\) denotes the strong coupling constant. Similarly, the precision obtained for the fiducial cross-section measurements of \(|y|\) provides strong constraints on the parton distribution functions.

This paper presents for the first time a double-differential measurement in (\(p_{\textrm{T}}\),\(|y|\)) of absolute and normalised cross-sections at the Z pole within the full phase space of the decay leptons. The measurement uses the full coverage of the ATLAS detector to combine 6.2 million electron and 7.8 million muon pairs from Z-boson decays in the central region (\(ee_{CC}\) and \(\mu \mu _{CC}\) channels), complemented by 1.3 million electron pairs with one electron in the forward region of the detector (\(ee_{CF}\) channel). Such a measurement is possible using the methodology already developed and published for the extraction of the \(Z\)boson polarisation [6]. It relies on the decomposition of the lepton angular \(\cos \theta \) and \(\phi \) distributions in the Collins–Soper frame [9] into nine spherical harmonic polynomials, \(P_i\), multiplied by angular coefficients, \(A_i\) [10,11,12,13]. For pure Z-boson production, the full five-dimensional differential cross section describing the kinematics of the two Born-level leptons can be written as:

$$\begin{aligned}&\frac{ \textrm{d}\sigma }{\textrm{d}p_{\textrm{T}}~\textrm{d}y~\textrm{d}m~\textrm{d}\cos \theta ~\textrm{d}\phi }= \frac{3}{ 16 \pi } \frac{\textrm{d}\sigma ^{ U+ L }}{\textrm{d}p_{\textrm{T}}~\textrm{d}y~\textrm{d}m} \nonumber \\&\quad \times \bigg \{(1 + \cos ^2\theta ) + \frac{1}{2}\ A_0 (1 - 3 \cos ^2\theta ) \nonumber \\&\qquad + A_1 \ \sin 2 \theta \ \cos \phi \nonumber \\&\qquad + \frac{1}{2}\ A_2 \ \sin ^2\theta \ \cos 2 \phi \nonumber \\&\qquad + A_3 \ \sin \theta \ \cos \phi + A_4 \ \cos \theta \nonumber \\&\qquad + A_5 \ \sin ^2\theta \ \sin 2 \phi \nonumber \\&\qquad + A_6 \ \sin 2 \theta \ \sin \phi + A_7 \ \sin \theta \ \sin \phi \bigg \}. \end{aligned}$$
(1)

The dependence of the differential cross section on \(\cos \theta \) and \(\phi \) is analytical and is fully contained in the harmonic polynomials. On the other hand, the dependence on \(p_{\textrm{T}}\), \(|y|\) and m is entirely contained in the unpolarised cross section, denoted by \(\sigma ^{U+L}\), and in the \(A_{i}\) angular coefficients. Therefore, all hadronic dynamics from the production mechanism are factorised from the decay kinematics in the \(Z\)-boson rest frame. This allows the measurement precision to be essentially insensitive to all uncertainties in QCD, quantum electrodynamics (QED), and electroweak (EW) effects related to \(Z\)-boson production and decay. In particular, EW corrections that couple the initial-state quarks to the final-state leptons have a negligible impact (below 0.05%) at the \(Z\)-boson pole. This has been shown for the LEP precision measurements [14, 15], when calculating the interference between initial-state and final-state QED radiation and more recently for \(Z\)-boson measurements at the LHC [16]. The small fraction of \(\gamma ^*\) production in the Z-boson pole region and its interference with the Z boson can also be described by Eq. (1), but with different coefficients, so the \(A_i\) coefficients discussed in this paper are effective coefficients, containing the small \(\gamma ^*\) contribution at the Z pole.

The decomposition is based on a simple and model-independent ansatz: the spin-one nature of the intermediate gauge boson, and the spin one-half nature of the decay leptons, and on the assumption of angular momentum conservation and quantisation. It removes the need for predictions to model accurately the polarisation and decay of the Z-boson; only its production properties are of interest for comparison to the measurements. Therefore, any phenomenological interpretation of this measurement avoids the theoretical uncertainties and ambiguities related to spin correlations and to the resummation of fiducial power corrections [17,18,19,20].

One striking example of the advantage of exploiting measurements in the full phase space of the decay leptons is the rapidity dependence of the Z-boson transverse momentum spectrum: whereas the fiducial measurements of Ref. [2] are insensitive to this production property because the fiducial lepton selections essentially remove the rapidity dependence of the spectrum, the measurements reported here probe this dependence very precisely. The differential angular distributions also provide extra constraints on certain experimental systematic uncertainties since they are not used in the calibration procedures of the detectors. Finally, they have even less sensitivity to theoretical systematic uncertainties than the unfolded \(p_{\textrm{T}}\) fiducial measurements.

These measurements pave the way for unambiguous interpretations in terms of QCD. A precise determination of the Z-boson transverse momentum spectrum leads to excellent sensitivity to the strong coupling constant at the  \(m_{Z}\) scale, while the even more precise rapidity-dependent cross-sections, obtained after integrating over \(p_{\textrm{T}}\), can be compared directly to state-of-the-art fixed-order predictions with excellent sensitivity to the parton distribution functions.

This paper is structured as follows. Section 2 first presents an overview of the measurement methodology and of the likelihood fit performed to extract the observables of interest from the experimental distributions. Section 3 describes briefly the data analysis, and Sect. 4 discusses the systematic uncertainties in the measured differential cross-sections. Section 5 compares the \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements to theoretical predictions combining perturbative resummation with fixed-order calculations, and then compares the \(\frac{d\sigma }{dy}\) and the total cross-section measurements to the predictions from QCD N\(^3\)LO calculations using different PDF sets. Finally, Sect. 6 summarises and concludes the paper.

2 Measurement methodology

The angular coefficients are extracted from the data by fitting templates of the \(P_i\) polynomial terms, defined in Eq. (1), to the reconstructed angular distributions using \(8 \times 8\) bins in (\(\cos \theta ,\phi \)) space (see Ref. [6] for a detailed description). Each template is normalised by a free parameter for its corresponding polynomial coefficient \({A_{i}}\) and a common parameter representing the unpolarised cross section. The polynomial \(P_8 = 1+\cos ^2\theta \) in Eq. (1) is only normalised by the parameter for the unpolarised cross-section. All the angular coefficients together with the corresponding unpolarised cross section parameters are measured in each of the analysis bins in (\(p_{\textrm{T}}\)\(|y|\)) space.

In the absence of selections for the final state leptons, the angular distributions in the gauge boson rest frame are defined by its polarisation. In the presence of selection criteria applied to the leptons, the distributions are sculpted by kinematic effects, and can no longer be described by the sum of the nine \(P_i\) polynomials as in Eq. (1). Templates of the \(P_i\) terms are therefore constructed to account for this, which requires fully simulated signal Monte-Carlo (MC) samples to model the acceptance, efficiency, and migration of events. Reference \({A_{i}}\) coefficients are extracted in a simple way from the predicted shapes of the angular distributions in the full phase space of the decay leptons, using the orthogonality of the \(P_i\) polynomials (see Ref. [11] describing the underlying moment method used to extract the coefficients). Together with the reference unpolarised cross-section, they are used in a folding procedure based on the signal MC simulation. The folded polynomial templates (or simply templates) are built in (\(\cos \theta ,\phi \)) space for each of the nine original polynomials and for each of the measurement bins in (\(p_{\textrm{T}}^{\ell \ell }\)\(y^{\ell \ell }\)) space. They are then used to extract the angular coefficients and the unpolarised cross section in the full phase space of the leptons from Z-boson decay. The observables \(m^{\ell \ell }\)\(p_{\textrm{T}}^{\ell \ell }\), and \(y^{\ell \ell }\), which are defined using reconstructed lepton pairs, as described in Sect. 3.3, are to be distinguished from m\(p_{\textrm{T}}\) and \(|y|\), which are defined at generator level using lepton pairs at the Born level.

A likelihood is built from the nominal templates and the varied templates reflecting the systematic uncertainties, which are represented by two categories of nuisance parameters (NP), \(\beta \) and \(\gamma \). The first category, \(\beta \), represents experimental and theoretical uncertainties. Each \(\beta ^m\) in the set \(\beta ~=~\left\{ \beta ^1,\ldots ,\beta ^M\right\} \) is constrained by a unit Gaussian probability density function, \(G(0|\beta ^m, 1)\), and linearly interpolates between the nominal and varied templates. The second category, \(\gamma ^n\), represents systematic uncertainties from the limited size of the MC signal and background samples, which are constrained by Poisson probability density functions, \(P(N_{\textrm{eff}}^{\textrm{n}}|\gamma ^n N_{\textrm{eff}}^{\textrm{n}})\) where \(N_{\textrm{eff}}^{\textrm{n}}\) is the effective number of MC events in bin n, in each of the \(N_{\textrm{bins}}\) = 22528 bins of the measurement. After including these auxiliary parameters, and after all signal and background templates (see Sect. 3 for details of the samples) are summed over (with their respective normalisations), the expected number of events \(N_{\textrm{exp}}^n\) in bin n of the measurement can be written as:

$$\begin{aligned} N_{\textrm{exp}}^{n}(A,\sigma ^{U+L},\beta ,\gamma )= & {} \left\{ \displaystyle \sum _{j}\sigma ^{U+L}_{j}\times L\right. \nonumber \\{} & {} \left. \times \left[ t_{8j}^{n}(\beta ) + \displaystyle \sum _{i=0}^{7}A_{ij}\times t_{ij}^{n}(\beta )\right] \right\} \nonumber \\{} & {} \times \gamma ^n + \displaystyle \sum _{B}^{\textrm{bkgs}}T_{B}^{n}(\beta ), \end{aligned}$$
(2)

where:

  • index i runs over the eight angular coefficients and the corresponding \(P_i\) polynomials, while index j runs over all 352 bins in (\(p_{\textrm{T}}\),\(|y|\)) space.

  • A is the set of all angular coefficients, \(A_{ij}\).

  • \(\sigma ^{U+L}\) is the set of all unpolarised cross sections, \(\sigma _{j}^{U+L}\).

  • \(\beta \) is the set of all Gaussian-constrained nuisance parameters representing the systematic uncertainties.

  • \(\gamma \) is the set of all Poisson-constrained nuisance parameters representing the statistical uncertainties in the simulated samples and in the background estimates.

  • t is the set of all signal \(P_i\) polynomial templates, \(t_{ij}\).

  • \(T_{B}\) is the set of background templates, where the sum runs over all background sources.

  • L is the total integrated luminosity.

The summation over index j accounts for the contributions from all analysis bins at generator level that migrate into other analysis bins at the reconstruction level. The likelihood is then constructed as a product of Poisson probabilities across all \(N_{\textrm{bins}}\) and of auxiliary constraints for each nuisance parameter \(\beta ^m\):

$$\begin{aligned} {\mathcal {L}}(A,\sigma ^{U+L},\theta |N_{\textrm{obs}})= & {} \displaystyle \prod _{n}^{N_{\textrm{bins}}}\left\{ P(N_{\textrm{obs}}^{n}|N_{\textrm{exp}}^{n}(A,\sigma ^{U+L},\theta ))\right. \nonumber \\{} & {} \times \left. P(N_{\textrm{eff}}^{n}|\gamma ^n N_{\textrm{eff}}^{\textrm{n}})\right\} \nonumber \\{} & {} \times \displaystyle \prod _{m}^{M}G(0|\beta ^m,1). \end{aligned}$$
(3)

A profile likelihood ratio method is used to extract the best fit values of the parameters of interest (POIs) and their uncertainties. The POIs include the angular coefficients \(A_{ij}\) and the cross sections \(\sigma _{j}\) \(^{U+L}\) in each measurement bin. This procedure extends that presented in Ref. [6], in which only the angular coefficients were extracted as a function of \(p_{\textrm{T}}\) and \(|y|\) without any fit constraints based on the \(\sigma _{j}\) \(^{U+L}\) POIs. The results from the full fit focus on the differential cross-section results in each channel and each measurement bin. The results of the fit for the angular coefficients involving only central leptons are compatible with those of Ref. [6] within the uncertainties resulting from the slightly different selections and calibrations applied here. As explained in Sect. 3.3, the analysis of the forward electrons is significantly improved here with respect to that documented in Ref. [21]. As a result, the measurements of the \(ee_{CF}\) angular coefficients reported in this paper are published in HEPDATA and supersede those of Ref. [6].

Table 1 MC samples used to estimate the signal and backgrounds in the analysis

3 Analysis

3.1 ATLAS detector

The ATLAS experiment [22] at the LHC is a multipurpose particle detector with a forward–backward symmetric cylindrical geometry and a near \(4\pi \) coverage in solid angle.Footnote 1 It consists of an inner tracking detector, electromagnetic (EM) and hadronic calorimeters, and a muon spectrometer. The inner detector provides precision tracking of charged particles in the pseudorapidity range \(|\eta | < 2.5\). This region is matched to a high-granularity EM sampling calorimeter covering the pseudorapidity range \(|\eta | < 3.2\) and a coarser granularity calorimeter up to \(|\eta | = 4.9\). A hadronic calorimeter system covers the entire pseudorapidity range up to \(|\eta | = 4.9\). The muon spectrometer provides triggering and tracking capabilities in the range \(|\eta | < 2.4\) and \(|\eta | < 2.7\), respectively.

A first-level trigger is implemented in hardware, followed by two software-based trigger levels that together reduce the accepted event rate to \({400}\,\textrm{Hz}\) on average. An extensive software suite [23] is used in the reconstruction and analysis of real and simulated data, in detector operations, and in the trigger and data acquisition systems of the experiment.

3.2 Data and Monte Carlo samples

The data were collected by the ATLAS detector in 2012 at a centre-of-mass energy of \(\sqrt{s} = 8\) TeV, and correspond to an integrated luminosity of 20.2 fb\(^{-1}\). The mean number of additional pp interactions per bunch crossing (pile-up events) in the data set is approximately 20.

The event generators used to produce the \(Z/\gamma ^*\rightarrow \ell \ell \) signal events and the backgrounds estimated from simulation are listed in Table 1. The baseline Powheg Box  (v1/r2129) signal sample [24,25,26], using the CT10 NLO set of PDFs [28], is interfaced to Pythia  8 (v.8.170) [27] with the AU2 set of tuned parameters [36] to simulate the parton shower, hadronisation and underlying event, and to Photos (v2.154) [37] to simulate QED final-state radiation (FSR) in the \(Z\)-boson decay. The simulated line-shape of the Z-boson is reweighted to account for the running width and for mass-dependent NNLO QCD [38] effects, following the recipe described in Ref. [8]. This is a percent-level effect within the mass range used by the measurement, and provides the best possible leading-order EW prediction of the line-shape of the Z boson. The number of events available in the baseline Powheg BoxPythia 8 signal sample corresponds to approximately four times that in the data.

Backgrounds from EW (diboson, \(Z/\gamma ^* \rightarrow \tau \tau \), and \(\gamma \gamma \rightarrow \ell \ell \) production) and top-quark (production of top-quark pairs and of single top quarks) processes are evaluated from the MC samples listed in Table 1. All MC samples are processed through a full ATLAS detector simulation [39], based on Geant4 [40], and reconstructed with the same software as that used for the data. Pile-up events, occurring in the same and neighbouring bunch crossings are simulated and overlaid at the detector hit level on top of the hard-scattering process from the MC simulation.

3.3 Analysis overview and event selection

For this paper, a central lepton (electron or muon) is one found in the region \(|\eta | < 2.4\) (excluding, for electrons, the electromagnetic calorimeter barrel/end-cap transition region \(1.37< |\eta | < 1.52\)), while a forward electron is one found in the region \(2.5< |\eta | < 4.9\) (excluding a transition region \(3.00< |\eta | < 3.35\) between the end-cap and forward calorimeters). The analysis is split into three orthogonal channels that are analysed independently, and then combined for the last stages of the analysis, after verifying their compatibility.

The \(ee_{CC}\) channel consists of candidate events with two central electrons, obtained using a logical OR of a dielectron trigger, requiring two electron candidates, each having \(p_{\text {T}} ~>~12\) GeV, with two high-\(p_{\text {T}}\) single-electron triggers, the main one corresponding to a \(p_{\text {T}}\)  threshold of 24 GeV. Electron candidates are required offline to have \(p_{\text {T}} ~>~20\) GeV and \(|\eta | < 2.4\), and are reconstructed from clusters of energy in the electromagnetic calorimeter matched to inner detector tracks. The electron candidates must satisfy a set of “medium” selection criteria [21], which have been optimised for the level of pile-up present in the 2012 data.

The \(\mu \mu _{CC}\) channel consists of candidate events with two central muons, obtained using a logical OR of a dimuon trigger requiring two muon candidates with \(p_{\text {T}} > 18\) GeV and 8 GeV, respectively, and of two high-\(p_{\text {T}} \) single-muon triggers, the main one corresponding to a \(p_{\text {T}}\)  threshold of 24 GeV. Muon candidates are required offline to have \(p_{\text {T}} > 20\) GeV and \(|\eta | < 2.4\), and are identified as tracks in the inner detector, which are matched and combined with track segments in the muon spectrometer [41]. Track-quality, and longitudinal and transverse impact-parameter requirements are imposed to suppress backgrounds, and to ensure that the muon candidates originate from a common primary pp interaction vertex.

The \(ee_{CF}\) channel consists of candidate events with one central and one forward electron, obtained using the logical OR of the two high-\(p_{\text {T}}\) single-electron triggers used for the \(ee_{CC}\) events, as described above. The central electron candidate is required to have \(p_{\text {T}} > 25\) GeV and \(|\eta | < 2.4\). Because the expected background from multijet events is larger in this channel than in the \(ee_{CC}\) channel, the central electron candidate is required in addition to satisfy a set of “tight” selection criteria [21], including an explicit isolation requirement, which are optimised for the level of pile-up observed in the 2012 data. The forward electron candidate is required to have \(p_{\text {T}} > 20\) GeV and to satisfy a set of “medium” selection criteria, based only on the shower shapes in the calorimeters, since this region is outside the acceptance of the inner detector [42].

In the case of the \(ee_{CF}\) channel, the analysis of the forward electron candidates, in terms of the data quality, alignment, identification criteria, and calibration of the electromagnetic compartments of the calorimeters outside the acceptance of the inner detector, was improved significantly from those documented in Ref. [21]. A few examples illustrating these improvements are:

  • misalignments of a few mm between the inner wheels of the electromagnetic end-cap calorimeters and the inner detector and a rotation of a few mrad between one of the forward calorimeters and the inner detector were found and corrected for when studying the azimuthal angle difference between the central and forward electrons. As a result, the two angular coefficients (\(A_1,A_6\)), for which the fully two-dimensional (\(\cos \theta ,\phi \)) measurement is necessary, can now be measured for the \(ee_{CF}\) channel, and are found to be in agreement with the SM predictions.

  • an azimuthal inter-calibration of the forward electrons was performed to improve the baseline calibration. This corrected for large inhomogeneities of up to 10–15% in the energy response due to material from inner-detector services in front of the calorimeters not accounted for correctly in the simulation, and for high-voltage problems in some cells, which were inadequately corrected for online during data-taking.

  • the largest improvement was obtained by adjusting the simulated lateral shower shapes in the calorimeters to data and then calibrating carefully each region of the calorimeters. This resulted in both improved efficiencies for the forward electrons in certain regions of phase space and improved energy calibration.

As a result, the p-value for the agreement between the combined \(ee_{CC}\) and \(\mu \mu _{CC}\) channels and the \(ee_{CF}\) channel in the \(|y|\) range where they overlap, \(1.6< |y|< 2.4\), was improved from \( < 10^{-4}\) to 3% (see Fig. 6 in Sect. 5.1).

In each channel, the events are required to contain exactly two lepton candidates satisfying the criteria described above. For the central-central channels, where the charges of both leptons are measured, the two lepton candidates are required to be of opposite charge.

As described in Sect. 1, this analysis is focused on the Z-boson pole region, and the lepton pair is required to have an invariant mass, \(m^{\ell \ell }\), within a window around the Z-boson mass, \(80< m^{\ell \ell }< 100\) GeV. The simulated events are required to satisfy the same selection criteria as the data, after applying small corrections to account for the differences between data and simulation in terms of reconstruction, identification and trigger efficiencies and of energy scale and resolution for electrons and muons [41,42,43]. All simulated events are reweighted to match the distributions observed in data for the level of pile-up and for the primary vertex longitudinal position.

The measurements are binned in \(p_{\textrm{T}}\) and \(|y|\), with fine bins in \(p_{\textrm{T}}\) and coarse bins in \(|y|\), as follows:

  1. 1.

    \(ee_{CC}\) and \(\mu \mu _{CC}\) channels:

    • 23 bins in \(p_{\textrm{T}}\) with bin boundaries \(\{\)0, 2.5, 5.0, 8.0, 11.4, 14.9, 18.5, 22.0, 25.5, 29.0, 32.6, 36.4, 40.4, 44.9, 50.2, 56.4, 63.9, 73.4, 85.4, 105.0, 132.0, 173.0, 253.0, 4000\(\}\) GeV,

    • 6 bins in \(|y|\) with bin boundaries \(\{\)0, 0.4, 0.8, 1.2, 1.6, 2.0, 2.4\(\}\);

  2. 2.

    \(ee_{CF}\) channel:

    • 19 bins in \(p_{\textrm{T}}\) with bin boundaries \(\{\)0, 2.5, 5.0, 8.0, 11.4, 14.9, 18.5, 22.0, 25.5, 29.0, 32.6, 36.4, 40.4, 44.9, 50.2, 56.4, 63.9, 73.4, 85.4, 105.0\(\}\) GeV,

    • 4 bins in \(|y|\) with bin boundaries \(\{\)1.6, 2.0, 2.4, 2.8, 3.6\(\}\).

The choice of bin boundaries in \(p_{\textrm{T}}\) is the result of an optimisation with respect to the limited resolution of the measurements at low \(p_{\textrm{T}}\) and the limited statistics at high \(p_{\textrm{T}}\).

Fig. 1
figure 1

Expected Z-boson acceptance times selection efficiencies as a function of \(|y|\) (left) and \(p_{\textrm{T}}\) (right) for the \(ee_{CC}\), \(\mu \mu _{CC}\) and \(ee_{CF}\) analysis channels

Table 2 Data yields integrated over \(p_{\textrm{T}}^{\ell \ell }\) in each \(|y^{\ell \ell }|\) analysis bin along with estimated background fractions in % for the \(ee_{CC}\), \(\mu \mu _{CC}\), and \(ee_{CF}\)  channels

The angular coefficients and unpolarised cross-section results are extracted taking into account the correlations and migrations between the measurement bins. Figure 1 shows the expected products of the acceptance and the selection efficiency, defined as the ratio of the number of selected events to the number of events in the full decay lepton phase space, as functions of both \(|y|\) and \(p_{\textrm{T}}\). The shape of this product of acceptance and efficiency in \(|y|\) results in part from the differences in reconstruction and identification efficiencies between central electrons and muons. The \(ee_{CF}\) channel covers a higher range in \(|y|\) and overlaps with the central-central channels for \(1.6< |y|< 2.4\). In each of the analysis measurement bins, two-dimensional (\(\cos \theta ,\phi \)) angular distributions are computed (eight equal-sized bins in each observable) and they serve as the basis for the simultaneous extraction of all the angular coefficients and of the unpolarised cross-sections, following the methodology described in Sect. 2.

3.4 Background estimates and signal yields

Table 2 shows the event yields and breakdowns of the fraction of events originating from background sources, for the \(ee_{CC}\)\(\mu \mu _{CC}\), and \(ee_{CF}\) channels, respectively. The numbers are shown in each \(|y|\) analysis bin integrated over \(p_{\textrm{T}}\). The backgrounds are divided into three major groups:

  • the backgrounds containing a prompt isolated lepton, from top (\(t{{\bar{t}}}\) and Wt), diboson, \(Z\rightarrow \tau \tau \), and \(\gamma \gamma \rightarrow ll\) processes, estimated from simulation samples (see Sect. 3.2),

  • the multijet and W+jet backgrounds, estimated from data using a method very similar to that described in Ref. [6], and

  • the non-fiducial \(Z\) background, which is estimated from simulation and consists almost entirely of events outside the full mass range considered for the analysis at generator level, but passing the selection cuts at reconstruction level owing to migrations.

The background contamination from other processes for the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels is very small and amounts to about 0.3%. The multijet background in the \(ee_{CF}\) channel is larger, amounting to about 1.0%. The non-fiducial \(Z\) background amounts to about 1% in all channels. Templates of the angular distributions of the sum of all these backgrounds are used in the fit to extract the angular coefficients, as described in Sect. 2.

Fig. 2
figure 2

The cos\(\theta \) (left) and \(\phi \) (right) angular distributions, integrated over \(p_{\textrm{T}}^{\ell \ell }\)and \(y^{\ell \ell }\), for the \(ee_{CC}\) (top), \(\mu \mu _{CC}\) (middle) and \(ee_{CF}\) (bottom) channels. The pre-fit expected distributions are shown separately for the signal and the different small background sources contributing to each channel. The grey band represents the experimental systematic uncertainties in the total expected yields from signal plus background. The small normalisation differences between the data and the MC distributions observed at the level of a few percent are compatible with the combination of uncertainties in integrated luminosity and signal cross section

3.5 Angular distributions

The criteria described above are applied to the data, leading to totals of 6.2 (\(ee_{CC}\)), 7.8 (\(\mu \mu _{CC}\)), and 1.3 (\(ee_{CF}\)) million selected events (see Table 2 for a more detailed breakdown as a function of \(y^{\ell \ell }\)). The reconstructed differential angular distributions, integrated over \(p_{\textrm{T}}^{\ell \ell }\) and \(y^{\ell \ell }\), are shown in Fig. 2 for the three channels. Small normalisation differences between the data and the MC distributions are observed at the level of a few percent, compatible with the combination of uncertainties in integrated luminosity and signal cross section. The measurement of the angular coefficients is, however, independent of the normalisation between data and simulation in each measurement bin. Overall, the agreement in shape between the observed and predicted distributions of \(\cos \theta \) and \(\phi \) is good.

Table 3 Examples of breakdown of relative uncertainties in the measured absolute differential cross-sections, as obtained from the full fit. The values are given in % for the region near the peak of the \(p_{\textrm{T}}\) distribution and are shown for two \(y\) bins in the case of the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels and for one \(y\) bin in the case of the \(ee_{CF}\) channel. Also shown are the total uncertainties. The uncertainty of 1.8% in the integrated luminosity is not included
Fig. 3
figure 3

Breakdown of relative uncertainties in the measured absolute differential cross-sections, \(\frac{d^{2}\sigma }{dp_Tdy}\), as a function of \(p_{\textrm{T}}\). The values are shown, as obtained from the full fit and for each \(|y|\) bin, for the combination of the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels for \(|y|< 1.6\), for the combination of all three channels for \(1.6< |y|< 2.4\), and for the \(ee_{CF}\) channel alone for \(2.4< |y|< 3.6\). The uncertainty of 1.8% in the integrated luminosity is not included

Fig. 4
figure 4

Breakdown of relative uncertainties in the measured normalised differential cross-sections, \(\frac{1}{\sigma }\) \(\frac{d^{2}\sigma }{dp_Tdy}\), as a function of \(p_{\textrm{T}}\). The values are shown, as obtained from the full fit and for each \(|y|\) bin, for the combination of the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels for \(|y|< 1.6\), for the combination of all three channels for \(1.6< |y|< 2.4\), and for the \(ee_{CF}\) channel alone for \(2.4< |y|< 3.6\)

Fig. 5
figure 5

Breakdown of relative uncertainties in the measured absolute differential cross-section, \(\frac{d\sigma }{dy}\), as a function of \(|y|\). The values are shown, as obtained from the full fit, for the combination of the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels for \(|y|< 1.6\), for the combination of all three channels for \(1.6< |y|< 2.4\), and for the \(ee_{CF}\) channel alone for \(2.4< |y|< 3.6\). The uncertainty of 1.8% in the integrated luminosity is not included

Fig. 6
figure 6

Compatibility test between the different channels for the differential \(\frac{d\sigma }{dy}\) cross-section measurements integrated over \(p_{\textrm{T}}\). The top panel shows the measurements with their total uncertainties (except for the uncertainty of 1.8% in the integrated luminosity) for the three channels and their overall combination. The bottom panel shows, for the relevant \(|y|\) bins, the relative \(\frac{d\sigma }{dy}\) differences, \(\Delta /\sigma \), in % between the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels,\(\Delta (ee_{CC}-\mu \mu _{CC})\), and between the \(ee_{CF}\) channel and the combination of the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels, \(\Delta (ee_{CF}-CC)\)

Fig. 7
figure 7

Measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections with their total uncertainties shown as a function of \(p_{\textrm{T}}\) for each \(|y|\) bin. The uncertainty of 1.8% in the integrated luminosity is not included. For each successive \(|y|\) bin, the differential cross section is divided by a factor of ten for plotting purposes

Fig. 8
figure 8

Illustration of the softening of the \(p_{\textrm{T}}\) spectrum at large rapidities. Shown are the ratios of the measured differential \(\frac{d\sigma }{dp_T}\) cross-sections in each \(|y|\) bin to a reference one taken to be the most central one, \(0< |y|< 0.4\). The ratios shown do not include any correlated treatment of the uncertainties between different \(y\) bins

Fig. 9
figure 9

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from DYTurbo [45]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from DYTurbo are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 10
figure 10

Comparison between the measured normalised differential \(\frac{1}{\sigma }\) \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from DYTurbo [45]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties. The predictions from DYTurbo are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 11
figure 11

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from CuTe-MCFM [48]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from CuTe-MCFM are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 12
figure 12

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from Artemide [49]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from Artemide are from pure resummation and are therefore shown only over the \(p_{\textrm{T}}\) range for which the contribution from fixed-order is below a few %

Fig. 13
figure 13

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from NangaParbat [50]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from NangaParbat are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 14
figure 14

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from RadISH [53]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from RadISH are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 15
figure 15

Comparison between the measured absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections in each of the eight \(|y|\) bins and the predictions from SCETlib [54]. Shown are the ratios between the predictions with their uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data with their overall uncertainties (except for the uncertainty of 1.8% in the integrated luminosity). The predictions from SCETlib are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 16
figure 16

Comparison between the measured normalised differential \(\frac{1}{\sigma }\) \(\frac{d\sigma }{dp_T}\) cross-sections, integrated over \(|y|< 1.6\), with their total uncertainties and the predictions from the various resummation calculations. The top left panel shows the data, while the next panels show one by one the ratios between each prediction with its uncertainties as obtained from renormalisation/factorisation/resummation scale variations and the data. Except for Artemide, the predictions are matched to the fixed-order O(\(\alpha _s^3\)) contributions from MCFM [48, 55]

Fig. 17
figure 17

Comparison between the absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections with their total uncertainties in each \(|y|\) bin and the predictions from the two fixed-order O(\(\alpha _s^3\)) calculations from MCFM [48, 55] and NNLOJET [56]

4 Systematic uncertainties

This section describes the systematic uncertainties in the measurements of the absolute and normalised differential cross-sections, \(\frac{d^{2}\sigma }{dp_Tdy}\) and \(\frac{d\sigma }{dy}\), as extracted from the angular observables presented in Fig. 2. These systematic uncertainties are grouped according to their source, and their typical relative values over most of the kinematic range of the measured observables are listed in Table 3 in the case of the absolute cross-section measurements, \(\frac{d^{2}\sigma }{dp_Tdy}\). The overall uncertainty of 1.8% in the integrated luminosity is not shown in this table nor in the figures below, but affects of course all the absolute cross-section measurements presented in this paper. The variations of the uncertainties as a function of \(p_{\textrm{T}}\) and \(|y|\) are shown in Figs. 3 (absolute cross-sections) and  4 (normalised cross-sections). The statistical uncertainty in the data is the dominant uncertainty over most of the measurement bins, followed by the statistical uncertainty in the MC samples. The variations are shown after combining the different channels together, so the uncertainties are correlated through the fit procedure. This explains for example why the forward electron systematic uncertainties do not vanish for \(|y|< 1.6\) and why the muon systematic uncertainties do not vanish for \(|y|> 2.4\). The breakdown between the different groups of uncertainties is obtained after freezing all the nuisance parameters of each group to their best-fit values, and then subtracting in quadrature the total uncertainty obtained by performing the fit in these conditions from the total uncertainty obtained by the full fit. Because of the correlations between the nuisance parameters, each of the resulting contributions is rescaled such that their sum in quadrature remains equal to the total uncertainty from the full fit.

Once these cross-sections are integrated over \(p_{\textrm{T}}\), the statistical uncertainties obtained for \(\frac{d\sigma }{dy}\) are significantly reduced and the experimental systematic uncertainties in the lepton measurements [21, 41] become dominant, especially in the central region, as shown in Fig. 5. In all cases, theoretical uncertainties, arising essentially only from PDFs, are negligible (see Ref. [6] for a more detailed discussion). In particular, as mentioned in Sect. 1, higher-order QED/EW effects, such as initial-final state interference diagrammes, which break the factorisation assumption underlying the expression of the differential cross-section in Eq. (1), have a negligible impact in the Z-boson pole region studied here [16].

5 Results and comparisons to predictions

5.1 Compatibility between measurements

Before considering the measurement results themselves, an important step is to evaluate the compatibility between the different channels. This is first done for \(\frac{d^{2}\sigma }{dp_Tdy}\) for the \(ee_{CC}\) versus \(\mu \mu _{CC}\) channels, which are found to be compatible within their dominant statistical uncertainties in all cases, and are then combined into overall central-central (CC) measurements. In the results presented in this paper, there are two rapidity bins where the central-central and central-forward measurements overlap, namely in the range \(1.6~<~|y|~<~2.4\). The \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements from the \(ee_{CF}\) channel are compared in this overlap region with those from the CC channel. The distributions are found to be compatible within their dominant statistical uncertainties, and all three channels are then finally combined. The overall p-value of the combined fit is 4%, while the overall p-values of fits performed separately for each channel are found to be 11% for the \(ee_{CC}\) channel, 90% for the \(\mu \mu _{CC}\) channel and 2% for the \(ee_{CF}\) channel.

A stringent test of the compatibility between channels can be performed for the \(\frac{d\sigma }{dy}\) measurements after integration over \(p_{\textrm{T}}\), since the statistical uncertainties are strongly reduced as shown in Fig. 5. The results of this test are shown as a function of \(|y|\) in Fig. 6. The top panel of Fig. 6 shows comparisons between separate fits done to each of the three channels. The overall p-value for the compatibility between the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels is found to be 2%, while that between the \(ee_{CF}\) and CC channels is found to be 3%. The bottom panel of Fig. 6 shows the relative cross-section differences after performing the full combination of all channels between the \(ee_{CC}\) and \(\mu \mu _{CC}\) channels for \(|y|< 2.4\) and between the \(ee_{CF}\) and CC channels for \(1.6< |y|< 2.4\). The residual tensions between channels arise mostly from the highest \(|y|\) bin.

5.2 Comparison between \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements and predictions

For the double-differential \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements, the predictions are obtained by different state-of-the-art QCD perturbative calculations based on \(q_{\textrm{T}}\) resummation [44] at approximate approximate N\(^4\)LL accuracy. All these calculations, DYTurbo [45,46,47], CuTe-MCFM [48], Artemide [49], NangaParbat [50], RadISH [51,52,53], and SCETlib [54], are currently being benchmarked in the LHC Standard Model working group. Except for Artemide, they are matched at high values of \(p_{\textrm{T}}\) to a common state-of-the-art O(\(\alpha _s^3\)) fixed-order perturbative calculation from MCFM [48, 55]. Each one of these perturbative resummation calculations has its own recipe for defining a total uncertainty in the prediction based on variations of several QCD scales related to the resummation and fixed-order contributions and the procedure used to match them such that the summed prediction is well behaved. It is beyond the scope of this paper to explain the differences between the various approaches and the theory predictions are therefore shown below with total uncertainty envelopes provided by the authors.

Figure 7 shows all the absolute differential \(\frac{d^{2}\sigma }{dp_Tdy}\) cross-section measurements with their total uncertainties (except for the uniform uncertainty of 1.8% in the integrated luminosity). This complete set of cross-section numbers with the full covariance matrix are published in HEPDATA and contain all the information required for comparisons to theory and for interpretation in terms of PDF fits or of the strong coupling constant. As \(|y|\) increases, the \(p_{\textrm{T}}\) spectrum becomes softer and this is illustrated in Fig. 8, which presents a normalised ratio of the measurements, for which the reference is taken to be the measurement in the most central \(|y|\) bin, namely \(0< |y|< 0.4\). Here the luminosity uncertainty cancels, and the softening of the \(p_{\textrm{T}}\) spectrum is clearly visible for the higher \(|y|\) bins. This feature, measured for the first time, is expected and well reproduced by the predictions, as shown below. The standard fiducial measurements published for example using the same ATLAS data in Ref. [2] do not see this kinematic effect because the lepton fiducial selections in \(p_{\text {T}}\) and pseudorapidity largely compensate for it, resulting in a flat dependence of the differential fiducial measurement versus rapidity of the Z-boson.

Figure 9 shows the ratio of the absolute differential \(\frac{d^{2}\sigma }{dp_Tdy}\) cross-section measurements presented in Fig. 7 to the predictions from DYTurbo. In all \(|y|\) bins, the predictions are a few percent lower than the measurements, a feature that is discussed more in detail in Sect. 5.3. In the region near the peak of the \(p_{\textrm{T}}\) distribution however, the shape of the predictions is in agreement with that of the measurements within the predominantly theoretical uncertainties. To illustrate this agreement in shape, Fig. 10 shows the ratios between the normalised differential measurements and predictions in the range \(0< p_{\textrm{T}}< 100\) GeV. Over the full \(|y|\) range measured, the data and predictions are found to be in agreement within better than 5%. There are rather strong anti-correlation terms between neighbouring bins, inducing for example the large fluctuations from bin to bin seen for the highest \(|y|\) bin at high \(p_{\textrm{T}}\), where the limited statistics induce terms as large as 40%.

Figures 11121314, and 15 show the same ratios as in Fig. 9, successively for the CuTe-MCFM, Artemide, NangaParbat, RadISH, and SCETlib predictions. In all cases, the predictions agree with the measurements within their uncertainties, which range between 2 and 5% at the peak of the \(p_{\textrm{T}}\) distribution. The theory uncertainties in all the calculations are arguably incomplete at this stage, since variations between PDF sets are not included nor are uncertainties at low \(p_{\textrm{T}}\) related to heavy quark and non-perturbative effects.

To better assess the degree of accuracy with which the different resummation calculations agree with the data, the \(\frac{d^{2}\sigma }{dp_Tdy}\) cross-sections are integrated over \(|y|< 1.6\). This improves considerably the statistical uncertainty in the measurements. Figure 16 shows these normalised \(|y|\)-integrated \(\frac{d\sigma }{dp_T}\) spectra for the data and the predictions with their respective uncertainties, together with the ratios between each prediction and the data. The agreement of the predictions with the data is excellent, well within the scale variation uncertainties of 2–3% over most of the \(p_{\textrm{T}}\) range, except for two points in the lowest \(p_{\textrm{T}}\) bins from the RadISH predictions that are somewhat discrepant.

Finally, Fig. 17 shows a comparison between the measured absolute \(\frac{d\sigma }{dp_T}\) differential cross-section in each \(|y|\) bin and two fixed-order O(\(\alpha _s^3\)) predictions, that from MCFM, as used by the different calculations discussed above, and that from NNLOJET [56]. As expected, these fixed-order calculations are in agreement with the data at high \(p_{\textrm{T}}\), and do not describe the data well at the lower values of \(p_{\textrm{T}}\), where resummation is expected to play a major role, as shown for the case of the RadISH predictions matched to each of these fixed-order O(\(\alpha _s^3\)) predictions in Fig. 18. The larger uncertainties seen in the case of MCFM are mostly due to limited statistics.

5.3 Comparison between \(\frac{d\sigma }{dy}\) measurements and predictions

This section focuses on the \(\frac{d\sigma }{dy}\) measurements obtained after integrating the \(\frac{d^{2}\sigma }{dp_Tdy}\) distributions shown above over \(p_{\textrm{T}}\). As shown in Fig. 5, the experimental measurement is significantly more precise with a total uncertainty below 0.2% in the central rapidity region, owing to the large reduction of the uncertainties of a statistical nature. One also expects the theoretical uncertainties in the predictions to be significantly reduced compared to those shown for the \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements. As a consequence, the focus of the comparisons between data and predictions in this section are on the PDFs, which are expected to provide the dominant uncertainty in the predictions.

Given the experimental accuracy achieved for the measurements presented in this section, higher-order effects from QED initial-state radiation (ISR) and from so-called genuine electroweak (EW) virtual corrections are considered at next-to-leading order and their sum is labelled NLO EW. These are directly computed using the code from Ref. [66], and are in agreement with earlier results from calculations benchmarked in the LHC EW working group [16, 67,68,69,70]. At the Z pole, the virtual effects decrease the predicted cross-sections by 0.8%, while the QED ISR effects increase them by 0.4%. These corrections are found to be independent of rapidity. All QCD predictions shown below have therefore been decreased by a \(|y|\)-independent amount of 0.4% labelled as NLO EW corrections.

In this case, the comparisons to predictions do not depend on \(q_{\textrm{T}}\) resummation which, through unitarity constraints, does not affect the \(p_{\textrm{T}}\)-integrated differential cross-sections. The measurements are thus directly compared to the O(\(\alpha _s^3\)) fixed-order perturbative predictions from DYTurbo, supplemented by MCFM [48, 55] for the Z+jet contribution at O(\(\alpha _s^3\)). These predictions are formally N\(^3\)LO in QCD and are obtained using the very recent aN\(^3\)LO PDF set of Ref. [58]. For the first time, such a comparison between experimental measurements and predictions of this formal accuracy is possible and shown in Fig. 19. The theoretical uncertainty bands from QCD renormalisation/factorisation scale variations and from the aN\(^3\)LO PDF set are shown separately. The uncertainty arising from the scale variations rises slowly from 0.4 to 1.0% as \(|y|\) increases, while the MSHT PDF uncertainty is constant at around 1.5%. As shown in the first line of Table 4, the compatibility of the theory with the data is reasonable, with a p value of 11% if one only includes the uncertainties in the PDFs for the predictions, a standard practice for many publications because most PDF sets do not usually provide scale variation uncertainties. If one combines the PDF uncertainties with a combined scale variation uncertainty from DYTurbo, assumed to be uncorrelated with that from the PDFs, the p value obtained is 12%.

To assess how the aN\(^3\)LO MSHT20-specific PDF uncertainty band compares to the presumably non-negligible spread between different PDF sets, the calculation has been performed one order lower, at NNLO in QCD using six of the most recent NNLO PDF sets. These comprise CT18A [59], MSHT20 [60], NNPDF4.0 [61], ABMP16 [62, 63], HERAPDF2.0 [64], and ATLASpdf21 [65].

Figure 20 shows the results of these comparisons as ratios between the predictions and the data. The uncertainties in the theory predictions reflect here only the PDF specific uncertainties (the uncertainties from CT18A have been rescaled from 95 to 68% confidence level). Table 4 quantifies the quality of the agreement between the data and the predictions through the total \(\chi ^2\) and its corresponding p value. Also shown is the pull on the integrated luminosity of the experiment for each PDF set. Only the aN\(^3\)LO MSHT, NNLO CT18A and NNLO MSHT PDF sets show reasonable agreement with the data, with a positive pull close to one standard deviation on the luminosity, corresponding to predictions approximately 1.6% lower than the data. The NNPDF4.0 PDF set with its much smaller uncertainties displays poor agreement with the data. This is due to the shape of the predicted distribution since the pull on the integrated luminosity is small. The ABMP16 PDF set is the one that most strongly pulls the integrated luminosity but its poor agreement with the data is also due to its significant difference in shape with respect to the data. The HERAPDF2.0 set and, to a lesser extent, the ATLASpdf21 set also display poor agreement because of a large discrepancy with the data in the highest \(|y|\) bin due to the limited set of data used in these fits.

Fig. 18
figure 18

Comparison between the absolute differential \(\frac{d\sigma }{dp_T}\) cross-sections with their total uncertainties in each \(|y|\) bin and the predictions from RadISH [53] matched to the two fixed-order O(\(\alpha _s^3\)) calculations from MCFM [48, 55] and NNLOJET [56, 57]

Fig. 19
figure 19

Comparison between the \(\frac{d\sigma }{dy}\) measurements and N\(^3\)LO QCD predictions obtained from DYTurbo using the recent aN\(^3\)LO MSHT PDF set. The N\(^3\)LO QCD theory prediction includes a negative correction of 0.4% from NLO EW effects and the theoretical uncertainty bands from QCD renormalisation/factorisation scale variations and from the aN\(^3\)LO PDF set are shown separately. The bottom panel shows the ratios between predictions and data, where the data uncertainty bands with and without inclusion of the dominant uncertainty of 1.8% in the integrated luminosity are shown separately

Table 4 Compatibility test between \(\frac{d\sigma }{dy}\) measurements and predictions obtained from DYTurbo using different PDF sets. Based on the total uncertainties in the measurements and on the PDF uncertainties in the predictions shown in Figs. 19 and 20, the \(\chi ^2\) per degree of freedom (d.o.f.) and corresponding p-values are shown for each PDF set (the uncertainties from CT18A have been rescaled from 95% to 68% confidence level). Also shown is the pull on the integrated luminosity in units of its total uncertainty of 1.8%

Finally, the tot al cross-section times branching ratio of \(Z \rightarrow \ell \ell \), \(\sigma _Z\), for \(Z/\gamma ^*\) production in the Z-boson pole region, \(80< m < 100\) GeV, and within \(|y|< 3.6\) is extracted from the integration of the measured differential \(\frac{d\sigma }{dy}\) cross-section:

$$\begin{aligned} \sigma _Z = 1055.3 \pm \ 0.7 \,\mathrm{(stat.)} \pm \ 2.2\, \mathrm{(syst.)} \pm \ 19.0\, \mathrm{(lumi.) pb}. \end{aligned}$$

Aside from the dominant uncertainty in the integrated luminosity, the overall systematic uncertainty of 0.2% in this measurement is dominated by experimental lepton efficiency systematic uncertainties and has a negligible contribution from theory uncertainties, which are below 0.1% and arise essentially from PDFs. Table 5 compares this measurement to the predictions obtained from DYTurbo with the same PDF sets as those shown in Table 4 for the rapidity-dependent cross-section \(\frac{d\sigma }{dy}\).

Fig. 20
figure 20

Ratio comparison between the \(\frac{d\sigma }{dy}\) measurements and NNLO QCD predictions obtained from DYTurbo using different NNLO PDF sets. The uncertainty bands in the predictions only show the uncertainties specific to each PDF set (the uncertainties from CT18A have been rescaled from 95% to 68% confidence level). In each \(|y|\) bin, the ratios for each PDF set are gradually displaced to the right for plotting purposes

6 Conclusions

This paper presents for the first time a double-differential measurement in (\(p_{\textrm{T}}\)\(|y|\)) of absolute and normalised cross-sections at the Z pole within the full phase space of the decay leptons. This is in contrast to the many previous precise unfolded measurements performed in the fiducial phase space of the decay leptons. The measurements in this paper are obtained through a four-dimensional measurement of the lepton angular distributions as a function of \(p_{\textrm{T}}^{\ell \ell }\) and \(y^{\ell \ell }\) for a total sample of approximately 15 million Z-boson decays measured within the pole region, \(80< m^{\ell \ell }< 100\) GeV, and within the range \(|y^{\ell \ell }|< 3.6\). Such a measurement is achieved by extending and improving the methodology already developed and published for the extraction of the Z-boson angular coefficients. A profile likelihood fit extracts at the same time these eight angular coefficients and the corresponding unpolarised cross-section as parameters of interest in each measurement bin in (\(p_{\textrm{T}}\)\(|y|\)) space. The uncertainties in these measurements are mostly statistical in nature and the experimental and theoretical systematics are at the few per mille level over most of the range.

The \(\frac{d^{2}\sigma }{dp_Tdy}\) measurements are compared to several state-of-the-art QCD perturbative predictions based on \(q_{\textrm{T}}\)-resummation at approximate N\(^4\)LL accuracy matched to fixed-order O(\(\alpha _s^3\)) calculations at high \(p_{\textrm{T}}\). The agreement between the data and the predictions is within 5% over the whole range of the measurements except in kinematic regions with limited statistics.

Once integrated over \(p_{\textrm{T}}\), the rapidity-dependent cross-sections are measured with an overall accuracy of 0.2–0.5% before accounting for the uncertainty in the integrated luminosity of 1.8%. They are compared to the predictions from different PDF sets, which display a varying degree of agreement with the data.

The total cross-section times branching ratio of \(Z \rightarrow \ell \ell \), \(\sigma _Z\), for \(Z/\gamma ^*\) production in the Z-boson pole region, \(80< m < 100\) GeV, and within \(|y|< 3.6\), is found to be: \(\sigma _Z = 1055.3 \pm \ 0.7\) (stat.) \(\pm \ 2.2\) (syst.) \(\pm \ 19.0\) (lumi.) pb, in agreement with state-of-the-art predictions at N\(^3\)LO in QCD.

Table 5 Comparison between the measured total cross-section times branching ratio of \(Z \rightarrow \ell \ell \), \(\sigma _Z\), for \(Z/\gamma ^*\) production with its uncertainty and predictions obtained from DYTurbo using different PDF sets. The prediction using the MSHTaN\(^3\)LO PDF from Ref. [58] shows separately the scale variation and PDF uncertainties. The uncertainties in the NNLO predictions include only the PDF uncertainties from each specific PDF set (the uncertainties from CT18A have been rescaled from 95% to 68% confidence level)