1 Introduction

The production of a vector boson in association with a hadronic jet is the simplest hadron-collider process that probes both the strong and electroweak interactions at Born level. It has been measured extensively at the Tevatron [1,2,3,4] and the LHC [5,6,7,8,9,10,11,12,13,14,15,16], covering a large range in transverse momentum and rapidity of the final-state particles. When compared to theory predictions, these measurements provide important tests of the dynamics of the Standard Model and help to constrain the momentum distributions of partons in the proton.

The study of the forward-rapidity region for this process is particularly important for our understanding of parton distribution functions (PDFs) at extremal values of Bjorken-x, due to the different kinematic regimes that are probed compared to the inclusive case. Owing to the extended rapidity coverage of the LHC experiments, data is now available for both highly boosted leptons and jets, giving direct access to these regions in phenomenological studies.

In order to make this connection more concrete, it is instructive to relate the event kinematics to the Bjorken-x values that are probed. For a given vector-boson-plus-jet event, one can directly infer the valid range in Bjorken-x values from the event kinematics at the hadronic centre-of-mass energy \(\sqrt{s}\) through

$$\begin{aligned} x_1&\ge \frac{1}{\sqrt{s}}\left( m_\mathrm {T} ^V \cdot \mathrm {e} ^{+y^V} +{p_\mathrm {T} ^{j1}} \cdot \mathrm {e} ^{+y^{j1}} \right) , \nonumber \\ x_2&\ge \frac{1}{\sqrt{s}}\left( m_\mathrm {T} ^V \cdot \mathrm {e} ^{-y^V} +{p_\mathrm {T} ^{j1}} \cdot \mathrm {e} ^{-y^{j1}} \right) , \end{aligned}$$
(1)

with \(m_\mathrm {T} ^V = \sqrt{{({p_\mathrm {T} ^V})}^2+{m_{V}^2}}\) denoting the transverse mass. In this equation, \(x_1\) and \(x_2\) correspond to the momentum fractions of the incoming partons present in the colliding protons, \(p_\mathrm {T} ^V\) and \(p_\mathrm {T} ^{j1}\) are the transverse momenta of the vector boson and the leading-\(p_\mathrm {T} \) jet, \(m_{V}\) is the invariant mass of the combined system of the decay products of the vector boson and \(y_V\) and \(y_{j1}\) are the rapidities of the vector boson and the leading jet. The equality in the above relations holds at Born level.

In general, the smallest x value that can be probed simultaneously (\(x_1 \sim x_2\)) is:

$$\begin{aligned} x_{\min } = \frac{m_{V+j}^{\min }}{\sqrt{s}} \; , \end{aligned}$$
(2)

which is relevant primarily for data where fiducial cuts are symmetric in rapidity. Here \(m_{V+j}\) is the invariant mass of the vector-boson-plus-jet final state at LO. In addition, we have the combined kinematic constraint

$$\begin{aligned} x_1 x_2 \ge \frac{1}{s} \left( m_\mathrm {T} ^{V,\min }+{p_\mathrm {T}} ^{j1,\min }\right) ^{2} , \end{aligned}$$
(3)

where \(m_\mathrm {T} ^{V,\min }\) and \({p_\mathrm {T}} ^{j1,\min }\) are the minimum values of the vector boson transverse mass and leading jet \(p_\mathrm {T} \) admitted by the fiducial cuts. This constraint is particularly relevant in phase-space regions that are asymmetric in rapidity, which in turn probes more asymmetric values in \(x_1\), \(x_2\) and gives rise to a more complex interplay between the kinematics and the event selection cuts.

Precision QCD predictions for the production of a vector boson in association with a jet have advanced considerably in recent years with the completion of fixed-order next-to-next-to-leading (NNLO) QCD calculations [17,18,19,20,21,22,23,24,25,26], which are now being matched to resummation results [27, 28] to extend their validity across a wider kinematic range. These are complemented by NLO electroweak corrections [29,30,31], which are particularly relevant at large transverse momenta. There is a strong experimental motivation for precise predictions for these processes due to the high statistics and clean decay channels observed at the LHC, and their relevance to determinations of Standard Model parameters and as backgrounds for new physics searches [32]. Fitting procedures for PDFs also benefit greatly from improved predictions, due to the increased sensitivity to the gluon and quark content of the proton [33, 34]. Owing to the large gluon luminosity at the LHC, the dominant initial state for vector-boson-plus-jet production is quark–gluon scattering, with different quark flavour combinations probed by the different bosons.

In this paper, we perform a comparison between NNLO QCD predictions for vector-boson-plus-jet (VJ) production and measurements by the LHCb [16] and ATLAS [8] experiments. These measurements are highly complementary, allowing one to probe a much larger kinematic region than if either of them were taken alone due to the different rapidity coverages of the two detectors. The region of the (x, \(Q^2\)) plane which is probed at LO in QCD in \(\mathrm {ZJ}\) production is shown in Fig. 1, where one can see that LHCb covers two distinct sectors corresponding to the x values of the two beams. The corresponding plot for the (\(x_1\), \(x_2\)) plane is shown in Fig. 2, where the asymmetry of the LHCb region preferentially probes large \(x_1\) and small \(x_2\) values in contrast to the symmetric (\(x_1\), \(x_2\)) coverage of the ATLAS fiducial region. The kinematic constraints on the LHCb region are relaxed beyond LO as the presence of radiation permits larger \(Q^2\) and \(x_2\) values, unlike on the ATLAS region where LO kinematics already fully cover the kinematic region accessible at higher orders. The LO kinematics dominates in the contribution to the total cross section however, and gives a good indication of where the sensitivities of the two experiments lie.

The theoretical predictions are obtained using the NNLOjet framework [18, 26], which implements the relevant NNLO VJ matrix elements [35,36,37,38,39,40,41,42] and uses the antenna subtraction method [43,44,45] to extract and combine infrared singularities from partonic subprocesses with different multiplicity.

Fig. 1
figure 1

The regions of the (x, \(Q^2\)) plane accessible for the LHCb [16] and ATLAS [8] \(\mathrm {ZJ}\) selection criteria at LO. Here \(Q^2\) is the invariant mass of the full final state including both charged leptons and QCD radiation and x is the Bjorken-x from either of the incoming beams

Fig. 2
figure 2

The regions of the (\(x_1\), \(x_2\)) plane accessible for the LHCb [16] and ATLAS [8] \(\mathrm {ZJ}\) selection criteria at LO. Here \(x_1\) and \(x_2\) are the Bjorken-x values from beams 1 and 2 respectively

Throughout this work, the theoretical predictions employ a diagonal CKM matrix. The electroweak parameters are set according to the \(G_{\mu }\) scheme with the following input parameters:

$$\begin{aligned} M_{\mathrm {Z}}&= 91.1876~\mathrm {GeV} ,&M_{\mathrm {W}}&= 80.385~\mathrm {GeV} , \nonumber \\ \Gamma _{\mathrm {Z}}&= 2.4952~\mathrm {GeV} ,&\Gamma _{\mathrm {W}}&= 2.085~\mathrm {GeV} , \nonumber \\ G_{\mathrm {F}}&= 1.1663787 \times 10^{-5}~\mathrm {GeV} ^{-2} , \end{aligned}$$
(4)

and the PDF set used at all perturbative orders is the central replica of NNPDF31_nnlo [33] with \(\alpha _\mathrm {s} (M_{\mathrm {Z}})=0.118\).

2 LHCb 8 TeV boosted cuts

At the proton–proton centre-of-mass energy of \(8~\mathrm {TeV} \), the LHCb experiment has measured both \(\mathrm {W}\)- and \(\mathrm {Z}\)-boson production in association with a jet with the vector bosons decaying in the muon channel [16]. The acceptance in the forward region of the LHCb experiment allows it to reliably probe PDFs at both much higher and lower momentum fractions x than the general-purpose detectors at the LHC. This sensitivity arises from kinematic configurations that are asymmetric in \(x_1\) and \(x_2\), which in turn means that the event is boosted into the forward region. PDF uncertainties at large x and \(Q^2\) are generally driven by uncertainties in the d content of the proton, which these measurements have the capacity to constrain due to their flavour sensitivity, particularly in the charged-current channels. This provides a strong motivation to use the state-of-the-art NNLO QCD results to test the quantitative agreement of the predictions with the experimental data.

The fiducial cuts applied to the charged leptons and the jets, which we label as the LHCb cuts for both \(\mathrm {W^\pm J}\) and \(\mathrm {ZJ}\) production are given by:

$$\begin{aligned} {p_\mathrm {T}} ^{\text {j}}&> 20~\mathrm {GeV} ,&2.2&< \eta ^{\text {j}}< 4.2 , \nonumber \\ {p_\mathrm {T}} ^{{\mathrm {\mu }}}&> 20~\mathrm {GeV} ,&2&< y^{{\mathrm {\mu }}} < 4.5 , \nonumber \\ \Delta R_{{\mathrm {\mu }},\text {j}}&> 0.5 , \end{aligned}$$
(5)

where \({p_\mathrm {T}} ^{\text {j}}\) and \({p_\mathrm {T}} ^{\mu }\) are the transverse momenta of the jets and muons respectively, \(\eta ^{\text {j}}\) is the jet pseudorapidity, \(y^{{\mathrm {\mu }}}\) is the muon rapidity and \(\Delta R_{\mathrm {\mu ,j}}\) is the angular separation between the leading jet and the muon. In addition, the requirement \({p_\mathrm {T}} ^{{\mathrm {\mu }} +j} > 20~\mathrm {GeV} \) is applied for \({\mathrm {W^\pm J}} \) production, where \({p_\mathrm {T}} ^{{\mathrm {\mu }} +j}\) is the transverse component of the vector sum of the charged lepton and jet momenta. For \({\mathrm {ZJ}} \) production, the invariant mass of the dimuon system \(m_{\mu \mu }\) is restricted to the window \(60~\mathrm {GeV}< m_{\mu \mu } < 120~\mathrm {GeV} \) around the \(\mathrm {Z}\)-boson resonance. The anti-\(k_\mathrm {T} \) jet algorithm [46] is used throughout, with radius parameter \(R=0.5\). In the LHCb analysis [16], the VJ data were compared to NLO theory predictions, which were observed to overshoot the data throughout, albeit being consistent within the combined theoretical and experimental uncertainties.

For the theoretical predictions presented in this section, we set the central scale as in [16], i.e.,

$$\begin{aligned} \mu _\mathrm {R} = \mu _\mathrm {F} = \sqrt{{m^2_{V}+\sum \nolimits _i(p^i_{\mathrm {T},\text {j}})^2}} \ \equiv \mu _{0} , \end{aligned}$$
(6)

with scale variations performed independently for the factorisation and renormalisation scales \(\mu _\mathrm {F} \), \(\mu _\mathrm {R} \) by factors of \(\frac{1}{2}\) and 2 subject to the constraint \(\frac{1}{2}<\mu _\mathrm {F}/\mu _\mathrm {R} <2\).

Table 1 Fiducial cross sections for fixed order theory predictions and LHCb results from Ref. [16]. The errors quoted for NNLOjet   correspond to the scale uncertainty and the reported LHCb errors are statistical, systematic and luminosity respectively

The predictions for the fiducial cross section are shown in Table 1 for LO, NLO and NNLO QCD and compared to the results reported by the LHCb experiment for the individual VJ channels. We see large corrections when going from LO to NLO as observed in the NLO/LO K-factor of 1.34 for \({\mathrm {W^-}} \), 1.35 for \({\mathrm {W^+}} \) and 1.32 for \({\mathrm {Z}} \). On the other hand, going from NLO to NNLO produces much smaller and more stable corrections, with a NNLO/NLO K-factor of 1.006 for \({\mathrm {W^-}} \), 1.003 for \({\mathrm {W^+}} \) and 0.998 for \({\mathrm {Z}} \). The NNLO corrections lie within the scale bands of the NLO results. We note that the uncertainty bands overlap marginally between theory and data in Table 1 for \({\mathrm {W^-}} \) and \({\mathrm {Z}} \) production, but not for \({\mathrm {W^+}} \) production.

Fig. 3
figure 3

Cross section differential in the \(p_\mathrm {T} \) of the leading jet for \(\mathrm {W^-}\) production. Predictions at LO (green), NLO (orange), and NNLO (red) are compared to LHCb data from Ref. [16], and the ratio to NLO is shown in the lower panel. The bands correspond to scale uncertainties estimated as described in the main text

Fig. 4
figure 4

Cross section differential in the \(p_\mathrm {T} \) of the leading jet for \(\mathrm {W^+}\) production. See Fig. 3 for details

Fig. 5
figure 5

Cross section differential in the \(p_\mathrm {T} \) of the leading jet for \(\mathrm {Z}\) production. See Fig. 3 for details

2.1 Distributions differential in leading jet \(p_\mathrm {T} \)

Figures 3, 4 and 5 show the distributions for transverse momentum of the leading jet in \({\mathrm {W^-}} \), \({\mathrm {W^+}} \) and \({\mathrm {Z}} \) production respectively. Similarly to the fiducial cross section, the scale dependence of the differential distributions is considerably reduced when going from NLO to NNLO. The NNLO corrections are stable with respect to NLO, indicating a good convergence of the perturbative series. In addition, these results exhibit a strong similarity in behaviour between the \({\mathrm {W^-}} \), \({\mathrm {W^+}} \) and \({\mathrm {Z}} \) production channels. We see that the theory overshoots the data by \(\sim 5\)\(10\%\) over the bulk of the distribution, rising to \(30\%\) in the highest \(p_\mathrm {T} \) bin. This closely mirrors the effects seen at NLO as well as in the total cross section. The considerable decrease in theory uncertainty from NLO to NNLO makes the tension between data and theory more pronounced.

For the cuts placed on the \(\mathrm {WJ}\) final state, we are also able to associate the bins in \(p_\mathrm {T} ^j\) to lower limits on the Bjorken-x invariants. The lowest \(p_\mathrm {T} \) bin has the loosest constraint on the forward x, with \(x_1>0.041\), \(x_2>5.4\times 10^{-5}\). However, for the highest \(p_\mathrm {T} \) bins, between 50 and \(100~\mathrm {GeV} \), the restrictions translate to \(x_1>0.075\), \(x_2>0.00011\). Due to the invariant mass cuts applied in the \(\mathrm {ZJ}\) case shown in Fig. 5, the smallest values in Bjorken-x that can be probed only extend down to \(x_1 > 0.11\), \(x_2 > 0.0002\) in the highest \(p_\mathrm {T} \) bin. As a result, one probes larger values of x for \(\mathrm {ZJ}\) production than for \(\mathrm {WJ}\) in general. At large \(p_\mathrm {T} \), we see that the same features are present in the neutral and charged current cases. We observe that the NNLO predictions overshoot the data.

2.2 Distributions differential in pseudorapidity

The leading jet pseudorapidity distributions in Figs. 6, 7, 8 show a similar pattern of deviation between NNLO predictions and data to the previous \(p_\mathrm {T} ^j\) results, with theory predictions exceeding the data at the largest values of \(\eta _{j1}\). The behaviour is similar for \({\mathrm {W^+}} \), \({\mathrm {W^-}} \) and \({\mathrm {Z}} \), which may further indicate that the discrepancy is mainly due to the gluon distribution being overestimated at large x. Changes in individual quark or antiquark distributions would instead give a pattern of discrepancy that is more pronounced in one of the channels than in the others. In the pseudorapidity distributions, we probe simultaneously more extreme regions of \(x_1\) and \(x_2\) than for the \(p_\mathrm {T} ^j\) distributions as the directional dependence on \(y_{j}\) as given in Eq. (1) allows us to more directly discriminate the two Bjorken-x values. This can be seen most explicitly for the \(\mathrm {ZJ}\) case, for which the forward-most bin in pseudorapidity requires implicitly \(x_1>0.16\), \(x_2>1.1\times 10^{-4}\), meaning that the large \(x>\mathcal {O} (0.1)\) regions are probed efficiently.

Fig. 6
figure 6

Cross section differential in the pseudorapidity \(\eta \) of the leading jet for \({\mathrm {W^-}} \) production. See Fig. 3 for details

Fig. 7
figure 7

Cross section differential in the pseudorapidity \(\eta \) of the leading jet for \({\mathrm {W^+}} \) production. See Fig. 3 for details

Fig. 8
figure 8

Cross section differential in the pseudorapidity \(\eta \) of the leading jet for \({\mathrm {Z}} \) production. See Fig. 3 for details

Fig. 9
figure 9

Cross section differential in the pseudorapidity \(\eta \) of the lepton for \({\mathrm {W^-}} \)J production. See Fig. 3 for details

Fig. 10
figure 10

Cross section differential in the pseudorapidity \(\eta \) of the lepton for \({\mathrm {W^+}} \)J production. See Fig. 3 for details

Fig. 11
figure 11

Cross section differential in the rapidity of the dilepton system for \({\mathrm {Z}} \)J production. See Fig. 3 for details

The distributions for the rapidity of the charged lepton \(\eta _\ell \) are shown in Figs. 9 and 10 for \({\mathrm {W^-}} \) and \({\mathrm {W^+}} \) respectively. Here the NNLO predictions lie \(\sim 5\)\(15\%\) above the data across the entire considered range in \(\eta _\ell \). Note that it would be preferable to construct these distributions as a function of the \(\mathrm {W}\) rapidity \(y_{\mathrm {W}} \), which however can not be unambiguously reconstructed experimentally due to the unknown longitudinal component of the neutrino momentum. For the case of neutral-current production, on the other hand, this is possible and is shown in Fig. 11 differentially with respect to the rapidity of the reconstructed \(\mathrm {Z}\) boson.

Fig. 12
figure 12

\({\mathrm {W^\pm }} \) asymmetry in WJ production differential in the pseudorapidity \(\eta \) of the lepton produced from the \(\mathrm {W}\) boson decay. See Fig. 3 for details

From the charged-current data one can further construct the charge asymmetry differentially in the lepton pseudorapidity \(A^{\pm }(\eta _\ell )\),

$$\begin{aligned} A^{\pm }(\eta _\ell ) = \frac{\mathrm {d} \sigma ^{{\mathrm {W^+}} j}/\mathrm {d} \eta _\ell - \mathrm {d} \sigma ^{{\mathrm {W^-}} j}/\mathrm {d} \eta _\ell }{\mathrm {d} \sigma ^{{\mathrm {W^+}} j}/\mathrm {d} \eta _\ell + \mathrm {d} \sigma ^{{\mathrm {W^-}} j}/\mathrm {d} \eta _\ell } \; . \end{aligned}$$
(7)

The charge asymmetry is a valuable input to PDF fits as many systematic experimental errors cancel due to correlations in luminosity and systematic errors between the measurements of \(\mathrm {W^+J}\) and \(\mathrm {W^-J}\), giving a higher level of precision than for the total cross sections alone. This is also true for the theory predictions, where many higher-order contributions cancel between \(\mathrm {W^+J}\) and \(\mathrm {W^-J}\), and the similarity of the two calculations justifies some correlation between scale errors. \(A^{\pm }\) directly provides information on the difference between the u and d quark (as well as between the \(\bar{d}\) and \(\bar{u}\) anti-quark) content of the proton.

The advantage of considering the charge asymmetry for events where a jet is produced in association with the \(\mathrm {W}\) boson, which can be regarded as an exclusive asymmetry, as opposed to the inclusive \(A^{\pm }\) is that the implicit constraint on Bjorken-x is tightened due to the increase in partonic energy required. Before comparing our predictions with LHCb data for the exclusive charge asymmetry, it is instructive to recall the status of measurements of its inclusive analogue. The LHCb measurement of the inclusive charge asymmetry [47] probes larger values of x than at ATLAS or CMS. Currently the main constraints on u and d content at \(x>0.1\) come primarily from fixed-target DIS experiments and the D0 inclusive lepton charge asymmetry data [48]. The inclusion of the latest Tevatron results in PDF fits generally results in a harder u / d behaviour in this high-x region [49].

In Fig. 12, we show a comparison between our theoretical predictions for \(A^{\pm }\) related to WJ production and the LHCb data. Inside the numerator and the denominator expressions, we fully correlate the scales between the \({\mathrm {W^+}} \) and \({\mathrm {W^-}} \) cross sections, which amounts to taking the sum and difference of the cross sections as independent physical quantities \(\left[ \mathrm {d} \sigma ^{{\mathrm {W^+}}}\pm \mathrm {d} \sigma ^{{\mathrm {W^-}}}\right] (\mu _\mathrm {F}, \mu _\mathrm {R})\) instead of the \({\mathrm {W^+}} \) and \({\mathrm {W^-}} \) cross sections. The scale uncertainty shown is then obtained by independently varying the factorisation \((\mu _\mathrm {F})\) and renormalisation \((\mu _\mathrm {R})\) scales of both the numerator and denominator by factors of \(\frac{1}{2}\) and 2 around the central scale, while imposing the restriction \(\frac{1}{2}\le \mu /\mu '\le 2\) between all pairs of scales (\(\mu ,\mu '\)) in Eq. (7).

The shape of \(A^{\pm }\) as a function of \(\eta _\ell \) is generally determined by two competing effects [50]. The first is the (anti-)quark content of the PDF, where the u / d ratio and \(q/\bar{q}\) asymmetry increase with momentum fraction x, and therefore with \(\eta _\ell \). This alone gives an increase in \(A^{\pm }\) with \(\eta _\ell \) since u-initiated production is dominant in \({\mathrm {W^+}} \) production while d-initiated production is dominant for \({\mathrm {W^-}} \).

Fig. 13
figure 13

\({\mathrm {W^\pm }} \) asymmetry in WJ final states differential in the pseudorapidity \(\eta \) of the lepton produced from the \(\mathrm {W}\)  boson decay, evaluated with NNPDF3.1 (red), MMHT14 (yellow), CT14 (green) NNLO parton distribution functions. The NNPDF3.1 curve corresponds to a full NNLO calculation with scale uncertainties as described in the main text, and is used to determine a differential NNLO/NLO K-factor. The other two predictions are calculated at NLO and then rescaled by this K-factor

The second factor is due to the left-handedness of the couplings in the \({\mathrm {W^\pm }} \) production and decay process, which results in opposite preferential directions of the positive and negative decay leptons relative to the \({\mathrm {W^\pm }} \) spin. As a consequence, for the \({\mathrm {W^+}} \) case, the lepton is preferentially produced at lower \(\eta \) than the \({\mathrm {W^+}} \), whereas for the \({\mathrm {W^-}} \) case, the lepton is preferentially produced at higher relative \(\eta \). This effect causes the asymmetry to decrease with \(\eta _\ell \), and dominates over the quark PDF effects at higher x, as can be seen in Fig. 12.

We find that the NNLO predictions for the asymmetry describe the data reasonably well, but in general show a less steep slope with \(\eta _\ell \) than the data. This may be indicative of a PDF overestimate in the u / d ratio for \(x\gtrsim 0.1\) which would lead to the observed overprediction of the charge asymmetry in this region. It is noted that the large u / d ratio is in particular inferred [33, 49] from the Tevatron D0 lepton charge asymmetry data [48]. It will thus be crucial to combine these data with the LHCb results [16] in a global fit to determine whether they are mutually consistent.

The sensitivity of the \({\mathrm {W^\pm }} \) asymmetry in WJ final states on the PDF parametrizations is illustrated in Fig. 13, which shows this asymmetry at NNLO for NNPDF3.1 [33], MMHT14 [51] and CT14 [49] parton distributions. The NNPDF3.1 prediction is obtained from a full NNLO calculation of the individual cross sections entering into the ratio, which are also used to extract NNLO K-factors. Predictions for the other two PDF parametrizations are computed at NLO at cross section level, and then rescaled by these K-factors, before computing the ratio. The large spread of the predictions (noting also the different scale in the ratio compared to Fig. 12) in the last bin reflects the different modelling of the quark distributions at large x in the three parametrizations, and demonstrates the discriminating power of the LHCb asymmetry measurement.

3 ATLAS 7 TeV standard cuts

The second set of experimental data we consider is the \(7~\mathrm {TeV} \) (electron and muon) measurement by the ATLAS experiment [8], which combines data from the \(\mathrm {W}\) and \(\mathrm {Z}\) analyses of [5] and [7] with a small modification to the lepton selection criteria applied in the \(\mathrm {Z}\) analysis when taking ratios. This modification is applied in order to better match the \(\mathrm {W}\) selection criteria.

The ATLAS detector has a large rapidity range, capable of measuring pseudorapidities of up to \(|\eta |=4.9\) in the endcap region for both hadronic and electromagnetic final states. Unlike the LHCb measurement region, the large pseudorapidity reach of ATLAS also allows to probe large rapidity separations between final state particles, which correspond to configurations in which the Bjorken-x of both incoming protons is relatively large. In the following, we perform a comparison of fixed-order NNLO results to the individual \(\mathrm {WJ}\) and \(\mathrm {ZJ}\) distributions of [5] and [7], before constructing the ratios of \({\mathrm {WJ}} \) (\(\equiv {\mathrm {W^+J}} + {\mathrm {W^-J}} \)) and \(\mathrm {ZJ}\) distributions and comparing those to the results of [8]. We consider leading jet \(p_\mathrm {T} \) distributions in inclusive (at least one jet is required) and exclusive (exactly one jet is required) jet production, as well as inclusive leading jet rapidity distributions. The inclusive distributions have previously been compared to NNLO QCD predictions in [20], however exclusive distributions and ratios of distributions were not considered.

The fiducial cuts used in the ATLAS analyses are as follows:

$$\begin{aligned} {p_\mathrm {T}} ^{\text {j}}&> 30~\mathrm {GeV} ,&|y^{\text {j}}|&< 4.4 , \nonumber \\ {p_\mathrm {T}} ^{\ell }&> 25~\mathrm {GeV} ,&|y^{\ell }|&< 2.5 , \nonumber \\ \Delta R_{\ell ,\text {j}}&> 0.5 . \end{aligned}$$
(8)

For \({\mathrm {W^\pm J}} \) production, the restrictions \(E_{\mathrm {T}}^\text {miss} >25~\mathrm {GeV} \), and \(m_{\mathrm {T}}^{\mathrm {W}} >40~\mathrm {GeV} \) on the missing transverse energy and transverse mass of the \(\mathrm {W}\) boson are imposed. For \(\mathrm {ZJ}\) production the requirements \(66~\mathrm {GeV}< m_\mathrm {T} ^{\ell \ell } < 116~\mathrm {GeV} \) and \(\Delta R_{\ell \ell } > 0.2\) are applied to the transverse mass of the dilepton system and angular separation of the leptons. In the \(\mathrm {ZJ}\) distributions, we relax the lepton \(p_\mathrm {T} \) cut from 25 to \(20~\mathrm {GeV} \) in order to compare directly with the results of [7]. However we keep the lepton \(p_\mathrm {T} \) cut at \(25~\mathrm {GeV} \) when constructing ratios of WJ and ZJ distributions.

Jets are reconstructed using the anti-\(k_\mathrm {T} \) algorithm [46] with radius parameter \(R=0.4\), and we choose the central scale of the theory predictions as

$$\begin{aligned} \mu _\mathrm {F} = \mu _\mathrm {R} = \frac{1}{2}H_{\mathrm {T}} = \frac{1}{2}\sum _{i \,\in \, \mathrm {jets,\,\ell ,\,\nu }}{p_\mathrm {T}} ^i \equiv \mu _{0} , \end{aligned}$$
(9)

where \(H_\mathrm {T} \) is the scalar sum of the transverse momenta of all final state jets and leptons/neutrinos as appropriate. We denote the number of jets as N, such that in the selection criteria \(N=1\) corresponds to the exclusive case and \(N\ge 1\) corresponds to the inclusive case.

The scale variation uncertainties for the ratios are obtained in a similar manner as for LHCb \(\mathrm {W^\pm }\) asymmetries, with fully correlated scales between the \(\mathrm {W^+}\) and \(\mathrm {W^-}\) processes in the numerator, but taking the envelope of the scales when taking the ratio to the \(\mathrm {Z}\) distributions, imposing \(\frac{1}{2}\le \mu /\mu '\le 2\) between all pairs of scales.

3.1 Exclusive \(p_\mathrm {T} ^{j1}\) distributions

First we consider the exclusive \((N=1)\) \(p_\mathrm {T} \)  distribution of the leading jet for \(\mathrm {WJ}\)  production using the data from [5] as shown in Fig. 14. Here we observe agreement of the theory with data within errors up to \({p_\mathrm {T} ^{j1}} \sim 80~\mathrm {GeV} \), beyond which the theoretical predictions are systematically below the data. This behaviour is closely replicated in Fig. 15, which shows the equivalent \(\mathrm {ZJ}\)  distribution. However beyond \({p_\mathrm {T} ^{j1}} \sim 80~\mathrm {GeV} \), the agreement with data is noticeably worse than for the \(\mathrm {WJ}\)  distribution. While we neglect electroweak corrections which have a well-known impact on the weak boson \(p_\mathrm {T} \)  distributions [29,30,31] from large Sudakov logarithms, these generally give considerable reductive K-factors at large \(p_\mathrm {T} ^j\) and so would further worsen the agreement with data in both cases (see e.g. [31]). For these exclusive distributions, it is instructive to note that \(p_\mathrm {T} ^{j1}\) is equivalent to the transverse momentum of the vector boson due to the absence of extra jet radiation.

Fig. 14
figure 14

\(\mathrm {WJ}\) cross section differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet for events with exactly one associated jet \((N=1)\) in the ATLAS fiducial region from Eq. 8. Predictions at LO (green), NLO (orange), and NNLO (red) are compared to ATLAS data from Ref. [5], and the ratio to NLO is shown in the lower panel. The bands correspond to scale uncertainties estimated as described in the main text

Fig. 15
figure 15

\(\mathrm {ZJ}\) cross section differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet for events with exactly one associated jet \((N=1)\). Predictions at LO (green), NLO (orange), and NNLO (red) are compared to ATLAS data from Ref. [7], and the ratio to NLO is shown in the lower panel. The bands correspond to scale uncertainties estimated as described in the main text

Fig. 16
figure 16

\(\mathrm {WJ}\) cross section differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet for events with one or more associated jets \((N\ge 1)\). See Fig. 14 for details

Fig. 17
figure 17

\(\mathrm {ZJ}\) cross section differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet for events with one or more associated jets \((N\ge 1)\). See Fig. 15 for details

3.2 Inclusive \(p_\mathrm {T} ^{j1}\) distributions

For the inclusive \((N\ge 1)\) \(p_\mathrm {T} ^{j1}\)  spectrum in \(\mathrm {WJ}\)  production, shown in Fig. 16, we observe marginally improved agreement over a wider range of \(p_\mathrm {T} \), with overlapping uncertainty bands between data and theory up to \({p_\mathrm {T} ^{j1}} \sim 300~\mathrm {GeV} \). Beyond this point, there are substantial, \(\mathcal {O} (15\%)\), shape corrections when moving from NLO to NNLO which improve the agreement with data with respect to the NLO results. In \(\mathrm {ZJ}\)  production, shown in Fig. 17, the pattern of perturbative corrections is very similar. However we do not observe the same level of improved agreement with data when moving from exclusive to inclusive jet production as for the \(\mathrm {WJ}\)  process and we again see that the theory prediction is systematically below the data from \({p_\mathrm {T} ^j} \sim 100~\mathrm {GeV} \) onwards.

Allowing extra QCD radiation, as in the inclusive case, entails also allowing for dijet-type configurations where two hard jets are produced alongside a relatively soft vector boson. In the full NNLO calculation, these \(\mathcal {O} (\alpha _\mathrm {s} )\) contributions are first described at NLO, and give rise to a large QCD K-factors at high \(p_\mathrm {T} ^j\)  [52]. This is the dominant cause of the distinct structure of the perturbative corrections between exclusive and inclusive production; for \(N=1\) we see a decrease in the high-\(p_\mathrm {T} ^{j1}\)  cross-sections with the inclusion of higher orders as opposed to an increase in \(N\ge 1\) production. The difference in theory-to-data agreement between the \(\mathrm {Z}\)  and \(\mathrm {W}\)  distributions persists however, and may be a related to the different quark flavour combinations probed by the different processes. Whilst not as constraining as the \(\mathrm {W^+}\)/\(\mathrm {W^-}\)  ratio, the \(\mathrm {W}\)/\(\mathrm {Z}\)  ratio still retains some sensitivity to the u / d ratio due to different coupling strengths, and some dependence on the strange quark distributions, albeit suppressed compared to the inclusive Drell-Yan cross sections due to the Born-level gluon contribution. The inclusion of higher-order EW terms are unlikely to describe the difference with respect to data at high \(p_\mathrm {T} \), as the EW corrections to the leading \(p_\mathrm {T} ^j\)  distribution in vector-boson-plus-dijet events behave in a very similar manner for \(\mathrm {WJ}\)  and \(\mathrm {ZJ}\)  production as demonstrated in [31].

3.3 Exclusive/inclusive ratios

In order to better understand the description of real emission by the fixed order predictions, one can construct the ratio between the exclusive and inclusive leading jet distributions for both the \(\mathrm {WJ}\)  and the \(\mathrm {ZJ}\) case, shown in Figs. 18 and 19. The experimental measurements [5, 7] do not explicitly quote the data in terms of exclusive/inclusive ratios. We have therefore reconstructed it here using the central values of the relevant distributions with the errors approximated using uncertainties from the \(N=1\) distribution normalised to the \(N\ge 1\) results. For both distributions we observe similar behaviour, with good description of the data across the range of \(p_\mathrm {T} ^{j1}\), albeit with the general trend that the predictions systematically undershoot the central values of the data below \({p_\mathrm {T} ^{j1}} \sim 200~\mathrm {GeV} \), from which we can conclude that the extra jet radiation is well-described by the fixed order predictions.

Fig. 18
figure 18

Ratio of exclusive/inclusive \((N=1/N\ge 1)\) \(\mathrm {WJ}\)  production differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet. Errors on the ATLAS data are approximated using uncertainties from the \(N=1\) distribution normalised to the \(N\ge 1\) results. See Fig. 14 for details

Fig. 19
figure 19

Ratio of exclusive/inclusive \((N=1/N\ge 1)\) \(\mathrm {ZJ}\)  production differential in the transverse momentum \(p_\mathrm {T} \) of the leading jet. Errors on the ATLAS data are approximated using uncertainties from the \(N=1\) distribution normalised to the \(N\ge 1\) results. See Fig. 15 for details

Fig. 20
figure 20

\({\mathrm {WJ}}/{\mathrm {ZJ}} \) ratio differential in the exclusive \(p_\mathrm {T} \) of the leading jet \((N=1)\). Predictions at LO (green), NLO (orange), and NNLO (red) are compared to ATLAS data from Ref. [8], and the ratio to NLO is shown in the lower panel. The bands correspond to scale uncertainties estimated as described in the main text

Fig. 21
figure 21

\({\mathrm {WJ}}/{\mathrm {ZJ}} \) ratio differential in the inclusive \(p_\mathrm {T} \) of the leading jet \((N\ge 1)\). See Fig. 20 for details

3.4 W/Z ratios differential in leading jet \(p_\mathrm {T} \)

Figure 20 shows the \(\mathrm {WJ}\)/\(\mathrm {ZJ}\) ratio as a function of \(p_\mathrm {T} ^{j1}\), for the exclusive \((N=1)\) case. The large scale variation bands visible at NLO are a result of large NLO corrections at high \(p_\mathrm {T} ^j\) that increase the scale uncertainties when propagated through ratios. In particular, as shown in Figs. 14 and 15, we observe large reductive NLO/LO K-factors at high \(p_\mathrm {T} ^{j1}\) for the individual \(\mathrm {ZJ}\) and \(\mathrm {W^\pm J}\) distributions, reaching \(K=0.3\) in the highest \(p_\mathrm {T} ^{j1}\) bin, whereas the absolute size of the scale variation bands does not reduce significantly when going from LO to NLO. This has the effect of making the exclusive \({\mathrm {WJ}}/{\mathrm {ZJ}} \) ratio much more sensitive to scale variation in the constituent distributions at NLO than LO, artificially inflating the scale uncertainties at this order. The inclusive (\(N\ge 1\)) ratio, shown in Fig. 21, has very similar central values at LO, NLO and NNLO, but does not display the inflated NLO scale uncertainty.

When taking the ratio, the impact of the extra jet activity is strongly suppressed, while the PDF sensitivity is enhanced. As mentioned in the case of the individual distributions, the \(\mathrm {W}\)/\(\mathrm {Z}\)  ratio can be used to provide constraints on the ratio of up and down valence quark distributions inside the PDFs, as well as on the strange distribution, due to the different couplings of the vector bosons. Taking only the dominant incoming qg partonic configurations, we can see that naïvely the ratio behaves as

$$\begin{aligned} \frac{\sigma ^{\mathrm {WJ}}}{\sigma ^{\mathrm {ZJ}}}\sim \frac{ug+dg}{0.29ug+0.37dg}, \end{aligned}$$
(10)

where the numerical factors are the appropriate sums of the squares of the vector and axial vector quark-\(\mathrm {Z}\)  couplings. Discarding the common factor of the gluon PDF, this can be used to interpret a theory-to-data excess in the \(\mathrm {W}\)/\(\mathrm {Z}\)  ratio as an overestimate of the u / d ratio. If we look back to the individual distributions, we see that for each of the \(\mathrm {W}\)  and \(\mathrm {Z}\)  cases, the theory falls below the data. From this, it can be deduced that the most probable cause is an underestimate in the d quark content of the PDF.

Fig. 22
figure 22

\(\mathrm {WJ}\) cross section differential in the absolute rapidity \(|y_j|\) of the leading jet. See Fig. 14 for details

3.5 Inclusive leading jet rapidity distributions

The leading jet rapidity distribution \(|y_{j1}|\) for \(\mathrm {WJ}\) events is shown in Fig. 22, and for \(\mathrm {ZJ}\) events in Fig. 23. Here we observe that the higher-order QCD predictions are relatively stable for all orders up to \(|y_{j1}| \sim 3\). Beyond this point, we see a change in shape when transitioning from LO to NLO. The shape is kept unmodified under the inclusion of the NNLO corrections. There is an increase in scale uncertainty at higher rapidities \(|y_{j1}|\gtrsim 3.5\) due to large subleading jet contributions in this region, which are only described at lower orders for inclusive observables in the NNLO VJ calculation. In both cases, we see good agreement for all rapidities, with overlapping scale errors and experimental error bars for the entire distribution. However, the shape corrections induced at NNLO for \(|y_{j1}|\gtrsim 3.5\) modify the central values of the theory predictions such that the tension with data increases compared to NLO.

Fig. 23
figure 23

\(\mathrm {ZJ}\) cross section differential in the absolute rapidity \(|y_j|\) of the leading jet. See Fig. 15 for details

If one associates the higher-energy incoming parton with \(x_1\) and the lower-energy incoming parton with \(x_2\), such that the sum of all final state momenta lies in the same direction as parton 1, the forward-most bin \((3.8<y_{j1}<4.4)\) in rapidity here corresponds to \(x_1>0.19\), \(x_2>0.00012\) for \(\mathrm {WJ}\) production and \(x_1>0.19\), \(x_2>0.00019\) in \(\mathrm {ZJ}\) production. One can then analyse the distributions here in a similar manner to the LHCb predictions in Figs. 6, 7 and 8. As is the case for the LHCb data, we see a theory excess in the jet rapidity bins corresponding to \(x\gtrsim 0.1\). This is again indicative of an overestimate of the gluon contributions to the PDF in this region since this excess is present in both \(\mathrm {W}\) and \(\mathrm {Z}\) distributions. The central rapidity bins allow us to quantify better the PDF description at intermediate Bjorken-x, with the central-most bin in \(y_{j1}\) requiring \(x_1>0.0044\) and \(x_2>0.0036\) for both neutral- and charged-current production. Here we see good agreement with the data, indicating that the behaviour in this region is well under control.

Fig. 24
figure 24

\({\mathrm {WJ}}/{\mathrm {ZJ}} \) ratio differential in the absolute rapidity \(|y_j|\) of the leading jet. See Fig. 20 for details

The ratio of \(\mathrm {WJ}\) to \(\mathrm {ZJ}\) differential in the absolute rapidity \(|y_{j1}|\) of the leading jet is shown in Fig. 24. Due to the cross-cancellation in the ratios, we see that these predictions display a considerably better perturbative stability than the individual distributions at high rapidities. We observe excellent agreement with the ATLAS data across the entire rapidity range. In the ratio, the PDF dependence of the predictions is in general lowered, particularly for gluonic contributions due to their similarity between the \(\mathrm {WJ}\) and \(\mathrm {ZJ}\) cases. The agreement on the ratio demonstrates that the NNLO QCD description of the underlying parton-level process is reliable. It indicates that the discrepancies observed in the individual distributions are of parametric origin and can be remedied by an improved determination of the gluon distribution.

4 Conclusions

The recent calculations [17,18,19,20,21,22,23,24,25,26] of NNLO QCD corrections to all observables related to the production of a massive vector boson in association with a jet open up a new level of precision in the phenomenological interpretation of these observables. In this context, final states at forward rapidity are particularly interesting, since they correspond to initial states with very asymmetric momentum fractions of the incoming partons, thereby probing the parton distributions in regions where they are insufficiently constrained by other data sets.

In this paper, we performed an in-depth comparison of forward vector-boson-plus-jet data from LHCb [16] and ATLAS [8] with precise NNLO QCD predictions, obtained using the NNLOjet code [22, 23, 26]. Inclusion of NNLO QCD corrections leads to a substantial reduction of the theory uncertainty on the predictions, thereby matching the accuracy of the LHC precision data. Deviations between data and theory are observed in various distributions, which are further investigated by constructing ratios between different vector bosons, and between inclusive and exclusive vector-boson-plus-jet cross sections. The pattern of vector boson ratios and related asymmetries points to an overestimate of the PDF parametrisation in the gluon distribution for Bjorken-\(x\gtrsim 0.1\) and equally to an overestimate in the u / d quark ratio in the same region.

Our results highlight the unique sensitivity of forward vector-boson-plus-jet production to the PDF content of the proton. We expect that the results presented here will enable improved determinations of the gluon distribution and of the quark flavour decomposition at large Bjorken-\(x\gtrsim 0.1\), thereby enhancing the accuracy of theory predictions for signal and background processes at the highest invariant masses.