1 Introduction

The forward–backward asymmetry in neutral-current (NC) Drell–Yan (DY) lepton pair production has traditionally been used at hadron-hadron colliders in the context of precision electroweak measurements, in particular for determinations of the weak mixing angle \(\theta _W\) [1,2,3]. However, mostly due to the theoretical systematics coming from the parton distribution functions (PDFs), the accuracy reached in \(\theta _W\) determinations from measurements of the forward–backward asymmetry at the Large Hadron Collider (LHC) is currently an order of magnitude lower than in the case of \(\theta _W\) determinations from lepton colliders [4].

In Ref. [5] we have proposed that measurements of the NC forward–backward asymmetry (henceforth denoted as \(A_\mathrm{FB}\)) at the LHC Run-II (as well as in future high-luminosity runs at the HL-LHC) can usefully be employed in PDF determinations, in a fashion complementary to measurements of the charged-current (CC) asymmetry, which have long been used in PDF fits to access the d / u quark PDF ratio (see [6,7,8,9,10,11] and references therein). While CC and NC doubly differential cross sections in the DY di-lepton mass and rapidity have extensively been used for PDF determinations, Ref. [5] investigates the role of the angular information encoded in the \(A_\mathrm{FB}\), which is related to the single-lepton pseudorapidity and, once combined with di-lepton mass and rapidity, would qualitatively correspond to triply differential cross sections. Precision measurements of such cross sections [12] have not yet been exploited systematically for PDF determinations. It is noted in [5] that, once mapped in the invariant mass of the di-lepton final state, both the NC differential cross section and the \(A_\mathrm{FB}\) display statistical errors which, with the luminosity reachable at Run-II, become competitive with those assigned to quark PDFs in a broad kinematic range around the Z-boson peak.

In this paper we follow the proposal [5] and explore the NC asymmetry focusing on its potential for disentangling the flavour content of PDFs. Specifically, we find that it is possible to probe the quark PDF flavour structure with the \(A_\mathrm{FB}\) through a judicious use of selection cuts in the rapidity of the emerging di-lepton pair, whose four-momentum is completely accessible in NC DY, unlike the case of the final state of CC DY which contains a neutrino. Although access to the d / u quark PDF ratio is more indirect with the NC \(A_\mathrm{FB}\) than with the CC asymmetry, the PDF kinematical region in longitudinal momentum fraction x and evolution scale \(Q^2\) is larger in the NC case, as both on-peak and off-peak cross sections can be used, whilst in the CC case the escaping neutrino prevents the reconstruction of the \(W^\pm \) mass so that the \(W^\pm \) peak region necessarily dominates the cross section. We observe that selecting high di-lepton rapidities enhances the contribution of higher-order radiative QCD effects [13, 14] and subleading production channels [15, 16] so that this region also becomes sensitive to the gluon PDF.

The paper is organised as follows. In Sect. 2 we briefly summarise the set-up of the calculation of Ref. [5] (which was applied to the CT14NNLO [6] and NNPDF3.1 [7] PDF sets) and we extend its results to the case of the ABMP16 [8], HERA2.0 [10] and MMHT2014 [9] PDF sets. In Sect. 3 we address the question of flavour content and investigate parton luminosities in the kinematic regions relevant for the \(A_\mathrm{FB}\) around the Z-boson peak. In Sect. 4 we analyse the \(A_\mathrm{FB}\) in the high-rapidity region and we study the role of rapidity selection cuts to disentangle the PDF flavour structure. We give conclusions in Sect. 5.

2 The differential forward–backward asymmetry

The forward–backward asymmetry \(A_\mathrm{FB}\) in the \(\gamma ^*,Z\) mediated DY production process \(pp\rightarrow \gamma ^*,Z\rightarrow \ell ^+\ell ^-\) (with \(\ell =e,\mu \)) is given by

$$\begin{aligned} A_\mathrm{FB} = \frac{ d \sigma / d M(\ell ^+\ell ^-)[\eta (\ell ^+)>0] - d \sigma / d M(\ell ^+\ell ^-)[\eta (\ell ^+)<0] }{ d \sigma / d M(\ell ^+\ell ^-)[\eta (\ell ^+)>0] + d \sigma / d M(\ell ^+\ell ^-)[\eta (\ell ^+)<0] }, \end{aligned}$$
(2.1)

where \(M(\ell ^+\ell ^-)\) is the DY lepton pair invariant mass, \(\eta (\ell ^+)\) is charged lepton pseudorapidity while the identification of the forward (F) and backward (B) hemispheres via the restrictions \(\eta (\ell ^+)>0\) and \(\eta (\ell ^+)<0\) is obtained by exploiting the di-lepton boost variable as discussed in [17,18,19,20,21]. Hence, this is effectively a ‘di-lepton FB asymmetry’.

This asymmetry is to be contrasted with the ‘lepton charge asymmetry’ in the CC channel, defined as

$$\begin{aligned} A_\mathrm{CC} = \frac{ d \sigma / d \eta (\ell ^+) - d \sigma / d \eta (\ell ^-) }{ d \sigma / d \eta (\ell ^+) + d \sigma / d \eta (\ell ^-) }, \end{aligned}$$
(2.2)

which has traditionally been used to constrain PDFs, particularly the ratio of d-quark to u-quark distributions. From the charged current data, different observables can be actually reconstructed. The two cross sections \(\sigma (u\bar{d}\rightarrow \ell ^+\nu \)) and \(\sigma (d\bar{u}\rightarrow \ell ^-\bar{\nu }\)) give rise to different distributions owing to the different valence u and d-quarks that dominate their production. The distribution in the rapidity of the mediator, \(W^\pm \), is governed by a sea quark and a sea anti-quark at low values. Only a tiny difference is thus expected between the two above-mentioned cross sections. Instead, at large values of \(y(W^\pm )\), the process is mainly initiated by a valence quark and a sea anti-quark. A much larger difference is therefore expected between the two charged current cross sections. This feature should enable one to distinguish between valence/sea u/d quark PDFs. This highlights the potential role of a binning (even a cut) on the rapidity of the di-lepton system for charged current processes and in general. Starting from the kinematic relations

$$\begin{aligned} M_W^2 = sx_1x_2 , \quad y = {1\over 2}\ln \left( {x_1\over x_2}\right) \end{aligned}$$
(2.3)

where \(\sqrt{s} = E_{coll}\) = 13 TeV and \(x_i\) are the quark longitudinal-momentum fractions, the \(W^\pm \) rapidity distributions can be directly related to the quark and anti-quark fractional momenta thus constraining the corresponding PDFs. For \(|y(W^\pm )|\simeq 2.5\), the measurable region at the W-boson peak would correspond to \(x_1\simeq 0.1\) and \(x_2\simeq 3\times 10^{-4}\) (low-x anti-quark and large-x quark as the charged lepton is likely to be emitted in the same direction as the valence quark). This range could be used to constrain the PDFs and determine the parton luminosities.

Fig. 1
figure 1

\(A_\mathrm{FB}^*\) with its statistical (left column) and PDF (right column) error bands obtained with the CT14NNLO, and NNPDF3.1 PDF sets with a rapidity cut of \(|y|>0\) (first row), \(|y|>0.8\) (second row) and \(|y|>1.5\) (third row)

For the CC leptonic channel, the \(W^\pm \)-rapidity can be derived using the reconstruction of the neutrino on the W-boson peak, \(\sqrt{\hat{s}}\simeq M_W\). This reconstruction algorithm however suffers from uncertainties. A better observable is the charged lepton pseudo-rapidity that is directly related to the scattering angle, \(\eta (l^\pm ) = -\ln (\tan \theta /2)\). The difference between the two processes, \(q\bar{q}\rightarrow W^\pm \rightarrow l^\pm \nu _l\), is then enhanced at high values of \(|\eta (l^\pm )|\), still compatible with the detector acceptance.

In order to reduce the systematics, including the theoretical uncertainty due to the higher order QCD and EW corrections, it is a standard practice to construct cross section ratios. That is the reason to define the lepton charge asymmetry in Eq. 2.2. This strategy is also recommended to reduce the theoretical error due to the PDFs determination. The PDFs uncertainty indeed cancels out, to a large extent, when considering cross section ratios. This observation is the kernel of our analysis, which addresses the neutral current DY processes \(q\bar{q}\rightarrow Z, \gamma ^*\rightarrow l^+l^-\).

This process is used to analyse the \(Q^2\) dependence of the PDFs. While the CC process is sensitive to the sea anti-quark and the valence quark, the NC process encodes information about the sum of the sea \(\bar{d}\), \(\bar{u}\) anti-quarks and the valence u, d quarks. In contrast to the CC differential cross sections that appear in ratios, the NC ones are not normalised usually. Recently, LHC NC data have been released [12] in the form of triple differential cross sections in the di-lepton rapidity, di-lepton invariant mass and lepton scattering angle. This powerful information however suffers from experimental and theoretical systematical errors. In particular, the theoretical PDF’s uncertainty can be so large to overcome the potential differences coming from the comparison of the NC cross sections obtained using different PDF’s sets. This would weaken the sensitivity of the LHC measurements to the q and \(\bar{q}\) PDFs. At present, the discrepancy between the predictions from any two different PDF sets is compared to the statistical precision. At the \(W^\pm \) or Z-boson peaks, the cross sections are very high. The signal is clear and the background is rather low. That is why a sensitivity to small differences in the PDF’s parameterisation is claimed to be at hand.

Fig. 2
figure 2

\(A_\mathrm{FB}^*\) with its statistical (left column) and PDF (right column) error bands obtained with the ABMP16, HERA2.0 and MMHT2014 PDF sets with a rapidity cut of \(|y|>0\) (first row), \(|y|>0.8\) (second row) and \(|y|>1.5\) (third row)

Fig. 3
figure 3

Parton luminosities for the a \(u-\bar{u}\) and b \(d-\bar{d}\) interactions as a function of the partonic c.o.m. rapidity, computed at four different c.o.m. energies around the Z-boson peak

Fig. 4
figure 4

Ratio of the \(d-\bar{d}\) and \(u-\bar{u}\) parton luminosities as a function of the c.o.m. rapidity at four fixed values of the c.o.m. energy and for the PDF sets a NNPDF3.1, b CT14NNLO, c ABMP16, d HERA2.0, e MMHT2014. f Same ratio including the PDF error band for all the sets at fixed energy \(\sqrt{\hat{s}} = M_Z\)

Fig. 5
figure 5

Number of events as a function of the di-lepton invariant mass including the a statistical and b PDF errors for a rapidity cut \(|Y_{ll}| > 4.5\). A luminosity \(L = 300\) fb\(^{-1}\) is assumed

Fig. 6
figure 6

\(A_\mathrm{FB}^*\) as a function of the di-lepton invariant mass including the a statistical and b PDF errors with a rapidity cut \(|y|>4.5\). We assume a luminosity \(L = 300\) fb\(^{-1}\)

However, in our view, the PDF’s uncertainty could still represent a real obstacle. This is the main reason for proposing the use of the forward–backward asymmetry, which is a ratio of (differential) cross sections in the NC channel. It was observed in [5], using the method of [19, 20] to define the reconstructed FB asymmetry, \(A^*_\mathrm{FB}\), and to evaluate its PDF uncertainty, that whilst no gain in quark PDF determinations should be expected from the reconstruction of this observable with the data collected at \(\sqrt{s}\) = 8 TeV, the scenario changes at the LHC Run-II and at the planned HL-LHC. For instance, for the projected luminosity \(L =\) 3 ab\(^{-1}\) of the HL LHC [22], and for the CT14NNLO [6] and NNPDF3.1 [7] PDF sets, the statistical error is found [5] to be lower than the PDF error in the low to intermediate mass region \(M_{\ell \ell }~\lesssim ~\) 400 GeV.

Furthermore, as previously mentioned, the reconstructed \(A_\mathrm{FB}^{*}\) experiences a partial cancellation of systematical uncertainties. As the uncorrelated and correlated components of the uncertainty are determined for \(A_\mathrm{FB}^{*}\) and added up in quadrature, the result is that the global error is significantly reduced compared to the case of the pure differential cross sections. From this point of view, the measured \(A_\mathrm{FB}^{*}\) could be highly competitive with the single-differential cross section and even with the recently measured triple-differential cross section [12] from which it can be obtained.

We here consider the differential distribution of the \(A_\mathrm{FB}^{*}\) observable in the di-electron final state, focusing on the invariant mass region around the Z-boson pole, where the high statistics due to its resonant production allows very precise experimental measurements. The results are presented taking into account the resolution and the experimental acceptance and efficiency factors of this channel [23]. When computing the statistical error on the observable, we include the QCD next-to-next-to-leading order (NNLO) corrections to the cross section [24, 25] to estimate the expected number of events. We neglect the additional NLO EW corrections, which are estimated [12] to give rise to a K-factor of about 1.035 in the di-lepton mass range considered.

Following up on the results we already presented in [5], here we further explore the effects of imposing a rapidity cut on the di-lepton system for the observable \(A_\mathrm{FB}^{*}\). As it is well known from the literature, such a kinematical cut improves the efficiency of the reconstruction so that a closer shape to the original \(A_\mathrm{FB}\) is recovered. Conversely, as we mentioned before for the CC channel, the discrepancy in the \(A_\mathrm{FB}^*\) predictions obtained with any two different PDF sets are enhanced by the effect of the rapidity cut. We extend here the results of [5] by including additional PDF sets.

In Figs. 1 and 2, the \(A_\mathrm{FB}^*\) distributions in the di-lepton invariant mass have been obtained with different PDF sets: CT14NNLO [6] and NNPDF3.1 [7] in the former, ABMP16 [8], HERA2.0 [10] and MMHT2014 [9] in the latter. We have imposed a lower rapidity cut of \(|y|>0\) (first row), \(|y|>0.8\) (second row) and \(|y|>1.5\) (third row) on the di-electron final state. The distributions are accompanied by the PDF (plots on the right) and the statistical (plots on the left) error bands. The latter have been computed assuming an integrated luminosity of 300 fb\(^{-1}\), that is, the value which is supposed to be achieved by the end of the LHC Run-II programme.

The increasing discrepancies between the predictions as the rapidity cut gets stronger are a consequence of the different parameterisations in the partonic content of the proton, characteristic of each PDF’s set. Quarks and anti-quarks PDFs suffer from larger uncertainties in the regions of high-x and low-x, respectively: the rapidity cut indeed selects these kinematical regions. Future experimental measurements of \(A_\mathrm{FB}^*\) in these invariant mass and high rapidity regions have the potential of reaching sufficient precision to discriminate between different PDF parameterisations in the corresponding x range. If using \(A_\mathrm{FB}^*\), which is a cross section ratio, this discrimination can be remarkably outside the PDF’s uncertainty bands, as shown by the right-hand side plots. This is true in the invariant mass interval just before the Z-boson peak, 60 GeV \(\le M(ll)\le \) 80 GeV, and soon after it, 100 GeV \(\le M(ll)\le \) 180 GeV, for a high enough y(ll) cut. This condition is not at all guaranteed by the triple differential cross section that contains the same information but is not normalised. This powerful feature of the \(A_\mathrm{FB}^*\) observable is not taken into account, generally, and it should.

From Fig. 2, one can clearly distinguish the HERA2.0 predictions from the ABMP16 and MMHT2014 ones already for \(y(ll)\ge 0.8\). Analogous results are evident in Fig. 1 in the comparison between the NNPDF3.1 and the CT14NNLO predictions and they were already displayed by the authors in Ref. [5].

3 Parton luminosities at high rapidity

In this section we want to further investigate the origin of the behaviour of \(A_\mathrm{FB}^*\) that we have observed. We will show that such a behaviour is closely related to the flavour content of the proton and can give access to the up-type PDFs in a unique way. To this aim, we compare the parton luminosities from the \(u-\bar{u}\) and the \(d-\bar{d}\) interactions at some fixed value of the centre-of-mass energy around the Z-boson peak focusing on their dependence on the partonic rapidity, \(y_{q\bar{q}}\). We follow the formulae of Ref. [26] and in Fig. 3 we show the predictions for the CT14NNLO set.

As visible, the \(d-\bar{d}\) initiated process gets more suppressed at high rapidity in comparison to the \(u-\bar{u}\) one. This effect gets enhanced with increasing the centre-of-mass energy of the hard quark–antiquark scattering. It is indeed particularly evident on the right hand side of the Z-boson peak. It is then interesting to visualise the ratio of the \(u-\bar{u}\) and \(d-\bar{d}\) partonic luminosities as a function of the c.o.m. rapidity for the various PDF sets that we are considering. This is shown in Fig. 4a–e for the four values of the centre-of-mass energy of the hard quark–antiquark scattering that, on purpose, have been selected before and after the Z-boson peak (see legend). While at low rapidity, \(y_{q\bar{q}}\), the two luminosities are comparable, with increasing \(y_{q\bar{q}}\) the \(d-\bar{d}\) component gets depleted by roughly an order of magnitude. The level of suppression of the \(d-\bar{d}\) contribution is different within each PDF set. In order to estimate such differences in the predictions, one needs to include the PDF errors from each set.

In Fig. 4f, the ratio of the \(u-\bar{u}\) and \(d-\bar{d}\) partonic luminosities as a function of the c.o.m. rapidity is displayed with the corresponding PDF error band as calculated within the various PDF sets at fixed \(\sqrt{\hat{s}} = M_Z\).

In order to detail the displayed differences in the ratio predicted by the various collaborations, we take as reference point the rapidity value \(y = 4.5\). Here, the relative contribution from the down-quark initiated process lies in the interval 2–23% for the NNPDF3.1 set, 13–29% for the CT14NNLO set, 10–14% for the ABMP16, 14–23% for the HERA2.0 set and 16–25% for the MMHT2014 set.

This has two consequences. First of all, there are differences expected in the Z-boson production cross sections obtained with the various PDF sets that are well outside the PDF’s error band at large rapidity. A good sensitivity could be reached to small differences in the u, d quark and anti-quark parameterisation. Second, by imposing a sufficiently high rapidity cut on the data sample, all the considered PDF sets agree within errors that the \(d\bar{d}\) component of the luminosity gets reduced to at most 20%. One could therefore have a direct handle on the processes that are initiated by the \(u-\bar{u}\) interactions. This implies that one could have direct access to the valence u-quark and the sea u quark and anti-quark PDFs. Up to some degree of purity, the up-quark flavour could be extracted from the neutral Drell–Yan data and the corresponding PDFs constrained (or tuned) by a direct comparison. Using the valence u-quark PDF already well measured, one could adjust the sea u and \(\bar{u}\) distributions to fit the data. The kinematical region of large rapidity corresponds to selecting a low \(x_{\bar{u}}\) and a high \(x_u\) in the \(u\bar{u}\) scattering process. These are the ranges least known for the quarks PDFs. Therefore, at large rapidity the neutral Drell–Yan data could be used to constrain the u and \(\bar{u}\) PDFs and determine the corresponding \(u\bar{u}\) parton luminosity in these two extreme regions of x.

4 Sensitivity to the u and \(\bar{u}\) PDFs

We now want to exploit this idea as far as the LHC sensitivity would allow. For a meaningful analysis we cannot push the rapidity cut too high due to the reduction of the data sample though. In Fig. 5, we show the expected number of events as a function of the di-lepton invariant mass obtained for different PDF choices when imposing a rapidity cut \(|y| > 4.5\). For this value, the measurable region would correspond to \(x_1\simeq 0.6\) and \(x_2\simeq 7\times 10^{-4}\) (low-x anti-quark and high-x quark). This range could be used to constrain the valence/sea up-quarks and sea up-type antiquarks PDFs, where they are least known, and determine the corresponding parton luminosities. In the plot on the left we can observe the statistical error band evaluated for an integrated luminosity of 300 fb\(^{-1}\). As visible, in the region around the Z-boson peak the statistics is very high despite the stiff kinematical cut, however, the distribution rapidly falls moving away from the resonant production. Up to \(M_{ll}\le 120\) GeV, the various PDF sets could be distinguished if relying only on the statistical error. Conversely, the plot on the right-hand side shows that with the current PDF uncertainties the predictions obtained with the various PDF sets cannot be disentangled. They are indeed all within the errors. The spectrum does not seem to be able to constrain the PDF of the valence/sea up-quark and the sea up-type antiquark.

In Fig. 6 we show the equivalent picture for the case of the \(A_\mathrm{FB}^*\) observable. In the left-hand side plot, we observe that the precision allowed by the available statistics is sufficient for a precise experimental measurement of the observable, reflecting what shown for the predicted di-lepton spectrum. In the plot on the right-hand side, we can appreciate the good features of the \(A_\mathrm{FB}^*\) in terms of reduction of the PDF uncertainties, such that for this observable it will be possible to distinguish the predictions obtained with some of the PDF sets in specific invariant mass regions. In particular, the NNPDF prediction cannot be disentangled from the others just by observing the differential cross section in the di-lepton invariant mass, because of the large PDF error which the spectrum is affected by. In the \(A_\mathrm{FB}^*\) distribution instead the NNPDF central value and error band come out to be incompatible with those of the other PDF sets. A new sensitivity to the PDFs is then gained by associating the measurement of \(A_\mathrm{FB}^*\) to the spectrum.

5 Conclusions

In summary, building upon the recent observation made in [5] that the forward–backward asymmetry of di-lepton pairs produced via NC DY processes at the LHC can be used to constrain the PDFs inside the protons, we emphasise here that its exploitation in conjunction with a cut on the rapidity of the final state gives access also to the flavour content of the proton. In the presence of such a cut, one could indeed separate the valence/sea up-quark and the sea up-type antiquark, as the y-cut enforces a suppression of the \(d\bar{d}\)-quark luminosity. Depending on the size of the data sample, the statistical and PDF errors for the different PDF sets could be not overlapping, thus offering the possibility to constrain or even eventually measure the up-type quark/antiquark PDFs.

This should be intended as a preliminary analysis, quite promising though. Higher order EW and QCD corrections can become quite important in the kinematical region of high rapidity. They should be properly included in order to make solid statements. Precise and reliable Monte Carlo Event Generators are available to this purpose and are regularly used by the LHC experimental collaborations for data interpretation. Having these tools at hand, it would be highly advisable to perform a new PDFs fitting procedure aimed at establishing, in a firm way and by direct comparison, whether the use of the reconstructed forward–backward asymmetry can give better results in terms of PDFs constraints with respect to the triple differential cross sections now in use.