Improving the Drell–Yan probe of small x partons at the LHC via an azimuthal angle cut

Predictions for Drell–Yan lepton pair production at low dilepton mass and small x at the LHC usually have a large scale dependence. This can be decreased by determining an optimal factorization scale. In this paper, we reduce this scale by imposing a cutoff in azimuthal angle between the transverse momentum of the leptons, properly taking into account Sudakov effects. This allows one to probe the parton distributions at smaller scales eliminating most of the current theoretical uncertainty.


Introduction
The Drell-Yan process is one of the standard channels for determining the parton distribution functions (PDFs), specially the sea quark ones. At the CMS experiment, for instance, the production of the pairs of muons is measured with a wide range of dilepton invariant mass, 15 < M < 3000 GeV at √ s = 13 TeV [1]. The results are integrated in dilepton rapidity and show good agreement with next-tonext-to-leading order Drell-Yan predictions. For a similar result from ATLAS, see Ref. [2].
It is possible to calculate the Drell-Yan (DY) cross section through a factorized scheme: as a convolution of the parton distributions (one for each involved proton) with the matrix element using a factorization scale, μ F . Schematically, we have where the matrix element, M(μ F ), is calculated in a perturbative manner. The convolution is in x space, i.e., the longitudinal momentum fraction carried by the partons. At leading a e-mail: yamasakikendi@gmail.com b e-mail: emmanuel.de.oliveira@ufsc.br (corresponding author) order (LO), there is a big scale dependence, whereas, at nextto-leading order (NLO), there is a smaller dependence and so on for higher orders until that, if we consider all perturbative terms, the result would be independent of scale, assuming it is a convergent series. The conventional choice for the factorization scale is M for the DY process. It is known that at small x NLO theoretical predictions there is a large factorization scale dependence, usually quantified by allowing for μ F = M/2, 2M. This is due to the fact that a variation of factorization scale will change the parton distributions. If the whole perturbative series were present, the matrix element would cancel this change. However, when truncated at NLO, the matrix element contains only one parton emission (see, e.g., Fig. 1), while the parton distributions can emit many partons (average of 8 at small x and for the LHC energies, as estimated in Ref. [3]) when they are evolved in μ F . This uncertainty limits the precision in which the parton distributions can be probed by the Drell-Yan process.
However, there is a procedure [3] to set an optimal scale, which reduces the uncertainty due to the factorization scale. The main idea is, in the limit of small x, to include part of the NLO contribution already at the LO by changing the parton distribution factorization scale at LO. It was applied first for the DY process, but it has been also applied to other processes, like cc and bb production [4] and J/ψ production [5].
Given that at large scales the parton distributions are more or less understood, it would be desirable to lower the optimal scale. With this goal, in Ref. [6], a dilepton (or, equivalently, photon) upper transverse momentum (k t ) cutoff was imposed, therefore making the NLO contribution smaller and then requiring a smaller optimal scale. In this way, one has information as regards the PDFs at smaller scales, i.e., smaller than the scales that can currently be measured due to experimental limitations. In this paper, we continue that work by imposing a cut in the azimuthal angle between the transverse momentum of the leptons (instead of a photon k t cut). This will be a complementary approach, which can be tested both theoretically and experimentally.
There are other ways to resum small x parton evolution. For example, an all-order small-x resummation matched to a fixed-order DGLAP anomalous dimension [7] was obtained some time ago. Also, by considering the perturbative coefficient functions at fixed order minus its expansion in α s series, it was possible to resum small x effects in Refs. [8,9] and have a better description of DIS data. In our work, we have the advantage of being able to choose more exclusive observables by having an easier way of introducing cutoffs.
This paper is organized as follows: in Sect. 2, we discuss how to reduce the NLO phase space through the azimuthal angle cut. Then, in Sect. 3, we calculate the optimal scale as a function of the cutoff. In Sect. 4, we show the effect of the cutoff in the cross section and, in Sect. 5, we show the stability of the results with regard to the choice of the remaining factorization scale. Finally, we present our conclusions in Sect. 6.

Imposing an azimuthal angle cut φ 0
Drell-Yan process at NLO is given by a collision between a parton A and parton B, resulting in another parton C and a photon, the latter splitting into leptons D and E. The most important case at small x, where the gluon distribution dominates, is the QCD Compton scattering: a gluon and a quark are the initial partons that result in the quark C and the leptons D and E, as shown in Fig. 1 The leptons D and E with the corresponding transverse momentum p Dt and p Et are separated by an azimuthal angle φ. If we take φ to be the smallest angle, it will vary between 0 < φ < π, with the upper limit corresponding to the back-to-back configuration. With an azimuthal angle cut, we reduce the number of events taken into account by selecting only the ones with φ > φ 0 , i.e., closer to the back-to-back configuration. In Fig. 2, we present only the lepton pair, D and E, and show the cut off region in red.
By introducing the cutoff, we expect to lower optimal scale that will be described in the next section. In this way, we are able to safely probe parton distribution at lower scales by reducing the big uncertainty involved in the choice of this scale as shown in Fig. 1 of Ref. [6].

Determination of the optimal scale
Following the procedure of Ref. [6], we use the parton cross section for the NLO subprocess qg → qγ * → qll differential in M 2 , in t and in the lepton transverse momenta. This is integrated in the two lepton variables, keeping the restriction Fig. 1 The Compton scattering diagram of the NLO Drell-Yan process: gluon A and quark B are the initial particles, resulting in a quark C and a photon, which in turn splits into a pair of leptons D and E. This diagram has a divergence in the t channel and it is the most relevant one at NLO for small x due to the gluon distribution For a given cutoff φ 0 , the green region corresponds to allowed values of angles φ > φ 0 , i.e. the part of phase space which is taken into account in the calculation. The red region is cut off, therefore, the events that are closer to the back-to-back configuration are the relevant (measured) ones in the azimuthal angle of φ > φ 0 : We also use the LO parton cross section convoluted with DGLAP g → qq splitting function [10] that does not have a dependence on the lepton variables: We equate both expressions (NLO vs. LO convoluted with DGLAP) and integrate in t. The infrared divergences cancel (the cut does not touch the divergence). There is a further integration in z = M 2 /ŝ with fixed M, accounting for an incoming gluon flux of 1/z, where the parton c.o.m. energy is √ŝ . Thus, we have an equation that can be used for finding the optimal scale, μ 0 .
In the next step, to calculate the cross section, we will use the factorized scheme: using the optimal scale, μ 0 , in the parton distribution appearing at leading order and also in the next-to-leading order coefficient, C NLO . By using the optimal scale μ 0 we include in the LO term all the NLO contributions which depends on factorization scale and enhanced by a large ln(1/x)that is, we resum inside the LO low-x PDF the terms [α s ln(μ F /M) ln(1/x)] n . Of course now, the first of these terms should not be taken into account at NLO to avoid double counting; this is done by setting μ F = μ 0 in C NLO . However, since there is a cutoff applied, it is necessary to take care of the situation of a parton that emits other partons during the evolution that may spoil the cutoff. In other words, we must take into account possible parton emissions from the optimal scale (μ 0 ) up to the hard scale ( √ŝ ) which give a supplementary transverse momentum to the dilepton. For example, a configuration in which the leptons are exactly back-toback (φ = π ) when the dilepton has no transverse momentum can be changed to another configuration like φ = π/2 if the dilepton is given the appropriate transverse momentum. We will do that at double logarithm accuracy. This situation is addressed by including Sudakov form factors that ensure that there will be no emission between the optimal scale μ 0 and √ŝ . This inclusion is detailed in Ref. [6], here we briefly recall that, in the double log approximation, the quark Sudakov factor is given by with where C F = 4/3 and, at leading order, √ŝ = M. Similarly, there is a Sudakov factor for the gluon. They enter the Eq. (4) as factors that multiply respective the parton distributions. Of course now we have to exclude the first term α s (ln 2 ( √ŝ /μ 0 ) − π 2 /4) from the C NLO expression to avoid the double counting.
One may argue that it is not clear how the Sudakov factors could be used with the angular cut, since they are traditionally used to account for no emission in a range of transverse momentum. First of all, the Sudakov factor depends on virtualities of single particles (as in the original paper [11]), not transverse momentum. This means that we can use them here, provided that we use as their arguments the inclusive scale M orŝ, where all possible dileptons are taken into account, and the optimal scale μ 0 with our cutoff. This is good at double Fig. 3 Optimal scale as a function of the azimuthal angle cutoff with and without Sudakov factors. We observe that for φ 0 > 0.7π the Sudakov factor does not affect much the factorization scale logarithm accuracy and corrections to it will appear only at NNLO. As shown in Ref. [3], the NNLO is rather small after the choice of the optimal scale and that justifies our approach. If we were to make completely sure that the cutoff was not spoiled by the PDF evolution, we would have to calculate this process to all orders or do a Monte Carlo evolution keeping track of all variables of intermediate partons, but we do not pursue this complicated approach.
In Fig. 3 the reduction is shown of the optimal scale with the cutoff for the cases without and with Sudakov form factor, for dilepton masses equal to 6 and 12 GeV. It starts with the case of no cut applied (φ 0 = 0) and ends in the most drastic case of φ 0 = π , where all phase space is cut off. In this range, the optimal scale varies from μ 0 = 1.45 M (no cutoff) to μ 0 = 0. From Fig. 3, we clearly see that, in the region which starts around φ 0 = 0.7π , Sudakov effects are not so important on the determination of the factorization scale. This is the most important region to study smaller scales, since μ 0 /M < 0.7 in this case. Then, we can investigate predictions of Drell-Yan cross section at smaller scales without worrying about a new theoretical uncertainty due to the Sudakov form factors.
After including into the LO term most of the NLO contribution, it would still be possible to use the parton distributions at a different scale μ 1 when computing the NLO contribution. Then the NNLO coefficient would depend on μ 0 and μ 1 and the idea would be to choose μ 1 in a way to make that almost all of the NNLO contribution would be already taken into account at lower orders. This would further reduce the scale uncertainty. We do not pursue such calculation here, but we argue, as first discussed in Ref. [3], that setting μ 1 = μ 0 already is a good choice, since the dominant diagram at small x at NNLO is the one with two gluons in the initial state and most of its contribution will be taken into account by correcting both quark and antiquark legs of the LO diagram with LO (and not NLO) DGLAP. Another possibility is to combine the azimuthal angle cutoff with the transverse momentum cutoff discussed in Ref. [6]. This would lower the optimal scale w.r.t. the application of a single cut, but we expect it will not be much lower. In fact, we expect that both cutoffs will be similar in the sense that a large part of the phase space is cut by the two cuts. For instance, the optimal scale for φ 0 = 0.85π is μ 0 = 0.44M; if we also cut the dilepton transverse momentum at k 0 = M, the optimal scale is still 0.44M within rounding error, if we set k 0 = M/2, we have μ 0 = 0.42M. In conclusion, applying both cuts should be weighted against the possible experimental difficulties when measuring this new cross section, depending on the setup it will be better to apply a single but stricter cut.

Predictions with an azimuthal angle cutoff
As described in Sects. 2 and 3, we are now in a position to lower the scale with an azimuthal angle cut and investigate the effects of the cutoff in cross section. We are interested in applying a cut for which Sudakov factors do not change much our results, φ 0 > 0.7π . A good choice will be φ 0 = 0.85π , for which the optimal scale is reduced to μ 0 = 0.44M. In Fig. 4, we show our predictions for the differential cross section in dilepton rapidity Y for the Drell-Yan process at LHC energy of √ s = 14 TeV. We use MMHT14 NLO PDFs [12] and set the dilepton mass equal to 6 and 12 GeV.
The upper curves in Fig. 4 correspond to the absence of any cutoff; therefore μ F = μ 0 = 1.45M. In this case, the scale at which the partons are probed is still larger than the usual choice μ F = M. The lower curves correspond to the cutoff φ 0 = 0.85π , for which we have a much lower scale (less than a third of 1.45M), but we still have a considerable Drell-Yan differential cross section given at two factorization scales μ F = μ 0 (black) and μ 0 /2 (red). The azimuthal angle cutoff φ 0 = 0.85π is imposed, with optimal scale at LO given by μ 0 = 0.44M. This shows that the remaining factorization scale uncertainty is greatly reduced cross section, as it can be seen that approximately 50% of the dileptons produced are kept.
We also calculate the 1σ error corridors coming from the PDF uncertainty, that, depending on Y , are rather large. The current precision of the measurements at the LHC is better than this PDF uncertainty, leading us to believe that a proper measurement of such observable would add new precise knowledge about the PDFs. In the next section we will see that the remaining factorization scale will be smaller than such bands.

Sensitivity of choice of factorization scale
We should now verify the behaviour of the cross section, Eq. (4) with respect to the remaining factorization scale dependence. Therefore, we set the scale at the LO PDF (μ F = μ 0 ) and in the NLO coefficient C NLO (μ 0 ), while varying the factorization scale, μ F , in the PDF multiplying the NLO contribution. We will investigate the central prediction μ F = μ 0 and also a smaller factorization scale μ F = μ 0 /2. Here we cannot use the larger μ F = 2μ 0 , because it would allow the DGLAP evolution to violate the cutoff. This would happen by the emission of partons with enough transverse momentum to produce a photon with some transverse momentum. Therefore, the dilepton will have to carry this momentum and the net effect will be a reduction of the azimuthal angle φ, putting, in the forbidden region, some events previously understood to be in the allowed region of φ.
In Fig. 5, we show the scale variation described above for the differential cross section in rapidity for M = 6 GeV and 12 GeV, setting the LHC energy to 14 TeV and, as an example, applying the azimuthal angle cutoff φ 0 = 0.85π with μ 0 = 0.44M. The renormalization scale is kept fixed at μ R = M. We can see that changing the factorization scale does not change much the results. Therefore, the role of optimal scale still holds and the uncertainty in the choice of scale is reduced.

Conclusion
In this work, we investigated the production of Drell-Yan dileptons at small x with a cutoff that excluded smaller values of the azimuthal angle φ < φ 0 . Following the prescription established in earlier work, we calculated the leadingorder optimal factorization scale using the dominant diagram at NLO, i.e., the gluon-quark Compton scattering. In doing so, the main theoretical uncertainty (factorization scale) was reduced, as can be seen for φ 0 = 0.85π at Fig. 5.
We provided the optimal scale as a function of the size of the cutoff φ 0 in Fig. 3. By introducing the cutoff, it was possible to lower the scale at which the parton distributions are probed, for example, μ 0 = 0.44M at φ 0 = 0.85π . In order to avoid the DGLAP evolution of the PDFs spoiling the proposed observable by the emission of a parton in the cutoff region, appropriate Sudakov factors were included. They changed the dependence of the optimal scale on φ 0 , but for φ 0 > 0.7π , the change of its absolute value is very small and therefore the optimal scale is quite robust regarding this correction.
Finally, we calculated the cross section of the discussed observable with φ 0 = 0.85π in Fig. 4, showing that indeed we will have a smaller cross section by a factor of about 2 when compared with the case without the cutoff. The uncertainty bands shown indicate that the determination of the parton distributions can be improved, since the uncertainty due to the factorization scale was greatly reduced.