Studies of gluon TMDs and their evolution using quarkonium-pair production at the LHC

J/ψ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J/\psi $$\end{document}- or Υ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Upsilon $$\end{document}-pair production at the LHC are promising processes to study the gluon transverse momentum distributions (TMDs) which remain very poorly known. In this article, we improve on previous results by including the TMD evolution in the computation of the observables such as the pair-transverse-momentum spectrum and asymmetries arising from the linear polarization of gluons inside unpolarized protons. We show that the azimuthal asymmetries generated by the gluon polarization are reduced compared to the tree level case but are still of measurable size (in the 5–10% range). Such asymmetries should be measurable in the available data sets of J/ψ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$J/\psi $$\end{document} pairs and in the future data sets of the high-luminosity LHC for Υ\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Upsilon $$\end{document} pairs.


Introduction
The three-dimensional structure of the composite hadrons has widely been analyzed through the study of transversemomentum dependent parton distribution functions (TMDs) in the framework of TMD factorization. The various TMDs can be accessed in hadronic processes with a small transverse momentum (TM), denoted by q T , of the detected final state [1][2][3]. TMDs need to be extracted from experimental data for such processes as they are intrinsically nonperturbative objects and therefore cannot be computed using perturbative QCD. So far, the majority of data allowing for the extraction of TMDs have been acquired from SIDIS and Drell-Yan measurements, two experimentally accessible processes and for which TMD factorization was proved to hold [4][5][6]. However, since such processes are primarily induced by a e-mail: scarpaflorent@ipno.in2p3.fr quarks/antiquarks, they mostly provide information about the quark TMDs. Currently our knowledge of gluon TMDs is still very limited, due to the lack of data on processes that could potentially be used for extractions. More specifically, gluons inside unpolarized protons can be described at leading twist using two TMDs [7]. The first one describes unpolarized gluons, while the second one describes linearly polarized gluons. The latter correlates the spin of the gluons with their TM, and thus requires non-zero gluon TM. The presence of polarized gluons inside the unpolarized proton has effects on the cross-sections, such as modifications of the TM-spectrum and azimuthal asymmetries.
Several processes have been proposed to extract gluon TMDs, see e.g. . Associated quarkonium production (see [37] for a recent review) has in particular a great potential to probe the gluon TMDs at the LHC, e.g. quarkonium plus photon (Q+γ ) or quarkonium-pair production. They mainly originate from gluon fusion, and can be produced via a colorsinglet transition, avoiding then possible TMD-factorizationbreaking effects [38][39][40]. This is indeed our working hypothesis, which is based on the fact that the produced systems are very small color dipoles, produced directly at the hard scattering and thus dominated by the color-singlet configuration. Moreover, isolation cuts may be used to reduce the risk of factorization breaking problems. In any case, comparing di-J/ψ and di-ϒ with γ * γ * results would provide in the longer run a good way to identify possible factorization breaking effects.
Some quarkonium states, like the J/ψ meson, are easily detected and a large number of events can be recorded. Processes with two particles in the final state offer some interesting advantages compared to those with a single detected particle. Since the TM of the final state needs to be small for the cross-section to be sensitive to TMD effects, oneparticle final states are bound to stay close to the beam axis and therefore difficult to detect, as the background level is high and triggering is complicated. However, two particles that are nearly back-to-back can each have large individual transverse momenta that add up to a small one. Indeed, in general a pair of particles can have a large invariant mass and a small TM. Whereas the hard scale in a one-particle final state is only its mass, and is thus constant, the invariant mass of a two-particle final state can be tuned with their individual momenta. This allows one to study the scale evolution of the TMDs. Finally, a two-particle final state allows one to define the azimuthal angle between these two particles, hence to look for various azimuthal asymmetries. These are in fact associated to specific convolutions of gluon TMDs.
It was thus recently proposed [30] to probe the gluon TMDs using quarkonium-pair production at the LHC, and more specifically J/ψ + J/ψ production. Such a process has already been measured by LHCb, CMS and ATLAS at the LHC, as well as by D0 at the Tevatron [41][42][43][44][45]. Also it has recently been considered within the parton Reggeization approach [46]. The size of some azimuthal asymmetries associated with the linearly polarized gluon distribution are nearly maximum in this process. In [30], the unpolarizedgluon distribution was modelled by a simple Gaussian as a function of the gluon TM. In order to see the maximal effect of the linearly-polarized gluons on the yields, their distribution was taken to saturate its positivity bound [7]. The size of the resulting maximum asymmetries was found to be very large, especially at large pair invariant mass, M QQ . Yet, more realistic estimates of the asymmetries require the inclusion of higher-order corrections in α s through TMD QCD evolution [21,[47][48][49].
Very recently, a TMD-factorization proof has been established for pseudoscalar η c,b hadro-production at low TM [50] (see also [51]). To date, this is the only one for quarkonium hadroproduction. It was pointed out that new hadronic matrix elements are involved for quarkonium production at low TM, in addition to the TMDs. These encode the soft physics of the process. It is not known how much these new hadronic matrix elements impact the phenomenology. In this context, we build on the previous work [30] by adding TMD evolution effects to the gluon TMDs. Such evolution effects are expected to play a significant role (see e.g. [21]) and should in any case be specifically analyzed. We will proceed like in previous studies for H 0 production [9,18,21].
In this article, we first discuss the characteristics of quarkonium-pair production at the LHC within the TMD framework, as well as the associated cross-section and observables sensitive to the gluon TMDs. We then detail the evolution formalism used in our computations and the resulting expressions for the TMD convolutions. Finally, we present our results for the P QQT -spectrum and the azimuthal asymmetries for J/ψ-pair production at the LHC as well as azimuthal asymmetries for ϒ-pair production.

Q-pair production within TMD factorization
2.1 TMD factorization description of the process TMD factorization extends collinear factorization by taking into account the intrinsic TM of the partons, usually denoted by k T . As in collinear factorization, the hard-scattering amplitude, which can be perturbatively computed, is multiplied by parton correlators that can be parametrized in terms of parton distribution functions, but in this case k T dependent. The parametrization of parton correlators is an extension from that used in collinear factorization, not only because of the k T dependence of the distribution functions, but also because there are more distributions. The gluon correlator inside an unpolarized proton with momentum P and mass M p , denoted by μν g (x, k T ) [7,52,53], can be parametrized in terms of two independent TMDs. The first one is the distribution of unpolarized gluons f Here the gluon 4-momentum is written using a Sudakov decomposition: k = x P + k T + k − n (where n is any light-like vector (n 2 = 0) such that n · P = 0), where k 2 T = −k 2 T and the transverse metric is g μν T = g μν − (P μ n ν + P ν n μ )/P·n. For TMD factorization to hold, the hard scale of the process should be much larger than the pair TM, q T .
The process we are interested in is the fusion of two gluons coming from two colliding unpolarized protons, leading to the production of a pair of vector S-wave quarkonia: g(k 1 )+g(k 2 ) → Q(P Q,1 )+Q(P Q,2 ) . The cross-section for this reaction involves the contraction of two gluon correlators [30], μν g (x 1 , k 1T ) and ρσ g (x 2 , k 2T ), with the squared amplitude M μρ (M νσ ) * of the partonic scattering, integrated over the gluon momenta. The expression of the tree-level partonic amplitude M is available in Ref. [54], although the earliest computations date back to 1983 [55,56]. The hadronization process, i.e. the transition from a heavy-quark pair to a quarkonium bound state, is described in our study using the color singlet model (CSM) [57][58][59] or in this case equivalently non-relativistic QCD (NRQCD) [60] at LO in the velocity v of the heavy quarks in the bound-state rest frame. Figure 1 represents the complete reaction with a typical Feynman diagram depicting the partonic subprocess.

Other contributions to quarkonium-pair production
The leading contribution to the hadronization of a Q Q pair into a bound state in NRQCD is the color-singlet (CS) tran- Fig. 1 Representative Feynman graph for p(P 1 )+ p(P 2 ) → Q(P Q,1 )+Q(P Q,2 )+X via gluon fusion at LO in TMD factorisation sition, for which the perturbatively-produced heavy-quark pair has the same quantum numbers as the quarkonium and directly binds without any extra soft interaction. Corrections to this leading contribution involving higher color-octet (CO) fock states are suppressed by powers of v, which is meant to be much smaller than unity for heavy quarkonia.
The CS over CO dominance normally follows from this power suppression in v encoded in the so-called NRQCD long distance matrix elements (LDMEs). More precisely one expects a relative suppression on the order of v 4 [60][61][62] (see [37,[63][64][65] for reviews) per quarkonium. For di-J/ψ production with v 2 c 0.25 and for which both the CO and the CS yields are produced at α 4 s , the CO/CS yield ratio, which thus scales as v 8 c , likely lies below the percent level. Explicit computations [66][67][68][69][70] indeed show corrections from the CO states below the percent level except in some corners of the phase space (e.g. large rapidity separation Δy) where some CO contributions can be kinematically enhanced, but these can safely be avoided with appropriate kinematical cuts. More details can be found in [70].
It is important for the applicability of TMD factorization that the CS contributions dominate. Soft gluon interactions between the hadrons and a colored initial or final state of the hard scattering can be encapsulated within the definition of the TMD through the use of Wilson lines. However, if both initial and final states are subject to soft gluon interactions, the resulting color entanglement may break TMD factorization [38][39][40]. The dominance of the CS contributions should therefore be ensured.
It is also important to take into account α s corrections. In the TMD region, P QQT M QQ , these introduce a renormalization scale (μ) dependence in the TMD correlators and a rapidity scale ζ dependence [4][5][6]. At larger P QQT one has to match onto the collinear factorization expression (see e.g. [71]), which is calculated by taking real-gluon emissions into account [68,[72][73][74]. At finite P QQT , such single realgluon emissions occur at α 5 s and the quarkonium pair effectively recoils against this hard gluon, increasing the pair TM. In this paper we will restrict to P QQT < M QQ /2 in order to stay away from the matching region.
Thus far, we have focused our discussion on the single parton scattering (SPS) case. However, since we look at a twoparticle final state, we should also consider the case where the quarkonia are created in two separate hard scatterings, i.e. double parton scattering (DPS). At LHC energies, the gluon densities are typically high and the likelihood for two hard gluon fusions to take place during the same protonproton scattering cannot be neglected.
In the case of di-J/ψ, it has been already anticipated in 2011 [75] that DPS contributions may be dominant at large rapidity difference Δy (thus large invariant masses with same individual TM). This was corroborated [68] by the CMS data [42] with an excess above the SPS predictions at large Δy. ATLAS further [45] confirmed the DPS relevance in di-J/ψ production with a dedicated DPS study. One expects 1 [45,68] that DPS contributions lie below 10% for Δy ∼ 0 in the CMS and ATLAS samples (characterized by a P QT cut away from the threshold M ψψ 2M ψ ) and that they only matter at large Δy. However, in the LHCb acceptance (where M ψψ 2M ψ ), DPS contributions cannot a priori be neglected, but can be subtracted [44] if one assumes that, for the DPS sample, the kinematics of both J/ψ's is uncorrelated. This yield a precise (yet, unnormalized) prediction of the kinematical distributions.
Another source of quarkonium pairs is the feed-down from excited states. For J/ψ-pair production, the main feed-down sources are the χ c and ψ . The feed-down from χ c is expected to be small, as gg → J/ψ + χ c and gg → ψ + χ c are suppressed [68] at LO by C-parity and the vanishing of the χ c wave function at the origin, and gg → χ c + χ c is suppressed by the squared χ c → J/ψ branching ratio. The production of a ψ with a J/ψ likely contributes [37] 50% of the J/ψ-pair samples, owing to the large branching ratio for ψ → J/ψ (O(60)%) and symmetry factors. Yet, ψ + J/ψ pairs are 1 Theory DPS studies advance [76][77][78][79], but not yet as to provide quantitative inputs to predict DPS cross sections as done for SPS. As such, one usually assumes the DPS contributions to be independent. This justifies factorizing the DPS cross section into individual ones with an (inverse) proportionality factor, referred to as an effective cross-section σ eff . Under this assumption, σ eff should be process independent, encoding the magnitude of the parton interaction. σ eff needs to be experimentally extracted as it is a nonperturbative quantity. This is the standard procedure at LHC energies [80][81][82][83][84][85][86]. Ideally, a single precise extraction of σ eff should suffice to provide predictions for any DPS cross section under this factorized Ansatz. Yet, the current extraction seems to differ [87] with values ranging from 25 mb down to a few mb, which forces us to restrict to qualitative considerations. produced exactly like J/ψ + J/ψ pairs and thus generate the same TMD observables. 2 In the case of ϒ-pair production, the main feed-down is from ϒ(2,3S). According to Ref. [88], less than 30% of the produced pairs would originate from feed-down at √ s = 8 TeV. As in the J/ψ case, C-parity suppresses the ϒ + χ b reaction at leading order, and χ b + χ b is suppressed by the squared branching ratio. Regarding the CO contributions, the relative velocity of the quarks inside the ϒ is smaller than for the J/ψ, meaning the NRQCD expansion used to describe the hadronization has a better convergence. Therefore it is highly unlikely that the CO channels overcome the CS ones in the reachable phase space. The fraction of DPS events is also expected to be less than 5% at low P QQT and central rapidity [37], making it an overall cleaner process.

The TMD differential cross-section
The general structure of the TMD-based differential crosssection describing quarkonium-pair production from gluon fusion reads [30]: with dΩ = dcos θ CS dφ CS , {θ CS , φ CS } being the Collins-Soper (CS) angles [89], and Y QQ is the rapidity of the pair.
Here P QQT (≡ q T ) and Y QQ are defined in the hadron c.m.s. The quarkonia move along (in the opposite direction) e = (sin θ CS cos φ CS , sin θ CS sin φ CS , cos θ CS ) in the CS frame. The kinematical pre-factor is specific to the mass of the quarkonia and the considered differential cross-sections, while the hard-scattering coefficients F i only depend on θ CS and the invariant mass of the system, here M QQ . Their expression for quarkonium-pair production can be found at tree level in Ref. [30]. When P QT M Q , small values of cos θ CS correspond to small values of Δy in the hadron c.m.s. 2 Up to the small kinematical shift due to the decay which we neglect in what follows.
The TMD convolutions appearing in Eq. (1) are defined as follows: where w(k 1T , k 2T ) denotes a TMD weight. The weights in Eq. (1) are common to all gluon-fusion processes originating from unpolarized proton collisions. They can be found in Ref. [25]. Our aim in the present study is to study the impact of QCD evolution effects in the above TMD convolutions.
Having at our disposal the computation of the hard-scattering coefficients, the measurements of differential yields in principle allow one to extract these TMD convolutions evolved up to the natural scale of the process, on the order of M QQ here.
In practice, one looks at specific observables sensitive to these convolutions. First we note that when the cross-section is integrated over the azimuthal angle φ CS , the terms with a cos(2,4φ CS )-dependence drop out from Eq. (1) such that giving direct access to C f . Furthermore, one can define, at fixed {Y, P QQT , θ CS , M QQ }, cos(nφ CS )-weighted differential cross-sections, integrated over φ CS and normalized by their azimuthallyindependent component: Such a variable, computed for n = 2 or 4 in our case, corresponds to (half of) the relative size of the cos(2,4φ CS )modulations present in the TMD cross-section in comparison to its φ CS -independent component: When cos nφ CS is computed within a range of M QQ , Y QQ , P QQT or cos(θ CS ), we define it as the ratio of corresponding integrals. Of course, the range in P QQT should be such that one remains in the TMD region, i.e. P QQT M QQ .
For positive Gaussian h ⊥ g 1 the cos(2φ C S ) asymmetry will be positive (note that in Ref. [30] the cos(2φ C S ) plots miss an overall minus sign).

TMD evolution formalism
TMD evolution has been considered in an increasing number of TMD observables. It is usually implemented by Fourier transforming to b T -space, with b T being the conjugate variable to P QQT . When evolution effects are considered, the TMDs acquire a dependence on two scales: a renormalization scale μ and a rapidity scale ζ (whose evolution is governed by the Collins-Soper equation). Below we present in a simple way the results needed to perform the TMD evolution. For more details, we refer to e.g. [21,[47][48][49].
When TMD evolution is incorporated to the gluon TMDs in the tree-level result in Eq. (1), the convolutions take the form where the two rapidity scales should fulfill the constraint While the renormalization scale μ in the hardscattering coefficients F i should be set here to μ ∼ M QQ in order to avoid large logarithms, the TMDs should be evaluated at their natural scale μ ∼ , in order to minimize both logarithms of μb T and ζ b 2 T , and then evolved up to μ ∼ √ ζ ∼ M QQ . The solution of the evolution equations results in the introduction of the following Sudakov factor S A : and the perturbative Sudakov factor (applicable for sufficiently small b T ) is given by We consider here the resummation at next-to-leadinglogarithmic accuracy, for which the Collins-Soper kernel D and the non-cusp anomalous dimension γ need to be taken at leading-order, while the cusp anomalous dimension at next-to-leading-order (see [90] for a recent detailed analysis of the two-dimensional evolution of TMDs). The perturbative Sudakov factor then takes the form with C A = 3, T f = 1/2 and n f the number of flavors (we will use n f = 4 for di-J/ψ and n f = 5 for di-ϒ production).
The running of α s is implemented at one loop. We note that the Sudakov factor S A is spin independent, and thus the same for all (un)polarized TMDs [21,49]. The perturbative component of the TMDs for small b T can be computed at a given order in α s . At leading order,f g 1 is given by the integrated PDF: As said above, h ⊥ g 1 describes the correlation between the gluon polarization and its TM (k T ) inside the unpolarized proton. It requires a helicity flip and therefore an additional gluon exchange. Consequently, its perturbative expansion starts at O(α s ) [9] (the NLO result was recently obtained in Ref. [91]): The above equations in principle allow one to derive a perturbative expression of these TMDs. However, they are strictly applicable only in a restricted b T range, whereas we need an expression for them from small to large b T in order to perform the corresponding Fourier transform.
For large b T , one indeed leaves the domain of perturbation theory. On the contrary, when b T gets too small, μ b becomes larger than M QQ and the evolution should stop. The above perturbative expression for the Sudakov factor should thus not be used as it is.
One of the common solutions to continue to use the above expressions consists in replacing b T by a function of b T which freezes in both these limits such that one is not sensitive to the physics there. For our numerical studies we use the following b T prescription [92]: where . This prescription is of course not unique, as it entails e.g. some particular assumptions on the transition from the hard to the soft regime. The ambiguity in the choice of this prescription can however be absorbed in the nonperturbative modelling of the TMDs, anyhow needed in the large b T region, which we discuss next.
Schematically each TMD convolution can be written in b T -space as for some integers n and m. HereW is a simple product of Fourier-transformed TMDs. The nonperturbative Sudakov factor S NP is now defined throughW is perturbatively calculable for all b T values. The value of b T max in Eq. (14) (roughly) sets the separation between the perturbative and nonperturbative domains. Its optimal value depends on many factors, such as the functional form chosen for b * T and the parametrization of the nonperturbative Sudakov factor S NP . For our numerical studies we take b T max = 1.5 GeV −1 , inspired by previous fits from Drell-Yan and W, Z production [93][94][95][96][97].
The functional form of S NP has been subject of debate, but is usually chosen to be proportional to b 2 T for all b T . By definition e −S NP (b T ,Q) has to be equal to 1 for b T = 0 and for large b T it has to vanish, at the very least to ensure convergence of the results. It is usually assumed to be a monotonically decreasing function of b T and its change from 1 to 0 is assumed to happen within the confinement distance. Lacking experimental constraints, here we will assume a simple Gaussian form (of varying widths). In order to assess the importance of the nonperturbative Sudakov factor for the size of the asymmetries and to perform a first error estimate, we consider several functions. For this purpose, we take a simple formula for the nonperturbative Sudakov factor that encapsulates the expected M QQ -dependence [98] and the assumed b T -Gaussian behavior: From this nonperturbative Sudakov factors a value b T lim is defined at which e −S NP becomes negligible, to be specific, where it becomes ∼ 10 −3 . From this we furthermore define a corresponding characteristic radius r = 1 2 b T lim (considering b T lim the diameter, since it is conjugate to P QQT = k 1T + k 2T ), which delimits the range over which the interactions occur from the center of the proton. To estimate the uncertainty associated with the largely unknown nonperturbative Sudakov factor, we will consider three cases: b T lim = 2, 4 and 8 GeV −1 . This spans roughly from b T max = 1.5 GeV −1 to the charge radius of the proton, thus giving a generous but sensible estimation of the nonperturbative uncertainty. The corresponding values of the parameter A and r for M QQ = 12 GeV are given in Table 1.
The value M QQ = 12 GeV is considered because the ratio F 3 /F 1 peaks there (for J/ψ pair production), but we will also consider larger values later on. When M QQ increases, the interaction radius r decreases. Figure 2 depicts e −S NP as a function of b T for the three values of A previously mentioned and for M QQ ranging from 12 to 30 GeV.
We point out that the nonperturbative Sudakov factor as fitted by Aybat and Rogers [95] to low-energy SIDIS as well as high-energy Drell-Yan and Z 0 production data, rescaled by a color factor C A /C F to account for the different color representation between quarks and gluons, is very close to the case b T lim = 2 GeV −1 . It is also very close to the Fourier transform of the Gaussian model for f g 1 (x, k 2 T ) with k 2 T = 3.3 ± 0.8 GeV 2 as extracted in Ref. [30] from a LO fit to J/ψ-pair-production data from LHCb [44] from which the DPS contributions was however approximately subtracted. We end this section by providing the expressions for the TMD convolutions in b T -space, which we actually use in the numerical predictions in the next section: 4 The TM spectrum and the azimuthal asymmetries

J/ψ-pair production
As said, after integration over the azimuthal angle φ C S , one gets to a good approximation dσ/dq T In Fig. 3a we compare q T C[ f g 1 f g 1 ] evaluated using the nonevolved Gaussian TMD model of [30] with the evolved TMD computed along the lines described in the previous section (a) (b) Fig. 3 (a) The normalised P QQT -spectrum for J/ψ-pair production at M ψψ = 8 GeV using two gluon TMDs. The first is a Gaussian Ansatz with k 2 T = 3.3 ± 0.8 GeV 2 obtained from the LHCb data [30] (the red curve shows the central value and the gray band the associated uncertainty). The second is the result of our present study with TMD evolution. The green band results from the uncertainty on the b T -width of the nonperturbative Sudakov factor S NP . The estimated DPS contribution has been subtracted from the LHCb data (black crosses) which were also normalized over the interval. (b) The P QQT -spectrum using our evolved gluon TMDs at M QQ = 12, 20 and 30 GeV for the same uncertainty on the b T -width for M QQ = 8 GeV using the range of b T lim between 2 and 8 GeV −1 . The main difference one can observe is the broadening of the P QQT -spectrum when including evolution effects. The curves are given as functions of P QQT in the range from 0 up to M QQ /2, to be in the validity range of TMD factorization.
The momentum fractions of the initial gluons, x 1 and x 2 , are both fixed to 10 −3 . Varying the momentum fractions does not have any significant impact on the shape of the P QQTspectrum or the azimuthal asymmetries. The size of the asym-metries varies by a few percent with x. As such variations do not change the conclusions of our analysis, we will keep the values x 1 = x 2 = 10 −3 throughout this paper. This is also convenient for an experimental study, as a binning of the data in Y QQ is not necessary to be able to compare them with predictions.
In Fig. 3b, we show evolved results for M QQ = 12, 20 and 30 GeV within the same b T lim range as Fig. 3b. The broadening of the P QQT -spectrum for increasing M QQ is then explicit.
The azimuthal asymmetries presented in Eq. (5) depend on a rather complex ratio of TMD convolutions and hardscattering coefficients. In the case of J/ψ-pair production, these expressions simplify for several reasons. The first one, already mentioned previously, is that because F 2 is small, the denominator can be approximated to be Moreover, because of the symmetry of the final state, one finds the coefficients F 3 and F 3 to be equal, simplifying the numerator of cos(2φ C S ) to be . Finally, when one takes the initial-parton-momentum fractions to be equal, i.e. x 1 = x 2 , these two convolutions become equal as well. Since the P QQT -dependence of the cross-section is contained inside the convolutions, the P QQT -dependence of the asymmetries can be studied via the convolution ratios for cos(2φ C S ) and cos(4φ C S ) , respectively.
The difference between both convolutions depends on the kind of TMDs they contain, but also the type of Bessel function generated by the angular integral and the weights. Becauseh ⊥ g 1 is of order α s , it is naturally suppressed in comparison to f g 1 . Moreover, α s (μ b ) is growing with b T (up to its bound α s (b 0 /b T max )) andh ⊥ g 1 is also broader in b T than f g 1 . The presence ofh ⊥ g 1 in a given convolution therefore contributes to reduce the magnitude of the integrand, and to its b T -broadening. These effects contribute to strongly suppress is of order α 2 s and its integrand is significantly broadened in b T , meaning it falls faster than C f g 1 f g 1 with increasing P QQT . Indeed, as a consequence of the b T -broadening, more oscillations of the J 0 Bessel function occur in the integrand of than of C f g 1 f g 1 , before being dampened by the Sudakov factors at large b T . Each additional oscillation in the integrand brings the convolution value closer to zero. More oscillations are packed in a given b T -range when P QQT increases, widening the gap between the two convolutions, and effectively making the ratio fall with P QQT . This additional effect renders the F 2 C w 2 h ⊥ g 1 h ⊥ g 1 term truly negligible in the cross-section for J/ψ-pair production. It also means that in other processes where the hard-scattering coefficient F 2 may be large, the convolution itself would remain relatively small at scales larger than a few GeV. Besides, its influence on the cross-section will be strongest at the smallest TM.
The situation is different for the azimuthal asymmetries, which involve convolutions in the numerator that contain either the J 2 or J 4 Bessel functions. Such functions are 0 at b T =0 and then grow in magnitude. The consequence is that the b T -integrals containing such functions benefit from unsuppressed intermediate b T values. At some point, undampened large-b T oscillations will bring the integral value down toward 0 in a similar way as for C f convolutions first grow with P QQT up to a peak maximum, and then decrease in value like C f g 1 f g 1 does. Another crucial difference is that the envelopes of J 2 and J 4 tend slower toward 0 than the J 0 one with increasing b T . The consequence is that C w 3 f Hence the convolution ratios, and the azimuthal asymmetries, always grow with P QQT , as can be seen in Fig. 4. In addition, as the large b T values are less suppressed than in C f g 1 f g 1 , the azimuthal asymmetries are also more sensitive to the variations of the nonperturbative Sudakov S NP . The effect is more pronounced for C w 4 h ⊥ g twice and a broader Bessel function. Figure 4b displays the cos(2φ C S ) asymmetry as a function of P QQT in the forward single J/ψ rapidity region (larger cos(θ C S )) while 4c displays the cos(4φ C S ) asymmetry in the central rapidity region (small cos(θ C S ) with x 1 x 2 ). Such choices maximize the size of the asymmetries as the associated hard-scattering coefficients are larger in these regions, without modifying the shapes of the asymmetries in P QQT (see [30] for a comparison between the two rapidity regions for each asymmetry). The uncertainty band associated with the width of S NP narrows with increasing M QQ as in Fig. 3; the uncertainty remains larger for The curves for b T lim = 8 GeV −1 (large dashes) are quite close to the ones using b T lim = 4 GeV −1 (solid line). Indeed, when S NP is already significantly wider than S A , an additional increase in its width will not affect the asymmetries anymore. Both convolutions in the ratios are larger with a wide nonperturbative Sudakov factor, yet this benefits the numerator C w 3 f

(c) (d)
We recall that the size of the asymmetries is also influenced by the ratio of the hard-scattering coefficients which are M QQ -dependent. F 3 /F 1 peaks around M QQ = 12 GeV which explains why the cos(2φ C S ) asymmetry is largest near this value. As discussed in Ref. [30], the ratio F 4 /F 1 keeps growing with M QQ , approaching 1 at sufficiently large values. Yet the cos(4φ C S ) asymmetry gets smaller with larger M QQ . This can be better seen in Fig. 5 which depicts the same asymmetries as functions of M QQ = M ψψ .
One first observes that, at large M QQ , the growth of the asymmetries with P QQT is slower. Indeed, in such a situation, the Sudakov factors broaden the P QQT -shapes of the convolutions, hence the ratio varies slower. This slower increase is compensated by the fact that larger values of M QQ allow for an extended growth of the asymmetry over a greater P QQT -range of validity for the TMD formalism. Secondly, the convolution ratios at a fixed value of P QQT also evolve with M QQ . The computable M QQ -dependence is encoded in the perturbative Sudakov factor S A , while S NP is also logarithmically varying with M QQ [98]. Both S A and S NP get narrower in b T with increasing M QQ , leading to a decrease of the value of the convolutions. . However the azimuthal asymmetries also depend on the evolution with M QQ of the hard-scattering coefficients ratios. Since F 4 /F 1 keeps growing while F 3 /F 1 falls after peaking at M QQ 12 GeV, cos(4φ C S ) will actually decrease slower than cos(2φ C S ) .
The large variations of the width of S NP generate moderate uncertainties on the size of the asymmetries. The latter, although consequently smaller than when computed in a bound-saturating model [30], still reach reasonable sizes, up to 5%-10%. We used the same nonperturbative Sudakov factor for all TMD convolutions in these computations, but the M QQ -independent part is actually expected to be nonuniversal. We checked that individually changing the width of S NP within the b T lim -ranges used in this study inside the different types of convolutions, does not bring any significant modification on the observables.
So far, there are still no experimental data allowing for an extraction of the gluon TMDs inside unpolarized protons.

ϒ-pair production
It is also of interest to look at ϒ-pair production. The partonic subprocess is identical to that of di-J/ψ production. In the non-relativistic limit, where M ϒ = 2m b , the main difference comes from the mass of the heavy quark. We note that the value of the non-relativistic wave function at the origin (or equivalently the NRQCD LDME for the CS transition) also differs but cancels in the ratios which we consider. The feed-down pattern is also clearly different. However, as announced, we will neglect the resulting (small) feed-down effects.
Owing to this larger mass, such a process probes the evolution at generally higher scales. The coupling constant α s is also smaller which increases the precision of the perturbative expansion. Higher scales also mean that the process is less sensitive to the (large b T ) nonperturbative behavior of the gluon TMDs. Hence, it is also less affected by the uncertainties associated with this unconstrained component.
On the experimental side, ϒ-pair production is admittedly a rare process. Yet, it starts to be accessible at the LHC. The first analysis by the CMS collaboration at √ s = 8 TeV only comprised a 40-event sample [99] but a second one is forthcoming. During the future high luminosity LHC runs, it will definitely be possible to record a sufficient number of events for a TMD analysis of both the P QQT and azimuthal dependences of the yield. Figure 6 depicts the azimuthal modulations for ϒ-pair production as functions of P QQT up to M QQ /2, for values of M QQ of 30, 40 and 50 GeV.
The uncertainty bands associated with the width of S NP are clearly narrower than in the J/ψ case. The cos(2φ C S ) asymmetry in Fig. 6b reaches 10% at M QQ = 40 GeV, which is the value for which the corresponding hard-scattering coefficient ratio F 3 /F 1 peaks for ϒ-pair production. Moreover, the decrease of the hard-scattering coefficient past the peak is slower, allowing the asymmetry to remain of similar size at M QQ = 40 and 50 GeV. (c) (d) 5 Conclusions In this paper we discussed the potential of double J/ψ and ϒ production for the study of the gluon TMDs inside unpolarized protons at the LHC. We presented the advantages of quarkonia as probes of these TMDs. We improved on previous results [30] by including TMD evolution effects, rendering the results more realistic and effectively taking into account QCD corrections that describe the evolution with the invariant mass M QQ of the quarkonium pair. We used a simple b T -Gaussian of variable width to parametrize the nonperturbative Sudakov factor S NP in order to estimate how important its impact is on the predicted yield and asymmetries, as it currently remains unconstrained in the gluon case. We discussed the broadening of the P QQT -spectrum due to the evolution in the case of double-J/ψ production, as well as the uncertainty associated with a variation of the width of S NP between 2 and 8 GeV −1 . As expected, we found that its influence decreases at large M QQ as the perturbative component of the TMDs becomes dominant. We also computed the cos(2,4φ C S ) asymmetries as functions of P QQT and M QQ . We found a notable suppression of the asymmetries in comparison to [30], caused by the fact that h ⊥ g 1 appears at order α s in the evolution formalism. We nevertheless found that such asymmetries still reach reasonable sizes for larger P QQT values and could be observed in the events already collected and to be recorded in the future. We found that the size of the asymmetries increases with P QQT . Such a behavior is explained by the relative slower fall in P QQT of the TMD convolutions containing h ⊥ g 1 . TMD factorization needs to be matched onto its collinear counterpart when P QQT approaches M QQ . Since the latter generates no asymmetries at leading twist, a Y -term becomes necessary at some point in order to neutralize the growth of the asymmetries and force them toward zero. We also observed that, in spite of the hard-scattering coefficient ratio F 4 /F 1 approaching 1 at large energy, the cos(4φ) asymmetry actually falls with M QQ .
Overall we conclude that J/ψ-pair production is a promising process to measure azimuthal asymmetries related to gluon TMDs as well as the effect of the evolution on the P QQT -spectrum. The energy threshold for this process is relatively low, making it sensitive to the nonperturbative component of the TMDs. The large event sample to be collected by the different collaborations at the LHC should give enough statistics to constrain them. ϒ-pair production presents the interesting opportunity to measure sizeable asymmetries at scales where perturbative contributions dominate, with a reduced necessity to include higher-order corrections. We also presented predictions for the asymmetries as functions of P QQT for ϒ-pair production. With sufficient data to come, it would allow for a complementary extraction of the gluon TMDs, while the expected size of asymmetries remain similar. Although ϒ pairs remain extremely rare at the LHC, the future high-luminosity runs will make it possible to acquire enough statistics.
Accessing information about the gluon TMDs can thus already be done at the LHC using quarkonium production, although more efforts in the direction of Ref. [50] are needed in order to obtain rigorous factorization theorems and expressions beyond tree level. It would give us a preview of what we can expect to find at a future electron-ion collider [100] or fixed-target experiments at the LHC [101][102][103][104][105], where these distributions should be accessible through different reactions. Because of the fundamental differences in these experimental setups, it is of great interest to measure the same TMDs using both of them, in order to be able to check fundamental predictions of the formalism such as the evolution and the universality.
Acknowledgements The work of MS was in part supported within the framework of the TMD Topical Collaboration and that of FS and JPL by the CNRS-IN2P3 project TMD@NLO. This project is also supported by the European Union's Horizon 2020 research and innovation programme under Grant agreement no. 824093. MGE is supported by the Marie Skłodowska-Curie Grant GlueCore (Grant agreement no. 793896).

Data Availability Statement
This manuscript has no associated data or the data will not be deposited. [Authors' comment: All data (numbers and plots) generated in our study have been included in this paper. We do not have additional data to show.] Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecomm ons.org/licenses/by/4.0/. Funded by SCOAP 3 .