Interplay of the CGC and TMD frameworks to all orders in kinematic twist

A framework for an improved TMD (iTMD) factorization scheme at small x, involving off-shell perturbative subamplitudes, was recently developed as an interpolation between the TMD kt ≪ Q regime and the BFKL kt ∼ Q regime. In this article, we study the relation between CGC and iTMD amplitudes. We first show how the dipole-size expansion of CGC amplitudes resembles the twist expansion of a TMD amplitude. Then, by isolating kinematic twists, we prove that iTMD amplitudes are obtained with infinite kinematic twist accuracy by simply getting rid of all genuine twist contributions in a CGC amplitude. Finally we compare the amplitudes obtained via a proper kinematic twist expansion to those obtained via a more standard dilute expansion to show the relation between the iTMD framework and the dilute low x framework. This allows to reinterpret the dilute expansion as a Wandzura-Wilczek approximation in the large kt regime.


JHEP05(2019)156
6 From the generic CGC process to the specific cases 20 6.1 q → qg channel 21 6.2 g → qq channel 23 6.3 g → gg channel 25 6.4 γ → qq channel 28 6.5 q → qγ channel 30 1 Introduction Factorization is one of the most crucial features of QCD: all perturbative QCD studies rely on this separation between a hard partonic subamplitude and long distance matrix elements. This separation is justified in the presence of a sufficiently large scale Q in the JHEP05(2019)156 observable, for which α s (Q) is small enough for perturbation theory to apply. However large logarithms can arise from QCD dynamics and compensate the smallness of α s (Q), which makes the resummation of such logarithms necessary. For most observables, two different factorization schemes can be employed, depending on the center-of-mass energy s of the process. For processes with the center-of-mass energy comparable to the large scale of the process (s ∼ Q), collinear factorization is applied and the large log(Q) terms are resummed via the Dokshitzer-Gribov-Lipatov-Altarelli-Parisi (DGLAP) evolution equations [1][2][3]. On the other hand, processes with the center-of-mass energy much larger than any other scale (s ≫ Q) are treated in the so-called low-x regime. In this case, k t -factorization applies and large log(s) terms are resummed.
Several descriptions of k T -factorization for low-x physics have been developed over the last couple of decades, starting with the well known Balitsky-Fadin-Lipatov-Kuraev (BFKL) framework [4,5]. The most recent low-x frameworks, namely the dipole model [6][7][8] and the shockwave framework [9][10][11] rely on a semi-classical approach, where low x gluon fields are treated as external fields. With such a treatment, all interactions with the external field can be resummed into path-ordered Wilson line operators which then constitute the building blocks of these low-x formalisms. Remarkably, due to this resummation of all interactions, perturbative results from this framework were found to be compatible with previous results for the semi-classical treatment of scattering off dense targets [12][13][14] which include gluon saturation effects from multiple scatterings. All of these recent frameworks are equivalent, and logarithms are resummed via the Balitsky/Jalilian-Marian-Iancu-McLerran-Weigert-Leonidov-Kovner (B-JIMWLK) hierarchy of evolution equations [15][16][17][18][19][20][21][22], or in the mean field approximation by the Balitsky-Kovchegov (BK) equation [9,23]. Nowadays, the weak coupling non-perturbative realization of the saturation in QCD is referred to as the Color Glass Condensate (CGC) [21,22]. Throughout this paper, we refer to the CGC as a unified picture (Balitsky formalism/ Mueller's dipole picture/CGC) of small-x QCD.
The fact that the CGC generalizes the BFKL framework was established early on at Leading-Logarithmic (LL) accuracy [24,25] and made more explicit in [26], then at Next-to-Leading-Logarithmic accuracy (NLL) in [27] and more explicitly in [28][29][30]. This equivalence relies on the expansion of the path-ordered Wilson lines in powers of the gluon field for small values of gA, what is known as the dilute limit.
Although it is not a true all-order factorization scheme, as opposed to collinear factorization for several simple processes [31], the CGC framework applies in principle to any low-x or high-density process regardless of the number of observed scales. In contrast, collinear factorization in its most common form is not valid for processes involving not only a hard scale Q, but also a second, smaller scale. In the present context the most interesting case is when that smaller scale is related to the transverse momentum of a parton inside a hadron. The collinear distributions were generalized for such processes, leading to the Transverse Momentum Dependent (TMD) factorization scheme [31][32][33][34][35][36][37][38].
For a process with center-of-mass energy s, a hard scale Q, and a hard yet softer transverse momentum scale |k| Λ QCD , the respective application ranges of CGC and TMD schemes are s ≫ Q |k| and s ∼ Q ≫ |k|. A matching of these schemes in the overlapping regime where |k| /Q and Q/s are both small was proven in [39,40]. Since

JHEP05(2019)156
then, gluon TMDs in the CGC have been at stake in many recent studies (see for example [41,[46][47][48]). Indeed the measurement of TMD parton distributions offers great insight in the 3D structure of hadrons, yet these distributions are not fully universal and thus they require case-by-case studies. Studying them at low-x allows one to use standard CGC tools like the McLerran-Venugopalan (MV) model [12][13][14], Golec-Biernat-Wüsthoff (GBW) parametrization [49] or numerical solutions to the B-JIMWLK hierarchy of equations [41] for the description of these complicated TMD distributions.
Notable attention has been drawn to polarized TMDs and to their role in angular distributions at low-x [42][43][44][45][46][47], and the relation between process independence breaking in TMD factorization and the Wilson lines which are natural built-in features of the CGC [48]. None of the cited studies addressed how twist corrections to TMD factorization are obtained in the CGC, which is the question we want to address in this article.
On the other hand, the CGC framework in the so-called dilute limit also matches BFKL results, which were built for processes with different kinematics, where s ≫ Q ∼ |k| . A new scheme for TMD factorization at low-x, which is referred to as the improved TMD scheme (iTMD), was built in [50,51] as an attempt to interpolate between both |k| ≪ Q and |k| ∼ Q limits. This framework aims at resumming some powers of |k| /Q by taking into account non-zero k in the hard subamplitude. In practice, as we will show in this article, it resums all kinematic twist corrections to the hard subamplitude which couples to the leading-twist TMD operator, leaving genuine twist corrections aside. For an alternative approach for twist studies in the saturation regime, see [52].
The purpose of this paper is to prove an infinite kinematic twist match between CGC and iTMD amplitudes, with a comparison with dilute BFKL amplitudes as well. It is organized as follows. In section 2, we consider the first corrections to the correlation limit in a CGC amplitude and compare them to the first power corrections in the TMD factorization, and show how both expansions are related to one another. Then in section 3, we start with the most generic form for 1 → 2 processes in the CGC and expand it in powers of the dipole size. We extract the pure kinematic twist corrections and resum them to infinite accuracy. This leads to the main result of this article: a completely generic infinite-twist CGC amplitude in eq. (3.10) in an all-body Wandzura-Wilczek approximation (i.e. where all genuine twist corrections are neglected). In section 4, we start again from the generic CGC amplitude and perform a more standard dilute expansion, leading to a generic dilute CGC amplitude in eq. (4.8). Section 5 is devoted to a short review of the iTMD framework and to recalculating the iTMD cross sections in a form that can be compared with the CGC all-kinematic-twists result. In section 6, we apply the generic kinematic twist resummed CGC result for different processes and compare them to the iTMD predictions. We find a perfect match between the kinematic twist resummed cross sections for each process and the corresponding iTMD results. Moreover, we also compare the dilute limit of the generic CGC cross sections with the kinematic twist ressumed cross sections by simply setting all distributions to the same value and find a perfect matching as well. Finally, in section 7 we summarize and discuss our findings for this study.
In the CGC calculations we use the lightcone gauge A + = 0. Transverse components are denoted with a ⊥ subscript in Minkowski space and by bold characters in Euclidean space. Therefore, for two vectors k and x, we write The CGC part of this paper relies on the separation of the gluon fields in the QCD Lagrangian depending on their + momentum between fast fields (k + > e −Y p + ) and slow fields (k + < e −Y p + ). In the eikonal approximation, the slow fields have the shockwave form where B is a function of x ⊥ only. In the semi-classical approximation for the slow fields, treated as external fields for the projectile, interactions with the target are resummed into path-ordered Wilson lines Throughout this paper, we will be using the small x limit of the studied amplitudes, and we will thus use the eikonal approximation as in eq. (1.2). The CGC computations in this article were done using the effective Feynman rules [9,14] described in appendix A. The interplay between the CGC and TMD frameworks is proven in terms of derivatives of Wilson lines [40]: Eq. (1.6) shows how taking derivatives of CGC Wilson line operators allows to distinguish between the small x limit of the physical TMD gluon field F i− and the soft gluons constituting the TMD gauge links.

JHEP05(2019)156 2 Correlation limit and TMD power expansion
In this work we study processes that describe the production of a pair of particles with a large invariant mass from a single particle in an external shockwave field built from the target gluons. We consider the case when both outgoing particles are tagged and their transverse momenta are fully reconstructed. The produced particles carry longitudinal momenta p + 1 and p + 2 , and transverse momenta p 1 and p 2 . The two important combinations of these momenta are the sum of the two transverse momenta k k ≡ p 1 + p 2 (2.1) and the transverse-boost invariant momentum q which is defined as The hard scale Q of the process is given by the invariant mass of the outgoing pair which is directly related to the transverse boost invariant momentum: As discussed in detail in [40], one can get the gluon TMDs through CGC calculations in certain limit which is usually referred to as "back-to-back correlation limit". In this limit, the two transverse scales |k| and |q| are well separated, i.e. |q| ≫ |k|. In the CGC framework, the transverse boost invariant momentum q is the Fourier conjugate to the transverse size of the produced pair (dipole size) r and the total transverse momentum is conjugate to the impact parameter b. Therefore, the back-to-back correlation limit corresponds to the case |r| ≪ |b| in coordinate space allowing a Taylor expansion of the CGC observables in the dipole size r.
We start by clarifying the power expansion employed here and in the rest of this section we consider a simple process in the back-to-back correlation limit to utilize the small dipole size expansion in the CGC framework and compare it with the power expansion in the TMD factorization framework to clarify the relation between the two procedures.

Power expansion at the amplitude level
The TMD framework involves gauge invariant light ray operators [53], for which the distinction between kinematic twists and genuine twists is convenient. For a set of gauge invariant twist p operators 1 O (i) p associated with the hard part H (i) p , the n-th power of k ⊥ in the cross section is given by the sum over p ∈ {0 · · · n} of the p-th power in H

JHEP05(2019)156
For inclusive observables, power corrections are split between amplitudes and complex conjugate amplitudes. However for the sake of this article, which aims at comparing CGC and iTMD results, it is actually sufficient to study power corrections at the amplitude level. Rather than using full, gauge invariant, inclusive operators, it is also enough for the comparison to use "half"-operators at the amplitude level, knowing how they would get combined into gauge invariant inclusive operators at the cross section level.
In the particular cases studied in this article, O (i) p will be a set of p-body gluon light ray half-operators p as a p-body operator, with O (i) 1 being the set of leading 1-body operators, which would combine into the leading twist (2-body in the standard counting) TMDs at the cross section level. Then the n-th power correction is given by the sum of p-th power in the (n − p)-body hard part, convoluted with the (n − p)-body operator. Corrections from the hard parts are kinematic twists, while higher-body operators lead to genuine twist corrections. In particular, fully kinematic twists, that are the main focus of this study, are given by successive k ⊥ -derivatives of the 1-body hard part.

Dipole size expansion for γ → qq in the CGC
It is informative to start by computing the first few corrections to the correlation limit in the CGC. As a simple example, let us consider the amplitude for the photoproduction of a quark-antiquark dijet which is given in appendix B: where the Wilson lines U b+zr are defined in eq. (1.4) with eq. (1.3). Here, φ µ is the tensor part of the amplitude that encodes the Dirac structure for this process and it is defined as In the correlation limit, it is straightforward to expand this amplitude in powers of the small dipole size r and keep the first two terms in the expansion. After performing a simple integration by parts, the result can be written as O(1) terms in eq. (2.8) give the well-known back-to-back result [40] which reads We would like to emphasize that the next-to-back-to-back term, eq. (2.10), has a very interesting form. Noting the fact that a derivative acting on a CGC Wilson line extracts a gluon field, one can immediately conclude that the first term in the brackets is a 2-body half-operator. On the other hand, one can manipulate the last two terms using the fact that so that the next-to-back-to-back term can be written as At this point we can make a diagram-by-diagram correspondence with TMD factorization. Naturally, ∂ j U b U † b terms correspond to the diagram where the TMD gluon hits the quark, while U b ∂ j U † b terms correspond to the diagram where it hits the antiquark. For such diagrams, it is easy to see that the dependance on k and p q (resp. k and pq) is only in the intermediate quark (resp. antiquark) propagator G k + p q (resp. G k − pq ). Thus for those diagrams we have Hence, the next-to-back-to-back contribution can be cast into the following form: where H jk 2 is a 2-body hard subamplitude, and H j 1 is a 1-body hard subamplitude (given by the sum of the two diagrams discussed above).

TMD power corrections to γ → qq
For the photoproduction of a quark-antiquark dijet, the natural starting point for a twist expansion at tree level is the infinite kinematic twist 1-body amplitude, 2 which has the form: where H i 1 (k 1 ) is the small x limit of a (perhaps unattainable, but it does not matter for any of the considerations we are about to make) gauge invariant hard subamplitude with non-zero TMD gluon transverse momentum. To get eq. (2.17) we used the well known technique for constructing gauge links for TMD processes described in [38] to determine to which infinity the gauge links point. 3 The kinematic twist expansion for the TMD regime |k| ≪ Qis done via the Taylor expansion of this hard part. Up to the first correction, rewriting the TMD operator as the derivative of a Wilson line, it reads: For consistency when including the kinematic twist correction to the 1-body amplitude, genuine twist corrections should be included as well (see the discussion in section 2.2). The first genuine twist correction for the same process is given by the 2-body amplitude, which can be written as the following in the small x limit: Taking the leading term in the Taylor expansion of the hard part yields Using the δ-function of the impact parameters b 1 and b 2 which sets these two transverse coordinates to the same value, one can rewrite the gauge link This allows us to rewrite the operator as derivatives of Wilson lines and the leading term in the Taylor expansion of the 2-body amplitude for the photoproduction of a quark-antiquark dijet reads (2.21) 2 We write the amplitude in an operator form, similarly to what is done in the CGC. The true amplitude is given by the action of this operator on target states. 3 Note that the Wilson lines are defined with −ig factors instead of ig in [38], which amounts to switching ∞ and −∞ from the gauge link rules therein, hence the −∞ in eq. (2.17).

JHEP05(2019)156
The comparison between eqs. (2.18), (2.21) and the CGC result given in eq. (2.16) shows a strong similarity between the small-dipole expansion in the CGC and the power expansion in the TMD framework. A more general matching could be conjectured. In this paper, we only focus on kinematic twist corrections and compare the 1-body contributions from the CGC to those obtained in the TMD framework with infinite power accuracy via the iTMD scheme developed in [50]. Comparisons for higher-body terms are left for further studies.
3 Kinematic twist resummation for a generic 1 → 2 process in the CGC In the previous section, we have calculated the next-to-back-to-back corrections for a specific process (γ → qq) in the CGC framework and showed how one can isolate the 1-body and 2-body terms in this contribution. Our main goal in this section is to generalize this procedure to all orders in the small dipole size expansion. We isolate the 1-body contribution from the higher-body contributions, and then resum the 1-body contributions that appear in higher orders in the small dipole size expansion. We would like to apply our results to several different 1 → 2 processes in the CGC framework. Therefore, we start from a generic CGC amplitude for a 1 → 2 process from which one can easily deduce all these different processes that are computed using effective Feynman rules in a shockwave background field [9][10][11][12][13][14] given in appendix A. For this generic process, as before, we consider the case when the outgoing pair of particle has a large invariant mass, and the incoming particle is on the mass shell. For each (p 0 → p 1 p 2 ) process, we use the same longitudinal momentum fractions (z andz) introduced in eq. (2.4), the total transverse momentum k of the produced particles defined in eq. (2.1) and the transverse boost invariant momentum q that is defined in eq. (2.2). The generic CGC amplitude (see figure 1) in this case reads where φ µ is a Dirac structure which does not depend on coordinates, and (R 1 , R 0 , R 2 ) are color representations. This is a well known form in small-x kinematics: the interaction with the target can be factorized out in the eikonal limit, and it contains all information on color flow. The spin structure factorizes in the massless case due to transverse boost invariance: the mere topology of a diagram is sufficient to predict its momentum structure, or equivalently in coordinate space its dipole-size dependence. One can easily check that the amplitudes listed in appendix B have the form of eq. (3.1). The expression for the generic CGC amplitude for a 1 → 2 process expanded to the n-th power of r is obtained by performing a Taylor series expansion of the Wilson line operators in A 0→12 which can be simply written as Figure 1. Generic (0 → 12) process in the CGC. The partonic lines with a gray blob represent the effective partonic lines in the external shockwave field [9,14], as given in appendix A.
The rest of our discussion relies on a symmetry hypothesis based on our experience of CGC and BFKL amplitudes.
In the CGC, diagrams where only on one partonic line is interacting with the external field give ( In BFKL computations, diagrams with each parton interacting with a gluon give the impact factor ϕ(k 1⊥ , k 2⊥ ) + ϕ(k 2⊥ , k 1⊥ ) while diagrams with only one parton interacting twist give counterterms −ϕ(k 1⊥ + k 2⊥ , 0 ⊥ ) and −ϕ(0 ⊥ , k 1⊥ + k 2⊥ ). The latter insure the cancellation of the full impact factor for k 1⊥ = 0 ⊥ and for k 2⊥ = 0 ⊥ and thus gauge invariance in the BFKL sense.
Both of these remarks point to the fact that the least symmetric diagrams only contribute to small x observables as gauge invariance restoring counterterms.
By analogy, keeping in mind that taking one derivative of a Wilson line amounts to extracting one physical gluon from it, we assume that contributions with no derivative on one Wilson line operator must be a gauge-invariance restoring term for the 1-body contributions, i.e. a kinematic twist. Our hypothesis is thus as follows.
We assume that the n-body contribution to a gauge invariant amplitude, for n > 1, does not contain contributions where one parton in the projectile scatters with several physical gluons while the other does not scatter with any. In other words, our statement is that no Operators with the least symmetric derivative structures need to be integrated by parts using or the other way around, depending on which Wilson line has more derivatives acting on it. By employing this procedure, we make sure that the non-symmetric operators are reduced JHEP05(2019)156 to a more symmetric contribution and a contribution with less derivatives acting on the Wilson line operators. One can then proceed recursively in order to isolate all the 1-body contributions from the higher-body terms. However, we should emphasize that a stronger hypothesis is required in order to study genuine twist corrections, which are left for future studies. Nevertheless, as mentioned earlier in this study we focus on the kinematic twists.
In order to clarify our discussion, let us consider the case for n = 4. The generic CGC amplitude for a 1 → 2 process, when expanded to O(r 4 ), after employing the procedure described above, reads As emphasized multiple times earlier, our aim in thus work is to study the terms and perform an all-order dipole size resummation for them. This amounts to the Wandzura-Wilczek approximation for all twists [54]: this approximation means assuming that kinematic twist corrections are enhanced when compared to genuine twist corrections, hence in present case that k ⊥ /Q corrections to 2-body TMD amplitudes dominate over n-body (n > 2) TMD amplitudes. Here after, we denote all the amplitudes and the cross sections obtained from the CGC calculations by adopting the Wandzura-Wilczek approximation with the superscript W W . With our symmetry argument, it is easy to obtain a generic form for the n-th power in the amplitude, by performing (n − 1) integrations by parts on the least symmetric terms. Summing up such contributions for all n leads to It is now straightforward to perform the resummation explicitly which results in the fol- The integral over the dipole size r is factorized from the rest of the expression and can be performed explicitly by considering the following integral for p =zk or p = −zk. The details of the calculation can be found in appendix C and the result reads Plugging this result into eq. (3.6) and reintroducing the transverse momenta of the produced particles (p 1 , p 2 ) leads to the final expression for the generic CGC amplitude for a 1 → 2 process in the Wandzura-Wilczek approximation: Using the generic CGC amplitude given in eq. (3.9), the generic cross section can be calculated in a straightforward manner and the result reads

JHEP05(2019)156
where the factor 1 2C 0 originates from the spin and color averaging over the incoming state and · · · is defined in eq. (1.5). The color Fierz factor C 0 is N c for a quark, N 2 c − 1 for a gluon and 1 for a photon.
Eq. (3.10) is the main result of this paper. It is the generic CGC cross section for a 1 → 2 process that resums all kinematic twists. By introducing the proper color structure and the proper Dirac structure for a specific 1 → 2 process, one can get the kinematic twist resummed CGC cross section for that specific process. In the following sections, we study several of such specific processes and show that the results match exactly the ones obtained through the iTMD calculations.
4 Dilute limit of a generic 1 → 2 process in the CGC For very high values of the center-of-mass energy s or for dense targets, multiple scatterings are expected to occur. In practice, for values of |k| of the order of the target saturation scale Q 2 s ∼ (A/x) 1/3 , it is expected for the target fields A − to scale like 1/g due to a high gluon occupation number, so that gA − must be resummed into the path-ordered Wilson line operators U R b which are the natural building blocks of the CGC or shockwave formalisms. The regime where |k| ≫ Q s , is referred to as the dilute limit. In this limit gA − is expected to be small and therefore one is allowed to expand Wilson line operators in gluon fields (or in Reggeon fields for more involved analysis, as [26,55]) or equivalently to use a dilute formalism like BFKL.
In this section, we consider the dilute limit of the CGC by expanding the Wilson line operators in the generic CGC amplitude for a 1 → 2 process whose expression is given in eq. (3.1). The generic Wilson line operator, when expanded in powers of the strong coupling constant g, in arbitrary representation R reads with T a R being the SU(N c ) generator in the representation R. Then, in the dilute limit, the generic CGC amplitude given in eq. (3.1) can be written as After introducing the incoming target state P and the target remnant states X, and using the translation invariance of the X |(. . .)| P matrix elements, one can easily integrate over the impact parameter which yields to the following form of the matrix element:

JHEP05(2019)156
In eq. (4.3), the integral over the dipole size r can be performed in a straightforward manner by using the well known integral which finally leads to the following form of the dilute amplitude The cross section in the dilute limit can be easily obtained from eq. (4.5), and the result reads Finally, it is customary to introduce the unintegrated parton distribution function (uPDF) G (k) that is defined as with δ ac G ac (k) = G (k) .
Averaging over the spin and color states of the incoming parton or photon, we arrive to the generic form of the cross section in the dilute limit: with C 0 being the factor that one obtains via color averaging, as introduced previously in section 3. We would like to draw attention to the similarity between the dilute limit of the generic cross section given in eq. (4.8) and the kinematic-twist-resummed cross section given in eq. (3.10). We discuss the implications of this similarity in section 7.

JHEP05(2019)156
5 Small-x improved TMD factorization (iTMD) In the following section we briefly summarize the small-x improved TMD factorization constructed in [50] in a self-contained way. Although the framework is more general, here we focus on dijets in pA and γA collisions. This section is organized as follows. We first list and explain the general form of the formulas for dijets in pA collisions. Next, we shall put the iTMD formulation into the context of the TMD factorization theorems to better clarify the terminology and the status of presented formulas. In the end of this section, in subsections 5.2-5.6, we will give the formulas for the cross section for all channels in a form that can be compared with the CGC framework. These formulas have been recalculated in a suitable kinematics, and, in particular the results of subsection 5.5 and 5.6 for processes with photons are to our knowledge new results, not given in the literature before.

Framework
The iTMD factorization formula for pA collisions has the form of a hybrid generalized k T -factorization. That is: (i) the incoming dilute projectile is described by the collinear PDF as it is probed at large x -so called hybrid approach [56], (ii) the target is probed at small x and is described by a set of process-dependent TMD gluon distributions, (iii) the hard factors are constructed from off-shell gauge invariant matrix elements. Thanks to (i), the formula for the cross section can be written as where f a/H is the collinear PDF for parton a = q, g (we can safely neglect antiquarks in this approximation), ⊗ denotes the convolution in the longitudinal fraction x p of the proton momentum carried by parton a, n f is the number of flavors. The remaining objects are cross sections for scattering a parton a off the target to produce the given final states.
They can be generically written as follows: ag * →bc are off-shell gauge invariant hard factors and Φ (i) ag→bc are unpolarized TMD gluon distributions in the target. The sum over i corresponds to two inequivalent color flows that exist for each channel.
The TMD gluon distributions Φ (i) ag→bc are linear combinations [50] (table 1) of the basic distributions with the following operator definitions [38]: withF (ξ) = t a F a (ξ + , ξ − = 0, ξ). The staple-like Wilson lines appearing above are defined as The Wilson loop is made from two staples glued together: ag * →bc for various channels. We show only planar color-ordered diagrams, i.e. the planar diagrams with fixed ordering of external legs, as they are enough to reconstruct the hard factors contributing to the in-equivalent color flows (see section 6 of [50] on how to reconstruct the hard factors from color-ordered amplitudes and [77] for a general review of color decompositions). The off-shell gluon has momentum k. The double line corresponds to the Wilson line propagator in momentum space, which couples to gluons via the igt a P µ vertex. The double line propagator with a momentum p is −i/ (p · P + iǫ). These diagrams have to be multiplied by k 2 /g -for all the details see [76]. We do not display the diagrams for processes with a photon since they do not require the use of a Wilson line, despite the off-shellnes of the gluon.
The off-shell gauge invariant hard factorsH (i) involve incoming off-shell gluons with momentum k = x A P + k ⊥ , k 2 = −k 2 , coupled eikonally to the target via a TMD correlator. In general, such Feynman diagrams are not gauge invariant when calculated using the standard QCD Feynman rules. There are several ways, to deal with this. First, one could use the Lipatov effective action and resulting vertices in the quasi-multi-Regge kinematics [70]. In [71][72][73][74] other methods have been developed, based on the spinor helicity formalism, especially convenient to deal with multiparticle processes and to guarantee fast computer implementation. The method [72] has been recently extended to loop level [75]. The easiest way to understand the diagrammatic content of the hard factors is probably provided by the method [76] which defines the gauge invariant off-shell amplitudes as partonic matrix elements of straight infinite Wilson line operators. In case of the hard factors involving one off-shell gluon needed here the Wilson line has a direction along P − . The diagrams contributing to each channel for pA collisions are given in figure 2.
The form of the generalized factorization (5.2) appears as follows. First the color structure is separated from the kinematic part of the amplitude by means of the color JHEP05(2019)156 decomposition [77]. The amplitudes with the color structure separated contain only planar diagrams with fixed ordering of the external legs. The TMD gluon distributions Φ (i) ag→bc are derived for the color structures (squared) following the general procedure of resummation of collinear gluons constructed in [38]. The color decomposition of amplitudes guarantees that each Φ (i) ag→bc corresponds to a gauge invariant subset of diagrams. For more details and application to multiparticle processes see [78].
The iTMD formula was constructed to agree with the k T -factorization for dijet production [79] in the limit of k 2 ∼ Q 2 ≫ Q 2 s and also with the leading power limit of the CGC expressions [40] for Q 2 ≫ k 2 ∼ Q 2 s . In the present paper we further compare all the power corrections contained in the framework. To this end, we need the small x limit of the TMD gluon distributions compliant with the CGC theory. They are obtained by neglecting the x dependence in the Fourier transforms and trading the hadronic matrix elements to the averages over the color distributions in the nucleus. In addition, lightcone gauge is used, in which for the shockwave approximation the transverse components of the gauge fields do not contribute due to EOM and transverse pieces of the gauge links are suppressed. This allows to neglect the transverse parts of the staple gauge links (5.11). Within the above approximation we have [40,41]: For completeness, let us now put the iTMD formulation into the context of the formal TMD factorization theorems [31]. First, one should understand that it does not involve an all-order factorization theorem like the ones existing for the Drell-Yan process and semi-inclusive DIS. These theorems are proved to leading power in the hard scale to any logarithmic accuracy, while the iTMD framework resums the power corrections, but its validity is limited to leading logarithms of energy. Next, the mentioned TMD factorization JHEP05(2019)156 theorems involve processes with at most two colored partons in the hard process (plus soft/collinear contributions of course) and two TMD correlators (parton distribution or fragmentation function). Because of the simplicity of the color structure, all Wilson lines appearing due to the resummation of collinear gluons can be disentangled and put into the gauge invariant definitions of the TMD objects. For jet production processes in hadronhadron collision, where formally one has at least two TMD correlators and more than two colored partons, it is not possible. Thus, formally, even the generalized factorization breaks down [80]. However, in the iTMD approach, which targets the collisions of a moderate-x projectile and a low-x target, there is only one TMD correlator, thus, at least formally, this problem does not occur. On the formal ground there is no all-order proof of the hybrid approach so far.
Finally, let us comment on the evolution equations for the TMD gluon distributions. The most adequate treatment would be using the renormalization group equation at small and moderate x developed in [81,82]. It however still requires work to derive the complete set of equations, not to mention solving them. An important feature of such procedure would be that some Sudakov logarithms ln k 2 can be consistently resummed. For existing phenomenological applications using iTMD [51,83] the evolution was based on BK or B-JIMWLK and some Sudakov resummation effects were estimated by means of a phenomenological model.
Below, we explicitly give formulas for the cross sections (5.2) in a form that can be directly compared with the CGC expressions.

qg * → qg channel
We get Note, that the above hard factors H (i) qg→qg are not exactly the ones in (5.2). The expressions are however more compact in the above notation.

JHEP05(2019)156
where H (1) gg→qq , H (2) gg→qq are the reduced off-shell hard factors. They read Above, an additional symmetry factor of 1/2 was included to account for identical final states.

From the generic CGC process to the specific cases
In sections 3 and 4, we have computed both the kinematic-twist-resummed cross section and the dilute limit of the CGC cross section for a generic 1 → 2 process respectively. Our aim in this section is to get both of these cross sections for specific processes and compare these results with the ones that are obtained through iTMD framework in section 5.
To be more accurate, we consider the photoproduction of a dijet, as well as all possible channels for two particle production (dijet or photon-jet) in forward pp and pA collisions. Within the CGC framework, hybrid formalism [56] is the state of the art approach for these processes. It has been very successfully used to calculate the next-to-leading order single inclusive particle production [57]- [66], heavy quark production [67], dijet production [41] and recently dijet+photon [47,68] and trijet production [69] in forward pA collisions. In the hybrid formalism, the final state particles are produced in the forward rapidity region so they can be treated in the collinear framework, i.e. the incoming partons are on-shell collinear partons and the partonic cross section calculated in this set up should be convoluted with the collinear parton distribution functions in order to get the hadronic cross sections. On the other hand, the target is assumed to be dense and the center-of-mass energy is large so it can be treated in the CGC framework. At the parton level, the set up that we have used for the calculation of the kinematic-twist-resummed cross section eq. (3.10) and the dilute cross section eq. (4.8) for a generic process is compatible with the hybrid formalism. Thus, we use those results to study the different channels and compare them with the ones obtained from iTMD framework in the rest of this section.

q → qg channel
Let us start our analysis by considering the q → qg channel (see figure 3). In this channel, the incoming quark splits into a quark-gluon pair at order g s which then scatters off the target via eikonal interaction. The CGC amplitude for this channel is given in eq. (B.1). In order to be able to use the kinematic twist resummed generic cross section eq. (3.10), the first thing we need is the tensor part of the amplitude that encodes the Dirac structure of this channel and it is given by whose square for an unpolarized observable can be calculated in a straightforward manner and the result reads One can read off the color structure in this channel from figure 3 and it is given by setting This color structure leads to the following TMD JHEP05(2019)156 By using the identity that relates the adjoint and fundamental representations of a unitary matrix and the Fierz identity one can easily get the following identities The next step is to compute the color trace of the TMD operators that are listed in eq. (6.3). By using the identities given in eq. (6.6), these traces can easily be computed and the result reads Comparing the structure of the trace of the Wilson lines in eq. (6.7) and the definitions of the first two gluon TMDs in the quark channel F qg and F (2) qg given in eqs. (5.13) and (5.14) respectively, one can conclude that these are the two gluon TMDs which appear in this channel. Moreover, for convenience, we can define the following combinations of the gluon TMDs F (1) qg and F (2) qg : which are exactly the same combinations that one gets from the iTMD calculations given in the table 1.

JHEP05(2019)156
Finally, we can plug the square of the tensor part of the amplitude given in eq. (6.2) and the Wilson line structure given in eq. (6.7) together with the definitions and the combinations of the gluon TMDs eq. (6.8) in the generic kinematic twist resummed cross section eq. (3.10) to get the cross section for q → qg channel as which coincides exactly with eq. (5.21) by using eqs. (5.22) and (5.23). Our next order of business is to consider the dilute limit in the q → qg channel. Inserting the proper color representations in the dilute limit of the generic cross section given in eq. (4.8), we get with G ac (k) being the unintegrated parton distribution function defined in eq. (4.7). Using the definition of the tensor part of the amplitude that encodes the Dirac structure in the q → qg channel given in eq. (6.1) and performing some color algebra, one simply gets the dilute limit of the cross section in this channel: From eq. (6.9) and (6.11), we also get a straightforward matching between the improved TMD scheme and the dilute scheme: The substitution Φ q→qg = G in the iTMD scheme in the dilute limit can be simply justified as follows. For |k| ≫ Q s and large, the Fourier transforms in the operator definitions force the transverse separation between the fields to be small. In that limit the gauge links become identical, while the Wilson loops become trivial. This universal behaviour was tested numerically in [51] and [41].

g → qq channel
The next channel we consider is g → qq. In this channel, the incoming gluon splits into a quark-antiquark pair at order g s , then it scatters through the target (see figure 4). The CGC amplitude for this channel is given in eq. (B.2) and the tensor part of it reads whose square can be computed easily for an unpolarized observable: 14) The color structure of this channel can be read off from figure 4 and it is given by setting This color structure leads to the following gluon TMD operators that appears in the generic kinematic twist resummed cross section given in eq. (3.10): One can easily compute the trace over the color indexes of the operators listed in eq. (6.15) and the result reads A comparison between the Wilson line structure in this channel given in eq. (6.16) and the definitions of the first three gluon TMDs in the gluon channel F gg and F gg given in eqs. (5.15), (5.16) and (5.17) suggests that these are the three gluon TMDs that appear in the g → qq channel. We define the following combinations of the TMDs which are the JHEP05(2019)156 same combinations defined in table 1: Finally, the square of the tensor structure, eq. (6.14), the Wilson line structure, eq. (6.16), and the TMD definitions with the combinations given in eq. (6.17) are plugged in the generic kinematic twist resummed cross section given in eq. (3.10). The result can simply be written as which coincides exactly with eq. (5.24) by using eqs. (5.25) and (5.26). The next step is to consider the dilute limit in the g → qq channel. Introducing the proper color structure in the generic dilute cross section in eq. (4.8), we get which after some color algebra leads to Finally, a comparison between the kinematic twist resummed cross section eq. (6.18) and the dilute limit of the cross section given in eq. (6.20), again leads to a straightforward matching between the iTMD scheme and the dilute scheme for g → qq channel: g→qq =G . (6.21)

g → gg channel
The next channel we consider is g → gg. The CGC amplitude for this channel is given in eq. (B.3). The tensor part for this channel can simply be read off from eq. (B.3) and it is given as Its square can be computed in a straightforward manner with the result being The color structure of this channel is demonstrated in figure 5 and it is given by This leads to the following TMD operators once it is inserted to the Wilson line structure of the generic kinematic twist resummed cross section in eq. (3.10): After a standard but cumbersome color algebra, the trace over the color indexes of the above TMD operators can be written in terms of the fundamental Wilson line operators as

JHEP05(2019)156
Tr O gg and F gg that have already appeared in the g → qq channel, one also gets new gluon TMDs F (4) gg , F (5) gg and F (6) gg in the g → gg channel. Again, for convenience, we define the following combinations of the TMDs which match exactly the combinations one get from iTMD calculations given in table 1. After plugging these results into the generic kinematic twist resummed cross section given in eq. (3.10), we get the result for the g → gg channel as
Let us now consider the dilute limit of the cross section in the g → gg channel. Once the proper color representations of this channel are plugged into the dilute limit of the generic cross section given in eq. (4.8), we get which after some color algebra leads to with the symmetry factor of 1/2. As a last comment for this channel, we would like to emphasize that a comparison between eqs. (6.30), (6.32) and the iTMD results lead to the same matching condition between the iTMD scheme and the dilute limit: g→gg =G . (6.33)

γ → qq channel
We have used this channel as an example to study the corrections to the back-to-back correlation limit in subsection 2.2. In this subsection, we generalize that study by using the generic expressions for the kinemtic twist resummed cross section and the dilute limit of the generic CGC cross section. The amplitude is given by eq. (2.6) from which we can read off the tensor part: The square of the tensor part for an unpolarized observable can be calculated easily and the result reads The color structure for this channel is demonstrated in figure 6 and it is given by setting = U † b and T R 0 = 1. This color structure leads to the following TMD JHEP05(2019)156 Figure 6. γ → qq amplitude in an external shockwave background with the appropriate color representations. operators: The trace over the color indices can be performed in a straightforward manner and one can easily conclude that this channel involves only one TMD F gg which is also referred to as the Weizsäcker-Williams TMD defined in eq. (5.17). Using this result and the square of the tensor part given in eq. (6.35), we can write the kinematic twist resummed cross section for this channel as which coincides exactly with eq. (5.30) by using eq. (5.31). Using the proper color representations for this channel and the dilute limit of the generic CGC cross section given in eq. (4.8), we can simply write the dilute limit of the cross section for the γ → qq channel as which, after a simple color algebra and using the result for the square of the tensor part given in eq. (6.35), leads to dσ gA∼0 γ→qq dy 1 dy 2 d 2 p 1 d 2 p 2 = α em α s p + 0 δ p + 1 + p + 2 − p + 0 zz z 2 +z 2 p 2 1 p 2 2 G (k) . Finally, we would like to mention that a comparison between eq. (6.30) and (6.32) suggests a similar matching between the iTMD scheme and the dilute limit of the CGC calculation: gg =G . (6.40)

q → qγ channel
The last channel we consider is the q → qγ one. The CGC amplitude for this channel is given by eq. (B.4) from which we can read off the tensor part as Its square for an unpolarized observable can be written as As it can be seen from figure 7, the color structure of this channel is quite simple. One gets the proper color structure by setting U R 1 b = U b , U R 2 b = 1 and T R 0 = 1. With this simple color structure, only one TMD operator appears in this channel which reads Performing the trace over color indices leads to F qg TMD which has been introduced in eq. (5.13). Plugging these results into eq. (3.10), we get the cross section for q → qγ channel: dσ W W q→qγ dy 1 dy 2 d 2 p 1 d 2 p 2 = α em α s N c p + 0 δ p + 1 + p + 2 − p + 0 zz 2 1 + z 2 p 2 2 q 2 F (1) qg (k) , (6.44) which coincides exactly with eq. (5.32) by using eq. (5.33). Before we continue with the dilute limit for this channel we would like to mention that eq. (6.44) is exact and it resums not only the kinematic twists but all twists for this process, i.e. no higher-body twist correction is expected for the q → qγ channel. This is JHEP05(2019)156 due to the fact that one of the Wilson line operators is trivial for this process and there is no other TMD operator involved.
Inserting the simple color structure of this process into the dilute limit of the generic CGC cross section given in eq. (4.8), we get the dilute limit of the cross section for the q → qγ channel: which leads to dσ gA∼0 q→qγ dy 1 dy 2 d 2 p 1 d 2 p 2 = α em α s N c p + 0 δ p + 1 + p + 2 − p + 0 zz 2 1 + z 2 p 2 2 q 2 G (k) . (6.46) As in the case of the other channels, comparing eq. (6.44) and (6.46), we also get a straightforward matching between the improved TMD scheme and the dilute scheme:

Discussions
Earlier studies have shown that for certain observables the small-x limit of the TMD framework and the so-called "correlation limit" of the CGC framework overlap. In particular, two particle production (such as dijet or photon+jet) in forward pp and pA collisions, gluon TMDs can be recovered from the CGC calculations in the correlation limit. This specific limit corresponds to the case when the total transverse momentum of the produced particles k is much smaller than the hard scale Q. On the other hand, it is also well known that in the dilute limit of the CGC framework, that is in the limit when the total transverse momentum of the produced particles are of the same order as the hard scale, one recovers the BFKL results. Recently, the small-x improved TMD (iTMD) formalism has been developed to interpolate between these two limits.
In this paper we studied two cases. First, by studying the correlation limit of the CGC amplitude for a generic 1 → 2 process, we identified the kinematic twist contributions from higher order terms in the Taylor expansion of the transverse size of the pair of particles produced in that process, resummed those twist corrections in the Wandzura-Wilczek approximation, i.e. neglecting all genuine twist corrections. The kinematic-twist-resummed cross section for a generic 1 → 2 process, eq. (3.10), is then used to compare the results obtained in the iTMD framework for different channels in forward pp and pA collisions. The perfect matching between these frameworks proves that the iTMD formalism is fully obtained from the CGC formalism by taking the Wandzura-Wilczek approximation.
Second, we considered the dilute limit of the CGC amplitude for a generic 1 → 2 process. The BFKL amplitudes obtained by taking the dilute limit of the CGC amplitudes JHEP05(2019)156 are shown to match iTMD results by simply setting the different TMD distributions to the unintegrated parton distribution function that defines the target.
One of the most striking results of this study is the perfect matching between the hard parts of the kinematic twist resummed cross section and the dilute limit of the CGC one. This can be explained in the following way. The kinematic twist resummation procedure that has been developed in this paper isolates and resums the parts of the higher-body contributions which can be rewritten as gauge invariance fixing counterterms to the 1body hard part. The remaining terms are the genuine twist contributions which vanish in the dilute limit since they account for multiple scatterings. In that sense, we resum the terms which do not vanish in the dilute limit. Thus, the difference between a rigorous twist resummation and the dilute expansion does not lie in the hard parts. Instead, it is linked to the way the hard parts couple to the distributions.
While iTMD distinguishes distributions depending on their gauge link structures, therefore extending its validity range in terms of |k| /Q s when compared to BFKL, both formalisms rely on the Wandzura-Wilczek approximation in the CGC.
With the previous observations, two origins of saturation can be expected. First of all, the difference between BFKL and iTMD is related to the distinction between gauge link structures, which account for multiple scattering from low x gluons. As discussed earlier, all distributions are equal at large |k|/Q s and distinct at low |k|/Q s , were saturation is expected. In that sense, the saturation scale Q s is the parameter which controls the importance of multiple scatterings via gauge links. On the other hand, BFKL and iTMD both rely on the Wandzura-Wilczek approximation when compared to the CGC. It will be very instructive to compare predictions from iTMD and full CGC once genuine twists are extracted from the CGC as well [94]. This would probe Q s as the parameter which controls the importance of multiple scattering via genuine twists.
As a natural extension of this study, we plan to perform a similar analysis for more complex observables where not only the unpolarized TMD distributions but their linearly polarized partners appear. The two immediate observables that we are planning to study are the heavy quark production [46] and three-particle production such as dijet+photon production [47].
Last but not least, we would like to mention that recently there have been several studies devoted to understand the subeikonal corrections in the CGC framework [84][85][86][87][88][89][90][91][92][93]. Comparing those to the future moderate-x corrections to the iTMD scheme would be also a natural extension of our study.