1 Introduction

Properties of hadrons are described by their matrix elements of QCD operators. These matrix elements are nonperturbative. They can be calculated only with nonperturbative methods, or extracted from experimental data. A class of such matrix elements are Generalized Parton Distributions(GPDs) introduced in [1,2,3]. In [4, 5] it has been shown that one can obtain the quark or gluon contributions to the proton spin from quark or gluon twist-2 GPDs, respectively. Therefore, studies of GPDs and relevant processes both in theory and experiment will help to solve the so-called proton spin crisis and provide more information about the hadron’s inner structure.

There is a class of exclusive processes which can be used to extract GPDs. They can be extracted from deeply virtual Compton scattering (DVCS) in the forward limit, as suggested in [4, 5]. Besides DVCS, processes with a light hadron or a quarkonium instead of the real photon in the final state of DVCS can be described with GPDs. With the leading power approximation the amplitudes of the processes are given by convolutions of twist-2 GPDs with perturbative coefficient functions. The properties of twist-2 GPDs have been studied extensively (see the reviews in [6,7,8]). From twist-2 GPDs one can only obtain the sum of the spin part and the orbital-angular momentum part of quarks or gluons. Recently, it has been shown in [9,10,11,12] that one can obtain each orbital-angular momentum part from twist-3 GPDs individually. Therefore, it is important to study twist-3 GPDs and how to extract them from experiment.

In general twist-3 GPDs only appear in power-suppressed contributions. In this work we study the power-suppressed contributions of photoproduction of \(J/\psi \) or \(\varUpsilon \). The amplitude at the leading power of \(J/\psi \)- or \(\varUpsilon \) photoproduction has been given in [15]. At the leading power, only a part of gluonic twist-2 GPDs is involved. The produced quarkonium at twist-2 level is only transversely polarized. At the next-to-leading power, the quarkonium is longitudinally polarized. If one can measure the polarization, then twist-3 GPDs can be accessed directly. It is interesting to note that the production rate of longitudinally polarized \(J/\psi \) or \(\varUpsilon \) can be large because the polarization vector is proportional to the energy in contrast to the transverse polarization vectors which are constant. This can enhance the power-suppressed contributions in the high energy limit. This motivates us to study the process at twist-3 level in this work. After we have completed the power-suppressed contributions of photoproduction of \(J/\varPsi \) in terms of twist-3 GPDs, we should take care of an important property of the GPDs to ensure that these calculations are safe. This property has been shown in a series of papers [16,17,18,19,20,21,22,23], which tell that some twist-3 GPDs may be discontinuous. We will discuss this at the end of our calculation.

2 Definitions and properties of GPDs

We consider the process

$$\begin{aligned} \gamma (q) + h(p) \rightarrow J_Q (k) + h(p'), \end{aligned}$$
(1)

where the quarkonium is denoted as \(J_Q\). We will consider the case where \(J_Q\) is a \(^3 S_1\) state of a \(Q{\bar{Q}}\) state with \(Q=c,b\) or a \(^1P_1\) state. The \(^3S_1\) state corresponds to \(J/\psi \) or \(\varUpsilon \), while the \(^1P_1\) state corresponds to \(h_{c,b}\). In Eq. (1) the initial or final hadron h is a proton or a spin-1/2 hadron. The kinematical variables of the process are given by \(s=(p+q)^2\) and \(t= (p'-p)^2\). This process with \(J/\psi \) at leading twist has been studied extensively in, e.g., [13,14,15] and the references therein.

We will use the light-cone coordinate system, in which a vector \(a^\mu \) is expressed as \(a^\mu = (a^+, a^-, \mathbf {a}_\perp ) = ((a^0+a^3)/\sqrt{2}, (a^0-a^3)/\sqrt{2}, a^1, a^2)\) and \(a_\perp ^2 =(a^1)^2+(a^2)^2\). We define two light-cone vectors \(n^\mu = (0,1,0,0)\) and \(l^\mu =(1,0,0,0)\). The transverse metric is given by \(g_\perp ^{\mu \nu } = g^{\mu \nu } - n^\mu l^\nu - n^\nu l^\mu \). We will also need the transverse antisymmetric tensor which is given by \(\epsilon _\perp ^{\mu \nu } = \epsilon ^{\alpha \beta \mu \nu } l_\alpha n_\beta \) with \(\epsilon _\perp ^{12}=1\). We take a frame in which the momenta in Eq. (1) are given by

$$\begin{aligned} \begin{aligned} p^\mu&= \left( p^+, p^-, -\frac{1}{2} \mathbf {\varDelta }_\perp \right) ,\\ p'^\mu&= \left( p'^+, p'^-, \frac{1}{2} \mathbf {\varDelta }_\perp \right) ,\\ q^\mu&= \left( q^+,q^-,\frac{1}{2} \mathbf {\varDelta }_\perp \right) , \end{aligned} \end{aligned}$$
(2)

with \(\varDelta ^\mu = (p' -p)^\mu \). In the forward region the components of \(\varDelta ^\mu \) except \(\varDelta ^+\) are small, they scale like

$$\begin{aligned} \varDelta _\perp ^\mu \sim Q (\lambda ,\lambda ), \quad \varDelta ^- \sim p^-\sim p'^- \sim q^+ \sim Q\lambda ^2 \end{aligned}$$
(3)

with \(\lambda \ll 1\). Q is a generic large scale of the process. It can be the heavy quark mass or \(\sqrt{s}\). In this kinematical region the momentum of \(J_Q\) is

$$\begin{aligned} \begin{aligned} k_\perp ^\mu&=-\frac{1}{2}\varDelta _\perp ^\mu ,\\ \quad k^+&= -\varDelta ^+ + {{\mathcal {O}}}(Q \lambda ^2),\\ \quad k^-&= q^- + {{\mathcal {O}}}(Q\lambda ^2). \end{aligned} \end{aligned}$$
(4)

The produced quarkonium moves almost in the direction of the initial photon.

For the process QCD factorization is expected in which the nonperturbative effect related to the quarkonium is included in matrix elements of non-relativistic QCD (NRQCD) and that of the initial hadron is included in various GPDs. The produced quarkonium mainly consists of a heavy quark Q and a heavy antiquark \({\bar{Q}}\). The relative velocity v between Q and \({\bar{Q}}\) inside the quarkonium is small. One can make a small velocity expansion and use NRQCD factorization in [24] for the quarkonium. In general, a quarkonium is a bound state of a heavy quark \(Q{\bar{Q}}\) pair combined with light partons, i.e., a quarkonium state is a superposition of a \(Q{\bar{Q}}\) state and states of \(Q{\bar{Q}}\) combined with gluons and light-quark pairs. We take the leading order of v. At this order quarkonium, considered in this work, can be taken only as a bound state of \(Q{\bar{Q}}\). The other components with light partons, like the \(Q{\bar{Q}} g\) component, contribute only at the higher orders in the small velocity expansion according to the power counting in [24].

For our calculation one can use a wave function \(\psi \) to project the relevant state of the produced \(Q{\bar{Q}}\) pair. The projection is standard. For \(J/\psi \) or \(\varUpsilon \) the derived amplitude will be proportional to the wave function at the origin. The amplitude of \(h_c\) is proportional to the derivative of the corresponding wave function at the origin. In this work we use the notation of wave functions. It should be noted that the wave function at the origin or the derivative are in fact the corresponding NRQCD matrix elements defined in [24]. The relations can be found in [24]. It will be important to express our results with these NRQCD matrix elements to perform NRQCD factorization beyond tree level. The Coulomb singularities appearing at higher orders of \(\alpha _s\) are factorized into NRQCD matrix elements as shown in [24]. In terms of wave functions, it is also noted that the effects of the detailed shapes of the wave functions are included in those NRQCD matrix elements at higher orders of the small velocity expansion. In this work we take the leading order of v and \(\alpha _s\).

Before analyzing the process, we introduce twist-3 parton GPDs and discuss their properties. We define the momentum \({\bar{P}}\) and the variable \(\xi \) by

$$\begin{aligned} {\bar{P}} = \frac{p+p'}{2}, \quad \xi =\frac{p^+ -p'^+}{ p^+ + p'^+} = \frac{-\varDelta \times n}{2 {\bar{P}}\times n}. \end{aligned}$$
(5)

The twist-3 gluonic GPDs are given by the following matrix elements:

$$\begin{aligned}&M_D^{\mu \nu \rho } (x_1,x_2, \varDelta ) \nonumber \\&=\int \frac{\mathrm{d}\lambda _1 \mathrm{d}\lambda _2 }{(2\pi )^2} \mathrm{e}^{i{\bar{P}}^+ \lambda _1 (x_1+x_2)/2 + i{\bar{P}}^+ \lambda _2 (x_2-x_1)}\nonumber \\&\quad \left\langle p',\lambda ' \Bigg \vert G^{a, +\mu }\left( -\frac{\lambda _1n}{2}\right) {{\tilde{D}}}_{\perp ab} ^\nu (\lambda _2 n ) {\hat{G}}^{b,+\rho } \left( \frac{\lambda _1 n}{2} \right) \Bigg \vert p,\lambda \right\rangle ,\nonumber \\&M_F^{\mu \nu \rho } (x_1,x_2,\varDelta ) \nonumber \\&= \mathrm{if}^{abc} \frac{g_s}{{\bar{P}}^+} \int \frac{\mathrm{d}\lambda _1 \mathrm{d}\lambda _2 }{(2\pi )^2} \mathrm{e}^{i{\bar{P}}^+ \lambda _1 (x_1+x_2)/2 + i{\bar{P}}^+ \lambda _2 (x_2-x_1)}\nonumber \\&\quad \times \left\langle p' \Bigg \vert G^{a,+\mu } \left( -\frac{\lambda _1n}{2} \right) G^{b, +\nu }(\lambda _2 n ) G^{c,+\rho }\left( \frac{\lambda _1n}{2}\right) \Bigg \vert p \right\rangle ,\nonumber \\&M_\partial ^{\mu \nu \rho } (x,\varDelta )=\frac{1 }{{\bar{P}}^+} \int \frac{\mathrm{d}\lambda }{2\pi } \mathrm{e}^{ i{\bar{P}}^+ \lambda x }\nonumber \\&\quad \times \left\langle p',\lambda ' \Bigg \vert G^{a,+\mu } \left( -\frac{\lambda n}{2} \right) {\tilde{\partial }}^{ \nu } G^{a,+\rho } \left( \frac{\lambda n}{2}\right) \Bigg \vert p,\lambda \right\rangle , \end{aligned}$$
(6)

where the derivative \({\tilde{\partial }}^\mu \) is defined by

$$\begin{aligned} f^\dagger (x) {\tilde{\partial }}^\mu g(x) = \frac{1}{2}\biggr [ f^\dagger (x) \partial ^\mu g(x) - \biggr (\partial ^\mu f(x) \biggr )^\dagger g(x) \biggr ] \end{aligned}$$
(7)

for arbitrary functions f(x) and g(x). The covariant derivative \(D^\mu \) is given by \(D^{\mu } (x) = \partial ^\mu + i g_s G^\mu (x)\). \({\tilde{D}}^\mu \) is defined with \(D^\mu \) in a similar way to \({\tilde{\partial }}^\mu \). All Lorentz indices \(\mu ,\nu \) and \(\rho \) are transverse. There is another twist-3 matrix element of gluonic GPDs, which is obtained by replacing \(f^{abc}\) in \(M_F^{\mu \nu \rho } (x_1,x_2,\varDelta )\) by \(d^{abc}\). This matrix element is irrelevant for the process in Eq. (1). But it will be relevant for the process when the produced quarkonium has the quantum number \(C=+\).

The definitions in Eq. (6) are given in the light-cone gauge \(n\times G=0\). In other gauges one has to insert gauge links between field operators to make the definitions gauge invariant. These gauge links are built with

$$\begin{aligned} {{\mathcal {L}}}(x) = P \exp \biggr \{ -i g_s \int _0^\infty \mathrm{d}\lambda G^+(\lambda n +x) \biggr \}. \end{aligned}$$
(8)

It is noted that in gauges other than the light-cone one, the definition of \(M_\partial ^{\mu \nu \rho }\) is given by replacing \({\tilde{\partial }}^\nu \) with \({{\mathcal {L}}}^\dagger (-\lambda n/2) {\tilde{\partial }}^\nu {{\mathcal {L}}} (\lambda n/2)\) in Eq. (6). Because the derivative also acts on gauge links, the symmetry of time reversal does not give any constraint for \(M_\partial ^{\mu \nu \rho }\). The defined matrix elements are not independent. Using the identity

$$\begin{aligned} \begin{aligned}&{{\mathcal {L}}}(y n) D_\perp ^\mu (y n) {{\mathcal {L}}}^\dagger (yn) F\\&\quad = \partial _\perp ^\mu F - i g_s \int ^{\infty }_0 \mathrm{d}\lambda \biggr ( {{\mathcal {L}}} G^{+\mu } {{\mathcal {L}}}^\dagger \biggr ) ((\lambda +y) n) F, \end{aligned} \end{aligned}$$
(9)

for an arbitrary function F one can derive the relation

$$\begin{aligned} \begin{aligned}&M_D^{\mu \nu \rho } (x_1,x_2,\varDelta ) \\&\quad = \frac{M_F^{\mu \nu \rho }(x_1,x_2,\varDelta )}{x_1-x_2 +i\varepsilon } + \delta (x_2-x_1) M_\partial ^{\mu \nu \rho } (x_1,\varDelta ), \end{aligned} \end{aligned}$$
(10)

where the factor \(i\varepsilon \) in the denominator appears because the integration over \(\lambda \) in the identity in Eq. (9) is from 0 to \(\infty \). Therefore, only two of the three matrix elements in Eq. (6) are independent.

In general, each matrix element with three transverse indices can be parameterized with scalar functions or GPDs. By taking parity symmetry into account the number of such functions of each matrix element cannot be larger than 16 for a spin-1/2 hadron. However, in this work, we will only encounter those matrix elements with two indices contracted with \(g_\perp ^{\mu \nu }\). There are three possible contractions. We consider first the contraction with the first and third Lorentz index. We define the contracted matrix elements and give their parameterization by

$$\begin{aligned} \begin{aligned} G^\mu _{F,D} (x_1,x_2,\varDelta )&= g_{\perp \nu \rho } M_{F,D} ^{\rho \mu \nu } (x_1,x_2,\varDelta ), \\ G^\mu _{\partial } (x,\varDelta )&= g_{\perp \nu \rho } M_{\partial } ^{\rho \mu \nu } (x,\varDelta ),\\ G^\mu _{\partial } (x,\varDelta )&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} H_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} H_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 H_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} H_4 \biggr ] u(p),\\ G^\mu _{F} (x_1,x_2, \varDelta )&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} G_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} G_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 G_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} G_4\biggr ] u(p),\\ G^\mu _{D} (x_1,x_2, \varDelta )&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} F_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} F_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 F_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} F_4\biggr ] u(p), \end{aligned} \end{aligned}$$
(11)

where the scalar functions \(G_i\), \(F_i\) and \(H_i\) depend on momentum fractions, \(\xi \) and t. There is a freedom by choosing different parameterizations. The parametrization in the above has the advantage that all spinor products \({\bar{u}}(p') \varGamma u(p)\) with \(\varGamma = \gamma ^+\), \(\sigma ^{+\mu }\) and \(\gamma ^+\gamma _5\) are at the order of \({{\mathcal {O}}}(\lambda ^0)\). There are constraints for these functions or GPDs from hermiticity, Bose symmetry and time-reversal symmetry. All \(G_i\) and \(F_i\) are real and have the properties

$$\begin{aligned} \begin{aligned}&G_3 (x_1,x_2,\xi ,t) = - G_3 (x_2,x_1,-\xi ,t), \\&G_{1,2,4}(x_1,x_2,\xi ,t) = G_{1,2,4}(x_2,x_1,-\xi ,t),\\&F_3 (x_1,x_2,\xi ,t)= F_3 (x_2,x_1,-\xi ,t), \\&F_{1,2,4} (x_1,x_2,\xi ,t) = -F_{1,2,4}(x_2,x_1,-\xi ,t),\\&G_{i} (x_1,x_2,\xi ,t) = -G_{i} (-x_2,-x_1, \xi ,t),\\&F_i (x_1,x_2,\xi ,t)= -F_i (-x_2, -x_1,\xi ,t), \ i=1,2,3,4. \end{aligned} \end{aligned}$$
(12)

It is noted that time-reversal symmetry does not give any constraint for \(G_\partial ^{\mu }\). The functions \(H_i\) are complex in general. The properties for \(H_i\) are

$$\begin{aligned} \begin{aligned} H_{1,2,4}^* (x,\xi ,t)&= -H_{1,2,4}(x,-\xi ,t),\\ H_3^*(x,\xi ,t)&=H_3 (x,-\xi ,t),\\ H_{1,2,3,4} (x,\xi ,t)&= - H_{1,2,3,4} (-x,\xi ,t). \end{aligned} \end{aligned}$$
(13)

By using the identity

$$\begin{aligned} \frac{1}{x_1 -x_2 +i\varepsilon } = P\frac{1}{x_1-x_2} - i\pi \delta (x_1-x_2) \end{aligned}$$
(14)

with P standing for the principal-value prescription and by noting that \(F_{1,2,3,4}\) and \(G_{1,2,3,4}\) are real and \(H_{1,2,3,4}\) are complex, we obtain the relations among these GPDs from the relation in Eq. (10):

$$\begin{aligned} \begin{aligned} F_{i} (x_1,x_2,\xi ,t)&= P\frac{1}{x_1-x_2} G_i (x_1,x_2,\xi ,t) \\&\quad \,\, + \delta (x_2-x_1) \mathrm{Re} H_i (x_1,\xi ,t),\\ G_i (x,x, \xi , t)&=\frac{1}{\pi } \mathrm{Im} H_i (x,\xi ,t). \end{aligned} \end{aligned}$$
(15)

In the limit of \(\varDelta ^\mu \rightarrow 0\), the GPD \(G_2\), \(F_2\) and \(H_2\) become gluonic ETQS matrix elements relevant for single transverse-spin asymmetries in inclusive processes [25,26,27,28,29]. An interesting observation of the limit is made in [9,10,11,12]: one of the moments of \(F_3\) is the orbital-angular momentum contribution of gluons and that of \(H_3\) can be interpreted as the canonical orbital-angular momentum contribution in the light-cone gauge as discussed in [30]. It is noticed from Eqs. (12, 15) that \(H_3\) becomes real in the limit by assuming that \(G_3\) is a continuous function of \(\xi \).

The matrix elements contracted in Eq. (6) with \(g_{\perp \mu \nu }\) or \(g_{\perp \nu \rho }\) are not independent. We have from Bose symmetry

$$\begin{aligned} \begin{aligned} g_{\perp \nu \rho } M^{\mu \nu \rho }_{D} (x_1,x_2, \varDelta )&= -g_{\perp \nu \rho } M^{\nu \rho \mu }_{D} (-x_2, -x_1, \varDelta ),\\ g_{\perp \nu \rho } M^{\mu \nu \rho }_{\partial } (x, \varDelta )&= -g_{\perp \nu \rho } M^{\nu \rho \mu }_{\partial } (-x, \varDelta ). \end{aligned} \end{aligned}$$
(16)

Since \(M_F^{\mu \nu \rho }\) is defined with three field-strength-tensor operators, all three contractions give essentially the same GPDs,

$$\begin{aligned} \begin{aligned}&g_{\perp \nu \rho } M^{\mu \nu \rho }_{F} (x_1,x_2, \varDelta ) \\ =&-g_{\perp \nu \rho } M^{\nu \rho \mu }_{F} (-x_2, -x_1, \varDelta ) \\ =&- g_{\perp \nu \rho } M^{\nu \mu \rho }_{F} (x_1,x_1-x_2+ \xi ,\varDelta ). \end{aligned} \end{aligned}$$
(17)

We denote the matrix elements and parameterize them as

$$\begin{aligned} \begin{aligned} {\tilde{G}}_D^{\mu } (x_1,x_2,\varDelta )&= g_{\perp \rho \nu } M_D^{\mu \rho \nu }(x_1,x_2,\varDelta ),\\ {\tilde{G}}_\partial ^\mu (x,\varDelta )&= g_{\rho \nu } M_\partial ^{\nu \rho \mu } (x,\varDelta ),\\ {\tilde{G}}_\partial ^\mu (x,\varDelta )&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} {\tilde{H}}_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} {\tilde{H}}_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 {\tilde{H}}_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} {\tilde{H}}_4 \biggr ] u(p),\\ {\tilde{G}}_D^{\mu } (x_1,x_2,\varDelta )&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} {\tilde{F}}_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} {\tilde{F}}_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 {\tilde{F}}_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} {\tilde{F}}_4\biggr ] u(p). \end{aligned} \end{aligned}$$
(18)

From the symmetries, the functions \({\tilde{F}}_i\) are real and the functions \({\tilde{H}}_i\) are complex. They have the properties

$$\begin{aligned} \begin{aligned} {\tilde{F}}_3 (x_1,x_2, \xi ,t)&= - {\tilde{F}}_3 (-x_1,- x_2, -\xi ,t),\\ {\tilde{F}}_{1,2,4} (x_1,x_2, \xi ,t)&= {\tilde{F}}_{1,2,4} (-x_1,- x_2, -\xi ,t),\\ {\tilde{H}}_3^* (x, \xi ,t)&= - {\tilde{H}}_3 (-x, -\xi ,t),\\ {\tilde{H}}_{1,2,4}^* (x, \xi ,t)&= {\tilde{H}}_{1,2,4}(-x, -\xi ,t). \end{aligned} \end{aligned}$$
(19)

Again, these functions are not independent. They satisfy the relations

$$\begin{aligned} \begin{aligned}&{\tilde{F}}_{i} (x_1,x_2,\xi ,t) =- P\frac{1}{x_1\!-\!x_2} G_i (x_1,x_1\!-\!x_2\!+\!\xi ,\xi ,t)\\&\quad + \delta (x_2\!-\!x_1) \mathrm{Re} {\tilde{H}}_i (x,\xi ,t),\\&G_i (x_1,x_1-x_2+\xi ,\xi ,t) = - \frac{1}{\pi } \mathrm{Im} {\tilde{H}}_i (x,\xi ,t), \end{aligned} \end{aligned}$$
(20)

for \(i=1,2,3,4\). These relations can be derived from Eq. (10) similarly to those in Eq. (15).

Twist-3 matrix elements with quarks also appear in the twist-3 amplitudes of the considered processes at tree level. There are three matrix elements:

$$\begin{aligned} \begin{aligned}&N_F^{\nu } (x_1,x_2,\varDelta ) \\&= \int \frac{\mathrm{d}\lambda _1}{2\pi } \frac{\mathrm{d}\lambda _2}{2\pi } e^{i{\bar{P}}^+ \lambda _1 (x_1+x_2)/2 + i{\bar{P}}^+ \lambda _2 (x_2-x_1)}\\&\quad \times \Bigg \langle p' \Bigg \vert {\bar{\psi }} (-\lambda _1 n/2 ) \gamma ^+ g_s G^{+\nu }(\lambda _2 n ) \psi (\lambda _1n/2) \Bigg \vert p \Bigg \rangle ,\\&N_D^{\nu } (x_1,x_2,\varDelta ) \\&= {\bar{P}}^+ \int \frac{\mathrm{d}\lambda _1}{2\pi } \frac{\mathrm{d}\lambda _2}{2\pi } \mathrm{e}^{i{\bar{P}}^+ \lambda _1 (x_1+x_2)/2 + i{\bar{P}}^+ \lambda _2 (x_2-x_1)}\\&\quad \times \Bigg \langle p' \Bigg \vert {\bar{\psi }} (-\lambda _1 n /2) \gamma ^+ {\tilde{D}}_\perp ^\nu (\lambda _2 n ) \psi (\lambda _1n/2) \Bigg \vert p \bigg \rangle ,\\&N_\partial ^{\nu } (x, \varDelta ) \\&= \int \frac{d\lambda }{2\pi } \mathrm{e}^{ i{\bar{P}}^+ \lambda x} \Bigg \langle p' \Bigg \vert {\bar{\psi }} (-\lambda n /2) \gamma ^+ {\tilde{\partial }}_\perp ^\nu \psi (\lambda n/2) \Bigg \vert p \Bigg \rangle . \end{aligned} \end{aligned}$$
(21)

Similarly, one can derive the relation between these three matrix elements by using the identity in Eq. (9):

$$\begin{aligned} \begin{aligned} N_D^{\nu } (x_1,x_2,\varDelta )&= \frac{1}{x_1-x_2 +i\varepsilon } N_F^{\nu } (x_1,x_2,\varDelta )\\&\quad \,\, + \delta (x_2-x_1) N_\partial ^{\nu } (x_1,\varDelta ). \end{aligned} \end{aligned}$$
(22)

A similar relation has also been derived in [11, 12], where \(i\varepsilon \) in the denominator is missing. In our work, only the sum of all quark flavors is relevant. We define the following quark–gluon GPDs:

$$\begin{aligned} \begin{aligned} \sum _q N_F^{\mu } (x_1,x_2)&= {\bar{u}} (p') \biggr [ i \gamma ^+ \frac{\varDelta _\perp ^\mu }{{\bar{P}}^+} N_1 + \frac{\sigma ^{+\mu } m}{{\bar{P}}^+} N_2\\&\quad \,\, + \frac{\varDelta _{\perp \rho }}{{\bar{P}}^+} \epsilon ^{+-\mu \rho } \gamma ^+ \gamma _5 N_3\\&\quad \,\, + \frac{\varDelta _\perp ^\mu \sigma ^{+\rho }\varDelta _{\perp \rho }}{m {\bar{P}}^+} N_4\biggr ] u(p). \end{aligned} \end{aligned}$$
(23)

The sum is over the light flavors of quarks. In the limit of \(\varDelta ^\mu \rightarrow 0\), one of the moments of \(N_3\) is the orbital-angular momentum contribution of quarks as observed in [9,10,11,12].

For the completeness of the gluon GPDs, we also include the definitions of gluonic twist-2 GPDs. These GPDs have been discussed in detail in [7, 8]. They will appear in our amplitudes. We take the notation of [7] for twist-2 gluon GPDs:

$$\begin{aligned} \begin{aligned} {{\mathcal {F}}}_g&= \frac{1}{{\bar{P}}^+} \int \frac{\mathrm{d}\lambda }{2\pi } e^{ i x {\bar{P}}^+ \lambda }\\&\quad \,\, \times \Bigg \langle p' \Bigg \vert g_{\perp \mu \nu } G^{+\mu } \left( -\frac{1}{2} \lambda n\right) G^{+\nu } \left( \frac{1}{2} \lambda n \right) \Bigg \vert p \Bigg \rangle \\&= -\frac{1}{2 {\bar{P}}^+} {\bar{u}}(p') \biggr [ \gamma ^+ H_g (x, \xi , t) \\&\quad \,\, + \frac{i\sigma ^{+\alpha }}{2 m} \varDelta _\alpha E_g (x,\xi ,t) \biggr ] u(p),\\ \tilde{{\mathcal {F}}}_g&= \frac{i}{{\bar{P}}^+} \int \frac{\mathrm{d}\lambda }{2\pi } \mathrm{e}^{ i x {\bar{P}}^+ \lambda }\\&\quad \,\, \times \Bigg \langle p' \Bigg \vert \epsilon _{\perp \mu \nu } G^{+\mu } \left( -\frac{1}{2} \lambda n\right) G^{+\nu } \left( \frac{1}{2} \lambda n \right) \Bigg \vert p \Bigg \rangle \\&= -\frac{1}{2 {\bar{P}}^+} {\bar{u}}(p') \biggr [ \gamma ^+ \gamma _5 {\tilde{H}}_g (x, \xi , t)\\&\quad \,\, + \frac{\gamma _5 \varDelta ^+ }{2 m} {\tilde{E}}_g (x,\xi ,t) \biggr ] u(p),\\ \end{aligned} \end{aligned}$$
(24)
$$\begin{aligned} \begin{aligned} {{\mathcal {F}}}^{\mu \nu }_{Tg}&= \frac{1}{2 {\bar{P}}^+} \int \frac{\mathrm{d}\lambda }{2\pi } e^{ i x {\bar{P}}^+ \lambda }\\&\quad \,\, \times \Bigg \langle p' \Bigg \vert \mathbf{S} \biggr ( G^{+\mu } \left( -\frac{1}{2} \lambda n\right) G^{+\nu } \left( \frac{1}{2} \lambda n \right) \biggr ) \Bigg \vert p \Bigg \rangle \\&= -\frac{1}{2{\bar{P}}^+} \mathbf{S} \biggr \{ \frac{{\bar{P}}^+ \varDelta ^\nu - \varDelta ^+ {{\bar{P}}^\nu }}{2 m {\bar{P}}^+} \\&\quad \,\, \times {\bar{u}}(p')\biggr [i\sigma ^{+\mu } H_{gT} (x,\xi ,t) \\&\quad \,\, +\frac{ {\bar{P}}^+ \varDelta ^\mu -\varDelta ^+ {\bar{P}}^\mu }{m^2} {\tilde{H}}_{gT} (x,\xi ,t)\\&\quad \,\, +\frac{\gamma ^+ \varDelta ^\mu -\varDelta ^+ \gamma ^\mu }{2 m} E_{gT} (x,\xi ,t) \\&\quad \,\, + \frac{\gamma ^+{\bar{P}}^\mu -{\bar{P}}^+ \gamma ^\mu }{m}{\tilde{E}}_{gT} (x,\xi ,t) \biggr ] u(p) \biggr \}, \end{aligned} \end{aligned}$$
(25)

where \(\mu \) and \(\nu \) are transverse. The notation \(\mathbf{S}(\cdots )\) implies that the tensors in \((\cdots )\) are symmetric and traceless. There are in total eight twist-2 gluon GPDs. Their properties can be found in [7, 8].

Fig. 1
figure 1

a One of the six diagrams for two-gluon exchanges. b One diagrams for three-gluon exchanges. Other diagrams are obtained through permutation

3 Calculations for \(J/\varPsi \) Production

Now we turn to our derivation of the twist-2- and twist-3 amplitudes for the process in Eq. (1) at tree level in the case that the quarkonium is a \(^3 S_1\) \(Q{\bar{Q}}\) state. We will perform the calculation mainly in the light-cone gauge \(n\times G =G^+= 0\), but also partly in Feynman gauge for fixing some ambiguities. At tree level, the quarkonium is produced through exchanges of gluons between the heavy quark pair and the initial hadron. The two-gluon- or three-gluon exchanges are illustrated in Fig. 1a or b, respectively. After projecting out the \(^3S_1\) state from the produced \(Q{\bar{Q}}\) pair, the contribution from Fig. 1a can be written as

$$\begin{aligned} \begin{aligned}&i {{\mathcal {T}}} \biggr \vert _{1a} = \frac{1}{2} \int \mathrm{d}^4 k_1 \mathrm{d}^4 k_2 {{\mathcal {M}}}^{\mu \nu } (k_1,k_2,\varDelta ) \\&\quad \,\, \times \int \frac{\mathrm{d}^4 \xi _1}{(2\pi )^4}\frac{\mathrm{d}^4 \xi _2}{(2\pi )^4} e^{i\xi _1 \times k_1+ i\xi _2 k_2} \langle p' \vert G_\mu (\xi _1) G_\nu (\xi _2) \vert p \rangle , \end{aligned} \end{aligned}$$
(26)

where \({{\mathcal {M}}}^{\mu \nu } (k_1,k_2,\varDelta )\) is essentially the amplitude of \(\gamma (q) + g(k_1) + g(k_2) \rightarrow J_Q (k)\) after taking the trace of colors. It is given by six diagrams. To make a power-expansion related to the light hadron, one notes that in the forward limit the gauge fields in the matrix element of Eq. (26) and the momenta carried by these fields scale like

$$\begin{aligned} G^\mu \sim Q (1,\lambda ^2,\lambda ,\lambda ), \quad k_{1,2}^\mu \sim Q (1, \lambda ^2,\lambda ,\lambda ). \end{aligned}$$
(27)

In the limit, the momentum transfer \(\varDelta ^\mu \) also scales like \(\varDelta ^\mu \sim Q (1, \lambda ^2,\lambda ,\lambda )\). The twist-2 or leading power contribution of the amplitude is obtained by taking the leading order of the expansion in \(\lambda \). For this purpose we need to expand \({{\mathcal {M}}}^{\mu \nu } (k_1,k_2,\varDelta )\) and the matrix element in \(\lambda \). In the light-cone gauge, the leading contribution of the matrix element is given when the two-gluon fields carry transverse indices, i.e., \(\mu ,\nu =\perp \). \({{\mathcal {M}}}^{\mu \nu } (k_1,k_2,\varDelta )\) at the leading order for \(\mu ,\nu =\perp \) is given by

$$\begin{aligned} \begin{aligned} {{\mathcal {M}}}^{\mu \nu } (k_1,k_2,\varDelta )&= i g_s^2 A_J\biggr [ \frac{16 m_Q^2}{k^+ q^-} \frac{k_1^+ k_2^+}{ (k_2^+ - i\varepsilon ) (k_1^+ - i\varepsilon )}\\&\quad \,\, \times g_\perp ^{\mu \nu } \epsilon _\perp (q) \times \epsilon ^*(k) \biggr ] ( 1 +{{\mathcal {O}}}(\lambda ^2) )\\ A_J&= Q_Q e \psi ^* (0) \frac{(2m_Q)^{-3/2}}{\sqrt{N_c}}, \end{aligned} \end{aligned}$$
(28)

where \(\psi (0)\) is the non-relativistic wave function at the origin. \(\epsilon (q)\) and \(\epsilon (k)\) are the polarization vectors of the photon and \(J_Q\), respectively. The factor \(i\varepsilon \) in the denominator comes from the quark propagators in Fig. 1a. In the gauge \(n\times G=0\), the twist-2 contribution comes only from two-gluon exchanges as in Fig. 1a. It is straightforward to obtain the twist-2 amplitude denoted as \({{\mathcal {T}}}_T\):

$$\begin{aligned} \begin{aligned} {{\mathcal {T}}}_T&= 8 g_s^2 A_J\epsilon _\perp (q) \times \epsilon ^*(k)\\&\quad \,\, \times \int \mathrm{d} x \frac{1}{( x-\xi + i\varepsilon )( x+\xi -i\varepsilon ) }{{\mathcal {F}}}_g(x,\xi ,t). \end{aligned} \end{aligned}$$
(29)

This result is in agreement with that given in [15]. We note here that at twist-2 the produced \(J/\psi \) or \(\varUpsilon \) is transversely polarized. Using this amplitude one can extract from experimental data only the GPD \(H_g\) and \(E_g\) contained in \({{\mathcal {F}}}_g\).

Now we consider the amplitude with longitudinally polarized \(J/\psi \) or \(\varUpsilon \). This amplitude is power-suppressed. In contrast to the twist-2 amplitude \({{\mathcal {T}}}_T\), there are several sources of contributions. There are contributions which involve twist-2 gluon GPDs and are explicitly power-suppressed. They are called dynamical twist-3 contributions. There are also contributions which involve twist-3 GPDs. One part of dynamical twist-3 contributions can easily be identified from Eq. (29). In the used frame given by Eq. (2), the polarization vector \(\epsilon _L\) of the longitudinally polarized \(J/\psi \) has a small transverse part at the order of \(\lambda \). To specify the polarization of \(J/\psi \), one has to define the spin quantization axis properly. We take the so-called s-channel helicity frame by introducing the 4-vector \(X^\mu = q+p\). The polarization vector \(\epsilon _L^\mu \) of this frame is given by

$$\begin{aligned} \epsilon _L^\mu (k) = \frac{k\times X}{\sqrt{(k\times X)^2 - M^2 s}} \biggr ( \frac{k^\mu }{M} -\frac{M}{k\cdot X}X^\mu \biggr ). \end{aligned}$$
(30)

The vector \(\epsilon _L\) given here is covariant with \(k^2 =M^2\). The physical meaning of the vector \(X^\mu \) is that in the rest frame of \(J_Q\) the spin quantization direction is the direction of the 3-vector \(-\mathbf{X}\). Keeping the leading order of \(\lambda \), we have

$$\begin{aligned} \begin{aligned} \epsilon _L^\mu&= (\epsilon _L^+,\epsilon ^-_L,0,0) +\epsilon _\perp ^\mu + {{\mathcal {O}}}(\lambda ^2),&\epsilon _L^-&=\frac{k^-}{2 m_Q},\\ \epsilon _{L\perp }^\mu&= \frac{\rho _L}{2 m_Q} k_\perp ^\mu ,&\rho _L&=\frac{s+ 4m_Q^2}{s-4 m_Q^2} . \end{aligned}\nonumber \\ \end{aligned}$$
(31)

The component \(\epsilon _L^+\) is not needed to express our result. Given the \(\epsilon _{L\perp }^\mu \), we obtain the part of dynamical twist-3 contribution from the transverse part of \(\epsilon _L^\mu \) which can be read off from Eq. (29):

$$\begin{aligned} \begin{aligned} {{\mathcal {T}}} \biggr \vert _{a,30}&= - g_s^2 A_J\frac{2}{ m_Q } \epsilon _\perp (q) \cdot \varDelta _\perp \rho _L\\&\quad \,\, \times \int d x \frac{1}{( x-\xi + i\varepsilon )( x+\xi -i\varepsilon ) }{{\mathcal {F}}}_g(x,\xi ,t), \end{aligned} \end{aligned}$$
(32)

which is power-suppressed, as indicated by the factor \(\varDelta _\perp \).

In the gauge \(n\cdot G=0\) the remaining twist-3 contributions come from two-gluon and three-gluon exchanges as the diagrams in Fig.1. We first study the contributions from Fig. 1a. It is nontrivial to find the twist-3 contributions from Fig. 1a because Fig. 1a also has the leading or twist-2 contributions. With the power counting given in Eq. (27), one part of the twist-3 contributions is given when one of the two-gluon fields carries the −-index. In this case one should take the leading order of the \(\lambda \)-expansion for \({{\mathcal {M}}}^{\mu +} \). Another part comes from the next-to-leading order of the \(\lambda \)-expansion for \({{\mathcal {M}}}^{\mu \nu }\) with \(\mu =\nu =\perp \). We denote this contribution of \({{\mathcal {M}}}^{\mu +}\) by \({{\mathcal {M}}}_1^{\mu \nu }\). After the expansion of \({{\mathcal {M}}}^{\mu \nu }\) we have

$$\begin{aligned} \begin{aligned} {{\mathcal {M}}}^{\mu +} (k_1,k_2,\varDelta )&= {{\mathcal {M}}}^{+\mu }(k_1,k_2,\varDelta ) \\&= i g_s^2 A_J \epsilon ^\mu \biggr [ \frac{- 16 m^2_Q \epsilon _L^- k_1^+ k_2^+ }{ q^- k^- (k_2^+ - i\varepsilon ) (k_1^+ - i\varepsilon )} \biggr ]\\&\quad \,\, ( 1+{{\mathcal {O}}}(\lambda ) ) ,\\ {{\mathcal {M}}}^{\mu \nu }_1(k_1,k_2,\varDelta )&= i g_s^2 A_J \biggr [ \frac{16 m^2_Q \epsilon _L^- \epsilon _\rho }{ k^+ q^- k^- (k_2^+ - i\varepsilon ) (k_1^+ - i\varepsilon )}\\&\quad \,\, \biggr ( g_\perp ^{\mu \nu } k_{1\perp }^\rho (k_2^+)^2 + g_\perp ^{\mu \nu } k_{2\perp }^\rho (k_1^+)^2\\&\quad \,\, - k_{1\perp }^\nu g_\perp ^{\mu \rho } k_2^+ (k_1^+ + k_2^+) \\&\quad \,\, - k_{2\perp }^\mu g_\perp ^{\nu \rho } k_1^+ (k_1^+ + k_2^+) \biggr ) \biggr ], \end{aligned}\nonumber \\ \end{aligned}$$
(33)

with \(\mu ,\nu =\perp \). From \({{\mathcal {M}}}^{\mu +}\) we have the twist-3 contribution:

$$\begin{aligned} \begin{aligned} i {{\mathcal {T}}} \biggr \vert _{a,31}&= i g_s^2 A_J \int \mathrm{d}k_1^+ \mathrm{d} k_2^+ \biggr [ \frac{16 m^2_Q \epsilon _L^- \epsilon _\mu }{q^- k^- (k_2^+ - i\varepsilon ) (k_1^+ - i\varepsilon )} \biggr ] \\&\quad \,\, \times \int \frac{ \mathrm{d}\xi _1^- \mathrm{d}\xi _2^-}{(2\pi )^2}e^{i\xi _1^- k_1^+ + i\xi _2^- k_2^+}\\&\quad \,\, \times \Bigg \langle p' \Bigg \vert G^{+\mu } (\xi _1^-n) G^{+-} (\xi _2^-n ) \Bigg \vert p \Bigg \rangle . \end{aligned} \end{aligned}$$
(34)

It is noted that the field \(G^-\) is not dynamically independent. From the QCD equation of motion in the gauge \(G^+=0\) we have the equation for \(G^{+-}\):

$$\begin{aligned} \begin{aligned} \partial ^+ G^{a,-+}(x)&= - \biggr ( D_{\perp \mu } (x) G^{\mu +} (x) \biggr )^a + g_s J^{a,+}(x), \\ J^{a,\mu } (x)&= \sum _q {\bar{q}} (x) T^a \gamma ^\mu q(x), \end{aligned} \end{aligned}$$
(35)

and the solution is

$$\begin{aligned} \begin{aligned} G^{a,+-} (x)&= i \int \frac{\mathrm{d}\lambda \mathrm{d} k^+}{2\pi } \frac{e^{-i\lambda k^+} }{k^+ +i\varepsilon } \biggr [ \biggr ( g_s J^{a,+}(x+\lambda n)\\&\quad \,\, + D_{\perp \mu } G^{+\mu }\biggr )^a (x +\lambda n)\biggr ]. \end{aligned} \end{aligned}$$
(36)

Using this result with a little algebra we derive

$$\begin{aligned} \begin{aligned} i {{\mathcal {T}}}\biggr \vert _{a,31}&= g_s^2 A_J\frac{16 m^2_Q \epsilon _L^- \epsilon _\rho }{( k^-)^2} \int \mathrm{d}k_1^+ \mathrm{d} k_2^+ \frac{ 1 }{ (k_2^+ \!-\! i\varepsilon )^2 (k_1^+ \!-\! i\varepsilon )} \\&\quad \,\, \int \frac{ \mathrm{d}\xi _1^- \mathrm{d}\xi _2^-}{(2\pi )^2}e^{i\xi _1^- k_1^+ + i\xi _2^- k_2^+}\\&\quad \,\, \times \biggr \{ \Bigg \langle p' \Bigg \vert g_sG^{+\rho } (\xi _1^-n) J^{+}(\xi _2^- n ) \\&\quad \,\, \!+\!\frac{i}{2} \varDelta _{ \perp \nu } G^{\!+\!\rho } (\xi _1^-n) G^{\!+\!\nu } (\xi _2^- n) \Bigg \vert p \Bigg \rangle \\&\quad \,\, + g_{\mu \nu } \int \mathrm{d} k_3^+ \int \frac{ \mathrm{d}\xi _3^-}{2\pi } \mathrm{e}^{ i k_3^+(\xi _3^--\xi _2^-) } \\&\quad \,\, \times \Bigg \langle p' \Bigg \vert G^{+\rho } (\xi _1^-n) {\tilde{D}}_{\perp }^\mu (\xi _3^- n) G^{+\nu }(\xi _2^- n) \Bigg \vert p \Bigg \rangle \biggr \}. \end{aligned} \end{aligned}$$
(37)

It is noted that the matrix element with \(G^{+-}\) in Eq. (34) contains not only twist-3 matrix element but also twist-2 matrix element.

To the contribution from \({{\mathcal {M}}}^{\mu \nu }_1\) in Eq. (33), we have to add the factors

$$\begin{aligned} \frac{k_1^+}{k_1^+-i\varepsilon } , \quad \frac{k_2^+}{k_2^+ - i\varepsilon } , \end{aligned}$$
(38)

in order to express the result with matrix elements of field-strength-tensor operators in the light-cone gauge \(n\cdot G=0\). These factors can be different from 1 under integrations over \(k_{1,2}^+\). We will discuss these added factors later. The contribution reads

$$\begin{aligned} \begin{aligned} i {{\mathcal {T}}} \biggr \vert _{a,32}&= g_s^2 A_J\frac{16 m^2_Q \epsilon _L^- }{k^+( k^-)^2} \int \mathrm{d}k_1^+ \mathrm{d} k_2^+ \frac{ \epsilon _\rho }{ (k_2^+ - i\varepsilon )^2 (k_1^+ - i\varepsilon )} \\&\quad \,\, \times \int \frac{ \mathrm{d}\xi _1^- \mathrm{d}\xi _2^-}{(2\pi )^2}e^{i\xi _1^- k_1^+ + i\xi _2^- k_2^+} g_{\perp \mu \nu }\\&\quad \,\, \times \Bigg \langle p' \vert \biggr [ k_1^+ G^{+\mu }(\xi _1^- n) {\tilde{\partial }}_{\perp }^\rho G^{+\nu }(\xi _2^-n)\\&\quad \,\, + \frac{i}{2} k_1^+ \varDelta _\perp ^\rho G^{+\mu }(\xi _1^- n)G^{+\nu }(\xi _2^-n)\\&\quad \,\, - k^+ G^{+\mu }(\xi _1^- n) {\tilde{\partial }}_\perp ^\nu G^{+\rho } (\xi _2^- n) \\&\quad \,\, -\frac{i}{2} k^+ \varDelta _{\perp \nu } G^{+\mu }(\xi _1^- n) G^{+\rho } (\xi _2^- n)\biggr ] \vert p \Bigg \rangle . \end{aligned} \end{aligned}$$
(39)

This part contains not only twist-3 matrix elements but also those at twist-2 matrix elements.

The contribution from Fig. 1b is relatively easy to obtain, because its leading contribution is at twist-3 in the light-cone gauge. We obtain

$$\begin{aligned} \begin{aligned} i {{\mathcal {T}}} \biggr \vert _{b,3}&= g_s^3 A_J \frac{16 m^2_Q \epsilon _L^{*-} }{(k^-)^2} \int \mathrm{d} k_1^+ \mathrm{d} k_2^+ \mathrm{d} k_3^+ \frac{ \!-\!i^3 }{(k_2^+ - i\varepsilon )(k_3^+- i\varepsilon )}\\&\quad \,\, \times \frac{ f^{a_1a_2a_3} g_{\perp \mu _1\mu _2} \epsilon _{\mu _3}}{(k_2^+ \!+ \!k_3^+ \!-\!i\varepsilon )(k_1^+ +k_3^+ -i\varepsilon ) }\\&\quad \,\, \times \int \frac{\mathrm{d} \xi _1^-}{(2\pi )}\frac{\mathrm{d}\xi _2^-}{(2\pi )} \frac{\mathrm{d}\xi _3^-}{(2\pi )} e^{i\xi _1^- k_1^+ \!+\! i\xi _2^- k_2^+ \!+\! i\xi _3^- k_3^+ }\\&\quad \,\, \Bigg \langle p' \Bigg \vert G^{a_1,+\mu _1} (\xi _1^-n) G^{a_2,+\mu _2}(\xi _2^-n) G^{a_3,+\mu _3} (\xi _3^-n) \Bigg \vert p \Bigg \rangle . \end{aligned}\nonumber \\ \end{aligned}$$
(40)

Compared to the definitions in Eq. (6), this contribution can be expressed with the twist-3 matrix element \(M_F^{\mu \nu \rho }\). It is noted that this contribution can also be expressed with \(M_D^{\mu \nu \rho }\) by adding the contribution of the first term in the second line of Eq. (39). Since the three twist-3 matrix elements in Eq. (6) are not independent, we have the freedom to choose an independent set of these matrix elements. We will express our final results with \(M_{D,\partial }^{\mu \nu \rho }\).

In deriving our results in the above, we have worked with the light-cone gauge \(G^+ =0\). In this gauge the gauge-field operator in the matrix elements can be converted to the field-strength-tensor operator combined with its momentum:

$$\begin{aligned} \begin{aligned}&\int {rm d}^4 \xi _i \mathrm{e}^{i \xi _i \cdot k_i } k_{i}^+ G^{\mu } (\xi _i) \\&\quad \rightarrow i \int {rm d}^4 \xi _i \mathrm{e}^{i \xi _i\cdot k_i } G^{+ \mu } (\xi _i), \quad i=1,2. \end{aligned} \end{aligned}$$
(41)

In our calculations the needed factors \(k_{1,2}^+\) are from \({{\mathcal {M}}}^{\mu \nu }\) and from the upper part of Fig. 1b, except for the contribution given in Eq. (39), where we have added the factors in Eq. (38). This introduces an ambiguity for the sign in front of \(i\varepsilon \). It can be checked by doing the calculation in Feynman gauge. We have calculated the twist-3 contributions of Fig. 1a in this gauge, and we found that the gauge fields in these contributions always appear in the combined form for \(\mu \ne +\):

$$\begin{aligned} \int \mathrm{d}^4 \xi _i e^{i \xi _i \cdot k_i } \biggr [ G^{\mu } (\xi _i) - G^+(\xi _i) \frac{ k_{\perp i}^\mu }{ k_i^+ -i\varepsilon } \biggr ], \quad i=1,2. \end{aligned}$$
(42)

This combination gives the form of field-strength-tensor operators at the leading order of \(g_s\). This indicates that the sign of \(i\varepsilon \) in Eq. (38) is correct. We have also calculated the contribution of Fig. 1b with one gluon field carrying the \(+\)-index in Feynman gauge. This contribution can be divided into two parts. One part gives the above combination; therefore, it is included in field-strength-tensor operators in Eq. (40) for Fig. 1b. If we take the contribution from Fig. 1a given in Eq. (39) as gauge invariant, which implies that there is the contribution from three-gluon exchanges with one gluon field carrying the \(+\)-index and the other two carrying a transverse index. This contribution comes partly from the non-linear term of the field-strength-tensor operators and partly from gauge links between operators. The remaining part from Fig. 1b with one gluon field carrying the \(+\)-index just gives this contribution. This gives an important check for gauge invariance of our results.

Our result for the amplitude at twist-3, denoted as \({{\mathcal {T}}}_L\), is the sum of the amplitudes given in Eqs. (32, 34, 39, 40). It is

$$\begin{aligned} \begin{aligned} i{{\mathcal {T}}}_L&= 2 g_s^2A_J\frac{\epsilon _\rho }{ m_Q } \biggr \{ \int \mathrm{d} x \frac{1}{ (x+\xi -i\varepsilon )^2 (\xi -x-i\varepsilon )} \\&\quad \,\, \biggr [ i \varDelta _\perp ^\rho ( \xi -x - \rho _L (x +\xi ) ) {{\mathcal {F}}} _g(x,\xi ,t)\\&\quad \,\, + 2 \xi \epsilon _\perp ^{\mu \rho } \varDelta _\mu \tilde{{\mathcal {F}}}_g(x,\xi ,t) \biggr ]\\&\quad \,\, + 4\xi \int \mathrm{d} x_1 \mathrm{d} x_2 \biggr [ \frac{ \delta (x_1-x_2) {\tilde{G}}_\partial ^\rho (- x_1,\varDelta ) }{ ( x_1+\xi - i\varepsilon )^2 (\xi -x_1 - i\varepsilon )}\\&\quad \,\, + \frac{ {\tilde{G}}_D^\rho (x_1,x_2,\varDelta ) }{ ( x_2+\xi - i\varepsilon )^2 (\xi -x_2 - i\varepsilon )}\\&\quad \,\, + \frac{G_D^\rho (x_1,x_2,\varDelta ) }{ (\xi +x_1 - i\varepsilon ) (\xi -x_1 - i\varepsilon )(\xi +x_2 -i\varepsilon ) }\\&\quad \,\, + \frac{ \sum _q N_F^{\rho }(x_1,x_2,\varDelta ) }{ (2\xi - x_2 + x_1- i\varepsilon )^2 (x_2-x_1 - i\varepsilon )} \biggr ) \biggr ] \biggr \}. \end{aligned} \end{aligned}$$
(43)

This result is complicated when we express it with various GPDs. Totally, 20 GPDs or scalar functions are involved with the parameterization of twist-2 and twist-3 matrix elements given before. This makes the extraction of GPDs from experiment complicated. Unless one can use some nonperturbative methods to calculated these GPDs, it seem impossible to make reliable predictions for experiment.

Before giving a discussion about our results, we notice that our amplitudes in Eqs. (29, 43) are \(U_{em}(1)\)-gauge invariant only at the leading order of \(\lambda \). Since in our results only the transverse part of the polarization vector of the initial photon is involved, the amplitudes are not exactly zero by replacing the polarization vector with the photon momentum \(q^\mu \). After replacement the amplitudes are in fact suppressed by one power of \(\lambda \) because \(q_\perp ^\mu \) is at order of \(\lambda \). Hence, the \(U_{em}(1)\)-gauge invariance of our result holds only at the leading order of \(\lambda \).

One may expect that the amplitude can be simplified in the high energy limit, i.e., \(s \gg m_Q^2\). In the process \(\xi \) is fixed as \(\xi =2m_Q^2/(s-2 m_Q^2) (1+ {{\mathcal {O}}}(\lambda )\). Formally, one sets \(\xi =0\) to significantly simplify the twist-3 amplitude in the limit \(s \gg m_Q^2\). But this will result in a divergent contribution. This can be seen from the contribution from the GPD \(H_g(x,\xi ,t)\) contained in \({{\mathcal {F}}}_g\). In the limit \(\varDelta ^\mu \rightarrow 0\) \(H_g\) is related to the gluon parton distribution g(x) as \( H_g (x,0,0)= x g(x)\). For \(x\rightarrow 0\) g(x) behaves as \(x^{-\alpha }\) with \(\alpha >0\). The pole contribution from \(H_g\) in the first line of Eq. (43) is expected for \(\xi \rightarrow 0\) as \(\xi ^{-\alpha }\), as discussed in [31]. The behavior of GPDs in the region \(\vert x\vert \sim \xi \) is crucial for the limit. It is clear that the integration regions with \(\vert x\vert \sim \xi \) and \(\vert x_{1,2}\vert \sim \xi \) are the most important ones. If one assumes that all GPDs have similar behaviors in these regions, one may only neglect the last two terms by power counting of \(\xi \). Thus, the behaviors of the GPDs may have a significant effect on our calculation. One behavior of the GPDs has been studied in a series of papers [16,17,18,19,20,21,22,23], which show that some twist-3 GDPs may be discontinuous in some region. If this is true for the twist-3 GPDs in Eq. (43), we will not be allowed to do some of the integrations there. Thus, the power counting method cannot be applied to each twist-3 term of Eq. (43) individually. However, one has also shown [16] that some particular linear combinations of the twist-3 GPDs are free of discontinuity, and the amplitude may only depend on these linear combinations, although these GPDs may be discontinuous individually. The verification of a similar conclusion as regards the twist-3 GPDs here is another long work which is out of scope of this paper. Before this verification, one should always be careful when making use of the results here directly. Additionally, if a similar conclusion should be drawn here, it may be helpful to explain the poles of the hard factors which are convoluted to the twist-3 GPDs in Eq. (43). In general, it is not clear if the amplitude will be simplified in the limit. But the relative order of importance between twist-2- and twist-3 amplitude can be estimated, if we assume that the contributions from twist-3 GPDs can be transformed into terms that are free of discontinuity and, therefore, are finite.

By simply comparing the contribution from \(H_g\) in twist-2- or twist-3 contributions, one can already conclude that the twist-3 amplitude is suppressed only by a factor \(\varDelta _\perp /m_Q\). For \(J/\psi \) \(m_Q\) is \(m_c\); around 1 GeV. Therefore, the suppression is not strong. The weak suppression has a significant impact on extracting twist-2 gluon GPDs by using the leading amplitude given in Eq. (29), in which \(J/\psi \) is only transversely polarized. If the polarization of \(J/\psi \) is not measured or summed in experiment, then there will be a substantial contribution of longitudinally polarized \(J/\psi \) in the measured differential cross-sections. This can result in the extracted twist-2 GPDs by using the leading amplitude being not accurate and containing significant effects of higher-twists. This is an important observation of our work.

From the production of transversely polarized \(J/\psi \) or \(\varUpsilon \) one can only extract the gluon GPDs \(E_g\) and \(H_g\). It is possible to obtain information of other twist-2 GPDs through production of quarkonium with different quantum numbers. At leading power or twist-2 level, the exchanged two gluons in Fig. 1a can only be in a state with the quantum number \(C=+\). Therefore, \(h_c\) can be produced at twist-2. Other quarkonia, like \(\eta _c\) and \(\chi _{cJ}\), with \(J=0,1,2\), can only be produced through three-gluon exchange given in Fig. 1b, where the production is described with twist-3 GPDs obtained from \(M_F^{\mu \nu \rho }\) by replacing \(i f^{abc}\) in Eq. (6) by \(d^{abc}\).

4 Calculations for \(h_c\) Production

We have derived the twist-2- and twist-3 amplitude of \(h_c\). At twist-2 \(h_c\) is transversely polarized. The amplitude reads

$$\begin{aligned} {{\mathcal {T}}}_T&= 16 g_s^2 A_P \epsilon ^*_\nu (k) \epsilon ^\rho (q) \epsilon _{\perp \mu \rho } \nonumber \\&\quad \,\, \times \int \mathrm{d} x \frac{ \xi }{ (\xi -x-i\varepsilon )^2 (\xi +x -i\varepsilon ) }\nonumber \\&\quad \,\, \times \biggr [ g_\perp ^{\mu \nu } {{\mathcal {F}}}_g - i\epsilon _\perp ^{\mu \nu } \tilde{{\mathcal {F}}}_g + 2 {{\mathcal {F}}}^{\mu \nu }_{Tg} \biggr ] ,\nonumber \\ A_P&= e_Q \sqrt{\frac{3}{4\pi N_c}} R'^* (0) (2 m_Q)^{-5/2}, \end{aligned}$$
(44)

where \(R' (0)\) is the derivative of the radial wave function at the origin. It is interesting to note that all twist-2 gluon GPDs are involved. But the production of \(h_c\) can be difficult to measure. At twist-3 \(h_c\) is longitudinally polarized. The amplitude is

$$\begin{aligned} {{\mathcal {T}}}_L&= \frac{ 4 g_s^2 A_P}{m_Q} \int \mathrm{d} x \biggr \{ {\tilde{\epsilon }} \times \varDelta _\perp {{\mathcal {F}}}_g \biggr [ \frac{-2}{(\xi -x-i\varepsilon )(\xi +x -i\varepsilon )} \nonumber \\&\quad \,\, +\xi \biggr (\frac{ 5 x - 3\xi }{(\xi -x-i\varepsilon )(\xi +x-i\varepsilon )^3 }\nonumber \\&\quad \,\, +\frac{\rho _L}{(\xi -x-i\varepsilon )^2 (\xi +x-i\varepsilon )} \biggr ) \biggr ]\nonumber \\&\quad \,\, + i \xi \epsilon \times \varDelta _\perp \tilde{{\mathcal {F}}} _g \biggr [\frac{2x -6 \xi }{(\xi -x-i\varepsilon )(\xi +x-i\varepsilon )^3 }\nonumber \\&\quad \,\, -\frac{\rho _L}{(\xi -x-i\varepsilon )^2 (\xi +x-i\varepsilon )} \biggr ]\nonumber \\&\quad \,\, - \xi F_T^{\mu \nu } \biggr [\frac{(10\xi -6 x ) {\tilde{\epsilon }}_\mu \varDelta _{\perp \nu }}{(\xi -x-i\varepsilon )(\xi +x-i\varepsilon )^3 } \nonumber \\&\quad \,\, + \frac{\rho _L \ \epsilon _\mu {\tilde{\varDelta }}_{\perp \nu } }{(\xi -x-i\varepsilon )^2 (\xi +x-i\varepsilon )} \biggr ] \biggr \}\nonumber \\&\quad \,\, +\frac{8 g_s^2 A_P \xi }{m_Q } \int \mathrm{d} x_1 \mathrm{d} x_2 \biggr \{-\frac{4 {\tilde{\epsilon }}_\mu {\tilde{G}}_D^\mu (x_1,x_2)}{(\xi +x_2-i\varepsilon )^3}\nonumber \\&\quad \,\, +2 {\tilde{\epsilon }}_\mu G^\mu _\partial (x_1)\delta (x_1-x_2) \frac{\xi -2 x_1 }{(\xi \!-\!x_1\!-\!i\varepsilon )^2 (\xi \!+\! x_2 \!-\!i\varepsilon )^2}\nonumber \\&\quad \,\, \!+ \!\frac{ (x_1\!+\!x_2) {\tilde{\epsilon }}_\mu G_D^\mu (x_1,x_2)}{(\xi \!-\!x_2-i\varepsilon )(\xi \!+\!x_1-i\varepsilon ) (\xi \!-\!x_1\!-\!i\varepsilon )(\xi +x_2 \!-\!i\varepsilon )}\nonumber \\&\quad \,\, \!+\!4 {\tilde{\epsilon }}_\mu {\tilde{G}}_\partial ^\mu (x_1) \delta (x_1-x_2) \frac{ x_1}{(\xi -x_1-i\varepsilon )^2 (\xi +x_1-i\varepsilon )^2} \nonumber \\&\quad \,\, -4 {\tilde{\epsilon }}_\mu \frac{\sum _q N_F^\mu (x_1,x_2) }{(2\xi +x_1-x_2-i\varepsilon )^3} \biggr \}, \end{aligned}$$
(45)

with \({\tilde{\epsilon }}^\mu =\epsilon _\perp ^{\mu \nu } \epsilon _\nu \) and \({\tilde{\varDelta }}^\mu =\epsilon _\perp ^{\mu \nu } \varDelta _\nu \). Again, the amplitude here is suppressed only by the inverse of \(m_Q\) in comparison with the twist-2 amplitude in Eq. (44). But this amplitude is in fact divergent.

Although GPDs cannot be calculated perturbatively, their dependence on the renormalization scale \(\mu \) can be calculated with perturbative theory if \(\mu \) is large enough. The evolutions of twist-2 GPDs have been studied in [36]. The asymptotic behaviors have been derived. In the limit of \(\mu \rightarrow \infty \) the x-dependence of the GPD \(E_g\) and \(H_g\) contained in \({{\mathcal {F}}}_g\) can be found in [8, 36]:

$$\begin{aligned} \begin{aligned} E_g (x,\xi )&\sim \frac{4 C_F}{4 C_F + n_f}\frac{1}{\xi ^5} (\xi -x)^2 (\xi +x)^2 \theta (\xi -\vert x\vert ),\\ H_g (x,\xi )&\sim \frac{4 C_F}{4 C_F + n_f}\frac{1}{\xi ^5} (\xi -x)^2 (\xi +x)^2 \theta (\xi -\vert x\vert ). \end{aligned} \end{aligned}$$
(46)

In the limit these two GPDs are zero for \(\vert x\vert > \xi \). Given such a dependence, the integrals like

$$\begin{aligned} \int \mathrm{d}x \frac{1}{(\xi + x -i\varepsilon )^3} \biggr \{ E_g(x,\xi ),H_g(x,\xi )\biggr \} \end{aligned}$$
(47)

are divergent in the limit \( \mu \rightarrow \infty \). The last term in the first line in Eq. (45) contains such integrals but at finite \(\mu \). One can calculate these integrals at \(\mu \rightarrow \infty \) and obtain the results at finite \(\mu \) from the evolution of the two GPDs. The evolutions cannot eliminate such divergences. Therefore, the results in Eq. (45) are in fact divergent. The divergences come from the momentum region where one of the two exchanged gluons in Fig. 1a is soft. Such divergences are end-point ones, which are well known in collinear factorization of exclusive processes with light-cone amplitudes of hadrons, e.g., in twist-3 contributions of the \(\pi \) form factor and exclusive B-meson decays. For processes factorized with GPDs, the existence of such divergences involving GPDs has been first observed in \(\rho \)-meson production in [31]. These divergences make QCD factorization impossible [32,33,34,35]. To the best of our knowledge, there has so far appeared no rigorous way to deal with the singularities.

It should be emphasized that the existence of the end-point singularity in our twist-3 amplitude of \(h_c\) only means that we cannot evaluate it correctly in the framework of collinear factorization. But the contribution of the amplitude is power-suppressed. The amplitude of \(h_{c,b}\) at leading power or twist-2 has no such a singularity. If the power correction can be neglected, a physical prediction can still be made from Eq. (44) for \(h_{c,b}\) production. It is noted that in the twist-3 amplitude of \(J/\psi \) or \(\varUpsilon \) in Eq. (43) there is no evidence of existing end-point singularities. One can expect that the factorization holds in this case.

5 Summary

To summarize, we have studied at twist-3 level the forward photoproduction of a quarkonium, where the quarkonium can be \(J/\psi \) and \(h_c\). We have classified the relevant twist-3 gluon GPDs and studied their properties. The produced quarkonium at twist-2 is transversely polarized. At twist-3 it is longitudinally polarized and the amplitude is power-suppressed. Our result indicates that the twist-3 amplitude of \(h_c\) contains end-point singularities which spoil the factorization. We find that the twist-3 amplitude is only suppressed by the inverse of the heavy quark mass. This has an important implication for extracting twist-2 gluon GPDs, if the polarization of the produced quarkonium is not observed. In this case, the contribution of the longitudinal polarization can be significant and should be taken into account. Experimentally, the studied processes can be already studied at JLab, and in the future at EIC or After@LHC as proposed in [37] or [38], respectively.