1 Introduction

Prompt production of \(J/\psi \) meson pairs at high energies is a very intriguing subject of studies [1,2,3,4]. It provides a unique laboratory to investigate the quarkonia production mechanisms predicted by the non-relativistic QCD (NRQCD) factorization [5,6,7], which is a rigorous framework for the description of heavy quarkonia production or decays. The NRQCD implies a separation of perturbatively calculated short distance cross sections for the production of a heavy quark pair in an intermediate Fock state \({}^{2S+1}L_J^{(a)}\) with spin S, orbital angular momentum L, total angular momentum J and color representation a from its subsequent non-perturbative transition into a physical quarkonium via soft gluon radiation. The latter is described by the long-distance non-perturbative matrix elements (LDMEs), which obey certain hierarchy in powers of the relative heavy quark velocity v [5,6,7]. At the next-to-leading order (NLO), NRQCD can explain the LHC data on the prompt \(J/\psi \), \(\psi ^\prime \) and \(\chi _c\) transverse momentum distributions (see, for example, [8,9,10,11,12,13,14,15]). However, it has a long-standing challenge in the \(J/\psi \) and \(\psi ^\prime \) polarization and provides inadequate description [16,17,18] of the \(\eta _c\) production dataFootnote 1 (see also discussions [23,24,25]). Studying the \(J/\psi \) meson pair production can shed light on the puzzling aspects above since \(c{{\bar{c}}}\) bound state formation takes place here twice.

In the last few years, significant progress has been made in the NRQCD evaluations of prompt double \(J/\psi \) production. The complete leading-order (LO) calculations, including both the color singlet (CS) and color octet (CO) terms, were done [26]. The relativistic corrections to the \(J/\psi \) pair production are carried out [27]. The NLO contributions to the CS mechanism are known [28] and partial tree-level NLO\(^*\) contributions to the both CS and CO terms were calculated [29]. The latter were found to be essential for both low and large transverse momenta, as compared to the LO results.Footnote 2 However, being comparable with the LHCb measurements [3, 4], all these evaluations have sizeble discrepancies with the latest CMS [1] and ATLAS [2] data, especially at large transverse momentum \(p_T(J/\psi , J/\psi )\), invariant mass \(m(J/\psi , J/\psi )\) and rapidity separation \(\Delta y(J/\psi , J/\psi )\) of the \(J/\psi \) pairs. For example, the CMS data are underestimated by the NRQCD predictions with a factor of about 10 [26, 28]. The difference between the theoretical calculations and more recent ATLAS data at large \(p_T(J/\psi , J/\psi )\) or \(m(J/\psi , J/\psi )\) is typically smaller but still essential. It was argued [26] that new processes or mechanisms are needed to better describe the LHC data.

At large invariant mass \(m(J/\psi , J/\psi )\) the processes with large angular separation between the \(J/\psi \) mesons could play a role. One of such processes are the gluon or quark fragmentation shown in Fig. 1. The gluon fragmentation into \(^3S_{1}^{(8)}\) intermediate state scales as \(1/p_T^4\) and govern the single \(J/\psi \) production at high transverse momenta (see, for example, [8,9,10,11,12] and references therein). In the case of \(J/\psi \) pair production, such terms were found to be negligible since they are suppressed by powers of QCD coupling \(\alpha _s\) [26]. However, at large \(p_T(J/\psi , J/\psi )\) or \(m(J/\psi , J/\psi )\) one can expect a sizeble combinatorial contribution to the fragmentation yield from the multiple gluon radiation originating during the QCD evolution of the initial gluon cascade. The latter determines the perturbative QCD corrections to the production cross sections at high energies, which can be effectively taken into account using the Ciafaloni–Catani–Fiorani–Marchesini (CCFM) evolution equation [30,31,32,33]. Main goal of our study is to clarify this point and investigate the role of combinatorial cascade gluon fragmentation contributions to the double \(J/\psi \) production in different kinematical regimes at the LHC.

Fig. 1
figure 1

Contribution to the \(J/\psi \) pair production from the fragmentation of gluon cascade. The dashed line encloses the hard subprocess \(g^* g^*\rightarrow g^*\), the rest of the diagram describes the initial state radiation cascade

Our other goal is connected with the investigation of additional production mechanism, double parton scattering (DPS), which is widely discussed in the literature at present (see, for example, [34,35,36,37,38,39,40,41] and references therein). Apart from the single parton scattering (SPS), where \(J/\psi \) meson pair is produced in a single gluon–gluon collision, DPS events originate from two independent parton interactions. Studying the DPS mechanism is of great importance since it can help in understanding various backgrounds in searches for new physics at the collider experiments. Despite the relative low total production rate, the DPS mechanism is expected to be important for double \(J/\psi \) production at forward rapidities [42,43,44]. Therefore, the latter can be used to determine the DPS key parameter, the effective cross section \(\sigma _{\mathrm{eff}}\), which is related to the transverse overlap function between partons in the proton and supposed to be universal for all processes with different kinematics and energy scales. Most of the measured values of \(\sigma _{\mathrm{eff}}\) lie between 12 and 20 mb (see, for example, [45, 46]). However, somewhat lower value \(\sigma _{\mathrm{eff}} = 8.8 - 12.5\) mb was extracted from the latest LHCb data on \(J/\psi \) pair production within the NRQCD [4]. Moreover, the values \(\sigma _{\mathrm{eff}} = 8.2 \pm 2.2\) mb [47], \(\sigma _{\mathrm{eff}} = 6.3 \pm 1.9\) mb [2], \(\sigma _{\mathrm{eff}} = 4.8 \pm 2.5\) mb [48] and even \(\sigma _{\mathrm{eff}} = 2.2 \pm 1.1\) mb [49], \(\sigma _{\mathrm{eff}} = 2.2 - 6.6\) mb [50] were obtained from recent Tevatron and LHC experiments. Below we will try to extract the effective cross section \(\sigma _{\mathrm{eff}}\) from combined analysis of the LHCb data [3, 4] on double \(J/\psi \) production taken at \(\sqrt{s} = 7\) and 13 TeV.

To calculate the physical cross sections we use the \(k_T\)-factorization approach [51,52,53,54]. We see certain technical advantages in the fact that, even with the LO amplitudes for hard subprocesses, one can include a large piece of higher-order pQCD corrections (NLO + NNLO + \(\ldots \)) taking them into account in the form of CCFM-evolved Transverse Momentum Dependent (TMD) gluon densities in a protonFootnote 3. In this way we preserve consistency with our previous studies [19,20,21,22] and automatically incorporate the wanted effects of initial state gluon radiation. To reconstruct the CCFM evolution ladder, that is the key point of our consideration, we employ the TMD parton shower routine implemented into the Monte-Carlo event generator cascade [56]. The \(k_T\)-factorization approach can be considered as a convenient alternative to explicit high-order calculations in the collinear DGLAP-based scheme. The situation in \(J/\psi \) pair production is specific since calculating even the LO hard scattering amplitudes is already complicated enough, so that extending to higher orders seems to be a rather cumbersome task. Thus, the \(k_T\)-factorization remains the only way open to potentially important higher-order effects. To evaluate the DPS contributions to the double \(J/\psi \) production we will use the results of our previous analysis [20].

The outline of the paper is the following. In Sect. 2 we briefly describe the basic steps of our calculations. In Sect. 3 we present the numerical results and discussion. Our conclusions are summarised in Sect. 4.

2 The model

The neccessary starting point of our consideration is related with CS contribution to the double \(J/\psi \) production, that refers to \({{{\mathcal {O}}}}(\alpha _s^4)\) gluon–gluon fusion subprocess

$$\begin{aligned} g^*(k_1) + g^*(k_2) \rightarrow c{{\bar{c}}}\left[ ^3S_1^{(1)}\right] (p_1) + c{{\bar{c}}}\left[ ^3S_1^{(1)}\right] (p_2), \end{aligned}$$
(1)

where the four-momenta of all particles are indicated in the parentheses. Some typical Feynman diagrams are depicted in Fig. 2. It is important that both initial gluons are off mass shell. That means that they have non-zero transverse four-momenta \(k_{1T}^2 = - {{\mathbf {k}}}_{1T}^2 \ne 0\) and \(k_{2T}^2 = - {{\mathbf {k}}}_{2T}^2 \ne 0\) and an admixture of longitudinal component in the polarization four-vectors (see [51,52,53,54] for more information). The corresponding off-shell (\(k_T\)-dependent) production amplitude contains widely used projection operators for spin and color [57,58,59,60] which guarantee the proper quantum numbers of final state charmonia. Below we apply the gauge invariant expression obtained earlier [61]. The derivation steps are explained in detail there. The respective cross section can be written as

$$\begin{aligned} \sigma (pp \rightarrow J/\psi J/\psi + X)= & {} \int {1\over 16 \pi (x_1 x_2 s)^2} |{\mathcal {{\bar{A}}}}(g^* g^* \rightarrow J/\psi J/\psi )|^2 \nonumber \\&\times f_g(x_1,{\mathbf {k}}_{1T}^2,\mu ^2) f_g(x_2,{\mathbf {k}}_{2T}^2,\mu ^2) \nonumber \\&\quad d{{\mathbf {k}}}_{1T}^2 d{{\mathbf {k}}}_{2T}^2 d{{\mathbf {p}}}_{1T}^2 dy_1 dy_2 {d\phi _1\over 2\pi } {d\phi _2\over 2\pi } {d\psi _1\over 2\pi },\nonumber \\ \end{aligned}$$
(2)

where \(\psi _1\) is the azimuthal angle of outgoing \(J/\psi \) meson, \(\phi _1\) and \(\phi _2\) are the azimuthal angles of initial gluons having the longitudinal momentum fractions \(x_1\) and \(x_2\), \(y_1\) and \(y_2\) are the center of mass rapidities of produced particles and \(f_g(x,{{\mathbf {k}}}_{T}^2, \mu ^2)\) is the TMD gluon density in a proton taken at the scale \(\mu ^2\).

Fig. 2
figure 2

Examples of the Feynman diagrams, contributing to the \(J/\psi \) pair production via CS mechanism

In addition to the CS terms above, we have considered some of CO contributions using the fragmentation approach. At high transverse momenta, \(p_T \gg m_\psi \), large logaritmic corrections proportional to \(\alpha _s^n \ln ^n p_T/m_\psi \) occur and, therefore, description in terms of fragmentation functions (FFs), evolving with the energy scale \(\mu ^2\), appears to be appropriate. In general, the FF \(D_a^{{{\mathcal {H}}}}(z,\mu ^2)\) describing the transition of parton a into the charmonium state \({{\mathcal {H}}}\) can be expressed as follows (see, for example, [62] and references therein):

$$\begin{aligned} D_a^{{{\mathcal {H}}}}(z,\mu ^2) = \sum _{n}d_a^n(z,\mu ^2) \langle {{{\mathcal {O}}}}^{{{\mathcal {H}}}} [n]\rangle , \end{aligned}$$
(3)

where n labels the intermediate (CS or CO) state of charmed quark pair produced in the hard parton interaction and \(\langle {{{\mathcal {O}}}}^{{{\mathcal {H}}}} [n]\rangle \) are the corresponding LDMEs. In the leading logarithmic approximation, \(g^* \rightarrow c{{\bar{c}}}[^3S_1^{(8)}]\) transition is the only one giving a sizeble contribution to S-wave charmonia production at \(p_T \gg m_\psi \) [62], so that the cross section of inclusive single \(J/\psi \) production in pp collisions could be approximately calculated as

$$\begin{aligned} {d\sigma (pp \rightarrow J/\psi + X) \over dp_T}= & {} \int dz {d\sigma (pp \rightarrow g^*) \over dp_T^{(g^*)}} d_g^{[^3S_1^{(8)}]}\nonumber \\&\quad (z,\mu ^2) \langle {{{\mathcal {O}}}}^{J/\psi } [^3S_1^{(8)}] \rangle , \end{aligned}$$
(4)

where \(p = z p^{(g^*)}\) and \(p^{(g^*)}\) are the outgoing \(J/\psi \) meson and intermediate gluon momenta. One can easily obtain

$$\begin{aligned} \sigma (pp \rightarrow g^*)= & {} \int {\pi \over x_1 x_2 s \lambda ^{1/2}(m_\psi ^2,k_1^2,k_2^2)} |{\mathcal {{\bar{A}}}}(g^* g^* \rightarrow g^*)|^2 \nonumber \\&\times f_g(x_1,{{\mathbf {k}}}^2_{1T},\mu ^2) f_g(x_2,{{\mathbf {k}}}^2_{2T},\mu ^2) d{{\mathbf {k}}}^2_{1T} d{{\mathbf {k}}}^2_{2T} dy {d\phi _1 \over 2\pi } {d\phi _2 \over 2\pi },\nonumber \\ \end{aligned}$$
(5)

where \(p^{(g^*)} = k_1 + k_2\) and \(\lambda (m_\psi ^2, k_1^2, k_2^2)\) is the known kinematical function [63]. Evaluation of the off-shell production amplitude \(|{\mathcal {{\bar{A}}}}(g^* g^* \rightarrow g^*)|^2 = (3/2) \pi \alpha _s(\mu ^2) |{{\mathbf {p}}}_T^{(g^*)}|^2\) is an extremely straightforward and, in our opinion, needs no explanation. We only note that, according to the \(k_T\)-factorization prescription [51,52,53,54], the summation over the polarizations of initial off-shell gluons is carried out with \(\sum \epsilon ^\mu \epsilon ^{*\,\nu } = {{\mathbf {k}}}_T^\mu {{\mathbf {k}}}_T^\nu /{{\mathbf {k}}}_T^2\). In the collinear limit \({{\mathbf {k}}}_T \rightarrow 0\) this expression converges to the ordinary one after averaging on the azimuthal angle.

The key point of our consideration is that the gluon, produced in the hard scattering and fragmented into the \(J/\psi \) meson according to main formula (4), is accompanied by a number of gluons radiated during the non-collinear QCD evolution, which also give rise to final \(J/\psi \) mesons. Thus, taking into account all their possible combinations into the meson pairs, one can calculate corresponding gluon fragmentation contribution to the double \(J/\psi \) production up to all orders in the pQCD expansion. At high energies, the QCD evolution of gluon cascade can be described by the CCFM equation [30,31,32,33], which smoothly interpolates between the small-x BFKL gluon dynamics and high-x DGLAP one, and, therefore, provides us with the suitable tool for our phenomenological study. To reconstruct the CCFM evolution ladder, we generate a Les Houches Event file [64] in the numerical calculations according to (4) and (5) and then process the file with a TMD shower tool implemented into the Monte-Carlo event generator cascade [56]. This approach gives us the possibility to take into account the contributions from initial state gluon emissions in a consistent way (see also [65]).

Of course, the same scenario can be applied to fragmentation of charmed quark pairs into \(J/\psi \) mesons. So, one can first simulate the perturbative production of \(c {{\bar{c}}}\) pair in the off-shell gluon–gluon fusion and then reconstruct the CCFM evolution ladder using the cascade tool. After that, one can easily produce \(J/\psi \) pairs by taking into account all possible combinations of mesons originating from the charmed quarks and/or cascade gluon fragmentation. Unlike the conventional (collinear) QCD factorization, where only fragmentation of both charmed quarks into \(J/\psi \) mesons gives contribution, the model above can lead to increase in the double \(J/\psi \) production cross section due to additional combinatorial contributions from gluons and quarks.

The charm and gluon FFs at the any scale \(\mu ^2\), \(D_c^{J/\psi }(z,\mu ^2)\) and \(D_g^{J/\psi }(z,\mu ^2)\), can be obtained by solving the LO DGLAP evolution equations:

$$\begin{aligned} \frac{d}{d\ln \mu ^2}\left( \begin{array}{lr} D_c \\ D_g \end{array}\right) = \frac{\alpha _s(\mu ^2)}{2\pi }\left( \begin{array}{lr} P_{qq} &{} P_{gq} \\ P_{qg} &{} P_{gg} \end{array}\right) \otimes \left( \begin{array}{lr} D_c \\ D_g \end{array}\right) , \end{aligned}$$
(6)

where \(P_{ab}\) are the standard LO DGLAP splitting functions. The initial conditions for these FFs are calculated with [56]

$$\begin{aligned} d_g^{[^3S_1^{(8)}]}(z,\mu _0^2)= & {} \frac{\alpha _s(\mu _0^2)}{24 m_c^3} \pi \delta (1 - z), \end{aligned}$$
(7)
$$\begin{aligned} d_c^{[^3S_1^{(1)}]}(z,\mu _0^2)= & {} \frac{\alpha _s^2(\mu _0^2)}{m_c^3} {16z(1-z)^2 \over 243(2-z)^6}\nonumber \\&\quad \left( 5z^4-32z^3+72z^2-32z+16\right) , \end{aligned}$$
(8)

where starting scale \(\mu _0^2 = m^2_\psi \). As it was noted above, we keep only the leading contributions to corresponding FFs (see, for example, [62] and references therein). According to the non-relativistic QCD approximation, we set the charmed quark mass to \(m_c = m_\psi /2\) and then solve the DGLAP equations (6) numerically. The obtained charm and gluon FFs, \(D_c^{J/\psi }(z,\mu ^2)\) and \(D_g^{J/\psi }(z,\mu ^2)\), are shown in Fig. 3 as functions of z for several values of scale \(\mu ^2\). Using these FFs, we reproduce well the results of calculations performed with the Monte Carlo event generator pegasus [66] (see Fig. 4).

Fig. 3
figure 3

The charm (left panel) and gluon (right panel) FFs, \(D_c^{J/\psi }(z,\mu ^2)\) and \(D_g^{J/\psi }(z,\mu ^2)\), calculated as functions of z for several values of scale \(\mu ^2\). We have applied one-loop QCD coupling with \(n_f = 4\), \(\Lambda _{\mathrm{QCD}} = 250\) MeV and took \(\langle {{{\mathcal {O}}}}^{J/\psi } [^3S_1^{(1)}] \rangle = 1.16\) GeV\(^3\), \(\langle {{{\mathcal {O}}}}^{J/\psi } [^3S_1^{(8)}] \rangle = 2.5 \cdot 10^{-3}\) GeV\(^3\)

Fig. 4
figure 4

Transverse momentum and rapidity distributions of inclusive \(J/\psi \) production at \(\sqrt{s} = 13\) TeV, calculated using the fragmentation approach and Monte-Carlo event generator pegasus [63]. The contributions from the \(^3S_1^{(8)}\) transition are only taken into account with \(\langle {{{\mathcal {O}}}}^{J/\psi } [^3S_1^{(8)}] \rangle = 2.5 \cdot 10^{-3}\) GeV\(^3\) [20]. The A0 gluon distribution in proton is applied

Finally, we turn to the DPS contribution to the double \(J/\psi \) production. We apply a commonly used factorization formula (for details see reviews [34,35,36,37,38,39,40,41] and references therein):

$$\begin{aligned} \sigma _{\mathrm{DPS}}(pp \rightarrow J/\psi J/\psi + X) = {1\over 2} {\sigma ^2(pp \rightarrow J/\psi + X) \over \sigma _{\mathrm{eff}}}, \end{aligned}$$
(9)

where factor 1/2 accounts for two identical particles in the final state. The effective cross section \(\sigma _{\mathrm{eff}}\) can be considered as a normalization constant which incorporates all ”DPS unknowns” in to a single phenomenological parameter. Derivation of the factorization formula (9) relies on the two approximations: first, the double parton distribution function can be decomposed into longitudinal and transverse components and, second, the longitudinal component reduces to the diagonal product of two independent single parton densities. The latter is generally acceptable for such collider experiments where small-x values are probed. The typical values of the variable x in the considered process are of order \(x \sim (2m_\psi ^2 + p_T^2)^{1/2}/\sqrt{s} \sim 10^{-3}\), that approximately corresponds to the kinematical region of CMS [1], ATLAS [2] and even LHCb [3, 4] measurements (due to relatively small invariant mass of produced \(J/\psi \) pair, see discussion below). Therefore, one can safely omit the kinematical constraint [67, 68] often applied at the edge of phase spaceFootnote 4. Detailed description of evaluation of inclusive cross section \(\sigma (pp \rightarrow J/\psi + X)\) in the \(k_T\)-factorization approach supplemented with the NRQCD formalism can be found [20].

In the numerical calculations below we will use TMD gluon densities in a proton obtained from the numerical solution of CCFM evolution equation, namely, A0 [70] and more recent JH’2013 set 2 [71] gluons. Their input parameters have been fitted to the proton structure function \(F_2(x,Q^2)\). At present, both these gluon distributions are widely used in the phenomenological applicationsFootnote 5 (see, for example, [20,21,22]). The renormalization and factorization scales, \(\mu _R\) and \(\mu _F\), were set to \(\mu _R^2 = \mu _F^2 = {{\hat{s}}} + {{\mathbf {Q}}}^2_T\), where \({{\hat{s}}} = (k_1 + k_2)^2\) and \({{\mathbf {Q}}}_T^2\) is the transverse momentum of initial off-shell gluon pair. This choice is dictated mainly by the CCFM evolution algorithm (see [70, 71] for more information). As it is often done, the fragmentation scale \(\mu _{\mathrm{fr}}\) is choosen to be equal to \(\mu _{\mathrm{fr}} = m_T\), the transverse mass of fragmenting parton. We use the one-loop formula for the QCD coupling \(\alpha _s\) with \(n_f = 4\) active quark flavors at \(\Lambda _{\mathrm{QCD}}^{(4)} = 250\) MeV for A0 gluon and two-loop expression for \(\alpha _s\) with \(n_f = 4\) and \(\Lambda _{\mathrm{QCD}}^{(4)} = 200\) MeV for JH’2013 set 2 distribution. Following [73], we set the \(J/\psi \) meson mass \(m_\psi = 3.097\) GeV. We take corresponding CS LDME from the known \(J/\psi \rightarrow \mu ^+\mu ^-\) decay width: \(\langle {{{\mathcal {O}}}}^{J/\psi } [^3S_1^{(1)}]\rangle = 1.16\) GeV\(^3\) [8,9,10,11,12].

3 Numerical results and discussion

We are now in a position to present the results of our simulations. First we discuss the role of cascade gluon fragmentation in different kinematical regimes, which correspond to the CMS, ATLAS and LHCb experiments.

In Fig. 5 we show the differential cross sections of double \({J/\psi }\) production calculated as a functions of \(J/\psi \) pair invariant mass \(m(J/\psi , J/\psi )\) and difference in rapidity between the \(J/\psi \) mesons \(|\Delta y(J/\psi , J/\psi )|\) at \(\sqrt{s} = 13\) TeV. We have required \(p_T(J/\psi ) > 10\) GeV for both produced mesons, that ensures the validity of the fragmentation approach used. Moreover, this restriction close to the CMS or ATLAS conditions. To illustrate the effect of using different TMD gluon densities we present the results obtained with the A0 and JH’2013 set 2 distributions. One can see that an accurate account of combinatorial contributions, originated from the cascade gluon fragmentation into the \(J/\psi \) mesons (labeled as ”fragm. comb.”), significantly (up to an order of magnitude) increase the cross section compared to the single gluon fragmentation, governed by the LO gluon–gluon fusion subprocessFootnote 6 (labeled as ”fragm. coll.”). For the latter, we reproduce the results [28]. To highlight the importance of the combinatorial gluon fragmentation, we show the results obtained using the simplified selection of \({J/\psi }\) pair in each event, where one of the \(J/\psi \) mesons is originated from the gluon produced in the hard scattering subprocess and another one is produced from the leading cascade gluon (labeled as ”fragm. lead.”). This selection criterion almost corresponds to the collinear limit, as it is clearly demonstrated in Fig. 5. Next, we find that the cascade gluon fragmentation plays a dominant role at large invariant masses \(m(J/\psi , J/\psi ) \ge 25\) GeV for A0 gluon density (or at \(m(J/\psi , J/\psi ) \ge 35\) GeV for JH’2013 set 2 one) and \(|\Delta y(J/\psi , J/\psi )| \ge 1\), where it greatly overestimates the CS contributions. Note that the quantitative difference between the A0 and JH’2013 set 2 predictions observed in Figs. 5 and 6 is connected mainly with the different LDMEs used in the calculationsFootnote 7. Taking into account these combinatorial contributions results in the drastical rise of the double \(J/\psi \) production cross sections at large \(m(J/\psi , J/\psi )\), where the strong discrepancy between the NRQCD estimations (including both the CS and CO terms) and experimental data, taken by the CMS and ATLAS Collaborations, is observed. Contrary, the combinatorial fragmentation effects should be significantly less pronounced at forward rapidities, which are covered by the LHCb measurements. To demonstrate it, we have repeated the calculations under the requirements \(4.5< p_T(J/\psi ) < 10\) GeV and \(2< y(J/\psi ) < 4.5\). The upper limit of \(p_{T}(J/\psi )\) is set to be the same as in LHCb analyses [3, 4] while lower limit corresponds to the region, where the fragmentation approach is valid. Our results for distributions in \(m(J/\psi ,J/\psi )\) and \(|\Delta y(J/\psi ,J/\psi )|\) are shown in Fig. 6. One can see that the cascade gluon fragmentation gives only small contribution to the forward \(J/\psi \) pair production and, in principle, can be safely neglected. It can be easily understood since at large rapidities (or, equivalently, at large momentum fraction x of one of the interacting gluons) the gluon emissions in the initial state are insufficient.

Fig. 5
figure 5

Different contributions to the double \(J/\psi \) production calculated as functions of invariant mass \(m(J/\psi , J/\psi )\) and rapidity separation \(|\Delta y(J/\psi , J/\psi )|\) at \(\sqrt{s} = 13\) TeV. The kinematical cut \(p_T(J/\psi ) > 10\) GeV is applied for both \(J/\psi \) mesons. The A0 (upper panels) and JH’2013 set 2 (lower panels) gluon distributions in proton are used

Fig. 6
figure 6

Different contributions to the double \(J/\psi \) production calculated as functions of invariant mass \(m(J/\psi , J/\psi )\) and rapidity separation \(|\Delta y(J/\psi , J/\psi )|\) at \(\sqrt{s} = 13\) TeV. The kinematical cuts \(4.5< p_T(J/\psi ) < 10\) GeV and \(2< y(J/\psi ) < 4.5\) are applied for both \(J/\psi \) mesons. The A0 (upper panels) and JH’2013 set 2 (lower panels) gluon distributions in proton are used

Concerning the contributions from charm fragmentation, their role (compared to the LO predictions of conventional pQCD) is also a bit enhanced due to the multiple gluon emissions in the initial state. We find that these processes amount several percent of the \({J/\psi }\) pair production cross section (see Figs. 5 and 6) and, of course, can be considered as additional non-leading terms.Footnote 8

Thus, we have shown that taking into account the combinatorial contributions from the cascade gluon fragmentation could fill the gap between the NRQCD predictions and experimental data. However, to perform the quantitative comparison with the available CMS [1] and ATLAS [2] measurements one has to include into the analysis a number of other possible fragmentation channels playing role at low and moderate transverse momenta. Moreover, additional feeddown contributions to the double \(J/\psi \) production from the \(\chi _c\) and \(\psi ^\prime \) decays should be taken into account. An accurate theoretical description requires a rather long-time numerical calculations. So, here we only claim the possible importance of the combinatorial fragmentation terms above and left their further cumbersome analysis for a forthcoming dedicated study.

Now we turn to available LHCb data collected at \(\sqrt{s} = 7\) and 13 TeV [3, 4]. These data refer to \(p_T(J/\psi ) < 10\) GeV, \(m(J/\psi , J/\psi ) < 15\) GeV and forward rapidity region, \(2< y(J/\psi ) < 4.5\). Since the combinatorial contributions from gluon and/or charmed quark fragmentation are almost negligible there, only the CS terms and DPS production mechanism play the role. The latter give us the possibility to easily extract the key parameter of DPS mechanism, the effective cross section \(\sigma _{\mathrm{eff}}\), from the LHCb measurements. The feeddown contributions from radiative \(\chi _{c}\) and \(\psi ^\prime \) decays to the SPS cross section, which is governed by the subprocess (1), are also unimportant at small transverse momenta and invariant mass \(m(J/\psi , J/\psi )\), see discussions [44, 75]. Thus, we neglect below all these terms for simplicity. To evaluate the DPS contribution to the \(J/\psi \) pair production we use the results of our previous studies and strictly follow the approach [20] for the inclusive cross section \(\sigma (pp \rightarrow J/\psi + X)\), entering to the DPS factorization formula (9). So, the determination of \(\sigma _{\mathrm{eff}}\) can be performed in a self-consistent way. Note that only the SPS terms are affected by the choice of TMD gluon density in a proton. The DPS predictions are stable, because when we switch to a different set of TMD gluons we have to accordingly change the LDME values, in order to keep the calculation consistent with the single inclusive \(J/\psi \) production data. It means that the changes in TMD are compensated by the changes in LDMEs, making the net result the same.

Fig. 7
figure 7

Differential cross sections of double \(J/\psi \) production as functions of invariant mass \(m(J/\psi , J/\psi )\) and transverse momentum \(p_T(J/\psi ,J/\psi )\) calculated at \(\sqrt{s} = 7\) TeV (left panel) and \(\sqrt{s} = 13\) TeV (right panel). The kinematical cuts applied are described in the text

Fig. 8
figure 8

Prompt double \(J/\psi \) production cross sections as functions of different kinematical variables calculated at \(\sqrt{s} = 13\) TeV. The kinematical cuts applied are described in the text

Fig. 9
figure 9

Prompt double \(J/\psi \) production cross sections as functions of different kinematical variables calculated at \(p_T(J/\psi ,J/\psi ) > 1\) GeV and \(\sqrt{s} = 13\) TeV. Other kinematical cuts applied are described in the text

Fig. 10
figure 10

Prompt double \(J/\psi \) production cross sections as functions of different kinematical variables calculated at \(p_T(J/\psi ,J/\psi ) > 3\) GeV and \(\sqrt{s} = 13\) TeV. Other kinematical cuts applied are described in the text

The following kinematical variables have been investigated in the LHCb analyses [3, 4]: transverse momentum \(p_T(J/\psi , J/\psi )\), rapidity \(y(J/\psi ,J/\psi )\) and invariant mass of the \(J/\psi \) pair, transverse momentum and rapidity of \(J/\psi \) mesons, differences in the azimuthal angle \(|\Delta \phi (J/\psi , J/\psi )|\) and rapidity \(|\Delta y(J/\psi , J/\psi )|\) between the produced mesons and transverse momentum asymmetry \({{{\mathcal {A}}}}_T\), defined as

$$\begin{aligned} {{{\mathcal {A}}}}_T = \left| {p_T(J/\psi _1) - p_T(J/\psi _2) \over p_T(J/\psi _1) + p_T(J/\psi _2)} \right| . \end{aligned}$$
(10)

The measurements have been performed for \(p_T(J/\psi ,J/\psi ) > 1\) GeV, \(p_T(J/\psi ,J/\psi ) > 3\) GeV and in the whole \(p_T(J/\psi ,J/\psi )\) range. We consider \(\sigma _{\mathrm{eff}}\) as an independent parameter and perform a simultaneous fit to the LHCb data. The fitting procedure was separately done for each of the measured kinematical distributions employing the fitting algorithm as implemented in the commonly used gnuplot package [76].

Not all of the existing data sets are equally informative for the \(\sigma _{\mathrm{eff}}\) extraction. Using the data where the DPS contribution is smaller than the uncertainty of the ”main” contribution would only increase the total error. So, our fit is based on the following distributions (all measured at \(\sqrt{s} = 7\) TeV and 13 TeV): single \(J/\psi \) transverse momentum \(p_T(J/\psi )\); single \(J/\psi \) rapidity \(y(J/\psi )\); invariant mass \(m(J/\psi ,J/\psi )\); transverse momentum of \(J/\psi \) pair; rapidity of \(J/\psi \) pair; transverse momentum asymmetry \({{{\mathcal {A}}}}_T\); rapidity separation between the two \(J/\psi \) mesons \(|\Delta y(J/\psi ,J/\psi )|\). For all observables except \(|\Delta y(J/\psi ,J/\psi )|\) we used the data without cuts on \(p_T(J/\psi ,J/\psi )\) and with \(p_T(J/\psi ,J/\psi ) > 1\) GeV, while for the rapidity separation \(|\Delta y(J/\psi ,J/\psi )|\) we used the sets without cuts on \(p_T(J/\psi ,J/\psi )\), with \(p_T(J/\psi ,J/\psi ) > 1\) GeV, and with \(p_T(J/\psi ,J/\psi ) > 3\) GeV.

The obtained mean-square averages of the fitted values are \(\sigma _{\mathrm{eff}} = 17.5 \pm 4.1\) mb (for A0 gluon) and \(\sigma _{\mathrm{eff}} = 13.8 \pm 0.9\) mb (for JH’2013 set 2 gluon), where corresponding uncertainties are estimated in the conventional way using Student’s t-distribution at the confidence level \(P = 95\)%. These values coincide to each other within the uncertainties. Here we achieve a remarkable agreement with the majority of other \(\sigma _{\mathrm{eff}}\) estimations based on different final states, such as, for example, \(W + 2\) jets [77, 78], \(2\,\gamma + 2\) jets [79], \(\gamma + 3\) jets [80], 4 jets [34, 45], \(J/\psi + D^+\), \(J/\psi + D^0\), \(J/\psi + \Lambda _c^+\) [81], \(\Upsilon (1S) + D^0\) [46]. Thus, our results support the expectation about the universality of this parameter for a wide range of processes with essentially different kinematics, energies and hard scales. The both obtained values of \(\sigma _{\mathrm{eff}}\) significantly exceed previous estimations based on the same final state, \(J/\psi + J/\psi \), which are typically of about \(2 - 5\) mb [48,49,50]. Of course, the results [48,49,50] also contradict to the most of the measured \(\sigma _{\mathrm{eff}}\) values [77,78,79,80,81]. The disparity between the results can be attributed to using different schemes for parton densities (collinear in [2, 46, 47] versus \(k_T\)-factorization in our case) or to a different choice of the renormalization scale \(\mu _R\) (regarding the \(k_T\)-based part of the analysis presented in [4]). The missing feed-down contributions may also play some role.

A comparison of our predictions with the LHCb experimental data is displayed in Figs. 7, 8, 9 and10. The theoretical uncertainty bands are shown for A0 gluon density and include both scale uncertainties and uncertainties coming from the \(\sigma _{\mathrm{eff}}\) fitting procedure. First of them have been estimated in a usual way, by varying the \(\mu _R\) scale around its default value by a factor of 2. This was accompanied with using the A0\(+\) and A0− gluon densities instead of default A0 distribution, in accordance with [70]. As one can see, we achieved a reasonably good agreement between the results of our calculations with both considered TMD gluons and LHCb measurements performed for \(\sqrt{s} = 7\) and 13 TeV. There is only exception in the threshold region, \(m(J/\psi , J/\psi ) \le 9\) GeV, where our predictions systematically overshoot the data. However, at such low \(m(J/\psi , J/\psi )\) an accurate treatment of multiple soft gluon emissions, relativistic corrections and other nonperturbative effects becomes necessary to produce the theoretical estimations. All these issues are out from our present consideration. Next, we find that neither the SPS terms, nor the DPS contributions alone are able to describe the LHCb data, but only their sum. In particular, the DPS contributions are essential to reproduce the measured \(|\Delta y (J/\psi , J/\psi )|\) distributions at \(|\Delta y (J/\psi , J/\psi )| \ge 1\) or 1.5, that confirms the previous expectations [42,43,44]. They are important to describe also the normalization of \(J/\psi \) rapidity distributions and shape of transverse momentum asymmetry \({{{\mathcal {A}}}}_T\) at \({{{\mathcal {A}}}}_T \le 0.4\), see Figs. 8, 9 and 10.

The presented results, being considered altogether with the ones for inclusive single production of charmonia states [20], can give a significant impact on the understanding of charmonia production within the NRQCD framework and, in particular, on the further understanding of DPS mechanism. The most interesting outcome of our study is that the extremely low value of DPS effective cross section, \(\sigma _{\mathrm{eff}} \sim 2-5\) mb, obtained in earlier analyses of double \(J/\psi \) production at the LHC, is not confirmed.

4 Conclusion

We have considered the prompt production of \(J/\psi \) meson pairs in pp collisions at the LHC using the \(k_T\)-factorization approach of QCD. We employ the fragmentation mechanism to evaluate the color octet contributions to the production cross sections and take into account the combinatorial effects of multiple gluon radiation in the initial state using the CCFM evolution equation. The latter could be essential in the kinematical region covered by the CMS and ATLAS measurements. On the other hand, we have demonstrated that the experimental data taken by the LHCb Collaboration at forward rapidities can be described well by the color singlet terms and contributions from the double parton scattering mechanism. We determine the DPS effective cross section \(\sigma _{\mathrm{eff}} = 17.5 \pm 4.1\) mb (for A0 gluon) and \(\sigma _{\mathrm{eff}} = 13.8 \pm 0.9\) mb (for JH’2013 set 2 gluon) from the combined analysis of recent LHCb data collected at \(\sqrt{s} = 7\) and 13 TeV. The extracted values are compatible with many other estimations based on essentially different final states. The extremely low \(\sigma _{\mathrm{eff}} \sim 2 - 5\) mb, obtained earlier from the double \(J/\psi \) production data, is not confirmed.