It is well known that parton distribution functions in a proton (PDFs), \({{f}_{a}}(x,{{\mu }^{2}})\) with \(a = q\) or g, are an essential ingredient of any description of hard scattering at modern colliders energies. If only one scale is present in the process, \(\mu \sim \sqrt s \gg {{\Lambda }_{{{\text{QCD}}}}}\), then the PDFs can be described in Quantum Chromodynamics (QCD) via the Dokshitzer–Gribov–Lipatov–Altarelli–Parisi (DGLAP) equations [14]. However, in case of a two-scale process, \(\sqrt s \gg \mu \gg {{\Lambda }_{{{\text{QCD}}}}}\), the gluon dynamics can be described by the Balitsky–Fadin–Kuraev–Lipatov (BFKL) [57] or Catani–Ciafaloni–Fiorani–Marchesini (CCFM) [811] equations. It leads to Transverse Momentum Dependent (TMD, or unintegrated) gluon densities in a proton and high energy factorization [12, 13], or \({{k}_{T}}\)-factorization [14, 15] approach (see review [16] for more information). The \({{k}_{T}}\)-factorization approach is quite widely used in phenomenological applications and implemented in several Monte Carlo event generators, such as pegasus [17] and cascade [18]. A comprehensive set of TMD gluon distributions in a proton is available in the TMDLIB library [19].

In general, TMD gluon densities can be calculated within some approaches, such as popular Kimber–Martin–Ryskin formalism [2022], Parton Branching approach [23, 24] or obtained from the analytical or numerical solutions of BFKL-like QCD evolution equations. There are also investigations within the nonlinear (BK-JIMWLK) evolution in QCD (see, for example, [2534] and references therein). The CCFM equation, which resumes large logarithmic terms proportional to \(\alpha _{s}^{n}{{\ln }^{n}}1{\text{/}}x\) and \(\alpha _{s}^{n}{{\ln }^{n}}1{\text{/}}(1 - x)\) and therefore valid at both low and large x, has been applied [35]. In these calculations, the empirical expression for initial gluon density at some starting scale \({{\mu }_{0}}\) (which is of the order of the hadron scale) with factorized Gauss smearing in transverse momentum was applied. In contrast, in our previous study [36] a more physically motivated expression for the input distribution was chosen. It is based on the description of the LHC data on soft hadron transverse momenta spectra in the framework of the modified soft quark gluon string model [37, 38] (see also [3944]) with taking into account gluon saturation effects important at small x and scales of about the order of saturation scale \({{Q}_{s}}\) [45]. The nonlinear effects in the subsequent QCD evolution were neglected (see discussion below). The obtained CCFM-evolved gluon density in a proton (LLM) was successfully tested later on a number of collider data, in particular, on latest HERA data on longitudinal structure function of protonFootnote 1\({{F}_{L}}(x,{{Q}^{2}})\) [47] and associated photoproduction of prompt photons and jets [48].

Very recently it was shown [49] that some phenomenological parameters of the starting LLM gluon density need to be corrected in order to provide a good description (within the color dipole approach [4853]) of the low Q2 data on proton structure function \({{F}_{2}}(x,{{Q}^{2}})\) and reduced deep inelastic cross sections taken by H1 and ZEUS Collaborations. These parameters are related mainly with the small x behavior of the TMD gluon density at low scales, as it will demonstrated below. Such corrections have been done already [49]. At the same time, this an adjustment does not spoil the quality of soft hadron spectra data, provided the corresponding parameters of fragmentation of quarks and diquarks to the hadrons are altered moderately (see [49] for more details). Of course, in this case essential parameters, which cannot be determined from these data, have to be fitted from other measurements with taking into account the effects connected with the QCD evolution of gluon density. It is the aim of this short work to perform such a fit using an extended set of experimental data for several processes known to be sensitive to the gluon content of the proton and then complete our study by applying the CCFM evolution to the newly fitted initial density. As a main result, we present a more universal TMD gluon distribution function in a proton, which is already available in the Monte Carlo event generator PEG-ASUS and TMDLIB tool.

So, we ended up with the following form (non-factorized with respect to x and \({\mathbf{k}}_{T}^{2}\)) of the initial LLM gluon density in a proton:

$$\begin{gathered} {{f}_{g}}(x,{\mathbf{k}}_{T}^{2}) \\ = {{c}_{g}}{{(1 - x)}^{{{{b}_{g}}}}}\sum\limits_{n = 1}^3 {{c}_{n}}{{\left[ {{{R}_{0}}(x){\text{|}}{{{\mathbf{k}}}_{T}}{\text{|}}} \right]}^{n}}{\text{exp}}\left( { - {{R}_{0}}(x){\text{|}}{{{\mathbf{k}}}_{T}}{\text{|}}} \right), \\ \end{gathered} $$
(1)

where

$$R_{0}^{2}(x) = \frac{1}{{Q_{0}^{2}}}{{\left( {\frac{x}{{{{x}_{0}}}}} \right)}^{\lambda }},{{b}_{g}} = {{b}_{g}}(0) + \frac{{4{{C}_{A}}}}{{{{\beta }_{0}}}}\ln \frac{{{{\alpha }_{s}}(Q_{0}^{2})}}{{{{\alpha }_{s}}({\mathbf{k}}_{T}^{2})}},$$
(2)

with \({{C}_{A}} = {{N}_{C}}\), \({{\beta }_{0}} = 11 - 2{{N}_{f}}{\text{/}}3\) and \({{Q}_{0}} = 2.2\) GeV. Here \({{b}_{g}}\) parameter is treated to be running at \({\mathbf{k}}_{T}^{2} > Q_{0}^{2}\) only, whereas the fixed value \({{b}_{g}} = {{b}_{g}}(0)\) at \({\mathbf{k}}_{T}^{2} \leqslant Q_{0}^{2}\) is used. Simultaneous best fit to the HERA data on reduced deep inelastic cross section \({{\sigma }_{r}}(x,{{Q}^{2}})\) and proton structure function \({{F}_{2}}(x,{{Q}^{2}})\) measured at low \({{Q}^{2}} < 4.5\) GeV2 and LHC data on charged hadron (pion and kaon) production at small transverse momenta pT in the mid-rapidity region leads to \({{c}_{1}} = 5\), \({{c}_{2}} = 3\), \({{c}_{3}} = 2\), \({{x}_{0}} = 1.3 \times {{10}^{{ - 11}}}\) and \(\lambda = 0.22\) [49]. The experimental data on charged hadron production involved into the fit are compared with our predictions in Fig. 1 (left panel). One can see that good agreement is achieved in a wide range of energies. For a comparison we also show the results obtained within the popular GBW model [25, 53]. It was argued [49] that the latter gives a somewhat worse description of the data. Note that the saturation dynamics given by (1) has been already discussed [49] (see also [36]). In particular, the predicted saturation scaleFootnote 2\({{Q}_{s}}\) is of about \({{Q}_{s}} \sim 0.8\) GeV at very low \(x \sim {{10}^{{ - 11}}}\). This means that the nonlinear effects in the subsequent QCD evolution are practically absent and could be safely neglected, as it is often done in the framework of \({{k}_{T}}\)-factorization (high energy factorization) approach. Thus, we apply here the linear CCFM equationFootnote 3 to describe the QCD evolution, as we did earlier. Our choice is motivated mainly by the fact that the latter smoothly interpolates between the small-x BFKL gluon dynamics and large-x DGLAP one and therefore provides a suitable phenomenological tool.

Fig. 1.
figure 1

(Color online) Left panel: transverse momentum distributions of multiplicities of soft charged hadrons produced in \(pp\) collisions at different LHC energies in the mid-rapidity region. Shaded bands correspond to the uncertainties of our calculations connected with the uncertainties in determination of \({{x}_{0}}\) and parton-to-hadron fragmentation parameters. The experimental data are from ATLAS [54, 55] and CMS [56]. Right panel: \({{\chi }^{2}}{\text{/d}}{\text{.o}}{\text{.f}}.\) dependence of our fit for \({{b}_{g}}(0)\) parameter.

Our procedure to determine remaining parameters, \({{b}_{g}}(0)\) and cg, follows then the investigation [36]. As usual, they are extracted by minimizing

$${{\chi }^{2}} = \sum\limits_i {{\left[ {\frac{{\mathcal{C}_{i}^{{{\text{data}}}} - \mathcal{C}_{i}^{{{\text{theory}}}}}}{{\Delta \mathcal{C}_{i}^{{{\text{data}}}}}}} \right]}^{2}},$$
(3)

where \(i\) runs over all experimental data points for observables \({{\mathcal{C}}_{i}}\) from analyses listed in Table 1. Exploiting our previous results, theoretical predictions for each of \({{\mathcal{C}}_{i}}\) are generated for large number of fixed guessed \({{b}_{g}}(0)\) values in the range \(1 \leqslant {{b}_{g}}(0) \leqslant 9\) after applying the CCFM evolution to input density (1). At this step we used UPDFEVOLV routine [58] to solve numerically the CCFM equation. Note that the data set is extended now compared to the previous analysis [36] by including the ATLAS data [59] on inclusive b‑jet production in pp collisions at \(\sqrt s = 7\) TeV, CMS data on c-jet production in pp interactions at \(\sqrt s = 2.76\) and 5.02 GeV [60] and HERA data [61, 62] on inclusive prompt photon photoproduction in ep collisions at \(\sqrt s = 319\) GeV. Our simultaneous fit to all data sets leads to \({{b}_{g}}(0) = 5.109\) and \({{c}_{g}} = 1.2708\) with goodness \({{\chi }^{2}}{\text{/d}}{\text{.o}}{\text{.f}}. = 1.773\) for 509 data points, see Fig. 1 (right panel). Then, to obtain the TMD gluon density in the whole kinematical range, we solved the CCFM equation numerically using the fitted values of the parameters above. In this way, the gluon distribution function (LLM) is tabulated in a commonly recognized format (namely, grid of 50 × 50 × 50 bins in x, \({\mathbf{k}}_{T}^{2}\), and μ2) which is used in the TMDLIB package. It is already available for public usage from there and implemented also into the Monte-Carlo event generator pegasus.

Table 1. List of experimental data used for the fitting procedure and number of data points for each experiment

Let us now illustrate our fit with the inclusive prompt photon photoproduction in ep collisions at HERA, where sensitivity to the gluon density in a proton can be seen clearly. Here, in the photoproduction regime of DIS, the colliding electron emits a quasi-real photon, which then interacts with a proton. Here we completely follow our previous study [48], where we have investigated the prompt photon photoproduction with associated hadronic jets. The consideration is based on two leading off-shell (depending on the nonzero virtualities of incoming gluons) photon-gluon subprocesses, namely \(\gamma g{\text{*}} \to \gamma q\bar {q}\) and “box” subprocess \(\gamma g{\text{*}} \to \gamma g\), currently implemented into the Monte-Carlo event generator pegasus. The sea quark contribution is then incorporated in the former subprocess, while the small contribution of valence quarks can be taken into account in the conventional QCD factorization (see [48] for more details). Numerically, we use the massless limit for light (\(u\), d, and \(s\)) quarks, set the charm and beauty masses to \({{m}_{c}} = 1.4\) GeV and \({{m}_{b}} = 4.78\) GeV and implement certain kinematical cut applied in the experimental analyses:Footnote 4\(6 < E_{T}^{\gamma } < 15\) GeV, \( - 1.0 < {{\eta }^{\gamma }} < 2.4\), \(0.1 < y\) < 0.7 for H1 measurement [61] and \(6 < E_{T}^{\gamma } < 15\) GeV, \( - 0.7 < {{\eta }^{\gamma }} < 0.9\) and \(0.2 < y < 0.7\) for ZEUS [62] one, where \(y\) is the fraction of the electron energy transferred to the photon (inelasticity). Both these data were obtained with the electron energy \({{E}_{e}} = \) 27.6 GeV and proton energy \({{E}_{p}} = 920\) GeV. Results of our calculations for produced photon transverse energy and pseudorapidity distributions are shown in Fig. 2. One can see that our predictions (represented by the green histograms) are consistent with the latest HERA data within the experimental and theoretical uncertainties. The latter are estimated in a traditional way, by varying the renormalization scaleFootnote 5 around its default value \({{\mu }_{R}} = E_{T}^{\gamma }\) as \({{\mu }_{R}} \to \xi {{\mu }_{R}}\) with \(\xi = 1{\text{/}}2\) or 2. In contrast with newly fitted LLM set, using the JH’2013 set 2 gluon derived previously [35] leads to systematic overestimation of the HERA data in most of the bins, that coincides with observations made earlier [47, 48]. As it was argued [36], better agreement achieved with the LLM distribution is a consequence of using a physically motivated expression (1) for the initial gluon density for subsequent CCFM evolution. Thus, our calculations demonstrate that considered HERA data could help to distinguish the different approaches to evaluate the TMD gluon density in a proton. Exact determination of the latter, of course, is  important for experiments at future electron-proton or electron-ion colliders, such as NICA, LHeC, FC-C‑eh, EiC, and EiCC.

Fig. 2.
figure 2

(Color online) Inclusive prompt photon photoproduction cross section as a function of the photon transverse energy and pseudorapidity. The green (yellow) histograms and shaded bands correspond to the predictions obtained with the LLM (JH’2013 set 2) gluon density and estimated scale uncertainties of these calculations. The experimental data are from H1 [71] and ZEUS [72].

Finally we would like to summarize the search for the phenomenological parameters of the proposed TMD gluon density in a proton, which has been performed in two steps. First, some parameters of the starting gluon distribution, which are not sensitive to the QCD evolution, have been found [49]. We point out that the analytical expression for the input (1) provides a self-consistent simultaneous description of HERA data on the proton structure function \({{F}_{2}}(x,{{Q}^{2}})\), reduced cross section for the electron-proton deep inelastic scattering at low Q2 and soft hadron production in \(pp\) collisions at the LHC conditions. Second, in this work, we continue the determination of phonemenological parameters (namely, \({{b}_{g}}(0)\) and \({{c}_{g}}\)) with taking into account the effects of QCD evolution. Our procedure was based on a fit to a number of LHC and HERA data for processes sensitive to the gluon content of a proton at scale \(\mu > {{Q}_{s}}\). The resulting fit quality (\({{\chi }^{2}}{\text{/d}}{\text{.o}}{\text{.f}}. = 1.773\)) shows that the obtained gluon density does not contradict experimental data. We illustrate it additionally with latest HERA data on inclusive prompt photon photoproduction. Our results together with the ones [49] represent a self-consistent approach for the TMD gluon density in a proton valid in a wide kinematical region. The updated LLM gluon density supersedes previous version and can be used in different phenomenological applications for pp, \(p\bar {p}\), and \(ep\) processes at modern and future colliders. It is available now in the TMDLIB library and Monte Carlo event generator pegasus.