1 Introduction and conclusions

Good understanding of the proton structure has been one of the fundamental goals of particle physics over recent decades. Measurements of the deep inelastic \(e^{\pm }p\) scattering (DIS) performed by H1 and ZEUS collaborations at the HERA collider contributed invaluable experimental input into this task. The combined data of H1 and ZEUS [1] that include all the measurements of the proton structure functions provide the most accurate information on the proton structure over wide range of Bjorken x and the momentum transfer \(Q^2\), in particular at smaller x and \(Q^2\). Hence it is crucial to fully use these data to extract the precise information on the parton density functions (PDFs) in the proton.

The standard description of the proton structure function in QCD relies on the operator product expansion (OPE) in which only the leading – twist 2 operators – are retained. The twist 2 contributions to proton structure functions obey the hard factorisation theorem that allows to isolate the universal twist 2 parton density functions. The PDFs drive the proton scattering cross sections and the accuracy of the PDFs determination is crucial for the precision of measurements at proton colliders. It follows from the OPE however, that the twist 2 description of proton scattering is subject of higher-twist (HT) corrections that enter with suppression of inverse powers of the hard process scale squared, \(Q^2\). Those corrections, although quickly decreasing with \(Q^2\), may affect the determination of the PDFs from the cross sections. In order to avoid determination error of the PDFs it is necessary to include the higher-twist terms in the analysis. Currently not much is known about higher-twist components of the proton structure. The operator content is increasingly complicated with the increasing twist and the available data are not sufficient to perform a clean and straightforward measurement of the higher-twist terms. Fortunately, the model independent characteristics of higher-twist terms given by their \(Q^2\) scaling provides opportunity to obtain some information on the higher-twist corrections from fits to DIS data extended to low \(Q^2\). From the theory side, properties of the leading twist 4 singularity at small x were investigated [2], corresponding to a quasi-partonic [3] four-gluon exchange. It was found [2] that the energy dependence of the leading exchange is up to \(1/N_c^2\) corrections, given by a double gluonic ladder exchange in the t-channel. Hence, although the overall magnitude of the twist 4 contributions is currently undetermined, the \(Q^2\) and x-dependencies of these contributions are known from theory and may be used as the higher-twist signatures. Estimates of the higher-twist corrections to DIS at small x [4, 5] and fits to the DIS data with higher-twist corrections [6,7,8] have been performed since many years. Only recently however, with the most precise set of the combined HERA data these ideas implemented in several fits to diffractive DIS [9] and inclusive DIS [10, 11] have lead to accumulating an evidence for higher-twist corrections in diffractive and inclusive DIS.

The recent studies of higher-twist effects in inclusive DIS [10, 11] are based on DGLAP fits of the leading-twist contribution complemented by a simple model of twist 4 correction. In the central models elaborated in these analyses a multiplicative twist 4 correction was assumed for the longitudinal structure function \(F_{\mathrm {L}} \) of the form of \(A F_{\mathrm {L}}/ Q^2\) and the twist 4 correction to the structure function \(F_2 \) was set to vanish.Footnote 1 The fit quality increased significantly for the combined HERA data for \(Q^2 > Q^2_{\mathrm {min}} = 2\,\mathrm {GeV} ^2\) for both NLO and NNLO DGLAP approximations of the leading-twist evolution. This simple model provides a surprisingly good description of the DIS data, except of the predicted steep rise of \(F_{\mathrm {L}} \) towards the low \(Q^2\) for the NNLO DGLAP fit with the twist 4 correction found in Ref. [11]. Such a rise is not inline with the \(F_{\mathrm {L}} \) data [12,13,14].

In this paper we adopt a more flexible model of the twist 4 contribution motivated by an extraction of the twist 4 corrections to structure functions [15, 16] from the Golec-Biernat–Wüsthoff saturation model [17, 18]. The model is inspired by a resummation of multiple scattering in QCD in the eikonal approximation and it is capable to provide more information on the details of the higher-twist corrections and physics insight into their origin. In this approach, the twist 4 corrections to \(F_2 \) and \(F_{\mathrm {L}} \) structure functions have non-trivial \(Q^2\) and x dependence. In addition, we modify the standard form of the DGLAP input for the gluon and sea density, so that they are consistent with general features of parton saturation in QCD at small x. With this model complemented by the NLO or NNLO DGLAP evolution of PDFs we analyse the combined HERA data on the reduced cross sections using the xFitter package [19] with suitable extensions of the code to incorporate the new features of the model. The sensitivity to the higher-twist corrections is enhanced by performing independent fits of the data sets with the momentum transfer constrained by \(Q^2 > Q^2_{\mathrm {min}} \) and varying the limit \(Q^2_{\mathrm {min}} \). When \(Q^2_{\mathrm {min}} \) is larger than 20 GeV\(^2\), the \(\chi ^2/\mathrm {d.o.f.}\) measures for the DGLAP fits at NLO and NNLO accuracy, with and without the higher-twist terms are close to 1.15 and exhibit a nearly flat \(Q^2_{\mathrm {min}} \) dependence. Below \(Q^2_{\mathrm {min}} = 20\,\mathrm {GeV} ^2\), the \(\chi ^2/\mathrm {d.o.f.}\) of the pure DGLAP fits starts growing with the decreasing values of \(Q^2_{\mathrm {min}} \), reaching the \(\chi ^2/\mathrm {d.o.f.}\) \(\simeq 1.46\) (\(\chi ^2/\mathrm {d.o.f.}\) \(\simeq 1.34\)) for the NNLO (NLO) accuracy at \(Q^2_{\mathrm {min}} = 1\,\mathrm {GeV} ^2\). With the higher-twist corrections included and the saturation-improved input parameterisation of the PDFs, the \(\chi ^2/\mathrm {d.o.f.}\) is only mildly increasing when \(Q^2_{\mathrm {min}} \) decreases, and for \(Q^2_{\mathrm {min}} =1\,\mathrm {GeV} ^2\), the \(\chi ^2/\mathrm {d.o.f.}\) reaches 1.189 (\(\chi ^2/\mathrm {d.o.f.}\) \(\simeq 1.212\)) for the NNLO DGLAP + HT (NLO DGLAP + HT) fit. Hence the improvement of the fit quality by adding the higher-twist corrections for \(Q^2_{\mathrm {min}} = 1\,\mathrm {GeV} ^2\) is large and particularly pronounced for the fits using the NNLO DGLAP leading-twist part. Also a good description of the HERA \(F_{\mathrm {L}} \) data [12,13,14] is obtained down to the lowest measured values of \(Q^2\).

The evidence for sizeable contributions of higher-twist terms is further strengthened by an explicit analysis of the twist composition of the structure functions at small x and moderate and low scales. Consistently we find the growing higher-twist effects when x and \(Q^2\) decrease. The relative importance of the higher-twist corrections is found to be larger in the NNLO fit than in the NLO one. In particular, in the NNLO fit at \(Q^2=1.2\,\mathrm {GeV} ^2\), the twist 4 correction to the reduced cross section is found to be larger than the leading-twist contribution for \(x< 2\cdot 10^{-4}\), and the relative higher-twist correction further grows towards small x, and at the lowest available \(x \simeq 2 \cdot 10^{-5}\) it reaches about 200% of the leading-twist term. The higher-twist effects quickly decrease with increasing \(Q^2\) and reach \(\sim 10\%\) level at \(Q^2=6.5\,\mathrm {GeV} ^2\). The higher-twist effects are found to be much stronger in the longitudinal structure function \(F_{\mathrm {L}} \). In particular, the twist 4 contribution to \(F_{\mathrm {L}} \) is larger than the leading-twist contribution for \(Q^2 < 3\) GeV\(^2\) for both the NLO and NNLO DGLAP fits. Of course, also in \(F_{\mathrm {L}} \) the effects of higher-twist corrections decrease quickly with \(Q^2\), but in the NNLO fit, the higher-twist contribution is still visible at 10% level up to a sizeable scale \(Q^2 \simeq 20\,\mathrm {GeV} ^2\).

The inclusion of higher-twist corrections is found to affect significantly the fitted gluon and sea density functions at small \(x<0.01\) and moderate factorisation scales, \(\mu ^2\), while the sensitivity of the valence quark distribution to the higher-twist effects is minor and may be neglected. The largest difference in PDFs coming from the higher-twist effects is found in the gluon PDF – the difference is large and much larger than the corresponding uncertainty at small scales, 1 GeV\(^2< \mu ^2 < 3.5\) GeV\(^2\), then decreasing with \(\mu ^2\) to a few percent level at \(\mu ^2 = 50\) GeV\(^2\). The sea distribution at small x is also affected but it exhibits lower sensitivity to the presence of higher-twist corrections.

In conclusion, we have found a consistent evidence of the sizeable twist 4 corrections to proton structure functions. The evidence comes primarily from the \(\chi ^2\) quality measure of fits to the combined HERA data on the inclusive DIS with the leading-twist component described by the NLO / NNLO DGLAP evolution. This evidence is further strengthened by the strong effects of the higher-twist corrections in the reduced cross section and the structure function \(F_{\mathrm {L}} \) at small x and moderate / low \(Q^2\). The fitted twist 4 contributions have the x-dependence that is consistent with the double exchange of hard gluonic ladder at small x, as expected from the QCD analysis of the evolution of leading quasi-partonic operators [2, 3].

2 The model of higher-twist corrections

The Golec-Biernat–Wüsthoff (GBW) saturation model [17, 18] offers a simple and effective description of DIS, DDIS structure functions down to very low \(Q^2\) at small x, and also of the exclusive vector meson production [20]. In particular, with this model one is able to describe reasonably well even the transition from DIS at large \(Q^2\) to the photoproduction limit. This transition may be viewed as a transition from the twist 2 regime to the region that all twist contributions are relevant. From the point of view of perturbative QCD, the GBW model corresponds to multiple independent high-energy scatterings of photon hadronic fluctuations, that is to the eikonal iteration of a single gluon ladder exchange. In particular, the leading behaviour of twist 4 contributions of the GBW cross section is \(\sim (x^{\lambda }Q^2)^{-2}\) (modulo logarithms) compared to the leading twist cross section \(\sim (x^{\lambda }Q^2)^{-1}\) (modulo logarithms) [15, 16]. Such behaviour of twist 2 and twist 4 amplitudes is in a qualitative agreement with results of the evolution of twist 4 contributions in the Bukhvostov, Frolov, Lipatov and Kuraev framework [2, 3, 21], where dominant contributions at small x are driven by quasi-partonic operators.

From these studies [15, 16] it follows that the twist 4 contributions to the transverse and longitudinal cross sections take the form

$$\begin{aligned} \sigma ^{(\tau =4)}_{\mathrm T}= & {} A\left[ \frac{Q_{\mathrm {sat}} ^2(x)}{Q^2}\right] ^2,\;\;\; \sigma ^{(\tau =4)}_{\mathrm L} = -\frac{4}{3}A\left[ \frac{Q_{\mathrm {sat}} ^2(x)}{Q^2}\right] ^2\nonumber \\&\times \left[ \log \left( \frac{Q^2}{Q_{\mathrm {sat}} ^2(x)}\right) +B\right] , \end{aligned}$$
(1)

where AB are positive constants. The saturation scale depends on x variable as \(Q_{\mathrm {sat}} ^2= Q_0^2 (x_0/x)^{\lambda }\) where \(Q_0^2 = 1\) GeV\(^2\) and \(x_0 = 3.04\cdot 10^{-4}\) (the GBW fit without charm) are model parameters. It is important to notice the opposite signs of the corrections in (1), positive for the transverse and negative for the longitudinal part. Additionally, the twist 4 contribution to the longitudinal cross section is logarithmically enhanced. This structure of the corrections can be also deduced from the general QCD analysis which was discussed in [15]. Following the above considerations we shape a saturation model inspired ansatz for twist 4 corrections in the following form

$$\begin{aligned} \Delta \sigma _{\mathrm {L/T}} ^{(\tau = 4)} = \left[ \frac{Q_{\mathrm {sat}} ^2(x)}{Q^2}\right] ^2 \left[ \tilde{c}_{\mathrm {L/T}}^{\mathrm {(log)}} \log \left( \frac{Q_{\mathrm {sat}} ^2(x)}{Q^2}\right) +\tilde{c}_{\mathrm {L/T}} ^{(0)}\right] , \end{aligned}$$
(2)

in which coefficients \(\tilde{c}_{\mathrm {L/T}} ^{(0)}\) and \(\tilde{c}_{\mathrm {L/T}} ^{\mathrm {(log)}}\) are left as free and independent parameters. In practical implementation of the fits we rewrite the above parameterisation to the following convenient form of twist 4 corrections to the structure functions:

$$\begin{aligned} \Delta F^{(\tau =4)}_{\mathrm {L/T}} = \frac{Q_0 ^2}{Q^2} x^{-2\lambda } \left[ c_{\mathrm {L/T}} ^{\mathrm {(log)}} \left( \log \frac{Q_0^2}{Q^2} + \lambda \log \frac{1}{x}\right) + c_{\mathrm {L/T}} ^{(0)} \right] , \end{aligned}$$
(3)

where \(F_{\mathrm {L/T}} = Q^2\sigma _{\mathrm {L/T}} / 4\pi ^2\alpha _{\mathrm {em}} \).

Equation (2) contains the \(\tilde{c}^{\mathrm {(log)}} _{\mathrm T}\) parameter which determines the magnitude of the logarithmically enhanced term in the transverse cross section. Due to the properties of the transverse photon impact factor, this parameter vanishes at the leading order in the strong coupling constant expansion [15, 16, 22]. Therefore, one expects that this parameter is much smaller than parameter \(\tilde{c}_{\mathrm T} ^{(0)}\), hence it is neglected in our fits—we assume \(\tilde{c}_{\mathrm T}^{\mathrm {(log)}} \equiv c_{\mathrm T}^{\mathrm {(log)}} = 0\).Footnote 2

When one attempts to extend the proton structure function analysis to the region of small \(Q^2\) and very small x it is necessary to consider possible effects of parton rescattering and/or recombination in the dense parton regime. Those effects may lead not only to the higher-twist corrections but also they are expected to influence the form of the input for QCD evolution of the matrix elements in the OPE. In particular, it is natural to require that the input functions for the DGLAP evolution of parton densities are consistent with unitarity constraints relevant for high-energy scattering at very low x. Hence, a precise analysis of the higher-twist effects which pronounce at low \(Q^2\) and small x requires careful treatment of the gluon and sea input distributions at small x.

Currently, one of the most successful tools for an analysis of \(\gamma ^*\) – nucleon (or \(\gamma ^*\) – nucleus) scattering with the unitarity corrections is the Balitsky–Kovchegov (BK) equation [23,24,25]. In constructing the model for initial conditions of the PDFs we take into account the outcome of this equation analysis. The BK equation, that resums multiple scattering effects in the extended generalised leading logarithmic 1 / x approximation [26,27,28] and the large \(N_c\) limit, may be described in a natural way in terms of the colour dipole language [24, 25, 29,30,31]. In this approach the high-energy scattering is described in terms of the imaginary part of the BK dipole forward scattering amplitude, \(N(x,\mathbf {r})\), where \(\mathbf {r}\) is the dipole extension vector in the transverse plane. On the other hand the same dynamics may be covered by the BK equation represented in terms of the unintegrated gluon density [32,33,34,35]. It can be shown that at the leading logarithmic 1 / x accuracy the imaginary part of the BK dipole forward scattering amplitude, \(N(x,\mathbf {r})\), and the BK unintegrated gluon densityFootnote 3 \(\mathcal{F}(x,\mathbf {k}^2)\) (where \(\mathbf {k}\) is the gluon transverse momentum) are in one-to-one correspondence. Hence the unintegrated gluon density at small x can be recovered from \(N(x,\mathbf {r})\),

$$\begin{aligned} \mathcal{F}(x,\mathbf {k}^2) = \frac{N_c R_p^2}{2\alpha _s\pi }\mathbf {k}^2\nabla _{\mathbf {k}}^2 \tilde{N}(x,\mathbf {k}), \end{aligned}$$
(4)

where \(\tilde{N}(x,\mathbf {k}) = \int d^2\mathbf {r} \exp (-i\mathbf {k}\mathbf {r}) \, N(x,\mathbf {r})\) is the Fourier transform of the dipole scattering amplitude, and \(R_p\) is an effective radius of the proton. After employing the leading logarithmic relation of the collinear gluon density \(f_g(x,\mu ^2)\) and the unintegrated gluon density \(\mathcal{F}(x,\mathbf {k}^2)\) one obtains

$$\begin{aligned} xf_g(x,\mu ^2) = \frac{N_c R_p^2}{2\alpha _s\pi }\int ^{\mu ^2}d\mathbf {k}^2\, \mathbf {k}^2\, \nabla _{\mathbf {k}}^2 \tilde{N}(x,\mathbf {k}). \end{aligned}$$
(5)

Explicit numerical solutions of the BK equation [35, 36] show that at small x and for \(\mathbf {k}\) below the saturation scale, \(Q_{\mathrm {sat}} (x)\), generated by the BK evolution, the solution of the BK equation tends to \(\mathcal{F}(x,\mathbf {k}^2) \sim R_p ^2 \mathbf {k}^2 / Q_{\mathrm {sat}} ^2(x) \sim x^{\lambda }\). Exactly the same asymptotic behaviour of the unintegrated gluon density is found in the GBW model where one approximates the scattering amplitude with the saturation formula \(N(x,\mathbf {r})\sim 1-\exp (-\mathbf {r}^2 Q_{\mathrm {sat}} ^2(x))\). In fact, the small \(\mathbf {k}\) asymptotics of the saturated unintegrated gluon density may be traced back to the unitarity constraint on the dipole cross section in the position space, \(N(x,\mathbf {r}) \le 1\). For such a form of \(\mathcal{F}(x,\mathbf {k}^2)\) at small \(\mathbf {k}\), it is straightforward to show that in the limit \(\mu ^2\ll Q_{\mathrm {sat}} ^2(x)\) the gluon density small x asymptotics is

$$\begin{aligned} xf_g(x,\mu ^2) \sim x^\lambda , \end{aligned}$$
(6)

with \(\lambda > 0\). Therefore the gluon density at a low scale is expected to decrease toward zero with decreasing x. Following this argument our parameterisation of the input for gluon distribution fulfils the condition \(xf_g(x,\mu _0^2) \sim x^{B_g}\), at \(x\rightarrow 0\), where \(B_g\) is a positive fit parameter.

Furthermore in our model construction we consider the input for the sea distribution at a small scale and small x. Assuming that the sea quarks at small x are generated predominantly from the gluon DGLAP splitting to quark/antiquark, one may approximate the sea singlet distribution by the LO DGLAP expression describing the feed-down from gluons,

$$\begin{aligned} f_{\mathrm {sea}}(x,\mu _0^2) \simeq \int ^{\mu _0 ^2} \frac{d\mu ^2}{\mu ^2} \,\frac{\alpha _{\mathrm {s}} (\mu ^2)}{\pi } \int _x ^1 \frac{dz}{z} P_{qg}(z)f_g(x/z,\mu ^2).\nonumber \\ \end{aligned}$$
(7)

Obviously, since the scales probed are low and the impact of non-perturbative effects unknown, the above expression should be treated only as a QCD hint on the actual shape of the sea distribution at low scales. An explicit evaluation of the model expression in Eq. (7) leads to the sea-quark distribution asymptotic behaviour at small x following the gluon asymptotics, \(xf_{\mathrm {sea}}(x,\mu ^2) \sim x^\lambda \). Hence in the fits that include parton saturation effects in the input distributions, we impose the same asymptotic behaviour of the sea and gluon distributions at the initial scale, \(xf_{\mathrm {sea}}(x,\mu _0^2) \sim xf_g(x,\mu _0^2) \sim x^{B_g}\).

3 DGLAP framework

The leading twist 2 contributions to the \(F_2\) and \(F_{\mathrm {L}}\) structure functions are given in terms of PDFs, \(f_k (x,\mu ^2)\), determined within the DGLAP framework. Our approach follows closely the scheme adopted in the HERAPDF2.0  [1] study, in order to clearly see the effects of higher-twist contributions. The PDFs are parameterised at the starting scale \(\mu _{\mathrm {F}0}\) and then determined at all scales \(\mu _{\mathrm {F}} \) by solving the DGLAP evolution equations. The factorisation and renormalisation scales are chosen to be equal and in the following we denote them by \(\mu \), while the evolution starting scale is denoted by \(\mu _0\).

3.1 Scheme description

The light quarks, uds, are taken to be massless. The heavy quarks, cbt, are generated radiatively and appear only at transition scales, taken to be equal to the corresponding quark masses, \(m_h\). The PDFs of heavy quarks start from 0 once \(\mu \) goes above \(m_h\). In other words, there are no intrinsic heavy flavours. For a simple realisation of this scenario we take the starting scale, \(\mu _0\), below the charm mass.

The coefficient functions of heavy quarks are calculated in the Thorne–Roberts general-mass variable-flavour-number scheme called RT OPT [37,38,39]. This scheme is adopted in accordance with the HERAPDF2.0 fit [1].

3.2 The input parameterisation

The distributions parameterised at the starting scale include the gluon g, d and u valence quarks, and up- and down-type sea quarks, \(\bar{U} = \bar{u}, \bar{D} = \bar{d} + \bar{s}\).

The generic form of the input parton k distribution, \(xf_k(x) = xf_k(x,\mu _0^2)\) is assumed to be,

$$\begin{aligned} x f_k(x) = A_k x^{B_k} (1-x)^{C_k} \left( 1 + {D_k} x + {E_k} x^2 \right) \,, \end{aligned}$$
(8)

for \(k = g, u_{\mathrm {val}}, d_{\mathrm {val}}, \bar{U}, \bar{D}\).

The relative \(\bar{s}\) contribution to the down-type sea at the starting scale is assumed to be a fixed (x-independent) fraction \(\beta \) of \(\bar{D}\), i.e. \(f_{\bar{s}} = \beta f_{\bar{D}}\).

Thus, in general, we have 26 fit parameters to start with. Several assumptions are made in order to make this number smaller.

First, the quark-counting and momentum sum rules are used to fix the valence and gluon PDFs normalisation parameters, \(A_{u_{\mathrm {val}}}, A_{d_{\mathrm {val}}}\) and \(A_g\). Next, in order to ensure a uniform sea behaviour at low x, (\(\bar{u} \simeq \bar{d}\)), the following constraints are imposed: \(A_{\bar{U}} = (1-\beta ) A_{\bar{D}}\) and \(B_{\bar{U}} = B_{\bar{D}} \equiv B_\mathrm {sea}\). Finally, the strange sea fraction, \(\beta \), is set to 0.4. Based on the HERAPDF2.0 experience and our numerous fit results we set to zero all \(D_k\) and \(E_k\), except of \(E_{u_{\mathrm {val}}}\) which is left free. With this setup we have 11 free parameters of the input PDFs to be compared with 14 free parameters of the HERAPDF2.0 fit. With this restricted parameterisation at the leading-twist level, we gain on the stability of the fits with the saturation and higher-twist effects included.

The first step towards improved description at low x and low \(\mu ^2\) is a modification of the basic parameterisations (8), aimed at improving description of parton saturation effects in the gluon and sea input distributions in the small x domain.

In the HERAPDF2.0 fits the input gluon parameterisation is augmented by a negative term \(-A'_g x^{B'_g} (1-x)^{25}\) [1, 40]. In the current study we do not include this subtraction. Instead, we consider enhancing the basic parameterisation (8) with saturation-inspired, damping factors for the parton k:

$$\begin{aligned} \left[ 1+\displaystyle \left( \frac{{\hat{x}} _k}{x}\right) ^{{d} _k}\right] ^{-1} , \end{aligned}$$
(9)

applied to the gluon and sea components, with \({d} _k+ B_k > 0\). The application of such factors ensures a smooth decrease to zero of \(xf_g(x,\mu _0 ^2)\) and \(xf_\mathrm {sea}(x,\mu _0 ^2)\) when \(x \rightarrow 0\), consistent with the known results from analyses of parton saturation at small x (see Sect. 2 for a more detailed discussion). The damping factors describing the parton saturation effects turn on for x below a specific scale \({\hat{x}} _k\), which can be therefore interpreted as the saturation x at \(Q_0\): \(Q _{\mathrm {sat}}({\hat{x}} _k) = Q_0\). In general, \({\hat{x}} _k\) and the saturation powers \({d} _g, {d} _\mathrm {sea}\) are arbitrary parameters, with the already mentioned constraint \({d} _k + B_k > 0\).

Hence we consider the following input parameterisations of the gluon and sea PDFs:

$$\begin{aligned} xf_k(x,Q_0^2) = A_k x^{B_k} (1-x)^{C_k} \left[ 1+\displaystyle \left( \frac{\hat{x}_k}{x}\right) ^{d_k}\right] ^{-1}, \end{aligned}$$
(10)

where \(k=g,\bar{U}, \bar{D}\). With these parameterisations in the \(x \rightarrow 0\) limit the input PDFs scale as

$$\begin{aligned} xf_k(x,Q_0^2) \simeq A_k x^{B_k+d_k}. \end{aligned}$$
(11)

After a preliminary analysis of the data we found that for the gluon input distribution the saturation damping factor is irrelevant, as the fits yielded \(B_g \sim 1\) which already guarantees power-like approach to zero of \(xf_g(x,Q_0^2)\) for \(x\rightarrow 0\). Thus we retain the damping factor for the quark sea only, with \(\hat{x}_D = \hat{x}_U \equiv \hat{x}\) being a free fit parameter. For the saturation damping exponent for the quark sea, \(d_{\mathrm {sea}}\), we impose an additional constraint following from the assumption that the sea input distribution at small x follows the power-like behaviour of the gluon input distribution, see Sect. 2. As a result, a relation of the exponents is obtained: \(B_{\mathrm {sea}} + d_{\mathrm {sea}} = B_g\), resulting in \(d_{\mathrm {sea}} = B_g - B_{\mathrm {sea}}\). In fact, we have checked that leaving \(d_{\mathrm {sea}}\) as a free parameter does not improve the fit quality (the difference in the \(\chi ^2\)/d.o.f. is smaller than 0.002). Hence the phenomenological inclusion of the saturation effects in the input of the PDFs is reduced to taking the positive definite gluon input and imposing the sea input damping at small x. This parameterisation has 12 free parameters.

4 Results

In the current analysis we use the combined HERA data on neutral and charged current \(e^+ p\) and \(e^- p\) inclusive cross sections, measured at centre-of-mass energies ranging from 225 to 318 GeV [1].

In the fits we use only data points for which \(Q^2 > 1\,\mathrm {GeV} ^2\). Their kinematic range spans four orders of magnitude in x and \(Q^2\) with lower bounds at \(x = 1.76\cdot 10^{-5}\) and \(Q^2 = 1.2\,\mathrm {GeV} ^2\). The inelasticity y values are between 0.001 and 0.95. The whole data set comprises 1213 data points. A subset of this data set with \(Q^2 \ge 3.5\) GeV\(^2\) was used to extract the HERAPDF2.0 PDFs [1].

The measured cross sections are presentedFootnote 4 in terms of the reduced cross section:

$$\begin{aligned} \sigma _{\mathrm {red}} (x,Q^2,y)= & {} F_2(x,Q^2) - \frac{y^2}{1+(1-y)^2} \,F_{\mathrm {L}} (x,Q^2)\nonumber \\\equiv & {} F_{\mathrm {T}} (x,Q^2) + \frac{2(1-y)}{1+(1-y)^2} \,F_{\mathrm {L}} (x,Q^2) \,,\nonumber \\ \end{aligned}$$
(12)

where \(F_2 (x,Q^2) = F_{\mathrm {T}} (x,Q^2) + F_{\mathrm {L}} (x,Q^2)\).

In order to fit the data down to \(Q^2 = 1.2\,\mathrm {GeV} ^2\), we take the starting scale for the DGLAP evolution \(\mu _0 ^2 = 1\,\mathrm {GeV} ^2\). For the sake of comparison to the HERAPDF2.0 fits we present also some results for the PDFs parameterised at \(\mu _0 ^2 = 1.9\,\mathrm {GeV} ^2\).

The fits are performed using the xFitter package [19] supplemented by us with necessary code extensions including input parameterisation with saturation damping effects, and the higher-twist contributions to \(F_{\mathrm {T}}\) and \(F_{\mathrm {L}}\), as given by Eq. 3. The DGLAP evolution is performed using the QCDNUM program [41].

In the analysis below we use the following fit names:

  • HTS — Higher Twist with Saturation, \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\), \(Q^2_{\mathrm {min}} = 1.2\,\mathrm {GeV} ^2\) — Our main fit. It corresponds to the standard DGLAP evolution of the leading twist terms, in which the saturation damping effects are assumed in the input PDFs, complemented by the additive twist 4 corrections. Note that when performing the \(\chi ^2\) scan (see Sect. 4.1) we use the same name for this model fitted to data with a variable lower cutoff \(Q^2_{\mathrm {min}} \) on \(Q^2\);

  • HT – Higher Twist without Saturation, \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\), \(Q^2_{\mathrm {min}} = 1.2\,\mathrm {GeV} ^2\) – Like HTS but without the saturation damping effects in the initial PDFs;

  • LTS – Leading Twist with Saturation: the pure DGLAP fit with the input form like the HTS fit. The fits are performed with input scales \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\) and \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\);

  • LT – Leading Twist without Saturation: the pure DGLAP fit with the input form like the HT fit. The fits are performed with input scales \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\) and \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\);

  • LT-STD – “Standard” Leading Twist, the fit LTS with \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\), \(Q^2_{\mathrm {min}} = 3.5\,\mathrm {GeV} ^2\) – like LTS but with the initial scale and data selection as in HERAPDF2.0.

Fig. 1
figure 1

The \(\chi ^2/\mathrm {d.o.f.}\) of twist 2, \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\) fits to the data with the \(Q^2 \ge Q^2_{\mathrm {min}} \) condition

Fig. 2
figure 2

The \(\chi ^2/\mathrm {d.o.f.}\) of various \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\) fits to the data with the \(Q^2 \ge Q^2_{\mathrm {min}} \) condition

Fig. 3
figure 3

The \(\chi ^2/\mathrm {d.o.f.}\) for the LT-STD and HTS fits vs. lower \(Q^2\) cut, \(Q_\mathrm {cut}^2\), of the selected data subsamples

The analysis of the data is carried out in the following stages. First we check the quality of the pure DGLAP fits (hence the input parameterisations) LTS and LT with \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\) by performing the scans of the \(\chi ^2\)/d.o.f. for the data set with \(Q^2 > Q^2_{\mathrm {min}}\) as a function of \(Q^2_{\mathrm {min}}\). The results are shown in Fig. 1 in comparison to the corresponding values of \(\chi ^2\)/d.o.f. obtained from the reference fit HERAPDF2.0. This comparison shows that the modification of the input parameterisation that we performed does not lead to a deterioriation of the fit quality of the LTS fit at NNLO for lower values of \(Q^2_{\mathrm {min}}\) which are most relevant for the higher-twist analysis. For the NLO fits a marginal deterioration of the fit quality is found w.r.t. the HERAPDF2.0 fit – the increase of \(\chi ^2\)/d.o.f. is smaller than 0.01. Note however that the saturation sea damping effects in the input parameterisation are necessary to match the quality of the HERAPDF2.0 fit at the NNLO level. At the NLO level the inclusion of sea damping effects does not change the results of the fit (\({\hat{x}} \rightarrow 0\)), so the resuls of the LT and LTS fits are identical at NLO. We conclude that the form of the input of the leading-twist fit leads to results which agree very well with the HERAPDF2.0 fit and the fit LTS may be used as a reference for the analysis of the higher-twist effects.

Next, we perform the scan of the \(\chi ^2\)/d.o.f. for the data set with \(Q^2 > Q^2_{\mathrm {min}}\) as a function of \(Q^2_{\mathrm {min}}\) for all chosen setups with the lower input scale \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\), that is for NLO and NNLO DGLAP evolution of the leading twist, with and without the higher-twist corrections, with and without saturation damping effects in the input for the PDFs. It is found that at lower values of \(Q^2_{\mathrm {min}}\) the saturation damping modification of the input distributions becomes important only upon the inclusion of the twist 4 contributions. The features of our best fits with the higher-twist corrections are then described in detail. Furthermore we explicitly study the effects of the higher twists for \(\sigma _{\mathrm {red}}\) and \(F_{\mathrm {L}} \), as given by the best fits. Finally we show the impact of the inclusion of the higher-twist and saturation effects on the obtained PDFs.

4.1 The \(\chi ^2\) scans

In Fig. 2 we show the \(\chi ^2\)/d.o.f. for the fits to the reduced cross sections as a function of the lower cutoff \(Q^2_{\mathrm {min}}\) imposed on the photon virtuality \(Q^2\) for the data sample taken into account in the fits. The initial scale of the DGLAP evolution is set \(\mu _0 ^2 = 1\) GeV\(^2\). The higher-twist parameterisation provides the best description of the data from \(Q^2_{\mathrm {min}}\simeq 16\,\mathrm {GeV} ^2\) below. The question of the parton saturation effects in the input sea distribution is more subtle. From the \(\chi ^2\) scan it follows that within the NLO approximation this is not an important effect. However, in the fits assuming the NNLO DGLAP evolution of the leading twist contribution, both the higher-twist corrections and the parton saturation effects in the input are key ingredients for the best description of the data. The inclusion of both effects improves the data description significantly for \(Q^2_{\mathrm {min}} < 20\,\mathrm {GeV} ^2\).

In Fig. 3 we show a comparison of the \(\chi ^2/\)d.o.f. between the data and two fits, standard fit LT-STD and the best fit with higher-twist corrections HTS, as a function of the lower cutoff \(Q^2_{\mathrm {cut}}\) imposed on the data. Here \(Q^2_{\mathrm {min}}\) is kept fixed to 3.5 GeV\(^2\) for the LT-STD fit and to 1.2 GeV\(^2\) for the HTS fit. A systematic improvement of the data description is clearly seen for the HTS fit with respect to the LT-STD fit for \(Q^2_{\mathrm {cut}} < 5\) GeV\(^2\) at NLO and for \(Q^2_{\mathrm {cut}} < 20\) GeV\(^2\) at NNLO.

A short comparison of our reference fit LT-STD to the HERAPDF2.0 fit is in order. The resulting values of the \(\chi ^2/\)d.o.f. at \(Q^2_{\mathrm {min}} = 3.5\,\mathrm {GeV} ^2\) for the LT-STD fit are 1.212 and 1.201 at NLO and NNLO, correspondingly, while for the HERAPDF2.0 fit they are 1.200 and 1.205 at NLO and NNLO, correspondingly. Hence at NLO the HERAPDF2.0 fit is slightly better than the LT-STD fit, and at NNLO the LT-STD has marginally lower \(\chi ^2/\)d.o.f., and in summary both the parameterisations may be considered as equally good. Recall also, that the input PDFs parameterisation of the LT-STD fit has 12 free parameters, to be compared with 14 parameters of the HERAPDF2.0 fit hence by choosing LT-STD we gain on efficiency.

4.2 Features of the best fits

The parameters of the model obtained from the best fits (HTS) of all the data with \(Q^2 >1\,\mathrm {GeV} ^2\) with the twist 4 corrections included and the sea input with saturation damping, are given in Table 1. The results are displayed both for NLO and NNLO DGLAP evolution of the leading twist terms.

Table 1 Parameters and the \(\chi ^2/\mathrm {d.o.f.}\) values for the HTS NLO and NNLO fits. Note that \(A_g\), \(A_{u-}\) and \(A_{d-}\) are fixed by the sum rules

It is interesting to analyse the obtained parameters describing the higher-twist corrections and compare the results to the expectations from the GBW model. First of all, the value obtained from the fit of the saturation scale exponent is \(\lambda = 0.350 \pm 0.008\) for the NLO HTS fit and \(\lambda = 0.260 \pm 0.010\) in the NNLO HTS fit. These values are rather close to the saturation exponents of the GBW model \(\lambda = 0.288\) (without charm) and \(\lambda = 0.277\) (with charm). The obtained values of \(\lambda \sim 0.26 - 0.35\) are also consistent with the picture of double hard pomeron exchange as the leading contribution to twist 4 corrections at small x. Also the value of the saturation x parameter at \(Q_0 =1\,\mathrm {GeV} \) obtained from the NNLO fit, \({\hat{x}} = (2.0 \pm 0.4) \cdot 10^{-4}\) compares well to the corresponding GBW saturation x parameters, \(x_0 = 3.04 \cdot 10^{-4}\) (without charm) and \(x_0 = 0.41 \cdot 10^{-4}\) with charm. In the NLO fit, however, the obtained \({\hat{x}} = 0.05 \pm 1.0 \cdot 10^{-5}\) is consistent with zero. Recall that in our approach parameter \({\hat{x}} \) is the characteristic x for emergence of the saturation damping effects in the sea distribution. Hence the conclusion implied by the \(\chi ^2\) scan is confirmed: that the sea saturation damping is important for the NNLO DGLAP fit with twist 4 corrections, while the DGLAP NLO fit with higher twists does not require the saturation input damping in the x range of the fitted data.

Interestingly, the pattern of the twist 4 multiplicative coefficients is found to differ significantly from the predictions of the GBW model. At small x the model yields a sizeable negative twist 4 correction to \(F_{\mathrm {L}}\) and a positive correction to \(F_{\mathrm {T}} \). The performed fits exhibit a different pattern – both at NLO and NNLO we find a small positive twist 4 correction to \(F_{\mathrm {T}} \) and a larger positive correction to \(F_{\mathrm {L}} \). The difference in the sign of the higher twist correction to \(F_{\mathrm {L}} \) at small x between the GBW model prediction and the fit results, occurs both in the leading logarithmic term \(\propto c_{\mathrm L} ^{\mathrm {(log)}}\log (Q^2 / Q_{\mathrm {sat}} ^2 (x))\) for \(Q > Q_{\mathrm {sat}} (x)\) and in the constant term \(c_{\mathrm L}^{(0)}\). This result indicates that the leading twist 4 coefficient function in the longitudinal virtual photon inelastic scattering off the proton is not of the type of an eikonal pomeron exchange.

4.3 Comparison with the data for \(\sigma _{\mathrm {red}}\) and \(F_{\mathrm {L}}\)

In the approach presented here the relative importance of the higher-twist corrections to the proton structure functions may be estimated for different x and \(Q^2\). Such an estimate provides a measure of both the expected accuracy of the leading twist description and the sensitivity to the higher-twist contribution. It also permits to determine the kinematic region in which the higher-twist corrections are most important and, in this way, the evidence of the higher-twist contribution to the structure functions is strengthened. Indeed, we find that the most important effect in the structure function deviations from the DGLAP leading twist description comes from the region of small x and \(Q^2\), where the higher-twist effects are strongest.

Fig. 4
figure 4

The combined low-\(Q^2\) HERA inclusive NC \(e^+p\) reduced cross sections compared to the NLO HTS fits (full red line). Also shown twist 2 (dashed green line) and twist 4 (dotted purple line) contributions

In Fig. 4 the data for the reduced cross sections for \(1.2\;\mathrm {GeV}^2< Q^2 < 15\;\mathrm {GeV}^2\) are compared to the HTS fit results. In order to illustrate the twist content of the proton structure function we also show the twist decomposition of the reduced cross section at the NLO level. The twist 4 contribution makes up to 75\(\%\) of the twist 2 contribution at \(Q^2=1.2\,\mathrm {GeV} ^2\) and \(x=3\cdot 10^{-5}\), where the higher-twist effects are largest. In this kinematic region the twist 4 effects are estimated to provide about 40\(\%\) of the reduced cross section. As expected, the higher-twist contribution is suppressed with increasing x, and at \(x=3\cdot 10^{-4}\) the twist 4 correction is reduced to about 20\(\%\) of the total value. The relative importance of the higher-twist correction decreases also with growing \(Q^2\). Indeed, at \(x=3\cdot 10^{-5}\) the relative twist 4 contribution is about 30% at \(Q^2=2\,\mathrm {GeV} ^2\) and about 20\(\%\) at \(Q^2=3.5\,\mathrm {GeV} ^2\). In the HTS fit at NLO, the relative higher-twist effect is below 10% at \(Q^2=6.5\,\mathrm {GeV} ^2\).

The results of a similar investigation at NNLO are displayed in Fig. 5 for \(Q^2\) up to 15 GeV\(^2\). In the NNLO DGLAP fits the higher-twist effects are found to be significantly stronger than in the NLO fits over the whole probed range of \(Q^2\). In particular, in the NNLO fit at \(Q^2=1.2\,\mathrm {GeV} ^2\), the twist 4 correction is found to be larger than the leading-twist contribution for \(x< 2\cdot 10^{-4}\), and the relative correction further grows towards small x, to reach about 200% of the leading-twist term at the lowest available \(x \simeq 2 \cdot 10^{-5}\). At \(Q^2=2\,\mathrm {GeV} ^2\) and the smallest x, the twist 4 correction reaches about 80\(\%\) of the twist 2 contribution, and at \(Q^2=3.5\,\mathrm {GeV} ^2\) the higher-twist term is still around 25\(\%\) of the leading-twist term. Finally, the higher-twist correction reaches \(\sim 10\%\) level at \(Q^2=6.5\,\mathrm {GeV} ^2\) and quickly decreases for larger \(Q^2\).

The characteristic behaviour of the data at moderate \(Q^2\) is a turn-over at small x. This feature is not reproduced by the DGLAP fits without higher-twist corrections [1] and the inclusion of higher-twist effects is necessary to provide a good description of this behaviour [10, 11]. Hence the turn-over may be considered to be a signature of the higher-twist contributions. The HTS fits reproduce well this shape both at NLO and NNLO.

In the existing analyses of the combined HERA data with higher-twist corrections [11] a satisfactory description of the \(F_{\mathrm {L}} \) data at smaller \(Q^2\) has not been achieved within the NNLO framework [11]. The predictions for \(F_{\mathrm {L}} \) obtained in our approach are shown in Fig. 6 in comparison to the experimental data from H1 [13]. The \(F_{\mathrm {L}} \) data are well described down to \(Q^2 = 1\) GeV\(^2\). Note that the plotted \(F_{\mathrm {L}} \) data were not directly fitted, the \(F_{\mathrm {L}} \) contribution was treated in the fits only as a part of the reduced cross section \(\sigma _{\mathrm {red}}\). The higher-twist contributions are found to be important in \(F_{\mathrm {L}} \) at small and moderate \(Q^2\). In particular, the twist 4 term dominates for \(Q^2 < 5\) GeV\(^2\) (\(Q^2 < 6\) GeV\(^2\)) for the NLO fit (NNLO fit). Remarkably, in the NNLO fit, the higher-twist contribution is still visible at 10% level up to a sizeable scale of \(Q^2 \simeq 20\,\mathrm {GeV} ^2\). This shows that the longitudinal structure function is particularly sensitive to the higher-twist effects and it may be used as their effective probe.

Fig. 5
figure 5

The combined low-\(Q^2\) HERA inclusive NC \(e^+p\) reduced cross sections compared to the NNLO HTS fits (full red line). Also shown twist 2 (dashed green line) and twist 4 (dotted purple line) contributions

Fig. 6
figure 6

The predictions for \(F_{\mathrm {L}}\) from the HTS fit compared to the H1 data [13]. Also shown are twist 2 (dashed green line) and twist 4 (dotted purple line) contributions

4.4 Impact of the higher-twist effects on the PDFs

One of the key goals of the present analysis is to understand the impact of the higher-twist corrections on the parton density functions. It is expected that inclusion of the higher-twist corrections alters the resulting PDFs, hence affecting the predictions in which the PDFs are used. Below we present the PDFs obtained from the HTS fits (including the higher-twist effects) compared to the PDFs obtained within the standard framework, which we denote as the LT-STD fit. The determined PDFs are presented with the corresponding experimental uncertainties. The included higher-twist corrections are larger in the small x region, so they affect mostly the gluon and sea PDFs at small x. Hence, in the figures we show only these parton distributions. We have checked that the effect of the higher-twist corrections on the valence quark distribution is much smaller than for the gluon and sea PDFs.

In Fig. 7 we compare the gluon and sea PDFs at small scales, \(\mu ^2 = 1.2\), 2.0 and 3.5 GeV\(^2\), with (HTS) and without (LT-STD) the higher-twist corrections, at the NLO and NNLO accuracy. The input condition for the HTS fit is assumed at \(\mu _0^2 = 1\) GeV\(^2\) and it includes the saturation damping effect in the gluon and sea PDFs. The standard fit (LT-STD) starting scale is \(\mu ^2_0 = 1.9\,\mathrm {GeV} ^2\), so for \(\mu ^2 = 1.2\) GeV\(^2\) the LT-STD PDFs should be treated as extrapolations only. The dotted vertical lines in the plots mark the minimal kinematically allowed value of x in HERA measurements. Hence the PDFs values to the left of this line are also extrapolations from the region of available data towards smaller x.

Fig. 7
figure 7

Comparison of the HTS and LT-STD PDFs at \(\mu ^2 = 1.2\,\mathrm {GeV} ^2\) (top), \(\mu ^2 = 2.0\,\mathrm {GeV} ^2\) (middle) and \(\mu ^2 = 3.5\,\mathrm {GeV} ^2\) (bottom). For \(\mu ^2 = 1.2\,\mathrm {GeV} ^2\) the LT-STD PDFs are extrapolated below the parameterisation range. The experimental uncertainties are shown

Fig. 8
figure 8

Comparison of the HTS and LT-STD PDFs at \(\mu ^2 = 10\,\mathrm {GeV} ^2\) (top), \(\mu ^2 = 20\,\mathrm {GeV} ^2\) (middle) and \(\mu ^2 = 50\,\mathrm {GeV} ^2\) (bottom). The experimental uncertainties are shown

Fig. 9
figure 9

The PDFs at \(\mu ^2 = 10\,\mathrm {GeV} ^2\) obtained from the HTS fit. The experimental uncertainties are shown

It is clear from Fig. 7 that inclusion of the higher-twist and saturation effect leads to sizeable changes of the gluon and sea PDFs at small \(x <0.01\) and small scales \(\mu ^2 < 3.5\) GeV\(^2\) both at the NLO and NNLO accuracy. The effects found are larger for the NLO fits, where e.g. for \(\mu ^2 = 3.5\) GeV\(^2\) and \(x = 3\cdot 10^{-5}\) the HTS gluon is larger than the LT-STD gluon by about 60% and the HTS sea PDF is smaller by more than 10% than the LT-STD one. In the NNLO fits the differences between the gluon PDFs with and without the higher-twist effects tend to be smaller than in NLO fits, while the effect is larger in the sea quarks at NNLO. In general, at the smaller scales, the inclusion of the higher-twist effects leads to a larger gluon PDF and the reduced sea PDF. The differences found are not only in the shape and values of the PDFs but also in the relative behaviour of the gluon and sea distributions. In the standard approach those two are decoupled at the input scale whereas in the HTS approach the sea distribution at small x follows the gluon distribution already at the input scale.

In a similar way, Fig. 8 displays the impact of higher-twist effects on the PDFs at larger scales, \(\mu ^2 = 10\), 20 and 50 GeV\(^2\). The higher-twist effects are still significant in the gluon distribution at small x. As for the smaller scales, the inclusion of the higher-twist effects leads to a larger gluon PDF. The change of the gluon PDF at the small x values corresponding to the kinematic lower limit at HERA is greater than 20% for \(\mu ^2 = 10\,\mathrm {GeV} ^2\), about 10% for \(\mu ^2 = 20\,\mathrm {GeV} ^2\), and slightly below 10% for \(\mu ^2 = 50\) GeV\(^2\). At the larger scales the sea distribution is not significantly affected by the higher-twist corrections. Note finally that the higher-twist corrections lead to larger changes in the PDFs for the NLO DGLAP framework than for the NNLO DGLAP one.

Figure 9 shows the independent PDFs at \(\mu ^2 = 10\) GeV\(^2\) obtained from our best fits with higher-twist corrections and the parton saturation effects. The valence up and down quark, the sea and the gluon PDFs are shown with their experimental uncertainties at the NLO and NNLO accuracy. We have checked that the obtained leading twist LTS PDFs at \(Q^2 = 10\) GeV\(^2\) agree with standard PDFs in the range of x probed at HERA.

The obtained PDFs were compared in more detail to the widely used PDFs, e.g. to the MMHT PDFs [42] and to the results of the HERAPDF2.0 fit [1]. In the latter case also a direct comparison is possible of the HHT PDFs obtained from a fit with the higher-twist effects included [11]. A natural choice of the scale reference point for which all these PDFs are explicitly available is \(\mu ^2 = 10\) GeV\(^2\), and the results are displayed in Fig. 8 (the top raw) and in Fig. 9. We find that for the pure DGLAP fits: MMHT, HERAPDF2.0 and LTS, all the PDFs are consistent within errors down to \(x = 10^{-3}\). For \(x < 10^{-3}\) the NNLO LTS gluon distribution function is slightly larger than in the NNLO MMHT and NNLO HERAPDF2.0 fits – the maximal effect is found for the smallest \(x \simeq 10^{-4}\), where the central value of the LTS gluon distribution is larger by about 15% than the MMHT and HERAPDF2.0 fits. This difference is, however, almost saturated by the statistical error bands of the distributions. The sea and valence distributions are consistent for these partons sets over the whole range of x.

When comparing the HTS PDFs and the HHT PDFs [11] we find a moderate difference in the impact of the higher-twist effects on the gluon PDF at smaller x, and no significant differences in the valence and sea distributions. At \(\mu ^2 = 10\) GeV\(^2\) the NNLO HTS gluon is significantly larger than the NNLO LTS gluon, and the largest difference of about 20% occurs at the smallest accessible value of \(x \simeq 10^{-4}\). For the HHT gluon distribution function it is not the case, the HHT and HERAPDF2.0 gluons are overlapping over the whole range of x at this scale [11]. This difference between the results of different fits is not very pronounced, and it probably originates mostly from differences in the model of the higher-twist corrections applied in the considered analyses. Also a significant difference in the model of the input in the present analysis – the sea damping correction – affects in part the obtained HTS PDFs. A more detailed and wider discussion of the similarities and differences between the approaches of Refs. [10, 11] and the present study is performed in Sect. 5.

5 Discussion

Let us recall the main results of this paper that substantiate the evidence of the significant higher-twist contributions in the inclusive DIS at HERA at small x and \(Q^2\). The strongest point is the comparison of the fits to the combined HERA data on the reduced cross sections based on the DGLAP evolution with and without twist 4 corrections. For the data sample analysed, with \(Q^2>1\) GeV\(^2\), the inclusion of the twist 4 terms improves the \(\chi ^2/ \)d.o.f. of the NLO fit from about 1.34 (for the pure DGLAP) to about 1.21. With about 1200 degrees of freedom of the fit, the statistical significance of the \(\chi ^2\) change corresponds to an improvement of the p-value by more than seven orders of magnitude. For the NNLO DGLAP fit we find an improvement of the \(\chi ^2/ \)d.o.f. from about 1.46 to less than 1.19, so the statistical significance of the improvement is greater by many orders of magnitude that in the NLO DGLAP fits.

On the other hand, the values of the \(\chi ^2/ \)d.o.f. about 1.19 found in the DGLAP fits with twist 4 corrections are still uncomfortably larger than 1.0. Here however, we may take as the reference level the value \(\chi ^2/ \)d.o.f. = 1.15 that is obtained in all the fits for \(Q^2_{\mathrm {min}} >10\) GeV\(^2\), where the higher-twist corrections should be negligible. Given the crudeness of the higher-twist model applied, we find the change of the \(\chi ^2/ \)d.o.f. from the reference level 1.15 to about 1.19 in the best NNLO fit with higher twists to be sufficiently small to accept the model.

An issue that should be raised is a possible alternative explanation of the data by a pure DGLAP fit with a more flexible input parameterisation. In order to constrain this hypothetic scenario the fit \(\chi ^2/ \)d.o.f. scans may be compared for three signficantly different input parameterisations: the MMHT fit [10], the HERAPDF2.0 fit [11], and the LTS parameterisations proposed in this study. With the same input scale \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\) at lower \(Q^2_{\mathrm {min}}\), the obtained values of \(\chi ^2/ \)d.o.f. from the LTS and HERAPDF2.0 fits differ only marginally: by less than 0.01 both at NLO and at NNLO. The MMHT fit restricted to HERA data with \(Q^2_{\mathrm {min}}\) yields \(\chi ^2/ \)d.o.f. smaller than the LTS fit by up to 0.05 [10] but this is result obtained with a much greater number of free parameters than in the HERAPDF2.0 and LTS fits. The rather close values of \(\chi ^2/ \)d.o.f. resulting from three different input models suggest that the LTS fit is not overconstrained. In particular, when evaluating the model quality it is necessary to take into account also the number of free parameters needed to improve the \(\chi ^2\) of the fit, and here the LTS and HTS fits are particularly efficient. Moreover, the contribution of the higher twists found is characterised by a strong \(Q^2\) dependence which cannot be mimicked by the features of the input parameterisations that are given at a fixed scale \(\mu _0^2\) if the data are available over sufficiently large span of \(Q^2\) in the kinematic range where the higher-twist effects are significant. In order to enhance the sensitivity to the \(Q^2\) dependence we included in the analysis tha data down to \(Q^2 =1\) GeV\(^2\). Furthermore the explicit analysis of the \(Q^2\) and x-dependencies of the higher-twist terms leading to the improvement of the fit quality shows full consistency with the assumption of the leading twist 4 exchange as the contribution necessary to well describe the data. In particular the obtained twist 4 correction is characterised by a leading \(x^{-2\lambda }\) behaviour with \(\lambda = 0.26\) for the NLO fit and \(\lambda = 0.35\) for the NNLO fit – the values strongly supporting the interpretation of the twist 4 terms as coming from the double hard gluon ladder exchange, as expected from the QCD analyses of the twist 4 evolution at small x.

The higher twist interpretation advocated in this paper should be also confronted with recent interesting results obtained for the vector meson production [43], where power corrections to the coefficient functions in high energy scattering were derived, that do not come from the leading quasi-partonic 4-gluon twist 4 exchange.Footnote 5 Although we agree that such corrections should also contribute to the DIS cross sections, we stress that the x-dependence of the twist 4 corrections (i.e. the power corrections in \(1/Q^2\)) found in the present analysis is consistent with the dominance of the exchange of two hard gluonic ladders.

It is interesting to compare the results presented here with results of the two recent papers [10, 11] in which a similar conclusion was reached that the combined HERA data require an inclusion of the twist 4 corrections at small x and \(Q^2\). Hence the main conclusion of all the analyses is the same on the qualitative level. The models applied both for the DGLAP piece and the higher-twist corrections are different, however, and so is the data range used in the best fits, hence on a more detailed level there are certain differences between the results of [10, 11] and this paper. In what follows we compare the models and results.

The analyses of Refs. [10, 11] have been performed using a similar method of the \(\chi ^2\) scan as a function of \(Q^2_{\mathrm {min}} \) for DGLAP fits with and without the higher-twist corrections. In both approaches the data were selected with \(Q^2 >2\) GeV\(^2\) and it was found that the twist-4 corrections to \(F_2 \) are consistent with zero within a moderate determination error. Hence, in the central model of both the approaches,Footnote 6 only \(F_{\mathrm {L}} \) receives the twist 4 correction in the form of \(\Delta ^{(\tau =4)} F_{\mathrm {L}} (x,Q^2)= A F_{\mathrm {L}} (x,Q^2) / Q^2\). The improvements and extensions of the present analysis are the following. The model of higher-twist corrections we propose here is based on general features of multiple scattering in QCD at high energies. In particular it includes the known results on the leading evolution of twist 4 quasi-partonic operators [2, 3]. Therefore the model offers an explicit physics interpretation of the result and allows to obtain a deeper insight into the x-dependence of higher-twist corrections in both \(F_{\mathrm {L}} \) and \(F_{\mathrm {T}} \) (or \(F_2 \)). Furthermore, in order to enhance the sensitivity to the higher-twist effects, we extended the \(Q^2\) range of the fitted data down to \(Q^2=1\) GeV\(^2\). Indeed, with this extension the fits are sensitive to the x-dependence of the twist 4 corrections, which is well constrained by the performed fits. In contrast to results of Refs. [10, 11] we find positive and sizable higher-twist corrections to both \(F_{\mathrm {L}} \) and \(F_2 \). We consider three possible sources of this difference: (i) the extended range of \(Q^2\) used in the present analysis, (ii) the differences in the model of the input parameterisation for the DGLAP evolution, and (iii) the different form of the model of the higher-twist contributions.

By performing an explicit analysis of the fitted higher-twist terms dependence on \(Q^2_{\mathrm {min}}\) with initial scales \(\mu _0 ^2=1.0\,\mathrm {GeV} ^2\) and \(\mu _0 ^2=1.9\,\mathrm {GeV} ^2\), we verified that a stable general pattern of sizable positive higher-twist corrections to both \(F_{\mathrm {L}} \) and \(F_2 \) is obtained in the NLO and NNLO fits. In particular this pattern is found for the value \(Q^2_{\mathrm {min}}=2\) GeV\(^2\) for which the earlier fits were performed. Hence the extended range of \(Q^2\) is not responsible for the differences. The impact of the DGLAP input parameterisation may be partially constrained by an observation that while the pure DGLAP fits the HERAPDF2.0 fit and the LT-STD fit lead to a very similar description of the data (see Fig. 1), these DGLAP fits complemented by different models of the higher-twist terms lead to a significantly different higher-twist corrections to \(F_{\mathrm {L}} \) and \(F_2 \). On the other hand, turning on or off the saturation sea damping factor in the LTS / LT fits does not affect the higher-twist correction pattern. Hence, most likely, the source of the difference in the determined higher-twist corrections to \(F_{\mathrm {L}} \) and \(F_2 \) is the applied model of higher-twist corrections.

An additional conclusion we find w.r.t. the previous studies is the steep \(x^{-2\lambda }\) dependence of the twist 4 corrections, while the twist 4 correction \(\Delta ^{(\tau =4)} F_{\mathrm {L}} (x,Q^2)\) of Refs. [10, 11] is characterised by a single gluonic ladder exchange \(\sim x^{-\lambda }\). It is intriguing why the fits do not distinguish clearly between these two different scenarios. We propose the following interpretation of this apparent discrepancy. As follows from the performed twist decomposition of the reduced cross section in the best fits, the higher-twist corrections become significant in the bins of the lowest \(Q^2\), below \(Q^2 = 5\) GeV\(^2\) and at the smallest available x. Therefore there is only a moderate region of the kinematic space (and a moderate number of the data points in this space) to pin down the x-dependence of the higher-twist corrections with a high precision, see a discussion of this issue performed in Ref. [10]. In order to enhance the sensitivity of the analysis to higher twist effects we extended the data range down to \(Q^2 > 1\) GeV\(^2\) from \(Q^2 > 2\) GeV\(^2\) taken in Refs. [10, 11]. This lowered limit resulted in significantly smaller errors of the fitted higher twist parameters and a much larger impact of the higher twist correction on the fit \(\chi ^2\)/d.o.f. Hence we expect that the enhanced sensitivity of the present analysis may also contribute to the difference between the present results and the results of Refs. [10, 11]. Furthermore in the present approach we introduce additional freedom of the sea input parameterisation at small x motivated by parton saturation damping effects. This feature leads to a reduced sea PDF at small x and small scales w.r.t. the standard sea PDF and a modified physics picture emerges. In [10, 11] the sizeable higher-twist corrections are found in \(F_{\mathrm {L}} \) and they are negligible in \(F_2 \). In the HTS NNLO fit obtained in the present analysis we find the sizeable positive higher-twist corrections in both \(F_{\mathrm {L}} \) and \(F_2 \) at small x and \(Q^2\), and certain suppression of the leading twist contribution to \(F_2 \) due to the saturation damping of the sea. Finally the model of higher-twist correction of Ref. [11] is simple and efficient, however it leads to unphysical behaviour of the predicted \(F_{\mathrm {L}} \) at small \(Q^2\) at NNLO. In summary, we offer an approach to the higher-twist corrections with a clear physical interpretation and an enhanced flexibility. The present analysis confirms the conclusions of Refs. [10, 11] on improvements of the proton structure description due to inclusion of higher-twist corrections, but a different model of the higher-twist terms applied here and a wider data range result in some differences in the fitted higher-twist corrections in \(F_{\mathrm {L}} \) and \(F_2 \). The proposed framework can be successfully extended down to \(Q^2=1\) GeV\(^2\) and gives a reasonably good description of \(F_{\mathrm {L}} \) over the whole \(Q^2\) range.

A comment is in order on the fact that the sign of the obtained twist 4 correction to \(F_{\mathrm {L}} \) is positive and it disagrees with the predictions [15, 16] from the Golec-Biernat–Wüsthoff saturation model [17, 18]. The result is also not intuitive in a simplified picture of the multiple scattering, in which the longitudinal virtual photon–proton scattering total cross section receives a positive correction from double scattering, while one could expect a negative shadowing correction. The sign of the twist 4 correction depends however, on the sign of the corresponding coefficient function. In the GBW model the twist 4 coefficient function is implicitly computed assuming eikonal coupling of two gluonic ladders to the virtual quark loop coming from the virtual photon fluctuation. Such a coupling is not what is predicted by QCD at small x. The \(\gamma ^*\) coupling to four-gluon exchange was analysed in the leading logarithmic 1 / x approximation in the framework of extended generalised leading logarithmic approximation [22, 26,27,28, 44] and it was found that the coupling occurs via triple pomeron vertex, which is distinctly different from the eikonal coupling. A more detailed analysis of the four-gluon coupling to the virtual photon was performed in Refs. [45, 46]. The definite prediction of the sign of the twist 4 corrections to the proton structure functions in this approach is not available yet, so it is interesting to obtain it and confront with the results of the fit.

Finally, let us discuss shortly the concept of the gluon and sea saturation damping effects in the input PDFs at small x. The motivation to propose this damping for the gluon PDF comes from the properties of the solution of the Balitsky–Kovchegov equation [23,24,25] and for the sea PDFs we additionally assume that the input sea PDF at small x is driven by the gluon PDF. The damping may be understood as a result of unitarity corrections due to the multiple scattering effects below the scale of the input parameterisation. For the gluon PDF, the DGLAP fits (with or without higher-twist corrections) with the conventional parameterisation of the input gluon PDF, with the small x asymptotics \(xf_g(x,\mu _0^2) \sim x^{B_g}\), yield a positive \(B_g\) when \(\mu _0^2\) is sufficiently small, inline with the saturation damping property. So, for the gluon PDF the saturation damping is implicit in the standard parameterisation. For the sea PDFs, however, the saturation damping leads to a significant modification of the input shape at \(\mu _0^2 =1\) GeV\(^2\). The saturation damping effect of the sea distribution is found to be supported by the data in the NNLO DGLAP fit with the higher-twist corrections. The inclusion of the saturation damping effects in this fit improves the \(\chi ^2/\)d.o.f. significantly for moderate \(Q^2\). In particular for the complete data sample the \(\chi ^2/\)d.o.f. goes down from 1.25 to 1.19. In this fit, the characteristic value of x for which the saturation damping effects turn on at \(\mu _0^2 =1\) GeV\(^2\) was found to be \(\hat{x} \simeq 2\cdot 10^{-4}\), in consistence with the earlier analyses of the parton saturation at small x [17, 18, 20].