Impact of heavy-flavour production cross sections measured by the LHCb experiment on parton distribution functions at low x

The impact of recent measurements of heavy-flavour production in deep inelastic $ep$ scattering and in $pp$ collisions on parton distribution functions is studied in a QCD analysis in the fixed-flavour number scheme at next-to-leading order. Differential cross sections of charm- and beauty-hadron production measured by LHCb are used together with inclusive and heavy-flavour production cross sections in deep inelastic scattering at HERA. The heavy-flavour data of the LHCb experiment impose additional constraints on the gluon and the sea-quark distributions at low partonic fractions $x$ of the proton momentum, down to $x \sim 5 \times 10^{-6}$. This kinematic range is currently not covered by other experimental data in perturbative QCD fits.


Introduction
Understanding the nucleon structure is one of the fundamental tasks of modern particle physics. In quantum chromodynamics (QCD), the structure of the nucleon is described by parton distribution functions (PDFs), which, in collinear factorisation, represent probability densities to find a parton of longitudinal fraction x of the nucleon momentum at a factorisation scale µ f . The scale evolution of the PDFs is uniquely predicted by the renormalisation group equations for factorisation [1,2]. The x-dependence cannot be derived from first principles and must be constrained by experimental measurements. The precision of the PDFs is of key importance for interpreting the measurements in hadronic collisions. In particular, the uncertainty of the proton PDFs must be significantly reduced in order to improve the accuracy of theory predictions for Standard Model (SM) processes at the LHC.
Deep inelastic lepton-proton scattering (DIS) experiments cover a broad range in x and µ f . In the perturbative regime, a wide x-range of 10 −4 < x 10 −1 is probed by the data of the H1 and ZEUS experiments at the HERA collider [3]. These measurements impose the tightest constraints on the existing PDFs. However, additional measurements are necessary for a better flavour separation and to constrain the kinematic ranges of very small and very high x, where the gluon distribution is poorly known. A better constraint on the high-x gluon is needed for an accurate description of the SM backgrounds in searches for new particle production at high masses or momenta. Significant reduction of the uncertainty of the low-x gluon distribution is important for studies of parton dynamics, non-linear and saturation effects. Furthermore, precision of the gluon distribution at low x has implications in physics of atmospheric showers, being crucial for cross-section predictions of high-energy neutrino DIS interaction [4] and for calculations of prompt lepton fluxes in the atmosphere [5].  Figure 1: Kinematic range in x for the gluon density covered by measurements at HERA and LHCb. For the HERA inclusive DIS data, the x range is indicated, where the gluon PDF uncertainties are less than 10% at µ 2 f = 10 GeV 2 . For the LHCb data, the upper (lower) edge of the box refers to the indicated upper (lower) end of the rapidity, y, range of the heavy-hadron production.
Heavy-flavour measurements of the LHCb Collaboration [6,7] at the LHC probe the very forward range of the heavy-hadron rapidity y and are sensitive to the gluon PDF at low x, as schematically shown in Fig. 1. For this illustration, in the calculation of the kinematics of heavy-quark production at HERA, the leading order (LO) relation is used for the typical gluon x in boson-gluon , where x B j denotes the Bjorken scaling variable, m Q is the heavy-quark mass, and Q 2 is the virtuality of the exchanged electroweak boson. In the case of heavy-quark production at LHCb, the LO formula x = e ±y p 2 T +m 2 Q E p , assuming p z = 0 in the parton-parton rest frame, is applied. Here, p T and p z represent the transverse and longitudinal momenta of the heavy quark, respectively, and E p is the proton beam energy.
Heavy-flavour production in proton-proton collisions at LHC is dominated by the gluon-gluon fusion process. Therefore the LHCb measurements of charm [6] and beauty [7] production in the forward region 2.0 < y < 4.5 probe the gluon distribution at 5 × 10 −6 x 10 −4 , a region which is not accessible with HERA data. Note that the LHCb data are sensitive to the product of gluon densities in two non-overlapping low and medium-to-high x ranges, as illustrated in Fig. 1. Since the medium range is already well constrained by HERA data, which furthermore bridge the gap between the two LHCb ranges, the major impact of the LHCb heavy-flavour measurements is expected at 5 × 10 −6 x 10 −4 . The advantage of using heavy-flavour data is that the charm and beauty masses provide hard scales for the perturbative QCD expansion all the way down to their production threshold. To estimate the impact of the LHCb measurement of charm and beauty production on the gluon distribution at low x, these data are included in a QCD analysis together with the inclusive DIS [3] and heavy-flavour production [8,9] cross sections measured at HERA.

Experimental data used in the QCD analysis
The main objective of the present QCD analysis is to demonstrate the constraining power of the measurements of heavy-flavour production in DIS and pp collisions for the determination of the PDFs of the proton. The measurements of charm and beauty production at HERA and LHCb, together with the combined HERA inclusive cross-section measurements, are used in a next-toleading order (NLO) perturbative QCD (pQCD) analysis.
Neutral current (NC) and charged current (CC) inclusive DIS cross sections in ep scattering are directly sensitive to the valence and sea-quark distributions and probe the gluon distribution through scaling violations [1]. HERA measurements of the NC and CC cross sections in DIS at a centre-of-mass energy √ s = 320 GeV have been combined taking into account systematic correlations [3]. This combined data set contains the complete information on inclusive DIS cross sections published by the H1 and ZEUS Collaborations based on data collected in the years 1994-2000, and has been used for the determination of the PDF set HERAPDF1.0 [3]. The kinematic range of the NC data is 6 × 10 −7 ≤ x B j ≤ 0.65, 0.045 ≤ Q 2 ≤ 30000 GeV 2 . The CC cross sections span the kinematic range of 1.3 × 10 −2 ≤ x B j ≤ 0.40 and 300 ≤ Q 2 ≤ 30000 GeV 2 . These combined NC e ± p and CC e ± p cross sections represent the basis for all PDF determinations.
In ep scattering, charm and beauty quarks are produced predominantly in the photon-gluon fusion process which provides a direct probe of the gluon distribution in the proton. Measurements of open-charm production cross sections in DIS at HERA from the H1 and ZEUS Collaborations have been combined [8]. Cross sections for charm production were obtained in the kinematic range of 2.5 ≤ Q 2 ≤ 2000 GeV 2 and 3 × 10 −5 ≤ x B j ≤ 5 × 10 −2 . The combination method accounts for the correlations of the systematic uncertainties among the different data sets. These combined measurements were used to improve constraints on the gluon distribution and to determine the charm-quark mass [8]. The charm reduced cross sections determined as a function of Q 2 and x B j are used in the present analysis together with all provided details on the systematic correlations. In addition, cross sections for the production of b quarks in ep scattering, as measured by the ZEUS Collaboration [9] are used in the present analysis. These data correspond to an integrated luminosity of 354 pb −1 and cover the kinematic range of 5 < Q 2 < 1000 GeV 2 . The b-and c-quark content in the events with at least one jet have been extracted using the invariant mass of charged tracks associated with secondary vertices and lifetime information, and the b-quark mass has been measured [9]. In the present analysis, the b-quark production data are used mainly to improve constraints on the b-quark mass.
For additional constraints on the gluon distribution at low x the differential cross sections of charm and beauty production in pp collisions at √ s = 7 TeV from the LHCb experiment are used for the first time. The measurement of charm production [6] is based on data corresponding to an integrated luminosity of 15 nb −1 . Charm production is identified through the full reconstruction of decays of the charmed hadrons 1 D 0 , D + , D * + , D + s and Λ + c . The cross sections are measured as a function of the transverse momentum, p T , and rapidity, y, of the reconstructed hadrons. The LHCb data on B-meson production in pp collisions [7] correspond to an integrated luminosity of 0.36 fb −1 . The B + , B 0 and B 0 s mesons are reconstructed in exclusive decays mainly involving J/ψ final states. Correlations between the experimental systematic uncertainties are accounted for as described in the original publications. An uncorrelated systematic uncertainty is obtained for each distribution by subtracting the correlated uncertainties from the total ones. The 3.5% luminosity uncertainty is treated as correlated between the measurements of charm and beauty production. In the present analysis, the normalised cross sections, dσ dy / dσ dy 0 , for charm and beauty production are calculated from the absolute measurements published by LHCb and are used in the QCD analysis, with dσ dy 0 being the cross section in the center bin, 3 < y < 3.5, of the measured rapidity range in each p T bin. The uncorrelated experimental uncertainty on dσ dy 0 is propagated as a correlated uncertainty to the respective complementary rapidity bins. The QCD analysis is performed by using both, absolute or normalised, representations of the LHCb measurements, alternatively.

Theoretical predictions for heavy-flavour production
In the QCD analysis, the experimental measurements are confronted with corresponding theoretical predictions. The theoretical predictions for charm and beauty production in both ep and pp collisions are obtained at NLO in the fixed-flavour number scheme (FFNS). This scheme and its applicability to HERA measurements is discussed in detail in Ref. [8] and references therein. Predictions for HERA data are obtained by following the approach of the ABM group at NLO using its implementation in OPENQCDRAD [10] in the framework of HERAFitter [11]. The number of active flavours is set to N f = 3, and the renormalisation and factorisation scales (pQCD scales) for heavy flavour production are chosen as µ r = µ f = Q 2 + 4m 2 Q , where m Q denotes the pole mass 1 Charge conjugation is always implied for charm and beauty hadrons.
of c or b quarks 2 . For the light-flavour contributions to the inclusive DIS cross sections, the pQCD scales are set to µ r = µ f = Q.
Theoretical predictions for heavy-quark production in pp collisions are obtained using the massive NLO calculations [12][13][14] in the FFNS, also available as part of the Mangano-Nason-Ridolfi (MNR) calculations [15]. The pQCD scales are chosen as r, f being coefficients for c and b quarks, which are discussed in the following. These predictions were used successfully for beauty production in pp collisions at the SppS [16] and the Tevatron 3 [17]. They are conceptually very similar to the Frixione-Mangano-Nason-Ridolfi (FMNR) predictions [21] employed for heavy-flavour photoproduction at HERA [22].
The cross-section predictions for heavy-flavoured hadron production not only depend on the kinematics of the heavy-flavour production mechanism, but also on the fragmentation of the heavy quark into a particular final-state hadron. There is no final-state factorisation scale in the FFNS since collinear logarithms of the heavy-quark mass are included in fixed-order perturbation theory. The calculations in [12][13][14]23] describe the production of an on-shell heavy quark. Near the kinematic threshold, the transition of the heavy quark into the observed heavy-flavoured hadron can be taken into account by multiplying the cross section with the appropriate branching fraction. This leads to an excellent description of B-and D-meson production measurements at the Tevatron and the LHC from p T = 0 up to p T ∼ 4m [24,25]. The scope of these calculations can be extended by convoluting the heavy-quark production cross section with a suitable scale-independent fragmentation function describing the hadronisation of the heavy quark. The implementation of the convolution is not unique once the quark and hadron masses are taken into account, and leads to a potentially p T -dependent modelling uncertainty which is, however, small compared to the scalechoice uncertainty at NLO. This fragmentation function is used on a purely phenomenological basis, since it does not strictly appear in the context of a factorisation theorem, and therefore it has to be extracted from data. It depends on the order of the perturbation series but is generally assumed to be otherwise universal. Its main effect is to lower the theoretical predictions at large p T . Typical parametrisations used in the literature are those by Peterson et al. [26] depending on one parameter ε and by Kartvelishvili et al. [27] depending on one parameter α K .
For the HERA measurements, the fragmentation functions and their uncertainties are considered and accounted for in the original publications [8,9]. The measurements of LHCb are provided as hadron-production cross sections and the fragmentation functions have to be applied explicitly in order to use these data in the QCD analysis. In addition, fragmentation fractions describing the probability of a quark to fragment into a particular hadron have to be applied. The fragmentation fractions for c-flavoured hadrons are taken from [28] and for b-flavoured hadrons from [29].
So far, no fragmentation measurements were performed in pp collisions. Because of similarities of the c-quark production kinematics at HERA and LHCb, the Kartvelishvili fragmentation function [27] with α K = 4.4 ± 1.7, as obtained from corresponding HERA measurements [30,31] extracted for the NLO FFNS scheme, is applied for predictions of the LHCb measurements of charm-hadron production. The fragmentation is performed in the laboratory frame by rescaling the quark three-momentum with the energy of the produced hadron being calculated using the hadron mass. This procedure is used for D + and D + s mesons, and for Λ + c baryons. For D 0 -and D + -meson production, the contribution from D * + and D * 0 mesons is treated as described in [32]. For beauty production, the value α K = 11 ± 4 is used for all b-flavored hadrons, corresponding to measurements at LEP [33].
The fragmentation-fraction uncertainties are assigned to the measurements and are treated as correlated, while the uncertainties arising from the variations of assumptions on the fragmentation functions are treated in the form of variations of the theory predictions in the QCD fit.

Details of the QCD analysis
The open source QCD fit framework for PDF determination HERAFitter [11], version 1.0.0, is used. The partons are evolved by using the QCDNUM program [34]. The analysis-specific modifications to HERAFitter address the heavy-flavour treatment as follows. The massive fixed-flavour number scheme [35] with the number of flavours N f = 3 is used for the treatment of heavy-flavour contributions. The calculation of one-particle inclusive heavy-quark production cross sections in hadron collisions at NLO according to [14] is implemented by using original routines from the MNR code [36]. The results agree with those obtained with the original MNR code at a level of accuracy below 1%.
The 3-flavour strong coupling constant in the NLO MS scheme is set to α S (m Z ) N f =3 = 0.1059± 0.0005, which corresponds to the world average value of α S (m Z ) N f =5 = 0.1185 ± 0.0006, using two-loop evolution equations [34].
The Q 2 range of the inclusive HERA data is restricted to Q 2 > Q 2 min = 3.5 GeV 2 . The procedure for the determination of the PDFs follows the approach used in the HERAPDF1.0 QCD fit [3]. The following independent combinations of parton distributions are chosen in the fit procedure at the initial scale of the QCD evolution Q 2 0 = 1.4 GeV 2 : the valence-quark distributions xu v (x), xd v (x), the gluon distribution xg(x) and the u-type and d-type anti-quark distributions (which are identical to the sea-quark distributions), xU(x), xD(x), where xU(x) = xu(x) and xD(x) = xd(x) + xs(x). At the scale Q 0 , the parton distributions are represented by The normalisation parameters A u v , A d v , A g are determined by the QCD sum rules, the B parameters are responsible for the small-x behaviour of the PDFs, and the parameters C describe the shape of the distribution as x → 1. A flexible form for the gluon distribution is adopted with the choice of C ′ g = 25 motivated by the approach of the MSTW group [37,38]. The s-quark distribution is defined through x-independent strangeness fraction, f s , of the d-type sea, xs = f s xD at Q 2 0 , where f s = 0.31 +0.19 −0.08 as in the analysis of [38], including the recent complementary measurement [39]. Additional constraints B U = B D and A U = A D (1 − f s ) are imposed, with xū → xd as x → 0. The analysis is performed by fitting the remaining 13 free parameters in Eqs. (1)(2)(3)(4)(5).
The PDF parameters are determined in HERAFitter by minimisation of a χ 2 -function taking into account correlated and uncorrelated uncertainties [40] of the measurements. Systematic uncertainties are assumed to be proportional to the central prediction values, whereas statistical uncertainties scale with the square root of the predictions. Correlated uncertainties are treated using nuisance parameter representation [40]. To minimise biases arising from the likelihood transition to χ 2 when the scaling of the errors is applied, a logarithmic correction is added to the χ 2 -function [41].
The heavy-quark masses are left free in the fit. They are well constrained by the measurements of charm and beauty production in DIS and the fitted values (see Table 2 in the Appendix A) are consistent with the ones obtained in the corresponding HERA analyses [8,9] within the intrinsic theoretical systematic uncertainty of the pole mass definition [42].
The QCD analysis is performed twice using either absolute or normalised differential cross sections of heavy-flavour production from LHCb measurements, as defined in Section 2. The implementation of the theory calculations [43] as described in Section 3 allows the pQCD scales, i.e. the parameters A c,b r, f , and the values for the pole mass of the heavy quarks to be changed at each fit iteration.
In the QCD analysis using the normalised LHCb measurements, the pQCD scales are fixed to A r = A f = 1 for the central result. The scale dependence is studied by varying the pQCD scales independently such that 0.5 ≤ A r , A f ≤ 2. A c and A b are always varied simultaneously. The resulting scale dependence is small, since it is largely absorbed by the normalisation, as illustrated in Appendix A.
In the variant of the fit using the absolute LHCb cross sections, the scale dependence of the predicted cross section is the dominant theoretical uncertainty. The same scale choice and variation procedure, as applied for the variant of the fit using the normalised LHCb measurements, leads to unacceptably high χ 2 values of the respective fits [43]. Therefore, the four scales technically are treated as independent fully correlated systematic uncertainties for the central result. Since the pQCD scales are not physical parameters, the related uncertainties are not obtained from the fit. Instead, the effect of the scale choice on the other fitted parameters is evaluated by an independent variation of A f in the range 0.5 < A c f = A b f < 2 with A c r and A b r as free parameters, or A r in the range 0.25 < A c r = A b r < 1 with A c f and A b f being free parameters. For the variation A c f = A b f = 0.5, a cut p T > 2 GeV is applied for the charm LHCb data to ensure that the factorisation scale is above 1 GeV 2 , since this is technically required in the QCDNUM. This procedure ensures an acceptable fit quality for all variations [43], as required for a meaningful extraction of the other uncertainties. Because of the unconventional scale treatment the fit using absolute cross sections is considered to be a cross check.

PDF uncertainties
The PDF uncertainties are estimated following the approach of HERAPDF1.0 [3] in which experimental, model, and parametrisation uncertainties are taken into account. Experimental uncertainties are evaluated using the Hessian method [40]. A tolerance criterion of ∆χ 2 = 1 is adopted for defining the fit uncertainties that originate from the experimental uncertainties of the measurements included in the analysis.
Model uncertainties arise from the variations in the values assumed for Q 2 min imposed on the HERA data, which is varied in the interval 2.5 ≤ Q 2 min < 5.0 GeV 2 ; the fraction of strange quarks, varied in the range 0.23 < f s < 0.50 and the value of the strong coupling, varied in the range 0.1054 < α S (m Z ) N F =3 < 0.1064. The pQCD scales for heavy-quark production in DIS are varied simultaneously by a factor of 2 up and down for both, charm and beauty. For the fits with the LHCb data, the model uncertainties include theoretical uncertainties for the cross section predictions for heavy-flavoured hadron production, arising from variation of the pQCD scales and of the fragmentation parameters, as described in Section 3. Uncertainties, arising from these model variations are referred to as MNR uncertainties in the following.
The parametrisation uncertainty is estimated similarly to the HERAPDF1.0 procedure: for all parton densities, additional parameters are added one by one in the functional form of the parametrisations in Eqs. (1)(2)(3)(4)(5), in a similar way as described in [3,8,9]. Furthermore, the starting scale is varied to Q 2 0 = 1.9 GeV 2 . The parametrisation uncertainty is constructed as an envelope built from the maximal differences between the PDFs resulting from all the parametrisation variations and the central fit at each x value. The total PDF uncertainty is obtained by adding experimental, model and parametrisation uncertainties in quadrature.

Results
In Fig. 2, the absolute cross sections for D 0 -and B + -meson production in pp collisions are shown for one representative rapidity bin and are compared to the theory predictions as used in the QCD analysis. A significant scale dependence is observed. The normalised cross sections for a representative p T bin of the same data set are compared to the respective theory predictions in Fig. 3. The advantage of using the normalised cross section is a significant reduction of the scale dependence of the theoretical prediction, retaining the sensitivity of the cross sections to the gluon distribution. The reduction of the uncertainty due to scale variation is related to the fact that the scale choice affects mostly the normalisation but only to some extent the shape of heavy-quark production kinematics, as demonstrated in Fig. 6, 7 in the Appendix A.
The fit quality, represented by the total and partial values of χ 2 divided by the number of degrees of freedom, n dof , for both variants of the QCD analysis is presented in Table 1. When the normalised LHCb measurements are used in the QCD analysis, n dof is appropriately reduced for the respective data sets. The fitted parameters are presented in Table 2 in the Appendix A.
The resulting gluon, valence-quark and sea-quark distributions with their total uncertainties are presented at µ 2 f = 10 GeV 2 in Fig. 4 and compared to the result of the fit, based on solely HERA measurements of inclusive and heavy-flavour DIS. The uncertainties on the gluon and sea-quark distributions at low x are significantly reduced in both cases, using LHCb absolute or normalised heavy-quark production cross sections. In case of the variant of the fit based on normalised LHCb cross sections, the uncertainties are reduced by more than a factor of three at x ∼ 5 × 10 −6 , which is the edge of the sensitivity of the included measurements (Fig. 1). Consistent results are obtained in the fit using the absolute cross sections, which is considered an important cross check of the self-consistency of the NLO theory description.
The individual contributions of the experimental, model and parametrisation uncertainties for  Figure 2: Data to theory comparison for a representative subset of the LHCb absolute cross sections for production of D 0 mesons for 3.5 < y < 4.0 (left) and of B + mesons for 3.0 < y < 3.5 (right). In the bottom panels the ratios theory/data for the nominal variant of the fit and the scale variations are shown. For demonstration purpose, correlated shifts for data points obtained in the fit using nuisance parameters are applied to theoretical predictions. Uncorrelated uncertainties for data points are shown as they are rescaled in the fit, while total uncertainties are shown as not rescaled.
both cases of using the LHCb measurements are shown in Fig. 5 and compared to the result of the fit using only HERA data. The gluon distribution at low x is constrained by the HERA measurements mostly via the sum rules and this results in large parametrisation uncertainties. Once the LHCb measurements are included in the QCD analysis, the gluon distribution is directly probed and the parametrisation dependence of the PDF is significantly reduced.
The main differences in the PDF uncertainties between the fits using the absolute and normalised LHCb measurements are caused by the MNR uncertainties. The variation of the pQCD scales in the prediction of the absolute cross section of heavy-flavour production in pp collisions leads to significant changes in the normalisation of the cross section and represents the dominant uncertainty on the PDFs. The variations of the assumption on fragmentation parameters [43] result in a smaller uncertainty, as compared to that due to the scale variations.
In the case of the PDF fit using the normalised LHCb cross sections, the MNR uncertainty is strongly reduced, since variations of pQCD scales and of the fragmentation parameters do not significantly affect the shape of the y distributions for heavy-flavour production. Therefore this is considered to be the primary result of this paper, while the consistency between the absolute and normalised variants is considered to be an important cross check.

Conclusions
The sensitivity of heavy-flavour production in pp collisions to the low-x gluon distribution was studied in a comprehensive QCD analysis at NLO. The measurements of c-and b-hadron production cross sections at the LHCb experiment are included into a PDF fit together with inclusive and heavy-flavour production measurements in DIS at HERA. Since the bulk of the heavy-flavour data is close to the kinematic threshold, the fixed-flavour number scheme at next-to-leading order is used for the predictions of heavy-flavour production in ep and pp collisions. A significant reduction of the parametrisation uncertainty of the gluon distribution at very low x is observed, as compared to the result of the PDF fit using only HERA DIS data.
Two ways of using the LHCb measurements in the fit are studied. Although the absolute differential cross-section measurements contain more information, the resulting PDFs suffer from large theoretical uncertainty due to uncalculated higher-order corrections, estimated by the variation of the pQCD scales. By using only the rapidity shape information in the normalised cross sections for the final result, this uncertainty is significantly reduced for the PDF extraction.
The present analysis has illustrated the high potential of the LHCb measurements to constrain the gluon distribution at low x, and global PDF fits clearly can profit from the inclusion of such data. Precise measurements of normalised cross sections of heavy-flavour production in the forward kinematic range of the LHC therefore have a great potential to further improve the constraints on