1 Introduction

Measurements of open charm production in deep-inelastic electronFootnote 1-proton scattering (DIS) at HERA provide important input for stringent tests of the theory of strong interactions, quantum chromodynamics (QCD). Previous measurements [118] have demonstrated that charm quarks are predominantly produced by the boson-gluon-fusion process, \(\gamma g\rightarrow c\overline{c}\), which is sensitive to the gluon distribution in the proton.

The mass of the charm quark, m c , provides a sufficiently high scale necessary to apply perturbative QCD (pQCD). However, additional scales are involved in charm production, e.g. the virtuality, Q 2, of the exchanged photon in case of DIS and the transverse momenta, p T , of the outgoing quarks. The presence of several hard scales complicates the QCD calculations for charm production. Depending on the details of the treatment of m c , Q and p T , different approaches in pQCD have been formulated. In this paper, the massive fixed-flavour-number-scheme (FFNS) [1927] and different implementations of the variable-flavour-number-scheme (VFNS) [2840] are considered.

At HERA different techniques have been used to measure open charm production cross sections in DIS. The full reconstruction of D or D mesons [1, 2, 46, 1012, 15, 18], the long lifetime of heavy flavoured hadrons [79, 12, 14] or their semi-leptonic decays [13] are exploited. In general, the best signal-to-background ratio of the charm samples is observed in the analysis of fully reconstructed D mesons. However, the branching ratios are small and the phase space of charm production accessible with D mesons is restricted considerably because all products from the D meson decay have to be measured. The usage of semi-leptonic decays of charmed hadrons for the analysis of charm production can profit from large branching fractions and a better coverage in polar angle at the cost of a worse signal-to-background ratio. Fully inclusive analyses using lifetime information are not hampered by specific branching ratios and are in addition sensitive to low transverse momenta. Among the methods used it has the largest phase space coverage, however it yields the worst signal-to-background ratio.

In this paper the published data of H1 [9, 10, 14, 15, 18] and ZEUS [4, 6, 12, 13] are combined. All publications on data setsFootnote 2 are included for which the necessary information on systematic uncertainties needed for the combination is available and which have not been superseded. For the combination, the published cross sections in the restricted phase space regions of the individual measurements are extrapolated to the full phase space of charm production in a coherent manner by the use of FFNS calculations in next-to-leading order (NLO). This includes the coherent treatment of the related systematic uncertainties.

The combination is based on the procedure described in [4143]. The correlated systematic uncertainties and the normalisation of the different measurements are accounted for such that one consistent data set is obtained. Since different experimental techniques of charm tagging have been employed using different detectors and methods of kinematic reconstruction, this combination leads to a significant reduction of statistical and systematic uncertainties.

The combined charm data are used together with the combined inclusive DIS cross sections [43] to perform a detailed QCD analysis using different models of charm production in DIS. The role of the value for the charm quark mass which enters as a parameter in these models is investigated and the optimal value of the charm quark mass parameter is determined for each of the QCD calculations considered. The impact of this optimisation on predictions of W ± and Z production cross sections at the LHC is discussed. The running mass of the charm quark is determined using the modified minimal subtraction scheme (\({\overline{\text{MS}}}\)) variant [44, 45] of the FFNS.

The paper is organised as follows. In Sect. 2 the different theoretical schemes of charm production are briefly reviewed. The data samples used for the combination and the details of the combination procedure are described in Sect. 3. The results on the combined reduced cross section are presented in Sect. 4. The predictions from different QCD approaches for charm production in DIS are compared to the measurement in Sect. 5. The QCD analysis is presented in Sect. 6. Conclusions are given in Sect. 7.

2 Open charm production in DIS

In this paper, charm production via neutral-current deep-inelastic ep scattering is considered. In the kinematic domain addressed, where the virtuality Q 2 of the exchanged boson is small, \(Q^{2} \ll M_{Z}^{2}\), charm production is dominated by virtual photon exchange. The cross section may then be written in terms of the structure functions \(F^{c\bar{c}}_{2}(x,Q^{2})\) and \(F^{c\bar{c}}_{L}(x,Q^{2})\) as

(1)

Here x=Q 2/2pq is the Bjorken scaling variable and y=pq/pl is the inelasticity with p, q and l denoting the 4-momenta of the proton, photon and electron, respectively, and Q 2=−q 2. The suffix \(c\bar{c}\) indicates the presence of a \(c\bar{c}\) pair in the final state, including all possible QCD production processes. The cross section \({{\rm d}^{2} \sigma^{c\bar{c}}}/{{\rm d}x {\rm d}Q^{2}}\) is given at the Born level without QED and electro-weak radiative corrections, except for the running electromagnetic coupling, α(Q 2).

In this paper, the results are presented in terms of reduced cross sections, defined as follows:

(2)

The contribution \(F_{L}^{c\bar{c}}\), originating from the exchange of longitudinally polarised photons, is small in the kinematic range of this analysis and reaches up to a few per cent only at high y [46].

The above definition of \(F^{c\bar{c}}_{2 (L)} (x,Q^{2})\) (also denoted as \(\tilde{F}_{c}\) [37] or F c,SI [47]) is suited for measurements in which charm is explicitly detected. It differs from what is sometimes used in theoretical calculations in which \(F^{c}_{2 (L)} (x,Q^{2})\) [3537, 48] is defined as the contribution to the inclusive F 2(L)(x,Q 2) in which the virtual photon couples directly to a c or \(\bar{c}\) quark. The latter excludes contributions from final state gluon splitting to a \(c\bar{c}\) pair in events where the photon couples directly to a light quark, and contributions from events in which the photon is replaced by a gluon from a hadron-like resolved photon. As shown in Table 1 of [37], the gluon splitting contribution is expected to be small enough to allow a reasonable comparison of the experimental results to theoretical predictions using this definition. The hadron-like resolved photon contribution is expected to be heavily suppressed at high Q 2, but might not be completely negligible in the low Q 2 region. From the point of view of pQCD it appears at \(O(\alpha_{s}^{3})\) and it is neglected in all theoretical calculations used in this paper.

At photon virtualities not much larger than the charm quark mass, charm production in DIS is described in the framework of pQCD by flavour creation through the virtual photon-gluon-fusion process. Since a \(c\bar{c}\) pair is being produced, there is a natural lower cutoff of 2m c for the mass of the hadronic final state. The non-zero mass influences the kinematics and higher order corrections in essentially all the HERA phase space. Therefore the correct treatment of the mass of charm and beauty quarks is of particular importance in the QCD analysis and determination of parton distribution functions (PDFs) of the proton. In the following, the different approaches used in the treatment of the charm quark mass in pQCD calculations are discussed.

2.1 Zero mass variable flavour number scheme

In the zero-mass variable-flavour-number-scheme (ZM-VFNS) [28] the charm quark mass is set to zero in the computation of the matrix elements and kinematics, and a threshold is introduced at \(Q^{2} \sim m_{c}^{2}\), below which the charm production cross section is assumed to vanish. The charm quark is also excluded from the parton evolution and only three light flavours are left active. Above this threshold, charm is treated as a massless parton in the proton, leading to the introduction of the charm quark distribution function of the proton. The transition from three to four active flavours in the parton evolution follows the BMSN prescription [31]. The lowest order process for charm production in this approach is the quark-parton-model like scattering at order zero in α s . The running of α s is calculated using three flavours (u,d,s) below the scale m c , and using four or five flavours (including charm and beauty) above the respective threshold scales. The main advantage of this scheme is that the Q 2 evolution of the charm density provides a resummation of terms proportional to \(\log(Q^{2}/m_{c}^{2})\) that may be large at large Q 2. It has been shown [15, 18] that this approach does not describe the charm production data at HERA.

2.2 Fixed flavour number scheme

In the fixed-flavour-number-scheme (FFNS) the charm quark is treated as massive at all scales, and is not considered as a parton in the proton. The number of active flavours, n f , is fixed to three, and charm quarks are assumed to be produced only in the hard scattering process. Thus the leading order (LO) process for charm production is the boson-gluon-fusion process at O(α s ). The next-to-leading order (NLO) coefficient functions for charm production at \(O(\alpha_{s}^{2})\) in the FFNS were calculated in [1922] and adopted by many global QCD analysis groups [2327], providing PDFs in the FFNS. In the data analysis presented in this paper, the prediction of open charm production in the FFNS at NLO is used to calculate inclusive [1922] and exclusive [49] quantities. Partial \(O(\alpha_{s}^{3})\) corrections are also available [50, 51].

In the calculations [1922, 49] the pole mass definition [52] is used for the charm quark mass, and gluon splitting contributions are included. In a recent variant of the FFNS scheme (ABM FFNS) [44, 45], the running mass definition in the modified minimal subtraction scheme (\(\overline {{\text{MS}}}\)) is used instead. This scheme has the advantage of reducing the sensitivity of the cross sections to higher order corrections, and improving the theoretical precision of the mass definition.

To O(α s ), which is relevant for the calculation of cross sections to \(O(\alpha_{s}^{2})\), the \(\overline{\text{MS}}\) and pole masses are related by [5355]

$$ m_c(Q) = m_{c,{\mathrm{pole}}} \biggl[1 - \frac{\alpha_s}{\pi} - \frac{3\alpha_s}{4\pi} \ln \biggl(\frac{Q^2}{m_c(m_c)^2} \biggr) \biggr], $$
(3)

i.e. the running mass evaluated at the scale Q=m c is smaller than the pole mass.

2.3 General mass variable flavour number scheme

In the general-mass variable-flavour-number-schemes (GM-VFNS) charm production is treated in the FFNS in the low Q 2 region, where the mass effects are largest, and in the ZM-VFNS approach at high Q 2, where the effect of resummation is most noticeable. At intermediate scales an interpolation is made between the FFNS and the ZM-VFNS, avoiding double counting of common terms. This scheme is expected to combine the advantages of the FFNS and ZM-VFNS, while introducing some level of arbitrariness in the treatment of the interpolation. Different implementations of the GM-VFNS are available [2940] and are used by the global QCD analysis groups.

All GM-VFNS implementations used in this paper use the pole mass scheme for the definition of the massive part of the calculation. However, the freedom introduced by choosing an interpolation approach and the different methods for the truncation of the perturbative series lead to an additional theoretical uncertainty when extracting the mass from the data. Within this uncertainty, different approaches can yield different values. Therefore in the following we choose to refer to the charm mass appearing in the GM-VFNS as a mass parameter, M c , of the individual interpolation models.

3 Combination of H1 and ZEUS measurements

3.1 Data samples

The H1 [5658] and ZEUS [59] detectors were general purpose instruments which consisted of tracking systems surrounded by electromagnetic and hadronic calorimeters and muon detectors, ensuring close to 4π coverage of the ep interaction point. Both detectors were equipped with high-resolution silicon vertex detectors: the Central Silicon Tracker [60] for H1 and the Micro Vertex Detector [61] for ZEUS.

The data sets included in the combination are listed in Table 1 and correspond to 155 different cross section measurements. The combination includes measurements of charm production performed using different tagging techniques: the reconstruction of particular decays of D-mesons [4, 6, 10, 12, 15, 18], the inclusive analysis of tracks exploiting lifetime information [14] or the reconstruction of muons from charm semi-leptonic decays [13].

Table 1 Data sets used in the combination. For each data set the charm tagging method, the Q 2 range, the number of cross section measurements N and the integrated luminosity \(\mathcal{L}\) are given. The data set with the \(D^{0,{\rm no}D^{*+}}\) tagging method is based on an analysis of D 0 mesons not originating from detectable D ∗+ decays. Charge conjugate modes are always implied

The results of the inclusive lifetime analysis [14] are directly taken from the original measurement in the form of \(\sigma_{\rm red}^{c\bar{c}}\). In the case of D-meson and muon measurements, the inputs to the combination are visible cross sections \(\sigma_{\rm vis, bin}\) defined as the D (or μ) production cross section in a particular p T and η range, reported in the corresponding publications,Footnote 3 in bins of Q 2 and y or x. Where necessary, the beauty contribution to the inclusive cross sections of D meson production is subtracted using the estimates of the corresponding papers. The measured cross sections include corrections for radiation of a real photon from the incoming or outgoing lepton and for virtual electroweak effects using the HERACLES program [62]. QED corrections to the incoming and outgoing quarks were neglected. All D-meson cross sections are updated using the most recent branching ratios [52].

3.2 Extraction of \(\sigma_{\rm red}^{c\bar{c}}\) from visible cross sections

In the case of D-meson and muon production, \(\sigma_{\rm red}^{c\bar{c}}\) is obtained from the visible cross sections \(\sigma_{\rm vis, bin}\) measured in a limited phase space using a common theory. The reduced charm cross section at a reference (x,Q 2) point is extracted according to

$$ \sigma_{\rm red}^{c\bar{c}}\bigl(x,Q^2\bigr)= \sigma_{\rm vis, bin} \frac{\sigma_{\rm red}^{c\bar{c}, \mathrm{th}}(x,Q^2)}{\sigma^{\rm th}_{\rm vis, bin}}. $$
(4)

The program from Riemersma et al. [1922] and the program HVQDIS [49] are used to calculate, in NLO FFNS, the reduced cross sections \(\sigma_{\rm red}^{c\bar{c}, \mathrm{th}}(x,Q^{2})\) and the visible cross sections \(\sigma^{\rm th}_{\rm vis, bin}\), respectively. The following parameters are used consistently in both NLO calculations and the corresponding variations are used to estimate the associated uncertainties on the extraction of \(\sigma_{\rm red}^{c\bar{c}}\):

  • pole mass of the charm quark m c =1.5±0.15 GeV;

  • renormalisation and factorisation scales \(\mu_{f}=\mu_{r}=\sqrt{Q^{2}+4m_{c}^{2}}\), varied simultaneously up or down by a factor of two;

  • strong coupling constant \(\alpha_{s}^{n_{f}=3}(M_{Z}) = 0.105 \pm0.002\), corresponding to \(\alpha_{s}^{n_{f}=5}(M_{Z}) = 0.116 \pm0.002\);

  • the proton structure is described by a series of FFNS variants of the HERAPDF1.0 set [43] at NLO, evaluated for m c =1.5±0.15 GeV and for \(\alpha_{s}^{n_{f}=3}(M_{Z}) = 0.105 \pm0.002\). For the light flavour contribution, the renormalisation and factorisation scales are set to μ r =μ f =Q, while for the heavy quark contributions the scales of \(\mu_{f}=\mu_{r}=\sqrt{Q^{2}+4m_{Q}^{2}}\) are used, with m Q being the mass of the charm or beauty quark. Additional PDF sets are evaluated, in which the scales are varied simultaneously by a factor of two up or down. Only the scale variation in the heavy quark contribution has a sizeable effect on the PDFs. The experimental, model and parameterisation uncertainties of the PDFs at 68 % C.L. are also included in the determination of the PDF uncertainties on \(\sigma_{\rm red}^{c\bar{c}}\). For estimating the uncertainties of the NLO calculations [1922, 49] due to the respective choice of the scales, α s and m c , the appropriate PDF set is used. The effects of the PDF uncertainties are calculated according to the HERAPDF1.0 prescription [43].

The cross sections \(\sigma_{\rm vis, bin}^{\rm th}\) depend, in addition to the kinematics of the charm quark production mechanism, also on the fragmentation of the charm quark into particular hadrons. The charm quark fragmentation function has been measured by H1 [63] and ZEUS [11] using the production of D mesons, with and without associated jets, in DIS and photoproduction (Q 2≈0 GeV2). In the calculation of \(\sigma_{\rm vis, bin}^{\rm th}\) the fragmentation is performed in the γ -p centre-of-mass frame, using for the fraction of the charm quark momentum carried by the charmed meson a fragmentation function which is controlled by a single parameter, α K  [64]. The parameter relevant for charm fragmentation into D mesons has been determined [11, 63] for the NLO FFNS calculation for three different kinematic and jet requirements, which correspond approximately to three different regions of the γ -parton centre-of-mass energy squared, \(\hat{s}\). The values of α K , together with the corresponding ranges in \(\hat{s}\), are listed in Table 2. The fragmentation is observed to become softer with increasing \(\hat {s}\), as expected from the evolution of the fragmentation function. The limits on the \(\hat{s}\) ranges are determined with HVQDIS by applying the jet requirements of the individual analysis on parton level. The α K parameters and the \(\hat{s}\) ranges were varied according to their uncertainties to evaluate the corresponding uncertainty on \(\sigma_{\rm vis, bin}^{\rm th}\).

Table 2 The α K parameters used for the longitudinal fragmentation into D mesons and in ground state (g.s.) charmed hadrons. The first column shows the \(\hat{s}\) range in which a particular value of α K is used, with \(\hat{s}_{1}=70\pm40~\mbox{GeV}^{2}\) and \(\hat{s}_{2}=324~\mbox{GeV}^{2}\). The variations of α K are given in the second and third column. The parameter \(\hat{s}_{2}\) is not varied, since the corresponding uncertainty is already covered by the α K variations

Since ground-state D mesons partly originate from decays of D and other excited mesons, the corresponding charm fragmentation function is softer than that measured using D mesons. From kinematic considerations [65], supported by experimental measurements [66], the expectation value for the fragmentation function of charm into \(D^{0, {\rm no}D^{*+}}\), D + and in the mix of charm hadrons decaying into muons, has to be reduced by ≈5 % with respect to that for D mesons. The values of α K for the fragmentation into ground state hadrons, used for the \(D^{0, {\rm no}D^{*+}}\), D + and μ measurements, have been re-evaluated accordingly and are reported in Table 2.

Transverse fragmentation is simulated assigning to charmed hadrons a transverse momentum, k T , with respect to the charm quark direction, according to f(k T )=k T exp(−2k T /〈k T 〉). The average 〈k T 〉 is set to 0.35±0.15 GeV [6772].

The fragmentation fractions of charm quarks into specific D mesons are listed in Table 3. They are obtained from the average of e + e and ep results [73]. The semi-leptonic branching fraction B(cμ) [52] is also given. The decay spectrum of leptons from charm decays is taken from [74].

Table 3 Charm fragmentation fractions to charmed mesons and the charm branching fraction to muons

To evaluate the extrapolation uncertainty on the extracted reduced cross section, \(\sigma_{\rm red}^{c\bar{c}}\), all the above parameters are varied by the quoted uncertainties and each variation is considered as a correlated uncertainty among the measurements to which it applies. The dominant contributions arise from the variation of the fragmentation function (average 3–5 %) and from the variation of the renormalisation and factorisation scales (average 5–6 %, reaching 15 % at lowest Q 2). In a few cases, the symmetric variation of model parameters results in an asymmetric uncertainty on the cross section. In such cases, the larger difference with respect to the default cross section is applied symmetrically as systematic uncertainty.

3.3 Common xQ 2 grid

Except for the H1 lifetime analysis [14], the values of \(\sigma_{\rm red}^{c\bar{c}}\) for individual measurements are determined at the 52 (x,Q 2) points of a common grid. The grid points are chosen such that they are close to the centre-of gravity in x and Q 2 of the corresponding \(\sigma_{\rm vis,bin}\) bins, taking advantage of the fact that the binnings used by the H1 and ZEUS experiments are similar. Prior to the combination, the H1 lifetime analysis measurements are transformed, when needed, to the common grid (x,Q 2) points using the NLO FFNS calculation [1922]. The resulting scaling factors are always smaller than 18 % and the associated uncertainties, obtained by varying the charm mass, the scales and the PDFs, are negligible. For all but five grid points at least two measurements enter into the combination.

3.4 Combination method

The combination of the data sets uses the χ 2 minimisation method developed for the combination of inclusive DIS cross sections [41, 43]. The χ 2 function takes into account the correlated systematic uncertainties for the H1 and ZEUS cross section measurements. For an individual data set, e, the χ 2 function is defined as

$$ \chi^2_{{\rm exp},e} (\boldsymbol{m},\boldsymbol{b} ) = \sum_i \frac{ (m^i - \sum_j \gamma^{i,e}_j m^i b_j - {\mu^{i,e}} )^2}{ (\delta_{i,e,{\rm stat}}\,\mu^{i,e} )^2+ (\delta_{i,e,{\rm uncor}}\,m^i )^2} + \sum_j b^2_j. $$
(5)

Here μ i,e is the measured value of \(\sigma_{\rm red}^{c\bar {c}}(x_{i},Q^{2}_{i})\) at an (x,Q 2) point i and \(\gamma^{i,e}_{j} \), \(\delta_{i,e,{\rm stat}} \) and \(\delta_{i,e,{\rm uncor}}\) are the relative correlated systematic, relative statistical and relative uncorrelated systematic uncertainties, respectively. The vector m of quantities m i expresses the values of the combined cross section for each point i and the vector b of quantities b j expresses the shifts of the correlated systematic uncertainty sources, j, in units of the standard deviation. Several data sets providing a number of measurements are represented by a total χ 2 function, which is built from the sum of the \(\chi^{2}_{{\rm exp},e}\) functions of all data sets

$$ \chi^2_{\rm tot} = \sum_e \chi^2_{{\rm exp},e}. $$
(6)

The combined reduced cross sections are given by the vector m obtained by the minimisation of \(\chi^{2}_{\rm tot}\) with respect to m and b. With the assumption that the statistical uncertainties are constant and that the systematic uncertainties are proportional to m i, this minimisation provides an almost unbiased estimator of m.

The double differential cross section measurements, used as input for the combination, are availableFootnote 4 with their statistical and systematic uncertainties. The statistical uncertainties correspond to \(\delta_{i,e,{\rm stat}}\) in Eq. (5). The systematic uncertainties within each measurement are classified as either point-to-point correlated or point-to-point uncorrelated, corresponding to \(\gamma^{i,e}_{j}\) and \(\delta_{i,e,{\rm uncor}}\), respectively. Asymmetric systematic uncertainties are symmetrised before performing the combination. The result is found to be insensitive to the details of the symmetrisation procedure.

In the present analysis the correlated and uncorrelated systematic uncertainties are predominantly of multiplicative nature, i.e. they change proportionally to the central values. In Eq. (5) the multiplicative nature of these uncertainties is taken into account by multiplying the relative errors \(\gamma^{i,e}_{j}\) and \(\delta_{i,e,{\rm uncor}}\) by the expectation m i.

In charm analyses the statistical uncertainty is mainly background dominated. Therefore it is treated as constant independent of m i. To investigate the sensitivity of the result on the treatment of the uncorrelated and, in particular, statistical uncertainty, the analysis is repeated using an alternative χ 2 definition in which only correlated uncertainties are taken as multiplicative while the uncorrelated uncertainties are treated as constant. In a third approach the statistical uncertainties are assumed to be proportional to the square root of m i. The differences between the results obtained from these variations and the nominal result are taken into account as an asymmetric procedural uncertainty and are added to the total uncertainty of the combined result in quadrature.

Correlations between systematic uncertainties of different measurements are accounted for. Experimental systematic uncertainties are treated as independent between H1 and ZEUS. Extrapolation uncertainties due to the variation of the charm quark mass and the renormalisation and factorisation scales, charm fragmentation as well as branching fractions are treated as correlated. All reduced cross section data from H1 and ZEUS are combined in one simultaneous minimisation, through which the correlated uncertainties are reduced also at (Q 2,x) points where only one measurement exists.

4 Combined charm cross sections

The values of the combined cross section \(\sigma_{\rm red}^{c\bar{c}}\) together with uncorrelated, correlated, procedural and total uncertainties are given in Table 4. In total, 155 measurements are combined to 52 cross-section measurements.

Table 4 The averaged reduced cross section of charm production, \(\sigma_{\rm red}^{c\bar{c}}\), as obtained from the combination of H1 and ZEUS measurements. The values of the cross section are presented together with uncorrelated (δ unc) correlated (δ cor) and procedural (δ proced) uncertainties. The total uncertainty (δ tot) is obtained by adding the correlated, uncorrelated and procedural errors in quadrature

The data show good consistency, with a χ 2-value per degree of freedom, \(n_{\rm dof}\), of \(\chi^{2}/n_{\rm dof}=62/103\), indicating that the uncertainties of the individual measurements have been estimated conservatively. The distributions of pulls (as defined in [43]) is shown in Fig. 1. No significant tensions are observed. For data with no correlated systematic uncertainties the pulls are expected to follow Gaussian distributions with zero mean and unit width. Correlated systematic uncertainties lead to narrowed pull distributions.

Fig. 1
figure 1

Pull distribution for the combined data samples (shaded histogram). RMS gives the root mean square of the distribution. The curve shows the result of a binned log-likelihood Gaussian fit

There are in total 48 sources of correlated systematic uncertainty, including global normalisations, characterising the separate data sets. The shifts and the reduction of the correlated uncertainties are given in Table 5. None of these systematic sources shifts by more than 1.2 σ of the nominal value in the averaging procedure. The influence of several correlated systematic uncertainties is reduced significantly in the result. For example the uncertainties from the vertex analyses due to the light quark background (H1) and due to the tracking (ZEUS) are reduced by almost a factor of two. The reductions can be traced mainly to the different charm tagging methods, and to the requirement that different measurements probe the same cross section at each (x,Q 2) point. In addition, for certain kinematic regions one measurement has superior precision and the less precise ones follow its trend through the fit. The reduction of systematic uncertainties propagates to the other average points, including those which are based solely on the less precise measurements.

Table 5 Sources of bin-to-bin correlated systematic uncertainties considered in the combination. For each source the shifts in units of standard deviations σ and the reduction factor of the uncertainty values are given. The systematic sources corresponding to the extrapolation uncertainties are highlighted in bold font. The second column shows the data sets (see Table 1) affected by each particular source

The cross section tables of the input data sets used in the analysis (see Sect. 3) together with the full information of the correlations among these cross section measurements can be found elsewhere.Footnote 5 The combined reduced cross section is presented in Fig. 2 as a function of x, in bins of Q 2, and compared to the input H1 and ZEUS data in Fig. 3. The combined data are significantly more precise than any of the individual input data sets. This is illustrated in Fig. 4, where the measurements for Q 2=18 GeV2 are shown. The uncertainty of the combined results is 10 % on average and reaches 6 % in the region of small x and medium Q 2. This is an improvement of about a factor of 2 with respect to each of the most precise data sets in the combination.

Fig. 2
figure 2

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature

Fig. 3
figure 3

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. For comparison, the input data are shown: the H1 measurement based on lifetime information of inclusive track production is represented by closed squares; the H1 measurements based on reconstruction of D mesons in HERA-I/HERA-II running periods are denoted by filled up (down) triangles; the ZEUS measurement using semileptonic decays into muons is represented by open circles; the ZEUS measurements based on reconstruction of D mesons are depicted by open squares (open triangles) for data collected in 1998–2000 (1996–1997) years; the ZEUS measurements based on reconstruction of D 0 (D +) mesons are shown by open diamonds (crosses). For presentation purpose each individual measurement is shifted in x

Fig. 4
figure 4

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for Q 2=18 GeV2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. For comparison, the input data are shown. For further details see Fig. 3

5 Comparison to theoretical predictions

Before proceeding to the QCD analysis including these data, it is instructive to compare them to various QCD predictions produced by different theory groups, for which the parameters are listed in Table 6. This comparison tests the interplay between the gluon and/or heavy flavour PDFs as obtained in different schemes and the charm treatment within each scheme (Sect. 2), as well as the related choice of the central value for the respective charm mass parameter. Some of the findings in this section can be cross-related to corresponding more detailed NLO QCD studies in Sect. 6. In addition, the effect of NNLO corrections is studied here. The full calculation of the heavy quark coefficient functions is available at \(O(\alpha_{s}^{2})\) only. The \(O(\alpha_{s}^{3})\) corrections listed in Table 6 correspond to partial resummation corrections applied in some kinematic ranges of charm production. Most predictions already contain some measured charm data from previous publications as input (see Table 6 for details).

Table 6 Calculations from different theory groups as shown in Figs. 58. The table shows the heavy flavour scheme used and the corresponding reference, the respective F 2(L) definition (Sect. 2), the value and type of charm mass used (Eq. (3)), the order in α S of the massive and massless parts of the calculation, the value of α s , the renormalisation and factorisation scale, and which HERA charm data were included in the corresponding PDF fit. The distinction between the two possible F 2(L) definitions is not applicable (n.a.) for \(\mathcal{O}(\alpha_{s})\) calculations

In Fig. 5 the reduced cross section \(\sigma_{\rm red}^{c\bar{c}}\) is compared with predictions of the MSTW group in the GM-VFNS at NLO and NNLO, using the RT standard [35, 36] and the RT optimised [40] interpolation procedure of the cross section at the charm production threshold. At NLO, the optimised prediction tends to describe the data better than the standard one at lower Q 2. The description of the data is improved in NNLO compared to NLO.

Fig. 5
figure 5

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. The data are compared to MSTW predictions at NLO (left panel) and NNLO (right panel). The predictions obtained using the standard (optimised) parametrisation are represented by the shaded bands (solid lines). The uncertainties for the optimised parametrisation are not yet available

In Fig. 6 the data are compared to the NLO predictions based on HERAPDF1.5 [75, 76] extracted in the RT standard scheme using as inputs the published HERA-I [43] and the preliminary HERA-II combined inclusive DIS data. For the central PDF set a charm quark mass parameter M c =1.4 GeV is used. The uncertainty bands of the predictions reflect the full uncertainties on the HERAPDF1.5 set. They are dominated by the uncertainty on M c  which is varied between 1.35 GeV and 1.65 GeV [43]. Within these uncertainties the HERAPDF1.5 predictions describe the data well. The central predictions are very similar to those of the MSTW group for the same scheme.

Fig. 6
figure 6

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. The data are compared to the NLO predictions based on HERAPDF1.5 extracted in the RT standard scheme. The line represents the prediction using M c =1.4 GeV. The uncertainty band shows the full PDF uncertainty which is dominated by the variation of M c

In Fig. 7 the data are compared to the predictions in the GM-VFNS by the NNPDF and CT collaborations. Both the NNPDF FONLL-A [37] and FONLL-B [38, 39] predictions describe the data fairly well at higher Q 2, while they fail to describe the data at lower Q 2. The description of the data at lower Q 2 is improved in the FONLL-C [38, 39] scheme. The CT predictions [26, 77] are based on the S-ACOT-χ heavy quark scheme. The NLO prediction, which is very similar to the FONLL-A scheme, describes the data well for Q 2>5 GeV2 but fails to describe the data at lower Q 2. Similar to the FONNL-C case the description of the data improves significantly at NNLO.

Fig. 7
figure 7

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. The data are compared to predictions by the NNPDF group (left panel) and the CTEQ group (right panel). The predictions from NNPDF2.1 in FONNL-A, -B and -C schemes are shown with their uncertainties (bands with different hatch styles). The CT10 NLO prediction with its uncertainties is shown by the shaded bands. The uncertainties on the CT10 NNLO (prel.) predictions are not yet available

In Fig. 8 the data are compared to the prediction of the ABM group in FFNS at NLO and NNLO, based on the running-mass scheme for both the coefficient functions and the PDFs [44, 45], which is a variant of an earlier analysis using the pole mass [78]. The uncertainties on the prediction include the uncertainties on m c , which dominate at small Q 2. The predictions at NLO and NNLO are very similar and describe the data well in the whole kinematic range of the measurement.

Fig. 8
figure 8

Combined reduced cross sections \(\sigma_{\rm red}^{c\bar{c}}\) (filled circles) as a function of x for fixed values of Q 2. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. The data are compared to predictions of the ABM group at NLO (hashed band) and NNLO (shaded band) in FFNS using the \(\overline{\text{MS}}\) definition for the charm quark mass

In summary, the best description of the data is achieved by predictions including partial \(O(\alpha_{s}^{3})\) corrections (MSTW NNLO and ABM NNLO). The predictions including \(O(\alpha_{s}^{2})\) terms in all parts of the calculation (NNPDF FONLL C, CT NNLO and ABM NLO) as well as the MSTW NLO optimal scheme also agree well with the data. The largest deviations are observed for predictions based on O(α s ) terms only (NNPDF FONLL A and CT NLO). As investigated in the next section, further differences can be partially explained by the different choices for the value of the respective charm quark mass parameter M c .

6 QCD analysis

The combined H1 and ZEUS inclusive ep neutral current and charged current DIS cross sections have been used previously to determine the HERAPDF1.0 parton density functions. In the current paper a combined NLO QCD analysis is performed using the reduced charm cross section together with the combined inclusive DIS cross sections [43]. Since the charm contribution to the inclusive DIS cross section is sizeable and reaches up to ≈30 % at high Q 2, this combined analysis is expected to reduce the uncertainties related to charm production inherent in all PDF extractions. In particular, the role of the charm quark mass m c (m c ) or the charm quark mass parameter M c , depending on the heavy flavour scheme, is investigated within all schemes discussed in Sect. 2.

The analysis is performed with the HERAFITTER [79] program, which is based on the NLO DGLAP evolution scheme [8085] as implemented in QCDNUM [86]. The invariant mass of the hadronic system is restricted to W>15 GeV, and the Bjorken scaling variable x is limited by the data to x≤0.65. In this kinematic range target mass corrections and higher twist contributions are expected to be small. In addition, the analysis is restricted to data with \(Q^{2}>Q^{2}_{\mathrm{min}}=3.5~\mbox{GeV}^{2}\) to assure the applicability of pQCD. The consistency of the input data sets and the good control of the systematic uncertainties enable the determination of the experimental uncertainties on the PDFs using the χ 2 tolerance of Δχ 2=1.

The following independent PDFs are chosen in the fit procedure: xu v (x), xd v (x), xg(x) and \(x\overline{U}(x)\), \(x\overline{D}(x)\), where \(x\overline{U}(x) = x\overline{u}(x)\), and \(x\overline{D}(x) = x\overline{d}(x) +x\overline{s}(x)\). Compared to the HERAPDF1.0 analysis, a more flexible parameterisation with 13 free parameters is used. At the starting scale Q 0 of the QCD evolution, the PDFs are parametrised as follows:

(7)
(8)
(9)
(10)
(11)

The normalisation parameters A g , \(A_{u_{v}}\), \(A_{d_{v}}\) are constrained by the sum rules. The parameter \(B_{\overline{U}}\) is set to \(B_{\overline{D}}\) and the constraint \(A_{\overline{U}}=A_{\overline{D}}(1-f_{s})\), with f s being the strangeness fraction at the starting scale, ensures the same normalisation for the \(\overline{u}\) and \(\overline{d}\) densities for x→0. The strangeness fraction is set to f s =0.31, as obtained from neutrino-induced di-muon production [87]. To ensure a positive gluon density at large x, the parameter \(C'_{g}\) is set to 25, in accordance with [35, 36].

The study involves variations of the charm mass parameter down to M c =1.2 GeV with the exception of the S-ACOT-χ scheme for which the M c  scan starts at M c =1.01 GeV. Since the starting scale Q 0 has to be smaller than M c , the fits are performed with setting \(Q^{2}_{0}=1.4~\mbox{GeV}^{2}\) and \(Q^{2}_{0}=1.0~\mbox{GeV}^{2}\), respectively. In order to keep the variation of M c independent from a Q 0 variation, this value for Q 0 is chosen irrespectively of the actual value of M c used during the variation procedure.

The renormalisation and factorisation scales are set to Q for the VFNS and for the light quark contribution in the FFNS and to \(\sqrt {Q^{2}+4m_{c,b}^{2}}\) for the contribution of a heavy quark in the FFNS.

For the strong coupling constant the values α s (M Z )=0.1176 [52] and \(\alpha_{s}^{n_{f}=3}(M_{Z})=0.105\) with n f =3 active flavours in the proton are used for the VFNS and for the FFNS, respectively. The definition of F 2,L and the α s order of the calculation are the same as those listed for the respective scheme in Table 6 at NLO (ACOT-full and ZM-VFNS see S-ACOT-χ).

For each heavy flavour scheme a number of PDF fits is performed with varying M c  from 1.2 GeV to 1.8 GeV. For each fit the χ 2(M c ) value is calculated and the optimal value, \(M_{c}^{\rm opt}\), of the charm quark mass parameter in a given scheme is subsequently determined from a parabolic fit of the form

$$ \chi^2(M_c) = \chi^2_{\rm min} + \biggl(\frac{M_c- M_c^{\rm opt} }{\sigma(M_c^{\rm opt})} \biggr)^2 $$
(12)

to the χ 2(M c ) values. Here \(\chi^{2}_{\rm min}\) is the χ 2 value at the minimum and \(\sigma(M_{c}^{\rm opt})\) is the fitted experimental uncertainty on \(M_{c}^{\rm opt}\). The procedure of this χ 2-scan is illustrated in Fig. 9 for the standard RT scheme when fitting only the inclusive HERA-I DIS data and when fitting these data together with \(\sigma_{\rm red}^{c\bar{c}}\). The inclusive NC and CC cross sections from HERA-I alone only weakly constrain M c ; the value of χ 2(M c ) varies only slowly with M c . Once the charm data are included, a clear minimum is observed, which then determines \(M_{c}^{\rm opt}\).

Fig. 9
figure 9

The values of χ 2(M c ) for the PDF fit to the combined HERA DIS data in the RT standard scheme. The open symbols indicate the results of the fit to inclusive DIS data only. The results of the fit including the combined charm data are shown by filled symbols

The systematic uncertainties on \(M_{c}^{\rm opt}\) are calculated from the following variations of the model assumptions:

  • the strangeness fraction is varied in the range 0.23<f s <0.38. In a recent publication the ATLAS collaboration [88] has observed f s =0.5. This value of f s is also tested and found to have only a negligible effect on the determination of \(M_{c}^{\rm opt}\).

  • the b -quark mass is varied between 4.3 GeV and 5 GeV with a default value of 4.75 GeV.

  • the minimum Q 2 value for data used in the fit, \(Q^{2}_{\rm min}\), is varied for the inclusive data from \(Q^{2}_{\rm min}=3.5~\mbox{GeV}^{2}\) to 5.0 GeV2. For the charm data this variation is not applied because it would significantly reduce the sensitivity of the analysis on M c . However, the full difference on the fitted value \(M_{c}^{\rm opt}\) obtained by using the cuts \(Q^{2}_{\rm min}=3.5~\mbox{GeV}^{2}\) or \(Q^{2}_{\rm min}=5~\mbox{GeV}^{2}\), is then taken as symmetric uncertainty due to the variation of \(Q_{\rm min}^{2}\).

  • the parameterisation uncertainty is estimated similarly to the HERAPDF1.0 procedure. To all quark density functions an additional parameter is added one-by-one such that the parameterisations are changed in Eq. (8) from Ax B⋅(1−x)C⋅(1+Ex 2) to Ax B⋅(1−x)C⋅(1+Dx+Ex 2) and in Eqs. (9)–(11) from Ax B⋅(1−x)C to either Ax B⋅(1−x)C⋅(1+Dx) or Ax B⋅(1−x)C⋅(1+Ex 2). Furthermore, the starting scale Q 0 is varied to \(Q_{0}^{2}=1.9~\mbox{GeV}^{2}\). The full difference on the fitted value \(M_{c}^{\rm opt}\), obtained by using \(Q_{0}^{2}=1.9~\mbox{GeV}^{2}\) and \(Q_{0}^{2}=1.4~\mbox{GeV}^{2}\) is then taken as symmetric uncertainty due to the variation of the starting scale Q 0. The total parameterisation uncertainty is obtained taking the largest difference in \(M_{c}^{\rm opt}\) of the above variations with respect to \(M_{c}^{\rm opt}\) for the standard parameterisation.

  • the strong coupling constant α s (M Z ) is varied by ±0.002.

For each scheme the assumptions in the fits are varied one by one and the corresponding χ 2 scan as a function of M c  is performed. The difference between \(M_{c}^{\rm opt}\) obtained for the default assumptions and the result of each variation is taken as the corresponding uncertainty. The dominant contribution arises from the variation of \(Q^{2}_{\rm min}\), while the remaining model and parameterisation uncertainties are small compared to the experimental error.

6.1 Extraction of \(M_{c}^{\mathrm{opt}}\) in the VFNS

The following implementations of the GM-VFNS are considered: ACOT full [29, 30] as used for the CTEQHQ releases of PDFs; S-ACOT-χ [3234] as used for the latest CTEQ releases of PDFs, and for the FONLL-A scheme [37] used by NNPDF; the RT standard scheme [35, 36] as used for the MRST and MSTW releases of PDFs, as well as the RT optimised scheme providing a smoother behaviour across thresholds [40]. The ZM-VFNS as implemented by the CTEQ group [29, 30] is also used for comparison. In all schemes, the onset of the heavy quark PDFs is controlled by the parameter M c in addition to the kinematic constraints.

In Fig. 10 the χ 2-values as a function of M c  obtained from PDF fits to the inclusive HERA-I data and the combined charm data are shown for all schemes considered. Similar minimal χ 2-values are observed for the different schemes, albeit at quite different values of \(M_{c}^{\rm opt}\). In Table 7 the resulting values of \(M_{c}^{\rm opt}\) are given together with the uncertainties, the corresponding total χ 2 and the χ 2-contribution from the reduced charm cross section measurements. The ACOT-full scheme yields the best global χ 2, while the best partial χ 2 for the charm data is obtained using the RT optimised scheme. The fits in the S-ACOT-χ scheme result in a very low value of \(M_{c}^{\rm opt}\) as compared to the other schemes.

Fig. 10
figure 10

The values of χ 2(M c ) for the PDF fit to the combined HERA inclusive DIS and charm measurements. Different heavy flavour schemes are used in the fit and presented by lines with different styles. The values of \(M_{c}^{\rm opt}\) for each scheme are indicated by the stars

Table 7 The values of the charm mass parameter \(M_{c}^{\rm opt}\) as determined from the M c scans in different heavy flavour schemes. The uncertainties of the minimisation procedure are denoted as ‘exp’, the model and parameterisation uncertainties are represented by ‘mod’ and ‘param’, respectively. Also the uncertainty due to α s variation is listed. The corresponding global and partial χ 2 are presented per degrees of freedom n dof and per number of data points n dp, respectively

In Fig. 11 the NLO VFNS predictions for \(\sigma_{\rm red}^{c\bar{c}}\) based on the PDFs evaluated using \(M_{c}=M_{c}^{\rm opt}\) of the corresponding scheme are compared to the data. In general the data are better described than when using the default values for M c  and the predictions of the different schemes become very similar for Q 2≥5 GeV2. Even the ZM-VFNS, which includes mass effects only indirectly [29, 30], yields an equally good description of \(\sigma_{\rm red}^{c\bar{c}}\) as the GM-VFNS, although it fails to describe more differential distributions of D ∗± meson production and the lowest Q 2 bin in Fig. 11, for which the ZM-VFNS cross section prediction is zero.

Fig. 11
figure 11

Combined measurements of \(\sigma_{\rm red}^{c\bar{c}}\) as a function of x for given values of Q 2 is shown by filled symbols. The error bars represent the total uncertainty including uncorrelated, correlated and procedural uncertainties added in quadrature. The data are compared to the results of the fit using different variants of the VFNS (represented by lines of different styles) choosing \(M_{C}=M_{c}^{\rm opt}\). The cross section prediction of the ZM-VFNS vanishes for Q 2=2.5 GeV2

6.2 Impact of the charm data on PDFs

In Fig. 12 the PDFs from a 13 parameter fit using the inclusive HERA-I data only are compared with the corresponding PDFs when including the combined charm data in the fit. For both of these fits the RT optimised VFNS is used. The total PDF uncertainties include the parameterisation and model uncertainties as described in Sect. 6 except for the uncertainties due to M c , which is treated as follows: in the fit based solely on the inclusive data a central value of M c =1.4 GeV is used with a variation in the range 1.35<M c <1.65 GeV, consistent with the treatment for HERAPDF1.0. For the fit including the combined charm cross sections this parameter is set to \(M_{c}^{\rm opt}\) with the corresponding uncertainties as obtained by the charm mass scan for the RT optimised VFNS (Table 7).

Fig. 12
figure 12

Parton density functions xf(x,Q 2) with \(f=g,u_{v},d_{v},\overline{u},\overline{d},\overline{s},\overline{c}\) for (a) valence quarks and gluon and for (b) sea anti-quarks obtained from the combined QCD analysis of the inclusive DIS data and \(\sigma_{\rm red}^{c\bar{c}}\) (dark shaded bands) in the RT optimised scheme as a function of x at Q 2=10 GeV2. For comparison the results of the QCD analysis of the inclusive DIS data only are also shown (light shaded bands). The gluon distribution function is scaled by a factor 0.05 and the \(x\overline{d}\) distribution function is scaled by a factor 1.1 for better visibility

By comparing the PDF uncertainties obtained from the analysis of the inclusive data only and from the combined analysis of the inclusive and charm data, the following observations can be made:

  • the inclusion of \(\sigma_{\rm red}^{c\bar{c}}\) in the fit does not alter the central PDFs significantly; the central PDFs obtained with the charm data lie well within the uncertainty bands of the PDFs based on the inclusive data only;

  • the uncertainties of the valence quark distribution functions are almost unaffected;

  • the uncertainty on the gluon distribution function is reduced, mostly due to a reduction in the parameterisation uncertainty coming from the constraints that the charm data put on the gluon through the \(\gamma g \to c\overline{c}\) process;

  • the uncertainty on the \(x\overline{c}\) distribution function is considerably reduced due to the constrained range of M c ;

  • the uncertainty on the \(x\overline{u}\) distribution function is correspondingly reduced because the inclusive data constrains the sum \(x\overline{U}=x\overline{u}+x\overline{c}\);

  • the uncertainty on the \(x\overline{d}\) distribution function is also reduced because it is constrained to be equal to \(x\overline{u}\) at low x;

  • the uncertainty on the \(x\overline{s}\) distribution function is not reduced because it is dominated by the model uncertainty on the strangeness fraction f s .

Similar conclusions hold also for the other schemes discussed in this paper.

6.3 Measurement of the charm quark mass

An NLO QCD analysis is performed in the FFNS of the ABM group [44, 45] to determine the \(\overline {\text{MS}}\) running charm quark mass m c (m c ) based on the inclusive neutral and charged current HERA-I DIS data and the charm cross section. For this purpose the coefficient functions as implemented in OPENQCDRAD [23, 24, 89] are used. The strong coupling constant is evolved with setting the number of active flavours to n f =3, using \(\alpha_{s}^{n_{f}=3}(M_{Z})=0.105\). The same minimisation procedure as for the VFNS analysis is applied and the resulting dependence of the χ 2 values from the QCD fits on the charm quark mass m c is shown in Fig. 13. The fit of the parabolic function, defined in Eq. (12), results in a value of

(13)

for the running charm mass in NLO. The errors correspond to the experimental, the model, parameterisation and α s dependent uncertainties. The same variations of the model and parameterisation assumptions are applied as for the results presented in Sect. 6.1 and discussed in Sect. 6. The data are well described by the FFNS calculations for m c (m c )=1.26 GeV with a total χ 2=627.7 for 626 degrees of freedom. The partial contribution from the charm data is χ 2=51.8 for 47 data points. The measured value of the running charm quark mass is consistent with the world average of m c (m c )=1.275±0.025 GeV [52] defined at two-loop QCD, based on lattice calculations and measurements of time-like processes. It also compares well to recent analyses [44, 45, 90] of DIS and charm data at NLO and NNLO.

Fig. 13
figure 13

The values of χ 2 for the PDF fit to the combined HERA DIS data including charm measurements as a function of the running charm quark mass m c (m c ). The FFNS ABM scheme is used, where the charm quark mass is defined in the \(\overline{\text{MS}}\) scheme

6.4 Impact of charm data on predictions for W ± and Z production at the LHC

The different series of PDFs obtained from fits to the HERA data by the M c scanning procedure in the different VFNSs are used to calculate cross section predictions for W ± and Z production at the LHC at \(\sqrt {s}=7~\mbox{TeV}\). These predictions are calculated for each scheme using the MCFM program [9193] interfaced to APPLGRID [94] for 1.2≤M c ≤1.8 GeV in 0.1 GeV steps, except for S-ACOT-χ for which the range 1.1≤M c ≤1.4 GeV is used.

The predicted W ± and Z production cross sections as a function of M c for the different implementations of the VFNS are shown in Fig. 14 and the values for the optimal choice \(M_{c}^{\rm opt}\) are summarised in Table 8. For all implementations of VFNS a similar monotonic dependence of the W ± and Z production cross sections on M c  is observed. This can be qualitatively understood as follows. A higher charm mass leads to stronger suppression of charm near threshold such that more light sea quarks are required to fit the inclusive data. More gluons are also needed to describe the HERA charm data. The need for more light sea quarks at the initial scale together with the creation of more sea quarks from gluon splitting at higher scales lead to an enhancement of the W ± and Z cross sections at the LHC.

Fig. 14
figure 14

NLO predictions for (a) W +, (b) W and (c) Z production cross sections at the LHC for \(\sqrt{s}=7~\mbox{TeV}\) as a function of M c used in the corresponding PDF fit. The different lines represent predictions for different implementations of the VFNS. The predictions obtained with PDFs evaluated with the \(M_{c}^{\rm opt}\) values for each scheme are indicated by the stars. The horizontal dashed lines show the resulting spread of the predictions when choosing \(M_{c}=M_{c}^{\rm opt}\)

Table 8 NLO VFNS predictions for Z/W ± cross sections at the LHC for \(\sqrt{s}=7~\mbox{TeV}\) using the MCFM program. The calculations are based on the PDF sets extracted in the corresponding schemes from the HERA data using \(M_{c}^{\rm opt}\) for the charm quark mass parameter. The listed cross section uncertainties correspond to the uncertainties on \(M_{c}^{\rm opt}\) only

There is a significant spread of about 6 % between the predictions if they are considered for a fixed value of M c , e.g. at M c =1.4 GeV. Similarly, the prediction typically varies by about 6 % when raising M c from 1.2 to 1.8 GeV. However, when using the \(M_{c}^{\rm opt}\) for each scheme the spread of predictions is reduced to 1.4 % for W , 1.8 % for Z and to 2 % for W + production.

This indicates that a good description of the HERA charm data correlates with a very similar flavour composition of the quark PDFs at LHC scales, almost independent of the chosen scheme. The uncertainty on the W ± and Z cross section predictions due to the choice of the charm mass can thus be considerably reduced. However, the charm mass parameter must be adjusted to a different value for each scheme, consistent with the HERA data, in order to achieve this result.

7 Conclusions

Measurements of open charm production in deep-inelastic ep scattering by the H1 and ZEUS experiments using different charm tagging methods are combined, accounting for the systematic correlations. The measurements are extrapolated to the full phase space using an NLO QCD calculation to obtain the reduced charm quark-pair cross sections in the region of photon virtualities 2.5≤Q 2≤2000 GeV2. The combined data are compared to QCD predictions in the fixed-flavour-number-scheme and in the general-mass variable-flavour-number-scheme. The best description of the data in the whole kinematic range is provided by the NNLO fixed-flavour-number-scheme prediction of the ABM group. Some of the NLO general-mass variable-flavour-number-scheme predictions significantly underestimate the charm production cross section at low Q 2, which is improved at NNLO.

Using the combined charm cross sections together with the combined HERA inclusive DIS data, an NLO QCD analysis is performed based on different implementations of the variable-flavour-number-scheme. For each scheme, an optimal value of the charm mass parameter, \(M_{c}^{\rm opt}\), is determined. These values show a sizeable spread. All schemes are found to describe the data well, as long as the charm mass parameter is taken at the corresponding optimal value. The use of \(M_{c}^{\rm opt}\) and its uncertainties in the QCD analysis significantly reduces the parton density uncertainties, mainly for the sea quark contributions from charm, down and up quarks.

The QCD analysis is also performed in the fixed-flavour-number-scheme at NLO using the \(\overline{\rm MS}\) running mass definition. The running charm quark mass is determined as \(m_{c}(m_{c})=1.26\pm 0.05_{\rm exp.}\pm0.03_{\rm mod}\pm0.02_{\rm param}\pm0.02_{\alpha _{s}}~\mbox{GeV}\). This value agrees well with the world average based on lattice calculations and on measurements of time-like processes.

The PDFs obtained from the corresponding QCD analyses using different M c are used to predict W ± and Z production cross-sections at the LHC. A sizeable spread in the predictions is observed, when the charm mass parameter M c is varied between 1.2 and 1.8 GeV, or when different schemes are considered at fixed value of M c . The spread is significantly reduced when the optimal value of M c is used for each scheme.