1 Introduction

The deep-inelastic-scattering (DIS) experiments traditionally have provided important tests of perturbative QCD (pQCD) and are essential to precisely determine the parton distribution functions (PDFs) of the nucleon. In addition to the numerous dedicated fixed-target DIS experiments that have been performed so far, the HERA accelerator used colliding beams of leptons (electrons and positrons) and protons to investigate the nucleon structure. The broad kinematic coverage of the HERA charge-current (CC) and neutral-current (NC) DIS data in terms of the negative virtuality \(Q^2\) of the exchanged vector boson and the Bjorken variable \(x_{\text {Bj}}\) is such that these data have significant impact on the determinations of the PDFs [1,2,3,4,5].

In the Standard Model (SM), the charm quark plays an important role in the investigation of the nucleon structure [6,7,8,9]. In the NC case, the photon-gluon fusion process for charm production was calculated at \({\mathscr {O}}(\alpha _s^2)\) with the full heavy-quark mass dependence included in the DIS hard cross sections [10, 11]. The heavy-quark mass effects in the CC process have been calculated to \({\mathscr {O}}(\alpha _s)\) in Refs. [12,13,14,15,16], and the recent work of Ref. [17] provides results up to \({\mathscr {O}}(\alpha _s^2)\). The large-\(Q^2\) contributions of heavy flavors to the \(xF_3\) structure function had already been computed in Ref. [18]. In many of the posited models which extend the SM, the coupling to “new physics” is proportional to the particle mass; hence, the heavy quarks will have an enhanced coupling and provide an optimal testing ground for these searches.

Heavy quarks also play a critical role in helping us fully characterize the SM, and the charm quark is especially useful in this respect as it can provide us direct access to the strange-sea quark distribution. The strange sea has been extensively investigated in a number of experiments including the associated production of a W boson with a charm-jet final state, which (at LO) arises from strange–gluon initial states [19,20,21,22,23,24,25,26]. Additionally, charm production in neutrino/antineutrino-nucleon DIS has been studied by a number of experiments including: CCFR [27], NuTeV [28], CHORUS [29], CDHSW [30] and NOMAD [31]. With a sign-selected beam (\(\nu /\bar{\nu }\)), these experiments can separately probe the strange s(x) and anti-strange \(\bar{s}(x)\) distributions. While the neutrino DIS experiments provide detailed information on the shape of the strange distribution, the normalization is a challenge, as that is tied to the beam flux. Separately, the HERMES collaboration used charged-lepton DIS production of charged kaons to provide a complementary extraction of \(s(x)+ \bar{s}(x)\) at LO [32]. Recently, charm production in CC DIS was measured for the first time in \(e^{\pm }p\) collisions by ZEUS [33].

Additionally, charm production mediated by electroweak gauge boson at hadron colliders provides important information on the strange- and charm-quark distributions, and is complementary to the DIS final-state charm-quark experiments [34]. The Tevatron measured the charm-quark cross section in association with a W boson at CDF [19, 35, 36] and D0 [20], but these results were limited by low statistics.

In lieu of significant experimental constraints, many global QCD analyses tie the strange distribution to the light-sea quarks via the relation \(s= \bar{s}=r_s \, \bar{d}\). While in principle \(r_s\) depends on both \(x_{\text {Bj}}\) and \(Q^2\), it is often set to a fixed value [37, 38].

Using inclusive leptonic decays of W and Z bosons, the ATLAS experiment has obtained a value of \(r_s=1.19 \pm 0.16\) at \(x= 0.023\) and \(Q^2_0 = 1.9\) GeV\(^2\) [39]. Additionally, using the cross section ratio for \(W^\pm +c\) final states they also find a comparably large value for \(r_s\) [23]. In contrast, CMS results generally prefer lower \(r_s\) values [22, 24]. However, a recent analysis using both ATLAS and CMS data suggests that the LHC data support unsuppressed strangeness in the proton. While the result is dominated by ATLAS, this is not in contradiction with the CMS data [22, 23, 39, 40].

Looking to the future, it is clearly important to reduce the uncertainty of the strange-quark PDF as we strive to make increasingly precise tests of the SM and search for what might lie beyond. The proposed Large Hadron Electron Collider (LHeC) program has the ability to provide high statistics measurements of electrons on both protons and nuclei across a broad kinematic range to address many of these outstanding questions.

In this investigation, we make use of the xFitter tools [41] (version 2.0.0) to study the present constraints on the strange-quark PDFs, and then use LHeC pseudodata [42] to infer how these might improve. Furthermore, as xFitter implements both fixed-flavor- and variable-flavor-number schemes, we can examine the impact of these different theoretical choices.

This paper is organized as follows. In Sect. 2 we outline the theoretical details of the different heavy-flavor schemes. In Sect. 3 we compare the theoretical predictions of the different schemes across the kinematic range, and examine the individual partonic contributions. In Sect. 4 we study the impact of the LHeC pseudodata on the PDFs using a profiling technique. In Sect. 5 we provide some discussion and summarize the results. Finally, in Appendix A we discuss some of the more subtle theoretical issues that we encounter at higher orders.

2 Theoretical predictions for CC charm production at the LHeC

The proposed Large Hadron Electron Collider (LHeC) [42] would collide a newly built electron beam with the LHC hadron beam at a center of mass energy of \(\sqrt{s} = \sqrt{4 E_e E_p}\); thus the 7 TeV proton beam on a 60 GeV electron beam provide \(\sqrt{s}\sim 1.3\) TeV. Compared to HERA, the LHeC extends the covered kinematic range by an order of magnitude in both \(x_{\text {Bj}}\) and \(Q^2\) with a nominal design luminosity of \(10^{33} \,\mathrm{cm}^{-2}\,\mathrm{s}^{-1}\).

Theoretical predictions are calculated for electroweak charged-current (CC) charm production in ep collisions at the LHeC at centre-of-mass energy \(\sqrt{s} = 1.3\) TeV, using a variety of heavy-flavor schemes. The predictions are provided for unpolarized beams in the kinematic range \(100< Q^2 < 100000\) GeV\(^2\), \(0.0001< x_{\text {Bj}} < 0.25\). They are calculated as reduced cross sections at different \(Q^2\), \(x_{\text {Bj}}\) and inelasticity (y) points. The covered y range is \(0.0024< y < 0.76\).

Experimentally, however, not charm quarks but charmed hadrons (or rather their decay products) are registered in the detectors. Therefore, extrapolation to the inclusive charm-production cross section has to be carried out in a model-dependent way. Furthermore, CC production of charm quarks in the final state can happen via both electroweak and QCD processes. The former leads to an odd number of charm quarks in the final state with the W boson having the same electric charge as the sum of the electric charges of final-state charm quarks, while the latter creates an even number of charm quarks with total electric charge equal to zero. If the electric charge of the tagged charm quark can be accessed experimentally (e.g. when reconstructing D mesons), the QCD contribution can be subtracted by taking the difference of the yields in the events with odd and even numbers of charm quarks, otherwise the QCD contribution can be estimated only in a model-dependent way.

The CC charm process directly depends on the CKM matrix [43]. Here, the CKM matrix elements \(V_{cd}\) and \(V_{cs}\) are particularly relevant and we use the values \(V_{cd} = 0.2252\) and \(V_{cs} = 0.9734\). Three different heavy-flavor schemes are employed, all including a full treatment of charm-mass effects up to NLO, i.e. \({\mathscr {O}}(\alpha _s)\); in the following we describe them in detail for the particular application to CC electron-proton reactions.

2.1 The heavy-flavor schemes

The standard “A” variant of the fixed-flavor number scheme (FFNS), which we identify as FFNS A, uses three light flavors in both PDFs and \(\alpha _s\) evolution for all scales, while heavy flavors (here, charm) are produced exclusively in the matrix-element part of the calculation. This scheme has been used for the PDF determinations and cross section predictions of the ABM(P) group [4, 44,45,46], as well as in the FF3A variant of the HERAPDF analysis [2], and implemented in xFitter through the OPENQCDRAD package [47].

Next, the “B” variant of the FFNS (FFNS B), known as the “mixed” or “hybrid” scheme [6] is also used. In this scheme, the number of active flavors is still fixed to three in the PDFs, relying exclusively on \({\mathscr {O}}(\alpha _s)\) fully massive matrix elements for charm production, while the number of flavors is allowed to vary in the virtual corrections of the \(\alpha _s\) evolution. Corrections to the \(\alpha _s\) evolution involving heavy-flavor loops are thus included and resummed to all orders, while no resummation is applied to other higher order corrections. This procedure will catch a fraction of the “large logs” which might spoil the fixed-flavor scheme convergence at very high scales, and is possible since the masses of the charm and beauty quarks provide natural cutoffs for infrared and collinear divergences. This scheme was used in the HERAPDF FF3B variant [2] and in applications of the HVQDIS program [6]. In general, the transition from the FFNS A to the FFNS B requires a readjustment of the treatment of matrix elements involving heavy-flavor loops. In the specific case of CC production, no such loops occur up to NLO (at NNLO they do), so that the same matrix elements can be used for both schemes; thus the only difference is in the \(\alpha _s\) evolution.

Finally, for the variable-flavor-number scheme (VFNS) we use the “B” variant of the fixed-order-next-to-leading-log scheme (FONLL-B) [48] which combines the NLO \({\mathscr {O}}(\alpha _s)\) massive matrix elements of the FFNS with the \({\mathscr {O}}(\alpha _s)\) massless results of the zero-mass variable-flavor-number scheme (ZM-VFNS), allowing the number of active flavors to vary with scale, and all-order next-to-leading log resummation of (massless) terms beyond NLO. It thus explicitly includes charm and beauty both in the PDFs and in the evolution of the strong coupling constant. Whenever terms would be double-counted in the merging of the two schemes, the massless terms are eliminated in favour of the massive ones. The FONLL scheme is commonly used by the NNPDF group [5] and implemented in xFitter through the APFEL package [49].

In summary, the schemes used are:

  • FFNS A: a NLO FFNS with \(n_f = 3\) at all scales, used with the ABMP16  [46] or HERAPDF2.0 FF3A  [2] NLO PDF sets.

  • FFNS B: a NLO FFNS with \(n_f = 3\) for the PDFs and variable \(n_f\) for \(\alpha _s\), used with the HERAPDF2.0 FF3B  [2] NLO PDF set.

  • FONLL-B : a VFNS used with the NNPDF3.1 NLO PDF set [5].

The PDF sets are available via the lhapdf interface (version 6.1.5) [50]. Note that we use the PDFs directly from the LHAPDF tables without any additional fitting, and we use the default values for \(\alpha _s(M_Z)\) and the quark masses \(m_{c,b}\). We’ve chosen this collection of PDFs as they are consistently extracted in the appropriate VFNS/FFNS schemes. Comparing the PDFs at the inital evolution scale, we find a typical variation of \(\lesssim 10\%\) for the quarks, and a bit larger for the gluon (which enters at NLO). These differences will lead to a small shift in the ratio plots, but will not affect the general features which are the focus of this paper, c.f., Fig. 2.

2.2 The reduced cross section

The reduced CC charm-production cross sections can be expressed as a linear combination of structure functions:

$$\begin{aligned} \sigma ^{\pm }_{\text {charm,CC}} = \frac{1}{2}\left( Y_{+}F_2^{\pm } {\mp } Y_{-}xF_3^{\pm } - y^2F_L^{\pm }\right) \,, \end{aligned}$$
(1)

with

$$\begin{aligned} Y_{\pm } = 1 \pm (1-y)^2 \,. \end{aligned}$$
(2)

In the quark-parton model, when we neglect the gluons, the structure functions become:

$$\begin{aligned} \begin{aligned} F_2^{+}&= xD + x\overline{U}, \\ F_2^{-}&= xU + x\overline{D},\\ F_L&= 0,\\ xF_3^{+}&= xD - x\overline{U}, \\ xF_3^{-}&= xU - x\overline{D}. \end{aligned} \end{aligned}$$
(3)

The terms xU, xD, \(x\overline{U}\) and \(x\overline{D}\) denote the sum of parton distributions for up-type and down-type quarks and anti-quarks, respectively.Footnote 1 The ± superscript on \(\sigma \) and F corresponds to the sign of \(W^\pm \). Below the b-quark mass threshold, these sums are related to the quark distributions as follows:

$$\begin{aligned} \begin{aligned} xU&= xu + xc , \\ x\overline{U}&= x\overline{u} + x\overline{c} , \\ xD&= xd + xs , \\ x\overline{D}&= x\overline{d} + x\overline{s}. \end{aligned} \end{aligned}$$
(4)

In the FFNS the charm-quark densities are zero. In the phase-space corners \(y \rightarrow 0\) and \(y \rightarrow 1\) and using the same quark-parton model approximation, we have the following asymptotic relations:

$$\begin{aligned} \begin{aligned} y \rightarrow 0: \quad \sigma ^{\pm }_{\text {charm,CC}}&= F_2^{\pm } = xD(x\overline{D}) + x\overline{U}(xU), \\ y \rightarrow 1: \quad \sigma ^{\pm }_{\text {charm,CC}}&= \frac{1}{2}(F_2^{\pm } {\mp } xF_3^{\pm }) = x\overline{U} (xU). \end{aligned} \end{aligned}$$
(5)

Thus the contribution from the strange-quark PDF is suppressed at high y.

2.3 xFitter implementation

All calculations are interfaced in xFitter and available with \(\overline{\text{ MS }}\) heavy-quark masses. The reference value of the \(\overline{\text{ MS }}\) charm mass is set to \(m_c(m_c) = 1.27\) GeV [43], and \(\alpha _s\) is set to the value used for the corresponding PDF extraction: \(\alpha _s(M_Z) = 0.1191\) for ABMP16 and \(\alpha _s(M_Z) = 0.118\) for NNPDF3.1. The renormalization and factorization scales are chosen to be \(\mu _\mathrm {r}^2 = \mu _\mathrm {f}^2 = Q^2\).

To estimate theoretical scale uncertainties, \(\mu _\mathrm {r}\) and \(\mu _\mathrm {f}\) are simultaneously varied up and down by a factor of two. In the case of the FONLL-B calculations, also the independent \(\mu _r\) and \(\mu _f\) variations are checked. Furthermore, the PDF uncertainties are propagated to the calculated theoretical predictions, while the uncertainties arising from varying the charm mass \(m_c(m_c) = 1.27 \pm 0.03\) GeV by one standard deviation are smaller than \(1\%\) and therefore neglected. In the FONLL-B scheme, as a cross check, the calculation was performed with the pole charm mass \(m_c^{\text {pole}} = 1.51\) GeV which is consistent with the conditions of the NNPDF3.1 extraction [5]. The obtained theoretical predictions differ from the ones calculated with \(m_c(m_c) = 1.27\) GeV by less than \(1\%\). The total theoretical uncertainties are obtained by adding in quadrature scale and PDF uncertainties.

Fig. 1
figure 1

The theoretical predictions with their total uncertainties for charm CC production at the LHeC as a function of \(x_{\text {Bj}}\) for different values of \(Q^2\) calculated in the FFNS A and FONLL-B schemes. The bottom panels display the theoretical predictions normalized to the nominal values of the FFNS A predictions

3 Comparison of theoretical predictions

We now provide some numerical comparisons of the heavy-flavor schemes using their separate input conditions and associated PDF sets. Caution is necessary in these comparisons as the PDF sets are extracted with different input assumptions, data sets, and tolerance criteria; this is, in part, why we shall separately display the \(\mu _\mathrm {r}\), \(\mu _\mathrm {f}\) and PDF uncertainties in the following.

Fig. 2
figure 2

The theoretical predictions with their total uncertainties for charm CC production at the LHeC as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) calculated in the FFNS A and FONLL-B schemes. The bottom panels display the theoretical predictions normalized to the nominal values of the FFNS A predictions

Fig. 3
figure 3

The theoretical predictions with their total uncertainties for charm CC production at the LHeC as a function of y for different values of \(Q^2\) calculated in the FFNS A and FONLL-B schemes. The bottom panels display the theoretical predictions normalized to the nominal values of the FFNS A predictions

3.1 Comparison of theoretical predictions in the FFNS A and FONLL-B schemes

Figures 1, 2 and 3 show theoretical predictions for the FFNS A and FONLL-B schemes calculated as described in the previous sections with their total uncertainties. The FFNS A and FONLL-B results agree reasonably well within uncertainties in the bulk of the phase space. However, in phase-space corners such as \(Q^2 \gtrsim 10000\) GeV\(^2\) or small y the predictions in the two schemes differ by more than \(50\%\), exceeding the theoretical uncertainties.

To examine these differences further, in Fig. 4 we separately compute PDF and scale uncertainties (setting \(\mu _\mathrm {r}=\mu _\mathrm {f}=\mu \)) of the charm CC cross section as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) calculated in the FFNS A and FONLL-B scheme.

Comparing the two schemes, the larger variation of the FONLL-B scheme reflects the larger PDF uncertainty of the underlying PDF sets used: ABMP16 for FFNS A and NNPDF3.1 for FONLL-B. This difference is most evident in Fig. 4 which specifically separates out the PDF uncertainty, and reflects the independent inputs and assumptions used in the different PDF extractions.

Examining the results of Fig. 4, we also observe some other interesting features. For both of the calculations, the PDF uncertainties are relatively stable across the \(Q^2\) range for fixed \(x_{\text {Bj}}\), but tend to increase at larger \(x_{\text {Bj}}\) values. As is well known, in pQCD calculations the effect of scale variations is indicative of the convergence of the series. We observe that the scale uncertainties for the FONLL-B scheme uniformly decrease with increasing \(Q^2\). For the FFNS A scheme, the scale uncertainties decrease for small \(x_{\text {Bj}}\) values but increase with \(Q^2\) at intermediate values of \(x_{\text {Bj}}\). Additional details are shown in Fig. 5 where we separately vary \(\mu _\mathrm {r}\) and \(\mu _\mathrm {f}\) for the FONLL-B scheme. Here we note that the uncertainty associated to \(\mu _\mathrm {r}\) is very small and the total scale uncertainty is dominated by the variations of \(\mu _\mathrm {f}\) which is tied to the PDFs, \(f_i(x,\mu _\mathrm {f})\). For the FFNS A in xFitter, it is not possible to separately vary \(\mu _\mathrm {r}\) and \(\mu _\mathrm {f}\) in the current implementation, so the separate uncertainties can only be inferred by comparison to the FONLL-B case.

Fig. 4
figure 4

Relative theoretical uncertainties of charm CC predictions for the LHeC as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) calculated in the FFNS A and FONLL-B schemes. The PDF and scale uncertainties are shown separately

Fig. 5
figure 5

The impact of separate scale variations on charm CC predictions for the LHeC as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) calculated in the FFNS A and FONLL-B schemes

Fig. 6
figure 6

The theoretical predictions for CC charm production at the LHeC as a function of \(Q^2\) (\(x_{\text {Bj}} \), y) for different values of \(x_{\text {Bj}} \) (\(x_{\text {Bj}} \), \(Q^2\)) obtained using the HERAPDF2.0 PDF sets in the FFNS A, FFNS B and FONLL-B schemes. The bottom panels display the theoretical predictions normalized to the nominal values of the FFNS A predictions

3.2 Additional comparisons

To further explore whether the differences between the two sets of theoretical predictions are due to the different treatment of heavy quarks or to the different PDF sets, theoretical calculations in FFNS A and FONLL-B are repeated with the HERAPDF2.0 PDF sets extracted from the HERA DIS data [2]. Predictions in the FFNS B scheme are also produced using the HERAPDF2.0 FF3B PDF set and the FFNS B matrix elements, which are equivalent to the FFNS A matrix elements at NLO for CC charm production. The results are displayed in Fig. 6. The differences between FFNS A and FONLL-B are similar to those displayed in Figs. 1, 2 and 3 and demonstrate that these differences arise from the different treatment of the heavy quarks in the two schemes. The FFNS B predictions lie between the FFNS A and FONLL-B predictions, indicating that a large part of the difference is due to the different treatment of heavy quarks in the running of \(\alpha _s\) at high \(x_{\text {Bj}}\) or low y.

Furthermore, to investigate the impact of the NNLO corrections available at \(Q \gg m_c\) for the FFNS calculation, approximate NNLO predictions are obtained using the ABMP16 NNLO PDF set [4]. The results for the cross section as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) are shown in Fig. 7, where they are compared to the NLO FFNS A predictions from Fig. 2. The approximate NNLO corrections do not exceed \(\sim 10\%\) and thus cannot account for the differences between the FFNS A and FONLL-B theoretical predictions. Similar results are observed for the cross sections as functions of other kinematic variables.

Fig. 7
figure 7

The theoretical predictions for CC charm production at the LHeC as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\) calculated in the FFNS A scheme at NLO and approximate NNLO. The bottom panels display the theoretical predictions normalized to the nominal values of the FFNS A NLO predictions

To better understand the differences between the FFNS and VFNS calculations, Fig. 6 is particularly instructive. We see that at low \(Q^2\) the FFNS (FFNS A and FFNS B) and VFNS (FONLL-B) results agree within uncertainties (as demonstrated in Fig. 2). When the scale \(\mu \) is below the charm-threshold scale \(\mu _c\) (typically taken to be equal to \(m_c(m_c)\)) the charm PDFs vanish and the FFNS and VFNS reduce to the same result.Footnote 2 For increasing scales, the VFNS resums the \(\alpha _s\ln (\mu ^2/\mu _c^2)\) contributions via the DGLAP evolution equations and the FFNS and VFNS will slowly diverge logarithmically. This behavior is observed in Fig. 6 and is consistent with the characteristics demonstrated in Ref. [52].

More precisely, Ref. [52] used a matched set of \(n_f=3\) and \(n_f=5\) PDFs to study the impact of the scheme choice at large scales. They found that the resummed contributions in the VFNS yielded a larger cross section than the FFNS (the specific magnitude was x-dependent), and that for \(Q^2\) scales more than a few times the quark mass, the differences due to scheme choice exceeded the differences due to (estimated) higher-order contributions. Thus, we have identified the source of the scheme differences at large \(Q^2\).

The source of the scheme differences at large \(x_{\text {Bj}} \) is a bit more subtle. The VFNS includes a resummation of higher-order logarithms of the form \(\alpha _s \ln (\mu ^2/\mu _c^2 )\). In Fig. 18 of the Appendix we display the separate contributions of the VFNS for a choice of \(\{x_{\text {Bj}},Q^2\}\); the difference between the LO and SUB curves is indicative of the additional contribution of the resummed logarithms. This contribution depends on the particular \(x_{\text {Bj}} \) value as well as \(Q^2\) (c.f., Fig. 11 of Ref. [52]). Thus, it is a balance between the LO and SUB contributions which determines the difference between the VFNS and FFNS; hence, the behavior is not just a simple factor. In Fig. 6b) we observe that for \(Q^2\) values not too large (\(Q^2 \lesssim 4924~\mathrm{GeV}^2\)), the VFNS is above the FFNS A result for \(x_{\text {Bj}} \gtrsim 0.1\). Interestingly, we see the FFNS B scheme is quite similar to the VFNS in this kinematic region. For larger scales the patterns are not so clear, as the large \(Q^2\) effects discussed previously now complicate the situation.

3.3 Contributions from different partonic subprocesses

The fundamental difference between the FFNS and the VFNS is the treatment of the heavy partons, the charm in particular. In the FFNS the charm is not included in the PDFs as an active parton, so charm quarks only arise from gluon splitting, \(g\rightarrow c \bar{c}\). In contrast, the VFNS does include the charm as an active partonic flavor, and thus allows for charm-initiated subprocesses. To better appreciate these differences, we will study the individual partonic contributions to the cross section as functions of the kinematic variables \(x_{\text {Bj}}\), \(Q^2\), and y.

Fig. 8
figure 8

The partonic subprocesses for charm CC production cross sections in the FFNS A (left) and FONLL-B (right) schemes as a function of \(x_{\text {Bj}}\) for different values of \(Q^2\)

Fig. 9
figure 9

The partonic subprocesses for charm CC production cross sections in the FFNS A (left) and FONLL-B (right) schemes as a function of \(Q^2\) for different values of \(x_{\text {Bj}}\)

Fig. 10
figure 10

The partonic subprocesses for charm CC production cross sections in the FFNS A (left) and FONLL-B (right) schemes as a function of y for different values of \(Q^2\)

Figures 8, 9 and 10 show the contributions from separate partonic subprocesses to the CC charm production cross section in the FFNS A and FONLL-B schemes as a function of: \(x_{\text {Bj}}\) for different values of \(Q^2\), \(Q^2\) for different values of \(x_{\text {Bj}}\), and y for different values of \(Q^2\), respectively.

In these figures we observe that the gluon contribution to the FFNS is strikingly similar to the charm contribution to the VFNS. This is explained by the fact that in the FFNS the charm is present only in the final state and produced predominantly in the hard process \(W^+ g \rightarrow c\bar{s}\). In contrast, in the VFNS the charm is present also in the initial state and mainly produced by \(g\rightarrow c\bar{c}\) collinear splitting through DGLAP evolution. The fundamental underlying process is (and has to be) the same in both the FFNS and VFNS, but the factorization boundary between PDFs and hard scattering cross section, \(\hat{\sigma }\otimes f\), (determined by the scale \(\mu \) and the scheme choice) is different.Footnote 3

These figures highlight another interesting feature of the QCD theory; we observe that for the VFNS the gluon contribution (green curves) can become negative in particular kinematic regions.Footnote 4 This is because in the VFNS we combine the gluon-boson fusion process (the NLO terms of Figs. 16 and 17) with the counter-term (the SUB terms), and this combination can be negative. This behavior underscores the fact that the renormalization scale \(\mu \) is simply “shuffling” contributions among the separate sub-pieces, but the total physical cross section remains positive and stable, cf., Fig. 18 and Ref. [54]. This is a triumph of the QCD theory.

Next, turning our attention to the strange PDF contribution, it is notable that the FFNS and VFNS behave qualitatively very similar as functions of \(Q^2\), \(x_{\text {Bj}}\), and y. In particular, we observe that the strange fraction increases for \(x_{\text {Bj}}\) and decreases for \(Q^2\) and y. In particular, at high y the strange PDF contribution drops to zero in favor of the gluon or charm quark PDFs (see Fig. 10 and Eq. (5)). Similar phenomena (although less pronounced) are observed at low \(x_{\text {Bj}}\) and/or high \(Q^2\). In these phase-space regions, the dominant contributions to the cross section are proportional to the gluon PDF in the FFNS or to the charm-quark PDFs in the VFNS.

Finally, we note that in Fig. 9 for the FFNS A, the gluon contribution at high \(x_{\text {Bj}}\) (\(x_{\text {Bj}} \sim 0.25\)) is minimal throughout the \(Q^2\) range. To ensure this is not an artifact of either the FFNS A or the ABMP16 set, we regenerated these curves (not shown) in the FFNS B with the HERAPDF2.0 FF3B set and found the same behaviour; hence, this feature is truly a characteristic of the FFNS in the high-\(x_{\text {Bj}}\) region.

4 PDF constraints from charm CC pseudodata

Now we turn to examine how the LHeC can reduce the PDF uncertainties and thus improve our predictive power.

The impact of charm CC cross section measurements at the LHeC on the PDFs is quantitatively estimated using the profiling technique [55]. This technique is based on minimizing the \(\chi ^2\) between data and theoretical predictions taking into account both experimental and theoretical uncertainties arising from PDF variations. Two NLO PDF sets were chosen for this study: ABMP16  [46] and NNPDF3.1  [5]. All PDF sets are provided with uncertainties in the format of eigenvectors. In the presence of strong constraints (the LHeC data is very precise), it is preferable to use the eigenvector representation as only a few MC replicas would survive the Bayesian reweighting.

4.1 The CC charm pseudodata

For this study, pseudodata for charm CC production cross section differential in \(Q^2\) and \(x_{\text {Bj}}\) and corresponding to an integrated luminosity of \(100\mathrm{~fb}^{-1}\) [42, 56] and polarization \(P=-0.8\) are used. Theoretical predictions are calculated at NLO in pQCD both in the FFNS A with number of active flavors \(n_f = 3\) and in the FONLL-B scheme. The charm-mass reference value in the \(\overline{\text{ MS }}\) scheme is set to \(m_c(m_c) = 1.27\) GeV and \(\alpha _s\) is set to the value used for the corresponding PDF extraction. The renormalization and factorization scales are chosen to be \(\mu _\mathrm {r}^2 = \mu _\mathrm {f}^2 = Q^2\).

The \(\chi ^2\) value is calculated as follows:

$$\begin{aligned} \begin{aligned} \chi ^2&= \mathbf {R}^{T}_{} \mathbf {Cov}^{-1}_{} \mathbf {R}_{} + \sum _{\beta } b_{\beta ,\mathrm{th}}^2 \ , \\ \mathbf {R}&= \mathbf {D} - \mathbf {T} - \sum _{\beta } \varGamma ^{}_{\beta ,\mathrm{th}} b_{\beta ,\mathrm{th}} \ , \end{aligned} \end{aligned}$$
(6)

where \(\mathbf {D}\) and \(\mathbf {T}\) are the column vectors of the measured (data) and predicted (theory) values, respectively. The correlated theoretical PDF uncertainties are included using the nuisance parameters \(b_{\beta , \mathrm{th}}\) with their influence on the theory predictions described by \(\varGamma _{\beta ,\mathrm{th}}\), where the index \(\beta \) runs over all PDF eigenvectors. For each nuisance parameter a penalty term is added to the \(\chi ^2\), representing the prior knowledge of the parameter. No theoretical uncertainties except the PDF uncertainties are considered; the PDF Hessian uncertainties are treated symmetrically. The full covariance matrix \(\mathbf{Cov }\) representing the statistical and systematic uncertainties of the data is used in the fit. The statistical and systematic uncertainties are treated as additive, i.e. they do not change in the fit. The systematic uncertainties are assumed uncorrelated between bins.

The values of the nuisance parameters at the minimum, \(b^\mathrm{min}_{\beta ,\mathrm{th}}\), are interpreted as optimized, or profiled, PDFs, while uncertainties of \(b^\mathrm{min}_{\beta ,\mathrm{th}}\) determined using the tolerance criterion of \(\varDelta \chi ^2 = 1\) correspond to the new PDF uncertainties. The profiling approach assumes that the new data are compatible with the theoretical predictions using the existing PDFs, such that no modification of the PDF fitting procedure is needed. Under this assumption, the central values of the measured cross sections are set to the central values of the theoretical predictions.

4.2 The profiled PDFs

The profiling study is performed using two sets of LHeC charm CC pseudodata:

  • the full set,

  • a restricted set with data points for which the difference between the FFNS A and FONLL-B are smaller than the present PDF uncertainties. The latter is taken for simplicity as the sum of the ABMP16 and NNPDF3.1 uncertainties, but for the most data points it is dominated by the NNPDF3.1 uncertainties (see Fig. 4).

Given the sizable differences observed between the FFNS A and FONLL-B predictions, the study with the restricted data set (also referred to as ‘with cuts’) aims to check whether or not model independent constraints on the strange PDF can be extracted using the charm CC reaction at LHeC. The two sets of data points are shown in Fig. 11 as functions of \(Q^2\) and \(x_{\text {Bj}}\).

The comparison between ABMP16 and NNPDF3.1 is insightful as ABMP16 represents a more restricted parametrization. For the HERAPDF2.0 set, the strange PDF was not fit directly, but computed via the relation \(f_s = \bar{s}/(\bar{s}+\bar{d})=0.4\pm 0.1\) and the uncertainty was approximated using the variation on \(f_s\); hence, these uncertainties are not the same as the Hessian diagonalized eigenvectors, so we will not profile the HERAPDF2.0 PDF set.

The original and profiled ABMP16 and NNPDF3.1 PDF uncertainties are shown in Figs. 12, 13, 14 and 15. The uncertainties of the PDFs are presented at the scales \(\mu _\mathrm {f}^2=100\) GeV\(^2\) and \(\mu _\mathrm {f}^2=100000\) GeV\(^2\). A strong impact of the charm CC pseudodata on the PDFs is observed for both PDF sets. In particular, the uncertainties of the strange PDF are strongly reduced once the pseudodata are included in the fit. Also the gluon PDF uncertainties are decreased. Furthermore, in the case of the NNPDF3.1 set, the charm PDF uncertainties are reduced significantly. For all PDF sets, only small differences can be noticed between the PDF constraints obtained using the full or restricted set because the whole \(x_{\text {Bj}}\) range is covered in both cases (see Fig. 11) despite the fact that the number of data points in the restricted set is roughly half of the total number of data points.

Fig. 11
figure 11

The full (\(\varDelta _{\mathrm{scheme}} < \varDelta _{\mathrm{PDF}}\), \(\varDelta _{\mathrm{scheme}} > \varDelta _{\mathrm{PDF}}\)) and restricted (\(\varDelta _{\mathrm{scheme}} < \varDelta _{\mathrm{PDF}}\)) sets of data points which are used for PDF profiling

Additionally, in the case of the NNPDF3.1 set, it is possible to check the constraints on the strange quark and anti-quark distributions separately, because no assumption \(s=\bar{s}\) is used in NNPDF3.1. The LHeC \(e^{-}p\) pseudodata provide direct constraints only on \(\bar{s}\). Nevertheless due to the apparently strong correlation between s and \(\bar{s}\) in the NNPDF3.1 fit, quite strong constraints are present on both the s and \(\bar{s}\) distributions once the direct constraints on \(\bar{s}\) are provided by the LHeC pseudodata. However, only mild constraints are put on the ratio \(s/\overline{s}\). This indicates that for precise determination of \(s/\overline{s}\) both \(e^{-}p\) and \(e^{+}p\) data will be needed.

Fig. 12
figure 12

The relative strange (top left), gluon (top right), sea quark (middle left), u valence quark (middle right) and d valence quark (bottom) PDF uncertainties at \(\mu _\mathrm {f}^2=100\) GeV\(^2\) of the original and profiled ABMP16 PDF set

Fig. 13
figure 13

The relative strange (top left), gluon (top right), sea quark (middle left), u valence quark (middle right) and d valence quark (bottom) PDF uncertainties at \(\mu _\mathrm {f}^2=100000\) GeV\(^2\) of the original and profiled ABMP16 PDF set

Fig. 14
figure 14

The relative strange quark (a), strange anti-quark (b), and ratio \(s/\overline{s}\) (c), gluon (d), sea quark (e), u valence quark (f), d valence quark (g) and charm quark (h) PDF uncertainties at \(\mu _\mathrm {f}^2=100\) GeV\(^2\) of the original and profiled NNPDF3.1 PDF set

Fig. 15
figure 15

The relative strange quark (a), strange anti-quark (b), and ratio \(s/\overline{s}\) (c), gluon (d), sea quark (e), u valence quark (f), d valence quark (g) and charm quark (h) PDF uncertainties at \(\mu _\mathrm {f}^2=100000\) GeV\(^2\) of the original and profiled NNPDF3.1 PDF set

A recent study [25] has examined potential improvements in the PDFs for both the LHeC and HL-LHC facilities with both CC and NC. Our improvement in the strange PDF is comparable, while additional channels of Ref. [25] yields improved constraints on the gluon.

Comparing the results of profiled PDFs in the FFNS and the VFNS, we find both analyses are able to significantly improve the constraints on the strange quark PDF. This result gives us confidence that the general features we observe here are independent of the details of the heavy flavor scheme.

5 Discussion and summary

The recent performance of the LHC has exceeded expectations and produced an unprecedented number of precision measurements to be analyzed; thus, it is essential to improve the theoretical calculations to match. The uncertainty for many of these precision measurements stems primarily from the PDFs. Hence, our ability to measure fundamental parameters of the Standard Model (SM), such as the W boson mass and \(\sin ^2\theta _W\), ultimately comes down to how accurately we determine the underlying PDFs [57]. Additionally, our ability to characterize and constrain SM processes can indirectly impact beyond-standard-model (BSM) signatures.

We have focused on the strange-quark distribution which, at the LHC, can have a significant impact on the W / Z cross section: one of the “standard candle” measurements. If we can reduce the uncertainty for these predictions, we can set stringent limits on any admixture of physics at higher scales. Unfortunately, at present the strange PDF has a comparably large uncertainty because measurements from the LHC and HERA, as well as older fixed-target experiments, do not seem to provide a definitive result for this flavor component.

This situation has prompted us to examine the CC DIS charm production at the LHeC to determine the impact of this data set on the PDF uncertainty. We considered the LHeC as this high-energy ep / A facility could potentially run in parallel with the LHC and provide insights into these issues at low x and high \(Q^2\) in advance of a FCC program.

This case study of the CC DIS charm production at the LHeC provides a practical illustration of the many features of xFitter. As the xFitter framework is designed to be a versatile open-source software framework for the determination of PDFs and the analysis of QCD physics, we can readily adapt this tool to address the impact and influence of new data sets. Furthermore, as both FFNS and VFNS calculations are implemented, we can use xFitter as a theoretical “laboratory” to study the resummation of large logarithms and multi-scale issues. We have outlined some of these issues in the Appendix. In particular, the CC DIS charm production involves a flavor-changing \(W^\pm \) boson, multiple quark masses enter the calculation, and this introduces some subtle theoretical issues to properly address the disparate mass and energy scales.

Using the xFitter framework, we find that the LHeC can provide strong constraints on the strange-quark PDF, especially in the previously unexplored small-\(x_{\text {Bj}}\) region.Footnote 5 A large reduction of uncertainties is observed also when restricting the input data to the kinematic range where the differences between the FFNS A and FONLL-B schemes are not larger than the present PDF uncertainties, indicating that the obtained PDF constraints are stable and independent of the particular heavy-flavor scheme. As noted above, a reduction of the strange-PDF uncertainties influences the W / Z production, and thus the Higgs production; hence, the LHeC CC DIS charm production data represent a valuable addition for the future global PDF fits.

However, since charm CC production in \(e^{-}p\) collisions mostly probe \(\bar{s}\), only mild constraints are put on the ratio \(s/\overline{s}\) using the NNPDF3.1 PDF set as reference; therefore for a precise determination of this ratio, both \(e^{-}p\) and \(e^{+}p\) data will be needed.

In conclusion, we find that CC DIS charm production at the LHeC can provide strong constraints on the strange PDF which are complementary to the current data sets. As the PDF uncertainty is the dominant factor for many precision analyses, a reduction of these uncertainties will allow for more accurate predictions which can be used to constrain both SM and BSM physics processes.