1 Introduction and motivation

Proton-proton collisions at the LHC produce a high-intensity collimated flux of neutrinos. These neutrinos are characterised by the largest energies ever achieved in laboratory experiments, reaching up to several TeV [1]. Due to the lack of dedicated instrumentation in the far-forward region, until recently these neutrinos avoided detection. The recent observation of LHC neutrinos [2,3,4] by the FASER [5, 6] and SND@LHC [7, 8] far-forward experiments demonstrates that this hitherto discarded beam can now be deployed for physics studies. Beyond the ongoing Run III, a dedicated suite of upgraded far-forward neutrino experiments would be hosted by the proposed Forward Physics Facility (FPF) [9, 10] operating concurrently with the High-Luminosity LHC (HL-LHC) [11, 12]. Current and future LHC neutrino experiments enable unprecedented scientific opportunities for particle and astroparticle physics both within the Standard Model and beyond it, as summarised in [9, 10] and references therein.

Measurements of neutrino structure functions [13,14,15] in deep-inelastic scattering (DIS) are sensitive probes of the parton distributions (PDFs) of nucleons and nuclei [16,17,18], in particular concerning (anti)quark flavour separation and strangeness [19,20,21,22]. Constraints arising from charged-current neutrino scattering provide information on complementary flavour combinations as compared to neutral-current charged-lepton DIS. Several experiments have measured neutrino DIS structure functions over a wide range of energies, and neutrino data from CHORUS [23], NuTeV [24], CCFR [25], NOMAD [26], CDHS [27], and other experiments are routinely included in global determinations of proton [28,29,30] and nuclear PDFs [31,32,33].

As compared to previous neutrino DIS experiments, neutrino scattering at the LHC involves up to 10-fold higher energies. Furthermore, large event rates are expected, with up to one million muon neutrinos interacting at the FPF detectors [9, 10]. Initial estimates [10] indicate that an extension of the coverage of available neutrino DIS data by an order of magnitude both at small-x and at large-\(Q^2\) should be possible. However, quantitative projections for the kinematic reach and experimental accuracy expected at current and future LHC neutrino experiments are not available. The lack of these projections has prevented detailed studies assessing the impact of LHC neutrino data in global analyses of proton and nuclear PDFs, comparable to those performed for the HL-LHC [11, 34], the Electron-Ion Collider (EIC) [35,36,37], and the Large Hadron Electron Collider (LHeC) [38,39,40].

Here we bridge this gap by quantifying the expected impact of LHC neutrino structure functions on proton and nuclear PDFs. To this end, we produce simulations for FASER\(\nu \) and SND@LHC at Run III as well as for the proposed FPF experiments [9, 10, 41, 42], FLArE, AdvSND, and FASER\(\nu \)2. For each experiment, we determine the expected event yields in bins of \((x,Q^2,E_\nu )\) satisfying acceptance and selection cuts, generate pseudo-data for inclusive and charm structure functions, and estimate their dominant systematic uncertainties. Subsequently, we study their impact on the proton and nuclear PDFs by means of Hessian profiling [34, 43,44,45] of both PDF4LHC21 [46] (for protons) and EPPS21 [31] (for tungsten nuclei) within the xFitter [47,48,49,50] open-source quantum chromodynamics (QCD) analysis framework, and with the direct inclusion in the open-source NNPDF4.0 fitting code [51].

Our analysis reveals that LHC neutrino structure functions can provide stringent constraints on the light quark and antiquark PDFs, especially on the up and down valence quarks and on strangeness, as compared to state-of-the-art global analyses. We also find that accounting for the main systematic uncertainties does not significantly degrade the sensitivity achieved in the baseline fits considering only statistical errors. We quantify the impact in our results of charm-tagged structure functions (large), study the relevance of final-state lepton-charge identification capabilities (moderate), and compare the constraints provided by different experiments (finding that the overall sensitivity is dominated by FASER\(\nu \)2). We also study the implications of the resulting improvement in PDF precision on core processes at the HL-LHC, finding an up to twofold reduction in theoretical error in the most optimistic scenario for selected Higgs and gauge boson production cross-sections.

Our results demonstrate that the availability of far-forward neutrino detectors at the LHC effectively provides CERN with a charged-current counterpart of the EIC, with similar kinematic reach and complementary sensitivity on hadronic structure. Therefore, LHC neutrino experiments realise, upon Lorentz-boosting, the analog of a “Neutrino-Ion Collider” at CERN without the need for new accelerator infrastructure or additional energy consumption.

The outline of this paper is as follows. Section 2 describes the procedure adopted to generate projections for neutrino DIS structure functions at the LHC and the methodology used to include these in PDF fits. The impact of such LHC structure function measurements on proton and nuclear PDFs is quantified in Sect. 3, with the associated implications for precision phenomenology at the HL-LHC assessed in Sect. 4. We provide a summary in Sect. 5, where we also consider possible directions for follow-up research. Additional results are collected in two appendices: Appendix A revisits the phenomenological studies of Sect. 4 using the HL-LHC PDF impact projections presented in [34], while Appendix B quantifies the stability of the nPDF impact projections with respect to input variations.

2 Deep-inelastic scattering with LHC neutrinos

Here we describe the procedure adopted to generate projections for the kinematic coverage and uncertainties associated to measurements of neutrino-nucleus scattering at the LHC far-forward experiments. First, we summarise the theoretical description of differential neutrino scattering in terms of DIS structure functions. Then, we present an overview of the operative and proposed LHC far-forward neutrino detectors that are considered in the present study and indicate their acceptance and performance parameters. By convoluting the expected electron and muon neutrino fluxes with the acceptance and scattering rates of each of these detectors, we evaluate the event yields in bins of x, \(Q^2\), and \(E_\nu \) and the associated statistical and systematic uncertainties. Finally, we discuss the procedure adopted to generate pseudo-data for LHC neutrino structure functions and to quantify their impact into proton and nuclear PDF determinations.

2.1 Neutrino DIS revisited

The double-differential cross-section for neutrino-nucleus charged-current scattering (see [15] and references therein) can be expressed in terms of three independent structure functions \(F_2^{\nu A}\), \(xF_3^{\nu A}\) and \(F_L^{\nu A}\):

$$\begin{aligned}&{{d^2\sigma ^{\nu A}(x,Q^2,y)}\over {dxdy}} = {{G_F^2s/4\pi }\over {\left( 1+Q^2/m_W^2\right) ^2}}\nonumber \\&\quad \times \left[ Y_+F^{\nu A}_2(x,Q^2) - y^2F^{\nu A}_L(x,Q^2) +Y_- xF^{\nu A}_3(x,Q^2)\right] , \end{aligned}$$
(2.1)

where \(Y_\pm = 1 \pm (1-y)^2\), and with a counterpart expression for antineutrino scattering,

$$\begin{aligned}&{{d^2\sigma ^{\bar{\nu } A}(x,Q^2,y)}\over {dxdy}} = {{G_F^2s/4\pi }\over {\left( 1+Q^2/m_W^2\right) ^2}} \nonumber \\&\quad \times \left[ Y_+F^{\bar{\nu } A}_2(x,Q^2) - y^2F^{\bar{\nu } A}_L(x,Q^2) -Y_- xF^{\bar{\nu } A}_3(x,Q^2)\right] , \end{aligned}$$
(2.2)

with \(s=2m_N E_\nu \) being the neutrino-nucleon centre-of-mass energy squared, \(m_N\) the nucleon mass, \(E_\nu \) the incoming neutrino energy, and \(y=Q^2/(2x m_n E_{\nu })\) the inelasticity. In the case of tau neutrino scattering, tau-lepton mass effects may be relevant, and Eqs. (2.1) and (2.2) receive additional contributions from the \(F_4\) and \(F_5\) structure functions. We neglect these effects, since here we focus on electron and muon neutrino scattering.

Structure functions depend on both x and \(Q^2\), while the differential cross-section also depends on the neutrino energy \(E_\nu \), or alternatively on the inelasticity y. Further, structure functions \(F^{\nu A}_i(x,Q^2)\) and \(F^{\bar{\nu } A}_i(x,Q^2)\) depend on the nuclear target A entering neutrino scattering through the nuclear modifications of the free-nucleon PDFs. Equations (2.1) and (2.2) are valid provided the hadronic invariant mass W is above the resonance production threshold,

$$\begin{aligned} W^2 = \left( m_N^2 + Q^2 {{(1-x)}\over {x}} \right) \gtrsim \left( 2\,\textrm{GeV} \right) ^2. \end{aligned}$$
(2.3)

In addition, here we restrict ourselves to the DIS region with perturbative momentum transfers \(Q^2 \gtrsim 2~\hbox {GeV}^2\), such that the structure functions can be decomposed as

$$\begin{aligned} F^{\nu A}_i(x,Q^2)&= \sum _{j=q,\bar{q},g} \int _x^1 {{dz}\over {z}}\, C_{i,j}^{\nu N} (z,\alpha _s(Q^2))f^{(A)}_j\left( {{x}\over {z}},Q^2\right) , \nonumber \\&\quad \text { with } \quad i = 2,3,L , \end{aligned}$$
(2.4)

expressed in terms of a convolution of partonic scattering cross-sections \(C_{i,j}^{\nu N}(x,\alpha _s)\) and of process-independent PDFs \(f^{(A)}_j\left( x,Q^2\right) \). A similar expression holds for charm production [21], which necessitates also accounting for charm mass effects [52]. Here the theory pipeline adopted to evaluate neutrino structure functions using Eq. (2.4) is provided by EKO [53] and YADISM [15, 54] interfaced to PineAPPL [55] for the generation of fast interpolation grids.

Different neutrino structure functions provide complementary sensitivity to the partonic flavour decompositions of nucleons.

To illustrate this feature, consider a leading-order calculation for a proton target with \(n_f=4\) active quark flavours, a diagonal CKM matrix, and no heavy quark mass effects.

The resulting \(F_2^{\nu p}\) and \(xF_3^{\nu p}\) structure functions read

$$\begin{aligned} F_2^{\nu p}(x,Q^2)&= 2x\left( f_{\bar{u}} + f_{d} + f_{s} + f_{\bar{c}} \right) (x,Q^2) \, , \nonumber \\ F_2^{\bar{\nu } p}(x,Q^2)&= 2x\left( f_u + f_{\bar{d}} + f_{\bar{s}} + f_c \right) (x,Q^2) \, , \nonumber \\ xF_3^{\nu p}(x,Q^2)&= 2x\left( -f_{\bar{u}} + f_d +f_s - f_{\bar{c}}\right) (x,Q^2) \, , \nonumber \\ xF_3^{\bar{\nu } p}(x,Q^2)&= 2x\left( f_u - f_{\bar{d}} -f_{\bar{s}} + f_{c}\right) (x,Q^2) . \end{aligned}$$
(2.5)

The corresponding expressions for a neutron target are obtained from isospin symmetry

$$\begin{aligned} F_2^{\nu n}(x,Q^2)&= 2x\left( f_{\bar{d}} + f_{u} + f_{s} + f_{\bar{c}} \right) (x,Q^2) , \nonumber \\ F_2^{\bar{\nu } n}(x,Q^2)&= 2x\left( f_d + f_{\bar{u}} + f_{\bar{s}} + f_c \right) (x,Q^2) , \nonumber \\ xF_3^{\nu n}(x,Q^2)&= 2x\left( -f_{\bar{d}} + f_u +f_s - f_{\bar{c}}\right) (x,Q^2) \, , \nonumber \\ xF_3^{\bar{\nu } n}(x,Q^2)&= 2x\left( f_d - f_{\bar{u}} -f_{\bar{s}} + f_{c}\right) (x,Q^2), \end{aligned}$$
(2.6)

while for an isoscalar, free-nucleon target denoted by N, one has

$$\begin{aligned} F_2^{\nu N}(x,Q^2)&= 2x\left( f_{u^+} + f_{d^+} + 2f_s + 2f_{\bar{c}} \right) (x,Q^2), \nonumber \\ F_2^{\bar{\nu } N}(x,Q^2)&= 2x\left( f_{u^+} + f_{d^+} + 2f_{\bar{s}} + 2f_c \right) (x,Q^2), \nonumber \\ xF_3^{\nu N}(x,Q^2)&= 2x\left( f_{u^-} + f_{d^-} +2f_s - 2f_{\bar{c}}\right) (x,Q^2), \nonumber \\ xF_3^{\bar{\nu } N}(x,Q^2)&= 2x\left( f_{u^-} + f_{d^-}-2f_{\bar{s}} +2 f_{c}\right) (x,Q^2), \end{aligned}$$
(2.7)

in terms of the valence and sea PDF combinations defined by the following relations

$$\begin{aligned} f_{q^+} (x,Q^2)&\equiv \left( f_{q}+f_{\bar{q}}\right) (x,Q^2) , \nonumber \\ f_{q^-} (x,Q^2)&\equiv \left( f_{q}- f_{\bar{q}}\right) (x,Q^2) . \end{aligned}$$
(2.8)

We note that, even for isoscalar targets, separate measurements for neutrinos and antineutrinos will not be equivalent, since in general the strange and charm sea asymmetries \(f_{s^-}\) and \(f_{c^-}\) are not expected to vanish [56, 57].

In the projections presented here, when interpreting the LHC neutrino structure functions in terms of proton PDFs, we will assume an isoscalar free-nucleon target and neglect nuclear PDF modifications, along the lines of Eq. (2.7).

Accounting for nuclear modifications in a global proton PDF fit is possible by means of the procedure developed in [58, 59] based on the theoretical covariance matrix approach [60, 61].

On the other hand, when evaluating structure functions for a tungsten (W) target, we take into account both nuclear corrections and that the target is not isoscalar when evaluating the physical observables.

It is also illustrative to compare the PDF dependence of neutrino structure functions at LO with that of their counterparts for neutral-current scattering with a charged lepton projectile.

Within the same assumptions, for energies below the Z-boson mass, \(Q^2 \ll m_Z^2\), the corresponding decomposition is

$$\begin{aligned} F_2^{\ell p}(x,Q^2)&= x\left( {{4}\over {9}}\left[ f_{u^+} + f_{c^+}\right] + {{1}\over {9}}\left[ f_{d^+} + f_{s^+}\right] \right) (x,Q^2) , \nonumber \\ F_2^{\ell n}(x,Q^2)&= x\left( {{4}\over {9}}\left[ f_{d^+} + f_{c^+}\right] + {{1}\over {9}}\left[ f_{u^+} + f_{s^+}\right] \right) (x,Q^2) , \nonumber \\ F_2^{\ell N}(x,Q^2)&= x\left( {{5}\over {18}}\left[ f_{u^+} + f_{d^+}\right] + {{1}\over {9}} f_{s^+} + {{4}\over {9}} f_{c^+} \right) (x,Q^2), \end{aligned}$$
(2.9)

with \(xF_3\) being negligible in this region.

A comparison of Eqs. (2.5)–(2.7) with Eq. (2.9) showcases the complementarity between neutrino and charged-lepton DIS in terms of sensitivity to different flavour PDF combinations.

This implies that the best sensitivity on the quark flavour separation in the nucleon would be provided by combining neutrino DIS from the LHC far-forward experiments with measurements on charged-lepton DIS at the EIC.

2.2 LHC far-forward neutrino experiments

The calculation of differential neutrino scattering event rates at the LHC far-forward detectors involves two main ingredients: the energy and flavour dependence of the incoming neutrino flux crossing the detector fiducial volume, on the one hand, and the scattering rates within the detector acceptance, on the other hand.

Here we summarise the main features of the existing and future far-forward detectors considered, in particular concerning their acceptance and expected performance.

We focus on muon-neutrino scattering, which benefits from the highest rates and is less affected by theoretical uncertainties in the production mechanism, but also provide predictions for the subdominant electron-neutrino structure functions.

The kinematics of a charged-current neutrino DIS event \((x,Q^2, E_\nu )\), or alternatively \((x,Q^2, y)\), are uniquely specified by the measurement of three independent final-state variables, such as \(\left( E_\ell ,\theta _\ell , W^2\right) \) or \(\left( E_\ell ,\theta _\ell , E_h \right) \), with \(E_\ell \) and \(\theta _\ell \) being the energy and polar angle of the outgoing charged lepton and \(E_h\) the total energy of the hadronic final state.

Most neutrino detectors can only access \(E_h\), given that measuring the invariant mass \(W^2\) requires fully reconstructing the final state.

A measurement of the three kinematic variables \(\left( E_\ell ,\theta _\ell , E_h \right) \) then fixes the DIS kinematics as follows:

$$\begin{aligned} E_\nu= & {} E_h + E_\ell \,, \nonumber \\ Q^2= & {} 4 ( E_h + E_\ell ) E_\ell \sin ^2 \left( \theta _\ell /2\right) , \nonumber \\ x= & {} {{4 ( E_h + E_\ell ) E_\ell \sin ^2 \left( \theta _\ell /2\right) }\over {2m_N E_h}} . \end{aligned}$$
(2.10)

These relations also reflect how systematic uncertainties affecting the measurement of \(\left( E_\ell ,\theta _\ell , E_h \right) \) translate into uncertainties in the reconstructed values of \((x,Q^2, E_\nu )\) modifying the expected binned event rates.

Detector overview Table 1 summarises, for each of the far-forward LHC neutrino experiments considered in this work, their pseudorapidity coverage, target material, whether they can identify the sign of the outgoing charged lepton, the acceptance for the charged lepton and hadronic final state, and the expected reconstruction performance.

We consider acceptance and performance separately for electron neutrinos and muon neutrinos.

In these projections we assume that FASER\(\nu \) and SND@LHC acquire data for Run III (\(\mathcal {L}=150\) fb\(^{-1}\)), while FASER\(\nu \)2, AdvSND, and FLArE take data for the complete HL-LHC period (\(\mathcal {L}=3\) ab\(^{-1}\)).

In the case of FLArE, we consider projections for fiducial volumes corresponding to both 10- and 100-tonne detectors.

In the following, we provide details about the information collected in Table 1.

Table 1 For each of the far-forward LHC neutrino experiments considered, we indicate their neutrino pseudorapidity coverage, target material, whether they can identify the sign of the outgoing charged lepton, the acceptance for the charged lepton and hadronic final state, and the expected reconstruction performance. We consider acceptance and performance separately for electron and muon neutrinos. For FLArE, we assume that muons would be measured in the FASER2 spectrometer situated downstream in the FPF cavern. See the description of each experiment in the text for more details. For our projections we assume that FASER\(\nu \) and SND@LHC acquire data for the Run III period (\(\mathcal {L}=150\) fb\(^{-1}\)), while FASER\(\nu \)2, AdvSND, and FLArE take data for the complete HL-LHC period (\(\mathcal {L}=3\) ab\(^{-1}\)). In the case of FLArE, we consider projections for fiducial volumes corresponding to both 10- and 100 tonne detectors, which we denote for the two detectors as FLArE(*)

FASER \(\nu \). The ForwArd Search ExpeRiment (FASER) detector and its companion FASER\(\nu \) [2, 5, 6, 62] are located at the TI12 tunnel of the CERN accelerator complex. Both detectors are aligned with the collision axis line of sight (LOS) and have been acquiring data since the beginning of Run III in 2022. Neutrino scattering takes place in the FASER\(\nu \) detector, composed of interleaved emulsion and tungsten plates and adding up to a mass of 1.1 tonnes with a fiducial volume of \(20~\textrm{cm} \times 25~\textrm{cm} \times 80~\textrm{cm}\). The FASER apparatus is immersed in a magnetic field, providing charged lepton identification thanks to two 1 m-long dipole magnets with \(B=0.57\) T and another 1.5 m-long magnet in front of the spectrometer. Neutrino detection and identification can be carried out either using the emulsion films, which have the key benefit of excellent position and angular resolution, or instead using the electronic detector components of FASER, which enable the tagging of the outgoing downstream energetic muons. FASER\(\nu \) is sensitive to neutrinos with pseudo-rapidity \(\eta _\nu \ge 8.5\) and can also identify charm-tagged events. To identify the charge of the lepton from a neutrino interaction, the lepton is required to pass through the FASER spectrometer. This imposes an angular requirement on the outgoing charged lepton, indicated in Table 1. Also, to identify a DIS interaction, the emulsion detector requires at least five charged tracks to emerge from the interaction vertex. We implement this last constraint in our simulations by requiring a minimum hadronic energy when calculating the event yields, as the charged track multiplicity is expected to grow with W [5, 63].

SND@LHC. In the same manner as FASER, the SND@LHC experiment [8] is located in a service tunnel (TI18) around 500 m from the ATLAS interaction point and has been taking data since the beginning of Run III. SND@LHC is installed off the LOS axis in order to cover the neutrino pseudo-rapidity range of \(7.2 \le \eta _\nu \le 8.4\). With a total fiducial volume corresponding to a 830 kg detector with a length of 35 cm, it is composed of tungsten plates, where neutrino scattering takes place, interleaved with nuclear emulsions and electronic tracker components. Downstream, the scattering volume is followed by a hadronic calorimeter and a muon tracking system. The electromagnetic and hadronic energy deposits can be measured at the electronic detectors, with the emulsion components providing vertex reconstruction.

The lack of magnetic field prevents the charge-sign identification of the outgoing charged leptons.

FASER \(\nu \)2. This is a proposed 20-tonne neutrino experiment located on the LOS of the LHC neutrino beam to be installed in the FPF cavern. It is based on the same technology as FASER\(\nu \), and hence relies on an emulsion-based detector optimised to identify heavy flavour particles, including tau leptons and charm and beauty particles, arising from neutrino interactions. It would be sensitive to neutrinos with pseudorapidity \(\eta _\nu \ge 8.5\). The FASER\(\nu \)2 detector is composed of 3300 emulsion layers interleaved with 2-mm-thick tungsten plates, for a total volume of tungsten of \(40~\textrm{cm} \times 40~\textrm{cm} \times 6.6~\textrm{m}\). The combination of FASER\(\nu \)2 with the nearby FASER2 detector, equipped with a spectrometer, makes measurements of the outgoing muon charge possible. Given that FASER\(\nu \)2 is based on the same detector technology as its predecessor, the same performance in terms of reconstruction of final-state kinematics can be assumed.

AdvSND. This proposed experiment [10] actually consists of two detectors, a far-detector to be installed at the FPF with a coverage in neutrino pseudorapidity of \(7.2 \le \eta _\nu \le 8.4\) (i.e., off-LOS, same as SND@LHC) and a near detector installed somewhere else in the LHC complex and covering the range \(4 \le \eta _\nu \le 5\).

Here we focus on the former.

It would be composed (from upstream to downstream) of a target region for vertex reconstruction and electromagnetic energy measurement, followed by a hadronic calorimeter, a muon identification system, and finally a magnet enabling muon charge and momentum measurements.

The target region of the detector, where the neutrino interactions take place, is made of thin sensitive layers of emulsion interleaved with tungsten plates, for a total mass of 5 tonnes and fiducial length of 50 cm.

This detector configuration will be able to track muons with energy \(E_\nu \gtrsim 20~\hbox {GeV}\) within an acceptance of 100 mrad and provide information on the charge of the outgoing muon thanks to its magnet.

The total energy of the hadronic final state will be measured in the hadronic calorimeter.

No information on the expected performance of the AdvSND-far detector is available and hence no estimate of the systematic errors is carried out.

FLArE. Building upon progress in liquid noble gas neutrino detectors over the last decade (ICARUS, MicroBooNE, SBND, DUNE), this experiment [10, 41] would rely on a modularised liquid argon (LAr) time projection detector.

The use of LAr as a target is beneficial for final-state particle identification, track angle, and kinetic energy measurements with sub-millimeter spatial resolution in all dimensions.

The detector will be equipped with a magnetised hadron/muon calorimeter downstream of the liquid argon volume for muon charge and momentum measurements.

While muon neutrinos with energy \(E_\nu \gtrsim 2~\hbox {GeV}\) would not be fully contained in the FLArE detector, for our projections we assume that outgoing high-energy muons would be measured in the FASER2 spectrometer situated downstream in the FPF cavern.

With an expected fiducial (active) mass of 10 tonnes (30 tonnes) and a length of 7 m, FLArE will detect final-state electrons with energies \(E_{e}\lesssim 2~\textrm{TeV}\) and scattering angles up to 0.5 mrad, while final-state muons with \(\theta _\mu \lesssim 0.025\) will be recorded by FASER2.

Here we present projections for this baseline design of the experiment, FLArE10, as well as for a potential larger variant based on a fiducial mass of 100 tonnes, denoted as FLArE100.

While FLArE100 is not one of the currently proposed FPF experiments, we nevertheless include it for illustration purposes.

Reconstruction of the total energy of the hadronic final state will be possible.

In terms of performance, the targets are \(\delta E_\mu \sim 5\%\) of electron energy resolution, \(\delta \theta _e \sim 15\) mrad of electron angular resolution, and \(\delta E_h \sim 30\)% for the hadronic energy.

Since the muon energy from a charged-current interaction would be measured by FASER2, the performance parameters for \(E_\mu \) are taken to be the same.

2.3 Differential scattering event rates

For each of the LHC far-forward neutrino detectors described in Table 1, we generate projections for the expected DIS structure function measurements as follows. We want to evaluate the number of reconstructed charged-current neutrino interaction events taking place in the fiducial volume of the detector when divided into bins of Bjorken-x, momentum transfer \(Q^2\), and neutrino energy \(E_\nu \), that is,

$$\begin{aligned} N_{\nu _e,\textrm{ev}}^{(i)}\left( \nu _e ; E_\nu ; x; Q^2\right) \, ,\quad i=1,\ldots , N_\textrm{bin}, \end{aligned}$$
(2.11)

with

$$\begin{aligned} E_{\textrm{min}}^{(i)}&\le E_\nu \le E_{\textrm{max}}^{(i)}, \\ x_{\textrm{min}}^{(i)}&\le x \le x_{\textrm{max}}^{(i)} , \\ Q_{\textrm{min}}^{2(i)}&\le Q^2 \le Q_{\textrm{max}}^{2(i)}, \end{aligned}$$

for electron neutrinos and for each of the bins comprising the measurement and with similar expressions applying for muon neutrinos and antineutrinos. These event yields determine the statistical precision associated with a measurement of the double-differential cross-sections Eqs. (2.1) and (2.2). Subsequently, we account for the expected reconstruction performance of the detector in order to estimate the systematic uncertainties associated with these event yields. For simplicity, in this initial study we consider a single bin in energy and choose log-spaced bins in x and \(Q^2\); the eventual optimisation of the binning selection is left for future work.

In this calculation, we adopt the neutrino fluxes evaluated in [1] and used for the FPF simulations presented in [10]. The bin-by-bin integrated event yields in Eq. (2.11) are obtained by convoluting the incoming neutrino fluxes, for a given acceptance of the target detector, with the corresponding neutrino differential cross-sections Eqs. (2.1) and (2.2). Binned event yields are evaluated using

$$\begin{aligned} N_\textrm{ev}^{(i)}&= n_T L_T\int _{Q^{2(i)}_\textrm{min}}^{Q^{2(i)}_\textrm{max}}\int _{x^{(i)}_\textrm{min}}^{x^{(i)}_\textrm{max}}\int _{E_\textrm{min}^{(i)}}^{E_\textrm{max}^{(i)}} {{dN_{\nu }(E_\nu )}\over {dE_{\nu }}} \nonumber \\&\quad \times \left( {{d^2\sigma (x,Q^2,E_{\nu })}\over {dxdQ^2}}\right) \mathcal{A}(x,Q^2,E_{\nu }) dQ^2 dx dE_{\nu } , \end{aligned}$$
(2.12)

where \(n_T\) is the nucleon density of the target detector material, \(L_T\) is its length, and \(\mathcal{A}(x,Q^2,E_{\nu })\) is a global acceptance factor which takes the form of a step function and accounts for the experimental acceptances in \(E_\ell \), \(\theta _\ell \), and \(E_h\) listed in Table 1.

The incoming neutrino fluxes of [1] account for the geometry and neutrino pseudorapidity \(\eta _\nu \) coverage of the considered detector and are encoded in Eq. (2.12) as \(dN_{\nu }(E_\nu )/dE_{\nu }\). This neutrino flux takes into account both the prompt component associated with neutrinos from charmed hadron decay and a displaced component from light hadron decays. The prompt component used in this work was simulated at next to leading order (NLO) using POWHEG [64,65,66] matched to Pythia8 [67, 68] for the parton shower and hadronisation, and the displaced component was simulated with EPOS-LHC [69]. In comparison with the calculation from [1], in addition to the improved charmed hadron production presented in [70], the neutrino flux has been updated to \(\sqrt{s} = 13.6~\textrm{TeV}\).

As pointed out in Ref. [1], there are notable neutrino flux uncertainties, as various event generators do not agree on the forward parent hadron spectra. If the prediction spread of various generators were taken as a means of flux uncertainty, corresponding to a \(\lesssim 50\%\) uncertainty on the interacting muon neutrino spectrum, this would be a significant systematic uncertainty if left unresolved. However, it is worth noting that many existing predictions are yet to be tuned for the purposes of experiments such as those planned for the FPF. Nevertheless, there are projections of FPF measurements which would reduce this uncertainty to the sub-percent level already in the context of the contemporary predictions, based on parametrising their expected correlations [71], as well as efforts to describe the uncertainty in a data-driven way while improving the modelling of forward hadronisation [72]. However, it is important to note that forward neutrino experiments actually constrain the product of flux and cross-section, and one must be assumed to measure the other. In a full analysis, the flux and cross-section would be constrained simultaneously in a joint measurement, utilising their different kinematic dependence on \( x,Q^2,E_{\nu }\) and neutrino rapidity \(\eta _{\nu }\). In our study, we aim to understand the full impact of FPF data on the PDF fit, thus motivating this future joint measurement. To this end, we take the neutrino flux to be known and focus on the irreducible systematics associated with event reconstruction. With this assumption, we will show that Run III measurements will not be sufficient to impact PDF fits. Instead, Run III measurements could be used to calibrate incoming neutrino fluxes, effectively reducing the large uncertainties by the time FPF data are collected in the future. The expected reduction of FPF neutrino flux uncertainties further justifies our choice to take the neutrino flux as known.

Table 2 Integrated event yields for the six detectors considered, separated into electron neutrinos and antineutrinos, muon neutrinos and antineutrinos, and their sum. These event yields are computed from Eq. (2.12) imposing DIS kinematics, \(Q^2 \ge 2~\hbox {GeV}^2\) and \(W^2 \ge 4~\hbox {GeV}^2\). The numbers in parenthesis indicate the event rates corresponding to charm production, Eq. (2.14). In the case of the FLArE detector, we display results for two proposed options with fiducial masses of 10 and 100 tonnes, respectively

The triple integral in Eq. (2.12) is evaluated numerically by means of Monte Carlo sampling, by generating \(N_\textrm{mc}\) sampling points in the \(\left( x,Q^2,E_{\nu }\right) \) space with the constraint that

$$\begin{aligned} 0< y \left( = Q^2/2m_N E_{\nu }x\right) <1 , \end{aligned}$$
(2.13)

and where \(N_\textrm{mc}\) is chosen to be large enough that residual Monte Carlo integration uncertainties are negligible for all the bins considered.

Equation (2.12) can be generalised to charm production events, with the only difference being that now the neutrino scattering cross-section is restricted to those processes leading to final-state charm quarks. Assuming that charm quarks can be directly tagged by the detector, one has

$$\begin{aligned}&N_\textrm{ev,c}^{(i)} = n_T L_T\int _{Q^{2(i)}_\textrm{min}}^{Q^{2(i)}_\textrm{max}}\int _{x^{(i)}_\textrm{min}}^{x^{(i)}_\textrm{max}} \int _{E_\textrm{min}^{(i)}}^{E_\textrm{max}^{(i)}} {{dN_{\nu }(E_\nu )}\over {dE_{\nu }}} \nonumber \\&\quad \times \left( {{d^2\sigma ^{\nu N \rightarrow \ell + c+X}(x,Q^2,E_{\nu })}\over {dxdQ^2}}\right) \mathcal{A}(x,Q^2,E_{\nu }) dQ^2 dx dE_{\nu }, \end{aligned}$$
(2.14)

with the charm production cross-sections discussed in [21] and references therein. Here we neglect efficiency and acceptance effects associated to D-meson tagging, which can only be properly estimated by means of a full detector simulation. The acceptance correction \(\mathcal{A}(x,Q^2,E_{\nu })\) in Eq. (2.14) applies only to the charged leptons and hence is the same as in the inclusive case.

Detectors without charm-tagging capabilities can still be sensitive to charm production via the semileptonic decays of the D-mesons, resulting in the characteristic dimuon topology, where

$$\begin{aligned} N_\mathrm{ev,2\mu }^{(i)} \approx N_\textrm{ev,c}^{(i)} \times \mathcal {B}\left( c \rightarrow D \rightarrow \mu + X\right) , \end{aligned}$$
(2.15)

with \(\mathcal {B}\) a numerical factor that accounts for charm hadronisation and the resulting semileptonic decay to a muon.

Given that \(\mathcal {B}\sim 0.1\), being able to directly tag charm quarks increases the event yields of charm production events by a factor of 10 as compared to reconstructing the dimuon final state.

Table 2 summarises the predicted integrated event yields for the detectors considered, separated into electron neutrinos and antineutrinos and muon neutrinos and antineutrinos. We used the PDF4LHC21 set to compute the differential cross-sections that enter in Eq. 2.12. As mentioned above, for FLArE we assume that muon neutrinos interacting in its fiducial volume will be measured by the FASER\(\nu \)2 spectrometer.

These event yields are computed from Eq. (2.12) with the requirement that the momentum transfer and the final-state invariant mass are restricted to the DIS region,

$$\begin{aligned} Q^2 \ge 2~\textrm{GeV}^2\quad \textrm{and}\quad W^2 \ge 4~\textrm{GeV}^2 . \end{aligned}$$
(2.16)

The numbers in parenthesis indicate the event rates corresponding to charm production, assuming heavy flavour-tagging capabilities. In the case of the FLArE detector, we display results for two proposed options with fiducial masses of 10 and 100 tonnes. As opposed to the number of interacting neutrinos presented in [10], we now account for the detector acceptances listed in Table 1, which reduce the event yields by up to a factor of 2.

Several observations can be derived from Table 2. First, one appreciates the large increase in statistics from the Run III experiments to those from FPF, with for example an increase by a factor of \(\sim \) 250 in the muon neutrino yield between FASER\(\nu \) and FASER\(\nu \)2. Second, the muon-neutrino scattering yield dominates over electron-neutrino scattering by a factor between 2 and 3, though the precise value of this ratio is affected by the large theoretical uncertainties affecting forward electron neutrino production. Third, charm production represents around 15% of the inclusive yields, with both FASER\(\nu \)2 and FLArE100 resulting in around 80k recorded charm production events. Note that this result assumes D-meson tagging capabilities, and if only dimuon events can be recorded, the yields would be reduced by a factor of 10. Fourth, FASER\(\nu \)2 and FLArE100 lead to the largest absolute yields of the FPF experiments, with a total of around 680k and 1.1M (non-tau) neutrino DIS events, respectively, with 74k and 170k events expected instead for AdvSND-far and for FLArE10.

Figure 1 displays the differential event yields per bin, Eq. (2.11), for muon neutrinos detected at the FASER\(\nu \) (Run III) and the AdvSND, FASER\(\nu \)2, and FLArE100 (FPF) experiments, restricted to the DIS region defined by Eq. (2.16) and where only bins with \(\ge 100\) events are retained, except for FASER\(\nu \) in which bins with \(\ge 10\) events are shown.

Adding up the bins in each of the panels results in the inclusive yields listed in Table 2.

The clear improvement going from the current FASER\(\nu \) experiment to the FPF ones is visible both in terms of the number of events per bin and for the kinematic coverage.

The FPF experiments benefit from large event rates for most of the region in \(\left( x,Q^2\right) \) covered, leading to typical statistical uncertainties at the 1% level or smaller, while for FASER\(\nu \), the statistical uncertainties are larger due to the reduced event rates.

Fig. 1
figure 1

The event yields per bin \(N_\textrm{ev}^{(i)}\), Eq. (2.11), for muon-neutrino scattering at the FASER\(\nu \), FASER\(\nu \)2, AdvSND, and FLArE100 experiments. Selected events are restricted to the DIS region Eq. (2.16), and only bins with \(\ge 100\) events are retained, except for FASER\(\nu \) in which bins with \(\ge 10\) events are kept. Adding up the bins in each of the panels results in the numbers listed in Table 2

From Fig. 1 one observes how the kinematic coverage of the FPF far-forward experiments reaches down to \(x_\textrm{min}\sim 3\times 10^{-3}\) at small-x and up to \(Q^2_\textrm{max}\sim 10^4~\hbox {GeV}^2\) at large-\(Q^2\), representing an extension of around one order of magnitude in both directions as compared to available DIS neutrino data. To illustrate this, Fig. 2 compares the kinematic coverage of FASER\(\nu \), FASER\(\nu \)2, FLArE, and AdvSND, same as in Fig. 1, with that of electron-ion collisions at the upcoming EIC [35, 36] at the highest centre-of-mass energies planned, as well as to available fixed-target neutral- and charged-current DIS measurements. The LHC neutrino experiments cover an x region relevant at hadron colliders for Higgs boson analyses, precision electroweak measurements such as the W-boson mass [73], and new physics measurements sensitive to the large-x PDFs [74]. FASER\(\nu \)2 and FLArE100 mostly overlap with the EIC coverage, providing a complementary handle on the quark flavour decomposition in protons and heavy nuclei as compared to the one provided by the EIC measurements.

Fig. 2
figure 2

The kinematic coverage in the \((x,Q^2)\) plane of muon-neutrino scattering at the FASER\(\nu \), FASER\(\nu \)2, FLArE10(100), and AdvSND experiments (see also Fig. 1) compared to that of electron-ion collisions at the EIC as well as to the coverage of existing neutrino fixed-target DIS measurements

The inclusive yields listed in Table 2 differ from the total number of neutrinos interacting within the detector volume [1, 10] due to both the DIS requirements (\(W^2 > 4\,\textrm{GeV}^2\) and \(Q^2 > 2\,\textrm{GeV}^2\)) and the detector fiducial acceptances summarised in Table 1. The former is found to be negligible, with DIS cuts removing only \(\lesssim 1\%\) of the events, a consequence of the high energy of LHC neutrinos. The latter is quantified in Table 3, displaying the total number of electron and muon neutrinos before and after applying acceptance cuts. The acceptance efficiency, defined as the ratio between pre- and post-acceptance integrated event yields, depends on the detector and can be up to 50%. For instance, FASER\(\nu \)2 loses 63% and 52% of the interacting electron and muon neutrinos, respectively due to the acceptance requirements. From Table 1, is it also worth noting how FLArE has virtually perfect acceptance for electron neutrinos. One also observes that SND@LHC and AdvSND exhibit relatively large acceptance efficiencies, which however are not sufficient to compensate for the lower number of initial interacting neutrinos due to their off-axis location as compared to FASER\(\nu \)(2) and FLArE.

Table 3 The number of electron and muon neutrinos interacting within the detector volume, compared with the results after applying the DIS requirements (\(W^2 > 4\,\textrm{GeV}^2\) and \(Q^2 > 2\,\textrm{GeV}^2\)) and the experimental acceptances from Table 1. The DIS requirement removes only \(\lesssim 1\%\) of the events, a consequence of the high energy of LHC neutrinos. The last column displays the acceptance efficiency, defined as the ratio between pre- and post-acceptance integrated event yields. While the specific efficiencies depend on the experiment, up to 50% of the neutrinos interacting in the detector volume may fall outside detector acceptance

2.4 Statistical and systematic uncertainties

The event yields displayed in Fig. 1 determine the associated statistical uncertainty in each bin,

$$\begin{aligned} \delta ^\mathrm{(stat)} N_\textrm{ev}^{(i)} = \sqrt{N_\textrm{ev}^{(i)}}, \end{aligned}$$
(2.17)

such that the fractional statistical uncertainty per bin is \(\delta ^\mathrm{(stat)}_i=1/\sqrt{N_\textrm{ev}^{(i)}}\). Since we discard bins with less than 100 events for FPF experiments (FASER\(\nu \)2, FLArE, AdvSND) and 10 events for FASER\(\nu \) and SND@LHC, the fractional statistical uncertainty ranges between \(\lesssim 1\%\) and \(\sim 30\%\), depending on the values of x and \(Q^2\) associated with each bin.

Fig. 3
figure 3

Same as Fig. 1 for FASER\(\nu \)2 now as a function of x after having integrated the event yields in the range of \(Q^2 \in [10,100]~\hbox {GeV}^2\) for both the neutrino and antineutrinos. In addition to the event yield values, we also show the error bars corresponding to statistical errors only (left), systematic errors only (middle), and the sum in quadrature of the two (right panel). The bars in the horizontal direction indicate the width of the adopted x-bins

The projected statistical uncertainties for muon-neutrino scattering in the case of FASER\(\nu \)2 are displayed in the left panel of Fig. 3, which corresponds to the same event yields as in Fig. 1, now as a function of x after having integrated the event yields in the range of \(Q^2 \in [10,100]~\hbox {GeV}^2\). The error bar in the vertical direction indicates the statistical uncertainties, while that in the horizontal direction corresponds to the width of the x-bins. Except for the bins with the smallest values of x, statistical uncertainties indeed are at the percent level or smaller for this experiment.

In addition to the statistical uncertainties evaluated from Eq. (2.17), one needs to estimate the systematic uncertainties associated to the finite precision in the reconstruction of the final state leptonic and hadronic variables listed in Table 1. For instance, an event which would be classified into a given bin in \((x,Q^2,E_\nu )\) in the case of a perfect detector may end up being mis-classified into a different bin in the presence of systematic shifts associated to the lepton energy \(E_\ell \), lepton scattering angle \(\theta _\ell \), and hadronic energy \(E_h\), as indicated by Eq. (2.10).

For each independent source of systematic uncertainty, which in this analysis consists of \(\delta E_\ell \), \(\delta E_h \), and \(\delta \theta _\ell \), we quantify its impact at the event yield level

$$\begin{aligned} \delta _\textrm{sys}^{(E_\ell )} N_\textrm{ev}^{(i)} \, ,\quad \delta _\textrm{sys}^{(E_h)} N_\textrm{ev}^{(i)} ,\quad \delta _\textrm{sys}^{(\theta _\ell )} N_\textrm{ev}^{(i)} ,\quad i=1,\ldots ,N_\textrm{bin} \, , \end{aligned}$$
(2.18)

by extending the calculation delineated in Sect. 2.3. First, we generate a Monte Carlo set of events, denoted by \(\mathcal {D}_0\), composed of \(N_\textrm{mc} \approx 10^7\) samples and determine the assignment of each event to a point in the \(\left( x,Q^2,E_{\nu }\right) \) space. We then take each event in \(\mathcal {D}_0\) and smear it with Gaussian distributions whose variances are given by Table 1 to produce a set of new samples \(\{\mathcal {D}_k\}\). The smeared events are subjected to the same DIS cuts from Eq. (2.16) and acceptances from Table 1. The bin assignment of the events in the smeared samples \(\mathcal {D}_k\) will in general be different from those of the baseline sample \(\mathcal {D}_0\).

We define the fractional uncertainty associated to a given systematic source, say \(\delta E_\ell \), for bin i to be the mean of the absolute difference between the number of events in this bin for the smeared samples \(\{\mathcal {D}_k\}\) and the number of events in this bin for \({\mathcal {D}_0}\):

$$\begin{aligned} \delta _\textrm{sys}^{(E_\ell )} =\left\langle \left| {{{N}^{(i)}_{E_\ell -\textrm{smeared},k} -N_0^{(i)}}\over {N_0^{(i)}}}\right| \right\rangle . \end{aligned}$$
(2.19)

The absolute systematic uncertainty in event yield caused by \(\delta E_\ell \) is then \(\delta _\textrm{sys}^{(E_\ell )} N_\textrm{ev}^{(i)}\). Individual sources of systematic errors are treated as uncorrelated among them, and hence by producing samples where only one source of error is varied at a time, we can determine the systematic errors, Eq. (2.18), in each bin for each of the considered experiments. This approach has the benefit that rescaling individual sources of systematic uncertainties, say to assess the impact of improved detector performance, becomes straightforward.

Figure 4 displays the projected systematic uncertainties associated to \(E_\ell \), \(\theta _\ell \), and \(E_h\) for the measurements of the double-differential muon-neutrino scattering at FASER\(\nu \)2. We note that there is a further systematic uncertainty associated to the overall neutrino flux, but we do not include this uncertainty in our estimation. The magnitude of each systematic error is plotted as a function of the average momentum fraction per bin \(\left\langle x\right\rangle \) in two different bins of \(Q^2\). We indicate the results separately for neutrino and antineutrino projectiles as well as those associated to inclusive and to charm production measurements. For completeness, we also display in the bottom-right panel the corresponding statistical uncertainties in the same bins.

Fig. 4
figure 4

Estimated systematic uncertainties for the measurements of the double-differential muon-neutrino scattering cross-section at FASER\(\nu \)2. We consider the systematic errors associated to the charged lepton energy \(E_\ell \) and scattering angle \(\theta _\ell \) and to the hadronic energy \(E_h\). The size of each source of systematic error is plotted as a function of the average momentum fraction per bin \(\left\langle x\right\rangle \) in two different bins of \(Q^2\). We indicate the results separately for neutrino and antineutrino projectiles as well as those associated to inclusive and to charm production measurements. For completeness, we display in the bottom-right panel the corresponding statistical uncertainties in the same bins

In the case of FASER\(\nu \)2, the systematic uncertainties associated to the final state energies \(E_\ell \) and \(E_h\) are comparable and are in general at the \(10\%\) level, ranging from \(1\%\) to \(30\%\) across bins. The uncertainties associated with \(\theta _\ell \) are typically at or well below the \(1\%\) level for these experiments, which can be attributed to their good spatial resolution. On the other hand, for the FLArE experiment, uncertainties associated with the scattering angle \(\theta _\ell \) dominate, being at the \( 40\%\) level and ranging from a few percent to \(\mathcal{O}(1)\), whereas the uncertainties associated with final-state energy reconstruction are smaller, around the \(10\%\) level, and range from \(\lesssim 1\%\) to up to around \(80\%\) depending on the bin. For all experiments, statistical uncertainties are subdominant in most of the bins and are below the 10% level, especially at large-x which is the kinematic region benefitting from the largest event rates.

Figure 3 also displays the integrated event yields for FASER\(\nu \)2 as a function of x for only systematic errors and for the sum in quadrature of statistical and systematic errors (central and right panels, respectively). For most of the bins, for the baseline performance assumptions the total systematic uncertainty dominates over the statistical uncertainties, with the possible exception of the small-x region where statistical and systematic errors are of comparable size.

The end result of the procedure is an estimate of the statistical and systematic uncertainties for each bin of the measurement, from which an experimental covariance matrix can be constructed as

$$\begin{aligned} \textrm{cov}_{ij}&=\delta _{ij} \left( \delta ^\mathrm{(stat)} N_\textrm{ev}^{(i)}\right) ^2 + \sum _{k=1}^{n_\textrm{sys}}\left( \delta _\textrm{sys}^{(k)} N_\textrm{ev}^{(i)} \right) \left( \delta _\textrm{sys}^{(k)} N_\textrm{ev}^{(j)} \right) , \nonumber \\&\quad \text { for } i,j=1,\ldots ,N_\textrm{bin} , \end{aligned}$$
(2.20)

and the same for the associated correlation matrix of the measurement

$$\begin{aligned} \rho _{ij} = {{\textrm{cov}_{ij}}\over {\sqrt{ \textrm{cov}_{ii} }\sqrt{ \textrm{cov}_{jj} } }} . \end{aligned}$$
(2.21)

The relative covariance matrix, \( \textrm{cov}_{ij}/( N_\textrm{ev}^{(i)}N_\textrm{ev}^{(j)})\), is independent of the considered observable and would also apply for the double-differential cross-sections Eqs. (2.1) and (2.2) which are related to the event yields by a constant prefactor.

The experimental covariance matrix constructed as per Eq. (2.20) assumes that each source of systematic uncertainty is 100% correlated across all the bins of the measurement. In the real experiment, the actual covariance matrix will be composed of a large number of uncertainty sources, with typical HERA and LHC precision measurements characterised by up to hundreds of different sources of systematic error. In particular, the assumption that a single source of systematic error, say \(\delta E_\ell \), is fully correlated among all the bins in \((x,Q^2)\) is unlikely to be accurate.

For this reason, following the HL-LHC projection strategy of [34], here we neglect bin-by-bin correlations and add in quadrature statistical and systematic errors,

$$\begin{aligned} \textrm{cov}_{ij}&= \delta _{ij} \left( \delta ^\mathrm{(stat)} N_\textrm{ev}^{(i)}\right) ^2 + \delta _{ij}\left( f_\textrm{corr}\right) ^2\sum _{k=1}^{n_\textrm{sys}} \left( f_\textrm{red}^{(k)}\right) ^2 \nonumber \\&\quad \times \left( \delta _\textrm{sys}^{(k)} N_\textrm{ev}^{(i)} \right) ^2 \text { for } i,j=1,\ldots ,N_\textrm{bin} . \end{aligned}$$
(2.22)

In Eq. (2.22) we have introduced a parameter \(f_\textrm{red}^{(k)}\le 1\) that gauges the impact of a possible reduction of the kth systematic error as compared to the default experiment performance (reproduced with \(f_\textrm{red}=1\)). Furthermore, \(f_\textrm{corr}\) represents an effective correction factor that accounts for the fact that data with correlated systematics may be more constraining than the same data where each source of error is simply added in quadrature as we do here. A value of \(f_\textrm{corr} \sim 0.5\), obtained from the inspection of available measurements for which the full information on correlated systematics is available, was estimated in [34] and we adopt the same choice here.

In Sect. 3 we present results for both \(f_\textrm{red}=1\) (conservative) and \(f_\textrm{red}=0.5\) (optimistic scenario), though we note that it would be straightforward to revisit the projections for other assumptions of the detector performance.

Specifically, \(f_\textrm{red}=0.5\) is assumed for the Hessian profiling of PDF4LHC21 and EPPS21, while \(f_\textrm{red}=1\) for the NNPDF4.0-based fits.

2.5 Pseudo-data generation

In order to generate pseudo-data for double-differential LHC neutrino scattering cross-sections, we follow the procedure used for the HL-LHC projections of [34] which was also adopted in [75, 76] for SMEFT impact projections of vector-boson scattering and high-mass Drell-Yan data at the HL-LHC, respectively.

The starting point is predictions for inclusive and charm-tagged differential neutrino scattering cross-section, denoted generically by

$$\begin{aligned} \mathcal {O}_i^{\mathrm{(th)}} \equiv {{d^2\sigma ^{\nu N}(x_i,Q^2_i,y_i)}\over {dxdy}} \, ,\quad i=1,\ldots ,N_\textrm{bin} \, , \end{aligned}$$
(2.23)

with \((x_i,Q^2_i,y_i)\) labelling the corresponding bin centres.

The observables \(\mathcal {O}_i \) in Eq. (2.23) are evaluated with YADISM [15, 54] interfaced to PineAPPL [55, 77] to return a fast interpolation grid admitting a generic PDF input, and with DGLAP evolution effects provided by EKO [53]. DIS structure functions are evaluated at next to next to leading order (NNLO) in the QCD expansion and account for both heavy quark and target mass effects. No higher-twist corrections are included. In particular, charm structure functions are evaluated in the FONLL general mass variable-flavour-number scheme [21, 78, 79] at \(\mathcal {O}\left( \alpha _s\right) \) accuracy. For the proton PDF fits, we assume a free isoscalar target N, while for the nuclear PDF one we allow for deviations from isoscalarity relevant for a tungsten nucleus.

To ensure consistency, the PDF set and other theoretical settings, such as the perturbative order and heavy quark scheme, adopted for the evaluation of Eq. (2.23) should be the same as those used in the fitting framework assessing their impact. For instance, when using the xFitter profiling of PDF4LHC21, one also needs to generate neutrino structure functions using PDF4LHC21 as input. This ensures that the generated pseudo-data are coherent with the prior PDF set used as baseline and avoids introducing artificial inconsistencies compromising the validity of the projection studies.

The central values for the pseudo-data, denoted by \(\mathcal {O}_i^{\mathrm{(exp)}} \), are obtained by fluctuating the reference theory prediction Eq. (2.23) by the corresponding fractional statistical and systematic uncertainties,

$$\begin{aligned} \mathcal {O}_i^{\mathrm{(exp)}} = \mathcal {O}_i^{\mathrm{(th)}} \left( 1+ r_i \delta _i^\textrm{tot} \right) , \quad i=1,\ldots ,N_\textrm{bin} , \end{aligned}$$
(2.24)

where the total experimental uncertainty is the sum in quadrature of statistical and systematic errors, accounting for a possible reduction factor in the latter,

$$\begin{aligned} \delta _{i}^\textrm{tot}&= \left( \left( \delta _i^\textrm{stat}\right) ^2 + \sum _{k=1}^{n_\textrm{sys}} \left( f_\textrm{corr} \times f_\textrm{red}^{(k)} \times \delta _{i,k}^\textrm{sys} \right) ^2\right) ^{1/2} \, , \nonumber \\&\quad \text { for } i=1,\ldots ,N_\textrm{bin}, \end{aligned}$$
(2.25)

and with \(r_{i}\) being univariate Gaussian random numbers. As mentioned above, \(f_\textrm{red}^{(k)}\) is a reduction factor modelling improvements in the experimental performance as compared to the baseline settings summarised in Sect. 2.1. The pseudo-data generated by means of Eq. (2.24), together with the corresponding covariance matrix computed according to Eq. (2.22), then define the inputs of the subsequent proton and nuclear PDF determinations.

2.6 PDF impact assessment

We consider two complementary approaches to assess the impact of the projected LHC neutrino data on the proton and nuclear PDFs: first, the Hessian profiling [34, 43,44,45] of prior proton and nuclear PDF sets, taken to be PDF4LHC21 [46] and EPPS21 [31], respectively; second, the direct inclusion in the NNPDF global analysis framework [28, 51].

The profiling method applied to Hessian PDF fits is based on minimising a goodness-of-fit error function defined as

$$\begin{aligned} \chi ^2&= \sum _{i=1}^{N_{\text {bin}}} {{\left( \mathcal {O}_i^\mathrm{(exp)} + \varGamma _i^{\alpha ,\text {exp}} b_\alpha ^{\text {(exp)}} - \mathcal {O}_i^\mathrm{(th)} - \varGamma _i^{\beta ,\text {th}} b_\beta ^{(\text {th})} \right) ^2 }\over { \left( \delta ^{\mathrm{(stat)}}\mathcal {O}_i^\mathrm{(th)}\right) ^2 }} \nonumber \\&\quad + \sum _\alpha \left( b_\alpha ^{(\text {exp})}\right) ^2 + T^2 \sum _\beta \left( b_\beta ^{\text {(th)}}\right) ^2 , \end{aligned}$$
(2.26)

with the pseudo-data \(\mathcal {O}_i^\mathrm{(exp)}\) defined in Eq. (2.24). The correlated uncertainties for the pseudo-data and for the theoretical prediction are contained in the nuisance parameter vectors \(b^{(\text {exp})}\) and \(b^{(\text {th})}\), respectively, where T is the tolerance factor, and the total uncorrelated uncertainty is \(\delta ^{\mathrm{(stat)}}\mathcal {O}_i^\mathrm{(th)}\).

The effect of the nuisance parameters on the observables \(\mathcal {O}_i^\mathrm{(exp)}\) and \(\mathcal {O}_i^\mathrm{(th)}\) is described by the matrices \(\varGamma _i^{\text {exp}}\) and \(\varGamma _i^{\text {th}}\). The indices \(\alpha \) and \(\beta \) then run over the uncertainty nuisance parameters for the pseudo-data and the theoretical prediction, respectively. The nuisance parameter values \(b_\beta ^{\text {(th,min)}}\) that minimise Eq. (2.26) give the central PDFs \(f'_0\) optimised to the profiled dataset in the form

$$\begin{aligned} f_0' = f_0 + \sum _\beta b_\beta ^{\text {(th,min)}} \left( {{f_\beta ^+ - f_\beta ^- }\over {2}} - b_\beta ^{\text {(th,min)}} {{f_\beta ^+ + f_\beta ^- - 2f_0}\over {2}} \right) , \end{aligned}$$
(2.27)

where \(f_0\) is the original central PDF, and the up and down variation eigenvectors are given by \(f^+, f^-\). The reduction in the uncertainties of the profiled PDFs indicates the impact of the projected data with respect to the assumed prior PDF set.

The profiling studies carried out in this work are performed using version 2.2.1 of the xFitter open-source QCD analysis framework [47,48,49,50]. To this end, a new interface between PineAPPL and xFitter has been developed and is made available in xFitter. All the experimental and theoretical data files used in the analysis, including the PineAPPL grids, are available from the public xFitter repository. For the proton PDF profiling, a tolerance of \(T^2 = 10\) is adopted, which corresponds approximately to the average tolerance used in the CT18 [29] and MSHT20 [30] determinations, the two Hessian sets entering the PDF4LHC21 combination [46], for one-sigma PDF uncertainties. For the Hessian profiling of EPPS21, a value of \(T^2 = 33\) is used, consistent with [31], and the resulting uncertainties are scaled down by a factor of 1.645 to obtain 68% confidence level intervals.

Concerning the inclusion of the LHC neutrino pseudo-data in the NNPDF proton analysis framework, we follow the procedure outlined in [51]. Fast interpolation tables (FK-tables) [80] combining EKO DGLAP evolution with YADISM DIS coefficient functions are computed using PineAPPL, see also [81]. Predictions for the fitted observables Eq. (2.23) are evaluated with NNPDF4.0 NNLO and included in the corresponding PDF determination alongside all other datasets already present in the global fit. We verify that in all cases, \(\chi ^2/n_\textrm{dat}\sim 1\) after the fit, as expected given the built-in consistency between the prior PDF fit and the generated pseudo-data.

3 Constraints on proton and nuclear structure

By following the strategy outlined in Sect. 2, we quantify the impact on the proton and nuclear PDFs of differential DIS cross-section measurements obtained with the LHC neutrino beam. Here we present results first for the Hessian profiling of the PDF4LHC21, then for the Monte Carlo fit NNPDF4.0, and finally for the nuclear PDFs of EPPS21, also by means of profiling.

First of all, we study the constraints on the PDFs provided by FASER\(\nu \) during the LHC Run III data taking period, showing that they are not able to improve the determination of PDFs. We then move to study the impact of the FPF experiments, focusing on FASER\(\nu \)2. We study the stability of the results with respect to the inclusion of systematic uncertainties, charm-tagged data, and lepton-charge separation. We also compare the impact of the different FPF experiments separately and provide results for their combination.

Fig. 5
figure 5

The fractional PDF uncertainties (at the 68% CL) at \(Q^2 = 10^4 \, \text {GeV}^2\) for the up and down valence quarks, gluon, total quark singlet, and total strangeness PDFs in the PDF4LHC21 baseline, compared to the results obtained once the FASER\(\nu \) and FASER\(\nu \)2 structure functions are included in the fit. In both cases we include charm-tagged structure functions and assume final-state lepton-charge separation. The FASER\(\nu \) projections are based on a Run III integrated luminosity of \(\mathcal {L}=150~\hbox {fb}^{-1}\)

3.1 Proton PDFs: impact on PDF4LHC21

We begin with the Hessian profiling of the PDF4LHC21 set. This proton PDF set is a Monte Carlo combination [82, 83] of three global PDF sets, CT18 [29], MSHT20 [30], and NNPDF3.1 [84]. Its Hessian representations are obtained by means of the reduction methodologies developed in [85,86,87]. Because it is based on the combination of three modern global PDF fits, PDF4LHC21 provides a conservative estimate of current uncertainties associated to our understanding of proton PDFs. We profile PDF4LHC21 with pseudo-data from various LHC neutrino experiments, and study the stability of the results with respect to variations in the profiling inputs.

Impact of the FASER\(\nu \) Run III measurements. Figure 5 shows the fractional uncertainties (at the 68% confidence level) at \(Q^2 = 10^4 \, \text {GeV}^2\) for the up and down valence quarks, gluon, total quark singlet, and total strangeness PDFs in the PDF4LHC21 baseline, compared to the results obtained once the FASER\(\nu \) and FASER\(\nu \)2 structure functions are added by means of Hessian profiling. In both cases we include charm-tagged structure functions and assume final-state lepton-charge separation. The FASER\(\nu \) projections are based on a Run III integrated luminosity of \(\mathcal {L}=150\) fb\(^{-1}\). As indicated by Table 2, by the end of Run III, one expects that FASER\(\nu \) will have recorded around 600 and 1800 electron and muon neutrinos, respectively, corresponding to deep-inelastic scattering events, of which around 100 and 300, respectively, are associated to charm-tagged events. We display results for the profiling in which the experimental covariance matrix considers only statistical uncertainties. We restrict the comparisons to the region \(10^{-3}\lesssim x \lesssim 0.7\) covered by the LHC neutrino experiments (see also Fig. 2).

From Fig. 5 we find that neutrino DIS measurements at FASER\(\nu \) are unable to improve PDF uncertainties as compared to the baseline scenario encapsulated by PDF4LHC21. The reason is twofold: the smaller event rates as compared to FASER\(\nu \)2 and the reduced coverage of the \((x,Q^2)\) phase space shown in Fig. 1. The differences in PDF sensitivity between FASER\(\nu \) and FASER\(\nu \)2 in Fig. 5 illustrate the importance of realising the FPF in order to exploit the full physics potential enabled by LHC neutrinos for QCD and hadron structure studies.

It should be emphasised that the lack of PDF sensitivity displayed in Fig. 5 does not imply that measuring DIS structure functions at FASER\(\nu \) will not provide useful information. First of all, our procedure assumes perfect compatibility between current data and the projected DIS neutrino measurements at the LHC, something which remains to be demonstrated experimentally. Second, FASER\(\nu \) still covers a region of neutrino energies unexplored by previous experiments, hence providing a new validation of our QCD calculations and of neutrino interactions at the TeV scale. Third, it would represent a non-trivial proof of concept that LHC neutrino differential measurements can be unfolded to the cross-section level to be used in theoretical interpretations, paving the way for and demonstrating the feasibility of subsequent neutrino DIS measurements at the FPF experiments.

Fig. 6
figure 6

Same as in Fig. 5 comparing the FASER\(\nu \)2 impact projections without and with systematic uncertainties accounted for in the experimental covariance matrix

Constraints from FASER\(\nu \)2. Figure 6 shows the same comparison as in Fig. 5, now for the FASER\(\nu \)2 impact projections without and with systematic uncertainties accounted for in the experimental covariance matrix, following the procedure spelled out in Sect. 2.4. The FASER\(\nu \)2 pseudo-data account for both inclusive and charm-tagged structure functions and assume outgoing lepton charge identification. We display results for the profiling in which the experimental covariance matrix considers only statistical uncertainties, as well as for the scenario where statistical and systematic errors are added in quadrature. In addition to a reduction of PDF uncertainties, the Hessian profiling results in general in a shift in the PDF central values. This shift is arbitrary, however, since it depends on the fluctuations entering the pseudo-data generation, and hence is ignored in the following.

Inspection of Fig. 6 reveals that, as opposed to the FASER\(\nu \) impact projections, measurements of DIS structure functions at FASER\(\nu \)2 would reduce PDF uncertainties on the quark and antiquark PDFs, while leaving the gluon essentially unaffected. As expected for a neutrino scattering experiment, its impact is most marked for those PDF combinations sensitive to quark flavour separation such as the up and down valence PDF and the total strangeness. Indeed, the reduction of PDF uncertainties is particularly significant for the latter, a consequence of the inclusion of charm-tagged structure functions in the fit. Given that all PDF determinations entering PDF4LHC21 already include existing neutrino DIS measurements, the fact that FASER\(\nu \)2 pseudo-data still manage to improve uncertainties highlights the new information provided by the LHC neutrino experiments.

By comparing the impact of the FASER\(\nu \)2 structure functions in the case where only statistical errors are considered with that where systematic uncertainties are also accounted for, one finds that the latter eventually become a limiting factor, but also that they not modify the qualitative findings of the statistics-only scenario. Indeed, while systematic uncertainties somewhat degrade the PDF sensitivity, they do not wash it out for any of the quark PDFs. Furthermore, in the projections presented in this work, we assume the performance parameters of Table 1, which however could be improved in the actual realisation of the experiments, using for instance detector improvements or different kinematic reconstruction techniques. We also note that the availability of complementary experiments accessing the same neutrino beam should allow their mutual cross-calibration, such that the combination of their data brings in more information than just the naive statistics scaling.

Fig. 7
figure 7

Same as Fig. 6 (statistical uncertainties only), now showing results in the scenario where charm-tagged structure function measurements are excluded from the analysis

Relevance of charm-tagged measurements. The analysis of Fig. 6 highlights that LHC neutrino data are particularly constraining for the poorly known strange PDF, which is one of the quark flavour combinations for which proton PDF fits differ the most [21]. To further investigate this point, Fig. 7 compares the impact of the FASER\(\nu \)2 data shown in Fig. 6 for the case in which only statistical uncertainties are considered, with the results of the same profiling once the charm-tagged structure function data are excluded from the fit. While differences are moderate for the up and down quark PDFs, the significant loss of information resulting from this exclusion of charm-tagged data is clearly visible for the strange PDF. Specifically in the region \(x\gtrsim 0.01\), the constraints on strangeness shown in Fig. 7 are mostly washed out in the absence of charm-tagged data. We thus establish that inclusive neutrino DIS measurements predominantly constrain the up and down quark and antiquark PDFs (and thus also the quark singlet), while the charm-tagged structure functions are responsible for most of the constraints provided on the total strangeness. The PDF reach of the LHC neutrino experiments would thus be markedly limited in experiments without charm identification capabilities.

Lepton-charge identification. The ability to identify the charge of the produced final-state lepton in charged-current neutrino scattering requires that an experiment be equipped with a powerful enough magnet to deflect this lepton within the detector fiducial volume. Our baseline results for FASER\(\nu \)2 in Fig. 6 assume that this charge identification is possible, and therefore include separate structure function datasets for neutrino and antineutrino projectiles. In order to ascertain the extent to which the constraints provided by FASER\(\nu \)2 structure functions depend on the availability of such a magnet, Fig. 8 compares the reduction of the PDF uncertainties using the FASER\(\nu \)2 data with and without the assumption of charged-lepton identification capabilities.

Fig. 8
figure 8

Same as Fig. 6 (statistical uncertainties only), now showing results in the scenario where the charge of the final-state charged lepton cannot be identified

This analysis finds that the lack of charged-lepton identification actually does not significantly degrade the PDF sensitivity of FASER\(\nu \)2. Having access to the lepton charge information slightly improves the constraints for the down and (to a lesser extent) the up valence quark PDFs, while there are no differences for the total quark singlet or for strangeness. This improvement can be understood by inspecting the leading-order decomposition of neutrino DIS structure functions in terms of the different PDF flavours for different targets, Eqs. (2.5)–(2.7). For an isoscalar target, as assumed here, structure functions are very similar for neutrino and antineutrino projectiles, with differences restricted to the strangeness asymmetry. Given that this strangeness asymmetry is quite small, this also explains why the impact on strangeness, which is driven by the charm-tagged data, is the same irrespective of whether one identifies the outgoing lepton charge. We conclude that LHC neutrino DIS data exhibits competitive PDF sensitivity even for detectors which cannot separate incoming neutrinos from antineutrinos.

Experiment dependence. Together with the impact projections for FASER\(\nu \)2, we have also produced analogous results for other proposed FPF experiments, specifically for AdvSND and FLArE. In the latter case, we consider the 10-tonne variant, FLArE10. Figure 9 compares the PDF sensitivity of FASER\(\nu \)2, in the scenario where systematic uncertainties are neglected, with the corresponding results for AdvSND and FLArE10, respectively. As summarised in Table 2, each of these experiments has different associated expected numbers of DIS events, namely 510k, 56k, and 110k (670k) inclusive muon-neutrino events for FASER\(\nu \)2, AdvSND, and FLArE10 (100), respectively, with 76k, 7k, and 15k (90k) in the charm-tagged case.

Fig. 9
figure 9

Same as Fig. 6 (statistical uncertainties only), comparing the projected PDF impact of FASER\(\nu \)2 with that of AdvSND (top panels) and FLArE10 (bottom panels)

In agreement with expectations, detectors with the largest event rates are those exhibiting the best PDF sensitivity. From Fig. 9 one can see that the three FPF experiments studied lead to a reduction of the uncertainties in the quark PDFs. By comparing their relative impact, we find that the constraints achieved by FASER\(\nu \)2 are somewhat better than for the other two experiments, consistent with the larger event yields obtained in the former. In the case of AdvSND, the smaller sample of charm-tagged events is responsible for the reduction in the constraints on strangeness. Another consequence of the smaller event rates in AdvSND and FLArE10 as compared to FASER\(\nu \)2 is the milder impact for the \(x\lesssim 10^{-2}\) region, which can only be covered once the integrated statistics become large enough, as indicated by the differential bin-by-bin yields displayed in Fig. 1.

Fig. 10
figure 10

Same as Fig. 6, now quantifying the projected PDF impact of the three FPF experiments added simultaneously to the analysis: FASER\(\nu \)2, AdvSND, and FLArE10

While Fig. 9 compares fits based on FPF pseudo-data with only statistical uncertainties, a more robust comparison between the PDF reach of the various FPF experiments also requires one to account for the systematic uncertainties, which ultimately become one of the limiting factors. Also, as mentioned above, cross-calibration between experiments should provide valuable input to enhance the PDF sensitivity.

Combined impact of the FPF experiments. We have also carried out a further PDF analysis simultaneously including the three FPF experiments considered here: FASER\(\nu \)2, AdvSND, and FLArE10. Figure 10 displays the combined impact of the FPF experiments when added on top of the PDF4LHC21 baseline by means of Hessian profiling, both in the statistics-only case and when systematic and statistical uncertainties are added in quadrature. Potential correlations between individual experiments are neglected in this exercise. The combined impact of the three FPF experiments on PDF4LHC21 shown in Fig. 10 is only moderately improved as compared to the results of the FASER\(\nu \)2-only profiling in Fig. 6. This shows that in this combination of experiments, the one with the largest individual sensitivity dominates, in this case FASER\(\nu \)2. Again, here we assume perfect consistency between theory and data, while within the actual experiments the consistency (or lack thereof) between individual measurements cannot be guaranteed.

The projections displayed in Fig. 10, and in particular the statistics-only case, represents a best-case scenario for the reduction of PDF uncertainties which can be expected from the analysis of neutrino DIS structure function data at the FPF, under the assumption that PDF4LHC21 represents our current knowledge about the quark and gluon structure of the proton.

The effect of neutrino flux uncertainties. The results presented so far assume the incoming far-forward neutrino flux as input, with vanishing uncertainties. One can estimate the impact that uncertainties associated to this neutrino flux input have on the PDF constraints by extending the procedure of [71], based on the predictions provided in [69, 72, 88,89,90,91,92,93,94,95,96,97,98,99,100], by introducing an additional fit parameter to account for the overall normalisation of the neutrino flux (while assuming the shape is unchanged with respect to the baseline predictions). The constraint for this flux normalisation parameter is obtained considering two scenarios: first, by using the expected Run III data from FASER\(\nu \), and second, by using 50% of the data expected at FASER\(\nu \)2 during the HL-LHC run. In the latter case, only the remaining 50% of FASER\(\nu \)2 data are then considered available for PDF determination, to avoid double counting.

The resulting flux normalisation parameter can be constrained approximately to a precision of 6% using FASER\(\nu \) Run III data, and to 0.4% using 50% of FASER\(\nu \)2 data, as illustrated in Fig. 11. These constraints are then incorporated into the profiling studies as additional fully correlated (across bins) uncertainties, also accounting for an increase in statistical uncertainty in the case where only 50% of data are used. The impact on the FASER\(\nu \)2 projections of the neutrino flux uncertainties, in the case of the PDFs resulting from using the Run III constraint, are shown in Fig. 12, and those obtained with the HL-LHC constraints in Fig. 13. In either case, the results indicate only a very small increase in the resulting PDF uncertainties. One may conclude that neutrino flux uncertainties do not significantly degrade the PDF constraining potential of the FPF experiments presented in this work.

Fig. 11
figure 11

The baseline uncertainty associated to the theoretical modelling of the far-forward muon neutrino fluxes [71] at FASER\(\nu \) (left) and FASER\(\nu \)2 (right panel), compared to the results of constraining the overall normalisation of this flux from FASER\(\nu \) data and from 50% of the expected FASER\(\nu \)2 data, respectively

Fig. 12
figure 12

Same as Fig. 6, now quantifying the impact on the FASER\(\nu \)2 results of accounting for the muon neutrino flux uncertainties, assuming its overall normalisation has been constrained with FASER\(\nu \) Run III data. See text for more details

Fig. 13
figure 13

Same as Fig. 6, now quantifying the effect of neutrino flux uncertainties in the analysis of FASER\(\nu \)2 pseudo-data, assuming that the neutrino flux normalisation is constrained using 50% of FASER\(\nu \)2 data to constrain the flux and the remainder for PDF determination

3.2 Proton PDFs: impact on NNPDF4.0

The PDF sensitivity studies based on the xFitter profiling of PDF4LHC21 are complemented by those based on their direct inclusion in the NNPDF4.0 analysis. Here we present results for the impact of the combined FPF neutrino measurements on NNPDF4.0, namely the analogous study of that shown in Fig. 10 in the case of PDF4LHC21. We have also produced results for the different variations studied in Sect. 3.1 with the NNPDF fitting methodology. Given that the findings obtained with the NNPDF fits are compatible with those obtained with xFitter, it is not necessary to duplicate them here, and we show results only for the impact of the combined FPF pseudo-data.

Figure 14 displays the same results as Fig. 10, now in the case of the fits obtained with the NNPDF4.0 fitting methodology. The baseline NNPDF4.0 NNLO analysis is compared with the results of the fits which include the combined FPF dataset, both in the statistics-only scenario and for the case where systematic uncertainties are also accounted for. As in the PDF4LHC21 analysis, the bands indicate the one-sigma PDF uncertainties. Note that we also show results for the charm PDF (bottom right panel), which in the NNPDF4.0 fit is also determined directly from the data entering the fit [101]. In this way, we can determine the possible constraints that neutrino DIS measurements at the LHC may impose on the intrinsic charm content of the proton [102].

Fig. 14
figure 14

Same as Fig. 10 for the results obtained using the NNPDF4.0 fitting methodology. The baseline NNPDF4.0 NNLO analysis is compared with the results of the fits which include the combined FPF dataset both in the statistics-only scenario and for the case where systematic uncertainties are also accounted for. As in the PDF4LHC21 case, the bands indicate the one-sigma PDF uncertainties. We now also show results for the charm PDF (bottom right panel), which in the NNPDF4.0 fit is determined from the data entering the fit

From Fig. 14 one finds that the impact on the PDFs found by direct inclusion of the FPF structure functions on the NNPDF4.0 fit is qualitatively consistent with that found in the the Hessian profiling of PDF4LHC21. In particular, the constraints are more significant for the up and down quark valence PDFs as well as for the total strangeness. As was the case with PDF4LHC21, the uncertainty reduction for the strangeness is particularly striking. Also, consistent with the results of Sect. 3.1, the gluon PDF is unchanged, and the impact on the total quark PDF is restricted to the large-x region. While one can also identify differences related to analysis choices such as the PDF parametrisation and the method used to assess the PDF error reduction, for example, concerning the small-x behaviour of \(xu_V\) and \(xd_V\), we conclude that, by and large, the projected impact of the FPF structure function measurements is independent of the specific fitting methodology adopted.

As indicated by Fig. 14, the FPF data are also expected to reduce the uncertainties associated to the fitted charm PDF in the large-x region dominated by the non-perturbative component. In order to highlight this impact on the large-x region of the PDFs, Fig. 15 presents the same comparison as in Fig. 14, now for the absolute PDFs at the initial parametrisation scale, \(Q=1.65~\hbox {GeV}\), using a linear scale, for the statistics-only scenario. One can observe the improved constraints on the charm PDF in the large-x region, further confirming the results of previous studies [9, 10] indicating the sensitivity of the FPF to the intrinsic charm content of the proton. This comparison also illustrates the excellent constraining power on the proton strangeness enabled by the FPF charm-tagged structure function data.

Table 4 Statistical estimators for the NNPDF4.0 NNLO baseline fit, compared to the variants including the combined FPF dataset shown in Figs. 14 and 15. From top to bottom: total \(\chi ^2/n_\textrm{dat}\), average over replicas of the training and validation figures of merit \(\left\langle E_\textrm{tr}\right\rangle _\textrm{rep}\) and \(\left\langle E_\textrm{val}\right\rangle _\textrm{rep}\), average \(\chi ^2/n_\textrm{dat}\) over replicas \(\left\langle \chi ^2/n_\textrm{dat}\right\rangle _\textrm{rep}\), \(\chi ^2/n_\textrm{dat}\) for datasets grouped by process. The last row displays the \(\chi ^2/n_\textrm{dat}\) obtained for the combined FPF dataset. For the baseline, we indicate the pre-fit values of \(\chi ^2/n_\textrm{dat}\) for the FPF dataset (in italics) computed without and with systematic errors in the covariance matrix

The statistical estimators for the NNPDF4.0 NNLO baseline fit, compared to the variants including the combined FPF dataset shown in Figs. 14 and 15, are reported in Table 4. The last row displays the \(\chi ^2/n_\textrm{dat}\) obtained for the combined FPF dataset, for which one finds \(\chi ^2/n_\textrm{dat} \sim 1\) up to fluctuations, consistent with the fact that the FPF pseudo-data are produced by construction to be compatible with the NNPDF4.0 baseline. For the same reason, the description of the other experiments entering the NNPDF4.0 global fit is not distorted by the inclusion of the FPF data. In particular, the \(\chi ^2/n_\textrm{dat}\) of the available (non-FPF) DIS neutral-current and charged-current datasets is unchanged (1.22 and 0.90, respectively) from the baseline to the fit with FPF pseudo-data. A moderate increase is found to the \(\chi ^2/n_\textrm{dat}\) of Drell–Yan processes, from 1.76 to 1.83, possibly because of the enhanced weight that DIS data carry in this fit variant.

Fig. 15
figure 15

Same as Fig. 14, now for the absolute PDFs at the initial parametrisation scale, \(Q=1.65~\hbox {GeV}\), focusing on the large-x region

Figure 16 displays a comparison of the PDF luminosity at \(\sqrt{s}=14~\hbox {TeV}\) as a function of the final-state invariant mass \(m_X\) between the baseline NNPDF4.0 fit and its variants including the combined FPF dataset, see Fig. 14 for the corresponding PDF comparison. Results are shown normalised to the central value of the NNPDF4.0 baseline. Specifically, we show the gluon-gluon, quark-antiquark, quark-quark, down-antiup, up-antistrange, and strange-antistrange luminosity values. The reduction of PDF uncertainties found in Figs. 14 and 15 is also visible for the partonic luminosities, specifically in the best-case scenario where only statistical uncertainties in the FPF pseudo-data are considered. The improved quark-antiquark luminosities at \(Q\sim 100~\hbox {GeV}\) benefit theoretical predictions for gauge-boson production at the LHC. It is also worth noting the PDF error reduction for the quark-quark luminosity at high invariant masses, which will feed into searches for new heavy particles at the TeV scale driven by quark-initiated scattering. Unsurprisingly, luminosities related to the strange PDF are those which improve the most, such as the \(u\bar{s}\) channel which provides a significant contribution to inclusive \(W^+\) production.

Fig. 16
figure 16

Comparison of the PDF luminosities evaluated at \(\sqrt{s}=14~\hbox {TeV}\) as a function of the final-state invariant mass \(m_X\) between the baseline NNPDF4.0 NNLO determination and its variants which include the full set of FPF pseudo-data, see Fig. 14 for the corresponding PDF comparison. Results are shown normalised to the central value of the NNPDF4.0 baseline. From left to right and top to bottom we show the gluon-gluon, quark-antiquark, quark-quark, down-antiup, up-antistrange, and strange-antistrange luminosity values. The bands in the NNPDF4.0 baseline indicate the one-sigma PDF uncertainties

3.3 Nuclear PDFs: impact on EPPS21

The studies presented in Sects. 3.1 (for PDF4LHC21) and 3.2 (for NNPDF4.0), from the point of view of PDF constraints, treat the neutrino scattering target as composed of isoscalar free nucleons. They hence neglect both nuclear modifications and non-isoscalar effects. We now revisit these analyses but account for the fact that the target material in the FASER\(\nu \)2 and AdvSND experiments is tungsten, with \(A=184\). In this exercise we do not include the FLArE structure function data, corresponding to a different target material (liquid argon, with \(A=40\)), although in an actual nuclear PDF fit, all FPF datasets would be included simultaneously. Nuclear modifications associated to a tungsten target, when compared to a free isoscalar nucleon, are not necessarily small, and will affect the event rate predictions for the FPF experiments. This fact also implies that measurements of differential neutrino cross-sections on heavy nuclear targets provide direct constraints on these nuclear modifications without relying on assumptions on their A dependence.

We quantify the impact of the FPF structure function measurements on the nuclear PDFs of tungsten by applying the same Hessian profiling of Sect. 3.1 to EPPS1, a global determination of nuclear PDFs that accounts for the constraints of existing datasets involving nuclei as target or projectiles, and based on the CT18 set as proton boundary condition. We note that EPPS21 already includes information from neutrino DIS on nuclear targets from the CHORUS and NuTeV experiments. EPPS21 is generally in good agreement with other recent nPDF fits such as nNNPDF3.0 [32] and nCTEQ15HQ [103].

The application of profiling to EPPS21 follows the same strategy as that for PDF4LHC21, with the caveat that its Hessian error sets also include the contribution from the uncertainties associated to their reference proton PDF set, in this case CT18. Given that the measured event rates depend on both the proton PDFs and the nuclear modifications, when profiling EPPS21 we also account for the Hessian sets associated to the proton PDF dependence. Also relevant for the subsequent discussion, EPPS21 adopts a tolerance factor of \(T^2 = 20\) when defining their 68% confidence level uncertainties, to be contrasted with \(T^2 = 10\) used in PDF4LHC21.

Fig. 17
figure 17

Same as Fig. 10, now corresponding to the Hessian profiling of EPPS21 for a tungsten nucleus, and excluding FLArE contributions

Figure 17 displays a similar comparison as that of Fig. 10, now corresponding to the Hessian profiling of EPPS21 for a tungsten nucleus. The FPF dataset is in this case composed of FASER\(\nu \)2 and AdvSND. One finds that the impact of FPF structure function measurements is most important for the strange PDF and to a lesser extent for the up and down valence quark PDFs. For the latter, the projected impact of FPF measurements appears to be somewhat milder as in the case of PDF4LHC21, especially once systematic uncertainties are taken into account.

While the main qualitative features of the nPDF profiling are consistent with the free-proton case, one explanation for the observed differences may be the use of a larger tolerance factor T in EPPS21 as compared to PDF4HC21, which effectively reduces the impact of new data in the fit. Another potential reason is the more restrictive functional form adopted in EPPS21 as compared to the PDF sets entering PDF4LHC21, a consequence of the smaller dataset available for hard scattering on nuclear targets. Given that these functional forms are fixed when applying Hessian profiling, assessing the impact on nuclear PDFs by a direct refit, as was done for NNPDF4.0, may be required to establish the impact of FPF structure function data on nPDFs. Additional results for the profiling of EPPS21 are presented in Appendix B.

4 Implications for Higgs and weak boson production

The reduction of PDF uncertainties made possible by neutrino DIS measurements at the FPF, discussed in Sect. 3, enables more precise theoretical predictions for core processes at the HL-LHC. Here we present an initial study of the phenomenological implications of the PDFs enhanced with LHC neutrino data for hard-scattering processes at proton colliders. Specifically, we present results for single and double gauge boson production, Higgs production in vector-boson fusion, and Higgs production in association with a vector boson. We focus on processes sensitive to the quark-quark and quark-antiquark initial states, given that LHC neutrinos do not provide information on the gluon PDF and hence they cannot inform gluon-initiated processes, such as top quark pair production or Higgs production in gluon fusion.

We adopt the same calculational settings as in the LHC phenomenology analysis considered in the PDF4LHC21 combination study [46] and provide predictions both for inclusive cross-sections, integrated in the fiducial region, and for differential distributions. We evaluate these cross-sections using matrix elements which include NLO corrections in both the QCD and electroweak coupling using mg5_amc@nlo [104] interfaced to PineAPPL [55]. For all processes, realistic selection and acceptance cuts on the final state particles have been applied. No further theoretical uncertainties are considered in this analysis, given that we do not aim to compare with experimental data.

Fig. 18
figure 18

Fiducial cross-sections for representative LHC processes at \(\sqrt{s}=14~\hbox {TeV}\) evaluated with NNPDF4.0 (upper panels) and PDF4LHC21 (bottom panels), compared with the fits including the FPF structure function projections. See [28, 46] for the calculational settings used to evaluate these cross-sections. For the baseline predictions, the dark (light) bands indicate the 68% (95%) CL uncertainties. The fits labelled “_FPF\(^*\)” are the ones considering statistical uncertainties, while those labelled “_FPF” also include systematic errors. In the fits with FPF data, the central values are set to be the same as in the corresponding baseline. We provide predictions for inclusive Drell–Yan production (\(Z, W^+, W^-\)), Higgs production in vector-boson fusion, Higgs associated production, and diboson production (\(W^+W^-\), \(W^+Z\), \(W^-Z\))

Figure 18 displays the fiducial cross-section for representative LHC processes at \(\sqrt{s}=14~\hbox {TeV}\) evaluated with NNPDF4.0 (top panels) and with PDF4LHC21 (bottom panels), compared with the results obtained from the corresponding fits including the FPF structure function projections. For the fits with FPF data, we display the variants where the covariance matrix consists only of statistical errors (“_FPF\(^*\)”) and where also systematic errors are accounted for (“_FPF”). For the baseline predictions, the dark (light) bands indicate the 68% (95%) CL uncertainties. The central values are set to be the same as in the baseline calculation in all cases. From top to bottom, we show inclusive Drell-Yan production in the Z, \(W^+\), and \(W^-\) channels, Higgs production in vector-boson fusion, Higgs associated production, and diboson production in the \(W^+W^-\), \(W^+Z\), and \(W^-Z\) final-states. The corresponding comparisons at the level of differential distributions are shown in Figs. 19 and 20 for NNPDF4.0 and PDF4LHC21, respectively. As done for the fiducial cross-sections of Fig. 18, we only indicate the relative PDF uncertainty in each fit.

Fig. 19
figure 19

Same as Fig. 18 for the corresponding differential distributions in the case of NNPDF4.0

Fig. 20
figure 20

Same as Fig. 19 but with PDF4LHC21

Inspection of Figs. 18, 19, 20 confirms the potential of LHC neutrino structure function measurements to improve theoretical predictions for Higgs, electroweak, and high-mass processes at the HL-LHC. In general, qualitative consistency is found between the results based on NNPDF4.0 and on PDF4LHC21, as was already the case at the PDF level. As compared to the best-case scenario where only statistical uncertainties are taken into account, the reduction of PDF errors in the LHC cross-sections obtained thanks to the FPF structure functions becomes less marked, but still visible for most of the processes under consideration, upon the inclusion of systematic uncertainties in the fit covariance matrix.

Concerning the fiducial integrated cross-sections, a decrease in PDF uncertainties is observed for all processes, including for single gauge boson production relevant for core HL-LHC analyses such as the \(m_W\) and \(\sin ^2\theta _W\) measurements. The specific improvement in precision depends on the underlying scattering reaction as well as on the range of x and \(Q^2\) covered by each process. In the case of Higgs associated production with vector bosons and in vector-boson scattering, driven by the quark-antiquark and quark-quark initial states, respectively, one observes that PDF uncertainties can be reduced by up to a factor of 2 thanks to the FPF measurements, for instance in the case of the \(hW^+\) and \(hW^-\) cross-sections, in the most optimistic scenario in which systematic uncertainties are neglected. This is similarly found for the diboson cross-sections, with in this case the largest improvement observed for the \(ZW^+\) and \(ZW^-\) final states, with a reduction of up to \(\sim 40\%\) in the PDF uncertainties.

In the case of the differential cross-sections displayed in Figs. 19 and 20, one observes how the impact of the FPF structure functions on LHC observables depends on the hard-scattering scale. For instance, searches for heavy resonances in the high-mass tail of the Drell-Yan distributions could be improved by the addition of the FPF data. Similar considerations apply for diboson prediction, and in the case of the \(ZW^{\pm }\) channel, we observe an improvement specifically in the low \(p_{T,\ell \bar{\ell }}\) region, which represents the bulk of the fiducial cross-sections. For the Higgs production processes, the PDF uncertainty in the theoretical predictions is relatively stable as a function of the rapidity. The effects of accounting for systematic uncertainties in the fit covariance matrix are somewhat more visible here as compared to the inclusive cross-sections, indicating that they affect mostly the tails, rather than the bulk, of the distributions, and in particular the large-x behaviour of the PDFs. This observation emphasises the importance of reducing systematic errors in the FPF measurements in order to enhance the cross-talk with HL-LHC analyses.

For completeness, Appendix A revisits the phenomenological studies presented in this section for the fiducial cross-sections in terms of the HL-LHC PDF impact projections of [34]. This comparison highlights the complementarity between the two experiments in terms of PDF constraints, being fully orthogonal and arising from different scattering processes. Furthermore, the FPF constraints benefit from the valuable property that an eventual contamination from new physics on the PDF fits can be neglected, since this process is driven by \(Q^2\) values outside the possible presence of BSM physics, at least concerning new heavy states. Therefore, the ultimate sensitivity on high-precision observables is obtained by integrating the constraints from both the HL-LHC and the FPF in the same global determination of PDFs.

5 Summary and outlook

In this work we have, for the first time, quantified the impact that measurements of high-energy neutrino DIS structure functions at the LHC would have on the quark and gluon structure of the nucleon. Our analysis requires the generation of DIS pseudo-data full differential in x, \(Q^2\), and \(E_\nu \) for the various ongoing and proposed far-forward neutrino LHC experiments, including the estimate of their associated systematic uncertainties. Consistent results are obtained from both the Hessian profiling of PDF4LHC21 and the direct inclusion of LHC neutrino structure functions in the NNPDF4.0 global fit, revealing a reduction of PDF uncertainties in the light quark sector, in particular concerning strangeness, as well as for the large-x charm PDF in the case of NNPDF4.0. We have assessed the robustness of these results upon removing charm-tagged data and final-state lepton-charge identification, as well as upon the combination of all FPF experiments within a single analysis.

We have also demonstrated the rich interplay between far-forward and central measurements at the HL-LHC by providing predictions for a range of Higgs and gauge boson production processes, both for integrated cross-sections in the fiducial region and for single-differential distributions. This phenomenological analysis suggests that a reduction of the PDF uncertainty by up to a factor of 2 may be within reach for some of these cross-sections in the most optimistic scenario. As was the case at the PDF level, for the HL-LHC projections, results based on PDF4LHC21 and NNPDF4.0 are in qualitative agreement.

Several avenues extending the results of this work may be foreseen. First of all, as the design of the proposed FPF experiments advances, it will be possible to derive more accurate estimates of the systematic uncertainties (and of their correlations) which eventually become the limiting factor. This will allow us to study whether improved detection methods, novel reconstruction techniques, i.e. based on deep learning, or combining information from different experiments can reduce the systematic uncertainties affecting the measurements. Second, the modelling of neutrino scattering at the LHC would benefit from the use of Monte Carlo event generators accounting for higher-order QCD corrections. As compared to the currently used LO generators, these would improve the description of the final-state kinematics, which in turn determine the acceptance rate of the reconstructed events. Furthermore, the predictions from such precise Monte Carlo generators should be folded with a full-fledged detector simulation in order to robustly determine selection efficiencies, such as those related to charm and D-meson tagging.

Third, while here we focus on the DIS region, ongoing and future LHC neutrino experiments also provide important information on shallow-inelastic scattering (SIS) at lower values of Q [15, 105], which in turn are relevant for inclusive cross-sections entering atmospheric and oscillation neutrino experiments. By following the approach presented in this work, it should be possible to quantify the improvements that LHC data provide on models of neutrino scattering in this poorly understood SIS region. Finally, here we have taken the incoming neutrino fluxes as an external input and neglected any associated uncertainties. However, measurements of these fluxes provide unique information on light and heavy forward hadron production in QCD, and in particular open a new window to the small-x gluon PDF. For this reason, ultimately one needs to simultaneously constrain the incoming fluxes and the neutrino scattering cross-sections from the measured event rates. Such joint interpretation would require extending the present analysis with a data-driven parametrisation of the neutrino fluxes, which could subsequently be compared with different theoretical predictions.

Our findings highlight how exploiting the LHC neutrino beam for hadron structure studies effectively provides CERN with a “Neutrino-Ion Collider”, a charged current-counterpart of the EIC, without changes in its accelerator infrastructure or additional energy costs. In addition to their intrinsic interest for hadronic science, measurements of neutrino structure function at the LHC provide a novel, and hitherto ignored, handle to inform theoretical predictions of hard-scattering cross-sections at the HL-LHC.

Together with this paper, we make public in Zenodo [106] the corresponding LHAPDF grids [107] for the PDF4LHC21 and NNPDF4.0 fits including FPF data. We also release the generated neutrino structure function pseudo-data and the corresponding theoretical calculations, for the different scenarios considered in this work. These projections for neutrino structure functions should be of relevance for a broad range of applications related to forward neutrino scattering at the LHC, from tests of lepton flavour universality at the TeV scale in the neutrino sector to probes of anomalous neutrino interactions and searches for sterile neutrinos distorting oscillation patterns.