1 Introduction

High precision physics at the LHC, especially in its high-luminosity era [1,2,3], will demand the inclusion of electroweak corrections in the computation of theoretical predictions for hard processes [4]. This requires an extension of the set of proton parton distribution functions (PDFs). In particular a photon PDF has to be provided, and evolution equations need to be supplemented with QED splitting functions. The photon PDF enables the inclusion of photon-initiated processes, which typically are enhanced in the high-mass and large transverse-momentum tails of the distributions. In principle, at high-enough scales, proton PDFs should also include PDFs for leptons [5], gauge bosons [6] and indeed for the full set of 52 standard model fields [7], or even new hypothetical particles such as dark photons [8]. Even at the LHC lepton PDFs are needed in searches for leptoquarks [9] or exotic resonances that couple to leptons [10]. However, in practice, the main requirement of current precision physics at the LHC is the availability of PDF sets that also include a photon PDF, that mixes upon combined QED\(\times \)QCD evolution with the standard quark and gluon PDFs. We will henceforth refer to such PDF sets as “QED PDFs”.

Initial attempts at the construction of QED PDF sets relied on models for the photon PDF at the initial evolution scale [11, 12]. A first data-driven determination of QED PDFs, based on a fit using the NNPDF2.3 methodology [13, 14], resulted in a photon PDF with large uncertainties. The fact that determining the photon PDF from the data yields a result affected by large uncertainties was more recently confirmed in determinations based on fitting to the data PDFs with a fixed functional form within the xFitter methodology [15] applied to high-mass ATLAS Drell–Yan distributions [16].

A breakthrough in the determination of QED PDFs was achieved in 2016 in Refs. [17, 18] (see also related results in Refs. [19, 20]), where it was shown that the photon PDF can be computed perturbatively in QED, given as input the proton structure functions at all scales, from the elastic (\(Q^2\rightarrow 0\)) to the deep-inelastic (\(Q^2\rightarrow \infty \)) regimes – the so-called LuxQED method. Since then, this LuxQED framework has been the basis of all QED PDF sets [21,22,23,24,25].

While the effect of the inclusion of the photon PDF on other PDFs is small, it is not negligible within current uncertainties. For instance, the photon typically carries a fraction of the proton momentum which is about two orders of magnitude smaller than that of the gluon, so the corresponding depletion of the gluon momentum fraction is relevant at the percent level, which is comparable to the size of the current uncertainty on the gluon PDF. Precision calculations of LHC processes with electroweak effects thus require a consistent global QED PDF determination.

With this motivation, we construct a QED PDF set based on the recent NNPDF4.0 PDF determination [26, 27], including QED corrections to parton evolution up to \({\mathcal {O}}\left( \alpha \alpha _s\right) \) and \({\mathcal {O}}\left( \alpha ^2\right) \). We follow closely the methodology developed for the NNPDF3.1QED PDFs in Ref. [21], based on an iterative procedure. Namely, we evaluate the photon PDF at some fixed scale using the LuxQED formula and the structure function computed from an existing PDF set, now NNPDF4.0. We evolve it together with the other PDFs to the initial parametrization scale \(Q_0=1.65\) GeV. We then re-determine the quark and gluon PDFs while also including this photon PDF as a boundary condition to the QCD\(\times \)QED evolution. We compute the photon PDF again using LuxQED, and we iterate until convergence.

All results presented in this paper are obtained using a new theory implementation, now adopted by default in the NNPDF public code [27]. In particular, the APFEL [28] and APFELgrid [29] codes have been replaced by a suite of newly developed tools including the evolution equations solver EKO [30], the DIS structure functions calculator YADISM [31], and the interpolator of hard-scattering cross-sections PineAPPL [32, 33]. Taken together, they provide a pipeline for the efficient automatization of the computation of theory predictions [34]. An integral component of this theory pipeline is the use of interpolation grids in the format provided by PineAPPL [32, 33], which can be interfaced to Monte Carlo generators. Among these is mg5_aMC@NLO [35], which automates NLO QCD+EW calculations for a wide range of LHC processes where knowledge of QED PDFs is crucial.

The outline of this paper is as follows. First, Sect. 2 reviews the theoretical framework underlying the NNPDF4.0QED determination and in particular the implementation of QED evolution in EKO. Then the NNPDF4.0QED PDFs are presented in Sect. 3, where they are compared to the previous NNPDF3.1QED PDF set, and to other recent QED PDF sets. Implications for LHC phenomenology are studied in Sect. 4 by means of the PineAPPL interface to mg5_aMC@NLO. Conclusions and an outline of future developments are finally presented in Sect. 5. Because this paper is, as mentioned, the first to make use of a new theory pipeline, and specifically its implementation of combined QCD\(\times \)QED evolution, a set of benchmarks is collected in two appendices. Specifically, in Appendix A we benchmark NNPDF4.0 PDFs determined using the old and new pipeline and show that they are indistinguishable; in Appendix B the new implementation of joint QCD and QED evolution of PDFs in the EKO code is discussed and benchmarked against the previous implementation in the APFEL code.

2 Evolution and determination of QED PDFs

In this section, we discuss the structure of combined QED\(\times \)QCD evolution and we briefly review the methodology used to determine QED PDFs. This methodology is based on the LuxQED formalism [17, 18], and was developed for the construction of the NNPDF3.1QED PDF set [21].

2.1 QCD\(\times \)QED evolution equations and basis choice

The scale dependence of PDFs is determined by evolution equations of the form

$$\begin{aligned}{} & {} \mu ^2 \frac{d f_i(N,\mu ^2)}{d \mu ^2}\nonumber \\{} & {} = - \sum _{j} \gamma _{ij}\left( N,a_s(\mu ^2),a_{em}(\mu ^2) \right) f_j(N,\mu ^2), \end{aligned}$$
(2.1)

where \(f_i(N,\mu ^2)=\int _0^1 dx\,x^{N-1}f_i(x,\mu ^2)\) is the Mellin transform of the i-th PDF. The anomalous dimensions \(\gamma _{ij}(N,a_s, a_{em})\) are determined as a simultaneous perturbative expansion in the strong coupling \(a_s=\alpha _s/(4 \pi )\) and in the electromagnetic coupling \(a_{em}=\alpha /(4 \pi )\):

$$\begin{aligned}&\gamma _{ij}(N,a_s, a_{em}) = \sum _{\begin{array}{c} n,m=0 \\ (n,m) \ne (0,0) \end{array}}^\infty a_s^n a_{em}^m \gamma _{ij}^{(n,m)}(N)\nonumber \\&\quad = a_s \gamma _{ij}^{(1,0)}(N) + a_s^2 \gamma _{ij}^{(2,0)}(N) + a_s^3 \gamma _{ij}^{(3,0)}(N) \nonumber \\&\qquad + a_{em}\gamma _{ij}^{(0,1)}(N) + a_{em}^2 \gamma _{ij}^{(0,2)}(N) + a_s a_{em}\gamma _{ij}^{(1,1)}(N) + \cdots . \end{aligned}$$
(2.2)

In this work we include pure QCD corrections up to NNLO, \(\gamma ^{(1,0)}\), \(\gamma ^{(2,0)}\), and \(\gamma ^{(3,0)}\) [36, 37]; the pure NLO QED corrections \(\gamma ^{(0,1)}\) and \(\gamma ^{(0,2)}\) [38]; and the leading mixed correction \(\gamma ^{(1,1)}\) [39].

The scale dependence of the strong and electromagnetic couplings is in turn determined by coupled renormalization group equations of the form

$$\begin{aligned}&\mu ^2 \frac{d a_s }{d \mu ^2} =\beta _{_{\textrm{QCD}}} (a_s, a_{em}) \nonumber \\&\quad =-a_s^2\left( \beta _{_{\textrm{QCD}}}^{(2,0)} + a_s \beta _{_{\textrm{QCD}}}^{(3,0)} + a_{em}\beta _{_{\textrm{QCD}}}^{(2,1)} + a_s^2 \beta _{_{\textrm{QCD}}}^{(4,0)}\cdots \right) , \end{aligned}$$
(2.3)
$$\begin{aligned}&\mu ^2 \frac{d a_{em}}{d \mu ^2} =\beta _{_{\textrm{QED}}} (a_{s}, a_{em}) \nonumber \\&\quad = -a_{em}^2 \left( \beta _{_{\textrm{QED}}}^{(0,2)} + a_{em}\beta _{_{\textrm{QED}}}^{(0,3)} + a_s \beta _{_{\textrm{QED}}}^{(1,2)} + \cdots \right) , \end{aligned}$$
(2.4)

in terms of the coefficients of the corresponding QCD and QED beta functions. Consistently with the treatment of evolution equations, we include pure QCD contributions up to NNLO, namely \(\beta _{_\textrm{QCD}}^{(2,0)}\), \(\beta _{_{\textrm{QCD}}}^{(3,0)}\), and \(\beta _{_\textrm{QCD}}^{(4,0)}\); pure QED up to NLO, namely \(\beta _{_\textrm{QED}}^{(0,2)}\) and \(\beta _{_{\textrm{QED}}}^{(0,3)}\); and the leading mixed terms \(\beta _{_{\textrm{QCD}}}^{(2,1)}\) and \(\beta _{_\textrm{QED}}^{(1,2)}\) [40]. We adopt the \(\overline{\textrm{MS}}\) scheme. Schemes in which the electroweak coupling does not run, such as the \(G_\mu \) scheme, are commonly used in the computation of electroweak corrections, but \(\overline{\textrm{MS}}\) is more convenient when considering combined QCD and QED corrections [41, 42]. Equations (2.12.4) must then be simultaneously solved with a common scale \(\mu \).

The solution is most efficiently obtained in a maximally decoupled basis in quark flavor space. This requires adopting a suitable combination of quark and antiquark flavors such that the sum over j in Eq. (2.1) contains the smallest possible number of entries. In the case of QCD-only evolution, this is achieved in the so-called evolution basis, in which one separates off the singlet combination \(\Sigma = \sum _i (q_i + \bar{q}_i)\), which mixes with the gluon, and then one constructs nonsinglet combinations of individual C-even (sea-like) and C-odd (valence-like) \(q^\pm _i=q_i\pm \bar{q}_i\) quark and antiquark flavors, each of which evolves independently.

Because the photon couples differently to up-like \(u_k = \{u,c,t\}\) and down-like \(d_k=\{d,s,b\}\) quarks, a different basis choice is necessary when also including QED. To this end, given \(n_f\) active quark flavors, we split them into \(n_u\) up-like and \(n_d\) down-like flavors, such that \(n_f=n_u + n_d\), and define the four combinations

$$\begin{aligned} \Sigma _u= & {} \sum _{k=1}^{n_u}u_k^+, \quad \Sigma _d=\sum _{k=1}^{n_d}d_k^+, \quad \nonumber \\ V_u= & {} \sum _{k=1}^{n_u}u_k^-, \quad V_d=\sum _{k=1}^{n_d}d_k^-. \end{aligned}$$
(2.5)

Also, we separate off the QCD contributions to the anomalous dimensions, by rewriting the perturbative expansion of the anomalous dimensions, Eq. (2.2), as

$$\begin{aligned} \gamma _{ij}(N,a_s,a_{em}) = \gamma _{ij}(N,a_s)+\tilde{\gamma }_{ij}(N,a_s,a_{em}), \end{aligned}$$
(2.6)

where \(\gamma _{ij}(a_s)\) contains the pure QCD contributions and \(\tilde{\gamma }_{ij}\) contains both the pure QED and the mixed QCD\(\times \)QED corrections.

The maximally decoupled evolution equations are then constructed as follows. The nonsinglet combinations

$$\begin{aligned} T_3^d&= d^+ - s^+, V_3^d = d^- - s^-,\nonumber \\ T_3^u&=u^+ - c^+, V_3^u =u^- - c^-, \end{aligned}$$
(2.7)
$$\begin{aligned} T_8^d&= d^+ + s^+ - 2b^+, V_8^d = d^- + s^- - 2b^-,\nonumber \\ T_8^u&= u^+ + c^+ - 2t^+, V_8^u = u^- + c^- - 2t^-, \end{aligned}$$
(2.8)

evolve independently according to nonsinglet evolution equations of the form

$$\begin{aligned} \mu ^2\frac{d}{d\mu ^2}T^{u/d}_{3/8}&= - (\gamma _{\mathrm{ns,+}} +\tilde{\gamma }^{\mathrm{ns,+}}_{u/d}) T^{u/d}_{3/8}, \end{aligned}$$
(2.9)
$$\begin{aligned} \mu ^2\frac{d}{d\mu ^2}V^{u/d}_{3/8}&= - (\gamma _{\mathrm{ns,-}} +\tilde{\gamma }^{\mathrm{ns,-}}_{u/d} )V^{u/d}_{3/8}. \end{aligned}$$
(2.10)

The valence sum and difference combinations, defined as

$$\begin{aligned} V = V_u + V_d, \quad V_\Delta = \frac{n_d}{n_u}V_u - V_d, \end{aligned}$$
(2.11)

satisfy coupled evolution equations

$$\begin{aligned}{} & {} \mu ^2\frac{d}{d\mu ^2} \begin{pmatrix} V \\ V_\Delta \end{pmatrix}\nonumber \\{} & {} =- \begin{pmatrix} \gamma ^{\textrm{ns},V}+\langle \tilde{\gamma }^{\textrm{ns},-}_{q} \rangle &{} \nu _u\tilde{\gamma }^{\textrm{ns},-}_{\Delta q}\\ \nu _d\tilde{\gamma }^{\textrm{ns},-}_{\Delta q} &{} \gamma ^{\textrm{ns},-}+\{ \tilde{\gamma }^{\textrm{ns},-}_{q} \} \end{pmatrix} \begin{pmatrix} V \\ V_\Delta \end{pmatrix}, \end{aligned}$$
(2.12)

in terms of the linear combinations of anomalous dimensions

$$\begin{aligned} \nu _{u/d}= & {} \frac{n_{u/d}}{n_f}, \quad \langle \tilde{\gamma }^{\textrm{ns},\pm }_{q} \rangle = \nu _u \tilde{\gamma }^{\textrm{ns},\pm }_{u}+\nu _d \tilde{\gamma }^{\textrm{ns},\pm }_{d}, \nonumber \\ \{ \tilde{\gamma }^{\textrm{ns},\pm }_{q} \}= & {} \nu _d \tilde{\gamma }^{\textrm{ns},\pm }_{u}+\nu _u \tilde{\gamma }^{\textrm{ns},\pm }_{d}, \tilde{\gamma }^{\textrm{ns},\pm }_{\Delta q} = \tilde{\gamma }^{\textrm{ns},\pm }_{u} - \tilde{\gamma }^{\textrm{ns},\pm }_{d}. \end{aligned}$$
(2.13)

Finally, the gluon and photon satisfy coupled evolution equations together with the quark singlet sum and difference combinations, defined as

$$\begin{aligned} \Sigma = \Sigma _u + \Sigma _d, \quad \Sigma _\Delta = \frac{n_d}{n_u}\Sigma _u - \Sigma _d. \end{aligned}$$
(2.14)

These coupled evolution equations read

$$\begin{aligned} \mu ^2\frac{d}{d\mu ^2} \begin{pmatrix} g \\ \gamma \\ \Sigma \\ \Sigma _\Delta \end{pmatrix} = -{\mathbf \Gamma } \begin{pmatrix} g \\ \gamma \\ \Sigma \\ \Sigma _\Delta \end{pmatrix}, \end{aligned}$$
(2.15)

with \({\mathbf \Gamma }\) a \(4\times 4\) anomalous dimension matrix of the form

$$\begin{aligned} {\mathbf \Gamma } = \begin{pmatrix} \gamma _{gg}+\tilde{\gamma }_{gg} &{} \tilde{\gamma }_{g\gamma } &{} \gamma _{gq} + \langle \tilde{\gamma }_{gq}\rangle &{} \nu _u\tilde{\gamma }_{g \Delta q} \\ \tilde{\gamma }_{\gamma g} &{} \tilde{\gamma }_{\gamma \gamma } &{} \langle \tilde{\gamma }_{\gamma q} \rangle &{} \nu _u \tilde{\gamma }_{\gamma \Delta q} \\ 2n_f (\gamma _{qg} +\langle \tilde{\gamma }_{qg} \rangle ) &{} 2 n_f \langle \tilde{\gamma }_{q \gamma } \rangle &{} \gamma _{qq} + \langle \tilde{\gamma }^{\textrm{ns},+}_{q} \rangle + \langle e^2_q\rangle ^2\tilde{\gamma }_{\textrm{ps}} &{} \nu _u\tilde{\gamma }^{\textrm{ns},+}_{\Delta q} + \nu _ue^2_{\Delta q}\langle e^2_q\rangle \tilde{\gamma }_{\textrm{ps}}\\ 2n_f \nu _d \tilde{\gamma }_{\Delta qg} &{} 2n_f \nu _d \tilde{\gamma }_{\Delta q\gamma } &{} \nu _d\tilde{\gamma }^{\textrm{ns},+}_{\Delta q} + \nu _d e^2_{\Delta q}\langle e^2_q\rangle \tilde{\gamma }_{\textrm{ps}} &{} \gamma ^{\textrm{ns},+} + \{ \tilde{\gamma }^{\textrm{ns},+}_{q} \} + \nu _u \nu _d (e^2_{\Delta q})^2\tilde{\gamma }_{\textrm{ps}} \end{pmatrix}, \end{aligned}$$

where \(\tilde{\gamma }^{\textrm{ps}}_{qq'}=e^2_q e^2_{q'} \tilde{\gamma }_{\textrm{ps}}\) [38], and the combinations \(\langle \tilde{\gamma }_{gq} \rangle \), \(\langle \tilde{\gamma }_{qg} \rangle \), \(\tilde{\gamma }_{g \Delta q}\), \(\tilde{\gamma }_{\Delta q g}\), \(\langle \tilde{\gamma }_{\gamma q} \rangle \), \(\langle \tilde{\gamma }_{q \gamma } \rangle \), \(\tilde{\gamma }_{\gamma \Delta q}\), and \(\tilde{\gamma }_{\Delta q \gamma }\) are constructed analogously to those listed in Eq. (2.13).

The basis defined by Eqs. (2.7), (2.8), (2.11) and (2.14) is denoted as the unified evolution basis. Compared to the basis used in APFEL [28] to solve the QCD\(\times \)QED evolution equations, our definitions of \(\Sigma _\Delta \) and \(V_\Delta \) differ due to the prefactors \(n_d/n_u\) that make the basis fully orthogonal. In the presence of scale-independent intrinsic heavy quarks (e.g. a charm-quark PDF in a three-flavor scheme [43]), a further decomposition needs to be applied [30].

The unified flavor basis has been implemented in the EKO code [30], which, as discussed in the introduction, is now used to solve evolution equations as part of the new theory pipeline. The basic ingredient of the EKO code is the construction of evolution kernel operators \(E(Q^2 \leftarrow Q_0^2)\) (EKOs) such that

$$\begin{aligned} f(Q^2)= E(Q^2 \leftarrow Q_0^2) f(Q_0^2), \end{aligned}$$
(2.16)

where \(f(Q^2)\) is a vector whose components are all PDF flavors, including all active quark and antiquark flavors in a suitable basis, the photon and the gluon. Formally, the EKOs are given by

$$\begin{aligned} E(Q^2 \leftarrow Q_0^2)= & {} {\mathcal {P}} \exp \nonumber \\{} & {} \times \left( -\int _{Q_0^2}^{Q^2} \frac{\textrm{d}\mu ^2}{\mu ^2}\, \gamma \left( a_s(\mu ^2), a_{em}(\mu ^2)\right) \right) , \end{aligned}$$
(2.17)

where \(\gamma \) is the full matrix of anomalous dimensions, and \({\mathcal {P}}\) denotes path ordering. In the unified evolution basis the evolution kernel is a block diagonal matrix, with individual diagonal entries for the nonsinglet combinations Eqs. (2.7, 2.8) (without path-ordering), a \(2\times 2\) block for the valence combinations Eq. (2.11), and a \(4\times 4\) block in the singlet sector.

A variety of implementations of the solution of the evolution equations, each of those corresponding to a determination of the EKO, and which differ by higher-order terms, are available in the EKO code. These are discussed in Ref. [30]. At NNLO in QCD, the solutions are essentially indistinguishable in the data region. The solution adopted here is the iterated-exact (EXA) of Ref. [30], while the truncated (TRN) solution was adopted for the NNPDF3.1QED [21] and NNPDF4.0 [26] PDF sets, using the APFEL [28] implementation. These different implementations (at NNLO) are benchmarked and shown to be equivalent in Appendix B, where we also discuss the motivations for this choice.

2.2 Construction of the photon PDF

As mentioned in Sect. 1, the photon PDF is determined using the LuxQED [17, 18] formalism, implemented in a PDF fit using the same methodology as the one used for the NNPDF3.1QED PDF set [21], but now with the new theory pipeline. The LuxQED result amounts to proving that the photon PDF is perturbatively determined in QED by knowledge of the proton inclusive structure functions \(F_2\) and \(F_L\):

$$\begin{aligned} \begin{aligned} x \gamma (x, \mu ^2)&= \frac{2}{a_{em}(\mu ^2)} \int \limits _x^1 \frac{dz}{z} \Biggl \{ \int _{m_p^2x^2 \over (1-z)}^{\mu ^2 \over (1-z)} \frac{dQ^2}{Q^2} a_{em}^2(Q^2) \\&\quad \times \Biggl [ -z^2 F_L(x/z, Q^2) \\&\quad + \left( z P_{\gamma q}(z) + \frac{2 x^2 m_p^2}{Q^2} \right) F_2(x/z, Q^2)\Biggr ] \\&\quad - a_{em}^2(\mu ^2) z^2 F_2(x/z, \mu ^2)\Biggr \}, \end{aligned} \end{aligned}$$
(2.18)

where \(m_p\) is the mass of the proton and \(P_{\gamma q}\) is the photon-quark splitting function. Equation (2.18) holds in the \(\overline{\textrm{MS}}\) scheme, including terms of order \(a_{em}\) and \(a_{em}^2\ln \mu ^2/m_p^2\), times the accuracy of the QCD determination of the structure functions.

The integration over the scale \(Q^2\) and Bjorken-x dependence of the structure functions \(F_i(x,Q^2)\) in Eq. (2.18) includes four different regions and corresponding contributions to the structure functions: an elastic contribution at \(x=1\), a resonance contribution when x is large and close to \(x=1\), an inelastic non-perturbative contribution at low \(Q^2\) and intermediate x and an inelastic perturbative region for intermediate x and large \(Q^2\). The first three contributions must be determined by fits to lepton-proton scattering data, but the latter contribution may be determined by expressing the structure functions through perturbative QCD factorization in terms of quark and gluon PDFs. In a global PDF determination, one may choose to determine the photon PDF using the LuxQED formula Eq. (2.18) at a single chosen scale \(\mu ^2=Q_\gamma ^2\), and then evolve jointly the photon and all other PDFs through the QCD\(\times \)QED evolution equations discussed in Sect. 2.1. An alternative option is to determine the photon PDF using Eq. (2.18) at all scales. The two choices are equivalent because the LuxQED photon Eq. (2.18) satisfies the joint QED\(\times \)QCD evolution equations to the accuracy of the LuxQED formula itself, though they differ by higher-order corrections and also due to the fact that the LuxQED photon is partly determined from a parametrization of data. The former choice was made for the construction of our previous NNPDF3.1QED set [21], which we follow here, also in the choice of parametrization of the structure functions in the non-perturbative region. The same choice was made in Refs. [21,22,23,24,25], though in Refs. [24, 25] a PDF set in which the photon PDF is determined using Eq. (2.18) at all scales was also presented.

Fig. 1
figure 1

Breakdown of the photon PDF \(\gamma (x,\mu ^2)\) into the contributions coming from different contributions to the proton structure functions \(F_i(x,Q^2)\) that determine it according to the LuxQED formula Eq. (2.18). The result is shown as a function of x for four different choices of the scale \(\mu =Q_\gamma \): \(Q_\gamma =1.65\) GeV (top left), 10 GeV (top right), 100 GeV (bottom left) and 500 GeV (bottom right)

The relative size of the elastic, inelastic non-perturbative (including resonances) and elastic perturbative contributions to \(\gamma (x,\mu ^2)\) is displayed in Fig. 1 as a function of x for four different choices of the scale \(\mu =Q_\gamma \) at which the LuxQED formula Eq. (2.18) is used to determine the photon (see also Fig. 18 in Ref. [18]). If the LuxQED formula is used at the scale \(Q_\gamma =1.65\) GeV at which the NNPDF4.0 PDFs are parametrized, the photon is entirely determined by the elastic and non-perturbative contributions, but as the scale is increased, an increasingly large contribution comes from the perturbative region. By the time a value \(Q_\gamma > rsim 100\) GeV is reached, the photon at small \(x\lesssim 10^{-3}\) is almost entirely determined by the perturbative contribution, while the elastic and non-perturbative contributions only remain dominant at large \(x > rsim 0.1\) where however the photon PDF is tiny. Choosing a large value of \(Q_\gamma \) has the dual advantage that the LuxQED result is more accurate at high scale, because it includes contributions of order \(a_{em}\) and \(a_{em}^2\ln \mu ^2/m_p^2\), and also that one is then mostly relying on the accurate perturbative determination of the structure functions, that exploits global information on quarks and gluon PDFs and not just the lepton-proton scattering data. As in Ref. [21], we choose \(\mu =Q_\gamma =100\) GeV, as also advocated in Ref. [18].Footnote 1 We will discuss the dependence of our results on the choice of scale \(Q_\gamma \) in Sect. 3.

Equation (2.18) must be viewed as a constraint on the set of photon, quark and gluon PDFs that are simultaneously determined. Because of the small impact of the photon PDF on the other PDFs, it was suggested in Refs. [18, 21] that the constraint can be implemented iteratively. Namely, we first determine the photon PDF using Eq. (2.18) by means of structure functions that are determined from an existing PDF set at a given scale \(\mu =Q_\gamma \). This photon PDF is then evolved, by solving joint QCD\(\times \)QED evolution equations with the unchanged given set, to a chosen PDF parametrization scale where it is taken as fixed. All other PDFs are then re-determined, with the constraint that the momentum sum rule now also includes a contribution from the given (fixed) photon, i.e.

$$\begin{aligned} \int _0^1 dx\, \left( x\Sigma (x,Q^2) + xg(x,Q^2) + x\gamma (x,Q^2) \right) =1.\nonumber \\ \end{aligned}$$
(2.19)

The photon PDF at a scale \(\mu =Q_\gamma \) is then determined again from Eq. (2.18), in which structure functions are obtained from the new fit. The procedure is iterated until convergence, which was achieved in two iterations in the NNPDF3.1QED determination [21].

Here we follow the same procedure, starting with a re-determination of the NNPDF4.0 PDF set with the new pipeline, which is compared and shown to be equivalent to the published NNPDF4.0 in Appendix A.

3 The NNPDF4.0QED parton distributions

We present here the NNPDF4.0QED PDFs: we first summarize our procedure, then discuss the effect on fit quality of the inclusion of the photon PDF, examine the photon PDF itself, also in comparison to other determinations, and finally study the photon momentum fraction.

3.1 Construction of the NNPDF4.0QED parton set

As mentioned in Sect. 2 all the methodological aspects and settings of the PDF determination are the same as used for the underlying pure QCD NNPDF4.0 PDF [26], but now using a new theory pipeline. Even if in principle theory predictions should be independent of implementation details, in practice differences may arise, e.g. due to issues of numerical accuracy. Also, in the process of transitioning to this new pipeline, a few minor bugs in data implementation were uncovered and fixed. Finally, the new pipeline includes a new implementation of heavy-quark mass effect in deep-inelastic structure functions that differs by subleading terms from the previous one. Benchmarks showing the equivalence of the old and new pipeline are briefly presented in Appendix A. Because of this equivalence, NNPDF4.0 PDF replicas produced with the new pipeline should be considered equivalent to the published ones, and indeed for phenomenological applications (specifically those presented in Sect. 4) we will compare the results obtained using NNPDF4.0QED PDFs to pure QCD results obtained using the published NNPDF4.0 replicas. Nevertheless, in this section only, for all comparisons between pure QCD and QCD\(\times \)QED, we will use NNPDF4.0 pure QCD replicas generated using the new pipeline, in order to avoid even small confounding effects. Note however that, as discussed in the end of Sect. 2.1, the QED PDFs are based on the EXA solution of the evolution equations, that differs by subleading terms from the TRN solution used in the pure QCD fit. The effect of this difference is assessed in Appendix B and is very small at NNLO. However, all comparisons between pure QCD and QED PDFs also include the effect of this change.

The NNPDF4.0QED PDFs are determined at NLO and NNLO by supplementing with a photon PDF the pure QCD PDF set, according to the methodology outlined in the previous section. However, all theory predictions are obtained as in the pure QCD determination: hence in particular no photon-induced contributions are included, and thus the only effect of the inclusion of a photon PDF is through its mixing with other PDFs. This is justified because the NNPDF4.0 dataset was constructed including cuts that remove all datapoints for which the effect of electroweak corrections is larger than the experimental uncertainties (see Ref. [26, Sect. 4.1]). The inclusion of the electroweak corrections, which will allow relaxing these cuts, is left for future work. As in Ref. [21], the final PDF set is obtained after two iterations, with a third iteration providing a check of convergence.

Table 1 Statistical estimators for NNPDF4.0QED NLO and NNLO, compared to NNPDF4.0 pure QCD. From top to bottom: total \(\chi ^2\) per number of data points, average over replicas of the training and validation figures of merit \(\left\langle E_{\textrm{tr}}\right\rangle _{\textrm{rep}}\) and \(\left\langle E_\textrm{val}\right\rangle _{\textrm{rep}}\), average \(\chi ^2\) over replicas \(\left\langle \chi ^2\right\rangle _{\textrm{rep}}\), \(\chi ^2\) for datasets grouped by process. The total number of data points is 4424 (4616) at NLO (NNLO)

3.2 Fit quality

Table 1 displays the statistical estimators obtained using a set of 100 NNPDF4.0QED NLO and NNLO PDF replicas, compared to their QCD-only counterparts, generated using the new theory pipeline. Specifically, we show the \(\chi ^2\), for both the full dataset and for datasets grouped by process; for the full dataset we also show the average over replicas of the training and validation figures of merit \(\left\langle E_{\textrm{tr}}\right\rangle _{\textrm{rep}}\) and \(\left\langle E_{\textrm{val}}\right\rangle _{\textrm{rep}}\), and the average \(\chi ^2\) over replicas \(\left\langle \chi ^2\right\rangle _{\textrm{rep}}\), all as defined in Table 9 of Ref. [45]. Note that \(\chi ^2\) and \(\left\langle \chi ^2\right\rangle _{\textrm{rep}}\) are computed using the experimental covariance matrix, while, as in all NNPDF determinations, the figure of merit used for minimization is computed using the \(t_0\) covariance matrix [46].

It is clear from Table 1 that all estimators are essentially unchanged by the inclusion of QED corrections, with slightly larger differences seen at NLO than at NNLO: the impact of QED effects on fit quality is negligible, both globally and for individual processes. Specifically, the training and validation figures of merit in the pure QCD and QCD\(\times \)QED determinations differ by less than one sigma. As mentioned, these differences also include the effect of switching from the TRN solution of evolution equations in the pure QCD fit to the EXA solution. The impact of this change is yet smaller than that of the QED corrections, but it slightly reduces it.

3.3 The photon PDF

In Fig. 2, the NNLO NNPDF4.0QED photon PDF at \(Q=100\) GeV is compared to its counterpart in our previous NNPDF3.1QED [21] set, and in the recent QED PDF sets MSHT20QED [23], and CT18QED [25].Footnote 2 Here and elsewhere in this paper all uncertainties correspond to 1\(\sigma \). Results agree at the percent level, despite the fact that quark and gluon PDFs in these sets can display much larger differences. This is a consequence of the fact that in all these PDF sets the photon PDF is determined with the LuxQED formalism in terms of the proton structure function, and that the latter, in turn, is well constrained by experimental data both at high and low scale. In fact, we have checked that the dominant contribution to the difference between the NNPDF3.1QED and NNPDF4.0QED photon PDFs seen in Fig. 2 is due to the difference between the TRN and EXA solutions of evolution equations (see Sect. 3.1 and Appendix B), i.e. to higher-order corrections, with the residual difference being due to the change in the PDFs used in order to compute the structure function. The fact that the 3.1 and 4.0 PDFs are compatible within uncertainties is thus a consequence of the fact that the uncertainties due to higher-order corrections and to PDFs are correctly accounted for by the LuxQED construction, while the NNPDF4.0 PDFs are backward-compatible with the NNPDF3.1 PDFs [26].

Fig. 2
figure 2

Top left: the photon PDF at \(Q=100\) GeV in NNPDF4.0QED NNLO compared to its NNPDF3.1QED counterpart as a ratio to the central value of the former. Top right: the relative PDF uncertainties on the photon PDF in these two determinations. Bottom: same as top panels now comparing NNPDF4.0QED with MSHT20QED [23] and CT18QED [24] (all NNLO). Bands correspond to 1\(\sigma \) uncertainties

The uncertainty on the photon PDF is completely dominated by theoretical uncertainties on the LuxQED procedure, which include [18] missing higher order corrections, uncertainties on the experimentally measured low-scale structure function and so on. In our uncertainty determination we follow Ref. [18]; MSHT also mostly follows this reference, with an extra higher-twist contribution due to the low choice of scale \(Q_\gamma \), [22] and indeed finds a very similar uncertainty. A somewhat more conservative uncertainty estimate is provided by CT18QED [25], which also adopts a somewhat different determination of the elastic contribution.

Fig. 3
figure 3

Scale dependence of the central photon PDF in the NNPDF4.0QED, NNPDF3.1QED, MSHT20QED, and CT18QED PDF sets. The NNPDF4.0QED result in which the photon is determined at \(Q=10\) GeV (denoted as NNPDF4.0QED*) is also shown

In Fig. 3 the central value of the photon PDF in all these sets is compared at different scales, using the native scale dependence of each PDF set. Uncertainties are not shown in order not to clutter the plot. Note that differences in the scale dependence of the PDFs compared in the figure also arise due to somewhat different treatments of the QCD\(\times \)QED evolution equations. Specifically, the NNPDF3.1QED PDF set adopts a numerical implementation of the TRN solution (see Appendix B). However, one would expect differences to be mostly driven by the mixing with the quark and gluon PDFs. The fact that differences grow at low scales suggests that this is indeed the case. Even so, all photon PDFs agree within about 3% for all \(x > rsim 10^{-3}\), even at the lowest scale \(Q=1.65\) GeV.

In Fig. 3 we also show the central photon PDF which is found by repeating the NNPDF4.0QED determination with the photon PDF determined at a scale \(Q_\gamma =10\) GeV instead of the default \(Q_\gamma =100\) GeV. If a low value of \(Q_\gamma \) is adopted, the upper limit of integration in \(Q^2\), Eq. (2.18), is accordingly lower. As discussed in Sect. 2.2 in such a case a sizable contribution to the LuxQED formula comes from the low-scale region in which the structure function is determined from a fit to the data, and \(\mathcal{O}\left( \frac{m_p}{Q}\right) \) corrections to the LuxQED formula may then become relevant. We see that the shift in central photon PDF which is found by making this choice instead of the default one is at most of the order of the uncertainty on the photon PDF. Again, this shows that the uncertainty on the LuxQED procedure is correctly estimated.

Fig. 4
figure 4

Comparison of PDFs in the NNPDF4.0QED and the NNPDF4.0 (pure QCD) sets, shown as a ratio to the former at \(Q=100\) GeV. Bands correspond to 1\(\sigma \) uncertainties. From left to right and from top to bottom, the up, anti-up, down, anti-down, strange, and gluon PDFs are shown

We finally assess the impact of the inclusion of a photon PDF on the other PDFs, by comparing the NNLO NNPDF4.0QED and NNPDF4.0 (pure QCD) PDFs. In Fig. 4 we present this comparison at \(Q=100\) GeV. It is clear that the impact of the inclusion of the photon is moderate, with the NNPDF4.0 (pure QCD) and NNPDF4.0QED PDFs generally differing by less than one sigma and always in agreement within uncertainties. The largest effects are seen in the gluon PDF, which is suppressed at the percent level due to the momentum fraction transferred from the gluon to the photon.

3.4 The photon momentum fraction

The main impact of the photon on other PDFs is through its contribution to the momentum sum rule, Eq. (2.19). We quantify this contribution by evaluating the photon momentum fraction

$$\begin{aligned} M\left[ \gamma (Q)\right] \equiv \int _0^1 dx\, x\gamma (x,Q). \end{aligned}$$
(3.1)

In Fig. 5 the momentum fractions carried by the photon and by the gluon PDFs in the NNPDF4.0QED set are shown (in percentage) as a function of scale. For the photon the result is compared to that of NNPDF3.1QED, and for the gluon to that of NNPDF4.0 (pure QCD). The photon momentum fraction in the NNPDF3.1QED and NNPDF4.0QED PDF sets is essentially the same: the photon carries around 0.2% of the proton momentum at a low (Q\(\sim \)1 GeV) scale, growing logarithmically with Q up to around 0.6% at the multi-TeV scale. The momentum fraction carried by the gluon is reduced by a comparable amount upon inclusion of the photon.

Fig. 5
figure 5

Left: the percentage momentum fraction carried by the photon PDF \(\gamma (x,Q^2)\) in NNPDF4.0QED and in NNPDF3.1QED as a function of the scale Q, where the bands indicate 1\(\sigma \) uncertainties. Right: same for the momentum fraction carried by the gluon PDF in NNPDF4.0QED and in NNPDF4.0 (pure QCD)

4 Implications for LHC phenomenology

We now study the phenomenological implications of the NNPDF4.0QED PDF set. First we compare parton luminosities to those computed using other QED PDF sets. Then we assess the impact of QED corrections on selected processes, by comparing to the pure QCD case calculations that include photon-induced contributions, and also by directly comparing results obtained using NNPDF4.0 (pure QCD) and NNPDF4.0QED PDFs.

4.1 Luminosities

The phenomenological effect on the parton luminosity of the inclusion of a photon PDF is both direct, through the presence of photon-induced partonic channels, and indirect, through the effect of the photon on other PDFs, mostly through the depletion of the gluon that is necessary in order to preserve the momentum sum rule, as discussed in Sect. 3.33.4. The photon-induced contributions for NNPDF4.0QED NNLO are compared in Fig. 6 to their counterpart in NNPDF3.1QED (top) and in MSHT20QED and CT18QED (bottom). The level of agreement is high and directly follows from that seen between the respective photon PDFs in Fig. 2

Fig. 6
figure 6

The photon-induced contributions to the luminosity at the LHC with \(\sqrt{s}=14\) TeV as a function of the invariant mass \(m_X\) for NNPDF4.0QED NNLO compared to its NNPDF3.1QED counterpart (top) and compared to MSHT20QED and CT18QED, all shown as a ratio to NNPDF4.0QED

Fig. 7
figure 7

Comparison of parton luminosities in the NNPDF4.0QED and the NNPDF4.0 (pure QCD) sets, shown as a ratio to the former for the LHC with \(\sqrt{s}=14\) TeV. From left to right and from top to bottom the gluon-gluon, gluon-quark, quark-antiquark and quark-quark luminosities are shown

The luminosities for all other parton channels are shown in Figs. 7, 8, where we compare NNPDF4.0QED both to the previous set NNPDF3.1QED and to the pure QCD NNPDF4.0 (Fig. 7) and to the MSHT20QED and CT18QED sets (Fig. 8). Because, as shown in Sect. 3, the effect of the inclusion of the photon on the other PDFs is moderate, the comparison between the two QED sets is very similar to the comparison between the pure QCD NNPDF4.0 and NNPDF3.1 shown in Fig. 9.1 of Ref. [26], where it was shown that NNPDF4.0 are backward compatible, i.e. generally agree with NNPDF3.1 within uncertainties. The effect of the inclusion of QED corrections is, as expected, mostly seen in the gluon–gluon channel, where it leads to a suppression of a few percent in the \(m_X\sim 100\) GeV region in order to account for the transfer of a small amount of momentum to the photon. The comparison to other PDF sets is dominated by the differences in the quark and gluon PDFs, which are rather more significant than the difference in the photon PDF, and thus very similar to the corresponding comparison of pure QCD luminosities shown in Fig. 9.3 of Ref. [26].

Fig. 8
figure 8

Same as Fig. 7, now comparing NNPDF4.0QED to MSHT20 [23] and CT18qed [25]

4.2 Physics processes

We now study the impact of the photon PDF on a few representative processes: Drell–Yan production (neutral- and charged-current), Higgs production in gluon-gluon fusion, in vector boson fusion, and in associated production with weak bosons, diboson production, and top-quark pair production, all at the LHC with center-of-mass energy \(\sqrt{s}=14\) TeV. We have computed theory predictions for these processes exploiting the PineAPPL [32, 33] interface to the automated QCD and EW calculations provided by mg5_aMC@NLO [35]. PineAPPL produces interpolation grids, accurate to NLO in both the strong and electroweak couplings, that are independent of PDFs. They therefore make it easy to vary the input PDF set, since the same grid can be used for all PDF sets considered. As we are interested in assessing differences between PDF sets, rather than in doing precision phenomenology, we do not include NNLO QCD corrections, and we only show PDF uncertainties. For a detailed list of parameters and cuts used in the calculation of these grids, see Sect. 9 of Ref. [26].

For all processes, we display predictions in figures below with a standardized format, as follows. We show the absolute distributions (top panels) and the ratio to the central value obtained using NNPDF4.0QED PDFs and including photon-induced channels, which we call NNPDF4.0QED (bottom panels). In the left plots we compare NNPDF4.0QED to: NNPDF4.0QED but with no photon-initiated channels; NNPDF4.0 pure QCD; NNPDF3.1QED. In the right plots we compare NNPDF4.0QED to MSHT20QED and CT18QED. The left plots allow for assessing the overall size of the QED corrections (by comparison of the QED and pure QCD results), and disentangling the size of the photon-initiated contributions (by comparison of predictions with the photon-initiated channels switched on and off) and the impact of the changes in the quark and gluon PDFs due to QED effects (by comparison of NNPDF4.0 with NNPDF4.0QED with the photon-initiated channels switched off).

Fig. 9
figure 9

Predictions for inclusive Drell–Yan production at the LHC with center-of-mass energy \(\sqrt{s}=14\) TeV, computed at NLO accuracy in the QCD and electroweak couplings. All uncertainties shown are PDF uncertainties only. From top to bottom: neutral-current dilepton production as a function of the dilepton invariant mass \(m_{\ell \bar{\ell }}\); \(W^+\) production as a function of the antilepton pseudo-rapidity \(\eta _{\bar{l}}\); \(W^-\) production as a function of the antilepton pseudo-rapidity \(\eta _{l}\). For each process, we display the absolute distributions (top panels) and the ratio to the central value obtained with NNPDF4.0QED PDFs, including photon-initiated channels (bottom panels). In the left panels the full NNPDF4.0 result is compared to NNPDF4.0 (QCD only), NNPDF3.1QED, NNPDF4.0QED (no photon-initiated); in the right panels it is compared to MSHT20QED [23] and CT18QED [25]. Note that the experimentally measurable quantity is shown, so for dilepton production the t- and u-channel photon-induced contribution is also included

In Fig. 9 we show results for inclusive Drell–Yan production both in the neutral-current and charged-current channels. For neutral current we show results for the invariant mass distribution of the dilepton pair, while for charged-current we display the rapidity distribution of the lepton. Predictions for the Higgs rapidity distribution are shown in Fig. 10 for gluon fusion, associated production with a \(W^+\) boson, and vector boson fusion. Finally, in Fig. 11 we show predictions for weak-boson pair production (\(W^+W^-\) and \(W^+Z\)), as a function of the dilepton transverse momentum, and for top-quark pair production as a function of the invariant mass of the top quark pair.

For charged-current Drell–Yan, the QED corrections have essentially no effect. In all the other cases that we consider, the effects of the QED correction fall in one of two categories. Either upon inclusion of QED effects we see an enhancement of the cross-section, which is only present when the photon-induced contribution is included, while the NNPDF4.0QED result without photon-induced contribution is very close to the pure QCD result. Or else we see a suppression of the cross-section, but with the NNPDF4.0QED result with and without the photon-induced contribution very close to each other. Of course, the former case can be explained with the presence of a sizable positive photon-induced contribution, while the latter case is explained by the suppression of the gluon luminosity due to the transfer of momentum fraction from the gluon to the photon seen in Fig. 7.

The first situation – enhancement due to the photon–induced contribution – is observed in neutral-current dilepton production, where the enhancement increases with invariant mass and can reach up to 5% at the TeV scale. Note that the photon-induced contribution to the dilepton final state can proceed also through t- and u-channel leading-order diagrams. The fact that the QED enhancement is absent at the Z peak suggests that this non-resonant contribution provides the dominant part of the photon-induced contribution. A similar situation occurs in \(W^+W^-\) and \(ZW^+\), where the enhancement increases with \(p_T\) and reaches 5% in the former case and 2% in the latter case for transverse momenta in the TeV range. Finally, the enhancement is also observed in associate Higgs production with \(W^+\) and in vector boson fusion. In the former case the enhancement is largest at forward rapidity, where it reaches 4%, and it decreases to 2% for the largest rapidity \(y_H=2.5\). In the latter case the enhancement is very moderate, around 1%, and almost independent of the rapidity (though slightly decreasing as the rapidity increases).

The second situation – suppression, independent of the photon-induced contribution – is found in processes that proceed through gluon fusion. The effect is clearly seen in Higgs production in gluon fusion, where the suppression is weakly dependent on rapidity, varying between 2% at central rapidity and about 1% at the largest rapidity. A similar, but more moderate effect, is also seen in top pair production, where the suppression is of order 1%, essentially independent of the invariant mass of the top pair.

When comparing results obtained using different QED PDF sets, be they NNPDF3.1QED, MSHT20QED or CT18QED, we observe that differences are essentially driven by the difference in quark and gluon PDFs. This is a direct consequence of the similarity of the photon PDF in all sets. Indeed, all comparisons are quite similar to those shown in Sect. 9.3 of Ref. [26], where the same processes were studied in pure QCD.

All in all, we conclude that the inclusion of QED corrections is important for precision phenomenology at the percent level, even in cases in which the photon-induced contribution is negligible, such as Higgs production in gluon fusion. Here neglecting the indirect effect of including the photon PDF results in an overestimation of the peak cross-section (and thus the total cross-section) by about 2%, thus biasing the prediction by an amount that is of the same order as the PDF uncertainty, but not included in it.

Fig. 10
figure 10

Same as Fig. 9 but for the rapidity distribution of the Higgs for production in gluon-gluon fusion (top panel), in association with a \(W^+\) boson (middle panel) and in vector-boson fusion (bottom panel)

Fig. 11
figure 11

Same as Fig. 9 but for the dilepton transverse momentum distribution for weak boson pair production (\(W^+W^-\) and \(W^+Z\)) and for the invariant mass distribution for top-quark pair production. No acceptance cuts on the decay productions of the WZ bosons and top quark have been imposed

5 Summary and outlook

We have presented a new determination of QED PDFs based on the NNPDF4.0 set of parton distributions, using the methodology previously adopted for the construction of the NNPDF3.1QED PDFs [21]. This methodology implements the LuxQED procedure [17, 18] to determine the photon PDF. Results are consistent with previous studies: specifically, we find that QED effects have a small but non-negligible impact mostly on the gluon PDF, and that photon-initiated contributions are most important for high-mass process such as neutral-current Drell–Yan production. This PDF determination is based on a new NNPDF pipeline for producing theory predictions [34]. Thanks to the integration of QED evolution in this pipeline, and in particular thanks to the use of the EKO evolution code [30], the production of QED variants of NNPDF determinations is now essentially automated, and will become the default in future releases. Indeed, there is in general no reason to switch off the photon PDF, even if it is a small correction, nor to neglect its effect on the momentum fraction carried by the gluon.

The NNPDF4.0QED PDF sets are made publicly available via the LHAPDF6 interface,

http://lhapdf.hepforge.org/ .

All sets are delivered as sets of \(N_{\textrm{rep}}=100\) Monte Carlo replicas. Specifically, we provide the NLO and NNLO global fits constructed with the settings defined in Sect. 2 and denoted as

NNPDF40_nlo_as_01180_qed

NNPDF40_nnlo_as_01180_qed These sets are also made available via the NNPDF collaboration website

https://nnpdf.mi.infn.it/nnpdf4-0-qed/ .

They should be considered the QED PDF counterparts of the published NNPDF4.0 QCD-only PDF sets [26].

We also make available the set of 100 NNPDF4.0 (QCD only) replicas used to produce the comparisons shown in Sect. 3. These differ from the published NNPDF4.0 PDFs because they have been produced using the new theory pipeline, and include some minor bug corrections in the implementation of the NNPDF4.0 dataset. These are called:

NNPDF40_nlo_as_01180_qcd

NNPDF40_nnlo_as_01180_qcd.

The equivalence of these replicas to the published NNPDF4.0 replicas is demonstrated in Appendix A. They are made available for completeness, on the NNPDF website only.

The NNPDF4.0QED determination is part of a family of developments based on the NNPDF4.0 PDF set, and aimed at increasing its accuracy. These will also include a determination of the theory uncertainty on NNPDF4.0 PDFs based on the methodology of Refs. [48, 49], and a first PDF determination based on NNPDF methodology at approximate N\(^3\)LO [50]. All of these, as well as their combination, will become part of the default NNPDF methodology in future releases.

A natural development of QED PDFs is the full inclusion of electroweak corrections in theory predictions used for PDF determination, which enables a widening of both the set of processes and the kinematic range that may be used for PDF determination, which are currently constrained by the need of ensuring that electroweak corrections are small. This is especially important as electroweak effects become more relevant in regions of phase space sensitive to the large-x PDFs, which are in turn relevant for new physics searches [51, 52]. This development will be greatly facilitated by the availability, within the new pipeline, of the PineAPPL interface to automated Monte Carlo generators, such as mg5_aMC@NLO. All these developments will help achieve PDF determination with percent or sub-percent accuracy.