1 Introduction

The Large Hadron Collider (LHC) has completed data taking for Run II. While a lot of collected data still awaits to be analyzed, no physics beyond the standard model (BSM) has been announced until now. The lack of direct indications for the presence of new physics (NP) makes indirect searches more interesting. The High Luminosity LHC (HL-LHC) upgrade will eventually collect an integrated luminosity of 3 ab\(^{-1}\) of data in pp collisions at a center-of-mass (c.o.m.) energy of 14 TeV, which should maximize the LHC potential to uncover new phenomena. It may however well be that the NP degrees of freedom are at higher masses making it difficult at the LHC to identify experimentally new particles, or new paradigms. These considerations have been driving, in the last few years, intense activity worldwide to assess the future of collider experiments beyond the HL-LHC. Several proposals and studies have been performed. The prospects of pushing the LHC program further with the LHC tunnel and the whole CERN infrastructure, together with future magnet technology, is an exciting possibility that could push the energy up into an unexplored region with the 27 TeV High Energy LHC (HE-LHC), that could collect an integrated luminosity of 15 ab\(^{-1}\).

Precision measurements provide an important tool to search for heavy BSM dynamics, associated with mass scales beyond the LHC direct energy reach, exploiting the fact that such dynamics can still have an impact on processes at smaller energy, via virtual effects. In this context the well-established framework of effective field theories (EFTs) allows to systematically parameterize BSM effects and elucidate how they modify SM processes. The BSM contributions are effectively parametrized in terms of higher dimension operators \(\mathcal{O}^{(n)}_i\), with some effective couplings \(C^{(n)}_i\) suppressed by appropriate powers of an unknown energy scale \(\Lambda \) at which new physics sets in,

$$\begin{aligned} \mathcal{L}=\mathcal{L}_{SM} +\Sigma _i\frac{C_i^{(6)}}{\Lambda ^2}\mathcal{O}_i^{(6)} +\Sigma _i\frac{C_i^{(8)}}{\Lambda ^4}\mathcal{O}_i^{(8)} +\cdots , \end{aligned}$$
(1.1)

where the superscript \(n=6,8\) indicates the dimensionality of the corresponding operator. Following the usual notation, we introduce a set of Wilson coefficients \(f_i^{(n)}\), defined as

$$\begin{aligned} f_i^{(6)}=\frac{C_i^{(6)}}{\Lambda ^2},\quad f_i^{(8)}=\frac{C_i^{(8)}}{\Lambda ^4},\ldots , \end{aligned}$$
(1.2)

which are free parameters since neither \(C_i^{(n)}\) nor \(\Lambda \) of the full theory are known. Equation (1.1) represents in principle an infinite and model-independent parametrization of possible BSM effects, valid by construction up to the cutoff value \(\Lambda \) in the energy scale of the studied process. The EFT approach is based on the assumption that a limited number of chosen operators contribute to the given process in the studied energy range. In particular, vector boson scattering (VBS) processes are widely recognized as the best laboratory to study the operators which modify only the VVVV quartic couplings. In the SM EFT framework these operators start at dimension-8. The usual ATLAS and CMS procedure in VBS data analyses to date involves testing one such dimension-8 operator at a time. The goal of this paper is to investigate the sensitivity of this approach by identifying for each operator the corresponding reach in the \((f_i,\Lambda )\) plane at the HE-LHC.

In a recent paper [1] the physics potential of the single dimension-8 operator EFT approach has been tested on a hypothetical new physics signal observed in the same-sign WW scattering process at the HL-LHC. The analysis was focused on the “gold-plated” purely leptonic W decay modes:

$$\begin{aligned} pp \rightarrow 2 jets+l^+\nu +l'^+\bar{\nu }^\prime \end{aligned}$$
(1.3)

where l and \(l'\) stand for any combination of electrons and muons. In this process the \(M_{WW}\) invariant mass cannot be reconstructed experimentally on the event-by-event basis, leading to a restricted space in the \((f_i, \Lambda )\) plane (the “EFT triangle”) for which the single-operator EFT description of the data is viable. In this note we extend such investigations to the HE-LHC energy and expected luminosity domain.

The paper is organized as follows. In Sect. 2 we remind the concept of “EFT triangles”. In Sect. 3. we detail our procedure of event simulation and subsequent treatment of generated events. In Sect. 4 we summarize our findings and conclude. Three appendices contain supplementary material. In the first appendix the definitions of dimension-8 operators are recalled. In Appendix B we argue that qualitative features of the full WW scattering process, including off-shell effects, can be inferred from considering the on-shell WW scattering amplitudes and we discuss helicity amplitudes after adding the higher dimension operators. In the third appendix we address the question of what values of BSM couplings can be drawn from the discovery regions.

2 EFT triangles

Since the truncation of the expansion in Eq. (1.1) introduces model dependence, in Ref. [1] the concept of “EFT models” has been introduced where they are defined by the choice of operators \(\mathcal{O}_i^{(n)}\) and the values of Wilson coefficients \(f_i^{(n)}\). The EFT description is valid up to a cutoff energy \(\Lambda \) at which new states are expected to appear; the cutoff value is unknown a priori. However, in the presence of higher dimension operators the scattering amplitudes grow with energy and eventually break the perturbative unitarity limit \(M^U\). The condition \(\Lambda < M^U(f_i)\) defines the upper bound on the range of possible values of \(\Lambda \) as a function of \(f_i\).

A specific feature of the process in Eq. (1.3) is that the scale \(M\equiv M_{WW}\), i.e., the invariant mass of the scattered WW bosons, is not experimentally accessible, making it impossible to properly apply the cutoff \(\Lambda \) on the data. Any BSM signal, S, is defined as the deviation from the SM prediction in the differential distributions \(d\sigma /dx_i\) of some observable \(x_i\).

$$\begin{aligned} \frac{dS}{dx_i}=\left( \frac{d\sigma }{dx_i}\right) ^{BSM} -\left( \frac{d\sigma }{dx_i}\right) ^{SM}. \end{aligned}$$
(2.1)

Any collected data sample will in general be a sum of the contributions from \(M < \Lambda \) and \(M > \Lambda \) (unless \(\Lambda \) happens to be out of kinematic reach). To cope with events with \(M > \Lambda \), different solutions have been advocated in the literature, e.g.: from discarding these events at the level of simulation, to invoking unitarization procedures (usually assuming \(\Lambda = M^U\)) [2, 3], to ignoring the cutoff altogether. Any of the above prescriptions is related to additional arbitrariness of choices and therefore affects the physics interpretation of the results. Genuine data interpretation in the EFT language requires its successful description without any additional assumptions as to the nature of BSM physics at the scale above \(\Lambda \). This is only possible if the bulk of the total observed BSM signal originates indeed from the EFT-controlled range.

The EFT-controlled signal reads:

$$\begin{aligned} \left( \frac{d\sigma }{dx_i}\right) ^{EFT}= & {} \int ^\Lambda _{2M_W} \left( \frac{d^2\sigma }{dx_idM}\right) ^{EFT}dM\nonumber \\&+\int _\Lambda ^{M_{max}}\left( \frac{d^2\sigma }{dx_idM}\right) ^{SM} dM, \end{aligned}$$
(2.2)

Here \(M_{max}\) is the kinematic limit of the WW invariant mass. Eq. (2.2) defines signal coming uniquely from the “EFT model” in its range of validity and assumes only the SM contribution in the region \(M>\Lambda \).

The additional contribution from the region above \(\Lambda \) may enhance the signal, but it may also preclude proper description of the data within the EFT. The total BSM signal can be estimated without detailed knowledge of the UV completion from the expected asymptotic behavior for \(M \rightarrow \infty \), i.e., by assuming that all the helicity amplitudes above \(\Lambda \) remain constant at their respective values they reach at \(\Lambda \) (hence superscript \(A=const\)):

$$\begin{aligned} \left( \frac{d\sigma }{dx_i}\right) ^{BSM}= & {} \int ^\Lambda _{2M_W} \left( \frac{d^2\sigma }{dx_idM}\right) ^{EFT}dM\nonumber \\&+\int _\Lambda ^{M_{max}}\left( \frac{d^2\sigma }{dx_idM}\right) ^{A=const} dM, \end{aligned}$$
(2.3)

For every value of \(\Lambda \), BSM observability imposes some minimum value of \(f_i\) for which the total BSM signal defined by Eq. (2.3) has enough statistical significance, 5\(\sigma \) in our example. Successful description in the EFT framework imposes some maximum value of \(f_i\) such that signal estimates computed from Eqs. (2.2) and (2.3) remain statistically consistent, e.g., within 2\(\sigma \). The “EFT triangle” is the region in the \((f_i, \Lambda )\) plane for which a statistically significant BSM signal can be successfully described with a chosen higher dimension operator \(\mathcal{O}_i\). It is bounded from three sides:

  • from above by the unitarity limit \(M^U(f_i)\),

  • from the left by the signal significance of 5\(\sigma \), computed according to Eq. (2.3),

  • and from the right by the consistency within 2\(\sigma \) with Eq. (2.2).

In the HL-LHC case, for all the individual dimension-8 operators that affect the WWWW quartic coupling such triangles were found to be rather narrow or even entirely empty (for \(\mathcal{O}_{S1}\)) [1]. In this paper we extend this analysis to the HE-LHC case, in an attempt to verify if an increased beam energy and integrated luminosity will translate into larger EFT triangles.

Fig. 1
figure 1

Typical examples of kinematic distributions used for the assessment of BSM signal significances. Shown are the distributions of \(M_{ll}\), \(M_{o1}\) and \(R_{pT}\) (in log scale): in the Standard Model (black), with \(f_{M1}=0.2\) TeV\(^{-4}\) and the high-\(M_{WW}\) tail treatment according to Eq. (2.3) (red), and with \(f_{M1}=0.2\) TeV\(^{-4}\) and the high-\(M_{WW}\) tail treatment according to Eq. (2.2) (green). In addition the lower-right plot shows the distribution in the invariant mass of the WW system, \(M_{WW}\) with \(f_{M1}=0.2\) TeV\(^{-4}\). In all the plots the scale \(\Lambda \) was chosen as: \(\Lambda = M^U = 4.9\) TeV. Assumed is \(\sqrt{s} = 27 \) TeV and an integrated luminosity of 3 ab\(^{-1}\)

Fig. 2
figure 2

Regions in the \(\Lambda \) vs f (positive f values) space for dimension-8 operators in which a \(5\sigma \) BSM signal can be observed and the EFT is applicable. The unitarity limit is shown in blue; the lower limits for a \(5\sigma \) signal significance from Eq. (2.3) (red) and the upper limit on \(2\sigma \) EFT consistency (black). The solid (dotted) lines correspond to \(\sqrt{s} = 27\ (14)\) TeV. Assumed is the integrated luminosity of 3 ab\(^{-1}\)

Fig. 3
figure 3

Regions in the \(\Lambda \) vs f (negative f values) space for dimension-8 operators in which a \(5\sigma \) BSM signal can be observed and the EFT is applicable. The unitarity limit is shown in blue; the lower limits for a \(5\sigma \) signal significance from Eq. (2.3) (red) and the upper limit on \(2\sigma \) EFT consistency (black). The solid (dotted) lines correspond to \(\sqrt{s} = 27\ (14)\) TeV. Assumed is the integrated luminosity of 3 ab\(^{-1}\)

Throughout this work we follow the MadGraph convention for the definition of dimension-8 operators (implemented therein via public UFO files [4]), in which the field strength tensors \(W_{\mu \nu }\equiv \frac{i}{2}g\tau ^i(\partial _\mu W^i_\nu - \partial _\nu W^i_\mu + g\epsilon _{ijk}W^j_\mu W^k_{\nu })\) are replaced with \(\hat{W}_{\mu \nu } \equiv \frac{1}{ig}W_{\mu \nu }\). Such conversion factors are equivalent to absorbing the electroweak coupling constants g, each one explicitly factored out for each occurrence of the field stress tensor, in the effective couplings \(C_i\). For the reader’s convenience, Appendix A lists dimension-8 operators; more details can be found in Ref. [5].

3 Analysis

In this section we present a generator-level study aimed at finding the EFT triangles for the individual \(n=8\) operators at the HE-LHC. Event samples of the process \(pp \rightarrow jj\mu ^+\mu ^+\nu \nu \) at 27 TeV were generated for each \(n=8\) operator \(\mathcal{O}_i\) that modifies the WWWW quartic coupling, \(i = S0, S1\) (so called scalar operators), T0, T1, T2 (transverse), and M0, M1, M6, M7 (mixedFootnote 1 ones). Generation has been done at LO using MadGraph5_aMC@NLO v5.2.6.2 generator [6], with the appropriate UFO files containing additional vertices involving the desired \(n=8\) operators. A scan of \(f_i\) values for each operator was made using the MadGraph reweight command, including \(f_i=0\) to represent the SM case. The Pythia package v6.4.1.9 [7] was used for hadronization as well as initial and final state radiation processes. Unitarity limits were determined using the VBFNLO [8, 9] calculator v1.4.0, after applying appropriate Wilson coefficients conversion factors. Cross sections at the output of MadGraph were multiplied by a factor 4 to account for all the lepton (electron and/or muon) combinations in the final state (in this work only positively charged leptons are taken into account, although the same analysis can be done for the negative charges). Only signal samples were generated and the SM case was treated as irreducible background in the study of possible BSM effects.

The applied analysis chain to the events generated for HE-LHC is a carbon copy of the one described in detail in Ref. [1], here we only briefly outline the main points, with special emphasis on the differences. Standard VBS selection criteria were applied, namely we require at least two reconstructed jets and exactly two leptons (muons or electrons) satisfying the following conditions: \(M_{jj}>\) 500 GeV, \(\Delta \eta _{jj}>\) 2.5, \(p_T^{~j}>\) 30 GeV, \(|\eta _j|<\) 5, \(p_T^{~l}>\)25 GeV and \(|\eta _l|<\) 2.5. Although we have not optimized our selection criteria for 27 TeV, one can safely suppose they will not be much different. A detailed optimization will be done with full knowledge of the future detector geometry. Like before, the total BSM signal is estimated according to Eq. (2.3) by suppressing the high-mass tail above the calculated value of \(\Lambda \). This is achieved by applying an additional weight of the form \((\Lambda /M_{WW})^4\) to each generated event in this region. The EFT-controlled signal is calculated according to Eq. (2.2) by replacing the generated high-mass tail with the one expected in the SM (known as “clipping method”). Signal significances are computed as the square root of a \(\chi ^2\) resulting from a bin-by-bin comparison of the event yields, with statistical errors such as expected from the data at 3 ab\(^{-1}\), in the distribution of the most sensitive kinematic variables. Compared to the 14 TeV analysis, the binning of histograms was changed so that in the highest bin the SM prediction normalized to 3/ab is still between 2 and 3 events. In Fig. 1 as an example shown are distributions of four chosen variables:

  • invariant mass of two leptons \(M_{ll}\),

  • ratio of transverse momenta of leptons and jets \(R_{p_T} \equiv p_T^{~l1}p_T^{~l2}/(p_T^{~j1}p_T^{~j2})\),

  • \(M_{o1} \equiv \sqrt{(|\mathbf {p}_T^{~l1}|{+}|\mathbf {p}_T^{~l2}| {+}|\mathbf {p}_T^{~miss}|)^2 {-} (\mathbf {p}_T^{~l1}{+}\mathbf {p}_T^{~l2}{+}\mathbf {p}_T^{~miss})^2}\)

  • and the (true) invariant mass \(M_{WW}\)

in the SM and in the case \(f_{M1} = 0.2\) TeV\(^{-4}\) and \(\Lambda =\) 4.9 TeV. As it was for 14 TeV, we found \(R_{p_T} \) to be the most sensitive kinematic variable for \(\mathcal{O}_{S0}\) and \(\mathcal{O}_{S1}\), and \(M_{o1} \) for the remaining operators.

Fig. 4
figure 4

Regions in the \(\Lambda \) vs f (negative f values) space for M1 operator in which a \(5\sigma \) BSM signal can be observed and the EFT is applicable. The unitarity limit is shown in blue; the lower limits for a \(5\sigma \) signal significance from Eq. (2.3) (red) and the upper limit on \(2\sigma \) EFT consistency (black). The solid (dotted) lines correspond to 15 ab\(^{-1}\) (3 ab\(^{-1}\) ). Assumed is \(\sqrt{s} = 27\) TeV

As the unitarity limit we take always the lower of the two values between on-shell \(W^+W^+\) and on-shell \(W^+W^-\) scattering, calculated from T-matrix diagonalization in the helicity space. Indeed, both processes probe the same quartic coupling and are governed by the same Wilson coefficients, as further explained in Appendix B.3.

Figure 2 shows the results for the individual operators \(S0,\, S1,\, T0,\, T1,\, T2,\, M0,\, M1\) and M7, in comparison with results at 14 TeV (for positive f values). Not unexpectedly, all the triangles are shifted to lower f values compared to 14 TeV, the shift being as large as almost an order of magnitude. However, the total area of the triangles does not get significantly larger as we increase the energy. This is because the EFT consistency criterion pushes the effective upper limits on f in a similar manner as does the BSM observability criterion for the lower limits. Overall, the shapes and sizes of all the EFT triangles are remarkably similar for 27 TeV as for 14 TeV, only their respective positions differ.

Figure 3 shows the respective results for negative f values of \(S0,\, S1,\, T0,\, T1,\, T2,\, M1\) and M7. Here exactly the same observations can be made again. The negative f values of M0 look virtually identical to their positive counterparts, since for these operators the SM-BSM interference term in the total amplitude calculation is practically negligible (see the Appendix for details), and so we do not show them here. There is no triangle at all for S1, for which the overall lower limit for BSM observability is about 1.2 TeV\(^{-4}\) and the upper limit for EFT consistency is 1.4  TeV\(^{-4}\). Here as well we observe a similar behavior as for 14 TeV.

Both analyses, at 14 and 27 TeV, were done at the generator level. Reducible backgrounds were not simulated, since they are known to be strongly detector dependent (for the different compositions of reducible background as measured at 13 TeV by CMS and ATLAS, see Figs. 2-a and 2-b in Ref. [10] and Fig. 3 in Ref. [11], respectively, where it has been found that ’non-prompt’ leptons (from hadron decay and fakes) is the largest reducible background and WZ is small in CMS, while in ATLAS WZ is the largest background and ’non-prompt’ is small). Such simulations have to be carried by each experiment for its specific detector performances. For a realistic estimate of the sensitivity limits to new physics effects the reader is referred to literature, e.g., Ref. [12]. Full detector simulation will move the EFT triangles to larger Wilson coefficients, but it cannot make them larger, hence all our conclusions still hold.

For the sake of a convenient comparison between the respective results at two different pp beam energies, in the bulk of this study we have always assumed the same integrated luminosity of 3 ab\(^{-1}\) for both cases. This number is appropriate for the HL-LHC stage, but underestimates the expected statistical power of the HE-LHC. However, it is trivial to recalculate all the results to 15/ab in order to get the true expected discovery reach of the HE-LHC, taking into account its actual expected luminosity. An increase of statistics by a factor 5 will lead to a further shift of all the EFT triangles by a factor close to \(\sqrt{5}\), both in the 5\(\sigma \) discovery and the 2\(\sigma \) consistency curves (in fact, somewhat less than that because of non-linear dependence of the BSM signal on the value of the individual Wilson coefficients). It will not significantly change either the shape nor the size of triangles. A comparison of results calculated for the same pp beam energy of 27 TeV and two different integrated luminosities is exemplified for the M1 operator in Fig. 4.

Our simple procedure to suppress the high-mass tails by applying a \((\Lambda /M_{WW})^4\) weight to events generated above the scale of \(\Lambda \) works reasonably well in the vicinity of the unitarity limit. In this region it produces a tail falling approximately like \(1/M_{WW}^2\), which is the expected asymptotic (i.e., for \(M_{WW}>>\Lambda \)) behavior of the total cross section after regularization. It nonetheless becomes too strong as we go to \(\Lambda<< M^U\), where the total cross section is still dominated by the SM contribution which does not require any further suppression. Moreover, for low values of \(\Lambda \) the tail itself becomes large, leading to large uncertainties due to the details of its modeling. We have discontinued the curves in Figs. 2 and 3 below the values at which we find the method lead to the unphysical result of signal being suppressed below the SM level itself. For this reason the EFT triangles for T0, T1 and T2 do not close. For the remaining cases, however, they are completely contained in the region where our simple method is still viable.

According to Ref. [13], full NLO (EW + QCD) corrections to the SM same-sign WW scattering process at 13 TeV lower the total cross section within a fiducial volume defined by standard VBS cuts by 17%; the relative effect increases in the high invariant mass region. Corresponding corrections at 27 TeV were reported in [14]. They are relatively a few percent larger than in the former case. Therefore, accounting for higher-order corrections is well motivated in further more dedicated studies.

Positivity constraints applied to VBS, derived e.g. in Ref. [15], require that certain linear combinations of Wilson coefficients of dimension-8 operators to be positive definite. Since we choose one operator at a time, the bounds would determine what signs of the Wilson coefficients should be considered for each operator. Therefore accounting for these bounds using our results is straightforward.

4 Conclusion and outlook

Although an increase of the LHC energy vastly improves the sensitivity to new physics effects in VBS processes, the question of EFT applicability is a different one and cannot be solved by changing the energy. The same-sign WW process with its purely leptonic W decays is often considered “gold-plated” due to its relatively good signal to background ratio, but the lack of experimental access to the WW invariant mass poses a severe problem in describing the data in terms of the EFT. Despite reasonable sensitivity to BSM effects, such effects if observed will most likely not be possible to interpret using the data from this process alone and applying the usual framework of testing one dimension-8 operator at a time. This conclusion holds regardless of the actual proton beam energy.

The present results reinforce the former conclusion that future VBS data analysis, both at the LHC experiments as well as future proton-proton colliders, should evolve in the direction of multidimensional fits with many higher dimension operators varied at a time. This in turn may require global simultaneous fits to many processes (including \( WZ,\, ZZ\) and semi-leptonic WV, if not other processes) to help disentangle the correlations between signals originating from different operators. Independently of the goal and results of this paper, one always has to remember that the approach of varying one operator at-a-time is not the most appropriate in any case – such a procedure breaks the model independence of the EFT description and therefore EFT requires ultimately a global approach.

Helpful in disentangling the effects of different operators may be also the polarizations of the outgoing W bosons, as different operator subsets, S, T and M, affect different polarizations. There are new theoretical ideas how to project the total VBS cross sections onto individual polarizations without invoking the rather crude W on-shell approximation [16]. WW polarizations can be extracted from the data by fitting simulated templates of the corresponding polarized distributions. Unfortunately, purely leptonic WW decays do not offer the possibility to reconstruct the W decay angle, which is the only strictly model-independent signature of W polarization. While many other distributions exhibit qualitative differences between the different polarizations, they are usually also strongly model-dependent. Consequently, SM templates cannot be used in the BSM case without the risk of losing sensitivity to the BSM signal. If, however, a set of observables is identified for which sufficiently model-independent templates for \(W_LW_L\), \(W_TW_T\) and \(W_TW_L\) can be constructed, it could vastly improve the perspectives of future VBS data analysis in the framework of the EFT.