1 Introduction

Here I present a few personal recollections and observations on what is necessary in order to obtain the most accurate theoretical predictions outside the Higgs-like resonance region, given the present level of theoretical knowledge.

Somebody had an idea, somebody else gave it wings, a third group did the cut-and-count, and a fourth did a shape-based analysis.Footnote 1 Ideas are like rabbits. You get a couple, learn how to handle them, and pretty soon you have a dozen.

In Ref. [1] the off-shell production cross section has been shown to be sizeable at high \({\mathrm{Z}}{\mathrm{Z}}\,\)-invariant mass in the gluon fusion production mode, with a ratio relative to the on-peak cross section of the order of \(8\%\) at a center-of-mass energy of \(8\) TeV. This ratio can be enhanced up to about \(20\%\) when a kinematical selection used to extract the signal in the resonant region is taken into account [2]. This arises from the vicinity of the on-shell \({\mathrm{Z}}\) pair production threshold, and is further enhanced at the on-shell top pair production threshold.

In Ref. [3] the authors demonstrated that, with fewer assumptions and using events with pairs of \({\mathrm{Z}}\) particles, the high invariant mass tail can be used to constrain the Higgs width.

This note provides a more detailed description of the theoretical uncertainty associated with the camel-shaped and square-root–shaped tails of a light Higgs boson.

The outline of the paper is as follows: old and new ideas on measuring the Higgs boson intrinsic width are presented in Sect. 2, off-shell effects are discussed in Sect. 3, inclusion of the interference is analyzed in Sect. 3.4 with the introduction of different options for the corresponding theoretical uncertainty. Historical remarks are given in Sect. 4; in Sect. 4.2 improvements are introduced and critically analyzed.

2 An old idea

The problem of determining resonance parameters in e\(^{+}\)e\(^{-}\) annihilation, including initial state radiative corrections and resolution corrections is an old one, see Ref. [4]. For the interested reader we recommend the original Refs. [4, 5] or the summary in Chap. 2 of Ref. [6].

2.1 Higgs intrinsic width

Is there anything we can say about what the intrinsic width of the light resonance is like? Ideas pass through three periods:

  • It can’t be done.

  • It probably can be done, but it’s not worth doing.

  • I knew it was a good idea all along!

From the depths of my memory \(\ldots \)

Remark

✘ It can’t be done: at LHC we reconstruct the invariant mass of the Higgs decay products, “easy” in case of \(\upgamma \upgamma \) or \(4\) charged lepton final states. The mass resolution has a Gaussian core but non-Gaussian tails (e.g., due to calorimeter segmentation but also pile-up effects etc.). The accuracy in the mean of the mass peak can then approach that \(1.\%\) precision. Thus it could perhaps compare with the \({\mathrm{W}}\,\)-mass extraction at LEP, based on some measured invariant mass distribution. Experimentalists would let the detector event simulation program do the folding of the theoretical invariant mass distribution, hoping that the MC catches most of the Gaussian and non-Gaussian resolution effects with the remainder being put into the systematic uncertainty. However, this would affect the width much more than the mass (mean of the distribution).

Remark

✘ It’s not worth doing. For the width of the Higgs things are thus much more difficult: For \(M_{{\mathrm{H}}}< 180\,GeV\) detector resolution dominates, so experimentally it will be very tough.

Let’s review what we have learned in the meantime, highlighting new steps for Higgs precision physics:

  • complete off-shell treatment of the Higgs signal

  • signal-background interference

  • residual theoretical uncertainty

3 The wrath of the “heavy” Higgs

You didn’t want me to be real, I will contaminate your data, come and see if ye can swerve me

Let’s see how this develops.

3.1 Higgs boson production and decay: the analytic structure

Remark

✔ I knew it was a good idea all along!

Before giving an unbiased description of production and decay of an Higgs boson we underline the general structure of any process containing a Higgs boson intermediate state. The corresponding amplitude is schematically given by

$$\begin{aligned} A(s) = \frac{f(s)}{s - {s_{{\mathrm{H}}}}} + N(s), \end{aligned}$$
(1)

where \(N(s)\) denotes the part of the amplitude which is non-Higgs-resonant. Strictly speaking, signal (\(\mathrm {S}\)) and background (\(\mathrm {B}\)) should be defined as follows:

$$\begin{aligned}&A(s) = S(s) + B(s), \qquad S(s)= \frac{f({s_{{\mathrm{H}}}})}{s - {s_{{\mathrm{H}}}}},\nonumber \\&\quad B(s)= \frac{f(s) - f({s_{{\mathrm{H}}}})}{s - {s_{{\mathrm{H}}}}} + N(s) \end{aligned}$$
(2)

Definition

The Higgs complex pole (describing an unstable particle) is conventionally parametrized as

$$\begin{aligned} {s_{{\mathrm{H}}}}= \mu ^2_{{\mathrm{H}}} - i\,\mu _{{\mathrm{H}}}\,\gamma _{{\mathrm{H}}} \end{aligned}$$
(3)

As a first step we will show how to write \(f(s)\) in a way such that pseudo-observables make their appearance [7, 8]. Consider the process \(i j \rightarrow {\mathrm{H}}\rightarrow {\mathrm{F}}\) where \(i,j\,\in \,\)partons and \({\mathrm{F}}\) is a generic final state; the complete cross-section will be written as follows:

$$\begin{aligned} \sigma _{i j \rightarrow {\mathrm{H}}\rightarrow {\mathrm{F}}}(s)&= \frac{1}{2\,s}\,\int \,d\Phi _{i j \rightarrow {\mathrm{F}}}\, \left[ \, \sum _{s,c} \Bigr |A_{i j \rightarrow {\mathrm{H}}} \Bigr |^2\, \right] \nonumber \\&\times \frac{1}{\Bigr |s - {s_{{\mathrm{H}}}}\Bigr |^2}\left[ \, \sum _{s,c}\Bigr |A_{{\mathrm{H}}\rightarrow {\mathrm{F}}} \Bigr |^2\, \right] \end{aligned}$$
(4)

where \(\sum _{s,c}\) is over spin and colors (averaging on the initial state). Note that the background (e.g. \({\mathrm{g}}{\mathrm{g}}\rightarrow 4\,{\mathrm{f}}\)) has not been included and, strictly speaking and for reasons of gauge invariance, one should consider only the residue of the Higgs-resonant amplitude at the complex pole, as described in Eq. (2). For gauge invariance the rule of thumb can be formulated by looking at Eq. (1): the only gauge invariant quantities are the location of the complex pole, its residue and the non-resonant part of the amplitude [\(B(s)\) of Eq. (2)]. For the moment we will argue that the dominant corrections are the QCD ones where we have no problem of gauge parameter dependence. If we decide to keep the Higgs boson off-shell also in the resonant part of the amplitude (interference signal/background remains unaddressed) then we can write

$$\begin{aligned} \int \,d\Phi _{i j \rightarrow {\mathrm{H}}}\,\sum _{s,c} \Bigr |A_{i j \rightarrow {\mathrm{H}}} \Bigr |^2 = s\,{\overline{A}}_{ij}(s). \end{aligned}$$
(5)

For instance, we have

$$\begin{aligned} {\overline{A}}_{{\mathrm{g}}{\mathrm{g}}}(s) = \frac{\alpha _{\mathrm {s}}^2}{\pi ^2}\,\frac{G_{{\mathrm {F}}}\,s}{288\,\sqrt{2}}\, \Bigr |\sum _q\,f\left( \tau _q\right) \Bigr |^2\,\left( 1 + \delta _{{\mathrm {QCD}}}\right) , \end{aligned}$$
(6)

where \(\tau _q = 4\,m^2_q/s\), \(f(\tau _q)\) is defined in Eq. (3) of Ref. [9] and where \(\delta _{{\mathrm {QCD}}}\) gives the QCD corrections to \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}\) up to next-to-next-to-leading-order (NNLO) + next-to-leading logarithms (NLL) resummation. Furthermore, we define

$$\begin{aligned} \Gamma _{{\mathrm{H}}\rightarrow {\mathrm{F}}}(s) = \frac{1}{2\,\sqrt{s}}\,\int \,d\Phi _{{\mathrm{H}}\rightarrow {\mathrm{F}}}\,\sum _{s,c} \Bigr |A_{{\mathrm{H}}\rightarrow {\mathrm{F}}} \Bigr |^2 \end{aligned}$$
(7)

which gives the partial decay width of a Higgs boson of virtuality \(s\) into a final state \({\mathrm{F}}\).

$$\begin{aligned} \sigma _{i j \rightarrow {\mathrm{H}}}(s) = \frac{{\overline{A}}_{ij}(s)}{s} \end{aligned}$$
(8)

which gives the production cross-section of a Higgs boson of virtuality \(s\). We can write the final result in terms of pseudo-observables

Proposition 3.1

The familiar concept of on-shell production \(\otimes \) branching ratio can be generalized to

$$\begin{aligned} \sigma _{i j \rightarrow {\mathrm{H}}\rightarrow {\mathrm{F}}}(s) = \frac{1}{\pi }\, \sigma _{i j \rightarrow {\mathrm{H}}}(s)\,\frac{s^2}{\vert s - {s_{{\mathrm{H}}}}\vert ^2}\, \frac{\Gamma _{{\mathrm{H}}\rightarrow {\mathrm{F}}}(s)}{\sqrt{s}} \end{aligned}$$
(9)

It is also convenient to rewrite the result as

$$\begin{aligned} \sigma _{i j \rightarrow {\mathrm{H}}\rightarrow {\mathrm{F}}}(s) = \frac{1}{\pi }\, \sigma _{i j \rightarrow {\mathrm{H}}}\,\frac{s^2}{\vert s \!-\! {s_{{\mathrm{H}}}}\vert ^2}\, \frac{\Gamma ^{{\mathrm{tot}}}_{{\mathrm{H}}}}{\sqrt{s}}\,\mathrm {BR}\left( {\mathrm{H}}\rightarrow {\mathrm{F}}\right) \end{aligned}$$
(10)

where we have introduced a sum over all final states,

$$\begin{aligned} \Gamma ^{{\mathrm{tot}}}_{{\mathrm{H}}} = \sum _{{\mathrm{f}}\in {\mathrm{F}}}\,\Gamma _{{\mathrm{H}}\rightarrow {\mathrm{f}}} \end{aligned}$$
(11)

Note that we have written the phase-space integral for \(i(p_1) + j(p_2) \rightarrow {\mathrm{F}}\) as

$$\begin{aligned} \int \,d\Phi _{i j \rightarrow {\mathrm{F}}}&= \int \,d^4k\,\delta ^4(k - p_1 - p_2)\nonumber \\&\times \int \,\prod _f\,d^4p_f\,\delta ^+(p^2_f)\,\delta ^4\left( k - \sum _f p_f\right) \nonumber \\ \end{aligned}$$
(12)

where we assume that all initial and final states (e.g. \(\upgamma \upgamma \), \(4\,{\mathrm{f}}\), etc.) are massless.

Why do we need pseudo-observables? Ideally experimenters (should) extract so-called realistic observables from raw data, e.g. \(\sigma \left( {\mathrm{p}}{\mathrm{p}}\rightarrow \upgamma \upgamma +\, {\mathrm{X}}\right) \) and (should) present results in a form that can be useful for comparing them with theoretical predictions, i.e. the results should be transformed into pseudo-observables; during the deconvolution procedure one should also account for the interference background – signal; theorists (should) compute pseudo-observables using the best available technology and satisfying a list of demands from the self-consistency of the underlying theory.

Definition

We define an off-shell production cross-section (for all channels) as follows:

$$\begin{aligned} \sigma ^{\mathrm{prop}}_{i j \rightarrow {\mathrm{all}}} = \frac{1}{\pi }\,\sigma _{i j \rightarrow {\mathrm{H}}}\,\frac{s^2}{\vert s - {s_{{\mathrm{H}}}}\vert ^2}\, \frac{\Gamma ^{{\mathrm{tot}}}_{{\mathrm{H}}}}{\sqrt{s}} \end{aligned}$$
(13)

When the cross-section \(i j \rightarrow {\mathrm{H}}\) refers to an off-shell Higgs boson the choice of the QCD scales should be made according to the virtuality and not to a fixed value. Therefore, for the PDFs and \(\sigma _{i j \rightarrow {\mathrm{H}}+ {\mathrm{X}}}\) one should select \({\mu ^{2}_{{\mathrm {F}}}}= {\mu ^{2}_{{\mathrm {R}}}}= z\,s/4\) (\(z\,s\) being the invariant mass of the detectable final state). Indeed, beyond lowest order (LO) one must not choose the invariant mass of the incoming partons for the renormalization and factorization scales, with the factor \(1/2\) motivated by an improved convergence of fixed order expansion, but an infrared safe quantity fixed from the detectable final state, see Ref. [10]. The argument is based on minimization of the universal logarithms (DGLAP) and not the process-dependent ones.

3.2 More on production cross-section

We give the complete definition of the production cross-section; let us define \(\upzeta = z\,s, \upkappa = v\,s\), and write

Definition

 \(\sigma ^{{\mathrm{prod}}}\) is defined by the following equation:

$$\begin{aligned} \sigma ^{{\mathrm{prod}}}&= \sum _{i,j}\,\int \hbox {PDF}\,\otimes \,\sigma ^{{\mathrm{prod}}}_{i j \rightarrow {\mathrm{all}}}\nonumber \\&= \sum _{i,j}\,\int \limits _{z_0}^1 dz \int \limits _z^1\frac{dv}{v}\,{\fancyscript{L}}_{ij}(v) \sigma ^{\mathrm{prop}}_{ij \rightarrow {\mathrm{all}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}}) \end{aligned}$$
(14)

where \(z_0\) is a lower bound on the invariant mass of the \({\mathrm{H}}\) decay products, the luminosity is defined by

$$\begin{aligned} {\fancyscript{L}}_{ij}(v) = \int \limits _v^1 \frac{dx}{x}\, f_i\left( x,{\mu _{{\mathrm {F}}}}\right) \,f_j\left( \frac{v}{x},{\mu _{{\mathrm {F}}}}\right) \end{aligned}$$
(15)

where \(f_i\) is a parton distribution function and

$$\begin{aligned} \sigma ^{\mathrm{prop}}_{ij \rightarrow {\mathrm{all}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}})&= \frac{1}{\pi }\, \sigma _{ij \rightarrow {\mathrm{H}}+ {\mathrm{X}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}})\nonumber \\&\times \frac{ \upzeta \,\upkappa }{\vert \upzeta - {s_{{\mathrm{H}}}}\vert ^2}\frac{\Gamma ^{{\mathrm{tot}}}_{{\mathrm{H}}}(\upzeta )}{\sqrt{\upzeta }} \end{aligned}$$
(16)

Therefore, \(\sigma _{ij \rightarrow {\mathrm{H}}+ {\mathrm{X}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}})\) is the cross section for two partons of invariant mass \(\upkappa \) (\(z \le v \le 1\)) to produce a final state containing a \({\mathrm{H}}\) of virtuality \(\upzeta = z\,s\) plus jets (X); it is made of several terms (see Ref. [9] for a definition of \(\Delta \sigma \)),

$$\begin{aligned}&\sum _{ij}\,\sigma _{ij \rightarrow {\mathrm{H}}+ {\mathrm{X}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}}) = \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}}\,\delta \left( 1 - \frac{z}{v}\right) \nonumber \\&\quad + \frac{s}{\upkappa }\,\left( \Delta \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}{\mathrm{g}}} + \Delta \sigma _{{\mathrm{q}}{\mathrm{g}}\rightarrow {\mathrm{H}}{\mathrm{q}}} + \Delta \sigma _{\bar{\hbox {q}}{\mathrm{q}}\rightarrow {\mathrm{H}}{\mathrm{g}}} + \text{ NNLO }\right) \nonumber \\ \end{aligned}$$
(17)

Remark

As a technical remark the complete phase-space integral for the process \({\hat{p}}_i + {\hat{p}}_j \rightarrow p_k + \{f\}\) (\({\hat{p}}_i = x_i\,p_i\) etc.) is written as

$$\begin{aligned}&\int \,d\Phi _{ij \rightarrow f} = \int \,d\Phi _{{\mathrm{prod}}}\,\int \,d\Phi _{\mathrm{dec}} = \int d^4 p_k\,\delta ^+(p^2_k)\nonumber \\&\qquad \times \prod _{l=1,n} d^4 q_l\,\delta ^+(q^2_l)\, \delta ^4\left( {\hat{p}}_i + {\hat{p}}_j - p_k - \sum _l q_l\right) \nonumber \\&\quad = \int d^4k d^4Q\,\delta ^+(p^2_k)\,\delta ^4 \left( {\hat{p}}_i + {\hat{p}}_j - p_k - Q\right) \nonumber \\&\qquad \times \int \prod _{l=1,n} d^4q_l\,\delta ^+(q^2_l)\,\delta ^4 \left( Q - \sum _l q_l\right) \end{aligned}$$
(18)

where \(\int \,d\Phi _{\mathrm{dec}}\) is the phase-space for the process \(Q \rightarrow \{f\}\) and

$$\begin{aligned}&\int \,d\Phi _{{\mathrm{prod}}} \!=\! s\,\int dz\,\int d^4p_k d^4Q\,\delta ^+(p^2_k)\, \delta \left( Q^2 \!-\! \upzeta \right) \theta (Q_0)\nonumber \\&\qquad \,\times \delta ^4 \left( {\hat{p}}_i + {\hat{p}}_j - p_k - Q\right) \nonumber \\&\quad = s^2\,\int dz dv d{\hat{t}}\,\int d^4p_k d^4Q\,\delta ^+(p^2_k)\, \delta \left( Q^2 - \upzeta \right) \theta (Q_0) \nonumber \\&\qquad \times \, \delta ^4 \left( {\hat{p}}_i + {\hat{p}}_j - p_k - Q\right) \nonumber \\&\qquad \times \,\delta \left( ({\hat{p}}_i + {\hat{p}}_j)^2 - \upkappa \right) \, \delta \left( ({\hat{p}}_i + Q)^2 - {\hat{t}}\right) \end{aligned}$$
(19)

Equations (14) and (16) follow after folding with PDFs of argument \(x_i\) and \(x_j\), after using \(x_i = x\), \(x_j = v/x\) and after integration over \({\hat{t}}\). At NNLO there is an additional parton in the final state and five invariants are need to describe the partonic process, plus the \({\mathrm{H}}\) virtuality. However, one should remember that at NNLO use is made of the effective theory approximation where the Higgs-gluon interaction is described by a local operator.

3.3 An idea that is not dangerous is unworthy of being called an idea at all

Let us consider the case of a light Higgs boson; here, the common belief was that the product of on-shell production cross-section (say in gluon-gluon fusion) and branching ratios reproduces the correct result to great accuracy. The expectation is based on the well-known result [11] (\(\Gamma _{{\mathrm{H}}} \ll M_{{\mathrm{H}}}\))

$$\begin{aligned} \Delta _{{\mathrm{H}}}&= \frac{1}{\left( s - M^2_{{\mathrm{H}}}\right) ^2 + \Gamma ^2_{{\mathrm{H}}}\,M^2_{{\mathrm{H}}}} = \frac{\pi }{M_{{\mathrm{H}}}\,\Gamma _{{\mathrm{H}}}}\,\delta \left( s - M^2_{{\mathrm{H}}}\right) \nonumber \\&+\mathrm {PV}\,\left[ \frac{1}{\left( s - M^2_{{\mathrm{H}}}\right) ^2}\right] \end{aligned}$$
(20)

where \(\mathrm {PV}\) denotes the principal value (understood as a distribution). Furthermore \(s\) is the Higgs virtuality and \(M_{{\mathrm{H}}}\) and \(\Gamma _{{\mathrm{H}}}\) should be understood as \(M_{{\mathrm{H}}} = \mu _{{\mathrm{H}}}\) and \(\Gamma _{{\mathrm{H}}} = \gamma _{{\mathrm{H}}}\) and not as the corresponding on-shell values. In more simple terms, the first term in Eq. (20) puts you on-shell and the second one gives you the off-shell tail. More details are given in Appendix A.

Remark

\(\Delta _{{\mathrm{H}}}\) is the Higgs propagator, there is no space for anything else in QFT (e.g. Breit-Wigner distributions). For a comparison of Breit-Wigner and Complex Pole distributed cross sections at \({\mu _{{\mathrm{H}}}}= 125.6\,GeV\) see Figs. 1 and 2.

Fig. 1
figure 1

Ratio of Breit–Wigner and Complex Pole distributed cross sections at \({\mu _{{\mathrm{H}}}}= 125.6\) GeV

Fig. 2
figure 2

Breit–Wigner and Complex Pole distributed lineshapes at \({\mu _{{\mathrm{H}}}}= 125.6\) GeV

A more familiar representation of the propagator can be written as follows:

Definition

with the parametrization of Eq. (3) we perform the well-known transformation

$$\begin{aligned} {{\overline{M}}^{2}_{{\mathrm{H}}}}= {\mu ^{2}_{{\mathrm{H}}}}+ \gamma ^2_{{\mathrm{H}}} \qquad \mu _{\mathrm {H}}\,{{\overline{\Gamma }}}_{{\mathrm{H}}} = {{\overline{M}}_{{\mathrm{H}}}}\,{\gamma _{{\mathrm{H}}}} \end{aligned}$$
(21)

A remarkable identity follows (defining the Bar-scheme):

$$\begin{aligned} \frac{1}{s - {s_{{\mathrm{H}}}}} = \left( 1 + i\,\frac{{{\overline{\Gamma }}}_{{\mathrm{H}}}}{{{\overline{M}}_{{\mathrm{H}}}}}\right) \, \left( s - {{\overline{M}}^{2}_{{\mathrm{H}}}}+ i\,\frac{{{\overline{\Gamma }}}_{{\mathrm{H}}}}{{{\overline{M}}_{{\mathrm{H}}}}}\,s \right) ^{-1} \end{aligned}$$
(22)

showing that the Bar-scheme is equivalent to introducing a running width in the propagator with parameters that are not the on-shell ones. Special attention goes to the numerator in Eq. (22) which is essential in providing the right asymptotic behavior when \(s \rightarrow \infty \), as needed for cancellations with contact terms in \({\mathrm{V}}{\mathrm{V}}\,\)scattering.

The natural question is: to which level of accuracy does the ZWA [delta-term only in Eq. (20)] approximate the full off-shell result given that at \({\mu _{{\mathrm{H}}}}= 125\) GeV the on-shell width is only \(4.03\) MeV? For definiteness we will consider \(ij \rightarrow {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\rightarrow 4{\mathrm{l}}\). When searching the Higgs boson around \(125\) GeV one should not care about the region \(M_{{\mathrm{ZZ}}} > 2\,M_{{\mathrm{Z}}}\) but, due to limited statistics, theory predictions for the normalization in \(\bar{\hbox {q}}-{\mathrm{q}}- {\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) are used over the entire spectrum in the \({\mathrm{Z}}{\mathrm{Z}}\) invariant mass.

Therefore, the question is not to dispute that off-shell effects are depressed by a factor \({\gamma _{{\mathrm{H}}}}/{\mu _{{\mathrm{H}}}}\) but to move away from the peak and look at the behavior of the invariant mass distribution, no matter how small it is compared to the peak; is it really decreasing with \(M_{{\mathrm{ZZ}}}\)? Is there a plateau? For how long? How does that affect the total cross-section if no cut is made?

Let us consider the signal, in the complex-pole scheme:

$$\begin{aligned} \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}}({\mathrm {S}})&= \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}}(M^2_{{\mathrm{ZZ}}})\nonumber \\&= \frac{1}{\pi }\, \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}}\,\frac{M^4_{{\mathrm{ZZ}}}}{\Bigr |M^2_{{\mathrm{ZZ}}} - {s_{{\mathrm{H}}}}\Bigr |^2}\, \frac{\Gamma _{{\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}}\left( M_{{\mathrm {Z}}}\right) }{M_{{\mathrm{ZZ}}}} \end{aligned}$$
(23)

where \({s_{{\mathrm{H}}}}\) is the Higgs complex pole, given in Eq. (3). Away (but not too far away) from the narrow peak the propagator and the off-shell \({\mathrm{H}}\) width behave like

$$\begin{aligned} \Delta _{{\mathrm{H}}} \approx \frac{1}{\left( M^2_{{\mathrm{ZZ}}} - {\mu ^{2}_{{\mathrm{H}}}}\right) ^2}, \qquad \frac{\Gamma _{{\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}}\left( M_{{\mathrm {Z}}}\right) }{M_{{\mathrm{ZZ}}}} \sim G_{{\mathrm {F}}}\,M^2_{{\mathrm{ZZ}}} \end{aligned}$$
(24)

above threshold with a sharp increase just below it (it goes from \(1.62 \times 10^{-2}\) GeV at \(175\) GeV to \(1.25 \times 10^{-1}\) GeV at \(185\) GeV).

Our result for the \({\mathrm{V}}{\mathrm{V}}\) (\({\mathrm{V}}= {\mathrm{W}}/{\mathrm{Z}}\)) invariant mass distribution is shown in Fig. 3: after the peak the distribution is falling down until the effects of the \({\mathrm{V}}{\mathrm{V}}\,\)-thresholds become effective with a visible increase followed by a plateau, by another jump at the \(\bar{{\mathrm{t}}}-{\mathrm{t}}\)-threshold. Finally the signal distribution starts again to decrease, almost linearly.

Fig. 3
figure 3

The NNLO \({\mathrm{V}}{\mathrm{V}}\) invariant mass distribution in \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{V}}{\mathrm{V}}\) for \({\mu _{{\mathrm{H}}}}= 125\) GeV

What is the net effect on the total cross-section? We show it in Table 1 where the contribution above the \({\mathrm{Z}}{\mathrm{Z}}\,\)-threshold amounts to \(7.6\%\). The presence of the effect does not depend on the propagator function used (Breit-Wigner or complex-pole propagator). The size of the effect is related to the distribution function. In Table 2 we present the invariant mass distribution integrated bin-by-bin.

Table 1 Total cross-section in \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) and in \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}\rightarrow \,\) all; the part of the cross-section for \(M_{{\mathrm{ZZ}}} > 2\,M_{{\mathrm{Z}}}\) is explicitly shown. \(R[\%]\) is the ratio between the number of events with \(M_{{\mathrm{ZZ}}} > 2\,M_{{\mathrm{Z}}}\) and the total number of events
Table 2 Bin-by-bin cross-section in \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\). First row gives the bin in GeV, second row gives the cross-section in pb

If we take the ZWA value for the production cross-section at \(8\) TeV and for \({\mu _{{\mathrm{H}}}}= 125\) GeV (\(19.146\) pb) and use the branching ratio into \({\mathrm{Z}}{\mathrm{Z}}\) of \(2.67\,\times \,10^{-2}\) we obtain a ZWA result of \(0.5203\) pb with a \(5\%\) difference w.r.t. the off-shell result, fully compatible with the \(7.6\%\) effect coming form the high-energy side of the resonance.

Always from Table 1 we see that the effect is much less evident if we sum over all final states with a net effect of only \(0.8\%\) (the decay is \(\bar{{\mathrm{b}}}-{\mathrm{b}}\) dominated).

Of course, the signal per se is not a physical observable and one should always include background and interference. In Fig. 4 we show the complete LO result. Numbers are shown with a cut of \(0.25\,M_{{\mathrm{ZZ}}}\) on \(p_\mathrm{T}^\mathrm{Z}\). The large destructive effects of the interference wash out the peculiar structure of the signal distribution. If one includes the region \(M_{{\mathrm{ZZ}}} > 2\,M_{{\mathrm{Z}}}\) in the analysis then the conclusion is: interference effects are relevant also for the low-mass region.

Fig. 4
figure 4

The LO \({\mathrm{Z}}{\mathrm{Z}}\) invariant mass distribution \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) for \({\mu _{{\mathrm{H}}}}= 125\) GeV. The black line is the total, the red line gives the signal while the cyan line gives signal plus background; the blue line includes the \(q\bar{q} \rightarrow {\mathrm{Z}}{\mathrm{Z}}\) contribution

It is worth noting again that the whole effect on the signal has nothing to do with \({\gamma _{{\mathrm{H}}}}/{\mu _{{\mathrm{H}}}}\) effects; above the \({\mathrm{Z}}{\mathrm{Z}}\,\)-threshold the distribution is higher than expected (although tiny w.r.t. the narrow peak) and stays approximately constant till the \(\bar{{\mathrm{t}}}-{\mathrm{t}}\)-threshold after which we observe an almost linear decrease. This is why the total cross-section is affected (in a \({\mathrm{V}}{\mathrm{V}}\) final state) at the \(5\%\) level.

3.4 When the going gets tough, interference gets going

The higher-order correction in gluon-gluon fusion have shown a huge \(\mathrm {K}\,\)-factor

$$\begin{aligned} \mathrm {K}= \frac{\sigma _{{\mathrm{prod}}}^{\mathrm{NNLO}}}{\sigma _{{\mathrm{prod}}}^{\mathrm{LO}}}, \qquad \sigma _{{\mathrm{prod}}} = \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}}. \end{aligned}$$
(25)

3.4.1 The zero-knowledge scenario

A potential worry is: should we simply use the full LO calculation or should we try to effectively include the large (factor two) \(\mathrm {K}\,\)-factor to have effective NNLO observables? There are different opinions since interference effects may be as large or larger than NNLO corrections to the signal. Therefore, it is important to quantify both effects. We examine first the scenario where zero knowledge is assumed on the \(\mathrm {K}\,\)-factor for the background. So far, two options have been introduced to account for the increase in the signal. Let us consider any distribution \({\mathrm {D}}\) (for definiteness we will consider \(ij \rightarrow {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\rightarrow 4{\mathrm{l}}\)), i.e.

$$\begin{aligned} D = \frac{d\sigma }{d M^2_{{\mathrm{ZZ}}}} \quad \text{ or } \quad \frac{d\sigma }{d p_\mathrm{T}^\mathrm{Z}} \quad \text{ etc. } \end{aligned}$$
(26)

where \(M_{{\mathrm{ZZ}}}\) is the invariant mass of the \({\mathrm{Z}}{\mathrm{Z}}\,\)-pair and \(p_\mathrm{T}^\mathrm{Z}\) is the transverse momentum. Two possible options are:

Definition

The additive option is defined by the following relation

$$\begin{aligned} {\mathrm {D}}^{\mathrm{NNLO}}_{\mathrm{eff}} = {\mathrm {D}}^{\mathrm{NNLO}}({\mathrm {S}}) + {\mathrm {D}}^{\mathrm{LO}}({\mathrm {I}}) + {\mathrm {D}}^{\mathrm{LO}}({\mathrm {B}}) \end{aligned}$$
(27)

Definition

The multiplicative [12] (\(\mathrm {M}\)) or completely multiplicative (\(\overline{\mathrm {M}}\)) option is defined by the following relation:

$$\begin{aligned}&{\mathrm {D}}^{\mathrm{NNLO}}_{\mathrm{eff}}(\mathrm {M}) =\mathrm {K}_{{\mathrm {D}}}\,\left[ {\mathrm {D}}^{\mathrm{LO}}({\mathrm {S}}) + {\mathrm {D}}^{\mathrm{LO}}({\mathrm {I}}) \right] + {\mathrm {D}}^{\mathrm{LO}}({\mathrm {B}}),\nonumber \\&\quad {\mathrm {D}}^{\mathrm{NNLO}}_{\mathrm{eff}}(\overline{\mathrm {M}}) = \mathrm {K}_{{\mathrm {D}}}\,{\mathrm {D}}^{\mathrm{LO}}, \quad \mathrm {K}_{{\mathrm {D}}} = \frac{{\mathrm {D}}^{\mathrm{NNLO}}({\mathrm {S}})}{{\mathrm {D}}^{\mathrm{LO}}({\mathrm {S}})} \end{aligned}$$
(28)

where \(\mathrm {K}_{{\mathrm {D}}}\) is the differential \(\mathrm {K}\,\)-factor for the distribution. The \(\overline{\mathrm {M}}\) option is only relevant for background subtraction and it is closer to the central value described in Sect. 4.2.1.

In both cases the NNLO corrections include the NLO electroweak (EW) part, for production [13] and decay. The EW NLO corrections for \({\mathrm{H}}\rightarrow {\mathrm{W}}{\mathrm{W}}/{\mathrm{Z}}{\mathrm{Z}}\rightarrow 4{\mathrm{f}}\) can reach a \(15\%\) in the high part of the tail. It is worth noting that the differential \(\mathrm {K}\,\)-factor for the \({\mathrm{Z}}{\mathrm{Z}}\,\)-invariant mass distribution is a slowly increasing function of \(M_{{\mathrm{ZZ}}}\) after \(M_{{\mathrm{ZZ}}} = 2\,M_{{\mathrm{t}}}\), going (e.g. for \({\mu _{{\mathrm{H}}}}= 125.6\) GeV) from \(1.98\) at \(M_{{\mathrm{ZZ}}} = 2\,M_{{\mathrm{t}}}\) to \(2.11\) at \(M_{{\mathrm{ZZ}}} = 1\) GeV.

The two options, as well as intermediate ones, suffer from an obvious problem: they are spoiling the unitarity cancellation between signal and background for \(M_{{\mathrm{ZZ}}} \rightarrow \infty \). Therefore, our partial conclusion is that any option showing an early onset of unitarity violation should not be used for too high values of the \({\mathrm{Z}}{\mathrm{Z}}\,\)-invariant mass.

Therefore, our first prescription in proposing an effective higher-order interference will be to limit the risk of overestimation of the signal by applying the recipe only in some restricted interval of the \({\mathrm{Z}}{\mathrm{Z}}\,\)-invariant mass. This is especially true for high values of \({\mu _{{\mathrm{H}}}}\) where the off-shell effect is large. Explicit calculations show that the multiplicative option is better suited for regions with destructive interference while the additive option can be used in regions where the effect of the interference is positive, i.e. we still miss higher orders from the background amplitude but do not spoil cancellations between signal and background.

Actually, there is an intermediate options that is based on the following observation: higher-order corrections to the signal are made of several terms, see Eq. (14): the partonic cross-section is defined by

$$\begin{aligned}&\sum _{ij}\,\sigma _{ij \rightarrow {\mathrm{H}}+ {\mathrm{X}}}(\upzeta , \upkappa , {\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}}) = \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}}\,\delta \left( 1 - \frac{z}{v}\right) \nonumber \\&\quad +\frac{s}{\upkappa }\,\left( \Delta \sigma _{{\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{H}}{\mathrm{g}}} + \Delta \sigma _{{\mathrm{q}}{\mathrm{g}}\rightarrow {\mathrm{H}}{\mathrm{q}}} + \Delta \sigma _{\bar{\hbox {q}}{\mathrm{q}}\rightarrow {\mathrm{H}}{\mathrm{g}}} + \text{ NNLO }\right) \nonumber \\ \end{aligned}$$
(29)

From this point of view it seems more convenient to define

$$\begin{aligned} \mathrm {K}_{{\mathrm {D}}} = \mathrm {K}_{{\mathrm {D}}}^{{\mathrm{g}}{\mathrm{g}}} + \mathrm {K}_{{\mathrm {D}}}^{\mathrm{rest}}, \quad \mathrm {K}_{{\mathrm {D}}}^{{\mathrm{g}}{\mathrm{g}}} = \frac{{\mathrm {D}}^{\mathrm{NNLO}}\left( {\mathrm{g}}{\mathrm{g}}\!\rightarrow \! {\mathrm{H}}({\mathrm{g}}) \rightarrow {\mathrm{Z}}{\mathrm{Z}}({\mathrm{g}})\right) }{{\mathrm {D}}^{\mathrm{LO}}\left( {\mathrm{g}}{\mathrm{g}}\!\rightarrow \! {\mathrm{H}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\right) }\nonumber \\ \end{aligned}$$
(30)

and to introduce a third option

Definition

The intermediate option is given by the following relation:

$$\begin{aligned} {\mathrm {D}}^{\mathrm{NNLO}}_{\mathrm{eff}} = \mathrm {K}_{{\mathrm {D}}}\,{\mathrm {D}}^{\mathrm{LO}}({\mathrm {S}}) + \left( \mathrm {K}_{{\mathrm {D}}}^{{\mathrm{g}}{\mathrm{g}}}\right) ^{1/2}\, {\mathrm {D}}^{\mathrm{LO}}({\mathrm {I}}) + {\mathrm {D}}^{\mathrm{LO}}({\mathrm {B}}) \end{aligned}$$
(31)

which, in our opinion, better simulates the inclusion of \(\mathrm {K}\,\)-factors at the level of amplitudes in the zero knowledge scenario (where we are still missing corrections to the continuum amplitude).

4 There is no free lunch

Summary of (Higgs precision physics) milestones without sweeping under the rug the following issues:

  • moving forward, beyond ZWA (see Ref. [1]) don’t try fixing something that is already broken in the first place.

  • Unstable particles require complex-pole-scheme (see Ref. [14]).

  • Off-shell + Interferences + uncertainty in \({\mathrm{V}}{\mathrm{V}}\) production (see Ref. [15]).

  • See also Interference in di-photon channel, see Refs. [16, 17] and Refs. [1820].

The so-called area method [4] is not so useless, even for a light Higgs boson. One can use a measurement of the off-shell region to constrain the couplings of the Higgs boson. Using a simple cut-and-count method and one scaling parameter (see Eq. (32) in Sect. 4.1), existing LHC data should bound the width at the level of \(25{-}45\) times the Standard Model expectation [3, 21].

Remark

Chronology and Historical background

one cannot influence developments beyond telling his side of the story. The judgement about originality, importance, impact etc. is of course up to others

  • Constraining the Higgs boson intrinsic width has been discussed during several LHC HXSWG meetings (G. Passarino, LHC HXSWG epistolar exchange, e.g. \(10/25/10\) with CMS “Are you referring to measuring the width according to the area method you discuss in your book [6]? That would be interesting to apply if possible”).

  • N. Kauer was the first person who created a plot clearly showing the enhanced Higgs tail. It was shown at the \(6\)th LHC HXSWG meeting.Footnote 2

  • N. Kauer and G. Passarino (arXiv:1206.4803 [hep-ph]) confirmed the tail and provided an explanation for it, starting a detailed phenomenological study, see Ref. [1] and also Refs. [2, 22].

  • Higgs interferometry has been discussed at length in the LHC HXSWG (epistolar exchange, e.g. on \(05/17/13\)\(\cdots \) the interference effects could be used to constrain BSM Higgs via indirect Higgs width measurement \(\cdots \) there are large visible effectsFootnote 3). For a comprehensive presentation, see D. de Florian talk at “Higgs Couplings \(2013\)”.Footnote 4

  • Dixon and Li [19], followed by F. Caola and K. Melnikov (arXiv:1307.4935 [hep-ph]) introduced the notion of \(\infty \,\)-degenerate solutions for the Higgs couplings to SM particles, observed that the enhanced tail, discussed and explained in arXiv:1206.4803 [hep-ph], is obviously \({\gamma _{{\mathrm{H}}}}\,\)-independent and that this could be exploited to constrain the Higgs width model-independently if there’s experimental sensitivity to the off-peak Higgs signal [3]. Once you have a model for increasing the width beyond the SM value, Ref. [3] turns the observation of Ref. [1] into a bound on the Higgs width, within the given scenario of degeneracy.

  • J. M. Campbell, R. K. Ellis and C. Williams (arXiv:1311.3589 [hep-ph]) investigated the power of using a matrix element method (MEM) to construct a kinematic discriminant to sharpen the constraint [21] (with foreseeable extensions in MEM\(@\)NLO [23]). MEM-based analysis has been the first to describe a method for suppressing \(\bar{\hbox {q}}-{\mathrm{q}}\) background; the importance of his work cannot be overestimated. Complementary results from \({\mathrm{H}}\rightarrow {\mathrm{W}}{\mathrm{W}}\) in the high transverse mass region are shown in Ref. [24]. The MEM-based analysis for separation of the \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) and \(\bar{\hbox {q}}{\mathrm{q}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) processes, including signal and background interference within the \({\mathrm{g}}{\mathrm{g}}\rightarrow {\mathrm{Z}}{\mathrm{Z}}\) process, has been suggested and implemented within the MELA framework on CMS [25] (http://www.pha.jhu.edu/spin/).

  • This note provides a more detailed description of the theoretical uncertainty associated with the camel-shaped and square-root–shaped tails of a light Higgs boson.

  • A similar analysis, performed for the exclusion of a heavy SM Higgs boson, can be found in Ref. [15] and in Ref. [12] with improvements suggested in Ref. [26].

4.1 How to use an LO MC?

The MCs used in the analysis are based on LO calculations, some of them include \(\mathrm {K}\,\)-factors for the production but all of them have decay and interference implemented at LO. The adopted solution is external “re-weighting” (i.e. re-weighting with results from some analytical code), although rescaling exclusive distributions (e.g. in the final state leptons) with inclusive \(\mathrm {K}\,\)-factors is something that should not be done, it requires (at least) a \(1{-}1\) correspondence between the two lowest orders.

An example of \(\mathrm {K}\,\)-factors that can be used to include interference in the zero-knowledge scenario is given in Fig. 5. For a more general discussion on re-weighting see Ref. [27].

Fig. 5
figure 5

Differential \(K\,\)-factors in Higgs production for \({\mu _{{\mathrm{H}}}}= 125.6\) GeV

Most of the studies performed so far are for the exclusion of a heavy SM Higgs bosonFootnote 5 and, from that experience, we can derive that It Takes A Fool To Remain Sane:

A list of comments and/or problems

  • LO decay is not state-of-art, especially for high values of the final state invariant mass and the effect of missing higher orders is rapidly increasing with the final state invariant mass.

  • When the cross-section \(i j \rightarrow {\mathrm{H}}\) refers to an off-shell Higgs boson the choice of the QCD scales should be made according to the virtuality and not to a fixed value. Indeed, one must choose an infrared safe quantity fixed from the detectable final state, see Ref. [10]. Using the Higgs virtuality or the QCD scales has been advocated in Ref. [14]: the numerical impact is relevant, especially for high values of the invariant mass, the ratio static/dynamic scales being \(1.05\). The authors of Ref. [21] seem to agree on our choice [14].

  • References [3, 21] consider the following scenario (on-shell \(\infty \,\)-degeneracy): allow for a scaling of the Higgs couplings and of the total Higgs width defined by

    $$\begin{aligned}&\sigma _{i \rightarrow {\mathrm{H}}\rightarrow f} = \left( \sigma \cdot \mathrm {BR}\right) = \frac{\sigma ^{{\mathrm{prod}}}_i\,\Gamma _f}{{\gamma _{{\mathrm{H}}}}} \quad \sigma _{i \rightarrow {\mathrm{H}}\rightarrow f}\nonumber \\&\quad \varpropto \centerdot \;\frac{g^2_i g^2_f}{{\gamma _{{\mathrm{H}}}}} \quad g_{i,f} = \xi \,g^{\mathrm{SM}}_{i,f}, \;\; {\gamma _{{\mathrm{H}}}}= \xi ^4\,{\gamma _{{\mathrm{H}}}}^{\mathrm{SM}} \end{aligned}$$
    (32)

    Looking for \(\xi \,\)-dependent effects in the highly off-shell region is an approach that raises sharp questions on the nature of the underlying extension of the SM; furthermore it does not take into account variations in the SM background and the signal strength in \(4{\mathrm{l}}\), relative to the expectation for the SM Higgs boson, is measured by CMS to be \(0.91^{+ 0.30}_{-0.24}\) [28] and by ATLAS to be \(1.43^{+ 0.40}_{-0.35}\) [29]. We adopt the approach of Ref. [30] [in particular Eqs. (118)] which is based on the \(\upkappa \,\)-language, allowing for a consistent “Higgs Effective Field Theory” (HEFT) interpretation, see Ref. [31]. Neglecting loop-induced vertices, we have

    $$\begin{aligned}&\frac{\Gamma _{{\mathrm{g}}{\mathrm{g}}}}{\Gamma _{{\mathrm{g}}{\mathrm{g}}}^{\mathrm{SM}}({\mu _{{\mathrm{H}}}})} =\frac{\upkappa _{{\mathrm{t}}}^2\cdot \Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{t}}{\mathrm{t}}}({\mu _{{\mathrm{H}}}}) \!+\! \upkappa _{{\mathrm{b}}}^2\cdot \Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{b}}{\mathrm{b}}}({\mu _{{\mathrm{H}}}}) \!+\! \upkappa _{{\mathrm{t}}}\upkappa _{{\mathrm{b}}}\cdot \Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{t}}{\mathrm{b}}}({\mu _{{\mathrm{H}}}})}{\Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{t}}{\mathrm{t}}}({\mu _{{\mathrm{H}}}}) \!+\! \Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{b}}{\mathrm{b}}}({\mu _{{\mathrm{H}}}})\!+\!\Gamma _{{\mathrm{g}}{\mathrm{g}}}^{{\mathrm{t}}{\mathrm{b}}}({\mu _{{\mathrm{H}}}})}\nonumber \\&\quad \sigma _{i \rightarrow {\mathrm{H}}\rightarrow f} \;=\; \frac{\upkappa ^2_i \upkappa ^2_f}{\upkappa ^2_{{\mathrm{H}}}}\,\sigma ^{\mathrm{SM}}_{i \rightarrow {\mathrm{H}}\rightarrow f} \end{aligned}$$
    (33)

Remark

The measure of off-shell effects can be interpreted as a constraint on \({\gamma _{{\mathrm{H}}}}\) only when we scale couplings and total width according to Eq. (32) to keep \(\sigma _{{\mathrm{peak}}}\) untouched, although its value is known with 15–20% accuracy.

Proposition 4.1

The generalization of Eq. (32) is an \(\infty ^2\,\)-degeneracy, \(\upkappa _i\,\upkappa _f = \upkappa _{{\mathrm{H}}}\).

On the whole, we have a constraint in the multidimensional \(\upkappa \,\)-space, since \(\upkappa ^2_{{\mathrm{g}}} = \upkappa ^2_{{\mathrm{g}}}(\upkappa _{{\mathrm{t}}},\upkappa _{{\mathrm{b}}})\) and \(\upkappa ^2_{{\mathrm{H}}} = \upkappa ^2_{{\mathrm{H}}}(\upkappa _j,\,\forall \,j)\). Only on the assumption of degeneracy we can prove that off-shell effects “measure” \(\upkappa _{{\mathrm{H}}}\); a combination of on-shell effects (measuring \(\upkappa _i\,\upkappa _f/\upkappa _{{\mathrm{H}}}\)) and off-shell effects [measuring \(\upkappa _i\,\upkappa _f\), see Eq. (9)] gives information on \(\upkappa _{{\mathrm{H}}}\) without prejudices. Denoting by \({\mathrm {S}}\) the signal and by \({\mathrm {I}}\) the interference and assuming that \({\mathrm {I}}_{{\mathrm{peak}}}\) is negligible we have

$$\begin{aligned} \frac{\mathrm {S}_{{\mathrm{off}}}}{\mathrm {S}_{{\mathrm{peak}}}}\,\upkappa ^2_{{\mathrm{H}}} + \frac{\mathrm {I}_{{\mathrm{off}}}}{\mathrm {S}_{{\mathrm{peak}}}}\,\frac{\upkappa _{{\mathrm{H}}}}{x_{if}}, \qquad x_{if} = \frac{\upkappa _i \upkappa _f}{\upkappa _{{\mathrm{H}}}} \end{aligned}$$
(34)

for the normalized \(\mathrm {S} + \mathrm {I}\) off-shell cross section.

The background, e.g. \({\mathrm{g}}{\mathrm{g}}\rightarrow 4{\mathrm{l}}\), is also changed by the inclusion of \(d = 6\) operators and one cannot claim that New Physics is modifying only the signal.Footnote 6

  • The total systematic error is dominated by theoretical uncertainties, therefore one should never accept theoretical predictions that cannot provide uncertainty in a systematic way (i.e. providing an algorithm).

In Fig. 6 we consider the estimated theoretical uncertainty (THU) on the signal lineshape for a mass of \(125.6\,GeV\). Note that PDF\(\,{+}\,\alpha _{\mathrm {s}}\) and QCD scales uncertainties are not included. As expected for a light Higgs boson, the EW THU is sizable only for large values of the off-shell tail, reaching \(\pm 4.7\%\) at \(1\) TeV (the algorithm is explained in Ref. [14]). To summarize the various sources of parametric (PU) and theoretical (THU) uncertainties, we have

Fig. 6
figure 6

Electroweak theoretical uncertainty for the signal lineshape at \({\mu _{{\mathrm{H}}}}= 125.6\) GeV

THU summary

➀:

PDF\(\,+\,\alpha _{\mathrm {s}}\); these have a Gaussian distribution;

➁:

\({\mu _{{\mathrm {R}}}}, {\mu _{{\mathrm {F}}}}\) (renormalization and factorization QCD scales) variations; they are the standard substitute for missing higher order uncertainty (MHOU) [32]; MHOU are better treated in a Bayesian context with a flat prior;

➂:

uncertainty on \({\gamma _{{\mathrm{H}}}}\) [Eq. (3)] due to missing higher orders, negligible for a light Higgs;

➃:

✔uncertainty for \(\Gamma _{{\mathrm{H}}\rightarrow {\mathrm{F}}}({M_{{\mathrm{f}}}})\) due to missing higher orders (mostly EW), especially for high values of the Higgs virtuality \({M_{{\mathrm{f}}}}\) (i.e. the invariant mass in \({\mathrm{p}}{\mathrm{p}}\rightarrow {\mathrm{H}}\rightarrow {\mathrm{f}}+ {\mathrm{X}}\));

➄:

✔uncertainty due to missing higher orders (mostly QCD) for the background

where ✔means discussed in this note. When ➁ is included one should remember the N\({}^3\)LO effect in gluon-gluon fusion (estimated \(+17\%\) in Ref. [33]) and and additional \(+7\%\) for an all-order estimate, see Ref. [32]. These numbers refer to the fully inclusive \(\mathrm {K}\,\)-factors. The effect of varying QCD scales, \({\mu _{{\mathrm {R}}}}= {\mu _{{\mathrm {F}}}}\in [ {M_{{\mathrm{f}}}}/4\,,\,{M_{{\mathrm{f}}}}]\) is shown in Fig. 7, for \(\mathrm {K}\) and \(\sqrt{\mathrm {K}_{{\mathrm{g}}{\mathrm{g}}}}\).

Fig. 7
figure 7

Differential \(K\)-factors in Higgs production for \({\mu _{{\mathrm{H}}}}= 125.6\) GeV. The central values correspond to \({\mu _{{\mathrm {R}}}}= {\mu _{{\mathrm {F}}}}= {M_{{\mathrm{f}}}}/2\), where \({M_{{\mathrm{f}}}}\) is the Higgs virtuality. The bands give the THU simulated by varying QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4,{M_{{\mathrm{f}}}}]\)

Once again, it should be stressed that QCD scale variation is only a conventional simulation of the effect of missing higher orders. Taking Fig. 7 for its face value, we register a substantial reduction in the uncertainty when \(\mathrm {K}\,\)-factors are included. For instance, we find \([ {-}12.1\%\,,\,{+}11.0\% ]\) for the NNLO prediction around the peak, \([ {-}10.9\%\,,\,{+}9.9\% ]\) around \(2\,M_{{\mathrm{Z}}}\) and \([ {-}9.7\%\,,\,{+}6.6\% ]\) at 1 TeV. The corresponding LO prediction is \([ {-}27.3\%\,,\,{+}12.9\% ]\) around the peak, \([ {-}29.5\%\,,\,{+}32.1\% ]\) around \(2\,M_{{\mathrm{Z}}}\) and \([ {-}38\%\,,\,+42\% ]\) at 1 TeV. Note that \({\mu _{{\mathrm {R}}}}\) enters also in the values of \(\alpha _{\mathrm {s}}\).

Admittedly, showing the effect of QCD scale variations on \(\mathrm {K}\,\)-factors is somewhat misleading but we have adopted this choice in view of the fact that, operatively speaking, the experimental analysis will generate bins in \(M_{4{\mathrm{l}}}\) with a LO MC and multiply the number of events in each bin by the corresponding \(\mathrm {K}\,\)-factor. Introducing \({\mathrm {D}}^{\mathrm{LO}}_{+} = {\mathrm {D}}^{\mathrm{LO}}({M_{{\mathrm{f}}}}/4)\) and \({\mathrm {D}}^{\mathrm{LO}}_{-} = {\mathrm {D}}^{\mathrm{LO}}({M_{{\mathrm{f}}}})\), where \({\mathrm {D}}^{\mathrm{LO}}\) is the LO distribution and \(\mathrm {K}_{+} = \mathrm {K}({M_{{\mathrm{f}}}}/4)\) and \(\mathrm {K}_{-} = \mathrm {K}({M_{{\mathrm{f}}}})\), where \(\mathrm {K}= {\mathrm {D}}^{\mathrm{NNLO}}/{\mathrm {D}}^{\mathrm{LO}}\) is the \(\mathrm {K}\,\)-factor, the correct strategy is \(\mathrm {K}_{\pm }\,{\mathrm {D}}^{\mathrm{LO}}_{\pm }\). When looking at Fig. 7 one should remember that the scale variation that increases (decreases) the distributions is the one decreasing (increasing) the \(\mathrm {K}\,\)-factor. The NNLO and LO (camel-shaped) lineshapes, with QCD scale variations, are given in Fig. 8. The THU induced by QCD scale variation can be reduced by considering the (peak) normalized lineshape, as shown in Fig. 9. In other words the constraint on the Higgs intrinsic width should be derived by looking at the ratio

$$\begin{aligned} \mathrm {R}^{4{\mathrm{l}}}_{{\mathrm{off}}} = \frac{\mathrm {N}^{4{\mathrm{l}}}_{{\mathrm{off}}}}{\mathrm {N}^{4{\mathrm{l}}}_{{\mathrm{tot}}}}, \qquad \mathrm {N}^{4{\mathrm{l}}}_{{\mathrm{off}}} = \mathrm {N}^{4{\mathrm{l}}}\left( M_{4{\mathrm{l}}} > M_0 \right) \end{aligned}$$
(35)

as a function of \({\gamma _{{\mathrm{H}}}}/{\gamma _{{\mathrm{H}}}}^{\mathrm{SM}}\), where \(\mathrm {N}^{4{\mathrm{l}}}\) is the number of \(4\,\)-leptons events. Since the \(\mathrm {K}\,\)-factor has a relatively small range of variation with the virtuality, the ratio in Eq. (35) is much less sensitive also to higher order terms.

Fig. 8
figure 8

(Camel) Lineshape for \({\mu _{{\mathrm{H}}}}= 125.6\) GeV. The central values correspond to \({\mu _{{\mathrm {R}}}}= {\mu _{{\mathrm {F}}}}= {M_{{\mathrm{f}}}}/2\), where \({M_{{\mathrm{f}}}}\) is the Higgs virtuality. The bands give the THU simulated by varying QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4, {M_{{\mathrm{f}}}}]\)

Fig. 9
figure 9

Normalized NNLO lineshape for \({\mu _{{\mathrm{H}}}}= 125.6\) GeV. The central values correspond to \({\mu _{{\mathrm {R}}}}= {\mu _{{\mathrm {F}}}}= {M_{{\mathrm{f}}}}/2\), where \({M_{{\mathrm{f}}}}\) is the Higgs virtuality. The bands give the THU simulated by varying QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4, {M_{{\mathrm{f}}}}]\)

An additional comment refers to Eqs. (42, 43) of Ref. [21], where \({\gamma _{{\mathrm{H}}}}= {\gamma _{{\mathrm{H}}}}^{\mathrm{SM}}\) produces a negative number of events, a typical phenomenon that occurs with large and destructive interference effects when only signal + interference is considered. Unless the notion of negative events is introduced (background-subtracted number of events), the SM case cannot be included, as also shown in their Fig. 9, where only the portion \({\gamma _{{\mathrm{H}}}}> 4.58(2.08)\,{\gamma _{{\mathrm{H}}}}^{\mathrm{SM}}\) should be considered for \(M_{4{\mathrm{l}}} > 130(300)\) GeV, roughly a factor of \(10\) smaller than the estimated bounds. This clearly demonstrate the importance of controlling THU on the interference, especially for improved limits on \({\gamma _{{\mathrm{H}}}}\).

4.2 Improving THU for interference?

One could argue that zero knowledge on the background \(\mathrm {K}\,\)-factor is a too conservative approach but it should be kept in mind that it’s better to be with no one than to be with wrong one. Let us consider in details the process \(i j \rightarrow {\mathrm{F}}\); the amplitude can be written as the sum of a resonant (R) and a non-resonant (NR) part,

$$\begin{aligned} A_{ij \rightarrow {\mathrm{F}}} = A_{ij \rightarrow {\mathrm{H}}}\,\frac{1}{s - {s_{{\mathrm{H}}}}}\,A_{{\mathrm{H}}\rightarrow {\mathrm{F}}} + A^{{\mathrm{NR}}}_{ij \rightarrow {\mathrm{F}}} \end{aligned}$$
(36)

We denote by LO the lowest order in perturbation theory where a process starts contributing and introduce \(\mathrm {K}\,\)-factors that include higher orders.

$$\begin{aligned}&A_{ij \rightarrow {\mathrm{H}}} = \left( \mathrm {K}^{\mathrm {p}}_{ij} \right) ^{1/2}\,A^{\mathrm{LO}}_{ij \rightarrow {\mathrm{H}}}, \quad A_{{\mathrm{H}}\rightarrow {\mathrm{F}}} = \left( \mathrm {K}^{\mathrm {d}}_{{\mathrm{F}}} \right) ^{1/2}\,A^{\mathrm{LO}}_{{\mathrm{H}}\rightarrow {\mathrm{F}}},\nonumber \\&A^{{\mathrm{NR}}}_{ij \rightarrow {\mathrm{F}}} = \left( \mathrm {K}^{\mathrm {b}}_{ij{\mathrm{F}}} \right) ^{1/2}\, A^{{\mathrm{NR}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}} \end{aligned}$$
(37)

Furthermore, we introduce

$$\begin{aligned} A^{{\mathrm{R}}}_{ij \rightarrow {\mathrm{F}}} = A_{ij \rightarrow {\mathrm{H}}}\,A_{{\mathrm{H}}\rightarrow {\mathrm{F}}} \end{aligned}$$
(38)

the interference becomes

$$\begin{aligned}&\mathrm {I} = 2\,\left[ \mathrm {K}^{\mathrm {p}}_{ij}\,\mathrm {K}^{\mathrm {d}}_{{\mathrm{F}}}\,\mathrm {K}^{\mathrm {b}}_{ij{\mathrm{F}}} \right] ^{1/2}\nonumber \\&\quad \times \left\{ \begin{array}{l}{\mathrm{Re}}\,\frac{A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}}{s - {s_{{\mathrm{H}}}}}\;{\mathrm{Re}}\,A^{{\mathrm{NR}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}} - {\mathrm{Im}}\,\frac{A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}}{s - {s_{{\mathrm{H}}}}}\;{\mathrm{Im}}\,A^{{\mathrm{NR}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}\end{array}\right\} \nonumber \\&{\mathrm{Re}}\,\frac{A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}}{s - {s_{{\mathrm{H}}}}} = \frac{s - {\mu _{{\mathrm{H}}}}^2}{\Bigr |s - {s_{{\mathrm{H}}}}\Bigr |^2}\,{\mathrm{Re}}\,A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}} + \frac{{\mu _{{\mathrm{H}}}}\,{\gamma _{{\mathrm{H}}}}}{\Bigr |s - {s_{{\mathrm{H}}}}\Bigr |^2}\,{\mathrm{Im}}\,A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}\nonumber \\&{\mathrm{Im}}\,\frac{A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}}{s - {s_{{\mathrm{H}}}}} =\frac{s - {\mu _{{\mathrm{H}}}}^2}{\Bigr |s - {s_{{\mathrm{H}}}}\Bigr |^2}\,{\mathrm{Im}}\,A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}} - \frac{{\mu _{{\mathrm{H}}}}\,{\gamma _{{\mathrm{H}}}}}{\Bigr |s - {s_{{\mathrm{H}}}}\Bigr |^2}\,{\mathrm{Re}}\,A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}\nonumber \\ \end{aligned}$$
(39)

From Eq. (39) we see the main difference in the interference effects of a heavy Higgs boson w.r.t. the off-shell tail of a light Higgs boson. For the latter case \({\gamma _{{\mathrm{H}}}}\) is completely negligible, whereas it gives sizable effects for the heavy Higgs boson case.

4.2.1 The soft-knowledge scenario

Neglecting PDF + \(\alpha _{\mathrm {s}}\) uncertainties and those coming from missing higher orders, the major source of THU is due to the missing NLO interference. In Ref. [26] the effect of QCD corrections to the signal-background interference at the LHC has been studied for a heavy Higgs boson. A soft-collinear approximation to the NLO and NNLO corrections is constructed for the background process, which is exactly known only at LO. Its accuracy is estimated by constructing and comparing the same approximation to the exact result for the signal process, which is known up to NNLO, and the conclusion is that one can describe the signal-background interference to better than ten percent accuracy for large values of the Higgs virtuality. It is also shown that, in practice, a fairly good approximation to higher-order QCD corrections to the interference may be obtained by rescaling the known LO result by a \(\mathrm {K}\,\)-factor computed using the signal process.

The goodness of the approximation, when applied to the signal, remains fairly good down to \(180\) GeV and rapidly deteriorates only below the \(2\,M_{{\mathrm{Z}}}\,\)-threshold; note that both \(M_{4{\mathrm{l}}} > 130\,GeV\) and \(M_{4{\mathrm{l}}} > 300\) GeV have been considered in the study of Ref. [21]. The exact result for the background is missing but the eikonal nature of the approximation should make it equally good, for signal as well as for background.Footnote 7

This line of thought looks very promising, with a reduction of the corresponding THU (zero-knowledge scenario), although its extension from the heavy Higgs scenario to the light Higgs off-shell scenario has not been completely worked out. In a nutshell, one can write

$$\begin{aligned} \sigma = \sigma ^{\mathrm{LO}} \!+\! \sigma ^{\mathrm{LO}}\,\frac{\alpha _{\mathrm {s}}}{2\,\pi }\,\left[ \text{ universal } \!+\! \text{ process } \text{ dependent } \!+\! \text{ reg } \right] \nonumber \\ \end{aligned}$$
(40)

where “universal” (the \(+\) distribution) gives the bulk of the result while “process dependent” (the \(\delta \) function) is known up to two loops for the signal but not for the background and “reg” is the regular part. A possible strategy would be to use for background the same “process dependent” coefficients and allow for their variation within some ad hoc factor. Assuming

$$\begin{aligned} \mathrm {K}^{\mathrm {b},{\mathrm{soft}}}_{ij{\mathrm{F}}} = \mathrm {K}^{\mathrm {p}}_{ij} \pm \Delta \mathrm {K}^{\pm }_{ij} \end{aligned}$$
(41)

we could write

$$\begin{aligned} \mathrm {I}&= 2\,\mathrm {K}^{\mathrm {p}}_{ij}\,\left( \mathrm {K}^{\mathrm {d}}_{{\mathrm{F}}} \right) ^{1/2}\,\left[ 1 \pm \frac{\Delta \mathrm {K}^{\pm }_{ij}}{\mathrm {K}^{\mathrm {p}}_{ij}} \right] ^{1/2}\,{\mathrm{Re}}\,\frac{A^{{\mathrm{R}},\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}}{s-{s_{{\mathrm{H}}}}}\,\left( A^{{\mathrm{NR}}\,\mathrm{LO}}_{ij \rightarrow {\mathrm{F}}}\right) ^{*}\nonumber \\&= 2\,\mathrm {K}^{\mathrm {p}}_{ij}\,\left( \mathrm {K}^{\mathrm {d}}_{{\mathrm{F}}} \right) ^{1/2}\,\left[ 1 \pm \frac{\Delta \mathrm {K}^{\pm }_{ij}}{\mathrm {K}^{\mathrm {p}}_{ij}} \right] ^{1/2}\,\mathrm {I}^{\mathrm{LO}} \end{aligned}$$
(42)

In this scenario the subtraction of the background cannot be performed at LO. It is worth noting that simultaneous inclusion of higher order corrections for Higgs production (NNLO) and Higgs decay (NLO) is a three-loop effect that is not balanced even with the introduction of the eikonal QCD \(\mathrm {K}\,\)factor for the background; three loop mixed EW-QCD corrections are still missing, even at some approximate level. Note that \(\mathrm {K}^{\mathrm {d}}_{4{\mathrm{l}}}\) can be obtained by running Prophecy4f [34] in LO/NLO modes.

4.3 Background-subtracted lineshape

In Fig. 10 we present our results for \(\sigma ^{\mathrm {S} + \mathrm {I}}\) for the \({\mathrm{Z}}{\mathrm{Z}}\rightarrow 4\,{\mathrm{e}}\) final state. The pseudo-observable \(\sigma ^{\mathrm {S} + \mathrm {I}}\) that includes only signal and interference (not constrained to be positive) is now a standard in the experimental analysis.

Fig. 10
figure 10

\(\sigma ^{\mathrm {S} + \mathrm {I}}\) for \(4{\mathrm{e}}\) final state. The blue curve gives the intermediate option and the cyan band the associated THU between additive and multiplicative options. Multiplicative option is the green curve. Red curves give the THU due to QCD scale variation for the intermediate option (QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4\,,\,{M_{{\mathrm{f}}}}]\), where \({M_{{\mathrm{f}}}}= M_{4{\mathrm{e}}}\) is the Higgs virtuality). A cut \(p_\mathrm{T}^\mathrm{Z}> 0.25\,M_{4{\mathrm{e}}}\) has been applied. If one adopts the soft-knowledge recipe, the result is given by the green curve; provisionally, one could assume a \(\pm 10\%\) uncertainty, extrapolating the estimate made for the high-mass study in Ref. [26]

The blue curve in Fig. 10 gives the intermediate option for including the interference and the cyan band the associated THU between additive and multiplicative options. Multiplicative option is the green curve. Red curves give the THU due to QCD scale variation for the intermediate option (QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4\,,\,{M_{{\mathrm{f}}}}]\), where \({M_{{\mathrm{f}}}}= M_{4{\mathrm{e}}}\) is the Higgs virtuality). A cut \(p_\mathrm{T}^\mathrm{Z}> 0.25\,M_{4{\mathrm{e}}}\) has been applied. The figure shows how a \(\mathrm {S}\) (camel-shaped) distributions transforms into a \(\mathrm {S} + \mathrm {I}\) (square-root–shaped) distribution.

Remark

Of course, one could adopt the soft-knowledge recipe, in which case the result is given by the green curve in Fig. 10; provisionally, one could assume a \(\pm 10\%\) uncertainty, extrapolating the estimate made for the high-mass study in Ref. [26]. Background subtraction should be performed accordingly [\(\mathrm {K}^{\mathrm {b}}_{ij{\mathrm{F}}}\) of Eq. (37)].

It is worth introducing few auxiliary quantities [15]: the minimum and the half-minima of \(\sigma ^{\mathrm {S} + \mathrm {I}}\): given

$$\begin{aligned} {\mathrm {D}}\left( M_{4{\mathrm{l}}}\right) = \frac{d}{d M^2_{4{\mathrm{l}}}}\,\sigma ^{\mathrm {S} + \mathrm {I}} \end{aligned}$$
(43)

we define

$$\begin{aligned}&{\mathrm {D}}_1 = {\mathrm {D}}\left( M_1\right) = \min \,{\mathrm {D}}\left( M_{4{\mathrm{l}}}\right) , \nonumber \\&\quad {\mathrm {D}}^{\pm }_{1/2} = {\mathrm {D}}\left( M^{\pm }_{1/2}\right) = \frac{1}{2}\,{\mathrm {D}}\left( M_1 \right) \end{aligned}$$
(44)

As observed in Ref. [15], THU is tiny on \(M_1\) and moderately larger for \(M^{\pm }_{1/2}\).

Remark

Alternatively, and taking into account the indication of Ref. [26] we could proceed as follows:Footnote 8 we can try to turn our three measures of the lineshape into a continuous estimate in each bin; there is a technique, called “vertical morphing” [35], that introduces a “morphing” parameter \(f\) which is nominally zero and has some uncertainty. If we define

$$\begin{aligned}&{\mathrm {D}}^0 = \frac{d \sigma ^{\mathrm {S} + \mathrm {I}}}{d M^2_{4{\mathrm{l}}}}, \quad \text{ option }\;\mathrm {I}, \qquad {\mathrm {D}}^+ = \max \nolimits _{{\mathrm {A}},{\mathrm {M}}}\,{\mathrm {D}}, \nonumber \\&\quad {\mathrm {D}}^- = \min \nolimits _{{\mathrm {A}},{\mathrm {M}}}\,{\mathrm {D}}\end{aligned}$$
(45)

the simplest “vertical morphing” replaces

$$\begin{aligned} {\mathrm {D}}^0 \rightarrow {\mathrm {D}}^0 + \frac{f}{2}\,\left( {\mathrm {D}}^+ - {\mathrm {D}}^-\right) \end{aligned}$$
(46)

Of course, the whole idea depends on the choice of the distribution for \(f\), usually Gaussian which is not necessarily our case; instead, one would prefer to maintain, as much as possible, the indication from the soft-knowledge scenario (in a Bayesian sense). Therefore, we define two curves

$$\begin{aligned} {\mathrm {D}}_{-}\left( \uplambda \,,\,M_{4{\mathrm{l}}}\right) = \uplambda \,{\mathrm {D}}_{{\mathrm {M}}}\left( M_{4{\mathrm{l}}}\right) + \left( 1 - \uplambda \right) \,{\mathrm {D}}_{{\mathrm {I}}}\left( M_{4{\mathrm{l}}}\right) \nonumber \\ {\mathrm {D}}_{+}\left( \uplambda \,,\,M_{4{\mathrm{l}}}\right) = \uplambda \,{\mathrm {D}}_{{\mathrm {I}}}\left( M_{4{\mathrm{l}}}\right) + \left( 1 - \uplambda \right) \,{\mathrm {D}}_{{\mathrm {A}}}\left( M_{4{\mathrm{l}}}\right) \end{aligned}$$
(47)

We assume that the parameter \(\uplambda \), with \(0 \le \uplambda \le 1\), has a flat distribution. We will have \({\mathrm {D}}_{-} < {\mathrm {D}}_{{\mathrm {I}}} < {\mathrm {D}}_{+}\) and a value for \(\uplambda \) close to one (e.g. \(0.9\)) gives less weight to the additive option, highly disfavored by the eikonal approximation. The corresponding THU band will be labelled by \(\mathrm {VM}(\uplambda )\).

Consider \({\mathrm {D}}_1\) of Eq. (44): we have \(M_1 = 233.9\) GeV and the THU band corresponding to the full variation between A-option and M-option is \(0.00171\,fb\), equivalent to a \(\pm 39.9\%\). If we select \(\uplambda = 0.9\) in Eq. (47) the difference \({\mathrm {D}}_- - {\mathrm {D}}_+\) reduces the uncertainty to \(0.00098\,fb\), equivalent to \(\pm 22.8\%\). The destructive effect of the interference shows how challenging will be to put more stringent bounds on \({\gamma _{{\mathrm{H}}}}\) when \({\gamma _{{\mathrm{H}}}}\rightarrow {\gamma _{{\mathrm{H}}}}^{\mathrm{SM}}\). The off-shell effects are an ideal place where to look for “large” deviations from the SM (from \({\gamma _{{\mathrm{H}}}}^{\mathrm{SM}}\)) where, however, large scaling of the Higgs couplings raise severe questions on the structure of underlying BSM theory.

Definition

There is an additional variable that we should consider:

$$\begin{aligned} \mathrm {R}^{\mathrm {S} + \mathrm {I}}\left( M_1,M_2\right) = \frac{\sigma ^{\mathrm {S} + \mathrm {I}}\left( M_{4{\mathrm{l}}} > M_1 \right) }{\sigma ^{\mathrm {S} + \mathrm {I}}\left( M_{4{\mathrm{l}}} > M_2 \right) } \end{aligned}$$
(48)

For instance, integrate \(d\sigma ^{\mathrm {S} + \mathrm {I}}/d M^2_{4{\mathrm{l}}}\) over bins of \(2.25\) GeV for \(M_{4{\mathrm{l}}} > 212\) GeV and obtain \(\sigma ^{\mathrm {S} + \mathrm {I}}(i)\). Next, consider the ratio \(\mathrm {R}^{\mathrm {S} + \mathrm {I}}(i) = \sigma ^{\mathrm {S} + \mathrm {I}}(i)/\sigma ^{\mathrm {S} + \mathrm {I}}(1)\) which is shown in Fig. 11 where the THU band is given by \(\mathrm {VM}(0.9)\). To give an example the THU corresponding to the bin of \(300\) GeV is \(14.9\%\). THU associated with QCD scale variations is given by the two dashed lines.

Fig. 11
figure 11

The ratio \(\mathrm {R}^{\mathrm {S} + \mathrm {I}}(i) = \sigma ^{\mathrm {S} + \mathrm {I}}(i)/\sigma ^{\mathrm {S} + \mathrm {I}}(1)\), Eq. (48), where \(\sigma ^{\mathrm {S} + \mathrm {I}}(i)\) is obtained by integrating \(d\sigma ^{\mathrm {S} + \mathrm {I}}/d M^2_{4{\mathrm{l}}}\) over bins of \(2.25\) GeV for \(M_{4{\mathrm{l}}} > 212\) GeV. The parameter \(\uplambda \) is defined in Eq. (47). Dashed lines give the QCD scale variation (QCD scales \(\in \; [ {M_{{\mathrm{f}}}}/4\,,\,{M_{{\mathrm{f}}}}]\), where \({M_{{\mathrm{f}}}}= M_{4{\mathrm{e}}}\) is the Higgs virtuality). A cut \(p_\mathrm{T}^\mathrm{Z}> 0.25\,M_{4{\mathrm{e}}}\) has been applied

5 Conclusions

The successful search for the on-shell Higgs-like boson has put little emphasis on the potential of the off-shell events; the attitude was “the issue of the Higgs off-shellness is very interesting but it is not relevant for low Higgs masses” and “for SM Higgs below \(200\) GeV, the natural width (mostly for MSSM as well) is much below the experimental resolution. We have therefore never cared about it for light Higgs. Just produce on-shell Higgs and let them decay in MC”; luckily the panorama is changing.

In \(2012\), it was demonstrated that, with few assumptions and using events with pairs of \({\mathrm{Z}}\) particles, the high invariant mass tail can be used to constrain the Higgs width. One can also extract an upper limit on the Higgs width at the price of assuming that its couplings to the known particles are given by the Standard Model, yet allowing new particles to affect the width: the LHC is becoming a precision instrument even in the Higgs sector.

It is clear that one can’t do much without a MC, therefore the analysis should be based on some LO MC, or some other. However, more inclusive NLO (or even NNLO) calculations show that the LO predictions can be far away, which means that re-weighting can be a better approximation, as long as it is accompanied by an algorithmic formulation of the associated theoretical uncertainty. The latter is (almost) dominating the total systematic error and precision Higgs physics requires control of both systematics, not only the experimental one. Very often THU is nothing more than educated guesswork but a workable falsehood is more useful than a complex incomprehensible truth. In other words, closeness to the whole truth is in part a matter of degree of informativeness of a proposition.