Boosted Higgs Shapes

The inclusive Higgs production rate through gluon fusion has been measured to be in agreement with the Standard Model (SM). We show that even if the inclusive Higgs production rate is very SM-like, a precise determination of the boosted Higgs transverse momentum shape offers the opportunity to see effects of natural new physics. These measurements are generically motivated by effective field theory arguments and specifically in extensions of the SM with a natural weak scale, like composite Higgs models and natural supersymmetry. We show in detail how a measurement at high transverse momentum of $H\to 2\ell+\mathbf{p}\!\!/_T$ via $H\to \tau\tau$ and $H\to WW^*$ could be performed and demonstrate that it offers a compelling alternative to the $t\bar t H$ channel. We discuss the sensitivity to new physics in the most challenging scenario of an exactly SM-like inclusive Higgs cross-section.

With no apparent deviation from the SM so far, it is important to closely examine the channels where one has a fighting chance to encounter new physics. One such promising process is Higgs production via gluon fusion: In order to avoid unnatural fine-tuning while still obtaining a light Higgs mass, loops of new particles need to soften to the Higgs mass squared UV sensitivity of the top loop. If these particles are charged under the SU (3) color gauge group (which they are in almost all known cases), gluons will couple to the loop. With two gluons coupled to the new physics loop and one Higgs set to its vacuum expectation value, one gets a contribution to gluon fusion, the dominant Higgs production mechanism, see e.g. [34,35].
At the same time, top partners can lead to a modified top-Yukawa coupling. A change in the top-Yukawa affects the Higgs production cross-section and can even compensate for new particles in the loop such that a SM-like inclusive cross-section is obtained even though new physics is present. The reason for this is that already for the top quark the effective gluon Higgs interaction [36,37] obtained from the low energy theorem is a very good description [38,39] which works even better for heavier particles. Therefore the inclusive amplitude can be expressed as the sum of two identical Feynman diagrams with the effective interaction (one from the top loop and one from the non-SM loop) which differ only by a coefficient, c t and κ g , respectively. The crosssection is therefore only sensitive to the absolute square of the sum of these coefficients. The effects of this in composite-Higgs models were calculated in [40][41][42][43][44] where it is shown that the contributions to the inclusive cross-section indeed cancel in minimal models.
The main idea now is to study boosted Higgs shapes above a certain p T scale. This scale should be high enough to resolve the top loop beyond the effective description, but low enough to keep the effective description of the loop of the new particle valid; see [45] for a discussion in a concrete model. The simple relation σ ∝ |c t + κ g | 2 for the inclusive cross-sections does not apply but is modified, allowing the two coefficients to be extracted separately, when combined with the inclusive measurement. Early studies looking for New Physics in the Higgs p T distribution in the gluon-fusion production mode include [46][47][48] and recent preliminary studies looking at highly boosted Higgs shapes include [45,[49][50][51]. An alternative approach to measure the coupling c t in boosted pp → HZ was presented in [52,53]. Although there are attempts to measure c t directly by looking into the difficult ttH channel [54][55][56][57][58][59][60], it is important to explore boosted Higgs production from gluon fusion as a complementary approach.
To simplify the extraction of the small amount of high-p T signal from the background, we focus on the clean decay of a Higgs to two leptons = e ± , µ ± and missing transverse momentum / p T . For a 125 GeV Standard Model-like Higgs boson, this occurs almost entirely via H → W W * and H → τ τ ; we will focus on these two channels separately as detailed in Section IV.
The organization of the paper is as follows. In Section II we discuss some examples of beyond the Standard Model physics which motivate this analysis. Section III outlines how we generated our signal and background samples. Section IV contains our signal versus background analyses for boosted H → 2 + / p T in the Standard Model. Section V contains a discussion of the analysis and we conclude in Section VI.

A. Minimal Composite Higgs Model
In the Minimal Composite Higgs Model (MCHM) [61], electroweak symmetry is broken dynamically by a strong interaction based on the coset SO(5)/SO (4). For reviews of MHCM see [62,63]. In this class of models, the Higgs arises as a pseudo-Nambu-Goldstone Boson (pNGB) of the symmetry breaking which naturally explains its small mass. Fermionic resonances of the strong sector, coming in multiplets of SO(4), will contribute to the gluon fusion loop diagram. These resonances also mix with the SM fermions and thus modify their couplings to the Higgs. Interestingly, the contributions of these resonances to the sum of the coefficients κ g and c t cancel exactly in a broad class of MCHM models and lead, up to small corrections which are negligible at the LHC [64], to [40][41][42][43][44] where f g is a function satisfying f g (ξ → 0) = 1 with ξ ≡ v 2 /f 2 and f is the decay constant of the non-linear sigma model. The gluon fusion cross-section is therefore independent of the mass spectrum of the fermionic resonances, and for small ξ is even SM-like. This makes it impossible to find traces of the top partner spectrum in the inclusive gluon fusion process. While the resonances are needed to cut off the UV-divergences of the Higgs mass and thus must not be too heavy to avoid excessive fine tuning, they should still be heavy enough to allow for an effective description of the boosted Higgs production. In [45] it was shown that as long as the mass of the lightest resonance is at least of the order of the Higgs transverse momentum, the result of the calculation in the heavy top limit lies within O(10%) of the full calculation. Considering that the masses of the resonances have to be heavier than 600 − 800 GeV depending on the representation [65][66][67][68][69][70][71][72][73][74], the effective description is well justified within the scope of the paper.

B. Supersymmetry
In the minimal supersymmetric SM (MSSM), an analogous flat direction of the inclusive cross-section exists which can be resolved by looking at boosted Higgs shapes. For certain choices of the stop masses mt 1 , mt 2 and A t , the effects of two contributions cancel, yielding a SM-like inclusive signal strength [75][76][77][78][79][80][81][82][83]. Assuming the MSSM is in the decoupling limit, and neglecting small D-term contributions, the inclusive signal strength is given by [84] quantifies the deviation from the SM value and can vanish due to the relative minus sign. A 125 GeV Higgs can easily achieved by extending the MSSM by additional D-or F-terms which should, of course, not have a major impact on the couplings of the SM-like lightest Higgs.
Since the A t -dependent parts of the production cross-section are less sensitive to the boost of the Higgs than the A t -independent ones, the aforementioned degeneracy gets broken in the boosted regime. Therefore the non-SM nature of the Higgs production can be revealed by looking at the boosted production. Moreover this can make light stops [85][86][87][88][89][90][91][92][93][94][95][96] [97,98] accessible which are hidden in the stealth region and challenging to extract given the similarity to the top background [99][100][101][102][103]. An outline showing this sensitivity and taking vacuum stability constraints into account has been presented in [45].

C. Effective description
It is useful to parametrize our ignorance of new physics in terms of an effective Lagrangian. Out of the 59 dimension six operators one can add to the SM [104,105], only four can affect the Higgs production through gluon fusion [106][107][108]. These four operators as well as the other dimension six operators involving the Higgs are already constrained to some extent by LHC data [108][109][110][111][112][113]. We will focus on CP-conserving effects and omit the CP-violating operator containing the dual of the QCD gauge field strength. The remaining three important operators are After adding them to the SM Lagrangian and extracting the terms relevant for the gluon fusion process we obtain where c t = 1 − Re(C y ) − C H /2 scales the top Yukawa coupling which enters the process via the top loop and κ g = C g controls the direct gluon-Higgs interaction. The C i are the coefficients of the corresponding operators in (4) and the coefficients are chosen such that for c t = 1 and κ g = 0 the SM Lagrangian is obtained. The full matrix element for boosted Higgs production is then given by 1 where M IR is the matrix element taking the full top mass dependence into account [115] and M U V is the one obtained from M IR in the heavy top limit or equivalently from the tree-level diagram generated by O g . From Eq. (6) we see that the differential cross-section, normalized by the SM value, can be described as where 1 In the SM the effects of the bottom loop are within a few percent if the boost of the Higgs exceeds O(50 GeV) [39,48,114] and are therefore neglected.
For small p cut T , the coefficients δ, are very small, modifying the cross section only by a few percent, which is less than the uncertainty expected in the inclusive Higgs cross section measurements [116][117][118]. This is what is expected due to the very good description of both the top and the new particle loop by the effective interaction. On the other hand, δ, grow significantly as p cut T increases, and they become O(1) for p cut T > 300 GeV [45]. It means we can break the degeneracy by measuring the Higgs p T distribution while we cannot break the degeneracy along c t + κ g = const. direction only by determining the inclusive cross-section.

A. Signal sample
In this paper we consider H+jet events with subsequent H decays to W W * → + − νν and τ + τ − modes as a signal. The signal events are generated with MadGraph5, version 1.5.15 [119] and showered with HERWIG++ [120][121][122], where only W W * and τ + τ − decays are specified.
We have used MadGraph5 to generate H+jet events using the 'HEFT' model with SM couplings which makes use of the low energy theorem. The generated cross-section is proportional to |M(0, 1)| 2 and does not take into account finite top mass effects which are crucial to our analysis. To obtain the correct weight of the events we reweighted them by a weight factor making use of our own code, which is based on an implementation of the formulas for the matrix elements given in [115] and also calculated in [123]. At present no finite top mass NLO computation of the SM Higgs p T spectrum is available. An exact NLO prediction of SM Higgs p T spectrum would be very desirable and help to exploit the full potential of this observable. Recent progress in the precision prediction of h + jet can be found in Refs. [124][125][126]. We will approximate the NNLO (+ NNLL) result of 49.85 pb [127][128][129][130] by multiplying the exact LO result with a K factor of 1.71. We reweight the events for points along the line c t + κ g = 1 for κ g ∈ [−0.5, 0.5] with steps of 0.1, as shown in the left panel of Fig. 1. This is consistent with the SM inclusive Higgs production cross-section. The size of c t alone is only weakly constrained by the current ttH measurement. Although we only consider the most difficult points satisfying c t + κ g = 1 (i.e. an exactly SM-like inclusive cross-section), an analysis along different c t + κ g = const. lines would be straightforward as a different choice essentially just corresponds to an overall rescaling of the signal.  The right panel of the Fig. 1 shows the p T,H distributions for several model points. In the region with low p T,H , the distributions are degenerate but for high p T,H the distributions start to split. For the model points with κ g > 0 we see an enhancement in the high p T,H region while we see the suppression for the model points with κ g < 0. Table I shows the Higgs production cross-sections relative to the SM value for several model points (c t , κ g ) and p T,H cuts. As one can see, for p T,H > 10 GeV the cross-sections are essentially the same as the SM value within 3%, while for increasing p cut T , significant differences from the SM predictions can be observed. For the model point (c t , κ g ) = (0.7, 0.3), for example, a 6% difference would be observed for σ(p T,H > 200 GeV), and a ∼ 20% difference for σ(p T,H > 300 GeV). We will see that these effects are comparable to the sensitivity of the boosted Higgs shape measurements, see Section IV. For very hard cuts, O(1) differences can be observed, as can be seen from the cross-section ratios for p T,H > 500 GeV and harder.

B. Background sample
We include W +jets, Z+jets and tt+jets as background processes which we have generated with ALPGEN + PYTHIA [131,132]. Since we consider boosted Higgs reconstruction and since we will require the existence of one hard recoil jet, we apply a pre-selection cut in the generation step, where we demand at least one recoil parton of p T > 150 GeV. We merge up to two partons for W W +jets and Z+jets, and up to one parton for tt+jets using the MLM matching scheme [133,134]. As we only consider the dilepton mode in this paper we preselect the W decay mode, including W from tops only with leptons, e, µ, and τ . For the Z decay, we consider only Z → τ + τ − since for the other leptonic decay modes we can reconstruct the Z-peak and reject them. We rescale the tt sample to obtain a NLO inclusive cross section of 918 pb [135][136][137]. For the Z+jets and W W +jets samples we used LO cross-sections.
Our analysis is performed at particle level with a simple detector simulation with the granularity resolution of ∆η × ∆φ = 0.1 × 0.1. After removing the isolated leptons, the energy of the remaining visible particles falling into each cell are summed up. Cells with transverse energy above 0.5 GeV are used for the further jet reconstruction.
Jet clustering was performed using the FastJet [138] version 3.0.4. We use the Cambridge-Aachen (C/A) algorithm [139,140] with R = 0.5 for normal jet and b-tag jet definition. We also define 'fat' jets, as explained later, defined using the C/A algorithm with R = 1.5.
In this paper, we only consider the events with isolated leptons for simplicity. There is room for improving the analysis with hadronic tau modes with tau tagging for example [141,142], which is, however, beyond the scope of our current study.
In our notation a subscript will denote leptonically-decaying: τ thus represents τ → + 2ν, W is mostly W → lν with some W → τ ν τ , and t is t → bW . The decay H → 2 + / p T is mostly 2 through H → W W * and H → τ τ . As noted in [143,144], in the decay H → W W * → 2 + 2ν spin correlations ensure that the two lepton momenta have similar directions, as do the two neutrino momenta. In H → τ τ however, the two τ leptons are back-to-back in the Higgs rest frame, and each of them gives rise to a highly collimated + 2ν trio. These two facts imply that for a boosted H → 2 + / p T decay, the / p T is typically outside the lepton pair for the H → W W * contribution and inside the lepton pair for H → τ τ , as shown in Table II. We use this binary criterion -/ p T inside or outside the leptons -to split our analysis into two sub-analyses, which differ in their background compositions as well as signals. A. Common Cuts for H → τ τ and H → W W * In both of our sub-analyses the cuts begin by requiring the following: • Two opposite-sign isolated leptons each having p T > 10 GeV and |η| < 2.5. If a third isolated lepton with p T > 5 GeV and |η| < 2.5 is present, the event is vetoed. Our isolation criterion is T,had is the sum of transverse energies over all hadronic activity in the cone ∆R < 0.2 around the lepton. (The signal leptons are typically hard, so our p T threshold could be raised with minimal loss of efficiency.) • A dilepton mass m exceeding 20 GeV, which is necessary in practice to suppress Drell-Yan dilepton production (not simulated here).
• At least 200 GeV of transverse momentum for the system obtained by vectorially summing the dilepton and missing transverse momenta: The system thus defined has a transverse (but not longitudinal) momentum coinciding with the Higgs in the case of the signal: herein lies our restricted focus on highly energetic/boosted Higgs bosons.
• One 'fat' jet, resulting from clustering using the C/A algorithm with a distance parameter R jet = 1.5. This jet should be very hard: The presence of a very hard jet coincides with our parton-level picture of the signal process: a boosted Higgs recoiling against a gluon/quark. Defining geometrically large 'fat' jets allows us to capture the radiation emitted by this gluon/quark (which might otherwise be clustered into a separate jet when clustering with traditional 'skinny' jets). We veto if there is a second fat jet with p T,j > 100 GeV. Vetoing on additional hadronic activity beyond the first hard fat jet suppresses higher-multiplicity backgrounds, i.e. tt+jets. Not vetoing additional fat jets approximately doubles the tt background, while the signal increases by roughly 30%. These numbers even hold in case regular jets with a cone size of R = 0.4 are vetoed instead. When vetoing jets large logarithms ∼ ln 2 ( √ŝ /p T,veto ) can be induced which need to be resummed [145,146]. However, due to the high veto scale we do not expect these contributions to spoil the reliability of our analysis. As an alternative to jet vetos, 2-jet observables can be used to disentangle signal from background in this process [147,148].
• Zero b-tags. This considerably reduces the (until now dominant) tt+jets background while having negligible effect on the signal. We re-cluster the hadronic activity into jets, again using the C/A algorithm but now with R jet = 0.5, to use for the b-tagging. We assume a flat 70% (1%) efficiency for b (light quark or gluon) initiated jets, i.e. a 30% (99%) probability for such a jet not to provoke the veto. We only consider b-jets of p T,b > 30 GeV and |η b | < 2.5.
The efficiencies of these cuts for the signal and various backgrounds are shown in the first part of Table III. At this stage the backgrounds from W W /Z/tt +jets are seen to contribute at similar levels. The set of cuts described so far are common to both our H → τ τ and H → W W * analyses; from this point onwards they diverge.

B. H → τ τ analysis
The Higgs mass in the decay H → τ τ can be reconstructed using the collinear approximation [123]. The large hierarchy between the Higgs and tau masses ensures a very large boost for the taus, highly collimating their visible and invisible decay products. We can approximate the neutrino momenta by a decomposition of the missing transverse momentum, which assumes that each invisible momentum is parallel to the corresponding visible momentum. (This procedure can be extended to decays of more than one particle, see [149]). As was noted in [123], and further explored in [150], this procedure gains sensitivity with increasing transverse momentum of the Higgs -i.e. when the Higgs recoils against a hard jet. It suffers for a low-p T Higgs because the two τ daughters are then nearly back-to-back, providing a poor basis for the / p T decomposition. For our high-p T Higgs study the mass reconstruction of the signal in this manner is very good and provides a sharp peak 3 .
In more detail, the Higgs mass in H → τ τ is reconstructed via the collinear approximation as follows. We require the missing transverse momentum / p T to be inside the two leptons (more precisely, projecting the two lepton momenta into the transverse plane defines two segments; 'inside' the leptons means inside the smaller segment). We decompose / p T as a linear combination of the two lepton momenta (defining for it a longitudinal component in the process): The requirement that / p T be inside the leptons is equivalent to demanding that the decomposition coefficients are both positive: [  p ν1,col and p ν2,col thus defined approximate the neutrino three-momenta. Promoting them to massless fourmomenta and adding them to the lepton four-momenta gives an approximate Higgs four-momentum, the mass of which we refer to as the collinear Higgs mass: We apply one more cut before making use of the collinear mass variable: an upper limit for the dilepton mass, m < 70 GeV. This cut reduces the tt+jets and W W +jets backgrounds very efficiently while leaving most the H+jets signal and Z → τ τ background (see Fig. 2, left panel). At this stage Z → τ τ becomes the dominant background for extracting the H → τ τ signal. The size of the tt and W W backgrounds can be estimated in a data-driven way by removing m < 70 GeV cuts. We discuss this in detail in Appendix A.
The collinear mass is shown in the central panel of Fig. 2. Note that any particle decaying to τ τ with enough boost that the two τ are not back-to-back will have its mass reconstructed by this process; indeed the most striking feature of the collinear mass distribution is the Z mass peak from the large irreducible Z → τ τ background. A peak due to the signal is visible at M col ∼ m H = 125 GeV. By selecting events in the window |M col − m H | < 10 GeV we achieve a S/B ∼ 0.4 with S/ √ B > 9 for 300 fb −1 . The signal is taken to include the H → W W * contribution, which contributes about ∼ 10% the H → τ τ selection. We estimate the statistical error of the high p T cross-section measurement with √ S + B/S. We obtain uncertainties of 12% for σ(p T,H > 200 GeV), 22% for σ(p T,H > 300 GeV), and 41% for σ(p T,H > 400 GeV), respectively. Assuming we can achieve the same efficiencies for high-luminosity run of the LHC (HL-LHC) at 3 ab −1 , we obtain ∼ 4% for σ(p T,H > 200 GeV), ∼ 7% for σ(p T,H > 300 GeV), and ∼ 13% for σ(p T,H > 400 GeV).
As seen in the central panel of Fig. 2 the smooth side-band distribution can be used for estimating the background contribution. We show in Appendix A that these side-bands are available even after hard p rec T,H cuts. We therefore expect that a data-driven strategy for background estimation will be available, and take the statistical errors as a background uncertainty estimate. There will of course be further systematic uncertainties induced by MC background modeling.
In this analysis we mostly use the recoiling fat jet to remove the tt+jets background. It could be beneficial to make use of the difference between the jet substructure of gluon and quark jets [152][153][154][155][156] since the dominant background at the last stage is Z+jets, which gives a different fraction of gluon and quark jets than the H+jets signal. We leave this for future work.

C. H → W W * analysis
Our selection criteria for extracting H → W W * from the background begin with those described in Section IV A. In Section IV B we required that the / p T vector be inside the two lepton momenta, after which the signal was dominated by H → τ τ and the background by Z →τ τ +jets. Here we will remove most of the contribution of these processes by requiring that / p T be outside the two lepton momenta. This is equivalent to demanding that the m T 2 variable [157] be greater than zero, as m T 2 = 0 when this is not satisfied -the 'trivial zero' [158]. In fact we go further and impose m T 2 > 10 GeV. (16) This rejects essentially all of the contributions from H → τ τ and Z → τ τ +jets, which have the same end point close to m τ . Allowing for endpoint smearing we cut a little harder at 10 GeV instead of m τ .
We are now left with H → W W * as our signal process, competing with the W W /tt +jets backgrounds. Their kinematics unfortunately allow for little discriminating power: all of them contain two leptonic W bosons, with no possibility of mass reconstruction. Luckily, the transverse mass provides some discrimination. As shown in [159], the transverse mass variable satisfying m T, ≤ m h that gives the greatest lower bound on the Higgs mass in its decay to W W * where E T, = (m 2 + p 2 T, ) 1/2 is the transverse energy of the dilepton system, and / E T = | / p T | is the missing transverse energy. We adopt this definition of m T, , also used by the ATLAS Collaboration [160]. 4 The end point at m H for the transverse mass of the signal is shown in the left panel of Fig. 3, where all the selection cuts up to step 6 in Table IV have been applied. We therefore impose m T, < m H = 125 GeV. (18) Finally, backgrounds are further suppressed by requiring that the leptons have similar directions, which is typically the case for the signal due to the aforementioned spin correlations.
The efficiencies of the cuts aimed at H → W W * are shown in Table IV, together with the last common cut -the b veto. We finally find S/B ∼ 0.4, with S/ √ B > 6 for 300 fb −1 . The table also shows the event numbers left after increased p T cuts on the reconstructed Higgs. The resulting reconstructed Higgs p rec T,H distributions are shown in Fig. 3 (right panel), stacked with the signal and background processes. As p rec T,H increases, the signal over background ratio drops faster for the W W mode selection than for the τ τ selection.

V. DISCUSSION
In this section, we discuss how much of the difference in the p T,H distributions due to the modified couplings can be observed after the realistic reconstruction of the previous section has been performed. The left panel of Fig. 4 shows the signal M col distributions for the model points after applying the analysis described in Sec. IV B up to cut 7. We see the peak in the observable for all points. The central and right panel show the signal p rec T,H distributions after the reconstruction described in Sec. IV for H → τ τ and H → W W optimizations, respectively. As we expect, the difference in shape expected from the parton level result of Fig. 1 manifests itself also in the reconstructed p rec T,H distributions. A detailed breakdown after successive selection cuts is shown in Table V for the H → τ τ optimization and in Table VI for the H → W W optimization, quoting crosssections relative to the corresponding SM value. Compared with the parton level numbers in Table I,  dependence is more enhanced at the reconstructed level. This is because most of the selection cuts are more efficient for the boosted Higgs event topology.  We will now estimate how much integrated luminosity is needed to find a certain significance for the signal. We perform a binned likelihood analysis of signal and background using the CL s method, as described in [162]. We include systematic errors on the cross-section normalization assuming a Gaussian probability distribution. . The analysis is based on the expected signal-plus-background against a background-only hypothesis. In the analysis, three different systematic errors on the cross-section normalization of 0, 5, and 10% are assumed. While achieving theoretical uncertainties of less than 10 % is challenging, in the separation of signal and background we rely predominantly on the lepton momenta which can be measured very precisely. As one can see from the left panel in Fig. 5, with L = 20 ∼ 60 fb −1 , we are able to see the SM signal at 95% confidence level depending on the assumed systematic uncertainty.
For κ g > 0, the signal is enhanced and the required integrated luminosity decreases: it would be L = 15 ∼ 30 fb −1 for κ g = 0.5 to observe the signal at 95% CL, as shown in the central panel.
The right panel of Fig. 5 shows the p-values for κ g = 0.5 using the H → W W mode. The sensitivity compared to the τ τ mode is slightly reduced. However, it is still possible to exploit the W W final state to observe a boosted Higgs boson. We also perform a binned likelihood analysis to estimate how well we can distinguish these model points from the SM given the presence of backgrounds. The left panel of Fig. 6 shows the expected p-values to observe the signal and background against the SM and background hypothesis as a function of the integrated luminosity L for the model point of κ g = 0.5 using the H → τ τ analysis. Again, systematic errors of 0, 5, and 10% are assumed. We find that we are able to distinguish the model point κ g = 0.5 from the SM with L = 1000 fb −1 even assuming 10% systematic uncertainty.
It is more difficult to prove a deviation from the SM for model points with κ g < 0, compared to κ g > 0 with the same |κ g | value, since this gives a deficit rather than a surplus of signal events. The central panel of Fig. 6 shows the p-values for κ g = −0.5 using the H → τ τ analysis. As expected we have less sensitivity, and even smaller values of |κ g | require larger integrated luminosities.
The right panel of Fig. 6 shows the p-values as a function of κ g using the H → τ τ for an integrated luminosity of 3000 fb −1 . If we assume 0% systematic uncertainty we can exclude κ g < −0.29 and κ g > 0.24 for L = 3000 fb −1 at 95% CL. For the same integrated luminosity, assuming 10% systematic uncertainty, we can still exclude κ g < −0.4 and κ g > 0.3 at 95% CL.
We have not combined the τ τ and W W analyses although it could improve our sensitivity by some amount. Combining both channels is a complex task since the systematic uncertainties of both channels have to be evaluated by the experimental collaborations. Furthermore, it is not easy to avoid double-counting of events when combining both decay modes, as the final state reconstructions discussed in Sec. IV are not able to strictly separate them (see Table III).

VI. CONCLUSIONS
The dominant production mode of the Higgs boson at the LHC -gluon fusion -is an important probe of new physics. Even though the inclusive rate has been measured to be in agreement with the SM, the study of a Higgs boosted by recoil against a hard jet constitutes an interesting, albeit challenging, measurement. It is motivated in the context of supersymmetry and composite Higgs models, and indeed generically in natural new physics: the Higgs coupling to a top-quark loop is both central to the question of natural electroweak symmetry breaking, and the chief source of gluon fusion. Due to the low energy theorem however, the details of this loop-induced process are entirely obscured unless one can access the boosted Higgs regime.
We have shown boosted Higgs signal isolation in the dilepton channel via H → τ τ and H → W W . The boost enhances the efficiency of the collinear approximation for mass reconstruction in the H → τ τ mode, giving a peak at m H visible above the dominant Z+jets background. Z+jets provides its own peak for this reconstructed mass distribution; using the sidebands around the m H peak we expect a relatively precise background estimate. In the end we achieve S/B ∼ 0.4. For H → W W mode, we can also achieve S/B ∼ 0.4 but with fewer events. This is nevertheless a helpful addition to the statistical significance. We expect a 12% error for the crosssection measurement for p T > 200 GeV, 22% for p T > 300 GeV, and 41% for p T > 400 GeV with an integrated luminosity of 300 fb −1 .
A direct measurement of the top Yukawa coupling in the ttH channel is also instrumental for breaking the degeneracy concerning the coupling of the Higgs to gluons and to the top quark, and the H+jets mode provides a complementary determination. We have shown that we can distinguish several new physics models in an effective field theory approach using the reconstructed Higgs p T distribution. With an integrated luminosity of 3000 fb −1 at the 14 TeV LHC, we can exclude κ g < −0.4 and κ g > 0.3 along the line c t + κ g = 1 at 95% confidence level assuming the systematic uncertainty of 10%.

VII. ACKNOWLEDGMENTS
We thank Christophe Grojean and Ennio Salvioni for many helpful discussions. CW gratefully acknowledges additional funding from the IPPP beyond his studentship, and from the French ANR, Project DMAstroLHC, ANR-12-BS05-0006. MS acknowledges the hospitality at the CERN TH division and the funding of his work by the Joachim Herz Stiftung. The work of AW was supported in part by the German Science Foundation (DFG) under the Collaborative Research Center (SFB) 676. This work was supported in part by the STFC. We collect distributions of the collinear mass M col for several minimal values of the reconstructed Higgs p T and discuss how a data-driven background estimate could be performed. In Fig. 7, we show distributions of M col for p rec T,H > 200 GeV, 300 GeV, and 400 GeV. The upper three plots include the selection cuts up to step 7 in Table III, while the lower three plots are up to cut 6 (i.e. without the m cut). The red lines show the fitting curves for the background distributions. We take the fitting function as the sum of a Breit-Wigner function and a log-normal function. As one can see, the Z-peak and the tail distributions are well fitted for a wide p rec T,H range. This means we can estimate the contributions of the background processes using side bands, which reduces the sensitivity to theoretical uncertainties.
Moreover, the lower plots without the m cut have larger tt and W W contributions but are still well fitted with the same fitting function. Thus, we can extract the normalizations of tt and W W contributions and control part of the Monte Carlo uncertainties using data. We therefore only consider the statistical uncertainty of the total background contributions in the signal region in the main text.