Reconstruction and identification of boosted di-τ systems in a search for Higgs boson pairs using 13 TeV proton-proton collision data in ATLAS

In this paper, a new technique for reconstructing and identifying hadronically decaying τ+τ− pairs with a large Lorentz boost, referred to as the di-τ tagger, is developed and used for the first time in the ATLAS experiment at the Large Hadron Collider. A benchmark di-τ tagging selection is employed in the search for resonant Higgs boson pair production, where one Higgs boson decays into a boosted bb¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ b\overline{b} $$\end{document} pair and the other into a boosted τ+τ− pair, with two hadronically decaying τ-leptons in the final state. Using 139 fb−1 of proton-proton collision data recorded at a centre-of-mass energy of 13 TeV, the efficiency of the di-τ tagger is determined and the background with quark- or gluon-initiated jets misidentified as di-τ objects is estimated. The search for a heavy, narrow, scalar resonance produced via gluon-gluon fusion and decaying into two Higgs bosons is carried out in the mass range 1–3 TeV using the same dataset. No deviations from the Standard Model predictions are observed, and 95% confidence-level exclusion limits are set on this model.


Introduction
The discovery of the Higgs boson (H) by the ATLAS and CMS collaborations at the Large Hadron Collider (LHC) in 2012 [1,2] opens new ways of probing physics beyond the Standard Model (SM), since the Higgs boson may itself appear as one of the intermediate states in the decay of new resonances.Various final states have been used by ATLAS and CMS in searches for both resonant and non-resonant HH production [3][4][5][6][7][8].In the high-mass regime, for resonance masses typically above 1 TeV, the Higgs bosons may be produced with large momenta, causing their decay products to be collimated.The standard reconstruction techniques become inefficient in this regime.Therefore, a novel technique, referred to as the di-τ tagger, is developed to reconstruct and identify boosted hadronically decaying τ + τ − pairs.For the identification, a multivariate algorithm is trained to distinguish between τ + τ − pairs and the multi-jet background from quark-or gluon-initiated jets by exploiting the calorimetric shower shapes and tracking information.
An application of the di-τ tagger is carried out in a search for a narrow spin-0 resonance in the mass range 1-3 TeV, which is produced via gluon-gluon fusion and decays into a pair of Higgs bosons (X → HH), as predicted by models with an extended Higgs sector, such as two-Higgs-doublet models [9,10].This search considers the final state where one Higgs boson decays into a b b pair and the other one into a τ + τ − pair, where both τ-leptons decay hadronically.1A dedicated benchmark of the di-τ tagger with an identification efficiency of 60% is designed for this analysis.Using 139 fb −1 of proton-proton (pp) collision data at a centre-of-mass energy √ s = 13 TeV recorded by the ATLAS experiment in 2015-2018, various orthogonal event categories are defined in order to correct the efficiency of the di-τ tagger for the benchmark selection, to perform and validate the multi-jet background estimate, and to search for resonantly produced Higgs boson pairs.This paper is organised as follows.After a brief description of the ATLAS detector in Section 2, the samples of data and simulated events used in this study are described in Section 3. The procedures used to reconstruct and identify physics objects such as electrons, muons, jets and missing transverse momentum in the detector are described in Section 4. Section 5 presents the reconstruction and identification of boosted hadronically decaying τ + τ − pairs.In Section 6, general event selections and categorisations are summarised, while Section 7 focuses on the data-driven estimation of the multi-jet background with quark-or gluon-initiated jets misidentified as boosted hadronically decaying τ + τ − pairs.The data-driven correction of the di-τ tagger efficiency is discussed in Section 8 and the search for X → HH → b bτ + τ − is presented in Section 9, including the statistical analysis used to set 95% confidence-level (CL) limits on the production cross-section for resonant HH production.Finally, a summary is given in Section 10.

ATLAS detector
The ATLAS detector [11] at the LHC is a multipurpose particle detector with a forward-backward symmetric cylindrical geometry and nearly 4π coverage in solid angle.2It consists of an inner tracking system surrounded by a thin superconducting solenoid providing a 2 T axial magnetic field, electromagnetic and hadronic calorimeters, and a muon spectrometer.
The inner detector covers the pseudorapidity range |η| < 2.5.It consists of silicon pixel, silicon microstrip, and transition radiation tracking detectors.For the √ s = 13 TeV run, a fourth layer of the pixel detector, the insertable B-layer [12,13], was inserted close to the beam pipe at an average radius of 33.2 mm, providing an additional position measurement with 8 µm resolution in the (x, y) plane and 40 µm along z.
The endcap and forward regions are instrumented with LAr calorimeters for both the electromagnetic and hadronic energy measurements up to |η| = 4.9.The granularity of the calorimeter system in terms of ∆η × ∆φ is typically 0.025 × π/128 in the barrel of the electromagnetic calorimeter and 0.1 × π/32 in the hadronic calorimeter, with variations in segmentation with |η| and the layer [14].
The muon spectrometer surrounds the calorimeters and is based on three large air-core toroidal superconducting magnets with eight coils each.The field integral of the toroids ranges between 2.0 and 6.0 T m across most of the detector.The muon spectrometer includes a system of precision tracking chambers and fast detectors for triggering.
A two-level trigger system [15] is used to select events.The first-level trigger is implemented in hardware and uses a subset of the detector information to reduce the accepted rate to at most 100 kHz.This is followed by a software-based trigger that reduces the accepted event rate to 1 kHz on average. 2 ATLAS uses a right-handed coordinate system with its origin at the nominal interaction point (IP) in the centre of the detector and the z-axis along the beam pipe.The x-axis points from the IP to the centre of the LHC ring, and the y-axis points upwards.Cylindrical coordinates (r, φ) are used in the transverse plane, φ being the azimuthal angle around the z-axis.The pseudorapidity is defined in terms of the polar angle θ as η = − ln tan(θ/2).Angular distance is measured in units of ∆R ≡ (∆η) 2 + (∆φ) 2 .

Data and simulated events
The studies presented in this paper are performed using a sample of pp collision data recorded at a centre-of-mass energy √ s = 13 TeV between 2015 and 2018, during stable beam conditions and when all detector components relevant to the analysis were operating nominally [16].This corresponds to an integrated luminosity of 139 fb −1 .Samples of Monte Carlo (MC) simulated events are used to train and calibrate the di-τ tagger, as well as to model the signal and some SM background processes in the search for resonant Higgs boson pair production.
The signal, i.e. the production of a heavy spin-0 resonance via gluon-gluon fusion and its decay into a pair of Higgs bosons, X → HH, was simulated for nine values of the resonance mass, m X , between 1 and 3 TeV, using M G 5_aMC@NLO v2.6.1 [17] at leading-order (LO) accuracy in quantum chromodynamics (QCD) with the NNPDF2.3LO[18] set of parton distribution functions (PDFs).The event generator was interfaced with H v7. 1.3 [19, 20] to model the parton shower, hadronisation and underlying event, using the default set of tuned parameters (tune) and the MMHT2014LO [21] PDF set.In the nine signal samples, a narrow-width approximation was used for the resonance X, i.e. its natural width was set to a value that remains much smaller than the experimental mass resolution.In addition, the Higgs boson mass was set to 125 GeV and the SM branching fractions were used for the decays H → b b and H → τ + τ − .In order to develop the identification algorithm of the boosted hadronically decaying τ + τ − pairs, another set of HH samples was produced, based on a narrow-width spin-2 Kaluza-Klein graviton, as predicted in the Randall-Sundrum model of warped extra dimensions [22], G → HH → (τ + τ − )(τ + τ − ).Such events were generated with M G 5_aMC@NLO v2.3.3 at LO accuracy in QCD with the NNPDF2.3LOset of PDFs, interfaced with P v8.212 [23] using the A14 [24] tune.Five samples, with graviton masses of 1.5, 1.75, 2, 2.25 and 2.5 TeV, were generated.Only hadronically decaying τ-leptons were considered in all HH signal samples.
The production of W and Z bosons in association with jets (V+jets) was simulated with S v2.2.1 [25] using matrix elements at next-to-leading-order (NLO) accuracy in QCD for up to two jets and at LO accuracy for up to four jets, calculated with the Comix [26] and OpenLoops [27] libraries.They were matched with the S parton shower [28] using the MEPS@NLO prescription [29,30].The tune developed by the S authors and the NNPDF3.0NNLOPDF set were used.The V+jets samples were normalised to a next-to-next-to-leading-order (NNLO) prediction [31].In the Z+jets events, jets are labelled according to the generated hadrons with p T > 5 GeV found within a cone of size ∆R = 0.4 around the jet axis.If a b-hadron is found, the jet is labelled as a b-jet.If no b-hadron is found but there is a c-hadron instead, the jet is labelled as a c-jet.If neither a b-hadron nor a c-hadron is found, the jet is labelled as a light (l) jet.Simulated Z+jets events are then categorised according to the labels of the two jets that are used to reconstruct the H → b b candidate.The combination of Z bb, Z bc, Z bl and Zcc events is referred to as Z+hf (denoting heavy-flavour jets) in the following, whereas other events belong to the Z+lf (denoting light-flavour jets) category.This categorisation is not performed for the W+jets process because its contribution is small.The P -B v2 generator [32][33][34] was used to generate the WW, W Z and Z Z (diboson) processes [35] at NLO accuracy in QCD.The effect of singly resonant amplitudes, as well as interference effects due to Z/γ * and identical leptons in the final state, were included where appropriate (interference effects between WW and Z Z for same-flavour charged leptons and neutrinos were ignored).Events were interfaced with P v8.186 [36] for the modelling of the parton shower, hadronisation and underlying event, with parameters set according to the AZNLO [37] tune.The CT10 [38] PDF set was used for the hard-scattering processes, whereas the CTEQ6L1 [39] PDF set was used for the parton shower.
The production of a single 125 GeV Higgs boson in association with a Z boson was simulated up to NLO accuracy in QCD using P -B v2 [40][41][42], with the NNPDF3.0NLOPDF set and subsequently reweighted to the PDF4LHC15NLO [43] PDF set.The simulation was interfaced with P v8.212, using the AZNLO tune and the CTEQ6L1 PDF set.The gg → Z H and qq → Z H samples were normalised to cross-sections calculated at, respectively, NLO accuracy in QCD including soft-gluon resummation up to next-to-leading logarithms [44][45][46] and NNLO accuracy in QCD with NLO electroweak corrections [47][48][49][50][51][52][53][54].Other single-Higgs-boson production modes were found to contribute negligibly.
Single-top-quark processes (split into s-channel, t-channel and tW contributions) and t t events were simulated using P -B v2 [55][56][57] at NLO accuracy in QCD with the NNPDF3.0NLOPDF set.All events were interfaced with P v8.230 using the A14 tune and the NNPDF2.3LOPDF set.For the tW process, the diagram removal scheme [58] was employed in order to handle the interference with t t production.The t t sample was normalised to the cross-section prediction at NNLO accuracy in QCD including the resummation of next-to-next-to-leading logarithmic soft-gluon terms calculated using T ++2.0 [59][60][61][62][63][64][65].For the single-top-quark processes, the cross-sections of the sand t-channels were corrected to the theory prediction at NLO accuracy in QCD calculated with H v2.1 [66,67], while the cross-section used for the tW sample was based on the approximate NNLO calculation [68,69].

Except when using S
, band c-hadron decays were performed with E G v1.2.0 or v1.6.0 [70], while the decays of τ-leptons were handled internally by all event generators.The effect of multiple interactions in the same and neighbouring bunch crossings (pile-up) was modelled by overlaying the original hard-scattering event with simulated inelastic pp events generated with P v8.186 using the NNPDF2.3LOPDF set and the A3 [71] tune.The MC samples were processed with a simulation [72] of the detector response based on G 4 [73] and events were then reconstructed with the same software as the data.

Object reconstruction
The following procedures are used to reconstruct and identify objects, such as electrons, muons, jets and missing transverse momentum, in the ATLAS experiment.
In general, jets refer to the hadronic objects reconstructed using the anti-k t algorithm with a radius parameter R = 0.4 [74,75], starting from topological clusters of energy deposits in the calorimeter.When used in the following, jets are required to have p T > 20 GeV and |η| < 4.5.The reconstruction of electrons is based on matching inner-detector tracks to energy clusters in the electromagnetic calorimeter.Electrons are required to have p T > 7 GeV and |η| < 2.47, excluding the barrel-endcap transition region of the calorimeter (1.37 < |η| < 1.52).They are then identified using the 'loose' operating point provided by a likelihood-based algorithm [76].The reconstruction of muons relies on matching tracks in the inner detector and the muon spectrometer.Muons are required to have p T > 7 GeV and |η| < 2.5, as well as to satisfy the 'loose' identification criteria and 'FixedCutLoose' isolation working point defined in Ref. [77].
In order to avoid double-counting of objects that overlap geometrically, an electron is removed if it shares an inner-detector track with a muon.Then, anti-k t jets with R = 0.4 are discarded if they meet one of the following two conditions: (i) ∆R(jet, e) < 0.2; (ii) the jet has less than three associated tracks and either a muon inner-detector track is associated with the jet or ∆R(jet, µ) < 0.2.Finally, an electron or muon is discarded if found within a distance ∆R = min(0.4,0.04 + 10 GeV/p e/µ T ) from any remaining jet.
The missing transverse momentum, the magnitude of which is denoted by E miss T in the following, is defined as the negative vector sum of the transverse momenta of all fully reconstructed and calibrated objects, after an overlap removal procedure that is distinct from that used for the electron/muon/jet disambiguation above [78,79].The missing transverse momentum also includes a soft term, calculated using the inner-detector tracks that originate from the primary vertex (defined as that having the largest sum of squared track-p T ) but are not associated with reconstructed objects.
In order to capture the decay products of boosted particles, such as Higgs bosons, another type of jets, called large-radius jets, is employed.These are also formed using the anti-k t algorithm, but with a radius parameter R = 1.0 and are built from topological clusters of energy deposits calibrated using the local hadronic cell weighting scheme [14].In the following, the large-radius jet matched to the boosted H → b b candidate is referred to as a 'large-R jet' while that of the boosted hadronically decaying τ + τ − pair is referred to as a 'di-τ object'.
While the reconstruction and identification of boosted H → τ + τ − decays employ novel techniques described in Section 5, a standard procedure is used for boosted H → b b decays.Large-R jets are trimmed [80] to remove the effects of pile-up and the underlying event.Trimming proceeds by reclustering the original constituents of a large-R jet into a collection of R = 0.2 sub-jets using the k t algorithm [81,82] and removing any sub-jets with p sub-jet T /p J 0 T < 0.05, where p sub-jet T is the transverse momentum of the sub-jet under consideration and p J 0 T that of the original (untrimmed) large-R jet.The energy and mass scales of the trimmed jets are then calibrated using p T -and η-dependent calibration factors derived from simulation [83].After trimming, the large-R jet is required to have p T > 300 GeV and its mass is calculated using the combined mass technique with tracking and calorimeter information as input [84].
In order to identify b-hadrons within a large-R jet, variable-radius track-jets [85] are reconstructed using the anti-k t algorithm from inner-detector tracks with a jet-p T dependent radius parameter R(p T ) = ρ/p T , where ρ determines how fast the effective size of the jet decreases with its transverse momentum.The lower (R min ) and upper (R max ) cut-offs prevent the jet from becoming too large at low p T and from shrinking below the detector resolution at high p T , respectively.In this paper, ρ = 30 GeV, R min = 0.02 and R max = 0.4 are used [86].These track-jets are then matched to an untrimmed large-R jet by using the ghost-association method [87,88].Only track-jets with p T > 10 GeV, |η| < 2.5 and at least two tracks [86] are considered in the following.Events with two collinear track-jets a and b that fulfil The flavour of track-jets is determined using a multivariate approach based on the properties and vertex information of the associated tracks [89,90].Various b-jet identification algorithms are used to exploit impact-parameter information, secondary-vertex information, and bto c-hadron decay chain information.The MV2 10 algorithm [91] then combines information from the various upstream algorithms in a boosted decision tree (BDT) that is trained to discriminate b-jets from a background sample made of 93% light-flavour jets and 7% c-jets.In order to label a track-jet as b-tagged, a requirement is placed on the output score of the MV2 10 discriminant.This requirement has an average efficiency of 70% for b-jets in simulated t t events (decreasing to about 60% for a b-jet p T above 500 GeV) with rejection factors of 8.9, 36 and 300 for jets initiated by c-quarks, hadronically decaying τ-leptons and light-flavour quarks, respectively [92].The number of b-tagged track-jets in the large-R jet is used to define various event categories, as described in Section 6.

Reconstruction and identification of boosted hadronically decaying τ + τ − pairs
Hadronically decaying τ-leptons produce a neutrino and visible decay products, typically one or three charged pions and up to several neutral pions, which are reconstructed and identified as τ had-vis objects.In the standard procedure [93], a τ had-vis object is seeded by a jet with p T > 10 GeV and |η| < 2.5, formed using the anti-k t algorithm with R = 0.4.A multivariate identification stage, using calorimetric shower shapes and tracking information as input variables, is employed to discriminate τ had-vis candidates with one or three associated tracks from quark-or gluon-initiated jets.However, in the search for high-mass X → HH → b bτ + τ − presented in this paper, more than 50% of the τ + τ − pairs have a separation ∆R < 0.4 when m X ≥ 2 TeV, hence they would fail the standard reconstruction procedure.For such events, a novel method for reconstructing boosted hadronically decaying τ + τ − pairs, referred to as the di-τ tagger, is employed.
Boosted di-τ objects are seeded by untrimmed large-radius jets that must have p T > 300 GeV.Their constituents are reclustered into anti-k t sub-jets with R = 0.2.The original di-τ seeding jet must include at least two such sub-jets and, after ordering in p T , the two leading sub-jets are used to construct the di-τ system.Tracks are geometrically matched to a sub-jet if they are within a cone of size ∆R = 0.2 around its axis, and they are labelled as 'τ tracks'.Other tracks found in the isolation region (i.e. the area of the di-τ seeding large-radius jet excluding the di-τ sub-jets) are labelled as 'iso-tracks'.A schematic representation of a di-τ object is shown in Figure 1.The track selection criteria, as well as the track-vertex matching, are the same as those used for the standard τ had-vis objects [93].In the following, the two leading sub-jets used to compute the four-momentum of the di-τ system must have p T > 10 GeV and at least one associated track.At this stage, the di-τ reconstruction efficiency is defined as the fraction of events in which a boosted di-τ candidate is reconstructed and each of the two leading sub-jets geometrically matches a generated hadronically decaying τ-lepton for which the p T of the visible products (neutral and charged hadrons) exceeds 10 GeV.Figures 2(a) and 2(b) show the di-τ reconstruction efficiency as a function of, respectively, the distance ∆R(τ 1,vis , τ 2,vis ) between the visible products of the two hadronically decaying τ-leptons and the visible p T of the di-τ system, both computed at generator level.For this purpose, a sample of simulated X → HH → b bτ + τ − events is used, in which the resonance mass is set to 2 TeV and both τ-leptons decay hadronically.In Figure 2, the reconstruction efficiency is also shown for standard (resolved) τ had-vis objects.In this case, the efficiency is defined as the fraction of events with at least two reconstructed τ had-vis candidates, with at least one associated track each, that geometrically match a generated hadronically decaying τ-lepton for which the p T of the visible products exceeds 10 GeV.This comparison shows that the boosted di-τ object reconstruction method is necessary at high transverse momenta and low ∆R(τ 1,vis , τ 2,vis ).In particular, Figure 2(a) shows that the reconstruction efficiencies decrease sharply when the visible products of the two hadronically decaying τ-leptons are so close that they merge into one jet.
With the resolved τ had-vis reconstruction this happens when ∆R(τ 1,vis , τ 2,vis ) < 0.4, while the boosted di-τ reconstruction extends the sensitivity down to ∆R(τ 1,vis , τ 2,vis ) = 0.2 by resolving the smaller sub-jets.In addition, as the distance ∆R(τ 1,vis , τ 2,vis ) increases, the reconstruction efficiency of both boosted di-τ and resolved τ had-vis objects decreases slowly, because the sub-leading generated τ-lepton is found to become softer, and hence less likely to exceed the 10 GeV p T threshold imposed on its visible products.Also, in contrast to the reconstruction efficiency of resolved τ had-vis objects, which is based on at least two candidates, the di-τ reconstruction method loses some efficiency due to the fact that only the two leading sub-jets are considered.Finally, as shown in Figure 2(b), the reconstruction efficiency reaches a plateau when the p T of the di-τ system exceeds 300 GeV, while the location of the turn-on is set by the p T cut on the seeding jet.
Figure 2: Efficiency to reconstruct a di-τ system with (squares) resolved τ had-vis objects and (circles) a boosted di-τ object versus (a) the distance ∆R(τ 1,vis , τ 2,vis ) between the visible products of the two hadronically decaying τ-leptons and (b) the p T of the di-τ system, both at generator level.The reconstruction efficiency is computed in simulated X → HH → b bτ + τ − events, where the resonance mass is set to 2 TeV and both τ-leptons decay hadronically (similar patterns are observed for other masses though).The vertical error bars only account for statistical uncertainties.
As in the case of standard τ had-vis objects, a separate identification stage using multivariate techniques is employed to reduce the background from quark-and gluon-initiated jets.For this purpose, a BDT discriminant is built using information about the clusters in the calorimeter, tracks and vertices.Multi-jet events with quark-or gluon-initiated jets misidentified as di-τ objects are expected to have lower-p T sub-jets, with a larger fraction of energy in the isolation region.Also, multi-jet events typically have fewer collimated tracks and a higher track multiplicity with, accordingly, a smaller fraction of the transverse momentum carried by each track.The variables defined in Table 1 are found to provide good discrimination and are used in the BDT training.
Table 1: Discriminating variables used in the di-τ identification BDT, aimed at rejecting the background from quarkand gluon-initiated jets.Here, LRJ refers to the seeding large-radius jet of the di-τ object, sj 1 and sj 2 stand for the first and second sub-jets ordered in p T , respectively, and tracks refer to those matched to a sub-jet (τ track), unless specified otherwise.
Variable Definition Ratios of the energy deposited in the core to that in the full cone, for the sub-jets sj 1 and sj The training of the BDT is performed using the adaptive boosting (AdaBoost) algorithm [94], with a boosting parameter of 0.5, in the TMVA toolkit [95].The di-τ objects in simulated events with a spin-2 graviton, G → HH → (τ + τ − )(τ + τ − ), form the signal training set.Five samples, with graviton masses of 1.5, 1.75, 2, 2.25 and 2.5 TeV, are combined to form the signal.The spin of the resonance is found to have no impact on the di-τ identification.
The background sample in the BDT training consists of the 3.2 fb −1 of data recorded by the ATLAS experiment in 2015 using combined jet triggers with transverse energy (E T ) thresholds between 100 and 400 GeV.It is dominated by multi-jet events and the contamination from events with a true di-τ object is found to be negligible.The di-τ p T spectra of the signal and background events are reweighted to become flat, so that the p T dependencies of the input variables are eliminated in the BDT training.On the other hand, no reweighting regarding η is performed, since no dependency on this variable is observed.Figure 3(a) shows the background rejection factor (defined as the inverse of the background efficiency) versus the signal efficiency, as obtained after the di-τ reconstruction step.In the following, a requirement is placed on the BDT output score to define a benchmark with a signal efficiency of about 60%, as indicated by the red cross.This working point corresponds to a background rejection factor of 10 4 .Figure 3(b) shows that the identification efficiency has little dependency on the p T of the di-τ system beyond 300 GeV.As illustrated in Figure 3(c), the performance of the BDT is not affected much by the number of pile-up interactions.Hence, although the training is performed using only data collected in 2015, the discrimination between the di-τ signature and the multi-jet background is not expected to change significantly across the data-taking period between 2015 and 2018.Since the majority of hadronically decaying τ-leptons produce one or three charged pions, the number of tracks matched to each di-τ sub-jet, with a distance ∆R < 0.1 from its axis, must be either one or three in the following.The efficiency of this additional requirement, computed after the di-τ reconstruction and identification steps, is shown in Figures 4(a) and 4(b) as a function of the generator-level p T of the di-τ system and the number µ of pile-up interactions, respectively.For this purpose, the same sample of simulated X → HH → b bτ + τ − events is used.The di-τ sub-jet track selection is found to be stable not only with respect to the p T of the di-τ system and the number of pile-up interactions, but also with respect to pseudorapidity and the distance ∆R(τ 1,vis , τ 2,vis ) between the visible decay products of the two hadronically decaying τ-leptons.This additional requirement further improves the rejection of the background with quark-or gluon-initiated jets by a factor of about five.
In simulated X → HH → b bτ + τ − events, the energy of reconstructed di-τ objects is close to that of the visible decay products of the corresponding two hadronically decaying τ-leptons at generator level, as illustrated in Figure 5.In addition, the energy resolution is found to improve with increasing energy.In regions of the data enriched in boosted di-τ candidates, good agreement is found between simulated and measured di-τ energy distributions; hence no additional energy calibration is applied to di-τ objects.

Event selection and categorisation
This section presents the common event selection and further categorisations used for the multi-jet background estimation, the data-driven correction of the efficiency of the di-τ tagger and the search for X → HH → b bτ + τ − .The events are selected using unprescaled triggers requiring the presence of a large-radius jet with an E T threshold between 360 and 460 GeV, depending on the data-taking period.Subsequently, in order to ensure a constant trigger efficiency, an offline p T cut that exceeds the trigger threshold by [40][41][42][43][44][45][46][47][48][49][50] GeV is applied to the leading large-radius jet, independently of whether it is compatible with a boosted b b or hadronically decaying τ + τ − pair.In order to avoid contamination from non-collision backgrounds, such as those originating from calorimeter noise, beam halo and cosmic rays, events are rejected if they contain an anti-k t jet with R = 0.4 and p T > 20 GeV that does not fulfil the loose quality criteria of Ref. [96].In addition, events are required to have a vertex with at least two associated tracks with p T > 500 MeV.Finally, events are vetoed if they contain an electron or muon that meets the requirements described in Section 4.
At preselection, events are required to contain at least one reconstructed di-τ object, which must meet the requirements below, in addition to those listed in Section 5.If multiple di-τ objects are found, the one with the highest p T is selected.
• The number of sub-jets is at most three, and the p T threshold of the two leading sub-jets is raised to 50 GeV in order to reduce the contribution of multi-jet background events.
• The separation ∆R between the leading and sub-leading sub-jets is less than 0.8 to ensure that both are fully contained in the di-τ seeding jet.
• The charge product of the two leading sub-jets is Q = q lead × q sub-lead = ±1.The charge of the sub-jet is defined as the sum of the charges of the associated tracks.
• The transverse momentum of the di-τ object, computed from the two leading sub-jets, must exceed 300 GeV (or fulfil the tighter p T requirement set by the trigger threshold if the seeding large-radius jet fires the trigger), with a pseudorapidity |η| < 1.37 or 1.52 < |η| < 2.0, thereby rejecting the transition region between the barrel and endcap calorimeters.
In addition, E miss T > 10 GeV is required in order to define a direction for the missing transverse momentum.
To select boosted H → b b decays, large-R jets with a p T exceeding 300 GeV (or the tighter trigger-dependent threshold if the seeding large-radius jet fires the trigger), |η| < 2.0, a combined mass larger than 50 GeV and a separation ∆R > 1.0 from the selected di-τ object are required.If multiple large-R jets fulfil these requirements, the one with the highest p T is selected.
Events are then further divided into regions with either misidentified or true di-τ objects.In the latter case, the event categories contain a significant fraction of di-τ objects from either Z → τ + τ − or H → τ + τ − decays.As shown in Table 2, the categorisation is based on the charge product Q of the two leading di-τ sub-jets, the number N b-tags of b-tagged track-jets in the selected large-R jet, the azimuthal angle |∆φ| between the selected di-τ object and the missing transverse momentum, the combined mass m J of the selected large-R jet, and the invariant mass of the selected di-τ object and the large-R jet, m vis H H . Requirements on m J and m vis H H are imposed in regions with one b-tagged track-jet in order to suppress the contamination from X → HH → b bτ + τ − signal events.Finally, in order to ensure orthogonality to searches for X → HH → b bb b [97], events with two or more b-tagged large-R jets are removed.
Table 2: Definition of various categories after the event preselection (see text for details), based on the charge product Q of the two leading di-τ sub-jets (OS for opposite-sign, SS for same-sign), the number N b-tags of b-tagged track-jets in the selected large-R jet, the azimuthal angle |∆φ| between the di-τ object and the missing transverse momentum, the large-R jet mass m J , and the visible mass m vis H H of the reconstructed HH system.Name Usage

Estimation of the multi-jet background with a misidentified di-τ object
Multi-jet events with quark-or gluon-initiated jets misidentified as di-τ objects are a common background for the efficiency measurement of the di-τ tagger and in the search for X → HH → b bτ + τ − .This section describes the data-driven method developed to estimate this background, whereas non-multi-jet background events with a misidentified di-τ object are obtained directly from simulation.
A data sample enriched in misidentified di-τ objects is collected using the same event selection as in a region of interest (RoI), except that the selected di-τ object must fail (instead of pass) the BDT-based identification, but it still fulfils a very loose criterion that corresponds to a cut on the BDT output score with an efficiency above 99%.In fact, this very loose criterion is applied to all di-τ candidates in order to keep the composition of misidentified di-τ objects in terms of quarks and gluons close to that in the RoI.
The contribution from non-multi-jet processes, obtained using simulation, is subtracted from the data.The remaining events, defined as N fail RoI , are then reweighted by a fake factor (FF) to predict the yield of events N mis-ID RoI with a misidentified di-τ object in the RoI as follows: The FF is calculated in the region labelled as 'FF SS' in Table 2, which requires Q = +1 for the di-τ object and is enriched in multi-jet events.It is binned in the p T of the di-τ object and defined as: , where the numerator is the number of events passing the BDT-based di-τ identification requirement, while the denominator is the number of events failing this requirement but passing the very loose BDT requirement.The contributions from non-multi-jet processes in the numerator and the denominator are about 24% and 3% of the total event yields, respectively, and they are subtracted.Figure 6 shows the FF as a function of the p T of the di-τ object.The observed increase of the FF relates to a reduced rejection of the background from quark-or gluon-initiated jets at higher p T values of the di-τ object.On the other hand, the FF is inclusive in η, since no significant change in the multi-jet background modelling was found with an additional binning in the pseudorapidity of the di-τ object.The statistical uncertainty of the measured FF comes from the limited size of the samples of data and simulated events used in the computation.Systematic uncertainties are computed to account for the different event selections used in the analysis.In particular, a p T -binned FF is computed in a region with the same selections as those used in 'FF SS', except that the charge requirement for the di-τ candidate is changed to Q = −1.The deviation (up to 30%) from the nominal FF is taken as a systematic uncertainty in the extrapolation from the SS region to the OS region.In addition, the maximum overall difference (42%) between the FF values evaluated in regions with zero, one or two b-tagged track-jets in the selected large-R jet accounts for the systematic uncertainty in the event categorisation based on b-tagging.
The estimation of the multi-jet background is validated in two regions ('MJ OS 0-tag' and 'MJ OS 1-tag' in Table 2) selected by requiring Q = −1 for the di-τ candidate, either zero or one b-tagged track-jet in the selected large-R jet, and |∆φ| > 1 between the selected di-τ object and the missing transverse momentum to suppress events with correctly identified di-τ objects.As shown in Figure 7, good agreement between the predicted and measured multi-jet backgrounds is found in the 'MJ OS 0-tag' region, while some discrepancy is observed between the predicted and measured event yields in the 'MJ OS 1-tag' region.3Therefore, an additional, conservative, non-closure systematic uncertainty of 50% is assigned for the multi-jet background estimation in all regions with N b-tags ≥ 1.

Data-driven correction of the di-τ tagger efficiency
To account for differences in the efficiency of the di-τ tagger between simulation and data, a correction is derived in a region enriched with properly identified di-τ objects from boosted Z → τ + τ − decays.This region, labelled as 'Zττ 0-tag' in Table 2, is designed by imposing a veto on events with b-tagged track-jets in the selected large-R jet and by demanding that Q = −1 and that the boosted di-τ system and the missing  2. The background referred to as 'Others' contains W+jets, diboson, Z H, t t and single-top-quark events.The lower panels show a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.transverse momentum point in the same direction, i.e. |∆φ| < 1. Figure 8 shows the visible di-τ mass in this region, as well as p T and η distributions after an additional requirement on the visible di-τ mass, 30 GeV < m vis di-τ < 90 GeV, which increases the fraction of Z → τ + τ − events to 87%.A scale factor (SF) is computed in the 'Zττ 0-tag' region as the ratio of measured to predicted event yields with a true boosted hadronically decaying τ + τ − pair.The measured event yield is obtained after subtracting the backgrounds with a misidentified di-τ object from the data: No significant dependence of the ratio on the p T or η of the di-τ object is found within the uncertainties, as illustrated in Figures 8(b) and 8(c).Hence, a single-bin SF is considered in order to minimise the statistical uncertainty.The SF is applied to all simulated (background and signal) events containing a di-τ object matched to a hadronically decaying τ + τ − pair at generator level.
The following uncertainties are considered in the SF computation.The statistical uncertainty accounts for the limited size of the data and simulated background samples, and is respectively 10.3% and 2.7% relative to the SF.The uncertainty labelled as 'Z-modelling' arises from the cross-section (5%) and the acceptance of Z+jets events.The latter comes from the choice of PDF set, as well as from variations of α S and of the renormalisation, factorisation, matrix-element matching and soft-gluon resummation scales in S .The uncertainty related to the cross-section and acceptance of the other simulated processes is found to be negligible.The systematic uncertainties arising from the multi-jet background estimate are discussed in Section 7. In all samples of simulated events, the instrumental systematic uncertainties arise primarily from the mismodelling of large-R jets, in particular the jet energy scale and resolution, as well as the jet mass scale and resolution [98].Uncertainties arising from the mismodelling of the reconstructed energy  2. All simulated events containing a generator-level τ + τ − pair matched to a simulated di-τ object are referred to as 'True di-τ'.The lower panels show a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.
(momentum) measurements for electrons (muons), as well as from the mismodelling of their reconstruction and identification efficiencies [76,77], are found to be negligible.The systematic uncertainties arising from the E miss T scale and resolution [78] uncertainties, as well as from the mismodelling of pile-up, are taken into account.Finally, the uncertainty in the combined 2015-2018 integrated luminosity is 1.7% [99], obtained using the LUCID-2 detector [100] for the primary luminosity measurements.
The modelling of di-τ objects from genuine hadronically decaying τ + τ − pairs is checked in the region 'Zττ 1-tag' with exactly one b-tagged track-jet in the selected large-R jet.In the following, for events containing a large-R jet with b-tagged track-jets, an additional systematic uncertainty arises from differences between the predicted and measured b-tagging efficiency and rates at which c-jets and light-flavour jets are misidentified as b-tagged jets [92, 101,102].As shown in Figure 9, good agreement between predicted and measured yields is observed in the region 'Zττ 1-tag', within the uncertainties.2. All simulated events containing a generator-level τ + τ − pair matched to a simulated di-τ object are referred to as 'True di-τ'.The lower panel shows the event yield ratio of the data and the predicted total background.The hatched bands represent combined statistical and systematic uncertainties.9 Search for resonant Higgs boson pair production in the b bτ + τ − final state The signal region (SR) used in the search for resonant Higgs boson pair production is defined by requiring OS charges for the two leading sub-jets of the di-τ object (Q = −1), |∆φ| < 1, and two b-tagged track-jets in the selected large-R jet, which furthermore must have a mass m J compatible with the mass of the Higgs boson .Finally, a requirement on m vis H H is imposed, which depends on the resonance mass hypothesis: no requirement is applied if m X < 1.6 TeV, whereas the cut-off value is set to 900 (1200) GeV if m X ≥ 1.6 (2.5) TeV.These requirements are summarised in the last row of Table 2.The signal acceptance times efficiency at various stages of the event selection is shown in Figure 10.The efficiency at low resonance masses is limited by the trigger thresholds and kinematic requirements on the large-R jet.In the SR, the dominant backgrounds arise from multi-jet, Z H, Z+hf and Z+lf events.The multi-jet background with a misidentified di-τ object is estimated with the data-driven method of Section 7. The distribution shapes of events from the other background processes, with a true di-τ object, are modelled using simulation, but the data-driven SF computed in Section 8 is employed for their normalisation.In addition, the S event generator was found to underestimate the heavy-flavour contribution in the Z+jets process in previous ATLAS analyses, such as in Ref. [103].Therefore, the normalisation of the Z → τ + τ − +hf background is further corrected by a normalisation factor derived in a dedicated region enriched with Z → + − +hf events, where denotes either an electron or a muon defined as a 'loose' lepton in Ref. [103].This region is selected by requiring two isolated electrons or muons with an invariant mass 81 GeV < m < 101 GeV and a dilepton transverse momentum p T > 300 GeV.In addition, the selected events must contain at least one large-R jet with p J T > 250 GeV, m J > 50 GeV and two associated b-tagged track-jets.After applying these event selections, the fraction of Z+hf events is about 80%. Figure 11 shows a comparison of the predicted and measured p T distributions.After subtracting all other processes from the data and comparing with simulated Z+hf events, the normalisation factor is found to be 1.20 ± 0.10 (stat) ± 0.28 (syst).The statistical uncertainty accounts for the limited size of the data and simulated event samples, and is respectively 8.3% and 1.7% relative to the normalisation factor.Systematic uncertainties arise from the instrumental sources detailed in Section 8 (3.4%) and from neglecting the differences between the p T distributions of the visible decay products of Z → + − and Z → τ + τ − , due to the presence of neutrinos in the latter case (23%).Table 3 summarises the event yields for data and all backgrounds, together with the associated statistical and systematic uncertainties.The systematic uncertainties in the multi-jet background estimation and of instrumental origin are the same as those discussed in Sections 7 and 8.The uncertainty related to the cross-sections and acceptances of the simulated background processes is found to be negligible compared to the statistical uncertainty.None of the simulated t t events pass the event selection of the HH SR, and hence a +1σ uncertainty of 0.12 events in the associated yield is estimated by considering the simulated events that survive the selection criteria prior to applying requirements on m J and m vis H H . T in Z → + − events produced in association with heavy-flavour jets.The background labelled as 'Others' contains W+jets, diboson, t t and single-top-quark events.The lower panel shows a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.
The systematic uncertainties in the modelling of the signal acceptance and efficiency arise from four sources.Those due to missing higher-order corrections are estimated by changing the renormalisation and factorisation scales up and down by a factor of two.Systematic uncertainties arising from the strong coupling parameter are assessed by varying its value.Uncertainties related to the PDF set used for matrix-element generation are computed by comparing the signal acceptance and efficiency across the 100 replicas of the nominal NNPDF2.3LOPDF set, as well as with alternative PDF sets, MMHT2014LO [21] and CT14 [104].Finally, the impact of the parton shower and hadronisation modelling is assessed by comparing the signal acceptance and efficiency between H 7 and P 8 predictions.The latter is the dominant systematic uncertainty in the signal modelling (2-4% for most of the mass hypotheses).
Figure 12 shows the m vis H H distribution in the SR, prior to the final selection using that variable.No significant excess with respect to the background expectation is found and only two events are observed, with m vis H H values of 1012 and 1114 GeV.
In order to extract exclusion limits on the X → HH signal cross-section, a single-bin counting experiment is performed for every mass hypothesis, based on a profile-likelihood fit [105].All sources of systematic uncertainty are accounted for as nuisance parameters that are profiled in the fit.The 95% CL upper limits on the X → HH cross-section are calculated as a function of m X using the CL s method [106].Because both the expected and observed event yields are small, the distribution of the test statistic is sampled by generating pseudo-experiments.For each mass hypothesis, 10 000 background-only and 10 000 signal+background pseudo-datasets are produced, and the exclusion limits are obtained by scanning the signal strength and interpolating to find the value at which CL s = 0.05.The corresponding upper limits on the production cross-section of a heavy, narrow, scalar resonance decaying into a pair of Higgs bosons are shown in Figure 13.H H after applying all the event selection that define the HH SR, except the requirement on m vis H H .The background labelled as 'Others' contains W+jets, diboson, t t and single-top-quark events.The X → HH → b bτ + τ − signal is overlaid for two resonance mass hypotheses with a cross-section set to the expected limit, while all backgrounds are pre-fit.The first and the last bins contain the underflow and overflow bin entries, respectively.The hatched bands represent combined statistical and systematic uncertainties.
Two different requirements are applied to m vis H H for the resonance mass hypotheses of 1.6 TeV and 2.5 TeV, leading to discontinuities in the limits (at 1.6 TeV, the difference between imposing no requirement and m vis H H > 900 GeV is less than 1% though).The more stringent requirement on m vis H H reduces the expected background for resonance mass hypotheses beyond 2.5 TeV, and no data events survive.This leads to an observed exclusion limit slightly below the expected limit.For resonance masses in the range 1.2-3.0TeV, the observed (expected) 95% CL upper limits lie between 94 and 28 fb (74 and 32 fb).For m X = 1.0 TeV, where the analysis is least optimal, mostly due to the p T requirements on the large-R jet at both the trigger and analysis levels, the observed (expected) limit becomes 817 fb (624 fb).These 95% CL upper limits are driven mainly by the statistical uncertainties of the data sample.The relative impact of the systematic uncertainties is estimated by comparing the expected upper limits when taking or not into account the nuisance parameters in the statistical analysis.For resonance mass hypotheses in the range 1.2-2.0TeV, the expected upper limits deteriorate by 20-30%, mostly due to the uncertainties in the di-τ reconstruction.At lower masses, the uncertainties in the large-R jet energy and mass scales further affect the expected upper limits (the largest impact from all nuisance parameters reaches 85% at 1 TeV).On the other hand, systematic uncertainties are found to have a negligible impact (less than 10%) on the expected upper limits for mass hypotheses beyond 2.5 TeV.

Conclusion
In this paper, novel analysis techniques using large-radius jets and exploiting jet substructure are used to reconstruct and identify boosted di-τ systems in the ATLAS experiment.These can arise from the decay H → τ + τ − of a Higgs boson with a large Lorentz boost.A dedicated benchmark of the di-τ tagger is designed to search for resonant Higgs boson pair production in the high-mass regime, X → HH → b bτ + τ − , where only final states with two hadronically decaying τ-leptons are considered.This benchmark corresponds to a 60% di-τ identification efficiency and a rejection factor of 10 4 for the background of quark-or gluon-initiated jets misidentified as boosted di-τ objects.The efficiency of the di-τ tagger is determined using 139 fb −1 of LHC proton-proton collision data at √ s = 13 TeV.
The search for resonant Higgs boson pair production via gluon-gluon fusion is carried out using the same dataset.No significant deviations from the Standard Model predictions are observed, and 95% confidence-level upper limits are set on the production cross-section of a heavy, narrow, scalar resonance decaying into a pair of Higgs bosons.Assuming SM branching fractions for the Higgs boson, the observed (expected) limits on the X → HH production cross-section lie between 94 and 28 fb (74 and 32 fb) for resonance mass hypotheses in the range 1.2-3.0TeV.

Figure 1 :
Figure1: Schematic representation of a di-τ object: the large blue cone is the di-τ seeding jet while the two smaller yellow cones are the two leading sub-jets.Tracks found within a cone of size ∆R = 0.2 around the sub-jet axis are matched to charged pions produced in hadronic decays of τ-leptons, while other tracks found in the isolation region are labelled as 'iso-tracks'.The closest distance d 0 in the transverse plane between the primary vertex and the leading track matched to a sub-jet is also shown for illustration.

Figure 3 :
Figure3: (a) Background rejection factor versus identification efficiency for the BDT used to discriminate between di-τ signatures and quark-or gluon-initiated jets, as obtained after the di-τ reconstruction step (the red cross indicates the benchmark used in the analysis, corresponding to a signal efficiency of about 60% and a background rejection factor of 10 4 ), and identification efficiency of boosted di-τ objects, as a function of (b) their p T at generator level and (c) the number µ of pile-up interactions.The vertical error bars only account for statistical uncertainties.

Figure 4 :
Figure 4: Efficiency of the track selection for di-τ sub-jets versus (a) the p T of the di-τ system at generator level and (b) the number µ of pile-up interactions, computed after the boosted di-τ reconstruction and identification steps in simulated X → HH → b bτ + τ − events, where the resonance mass is set to 2 TeV and both τ-leptons decay hadronically.The vertical error bars only account for statistical uncertainties.

Figure 5 :
Figure 5: Energy (a) scale and (b) resolution of the di-τ reconstruction as a function of the p T of the di-τ system at generator level, computed in simulated X → HH → b bτ + τ − events, where the resonance mass is set to 2 TeV and all τ-leptons decay hadronically.The requirement of p T > 300 GeV on the seeding jet truncates the reconstructed energy distribution and leads to a degradation of the energy scale for the lower values of the p T of the di-τ system.The vertical error bars only account for statistical uncertainties.

Figure 6 :
Figure6: Fake factor as a function of the p T of the di-τ object, computed in the control region labelled as 'FF SS' in Table2, with the requirements of Q = +1 for the di-τ object and of zero b-tagged track-jets for the selected large-R jet.The error bars indicate the statistical uncertainties.

Figure 7 :
Figure 7: Distribution of m vis H H in two regions enriched in multi-jet events, (a) 'MJ OS 0-tag' and (b) 'MJ OS 1-tag' in Table2.The background referred to as 'Others' contains W+jets, diboson, Z H, t t and single-top-quark events.The lower panels show a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.

Figure 8 :
Figure 8: Distributions of (a) the visible mass of the di-τ object, (b) its p T and (c) η in the region labelled as 'Zττ 0-tag' in Table2.All simulated events containing a generator-level τ + τ − pair matched to a simulated di-τ object are referred to as 'True di-τ'.The lower panels show a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.

Figure 9 :
Figure 9: Predicted and measured event yields in the region labelled as 'Zττ 1-tag' in Table2.All simulated events containing a generator-level τ + τ − pair matched to a simulated di-τ object are referred to as 'True di-τ'.The lower panel shows the event yield ratio of the data and the predicted total background.The hatched bands represent combined statistical and systematic uncertainties.

Figure 10 :
Figure 10: Signal acceptance times selection efficiency as a function of the resonance mass, at various stages of the event selection.From top to bottom: an event preselection (red solid-circle: trigger, object definitions and E miss T > 10 GeV) is performed first; the requirements on the di-τ object and large-R jet detailed in the text are then applied (blue solid-box and green open-circle, respectively); finally, the HH SR definition must be satisfied (pink open-box).

Figure 11 :
Figure 11: Distribution of pT in Z → + − events produced in association with heavy-flavour jets.The background labelled as 'Others' contains W+jets, diboson, t t and single-top-quark events.The lower panel shows a bin-by-bin comparison of the data and the predicted total background, in terms of event yield ratios.The hatched bands represent combined statistical and systematic uncertainties.

Figure 12 :
Figure 12: Distributions of m visH H after applying all the event selection that define the HH SR, except the requirement on m vis H H .The background labelled as 'Others' contains W+jets, diboson, t t and single-top-quark events.The X → HH → b bτ + τ − signal is overlaid for two resonance mass hypotheses with a cross-section set to the expected limit, while all backgrounds are pre-fit.The first and the last bins contain the underflow and overflow bin entries, respectively.The hatched bands represent combined statistical and systematic uncertainties.

Figure 13 :
Figure 13:  Expected and observed 95% CL upper limits on the production cross-section of a heavy, narrow, scalar resonance decaying into a pair of Higgs bosons (X → HH).The final state used in the search consists of a boosted b b pair and a boosted hadronically decaying τ + τ − pair, and the SM branching fractions of the Higgs boson are assumed.The ±1σ and ±2σ variations about the expected limit are indicated by the green and yellow bands, respectively.
Logarithm of the ratio of the scalar p T sum of the iso-tracks to the di-τ seeding large-radius jet p T ∆R max (track, sj 1 ) and ∆R max (track, sj 2 ) Largest separation of a track from its associated sub-jet axis, for the sub-jets sj 1 and sj 2 , respectively [p track Logarithms of the closest distance in the transverse plane between the primary vertex and the leading track of sj 1 and sj 2 , respectively n T ∆R(track, sj 2 )]/ p track T p T -weighted ∆R of the tracks matched to sj 2 with respect to its axis [p iso-track T ∆R(iso-track, sj)]/ p iso-track T p T -weighted sum of ∆R between iso-tracks and the nearest sub-jet axis

Table 3 :
Event yields of the various estimated backgrounds and data, computed in the SR of the search for X → HH → b bτ + τ − .The background labelled as 'Others' contains W+jets, diboson, t t and single-top-quark events.Statistical and systematic uncertainties are quoted, in that order.The background yields and uncertainties are pre-fit and are found to be similar to those post-fit.A dash '-' indicates a yield or uncertainty below 0.01.