1 Introduction

Processes involving the production and decay of W and Z bosons provide benchmarks for testing the Standard Model (SM), as well as probes of physics beyond the SM (BSM). Since the cross section for the direct strong production of events with multiple jets (QCD multijets) at the Large Hadron Collider (LHC) is much larger than for W and Z boson production, it is usually the case that the leptonic decays of bosons must be used to reduce the overwhelming background. However, when the momentum \(p^V\) of a boson V is comparable with its mass, \(m_V\), the spatial proximity of the decay products provides a new set of tools that can be used to distinguish between jets from hadronic boson decays and jets originating from QCD multijet backgrounds. In particular, since the angle between the decay products of a boson V scales with \(2m_V/p^V\), for large \(p^V\), jet substructure techniques become powerful tools. This leads to a tradeoff between using relatively pure leptonic decays and high-branching-ratio hadronic decays. In some BSM theories, new particles similar to W / Z bosons do not couple directly to leptons, so searching for hadronic decays of heavy particles is essential.

Jet substructure techniques developed to distinguish hadronically decaying W and Z bosons from QCD multijet background processes have become increasingly sophisticated. A recent review is given in Ref. [1]. Both ATLAS [2] and CMS [3] have performed detailed comparisons of the various tagging variables and jet-grooming techniques with the overall conclusion that large QCD multijet suppression factorsFootnote 1 are possible while maintaining acceptable levels of boson tagging efficiency. Given a W / Z-boson tagger, a natural next step is to distinguish boson types.

There are several important possible applications of a boson-type tagger at the LHC. First, a type tagger could enhance the SM physics program with W and Z bosons in the final state. Measurements of this kind include the determination of the cross sections for V+jets, VV, and \(t\bar{t}+V\). Another important use of a boson-type tagger is in searches for flavour-changing neutral currents (FCNC). Due to the Glashow–Iliopoulos–Maiani (GIM) mechanism [4], FCNC processes in the SM are highly suppressed. Many models of new physics predict large enhancements to such processes. Both ATLAS and CMS have performed searches for FCNC [5, 6] of the form \(t\rightarrow Zq\) in the leptonic channels, but these could be extended by utilizing the hadronic Z decays as well. FCNC processes mediated by a leptophobic \(Z'\) such as \(t\rightarrow Z'q\) may be detected only via hadronic type-tagging methods. A third use of a boson-type tagger is to categorize the properties of new physics, if discovered at the LHC. For instance, if a new boson were discovered as a hadronic resonance, a boson-type tagger could potentially distinguish a \(W'(\rightarrow qq)\) from a \(Z'(\rightarrow qq)\) (where mass alone may not be useful). This is especially relevant for leptophobic new bosons, which could not be distinguished using leptonic decays.

Labelling jets as originating from a W or Z boson is less ambiguous than quark/gluon labelling. A W boson can radiate a Z boson, just like a quark can radiate a gluon, but this is heavily suppressed for the former and not for the latter. The radiation pattern of jets from W- and Z-bosons is less topology dependent because it is largely independent of the other radiation in the event as W and Z bosons are colour singlets. Aside from the production cross section and subtle differences in differential decay distributions, the only features that distinguish between W and Z bosons are their mass, charge, and branching ratios. Experimentally, this means that the only variables that are useful in discriminating between hadronic decays of W and Z bosons are those which are sensitive to these properties. The three variables used in the analysis presented here are jet mass, sensitive to the boson mass, jet charge, sensitive to the boson charge, and a b-tagging discriminant which is sensitive to the heavy-flavour decay branching fractions of the bosons. The application of a boson-type tagger in practice will be accompanied by the prior use of a boson tagger (to reject QCD multijet processes). The type-tagger variables are largely independent of typical boson-tagger discriminants like n-subjettiness [7], which rely on the two-prong hard structure of both the W and Z decaysFootnote 2.

This paper introduces a jet tagging method to distinguish between hadronically decay W and Z bosons at the LHC, and documents its performance with the ATLAS detector at \(\sqrt{s}=8\) TeV. The paper is organized as follows. Section 2 describes the simulated datasets used in constructing and evaluating the boson-type tagger. Following a discussion of the differences between the properties of W and Z bosons in Sect. 3, Sect. 4 defines the three discriminating variables. The construction and performance of the tagger are detailed in Sect. 5 and the sensitivity to systematic uncertainties is described in Sect. 6. The input variables are studied in a dataset enriched in boosted W bosons in Sect. 7. The paper ends with a discussion of possible uses of the tagger in Sect. 8 and conclusions in Sect. 9.

2 Datasets

Two sets of Monte Carlo (MC) simulations are generated, one to study the tagger’s W versus Z performance and the other to compare the tagger inputs for W bosons with the data. Simulations of hypothetical \(W'~\rightarrow ~WZ\) production and decay provide a copious source of boosted W and Z bosons whose \(p_\text {T}\) scale is set by the mass of the \(W'\) boson. Such events are used to construct a tagger to separate hadronically decaying boosted W and Z bosons, as well as to evaluate its performance. It is not possible to measure the performance directly in the data due to the lack of a pure sample of boosted, hadronically decaying Z bosons, but the modelling of the tagger inputs can be studied using hadronically decaying W bosons from \(t\bar{t}\) events in the data.

A simulated sample of \(W'\) bosons is generated with PYTHIA 8.160 [8] using the leading-order parton distribution function set (PDF) MSTW2008 [9, 10] and the AU2 [11] set of tunable parameters (tune) for the underlying event. The baseline samples use PYTHIA for the \(2\rightarrow 2\) matrix element calculation, as well as \(p_\text {T}\)-ordered parton showers [12] and the Lund string model [13] for hadronization. Additional samples are produced with HERWIG++ [14], which uses angular ordering of the parton showers [15], a cluster model for hadronization [16], as well as the EE3 [17] underlying-event tune. The W’ differs from the SM W boson only in its mass and the branching ratio \(W'\rightarrow WZ\) is set to 100 %. The W and Z bosons are produced with a mixture of polarizations, but the longitudinal polarization state dominates because \(m_{W},m_{Z}\ll m_{W'}\). In order to remove artifacts in the \(p_\text {T}\) distributions of the W and Z bosons due to the generation of \(W'\) particles with discrete masses, the \(p_\text {T}^V\) spectra are re-weighted to be uniform in the range 200 GeV\( <p_\text {T}^V<400\) GeV. As is discussed in Sect. 1, for \(p_\text {T}>200\) GeV, a jet with large radius is expected to capture most of the W or Z boson decay products. The range is truncated to \(p_\text {T}<400\) GeV because hadronically decaying W bosons can be probed with data in this \(p_\text {T}\) range; there are too few events in the 8 TeV dataset for \(p_\text {T}>400\) GeV.

Top-quark pair production is simulated using the next-to-leading-order (NLO) generator POWHEG-BOX [1820] with the NLO PDF set CT10 [10] and parton showering from PYTHIA 6 [21]. The single-top (s-, t-, and Wt-channel) backgrounds are modelled with POWHEG-BOX and PYTHIA 6, as for the nominal \(t\bar{t}\) simulation. The PDF set CT10f4 [9] is used for the t-channel and CT10 is used for the s- and Wt-channels. For the \(Wt-\)channel, the ‘inclusive Diagram Removal’ (DR) scheme is used for overlap with \(t\bar{t}\) [22]. The W+jets and Z+jets backgrounds are modelled with ALPGEN 2.1.4 [23], PYTHIA 6 and the CTEQ6L1 PDF set [24]. Dibosons are generated with HERWIG 6.520.2 [25] using the CTEQ6L1 PDF set and the AUET2 tune [26]. Version 6.426 is used everywhere for PYTHIA 6, with the Perugia2011C tune [27].

Events are processed with a full simulation of the ATLAS detector and trigger [28] based on the \(\mathrm \mathtt{Geant4}\) [29] toolkit, and reconstructed using the same software as for the experimental data. The average number of additional pp collisions per bunch crossing (pileup interactions) was 20.7 over the full 2012 run. The effects of pileup are modelled by adding multiple minimum-bias events, which are simulated with PYTHIA 8.160, to the generated hard-scatter events. The distribution of the number of interactions is then weighted to reflect the pileup distribution in the 2012 data. A sample of W bosons is selected from data taken in 2012 at centre-of-mass energy of \(\sqrt{s}=8\,\mathrm{TeV}\) from \(t\bar{t}\) candidates as described in Sect. 7.

3 Distinguishing a Z boson from a W boson

Decays of W or Z bosons are characterized by the boson’s mass and coupling to fermions. The mass difference between the W and Z boson is about 10 GeV and if produced from a hard scatter or the decay of a heavy enough resonance, both bosons are produced nearly on-shell since the width \(\Gamma _V=2.1\) (2.5) GeV is much less than the mass \(m_V=80.4\) (91.2) GeV for W (Z) bosons [30]. The Breit–Wigner resonance curves for W and Z bosons are shown in Fig. 1a. The separation between the curves is a theoretical limit on how well mass-sensitive variables can distinguish between W and Z bosons. For hadronic boson decays, the mass peaks measured with jets are broader. This is because the jet-clustering algorithm for final-state hadrons loses particles at large angles to the jet axis and includes extra particles from the underlying event and pileup.

The generic coupling of a boson V to fermions is given by \(g_\text {V} \gamma _\mu [c_\text {V}-c_\text {A}\gamma _5]\), where \(g_\text {V}\) is a boson-dependent overall coupling strength, and \(c_\text {V}\) and \(c_\text {A}\) are the vector and axial-vector couplings, respectively. The W boson couples only to left-handed fermions so \(c_\text {V}=c_\text {A}=1\) with \(g_W\propto kN_\text {C}G_\text {F}m_W^3|V_{ij}|^2\), where \(G_\text {F}\) is the Fermi coupling constant, \(V_{ij}\) is a Cabibbo–Kobayashi–Maskawa (CKM) matrix element [31, 32], k represents higher-order corrections, and \(N_\text {C}=3\) for the three colours of quarks and \(N_\text {C}=1\) for leptons. The CKM matrix is nearly diagonal so \(W^+\rightarrow u\bar{d}\) and \(W^+\rightarrow c\bar{s}\) are the dominant decay modes. Small off-diagonal elements contribute to the other possible decay modes, and the overall hadronic branching ratios are approximately \(50\,\%\) for \(W\rightarrow cX\) and \(50\,\%\) for \(W\rightarrow \text {light-quark pairs}\). The W boson has electric charge \(\pm 1\) in units of the electron charge, so by conservation of charge, its decay products have the same net charge. The scalar sum of the charge of all the final-state hadrons originating from a W boson decay is not infrared safe (directly sensitive to the non-zero detection threshold), so there are limits to the performance of charge tagging dictated by the energy threshold placed on charged particles in the event reconstruction.

In contrast to W boson decays, Z bosons decay to both the left- and right-handed fermions. The partial width for \(Z\rightarrow f\bar{f}\) is proportional to \(kN_\text {C}G_\text {F}m_Z^3[c_\text {V}^2+c_\text {A}^2]\). The factors \(c_\text {V}\) and \(c_\text {A}\) are slightly different for up- and down-type fermions. The \(b\bar{b}\) branching ratio is 22 %, the \(c\bar{c}\) branching ratio is \(17\,\%\) and the sum of the remaining branching ratios is \(61\,\%\). W boson decays to b-quarks are highly suppressed by the small CKM matrix elements \(V_{cb}\) and \(V_{ub}\), so that identifying b-hadron decays associated with a hadronically decaying boson is a powerful discriminating tool. Branching ratios are plotted in Fig. 1d for Z decays to light quarks, c-quarks, and b- quarks, and in Fig. 1c for the W boson decays to light quarks and c-quarks.

Since the coupling structure is not identical for W and Z bosons, the total decay rates differ, and the angular distributions of the decay products also differ slightly. However, even at parton level without any combinatoric noise, the differences in the angular distributions are subtle. There is no difference for the two bosons with longitudinal polarization because the distributions for right- and left-handed fermions are the same. The distributions are different for right- and left-handed fermions for transversely polarized W and Z bosons, as shown in Fig. 1b. The relative contribution of left- and right-handed components for the Z decays depends on the quark flavour; for up-type quarks the relative contribution from right-handed fermions is \(15\,\%\) while it is only \(3\,\%\) for down-type quarks. In \(t\bar{t}\) decays, the fraction of longitudinally polarized W bosons (ignoring the b-quark mass) is \(m_t^2/(m_t^2+2m_W^2)\sim 0.7\). In contrast, the boson is mostly transversely polarized in inclusive V+jets events. Any discrimination shown in Fig. 1b is diluted by the longitudinal polarization, combinatorics, non-perturbative effects, and detector reconstruction, so angular distributions are not considered further in this paperFootnote 3.

Fig. 1
figure 1

a Breit–Wigner resonances for the W (red) and Z (blue) bosons, b angular distribution of the decay products of transversely polarized W / Z bosons with respect to the spin direction in the boson rest frame, c hadronic branching fractions of the \(W^+\) boson, and d of the Z boson. In c, d, light stands for decay modes not involving b, c quarks

4 Definitions of reconstructed objects

ATLAS is a multi-purpose particle detector [33] with nearly \(4\pi \) coverage in solid angle.Footnote 4 The energy of the hadronic decay products of boosted bosons is measured by a system of calorimeters. The electromagnetic calorimeter consists of a Pb/liquid-argon sampling calorimeter split into barrel (\(|\eta |\) \(<\) 1.5) and endcap ( \(1.5< |\eta | < 3.2\)) sections. The hadronic calorimetry is provided by a barrel steel/scintillating-tile calorimeter (\(|\eta | < 1.7\)) and two endcap Cu/liquid-argon sections (\(1.5 < |\eta | < 3.2\)). Finally, the forward region (\(3.1 < |\eta | < 4.9\)) is covered by a liquid-argon calorimeter with Cu (W) absorber in the electromagnetic (hadronic) section. Energy depositions are grouped into topological calorimeter-cell clusters [34] and then calibrated using the local cluster weighting algorithm [35, 36]. Jets are formed from clusters using two different jet algorithms. Small-radius jets are built with the anti-\(k_t\) algorithm with jet radius parameter \(R=0.4\) [37]. Large-radius jets are formed using the anti-\(k_t\) algorithm with \(R=1.0\) and then trimmed [38] by re-clustering the jet constituents with the \(k_t\) algorithm using \(R=0.3\) and removing the constituents with \(p_\text {T}\) less than 5 % of the original jet \(p_\text {T}\). Both the small- and large-radius jets are further calibrated to account for the residual detector response effects. For small-radius jets, this is a \(p_\text {T}\)- and \(\eta \)-dependent energy calibration, plus a correction to mitigate the contribution from additional pp collisions and to suppress jets from these additional collisions [39]. In addition to \(p_\text {T}\)- and \(\eta \)-dependent energy corrections, large-radius jets J have a calibrated jet mass:

$$\begin{aligned} m_J^2=\Bigg (\sum _{j\in J} E_j\Bigg )^2-\Bigg (\sum _{j\in J} \mathbf {p}_j\Bigg )^2, \end{aligned}$$
(1)

where \(E_j\) is the energy of cluster j and \(\mathbf {p}_j\) is a vector with magnitude \(E_j\) and direction \((\phi _j,\eta _j)\). The jet mass calibration depends on the calibrated jet energy and on the jet \(\eta \) [45]. When a W or Z boson is produced with large enough momentum, its decay products are collimated. When \(2m_V/p^V\sim 1\), an \(R=1.0\) trimmed jet captures a large fraction of the decay products and the jet-mass scale is set by \(m_V\). Since the W and Z boson masses differ by about 10 GeV, the jet mass can be used to discriminate between these two particles. The distributions of the boson masses and jet masses for hadronically decaying W and Z bosons are shown in Fig. 2. The particle-level (‘truth’) jet mass is constructed from stable particles in the MC simulation (\(c\tau > 10\) mm), excluding neutrinos and muons, clustered with the same jet algorithm as for calorimeter-cell clusters. The QCD processes that govern the formation of stable particles from the W and Z decay products create a broad distribution of jet masses even without taking into account detector resolution. Constructing the jet mass from calorimeter-cell clusters further broadens the distribution. The jet-mass resolution (physical \(\oplus \) detector) is large compared to the natural width of the W and Z bosons and comparable to the difference in their masses. For example, the standard deviation of the detector resolution \(\sigma (m^\text {reco jet}/m^\text {truth jet})\) is approximately 10 %. The jet-mass variable nevertheless has some discriminating power.

Fig. 2
figure 2

a The boson mass at generator level, b ‘truth’ jet mass (at particle level) after parton fragmentation, and c reconstructed jet mass distributions. The left plot has a different vertical scale than the right two plots and also has no \(p_\text {T}\) requirement

The momentum and electric charge of particles traversing the detector contain information about the charge of their parent boson. The tracks of charged particles are measured in a 2 T axial field generated by a solenoid magnet which surrounds the inner detector (ID) consisting of silicon pixels, silicon micro-strips, and a transition radiation tracking detector. Charged-particle tracks are reconstructed from all three ID technologies with a full coverage in \(\phi \), \(|\eta |<2.5\) and \(p_\text {T}>400\) MeV. The charge q of a track is determined as part of the reconstruction procedure, which uses a fit with five parameters: the transverse and longitudinal impact parameters, \(\phi \), \(\theta ,\) and q / p, where p is the track momentum. To suppress the impact of pileup, tracks are required to originate from the primary collision vertex, which is defined as the vertex with the largest \(\sum p_\text {T}^2\) computed from associated tracks. Additionally, tracks must satisfy a very loose quality criterion for the track fit \(\chi ^2\) per degree of freedom, which must be less than three. Tracks are associated with jets using ghost association [40]. The charge of tracks associated with a jet is sensitive to the charge of the initiating parton. In order to minimize the fluctuations due to low-\(p_\text {T}\) particles, the jet charge is calculated using a \(p_\text {T}\)-weighting scheme [41]:

$$\begin{aligned} Q_J = \frac{1}{({p_\text {T,J}})^\kappa }\sum _{i\in \mathbf Tracks } q_i\times (p_\text {T}^i)^\kappa , \end{aligned}$$
(2)

where Tracks is the set of tracks with \(p_\text {T}>500\) MeV associated with jet J, \(q_i\) is the charge (in units of the electron charge) determined from the curvature of track i with associated \(p_\text {T}^i\), \(\kappa \) is a free parameter, and \({p_\text {T,J}}\) is the transverse momentum of the jet measured in the calorimeter. The calorimeter energy is used in the denominator to determine \(p_\text {T}\) instead of the sum of track momenta to account for the contribution from neutral particles. Dedicated studies have shown that \(\kappa =0.5\) is generally best for determining the charge of partons from the jets they produce [42]. The distributions of the jet charge for jets initiated by \(W^+,W^-\) and Z bosons are shown in Fig. 3. There is an observable separation between positive and negative W bosons. The expected charge composition of a W sample is process dependent. For example, there are more \(W^+\) than \(W^-\) bosons in inclusive \(W'\) production because of the initial charge asymmetry of quarks in the proton resulting in more \(W'{}^{+}(\rightarrow W^+Z)\) than \(W'{}^{-}(\rightarrow W^-Z)\). The discrimination between Z bosons and a near even mixture of \(W^\pm \) is greatly diminished with respect to e.g. Z versus \(W^+\). In that case charge sensitive variables are not very useful for the tagger and so all results are shown also without such variables. In a variety of physics processes, the charge of the hadronically decaying W boson is known from other information in the event. For example, in searches for FCNC effects in \(t\bar{t}\) events with one leptonically decaying W boson, the charge of the lepton is opposite to the charge of the hadronically decaying W boson. Henceforth, only \(W^+\) bosons are used for constructing the boson-type tagger; the results are the same for \(W^-\) bosons.

Fig. 3
figure 3

The jet charge distribution for jets originating from \(W^\pm \) and Z bosons in simulated \(W'\) decays. Each distribution is normalized to unity. The parameter \(\kappa \) controls the \(p_\text {T}\)-weighting of the tracks in the jet charge sum

The tracks from charged particles can be used further to identify the decays of certain heavy-flavour quarks inside jets due to the long b-hadron lifetime. This is useful for boson-type tagging because the Z boson couples to \(b\bar{b}\) while decays of the W boson to b-quarks are highly suppressed and can be neglected. ATLAS has commissioned a b-tagging algorithm called MV1 (defined in Refs. [43, 44]) which combines information about track impact-parameter significance with the explicit reconstruction of displaced b- and c-hadron decay vertices. The boson-type tagger presented here uses multiple bins of the MV1 distribution simultaneously. Five bins of MV1 are defined by b-tag efficiencies (probability to tag a b-quark jet as such) of 0–50, 50–60, 60–70, 70–80, and 80–100 % as determined in simulated \(t\bar{t}\) events. A lower b-tag efficiency leads to higher light-quark jet rejection. The five b-tagging efficiency bins are exclusive and MV1 is constructed as a likelihood with values mostly between zero and one (one means more like a b-jet). For example, a \(100\,\%\) b-tagging efficiency corresponds to a threshold of MV1 \(>0\) and an \(80\,\%\) b-tagging efficiency corresponds to a threshold value of MV1 \(>z\) for \(z\ll 1\). The 80–100 % b-tag efficiency bin then corresponds to jets with an MV1 value between 0 and z. Constructed in this way, the fraction of true b-jets inside an efficiency bin x %–y % should be \((y-x)\,\%\).

Small-radius jets are matched to a large-radius jet by geometric matchingFootnote 5 (\(\Delta {R}<1.0\)). Of all such small-radius jets, the two leading ones are considered. There are thus 30 possible bins of combined MV1 when considering the leading and sub-leading matched small-radius jet. The number of bins is 25 from the \(5\times 5\) efficiency-binned MV1 distributions in addition to five more for the case in which there is no second small-radius jet matched to the large-radius jet. The distribution for the efficiency-binned MV1 variable for the leading and sub-leading matched small-radius jets is shown for W and Z bosons in Fig. 4. The flavour of a small-radius jet is defined as the type of the highest energy parton from the parton shower record within \(\Delta R < 0.4\). As expected, a clear factorization is seen in Fig. 4 – the MV1 value depends on the flavour of the small-radius jet and not the process that created it. This means that c-jets from W decays have the same MV1 distribution as c-jets from Z decays; the same is true for light jets. Small-radius jets originating from b-hadron decays tend to have a larger value of MV1, which means they fall in a lower efficiency bin. Small-radius jets not originating from b- or c-decays are called light jets and are strongly peaked in the most efficient bin of MV1. There is always one small-radius jet matched to the large-radius jet, but about \(20\,\%\) of the time there is no sub-leading small-radius jet with \(p_\text {T}>25\) GeV matched to the large-radius jets. These cases are all predicted to originate from light-quark decays of the W and Z bosons.

Fig. 4
figure 4

The efficiency-binned MV1 distribution for small-radius jets associated with large-radius jets resulting from W and Z boson decays. The left (right) plot shows the leading (sub-leading) small-radius jet MV1 distribution. The bins correspond to exclusive regions of b-jet efficiency. As such, the bin content of the black line (b-tagging for b-jets) should be proportional to the size of the efficiency window: about 50 % for the rightmost bin, 10 % for the three middle bins and 20 % for the second bin

5 Tagger performance

The optimal multivariate tagger combining jet mass, jet charge, and the MV1 of matched small-radius jets is constructed from a three-dimensional (3D) likelihood ratio. For N bins each of jet mass and jet charge, as well as 30 combined MV1 bins, the 3D likelihood ratio would have \(30\times N^2\) total bins. Populating all of these bins with sufficient MC events to produce templates for the likelihood ratio requires an unreasonable amount of computing resources, especially for the high-efficiency bins of combined MV1. Estimating the 3D likelihood as the product of the 1D marginal distributions, where all variables but the one under consideration are integrated out, is a poor approximation for jet mass and combined MV1 due to the correlation induced by the presence of semileptonic b-decays, which shift the jet mass to lower values due to the presence of unmeasured neutrinos.Footnote 6 It is still possible to use a simple product by noting that all three tagger inputs are independent when the flavour of the decaying boson has been determined. Thus, for each possible boson decay channel, templates are built for the jet mass, the jet charge, and the efficiency-binned MV1 distributions. For a particular decay flavour, the joint distribution is then the product of the individual distributions. Summing over all hadronic decay channels then gives the full distribution. To ease notation, the efficiency-binned MV1 is denoted \(B=(B_\text {lead},B_\text {sub-lead})\). The distribution for \(B_\text {lead}\) (\(B_\text {sub-lead}\)) is shown in the left (right) plot in Fig. 4. Symbolically, for decay flavour channel \(\mathcal {F}\), mass M, charge Q, and efficiency-binned MV1 B, the likelihood is given by:

$$\begin{aligned} p(M,Q,B|V)=\sum _\mathcal {F} \Pr (\mathcal {F}|V)p(M|\mathcal {F},V)p(Q|\mathcal {F},V)\Pr (B|\mathcal {F},V), \end{aligned}$$
(3)

whereFootnote 7 \(V\in \{W,Z\}\) and the sum is over \(\mathcal {F}=bb,cc,cs,cd\) and light-quark pairs. The distribution of B is well approximated as the product of the distributions for \(B_\text {lead}\) and \(B_\text {sub-lead}\) when the flavours of the leading and sub-leading jets are known. This is exploited for hadronically decaying W bosons and for the light-quark flavour decays of Z bosons to construct templates for B that have a sufficient number of simulated events for large values of B, i.e. \(\Pr (B|\mathcal {F},V)=\Pr (B_\text {lead}|\mathcal {F},V)\Pr (B_\text {sub-lead}|\mathcal {F},V)\). The unit-normalized templates for B are shown in Fig. 4 and the unit-normalized templates \(p(M|\mathcal {F},V)\) and \(p(Q|\mathcal {F},V)\) are shown in Fig. 5. For a given boson type, the jet-charge template is nearly independent of the flavour. However, there is a dependence of the jet mass on the (heavy) flavour of the boson decay products.

Fig. 5
figure 5

a The jet mass \(p(M|\mathcal {F},V)\) and b jet charge \(p(Q|\mathcal {F},V)\) templates conditioned on the flavour \(\mathcal {F}\) of the boson V decay for jets with 200 GeV \(<p_\text {T}<400\) GeV. The solid lines are for Z boson decays and the dashed lines are for W boson decays

The likelihood function is constructed by taking the ratio of the probability distribution functions p(MQB|V), for \(V\in \{W,Z\}\), determined from the templates in Eq. (3). Every bin i of the 3D histogram that approximates p(MQB|V) is assigned a pair of numbers \((i,s_i/b_i)\) where \(s_i\) is the overall fraction of the signal (Z or W) in bin i and \(b_i\) is the fraction of the overall background (the other boson flavour) in bin i. Bins are then sorted from largest to smallest \(s_i/b_i\), with f(i) defining a map from the old bin index to the new, sorted one. There are then two 1D histograms: for the signal, bin j has bin content \(s_{f^{-1}(j)}\) and for the background, bin j has bin content \(b_{f^{-1}(j)}\). The optimal tagging procedure is then to set a threshold on the new 1D histograms. The full likelihood ratio of the combined tagger is shown in Fig. 6 where the thresholds required for 90, 50, and \(10\,\%\) Z-boson tagging efficiency are marked with shaded regions.

Curves displaying the tagging performance for all possible subsets of \(\{M,Q,B\}\) are shown in Fig. 7. There are 30 possible values for B, which are therefore represented by discrete points. The jet mass is the best performing single variable for medium to high Z-boson efficiencies, with visible improvement for M+B and M+Q. There is a significant gain from combining all three variables for Z-boson tagging efficiency above about \(20\,\%\). Below \(20\,\%\), the combined tagger is dominated by B where the \(Z\rightarrow b\bar{b}\) branching fraction no longer limits Z-boson tagging efficiency. For Z-boson efficiencies of about \(50\,\%\), one can achieve \(W^+\) rejection factors (\(1/\epsilon _{W^+}\)) of 3.3 by using Q or B alone and about 5.0 using mass alone. For Z efficiencies of \(\epsilon _Z=90\), 50, and 10 %, \(W^+\) rejection factors of 1.7, 8.3, and 1000, respectively, can be achieved with the combined tagger. Although most applications of boson-type tagging will target Z bosons as the signal while rejecting W bosons as background, the likelihood constructed in Fig. 6 can also be used to optimally distinguish \(W^{+}\) bosons from Z bosons. The corresponding performance curves are shown in Fig. 8. The locations of the b-tagging points are all now shifted to high efficiency with respect to Fig. 7 because, for \(W^+\) tagging, one wants to operate in the high-efficiency b-tagging bins (whereas the opposite is optimal for Z tagging). At an efficiency of \(\epsilon _{W^+}=50\,\%\), a Z-boson rejection factor of \(1/\epsilon _Z \approx 6.7\) can be achieved.

Fig. 6
figure 6

The full likelihood ratio for the tagger formed from jet mass, jet charge, and a small-radius jet b-tagging discriminant. The black histogram shows the likelihood ratio for Z bosons and the red histogram is the likelihood ratio for \(W^+\) bosons. The shaded areas show the region of the likelihood ratio corresponding to 90, 50,  and \(10\,\%\) working points of the Z-boson tagging efficiency

Fig. 7
figure 7

The tradeoff between Z efficiency and a \(1-\) (\(W^+\) efficiency) b or 1 / (\(W^+\) efficiency) on a a linear scale and b a logarithmic scale. Each curve is constructed by placing thresholds on the likelihood constructed from the inputs indicated in the legend. Since the b-tagging discriminant is binned in efficiency, there are only discrete operating points for the tagger built only from B

Fig. 8
figure 8

The tradeoff between \(W^+\) efficiency and a \(1-\) (Z efficiency) or b 1 / (Z efficiency) on a a linear scale and b a logarithmic scale. Each curve is constructed by placing thresholds on the likelihood constructed from the inputs indicated in the legend. Since the b-tagging discriminant is binned in efficiency, there are only discrete operating points for the tagger built only from B

6 Systematic uncertainties

The performance curves in Fig. 7 are based on the nominal modelling parameters of the ATLAS simulation. Additional studies show how the curves change due to the systematic uncertainties on the inputs to the likelihood function. Sources of experimental uncertainty include the calibrations of the large- and small-radius jet four-momenta, the b-tagging (which incorporates e.g. impact parameter modelling), and the modelling of track reconstruction.

The uncertainty on the scale of the large-radius jet mass calibration is estimated using the double ratio in data and MC simulation of calorimeter jet mass to track jet mass [45]. Tracks associated with a jet are well measured and provide an independent observable correlated with the jet energy. Uncertainties on the jet-mass resolution can have a non-negligible impact on the performance of the tagger. The jet-mass resolution uncertainty is determined from the difference in the widths of the boosted W boson jet-mass peak in semileptonic \(t\bar{t}\) simulated and measured data events [45] and also from varying the simulation according to its systematic uncertainties [46]. The resolution is about 5 GeV in the Gaussian core of the mass spectrum and its uncertainty is about 20 %. The impact of the jet-mass scale and resolution uncertainties on the boson-type tagger built using only the jet mass is shown in Fig. 9 for two nominal working points of 50 and \(90\,\%\) Z-boson tagging efficiency. Both the likelihood map f from Sect. 5 and the threshold value are fixed. Inputs to the tagger are shifted by their uncertainties and the 1D histograms described above are re-populated. The efficiencies for W and Z bosons are recomputed and shown as markers in Fig. 9a. Coherent shifts of the jet masses (JMS) for W and Z bosons result in movement along the nominal performance curve corresponding to \(\pm 10\,\%\) changes in the efficiency. However, there are also shifts away from the nominal curve because the optimal jet-mass cut is not a simple threshold. Variation of the jet-mass resolution (JMR) preserves the scale and so the movement is nearly perpendicular to the original performance curve, at the \(\lesssim 5\,\%\) level, because of the increased overlap in the Z and W mass distributions.Footnote 8 Shifts along the nominal curve optimally use the input variables (albeit at different efficiencies), while shifts away from the nominal curve are a degradation in the performance. The impact of the fragmentation is estimated by using input variables from HERWIG but with the likelihood map from PYTHIA. PYTHIA and HERWIG have similar W / Z efficiencies at both the 50 and \(90\,\%\) benchmark points.

The systematic uncertainty on the efficiency of the tracking reconstruction is estimated by removing tracks associated with jets using an \(\eta \)-dependent probability [47]. The probability in the region \(2.3<|\eta |<2.5\) is \(7\,\%\); it is 4 % for \(1.9<|\eta |<2.3\), 3 % for \(1.3<|\eta |<1.9\), and 2 % for \(0<|\eta |<1.3\). These probabilities are known to be conservative in the most central \(\eta \) bins. There is also an uncertainty on the modelling of track merging for high-\(p_\text {T}\) jets, but the loss is expected to be negligible for jets with \(p_\text {T}<400\) GeV. Differences in the modelling of fragmentation can affect the expected performance for all the input variables, especially for the track-dependent observables. The impact of various uncertainties on the boson-type tagger built using only the jet charge is shown in Fig. 9b. Since W and Z boson decays produce on average many tracks (see Sect. 7), removing a small number of them does not have a big impact on the jet-charge tagger as a result of the \(p_\text {T}\)-weighting in the jet charge sum.

The efficiency to b-tag jets of various flavours (b, c, and light) is measured in data using \(t\bar{t}\) events [43], jets with identified charm hadrons, and multijet events [44]. The differences between data and MC simulation are typically a few percent and are applied as independent correction factors on a per-jet basis. The uncertainties on these scale factor measurements are used as estimates of the systematic uncertainty on the b-tagging. The sources of uncertainty are decomposed into many uncorrelated components (24 for b-jets, 16 for c-jets, and 48 for light-flavour jets) and the impact on the rejection is added in quadrature for a fixed value of \(\epsilon _\text {signal}\). The b-tagging of matched small-radius jets is also affected by uncertainties on the jet-energy scale and resolution. These quantities are varied within their uncertainties and if the shifted jet has \(p_\text {T}<25\) GeV, its MV1 value is not considered. The impact of various uncertainties on the boson-type tagger built using only the b-tagging discriminant for a 10 % nominal Z efficiency is shown in Fig. 10. At this efficiency, the full boson-type tagger is dominated by the b-tagging inputs, as seen in Fig. 7. The scale factor uncertainty for b-jets has no impact on the W efficiency (no real b-jets), but there is approximately a 10 % uncertainty on the Z efficiency. The uncertainties on the jet-energy scale for small-radius jets are relevant only because of the 25 GeV \(p_\text {T}\) threshold. Since all of the large-radius jets are required to have \(p_\text {T}>200\) GeV, the threshold is relevant only in the rare case that one of the W daughters is nearly anti-parallel in the W rest frame to the direction of the W boost vector.

The impact of the uncertainties on the jet-mass scale and resolution on the boson-type tagger built using all of the inputs (jet mass, jet charge, and b-tagging) is shown in Fig. 11a. At very low Z-boson tagging efficiency, the tagger is dominated by b-tagging, so Fig. 10 is a good representation of the uncertainty on the full tagger’s performance. For higher efficiencies, the tagger is dominated by the jet mass, although the jet charge and b-tagging discriminant significantly improve the performance. The uncertainty on the full tagger’s performance at the 50 and \(90\,\%\) Z-boson tagging efficiency benchmark points is due mostly to the uncertainty on the jet mass, which is why these uncertainties are shown in Fig. 11.

Fig. 9
figure 9

The impact of selected systematic uncertainties on benchmark working points of the boson-type tagger. a A jet-mass-only tagger, for 50 % (left) and 90 % Z efficiency benchmarks. b A jet-charge-only tagger, for 50 % (left) and 90 % Z efficiency benchmarks. The point marked HERWIG uses the alternative shower and hadronization model for the simulation, with the likelihood template from PYTHIA. See the text for an explanation of the notation in the legend

Fig. 10
figure 10

The impact of selected systematic uncertainties on benchmark working points of a b-tagging-only tagger at a \(10\,\%\) Z efficiency benchmark. The b-tagging discriminant is binned, so there are only discrete operating points. The point marked HERWIG uses the alternative shower and hadronization model for the simulation, with the likelihood template from PYTHIA. The b-tagging scale factor uncertainties are determined separately for b-, c-, and light-quark jets. Variations are added in quadrature for each ‘truth’ jet flavour. There is no contribution from the b-jet scale factor uncertainties on the W rejection because there are no ‘truth’ b-jets. Conversely, the c- and light-jet scale factor uncertainties do not impact the Z bosons because at this low efficiency, all the selected Z bosons decay into \(b\bar{b}\)

Fig. 11
figure 11

The impact of uncertainties on the jet-mass scale and resolution for 50 % (a) and 90 % (b) Z efficiency working points of the full boson-type tagger. The point marked HERWIG uses the alternative shower and hadronization model for the simulation, with the likelihood template from PYTHIA

7 Validation of tagging variables using data

The tagger cannot be fully tested with data because it is not possible to isolate a pure sample of hadronically decaying Z bosons in pp collisions. However, the modelling of the variables used to design the tagger can be studied with a relatively pure and copious sample of hadronically decaying W bosons in \(t\bar{t}\) events which can be tagged by the leptonic decay of the other W boson in the event (semileptonic \(t\bar{t}\) events). Single-lepton triggers are used to reject most of the events from QCD multijet background processes. Candidate reconstructed \(t\bar{t}\) events are chosen by requiring an electron or a muon with \(p_\text {T} > 25\) GeV and \(|\eta | < 2.5\), as well as a missing transverse momentum \(E_{\text {T}}^{\text {miss}} > 20 \mathrm GeV\). The electrons and muons are required to satisfy a series of quality criteria, including isolation.Footnote 9 Events are rejected if there is not exactly one electron or muon. In addition, the sum of the \(E_{\text {T}}^{\text {miss}} \) and the transverse massFootnote 10 of the W boson, reconstructed from the lepton and \(E_{\text {T}}^{\text {miss}} \), is required to be greater than 60 GeV. Events must have at least one b-tagged jet (at the 70 % efficiency working point) and have at least one large-radius trimmed jet with \(p_\text {T}>200\) GeV and \(|\eta |<2\). Furthermore, there must be a small-radius jet with \(p_\text {T}>25\) GeV, and \(\Delta R<1.5\) to the selected lepton (targeting the decay chain \(t\rightarrow bW(\rightarrow \ell \nu )\)). The other W boson candidate is selected as the leading large-radius trimmed jet with \(\Delta R>1.5\) from the small-radius jet that is matched to the lepton. The W+jets and multijet backgrounds are estimated from the data using the charge asymmetry and matrix methods, respectively [48]. The other backgrounds are estimated directly from MC simulation. Although the resulting event selection is expected to have a high \(t\bar{t}\) purity (about 75 %), the events cannot be compared directly to the isolated W bosons from the simulated \(W'\) boson decays. This is because there are several effects that make the typical large-radius jet in semileptonic \(t\bar{t}\) events different from isolated W and Z boson jets in \(W'\) boson eventsFootnote 11:

  1. 1.

    The event selection is based on the reconstructed jet \(p_\text {T}\) (earlier sections used \(p_\text {T}^V\)), so even if \(p_\text {T}^\text {jet}\gtrsim 200\) GeV for an \(R=1.0\) jet, the true hadronically decaying W boson in the event may have \(p_\text {T}^W<200\) GeV and thus the W boson decay products might not be collimated within \(\Delta R <1\).

  2. 2.

    There are more (close-by) jets in semileptonic \(t\bar{t}\) events than in \(W'\) boson events. Jets not originating from the W boson can form the leading large-radius jet, or the b-jet from the same top-quark as the hadronically decaying W bosons can merge with the W boson decay products to form a large-radius jet.

The variables \(p_\text {T}^\text {jet}/p_\text {T}^W\) and \(\Delta R(\text {jet},W)\), for the W boson from the MC ‘truth’ record and the selected large-radius jet, are used to classify the various \(t\bar{t}\) event sub-topologies. Events are labelled as having a Boosted W if \(|p_\text {T}^\text {jet}/p_\text {T}^W-1|<0.1\) and \(\Delta R(\text {jet},W)<0.1\). If the b-quark from the top-quark decay has an angular distance \(\Delta R<1.0\) from the selected large-radius jet, this jet is labelled as b -contaminated. All other \(t\bar{t}\) events, including events where both W bosons decay into leptons, are labelled as Other. The \(p_\text {T}\) spectrum of the jets from the classified events is shown in Fig. 12. In Fig. 12 and subsequent figures, systematic uncertainties on the simulation include the jet \(p_\text {T}\) and jet mass uncertainties described in Sect. 6, but exclude tracking uncertainties, which are sub-dominant. Events are vetoed if the selected large-radius jet has \(p_\text {T}>400\) GeV or if the \(\Delta R\) between the selected large-radius jet and a tagged b-jet is less than 1.0. This suppresses the b-contaminated \(t\bar{t}\) events. The effectiveness of the \(t\bar{t}\) event classification is most easily seen from the jet mass distribution, shown in Fig. 13a. The mass of the boosted W bosons from \(t\bar{t}\) events is peaked around \(m_W\), as is a small contribution from the hadronically decaying W bosons in single-top events in the Wt channel. There is no peak at \(m_t\) in the b-contaminated spectrum because of the b-jet veto, but there is a small non-resonant contribution below the top-quark mass, due to events in which one W daughter is matched with the b-jet. This is akin to the b-jet+lepton invariant mass used in other circumstances to measure top-quark properties and naturally has a scale around 150 GeV [49]. The low-mass peak in W+jets and the ‘other’ \(t\bar{t}\) events is due to the Sudakov peak from QCD jets, the location of which scales with \(R \times p_\text {T}\). The dependence on \(p_\text {T}\) of the W-peak position in Fig. 13a is shown in Fig. 13b. Events with the leading jet in a window around the W mass, 50 GeV \(<m^\text {jet}<120\) GeV are selected and the median of the mass distribution is plotted in Fig. 13b as a function of the jet \(p_\text {T}\). The similar trend for the simulation and the data shows that the combination of the reconstructed jet-mass scale and ‘truth’ jet-mass scale is well modelled. To quantify the spread in the jet mass peak, various inter-quantile ranges are shown as a function of \(p_\text {T}\) in Fig. 13c. The inter-quantile range of size \(0\,\%<X<50\,\%\) is defined as the difference between the \(50\,\%+X\,\%\) quantile and the \(50\,\%-X\,\%\) quantile, and is a measure of the spread in the distribution. The width of the boosted-W mass peak is well modelled within the statistical precision of the 2012 data sample.

The modelling of boosted W bosons can also be studied using the jet-mass scale measured from tracks. Defining the variable \(r_{\mathrm {track}}\) as the ratio of the jet mass determined from tracks to the jet mass determined from the calorimeter, the jet mass scale uncertainty is related to the difference from unity of the ratio of \(\langle r_\text {track}\rangle \) in data to \(\langle r_\text {track}\rangle \) in MC simulation. The mass scale uncertainty is calculated using the procedure described above, but with \(r_{\mathrm {track}}^{-1}\). If the jet consists only of pions, the natural scale for \(r_\text {track}\) is 2/3, although there are significant physics and detector effects that introduce a large spread of values. The distribution of \(r_\text {track}\) in the \(t\bar{t}\)–enriched event sample with the same \(p_\text {T}\) and b-jet veto requirements as in Fig. 13 is shown in Fig. 14a. Unlike the raw jet-mass distribution, the \(r_\text {track}\) distribution is similar for all of the sub-processes, as expected. The scale and spread of the \(r_\text {track}\) distribution are quantified in Fig. 14b, c using the \(p_\text {T}\) dependence of the median and inter-quantile ranges. Previous studies have indicated that the track multiplicity, \(n_\text {track}\), in quark and gluon jets is not well modelled, especially for gluon jets, where \(n_\text {track}\) is lower in the data with respect to PYTHIA [50]. The distribution of the track multiplicity for large-R jets in the \(t\bar{t}\)-enriched event sample is shown in Fig. 15. The boosted W events are peaked at slightly lower values of the number of associated tracks compared to the quark/gluon jets from the other processes. The (charged) particle multiplicity increases for generic quark and gluon jets as a function of jet energy. However, the mass-scale of the jets produced from W boson decays is set by \(m_W\) so that in the absence of detector reconstruction effects, the track multiplicity distribution should be largely \(p_\text {T}\) independent. The \(p_\text {T}\) dependence of the track multiplicity is shown in Fig. 15b, c in the form of the median and the inter-quantile ranges. The median does increase because of the large non-W component as well as the finite detector acceptance for charged particles from the boosted W boson decay. The width is well modelled within the statistical precision of the data. However, there is disagreement for the median. Previous studies (including Rev. [50]) suggest that this is due to fragmentation modelling and not the modelling of the detector response.

The \(p_\text {T}\)-weighted distribution of the track charges defines the jet charge, which is shown in Fig. 16a. The charge of the lepton from the leptonic W boson decay determines the expected charge of the hadronically decaying W boson candidate, allowing for a tag-and-probe study of the capability of charge tagging in hadronic W boson decays [42]. The jet charge for boosted W bosons for positively (negatively) charged leptons is clearly shifted to the left (right) of zero. There is also some separation between positive and negative W boson decays when the selected large-radius jet does not satisfy the criteria for being a boosted W boson. This is because the jet still contains some of the W boson decay products, and the jet charge is correlated with the charge of the W boson. The difference between the inclusive and boosted W-boson jets is clearer in the \(p_\text {T}\) dependence plot of the median jet charge shown in Fig. 16b. The medians of the distributions for boosted W jets are nearly twice as far apart as the medians for inclusive jets. However, in both cases the spread is less than the width of the distribution, shown as the inter-quantile range (inter-quantile range with \(X=25\,\%\)) in Fig. 16c. Even though there is some small disagreement for the median number of tracks, the \(p_\text {T}\)-weighted sum defining the jet charge is reasonably well modelled.

The remaining input to the boson tagger is the b-tagging discriminant for the matched small-radius jets. The efficiency-binned MV1 distributions are shown in Fig. 17a, b with the same selection criteria as for the previous figures, except that the b-jet veto is removed. The contamination due to the b-jet from the top-quark decay complicates a direct study of the MV1 distribution for boosted W jets; contamination from the b-quark decay products is seen clearly in the MV1 distribution at lower values of the efficiency. Most of the boosted W jets are in the highest efficiency bin because they have no real b-hadron decay.

Overall, the simulation models all three input variables well.

Fig. 12
figure 12

The \(p_\text {T}\) distribution of the selected large-radius jets. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass described in Sect. 6

Fig. 13
figure 13

a The jet-mass distribution of the selected jets in semi-leptonic \(t\bar{t}\) events. b The median of the mass distribution as a function of the jet \(p_\text {T}\) for events with the selected jet in the range 50 GeV \(<m^\text {jet}<\) 120 GeV. This includes the contributions from events which are not classified as Boosted W. c For the same events as in b, the inter-quantile range as a measure of spread. The quantiles are centred at the median. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass described in Sect. 6. The inter-quantile range of size \(0\,\%<X<50\,\%\) is defined as the difference between the \(50\,\%+X\,\%\) quantile and the \(50\,\%-X\,\%\) quantile. Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins

Fig. 14
figure 14

a The distribution of \(r_\text {track}\) in the data for semi-leptonic \(t\bar{t}\) events with the selected jet in the range 50 GeV \(<m^\text {jet}<\) 120 GeV. b The median of the \(r_\text {track}\) distribution as a function of the jet \(p_\text {T}\). This includes the contributions from events that are not classified as Boosted W. c The inter-quantile range as a measure of the width. The quantiles are centred at the median. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass described in Sect. 6. The inter-quantile range of size \(0\,\%<X<50\,\%\) is defined as the difference between the \(50\,\%+X\,\%\) quantile and the \(50\,\%-X\,\%\) quantile. Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins

Fig. 15
figure 15

a The distribution of the number of tracks associated with the selected large-radius jet in the semi-leptonic \(t\bar{t}\) data for events with the selected jet in the range 50 GeV \(<m^\text {jet}<\) 120 GeV. b The median of the distribution of the number of tracks as a function of the jet \(p_\text {T}\). This includes the contributions from events that are not classified as Boosted W. c The inter-quantile range as a measure of the width. The quantiles are centred at the median. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass described in Sect. 6. The inter-quantile range of size \(0\,\%<X<50\,\%\) is defined as the difference between the \(50\,\%+X\,\%\) quantile and the \(50\,\%-X\,\%\) quantile. Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins

Fig. 16
figure 16

a The distribution of the jet charge in the data for semi-leptonic \(t\bar{t}\) events with the selected jet in the range 50 GeV \(<m^\text {jet}<\) 120 GeV. The ratio uses the positive lepton charge. b The median of the jet charge distribution as a function of the jet \(p_\text {T}\). This includes the contributions from events that are not classified as Boosted W (except for the blue triangles, for which only the Boosted W is included). c The inter-quartile range as a measure of the width. The quantiles are centred at the median. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass described in Sect. 6. The inter-quantile range is defined as the difference between the \(75\,\%\) quantile and the \(25\,\%\) quantile. Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins

Fig. 17
figure 17

The efficiency-binned MV1 distribution for the a leading and b sub-leading matched small-radius in semi-leptonic \(t\bar{t}\) events. If there is no second small-radius jet with \(p_\text {T}>25\) GeV and \(\Delta R<1\) to the selected large-radius jet axis, the event is put in the ‘No jet’ category in b. The uncertainty band includes all the experimental uncertainties on the jet \(p_\text {T}\) and jet mass and those related to the b-tagging described in Sect. 6. Statistical uncertainty bars are included on the data points but are smaller than the markers in many bins

8 Outlook

The simulation studies of the boson-type tagger presented in Sect. 5 show that for bosons with 200 GeV \(<p_\text {T}<\) 400 GeV, it is possible to achieve Z-boson efficiencies of \(\epsilon _Z=90\), 50, and \(10\,\%\) with \(W^+\) boson rejections of 1.7, 8.3 and 1000, respectively. Putting this into context, with \(R(\epsilon _Z)\) defined as the lowest possible W-boson tagging efficiency at a fixed Z-boson tagging efficiency:

  • The WZ/WW cross-section ratio is \(\sim 20\,\%\) [51]. At the 50 % type-tagger working point, one can change the ratio of events to

    $$\begin{aligned} \frac{50\,\%}{R(50\,\%)}\times \frac{ \sigma (WZ)}{ \sigma (WW)} = \frac{50\,\%}{12\,\%}\times \frac{ \sigma (WZ)}{ \sigma (WW)} = \frac{50}{12}\times 20\,\% \approx 83\,\%, \end{aligned}$$
    (4)

    with the possibility for a high-purity extraction of the WZ cross section in the semileptonic channel (\(\ell \nu q\bar{q}\)).

  • Diboson resonances are predicted by many models of physics beyond the Standard Model. The all-hadronic channel provides a significantly higher yield than the leptonic channels. At the \(90\,\%\) type-tagger working point, one can distinguish ZZ from WZ with a likelihood ratio of \(0.9^2/(0.9\times 0.6)\sim 1.5\). new resonance is discovered with \(\sim 20\) events, this means that the difference between ZZ and WZ is distinguishable within a \(2\sigma \) statistical uncertainty of the data.

  • At the 10 % type-tagger working point, a leptophobic flavour-changing neutral current in \(t\bar{t}\) production (with decays like in the SM) with a branching ratio of 1 % would have the same number of events as the \(t\rightarrow bW\) decay:Footnote 12

    $$\begin{aligned} \frac{10\,\%}{R(10\,\%)}\times \frac{\Gamma (t\rightarrow Zc)}{ \Gamma (t\rightarrow Wb)}=\frac{10\,\%}{0.1\,\%}\times \frac{ \Gamma (t\rightarrow Zc)}{\Gamma (t\rightarrow Wb)} = 100 \times 1\,\% = 100\,\%. \end{aligned}$$
    (5)

Only the range 200 GeV \(<p_\text {T}<400\) GeV was studied thus far due to the availability of W bosons in the data. Figure 18 shows how the average and standard deviation of the jet mass, jet charge and multiplicity of the matched small-radius b-tagged jets distributions depend on jet \(p_\text {T}\) in simulation up to 1 TeV. As long as the jet \(p_\text {T}\) is high enough so that a single jet captures all of the boson decay products, the jet mass and jet charge distributions are predicted to be largely independent of \(p_\text {T}\). The information from b-tagging degrades around 400 GeV as the two decay products from the boson become too close to resolve two separate jets.

Fig. 18
figure 18

The boson \(p_\text {T}\) dependence of the a jet mass, b jet charge, and c number of small-radius b-tagged jets matched to the large-radius jet

9 Conclusions

A tagger for distinguishing hadronically decaying boosted Z bosons from W bosons using the ATLAS detector has been presented. It will most likely be used after a boson tagger has rejected most QCD multijet events. Three discriminating variables are chosen which are sensitive to the differences in boson mass, charge, and branching ratios to specific quark flavours: large-radius jet mass, large-radius jet charge, and an associated small-radius jet b-tagging discriminant. For moderate and high Z-boson tagging efficiencies, the jet mass is the most discriminating of the three variables, but there is significant improvement in discrimination when combining all three inputs into a single tagger. At low Z-boson efficiencies, smaller than the \(Z\rightarrow b\bar{b}\) branching ratio, the b-tagging discriminant is the most useful for rejecting W bosons. The full tagger is largely unaffected by many systematic uncertainties on the inputs, with the exception of the uncertainties on the jet-mass scale and resolution. While it is not possible to measure the tagger efficiencies directly in data due to the lack of a pure sample of boosted, hadronically decaying Z bosons, modelling of the likelihood function using hadronically decaying W bosons has been studied in the data. Overall, the simulation agrees well with the 20.3 fb\({}^{-1}\) of \(\sqrt{s}=8\) TeV pp data collected at the LHC.