Stealth multiboson signals

We introduce the `stealth bosons' $S$, light boosted particles with a decay $S \to AA \to q \bar q q \bar q$ into two daughter bosons $A$, which subsequently decay into four quarks that are reconstructed as a single fat jet. Variables that measure the two-pronged structure of fat jets, which are used for diboson resonance searches in hadronic or semi-leptonic final states, classify the jets produced in stealth boson decays as QCD-like - actually, for these variables they may seem more background-like than the QCD background itself. The number of tracks in those jets can also be, on average, much higher than for the fat jets arising from the hadronic decay of boosted $W$ and $Z$ bosons. Therefore, these elusive particles are hard to spot in standard searches. Heavy resonances decaying into two such stealth bosons, or one plus a $W/Z$ boson, could offer an explanation for the recurrent small excesses found in hadronic diboson resonance searches near an invariant mass of 2 TeV.


Introduction
Small excesses around an invariant mass of 2 TeV appear in no less than five different searches for diboson resonances decaying hadronically, performed by the ATLAS and CMS Collaborations at the Large Hadron Collider (LHC), and with energies of 8 and 13 TeV. Although none of these excesses is statistically significant on its own, it is difficult to regard them as a mere coincidence. Yet, their interpretation as a background shaping or a new physics signal is difficult too.
The largest excess, of 3.4σ, was found by the ATLAS Collaboration in a search using the full Run 1 datset at 8 TeV [1], and sparked a great interest. The excess appeared in the (non-independent) samples tagged as W W , W Z and ZZ, but it was largest for the W Z event selection. Before, a mild excess had been found by the CMS Collaboration [2], though the maximum significance was below 2σ, and located at slightly smaller invariant masses. The first ATLAS analysis of Run 2 data with 3.2 fb −1 at 13 TeV [3], did not show any hint of an excess with the nominal W Z event selection but relaxing the boson tagging criteria for fat jets a third bump appeared, again around 2 TeV, in one of the control distributions (the dijet invariant mass distribution without an upper cut on the 1 number of tracks per jet). The significance of this bump was later estimated to be of 2.4σ [4]. When the analysis was updated with a luminosity of 15.5 fb −1 [5], a bump at 2 TeV appeared also with the nominal W Z selection, with a significance around 2σ. The fifth 2 TeV bump has appeared recently in the CMS search with the full 2016 dataset, using a luminosity of 35.9 fb −1 [6]. Its significance is around 2σ. No excess at this mass was observed in earlier analyses with 2.6 fb −1 [7] and 12.9 fb −1 [8]. A very recent analysis [9] by the ATLAS Collaboration with 36.7 fb −1 shows no excess at this invariant mass.
Before addressing other explanations, it is worthwhile discussing in detail the possibility that the 2 TeV bumps are caused by some shaping of the dijet invariant mass distribution of the SM background. It is known (see for example Ref. [10]) that the various methods of jet grooming, such as mass-drop filtering [11], trimming [12], pruning [13] and soft drop [14] alter the shape of the mass distribution of jets resulting from quarks and gluons. For example, the jet trimming algorithm used by the ATLAS Collaboration in Run 2 searches [5] gives rise to a kink in the distribution at masses m J ≃ 80 GeV for jets of radius R = 1.0 and transverse momenta p T = 1 TeV. In this situation, it is conceivable that a jet mass selection around the W/Z masses could induce some feature in the background distribution around p T = 1 TeV, which could be reflected at dijet invariant masses m JJ ∼ 2 TeV. On the other hand, the ATLAS Run 1 search uses mass-drop filtering, for which this kink is not present, and the CMS Collaboration uses jet pruning and soft drop in their Run 1 and Run 2 analyses, respectively. It is then questionable that the same background shaping would appear at 2 TeV in all these analyses, using not only different jet grooming methods but also different jet substructure tagging. An explicit calculation of the QCD dijet distribution with the ATLAS Run 1 and Run 2 jet tagging criteria [4] showed that a kink around m JJ = 1.7 TeV appears when the jet mass and tagging criteria are applied (see also Ref. [15]), as it can be seen in Fig. 1. But it shows no hint of a bump around 2 TeV. As it will be shown in the following, the jet tagging criteria used by the CMS Collaboration in Run 2 searches do not produce a 2 TeV bump either. Consequently, in this work we will discard the possibility of a significant background shaping, although further studies on this issue are welcome.
Regarding new physics interpretations, it was early pointed out [16] that the 3.4σ ATLAS excess at 8 TeV was unlikely to result from a V V diboson resonance, V = W, Z, as the searches in the semi-leptonic final states [17,18] did not exhibit any deviation from the standard model (SM) prediction. The proposal in Ref. [16] was of a V V X triboson resonance, that is, a resonance R undergoing a cascade decay R → V Y → V V X yielding two SM bosons plus an additional particle X, with Y being an intermediate resonance.
The presence of an extra particle X would dramatically decrease the efficiency of such a signal in the searches in semileptonic modes, as it was later confirmed in Ref. [4] with a for the ATLAS Run 1 (left) and Run 2 (right) hadronic diboson searches, from Ref. [4]. The black lines correspond to the full W Z boson tagging criteria described in Refs. [1,3]: jet mass cuts, √ y (at Run 1) or D 2 (at Run 2) substructure tagging, and number of tracks per jet. For the blue, green and pink lines two out of these three tagging criteria are applied. The red lines correspond to the prediction without any boson tagging. The error bars in the points represent the Monte Carlo uncertainty.
more detailed analysis. The reason is that the mentioned Run 1 searches in semileptonic modes, as well as the more recent ones with Run 2 data [19-23], are highly optimised for the kinematics of diboson resonances produced back-to-back in the transverse plane. Moreover, the event selection criteria often veto the presence of extra particles near the decay products of the boson with leptonic decay, which obviously dampens the efficiency for such a signal. As an alternative explanation for the lack of a positive signal in the semileptonic decay channels, it was proposed [24] that the two hadronically decaying 'bosons' identified as two fat jets are not actually massive SM gauge bosons but smuons of similar mass, decaying into two quarks via R-parity violating interactions, and the heavy 2 TeV resonance is a sneutrino. That is, the observed 'bosons' simply do not have leptonic decay modes.
In Ref. [25] it has been shown that a wide bump on a smoothly-falling distribution that cannot be predicted from simulations is quite difficult to detect in narrow resonance searches: the bump can easily be absorbed in the background normalisation. This is indeed the case for diboson resonance searches in hadronic final states, where the background is obtained from a fit to data in the signal region, assuming some smooth functional form. This feature may partially explain why excesses are not seen in the searches that have a smaller dataset. Still, current new physics interpretations are undermined by an apparent inconsistency among the size of the excesses. For example, in Ref. [4] a variety of triboson signals that accommodated the ATLAS Run 1 excess was examined in order to give predictions for other analyses, in particular for the hadronic Run 2 CMS search with 2.6 fb −1 . The predictions were compatible with the null experimental result but in the latest dataset [6], with a luminosity 13 times higher, a larger excess should have appeared. The same reckoning is expected to apply to a simple signal such as the one proposed in Ref. [24], with two particles from a resonance decay, each decaying into a quark pair. A possible solution to this puzzle is suggested by a comparison of the different hadronic searches, focusing not on the number of observed events, but rather on the expected QCD background at dijet invariant masses around 2 TeV, and its dependence on the boson tagging criteria [26]: • CMS analyses use a subjettiness ratio τ 21 [27] to quantify the likeliness that the jets have a two-pronged structure. For the so-defined 'high purity' (HP) jets, the criterion has strenghthened from τ 21 ≤ 0.5 in the Run 1 analysis [2], to τ 21 ≤ 0.45 [7], τ 21 ≤ 0. 4 [8], and τ 21 ≤ 0.35 [6], as the dataset has increased. To give an example, the expected background near 2 TeV for the high purity dijet sample is similar in the latest analysis, with 35.9 fb −1 , and the previous one with 12.9 fb −1 . This clearly shows that the requirements on jets are much tighter.
• For the ATLAS analyses the comparison between Run 1 and Run 2 is not easy, because in Run 1 a cut √ y ≥ 0.45 on the y variable [11] measuring the subjet momentum balance is used, whereas in Run 2 it is replaced by a momentum-dependent cut on the so-called D 2 function [28]. Nevertheless, by a naive parton luminosity scaling of the expected background around 2 TeV, it is seen that the boson tagging criteria for the Run 2 nominal selection [3] are around one order of magnitude stronger. 1 The latest ATLAS search [9] implements a new jet mass definition and a reoptimisation of the jet mass windows and D 2 cut, whose details are not publicly available. We will not cover that analysis here.
Getting together the above arguments, namely (i) the persistence of the 2 TeV bumps, (ii) the unlikely possibility of a background shaping, (iii) the apparent inconsistency of the size of the bumps, an obvious question arises: May it be possible to have new physics signals giving fat jets whose substructure is background-like? If this were the case, those signals would be more and more suppressed as the jet tagging requirements are tightened, and we might have an explanation for that apparent inconsistency. To answer this question, 1 Scaling the expected background of two events per 100 GeV in the bin around 2 TeV in the Run 1 analysis by a factor of 15 to account for the increased gg parton luminosity, and multiplying by 3.2 fb −1 / 20.3 fb −1 , one would expect a background of 7 events in the Run 2 analysis with 3.2 fb −1 , whereas the expected background is around 0.7 events. 4 which arises in the context of the 2 TeV anomaly but whose consequences go beyond the interpretation of potential LHC excesses, is the motivation of the present work. And, as we show in this paper, the answer is affirmative. The cascade decay of a very boosted particle S with a mass M S ∼ M V into two lighter particles A that subsequently decay each into two quarks, gives a single fat jet with a mass consistent with the W and Z masses. (The particles S and A can be for instance a new scalar H 0 1 and pseudo-scalar A 0 , respectively, in models with an extended scalar sector.) And, depending on the A mass M A , the fat jet originated in the decay of S may seem more background-like than the QCD background itself, when one considers jet subjettiness measures such as τ 21 or D 2 . Therefore, it is appropriate to denote these S bosons as 'stealth bosons'. An alternative is to have a heavier S and with one of its decay products being a vector boson V = W, Z. One theoretically wellmotivated possibility is with M S M Z + M A and the four quarks merging into a single jet. (Again, S and A can be the new scalar H 0 1 and pseudo-scalar A 0 in models with an extended Higgs sector.) Often the jet grooming algorithm will completely eliminate the decay products of A, so that the groomed jet mass may be close to the Z mass even if S is heavier.
Stealth bosons with a mass M S ∼ M V and decaying as in Eq. (1) are the main focus of our analysis. The results obtained apply, at least qualitatively, to the decay chain of Eq. (2) as well. In section 2 we consider the diboson-like decay of a heavy resonance R into two such stealth bosons, R → SS, and compare the resulting variables with those for a true diboson resonance Z ′ → W W . There, our statement that those signals can be more background-like than the QCD background will be apparent, with the explicit example of CMS and ATLAS diboson resonance searches. In section 3 we study a diboson-like decay R → V S of a resonance into a SM boson and a stealth boson, and in section 4 a tribosonlike cascade decay R → V Y → V V S, with Y an intermediate resonance. As an example of these cascade decays, as well as for the Monte Carlo simulations, we use the multiboson signals that arise in left-right models [29], but our results will not be limited to such specific examples. We discuss our results in section 5. Although the novel 'stealth boson' signatures introduced in this work are motivated by the 2 TeV excesses, we point out that they are interesting on their own. Such signatures are relatively hard to spot over the QCD background with the standard diboson resonance searches, and therefore they are so far quite unexplored. Related details are discussed in three appendices. The possible shaping of the QCD background with the CMS event selection criteria is investigated in appendix A. A side effect of the multi-pronged structure of stealth bosons is the fact that standard grooming algorithms often fail to recover the true stealth boson mass from the resulting fat jet. This issue is discussed in appendix B, with a comparison of the results of different grooming algorithms. Appendix C is devoted to a brief exploration of the decay chain in Eq. (2).

Diboson-like R → SS decays
The R → SS decay depicted in Fig. 2 actually yields a quadriboson final state, but will resemble a diboson resonance if S is much lighter than R. Such a signal can arise in left-right models if CP is violated in the scalar sector, so that the CP-odd and CP-even states mix, or in simpler Z ′ extensions of the SM with additional scalars. We take in our simulations R = Z ′ with M R = 2 TeV, S = H 0 1 with M S = 100 GeV and consider A = A 0 → bb decays in 'high mass' and 'low mass' benchmarks, with M A = 40 and 20 GeV, respectively. (As these two scalars do not couple to W W or ZZ, they can be quite light and yet evade current searches; moreover, their couplings to quarks can be small, so that they are mainly produced from the decay of heavier particles.) The relevant Lagrangian is implemented in Feynrules [30] in order to generate events with MadGraph5 [31] using the universal Feynrules output [32]. Event generation is followed by hadronisation and parton showering with Pythia 8 [33]. The detector response is simulated with Delphes 3.4 [34], and for the jet reconstruction and analysis FastJet 3.2 [35] is used. The QCD dijet background is generated by slicing the phase space in dijet invariant mass intervals of 100 GeV between 700 and 3.5 TeV, generating 1.5 × 10 5 events in each m JJ interval and recombining these samples with a weight proportional to their cross section.  The CMS hadronic searches use jets with a large radius R = 0.8 reconstructed with the anti-k T algorithm [36], referred to as AK8 jets, as vector boson candidates. A soft drop grooming algorithm [14] is performed on the AK8 jets to eliminate contamination from initial state radiation, multiple interactions and pile-up. The most recent analysis selects  GeV the efficiency is essentially zero already for τ 21 ≤ 0.45.) This is precisely the 'anomalous' behaviour anticipated for this type of signals in the introduction. Note that for the QCD background the absolute efficiency values are not meaningful because they depend on the parton-level sample. In particular the sample generated here has dijet masses m jj ≥ 700 GeV, while a cut m JJ ≥ 1.05 TeV is applied in the topological selection. Instead, the relevant issue here is that while the signal-to-background ratios S/B and S/ √ B increase for a potential W W signal when strengthening the cut, they do not for a stealth boson signal. The jet mass distributions are also of interest. They are shown, after topological selection but before the boson tagging, in the top panels of Fig. 5, for the leading and subleading jets. While for the lower mass benchmark the boson masses are adequately reconstructed after the grooming, this is not the case for the higher mass, though a sizeable fraction of events still fall within the W or Z tagging mass windows. (A comparison of the jet masses with different grooming algorithms is presented in appendix B.) The dijet invariant mass distribution before boson tagging is wider than for a true diboson resonance, and peaks at lower invariant masses, as shown in the bottom left panel of Fig. 5. After boson tagging (right panel), the distribution for M A = 20 GeV is still wide while for M A = 40 GeV the simulated signal does not pass the boson tagging. For this plot, we consider HP jets as those with τ 21 ≤ 0.45. We point out that, as shown in Ref. [25], for a wide bump the location of the maximum deviation with respect to the background-only hypothesis is not located at the maximum of the 'signal' distribution, but may be shifted 100 − 200 GeV to higher dijet masses. Thus, these distributions would still give an apparent excess near 2 TeV. We now study the features of the R → SS signals under the ATLAS Run 2 analyses. The ATLAS Collaboration uses wider jets with radius R = 1.0, reconstructed with the anti-k T algorithm, and trimmed [12] to eliminate contamination. As topological selection, events must contain two fat jets, the leading one with p T ≥ 450 GeV and the subleading one with p T ≥ 200 GeV, both within |η| ≤ 2.0 and with a small rapidity separation |∆y 12 | ≤ 1.2. The dijet invariant mass m JJ must be larger than 1 TeV. At variance with the CMS analyses, a transverse momentum balance cut is here introduced, variable, abbreviated throughout this paper as D 2 , is used to characterise the two-pronged substructure, and a p T -dependent upper cut is imposed [37], approximately D 2 ≤ 1 + 0.8 (p T − 300)/1200 with p T in GeV. An upper cut N trk < 30 is also placed on the number of tracks with p T ≥ 0.5 GeV in the plain jets, pointing to the primary vertex. Jets satisfying both criteria are tagged as W or Z candidates if their mass is within an interval of ±15 GeV around the expected resonance peak. Notice that a jet can simultaneously be tagged as W and Z, therefore the W W , W Z and ZZ samples are not disjoint.
We present the D 2 distribution for the different signals and the background after topology and mass cuts in Fig. 6 (left). For Z ′ → W W and the background the distributions are similar to the ones obtained by the ATLAS Collaboration [38], slightly shifted to smaller D 2 values. For R → SS the distributions are background-like, especially for M A = 40 GeV. This is expected from what we have observed for the analogous τ 21 variable used by the CMS Collaboration. The application of the D 2 requirement on both jets has a 50% efficiency for the Z ′ → W W diboson signal, 24% for R → SS in the low-mass benchmark and only 6% in the high-mass benchmark.
The distribution of the number of tracks N trk for the plain jets is shown in the right panel of Fig. 6. For a true diboson signal Z ′ → W W , our results are close to the ones obtained by the ATLAS Collaboration, with a slightly smaller number of tracks predicted by our simulation. For R → SS with M A = 20 GeV, the distribution has a maximum at the 25 − 26 bin, close to the maximum of the QCD background. For M A = 40 GeV the signals have, on average, more tracks than the background itself. For the W Z selection, the requirement N trk < 30 reduces the efficiency of the signals and background as Again, the absolute value of the background efficiency is not relevant because it depends on the parton-level cuts in the sample; rather, the relative efficiency with and without the N trk < 30 requirement is the quantity of interest. We see that a requirement of a small number of tracks leads to an additional suppression of this type of signals, besides the one from the D 2 tagging, which may even decrease the signal-to-background ratio.
The trimming algorithm used by the ATLAS Collaboration allows for a slightly better jet mass determination for stealth bosons than in the CMS analyses. We show in Fig. 7 (top) the masses of the leading and subleading jet after the topological selection. The shapes are less peaked than for W bosons. The dijet mass distributions on the bottom panels are very wide, and similar after topology cuts only (left) and with the full W Z tagging (right). In the latter case the corresponding plot for M A = 40 GeV is not shown because the signal is tiny.
To conclude this section, we compare in   for a true diboson signal the efficiency of the ATLAS Run 2 event selection is larger than in Run 1, for the R → SS signals it is noticeably smaller -which might have motivated the disappearance of the excess in the first Run 2 result with the nominal selection, if it were due to a signal of this type. Of course, the precise numbers vary with M A , but the trend is the correct one. The comparison of these figures with the efficiency for the CMS selection is not meaningful because for CMS analyses the jet mass windows are narrower, and the events in the W W , W Z and W W categories, which are disjoint, are combined to obtain the limits on a potential signal. Besides, we point out that, as argued in the introduction from a naive parton luminosity scaling, the efficiency for the background is reduced by around an order of magnitude in the Run 2 selection with respect to Run 1.

Diboson-like R → V S decays
We repeat the same procedure of the previous section for a R → V S decay such as the one depicted in Fig. 8, which yields a triboson final state but with a diboson-like topology. Such a signal can originate for example in left-right models from the decay W ′ → W H 0 1 , with subsequent H 0 1 → A 0 A 0 decay [29], or also from Z ′ → ZH 0 1 . We consider the former process, with S = H 0 1 , A = A 0 and the same parameters as in section 2. The results, for what concerns the hadronic diboson searches, would be analogous for a signal of the latter type with a Z boson in the final state.
For the CMS analysis the τ 21 distributions for the leading and subleading jets are separately shown in Fig. 9. These distributions are a combination of the ones for the W and S bosons in Fig. 3, but they are obviously correlated: always one of the two jets, the one corresponding to S, has a large τ 21 . For this reason, it is illustrative to quantify the efficiency drop when one changes the τ 21 cut from 0.45 to 0. The masses of the leading and subleading jets are also shown in Fig. 9. The distributions are narrower than for a R → SS decay, as it may be expected because one of the decay products is a massive SM boson. The dijet invariant mass distributions are wide, peaking at invariant masses slightly lower than the resonance mass. The distributions after boson tagging are slightly sharper, comparable to a true diboson resonance.
For the ATLAS Run 2 analyses the D 2 and N trk boson tagging variables are shown in Fig. 10. For the leading jet the distributions are similar for the three signals, indicating that the leading jet in R → V S is the SM vector boson in most cases. The D 2 tagging requirement on both jets has an efficiency of 0.35 and 0.18 for M A = 20 and 40 GeV, respectively, larger than for the R → SS signal. Here it is also interesting to see the effect of the N trk < 30 cut, which reduces the signal efficiencies for the W Z selection as follows, For this type of signals an upper cut on the number of tracks may be counterproductive too, especially for heavier A.  Besides the jet substructure differences found, the topology and jet mass of the R → V S signal are not very different from a true diboson resonance, as it can be seen by comparing the rest of distributions presented in Fig. 10, namely the masses of the leading and subleading jet after the topological selection, the dijet invariant mass after topological selection and also with the final W Z tagging. The distributions are less peaked than for a true diboson but still they are alike. Finally, we collect in table 2 the efficiencies for

Triboson-like R → V V S decays
As our third example we consider a cascade decay R → V Y → V V S, with subsequent decay S → AA, as depicted in Fig. 11. This is a quadriboson signal but with a tribosonlike topology. There are two crucial differences with the previously seen resonance decays.  Figure 11: First, there are already two SM bosons in the final state, which produce fat jets with a higher boson tagging efficiency. Second, the invariant mass of the two selected jets does not concentrate at a maximum near or below the resonance mass, but instead the dijet invariant mass distributions are broader [16]. An example of this cascade decay chain is W ′ → ZH ± → ZW ± H 0 1 in left-right models [29]. This is the signal used in our simulations, taking Y = H ± with M Y = 500 GeV, S = H 0 1 , A = A 0 → bb. For this signal we collect the relevant distributions for the CMS analyses in Fig. 12. The τ 21 distribution for the leading jet is quite close to the one for a Z ′ → W W diboson signal, while there are some differences for the subleading jet. (This indicates that the leading jet is the Z boson from the heavy resonance decay in most cases.) The efficiency that is, it decreases by nearly the same amount as for a Z ′ → W W signal. The jet mass distributions are not very different from a diboson resonance either. Note that here the leading jet mass concentrates around M Z , as corresponds to the simulated signal. Sharp differences are found in the dijet invariant mass distributions in the bottom panels of Fig. 12. The m JJ distribution does not display any resonance-like structure before the boson tagging (left) and a wide bump, similar to the ones found for triboson signals in Ref. [4], appears after boson tagging.
The distributions for the ATLAS Run 2 analyses are given in Fig. 13. The D 2 variable for the leading and subleading jets in R → V V S in the top panels are similar to the ones for the benchmark Z ′ → W W signal, and the effiency for the D 2 cut in both jets is 0. That is, the signal suppresion is higher than for a true diboson signal but not dramatic. The jet mass distributions are relatively narrow, and the dijet invariant mass distributions do not display a peak, even after the application of the full W Z selection. This contrasts with the behaviour of true triboson signals, for which the invariant mass distribution of the two leading jets is shaped to a peak by the application of topological and boson tagging cuts [4,16]. The comparison of the efficiencies for ATLAS Run 1 and Run 2 analyses is given in table 3. They are slightly smaller in Run 2. Overall, we find that except for some efficiency decrease when the boson tagging requirements are tightened, the V V S signals behave in much the same way as the triboson signals studied before [4].

Discussion
New physics interpretations of the ATLAS Run 1 diboson excess [1] can explain the absence of signals in the semileptonic channels, either because the signal is due to a V V X triboson resonance [16] or by assuming a particle with a mass close to M W , M Z and decaying into qq only [24]. (Interpretations of this excess as a diboson resonance were early disfavoured, and have been already excluded with the first Run 2 results.) Nevertheless, the main difficulty for these proposals is posed by the most recent ATLAS and CMS searches in the hadronic channels, some of which again exhibit some mild excesses at the same mass, but with sizes that are apparently inconsistent. Even taking into account that, as shown in Ref. [25], part of the excess may be absorbed by the normalisation of the background if statistics are small, one would expect more significant excesses in the large samples collected in Run 2.
The 'stealth bosons' S introduced in this paper, which decay into four quarks that merge into a single fat jet, can succesfully address apparent inconsistencies among mild excesses in different searches like those. For stealth bosons decaying S → AA we have found that: 1. For the R → SS and R → V S signals the boson tagging efficiency for CMS analyses drops when the cut on the τ 21 variable, used to measure the two-pronged structure of the jets, is tightened. This may explain why the latest CMS analysis with 35.9 fb −1 [6] does not observe a much larger excess, and the previous one with 12.9 fb −1 [8] had a small deviation at the 1σ level.
2. For R → SS especially, and to some extent for R → V S and R → V V S too, the efficiency for the ATLAS Run 2 event selection is smaller than at Run 1, explaining why the Run 2 excess with 15.5 fb −1 [5] was of only 2σ, and with 3.2 fb −1 the deviation was at the 1σ level [3].
3. The upper cut on the number of tracks N trk applied by the ATLAS Collaboration washes out a R → SS signal. The suppresion is milder for R → V S and R → V V S. This might explain why the dijet mass distribution without this cut exhibits a bump already with 3.2 fb −1 [3], as already pointed out [4,29].
These results are quite independent of the heavy resonance mass, as long as the decay products are sufficiently boosted. This already happens for M R ≥ 1.5 TeV. And the results are applicable, at least qualitatively, to heavier stealth bosons and the decay S → ZA, as seen in appendix C.
As we have stressed, the hadronic signals of stealth bosons are hard to spot over the QCD background by using the standard discriminators specifically built for tagging W and Z vector bosons. Still, there are some differences in the jet substructure that can be exploited. As an example, we show in Fig. 14  We have not addressed the visibility of these signals in the semileptonic diboson searches. Still, a few comments are in order. For R → SS signals the leptonic decays are absent, and for R → V V S we expect a similar efficiency suppression for the semileptonic searches as it was found for V V X triboson signals in Ref. [4]. The only potentially visible leptonic signatures may arise for R → V S signals, which deserve a detailed study. For these final states the efficiency is generally expected to be smaller than for true diboson resonances, and with the particularity that the signal may pollute the control regions where the SM backgrounds from tt, W/Z plus jets, etc. are normalised, causing an unpredictable effect in the signal regions. This is quite a delicate study and falls out of the scope of this paper. Other signatures from the production of the light states S and A depend on their coupling to SM fermions -note that they do not couple to SM gauge bosons -and it is likely that with adequate model building the potential constraints can be evaded.
Our results provide guidance for new hadronic diboson searches. First, a better investigation of the nature of the recurrent excesses found is required. Second, and more importantly, a wider scope of new physics searches beyond the SM is compulsory, as the existing searches may nearly miss these more complex signatures. In particular, (i) generic anti-QCD jet taggers, which are sensitive not only to SM boosted particles but also non-SM ones, should be used; (ii) relaxing the mass window for at least one of the bosons would allow to investigate signals with non-SM bosons, in particular cascade decays such as in Eq. (2); (iii) in current searches, additional signal regions or analyses with looser requirements on τ 21 and D 2 should be also considered; (iv) triboson resonance searches should also be performed. In diboson resonance searches, a sufficiently large sample would allow to investigate the profile and characteristics of the deviations -and this implies that a further tightening of the boson tagging should be avoided.
Finally, let us stress that, beyond diboson resonance searches and the interpretation of anomalies, the stealth boson signals presented here provide a simple new physics case that highlights the limitations of current LHC searches. In order to broaden the sensitivity to new physics signatures, new tools and strategies are needed. First, generic taggers [40] that are sensitive to these non-standard boosted signals should be used, at least as an alternative to the dedicated ones. And, in parallel, new grooming algorithms (or variations of existing ones) that correctly recover the mass of multi-pronged objects should also be used.

A Test of the background shaping
In this appendix we investigate a possible shaping of the QCD background due to the CMS event selection criteria, using a high-statistics sample. We plot in Fig. 15 the dijet invariant mass distribution of the QCD background, after topological selection and also after W Z tagging, either defining HP jets as those with τ 21 ≤ 0.45 or τ 21 ≤ 0.35. From the 4.2×10 7 QCD events simulated, 2587283 events survive the topological selection. The number of events in the HP W Z selection is 7955 (1045) when τ 21 ≤ 0.45 (τ 21 ≤ 0.35) is used to define HP jets. By eye it can be observed that, within the available Monte Carlo statistics, no background shaping is produced. In particular, some tiny bumps in the distribution with the looser selection (blue line), e.g. in the 1.9 − 2.0 TeV bin, are not present in the distribution with the harsher selection (black line). A further check can be done by taking the dijet mass distributions predicted by simulation as pseudo-data, to perform a likelihood test for the presence of bumps over a smooth function parameterised as [2] This allows to test whether this functional form adequately parameterises the background, in particular. The precise details of the procedure applied (which is the standard one to obtain upper limits on a possible narrow resonance signal) are described in Ref. [25]. We  Fig. 15 after selection as pseudo-data. normalise these pseudo-data samples to have the same number of events in the 1.9 − 2.0 TeV bin as the best-fit background prediction in Ref. [6] with 35.9 fb −1 (7.9 events).
The expected and observed limits on a possible signal, taking the distributions after selection in Fig. 15 (blue and black lines) as pseudo-data, are presented in Fig. 16. As usual, the dashed lines and green, yellow bands correspond to the expected limits and the 1σ and 2σ variation, respectively. Together with the observed limits (solid black lines) we include a simple estimation of their uncertainty arising from the Monte Carlo finite statistics (dotted lines). 3 For the looser selection (left panel) we see that the dijet mass distribution after simulation is well described by the functional form (3). For the harsher selection (right panel) we observe some distortions caused by the finite Monte Carlo statistics. For example, in the 1.4 − 1.5 TeV bin the MC prediction is of 244 ± 32 events, with a Monte Carlo uncertainty that is twice larger than the statistical uncertainty. Taking into account the Monte Carlo uncertainty, estimated by the dotted lines, we can see that the variations in the observed limit with respect to the expected one are precisely due to the Monte Carlo statistics, and significant deviations with respect to the assumed functional form (3) are not observed either.
Finally, let us note that our simulation of the background only includes QCD-mediated dijet production, and not electroweak production of V +jets. The inclusion of the latter 3 The uncertainty is obtained by varying the data in a single bin within 1σ of the estimated Monte Carlo uncertainty, keeping the data in the rest of the bins fixed at their central values, and performing the maximum likelihood fit. This uncertainty does not capture the situation where adjacent bins fluctuate in the same direction, in which case the deviation in the observed limits is larger. Thus, the uncertainty is actually underestimated, though for our purposes this approximate assessment suffices.
will likely not change our conclusions, because the V j invariant mass distribution is also smoothly decreasing at TeV-scale masses, and amounts to a small fraction of the overwhelming QCD background. A shaping of this (smaller) background caused by the jet grooming and event selection, on a final state with a vector boson plus one or more jets, seems unlikely but cannot be discarded with the simulations performed here on a final state with two jets.

B Grooming of multi-pronged jets
Grooming algorithms are designed to remove from hadronic jets the contamination that arises from pile-up, initial state interactions and multiple interactions, trying to recover the mass of the particle originating the jet from the measured jet mass. Although the different algorithms are very useful for W and Z bosons, they are not adequate in general for multi-pronged boosted particles such as stealth bosons. As examples of grooming algorithms, we consider here 1. The soft-drop algorithm [14], which starts from all original constituents of the jet reclustered with the Cambridge-Aachen (CA) algorithm [41] and iteratively breaks the jet into two subjets. If the subjets pass the soft-drop condition [14] with z cut = 0.1, β = 0, then the jet is considered as the groomed jet, otherwise the procedure is applied again on the hardest of the two subjets.
2. Jet trimming [12] which reclusters the large-R jet constituents using the anti-k T algorithm with R = 0.2 and dropping any of the sub-jets with p T less than f cut = 0.05 of the original jet p T .
3. Jet pruning [13] which starts from all original constituents of the jet and discards soft recombinations after each step of the CA algorithm. Given two subjets, if their recombination is considered as soft, taking the parameters z cut = 0.1, R cut = 0.5, the softer subjet is discarded.
We present in Fig. 17 the jet masses for the R → SS signals in section 2, for AK8 jets and with the topology cuts of the CMS analyses. We also show the plain jet mass for comparison. As we can see, for M A = 40 GeV (top panels) the jet mass distribution is considerably spoiled by the grooming: the mass peak in the plain mass distribution is not sharpened but, on the contrary, it is transformed into a wide bump. For M A = 20 GeV the substructure of the jets is more two-prong-like (as it was also seen in section 2 by considering the τ 21 variable) and the groomed mass is closer to the plain mass. It is interesting to note that a small mass peak appears at 20 GeV, when the grooming algorithms completely remove one of the S → AA decay products -see also appendix C for the analogous case of heavier stealth bosons. For wider jets with R = 1.0 the results are alike, see Fig. 18. The wider jet radius makes the jets catch a larger amount of contamination, and this is reflected in a larger high-mass tail of the plain mass distribution. This contamination is removed by the grooming but, at the same time, the peaks are distorted, especially for M A = 40 GeV, for which the structure of the jets departs more from a two-pronged decay. These results highlight the need to tune a grooming algorithm that correctly removes contamination from multi-pronged non-SM boosted objects, which is beyond the scope of the present work.

C Heavier stealth bosons
Throughout this work we have studied stealth bosons with a mass M S = 100 GeV, which we have selected close to the W and Z masses, in order to investigate how the signature of these elusive particles would appear in diboson resonance searches. From our results, it is easy to realise that heavier boosted particles also giving rise to a four-pronged jet will share the same behaviour in which regards the jet substructure. (See also Ref. [40].) Here we only address, for completeness, the decay chain in Eq. (2) where the jet grooming may yield a jet mass close to the Z mass. Heavier stealth bosons could be produced in pairs, or associated to one or two weak bosons, in much the same way as the lighter ones studied in sections 2, 3 and 4, respectively. For brevity we will not repeat all the analyses in those sections for this case, but will restrict ourselves to the CMS event selection and show that the decay chain of Eq. (2) can produce jets with a groomed mass m J ∼ M V and whose substructure is not seen as two-pronged by the τ 21 discriminator.
We consider Z ′ → SZ, in which we select the decay Z → νν in order to clearly identify the fat jet from the stealth boson decay, and S → ZA. As before, we set M Z ′ = 2 TeV and for the heavier stealth boson we select a mass M S = 200 GeV. The analysis is done applying the event selection criteria of CMS diboson searches detailed in section 2.
The leading jet mass, without grooming and after application of various grooming algorithms, is presented in Fig. 19 for M A = 40 GeV (top, left) and M A = 20 GeV (top, right). We first notice that the plain jet mass has a low-mass tail, originated because sometimes the AK8 jets do not contain all the S decay products. With a wider jet radius R = 1.0 (bottom panels) this effect is softened. Of course, for a larger M Z ′ the fat jets would be more collimated, but for better comparison with the results in the rest of the paper we have kept a 2 TeV heavy resonance mass. From these plots we again observe that, as seen for M S = 100 GeV in the previous appendix, the grooming procedure significantly modifies the jet mass distribution. For soft drop and pruning, the appearance of a peak at the Z mass, nearly as high as the peak at M S , is remarkable. This happens when the grooming removes all the decay products of A from the jet. Smaller peaks at M A are also visible. Therefore, we see that a heavier particle of twice the Z boson mass can often yield a jet with groomed mass close to the weak boson masses.
The τ 21 variable of events passing the jet mass cut m J ∈ [65, 105] GeV is presented in Fig. 20. The distribution for fat jets from S → ZA with M A = 40 GeV is quite close to that of QCD events, while for M A = 20 GeV it is slightly shifted towards smaller τ 21 values. Therefore, we see that these particles behave as stealth bosons, when one considers variables measuring the two-pronged structure. Without the jet mass cut, the distributions are found to be comparable.
In summary, the decay chain in Eq. (2) would lead to the very conspicuous signal of a heavier new particle of a few hundreds of GeV, with hadronic four-pronged decay -then giving a jet with a substructure quite different from that of W , Z bosons -but with a groomed jet mass that often is close to the weak boson masses. Obviously, such a signal would be penalised in diboson resonance searches, and removing the jet mass cut would lead to an enhancement of signal, and also of the background. Semi-leptonic signals of these particles, produced when the Z boson decays leptonically, are likely to be highly suppressed by the isolation requirement on charged leptons, as these are very close to the hadronic decay products of A. This type of signal constitutes another new physics case that calls for novel tools that can correctly identify non-SM boosted jets.