1 Introduction

Small excesses around an invariant mass of 2 TeV appear in no less than five different searches for diboson resonances decaying hadronically, performed by the ATLAS and CMS Collaborations at the Large Hadron Collider (LHC), and with energies of 8 and 13 TeV. Although none of these excesses is statistically significant on its own, it is difficult to regard them as a mere coincidence. Yet, their interpretation as a background shaping or a new physics signal is difficult too.

The largest excess, of \(3.4\sigma \), was found by the ATLAS Collaboration in a search using the full Run 1 datset at 8 TeV [1], and sparked a great interest. The excess appeared in the (non-independent) samples tagged as WW, WZ and ZZ, but it was largest for the WZ event selection. Before, a mild excess had been found by the CMS Collaboration [2], though the maximum significance was below \(2\sigma \), and located at slightly smaller invariant masses. The first ATLAS analysis of Run 2 data with 3.2 fb\(^{-1}\) at 13 TeV [3], did not show any hint of an excess with the nominal WZ event selection but relaxing the boson tagging criteria for fat jets a third bump appeared, again around 2 TeV, in one of the control distributions (the dijet invariant mass distribution without an upper cut on the number of tracks per jet). The significance of this bump was later estimated to be of \(2.4\sigma \) [4]. When the analysis was updated with a luminosity of 15.5 fb\(^{-1}\) [5], a bump at 2 TeV appeared also with the nominal WZ selection, with a significance around \(2\sigma \). The fifth 2 TeV bump has appeared recently in the CMS search with the full 2016 dataset, using a luminosity of 35.9 fb\(^{-1}\) [6]. Its significance is around \(2\sigma \). No excess at this mass was observed in earlier analyses with 2.6 fb\(^{-1}\) [7] and 12.9 fb\(^{-1}\) [8]. A very recent analysis [9] by the ATLAS Collaboration with 36.7 fb\(^{-1}\) shows no excess at this invariant mass.

Fig. 1
figure 1

Monte Carlo predictions for the QCD dijet background, with the event selections for the ATLAS Run 1 (left) and Run 2 (right) hadronic diboson searches, from Ref. [4]. The black lines correspond to the full WZ boson tagging criteria described in Refs. [1, 3]: jet mass cuts, \(\sqrt{y}\) (at Run 1) or \(D_2\) (at Run 2) substructure tagging, and number of tracks per jet. For the blue, green and pink lines two out of these three tagging criteria are applied. The red lines correspond to the prediction without any boson tagging. The error bars in the points represent the Monte Carlo uncertainty

Before addressing other explanations, it is worthwhile discussing in detail the possibility that the 2 TeV bumps are caused by some shaping of the dijet invariant mass distribution of the SM background. It is well known (see for example Ref. [10]) that the various methods of jet grooming, such as mass-drop filtering [11], trimming [12], pruning [13] and soft drop [14] alter the shape of the mass distribution of jets resulting from quarks and gluons. For example, the jet trimming algorithm used by the ATLAS Collaboration in Run 2 searches [5] gives rise to a kink in the distribution at masses \(m_J \simeq 80\) GeV for jets of radius \(R=1.0\) and transverse momenta \(p_T = 1\) TeV. In this situation, it is conceivable that a jet mass selection around the W / Z masses could induce some feature in the background distribution around \(p_T = 1\) TeV, which could be reflected at dijet invariant masses \(m_{JJ} \sim 2\) TeV. On the other hand, the ATLAS Run 1 search uses mass-drop filtering, for which this kink is not present, and the CMS Collaboration uses jet pruning and soft drop in their Run 1 and Run 2 analyses, respectively. It is then questionable that the same background shaping would appear at 2 TeV in all these analyses, using not only different jet grooming methods but also different jet substructure tagging. An explicit calculation of the QCD dijet distribution with the ATLAS Run 1 and Run 2 jet tagging criteria [4] showed that a kink around \(m_{JJ} = 1.7\) TeV appears when the jet mass and tagging criteria are applied (see also Ref. [15]), as can be seen in Fig. 1. But it shows no hint of a bump around 2 TeV. As it will be shown in the following, the jet tagging criteria used by the CMS Collaboration in Run 2 searches do not produce a 2 TeV bump either. Consequently, in this work we will discard the possibility of a significant background shaping, although further studies on this issue are welcome.

Regarding new physics interpretations, it was early pointed out [16] that the \(3.4\sigma \) ATLAS excess at 8 TeV was unlikely to result from a VV diboson resonance, \(V=W,Z\), as the searches in the semi-leptonic final states [17, 18] did not exhibit any deviation from the standard model (SM) prediction. The proposal in Ref. [16] was of a VVX triboson resonance, that is, a resonance R undergoing a cascade decay \(R \rightarrow V Y \rightarrow VVX\) yielding two SM bosons plus an additional particle X, with Y being an intermediate resonance. The presence of an extra particle X would dramatically decrease the efficiency of such a signal in the searches in semi-leptonic modes, as it was later confirmed in Ref. [4] with a more detailed analysis. The reason is that the mentioned Run 1 searches in semi-leptonic modes, as well as the more recent ones with Run 2 data [19,20,21,22,23], are highly optimised for the kinematics of diboson resonances produced back-to-back in the transverse plane. Moreover, the event selection criteria often veto the presence of extra particles near the decay products of the boson with leptonic decay, which obviously dampens the efficiency for such a signal. As an alternative explanation for the lack of a positive signal in the semi-leptonic decay channels, it was proposed [24] that the two hadronically decaying ‘bosons’ identified as two fat jets are not actually massive SM gauge bosons but smuons of similar mass, decaying into two quarks via R-parity violating interactions, and the heavy 2 TeV resonance is a sneutrino. That is, the observed ‘bosons’ simply do not have leptonic decay modes.

In Ref. [25] it has been shown that a wide bump on a smoothly falling distribution that cannot be predicted from simulations is quite difficult to detect in narrow resonance searches: the bump can easily be absorbed in the background normalisation. This is indeed the case for diboson resonance searches in hadronic final states, where the background is obtained from a fit to data in the signal region, assuming some smooth functional form. This feature may partially explain why excesses are not seen in the searches that have a smaller dataset. Still, current new physics interpretations are undermined by an apparent inconsistency among the size of the excesses. For example, in Ref. [4] a variety of triboson signals that accommodated the ATLAS Run 1 excess was examined in order to give predictions for other analyses, in particular for the hadronic Run 2 CMS search with 2.6 fb\(^{-1}\). The predictions were compatible with the null experimental result but in the latest dataset [6], with a luminosity 13 times higher, a larger excess should have appeared. The same reckoning is expected to apply to a simple signal such as the one proposed in Ref. [24], with two particles from a resonance decay, each decaying into a quark pair. A possible solution to this puzzle is suggested by a comparison of the different hadronic searches, focusing not on the number of observed events, but rather on the expected QCD background at dijet invariant masses around 2 TeV, and its dependence on the boson tagging criteria [26]:

  • CMS analyses use a subjettiness ratio \(\tau _{21}\) [27] to quantify the likeliness that the jets have a two-pronged structure. For the so-defined ‘high purity’ (HP) jets, the criterion has strenghthened from \(\tau _{21} \le 0.5\) in the Run 1 analysis [2], to \(\tau _{21} \le 0.45\) [7], \(\tau _{21} \le 0.4\) [8], and \(\tau _{21} \le 0.35\) [6], as the dataset has increased. To give an example, the expected background near 2 TeV for the high purity dijet sample is similar in the latest analysis, with 35.9 fb\(^{-1}\), and the previous one with 12.9 fb\(^{-1}\). This clearly shows that the requirements on jets are much tighter.

  • For the ATLAS analyses the comparison between Run 1 and Run 2 is not easy, because in Run 1 a cut \(\sqrt{y} \ge 0.45\) on the y variable [11] measuring the subjet momentum balance is used, whereas in Run 2 it is replaced by a momentum-dependent cut on the so-called \(D_2\) function [28]. Nevertheless, by a naive parton luminosity scaling of the expected background around 2 TeV, it is seen that the boson tagging criteria for the Run 2 nominal selection [3] are around one order of magnitude stronger.Footnote 1 The latest ATLAS search [9] implements a new jet mass definition and a reoptimisation of the jet mass windows and \(D_2\) cut, whose details are not publicly available. We will not cover that analysis here.

Getting together the above arguments, namely (1) the persistence of the 2 TeV bumps, (2) the unlikely possibility of a background shaping, (3) the apparent inconsistency of the size of the bumps, an obvious question arises: May it be possible to have new physics signals giving fat jets whose substructure is background-like? If this were the case, those signals would be more and more suppressed as the jet tagging requirements are tightened, and we might have an explanation for that apparent inconsistency. To answer this question, which arises in the context of the 2 TeV anomaly but whose consequences go beyond the interpretation of potential LHC excesses, is the motivation of the present work. And, as we show in this paper, the answer is affirmative. The cascade decay of a very boosted particle S with a mass \(M_S \sim M_V\) into two lighter particles A that subsequently decay each into two quarks,

$$\begin{aligned} S \rightarrow A \, A \rightarrow q \bar{q} q \bar{q}, \end{aligned}$$
(1)

gives a single fat jet with a mass consistent with the W and Z masses. (The particles S and A can be for instance a new scalar \(H_1^0\) and pseudo-scalar \(A^0\), respectively, in models with an extended scalar sector.) And, depending on the A mass \(M_A\), the fat jet originated in the decay of S may seem more background-like than the QCD background itself, when one considers jet subjettiness measures such as \(\tau _{21}\) or \(D_2\). Therefore, it is appropriate to denote these S bosons as ‘stealth bosons’. An alternative is to have a heavier S and with one of its decay products being a vector boson \(V = W,Z\). One theoretically well-motivated possibility is

$$\begin{aligned} S \rightarrow Z \, A \rightarrow q \bar{q} q \bar{q}, \end{aligned}$$
(2)

with \(M_S \gtrsim M_Z + M_A\) and the four quarks merging into a single jet. (Again, S and A can be the new scalar \(H_1^0\) and pseudo-scalar \(A^0\) in models with an extended Higgs sector.) Often the jet grooming algorithm will completely eliminate the decay products of A, so that the groomed jet mass may be close to the Z mass even if S is heavier.

Stealth bosons with a mass \(M_S \sim M_V\) and decaying as in Eq. (1) are the main focus of our analysis. The results obtained apply, at least qualitatively, to the decay chain of Eq. (2) as well. In Sect. 2 we consider the diboson-like decay of a heavy resonance R into two such stealth bosons, \(R \rightarrow SS\), and compare the resulting variables with those for a true diboson resonance \(Z' \rightarrow W W\). There, our statement that those signals can be more background-like than the QCD background will be apparent, with the explicit example of CMS and ATLAS diboson resonance searches. In Sect. 3 we study a diboson-like decay \(R \rightarrow V S\) of a resonance into a SM boson and a stealth boson, and in Sect. 4 a triboson-like cascade decay \(R \rightarrow V Y \rightarrow V V S\), with Y an intermediate resonance. As an example of these cascade decays, as well as for the Monte Carlo simulations, we use the multiboson signals that arise in left–right models [29], but our results will not be limited to such specific examples. We discuss our results in Sect. 5. Although the novel ‘stealth boson’ signatures introduced in this work are motivated by the 2 TeV excesses, we point out that they are interesting on their own. Such signatures are relatively hard to spot over the QCD background with the standard diboson resonance searches, and therefore they are so far quite unexplored. Related details are discussed in three appendices. The possible shaping of the QCD background with the CMS event selection criteria is investigated in Appendix A. A side effect of the multi-pronged structure of stealth bosons is the fact that standard grooming algorithms often fail to recover the true stealth boson mass from the resulting fat jet. This issue is discussed in Appendix B, with a comparison of the results of different grooming algorithms. Appendix C is devoted to a brief exploration of the decay chain in Eq. (2).

2 Diboson-like \(R \rightarrow SS\) decays

The \(R \rightarrow SS\) decay depicted in Fig. 2 actually yields a quadriboson final state, but will resemble a diboson resonance if S is much lighter than R. Such a signal can arise in left–right models if CP is violated in the scalar sector, so that the CP-odd and CP-even states mix, or in simpler \(Z'\) extensions of the SM with additional scalars. We take in our simulations \(R = Z'\) with \(M_R = 2\) TeV, \(S = H_1^0\) with \(M_S = 100\) GeV and consider \(A = A^0 \rightarrow b \bar{b}\) decays in ‘high mass’ and ‘low mass’ benchmarks, with \(M_A = 40\) and 20 GeV, respectively. (As these two scalars do not couple to WW or ZZ, they can be quite light and yet evade current searches; moreover, their couplings to quarks can be small, so that they are mainly produced from the decay of heavier particles.) The relevant Lagrangian is implemented in Feynrules [30] in order to generate events with MadGraph5 [31] using the universal Feynrules output [32]. Event generation is followed by hadronisation and parton showering with Pythia 8 [33]. The detector response is simulated with Delphes 3.4 [34], and for the jet reconstruction and analysis FastJet 3.2 [35] is used. The QCD dijet background is generated by slicing the phase space in dijet invariant mass intervals of 100 GeV between 700 and 3.5 TeV, generating \(1.5 \times 10^5\) events in each \(m_{JJ}\) interval and recombining these samples with a weight proportional to their cross section.

The CMS hadronic searches use jets with a large radius \(R=0.8\) reconstructed with the anti-\(k_T\) algorithm [36], referred to as AK8 jets, as vector boson candidates. A soft-drop grooming algorithm [14] is performed on the AK8 jets to eliminate contamination from initial state radiation, multiple interactions and pile-up. The most recent analysis selects events with two jets \(J_1\), \(J_2\) (ordered by transverse momentum) having pseudorapidities \(|\eta | \le 2.5\), pseudorapidity difference \(|\Delta \eta | \le 1.3\), transverse momentum \(p_T \ge 200\) GeV and invariant mass \(m_{JJ} \ge 1.05\) TeV. These kinematical criteria, which are usually referred to as ‘topological selection’, are quite similar for previous CMS analyses. Jets are considered as W-tagged if they satisfy a condition on \(\tau _{21}\) specified below and their mass is in the range 65–85 GeV, and Z-tagged if their mass is in the range 85–105 GeV.

The N-subjettiness ratio variable \(\tau _{21} = \tau _2^{(1)} / \tau _1^{(1)}\) of the plain (ungroomed) jets is used to enrich the sample with diboson candidates. A low-purity (LP) category is defined for jets with \(\tau _{21} \le 0.75\), and a HP category with \(\tau _{21} \le 0.45 , 0.4, 0.35\), depending on the particular analysis considered. Events are classified as HP if they have two HP jets, and LP if they have one HP and one LP jet. The \(\tau _{21}\) distributions for the leading and subleading jets for a \(Z' \rightarrow W W\) sample with \(M_{Z'} = 2\) TeV and the QCD background are shown in Fig. 3, for jets within the mass window \(65-105\) GeV and fulfiling the above acceptance criteria. (For the background, dijet invariant masses in the interval [1.7–2.3] TeV after simulation are considered, in order to compare with the signal on a similar kinematical range.) The distributions have a shape quite similar to the one obtained by the CMS Collaboration [6], also with a maximum in the 0.25–0.3 bin for the \(Z' \rightarrow WW\) signal and in the 0.5–0.55 bin for the QCD background. This, in particular, shows that the \(\tau _{21}\) variable is adequately modelled by the fast simulation. In contrast, the distributions for \(R \rightarrow SS\) lean towards higher values of \(\tau _{21}\) and are background-like, especially for \(M_A = 40\) GeV. These distributions are quite independent of the heavy resonance mass, which determines how boosted the stealth bosons are. We have generated \(R \rightarrow SS\) signals for additional masses \(M_R = 1.5,2.5\) TeV and show the results in Fig. 4, for \(M_A = 20\) GeV (left) and \(M_A = 40\) GeV (right). The normalised distributions for \(M_R = 1.5\), 2, 2.5 TeV are almost identical. On the other hand, the distributions do depend on the stealth boson mass itself, though this dependence is less interesting for the present study because diboson resonance searches fix jet mass windows around \(M_W\) or \(M_Z\).

Fig. 2
figure 2

Sample diagram for a \(R \rightarrow SS\) signal

Fig. 3
figure 3

N-subjettiness ratio \(\tau _{21}\) for \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background. Solid (dashed) lines correspond to the leading (subleading) jet

Fig. 4
figure 4

N-subjettiness ratio \(\tau _{21}\) for \(R \rightarrow SS\) signals with different heavy resonance masses. Solid (dashed) lines correspond to the leading (subleading) jet

The background-like behaviour of stealth bosons is reflected in a significant efficiency drop as the \(\tau _{21}\) cut is tightened. For example, for the HP WZ selection (two HP jets, one W-tagged and the other Z-tagged) the efficiencies for the WW diboson, QCD background and SS stealth diboson signals are reduced as follows:Footnote 2

$$\begin{aligned}&W W:&0.053 \rightarrow 0.024&( \times \; 0.46), \\&\text {QCD:}&4.6 \times 10^{-5} \rightarrow 5.9 \times 10^{-6}&( \times \; 0.13), \\&M_A = 20~\text {GeV}:&0.039 \rightarrow 0.0031&( \times \; 0.08), \end{aligned}$$

when strengthening the cut from \(\tau _{21} \le 0.45\) to \(\tau _{21} \le 0.35\). (For \(M_A = 40\) GeV the efficiency is essentially zero already for \(\tau _{21} \le 0.45\).) This is precisely the ‘anomalous’ behaviour anticipated for this type of signals in the introduction. Note that for the QCD background the absolute efficiency values are not meaningful because they depend on the parton-level sample. In particular the sample generated here has dijet masses \(m_{jj} \ge 700~\text {GeV}\), while a cut \(m_{JJ} \ge 1.05~\text {TeV}\) is applied in the topological selection. Instead, the relevant issue here is that while the signal-to-background ratios S / B and \(S/\sqrt{B}\) increase for a potential WW signal when strengthening the cut, they do not for a stealth boson signal.

Fig. 5
figure 5

Jet and dijet mass distributions for \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background for the CMS analysis

The jet mass distributions are also of interest. They are shown, after topological selection but before the boson tagging, in the top panels of Fig. 5, for the leading and subleading jets. While for the lower mass benchmark the boson masses are adequately reconstructed after the grooming, this is not the case for the higher mass, though a sizeable fraction of events still fall within the W or Z tagging mass windows (a comparison of the jet masses with different grooming algorithms is presented in Appendix B). The dijet invariant mass distribution before boson tagging is wider than for a true diboson resonance, and peaks at lower invariant masses, as shown in the bottom left panel of Fig. 5. After boson tagging (right panel), the distribution for \(M_A = 20\) GeV is still wide while for \(M_A = 40\) GeV the simulated signal does not pass the boson tagging. For this plot, we consider HP jets as those with \(\tau _{21} \le 0.45\). We point out that, as shown in Ref. [25], for a wide bump the location of the maximum deviation with respect to the background-only hypothesis is not located at the maximum of the ‘signal’ distribution, but may be shifted 100–200 GeV to higher dijet masses. Thus, these distributions would still give an apparent excess near 2 TeV.

We now study the features of the \(R \rightarrow SS\) signals under the ATLAS Run 2 analyses. The ATLAS Collaboration uses wider jets with radius \(R = 1.0\), reconstructed with the anti-\(k_T\) algorithm, and trimmed [12] to eliminate contamination. As topological selection, events must contain two fat jets, the leading one with \(p_T \ge 450\) GeV and the subleading one with \(p_T \ge 200\) GeV, both within \(|\eta | \le 2.0\) and with a small rapidity separation \(|\Delta y_{12}| \le 1.2\). The dijet invariant mass \(m_{JJ}\) must be larger than 1 TeV. At variance with the CMS analyses, a transverse momentum balance cut is here introduced, \((p_{T1}-p_{T2})/(p_{T1}+p_{T2}) < 0.15\).

Fig. 6
figure 6

\(D_2\) variable (left) and number of tracks (right) for \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background. Solid (dashed) lines correspond to the leading (subleading) jet

The \(D_2^{(\beta =1)}\) variable, abbreviated throughout this paper as \(D_2\), is used to characterise the two-pronged substructure, and a \(p_T\)-dependent upper cut is imposed [37], approximately \(D_2 \le 1 + 0.8 \, (p_T - 300) /1200\) with \(p_T\) in GeV. An upper cut \(N_\text {trk}< 30\) is also placed on the number of tracks with \(p_T \ge 0.5\) GeV in the plain jets, pointing to the primary vertex. Jets satisfying both criteria are tagged as W or Z candidates if their mass is within an interval of \(\pm 15\) GeV around the expected resonance peak. Notice that a jet can simultaneously be tagged as W and Z, therefore the WW, WZ and ZZ samples are not disjoint.

Fig. 7
figure 7

Jet and dijet mass distributions for \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background for the ATLAS analysis

We present the \(D_2\) distribution for the different signals and the background after topology and mass cuts in Fig. 6 (left). For \(Z' \rightarrow W W\) and the background the distributions are similar to the ones obtained by the ATLAS Collaboration [38], slightly shifted to smaller \(D_2\) values. For \(R \rightarrow SS\) the distributions are background-like, especially for \(M_A = 40\) GeV. This is expected from what we have observed for the analogous \(\tau _{21}\) variable used by the CMS Collaboration. The application of the \(D_2\) requirement on both jets has a 50% efficiency for the \(Z' \rightarrow W W\) diboson signal, 24% for \(R \rightarrow SS\) in the low-mass benchmark and only 6% in the high-mass benchmark.

The distribution of the number of tracks \(N_\text {trk}\) for the plain jets is shown in the right panel of Fig. 6. For a true diboson signal \(Z' \rightarrow W W\), our results are close to the ones obtained by the ATLAS Collaboration, with a slightly smaller number of tracks predicted by our simulation. For \(R \rightarrow SS\) with \(M_A = 20\) GeV, the distribution has a maximum at the \(25-26\) bin, close to the maximum of the QCD background. For \(M_A = 40\) GeV the signals have, on average, more tracks than the background itself. For the WZ selection, the requirement \(N_\text {trk}< 30\) reduces the efficiency of the signals and background as

$$\begin{aligned}&W W:&0.18 \rightarrow 0.16&( \times \; 0.89), \\&\text {QCD:}&6.2 \times 10^{-5} \rightarrow 2.0 \times 10^{-5}&( \times \; 0.32), \\&M_A = 20~\text {GeV}:&0.10 \rightarrow 0.065&( \times \; 0.65), \\&M_A = 40~\text {GeV}:&0.017 \rightarrow 0.0024&( \times \; 0.14). \end{aligned}$$

Again, the absolute value of the background efficiency is not relevant because it depends on the parton-level cuts in the sample; rather, the relative efficiency with and without the \(N_\text {trk}< 30\) requirement is the quantity of interest. We see that a requirement of a small number of tracks leads to an additional suppression of this type of signals, besides the one from the \(D_2\) tagging, which may even decrease the signal-to-background ratio.

The trimming algorithm used by the ATLAS Collaboration allows for a slightly better jet mass determination for stealth bosons than in the CMS analyses. We show in Fig. 7 (top) the masses of the leading and subleading jet after the topological selection. The shapes are less peaked than for W bosons. The dijet mass distributions on the bottom panels are very wide, and similar after topology cuts only (left) and with the full WZ tagging (right). In the latter case the corresponding plot for \(M_A = 40\) GeV is not shown because the signal is tiny.

Table 1 Efficiencies of \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background for the WZ selection in ATLAS hadronic diboson searches

To conclude this section, we compare in Table 1 the efficiencies for the WZ selection in the ATLAS Run 1 [1] and Run 2 [5] searches. The results are indeed eloquent. While for a true diboson signal the efficiency of the ATLAS Run 2 event selection is larger than in Run 1, for the \(R \rightarrow SS\) signals it is noticeably smaller – which might have motivated the disappearance of the excess in the first Run 2 result with the nominal selection, if it were due to a signal of this type. Of course, the precise numbers vary with \(M_A\), but the trend is the correct one. The comparison of these figures with the efficiency for the CMS selection is not meaningful because for CMS analyses the jet mass windows are narrower, and the events in the WW, WZ and WW categories, which are disjoint, are combined to obtain the limits on a potential signal. Besides, we point out that, as argued in the introduction from a naive parton luminosity scaling, the efficiency for the background is reduced by around an order of magnitude in the Run 2 selection with respect to Run 1.

3 Diboson-like \(R \rightarrow VS\) decays

We repeat the same procedure of the previous section for a \(R \rightarrow VS\) decay such as the one depicted in Fig. 8, which yields a triboson final state but with a diboson-like topology.

Such a signal can originate for example in left–right models from the decay \(W' \rightarrow W H_1^0\), with subsequent \(H_1^0 \rightarrow A^0 A^0\) decay [29], or also from \(Z' \rightarrow Z H_1^0\). We consider the former process, with \(S = H_1^0\), \(A=A^0\) and the same parameters as in Sect. 2. The results, for what concerns the hadronic diboson searches, would be analogous for a signal of the latter type with a Z boson in the final state.

For the CMS analysis the \(\tau _{21}\) distributions for the leading and subleading jets are separately shown in Fig. 9. These distributions are a combination of the ones for the W and S bosons in Fig. 3, but they are obviously correlated: always one of the two jets, the one corresponding to S, has a large \(\tau _{21}\). For this reason, it is illustrative to quantify the efficiency drop when one changes the \(\tau _{21}\) cut from 0.45 to 0.35,

$$\begin{aligned}&M_A = 20~\text {GeV}:&0.068 \rightarrow 0.014&( \times \; 0.2), \\&M_A = 40~\text {GeV}:&3.9 \times 10^{-3} \rightarrow 3.5 \times 10^{-4}&( \times \; 0.09). \end{aligned}$$
Fig. 8
figure 8

Sample diagram for a \(R \rightarrow VS \) signal

Fig. 9
figure 9

Distributions for the \(R \rightarrow VS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background in the CMS analysis. Top: N-subjettiness ratio \(\tau _{21}\); middle: jet mass distributions; bottom: dijet mass distributions

The masses of the leading and subleading jets are also shown in Fig. 9. The distributions are narrower than for a \(R \rightarrow SS\) decay, as may be expected because one of the decay products is a massive SM boson. The dijet invariant mass distributions are wide, peaking at invariant masses slightly lower than the resonance mass. The distributions after boson tagging are slightly sharper, comparable to a true diboson resonance.

For the ATLAS Run 2 analyses the \(D_2\) and \(N_\text {trk}\) boson tagging variables are shown in Fig. 10. For the leading jet the distributions are similar for the three signals, indicating that the leading jet in \(R \rightarrow VS\) is the SM vector boson in most cases. The \(D_2\) tagging requirement on both jets has an efficiency of 0.35 and 0.18 for \(M_A = 20\) and 40 GeV, respectively, larger than for the \(R \rightarrow SS\) signal. Here it is also interesting to see the effect of the \(N_\text {trk}< 30\) cut, which reduces the signal efficiencies for the WZ selection as follows:

$$\begin{aligned}&M_A = 20~\text {GeV}:&0.13 \rightarrow 0.096&( \times \; 0.74), \\&M_A = 40~\text {GeV}:&0.055 \rightarrow 0.020&( \times \; 0.36). \end{aligned}$$
Fig. 10
figure 10

Distributions for the \(R \rightarrow VS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background in the ATLAS analysis. Top: \(D_2\) variable; second row: number of tracks; third row: jet mass; bottom: dijet invariant mass

For this type of signals an upper cut on the number of tracks may be counterproductive too, especially for heavier A.

Besides the jet substructure differences found, the topology and jet mass of the \(R \rightarrow VS\) signal are not very different from a true diboson resonance, as can be seen by comparing the rest of distributions presented in Fig. 10, namely the masses of the leading and subleading jet after the topological selection, the dijet invariant mass after topological selection and also with the final WZ tagging. The distributions are less peaked than for a true diboson but still they are alike. Finally, we collect in Table 2 the efficiencies for the full WZ tagging in the ATLAS Run 1/Run 2 analyses. At variance with the results shown in the previous section, the efficiencies for both ATLAS analyses are comparable, but slightly smaller in Run 2. Such a signal might also accommodate the size of the excesses observed in the different searches.

4 Triboson-like \(R \rightarrow VVS\) decays

As our third example we consider a cascade decay \(R \rightarrow VY \rightarrow VVS\), with subsequent decay \(S \rightarrow AA\), as depicted in Fig. 11. This is a quadriboson signal but with a triboson-like topology.

There are two crucial differences with the previously seen resonance decays. First, there are already two SM bosons in the final state, which produce fat jets with a higher boson tagging efficiency. Second, the invariant mass of the two selected jets does not concentrate at a maximum near or below the resonance mass, but instead the dijet invariant mass distributions are broader [16]. An example of this cascade decay chain is \(W' \rightarrow Z H^\pm \rightarrow Z W^\pm H_1^0\) in left–right models [29]. This is the signal used in our simulations, taking \(Y = H^\pm \) with \(M_Y = 500\) GeV, \(S = H_1^0\), \(A = A^0 \rightarrow b \bar{b}\).

For this signal we collect the relevant distributions for the CMS analyses in Fig. 12. The \(\tau _{21}\) distribution for the leading jet is quite close to the one for a \(Z' \rightarrow W W\) diboson signal, while there are some differences for the subleading jet. (This indicates that the leading jet is the Z boson from the heavy resonance decay in most cases.) The efficiency for \(\tau _{21}\) and mass tagging does not change as dramatically as for the previous \(R \rightarrow SS\) and \(R \rightarrow VS\) examples,

$$\begin{aligned}&M_A = 20~\text {GeV}:&0.13 \rightarrow 0.053&( \times \; 0.42), \\&M_A = 40~\text {GeV}:&0.082 \rightarrow 0.039&( \times \; 0.47), \end{aligned}$$

that is, it decreases by nearly the same amount as for a \(Z' \rightarrow WW\) signal. The jet mass distributions are not very different from a diboson resonance either. Note that here the leading jet mass concentrates around \(M_Z\), as corresponds to the simulated signal. Sharp differences are found in the dijet invariant mass distributions in the bottom panels of Fig. 12. The \(m_{JJ}\) distribution does not display any resonance-like structure before the boson tagging (left) and a wide bump, similar to the ones found for triboson signals in Ref. [4], appears after boson tagging.

The distributions for the ATLAS Run 2 analyses are given in Fig. 13. The \(D_2\) variable for the leading and subleading jets in \(R \rightarrow VVS\) in the top panels are similar to the ones for the benchmark \(Z' \rightarrow WW\) signal, and the efficiency for the \(D_2\) cut in both jets is 0.4 for \(M_A = 20\) GeV and 0.33 for \(M_A = 40\) GeV, close to the value of 0.5 for a WW signal. The number of tracks per jet is slightly higher, as can be seen from the plots in the second row. For the WZ selection, the \(N_\text {trk}< 30\) cut reduces the signal efficiencies as follows:

$$\begin{aligned}&M_A = 20~\text {GeV}:&0.082 \rightarrow 0.063&( \times \; 0.77), \\&M_A = 40~\text {GeV}:&0.058 \rightarrow 0.042&( \times \; 0.72). \end{aligned}$$
Table 2 Efficiencies of \(R \rightarrow VS\) signals for the WZ selection in ATLAS hadronic diboson searches
Fig. 11
figure 11

Sample diagram for a \(R \rightarrow V_1 Y \rightarrow V_1 V_2 S \) signal

Fig. 12
figure 12

Distributions for the \(R \rightarrow VVS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background in the CMS analysis. Top: N-subjettiness ratio \(\tau _{21}\); middle: jet mass distributions; bottom: dijet mass distributions

Fig. 13
figure 13

Distributions for the \(R \rightarrow VVS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background in the ATLAS analysis. Top: \(D_2\) variable; second row: number of tracks; third row: jet mass; bottom: dijet invariant mass

That is, the signal suppression is higher than for a true diboson signal but not dramatic. The jet mass distributions are relatively narrow, and the dijet invariant mass distributions do not display a peak, even after the application of the full WZ selection. This contrasts with the behaviour of true triboson signals, for which the invariant mass distribution of the two leading jets is shaped to a peak by the application of topological and boson tagging cuts [4, 16]. The comparison of the efficiencies for ATLAS Run 1 and Run 2 analyses is given in Table 3. They are slightly smaller in Run 2. Overall, we find that except for some efficiency decrease when the boson tagging requirements are tightened, the VVS signals behave in much the same way as the triboson signals studied before [4].

5 Discussion

New physics interpretations of the ATLAS Run 1 diboson excess [1] can explain the absence of signals in the semi-leptonic channels, either because the signal is due to a VVX triboson resonance [16] or by assuming a particle with a mass close to \(M_W,M_Z\) and decaying into \(q \bar{q}\) only [24]. (Interpretations of this excess as a diboson resonance were early disfavoured and have already been excluded with the first Run 2 results.) Nevertheless, the main difficulty for these proposals is posed by the most recent ATLAS and CMS searches in the hadronic channels, some of which again exhibit some mild excesses at the same mass, but with sizes that are apparently inconsistent. Even taking into account that, as shown in Ref. [25], part of the excess may be absorbed by the normalisation of the background if statistics are small, one would expect more significant excesses in the large samples collected in Run 2.

The ‘stealth bosons’ S introduced in this paper, which decay into four quarks that merge into a single fat jet, can successfully address apparent inconsistencies among mild excesses in different searches like those. For stealth bosons decaying \(S \rightarrow AA\) we have found that:

  1. 1.

    For the \(R \rightarrow SS\) and \(R \rightarrow VS\) signals the boson tagging efficiency for CMS analyses drops when the cut on the \(\tau _{21}\) variable, used to measure the two-pronged structure of the jets, is tightened. This may explain why the latest CMS analysis with 35.9 fb\(^{-1}\) [6] does not observe a much larger excess, and the previous one with 12.9 fb\(^{-1}\) [8] had a small deviation at the \(1\sigma \) level.

  2. 2.

    For \(R \rightarrow SS\) especially, and to some extent for \(R \rightarrow VS\) and \(R \rightarrow VVS\) too, the efficiency for the ATLAS Run 2 event selection is smaller than at Run 1, explaining why the Run 2 excess with 15.5 fb\(^{-1}\) [5] was of only \(2\sigma \), and with 3.2 fb\(^{-1}\) the deviation was at the \(1\sigma \) level [3].

  3. 3.

    The upper cut on the number of tracks \(N_\text {trk}\) applied by the ATLAS Collaboration washes out a \(R \rightarrow SS\) signal. The suppresion is milder for \(R \rightarrow VS\) and \(R \rightarrow VVS\). This might explain why the dijet mass distribution without this cut exhibits a bump already with 3.2 fb\(^{-1}\) [3], as has already been pointed out [4, 29].

These results are quite independent of the heavy resonance mass, as long as the decay products are sufficiently boosted. This already happens for \(M_R \ge 1.5\) TeV. And the results are applicable, at least qualitatively, to heavier stealth bosons and the decay \(S \rightarrow ZA\), as seen in Appendix C.

Table 3 Efficiencies of \(R \rightarrow VS\) signals for the WZ selection in ATLAS hadronic diboson searches
Fig. 14
figure 14

Higher-order N-subjettiness ratios \(\tau _{nm} = \tau _n / \tau _m\) for \(R \rightarrow SS\) and \(Z' \rightarrow W W\) signals and the QCD dijet background. Solid (dashed) lines correspond to the leading (subleading) jet

As we have stressed, the hadronic signals of stealth bosons are hard to spot over the QCD background by using the standard discriminators specifically built for tagging W and Z vector bosons. Still, there are some differences in the jet substructure that can be exploited. As an example, we show in Fig. 14 several ratios \(\tau _{nm} = \tau _n / \tau _m\) for \(Z' \rightarrow WW\) and stealth boson signals and the QCD background. We observe that the distributions of \(\tau _{32}\) and \(\tau _{42}\) are different for \(M_A = 40\) GeV, which is the most difficult case, and the background. On the other hand, \(\tau _{31}\) and \(\tau _{41}\) are different for \(M_A = 20\) GeV and the background. With the use of a complete set of N-subjettiness observables [39] and a neural network multivariate analysis one can efficiently discriminate these and other multi-pronged signals against the QCD background [40].

We have not addressed the visibility of these signals in the semi-leptonic diboson searches. Still, a few comments are in order. For \(R \rightarrow SS\) signals the leptonic decays are absent, and for \(R \rightarrow VVS\) we expect a similar efficiency suppression for the semi-leptonic searches as it was found for VVX triboson signals in Ref. [4]. The only potentially visible leptonic signatures may arise for \(R \rightarrow VS\) signals, which deserve a detailed study. For these final states the efficiency is generally expected to be smaller than for true diboson resonances, and with the particularity that the signal may pollute the control regions where the SM backgrounds from \(t \bar{t}\), W / Z plus jets, etc. are normalised, causing an unpredictable effect in the signal regions. This is quite a delicate study and falls out of the scope of this paper. Other signatures from the production of the light states S and A depend on their coupling to SM fermions – note that they do not couple to SM gauge bosons – and it is likely that with adequate model building the potential constraints can be evaded.

Our results provide guidance for new hadronic diboson searches. First, a better investigation of the nature of the recurrent excesses found is required. Second, and more importantly, a wider scope of new physics searches beyond the SM is compulsory, as the existing searches may nearly miss these more complex signatures. In particular, (1) generic anti-QCD jet taggers, which are sensitive not only to SM boosted particles but also non-SM ones, should be used; (2) relaxing the mass window for at least one of the bosons would allow one to investigate signals with non-SM bosons, in particular cascade decays such as in Eq. (2); (3) in current searches, additional signal regions or analyses with looser requirements on \(\tau _{21}\) and \(D_2\) should also be considered; (4) triboson resonance searches should also be performed. In diboson resonance searches, a sufficiently large sample would allow to investigate the profile and characteristics of the deviations – and this implies that a further tightening of the boson tagging should be avoided.

Finally, let us stress that, beyond diboson resonance searches and the interpretation of anomalies, the stealth boson signals presented here provide a simple new physics case that highlights the limitations of current LHC searches. In order to broaden the sensitivity to new physics signatures, new tools and strategies are needed. First, generic taggers [40] that are sensitive to these non-standard boosted signals should be used, at least as an alternative to the dedicated ones. And, in parallel, new grooming algorithms (or variations of existing ones) that correctly recover the mass of multi-pronged objects should also be used.