1 Introduction

Naturalness of the Higgs sector as well as the weakly interacting massive particle (WIMP) paradigm for dark matter provide strong motivations for new physics at the TeV scale. The most commonly studied extensions of the standard model (SM) that attempt to solve the hierarchy problem do so by positing the existence of partners to the SM particles that cancel divergent contributions to the Higgs mass. Many of these scenarios also provide a dark matter candidate since they incorporate a parity symmetry under which the partner particles are odd, making the lightest partner particle stable. Arguably the best known example for such scenarios is the minimal supersymmetric extension of the SM, the MSSM.

The collider phenomenology of these scenarios has been studied extensively in the literature. The most promising discovery channels include production of colored partners, which then decay, often in multiple stages, until the lightest partner is reached. Since the lightest partner is assumed to constitute dark matter, it leaves the detector without interacting. Thus no resonances can be constructed from the visible decay products and discovery as well as mass measurement prospects often rely on endpoints of one-dimensional distributions of Lorentz-invariant (e.g. edges, endpoints) [1,2,3,4,5,6,7,8,9,10,11,12] (for a comprehensive review, see [13]) or boost-invariant (e.g. \(m_{T}\), \(m_{T2}\)) variables [10, 14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38].

The Large Hadron Collider (LHC) has completed its 7 and 8 TeV runs and is currently running with a center of mass energy of 13 TeV. The LHC experiments currently do not have significant indications of physics beyond the SM. Considering that the center of mass energy is already near the design value, one needs to take seriously the possibility that if new physics is discovered by the LHC experiments, the signal statistics will remain low, or moderate at best. Therefore, it will be of paramount importance to optimize the methods by which the signal will be studied for low statistics.

Let us consider mass measurement techniques in particular. For cascade decay chains with sufficiently many intermediate on-shell stages, polynomial methods [39,40,41,42,43,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59] can be applied to algebraically solve for all unknown masses based on a small number of events. However, there exist decay chains which do not have sufficiently many on-shell stages for these methods to be applicable. For such decay chains, the one-dimensional variables mentioned above are commonly accepted as the tool to be used for mass measurements. It was argued in Ref. [60], however, that when there are more than two visible particles in the final state, the kinematically accessible region in phase space is multidimensional and the commonly used one-dimensional variables are inefficient at low statistics. It was demonstrated specifically for final states with three visible particles and one invisible particle that the density of events near the boundary of the kinematically accessible phase space is enhanced, and that a determination of this boundary in the multidimensional phase space could yield significantly higher precision and accuracy for mass measurements at low statistics.

In this paper we will extend the conclusions of Ref. [60] to the remaining cascade decay topologies where polynomial methods are not applicable. If all on-shell decay stages are 2 or 3-body decays with one invisible particle emitted from the last stage of the cascade, then it is straightforward to show that any cascade decay with more than five final state particles can be analyzed using polynomial methods, therefore we will restrict ourselves to final states with at most five final state particles. We will show that the enhancement in the density of events near the boundary is in fact even stronger for five-body decays compared to four-body decays, and in a number of representative cases for decay topologies we will demonstrate the improvement for mass measurements compared to the more traditional methods based on kinematic edges or endpoints.

Various techniques involving kinematic variables have also been proposed for the purpose of determining decay topologies [61,62,63,64,65,66,67]. We will provide a preliminary assessment of the sensitivity of the full phase space boundary method to the topology, and suggest an algorithm by which the topology underlying a signal sample should be determined.

Our goal will be to provide a proof of principle that these improvements can be obtained, and therefore as in Ref. [60] we will compare our methods to those based on kinematic endpoints under ideal circumstances, without SM or combinatorial backgrounds, spin correlations or realistic detector effects. While these certainly pose additional challenges in the construction of a fully realistic analysis, they will deteriorate the results of both our methods and any method based on kinematic endpoints, with no obvious reason why one should be more negatively affected than the other. Also, as in [60] we will restrict our study to “one-sided” events, where the cascade decay takes place on one side of the event, and the other side is assumed to include only the lightest partner. This corresponds to scenarios such as gluino-LSP associated production in the MSSM. The reason for this choice is that our methods use only Lorentz-invariant observables and are therefore used on one decay chain at a time, with no obvious way to combine the two sides of the event using the missing transverse energy (MET) for example. Therefore, for reasons of simplicity, we demonstrate the applicability of our methods in the simplest possible case of one-sided events. The same methods can of course simply be used twice in a symmetric event, but that comes at the cost of combinatoric issues such as identifying which side of the event any final state particle belongs to. We will leave a more realistic study including all these complications to future work. In fact, in parallel to this work, methods are already being developed to address some of these complications, and for one decay topology featured in Ref. [60] it has already been demonstrated [68,69,70] that the improvement for mass measurements based on the determination of the full phase space boundary over one-dimensional variables can be maintained in the presence of SM and combinatorial backgrounds, by using Voronoi tessellations.

The layout of the paper is as follows. In Sect. 2 we review the mathematical description of many-body phase space and we quantify the enhancement near the boundary for five-body final states. In Sect. 3 we focus on mass measurements and we set up an analysis to compare the results of mass measurement based on our methods to those obtained from kinematic endpoints. In Sect. 4 we comment on the potential use of our methods for determining the underlying decay topology. We conclude in Sect. 5. Certain details of our methods are more fully described in appendices A through C.

2 Mathematical description of many-body phase space

The standard form of the phase space volume element of n final state particles with 4-momenta \(p_{i}^{\mu }\) and total 4-momentum \(P_{\mu }\)

$$\begin{aligned} \mathrm{d}PS_n= & {} \left( \prod _{i=1}^n \frac{\mathrm{d}^4 p_i}{(2\pi )^3} \delta (p_i^2 - m_i^2) \right) (2\pi )^4 \delta ^4\left( \sum _{i=1}^n p_i^\mu - P^\mu \right) \nonumber \\= & {} \left( \prod _{i=1}^n \frac{\mathrm{d}^3 \mathbf p_i}{(2\pi )^3 2E_i} \right) (2\pi )^4 \delta ^4\left( \sum _{i=1}^n p_i^\mu - P^\mu \right) \end{aligned}$$
(1)

is expressed as a function of individual components of 4-momenta which are not manifestly Lorentz invariant. There also exists a less well-known formulation which is expressed purely in terms of Lorentz scalars [71, 72]. As argued in [60] this form contains important clues to optimizing the sensitivity of mass measurements, therefore we will review it below.

We start by defining \(M_n\) as the \(n\times n\) matrix with elements \(p_{i}\cdot p_{j}\), and define \(\Delta _{i}\) as the coefficients of the characteristic polynomial of \(M_{n}\) as follows:

$$\begin{aligned} \mathrm{Det}\left[ \lambda 1_{n\times n}-M_{n}\right] \equiv \lambda ^{n}-\left( \sum _{i=1}^{n}\Delta _{i}\lambda ^{n-i}\right) . \end{aligned}$$
(2)

The kinematically accessible region of phase space corresponds to \(\Delta _{1,2,3}>0\), \(\Delta _{4}\ge 0\) and \(\Delta _{5,\ldots ,n}=0\), with \(\Delta _{4}=0\) defining the boundary of this region [72]. For the specific case of \(n=4\), the volume element is given by

$$\begin{aligned} \mathrm{d}PS_4&=\, (\mathrm{const.})\times M_X^{-2} \left( \prod _{i<j}dm^2_{ij}\right) \Delta _4^{-1/2}\Theta (\Delta _4) \nonumber \\&\quad \quad \quad \quad \quad \quad \quad \times \delta \left( \sum _{i<j} m^2_{ij} - \mathrm{const.}\right) , \end{aligned}$$
(3)

where \(M_{X}^{2}=P_{\mu }P^{\mu }\) and where the \(\delta \)-function at the end enforces energy conservation. Note that the volume element scales as \(\Delta _4^{-1/2}\), diverging near the boundary in an integrable way. This can be understood as follows: \(\Delta _4\), which for \(n=4\) is equal to \((-\det M_4)\) can be rewritten as \(-\det (V^T g V) = \det ^2 V\), where V is the \(4\times 4\) matrix whose columns are the \(p_{i}^{\mu }\) and \(g=\text {diag}(1,-1,-1,-1)\) is the metric. This makes it clear that the boundary of the kinematically accessible region corresponds to the final state momenta becoming linearly dependent. When this happens, the coordinate change from Cartesian coordinates to the Lorentz-invariant coordinates \(m_{ij}^{2}\) becomes singular and the Jacobian diverges. Note also that the presence of intermediate on-shell particles in the cascade does not change this conclusion, since in the narrow width approximation, these contribute \(\delta \)-functions to the amplitude squared \(|{\mathcal M}|^{2}\), the arguments of which are linear in the \(m_{ij}^{2}\). Therefore, using these \(\delta \)-functions to eliminate some of the integrals over \(m_{ij}^{2}\) never produces nontrivial Jacobian factors.

Going beyond \(n=4\), the phase space volume element has the form [72]

$$\begin{aligned} \mathrm{d}PS_n&= (\mathrm{const.}) \times M_X^{-2}\,\left( \prod _{i<j}\mathrm{d}m^2_{ij}\right) \Delta _4^{(n-5)/2}\Theta (\Delta _4)\nonumber \\&\quad \times \delta (\Delta _5)\cdots \delta (\Delta _n) \delta \left( \sum _{i<j} m_{ij}^{2} - \mathrm{const.}\right) . \end{aligned}$$
(4)

Naively, this expression seems to imply that the enhancement in the volume element near the boundary is absent for \(n > 4\). However, a more careful examination reveals that the arguments of the \(\delta (\Delta _n)\) factors are non-linear in the \(m_{ij}^2\), and therefore nontrivial Jacobians arise as those \(\delta \)-functions are integrated over.

Fig. 1
figure 1

The density of Monte Carlo events arising from a 5-body decay along a direction perpendicular to the boundary of the kinematically accessible region. The red line visually illustrates a scaling law of \(\Delta _{4}^{-1}\)

In order to isolate the scaling of the volume element near the boundary, an alternative expression can be used [72], which locally takes the form

$$\begin{aligned} \mathrm{d}PS_n&= (\mathrm{const.}) \times M_X^{-2}\,\left( \prod _{i<j}\mathrm{d}m^2_{ij}\right) \Delta _4^{-(n-3)/2} \nonumber \\&\quad \times \left( \prod _{\alpha \le \beta } \delta (e_{\alpha \beta })\right) \delta \left( \sum _{i<j} m_{ij}^{2} - \mathrm{const.}\right) . \end{aligned}$$
(5)

Here the \(e_{\alpha \beta }\), where \(1\le \alpha \le \beta \le n-4\), are a set of \((n-4)(n-3)/2\) constraints that are linear in all \(m_{ij}^{2}\) to first order. The form of the \(e_{\alpha \beta }\) are complicated, which makes this expression less useful for practical purposes. However, since no nontrivial Jacobians arise, the correct scaling \(\Delta _4^{-(n-3)/2}\) is revealed, which results in a stronger and stronger enhancement near the boundary with increasing n. In particular, for \(n=5\) the volume element scales as \(\Delta _4^{-1}\). This can be understood in a similar way to our argument above for \(n=4\): as we approach the three-dimensional boundary of phase space, a larger number of 4-vectors must become linearly dependent, and the coordinate change from Cartesian coordinates to the \(m_{ij}^{2}\) becomes more singular.

We have also verified the scaling for \(n=5\) numerically by generating Monte Carlo data for 5-body decays. Specifically, after restricting ourselves to the physical hypersurface specified by the \(\Delta _{5}=0\) constraint, we have sampled the density of Monte Carlo events in a narrow tube perpendicular to the boundary near randomly chosen points on the boundary. Our results are shown in Fig. 1 and they confirm the \(\Delta _{4}^{-1}\) scaling, demonstrated visually by the red line.

3 Mass determination

In this section we will compare the results of mass measurements obtained by using the multidimensional nature of the kinematically accessible region in phase space to those obtained from the traditional kinematic edges and endpoints. In order to perform this comparison, we introduce quality-of-fit functions, to be described below, for the two methods, and we search for the spectrum that results in the best fit, using Monte Carlo samples of 100 events each for several decay topologies. By finding the best-fit spectrum over many samples, and studying the distribution of the best-fit masses, we can evaluate the precision and accuracy of the two techniques. We do this by studying representative benchmark spectra in the decay topologies of interest. Our setup is similar to the analysis performed in Ref. [60], where final states with three visible particles and one invisible particle were considered. In this paper we extend this to final states with four visible particles and one invisible particle. We use a shorthand notation to classify the topologies of interest by using the multiplicity of final states in each stage of the cascade: for instance “2 + 2 + 3” denotes a decay topology where the initial state decays through a 2-body decay, the resulting intermediate particle decays through another 2-body decay, and the intermediate particle resulting from the second stage decays through a 3-body decay, where the final state of the last decay stage includes the lightest partner particle.

We do not consider the 2 + 2 + 2 + 2 topology since it has sufficiently many on-shell intermediate particles to be analyzed by polynomial methods. The four on-shell constraints for the intermediate particles together with the 5-body constraint \(\Delta _5=0\) are sufficient to restrict the likelihood function to have support on a set of measure zero in the space of mass spectra. Therefore, the true spectrum can be determined with a finite number of events. The 2 + 2 + 3, 2 + 3 + 2, and 3 + 2 + 2 topologies are very similar, and therefore we will study the 2 + 2 + 3 topology as a representative case. We also study the 3 + 3 topology which is inequivalent to those. We do not consider decay topologies involving a direct 4-body decay. The endpoint formulas in certain topologies have different analytical forms in different regions of the space of spectra (see Appendix A). Some forms are functions of mass differences only and cannot contribute to a determination of the overall mass scale, while others do contain some absolute scale dependence. In order to gauge the performance of the endpoint method in a more representative way, we pick two benchmark mass spectra for the 2 + 2 + 3 topology. The benchmark mass spectra are listed in Table 1. In particular, benchmark spectrum 2 is expected to be less sensitive to the overall mass scale, as both the \(m_{1234}^{2}\) and the \(m_{234}^{2}\) endpoints depend only on mass differences (see the last lines of Eqs. 24 and 25), while the endpoint formulas for benchmark spectrum 1 do have some dependence on the overall mass scale (line 2 of Eqs. 24 and the first line of Eq. 25).

Table 1 Benchmark mass spectra used in our analysis. For the labeling of the masses in the spectra, see Figs. 7 and 8 in Appendix A

The Monte Carlo events are generated using the phase space routines in ROOT [73]. We also use the optimization routines in ROOT to find the best-fit spectrum. We assume that the underlying decay topology is known; we will comment on the question of determining the decay topology in the next section. We start the optimization procedure within a rectangular box in the space of spectra where each mass is varied by \(\pm 25\%\) of its correct value (for the multidimensional phase space method) or up to several TeV (for the endpoint method). We perform a random scan inside this box to find the best-fit spectrum. We then refine the best-fit spectrum using the simulated annealing algorithm.

As mentioned in the introduction, an important caveat in our methods is that all spin correlations are ignored, in other words we use isotropic decays in our Monte Carlo events, and in the quality-of-fit variables described below. Therefore, in the presence of spin correlations, the specific quality-of-fit variable described below for the multidimensional phase space method may develop biases. However, we reiterate that the main purpose of our study in this paper is to provide a proof of principle that multidimensional phase space methods can provide an improvement over kinematic endpoints for mass measurements. Fundamentally, all the information about the spectrum is encoded in the shape of the boundary of the kinematically accessible region in phase space, not in the distribution of events, which will have additional dependence on the matrix element. The ideal mass measurement analysis would therefore be based on finding the boundary alone, for example by using our methods combined with Voronoi tessellations as has already been done in Refs. [68,69,70]. Once the masses have been measured by using the boundary, more sophisticated methods such as matrix element matching can then be utilized for determining the spins of the particles in the decay chain. We proceed with the quality-of-fit variables below mainly to keep the comparison between the two methods as simple as possible for this initial study of the five-body decay chains. We leave a more realistic work incorporating tools such as Voronoi tessellations to future work.

3.1 Quality-of-fit variable for the kinematic endpoint method

We define the measured position of a kinematic endpoint as the highest value obtained for the observable in question within the data sample. We construct the quality-of-fit function to quantify the agreement between the measured endpoints and those predicted by the spectrum hypothesis:

$$\begin{aligned} Q = \Xi \sum _{i=\mathrm {endpoints}}\left( \frac{ \mathcal O_{i}^{\mathrm {predicted}} - \mathcal O_i^{\mathrm {measured}}}{\mathcal O_i^{\mathrm {measured}}}\right) ^2 \end{aligned}$$
(6)

where \(\Xi = 1\) if all measured endpoints occur at smaller (or equal) values than the predicted ones. If any one of the measured endpoints exceeds the predicted value, the mass hypothesis is rejected (\(\Xi \) is taken to be \(\infty \)). We consider all possible Lorentz-invariant endpoints, with pairs, triplets, etc. of visible final state particles. All endpoints used in our analysis and their predicted values are listed in appendix A. The best-fit mass hypothesis is the one that minimizes Q.

3.2 Quality-of-fit variable for the multidimensional phase space method

To quantify the quality-of-fit using the multidimensional phase space method, we introduce a likelihood function. In particular, let \({\mathcal L}(\{M_{i}\}|\mathrm{data})\) denote the likelihood for a hypothesis spectrum \(\{M_{i}\}\) given the data. By Bayes’ theorem, using a flat prior over spectra, this is proportional to \({\mathcal L}(\mathrm{data}|\{M_{i}\})\), the probability of obtaining the data from the underlying spectrum \(\{M_{i}\}\). This probability can be factored over the events in the data sample as

$$\begin{aligned} {\mathcal L}(\mathrm{data}|\{M_{i}\}) = \prod _\mathrm{events} {\mathcal L}_\mathrm{event}(\{m_{ij}^{2}\}|\{M_{i}\}), \end{aligned}$$
(7)

where \(\{m_{ij}^{2}\}\) denote all Lorentz-invariant observables in the event. The form of the \({\mathcal L}_\mathrm{event}\) factors and the details of their calculation are described in Appendix B. Ultimately, we bring the likelihood functions for each topology into a standard form,

$$\begin{aligned} \mathcal {L}_\mathrm{event} = \Theta [\mathcal {D}_1] \cdots \Theta [\mathcal {D}_m] \times \mathcal {N} \times \mathcal {F} \end{aligned}$$
(8)

where for any given decay topology, the \(\Theta [\mathcal {D}]\) factors encode the kinematically accessible region in phase space, \(\mathcal {F}\) contains all dependence on the hypothesis spectrum \(\{M_{i}\}\), and \(\mathcal {N}\) includes all remaining dependence on the observables in the events. Note that as in the setup for the kinematic endpoint method, spectra for which there exist events that fall outside the (hypothetical) kinematically accessible region are considered excluded. Since the phase space density becomes large near the boundary of the kinematically accessible region, the likelihood function favors spectra where as many events as possible lie near the boundary, with no events lying outside the boundary. The best-fit mass hypothesis is the one that maximizes \({\mathcal L}\) (to be more precise, we use the logarithm of \({\mathcal L}\)).

3.3 Analysis and results

As mentioned above, kinematic endpoint methods are generically much more sensitive to mass differences in the spectrum than to the overall mass scale, parameterized e.g. by the mass of the lightest partner. Therefore, when the statistical distribution of best-fit values for the mass of any particle in the spectrum is considered, the spread in the distribution is dominated by the uncertainty in the overall scale. In order to better compare the performance of the two methods to the overall mass scale and to the mass gaps in the spectrum separately, it is preferable to find an alternative parametrization for spectra rather than using the masses of the individual particles. In particular, we parameterize the spectrum in terms of one parameter that sets the overall mass scale, and three other parameters that only depend on mass gaps.

Fig. 2
figure 2

Distribution of the best-fit values of \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \) (defined in Eq. 9) for the kinematic endpoint method (blue) and the multidimensional phase space method (yellow), using the first benchmark spectrum for the 2 + 2 + 3 topology and data samples of 100 events. The true masses correspond to \(\alpha \), \(\beta \), \(\gamma \), and \(\delta \) all being zero

For the 2 + 2 + 3 topology, we define the dimensionless parameters \(\{\alpha , \beta , \gamma ,\delta \}\) as

$$\begin{aligned} M_{i} =M_{i}^\mathrm{true}+ \left( \alpha \, V_\alpha + \beta \, V_\beta + \gamma \, V_\gamma + \delta \, V_\delta \right) _{i}\times (100~\mathrm{GeV}), \end{aligned}$$
(9)

where \(M_{1}\) parameterizes the (hypothetical) mass of the initial decaying particle, \(M_{4}\) parameterizes the (hypothetical) mass of the lightest partner, \(M_{i}^\mathrm{true}\) denote the benchmark mass values that were used to generate the Monte Carlo events, and the vectors V are defined as

$$\begin{aligned}&V_\alpha = \{1,1,1,1\},\end{aligned}$$
(10)
$$\begin{aligned}&V_\beta = \{1,-1,0,0\},\end{aligned}$$
(11)
$$\begin{aligned}&V_\gamma = \{1,1,-1,-1 \},\end{aligned}$$
(12)
$$\begin{aligned}&V_\delta = \{0,0,1,-1 \}. \end{aligned}$$
(13)

Thus the coordinate \(\alpha \) parametrizes the overall mass scale. The allowed ranges of \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \) are chosen such that the hierarchy of masses is preserved, and all masses remain positive.

Similarly, for the 3 + 3 topology we define \(\{\alpha , \beta , \gamma \}\) as

$$\begin{aligned} M_{i} = M_{i}^\mathrm{true} + \left( \alpha \, V_\alpha + \beta \, V_\beta + \gamma \, V_\gamma \right) _{i}\times (100~\mathrm{GeV}), \end{aligned}$$
(14)

where

$$\begin{aligned}&V_\alpha = \{1,1,1\} ,\end{aligned}$$
(15)
$$\begin{aligned}&V_\beta = \{0,1,-1\},\end{aligned}$$
(16)
$$\begin{aligned}&V_\gamma = \{2,-1,-1 \}. \end{aligned}$$
(17)

Again, \(\alpha \) parameterizes the overall mass scale, and similar consideration as above apply in choosing the allowed range for these parameters.

Fig. 3
figure 3

Distribution of the best-fit values of \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \) (defined in Eq. 9) for the kinematic endpoint method (blue) and the multidimensional phase space method (yellow), using the second benchmark spectrum for the 2 + 2 + 3 topology and data samples of 100 events. The true masses correspond to \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \) all being zero

Fig. 4
figure 4

Distribution of the best-fit values of \(\alpha \), \(\beta \) and \(\gamma \) (defined in Eq. 14) for the kinematic endpoint method (blue) and the multidimensional phase space method (yellow), using the benchmark spectrum for the 3 + 3 topology and data samples of 100 events. The true masses correspond to \(\alpha \), \(\beta \) and \(\gamma \) all being zero

Our results for the 2 + 2 + 3 topology are shown in Fig. 2 for benchmark spectrum 1, in Fig. 3 for benchmark spectrum 2. The results for the 3 + 3 topology is shown in Fig. 4. The mean value and standard deviation of the distributions for \(\alpha \), \(\beta \), \(\gamma \) and \(\delta \) are listed in Table 2. It is easy to see that the conclusions obtained for the four-body decay topologies [60] continue to hold, namely that the multidimensional phase space method yields both more precise and more accurate results for the overall mass scale as well as for the mass gaps. The reasons for the mean values of the distributions obtained by the kinematic endpoint method to be biased away from the correct masses is similar to those discussed in Appendix C of Ref. [60] for the four-body final states. Note also that, for the 2 + 2 + 3 topology, although the \(\alpha \) distribution for the kinematic endpoint method of benchmark spectrum 1, which was chosen to have lesser sensitivity on the overall scale, seems to be broader compared to benchmark spectrum 2, this is somewhat misleading. The lower end of the \(\alpha \) distribution for benchmark spectrum 2 is cut off by the constraint that all masses in the spectrum be positive numbers, which obscures the true spread in the distribution.

Table 2 The mean value and standard deviation of the distributions masses in the spectrum as well as of the parameters \(\alpha \), \(\beta \) etc. for the two benchmark spectra of the 2 + 2 + 3 topology, and for the benchmark spectrum of the 3 + 3 topology, for data samples of 100 events

4 Topology determination

In this section we will consider how different event topologies may be distinguished from one another by using the distribution of events in phase space. We will consider both 4-body and 5-body final states, since Ref. [60] did not consider the question of topology determination. In particular, let \(\{T_{i}\}\) be the set of event topologies that are compatible with the number of observed particles, with one invisible particle assumed to be produced in the last stage of the cascade. We will now write the likelihood function as \({\mathcal L}(T_{i},\{M_{i}\}|\mathrm{data})\), making the dependence on the topology explicit. As before, with a flat prior over topologies and spectra, the likelihood can be related to the probability of obtaining a given distribution of events from an underlying topology

$$\begin{aligned} {\mathcal L}(T_{i},\{M_{i}\}|\mathrm{data})\propto {\mathcal L}(\mathrm{data}|T_{i},\{M_{i}\}). \end{aligned}$$
(18)

We can now use the likelihood functions listed in Appendix B, in the standard form

$$\begin{aligned} {\mathcal L}(\mathrm{data}|T_{i},\{M_{i}\})=\prod _\mathrm{events}&\Theta [\mathcal {D}_1] \cdots \Theta [\mathcal {D}_m] \times \mathcal {N}(\{m_{ij}^{2}\}) \nonumber \\&\times \mathcal {F}(\{m_{ij}^{2}\},\{M_{i}\}). \end{aligned}$$
(19)

As for the analysis for mass measurements, we adopt \(\log {\mathcal L}\) as the quality of fit variable. Maximizing over spectra as before, statistical statements (such as exclusion with a given confidence level) can then be made about a number of possible topology hypotheses based on the data. We will not attempt to perform a detailed analysis of this type in this work, since the idealizations we work with, such as perfect energy resolution and the absence of backgrounds and combinatoric effects, would render the conclusions unreliable.

Nevertheless, one general conclusion can be drawn immediately: When a topology hypothesis \(\tilde{T}\) contains more on-shell particles than the “true” topology T, it can be ruled out (for any spectrum) with a very small number of events. Indeed, for the hypothesis \(\tilde{T}\), the optimization over mass spectra will be trying to enforce an on-shell constraint among the visible particles where no such constraint is actually obeyed by the data. In general, there is no reason for a constraint that appears to be satisfied by one event to also be satisfied by any other. Conversely, a choice for \(\tilde{T}\) that contains fewer on-shell particles than T, while it cannot be ruled out completely, will generally result in a significantly lower likelihood than when the correct topology hypothesis is used, since \(\tilde{T}\) will not provide a very good fit to the distribution of events in the data.

Let us demonstrate this on a specific example. The \(2+2+2\) topology with the spectrum (500, 350, 200, 100) GeV was used to generate Monte Carlo samples of 100 events each, and for the topology hypotheses \(2+2+2\), \(2+3\), \(3+2\) and 4, all possible spectra were scanned until the spectrum with the highest likelihood was found for each sample. Note that unlike in the analysis in Sect. 3, for an incorrect topology hypothesis there is no “correct” mass point to center the scan region on, therefore we scan the spectra over a larger region where each mass is varied between zero and several TeV. The distribution of the best-fit log-likelihood over samples for each topology hypothesis is shown in Fig. 5. In accordance with our expectations, the \(2+3\) and \(3+2\) topologies with fewer on-shell particles result in a poor fit, and the correct topology results in the highest likelihoods.

It should be noted that, for certain incorrect hypotheses, there exists a runaway direction in the space of spectra \(\{M_{i}\}\), namely the likelihood increases as all masses go to infinity with fixed mass gaps. This happens for instance when a direct 4-body decay topology hypothesis is used in the example above. In addition to being completely unphysical (which is why they are not plotted in Fig. 5), the likelihood values for this topology hypothesis in any case turn out to be smaller than for the other topology hypotheses. Runaway directions do not exist for the correct topology hypothesis, and therefore the presence of a runaway direction can be used to rule out a topology hypothesis.

Fig. 5
figure 5

For data samples of 100 events each generated with the spectrum (500, 350, 200, 100) GeV, the distribution of log-likelihood values for the \(2+2+2\), \(2+3\) and \(3+2\) topology hypotheses where the likelihood is maximized over spectra for each sample and each topology hypothesis

Based on the above considerations, a rather general conclusion can be reached that when analyzing a given data sample, the correct topology is among those hypotheses that have the highest number of on-shell particles and that are not immediately ruled out. If there is only one such hypothesis (\(2+2+2\) in the above example), then it must be the correct one. Things are more interesting when there are competing hypotheses with the same number of on-shell particles.

For the final states with three visible particles and one invisible particle, the following outcomes are therefore possible:

  • The data does not rule out the \(2+2+2\) topology hypothesis, which is then established as the correct one.

  • The data is not compatible with the \(2+2+2\) topology hypothesis but it is compatible with the \(2+3\) and \(3+2\) topology hypotheses. This is the only nontrivial case that can arise with this number of final state particles, and a statistical analysis would be needed to find the best-fit topology hypothesis. An example of this is demonstrated in Fig. 6, where the log-likelihood distributions are plotted for the two competing hypotheses for data samples of 100 events each, generated with the \(2+3\) topology and the spectrum (500, 350, 100) GeV. The log-likelihood distribution clearly favors the correct topology.

  • The data is only compatible with a direct 4-body decay hypothesis, which is then established as the correct topology.

Fig. 6
figure 6

For data samples of 100 events each generated with the spectrum (500, 350, 100) GeV and the \(2+3\) hypothesis, the distribution of log-likelihood values for the \(2+3\) and \(3+2\) topology hypotheses where the likelihood is maximized over spectra for each sample and each topology hypothesis

Similarly, for the final states with four visible particles and one invisible particle, the possible outcomes are:

  • The data is compatible with the \(2+2+2+2\) topology hypothesis, which consequently must be the correct one.

  • The data rules out the \(2+2+2+2\) topology hypothesis but it is compatible with the \(2+2+3\), \(2+3+2\), and \(3+2+2\) topology hypotheses. Since these have the same number of on-shell particles, a statistical analysis would need to be performed to determine the correct topology hypothesis. We have performed a numerical study of this scenario with samples of 100 events each, generated with the \(2+2+3\) topology and the spectrum (500, 350, 200, 100) GeV. The log-likelihood distribution not only favors the correct topology but in fact the incorrect topology hypotheses are ruled out completely since no spectrum can be found that is consistent with the data.

  • If the data is not compatible with any of the above, then the \(3+3\) topology hypothesis is the most likely fit, though technically \(4+2\) or \(2+4\) are also potential topology hypotheses since they have the same number of on-shell intermediate particles. It is rare for particles in beyond the SM scenarios to not have any 2-body or 3-body decay channels such that the dominant decay mode is a direct 4-body decay, but from a purely model-independent point of view this should not be discarded off-hand and a likelihood analysis should be performed as in the above examples.

  • While extremely unlikely from a theoretical point of view, there is also a possibility that none of the above topology hypotheses provide a good fit such that a direct 5-body decay topology hypothesis may need to be considered.

5 Conclusions

With the LHC already operating close to its design energy, it is not unreasonable to expect that even if new physics is discovered, the signal will not have high statistics. Earlier work [60] demonstrated that, for limited signal statistics, kinematic endpoints are inefficient for mass measurements in cascade decays with three visible particles and one invisible particle, and that a determination of the phase space boundary in its full dimensionality can lead to significant improvement. This conclusion was borne out further with a subsequent study with a more realistic analysis [70], using the method of Voronoi tessellations [68, 69] to find the boundary of the signal region in the presence of background. In this paper we explored additional decay topologies, including those with four visible particles and one invisible particle, and we have shown that the enhancement in the density of events near the boundary of the kinematically allowed region not only persists, but is even stronger. We have also demonstrated the improvement in mass measurements that can be obtained with these methods on several benchmark decay topologies, for which polynomial methods are not applicable. We have performed this comparison in a very idealized setup, mainly as a proof of principle; however, there is no reason to expect that in a more realistic analysis the results obtained by the methods presented in this paper should degrade more than traditional methods based on kinematic endpoints. As has already been done in the case of 4-body final states [70], it should be possible to verify whether our conclusions continue to hold using a more realistic analysis. Finally, we have explored the possibility of determining the underlying decay topology using our methods, and we concluded that at least in principle topology determination is achievable. The construction of a more realistic analysis both for mass measurements and for topology determination will be performed in future work.