The Full Event Interpretation

The full event interpretation is presented: a new exclusive tagging algorithm used by the high-energy physics experiment Belle II. The experimental setup of Belle II allows the precise measurement of otherwise inaccessible B meson decay modes. The Full Event Interpretation algorithm enables many of these measurements. The algorithm relies on machine learning to automatically identify plausible B meson decay chains based on the data recorded by the detector. Compared to similar algorithms employed by previous experiments, the Full Event Interpretation provides a greater efficiency, yielding a larger effective sample size usable in the measurement.


Introduction
The Belle II experiment is located at the SuperKEKB electron-positron collider in Tsukuba, Japan, and was commissioned in 2018.The experiment is designed to perform a wide range of high-precision measurements in all fields of heavy flavour physics, in particular it will investigate the decay of B mesons [1].For this purpose, the experiment is expected to record about 40 billion collision events containing each an Υ(4S) resonance, which at least 96% of the time decays into exactly two B mesons (a BB pair).Each B meson decays via various intermediate states into a set of final-state particles, which are considered stable in the Belle II detector.Broadly speaking, charged final-state particles are reconstructed as tracks in the central drift chamber and in the inner silicon-based vertex detectors, whereas neutral final-state particles are reconstructed as clusters in the electromagnetic calorimeter.The entire experimental setup of the detector and the collider is described in more detail in Doležal and Uno [1].
The measurement of the branching fraction of rare decays like B → τ ν, B → Kνν or B → νγ , with undetectable neutrinos in their final states, is challenging.However, the second B meson in each event can be used to constrain the allowed decay chains.This general idea is known as tagging.Conceptually, each Υ(4S) event is divided into two sides: The signal-side containing the tracks and clusters compatible with the assumed signal B sig decay the physicist is interested in, e.g. a rare decay like B → τ ν; and the tag-side containing the remaining tracks and clusters compatible with an arbitrary B tag meson decay.Figure 1 depicts this situation.
The initial four-momentum of the produced Υ(4S) resonance is precisely known and no additional particles are produced in this primary interaction.Therefore, because of the relevant quantum numbers conservation, knowledge about the properties of the tag-side B tag meson allows one to recover information about the signal-side B sig meson which would otherwise be inaccessible.Most importantly, all reconstructed tracks and clusters which are not assigned to the B tag mesons must be compatible with the signal-decay of interest.
Ideally, a full reconstruction of the entire event has to take all reconstructed tracks and clusters into account to attain a correct interpretation of the measured data.The Full Event Interpretation (FEI) algorithm presented in this article is a new exclusive tagging algorithm developed for the Belle II experiment, embedded in the Belle II Analysis Software Framework (basf2) [2].The FEI automatically constructs plausible B tag meson decay chains compatible with the observed tracks and clusters, and calculates for each decay chain the probability of it correctly describing the true process."Exclusive" refers to the reconstruction of a particle (here the B tag ) assuming an explicit decay channel.
Consequently, exclusive tagging reconstructs the B tag independently of the B sig using either hadronic or semileptonic B meson decay channels.The decay chain of the B tag is explicitly reconstructed and therefore the assignment of tracks and clusters to the tag-side and signal-side is known.
In the case of a measurement of an exclusive branching fraction like B sig → τ ν τ , the entire decay chain of the Υ(4S) is known.Consequently, all tracks and clusters measured by the detector should be accounted for.In particular, the requirement of no additional tracks, besides the ones used for the reconstruction of the Υ(4S), is an extremely powerful and efficient way to remove most reducible 1 background.This requirement is called the completeness constraint throughout this text.
In the case of a measurement of an inclusive branching fraction like B sig → X u ν, all remaining tracks and clusters, besides the ones used for the lepton and the B tag meson, are identified with the X u system.Hence, the branching fraction can be determined without explicitly assuming a decay chain for the X u system.
The performance of an exclusive tagging algorithm depends on the tagging efficiency (i.e. the fraction of Υ(4S) events which can be tagged), the tag-side efficiency (i.e. the fraction of Υ(4S) events with a correct tag) and on the quality of the recovered information, which determines the tag-side purity (i.e. the fraction of the tagged Υ(4S) events with a correct tag) of the tagged events.
The exclusive tag typically provides a pure sample (i.e.purities up to 90% are possible).But this approach suffers from a low tag-side efficiency, of a few percent, since only a tiny fraction of the B decays can be explicitly reconstructed due to the large amount of possible decay channels and their high-multiplicity.The imperfect reconstruction efficiency of tracks and clusters further degrade the efficiency.
Both the quality of the recovered information and the systematic uncertainties depend on the decay channel of the B tag , therefore we distinguish further between hadronic and semileptonic exclusive tagging.
Hadronic tagging considers only hadronic B decay chains for the tag-side [3,Section 7.4.1].Hence, the four-momentum of the B tag is well-known and the tagged sample is very pure.A typical hadronic B decay has a branching fraction of O(10 −3 ).As a consequence, hadronic tagging suffers from a low tag-side efficiency and can only be applied to a tiny fraction of the recorded events.Large combinatorics of high- multiplicity decay channels further complicate the reconstruction and require tight selection criteria.
Semileptonic tagging considers only semileptonic B → D ν and B → D * ν decay channels [3,Section 7.4.2].Due to the presence of a high-momentum lepton these decay channels can be easily identified and the semileptonic tagging usually yields a higher tag-side efficiency compared to hadronic tagging due to the large semileptonic branching fractions.On the other hand, the semileptonic tag can not avoid to miss kinematic information due to the neutrino in the final state of the decay.Hence, the sample is not as pure as in the hadronic case.
To conclude, the FEI provides a hadronic and semileptonic tag for B ± and B 0 mesons.This enables the measurement of exclusive decays with several neutrinos and inclusive decays.In both cases the FEI provides an explicit tag-side decay chain with an associated probability.

Previous work
Previous experiments have already developed and successfully employed tagging algorithms.In order to compare the algorithms, the maximal achievable tag-side efficiency is of particular interest, because the tag-side efficiency is directly related to the signal selection efficiency of the measurement.On the other hand the achievable tag-side purity is only of limited use, because the achievable final purity of the final selection used for the measurement is dominated by the completeness constraint.Hence, most of the incorrect tags can be easily discarded and the final purity depends strongly on the considered signal decay channel.Moreover, signalside independent receiver operating characteristics (ROC) are not available for most of the previously employed algorithms.
The BaBar experiment [4] used the Semi-Exclusive B reconstruction (SER) algorithm for hadronic tagging [3,Section 7.4.1.1].The algorithm used exclusive D and D * mesons candidates as a seed, and combined those with up to 5 charmless hadrons to a B tag without assuming an exclusive B decay mode.The tag-side efficiency and tag-side purity of each B decay chain was extracted by fitting the beam-constrained mass [3, Section 7.1.1.2]spectrum of the constructed B tag meson candidates.The beam-constrained mass is defined as M bc = E 2 beam /c 4 − p 2 B /c 2 where p B denotes the three-momentum of the reconstructed B meson candidate and E beam denotes half of the centre-of-mass energy of the colliding electron-positron pair.The maximum hadronic tag-side efficiency achieved by this al-gorithm was 0.2% for B 0 B 0 and 0.4% for B + B − , with a tag-side purity around 30%.The tag-side purity could be further increased by rejecting B meson candidates from low-purity decay chains.The semileptonic tag was usually constructed by combining an exclusive D or D * meson with a lepton.The maximum semileptonic tagside efficiency was typically 0.3% for B 0 B 0 and 0.6% for B + B − with an unknown tag-side purity.
The Belle experiment [5] used the so-called Full Reconstruction (FR) algorithm [6] for hadronic tagging [3,Section 7.4.1.2].The FR introduced an hierarchical approach, which is still used by its successor and is presented in this article (see Section 3).The tag-side efficiency and tag-side purity was extracted by fitting the beam-constrained mass spectrum of the constructed B tag meson candidates.The maximum hadronic tagside efficiency achieved by this algorithm was 0.18% for B 0 B 0 and 0.28% for B + B − , with a tag-side purity around 10%.Multivariate classifiers [7] were used to estimate the signal probability of each candidate.The tag-side purity could be further increased by requiring a minimal signal probability.Variants of the FR were used for semileptonic tagging (see [8] and [9]).The maximum semileptonic tag-side efficiency was 0.31% for B 0 B 0 and 0.34% for B + B − , with a typical tag-side purity of 5%.
Compared to the previously employed algorithms, the FEI provides a greater tagging and tag-side efficiency, with a equal or better tag-side purity.The improvements with respect to the FR can be attributed equally to the additional decay channels and the new candidate selection criteria.The reported maximum tag-side efficiencies for the previously used exclusive tagging algorithms are summarized in Table 1.The stated efficiencies are not directly comparable due to different selection criteria, like: a threshold on the beamconstrained mass or the deviation of the nominal energy from the reconstructed energy ∆E = E beam − E B with E B denoting the energy of the B candidate, bestcandidate selections, or cuts on the event shape used to suppress background from non-Υ(4S) events.

Method
The FEI algorithm follows a hierarchical approach with six stages, visualized in Figure 2. Using the reconstructed tracks and clusters, final-state particle candidates are constructed, and combined to intermediate particles until the final B candidates are formed.The probability of each candidate to be correct is estimated by a multivariate classifier.A multivariate classifier maps a set of input features (e.g. the four-momentum or the vertex position) to a real-valued output, which can be interpreted as a probability estimate.Using machine Fig. 2: Schematic overview of the FEI.The algorithm operates on objects identified by the reconstruction software of the Belle II detectors: charged tracks, neutral clusters and displaced vertices.In six distinct stages, these basics objects are interpreted as final-state particles (e + , µ + , K + , π + , K 0 L , γ ) combined to form intermediate particles (J/ψ, π 0 , K 0 S , D, D * ) and finally form the tag-side B mesons.learning, the multivariate classifiers can be constructed by optimizing a loss-function (e.g. the mis-classification rate) on Monte Carlo simulated Υ(4S) events.
All steps in the algorithm are configurable, therefore: the decay channels used, the cuts employed, the choice of the input features, and hyper-parameters of the multivariate classifiers depend on the configuration.A more detailed description of the algorithm and the default configuration can be found in Keck [10] and in the following we give a brief overview over the key aspects of the algorithm.

Combination of Candidates
Charged final-state particle candidates are created from tracks assuming different particle hypotheses.Neutral final-state particle candidates are created from clusters and displaced vertices constructed by oppositely charged tracks.Each candidate can be correct (signal) or wrong (background).For instance, a track used to create a π + candidate can originate from a pion traversing the detector (signal), from a kaon traversing the detector (background) or originates from a random combination of hits from beam-background (also background).
All candidates available at this stage are combined to intermediate particle candidates in the subsequent stages, until candidates for the desired B mesons are created.Each intermediate particle has multiple possible decay channels, which can be used to create valid candidates.For instance, a B − candidate can be created by combining a D 0 and a π − candidate, or by combining a D 0 , a π − and a π 0 candidate.The D 0 candidate could be created from a K − and a π + , or from a K 0 S and a π 0 .
The FEI reconstructs more than 100 explicit decay channels, leading to O(10000) distinct decay chains.

Multivariate Classification
The FEI employs multivariate classifiers to estimate the probability of each candidate to be correct, which can be used to discriminate correctly identified candidates from background.For each final-state particle and for each decay channel of an intermediate particle, a multivariate classifier is trained which estimates the signal probability σ that the candidate is correct.In order to use all available information at each stage, a network of multivariate classifiers is built, following the hierarchical structure.
For instance, the classifier built for the decay of B − → D 0 π − would use σ of the D 0 and π − candidates, to estimate the σ of the B − candidate created by combining the aforementioned D 0 and π − candidates.
Additional input features of the classifiers are the kinematic and vertex fit information of the candidate and its daughters.The multivariate classifiers used by the FEI are trained on Monte Carlo simulated events.The training is fully automatized and distributed using a map-reduce approach.
As can be seen in Figure 2 the available information flows from the data provided by the detector through the intermediate candidates into the final B meson candidates, yielding a single number which can be used to distinguish correctly from incorrectly identified B tag mesons.This allows one to tune the trade-off between tag-side efficiency and tag-side purity of the algorithm by requiring a minimal σ.By contrast, most exclusive measurements by Belle, which used the previous FR algorithm, chose a working point near the maximum tagside efficiency as described in Section 2.

Combinatorics
It is not feasible to consider all possible B meson candidates created by all possible combinations.The amount of possible combinations scales with the factorial in the number of tracks and clusters.This problem is known as combinatorics in high-energy physics.Furthermore, it is not worthwhile to consider all possible B meson candidates, because all of them are wrong except for two in the best-case scenario.
The FEI uses two sets of so-called cuts.A cut is a criterion that a candidate has to fulfill to be considered further.For instance one could demand that the beamconstrained mass of the B meson candidate is near the nominal mass 5.28 GeV of a B meson particle, or that a µ + candidate has a high µ likelihood calculated from the measurements in the particle-identification sub-detectors.
Directly after the creation of the candidate (either from a track/cluster, or by combining other candidates), but before the application of the multivariate classifier, the FEI uses loose and fast pre-cuts to remove wrongly identified candidates (background), without loosing signal.The main purpose of these cuts is to save computing time and to reduce the memory consumption.These pre-cuts are applied separately for each decay channel.
At first, a very loose fixed cut is applied on a quantity which is fast to calculate e.g. the energy for photons, the invariant mass for D mesons, the energy released in the decay for D * mesons, or the beam-constrained mass for hadronic B mesons.Secondly, the remaining candidates are ranked according to a quantity, which is fast to calculate (usually the same quantity as above is used here).Only the n (usually between 10 and 20) best-candidates in each decay channel are further considered, the others are discarded.This best-candidate selection ensures that each decay channel and each event receives roughly the same amount of computing time.
Next, the computationally expensive parts of the reconstruction are performed on each candidate: the matching of the reconstructed candidates to the generated particles (in case of simulated events), the vertex fitting, and the multivariate classification.
After the multivariate classifiers have estimated the σ of each candidate, the candidates of different decay channels can be compared.Here the FEI uses tighter post-cuts to aggressively remove incorrectly reconstructed candidates using all available information.The main purpose of these cuts is to restrict the number of candidates per particle to a manageable number.
At first, there is a loose fixed cut on σ, to remove unreasonable candidates.Secondly, the remaining candidates are ranked according to their σ.Only the m (usually between 10 and 20) best-candidates of the particle (i.e. over all decay channels) are further considered, the others are discarded.This best-candidate selection ensures that the amount of candidates produced in the next stage is tractable by the computing system.

Performance
Applying the FEI to O(1 billion) events is a CPU-intensive task.An optimized runtime and a small memory-footprint are key for a practical application and save computing resources.The FEI spends most CPU time on vertex fitting (38%), particle combination (27%), and classifier inference (15%).All three tasks have been carefully optimized.
The FEI uses only a fast and simple unconstrained vertex fit during the reconstruction, and feeds the calculated information into its multivariate classifiers.The user can refit the whole decay chain of the final B candidates, including mass and/or interaction point profile constraints if desired.A dedicated fitter (called FastFit) based on a Kalman Filter [11] was implemented for the FEI, which outperforms the default implementation used by Belle II and yields very similar results.An overall speedup of the FEI of 2.74 was observed.The FastFit code is licensed under GPLv3 and available on GitHub [12].
As explained in Section 3.3, the number of candidates which have to be processed scales as the factorial of the multiplicity of the channel.In previous approaches the runtime and the maximum memory consumption was dominated by a few high-multiplicity events and tight cuts had to be applied to high-multiplicity channels.By contrast, the FEI addresses the combinatorics problem by performing best-candidate selections during the reconstruction of the decay chain instead of fixed cuts.As a consequence, for each event and each decay channel, the FEI processes the same number of candidates in vertex fitting and classifier inference i.e. consumes similar amounts of CPU time.Moreover, the maximum memory consumption is limited due to the fixed number of best-candidates per event, which is a key requirement for using the computing infrastructure.
Finally, the FEI uses FastBDT [13], a gradient-boosted decision tree (BDT) implementation, as its default mul-tivariate classification algorithm.The algorithm was originally designed for the FEI to speed up the training and application-phase.Compared to other popular BDT implementations such as those provided by TMVA [14], SKLearn [15] and XGBoost [16] it originally improved the execution time by more than one order of magnitude, both in training and application.Using FastBDT, most of the time is spent during the extraction of the necessary features, therefore no further significant speedups can be achieved by employing a different method.

Automatic Reporting
The FEI includes an automatic reporting system called Full Event Interpretation Report (FEIR).
The FEIR contains efficiencies and purities for all particles and decay channels at different points during the reconstruction.Individual reports containing control-plots for each multivariate classifier and input variables are also automatically created.This built-in monitoring capability upgrades the FEI from a blackbox to a white-box algorithm, which the user can understand and inspect on all levels of reconstruction.

Results
The FEI algorithm was developed for the Belle II experiment.In order to quantify the improvements with respect to the previously used FR algorithm, the FEI is applied to data recorded by the Belle experiment.Simulated events and recorded data from the Belle experiment are converted into the new Belle II data format [10,Chapter 2].This conversion tool was used to validate the entire Belle II analysis software and will be described in a separate publication [17].The remainder of this article focuses on the results obtained for the hadronic tag on data recorded by the Belle experiment.The results for the semileptonic tag and for Belle II are based on simulated events and are only summarized briefly.A detailed validation of the entire algorithm can be found in Keck [10,Chapter 4].

Hadronic Tag
The performance of the hadronic tag provided by the FEI using simulated and recorded Belle events is studied and compared to the previously used FR algorithm.
At first, the considered decay channels of the FEI are restricted to the set of hadronic decay channels used by the FR.The performance of the FEI to the FR are compared using the same hardware and the same simulated charged (neutral) BB Belle events.The FEI required 33% less computing time and achieved a maximum tag-side efficiency of 0.53% (0.33%) on simulated events, which is significantly higher than the previously reported tag-side efficiencies (see Section 2).The increase in the maximum tag-side efficiency is due to the improved candidate selection criteria, in particular the best-candidate selections.
Secondly, all decay channels of the FEI are used, including the 38 new hadronic decay channels.The performance of the FEI to the FR using the same hardware and the same simulated charged (neutral) Belle events are then compared.The FEI required 48% more computing time and achieved a maximum tag-side efficiency of 0.76% (0.46%) on simulated events.The further increase in the maximum tag-side efficiency is due to the additional decay channels.
As mentioned before the maximum tag-side efficiency is an important performance indicator for exclusive measurements, which can employ the completeness constraint to achieve a high final purity.The achieved maximum tag-side efficiencies are summarized in Table 2.
In order to validate the results for the hadronic tag obtained from the simulation study, we conducted exclusive measurements of ten different semileptonic B decay channels using the full Υ(4S) dataset recorded by Belle.The branching fractions of the considered semileptonic decay channels are well-known from independent untagged measurements.The branching fraction of those well-known decay channels is measured using the hadronic tag, taking into account all known disagreements between simulation and data, e.g. in the particle identification performance and the track reconstruction efficiency.We assume that the remaining disagreement between simulation and data is caused by the tag-side.Therefore, the ratio ε of the measured and the expected branching fraction is proportional to the ratio of the tag-side efficiency on recorded data and simulated events.Our assumption is supported by the compatibility of the extracted ratios within their uncertainties.Figure 3 summarizes the results for the ten decay channels.The ratios averaged over all controlchannels for the charged and neutral B tag mesons are where the first uncertainty is statistical and the second systematic.
A detailed description of the control measurements, including results for each tag-and control-channel, can Fig. 3: The ratios calculated by measuring 10 semileptonic decay channels on converted Belle data using the hadronic tag.The procedure is described in Schwab [18].
be found in Schwab [18].A similar study was conducted in the past for the FR by Sibidanov et al. [19], yielding a similar overall ratio of ε = 0.75 ± 0.03.The rather large discrepancy between simulated events and recorded data is caused by the uncertainty on the branching fractions and decay models of the simulated B decay channels used for the tag-side and the large number of multivariate classifiers involved in the process.
The uncertainty on the tag-side efficiency of the FEI is one of the most important systematic uncertainties in the measurement of branching fractions of rare decays.The tag-side efficiency can be corrected using the extracted ratios.It is possible to apply this corrections as a function of the tag-side decay channel and signalprobability.A measurement which uses the ratios to correct the tag-side efficiency is performed relative to the considered calibration decay channels.The systematic uncertainty of the correction is given by the uncertainty of the ratios.
In order to compare the hadronic tag provided by the FEI and the FR in a well-defined manner and independent of the signal-side, both algorithms are applied to the same set of 10 million events recorded by the Belle experiment.The events in the data sample are randomly sampled from the so-called Υ(4S) dataset.The same cuts on the beam-constrained mass M bc > 5.24 GeV and the deviation of the reconstructed energy from the nominal energy −0.15 GeV < ∆E < 0.1 GeV were applied and only the best (i.e. the highest σ) B meson candidate in each event was used.From this dataset, we extracted the ROC of the FEI and the FR, i.e. we extracted the tag-side efficiency and tagside purity for different cuts on the signal probability σ.We followed the procedure established in previous publications [3,Chapter 7.1].For different cuts on σ, an extended unbinned maximum likelihood fit of the beam-constrained mass spectrum is performed.The signal peak consisting of correct B tag mesons is modelled with a Crystal Ball function, whereas the background is described using an ARGUS function.The Gaussian mean of the Crystal Ball function was fixed to the B meson mass and its power law exponent was fixed to m = 4 based on the expected shape obtained from Monte Carlo simulations.The location and the width of the ARGUS were fixed using the known kinematic endpoint of the spectrum.All other parameters: the normalization of both functions, the width of the Crystal Ball, and the remaining shape parameters of both functions were adjusted by the fit.Afterwards we extracted the tag-side efficiency and tag-side purity in a window of 5.27 GeV < M bc < 5.29 GeV using the fitted yields of the signal and background component.
Moreover, we checked for a potential peaking combinatorial background component, which would bias the results.This test was done using 10 million events recorded 60 MeV below the Υ(4S) resonance.This dataset does not contain B mesons, hence no signal is expected.The fitted signal yields were compatible with zero.
The resulting ROC curves are shown in Figure 4 and Figure 5 for charged and neutral B tag mesons respectively.The FEI exhibits a larger overall tag-side efficiency compared to the FR.We observe a slightly better performance for the FR than reported in Feindt et al. [6].Both algorithms perform equally well when requiring a high tag-side purity.We suspect this is due to that there are only a finite number of cleanly identifiable B tag meson candidates and both algorithms identify with similar performance.The results for tag-side purities above 70% cannot be extracted reliably and depend depend strongly on the chosen signal or background fitmodel.For practical applications, the low tag-side purity regions is of interest for exclusive measurements.The beam-constrained mass distributions corresponding to the low-purity region with about 15% tag-side purity and the high-purity region with approximatively 80% tag-side purity are shown in Figure 6 and Figure 7, respectively, for the charged B tag .
The maximum tag-side efficiency on recorded data is not determinable by this method, as the fits are restricted to the best B tag candidates.However, a significant contribution to the improvement of the FEI compared to the FR is the increased number of provided candidates per event.A physics measurement will benefit from these additional tag-side candidates by first 0 10 20

Semileptonic Tag
The performance of the semileptonic tag provided by the FEI is studied using simulated Belle events.The maximum tag-side efficiencies are summarized in Table 2. Receiver operating characteristics extracted from simulated events can be found in Keck [10].The results obtained from simulated events, and the fact that the hadronic and semileptonic tag only share five out of six reconstruction stages, indicate a significant increase in the maximum tag-side efficiency.The semileptonic tag was successfully used by Keck [10] to determine the branching fraction of B → τ ν τ on the full Υ(4S) dataset recorded by the Belle experiment, with a smaller relative statistical uncertainty than obtained previously.However, no studies with well-known calibration channels as described in Kronenbitter [21] and no signal-side independent determination of the ROCs as described in Kirchgessner [8], are available yet.[10] which indicate a worse overall performance.This is likely due to the increased beam background caused by the higher luminosity of the collider, which does lead to additional tracks and neutral energy depositions.This additional detector activity is not yet fully rejected by the Belle II reconstruction algorithms [10] and future improvements are likely possible.

Discussion
The multivariate classifiers used by the FEI are trained on Monte Carlo simulated events.Depending on the training procedure and the type of events provided to the training, the multivariate classifiers of the FEI are optimized for different objectives.
In this article, we presented a so-called generic adaption of the FEI.The generic refers to that the FEI was trained independently of any specific signal-side using 180 million simulated Υ(4S) events.This setup optimizes the tag-side efficiency of a "generic" Υ(4S).
Other versions of the FEI exist which optimize the tag-side efficiency of specific signal events like B → τ ν.The so-called specific FEI is trained on the remaining tracks and clusters after a potential signal B meson was already identified.The training uses simulated Υ(4S) events and simulated signal events.As a consequence, the classifiers can be specifically trained to identify correctly reconstructed B tag mesons for signal events and can focus on reducing non-trivial background which is not discarded by the completeness constraint.The spe-cific FEI was first introduced as a proof of concept by Keck [22] and used in Metzner [20].
Roughly half of the improvements with respect to the previous algorithm can be attributed to the additionally considered decay channels.Future extensions are currently investigated which use semileptonic D meson decays, baryonic decays and decays including K 0 L particles.
It should also be noted that the FEI algorithm can be applied, with little modification, to the Υ(5S) resonance.This resonance decays into a pair of B ( * ) B ( * ) mesons.The powerful completeness constraint can still be applied in this situation.

Conclusion
The Full Event Interpretation is a new exclusive tagging algorithm developed for the Belle II experiment and will be used to measure a wide range of decays with a minimum of detectable information.The algorithm exploits the unique setup of B factories and significantly improves the tag-side efficiency compared to its predecessor algorithms.
The tag-side efficiency for hadronically tagged B mesons was validated and calibrated using Belle data.Furthermore, the hadronic and the semileptonic tag provided by FEI have already been used in several validation measurements [10,23,18] using the full Υ(4S) dataset recorded by the Belle experiment.Similar studies and measurements for Belle II are anticipated as soon as the experiment records a sufficient amount of collision events.
There are several ways that the FEI algorithm could be further refined and applied to so far unexplored applications.These will provide an exciting and fruitful area of future research.

Fig. 6 :Fig. 7 :
Fig. 6: Beam-constrained mass distribution of charged B tag mesons in the low tag-side purity region on converted Belle data.

Table 1 :
Summary of the maximum tag-side efficiency reported for the previously used exclusive tagging algorithms.The numbers are not directly comparable due to different selection criteria, like best-candidate selections and selections to suppress non-Υ(4S) events.
Receiver operating characteristic of charged B tag mesons extracted from a fit of the beam-constrained mass on converted Belle data.The FEI outperforms the FR algorithms performance at low and high purity.

Table 2 :
Summary of the maximum tag-side efficiency of different setups on simulated data from the last official Monte Carlo campaign of the Belle experiment.The maximum tag-side efficiency on recorded data is lower (see Section 4.1).