Kinematic strangeness production in cluster hadronization

We present a modification to the non-perturbative strangeness production mechanisms in the Monte-Carlo event generator Herwig in order to make the processes more dynamic and collective. We compare the model to a series of observables for soft physics at both LEP and LHC.


Introduction
The non-perturbative elements of simulating LHC events remain an active area of research in light of recent AL-ICE and CMS data [1,2]. Signs of strangeness enhancement and collective effects in high multiplicity events respectively have inspired several phenomenological models, ranging from interacting strings [3,4], to relativistic hydrodynamics [5], to tweaks to the existing multiple parton interaction mechanisms [6] and colour reconnection [7,8] models. Monte Carlo event generators [5,[9][10][11] provide a useful testing ground for these models.
Arguably the most successful models of hadronization which try to reproduce strangeness enhancement in highmultiplicity events are rooted in the physics of collectivity, where the dense environment of high multiplicity events leads to more complicated systems which interact with one another. Heavy ion event generators typically prefer a hydrodynamic viewpoint, where the quarkgluon plasma acts as a perfect fluid, changing the dynamics of hadronization. High-energy pp event generators tend to use sophisticated iterations of the more conventional proton collision techniques, such as the DIPSY rope model where several overlapping Lund strings [12] combine into a higher-representation colour field, which then may enhance strangeness production and may also shove each other transversely outwards, mimicking the fluid behaviour of quark-gluon plasma. Another model [13] has attempted to use a thermodynamics inspired route to string fragmentation and was able to explain a harder transverse momentum spectrum for heavier particles.
Herwig [9] has recently developed a new model for colour reconnection, where baryonic clusters were allowed to be produced in a geometric fashion [8], in an attempt to explain the results of [1]. The model was able to create heavier hadrons, and in particular more baryons, but in order to better describe the data, the non-perturbative gluon splitting mechanism was allowed to produce ss pairs as well as the default lighter species. However, the production weight was simply set to a flat number, tuned to Minimum Bias events at the LHC. In this paper, we will mainly focus on the fundamental mechanisms of strangeness production in cluster hadronization, namely the production rate of ss pairs during non-perturbative gluon splitting, cluster fission, and cluster decay. In doing so, we are taking the first steps to a rework of strangeness production in the Herwig hadronization phase. A full model would also need to consider colour reconnection, since this rearranges the colour topology and thus the mass distribution inside an event, affecting the scaling that we are interested in studying.
In this study, we aim to introduce a simple dynamic model of strangeness production in Herwig, in which each non-perturbative production stage uses the kinematic information of the relevant surrounding colour-singlet system. After reviewing the current mechanisms of hadronization in Sec. 2, we perform two separate tunes to a number of light strange meson observables for LEP and LHC Minimum Bias events in Sec. 3. We show that the tuned current strangeness production parameters are drastically different between the two collider types, and propose a mass-based scaling for the relevant production weights in Sec. 4, comparing two different mass-like measures to scale the probability. In Sec. 5, we tune our new model and compare the results with the old model in Herwig, as well as perform a comparison to the default Lund string model in Pythia [10] with the Monash tune [14]. We briefly summarize the work and possible future avenues for research in Sec. 6.

The Herwig Hadronization Model
To accurately describe a full QCD event, one must be able to model the non-perturbative physics contributions, e.g. hadronization of individual quarks & gluons from the parton shower and the multiple parton interactions to form colour-singlet hadrons. Fig. 1 sketches a schematic event, focusing on the final state. After generating a hard matrix element for the event, Herwig performs a parton shower, producing a number of soft and collinear partons. After the parton shower reaches O(1) GeV, the hadronization phase of simulation occurs. In Herwig, the hadronization model is the cluster model [15], based on the colour preconfinement [16] property from the angular-ordered parton shower. A clus- ter can be considered to be a highly primordial, excited colour-singlet qq pair. There are several parts to the hadronization model in Herwig, in the following algorithmic order: • Non-perturbative gluon splitting, • Colour reconnection, • Cluster fission, • Cluster decay to hadron pairs, • Unstable hadron decays.
In Fig. 1, we have omitted colour reconnection since this step simply changes the colour topology of the event, not the content of the clusters. While modifying the colour reconnection algorithm would have a non-trivial impact on the later stages of hadronization, namely cluster fission and decay, it is outside the scope of this paper, but these correlations will be studied and addressed in future work. Since the scope of this project is mainly focused on light strange hadron production, we tune predominately to pion and kaon observables. We will also ignore unstable hadron decays for the purposes of this paper.
The three other listed stages in hadronization are each allowed to contribute to the overall strangeness in the event, since they each produce new qq pairs. We briefly recall the details of each step as presented in depth in [9].

Non-perturbative gluon splitting
Once the parton shower ends, all gluons undergo a nonperturbative splitting into qq pairs. The species of the pair is determined by a given weight, e.g. in the tune from [8] the weights of up, down, and strange are 2:2:1. The default version of Herwig does not allow for strangeness production at this step, only uū and dd pairs. The only constraint on the gluon splitting is that the gluon mass is at least twice the constituent mass of the species in question, and the gluons are split isotropically.
After all the gluons in an event have been split, nearest neighbours in momentum space are most likely to be nearest neighbours in colour space [16], and clusters are formed from the momentum-space neighbouring qq pairs, with a mass distribution decoupled from the hard scattering process that created them.

Cluster fission
Exceptionally heavy clusters are allowed to fission into two lighter, less excited clusters if the mass M of the original cluster satisfies the condition: where p and q are parameters that control the fissioning rate criteria, and m 1,2 are the parton masses of the heavy cluster. In Herwig, p is given separate values for light quarks (u, d, s), charm, and bottom. The light quark weights are further subdivided, and strangeness is suppressed by a flat weight. q has a similar divide between the quark species.
After selecting clusters to fission, the cluster fissioner produces a qq pair from the light quarks with a fixed weight, distinct values for each flavour of quark (bar top), and diquarks. Each parton from the pair go into a separate cluster, giving the new pair of clusters a mass distribution of: where w is the splitting parameter that controls the rate of splitting for clusters containing different species of quarks.

Cluster decay
The last stage of cluster-based physics is at the cluster decay level, in which clusters decay into excited hadrons. Given a cluster with constituents q 1 ,q 2 , the weight for producing hadrons h a = q 1q , h b = qq 2 , where q denotes a quark or diquark species, is given by: where P q is the production weight for the given quark or diquark species, w i are the weights for the relevant hadron production, and s i are the suppression factors for the corresponding hadrons. The final factor in the weight is the two-body phase space factor that controls how readily the cluster can decay into the two chosen hadrons.

Herwig strangeness parameters
The Herwig parameters that control non-perturbative strangeness production are the gluon splitting weight -SplitPwtSquark, and the cluster fission & decay weight -PwtSquark. In the original model, cluster fissioning and cluster decaying are controlled by the same parameter. The first step in our understanding of the different contributions is to disentangle cluster fission from cluster decay and introduce one additional parameter which controls the production of a ss pair during cluster fission -FissionPwtSquark. The decay parameter remains the same.

Tuning of the existing model
In this section we tune the parameters for strangeness production of the existing model first to LEP and then to LHC data. Hadronization models are typically tuned to LEP data if they do not rely on pp-specific event topology, e.g. multiple parton interactions and their effects on colour reconnection, since LEP provides a clean QCD final state environment which imposes relatively strict constraints on what one's hadronization model is allowed to do. The tuning is achieved by using the Rivet and Professor frameworks for Monte Carlo event generators [17,18]. In order to understand the overall effects of strangeness production on different stages of the event generation, we keep all other hadronization parameters that were previously tuned to LEP data at their default values [9,19]. In the first tune (TUNE1), we only consider the effects of the parameters that are directly responsible for strangeness production as explained in Sec. 2.
In a second tuning attempt (TUNE2), we introduce the new parameter for the cluster fission stage. Tuning these 3 different parameters will allow us to study the phases of strangeness production during event generation and will shed light on the differences between LEP and LHC.
We note that this section is an extended part of the introduction to visualize and highlight the effects of the aforementioned different parameters and to see at which stage non-perturbative strangeness production is preferred.

LEP Tuning
For the tuning to LEP data, the following observables from ALEPH [20,21], DELPHI [22], SLD [23] and PDG hadron multiplicities [24], which represent a good description of event shapes and π, K multiplicities, were used with equal weights: • Mean charged multiplicities for rapidities |y| < 1.0, |y| < 1.5 and |y| < 2.0 The resulting parameter values for the two different tunes are listed in Tab.1.
While being able to describe all the considered LEP data on equally good footing, we improve the simulation of the observables which were considered in the tuning procedure. TUNE2 gives better agreement to the data, at least with respect to the K ± multiplicity, highlighting the necessity to disentangle the cluster fission and cluster decay parameters. The corresponding plots are shown in Fig. 2, where we compare the default version with our two new tunes.

LHC Tuning
For the tuning to LHC data, we solely focus on identified particle distributions which were measured at ALICE [25] and CMS [2]. We limit the tuning to a center of mass energy of √ s = 7 TeV due to the lack of suitable available Rivet analyses at higher energies. The following observables were considered in the tuning procedure with equal weights: The resulting parameter values are shown in Tab. 2. The outcome of the tuning procedure is shown for the p T distribution of K + + K − yields and the K/π ratio in Fig. 3. Again the retuning of the default model with the incorporation of an additional independent parameter at the cluster fission stage improves the description of the considered observables significantly.

Summary
The general approach in tuning a hadronization model is to tune the parameters to LEP data and then assume it is able to describe LHC observables as well since hadronization is assumed to factorize and should not depend on the process involved.
The main difference between LEP and LHC is the denser hadronic environment one encounters due to multiple parton interactions and therefore also the enhanced effect of colour reconnections on the distribution of final state particles. Be that as it may, we believe that the probability to produce strangeness e.g at the stage of non-perturbative gluon splitting should be a universal parameter and be independent of the process in question.
Since the data shows that clearly different parameter values are preferred at LHC and LEP the approach to have a single valued probability is not suited for the description of both LHC and LEP observables. It may capture the average effect but it does not allow for fluctuations on an event-by-event basis. We tackle this problem by assuming that the rate at which strangeness is produced depends on the hadronic density of the immediate environment, which will be discussed in the next section.   We show a comparison between the default Herwig model and our two different tunes.

Kinematic strangeness production
As mentioned above, the various splitting probabilities and weights are flat numbers tuned to data, without any considerations for the topology of a given event. In order to have a more dynamic picture, where the splitting probabilities depend on the environment, we choose to scale the weights with respect to colour-singlet masses. The mass of a colour-singlet system at a given phase of hadronization scales the probability for strangeness production up or down, depending on a characteristic mass scale for each step.
As a simple starting point for mass-based power scaling, we replace the flat weights in each of the steps mentioned in Sec. 2 with the following functional form: where m 2 0 is the characteristic mass scale for each phase, and m 2 is the total invariant mass of the relevant coloursinglet system. In this work, we will introduce another mass-based measure which replaces m 2 in the denominator of Eq. 4: the threshold production measure, λ. We discuss the difference in the two approaches in Sec. 4.3. For now, we will continue to use the total invariant mass as an example in the following sections.
The weights in Eq. 4 are only for strangeness production, and they are relative to the production weights of up and down quarks. In the limit of a very heavy coloursinglet, the rate of producing strangeness will be the same as that of the lighter quarks, while in the low-mass limit, only the lighter quarks will be allowed to be produced. The appeal of an exponential scaling is that this model only introduces one extra parameter to the default model of hadronization in Herwig, and indeed, it does not introduce any extra parameters if one splits the fission and decay parameters. Thus we avoid a proliferation of parameters in our model, and we still have a natural mechanism to allow for event-by-event fluctuations in strangeness production.
The scaling of the production rate in Eq. 4 only applies to ss pairs, and not to any diquarks containing strange quarks. Default Herwig does not allow gluons to nonperturbatively split into diquark-diantiquark pairs, nor does it allow these pairs to be produced during cluster fissioning and decay. Diquarks may only be produced as remnants of the incoming baryons, or from baryon-number violating processes [9]. Since diquark species would fundamentally affect the baryon yields, which we are not studying in this work, we leave diquark production considerations to a future rework of baryon production in Herwig.

Non-perturbative gluon splitting
At the end of the shower, instead of immediately splitting the gluons into qq pairs with the species determined by their given weights, we instead collect the various coloursinglet systems in the event, what we call pre-clusters.
While colour preconfinement dictates that the mass distribution of clusters is independent of the hard energy scale, there are no such constraints on the masses of the coloursinglet pre-clusters. As shown schematically in Fig. 6, a parton shower can produce gluons and quark-antiquark pairs at a perturbative level, separating the event into a number of different pre-clusters with a variety of masses. Every gluon in the same pre-cluster will get the same weight, since they belong to the same colour-singlet system, and thus have the same mass measure for strangeness production, but since the species is picked probabilistically, this does not mean that all the gluons will produce strange quark-antiquark pairs. The constraint from default Herwig still applies, namely that even in situations where there is a very heavy pre-cluster, if a gluon cannot access the phase space necessary to split into a ss pair, then it will undergo the usual splitting to up or down quarks.
The characteristic mass scale for pre-clusters will unfortunately depend on the type of collider one uses. As shown in Fig. 4, there is a very broad tail for the proton colliders due to the number of pre-clusters that one can produce. This is a by-product of the type of dense and complicated final state environment of high energy hadron colliders. At LEP, there are two peaks for the pre-cluster mass distribution, one at close to 91.2 GeV, corresponding to events where there are only gluon emissions from the outgoing qq legs from the hard scattering process, and very few colour-singlets fall between the two peaks, due to the simple fact that perturbative gluon splitting is suppressed compared to perturbative gluon emission.

Cluster fission & decay
At the cluster fission and cluster decay level, the coloursinglet is the cluster itself. We allow the characteristic mass scale and characteristic production probability to be different for the two phases. As shown in Fig. 5, the typical cluster masses at the cluster fission and cluster decay stages are roughly similar for both LEP and LHC, which we hope to reflect in the characteristic mass scales for the two tunes. We note that Figs. 4 and 5 are plotted without turning on the exponential scaling, which would change the mass distribution slightly, but the figures are benchmarks of the typical colour-singlet total invariant masses.

Colour-singlet masses
In the previous sections we have used the total invariant mass of the colour-singlet systems as the mass measure in Eq. 4, but there are issues with this approach. In using the total invariant mass of a given colour-singlet to scale the strangeness weight, we have neglected to take into account the massive nature of the partons in the pre-clusters and clusters. We argue that given two colour-singlets of the same total invariant mass, if one cluster has much heavier endpoints or constituents that the other, then the one with lighter endpoints or constituents should more readily produce ss pairs from the vacuum.
To remove the biasing effects of massive constituents, we have implemented another mass measure: where m 2 cs is the total invariant mass of the colour-singlet system, and m i are the invariant masses of the endpoints for pre-clusters or the constituent partons in a cluster.
Gluons are massive in Herwig, but because their masses are used to produce the ss pair, we do not include them in the subtraction term. The λ measure would replace the mass-based denominator in Eq. 4. We have presented the distributions of the λ measure for each of the stages in Fig.  7, and a comparison between the distributions of the two mass measures in Figs. 9 and 8. The λ measure has the appealing feature that if one produced a ss pair at the gluon splitting level, this extra mass wouldn't propagate extra strangeness enhancement further into the hadronization process.

Analysis
We first tune the 3 parameters of our mass-based scaling model to the same identified strange particle yields at LEP and LHC as in Sec. 3 Table 4: Results for the tuned characteristic mass scales m 0 , in units of GeV, of our new model using our λ measure (defined in Eq. 5) of a colour-singlet object for LEP and LHC tunes respectively. Fig. 6: Schematic topology of colour-singlets that can occur from perturbative gluon and quark shower splitting, before the gluons undergo non-perturbative splitting.
With the three new characteristic mass scales, we are able to improve the description of all observables considered in the tuning especially for LHC observables as shown in Fig.10, where we compare the two different mass measures after tuning, as well as the Monash tune [14] for Pythia.
Although the simple tuning recommends different values for the usage at LHC and LEP it is also feasible to use the set of parameters obtained from the tuning to LHC data and still get improved results for LEP observables which was not possible by having a simple flat number as the probability to produce strange quarks as is shown in Fig.11.

Discussion
The default version of Herwig did not allow for strange production during the gluon splitting stage. By allowing this process, improvements can be seen in all the considered observables. With our new model, there is a more physically motivated dynamic strangeness production mechanism at all stages of the hadronization.
The multiple parton interaction model in Herwig involves two types of subprocesses, hard and soft. Hard processes are allowed to shower and emit quarks and gluons, while soft ones produce only gluons which may not shower. These soft gluons are all colour-connected to each other and the beam remnants, resulting in a single pre-cluster when undergoing non-perturbative gluon splitting. This type of pre-cluster typically has a large invariant mass due to the large number of soft gluons and the isotropic nature of their momentum distribution, resulting in a high strangeness production weight for this subsystem. The resulting produced strange particles coming from these soft interactions are distributed uniformly in rapidity.
There are three key differences between the LEP and LHC environments during hadronization. Firstly, LEP has a much lower energy scale than the LHC, naturally limiting the possible distribution of colour-singlet masses at the stage of non-perturbative gluon splittings. As a result, a direct comparison between LEP and LHC in our model is not straightforward.
Secondly, while LEP and LHC simulations may have very similar cluster mass distributions, the number of clusters is far higher for the latter. Similarly, at the pre-cluster level, LEP prefers colour-singlets that span the entire final state, as shown in Fig. 4, i.e. no perturbative gluon splittings during the parton shower. This results in the majority of events either having enhanced strangeness production or none at all, at the gluon splitting level, meaning that a flat weight at this level in hadronization can be justified for LEP runs.
Finally, and related to the previous two, LEP is a much cleaner environment. For lepton collisions, there are no multiple parton interactions, nor much effect from colour reconnection. However, in proton collisions, these are both vital phases of the simulation that drastically change the mass topology of the event.
Taking the characteristic mass scales from Tabs. 3 and 4, we have translated these into an effective expected value for the weights for the two mass measures. For LEP events, as shown in Tab. 5, the total invariant mass approach   prefers cluster fissioning, while for the λ measure, nonperturbative gluon splitting and cluster fissioning are approximately the same. It should be noted that aside from the gluon splitting weights, there is no direct translation between the kinematic picture and the old model of strangeness production, but these expected values give an idea of the average weights. For gluon splitting at LEP, the weight simply varies between 0 and the maximal value, since pre-clusters are predominately situated around two peaks, as shown in Fig. 4, and the value shown in Tab. 5 is simply half the maximal value of 0.192 in the invariant mass case, and 0.328 for the λ measure.
For LHC Minimum Bias events, the expected value for the weights are shown in Tab. 6. There is very little difference between using the two mass measures at the gluon splitting and cluster fission stages, while cluster decay is significantly suppressed when using the λ measure. The enormous suppression of strangeness production during the later stages of hadronization compared to the gluon splitting is almost certainly a hint that colour reconnection plays a non-trivial role in producing strange hadrons. Our new kinematic model uses a mass-based scaling, but colour reconnection aims to lower the cluster masses to some local minimum, meaning that it is in direct conflict with our considerations. For LEP simulations, colour reconnection has a small effect, while in LHC simulations, colour reconnection is a vital phenomenon. Future work will study the correlations between the role colour reconnection plays and our model, in particular, varying the amount of colour reconnection that takes place in an event, and allowing baryonic clusters to form.
Our studies showed that there is virtually no quantitative difference between using the tuned invariant mass parameters and the tuned λ measure parameters. How-ever, the results in Tabs. 5 and 6 suggest that the λ measure bridges the divide between the two types of collision better.
We have also compared the results of our new model with Pythia and the Monash tune in Figs. 10 and 11. While the Monash tune aims to describe a number of observables other than the strangeness production rate in Pythia, it is tuned to both LEP and LHC data [14], making it an apt benchmark for this discussion.
We can see that our model performs marginally better than Pythia, and significantly better than default Herwig, when trying to describe the K ± and drastically better on both counts for the K/π ratio yields, as shown in Fig. 10. However, in the low-p ⊥ region, both Pythia and our model  overestimate the data. When using LHC Minimum Bias tuned parameters for LEP simulations, our model outperforms the default Herwig model, but Pythia describes the data better, as shown in Fig. 11.
We expect that changing non-perturbative strangeness production scaling should not change the overall eventshape observables, such as the Sphericity, and total jet broadening. We have included several of these observables from ALEPH data [20,21] in Fig. 12, to confirm that there are only minor statistical differences between default Herwig 7 and our new scaling when one is concerned with non-species specific observables.
While we have not fully solved the discrepancy between the weights for LEP and LHC strangeness production, we have achieved two results: firstly, we have narrowed the gap between the weights of the two types of collision, and in particular, our model can be used with LHC Minimum Bias tuned parameters to better describe LEP data. Secondly, we have made the first steps to a more sophisticated treatment of hadronization and pair production at the low-energy scale in Herwig.

Conclusion and Outlook
We have introduced a three-part model that scales the probability for strangeness production during the hadronization phase of event generation in Herwig. The scaling is directly controlled by the mass of the corresponding event colour-singlet subsystem at each step. With this mechanism, we allow for greater fluctuations in the production of strange pairs on an event-by-event basis.
We have studied the mechanism for non-perturbative strangeness production in detail and found that the current flat probability model is irreconcilable with both LEP and LHC data. A hadronization model should be able to have minimal effects on LEP simulations, but produce significant effects for LHC simulations.
After allowing a mass-based scaling, and tuning the parameters to LEP and LHC data, we find that we are able to narrow the gap between the two collider types, and able to describe some observables better than the Lund string model in Pythia with the Monash tune. We also provide expected values for non-perturbative strangeness production, which capture the average values for eventby-event fluctuations.
It should be noted that we have not considered heavier hyperons, the production of which has been shown to be increased by creating baryonic clusters at the colour reconnection stage [8]. Baryonic clusters, which are heavier by nature, would modify our model's strangeness production rates. Understanding the interplay between our new model and colour reconnection will be left for future work.
There is still much left to understand in soft physics, but understanding the correlations created between the various models in hadronization are imperative to having more precise and useful Monte Carlo event generators.  [20,21], comparing the results of default Herwig to our new LEP tuned non-perturbative strangeness production scaling, for both mass and λ measures. The new scaling does not impact on event-shape observables.