Colour reconnections in Herwig++

Gieseke, Stefan; Röhr, Christian; Siódmok, Andrzej

doi:10.1140/epjc/s10052-012-2225-5

Colour reconnections in Herwig++

Special Article - Tools for Experiment and Theory
Open access
Published: 13 November 2012

Volume 72, article number 2225, (2012)
Cite this article

Download PDF

You have full access to this open access article

The European Physical Journal C Aims and scope Submit manuscript

Colour reconnections in Herwig++

Download PDF

Stefan Gieseke¹,
Christian Röhr¹ &
Andrzej Siódmok^1,2

2618 Accesses
155 Citations
Explore all metrics

Abstract

We describe the implementation details of the colour reconnection model in the event generator Herwig++. We study the impact on final-state observables in detail and confirm the model idea from colour preconfinement on the basis of studies within the cluster hadronization model. Moreover, we show that the description of minimum bias and underlying event data at the LHC is improved with this model and present results of a tune to available data.

CMS pythia 8 colour reconnection tunes based on underlying-event data

Article Open access 10 July 2023

Measurements of observables sensitive to colour reconnection in $$t{\bar{t}}$$ events with the ATLAS detector at $$\sqrt{s} =$$ 13 TeV

Article Open access 20 June 2023

Insights from the ALICE quark-gluon coloured world at the LHC

Article 13 October 2016

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

1 Introduction

High-energy hadronic collisions at the Large Hadron Collider (LHC) require a sound understanding of soft aspects of the collisions. All hard collisions are accompanied by the underlying event (UE) which adds hadronic activity in all phase space regions. The physics of the underlying event is similar to the physics in minimum bias (MB) interactions and very important to understand to quantify the impact of pile-up in high-luminosity runs at the LHC. A wide range of measurements at the Tevatron and the LHC gives us a good picture of MB interactions and the UE [1–13]. Data has also shown that a good part of the underlying event is due to hard multiple partonic interactions (MPI). By now, the three major Monte Carlo event generators Herwig [14], Pythia [15, 16] and Sherpa [17] have an MPI model implemented to simulate the underlying event.

Such a model of independent multiple partonic interactions was first implemented in Pythia [18] where its relevance for a description of hadron collider data was immediately shown. On a similar physics basis, but with some differences in the detailed modelling the jimmy add-on to the old Herwig program, was introduced [19]. In these models, the average number of additional hard scatters is calculated from a few input parameters and then for each hard event the additional number of hard scatters is sampled. The individual scatters in turn are modelled similarly to the primary hard scatters from QCD 2→2 interactions at leading order, with parton shower and hadronization applied as usual. The current underlying event model in Sherpa [17] is similar but will be replaced by a new approach [20]. The current model in Pythia differs from the original development in some details and follows the idea of interleaved partonic interactions and showering [21, 22].

In the recent releases of Herwig an MPI model is also included [23]. It comes with two main parameters, the minimum transverse momentum $p_{\perp }^{\min }$ of the additional hard scatters and the parameter μ ², that can be understood as the typical inverse proton radius squared and appears in the spatial transverse overlap of the incoming hadrons. Good agreement with Tevatron data was found with this model. Soft interactions were added to this model in order to improve consistency with more general theoretical input as the total cross section and the elastic slope parameter in high-energy hadronic collisions [24]. The distribution of transverse momenta in the non-perturbative region below $p_{\perp }^{\min }$ was modelled similarly to the proposal in [25]. Furthermore, it is assumed that the soft partons are distributed differently from the hard partons inside the hadron. The additional parameters introduced here are fixed by requiring a description of the total cross section and the slope parameter, so we are still left with only two parameters. Once again, a good description of Tevatron data on the UE was found, now also where softer interactions play a role. The model for soft interactions smoothly extrapolates from the perturbative into the non-perturbative region, similar to a model for intrinsic transverse momentum in initial-state radiation [26].

With the advent of new data from the LHC at 900 GeV [3] we also considered new observables and found distinct disagreement with data, e.g. in the pseudorapidity of charged particles. It was clear that our implementation was incomplete as we have not at all tried to modify the relative colour structure of the multiple hard scatters. In Fig. 1 we show the sensitivity to the parameter p _disrupt, which controls the colour structure of soft scatters and see a partial refill of the central rapidity plateau. This notable dependence on p _disrupt of soft scatters hints at the importance of colour correlations in a more complete model. Furthermore, we studied the dependence on other possible sources, e.g. on the parton distribution functions (PDF), which are used to extract the additional partons from the hadrons. In Fig. 2 we show the pseudorapidity of charged particles and the average transverse momentum as a function of particle multiplicity, 〈p _⊥〉(N _ch), at that stage. The lines represent different settings of the parameter of soft colour disruption and two different PDF sets: CTEQ6L1 [27] and MRST LO** [28]. We stress that all settings gave a good description of the Tevatron UE data. As discussed in more detail in [29–31], even a dedicated tuning of the MPI model parameters did not improve this description, which lead us to include a colour reconnection (CR) model in order to improve the colour structure between various hard scatters in the MPI model. The starting point is the idea of colour preconfinement [32]. While in a single hard interaction the colour structure is given by (the leading part of) the colour matrices that appear in the Feynman diagrams and also by the parton shower evolution, there is no such firm prescription for the assignment of colour lines or colour connections between individual hard scatters. Colour preconfinement leads us to the assumption that hard jets emerging from separate hard scatters should end up colour-connected when they are produced nearby in momentum space. As there is no such correlation in the non-perturbative modelling of the multiple hard interactions, we have to impose a model on it. Studies of such a model were carried out earlier in [33–35]. In this paper we describe the details of such a colour reconnection model and confirm this physical picture with various analyses of the modelled hadronic final state. Finally, we present results of tuning this model to the currently available data on MB interactions and the UE.

2 Modelling colour reconnections

The cluster hadronization model [36] is based on planar diagram theory [37]: The dominant colour structure of QCD diagrams in the perturbation expansion in 1/N _c can be represented in a planar form using colour lines, which is commonly known as the N _c→∞ limit. The resulting colour topology in Monte Carlo events with partons in the final state features open colour lines after the parton showers. Following a non-perturbative isotropic decay of any left gluons in the parton jets to light quark-antiquark pairs, the event finally consists of colour-connected partons in colour triplet or anti-triplet states. These parton pairs form colour-singlet clusters.

In dijet production via e ⁺ e ⁻ annihilation the invariant mass spectrum of these clusters is independent of the scale of the hard process [36, 38]. The mass distribution peaks at small values, $\mathcal{O}(1~\mbox{GeV})$, and quickly falls off at higher masses. Descriptively speaking, the cluster constituents tend to be close in momentum space. This property of perturbative QCD is referred to as colour preconfinement, as already stated above. The invariant cluster mass largely consists of the constituent rest masses, which gives rise to a pronounced peak at the parton rest mass threshold. Hence, clusters are interpreted as highly excited pre-hadronic states. In the cluster hadronization model hadrons normally arise from non-perturbative, isotropic cluster decays. The Herwig implementation of this hadronization model is described in more detail in Ref. [14].

The situation in hadron collisions is necessarily more complicated. In a typical QCD 2→2 scatter, there is QCD radiation from the initial-state parton shower accompanied by jets emerging from outgoing partons. Due to colour charge conservation, there are colour connections between the partonic subprocess and the two hadron remnants. As sketched in Fig. 3, the primary hard subprocess is modelled in Herwig as an interaction of two valence (anti)quarks [14]. Hence, in pp ($p\bar{p}$) collisions the hadron remnants are colour anti-triplets (triplets). The typical length scale of the valence parton extraction is the hadron size, $\mathcal{O}(1\,\mathrm{fm})$, corresponding to energies where perturbation theory is not applicable. Thus, perturbative QCD cannot be used to calculate or assess the colour correlation between the partonic subprocess and the beam remnants.

We face a similar situation if we consider multiple parton interactions in single hadron collisions. The MPI model in Herwig equips the event with a number of further QCD parton scatters, in addition to the primary partonic subprocess. For each of these subprocesses a pair of gluons, initiating the scatter, is extracted from the colliding hadrons. The chosen colour topology for this extraction corresponds to the N _c→∞ limit. As stated above, this limit is justified in perturbative branchings. In non-perturbative regimes, however, it is rather a QCD-motivated model than an assessable approximation.

As can be seen in the sketch in Fig. 7 below, the parton extraction model for the first and possible additional partonic subprocesses introduces colour lines, which connect subprocesses to each other and to the hadron remnants. As a result, clusters emerge in hadronic collisions which link different parts of the hadron collision. Clearly, these clusters cannot be expected to feature the same invariant-mass distribution as the clusters in e ⁺ e ⁻ dijet events do. Yet the cluster hadronization model for hadronic collisions is adopted unchanged. Colour reconnection intervenes at the stage right before hadrons are generated from the clusters. It provides the possibility to create clusters in a way which does not strictly follow the actual colour topology: The ends of the colour lines are reconnected, resulting in a different cluster configuration. This rearrangement of colour charges is pictorially shown in Fig. 4. Based on the successful role of preconfinement in e ⁺ e ⁻ collisions, we designed two colour reconnection models to work out colour singlets with invariant masses smaller than a priori given. The colour reconnection models studied in this paper differ in the underlying algorithm to find alternative cluster configurations.

2.1 Plain colour reconnection

A first model for colour reconnection has been implemented in Herwig as of version 2.5 [39]. We refer to it as the plain colour reconnection model (PCR) in this paper. The following steps describe the full procedure:

1.
Create a list of all quarks in the event, in random order. Perform the subsequent steps exactly once for every quark in this list.
2.
The current quark is part of a cluster. Label this cluster A.
3.
Consider a colour reconnection with all other clusters that exist at that time. Label the potential reconnection partner B. For the possible new clusters C and D, which would emerge when A and B are reconnected (cf. Fig. 4), the following conditions must be satisfied:
- The new clusters are lighter,
  $$ m_C+m_D < m_A+m_B , $$
  (1)
  where m _i denotes the invariant mass of cluster i.
- C and D are no colour octets.
4.
If at least one reconnection possibility could be found in step 3, select the one which results in the smallest sum of cluster masses, m _C+m _D. Accept this colour reconnection with an adjustable probability p _reco. In this case replace the clusters A and B by the newly formed clusters C and D.
5.
Continue with the next quark in step 2.

The parameter p _reco steers the amount of colour reconnection in the PCR model. Because of the selection rule in step 4, the PCR model tends to replace the heaviest clusters by lighter ones. A priori the model is not guaranteed to be generally valid because of the following reasons: The random ordering in the first step makes this algorithm non-deterministic since a different order of the initial clusters, generally speaking, leads to different reconnection possibilities being tested. Moreover, apparently quarks and antiquarks are treated differently in the algorithm described above.

2.2 Statistical colour reconnection

The other colour reconnection implementation studied in this paper overcomes the conceptual drawbacks of the PCR model. We refer to this model as statistical colour reconnection (SCR) throughout this work. In the first place, the algorithm aims at finding a cluster configuration with a preferably small colour length, defined as

$$ \lambda\equiv\sum_{i=1}^{ N_{\mathrm {cl}}} m_{i}^2 , $$

(2)

where N _cl is the number of clusters in the event and m _i is the invariant mass of cluster i. In the definition of the colour length we opt for squared masses to give cluster configurations with similarly heavy clusters precedence over configurations with less equally distributed cluster masses.

Clearly, it is impossible to locate the global minimum of λ, in general, since an event with 100 parton pairs, for instance, implies about 100!≈10¹⁵⁸ possible cluster configurations to be tested. The Simulated Annealing algorithm from Ref. [40], however, has proven useful in solving optimisation problems like this approximately. The SCR model is an application of this algorithm with λ as the objective function to be minimised.

The SCR algorithm selects random pairs of clusters and suggests them for colour reconnection. Just like in the PCR model, clusters consisting of splitting products of a colour-octet state are vetoed. A reconnection step which reduces λ is always accepted. If the reconnection raises the colour length, it is accepted with probability

$$ p = \exp\biggl( -\frac{\lambda_2-\lambda_1}{T} \biggr) , $$

(3)

where λ ₁ and λ ₂ denote the colour lengths before and after the reconnection, respectively. This gives the system the possibility to escape local minima in the colour length. The “temperature” T is a control parameter, which is gradually reduced during the procedure. At high temperatures, $T \geq\mathcal{O}(\lambda_{2} - \lambda_{1})$, the algorithm is likely to accept steps which raise λ. By contrast, lower temperatures imply a small probability for colour-length-increasing reconnection steps.

The transition from high to low temperatures is determined by the annealing schedule, which flexibly adapts to the number of clusters, N _cl, and to the colour length in the event. First, a starting temperature is determined from the typical change in the colour length, Δλ=λ ₂−λ ₁. To this end, a few random dry-run colour reconnections S are performed, all starting with the default cluster configuration. The initial temperature is set to

$$ T_{\mathrm{init}} \equiv c\cdot\underset{i \in S}{\mathrm{median}} \bigl\{ |\Delta\lambda|_i\bigr\} , $$

(4)

where c is a free parameter of the model. Using the median makes this definition less prone to outliers compared to the mean. The algorithm proceeds in steps with fixed temperature. At the end of each temperature step T decreases by a factor f, which is another free model parameter, with f∈(0,1). Each value of T is held constant for αN _cl reconnection attempts with another free parameter α. The algorithm stops as soon as no successful colour reconnections happen in a temperature step, but at most N _steps temperature steps are tested. We use the parameters c, α, f and N _steps, which are all related to the annealing schedule, to tune the SCR model to data.

We would like to stress that the annealing model is used only as a numerical tool to minimize the colour length introduced above and hence give no physical interpretation to the model parameters themselves. We argue later, that merely the idea of minimizing the colour length is indeed meaningful and physical.

3 Characteristics of colour reconnection

In this section we want to study hadronization-related quantities which allow us to understand colour reconnection from an event generator–internal point of view. Here, a set of typical values for c, α, f and N _steps in the SCR model, as well as for p _reco in the PCR model, was used, which was obtained from tunes to experimental data, as described below in Sect. 4.

3.1 Colour length drop

To quantify the effect of colour reconnection at generator level, we define the colour length drop

$$ \Delta _{\mathrm {if}}\equiv1 - \frac{ \lambda _{\mathrm {final}}}{ \lambda _{\mathrm {init}}} , $$

(5)

where λ _init and λ _final denote the colour length in an event before and after colour reconnection, respectively. Δ_if approximately vanishes in events with λ _init≈λ _final, i.e. with no or only minor changes in the colour length λ due to colour reconnection. The other extreme, Δ_if≈1, indicates a notable drop in λ.

The distribution of Δ_if for soft inclusive LHC events at 7 TeV is shown in Fig. 5(a). The plain and the statistical colour reconnection models result in similar distributions with pronounced peaks at 0 and 1. Note that Fig. 5 shows logarithmic plots, so the plateau in between the peaks is really low. There is also a small fraction of events with negative Δ_if, though. The colour reconnection procedure actually raises λ in these events. In the SCR algorithm, this can happen since λ-raising steps are explicitly allowed with a certain probability, cf. Eq. (3). However, also the PCR algorithm might potentially raise λ since the reconnection condition, Eq. (1), is formulated in terms of the first power of cluster masses, whereas λ is defined as the sum of squared cluster masses. As these events are rare, we expect no impact on physical observables.

With soft inclusive hadron-hadron generator settings there are, generally speaking, two important classes of events. One of the two are events where there is no notable change in the sum of squared cluster masses, λ. In another large fraction of events, however, colour reconnection causes an extreme drop in λ. An obvious interpretation for this drop is that the colour reconnection procedure replaces disproportionally heavy clusters by way lighter ones.

This shift in the cluster mass spectrum, which both models aim at by construction, can also be observed directly. Figure 6 shows the cluster mass distribution before and after colour reconnection. As expected and also intended, both CR procedures cause the distribution to be enhanced in the low-mass peak region and suppressed in its, potentially unphysical, high-mass tail.

In Fig. 5(b) we show the colour length drop in hard dijet events in pp collisions. We observe a notable decrease of large colour length drops, Δ_if=1, with increasing cut on the jet transverse momentum at parton level. The reason for this decrease is that higher momentum fractions are required for the hard dijet subprocess, whereas in soft events the remaining momentum fraction of the proton remnants is higher. Hence clusters containing a proton remnant are less massive in hard events, which implies less need for colour reconnection.

The distribution of the colour length drop in e ⁺ e ⁻ annihilation events looks completely different, as shown in Fig. 5(c). We find that colour reconnection has no impact on the colour length in the bulk of dijet events. We show only the Δ_if distribution from the SCR model here, but the PCR model yields similar results. These results confirm that due to colour preconfinement partons nearby in momentum space in most cases are combined to colour singlets already. In events with hadronic W pair decays, however, hadrons emerge from two separate colour singlets. If there is a phase space overlap of the two parton jet pairs, the production of hadrons is expected to be sensitive to colour reconnection. We address this question later on in Sect. 4.1. Here we want to remark that the fraction of WW events with non-vanishing colour length drop is slightly higher than for the dijet case. Nevertheless, the vast majority of WW events is not affected by colour reconnection, too.

3.2 Classification of clusters

These results generically raise the question which mechanism in the hadron event generation is responsible for these overly heavy clusters. To gain access to this issue, we classify all clusters by their ancestors in the event history. A sketch of the three types of clusters in shown in Fig. 7.

The first class are the clusters consisting of partons emitted perturbatively in the same partonic subprocess. We call them h-type (hard) clusters.
The second class of clusters are the subprocesses-interconnecting clusters, which combine partons generated perturbatively in different partonic subprocesses. They are labelled as i-type (interconnecting) clusters.
The remaining clusters, which can occur in hadron collision events, are composed of at least one parton created non-perturbatively, i.e. during the extraction of partons from the hadrons or in soft scatters. In what follows, these clusters are called n-type (non-perturbative) clusters.

First we use this classification to analyse hadron collision events as they are immediately before colour rearrangement. For that purpose, we define the cluster fraction functions

$$ f_a( m_{\mathrm {cut}}) \equiv N_a( m_{\mathrm {cut}}) \Big/ \sum _{b=h,i,n} N_b( m_{\mathrm {cut}}) = \frac{N_a( m_{\mathrm {cut}})}{ N_{\mathrm {cl}}} , $$

(6)

where N _a(m _cut) is the number of a-type clusters (a=h,i,n) with m≥m _cut, counted in a sufficiently large number of events.^{Footnote 1} For instance, f _i(100 GeV)=0.15 says 15 % of all clusters with a mass larger than 100 GeV are subprocess-interconnecting clusters. By construction, f _a(m _cut) is a number between 0 and 1 for every class a. Moreover, the cluster fraction functions satisfy

$$ \sum_{a=h,i,n} f_a( m_{\mathrm {cut}}) = 1 . $$

Figure 8 shows the cluster fraction functions for LHC dijet events at $\sqrt{s} = 7~\mbox{TeV}$. The fraction of non-perturbative clusters increases with m _cut and exceeds 0.5 at m _cut≈70 GeV. So for an increasing threshold m _cut up to values well beyond physically reasonable cluster masses of a few GeV, the contribution of n-type clusters becomes more and more dominant.

A bin-by-bin breakdown to the contributions of the various cluster types to the total cluster mass distribution is shown in Fig. 9. There are several things to learn from those plots. First, non-perturbative n-type clusters do not contribute as much to the peak region, say below 6 GeV, as perturbative h-type and i-type clusters do. In the high-mass tail, however, n-type clusters clearly dominate, as already indicated by the cluster fraction functions discussed above. Both their minor contribution at low masses and their large contribution at high masses do not change after colour reconnection. In total, however, the mass distribution is more peaked after colour reconnection and the high-mass tail is suppressed by a factor larger than 10.

3.3 Resulting physics implications

The characteristics of clusters that have been studied in this section clearly confirm the physical picture we have started out with. The colour reconnection model in fact reduces the invariant masses of clusters that are mostly of non-perturbative origin. These arise as an artefact of the way we colour-connect additional hard scatters in the MPI model with the rest of the event.

At this non-perturbative level we have no handle on the colour information from theory, hence we have modelled it. First in a very naïve way when we extract the ‘first’ parton from the proton, but only to account for a more physical picture later, where we use colour preconfinement as a guiding principle. We therefore conclude that our ansatz to model colour reconnections in the way we have done it reproduces a meaningful physical picture.

4 Tuning and comparison of the model results with data

In this section we address the question of whether the MPI model in Herwig, equipped with the new CR model, can improve the description of the ATLAS MB and UE data, see Fig. 2. To that end we need to find values of free parameters (tune parameters) of the MPI model with CR that allow to get the best possible description of the experimental data. Since both CR models can be regarded as an extension of the cluster model [36], which is used for hadronization in Herwig, the tune of Herwig with CR models may require a simultaneous re-tuning of the hadronization model parameters to a wide range of experimental data, primarily from LEP (see Appendix D from Ref. [14]). Therefore, we start this section by examining whether the description of LEP data is sensitive to CR parameters.

4.1 Validation against e ⁺ e ⁻ LEP data

Already in Sect. 3 we have seen that the colour structure of LEP final states is well-defined by the perturbative parton shower evolution. Moreover, the CR model does not change this structure significantly. Therefore, although CR is an extension of hadronization, we can expect that the default hadronization parameters are still valid in combination with CR. This was confirmed by comparing Herwig results with and without CR against a wide range of experimental data from LEP [41–49]. As an example we show a comparison of Herwig without and with CR (using the main tunes for both CR methods presented in this paper) to two LEP observables in Fig. 10. The full set of plots, showing that the LEP data description in Herwig with and without CR is of the same quality, can be found on the Herwig and MCplots web pages [50, 51]. These results allow us to factorize the tuning procedure: The well-tested default Herwig tune for parton shower and hadronization parameters is retained, and only the parameters from the CR and MPI models are tuned to hadron collider data. However, we have checked each tune presented in this paper against LEP results.

In addition to the analyses used for the hadronization tuning, there are LEP analyses dedicated to colour reconnection in $W^{+}W^{-} \to(q\bar {q})(q\bar{q})$ events [52–55], originally proposed in Ref. [56]. In those analyses the W bosons are reconstructed via kinematic cuts on all possible jet pairs in four-jet events. The particle flow between jets originating from different bosons was expected to be enhanced in Monte Carlo models including colour reconnection. However, only moderate sensitivity to the tested CR models could be found at the time. We have confirmed this with our colour reconnection implementations. In Fig. 11 we show the sensitivity of the particle flow between the identified jets to the reconnection strength in the PCR model, compared to DELPHI data from Ref. [52]. We observe a slight improvement in the description of the data. A number of apparent outliers in the experimental data, however, indicate possibly too optimistic systematic errors in the experimental analysis. For that reason, no clear constraints on the model can be deduced from the data.

As the W bosons are produced on shell and significantly boosted at $\sqrt{s}=189~\mbox{GeV}$, the finite W width can cause the two W bosons to travel long distances before decaying. In the limit of a very small W width, large reconnection effects between the two W systems should thus be suppressed in the model. The moderate sensitivity of the particle flow to colour reconnections implies, however, that colour reconnection effects are small in WW events. Note that also the largely vanishing colour length drop in WW events, cf. Fig. 5(c) and the discussion in Sect. 3.1, supports this conclusion. Hence we retain the described generic reconnection models also for WW events and do not introduce an extra suppression mechanism.

4.2 Tuning to data from hadron colliders

Now that we have validated the CR models by comparison against LEP data, we are ready to tune their parameters to data provided by hadron colliders. Before LHC data was available, the MPI model in Herwig [24] was tuned by subdividing the two-dimensional parameter space, spanned by the model’s main parameters, the inverse proton radius squared μ ² and the minimum transverse momentum $p_{\perp }^{\min }$, into a grid. For each of the parameter points on this grid, the total χ ² against the Tevatron underlying-event data [1, 57] was calculated. A region in the parameter plane was found, where similarly good values for the overall χ ² could be obtained.

While tuning the MPI models including colour reconnection we are dealing with a larger number N of tunable parameters p _i, where N=4 in case of the PCR (p _disrupt, p _reco, $p_{\perp }^{\min }$ and μ ²) and N=7 in case of the SCR model (p _disrupt, $p_{\perp }^{\min }$, μ ², α, c, f and N _steps). Hence the simple tuning strategy from above is ineffective. A comprehensive scan of 7 parameters, with 10 divisions in each parameter would require too much CPU time.

Instead, we use a parametrization-based tune method which is much more efficient for our case. The starting point for this tuning procedure is the selection of a range $[p_{i}^{\rm min},\, p_{i}^{\rm max}]$ for each of the N tuning parameters p _i. Event samples are generated for random points of this N-dimensional hypercube in the parameter space. The number of different points depends on the number of input parameters to ensure a well converging behaviour of the final tune. Each generated event is directly handed over to the Rivet package [58] to analyse the generated events. This allows the computation of observables for each parameter point, which construct the input for the tuning process. The obtained distributions of observables for each parameter variation are the starting point for the main part of the tune, which is achieved using the Professor framework [59]. Professor parametrizes the generator response to the probed parameter points. In that way it finds the set of parameters, which fits the selected observables best. The user is able to affect the tuning by applying a weight for each observable, which specifies the impact of the variable for the tuning process.

4.2.1 Tuning to minimum-bias data

As we initially were primarily aiming at an improved description of MB data, we started by tuning the PCR model to ATLAS MB data. Since currently there is no model for soft diffractive physics in Herwig, we use the diffraction-reduced ATLAS MB measurement with an additional cut on the number of charged particles, N _ch≥6. The observables we used for the tune are the pseudorapidity distribution of the charged particles, the charged multiplicity, the charged-particle transverse momentum spectrum and the average transverse momentum measured as a function of the number of charged particles. All four available MB observables entered the tune with equal weights. The results of this tune are shown by the blue lines in Fig. 12. The bottom right figure shows that colour reconnection helps to achieve a better description of 〈p _T〉(N _ch). Also the other three distributions are now well described. We conclude that the CR model was the missing piece of the MPI model in Herwig++. We clearly improve the description of the pseudorapidity distribution.

4.2.2 Tuning to underlying-event data

The next important question was whether the new model is able to describe the UE data collected by ATLAS at 7 TeV [4]. The measurements are made relative to a leading object (the hardest charged track in this case). Then, the transverse plane is subdivided in azimuthal angle ϕ relative to this leading object at ϕ=0. The region around the leading object, |ϕ|<π/3, is called the “towards” region. The opposite region, where we usually find a recoiling hard jet, |ϕ|>2π/3, is called “away” region, while the remaining region, transverse to the leading object and its recoil, where the underlying event is expected to be least ‘contaminated’ by activity from the hard subprocess, is called “transverse” region. Again, we only focus on the tuning of the PCR model here. For the underlying-event tune two observables were used: The mean number of stable charged particles per unit of η–ϕ, 〈d² N _ch/dη dϕ〉, and the mean scalar p _⊥ sum of stable particles per unit of η–ϕ, 〈d²∑p _t/dη dϕ〉, both as a function of $p_{\perp }^{\mathrm {lead}}$, with charged particles in the kinematic range p _⊥>500 MeV and |η|<2.5.

The resulting tune, named ue7-2, gives very satisfactory results not only for the tuned observables but also for all other observables provided by ATLAS in Ref. [4]. In Figs. 13(c), 14(c) and 15(c), we show 〈d² N _ch/dη dϕ〉 and 〈d²∑p _t/dη dϕ〉 as a function of $p_{\perp }^{\mathrm {lead}}$ for p _⊥>500 MeV in the “transverse”, “away” and “toward” regions, compared to the Herwig++ ue7-2 results.

We repeated the tuning process for the UE data collected by ATLAS at 900 GeV and CDF at 1800 GeV, and obtained as good results as for 7 TeV (not shown in Figs. 13, 14 and 15 for the sake of simplicity). It is worth mentioning that the ATLAS UE observables with the lower p _⊥ cut on the charged particles, p _⊥>100 MeV, were not available during the preparation of the ue7-2 tune but are also well described by the tune, see Fig. 16(c). These results can therefore be considered as a prediction of the model.

Figure 17 shows the angular distributions of the charged-particle multiplicity and ∑p _⊥, with respect to the leading charged particle (at ϕ=0). The data sets are shown for four different cut values in the transverse momentum of the leading charged particle, $p_{\perp }^{\mathrm {lead}}$. With increasing cut on $p_{\perp }^{\mathrm {lead}}$, the development of a jet-like structure can be observed. The overall description of the data is satisfactory but we can also see that the description improves as the lower cut value in $p_{\perp }^{\mathrm {lead}}$ increases as then the description is more driven by perturbation theory. The full comparison with all ATLAS UE and MB data sets is available on the Herwig tune page [50]. At this stage different UE tunes were mandatory for different hadronic centre-of-mass energies $\sqrt{s}$. In the next section we address the question of whether an energy-independent UE tune can be obtained using the present model.

4.2.3 Centre-of-mass energy dependence of UE tunes

To study the energy dependence of the parameters properly, we examine a set of observables at different collider energies, whose description is sensitive to the MPI model parameters. The experimental data should be measured at all energies in similar phase-space regions and under not too different trigger conditions. These conditions were met by two UE observables: 〈d² N _ch/dη dϕ〉 and 〈d²∑p _t/dη dϕ〉, both measured as a function of $p_{\perp }^{\mathrm {lead}}$ (with $p_{\perp }^{\mathrm {lead}}< 20~\mbox{GeV}$) by ATLAS at 900 and 7000 GeV (with p _⊥>500 MeV) and by CDF at 1800 GeV. Let us first focus on the PCR model. In this case we have four free model parameters, p _disrupt, p _reco, $p_{\perp }^{\min }$ and μ ². For each hadronic centre-of-mass energy we performed independent four-dimensional tunings. Note that $p_{\perp }^{\mathrm {lead}}$ denotes the transverse momentum of the hardest track in the case of ATLAS, whereas the CDF underlying-event analysis uses the p _⊥ of the leading jet, which we call $p_{\perp }^{\mathrm {lead}}$ here, as well.

Figure 18 shows the spread of the tuning results for each parameter against Professor’s heuristic χ ². In the first row we present results for 900 GeV and in the second row for 7 TeV. Each point is from a separate tune, made using various combinations of generator runs at different points in the parameter space. We see that the parameters are not well constrained and are sensitive to the input Monte Carlo (MC) runs. This is due to what we have already seen during the tuning of the MPI model without CR [23, 24, 60] to Tevatron data, namely the strong and constant correlation between $p_{\perp }^{\min }$ and μ ². This correlation reflects the fact that a smaller hadron radius always balances against a larger p _⊥ cutoff, as far as the underlying-event activity is concerned. With one of these two parameters fixed, the remaining parameters are much less sensitive to the input MC runs.

The most important information we can see on these figures is that the experimental data for the two different c.m. energies (900 GeV and 7 TeV) cannot be described by the same set of model parameters. More precisely, the experimental data prefers different $p_{\perp }^{\min }$ values for different hadronic centre-of-mass energies, while the rest of the parameters may perhaps remain independent of the energy. This observation led us to the creation of energy-extrapolated UE tunes, named ue-ee-3, in which all parameters are fixed except for $p_{\perp }^{\min }$, which varies with energy. We summarize the tune values for $p_{\perp }^{\min }$ at different energies in Table 1. The other model parameters, which do not depend on the c.m. energy, are given in Table 2.

Table 1 Tune values for $p_{\perp }^{\min }$. All other model parameters, which do not depend on the c.m. energy, are summarized in Table 2

Full size table

Table 2 Parameters of the energy-extrapolating underlying-event tunes. The last two parameters describe the running of $p_{\perp }^{\min }$ according to Eq. (7)

Full size table

Since by construction the MPI model depends on the PDF set, we created two separate energy-extrapolated tunes for the CTEQ6L1 and MRST LO** PDFs. In general, both tunes yield similar and satisfactory descriptions of experimental data.^{Footnote 2} As an example see Fig. 16, in which we compare the ue-ee-3 and ue-ee-3-cteq6l1 tunes to ATLAS UE observables, measured in all three regions (toward, transverse and away).

We repeated this procedure also for the SCR model. However, since in this case the tuning procedure was more complicated, as explained below, we concentrated on one PDF set only, namely CTEQ6L1. The first obvious complication was the larger number of parameters to tune. The second complication was associated with the fact that one of the tuning parameters, N _steps, is an integer number. The current version of Professor, however, does not provide such an option, instead it treats all parameters as real numbers. Therefore, we decided to carry out fifty separate tunes for different fixed values of N _steps, starting from 1 to 50. The last problem that we encountered, which is probably associated with the two previously mentioned problems, was that for some parameter values the predictions from Professor were significantly different from the results we received directly from Herwig++ runs. Initially, we increased the order of the interpolating polynomials from second to fourth, which should improve Professor’s predictions, but this did not improve the situation. Therefore, we first identified regions of the parameter space where this problem appeared most frequently and then excluded these from the tuning procedure. As a result, we obtained an energy-extrapolated underlying-event tune for the SCR model, which we call ue-ee-scr-cteq6l1.

In Figs. 13, 14 and 15 we show a comparison of the PCR and SCR energy-extrapolated (CTEQ6L1) tunes and the ue7-2 tune against 〈d² N _ch/dη dϕ〉 and 〈d²∑p _t/dη dϕ〉 as a function of $p_{\perp }^{\mathrm {lead}}$ for p _⊥>500 MeV in all three regions (toward, transverse and away) and at three different collider energies. We can see that the quality of the data description is high and at the same level for all tunes. Nevertheless, we favour the SCR model as here we have a clearer physics picture and a more flexible model.

In the last step, we parametrized the $p_{\perp }^{\min }$ dependence. In a first attempt we have chosen a logarithmic function to extrapolate $p_{\perp }^{\min }$ to energies different from the tune energies. Therefore we fitted a function of the form $p_{\perp }^{\min }(s) = A\,\log(\sqrt{s}/B)$, where A and B are free fit parameters, to the three $p_{\perp }^{\min }$ values obtained in the ue-ee-3 tune. The fit is shown in Fig. 19. Based on this, we provide UE tunes for c.m. energies the LHC was or will be operating at. Since the logarithmic form is not very stable for lower energies, we have replaced this ansatz with a power law, see also e.g. [61],

$$ p_{\perp }^{\min }(s) = p_{\perp ,0}^{\min }\biggl(\frac{\sqrt{s}}{E_0} \biggr)^b\ . $$

(7)

This is the default parametrization of the energy dependence from Herwig++ release 2.6 [62]. The default value of E ₀ is 7 TeV. For the collider energies at consideration in our tunes there are no significant differences in all observables due to this change. The values for b and $p_{\perp ,0}^{\min }$, which we find by fitting Eq. (7) to the $p_{\perp }^{\min }$ values from Table 1, are summarized in the last two rows of Table 2.

For the preparation of the energy-extrapolated tunes we did not use any MB observables. Nevertheless, we show a comparison of the ue-ee-3-cteq6l1 and ue-ee-scr-cteq6l1 tunes to the diffraction-reduced ATLAS MB data at 7 TeV (with N _ch≥6) in Fig. 20. We see that the data is described slightly better by the SCR than by the PCR tune. Moreover, although these data sets were not taken into account in both tunes, the results are close to the experimental data.

In the future, we plan to study the energy scaling of the model parameters using diffraction-reduced minimum-bias data, and then, in more detail, the possibility of achieving a common description of the UE and MB data, cf. [63]. As can be seen in Fig. 21, the UE tunes fail to reproduce the ATLAS MB data at 7 TeV with a less tight cut on the number of charged particles, N _ch≥2, and where all charged particles with p _⊥>100 MeV are taken into account. This is not surprising, however, since Herwig lacks a model for soft diffractive physics so far. That explains the poor description of both the charged multiplicity and the average transverse momentum in the low-multiplicity bins. On the other hand, the unsatisfactory description of the shown observables in the high multiplicity tail may indicate missing physics in the model. It might, however, as well be resolved by a dedicated MB tune. Both possibilities are left for future work. In particular, we point out the lack of an explicit model for diffractive events. A more complete description of the MB data should also include a modelling of these.

5 Conclusions

We have introduced two different models for non-perturbative colour reconnections in Herwig. The models are of slightly different computational complexity but give very similar results. The tuning results have shown that the SCR is preferred to have parameters that force a quick ‘cooling’ of the system and therefore results in a very similar model evolution as in the simpler PCR model. We therefore consider the PCR as a special case of the SCR model for quick cooling and keep the SCR as the more flexible model for future versions of Herwig++. As a consequence, we understand that the data demands a final state that does not obey a perfectly minimized colour length. We interpret this as a model limitation. At some point the picture of colour lines breaks down. Colour lines themselves are only a valid prescription up to leading order in the N _C→∞ limit. Furthermore, the mechanism addresses the non-perturbative regime where the picture of the colour triplet charges themselves is already a model by itself and possibly completely washed out.

We have studied the mechanism of colour reconnection in detail and found that in fact the non-perturbative parts of the simulation demand the colour reconnection mechanism in order to repair the lack of information on the colour flow. The intuitive picture we have based our model on could be verified. The idea of colour preconfinement is meaningful in the context of the hadronization model and has to be rectified when a model of multiple partonic interactions is applied without further information on the colour structure in between the multiple scatters.

Furthermore, we have shown that by tuning the MPI model with CR we can obtain a proper description of non-diffractive MB ATLAS observables. We present the energy-extrapolated tune ue-ee-3, which is an important step towards the understanding of the energy dependence of the model. Finally, we have unified the different tunes of the MPI model in Herwig++ into a simple parametrization of the $p_{\perp }^{\min }$ dependence in a way that allows us to describe data at different energies with only one set of parameters. News concerning Herwig tunes are available on the tune wiki page [50].

Notes

Apparently, f _a(m _cut) is only well-defined for m _cut less than the maximum cluster mass. On this interval, the series (f _a,n), with n the number of events taken into account, converges pointwise to the function f _a. This is a more formal definition of the cluster fraction functions.
The only difference is that the CTEQ6L1 gives more flexibility in the choice of the model parameters.

References

A.A. Affolder et al. (CDF Collaboration), Phys. Rev. D 65, 092002 (2002)
Article ADS Google Scholar
T. Aaltonen et al. (CDF Collaboration), Phys. Rev. D 82, 034001 (2010)
Article ADS Google Scholar
G. Aad et al. (ATLAS Collaboration), Phys. Lett. B 688, 21 (2010)
Article ADS Google Scholar
G. Aad et al. (ATLAS Collaboration), Phys. Rev. D 83, 112001 (2011)
Article ADS Google Scholar
G. Aad et al. (ATLAS Collaboration), New J. Phys. 13, 053033 (2011)
Article ADS Google Scholar
G. Aad et al. (ATLAS Collaboration), Eur. Phys. J. C 71, 1636 (2011)
Article ADS Google Scholar
V. Khachatryan et al. (CMS Collaboration), J. High Energy Phys. 1002, 041 (2010)
Article ADS Google Scholar
V. Khachatryan et al. (CMS Collaboration), Phys. Rev. Lett. 105, 022002 (2010)
Article ADS Google Scholar
V. Khachatryan et al. (CMS Collaboration), Phys. Lett. B 699, 48 (2011)
Article ADS Google Scholar
S. Chatrchyan et al. (CMS Collaboration), J. High Energy Phys. 1109, 109 (2011)
Article ADS Google Scholar
K. Aamodt et al. (ALICE Collaboration), Eur. Phys. J. C 68, 89 (2010)
Article ADS Google Scholar
K. Aamodt et al. (ALICE Collaboration), Eur. Phys. J. C 68, 345 (2010)
Article ADS Google Scholar
K. Aamodt et al. (ALICE Collaboration), Phys. Lett. B 693, 53 (2010)
Article ADS Google Scholar
M. Bähr et al., Eur. Phys. J. C 58, 639 (2008)
Article ADS Google Scholar
T. Sjöstrand, S. Mrenna, P. Skands, J. High Energy Phys. 05, 026 (2006)
Article ADS Google Scholar
T. Sjöstrand, S. Mrenna, P. Skands, Comput. Phys. Commun. 178, 852 (2008)
Article ADS Google Scholar
T. Gleisberg et al., J. High Energy Phys. 02, 007 (2009)
Article ADS Google Scholar
T. Sjöstrand, M. van Zijl, Phys. Rev. D 36, 2019 (1987)
Article ADS Google Scholar
J.M. Butterworth, J.R. Forshaw, M.H. Seymour, Z. Phys. C 72, 637 (1996)
ADS Google Scholar
V. Khoze, F. Krauss, A. Martin, M. Ryskin, K. Zapp, Eur. Phys. J. C 69, 85 (2010)
Article ADS Google Scholar
T. Sjöstrand, P.Z. Skands, J. High Energy Phys. 03, 053 (2004)
Article ADS Google Scholar
T. Sjöstrand, P.Z. Skands, Eur. Phys. J. C 39, 129 (2005)
Article ADS Google Scholar
M. Bähr, S. Gieseke, M.H. Seymour, J. High Energy Phys. 07, 076 (2008)
Article ADS Google Scholar
M. Bähr, J.M. Butterworth, S. Gieseke, M.H. Seymour, 0905.4671 (2009)
I. Borozan, M.H. Seymour, J. High Energy Phys. 09, 015 (2002)
Article ADS Google Scholar
S. Gieseke, M.H. Seymour, A. Siódmok, J. High Energy Phys. 06, 001 (2008)
Article ADS Google Scholar
J. Pumplin et al., J. High Energy Phys. 07, 012 (2002)
Article ADS Google Scholar
A. Sherstnev, R.S. Thorne, Eur. Phys. J. C 55, 553 (2008)
Article ADS Google Scholar
S. Gieseke, S. Plätzer, C. Röhr, A. Siódmok, DESY-PROC-2010-01 (2010). Available from http://plhc2010.desy.de/proceedings
S. Gieseke, C. Röhr, A. Siódmok, 1110.2675 (2011)
P. Bartalini et al., 1111.0469 (2011)
D. Amati, G. Veneziano, Phys. Lett. B 83, 87 (1979)
Article ADS Google Scholar
M. Sandhoff, P.Z. Skands, hep-ph/0604120 (2005)
P.Z. Skands, D. Wicke, Eur. Phys. J. C 52, 133 (2007)
Article ADS Google Scholar
D. Wicke, P.Z. Skands, Nuovo Cimento B 123, S1 (2008)
ADS Google Scholar
B.R. Webber, Nucl. Phys. B 238, 492 (1984)
Article ADS Google Scholar
G. ’t Hooft, Nucl. Phys. B 72, 461 (1974)
Article ADS Google Scholar
S. Gieseke, A. Ribon, M.H. Seymour, P. Stephens, B. Webber, J. High Energy Phys. 02, 005 (2004)
Article ADS Google Scholar
S. Gieseke et al., 1102.1672 (2011)
S. Kirkpatrick, C.D. Gelatt, M.P. Vecchi, Science 220, 671 (1983)
Article ADS MathSciNet Google Scholar
G. Abbiendi et al. (OPAL Collaboration), Eur. Phys. J. C 40, 287 (2005)
Article ADS Google Scholar
G. Abbiendi et al. (OPAL Collaboration), Eur. Phys. J. C 20, 601 (2001)
Article ADS Google Scholar
K. Ackerstaff et al. (OPAL Collaboration), Eur. Phys. J. C 7, 369 (1999)
Article ADS Google Scholar
P. Pfeifenschneider et al. (JADE Collaboration), Eur. Phys. J. C 17, 19 (2000)
Article ADS Google Scholar
P. Abreu et al. (DELPHI Collaboration), Z. Phys. C 67, 543 (1995)
ADS Google Scholar
P. Abreu et al. (DELPHI Collaboration), Z. Phys. C 73, 11 (1996)
Google Scholar
R. Barate et al. (ALEPH Collaboration), Phys. Rep. 294, 1 (1998)
Article ADS Google Scholar
D. Decamp et al. (ALEPH Collaboration), Phys. Lett. B 273, 181 (1991)
Article ADS Google Scholar
A. Heister et al. (ALEPH Collaboration), Eur. Phys. J. C 35, 457 (2004)
Article ADS Google Scholar
Herwig++ Collaboration, http://herwig.hepforge.org/, http://herwig.hepforge.org/trac/wiki/MB_UE_tunes
P. Skands, A. Karneyeu, D. Konstantinov, M. Mangano, L. Mijovic, W. Pokorski, S. Prestel, A. Pytel, http://mcplots.cern.ch/
J. Abdallah et al. (DELPHI Collaboration), Eur. Phys. J. C 51, 249 (2007)
Article ADS Google Scholar
P. Achard et al. (L3 Collaboration), Phys. Lett. B 561, 202 (2003)
Article ADS Google Scholar
T. Ziegler (ALEPH Collaboration), ALEPH-2001-047
G. Abbiendi et al. (OPAL Collaboration), Eur. Phys. J. C 45, 291 (2006)
Article ADS Google Scholar
D. Duchesneau, New method based on energy and particle flow in e ⁺ e ⁻→W ⁺ W ⁻→ hadron events for color reconnection studies. LAPP-EXP-2000-02 (2000)
D. Acosta et al. (CDF Collaboration), Phys. Rev. D 70, 072002 (2004)
Article ADS Google Scholar
A. Buckley et al., 1003.0694 (2010)
A. Buckley, H. Hoeth, H. Lacker, H. Schulz, J.E. von Seggern, Eur. Phys. J. C 65, 331 (2010)
Article ADS Google Scholar
M. Bähr, J.M. Butterworth, M.H. Seymour, J. High Energy Phys. 01, 065 (2009)
Article ADS Google Scholar
M. Ryskin, A. Martin, V. Khoze, Eur. Phys. J. C 71, 1617 (2011)
Article ADS Google Scholar
K. Arnold et al., 1205.4902 (2012)
H. Schulz, P. Skands, Eur. Phys. J. C 71, 1644 (2011)
Article ADS Google Scholar

Download references

Acknowledgements

We are grateful to the other members of the Herwig collaboration for critical discussions and support. We acknowledge financial support from the Helmholtz Alliance “Physics at the Terascale”. This work was funded in part (AS) by the Lancaster-Manchester-Sheffield Consortium for Fundamental Physics under STFC grant ST/J000418/1.

Open Access

This article is distributed under the terms of the Creative Commons Attribution License which permits any use, distribution, and reproduction in any medium, provided the original author(s) and the source are credited.

Author information

Authors and Affiliations

Institut für Theoretische Physik, Karlsruhe Institute of Technology (KIT), Karlsruhe, Germany
Stefan Gieseke, Christian Röhr & Andrzej Siódmok
Consortium for Fundamental Physics, School of Physics and Astronomy, The University of Manchester, Manchester, UK
Andrzej Siódmok

Authors

Stefan Gieseke
View author publications
You can also search for this author in PubMed Google Scholar
Christian Röhr
View author publications
You can also search for this author in PubMed Google Scholar
Andrzej Siódmok
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Christian Röhr.

Rights and permissions

Open Access This article is distributed under the terms of the Creative Commons Attribution 2.0 International License (https://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

Reprints and permissions

About this article

Cite this article

Gieseke, S., Röhr, C. & Siódmok, A. Colour reconnections in Herwig++. Eur. Phys. J. C 72, 2225 (2012). https://doi.org/10.1140/epjc/s10052-012-2225-5

Download citation

Received: 16 October 2012
Revised: 24 October 2012
Published: 13 November 2012
DOI: https://doi.org/10.1140/epjc/s10052-012-2225-5

Keywords

Use our pre-submission checklist

Avoid common mistakes on your manuscript.

Colour reconnections in Herwig++

Abstract

Similar content being viewed by others

CMS pythia 8 colour reconnection tunes based on underlying-event data

Measurements of observables sensitive to colour reconnection in $$t{\bar{t}}$$ events with the ATLAS detector at $$\sqrt{s} =$$ 13 TeV

Insights from the ALICE quark-gluon coloured world at the LHC

1 Introduction