1 Introduction

Monte Carlo generators are essential tools, both for the design of future experiments and the analysis of data from the LHC, and previous collider experiments. Modern event generators [1,2,3] provide a simulation of exclusive eventsbased on the combination of fixed-order perturbative results, resummation of large logarithms of scales using the parton-shower approach and non-perturbative models of hadronisation and multiple-parton scattering.Footnote 1

These simulations rely on universality and factorisation in order to construct a simulation of the complex final states observed in hadronic collisions. This allows the simulation of final-state radiation in the parton shower and the non-perturbative hadronisation models to be first developed, and the parameters of the model tuned, using the simpler and cleaner environment of \(e^+e^-\) collisions, and then applied to more complicated hadronic collisions. These models are then combined with the parton-shower simulation of initial-state radiation, a multiple scattering model of the underlying event and a non-perturbative colour reconnection model in order to describe hadronic collisions. In principle, universality requires that the colour reconnection model is also used to describe leptonic collisions. In practice, however, colour reconnection has little effect on the distributions which so far have been used to develop and tune the models. These models are therefore usually either not included at all for the simulation of leptonic collisions, or, if they are, the parameters are determined by tuning to hadronic data sensitive to multiple partonic scattering.

As the LHC accumulates data at an unprecedented rate there are a number of observables which are not well described by current Monte Carlo event generators, and where the limitations of this approach have started to become obvious, for example:

  • the difference in the properties of jets initiated by quarks and gluons is not well described with generators predicting either a larger or smaller difference between the jets than is observed by the LHC experiments [5];

  • the transverse momentum spectra of identified baryons and strange hadrons which are not well described by current generators [6];

  • long-range correlations in high multiplicity events [7, 8].

In this paper we will focus on improvements to the perturbative and non-perturbative modelling to give a better description of both quark- and gluon-initiated jets, as well as the differences between them in Herwig 7. Beyond leading order there is no clear distinction between quark and gluon jets and the definition will depend on the analysis.Footnote 2 As \(e^+e^-\) annihilation to hadrons starts with an initial partonic quark–antiquark configuration the data used to develop the final-state parton-shower algorithm, tune its parameters and those of the hadronisation model, are dominated by quark-initiated jets. However, at the LHC jets initiated by gluons can often dominate, depending on the production process, rapidity and transverse momentum of the jets. Regrettably while there is great interest in the differences between quark and gluon jets at the LHC most of the experimental studies have concentrated on differentiating between quark and gluon jets using neural network, or similar, techniques which makes a direct comparison with simulated hadron-level events impossible. We will therefore use some recent data from the ATLAS experiment [10] which is sensitive to both quark and gluon jet properties, together with data on gluon jets in \(e^+e^-\) collisions from the OPAL experiment [11, 12] which has not previously been used in the development and tuning of the current generation of Monte Carlo event generators to study the properties of gluon jets.

In the next section we will first recap the default parton-shower algorithm used in Herwig 7 focussing on recent changes we have made to improve the simulation of both quark and gluon jets. In Sect. 3 we will briefly review the important parameters in the cluster hadronisation model used in Herwig 7 and identify the issues which may lead to different treatments of quark and gluon jets. We will then discuss the tuning strategy used to produce the tunes presented in this paper. We present our results in Sect. 5 Footnote 3 followed by our conclusions.

2 Herwig 7 parton-shower algorithm

The default Herwig 7 parton-shower algorithm [13] is an improved angular-ordered parton shower. In this approach the momenta of the partons produced in the parton shower are decomposed in terms of the four-momentum of the parton initiating the jet, p (\(p^2=m^2\), the on-shell parton mass-squared), a light-like reference vector, n, in the direction of the colour partner of the parton initiating the jet and the momentum transverse to the direction of p and n. The four-momentum of any parton produced in the evolution of the jet can be decomposed as

$$\begin{aligned} q_i = \alpha _i p + \beta _i n + q_{\perp i}, \end{aligned}$$
(1)

where \(\alpha _i\) and \(\beta _i\) are coefficients and \(q_{\perp i}\) is the transverse four-momentum of the parton (\(q_{\perp i}\cdot p = q_{\perp i}\cdot n =0\)).

Fig. 1
figure 1

Branching of parton i to produce partons jk, which then undergo subsequent branching

If we consider the branching of a final-state parton i to two partons j and k, i.e. \(i\rightarrow j k\) as shown in Fig. 1, the branching is described by the evolution variable

$$\begin{aligned} \tilde{q}^2_i = \frac{q^2_i-m^2_i}{z_i(1-z_i)}, \end{aligned}$$
(2)

where \(q^2_i\) is the square of the virtual mass developed by the parton i in the branching, \(m_i\) is the physical mass of parton i, and \(z_i\) is the momentum fraction of the parton j defined such that

$$\begin{aligned} \alpha _j = z_i \alpha _i, \quad \alpha _k = (1-z_i)\alpha _i. \end{aligned}$$
(3)

The transverse momenta of the partons produced in the branching are

$$\begin{aligned} q_{\perp j} = z_i q_{\perp i} + k_{\perp i}\quad q_{\perp k} = (1-z_i)q_{\perp i} - k_{\perp i}, \end{aligned}$$
(4)

where \(k_{\perp i}\) is the transverse momentum generated in the branching. In this case the virtuality of parton i is

$$\begin{aligned} q_i^2 = \frac{p^2_{Ti}}{z(1-z)} + \frac{m_j^2}{z}+\frac{m_k^2}{1-z}, \end{aligned}$$
(5)

where \(p_T\) is the magnitude of the transverse momentum produced in the branching defined such that \(k_{\perp i}^2=-p^2_{Ti}\).

In this case the probability for a single branching to happen is

$$\begin{aligned} \mathrm{d} \mathcal {P} = \frac{\mathrm{d}\tilde{q}^2_i}{\tilde{q}^2_i}\frac{\alpha _S}{2\pi } \frac{\mathrm{d} \phi _i}{2\pi } \mathrm{d} z_i P_{i\rightarrow jk}(z,\tilde{q}), \end{aligned}$$
(6)

where \(P_{i\rightarrow jk}(z,\tilde{q})\) is the quasi-collinear splitting function, and \(\phi _i\) is the azimuthal angle of the transverse momentum \(k_{\perp i}\) generated in the splitting.

As the branching probability is singular for massless partons an infrared cut-off is required to regularise the singularity. In HERWIG 6 [14] and early versions of Herwig++ [15] the cut-off was implemented by giving the partons an infrared mass. However, while this remains an option in later versions of Herwig++ and Herwig 7 [1] the default cut-off is now on the minimum transverse momentum of the branching [16].

In order to resum the dominant subleadinglogarithms [17] the transverse momentum of the branching is used as the scale for the strong coupling constant. This also means that the strong coupling used in the parton shower is that defined in the Catani-Marchesini-Webber (CMW) scheme which includes the subleadingterms via a redefinition of QCD scale, \(\Lambda _\mathrm{QCD}\).

While this specifies both the branching probability and the kinematics of the partons for a single emission in the case of subsequent emission from the daughter partons j and/or k we must decide which properties of the originally generated kinematics to preserve once the masses of j and/or k in Eq. 5 are no longer the infrared cut-off masses but the virtualities generated by any subsequent emissions. While this choice is formally subleading it can have a large effect on physical observables.

In Herwig++ the transverse momentum of the branching was calculated using Eq. 5 and the infrared cut-offmasses when the emission was generated and then preserved during the subsequent evolution of the daughter partons. In Herwig 7.0 the default option was to instead preserve the virtuality of the branching and calculate the transverse momentum of the branching using the virtual masses the daughter partons develop due to subsequent emissions. This means that if the daughter partons develop large virtual masses the transverse momentum of the branching is reduced, and in some cases the branching has to be vetoed if there is no solution of Eq. 5. However, this choice inhibits further soft emission and significantly changes the evolution by vetoing emissions and leads phenomenological to incorrect evolution of observables, see for example Fig. 11 (red dash-dot line). We therefore consider a further choice in which, if it is possible to preserve the virtuality and still have a solution for \(p_T^2>0\), we do so however, if this is not possible instead of vetoing the emission we set \(p_T=0\) and allow the virtuality to increase.

The most important parameters which affect thebehaviour of the parton shower and which we will tune in this paper are:

  • the choice of whether to preserve \(p_T\) or \(q^2\) during the subsequent evolution;

  • the value of the strong coupling constant AlphaMZ,taken to be \(\alpha ^\mathrm{CMW}_S(M_Z)\), the value of the coupling constant in the CMW scheme at the mass of the Z boson, \(M_Z\);

  • the cut-off in the parton shower.Footnote 4 For a cut-off in \(p_T\) this is the minimum transverse momentum allowed for the branchings in the shower, \(p_T^{\min }\). For a virtuality cut-off we parameterize the threshold for different flavours by

    $$\begin{aligned} Q_{g}=\max \left( \frac{\delta -am_{q}}{b},c\right) , \end{aligned}$$
    (7)

    where a and b are parameters chosen to give a threshold which is slightly reduced for heavier quarks. The parameter \(c=0.3\) GeV is chosen to prevent the cut-off becoming too small, we also keep the default value of \(b=2.3\). Only the parameters \(\delta \) (cutoffKinScale) and a (aParameter) are tuned to the data.

There is one other major feature of the angular-ordered parton shower which we need to consider. The angular ordering of the parton shower, which is used to implement the phenomenon of colour coherence, leads to regions of phase space in which there is no gluon emission. Consider for example the process \(e^+e^-\rightarrow q \bar{q} g\). In this case there is a dead-zone which is not filled by one emission from the parton shower, as shown in Fig. 2. Given this deficit of hard, wide-angle emission it is necessary to combine the parton-shower with the fixed-order calculation of \(e^+e^-\rightarrow q\bar{q} g\). There are now a range of techniques which can achieve this including both the next-to-leading order normalisation of the total cross section or including the fixed-order results for multiple emissions. However, for our purposes it is sufficient to consider the simplest matrix-element correction approach where the dead-zone is filled using the leading-order matrix element for \(e^+e^-\rightarrow q\bar{q} g\), as shown in Fig. 3, together with the reweighting of emission probability, Eq. 6, to the exact leading-order result, for any emission which could have the highest transverse momentum in the parton shower.Footnote 5

Fig. 2
figure 2

Dalitz plot for \(e^+e^-\rightarrow q \bar{q} g\) showing the region of phase space filled by one emission from the quark and antiquark in the angular-ordered parton shower. The line shows the limits for the parton-shower emission. \(x_i = 2 E_i/Q\) where \(E_i\) is the energy of parton i and Q is the centre-of-mass energy of the collision

Fig. 3
figure 3

Dalitz plot for \(e^+e^-\rightarrow q \bar{q} g\) showing the emission from the hard matrix-element correction into the dead-zone which is not populated by parton-shower emission. The line shows the limits for the parton-shower emission. \(x_i = 2 E_i/Q\) where \(E_i\) is the energy of parton i and Q is the centre-of-mass energy of the collision

Fig. 4
figure 4

Dalitz plot for \(e^+e^-\rightarrow q \bar{q}\) showing the region of phase space filled after multiple emission from the quark and antiquark in the angular-ordered parton shower. The transverse momentum of the branchings was preserved in the case of multiple emission. The line shows the limits for the parton-shower emission for a single emission. \(x_i = 2 E_i/Q\) where \(E_i\) is the energy of parton i and Q is the centre-of-mass energy of the collision

Fig. 5
figure 5

Dalitz plot for \(e^+e^-\rightarrow q \bar{q}\) showing the region of phase space filled after multiple emission from the quark and antiquark in the angular-ordered parton shower. The virtuality of the branchings was preserved in the case of multiple emission. The line shows the limits for the parton-shower emission for a single emission. \(x_i = 2 E_i/Q\) where \(E_i\) is the energy of parton i and Q is the centre-of-mass energy of the collision

The choice of whether to preserve the transverse momentum or virtuality of the branching affects the phase-space region which is filled by the shower in the case of multiple emission. In this case we cluster the partons using the Durham jet algorithm [19], using the p-scheme as implemented in FastJet [20], keeping track of the partons emitted by the quark and antiquark and then take the hardest additional jet to be the gluon.Footnote 6 The resulting Dalitz plots of \(e^+e^-\rightarrow q \bar{q}\) show that while the choice to preserve the transverse momentum of the branching leads to a significant number of events in the dead-zone, Fig. 4, if the virtuality of the branching is preserved, Fig. 5, there is little emission outside the original angular-ordered region.

3 Hadronisation and colour reconnection

All the Herwig family of event generator generators use the cluster hadronisation model [21]. This model is based on the phenomenon of colour pre-confinement, i.e. if we non-perturbatively split the gluons left at the end of the parton shower into quark–antiquark pairs and cluster quarks and antiquarks into colour-singlet clusters the mass spectrum of these clusters is peaked at masses close to the cut-off in the parton shower, falls rapidly as the cluster mass increases, and is universal, i.e. the mass distribution of these clusters is independent of the hard scattering process and its centre-of-mass energy. The cluster model assumes that these clusters are a superposition of heavy hadronic states and uses a simple phase-space model for their decay into two hadrons. The main parameters of the model are therefore:

  • the non-perturbative gluon mass, which is not very sensitive and which we do not tune;

  • the parameters which control the probability of producing baryons and strange quarks during cluster decay;

  • the parameter which controls the Gaussian smearing of the direction of the hadrons produced which contain a parton from the perturbative evolution about the direction of that parton, with separate values for light, charm and bottom quarks.

There are, however, a small fraction of large mass clusters for which the two hadron decay ansatz is not reasonable and these must first be fissioned into lighter clusters. While only a small fraction of clusters undergo fission due to the larger masses of these clusters they produce a significant fraction of the hadrons.

A cluster is split into two clusters if the mass, M, is such that

$$\begin{aligned} M^\mathbf{Cl_{pow}} \ge \mathbf{Cl_{max}}^\mathbf{Cl_{pow}}+(m_1+m_2)^\mathbf{Cl_{pow}}, \end{aligned}$$
(8)

where \(\mathbf{Cl_{max}}\) and \(\mathbf{Cl_{pow}}\) are parameters of the model, and \(m_{1,2}\) are the masses of the constituent partons of the cluster.

Fig. 6
figure 6

Example of colour-singlet gluon pair production followed by the branching of all the colours via \(g\rightarrow gg\). The Feynman diagram is shown in a whereas the colour flows, including the non-perturbative splitting of the gluons into quark–antiquark pairs, are shown in b

For clusters that need to be split, a \(q\bar{q}\) pair is selected to be popped from the vacuum. The mass distribution of the new clusters is given by

$$\begin{aligned} M_1= & {} m_1+(M-m_1-m_q)\mathcal {R}_1^{1/P_\mathrm{split}},\end{aligned}$$
(9a)
$$\begin{aligned} M_2= & {} m_2+(M-m_2-m_q)\mathcal {R}_2^{1/P_\mathrm{split}}, \end{aligned}$$
(9b)

where \(m_q\) is the mass of the parton popped from the vacuum, \(M_{1,2}\) are the masses of the clusters formed by the splitting and \(\mathcal {R}_{1,2}\) are pseudo-random numbers uniformly distributed between 0 and 1. The distribution of the masses of the clusters is controlled by the parameter \(P_{\mathrm{split}}\).

In order to improve the description of charm and bottom hadron production these parameters for cluster fission all depend on the flavour of the partons in the cluster, so that there are separate parameters for light, charm and bottom quarks.

In practice there is always a small fraction of clusters that are too light to decay into two hadrons. Before Herwig 7.1 these clusters were decayed to the lightest hadron, with the appropriate flavours. However, in some cases, for example for clusters containing a charm or bottom quark–antiquark pair, or a bottom quark and a light antiquark, there can be a number of hadrons of the appropriate flavour below the threshold. In these cases the lightest meson with the appropriate flavours is the pseudoscalar \(\phantom {1}^1S_0\) state and the vector \(\phantom {1}^3S_1\) state is also below the thresholdFootnote 7 which leads to a lower production rate for the vector state with respect to the pseudoscalar state than expected. For the mesons composed of a bottom quark and a light quark the rate is significantly less than that expected from the counting of spin states, or indeed observed experimentally [22,23,24,25]. For charmonium and bottomonium states as this mechanism is the only way the vector states can be produced via hadronisation it leads to a complete absence of direct \(J/\psi \) and \(\Upsilon \) production. In Herwig 7.1 we therefore include the possibility that instead of just producing the lightest state all states below the threshold are produced with a probability proportional to \(2S+1\), where S is the spin of the particle.

In order to improve the behaviour at the threshold for charm and bottom clusters, the option exists of allowing clusters above the threshold mass, \(M_\mathrm{threshold}\), for the production of two hadrons to decay into a single hadron such that a single hadron can be formed for masses

$$\begin{aligned} M<M_\mathrm{limit} = (1+\mathbf{SingleHadronLimit})M_\mathrm{threshold}, \end{aligned}$$
(10)

where SingleHadronLimit is a free parameter of the model. The probability of such a single-meson cluster decay is assumed to decrease linearly for \(M_\mathrm{threshold}<M<M_\mathrm{limit}\) and there are separate parameters for charm and bottom clusters.

In order to explain the rising trend of \(\langle p_t\rangle \mathrm {\; vs\; } N_{ch}\) (average transverse momentum as a function of the number of charged particles in the event) observed already by UA1 [26] and describe Underlying Event [27,28,29,30] and the Minimum Bias data [31,32,33,34], the hadronisation model is supplemented with a model of colour reconnections (CR) [35]. The default version of the model implemented in Herwig 7.0 is not very sophisticated. The aim of the CR model is to reduce the colour length, defined as \(\lambda \equiv \sum _{i=1}^{N_{cl}}m_i^2\) ,Footnote 8 where \(N_{cl}\) is the number of clusters in an event and \(m_i\) is the invariant mass of cluster i. The colour reconnection of the clusters leading to a reduction of \(\lambda \) is accepted with a given probability which is a parameter of the model. Although the default model is quite simple it should be stressed that its results resemble the more sophisticated statistical colour reconection model [35] which implements the minimisation of \(\lambda \) as Metropolis-like algorithm and requires a quick “cooling“ of the random walk.

In this model the only possible reconnections which are not allowed are connecting the quark and antiquark produced in the non-perturbative splitting of the gluon. It is therefore possible that the colour lines of a gluon produced at any other stage of the shower can be reconnected leading to the production of a colour-singlet object. While this is physically possible we would expect that it occurs at a rate which is suppressed in the number of colours, \(N_C\), as \(\sim \frac{1}{N_C^2}=\frac{1}{9}\), not the much higher reconnection rate \(\sim 2/3\) Footnote 9 which is necessary to describe the underlying event data. This can lead to the production of a colour-singlet gluon jet at a much higher rate than expected. This is particularly problematic in the theoretically clean, but experimentally inaccessible, colour-singlet gluon pair production processes often used to study gluon jets [9].

Consider, for example, the simple process of colour-singlet gluon pair production followed by the branching of all the gluons via \(g\rightarrow gg\), shown in Fig. 6a. After the non-perturbative splitting of the gluons into quark–antiquark pairs, as shown in Fig. 6b, without colour reconnection the quarks and antiquarks will be formed into colour-singlet clusters as \((q_1,\bar{q}_3)\), \((q_3,\bar{q}_4)\), \((q_4,\bar{q}_2)\) and \((q_2,\bar{q}_1)\). Given the configuration it is likely that the clusters containing partons from the parton shower of each of the original gluons, i.e. \((q_1,\bar{q}_3)\) and \((q_4,\bar{q}_2)\), will have large masses and the rearrangement to give the clusters \((q_1,\bar{q}_2)\) and \((q_4,\bar{q}_3)\) will be kinematically favoured, although it means the original gluons will effectively become colour singlets rather than octets.

In Herwig 7.1 we have therefore included the possibility to forbid the colour reconnection model making any reconnection which would lead to a gluon produced in any stage of the parton-shower evolution becoming a colour-singlet after hadronisation. We will investigate the effect of this change on the simulation of quark and gluon jets.

4 Tuning

The Rivet [38] program was used to analyse the simulated events and compare the results with the experimental measurements. The Professor program [39] was then used to interpolate the shower response and tune the parameters by minimising the chi-squared.Footnote 10

In general we use a heuristic chi-squared function

$$\begin{aligned} \chi ^{\prime 2}({\varvec{p}}) = \sum _{\mathcal {O}} w_{\mathcal {O}} \sum _{b\exists \mathcal {O}} \frac{\left( f_b(\varvec{p})-\mathcal {R}_b\right) ^2}{\Delta _b^2} \end{aligned}$$
(11)

where \(\varvec{p}\) is the set of parameters being tuned, \(\mathcal {O}\) are the observables used each with weight \(w_{\mathcal {O}}\), b are the different bins in each observable distribution with associated experimental measurement \(\mathcal {R}_b\) , error \(\Delta _b\) and Monte Carlo prediction \(f_b(\varvec{p})\). Weighting of those observables for which a good description of the experimental result is important is used in most cases. The parameterisation of the event generator response, \(f(\varvec{p})\), is used to minimize \(\chi ^{\prime 2}\) and find the optimum parameter values. We take \(w_{\mathcal {O}}=1\) in most cases except for the particle multiplicities where we use \(w_{\mathcal {O}}=10\) and total charged particle multiplicities where we use \(w_{\mathcal {O}}=50\). This ensures that particle multiplicities influence the result of the fit and are required due to the much higher quantity of event shape and spectrum data used in the tuning. Given the aim of this paper is to improve the description of gluon jets this data was also included with \(w_{\mathcal {O}}=10\) in order to avoid the fit being dominated by the large quantity of data sensitive to quark jets. In addition as we do not except a Monte Carlo event generator to give a perfect description of all the data and in order to avoid the fit being dominated by a few observables with very small experimental errors we use

$$\begin{aligned} \Delta ^\mathrm{eff}_b = \max (0.05\times \mathcal {R}_b,\Delta _b), \end{aligned}$$
(12)

rather than the true experimental error, \(\Delta _b\), in the fit.

Table 1 The Monte Carlo parameters obtained for different choices of the cut-off option, the preserved quantity in the shower and weight of the charged particle multiplicity data

The standard procedure which was adopted to tune the shower and hadronisation parameters of the Herwig++ and Herwig 7 event generators to data is

  • first the shower and those hadronisation parameterswhich are primarily sensitive to light quark-initiated processes are tuned to LEP1 and SLD measurements of event shapes, the average charged multiplicity andcharged multiplicity distribution, and identified particle spectra and rates which only involve light quark mesons and baryons;

  • the hadronisation parameters for bottom quarks aretuned to the bottom quark fragmentation function measured by LEP1 and SLD together with LEP1 and SLD measurements of event shapes and identified particlespectra from bottom events;

  • the hadronisation parameters involving charm quarks are then tuned to identified particle spectra, from both the B-factories and the LEP1, and LEP1 and SLD measurements of event shapes, and identified particle spectra from charm events;

  • the light quark parameters are then retuned using the new values of the bottom and charm parameters together with different weights for the charged multiplicity distributions in \(e^+e^-\) collisions at energies between 12 GeV and 209 GeV due to the difficulty in fitting the charged multiplicity.

Only \(e^+e^-\) annihilation data from the continuum region near the \(\Upsilon (4s)\) meson, for charm meson spectra, and at the Z-pole from LEP1 and SLD were used in the tuning.

Table 2 The values of \(\chi ^2\) per degree of freedom obtained in the fit for different choices of the cut-off option, the preserved quantity in the shower and weight of the charged particle multiplicity data. The values are \(\chi ^{\prime 2}\) as described in the text for the tuning observables, normalised to the sum of the weights for the different bins, and the true \(\chi ^2\) using the experimental error for the charged particle multiplicities. The number of degrees of freedom for each set of observables is given together with the sum including weights in brackets, where this is different
Fig. 7
figure 7

Multiplicity distribution of charged particles in gluons jets for two different gluon energies compared with data from OPAL [11]

In this paper we have extended this approach in order to better constrain the energy evolution to include data from a wider range of centre-of-mass energies both below the Z-pole, from the JADE and TASSO experiments, and above the Z-pole, from LEP2.

In order to tune the shower and light quark hadronisation parameters we used data on jet rates and event shapes for centre-of-mass energies between 14 and 44 GeV [40,41,42], at LEP1 and SLD [42,43,44,45,46] and LEP2 [42, 45, 46], particle multiplicities [43, 44] and spectra [43, 44, 47,48,49,50,51,52,53,54,55,56,57] at LEP 1, identified particle spectra below the \(\Upsilon (4S)\) from Babar [58], the charged particle multiplicity [59, 60] and particle spectra [59, 61, 62] in light quark events at LEP1 and SLD, the charged particle multiplicity in light quark events atLEP2 [63, 64], the charged particle multiplicity distribution at LEP1 [65], and hadron multiplicities at the Z-pole [66]. We also implemented in Rivet and made use of the data on the properties of gluon jets [11, 12] for the first time.

The hadronisation parameters for charm quarks weretuned using the charged multiplicity in charm events atSLD [60] and LEP2 [63, 64], the light hadron spectra in charm events at LEP1 and SLD [59, 61, 62], the multiplicities of charm hadrons at the Z-pole [43, 66], and charm hadron spectra below the \(\Upsilon (4S)\) [67, 68] and at LEP1 [69].

The hadronisation parameters for bottom quarks were tuned using the charged multiplicity in bottom events atSLD [60] and LEP2 [63, 64], the light hadron spectra in bottom events at LEP1 and SLD [59, 61, 62], the multiplicities of charm and bottom hadrons at the Z-pole [43, 66], charm hadron spectra at LEP1 [69] and the bottom fragmentation function measured at LEP1 and SLD [70,71,72].

In order to tune the evolution of the total charged particle multiplicity in \(e^+e^-\) collisions as a function of energy the results of Refs. [43, 46, 60, 63, 64, 73,74,75,76,77,78,79] spanning energies from 12 to 209 GeV were used.

In order to study the various effects we have discussed we have produced tunes for the shower and hadronisation parameters in the case that either the transverse momentum or virtuality in the shower is preserved. In each case we first tuned the shower and light quark parameters without the data on charged particle multiplicities as centre-of-mass energies below the mass of the \(Z^0\) boson. In the final stage of the process where we retune these parameters three tunes were produced for each choice of cut-off and preserved quantity, one (labelled A) without the low-energy charged multiplicity data, one (labelled B) where all thecharged multiplicity data was included with in the tune with weight \(w_{\mathcal {O}}=100\) and a final tune (labelled C) where this data had weight \(w_{\mathcal {O}}=1000\).

Unfortunately due to the CPU time required it is impossible to include the ATLAS data [10] directly in the tune, therefore we compare the results of the different tunes to this data.

Fig. 8
figure 8

Difference between the charged multiplicity in bottom and light quark events in \(e^+e^-\) collisions as a function of centre-of-mass energy. The data is from [60, 62,63,64, 74, 80,81,82,83,84,85,86,87,88,89] as compiled in [90]

Fig. 9
figure 9

The evolution of the number of charged particles in \(e^+e^-\rightarrow \mathrm{hadrons}\) as a function of the centre-of-mass energy

Fig. 10
figure 10

The thrust at the Z-pole compared with data from the DELPHI [43] experiment

Fig. 11
figure 11

The evolution of the number of charged particles in gluon jets as a function of twice the energy of the gluon jet

Fig. 12
figure 12

The average number of charged particles in jets as a function of the jet transverse momentum compared with data from the ATLAS experiment [10]

Fig. 13
figure 13

The difference between the average number of particles in central and forward jets compared with data from the ATLAS experiment [10]

5 Results

We have produced 12 tunes for different choices of the cut-off variable in the shower, the choice of which quantity to preserve in the parton shower, and different weightings of the charged particle multiplicities. The parameters obtained in the fits are given in Table 1 while the \(\chi ^2\) values are given in Table 2.

The effects of changing the colour reconnection model can be seen in Fig. 7. In the results of Herwig++ 2.7.1 or Herwig 7.0 there is an unphysical tendency of the gluon jets to contain an even number of charged particles due to the production of colour-singlet gluons by the reconnection model, this feature is not present in any of the new tunes which provide a much better description of the distribution of charged particles in the gluon jets, see also the appendix.

The choice of which tune and choice of cut-off variable and preserved quantity has to be a balance between how well we wish to describe the various different data sets, as unfortunately no choice provides a good description of all the data sets.

If we first consider the choice of cut-off it is clear that using a virtual mass provides a larger \(\chi ^{\prime 2}\) for all sets of observables used in the tuning apart from those sensitive to bottom quarks. In addition it displays an unphysical energy dependence in the difference in charged particle multiplicities between bottom (or charm) quark and light quark events, as shown in Fig. 8, where the results, which use a cut-off on the virtual mass, Herwig++ 2.7.1 and the new tune\(q^2\)\(q^2\)–B, show a strong dependence on the centre-of-mass energy while those which use a \(p_\perp \) cut-off, Herwig 7.0 and the new tune \(p_\perp \)\(q^2\)-B, are relatively independent of energy. We therefore prefer a cut-off on the minimum transverse momentum of the branching.

In order to obtain a reasonable evolution of the number of charged particles with centre-of-mass energy in \(e^+e^-\) collisions, see Fig. 9, without ruining the description of particle spectra and event shape observables we choose to use the B tune as our default.

The choice of whether to preserve the \(p_\perp \) or \(q^2\) of the branching is more complicated. While the data on light quark jets, in particular event shapes measured at LEP (for example the thrust Fig. 10), favour preserving \(q^2\) the data on the charged particle multiplicity in gluon jets at LEP Fig. 11, and in jets at the LHC Figs. 12 and 13 favours preserving the \(p_\perp \) of the branching.

Our preferred choice, in particular in the presence of higher-order matching, is to preserve the \(q^2\) of the branching in order to ensure that the parton shower does not overpopulate the dead-zone. This also ensures a more reasonable value of strong coupling, \(\alpha ^\mathrm{CMW}_S(M_Z)=0.126\), which gives \(\alpha ^\mathrm{\overline{MS}}_S(M_Z)=0.118\). However, given the better description of gluon jets it is reasonable to also consider the alternative of preserving the \(p_\perp \); see for example Fig. 16 in the appendix.

6 Conclusions

We have performed a tuning the Herwig 7 event generator using data on gluon jets from LEP for the first time. Together with changes to the non-perturbative modelling this gives a significantly better description of gluon jets, in particular their charge particle multiplicity. It is, however, impossible to get a good description of the LEP particle spectra and the charged particle multiplicities, particularly in gluon jets, at the same time. We therefore choose the tune \(p_\perp \)\(q^2\)–B as the default for Herwig 7.1. However, for jets at the LHC the tune \(p_\perp \)\(p_\perp \)–B gives a better description of jet properties.

While the tunes presented in this paper are an improvement on their predecessors there is a tension between the data on charged particle multiplicities, for both quark- and gluon-initiated jets, and the data on event shapes and particle spectra from LEP. The cluster hadronisation model also continues to have problems describing final states in events with bottom quarks. Any further improvement in the description of this data will require improvements to the non-perturbative modelling.