1 Introduction

The production of quarkonia, flavourless mesons comprised of a heavy quark and its corresponding antiquark bound by the strong interaction, is still not fully understood in the context of hadron–hadron collisions, despite decades of study [1,2,3,4,5,6,7,8,9,10,11,12,13,14,15,16,17,18,19,20,21]. For a complete review, see Ref. [22]. The reason for this is that our theory of the strong interaction, quantum chromodynamics (QCD), is best understood at large energy scales, whereas the dynamics of quarkonia production spans both large and small scales.

One of the first models of quarkonium production to make quantitative predictions was the colour-singlet model (CSM) [23,24,25,26,27], which factorises these perturbative and non-perturbative regimes. In the CSM, the production of the heavy quark/antiquark pair (\(Q \overline{Q} \)) is calculated using perturbative QCD, and these quarks are then bound non-perturbatively. In this second step, the constituents are assumed to be at rest in the quarkonium frame, and the colour and spin state of the \(Q \overline{Q} \) pair does not change during the binding. Comparisons of CSM predictions to Tevatron data [7, 8] demonstrated that the CSM, when calculated at leading fixed order, significantly underestimated the prompt production of \(\psi (1S)\) mesons, a quarkonium state.Footnote 1

The initial fixed-order CSM predictions [27] for the Tevatron data lacked two important sources of \(\psi (1S)\) production: one, partonic fragmentation; and, two, feed-down from heavier quarkonium states such as the \(\chi _{c}\) mesons. This first contribution can be significant in hadronic collisions because the rate for inclusive charm production is much larger than that calculated only from fixed-order processes such as \(q \overline{q} \rightarrow c \overline{c} \) and \(g g \rightarrow c \overline{c} \). Fragmentation functions [28,29,30,31,32] were derived for , considering the simpler \(Z \rightarrow c \overline{c} \) process, and convolved with Tevatron jet cross-sections. This additional fragmentation contribution brought the CSM prediction into better agreement with data at high \(p_\text {T}\) of the \(\psi (1S)\), but the overall agreement with data was still poor. For feed-down contributions from the \(\chi _{c}\), predictions were made at fixed-order with the CSM [33,34,35]. These calculations contain a kinematic cut-off to regulate an infrared divergence, typically taken as the mass of the bound state. The quantitative CSM results are very sensitive to the exact choice of this cut-off. Even after accounting for feed-down, the CSM prediction still significantly disagrees with the Tevatron data.

With the realization that the CSM was inadequate to describe the data, non-relativistic QCD (NRQCD) theory [36,37,38,39] was developed. While NRQCD is similar to the CSM in that it factorises into a perturbative and non-perturbative component, it does not require the initial partonic state to have the same colour or quantum numbers as the final state, and allows, for example, the production of colour-octet states which then evolve into physical colour-singlet states.

Production rates for quarkonia at both the Tevatron and LHC are, in general, well described by NRQCD. However, NRQCD predicts \(\psi (1S)\) polarisation [40,41,42,43,44,45] that is not observed in LHC data [16,17,18]. Additionally, fixed-order NRQCD predicts relatively isolated \(\psi (1S)\) production, while LHC measurements demonstrate that \(\psi (1S)\) mesons are oftentimes produced in association with nearby particles. Recent developments in calculating quarkonium fragmentation functions promise to resolve these issues [46, 47]. However, a comprehensive approach for calculating these fragmentation functions is lacking. Here, we introduce an alternative approach of using the Pythia 48, [49,50,51] parton shower to calculate the production of quarkonia from fragmentation. The idea is inspired by an earlier study using the Ariadne program [52,53,54], but is implemented here with a complete and extensible set of quarkonia processes in the full Pythia 8 framework.

The remainder of this paper is organised as follows. In Sect. 2 we briefly review NRQCD, while in Sect. 3 we describe parton shower formalism. We then include NRQCD quarkonium fragmentation processes into the Pythia 8 parton-shower in Sect. 4. In Sect. 5, we present quantitative predictions using our approach, and finally, we provide conclusions in Sect. 6.

2 Non-relativistic quantum chromodynamics

Non-relativistic QCD is an effective field theory where a perturbative expansion is performed not only in the QCD coupling \(\alpha _s\), but also in the relative velocity between the two heavy quarks of the bound state. Including this relative velocity scale results in the expansion of the physical quarkonium state into non-physical Fock states. For a heavy quark state \(Q \overline{Q} \), the general Fock-state expansion is,

(1)

where v is the relative velocity between the constituent quark/antiquark, (1) in the superscript notates a colour-singlet state, and (8) in the superscript notates a colour-octet state. For a \(\psi (1S)\), this can be explicitly written as,

(2)

and for the \(\chi _{c}\) states as,

(3)

where J corresponds to 0, 1, and 2.

The heavy quarkonium cross-sections can then be factorised into short- and long-distance scales, which for initial partons \(a_1\) and \(a_2\) can be written as

(4)

where the are perturbatively calculated partonic short-distance matrix elements (SDME) and are empirically determined long-distance matrix elements (LDME). The sums are over the states of the Fock expansion of Eq. (1), where the indices correspond to the spin and colour quantum numbers, and an SDME and LDME must be provided for each term. For hadronic beams \(h_1\) and \(h_2\), the partonic cross-section is then convolved with the parton density functions (PDF) for the hadrons

(5)

where the first two sums are over all possible partonic combinations \(a_i\) and \(a_j\). The PDFs for the two hadrons are given by \(f_1\) and \(f_2\), evaluated at the partonic energy scale \(\mu \) with corresponding momentum fractions \(x_1\) and \(x_2\).

For production of the \(\psi (1S)\) at the Large Hadron Collider (LHC), the fixed-order NRQCD partonic cross-section at leading order in \(\alpha _s\) is given by,

(6)

where the expansion in the relative quark/antiquark velocity is up to \(v^2\). Feed-down from \(\chi _{c}\) states contribute significantly to \(\psi (1S)\) production, with the partonic cross-sections given by

(7)

at leading order in \(\alpha _s\) and expanded up to v. In Pythia 8, a complete set of these \(2 \rightarrow 2\) unpolarised partonic cross-sections is available at fixed order in \(\alpha _s\), including for states, where \(b \overline{b} \) configurations and arbitrary heavy radial excitations are also included, e.g. the \(\psi (2S)\) or \(\Upsilon (1S)\), but may need to be configured by the user. All of these partonic cross-sections are t-channel exchanges and diverge for low \(p_\text {T}\) of the quarkonium. These divergences are regularised either with a hard \(p_\text {T}\) cut-off or, in the case of multiple parton interactions in Pythia 8, a smooth damping factor.

The LDMEs of the \({\mathcal {O}}(1)\) leading colour-singlet Fock state, i.e. the Fock state with the same quantum numbers as the physical state, can be determined from the wave-function of the physical state at the origin. The LDMEs for the additional Fock states must be determined from fits to data, although velocity scaling rules can be used to relate these LDMEs. Typically, these fits are made to the differential quarkonium production cross-section with respect to the \(p_\text {T}\) of the quarkonium, as the colour-singlet contributions dominate at low \(p_\text {T}\), while the colour-octet contributions dominate at high \(p_\text {T}\). Using LDME fits from CDF, the differential cross-section in \(p_\text {T}\) of the \(\psi (1S)\) is described well at both the Tevatron [37, 55] and the LHC [56,57,58]. However, these NRQCD predictions are dominated by colour-octet contributions at high \(p_\text {T}\), which are typically transversely polarised, unlike colour-singlet contributions which are typically longitudinally polarised. This is in direct conflict with experimental results, which are primarily unpolarised across all \(p_\text {T}\) of the \(\psi (1S)\) [15,16,17,18, 59,60,61,62].

The relative isolation of quarkonia can also be measured experimentally, where a jet clustering algorithm such as anti-\(k_\text {T}\) can be used to group the quarkonium with surrounding activity, and observables such as \({p_\text {T}} (\text {quarkonium})/{p_\text {T}} (\text {jet})\) can be constructed. For fixed-order NRQCD predictions, quarkonia are expected to be relatively isolated for colour-singlet contributions. Even for fixed-order colour-octet contributions with a parton shower applied similar to that of Pythia 8, the quarkonia is still expected to be primarily isolated. However, measurements from both LHCb [19] and CMS [20, 21] do not show this isolation, and instead match distributions that are more consistent with production of the quarkonia from fragmentation rather than fixed-order calculations. In Refs. [47, 63], fragmenting jet functions (FJF) calculated via resummation, and similar to the fragmentation probabilities of Eq. (8) below, were used to model the LHCb data. However, these calculations are incomplete and do not include all sources of \(\psi (1S)\) production such as feed-down.

Until this work, Pythia 8 did not include NRQCD-based fragmentation contributions to quarkonium production. This contribution can be written, formally, as,

(8)

which is analogous to Eq. (5), but now uses the fixed-order partonic cross-section \(\,\text {d}{\hat{\sigma }}(a_i a_j \rightarrow a_k X)\) for parton \(a_i\) and \(a_j\) scattering to produce parton \(a_k\) inclusively, multiplied by the fragmentation probability , which can be for either the sum of all Fock states or a specific Fock state. The precise definition of z varies within the literature, but generally all definitions reduce to the relative energy fraction carried by the emitted quarkonium in the high-energy limit.

While this formalism can calculate quarkonium production from fragmentation, it cannot produce realistic event topologies that emulate events observed in experiments. Additionally, this type of calculation typically does not include competition between quarkonium states, i.e. the fragmentation probabilities do not simply factorize. In this work, we provide an alternative method that overcomes both of these shortcomings, by introducing quarkonium splittings into the Pythia 8 parton shower. This technique is formally equivalent to analytic resummation for inclusive observables, but provides more comprehensive results via a flexible Monte Carlo generator interface. To avoid double counting, this NRQCD parton shower should not be used together with fixed-order quarkonium processes without considering appropriate matching.Footnote 2

3 Parton showers

A complete description of the Pythia 8 parton shower is available in Ref. [64]. Here, a brief review is given, with a focus on the algorithmic procedure. Parton showers evolve partons produced from fixed-order calculations at high virtuality \(\mu _\text {max} \) to low virtuality \(\mu _\text {min} \) through a series of probabilistic parton emissions. While various parton showers differ in the choice of the evolution variable \(\mu \), it is illustrative to consider it as an energy-like scale. The probability for parton \(a_i\) to not radiate, i.e. \(a_i \rightarrow a_j a_k\), between virtualities \(\mu _1\) and \(\mu _2\) is given by,

$$\begin{aligned} \Delta _{i} (\mu _1, \mu _2)= & {} \exp \left( -\int _{\mu _2}^{\mu _1} \text {d}\mu ^2\, \frac{1}{\mu ^2} \frac{\alpha _s (\mu )}{2\pi } \int _{z_\text {min}}^{z_\text {max}} \text {d}z\, \right. \nonumber \\{} & {} \quad \times \left. \sum _{j,k} P_{i \rightarrow j k} (z, \mu ) \right) \,, \end{aligned}$$
(9)

where z is defined as \(E_j/E_i\), the fractional energy parton \(a_j\) takes from \(a_i\), and \(P_{i \rightarrow j k}\) is the splitting kernel for the process \(a_i \rightarrow a_j a_k\). A summation is performed over all available splitting kernels, e.g. for a gluon both the splittings \(g \rightarrow g g \) and \(g \rightarrow q \overline{q} \) would be summed, where the second splitting is over all kinematically available quark flavours.

Table 1 Summary of the implemented quarkonium splitting kernels, with their corresponding reference from the literature when relevant. The numbers in parentheses indicate the pertinent equations from that reference. These spitting kernels can be used for multiple physical states, e.g. the splitting is used for the \(\eta _c\), \({\psi }\), and \(\chi _{c}\) states

The parton shower then proceeds through an iterative algorithm. For a given parton dipole with virtuality \(\mu _1\), a no-emission probability is uniformly sampled, \(\Delta _{i} (\mu _1,\mu _2) \in {\mathcal {U}}(0, 1)\), yielding an equation that can be solved for the emission scale \(\mu _2\). In practice, the integrands in Eq. (9) are complicated, and so the emission scale is calculated using an overestimate and rejection method called the veto algorithm. In this case, the summed integrands \(P_{i \rightarrow j k}\) of Eq. (9) are replaced by \(O_i\),

$$\begin{aligned} O_{i} = \sum _{j,k} O_{i \rightarrow j k} \,, \text { such that } O_{i \rightarrow j k} \ge P_{i \rightarrow j k} (z, \mu ) \,, \end{aligned}$$
(10)

is valid for all \(\mu \) in the range \(\mu _2\) to \(\mu _1\), and all z in the range \(z_\text {min} \) to \(z_\text {max} \). If \(\alpha _s\) is evolved at first order, the new virtuality \(\mu _2\) can be calculated as

$$\begin{aligned} \mu ^2_2 = \Lambda ^2 \left( \frac{\mu ^2_1}{\Lambda ^2}\right) ^ {\Delta _{i} ^{b_0/O_{i}}} \,, \end{aligned}$$
(11)

where \(b_0\) is the one-loop \(\beta \)-function coefficient [65], \((33 - 2n_f)/6\), and \(\Lambda \) is the scale from which \(\alpha _s\) is being evolved. If, instead, \(\alpha _s\) is fixed at one value, the new virtuality becomes,

$$\begin{aligned} \mu ^2_2 = \mu ^2_1 \Delta _{i} ^{2\pi /(\alpha _s O_{i})} \,. \end{aligned}$$
(12)

After the virtuality \(\mu _2\) is determined according to Eq. (11) or Eq. (12), one of the quarkonium splittings \(a_i \rightarrow a_j a_k\) is chosen randomly with probability \(O_{i \rightarrow jk}/O_{i} \). A z is then randomly sampledFootnote 3 between \(z_\text {min} \) and \(z_\text {max} \). The splitting is accepted if,

$$\begin{aligned} \frac{P_{i \rightarrow j k} (z, \mu _2)}{O_{i \rightarrow j k}} \ge {\mathcal {U}}(0, 1) \end{aligned}$$
(13)

where the right-hand side is a uniformly sampled random variable. If the splitting is rejected, then: the algorithm returns to Eq. (11) or Eq. (12); a new \(\mu _1\) is set as the old \(\mu _2\), \(\mu _2 \rightarrow \mu _1\), and a new \(\mu _2\) is selected; and the procedure continues as before. The parton shower continues until \(\mu _2\) drops below the minimum virtuality, \(\mu _\text {min} \). The virtuality variable \(\mu \) in Pythia 64 is defined as,

$$\begin{aligned} \mu ^2 \equiv {p_\text {T,evol}^{2}} = (s - m_r^2)\, z\, (1 - z) \end{aligned}$$
(14)

where \(\sqrt{s}\) is the centre-of-mass energy for the dipole, \(m_r\) is the mass of the radiating parton in the dipole, and z is a light-cone momentum fraction carried by the emitted parton.Footnote 4

4 Implementation

A full set of splitting kernels to complement the \(\psi (1S)\) and \(\chi _{c}\) fixed-order production implemented in Pythia 8, described in Sect. 2, were derived from fragmentation functions in the literature similar to those used in Eq. (8). The implemented splitting kernels, as well as their references, are given in Table 1. The full forms for all the implemented splitting kernels can be found in the references. We note that in comparison to Eq. (6) a number of splittings are missing, namely the gluon and heavy-quark initiated splittings to colour-octet states , , , and . These gluon initiated splittings are omitted as they only enter at \(\alpha _s ^2\), while the splitting enters at order \(\alpha _s \). Similarly, the heavy-quark initiated splittings are omitted since they are suppressed by a factor of \(1/(N_c^2 - 1)^2\) with respect to their corresponding colour-singlet splittings [66].

To illustrate our procedure for including the NRQCD-based fragmentation functions in the Pythia 8 shower, we give the details for the splitting. The splitting kernel is given as,

(15)

where the LDME depends upon the final physical state that this splitting kernel is being used to produce, e.g. the \(\psi (1S)\) as a state.

This expression differs from that given in Ref. [30] for a number of reasons. First, the z convention used there is equivalent to \(1 - z\) in Pythia 8, and so a change of variables is made here. An additional factor of \(2\pi \) is also included to account for the \(1/(2\pi )\) in Eq. (9). There is already a factor of \(\alpha _s (\mu )\) in Eq. (9), so only one factor of \(\alpha _s\) is included, evaluated at the scale \(p_\text {T,evol}^{}\); in Ref. [30], both factors of \(\alpha _s\) are evaluated at a scale of \(3m_{Q}\). The choice for this scale is discussed in Sect. 5. Finally, the integration variable of Eq. (9) is \(\mu ^2 \equiv {p_\text {T,evol}^{2}} \), rather than s as provided in Ref. [30], so a Jacobian transforming from s to \(p_\text {T,evol}^{2}\) has been introduced:

(16)

Because there is already a factor of \(1/{p_\text {T,evol}^{2}} \) in Eq. (9), only the additional factor of \((s - m_{Q}^2)\) needs to be included, corresponding to the final line of Eq. (15). A similar conditioning must be done for all the splitting kernels of Table 1.

An overestimate must also be determined for Eq. (15). This can be determined analytically or numerically. For this splitting, we can factorise Eq. (15) into a z-independent component, and a z-dependent component f(z), given by,

(17)

A factor of \(1/(s - m_{Q}^2)^2\) is kept in the z-dependent component f(z) to ensure it remains dimensionless. The overestimate can then be written as,

(18)

where we have used the numerical knowledge that \(f(z) < 3/2\), and we use the parton shower cut-off to evaluate \(\alpha _s\). Since \((s - m_{Q}^2)\) must always be larger than , we replace \((s - m_{Q}^2)\) with \(8m_{Q}^2\) in the numerator of the second term. Again, a similar procedure can be performed for the remaining splitting kernels of Table 1. For all the splitting kernels, we use the default recoil scheme in Pythia 8, which is based on dipole-style recoils.

Some of the splitting kernels above deviate significantly enough from the example to warrant special mention. The kernel is a \(1 \rightarrow 3\) splitting, rather than the normal \(1 \rightarrow 2\), and so the splitting kernel depends upon another kinematic variable, the digluon mass \(m_{g g}^{2}\), which must be sampled when generating the final-state kinematics. To handle these specific kinematics, a \(1 \rightarrow 2\) splitting is performed first, where the digluon is treated as a single massive parton. Later, this digluon object is decayed isotropically in its rest frame into the actual two final-state massless gluons. Another factor of \(\alpha _s\) enters into this splitting kernel, which is naturally evaluated at a scale of \(p_\text {T,evol}^{}\) with no choice given to the user. The splitting kernel diverges rapidly as \({p_\text {T,evol}^{2}} \rightarrow 0\), and so a \(p_\text {T,evol}^{2}\)-dependent overestimate has been introduced to increase the sampling efficiency for this kernel. Finally, it should be noted that in a parton-shower context the splitting for the Fock state must be treated specially as described in the following.

There is one splitting, given by . This is unique because the splitting is \(1 \rightarrow 1\), rather than \(1 \rightarrow 2\), and arises from a delta function in the splitting kernel. Because the gluon has the same quantum numbers as the , this splitting can be thought of as an off-shell gluon converting directly to a state when the virtuality of the gluon matches the corresponding quarkonium mass. The splitting kernel is then given by,

(19)

where \(\delta m\) is a small mass splitting, which for this work is set as \(0.01~\mathrm {\,Ge\hspace{-1.00006pt}V} \). The results of the parton shower are not sensitive to the choice of this mass splitting.

5 Results

The differential fragmentation probability from which the splitting kernel of Eq. (15) was derived can be integrated over s to determine the total fragmentation probability. From Ref. [30], this is

(20)

for \(\psi (1S)\) production from the splitting at the lowest allowed scale of \(3m_{c}\). Here, z does not follow the same convention as Sects. 3 and 4, but rather is defined as where \(\sqrt{s}\) is the centre-of-mass energy for the initial dipole. A comparison can be made directly between the parton-shower implementation of this work, and Eq. (20). This is done by producing a \(c \overline{c} \) dipole with a sufficiently high \(\sqrt{s}\) that mass effects are negligible. The Pythia 8 parton shower, with the inclusion of the individual quarkonium splitting kernels from Sect. 4 with no competition between the quarkonium splittings, is then run on this dipole. All QCD splittings are also switched off. The result can be seen in the left plot of Fig. 1, where the Pythia 8 result given by the blue histogram matches well with the analytic prediction of Eq. (20) corresponding to the dashed line. Similar agreement is also seen for the \(\eta _c\), shown in red.

Fig. 1
figure 1

Production of colour-singlet S-wave states from charm splittings compared between (solid) Pythia 8 and (dashes) analytic expressions at the energy scales of (left) \(3m_{c}\) and (right) \(m_Z/2\)

Fig. 2
figure 2

Production of colour-singlet (left) S-wave and (right) P-wave states from gluon splittings compared between (solid) Pythia 8 and (dashes) analytic expressions at the energy scale of \(2m_{c}\)

Fig. 3
figure 3

Production of (left) colour-singlet and (right) colour-octet P-wave states from charm splittings compared between (solid) Pythia 8 and (dashes) analytic expressions at the energy scale of \(3m_{c}\)

Fig. 4
figure 4

Production of (left) colour-singlet and (right) colour-octet states from (upper) charm and (lower) gluon splittings with Pythia 8 at the energy scale of \(m_Z/2\)

The DGLAP equations can be applied to Eq. (20) to numerically evolve the fragmentation probability from the scale of \(3m_{c}\) up to \(m_Z/2\). In Ref. [30] this is provided numericallyFootnote 5 for the \(\psi (1S)\) and \(\eta _c\), as given by the dashed blue and red lines, respectively, in the right plot of Fig. 1. The implementation of this work can be compared by showering a \(c \overline{c} \) dipole with \(\sqrt{s} = m_Z \), and keeping all QCD splittings switched on. The results are also given in the right plot of Fig. 1. It should be noted that for high z, where mass effects become negligible, the results match well. However, at low z the results differ significantly, because in Ref. [30] the masses of the \(\psi (1S)\) and \(\eta _c\) were not considered. Accounting for this mass can be seen clearly in the Pythia 8 result, where there are no quarkonia produced with \(z < 2m_{\psi (1S)}/m_Z \).

Analytic expressions similar to Eq. (20) are available in the literature for all the splittings of Table 1, evaluated at the lowest kinematically allowed scale. For heavy-quark radiators \(Q\), this scale is \(3m_{Q}\), while for gluon radiators \(g\), the scale is \(2m_{Q}\). In the left plot of Fig. 2 the fragmentation probabilities \(D_{g \rightarrow \psi (1S) ^{(1)}} (z, 2m_{c})\) and \(D_{g \rightarrow \eta _c ^{(1)}} (z, 2m_{c})\) for the colour singlet splittings and are given, respectively. There is good agreement between the parton-shower results and the analytic predictions. In the right plot of Fig. 2, comparisons between the parton-shower results and analytic predictions are given for \(D_{g \rightarrow \chi _{c 0} ^{(1)}} (z, 2m_{c})\), \(D_{g \rightarrow \chi _{c 1} ^{(1)}} (z, 2m_{c})\), and \(D_{g \rightarrow \chi _{c 2} ^{(1)}} (z, 2m_{c})\), corresponding to the splittings , , and , respectively. All three parton-shower results agree well with their corresponding analytic predictions.

The fragmentation probabilities, evaluated at the energy scale of \(3m_{c}\), for the production of the \(\chi _{c}\) from \(c\)-initiated splittings, are given in Fig. 3. On the left, the fragmentation probabilities \(D_{c \rightarrow \chi _{c J} ^{(1)}} (z, 3m_{c})\) are given for the colour-singlet splittings . On the right, the fragmentation probabilities \(D_{c \rightarrow \chi _{c J} ^{(8)}} (z, 3m_{c})\) are given for the colour-octet splittings . All six parton-shower results are in good agreement with the analytic predictions.

Analytic predictions evolved up to the scale of \(m_Z \) are not available in the literature for any of the processes other than and for the \(\psi (1S)\) and \(\eta _c\), respectively. Consequently, we are not able to make comparisons similar to the right plot of Fig. 1. We instead provide results at the energy scale of \(m_Z/2\) in Fig. 4 for all the remaining splittings implemented in Pythia 8. The fragmentation probabilities for all the colour-singlet splittings are given in the upper-left plot, while the equivalent fragmentation probabilities for the colour-octet splittings of the \(\chi _{c}\) states are given in the upper-right plot. The fragmentation probabilities for all the colour-singlet splittings are given in the lower left, and the fragmentation probabilities for the colour-octet splittings of Eq. (19) are given in the lower right.

Fig. 5
figure 5

Comparison of (left) splitting kernel choices for the colour-octet splitting with Pythia 8 at the energy scale of \(m_Z/2\), and (right) scale choices for the colour-singlet splitting

There is some ambiguity in how colour-octet states should be treated in the parton shower after being produced, e.g. the states in the lower-right plot of Fig. 4. Because colour-octet states carry colour charge, they must undergo further QCD radiation before transitioning to their physical state, requiring the emission of one or more soft gluons. The default behaviour in Pythia 8, prior to this work, was to treat these states with twice the \(q \rightarrow q g \) splitting kernel,

$$\begin{aligned} P_{q \rightarrow q g} \propto \frac{(1 + z^2)}{1 - z} - \frac{2m_{q}^2}{s} \,, \end{aligned}$$
(21)

i.e. the two heavy quarks of the quarkonium state would both be treated as radiators. However, in the case of the state, which is the only colour-octet state implemented here, it may be more appropriate to consider this colour-octet state as a massive gluon. The corresponding splitting kernel, derived from Ref. [69], is

$$\begin{aligned} P_{g \rightarrow g g} \propto \frac{2z}{1 - z} - \frac{2m_{g}^2}{s} + \frac{4}{3}\bigl ((1 - z)/z + z(1 - z)\bigr ) \,, \end{aligned}$$
(22)

where the radiating gluon has mass \(m_g \) and the emitted gluon is massless. These splittings do not guarantee the gluon emission necessary to transition from an octet to singlet state, so the colour-octet states are required to have slightly larger masses than their corresponding physical states. They are then forced to transition to their physical states through the isotropic emission of a single soft gluon.

The difference between the two splittings of Eqs. (21) and (22) depends on when the gluon splitting producing the colour-octet state, , occurs in the parton-shower evolution. Prior to this splitting, the gluon can continue to radiate, \(g \rightarrow g g \). Consequently, the second of the two splittings results in a transition much later in the parton-shower evolution, driven primarily by \(g \rightarrow g g \) splittings. The effects of these two different splitting kernels, as well as the pre-factor used for Eq. (21), can be explored by creating an initial dipole with a state at an \(\sqrt{s}\) of \(m_Z \), and evolving this downward to the parton shower cut-off scale. The results for this configuration are shown on the left of Fig. 5. The difference in the energy radiated away by the colour-octet state depending on the splitting kernel is sizeable. For the previous Pythia 8 default splitting of \(q \rightarrow q g \) with a pre-factor of 2, the colour-octet state remains relatively isolated. If the pre-factor is unphysically doubled again to a pre-factor of 4, the colour-octet state radiates significantly more, as expected. However, for the splitting \(g \rightarrow g g \), the colour-octet state radiates even more. In this work, this gluon splitting kernel has been used throughout, i.e. to produce the bottom-right plot of Fig. 4. It is also possible for colour-octet states to transition to other colour-octet states. For example, the splitting can occur at leading order. However, we expect that these transitions should provide a second-order effect to the gluon radiation of the initial state, although future investigation is warranted.

As mentioned in the discussion of Eq. (15), some choice needs to be made for the scale at which \(\alpha _s\) is evaluated. Given the diagrams for most quarkonium production, it is natural to choose the scale of the \(\alpha _s\) from Eq. (9) as \(p_\text {T,evol}^{}\), or whatever the virtuality of the parton shower is. However, the choice of scale for the remaining \(\alpha _s\) factors, e.g. the \(\alpha _s\) of Eq. (15), is not as clear. Here, we consider three different options for \(\mu \): the mass of the quarkonium m, the parton-shower virtuality \(p_\text {T,evol}^{}\), and the centre-of-mass of the dipole \(\sqrt{s}\). In the right plot of Fig. 5, the impact of this choice is shown for \(\psi (1S)\) production from a \(c \overline{c} \) dipole with \(\sqrt{s} = m_Z \), using only the splitting kernel. The default choice of \(p_\text {T,evol}^{}\) is given in blue, and is identical to the corresponding blue curve in the right plot of Fig. 1. Switching to m provides a nearly identical result, while using \(\sqrt{s}\) significantly reduces the fragmentation probability, although the shape remains similar, as expected. It should be noted for the splitting, only one of the three factors of \(\alpha _s\) is effected by this scale choice, the other two are always evaluated at \(p_\text {T,evol}^{}\).

Table 2 Branching fractions for Higgs decays into quarkonia calculated with Pythia 8. Here, feed-down is from quarkonium decays produced by the parton shower, while hadrons is production from all non-onia hadron decays, e.g. from \(b\) hadrons. The second set of columns is production from the shower separated by splitting type

Up to this point, we have only considered parton-shower configurations where no competition is included between the various quarkonium splittings. This competition is critical when generating comprehensive quarkonium predictions. Because of this competition, the different fragmentation contributions do not simply factorize into independent pieces. To demonstrate the full flexibility of our implementation, we calculate the inclusive branching fractions for standard-model Higgs boson decays into quarkonia. When performing this calculation, all possible quarkonium splitting kernels are switched on with full competition, as well as all other possible splittings, i.e. QCD and electroweak. The results are given in Table 2, using the default LDMEs of Pythia 8 from Refs. [70,71,72]. Here, the branching fractions in the first set of columns are separated into quarkonium production from parton-shower feed-down and from non-quarkonium hadron decays, e.g. \(b\) hadrons. The second set of columns is production from the parton shower, separated by the splitting type. Here, \(X^{(8)}\) is any colour-octet production, \(g \rightarrow X^{(1)}\) is colour-singlet production from a gluon splitting, and \(Q \rightarrow X^{(1)}\) is colour-singlet production from heavy-quark splitting. The \(X^{(8)}\) column includes both production directly from a colour-octet state, and from feed-down that was produced by a colour-octet state. Conversely, the feed-down column includes production only from feed-down, but the feed-down may be produced by any mechanism.

For \(\psi (1S)\) states, production from non-quarkonium hadron decays dominates the branching fraction, while the parton-shower contribution is nearly two orders of magnitude smaller. This is similar for all the charmonium branching fractions. For bottomonium production there is no production from non-quarkonium hadron decays, as there are not sufficiently high-mass states. For the \(\eta _{b}\) and \(\Upsilon \), the primary production mechanism is colour-singlet production from \(b\) splittings, whereas for the \(\chi _{b}\) states, the primary production mechanism is from colour-octet splittings.

6 Conclusions

We have implemented a full-featured quarkonium parton shower in the Pythia 8 Monte Carlo event generator, available from version 8.310 onward. This parton shower can be used to model the LHC quarkonium isolation results, as well as explore the production of quarkonia in general. Our implementation includes all the relevant splitting kernels needed for the production of , , and states with arbitrary radial excitations and for \(Q \in c, b \). These splitting kernels have been validated against fragmentation probabilities available in the literature, whenever possible, and found to be in good agreement. We provide first predictions for a number of fragmentation probabilities evolved up to the energy scale of \(m_Z/2\). The framework is flexible, so that an arbitrary number of competing splitting kernels can be included. Various choices are available for the radiation of the colour-octet states, as well as a choice of scale for evaluating \(\alpha _s\). Finally, we provide comprehensive first predictions for the production of , , and states from decays of the standard-model Higgs boson. Future work includes developing a method to match these results from the parton shower with fixed-order predictions, including matrix-element corrections, and introducing additional splitting kernels, such as those needed for double-heavy meson and baryon production.