1 Motivation

With the LHC in operation, one can access a number of ‘rare’ processes which could have never been systematically studied at previous accelerators. In this article we revisit the associated production of Z bosons and heavy quark jets. This process involves both weak and strong interactions and therefore serves as a complex test of the Standard Model, perturbative QCD and our knowledge of parton densities. On the experimental side, we have at our disposal the data collected by ATLAS and CMS collaborations [1,2,3,4,5].

Earlier, we have demonstrated [6, 7] a quite reasonable agreement between the theoretical and experimental results with respect to many observables, such as the differential cross sections and particle correlations. Here we wish to go one step deeper in our understanding and draw attention to two new observables which can be used as clean probes of b-jet formation dynamics. We aim at a difference between ‘prompt’ and ‘non-prompt’ production cases. The former class refers to the situation when the b-quark is produced in the hard scattering subprocess; it further radiates lighter partons and after all evolves into a jet containing b-hadrons. The latter class refers to jets initiated by a light parton (gluon or quark), and then b-quarks (or rather b-hadrons) appear from a parton evolution cascade. We discuss kinematic criteria that can be helpful to discriminate these cases. Namely, we find that in the jets originating from b-quarks (i.e., ‘prompt’), b-hadrons carry larger momentum fraction \(z_b\) than in other (‘non-prompt’) jets. Second, in the prompt jets, b-hadrons move closer to the jet axis. The goal of the study is to give quantitative estimates and to see to what extent our expectations are supported by real data.

2 Theoretical framework

There are two commonly used approaches in perturbative QCD (pQCD) calculations for cross sections of processes containing heavy quarks. One of these approaches is the so-called four-flavour number scheme (4FNS), where only gluon distributions and first two quark generations are involved in the QCD evolution equations for parton (quark and gluon) densities in a proton, so that b-quarks appear in a massive final state as a result of gluon splitting \(g\rightarrow b\bar{b}\) at leading order. The second approach is the five-flavour number scheme (5FNS), which allows a b-quark density in the initial state where the b-quark is typically treated massless above the flavour threshold. Up to all orders, the 4FNS and 5FNS schemes should give exactly the same results (up to the power corrections \(\sim m_b^2/\mu ^2\)), while at a given order differences can occur (see Ref. [8] for an review).Footnote 1 In the calculations below, we apply the 5FNS schemes for MadGraph5_aMC@NLO with Cascade3.

In another framework, based on the CCFM evolution equation, only the gluon density but no heavy quark (and no light sea-quarks) parton densities exist. In the approach used with PEGASUS we apply a mixed scheme, using the gluon density from CCFM together with partons obtained in a DGLAP scheme with 5 active flavors (5FNS).

2.1 Calculations with Monte-Carlo generator PEGASUS

To calculate the \(Z+b\)-jet total and differential cross sections at the LHC we employ two different schemes based on the transverse momentum dependent (TMD) quark and gluon distributions in a proton.Footnote 2 The first method was proposed in [6] and relies mainly on the \(\mathcal{O}(\alpha \alpha _s^2)\) off-shell gluon-gluon fusion subprocess:

$$\begin{aligned} g^* + g^* \rightarrow Z + Q + \bar{Q}, \end{aligned}$$
(1)

which gives the leading contribution to the production cross section in the small x region, where the gluon density dominates over the quark densities. An essential point here is using the CCFM evolution equation to describe the QCD evolution of the TMD gluon density in a proton. The gauge-invariant off-shell amplitude for the gluon-gluon fusion subprocess has been calculated in Ref. [13, 14], where all the relevant technical details are explained.

In addition, we take into account two subleading subprocesses involving quarks in the initial state. These are the flavor excitation processes

$$\begin{aligned} q + Q \rightarrow Z + Q + q, \end{aligned}$$
(2)

and the quark–antiquark annihilation processes

$$\begin{aligned} q + \bar{q} \rightarrow Z + Q +\bar{Q}, \end{aligned}$$
(3)

which could play a role essentially at large transverse momenta (or, respectively, at large x which is needed to produce large \(p_T\) events) where the quarks are less suppressed or can even dominate over the gluon density. The contributions from the quark-induced subprocesses (2) and (3) are calculated within a conventional DGLAP-based (collinear) factorization scheme, which provides better theoretical grounds in the region of large x. The evaluation of the production amplitudes is straightforward and needs no explanations.

Our scheme [6, 7] represents a combination of two techniques with each of them being used at the kinematic conditions where it is best suitable. This scheme is implemented in the Monte-Carlo event generator pegasus [15], which has been used for numerical calculations. Taking all the three subprocesses (1), (2) and (3) into account we extend the predictions to the whole kinematic range. Note that at least one heavy quark Q is always present in the final state already at the amplitude level.

The parton-level calculation returned by pegasus has further been improved by including the effects of the initial and final state parton showering. For the collinear part of the calculation, that has been done using the conventional pythia8 [16] algorithm.Footnote 3 The off-shell part of the calculations includes this kind of correction in the form of TMD gluon densities. The subsequent decay \(Z \rightarrow l^+l^-\) (including the \(Z/\gamma ^*\) interference effects) is incorporated already at the production step at the amplitude level in order to fully reproduce the experimental setup.

For the TMD gluon density in a proton, we used a numerical solution of the CCFM equation [18,19,20,21]. We find it to be a suitable option since it smoothly interpolates between the small-x Balitsky–Fadin–Kuraev–Lipatov (BFKL) [22,23,24] gluon dynamics and large-x DGLAP one. We adopted the latest JH-2013 parametrization, namely, we choose the JH-2013 set 2 [25]. The corresponding TMD gluon density was fitted to high-precision DIS data on the proton structure functions \(F_2(x, Q^2)\) and \(F_{2}^c(x,Q^2)\). The fit was based on TMD matrix elements and involves the two-loop strong coupling constant, the kinematic consistency constraint [26, 27] and non-singular terms in the CCFM gluon splitting function [28]. For the conventional quark and gluon densities, we used the NNPDF 4.0 (LO) [29] set.

2.2 Calculations with Monte-Carlo generator Cascade3

This method represents a more rigorous scheme based on the parton branching (PB) approach, which was introduced [9, 10] to treat the DGLAP evolution [30,31,32,33]. The method provides a solution of these equations and coincides with the standard methods to solve the DGLAP equations for inclusive distributions at NLO and NNLO. It allows one to simultaneously take into account soft-gluon emission at \(z\rightarrow 1\) and the transverse momentum \(q_T\) recoils in the parton branchings along the QCD cascade. The latter leads to a natural determination of the TMD quark and gluon densities in a proton. One of the advantages of this approach is that the PB TMDs can be combined with standard (on-shell) production amplitudes, which can be calculated at higher orders with. Here we use matrix elements calculated with next-to-leading (NLO) order with MadGraph5_aMC@NLO [34] using the HERWIG6 subtraction terms, which are suitable for combination with PB-TMDs.

The tool to be used to calculate the observables within the PB approach is the Monte-Carlo event generator Cascade3 [17]. A special procedure is adopted for the initial partons’ transverse momenta: a transverse momentum is assigned according to the TMD density, and then the parton-parton system is boosted to its center-of-mass frame and rotated in such a way that only the longitudinal and energy components are nonzero. The energy and longitudinal component of the initial momenta are recalculated taking into account the virtual masses [17, 35]. This method keeps the parton-parton invariant mass exactly conserved, while the rapidity of the partonic system is approximately restored, depending on the transverse momenta.

The PB TMD parton densities were obtained from HERA I+II DIS data (details are described in [36]). In the numerical calculations below we use the PB-NLO-HERAI+II-2018 set 2 available in [37].

3 Numerical results

Before we turn to the discussion on the ’prompt’ and ’non-prompt’ b-jets we have to justify our approach by comparing the results of our simulations with the newest CMS and ATLAS data. We start by listing the parameters of our calculations. Following [38], we set the charm and beauty quark masses to \(m_c = 1.4\) GeV and \(m_b = 4.75\) GeV, the mass of Z boson \(m_Z = 91.1876\) GeV, its total decay width \(\Gamma _Z = 2.4952\) GeV, and \(\sin ^2\theta _W = 0.23122\). The electromagnetic coupling is taken as \(\alpha (m_Z)=1/128.74\).

In the calculations performed with pegasus we set \(\Lambda _\mathrm{QCD}^{(4)} = 200\) MeV and use two-loop QCD coupling according to [25]. The default renormalization scale was taken as \(\mu _R^2 = m_Z^2\), while the default factorization scale for the off-shell gluon-gluon fusion subprocess was \(\mu _F^2 = \hat{s} + {\mathbf{Q}}_T^2\), where \({\mathbf{Q}}_T\) is the net transverse momentum of the initial off-shell gluon pair. This choice is dictated mainly by the CCFM evolution algorithm (see [25] for more information). For the quark-induced subprocesses (2) and (3) we keep it equal to the renormalization scale, \(\mu _F=\mu _R\). To estimate the theoretical uncertainties of our pegasus calculations for off-shell gluon-gluon fusion subprocess (1) we use auxiliary ’+’ and ’−’ TMD gluon densities instead of the default TMD gluon distribution functions. These two sets refer to the varied hard scales in the strong coupling constant \(\alpha _s\) in the off-shell amplitude: ’\(+\)’ stands for \(2\mu _R\), while ’−’ refers to \(\mu _R/2\) (see [25]). For the quark-induced subprocesses (2) and (3) we just vary the hard scale around its default value between halved and doubled magnitude, as usual.

For the PB calculation with MG5aMC+CA3(Z+1)NLO, we set \(m_c = 1.47\) GeV, \(m_b = 4.75\) GeV, \(\alpha _s (m_Z^2) = 0.118\) and \(\mu _R = \mu _F = \frac{1}{2} \sum _i \sqrt{m^2_i +p^2 _{t,i}}\), where the sum runs over all particles and parton in the matrix element. The hard process calculations are performed at NLO with MadGraph5_aMC@NLO  [34] with herwig6 subtraction terms and the TMD parton shower is simulated with Cascade3 [17]. The theoretical uncertainties are obtained by varying the scale of the hard process by a factor 2 up and down, provided by MadGraph5_aMC@NLO.

3.1 Z + heavy quark jet production

Now we are in position to present our numerical results. First we discuss \(Z + b\) production cross sections measured by the ATLAS Collaboration at \(\sqrt{s} = 13\) TeV [5]. The experimental setup was as following: the transverse momenta of the leptons from the Z decay are required to be \(p_T^\text {lead}>27\), with pseudorapidities of \(|\eta ^l|<2.5\) for muons or \(|\eta ^l|<2.47\) for electrons (excluding \(1.37<|\eta ^l|<1.52\)). The leptons are isolated from the jets by \(\Delta R<0.4\). The invariant mass of the reconstructed Z-boson has to be within \(76<m^{ll}<106\). The jets are reconstructed in the anti-\(k_T\) algorithm with radius \(R_\text {jet}=0.4\) and should have \(p_T^\text {jet}>20\) GeV and rapidity \(|y|<2.5\).

Our numerical predictions are shown in Figs. 1 and 2 in comparison with the latest ATLAS data [5]. The shaded bands represent the theoretical uncertainties estimated as described above. In all figures we show predictions (as described above) obtained from MG5aMC+CA3(Z+1)NLO based on a NLO calculation of \(Z+1\) jets with MadGraph5_aMC@NLO and TMD parton showers from Cascade3 with and without hadronization, together with predictions obtained from pegasus with and without parton shower.

Fig. 1
figure 1

\(Z+b\) production differential cross sections as functions of b-jet and Z-boson transverse momenta and rapidities at \(\sqrt{s}= 13\) TeV. ATLAS experimental data were taken from [5]

Fig. 2
figure 2

\(Z+b\) production differential cross sections as functions of the azimuthal angle, angular difference and rapidity differences between b-jet and Z-boson at \(\sqrt{s}= 13\) TeV. ATLAS experimental data were taken from [5]

Fig. 3
figure 3

The relative production rate \(\sigma (Z + c)/\sigma (Z + b)\) as functions of heavy jet (left panel) and Z-boson (right panel) transverse momentum at \(\sqrt{s}= 13\) TeV. The experimental data are from CMS [3]

The MG5aMC+CA3(Z+1)NLO calculations describe well the ATLAS measurements of the Z boson and the b jet rapidity and transverse momenta spectra at low and moderate \(p_T\) within the theoretical and experimental uncertainties. From the comparison of MG5aMC+CA3(Z+1)NLO with and without fragmentation, we estimate the fragmentation correction of \(\sim 10 \% \) in the larger transverse momentum regions, while at small \(p_T(Z)\) and small \(p_T(bjet)\) the corrections are significantly larger. These corrections are coming from b-hadrons which are outside the jet with \(R=0.4\). Due to missing higher order contributions in the calculation (\(Z+1\) at NLO) there is a notable underestimate of the data at large transverse momenta, namely \(p_T \gtrsim 200\) GeV. These missing higher order contributions lead also to deviations at small \(\Delta \phi (Z,b)\), \(\Delta R(Z,b)\) and at large \(\Delta y(Z,b)\).

Fig. 4
figure 4

\(Z+b\) production differential cross sections as functions of the energy fraction carried by the decay muon with respect to the total b-jet energy and the muon transverse momentum with respect to the b-jet axis at \(\sqrt{s}= 13\) TeV for different cuts on the minimal \(p_T\) of the jet

The pegasus predictions describe the data quite well within the estimated uncertainties, failing though at large \(p_T\). The scenario implemented into pegasus, where the heavy quark is produced in the hard partonic scattering at the amplitude level, is able to reproduce well the measured distributions in Z boson and b jet rapidity and transverse momenta (at low and moderate \(p_T\)). It is interesting to observe, that the distribution of \(\Delta \phi (Z,b)\) is well described, even at low \(\Delta \phi (Z,b)\), which is in contrast with the PB result. One should however keep in mind that the pegasus calculations do not include fragmentation (unlike the PB ones).Footnote 4 Taking into account the fragmentation effect may result in a \(\sim 10\)% drop of cross section [39]. The inclusion of the corresponding fragmentation correction factor could simultaneously result in better agreement by the two approaches. In the \(\Delta y(Z,b)\) we observe a similar behavior as for the PB predictions. The final state parton shower effects does not significantly affect the pegasus predictions. It can be easily understood since the main contributions here comes from initial state gluon radiation, which is already taken into account in the CCFM-evolved gluon density.

Now we turn to associated \(Z + c\) production at the LHC and discuss the relative production rate \(\sigma (Z + c)/\sigma (Z + b)\) as measured by the CMS collaboration at \(\sqrt{s}=13\) TeV [3]. The experimental cuts are: the leading lepton from the Z decay is required to have a transverse momentum \(p_T^\text {lead}>26\) GeV, while the subleading lepton must have \(p_T^\text {sublead}>10\) GeV; with pseudorapidities within \(|\eta ^l|<2.4\) and the dilepton invariant mass should be \(71<m^{ll}<111\) GeV. The leptons are required to be isolated from the jets with \(\Delta R<0.4\). The jets are required to have \(p_T^\text {jet}>30\) GeV and \(|\eta ^{\text{ jet }}|<2.4\), reconstructed with the anti-\(k_T\) algorithm with radius \(R_\text {jet}=0.4\).

In Fig. 3 we show the results of our calculations for the ratio \(\sigma (Z + c)/\sigma (Z + b)\) as a function of Z boson or jet transverse momenta in comparison with the CMS data [3]. The shaded bands represent the estimated uncertainties of our calculations. A good description of the CMS data can be obtained with MG5aMC+CA3(Z+1)NLO The predictions from pegasus lie below the data, though being compatible with the data at \(\sim 2\sigma \). Thus, despite the fact that both considered scenario provide a more or less reasonable description of inclusive production data (see Figs. 1 and 2), they are very distinct for \(\sigma (Z+c)/\sigma (Z+b)\) observables and some correlation variables. The same conclusion was achieved earlier [7] when comparing the pegasus and sherpa (NLO pQCD) predictions, where the same heavy jet production scenario, as in the MG5aMC+CA3(Z+1)NLO  is employed. Below we will discuss the kinematic criteria that can be helpful in discriminating these two cases.

3.2 Prompt and non-prompt b-jets

In 5 FNS calculations often a b-quark is produced during the jet fragmentation process, in gluon splitting \(g\rightarrow b\bar{b}\), while this contribution is small in the CCFM or 4FNS approaches, since a b-quark is already present in the hard process. It is therefore of great importance to find observables which could differentiate between a heavy-quark produced during the jet fragmentation fand the one present at the hard process.

Having demonstrated that the latest LHC data on \(Z + b\) production can be well described by cascade and pegasus at least up to moderate transverse momenta, in the following we consider kinematic properties of b-jet production by investigating B-hadrons tagged via the semileptonic decay \(B\rightarrow \mu +X\): we consider the fractional energy \(z_b\) carried by the decay muon with respect to the total b-jet energy and the muon transverse momentum \(p_T^\mathrm{rel}\) with respect to the b-jet axis. With both variables we aim to distinguish between prompt b-production, where the b-quark exists already at the matrix element level, and non-prompt b-production, where the b-quark is generated during the jet evolution. We expect large \(z_b\) and small \(p_T^\mathrm{rel}\) (relative to the jet \(p_T\)) for prompt b production, while non-prompt b-production would lead to significantly smaller values for \(z_b\) and a larger tail for \(p_T^\mathrm{rel}\).

In Fig. 4 the distribution of \(z_b\) and \(p_T^\mathrm{rel}\) are shown for different thresholds of the jet transverse momentum: \(p_T^\mathrm{jets} > 30,\, 50,\, 100,\, 200,\, 300\) GeV. Jets with larger transverse momenta provide larger phase space for parton radiation cascades. Accordingly, they show a larger fraction of ‘non-prompt’ B-hadrons resulting in a larger fraction of low-energy muons (muons with low \(z_b\)) and in larger fraction of muons with high \(p_T^\mathrm{rel}\) (muons with large deviation from the jet direction).

We find, that the intuitive variables \(z_b\) and \(p_T^\mathrm{rel}\) are very powerful to distinguish prompt and non-prompt b production.

The observables \(z_b\) and \(p_T^\mathrm{rel}\) are very well suited to discriminate between the heavy jet production mechanisms. An experimental measurement of these variables would be important to test the model predictions, and to confirm the importance of gluon splitting in the production of high \(p_T\) b-jets.

4 Conclusion

We have considered the production of Z bosons associated with charm and beauty jets at LHC conditions. We investigated two different schemes.

We find that the combination of three basic subprocesses (1)–(3) involving heavy quarks in final states provides a consistent description of Z boson and/or b-jet transverse momenta and rapidity distributions as well as different correlation observables in \(Z + b\) events at low and moderate \(p_T\). This can be seen from a direct comparison between the model predictions obtained using the Monte-Carlo generator pegasus and recent LHC data.

In another approach we consider \(Z+jet\) production at NLO, where heavy quarks can be produced directly in the matrix element, or during the showering process. We perform the calculations based on the Parton Branching TMDs together with TMD shower for the initial state cascade. Such calculations were performed using MG5aMC+CA3(Z+1)NLO. We find very good description of the measurement at not too large transverse momentum of the Z boson.

We classify different b-jet production mechanisms as prompt and non-prompt, depending whether the heavy quark is present at matrix element level or generated during the jet evolution.

Events of the prompt and non-prompt types show rather different kinematic properties, that can be seen, in particular, in relative production rate \(\sigma (Z + c)/\sigma (Z + b)\) measured very recently by the CMS Collaboration for the first time. The ratio of c over b jet production is also reasonably well described by the PB prediction of MG5aMC+CA3(Z+1)NLO.

Considering the \(Z + b\) events as a representative example, we see that jets with larger transverse momenta contain larger fraction of non-prompt b-hadrons, that results in larger fraction of low-energy muons and in larger fraction of muons with large deviation from the jet direction. We recommend that the relevant observables, \(z_b\) and \(p_T^\mathrm{rel}\), be used in the forthcoming experimental analyses.