1 Introduction and motivation

Beauty (b) and charm (c) quarks are often considered “heavy flavours” because their masses are above the proton mass, \(m_b=4.2\) GeV and \(m_c=1.3\) GeV, respectively. However, their mass is not so large, compared to the typical scale of hadron formation, \(\varLambda \simeq 1\) GeV, so hadronisation occurs. This does not happen for the top quark, which has a mass 170 times bigger than the proton mass. This value is so large that its lifetime is smaller than the hadronisation scale.

Heavy flavours constitute a window on two mechanisms that provide ordinary matter with mass: electroweak symmetry breaking and the binding energy of strong interactions. On the one hand, they play a crucial role in studies of the Higgs boson and, on the other hand, they constitute a bridge between perturbative and non-perturbative Quantum Chromo Dynamics (QCD). For these reasons, they have been (and still are) the subject of detailed theoretical and experimental studies.

In this work, we focus on processes in which heavy quarks are produced. From a theoretical point of view, calculations for identified heavy flavours can be performed essentially because the quark mass sets a perturbative scale for the running coupling and, at the same time, removes collinear singularities. From an experimental viewpoint, the lifetime of B (or D) hadrons is long enough so that their decay happens away from the interaction point. Dedicated b- and c-tagging techniques that exploit this property to identify B and D hadrons, or b and c jets, are widely used in collider experiments.

Flavour physics has been studied for decades, both in the quark and lepton sectors. However, the recent development of Infra-Red and Collinear (IRC) safe flavour-jet algorithms [1,2,3,4,5] opens up the possibility to set up a yet-unexplored flavour physics program that exploits jets and their substructure at the Large Hadron Collider (LHC).

Resummed calculations for jets initiated by heavy quarks have been first performed in the context of studies focussing on B decays [6,7,8,9] and top jets [10,11,12,13,14,15], exploiting effective field theories. However, to the best of our knowledge, there exists only a handful of studies that exploit modern jet substructure techniques to study heavy flavours [16,17,18,19,20,21,22,23]. The main goal of these studies is to investigate the so-called dead-cone effect [24, 25], i.e. the fact that colour radiation around heavy quarks is suppressed. Remarkably, this effect has been recently measured by the ALICE collaboration at the LHC [26].

In contrast, there exists a rather extensive literature dedicated to studying the properties of a reconstructed B (or D) hadron, such as its energy or its transverse momentum. The theoretical description of these observables is usually based on heavy-quark fragmentation functions, which can be computed in perturbative QCD. State-of-the-art predictions include the resummation of different classes of logarithmic corrections, e.g. mass logarithms and end-point logarithms, see e.g. [27,28,29,30,31,32,33,34]. In this context, two of us have recently developed a theoretical framework to consistently resum both mass and soft logarithms [35, 36]. In this study, we will heavily rely on these results and we will apply them to jet-based observables. This way, we will be able to obtain theoretical predictions that resum both the logarithms of the observable we want to study and the logarithms of the ratio of the heavy-quark mass to the jet transverse momentum.

In this study, we investigate the possibility of exploiting a widely used jet substructure technique, namely Soft Drop [37], to study heavy-flavour jets. The Soft Drop algorithm has been extensively studied and Soft Drop jets are routinely used in experimental analyses, both in the context of measurements and searches for new physics. In particular, we consider heavy-flavour-tagged jets and focus on the angular separation \((\theta _g)\) and momentum fraction \((z_g)\) of the first splitting that passes Soft Drop. The former directly measures the angular resolution of the groomed jet and, therefore, it gives us direct access to the dead cone. The latter, instead, allows us to probe the heavy-quark splitting function [38, 39].

These observables have been recently measured on c-jets by the ALICE collaboration [40] and our study constitutes the first step towards a first-principle description of the data. A direct comparison to the ALICE data, however, goes beyond the scope of this paper, as it would require matching the resummed expressions to fixed-order matrix elements, as well as accounting for the fact that ALICE reconstructs c-jets from exclusive D-meson decays.

The paper is organised as follows. We start by reviewing the Soft Drop algorithm in Sect. 2. In Sect. 3 we perform the next-to-leading logarithmic (NLL) resummation of the \(\theta _g\) distribution, including heavy-quark mass effects and compare it Monte Carlo parton-shower simulations. In Sect. 4 we study the \(z_g\) distribution, both from first-principle and in Monte Carlo simulations and study its correlation to a standard momentum fraction variable used in the context of fragmentation functions. Finally, we draw our conclusions in Sect. 5. Details of the calculations are collected in the Appendices.

2 Soft Drop

Soft Drop [37] is a grooming algorithm that recursively removes soft-wide angle constituents from a jet. The Soft Drop procedure starts by re-clustering a given jet (typically an anti-\(k_t\) [41] jet) with radius \(R_0\) and transverse momentum \(p_t\) with the Cambridge–Aachen (C/A) algorithm [42, 43]. Soft Drop then parses the resulting angular-ordered branching history, grooming away the softer branch, until a branch that satisfies the condition

$$\begin{aligned}&\frac{\min \left( p_{t1}, p_{t2}\right) }{p_{t1} + p_{t2}} > z_c\left( \frac{\varDelta _{12}}{R_0}\right) ^\beta , \nonumber \\&\quad \varDelta _{12}=\sqrt{(y_1-y_2)^2+(\phi _1-\phi _2)^2}, \end{aligned}$$
(1)

is found. In the expression above, 1 and 2 denote the branches at a given step in the clustering, \(p_{ti}\) are the corresponding transverse momenta, and \(\varDelta _{12}\) is their rapidity-azimuth separation. If the condition above is never satisfied, we can either remove the jet from consideration (“tagging mode”) or leave it as the final Soft Drop jet (“grooming mode”).

The kinematics of the first branch that satisfies (1) defines the groomed jet radius \(\theta _g\) and the groomed momentum sharing \(z_g\):

$$\begin{aligned} \theta _g = \frac{\varDelta _{12}}{R_0}, \quad z_g = \frac{\min \left( p_{t1}, p_{t2}\right) }{p_{t1} + p_{t2}}. \end{aligned}$$
(2)

The \(\theta _g\) distribution is IRC safe and it was first studied, for light jets, in [37] and then resummed to next-to-leading logarithmic (NLL) accuracy in [44]. The momentum sharing \(z_g\) is IRC safe for \(\beta <0\) but only Sudakov safe for \(\beta \ge 0\) [38]. The NLL calculation of the \(z_g\) distribution was performed in [45].

3 The \(\theta _g\) distribution

In this section, we describe the resummation of the \(\theta _g\) distribution for a jet originated by a massive quark. We follow the general strategy described in Ref. [37] and we, therefore, consider the resummation of the cumulative distribution, normalised to the Born cross section,

$$\begin{aligned} \varSigma (\theta _g^2)=\frac{1}{\sigma _0}\int _0^{\theta _g^2}\frac{{{\,\textrm{d}\,}}\!\sigma }{{{\,\textrm{d}\,}}\!{\theta _g'}^2} {{\,\textrm{d}\,}}\!{\theta _g'}^2, \end{aligned}$$
(3)

exploiting Lund diagrams [46].

3.1 Lund plane geography

Fig. 1
figure 1

Lund plane representation of the soft and quasi-collinear phase space for emissions off a b quark, on the left, and off a c quark, on the right. The dead-cone region is indicated in grey. Soft Drop is applied and the groomed-away region appears in light purple. The vertical dashed line in black indicates a measurement of the groomed radius \(\theta _g\) and the corresponding area in pink is the vetoed region, which gives rise to the Sudakov form factor. The horizontal red lines indicate the boundaries between different flavour numbers for the running coupling, while the green one marks the boundary of the non-perturbative region

Lund diagrams are a useful way to represent the available phase space for the emission of soft and collinear gluons off a hard dipole. The Lund plane is usually represented in terms of pairs of the logarithmic variables, so that, in the soft and collinear limit, equal areas correspond to equal emission probabilities.

We choose to draw Lund planes as depicted in Fig. 1. On the horizontal axis, we show the logarithm of the emission angle with respect to the jet axis, as measured in the rapidity-azimuth plane, normalised to the jet radius, i.e. \(\theta =\frac{\varDelta }{R_0}.\) On the vertical axis, we show the emission’s transverse momentum with respect to the jet axis, normalised to the hard scale of the process, \(\kappa =\frac{k_t}{p_t R_0}.\) This way, lines of constant \(\theta _g,\) in the soft and collinear limits, are represented by straight vertical lines, as shown in Fig. 1, in dashed black.

The Soft Drop condition \(z=z_c \theta ^\beta \) sets the dotted line in purple. In this study, we consider \(\beta \ge 0.\) The region of phase space below this line (shaded in purple) is groomed away.

Recently, two of us have achieved a generalisation of the Lund-plane resummation formalism that includes quark-mass effects [36]. The Lund plane on the left-hand panel of Fig. 1 is for a b-quark line, while the one on the right is for a c-quark. When considering heavy-quark Lund planes, two important differences with respect to the massless case arise. First, the presence of the vertical short-dashed line, in black, that represents the beginning of the dead cone at \(\theta =\frac{m_b}{p_t R_0},\) i.e. \(\varDelta =\frac{m_b}{p_t},\) for the left-hand figure, and at \(\theta =\frac{m_c}{p_t R_0},\) for the right-hand one. The shaded area in grey is the dead-cone region. Here, collinear emissions do not give rise to any logarithmic enhancement, leading to a suppression of QCD activity in this region. Second, the presence of heavy-quark thresholds, which are relevant when considering the running of the strong coupling. In Fig. 1, they are represented by horizontal dotted lines in red. The lines at \(\kappa =\frac{m_{b,c}}{p_t R_0}\) correspond to \(k_t=m_{b,c}\) and therefore mark the boundaries between two regions with different numbers of active flavours: \(n_f=5\) above and \(n_f=4\) below, and \(n_f=4\) above and \(n_f=3\) below, for \(k_t=m_b\) and \(k_t=m_c,\) respectively. We also show (in green) the line \(\kappa =\frac{\varLambda }{p_t R_0}\) that marks the \(k_t=\varLambda \simeq 1\) GeV boundary between the perturbative and the non-perturbative regions. For the region below this line, we need a prescription to regulate the Landau pole of the strong coupling. Details are given in Appendix A. Finally, we show dotted lines in blue, at \(\kappa =z_c\) and at \(\kappa =z_c \left( \frac{m_{b,c}}{p_t R_0}\right) ^{1+\beta },\) which originate from the Soft Drop condition and the dead-cone boundary.

When performing actual calculations, one needs to establish the hierarchy between these scales. Let us consider, as an example the case of b-jets, with relatively high-\(p_t.\) Because Soft Drop is usually employed with \(z_c=0.1,\) we have that \(z_c> \frac{m_b}{p_t R_0}\) if \(p_t R_0> 42\) GeV. Ideally, we would like to have \(z_c \left( \frac{m_{b}}{p_tR_0} \right) ^{1+\beta }>\frac{\varLambda }{p_t R_0}.\) This results in an upper bound for the hard scale \((p_t R_0)^\beta <z_c \frac{m_{b}^{1+\beta }}{\varLambda }.\) However, this inequality is hardly satisfied if we consider commonly used values of the Soft Drop parameters, e.g. \(z_c=0.1\) and \(\beta =0.\) Thus, we do expect non-perturbative contributions to affect the dead-cone region and we will assess their size using Monte Carlo simulations. In Fig. 1, we show the Lund plane for this hierarchy of scales, on the left for b-jets and on the right for c-jets. A discussion of all the other possible hierarchies and regions can be found in Appendix B.

3.2 Recap of the massless calculation

Before computing the \(\theta _g\) resummed distribution for a jet initiated by a massive quark, let us briefly recall the structure of the resummation in the case of massless partons. The all-order cumulative distribution can be written as the product of two contributions

$$\begin{aligned} \varSigma ^\text {NLL}(\theta _g^2)=S(\theta _g^2)\, e^{-R(\theta _g^2)}. \end{aligned}$$
(4)

The function R is the Sudakov exponent, or radiator, which accounts for angular-ordered collinear emissions. It was first computed in [37] and, for the case of a quark jet, reads

$$\begin{aligned} R(\theta _g^2) = \int _{\theta _g^2}^1 \frac{{{\,\textrm{d}\,}}\!\theta ^2}{\theta ^2}\int _0^1 {{\,\textrm{d}\,}}\!z \,P_{gq}(z)\frac{\alpha _{\text {S}}^\text {CMW}(k_t^2)}{2\pi }\varTheta (z-z_c \theta ^\beta ), \end{aligned}$$
(5)

where \(\theta \) is the emission angle, normalised to the jet radius, \(k_t= z \theta p_t R_0\) and the massless \(q\rightarrow q g\) (timelike) splitting function is

$$\begin{aligned} P_{gq}(z)=C_{\text{ F }}\frac{1+(1-z)^2}{z}. \end{aligned}$$
(6)

In order to achieve NLL accuracy, the running coupling must be evaluated at two loops in the so-called CMW scheme [47]. Henceforth, we will also work in the small-\(z_c\) limit. Consequently, even in the case \(\beta =0,\) we will ignore flavour-changing contributions [48, 49]. The Sudakov form factor represents the no-emission probability and corresponds to the area shaded in pink on the Lund planes of Fig. 1.

This rather simple picture essentially arises from two facts. First, for the \(\theta _g\) distribution, independent emissions, i.e. \(\alpha _{\text {S}}^n C_{\text{ F }}^n\) contributions, exponentiate with no corrections due to multiple emissions [37]. Second, all non-Abelian contributions are unresolved and therefore captured at NLL by the running coupling in the CMW scheme. However, as it was pointed out in [44], the above description is not complete at NLL and the resummed expression must be supplemented with a correction factor, S in Eq. (4). Indeed simple exponentiation is broken by single logarithmic contributions that arise from two competing mechanisms. Let us start with the correction to the independent emission contribution. The C/A algorithm, which is used as the first step of the Soft Drop procedure, can cluster two soft gluons together first, if they are the closest pair in angle, instead of clustering each of them with the hard quark. This happens when \(\theta _{12}<\min (\theta _1,\theta _2).\) This effect introduces single-logarithmic corrections that are usually referred to as clustering logarithms [50,51,52,53]. On the other hand, the clustering condition also plays a role in the correlated emission contribution. The C/A algorithm can resolve a soft gluon splitting, giving rise to a non-global contribution [54, 55] that spoils the CMW picture. This is also a single logarithmic contribution that can happen if the two soft gluons are not the closest pair in angle, i.e. \(\theta _{12}>\min (\theta _1,\theta _2).\)

Both contributions start at \({\mathscr {O}}(\alpha _{\text {S}}^2)\) and can be computed using the expression of the squared matrix element for the emission of two strongly-ordered soft gluons with momenta \(k_1,\) \(k_2\) off a hard fermionic dipole with momenta \(p_a,\) \(p_b\) [56, 57]

$$\begin{aligned} W&= 2 C_{\text{ F }}w_{ab,1}\left[ C_{\text{ A }}w_{a1,2}+C_{\text{ A }}w_{b1,2}+\left( 2 C_{\text{ F }}-C_{\text{ A }}\right) w_{ab,2}\right] \nonumber \\&= 4C_{\text{ F }}^2 \, w_{ab,1} w_{ab,2} + 2 C_{\text{ F }}C_{\text{ A }}\, w_{ab,1} \left( w_{a1,2}+ w_{b1,2}-w_{ab,2}\right) , \end{aligned}$$
(7)

where the \(C_{\text{ F }}^2\) term describes the independent emission contribution, while the \(C_{\text{ F }}C_{\text{ A }}\) one, the correlated one. We have introduced the eikonal factor

$$\begin{aligned} w_{ij,l}= \frac{p_i\cdot p_j}{p_i\cdot k_l \, p_j\cdot k_l}. \end{aligned}$$
(8)

Following [44], we consider the case in which the softer gluon \((k_2)\) is emitted at a large angle and passes the Soft Drop condition, while the harder gluon \((k_1)\) is emitted at an angle smaller than \(\theta _g\) and so it is not subject to the Soft Drop condition. If we work in the small-angle limit, considering real and virtual contributions, we find

$$\begin{aligned} S&=1+\left( \frac{\alpha _{\text {S}}}{\pi }\right) ^2\int _0^1\frac{{{\,\textrm{d}\,}}\!z_1}{z_1}\int _0^1 \frac{{{\,\textrm{d}\,}}\!z_2}{z_2}\int _0^{2 \pi }\frac{{{\,\textrm{d}\,}}\!\phi }{2 \pi }\int _0^{\theta _g^2} \frac{{{\,\textrm{d}\,}}\!\theta _1^2}{\theta _1^2}\int _{\theta _g^2}^1\frac{{{\,\textrm{d}\,}}\!\theta _2^2}{\theta _2^2} \nonumber \\&\quad \times \varTheta \left( z_2- z_c \theta _2^\beta \right) \varTheta \left( z_1-z_2\right) \Bigg [C_{\text{ F }}^2\, \varTheta \left( \theta _1-\theta _{12}\right) \nonumber \\&\quad -C_{\text{ F }}C_{\text{ A }}\frac{ \theta _1 \theta _2\; \cos \phi }{\theta _1^2+\theta _2^2-2 \theta _1 \theta _2 \cos \phi }\varTheta \left( \theta _{12}-\theta _1\right) \Bigg ]+{\mathscr {O}}(\alpha _{\text {S}}^3), \end{aligned}$$
(9)

with \(\theta _{12}^2=\theta _1^2+\theta _2^2-2 \theta _1 \theta _2 \cos \phi .\) The integration above can be simplified by noting that, in order to capture the highest power of the logarithm of \(\theta _g,\) we can evaluate the momentum fraction integrals with lower limit \(z_2= z_c \theta _2^\beta \simeq z_c \theta _g^\beta .\) This way, the momentum fraction integrals decouple from the angular ones and we obtain

$$\begin{aligned} S&= 1+\frac{1}{2}\left( \frac{\alpha _{\text {S}}}{\pi }\right) ^2 \log ^2 \left( z_c \theta _g^\beta \right) \int _0^{2 \pi }\frac{{{\,\textrm{d}\,}}\!\phi }{2 \pi }\int _0^{\theta _g^2} \frac{{{\,\textrm{d}\,}}\!\theta _1^2}{\theta _1^2}\int _{\theta _g^2}^1\frac{{{\,\textrm{d}\,}}\!\theta _2^2}{\theta _2^2}\nonumber \\&\quad \times \left[ C_{\text{ F }}^2\, \varTheta \left( 2\theta _1\cos \phi -\theta _{2}\right) -C_{\text{ F }}C_{\text{ A }}\frac{ \theta _1 \theta _2\; \cos \phi }{\theta _1^2+\theta _2^2-2 \theta _1 \theta _2 \cos \phi }\right. \nonumber \\&\quad \left. \times \varTheta \left( \theta _{2}- 2\theta _1\cos \phi \right) \right] +{\mathscr {O}}\left( \alpha _{\text {S}}^2 \log \theta _g\right) \nonumber \\&=1+\left( \frac{\alpha _{\text {S}}}{\pi }\right) ^2\log ^2 \left( z_c \theta _g^\beta \right) \frac{\pi ^2}{108}\left( C_{\text{ F }}^2 - 4 C_{\text{ F }}C_{\text{ A }}\right) . \end{aligned}$$
(10)

The all-order resummation of these contributions has been performed numerically, in the large-\(N_c\) limit [44]. In this study, we have decided to limit ourselves to studying the impact of these corrections. Thus, we approximate the clustering factor S by simply considering the exponentiation of the two-loop result:

$$\begin{aligned} S(\theta _g^2)= \exp { \left[ \frac{C_{\text{ F }}^2 - 4 C_{\text{ F }}C_{\text{ A }}}{108}\left( \int _{z_c \theta _g^\beta }^1 \frac{{{\,\textrm{d}\,}}\!z}{z}\, \alpha _{\text {S}}(z^2 p_t^2 R_0^2)\right) ^2\right] }, \end{aligned}$$
(11)

where the running of the strong coupling is taken at one loop.

3.3 The \(\theta _g\) distribution for a heavy-flavour jet

We now perform the resummation of the \(\theta _g\) distribution for jets initiated by a heavy quark, considering both the case of a b-quark and a c-quark. Both the Sudakov contribution R and the clustering correction function S in Eq. (4) need to be reconsidered. We start with the calculation of the resummed Sudakov exponent.

It was realised long ago that squared QCD matrix elements with massive partons factorise in the so-called quasi-collinear limit [58, 59]. In this approximation, both the transverse momentum \(k_t\) of the emission with respect to the massive emitter, and the mass m are small compared to the hard scale but they are considered of the same order. In this limit, the squared invariant amplitude for one-gluon emission takes the form

$$\begin{aligned} |{\mathscr {M}}|^2\simeq 8\pi \alpha _{\text {S}}\frac{z(1-z)}{k_{t}^2+z^2 m_i^2} P_{gi}\left( z,k_{t}^2\right) |{\mathscr {M}}_0|^2, \quad i=b,c,\nonumber \\ \end{aligned}$$
(12)

where the massive splitting function for \(i \rightarrow i g\) is

$$\begin{aligned} P_{gi}(z,k_t^2)= C_{\text{ F }}\left( \frac{1+(1-z)^2}{z}-\frac{2 z(1-z)m_i^2}{k_{t}^2+z^2 m_i^2}\right) . \end{aligned}$$
(13)

Following the same steps as in [36], we write

$$\begin{aligned} R_i(\theta _g^2,m_i^2,m_b^2,m_c^2)&= \int _0^{p_t^2 R_0^2} \frac{{{\,\textrm{d}\,}}\!k_t^2}{k_t^2+z^2 m_i^2}\int _0^1 {{\,\textrm{d}\,}}\!z \,P_{gi}(z,k_t^2)\nonumber \\&\quad \times \frac{\alpha _{\text {S}}^\text {CMW}(k_t^2)}{2\pi } \varTheta (z-z_c \theta ^\beta ) \varTheta (\theta -\theta _g), \end{aligned}$$
(14)

where \(\theta =\frac{k_t}{z p_t R_0}.\) Note that the mass dependence enters in two ways. First, for a jet initiated by a quark flavour \(i=b,c,\) the quasi-collinear phase-space and the massive splitting function depend on \(m_i.\) Second, regardless of the jet flavour, the running of the strong coupling may cross the b and the c thresholds, thus inducing a logarithmic dependence on the quark masses. In order to proceed further, it is convenient to change the integration variable in Eq. (14) from \(k_t^2\) to \(\theta ^2\):

$$\begin{aligned} R_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)&= \int _{\theta _g^2}^{1} \frac{{{\,\textrm{d}\,}}\!\theta ^2}{\theta ^2+ \theta ^2_i}\int _0^1 {{\,\textrm{d}\,}}\!z \,P_{gi}\left( z, k_t^2\right) \frac{\alpha _{\text {S}}^\text {CMW}(k_t^2)}{2\pi } \nonumber \\&\quad \times \varTheta (z-z_c \theta ^\beta ), \end{aligned}$$
(15)

where now \(k_t^2=z^2 \theta ^2 p_t^2 R_0^2,\) and we have introduced the following dimensionless variables

$$\begin{aligned} \xi _i=\frac{m_i^2}{p_t^2 R_0^2}, \quad \theta _{i}^2 =\xi _i, \quad \text {with} \ i=b,c. \end{aligned}$$
(16)

We stress again that the Sudakov exponent for either a high-\(p_t\) b-jet or c-jet depends on both \(\xi _b\) and \(\xi _c\) because the running coupling crosses both quark thresholds. However, it depends only on the dimensional ratio \(\theta _i^2\) of the corresponding flavour \(i=b,c.\)

The mass-dependent shift in the denominator of Eq. (15) acts as an effective lower bound of a logarithmic angular integration. This is the well-known dead-cone effect, [24, 25] i.e. the fact that radiation off massive partons at angles below \(m/p_t\) is not logarithmically enhanced. Therefore, we can further simplify our expression by shifting the angular integration variable to \(\bar{\theta }^2=\theta ^2+\theta ^2_i.\) Neglecting power corrections in the mass, we have

$$\begin{aligned}&R_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)= \int _{\vartheta _{g,i}^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2}\int _0^1 {{\,\textrm{d}\,}}\!z \,P_{gi}\left( z, z^2 (\bar{\theta }^2-\theta _i^2)p_t^2 R_0^2\right) \nonumber \\&\quad \times \frac{\alpha _{\text {S}}^\text {CMW}(z^2 (\bar{\theta }^2-\theta _i^2)p_t^2 R_0^2)}{2\pi } \varTheta (z-z_c (\bar{\theta }^2-\theta _i^2)^\frac{\beta }{2}), \end{aligned}$$
(17)

with \(\vartheta _{g,i}^2=\theta _g^2 +\theta ^2_i.\) Let us first note that

$$\begin{aligned} P_{gi}\left( z, z^2 (\bar{\theta }^2-\theta _i^2)p_t^2 R_0^2\right)&= C_{\text{ F }}\left( \frac{1+(1-z)^2}{z}-\frac{2 \theta _i^2 (1-z)}{z \bar{\theta }^2}\right) \nonumber \\&\equiv {\mathscr {P}}_{gi}(z,\bar{\theta }^2). \end{aligned}$$
(18)

Furthermore, as discussed in detail in [36], the mass-dependent shift in the argument of the running coupling contributes at most to NNLL corrections and, therefore, it can be dropped at the accuracy we are working at. A similar argument applies to the shift in the Soft Drop condition. This can be easily checked at fixed coupling. For instance, in the soft and collinear limits, we have

$$\begin{aligned} R_i^{(\text {f.c.})}&= \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi } \int _{\vartheta _{g,i}^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2}\int _0^1 \frac{{{\,\textrm{d}\,}}\!z}{z} \varTheta (z-z_c (\bar{\theta }^2-\theta _i^2)^\frac{\beta }{2})\nonumber \\&=\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\left\{ \log \vartheta _{g,i}^2\left( \log z_c+\frac{\beta }{4} \log \vartheta _{g,i}^2\right) \right. \nonumber \\&\quad \left. -\frac{\beta }{2}\left[ \text {Li}_2\left( -\frac{1}{\theta _i^2}\right) -\text {Li}_2\left( \frac{\theta _i^2}{\vartheta _{g,i}^2}\right) +\frac{1}{2}\log ^2\theta _i^2+\frac{\pi ^2}{6}\right] \right\} \nonumber \\&= \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi } \int _{\vartheta _{g,i}^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2}\int _0^1 \frac{{{\,\textrm{d}\,}}\!z}{z} \varTheta (z-z_c \bar{\theta }^\beta )+ \text {NNLL}, \end{aligned}$$
(19)

where in the last step we have dropped constant terms in the \(\theta _i \rightarrow 0\) or \(\theta _g \rightarrow 0\) limits, which are NNLL. Thus, the massive Sudakov exponent at NLL becomes

$$\begin{aligned} R_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)&= \int _{\vartheta _{g,i}^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2}\int _{z_c \bar{\theta }^\beta }^1 {{\,\textrm{d}\,}}\!z \,{\mathscr {P}}_{gi}(z, \bar{\theta }^2) \nonumber \\&\quad \times \frac{\alpha _{\text {S}}^\text {CMW}(z^2 \bar{\theta }^2 p_t^2 R_0^2)}{2\pi }. \end{aligned}$$
(20)

Remarkably, with the sole exception of the mass-dependent term in the splitting function, this result, when seen as a function of \(\vartheta _{g,i}^2,\) has the same form of the massless case, Eq. (5). Consequently, we can represent the massive Sudakov form factor in Fig. 1 using the same Lund planes as in the massless case, provided that we interpret vertical lines as lines of constant \(\vartheta _{g,i}^2.\) This way, the dead-cone line acts as a phase-space boundary because \(\vartheta _{g,i}^2\rightarrow \theta _i^2\) as \(\theta _g \rightarrow 0.\) The integrations in Eq. (20) are all straightforward but the presence of the Soft Drop condition, of the dead cone, and of mass thresholds force us to consider many cases. Details are given in the Appendices.

Next, we have to consider the clustering correction factor S. We adopt the same, approximate, strategy of the massless case. Namely, we consider the running-coupling exponentiation of the two-loop result. The square amplitude for the emission of two soft gluons off a massive quark–antiquark dipole was computed in [60, 61]. For our purposes, we are actually interested in a less general result, namely the case of the emission of two soft gluons that are strongly ordered in energies, e.g. \(z_2\ll z_1,\) off a single dipole. In this limit, the massive square matrix element takes the same form as Eq. (7) but with the massless eikonal factor \(w_{ij,k}\) substituted by the massive one:

$$\begin{aligned} w^m_{ij,l}= \frac{p_i\cdot p_j}{p_i\cdot k_l \, p_j\cdot k_l}-\frac{m_i^2}{2 p_i \cdot k_l}-\frac{m_j^2}{2 p_j \cdot k_l}. \end{aligned}$$
(21)

We choose \(p_a,\) see Eq. (7), to be the momentum of the heavy quark and we work in the quasi-collinear limit with respect to its momentum direction. Therefore, we can set \(p_b^2=0.\) We then integrate the \(C_{\text{ F }}^2\) and \(C_{\text{ F }}C_{\text{ A }}\) contributions with the same constraints as in Eq. (9):

$$\begin{aligned} S_i&=1+\left( \frac{\alpha _{\text {S}}}{\pi }\right) ^2\int _0^1\frac{{{\,\textrm{d}\,}}\!z_1}{z_1}\int _0^1 \frac{{{\,\textrm{d}\,}}\!z_2}{z_2}\int _0^{2 \pi }\frac{{{\,\textrm{d}\,}}\!\phi }{2 \pi }\int _0^{\theta _g^2} \frac{{{\,\textrm{d}\,}}\!\theta _1^2}{\bar{\theta }_1^2}\int _{\theta _g^2}^1\frac{{{\,\textrm{d}\,}}\!\theta _2^2}{\bar{\theta }_2^2} \nonumber \\&\quad \times \varTheta \left( z_2- z_c \theta _2^\beta \right) \varTheta \left( z_1-z_2\right) \left[ C_{\text{ F }}^2\, \varTheta \left( \theta _1-\theta _{12}\right) \right. \nonumber \\&\quad \times \left( 1 -\frac{\theta _i^2}{\bar{\theta }_1^2} -\frac{\theta _i^2}{\bar{\theta }_2^2} +\frac{\theta _i^4}{\bar{\theta }_1^2 \bar{\theta }_2^2}\right) -C_{\text{ F }}C_{\text{ A }}\frac{ \theta _1 \theta _2\; \cos \phi +\theta _i^2 }{\theta _1^2+\theta _2^2-2 \theta _1 \theta _2 \cos \phi } \nonumber \\&\quad \left. \times \left( 1-\frac{\theta _i^2}{\bar{\theta }_1^2} \right) \varTheta \left( \theta _{12}-\theta _1\right) \right] +{\mathscr {O}}(\alpha _{\text {S}}^3), \quad i=b,c, \end{aligned}$$
(22)

where, as before, \(\bar{\theta }_{1}^2=\theta _{1}^2+\theta _i^2\) and \(\bar{\theta }_{2}^2=\theta _{2}^2+\theta _i^2.\) As in the massless case, in order to obtain the leading contribution we can substitute \(\theta _2\simeq \theta _g\) in the Soft Drop condition. Therefore, the momentum fraction integrals decouple from the angular ones and we obtain

$$\begin{aligned} S_i= 1+\left( \frac{\alpha _{\text {S}}}{\pi }\right) ^2\log ^2 z_c \theta _g^\beta \Big [C_{\text{ F }}^2{\mathscr {F}}_1(\theta _g^2,\theta _i^2)+ C_{\text{ F }}C_{\text{ A }}{\mathscr {F}}_2(\theta _g^2,\theta _i^2)\Big ]. \end{aligned}$$
(23)

The Abelian contribution reads

$$\begin{aligned} {\mathscr {F}}_1(\theta _g^2,\theta _i^2)&= \int ^{\frac{\pi }{3}}_{0}\frac{{{\,\textrm{d}\,}}\!\phi }{2\pi }\int ^{4\cos ^2{\phi }}_{1}\frac{x {{\,\textrm{d}\,}}\!x}{(x+\kappa )^2}\Bigg [\log {(g(\phi ))}\nonumber \\&\quad -\frac{\kappa }{1+\kappa }\left( g(\phi )-1\right) \Bigg ], \end{aligned}$$
(24)

with

$$\begin{aligned} g(\phi )=\frac{4(1+\kappa )\cos ^2{\phi }}{x+4\kappa \cos ^2{\phi }}, \quad \kappa = \frac{\theta _i^2}{\theta _g^2}. \end{aligned}$$
(25)

For the non-Abelian case, instead, we find

$$\begin{aligned}&{\mathscr {F}}_{2}(\theta _g^2,\theta _i^2)=-\int ^{2\pi }_0 \frac{{{\,\textrm{d}\,}}\!\phi }{2\pi }\int ^{1}_0 \frac{{{\,\textrm{d}\,}}\!y}{y(1+y^2 \kappa )} \int ^{y}_0 {{\,\textrm{d}\,}}\!t \frac{2t}{t^2+y^2 \kappa }\nonumber \\&\quad \times \frac{t \cos {\phi }+y^2\kappa }{t^2-2t\cos {\phi }+1} \left[ 1-\frac{\kappa y^2}{t^2+\kappa y^2}\right] \varTheta (1-2 t \cos {\phi }). \end{aligned}$$
(26)

We evaluate \({\mathscr {F}}_1\) and \({\mathscr {F}}_2\) numerically. However, in the limit \(\theta _g^2\gg \theta _i^2\) we are able to perform the integrals analytically and we obtain

$$\begin{aligned} \lim _{\theta _i^2/\theta _g^2 \rightarrow 0}{\mathscr {F}}_1(\theta _g^2,\theta _i^2)&=\frac{\pi ^2}{108},\nonumber \\ \lim _{\theta _i^2/\theta _g^2 \rightarrow 0}{\mathscr {F}}_2(\theta _g^2,\theta _i^2)&=-\frac{\pi ^2}{27}, \end{aligned}$$
(27)

so that the massless result Eq. (10) is recovered. Remarkably, in the opposite limit, we find

$$\begin{aligned} \lim _{\theta _g^2/\theta _i^2 \rightarrow 0}{\mathscr {F}}_1(\theta _g^2,\theta _i^2)= \lim _{\theta _g^2/\theta _i^2 \rightarrow 0}{\mathscr {F}}_2(\theta _g^2,\theta _i^2)=0, \end{aligned}$$
(28)

so that these contributions disappear asymptotically. We note that this result is not related to the clustering conditions but rather to the requirement that one of the two gluons \((k_1)\) be within the groomed jet radius, i.e. \(0<\theta _1^2<\theta _g^2.\) Changing the integration variable to \(\bar{\theta }_1^2,\) we have \(\theta _i^2<\bar{\theta }_1^2<\theta _g^2+\theta _i^2.\) Thus, we have to evaluate the integral of a (regular or integrable) function over a domain that has a vanishing measure as \(\theta _g^2 \rightarrow 0.\)

As for the massless case, we do not perform the full NLL resummation of these effects, but we limit ourselves to exponentiate the \({\mathscr {O}}(\alpha _{\text {S}}^2)\) with running coupling corrections, as done in Eq. (11):

$$\begin{aligned} S_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)&= \exp \left[ \frac{C_{\text{ F }}^2 {\mathscr {F}}_1(\theta _g^2,\theta _i^2) + C_{\text{ F }}C_{\text{ A }}{\mathscr {F}}_2(\theta _g^2,\theta _i^2)}{\pi ^2}\right. \nonumber \\&\quad \left. \times \left( \int _{z_c \theta _g^\beta }^1 \frac{{{\,\textrm{d}\,}}\!z}{z}\, \alpha _{\text {S}}(z^2 p_t^2 R_0^2)\right) ^2\right] . \end{aligned}$$
(29)

In summary, the resummed \(\theta _g\) distribution for a heavy-flavour jet reads

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i}{{{\,\textrm{d}\,}}\!\theta _g}=\frac{{{\,\textrm{d}\,}}\!}{{{\,\textrm{d}\,}}\!\theta _g} \left[ S_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)\, e^{- R_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)}\right] , \end{aligned}$$
(30)

for \(i=b,c.\) The Sudakov exponent \(R_i\) is given in Eq. (20) and the clustering contribution \(S_i\) in Eq. (29). Note that our NLL distribution is no longer normalised, as a consequence of the fact that, in the massive case, the resummed cumulative behaves as a constant for \(\theta _g\rightarrow 0,\) while it vanishes in the massless case, see Eq. (4). Indeed, while in the massless case, Soft Drop with \(\beta \ge 0\) behaves perturbatively as a groomer, i.e. within resummed perturbation theory, it always returns a jet with \(\theta _g>0,\) the quark mass provides an effective cutoff so that, there is a non-vanishing probability, given by

$$\begin{aligned} S_i(0,\theta _i^2,\xi _b,\xi _c)\, e^{-R_i(0,\theta _i^2,\xi _b,\xi _c)}\sim e^{-\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }\left( \log ^2(z_c\theta _i^\beta )-\log ^2(z_c)\right) },\nonumber \\ \end{aligned}$$
(31)

to find Soft Drop jets with \(\theta _g=0,\) even if we expect fixed-order corrections, as well as non-perturbative effects, to smear this effect out. Thus, we could either use Soft Drop in grooming mode and supplement Eq. (30) with an endpoint, \(\delta (\theta _g),\) contribution that ensures normalisation or consider the algorithm in tagging mode and discard jets that return \(\theta _g=0.\) Henceforth, we choose this second option.

Finally, we note that, in the calculations of the resummed exponent presented so far, we have always included virtual corrections, as well as the contributions from jets that fail Soft Drop. In tagging mode, one should discard those and, in principle, repeat the calculations. However, we observe that in resummed perturbation theory

$$\begin{aligned} \varSigma (\theta _g)\Big |_\text {tagging}=\varSigma (\theta _g)\Big |_{\text {grooming}} -\varSigma (0)\Big |_{\text {grooming}}. \end{aligned}$$
(32)

Thus, in practice, we can just use the results obtained so far, provided that we remove the \(\theta _g=0\) contribution. Numerical results and their comparison to Monte Carlo simulations will be presented in the next section.

3.4 Numerical results and comparison to Monte Carlo simulations

We now provide numerical results for the \(\theta _g\) distribution according to Eq. (30), and compare them to Monte Carlo (MC) simulations. To this purpose, we generate events using , version 7.2.2 [62]. To test the all-order behaviour of the observables we are interested in, we consider LO matrix elements for the hard process, dressed with the parton shower.Footnote 1 Hadronisation effects are included using the cluster hadronisation model, when explicitly declared, and the CT14 [66] set of PDFs has been used throughout the paper.

We simulate the inclusive production of a pair of oppositely charged muons in association with a jet, in proton–proton collisions at 13 TeV centre-of-mass energy. The muon pair is required to have an invariant mass between 70 and 110 GeV. Jets are clustered using the anti-\(k_t\) algorithm [41] with \(R_0=0.4\) and then ordered in transverse momentum. The hardest jet containing a b (c) quark, or a B (D) hadron, is considered. To match the heavy quark/hadron with a jet we look at the closest (with respect to the jet axis) flavoured particle, starting from the highest in transverse momentum, with \(p_t > 5\) GeV. Properties of the flavoured particle and the jet are extracted and analysed using [67] and [68]. The fiducial phase-space for the muons is defined by the following cuts: \(p_{t,\mu } > 26\) GeV, \(|\eta _{\mu }|<2.4,\) while the jets are selected in the region defined by \(|\eta _J|<2.4.\) We consider three different transverse-momentum regions, i.e. \(p_{t} \ge 50, 150\) and 300 GeV. However, only \(p_{t}\ge 150\) GeV is shown in the main text, and the other cases can be found in Appendix C.

The \(\theta _g\) distribution is shown in Fig. 2. Each plot includes parton level, i.e. with parton shower effect only, and hadron level, i.e. with hadronisation and the Underlying Event (UE) included. Together with the MC prediction, we show the corresponding NLL result of Eq. (30). The latter also exhibits a theoretical uncertainty given by the variation of the renormalisation scale, for which the central value is set at the hard scale \(p_t R_0,\) by a factor of two, as customary. Plots in different rows correspond to results for the identified leading jet in \(Z+b\) (B), \(Z+c\) (D) and \(Z\,+\) light quark/hadron production, respectively. We always consider Soft Drop with \(z_c=0.1,\) and \(\beta =0\) and 1,  for left and right plots, respectively. For b and c jets, we also indicate, in green, the expected dead-cone region \(\theta _g< \theta _i=\frac{m_i}{p_t R_0}.\) All curves are normalised to have unit area. For \(\beta =0,\) i.e. when Soft Drop is more aggressive, we find good agreement between the NLL prediction and the MC. They are fairly different, instead, for \(\beta =1\) at high \(\theta _g.\) However, in this region, we expect fixed-order corrections, not considered here, to be important.

In order to highlight possible dead-cone effects, it is useful to consider the ratio between the heavy-flavour jet \(\theta _g\) distribution and the corresponding one for light-quark jets. We do this in Fig. 3. We concentrate on b-jets, but we show the results for three different transverse momentum cuts, namely 50, 150 and 300 GeV (from top to bottom). We show results for both \(\beta =0\) (left) and \(\beta =1\) (right). In every plot, we show the ratio obtained with our NLL resummed prediction and with the simulation performed with , both at parton-level and hadron-level.

As expected, because \(\theta _i=\frac{m_i}{p_t R_0},\) the dead cone is more visible at lower values of \(p_t.\) Interestingly, deviations between the b and the light quark distribution start at angular scales bigger than \(\theta _i.\) However, as \(p_t\) is increased, this transition is pushed to a region that is likely beyond experimental resolution. We also note that mass effects are more pronounced for the \(\beta =0\) case than \(\beta =1.\) Considering that we have already noted that the former is under better theoretical control than the latter, the Soft Drop jets with \(\beta =0\) appear to be an interesting choice to study the dead cone.

Fig. 2
figure 2

The groomed jet radius \((\theta _g)\) distribution for b-jets (top), c-jets (middle) and light-quark jets (bottom). In all plots, we show our NLL resummation, as well as the results obtained with the Monte Carlo event generator , both at parton and hadron level. Jets are selected with the anti-\(k_t\) algorithm with \(R_0=0.4,\) with \(p_t\ge 150\) GeV, and groomed with Soft Drop with \(z_c=0.1,\) and two values of the angular exponent: \(\beta =0\) (on the left) and \(\beta =1\) (on the right). The uncertainty bands for the analytic predictions are obtained by varying the resummation scale by a factor of two above and below the hard scale \(p_t R_0,\) i.e. \(\mu _R \in \left[ \frac{p_t R_0}{2}, 2 p_t R_0\right] \)

Fig. 3
figure 3

Ratios of groomed jet radius \((\theta _g)\) distribution of b jets to light-quark jets for three values of the jet transverse momentum: 50 GeV (top), 150 GeV (middle) and 300 GeV (bottom). The plots on the left are for \(\beta =0,\) and the ones on the right are for \(\beta =1.\) In each plot, we show our NLL resummation and the results obtained with , both a parton-level and hadron-level

4 The \(z_g\) distribution

We now discuss the second variable that characterises Soft Drop jets, namely the momentum fraction \(z_g.\) In this discussion, we want to provide a simple theoretical prediction for \(z_g\) for jets initiated by massive quarks. We also compare, both analytically and in Monte Carlo simulation, the variable \(z_g\) to a widely-used fragmentation variable that measures the transverse momentum fraction of the B(D) hadrons (or b(c) quark at parton level) with respect to the jet \(p_t.\)

4.1 Recap of the massless calculation

We start by briefly reviewing the calculation of the \(z_g\) distribution for light jets. The value of \(z_g\) is fixed by the first de-clustering of the jet that passes the Soft Drop condition. Because we are completely inclusive over the splitting angle, we must integrate over all possible values of \(\theta _g,\) including configurations where the two emissions become collinear, for which the integral diverges. If \(\beta \ge 0,\) collinear splittings always pass the Soft Drop condition and these divergent configurations are not cancelled by the corresponding virtual corrections, for which \(z_g\) is undefined, and heralds the fact that the observable is not IRC safe.

However, \(z_g\) belongs to a wider class, i.e. Sudakov safe observables [37, 38, 69, 70], that despite being IRC unsafe, can be computed in perturbation theory, provided that we use resummation. For this purpose, we need to introduce a safe companion observable. The Soft Drop procedure itself suggests using the groomed angle \(\theta _g,\) which we have discussed in the previous section. Therefore, we imagine to measure a value of \(z_g,\) given a finite angular separation \(\theta _g.\) Using the language of conditional probabilities, we have [38]:

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma }{{{\,\textrm{d}\,}}\!z_g} = \int _0^1 d \theta _g \, p(\theta _g) \, p(z_g|\theta _g), \end{aligned}$$
(33)

where \(p(\theta _g)=\frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma }{{{\,\textrm{d}\,}}\!\theta _g}\) and \(p(z_g|\theta _g)=\frac{p(z_g,\theta _g)}{p(\theta _g)}\) is the conditional probability for measuring \(z_g\) given a value of \(\theta _g.\) If \(\beta <0,\) \(z_g\) is IRC safe and the integral in Eq. (33) can be computed order by order in \(\alpha _{\text {S}}.\) This is no longer true when \(\beta \ge 0,\) which is the standard configuration in which the Soft Drop algorithm is used and, therefore, our case of interest. In this situation, the integral (33) diverges order by order in the strong coupling because of the \(1/\theta _g\) behaviour of the integrand. However, if we take \(p(\theta _g)\) to be the resummed distribution, i.e. the derivative of Eq. (4), then the Sudakov form factor regulates the \(\theta _g=0\) singularity, providing us with a finite answer for Eq. (33).

Because in this work we are not concerned with logarithms of \(z_g\) and \(z_c,\) we evaluate the conditional probability at fixed-order, focusing on the case of a quark-initiated jetFootnote 2:

$$\begin{aligned} p(z_g|\theta _g ) = \frac{P_{gq}(z_g)\alpha _{\text {S}}(z_g^2 \theta _g^2 p_t^2 R_0^2)}{\int _{z_c\theta _g^\beta }^{1} {{\,\textrm{d}\,}}\!z \, P_{gq}(z) \alpha _{\text {S}}(z^2 \, \theta _g^2 p_t^2 R_0^2)} \varTheta (z_g-z_c\theta _g^\beta ). \end{aligned}$$
(34)

The Soft Drop condition (1) dictates \(z_g<\frac{1}{2}\) and, if \(\beta =0,\) \(z_g>z_c.\)

The integral in Eq. (33) with running coupling must be performed numerically. However, it is interesting to study its fixed-coupling and lowest-order approximation, see Eq. (19). Thus, we consider

$$\begin{aligned} R^{(\text {f.c.})}(\theta _g^2)&= \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }\Big (\log ^2(z_c\theta _g^\beta )-\log ^2(z_c)\Big ) \nonumber \\&= \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\Big (\frac{\beta }{4} \log ^2 \theta _g^2+ \log \theta _g^2 \, \log z_c\Big ) , \nonumber \\ S^{(\text {f.c.})}(\theta _g^2)&=1. \end{aligned}$$
(35)

For \(\beta \ge 0,\) we find [38]

$$\begin{aligned}&\frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma ^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g} = P_{gq}(z_g)\frac{\alpha _{\text {S}}}{2\pi } \int _0^1\frac{{{\,\textrm{d}\,}}\!\theta _g^2}{\theta _g^2} e^{-R(\theta _g^2)} \varTheta (z_g-z_c\theta _g^\beta )\nonumber \\&\ = \sqrt{\frac{\alpha _{\text {S}}}{4\beta C_{\text{ F }}}} P_{gq}(z_g) e^{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }\log ^2 z_c} \Bigg [\text {erf}\left( \sqrt{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }} \log a \right) +1 \Bigg ], \end{aligned}$$
(36)

where \(\text {erf}(x)\) is the error function and \(a=\min [z_g,z_c].\) If \(\beta >0,\) the above result is non-analytic in \(\alpha _{\text {S}}\):

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma ^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g} = \frac{1}{2}\sqrt{\frac{\alpha _{\text {S}}}{\beta C_{\text{ F }}}}\, P_{gq}(z_g)+\mathcal{O}\left( \alpha _s\right) . \end{aligned}$$
(37)

Even more interesting is the \(\beta =0\) case. At fixed coupling, the conditional probability, Eq. (34), becomes independent of \(\theta _g\) and factors out of the integration to give

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma ^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g} = \frac{P_{gq}(z_g)}{\int _{z_c}^{1} {{\,\textrm{d}\,}}\!z \, P_{gq}(z)}\varTheta (z_g>z_c)+\mathcal{O}\left( \alpha _s\right) . \end{aligned}$$
(38)

It can be shown that the \(\beta = 0\) case does have a valid perturbative expansion in \(\alpha _{\text {S}},\) despite being \(\alpha _{\text {S}}\)-independent at lowest order. Eq. (38) shows that the \(\beta =0\) \(z_g\)-distribution is essentially driven by the QCD splitting function [38]. This observation has initiated numerous theoretical [73,74,75,76] and experimental [77,78,79,80,81,82,83] studies (see also [71, 72]) that aim to use \(z_g\) as a probe of QCD dynamics, both in pp and heavy-ion collisions.

4.2 The \(z_g\) distribution for a heavy-flavour jet

We now move to the case of b and c jets and we start by considering the simple \({\mathscr {O}}(\alpha _{\text {S}})\) calculation of the \(z_g\) distribution, in the quasi-collinear limit:

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.o.})}}{{{\,\textrm{d}\,}}\!z_g}&= \frac{\alpha _{\text {S}}}{2\pi } \int _{0}^{1} \frac{{{\,\textrm{d}\,}}\!\theta ^2}{\theta ^2+ \theta ^2_i}\int _{z_c \theta ^\beta }^1 {{\,\textrm{d}\,}}\!z \,P_{gi}\left( z, (z\theta p_t R_0)^2\right) \nonumber \\&\quad \times \delta (z-z_g) \nonumber \\&=\frac{\alpha _{\text {S}}}{2\pi } \int _{\theta _i^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2 } \,{\mathscr {P}}_{gi}\left( z_g, \bar{\theta }^2 \right) \varTheta (z_g-z_c \bar{\theta }^\beta )\nonumber \\&\quad + \text {NNLL}, \end{aligned}$$
(39)

where, as before, we have dropped the mass-dependent shift in the Soft Drop condition. Even before performing the integral, it is clear that the mass of the heavy quark, as one might have expected, regulates the collinear singularity and so the \(z_g\) distribution is IRC safe, for every value of \(\beta .\) The computation of the integral is straightforward but the presence of the \(\bar{\theta }^2\) dependent contribution in the splitting function complicates the result. However, to illustrate our point, it is sufficient to work at LL. Therefore, we approximate \({\mathscr {P}}_{gi}=2 C_{\text{ F }}/z_g,\) and we find

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.o.})}}{{{\,\textrm{d}\,}}\!z_g}&=- \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi } \frac{1}{z_g} {\left\{ \begin{array}{ll} \log \theta _i^2 &{} z_g>z_c, \\ \frac{2}{\beta }\log \frac{z_c\theta _i^\beta }{z_g}\, &{} z_c \theta _i^\beta<z_g< z_c. \end{array}\right. } \end{aligned}$$
(40)

Note that in the case \(\beta =0,\) we have \(z_g>z_c\) and only the first term survives. Albeit finite, this expression contains logarithms of \(\theta _i^2,\) which become large in the boosted regime. The all-order resummation of logarithms of \(z_g\) partially addresses this problem. Indeed, keeping our focus on the LL fixed-coupling approximation, we find the following result for the normalised cumulative distribution:

$$\begin{aligned}&\log \varSigma _i^{(\text {f.c.})}(z_g) =\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\Bigg [-\log \theta _i^2\, \log z_g\,\varTheta (z_g-z_c)\nonumber \\&\quad + \left( \frac{1}{\beta }\log ^2 \frac{z_g}{z_c}-\log \theta _i^2\, \log z_g\right) \varTheta (z_c-z_g)\varTheta (z_g-z_c \theta _i^\beta ) \nonumber \\&\quad +\left( \frac{1}{\beta }\log ^2 \theta _i^\beta - \log \theta _i^2\, \log z_c\theta _i^\beta \right) \varTheta (z_c \theta _i^\beta -z_g)\Bigg ]. \end{aligned}$$
(41)

The \(\beta =0\) case is rather simple

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g}=-\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\frac{\log \theta _i^2}{z_g}e^{-\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\log \theta _i^2\, \log z_g }, \quad z_g>z_c. \end{aligned}$$
(42)

We note that these expressions indeed resum those logarithms of \(\theta _i\) that are associated with logarithms of \(z_g.\) However, the \(\theta _i \rightarrow 0\) limit is not smooth and we do not recover the massless result of Eq. (36). This is related to the non-commutativity of the soft and massless limits, discussed at length in [35, 36].

Another way of resumming logarithms \(\theta _i^2\) is to resort to the conditional probability procedure described above for the massless case. To illustrate the procedure, we repeat, for the massive case, the calculation that led to the fixed-coupling result in Eq. (36). At LL, the splitting function can be approximated by its soft contribution. Moreover, at this accuracy \(R_i(\theta _g^2,\theta _i^2,\xi _b,\xi _c)=R(\vartheta _{g,i}^2),\) with R given by Eq. (35) and \(\vartheta _{g,i}^2\in [\theta _i^2,1].\) Therefore, for \(\beta \ge 0,\) we find

$$\begin{aligned}&\frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g}= \frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\frac{1}{z_g} \int _{\theta _i^2}^1\frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2} e^{-R(\bar{\theta }^2)} \varTheta \left( z_g-z_c {\bar{\theta }}^\beta \right) \nonumber \\&\quad = \sqrt{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\beta }} \frac{1}{z_g} e^{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }\log ^2 z_c} \nonumber \\&\qquad \times \left[ \text {erf}\left( \sqrt{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }} \log a \right) -\text {erf}\left( \sqrt{\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi \beta }} \log \left( z_c \theta _i^\beta \right) \right) \right] , \end{aligned}$$
(43)

with \(a=\min [z_g,z_c].\) In particular, for \(\beta =0,\) we find

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.c.})}}{{{\,\textrm{d}\,}}\!z_g}= \frac{1}{z_g \log z_c}\left( e^{-\frac{\alpha _{\text {S}}C_{\text{ F }}}{\pi }\log \theta _i^2\log z_c} -1\right) , \quad z_g>z_c. \end{aligned}$$
(44)

The first-order expansions of the above expressions agree with Eq. (40) and large logarithmic corrections in \(\theta _i^2\) are resummed. Furthermore, if we take \(\theta _i\rightarrow 0,\) we recover the massless distribution of Eq. (36). We further note that Eq. (44) is the same as its massless counterpart, but for the normalisation factor in brackets. This makes sense because in the LL fixed-coupling approximation, both distributions are driven by the most singular part of the splitting function, which is the same for \(P_{gq},\) \(P_{gb},\) and \(P_{gc}.\)

Because it has the correct \(\theta _i \rightarrow 0\) limit, we decide to use the conditional-probability approach to compute the \(z_g\) distribution of a heavy-flavour jet:

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i}{{{\,\textrm{d}\,}}\!z_g}&=\frac{1}{\sigma _0} \int _{\theta _i^2}^1 {{\,\textrm{d}\,}}\!\bar{\theta }^2 \frac{{\mathscr {P}}_{gi}(z_g,\bar{\theta }^2)\alpha _{\text {S}}(z_g^2 \bar{\theta }^2 p_t^2 R_0^2)}{\int _{z_c\bar{\theta }^\beta }^{1} d z \, {\mathscr {P}}_{gi}(z,\bar{\theta }^2) \alpha _{\text {S}}(z^2 \, \bar{\theta }^2 p_t^2 R_0^2)}\nonumber \\&\quad \times \varTheta (z_g-z_c\bar{\theta }^\beta ) \; \frac{{{\,\textrm{d}\,}}\!\sigma _i}{{{\,\textrm{d}\,}}\!\theta _g^2}\Bigg |_{\theta _g^2=\bar{\theta }^2-\theta _i^2}, \quad i=b,c, \end{aligned}$$
(45)

where the resummed \(\theta _g\) distribution is given in Eq. (45). We can make some further simplifications. The derivative in Eq. (45) gives a factor that simplifies, within our accuracy, the denominator of the conditional probability. Thus, we obtain

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i}{{{\,\textrm{d}\,}}\!z_g}&=\frac{1}{\sigma _0} \int _{\theta _i^2}^1 \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2} {\mathscr {P}}_{gi}(z_g,\bar{\theta }^2)\alpha _{\text {S}}(z_g^2 \bar{\theta }^2 p_t^2 R_0^2) \nonumber \\&\quad \times S_i(\bar{\theta }^2,\theta _i^2,\xi _b,\xi _c)\, e^{- R_i(\bar{\theta }^2,\theta _i^2,\xi _b,\xi _c)} \varTheta (z_g-z_c\bar{\theta }^\beta ). \end{aligned}$$
(46)

Numerical results and their comparison to Monte Carlo simulations will be presented in the next section.

We close this discussion by noting that Eq. (45) does not systematically resum logarithms of \(z_g\) and \(z_c.\) This is acceptable for our purposes because we are mostly interested in the \(\beta =0\) case, for which \(z_g>z_c =0.1.\) However, it would be interesting to extend the full NLL resummation for \(z_g,\) performed for light jets in [45] to the massive case. Because the calculation in [45] is based on the resummation of the double differential \((\theta _g,z_g),\) it may overcome the difficulties about the \(\theta _i \rightarrow 0\) limit of Eqs. (41) and (42), previously discussed.

4.3 Numerical results and comparison to Monte Carlo simulations

Fig. 4
figure 4

The groomed momentum fraction \((z_g)\) distribution for b-jets (top), c-jets (middle) and light-quark jets (bottom). In all plots, we show our resummed result, as well as the one obtained with the Monte Carlo event generator , both at parton and hadron level. Jets are selected with the anti-\(k_t\) algorithm with \(R_0=0.4,\) and groomed with Soft Drop with \(z_c=0.1\) and \(\beta =0.\) The plots on the left are for \(p_t \ge 50\) GeV, while the ones on the right are for \(p_t\ge 150\) GeV. The uncertainty bands for the analytic predictions are obtained by varying the resummation scale by a factor of two above and below the hard scale \(p_t R_0,\) i.e. \(\mu _R \in \left[ \frac{p_t R_0}{2}, 2 p_t R_0\right] \)

In this section, we compare our resummed results for the \(z_g\) distribution to the ones obtained with Monte Carlo event generators, both at parton- and hadron-level. We limit ourselves to the case \(\beta =0\) and we simulate events using , with the same settings as the ones described in Sect. 3.4.

In Fig. 4, we compare resummed results for two different values of the jet transverse momentum cut, namely \(p_t\ge 50\) GeV, on the left, and \(p_t\ge 150\) GeV, on the right. The plots at the top are for b-jets, the ones in the middle for c-jets, and the ones at the bottom for light-quark jets. All curves are normalised to have unit area. As already pointed out, the results all look very similar, because the shape of the distribution is mostly driven by the singular part of the splitting function for the emission of a gluon off a (massive) quark. However, we also note that differences between \(b,\) \(c,\) and light flavours are larger in the Monte Carlo results than in the analytic ones. This could be due to quark masses influencing the kinematics, thus causing mass-dependent recoil effects. These are, to a certain extent, accounted for in the parton shower, but neglected in the analytic resummation.

Overall, we find good agreement between our calculation and the Monte Carlo results, although our results are strangely closer to the full simulation than to the parton-level one. We note that our predictions undershoot the Monte Carlo at large \(z_g.\) We have traced this back to the fact that our calculation does not take into account symmetrised splitting functions, as discussed in the footnote of Sect. 4.1. Indeed by rescaling the \(z_g\) distribution by the symmetrised splitting function, the tail of the analytic calculation moves closer to the Monte Carlo results.

4.4 Comparison to fragmentation functions

Fig. 5
figure 5

Monte Carlo studies of the fragmentation function variable x,  obtained with . The plot on the left shows the \(\zeta =1-x\) distribution for b-jets, at parton- and hadron-level. The plot on the right shows the correlation between \(z_g\) (with \(z_c=0.1,\) and \(\beta =0)\) and \(\zeta ,\) at hadron level. In both cases, we have \(p_t\ge 150\) GeV, and \(R_0=0.4\)

The study of heavy-flavour production at high energies is a multi-scale problem and, as already pointed out, logarithms of the ratio of the quark mass to the hard scale of the process can spoil the convergence of the perturbative expansion. Because these logarithmic corrections are related to collinear dynamics, heavy-flavour production cross-sections obey a factorisation theorem and they can be written as the convolution of process-dependent partonic (massless) coefficient functions with universal heavy-flavour fragmentation functions. Fragmentation functions obey DGLAP evolution equations with timelike splitting functions, which allow one to resum these large logarithmic corrections to all perturbative orders.

The initial condition for heavy quark fragmentation functions can be computed in perturbation theory, as originally pointed out in Refs. [27, 28], where the next-to-leading order (NLO) computation in QCD was presented. The NNLO corrections were computed later in Refs. [29, 30]. Furthermore, the initial condition of the evolution is affected by soft logarithms, that can be resummed to all orders too [31,32,33,34, 36]. Fragmentation functions for b (or c) quarks are usually then supplemented with non-perturbative corrections before being compared to experimental data. In the context of heavy-flavour production in \(e^+ e^-\) collision, one of the most widely studied observables is perhaps the energy fraction x of the heavy quark (or hadron) with respect to the energy of the incoming beam, in the centre-of-mass frame, see [84] (and references therein) for a recent review.

Collinear factorisation ensures that fragmentation functions are universal objects and so they can be used, in principle, to describe heavy flavours in hadronic collisions. However, a few changes in their definition are usually required from a practical point of view. For instance, although is possible to study directly the properties of identified B or D decays, the LHC experiments typically measure properties of the B and D hadrons within jets [85,86,87,88]. Theoretical studies of heavy quarks fragmenting in jets have been performed using effective field theories [89,90,91,92].

In hadronic collisions, one usually measures projections of the hadron momentum with respect to the jet one (see e.g. [85]) or the transverse momentum fraction of the heavy flavour with respect to the jet transverse momentum (see e.g. [86]):

$$\begin{aligned} x= \frac{p_{ti}}{p_t}, \quad i=b,c. \end{aligned}$$
(47)

Because \(x=1\) at the Born level, if we want to study soft and collinear emissions, it is useful to define \(\zeta =1-x.\)

In what follows, we would like to study the correlation, if any, between the variable \(\zeta ,\) which measures the departure from Born kinematics of the identified heavy flavour within a jet, to \(z_g\) (with \(\beta =0)\), which probes the kinematics of the splitting that passes Soft Drop, in a heavy-flavour tagged jet.

Let us start with the \({\mathscr {O}}(\alpha _{\text {S}})\) calculation, in the quasi-collinear limit. The result for \(z_g\) can be immediately read off Eq. (39) by setting \(\beta =0\):

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.o.})}}{{{\,\textrm{d}\,}}\!z_g}= \frac{\alpha _{\text {S}}}{2\pi } \int _{\theta _i^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2 } \,{\mathscr {P}}_{gi}\left( z_g, \bar{\theta }^2 \right) \varTheta (z_g-z_c). \end{aligned}$$
(48)

The computation of the momentum fraction \(\zeta ,\) in the same approximation, is rather straightforward. We consider the emission of a gluon with momentum fraction z in the quasi-collinear limit. If the gluon is emitted within the jet, then \(z=\zeta ,\) i.e. the b(c) quark has momentum fraction \(1-z=x.\) If the gluon is emitted outside the jet, it does not contribute to the observable and x remains equal to unity.Footnote 3 We have

$$\begin{aligned} \frac{1}{\sigma _0}\frac{{{\,\textrm{d}\,}}\!\sigma _i^{(\text {f.o.})}}{{{\,\textrm{d}\,}}\!\zeta }&= \frac{\alpha _{\text {S}}}{2\pi } \int _{0}^{1} \frac{{{\,\textrm{d}\,}}\!\theta ^2}{\theta ^2+ \theta ^2_i}\int _0^1 {{\,\textrm{d}\,}}\!z \,P_{gi}\left( z, (z\theta p_t R_0)^2\right) \nonumber \\ {}&\delta (z-\zeta ) \nonumber \\&=\frac{\alpha _{\text {S}}}{2\pi } \int _{\theta _i^2}^{1} \frac{{{\,\textrm{d}\,}}\!\bar{\theta }^2}{\bar{\theta }^2 } \,{\mathscr {P}}_{gi}\left( \zeta , \bar{\theta }^2 \right) , \end{aligned}$$
(49)

where the second equation holds up to power corrections in the mass. Eq. (49) coincides with Eq. (48), for \(\zeta >z_c,\) i.e. \(x<1-z_c.\) We conclude that \(1-x\) and \(z_g\) are fully correlated at this perturbative order. It is interesting to investigate whether this remains true beyond that. We study this problem by performing a Monte Carlo study.

Before doing so, we study the \(\zeta =1-x\) spectrum using the same simulation setup described for our previous analyses. The results are shown in Fig. 5, on the left, where we plot the \(\zeta \) distribution at parton- and hadron-level, for b-jets with \(p_t \ge 150\) GeV. We note that non-perturbative effects are large. This is interesting because, while fragmentation functions are under better perturbative control than Soft Drop observables, the latter ones seem to be more robust against non-perturbative corrections.

Finally, the right-hand plot of Fig. 5 shows the two dimensional distribution for \(z_g\) (with \(z_c=0.1\) and \(\beta =0)\) and \(\zeta ,\) again for \(p_t \ge 150\) GeV, at hadron-level. We note that the \({\mathscr {O}}(\alpha _{\text {S}})\) correlation between these two observables is not maintained when higher-order corrections and non-perturbative effects are included. Therefore, we conclude that \(z_g\) and \(\zeta =1-x\) can offer different handles to study heavy-flavour dynamics at the LHC.

5 Conclusions and outlook

In this work, we have considered heavy-flavour jets, namely b- and c-jets. In particular, we have studied heavy-flavour jets in conjunction with the Soft Drop grooming algorithm. Exploiting resummed perturbation theory, we have computed the NLL distribution for the groomed jet radius \(\theta _g,\) for both b- and c-jets. Our calculation accounts for the dead-cone effect due to the heavy-quark mass, as well as \(m_b\) and \(m_c\) thresholds in the running coupling, which is frozen at a non-perturbative scale \(\varLambda =1\) GeV. We also discuss the role of clustering logarithms, first computed in [44]. The presence of many scales originates many cases and regions to be considered. We have listed all of them and shown numerical results for representative cases. We have compared our findings for the \(\theta _g\) distribution with simulated data obtained with Monte Carlo event generators, assessing the role of non-perturbative corrections, such as hadronisation and the Underlying Event. We have found good agreement between resummed perturbation theory and Monte Carlo simulation, for the \(\beta =0\) case, even for relatively low values of the hard scale, i.e. \(p_t=50\) GeV and \(R_0=0.4.\) The agreement worsens if positive values of the angular exponent \(\beta \) are considered.

We have also considered the momentum fraction \(z_g\) of the first emission that passes Soft Drop. Our calculation generalises to the massive case the conditional-probability approach of [38] and it allows for a resummation of mass effects. While our results hold for any \(\beta \ge 0,\) we have focussed our numerical investigations on the \(\beta =0\) case. This choice is motivated by a few reasons. First, this is the value for which, at least to lowest order, one has a clear factorisation of the conditional probability expression, leading to a distribution proportional to the splitting function. For \(\beta >0,\) the dependence on the splitting function is smeared out. Second, it is for the \(\beta =0\) case that one expects to find more similarities to the fragmentation variable x,  and third, from the study of the \(\theta _g\) distribution, we have found that the \(\beta =0\) is under better theoretical control and more sensitive to dead cone effects. We have compared our numerical results to Monte Carlo simulations. As for the massless case, the result is driven by the QCD splitting function and it is largely insensitive to non-perturbative effects.

Finally, we have compared the \(z_g\) distribution to the fragmentation variable x,  which measures the ratio of the heavy quark (hadron) transverse momentum to the jet \(p_t.\) We performed the analytic calculation of both distributions at \({\mathscr {O}}(\alpha _{\text {S}}),\) showing that they lead to the same results for \(z_g=1-x> z_c.\) We have investigated the all-order behaviour of these observables using Monte Carlo simulations and discovered that the parton-shower significantly dilutes this correlation. However, we have found that, in contrast to \(z_g,\) the x receives sizeable non-perturbative corrections. In the future, it would be interesting to repeat these studies with the modified version of the declustering procedure [93], whereby one follows the flavoured branch.

In this work, we have performed a detailed theoretical study of the effect of Soft Drop grooming on heavy-flavour jets. Before being able to compare to experimental data, such as the ones collected by the ALICE collaboration [40], a few steps need to be taken. We are going to implement our calculation in the resummation plugin to  [94, 95] in order to match our results to NLO theoretical predictions, with fiducial cuts, supplemented with non-perturbative corrections, as done for instance in [96,97,98,99,100,101]. In this context, it would be also interesting to lift the small-\(z_c\) limit and investigate, in the \(\beta =0\) case, flavour-changing contributions that may induce radiation into the dead-cone region. Finally, it would be important to improve the accuracy of the \(z_g\) calculation by including the resummation of \(z_g\) and \(z_c\) logarithms, as done in [45].