1 Introduction

Analytic studies of Quantum-Chromo-Dynamics (QCD) in the physical case, i.e. in four space-time dimensions, are substantially restricted to perturbation theory. The systematic application of perturbation theory to QCD gives rise to the well-known perturbative Quantum-Chromo-Dynamics (pQCD), a mature branch of theoretical physics, deeply involved in the phenomenology of the Standard Model. While exact analytic solutions of QCD correlation functions could be intrinsically beyond human ability [1], perturbative calculations of cross sections and decay rates often reveal a rich structure and describe a variety of physical effects.

There are basically two different approaches in pQCD. The first one involves an exact evaluation of the QCD matrix elements of the process under investigation, up to a given order n in the coupling \(\alpha _S\) (n is taken, of course, as large as possible):

$$\begin{aligned} \sigma \, \simeq \, \sum _{k=0}^n \alpha _S^k \, \sigma ^{(k)}, \end{aligned}$$
(1)

where \(\sigma ^{(k)}\) in the contribution to the physical cross section \(\sigma =\sigma (\alpha _S)\) of order k. That way, all physical effects which show up in the matrix elements up to the truncation order n are trivially taken into account. In this approach, one has to assume that the higher order \((k>n)\) contributions to the cross section can be safely neglected. In practice, one has to assume that all the terms in \(\sigma ^{(k)}\) are of order unity.

In the second approach, one concentrates instead, from the very beginning, on a specific physical effect. Usually, such effect manifests itself in peculiar perturbative terms, which depend on a kinematic parameter, and which become large in some region of the space of this parameter. Actually, at a generic order k, such enhanced terms \(s^{(k)}\), contained in \(\sigma ^{(k)}\), can become so large as to cancel the smallness of \(\alpha _S^k\), the k-th power of the QCD coupling. We face the situation

$$\begin{aligned} \alpha _S \, \ll \, 1, \qquad \left| s^{(k)} \right| \, \gg \, 1, \end{aligned}$$
(2)

with:

(3)

For concreteness sake, we have assumed that the smallest order at which the physical effect under consideration manifests itself is one, i.e.

$$\begin{aligned} k \, \ge \, k_{\min } \, = \, 1, \end{aligned}$$
(4)

but higher values of \(k_{\min }=2,3,\ldots \), are possible; Usually, \(k_{\min }\) is a small integer. In this physical situation, it is necessary to resum the enhanced terms \(s^{(k)}\) to all orders of perturbation theory, i.e. for all k. Schematically, we can write:

$$\begin{aligned} \sigma \, \simeq \, \sigma ^{(0)} \,+\, \sum _{k=1}^\infty \alpha _S^k \, s^{(k)}. \end{aligned}$$
(5)

Therefore, in the second approach, we realize an approximate resummation of the perturbative series of the given cross section to all orders. Note that, at each perturbative order k, we do not compute the exact cross section contribution \(\sigma ^{(k)}\), but only the leading term \(s^{(k)}\) contained in it,

$$\begin{aligned} s^{(k)} \, \simeq \, \sigma ^{(k)}, \end{aligned}$$
(6)

as far as the physical effect under consideration is taken into account.

From the above considerations, it should be clear that fixed-order calculations and resummed ones rely on quite different philosophies. In order to obtain an optimal perturbative description of the process, one should combine in some way the two approaches. That involves the so-called matching operation – or simply matching – in which one requires consistency between the two approaches. Roughly speaking, one would like to have an improved perturbative formula for the cross section which, at low orders, contains the exact matrix elements while, at higher orders, contains the approximate matrix elements of the resummation.

In this work we consider the general process

$$\begin{aligned} Q_1 \, \rightarrow \, Q_2 \, + \, \mathrm {(non \, QCD \, partons)}, \end{aligned}$$
(7)

where \(Q_1\) and \(Q_2\) are two different heavy quarks of mass \(m_1\) and \(m_2\) respectively and the non-QCD, i.e. non colored, partons, can be a photon, a lepton pair, an intermediate vector boson, etc. For the decay to occur, one has to assume:

$$\begin{aligned} m_2 \, < \, m_1. \end{aligned}$$
(8)

Let us describe in qualitative terms the physics of the simplest process above as far as soft-gluon dynamics is concerned, namely the rare decay

$$\begin{aligned} B \, \rightarrow \, X_s \, + \, \gamma , \end{aligned}$$
(9)

where \(X_s\) is the final hadronic state containing the strange quark s, coming from the fragmentation of the beauty quark b inside the B meson. In order to construct a general theory, let us assume that the strange quark mass \(m_s\) is a parameter that we can change at will. Let us consider first the massless limit of the final quark,

$$\begin{aligned} m_s \, = \, 0. \end{aligned}$$
(10)

We assume to be in the so-called threshold (or large-x) region,

$$\begin{aligned} m_{X_s}^2 \, \ll \, m_b^2, \end{aligned}$$
(11)

in which the invariant mass of the final hadronic (partonic in pQCD) state \(X_s\) is restricted to be much smaller than the hard scale of the process, provided by the initial beauty quark mass, \(Q=m_b\). In terms of the normalized invariant mass squared

$$\begin{aligned} y \, \equiv \, \frac{m_{X_s}^2}{m_b^2} \qquad (m_s=0), \end{aligned}$$
(12)

the threshold region is simply written:

$$\begin{aligned} y \, \ll \, 1 \qquad (y \in [0,1]). \end{aligned}$$
(13)

Roughly speaking, in the threshold region, not much radiation can be emitted, so the related rate is expected to be suppressed. Note that we only fix the invariant mass \(m_{X_s}\) of the final hadronic state, and not other quantities, such as for example the strange quark energy or its transverse momentum with respect to the photon 3-momentum. The final hadronic state \(X_s\) is treated as a single pseudo-particle, with a continuous invariant mass distribution (rather than a fixed mass, like an ordinary particle). We may say that, by means of the condition (13), we observe QCD radiation indirectly, in a semi-inclusive way.

The final strange quark, assumed to be emitted with a large energy compared to the QCD scale \(\Lambda _{QCD} \approx 300 \, \textrm{MeV}\) for pQCD to be relevant,

$$\begin{aligned} E_s \, \approx \, \frac{m_b}{2} \, \gg \, \Lambda _{QCD}, \end{aligned}$$
(14)

evolves, because of collinear emissions, into a hadronic jet.

Let us now consider the rare decay (9) in the general massive case \(m_s \ne 0\). The situation becomes substantially more complicated because of the presence of a new mass scale. The definition of the threshold region (11) can be generalized by means of the condition

$$\begin{aligned} m_{X_s}^2 \, - \, m_s^2 \, \ll \, m_b^2 \qquad (m_s \ne 0). \end{aligned}$$
(15)

The invariant mass \(m_{X_s}\) of the final hadronic state \(X_s\) is restricted to not become much larger than \(m_s\), compared to the hard scale \(m_b\). As in the massless case, that is again a constraint on QCD radiation: the latter cannot increase too much \(m_{X_s}\) with respect to \(m_s\). Note that the condition above trivially reduces to (11) in the massless case, so it is a sensible generalization. Let us also remark that the condition (15) does not imply neither \(m_{X_s}^2 \ll m_b^2\) nor \(m_s^2 \ll m_b^2\).

The unitary adimensional variable y defined in Eq. (12) is naturally generalized, in the massive case, as

$$\begin{aligned} y \, \equiv \, \frac{m_{X_s}^2 \, - \, m_s^2}{m_b^2 \, - \, m_s^2} \qquad \qquad (m_s \ne 0), \end{aligned}$$
(16)

in terms of which the threshold region is written

$$\begin{aligned} y \, \ll \, 1, \end{aligned}$$
(17)

just like in the massless case. We have divided the squared mass increase, \(m_{X_s}^2 \, - \, m_s^2\), by \(m_b^2 \, - \, m_s^2\), instead of simply \(m_b^2\), in order to have a unitary variable \((y \in [0,1])\), again as in the massless case.

If the final strange quark is relativistic (in the beauty rest frame), the non-vanishing of \(m_s\) produces the well-known dead-cone (dc) effect, i.e. the fact that gluon radiation is mostly emitted outside a cone centered around the strange quark motion direction, namely

(18)

\(\gamma _s\) is the Lorentz factor of the strange quark,

$$\begin{aligned} \gamma _s \, \equiv \, \frac{1}{ \sqrt{1 \, - \, \left| \textbf{v}_s \right| ^{\,2} / c^2} } \, \gg \, 1, \end{aligned}$$
(19)

with \(\textbf{v}_s \equiv d\textbf{r}_s/dt\) the ordinary strange quark 3-velocity. The dead-cone effect is well known from classical electrodynamics [2]; In general, a non-vanishing final quark mass softens the collinear singularity according to a general mechanism [3]. Since the source of kinetic energy of the strange quark is the beauty mass, the strange quark can be relativistic only if it is much lighter than the beauty, i.e. if

$$\begin{aligned} m_s \, \ll \, m_b. \end{aligned}$$
(20)

Since the usual kinematic condition of y larger than a given positive value \(y_{\textrm{cut}}\), namely

$$\begin{aligned} y \,> \, y_{\textrm{cut}} \, > \, 0, \end{aligned}$$
(21)

also gives a constraint on gluon emission angles (the related observable is an infrared safe quantity), one has to find which one of the two limitations (18) and (21) is stronger and then effective.

In general, we can identify three different subregions in the threshold region (15).

  1. 1.

    The effectively-massless region, in which the strange mass is much smaller than the final jet mass,

    $$\begin{aligned} m_s^2 \, \ll \, m_{X_s}^2 \, \ll \, m_b^2. \end{aligned}$$
    (22)

    In this case, the strange mass only gives power corrections to the massless distribution previously considered \((m_s=0)\), of the form

    $$\begin{aligned} \left( \frac{m_s^2}{m_{X_s}^2}\right) ^n, \qquad n=1,2,3,\ldots , \end{aligned}$$
    (23)

    possibly multiplied by logarithms of the same quantity. As already remarked, since the invariant mass distribution is an infrared (i.e. soft and collinear) safe quantity, no strange quark mass singularities can arise (i.e. terms of the form \(\log m_s^2/m_{X_s}^2\) without a power-suppressed coefficient). In this case, the strange mass is so small that the dead-cone effect (18) gives a small correction to the massless distribution. Note that relation (22) is basically equivalent to the relation

    $$\begin{aligned} m_s^2 \, \ll \, m_{X_s}^2 \, - \, m_s^2 \, \ll \, m_b^2 \, - \, m_s^2 \approx m_b^2. \end{aligned}$$
    (24)

    If we define the mass-correction parameter

    $$\begin{aligned} \rho \, \equiv \, \frac{m_s^2}{m_b^2 \, - \, m_s^2} \qquad (0< \rho < \infty ), \end{aligned}$$
    (25)

    the region (22) is simply written

    $$\begin{aligned} \rho \, \ll \, y \, \ll \, 1. \end{aligned}$$
    (26)
  2. 2.

    The quasi-collinear slice, in which the increase of the jet invariant mass produced by soft-gluon radiation is comparable to the strange quark mass,

    $$\begin{aligned} m_{X_s}^2 \, - \, m_s^2 \, \approx \, m_s^2 \, \ll \, m_b^2, \end{aligned}$$
    (27)

    or, equivalently,

    $$\begin{aligned} \rho \, \approx \, y \, \ll \, 1. \end{aligned}$$
    (28)

    Formally, one can consider the (correlated) limit:

    $$\begin{aligned} y \, \rightarrow \, 0^+, \qquad \rho \, \rightarrow \, 0^+, \qquad \frac{\rho }{y} \, \rightarrow \, \textrm{const}, \end{aligned}$$
    (29)

    where \(\textrm{const} \ne 0, \infty \). This is a double-logarithmic region, as the previous one or the massless case.

  3. 3.

    The soft region, in which the increase in the final jet mass due to gluon radiation is much smaller than the strange mass,

    $$\begin{aligned} m_{X_s}^2 \, - \, m_s^2 \, \ll \, m_s^2. \end{aligned}$$
    (30)

    In terms of the adimensional variables we have introduced, the above condition is written:

    $$\begin{aligned} y \, \ll \, \rho . \end{aligned}$$
    (31)

    Since we always assume \(y \ll 1\), the above relation basically implies one of the two following possibilities:

    $$\begin{aligned} \rho \, \approx \, 1 \quad \textrm{or} \quad \rho \, \gg \, 1. \end{aligned}$$
    (32)

    Since the final strange quark is not relativistic in any of the two above cases, there is not any collinear enhancement in this region. At any order of perturbation theory, one finds at most, in the invariant mass distribution, one large infrared logarithm of soft origin for each power of \(\alpha _S\). In other words, this region is a single-logarithmic, rather than a double-logarithmic, one.Footnote 1 If \(\rho \gg 1\), the final quark is very slow (in the initial quark rest frame) and soft-gluon radiation is suppressed by color coherence. Soft gluons indeed “see” a static color charge which, at the fragmentation time, begins to move with a very small velocity, without any color-spin flip. In the limit of vanishing final velocity, soft gluons just see a static color charge at any time.

The paper is organized as follows. In Sect. 2 we discuss the main phenomenological applications of our work. Nature provides to us heavy quark decays with quite different mass ratios, so we conclude that our work is not academic.

In Sect. 3 we consider threshold resummation in the massless limit of the final quark, \(m_2=0\). As already anticipated, this case is considerably simpler than the massive case \(m_2 \ne 0\), which is our primary concern. This is a preliminary section written in order to present the main ideas in a simple case and to fix the notation. This section also has a pedagogical character and can be omitted by an expert on threshold resummation.

In Sect. 4 we describe the exact calculation to first-order in \(\alpha _S\), of the photon energy spectrum in the rare \(B \rightarrow X_s \gamma \) decay, which is assumed as a model process. As already remarked, the above process is selected because of its simplicity, but we believe that the main consequences which we derive can be generalized to more complicated processes in the class (7), and perhaps even more.

In Sect. 5 we consider threshold resummation, in the usual rare decay, in the soft limit. As already remarked, that means, that the (massive) final quark is produced, in the fragmentation of the initial heavy quark (at rest), with a non-relativistic velocity. We may say that the latter is a complementary situation with respect to the massless limit of the final quark.

In Sect. 6 we construct a general factorization scheme in the massive case. We introduce, as usual, a universal, i.e. process independent, long-distance dominated QCD Form factor, resumming the infrared logarithms to all orders in \(\alpha _S\), together with a Coefficient and a Remainder functions. Unlike the form factor, the latter are process-dependent, short-distance quantities, having an ordinary (i.e. truncated) perturbative expansion.

Section 7 is the central one, the core of the paper. In this section we consider the problems of the general massive factorization scheme, constructed in the previous section, concerning the massless limit of the final quark, \(m_2 \rightarrow 0\), which turns out not to be correctly reproduced. An improved factorization scheme is then constructed which reproduces, in the massless limit, the factorization of the massless process discussed in Sect. 3. The main point is that, as we are going to show, it is necessary to introduce a dependence, inside the Coefficient function, on the final hadron invariant mass, i.e. on the variable y.

Finally, Sect. 8 contains the conclusions of our analysis, together with a discussion about future developments. In general, a lot of work along the lines of this paper remains to be made. The main developments which we can foresee, involve the application of the improved factorization scheme to other processes than \(B\rightarrow X_s\gamma \) decays, as well as the generalization of the scheme to higher orders.

2 Phenomenological relevance

As far as soft-gluon effects are concerned, the hard scale Q of the process (7), in the rest frame of the initial heavy quark \(Q_1\) (\(\textbf{p}_1=0\)), is given by [5,6,7,8,9,10]:

$$\begin{aligned} Q \, = \, E_X \, + \, \left| \textbf{p}_X \right| , \end{aligned}$$
(33)

where, as already defined, X denotes the final hadronic state into which the quark \(Q_2\) evolves (basically, a hadronic jet). If we denote by \(q^\mu \) the total 4-momentum of the non QCD partons, the hard scale can be written:

$$\begin{aligned} Q= & {} m_1 \, - \, \sqrt{q^2 \, + \, \left| \textbf{q} \, \right| ^2} \, + \, \left| \textbf{q} \, \right| \,\nonumber \\= & {} m_1 \, - \, q_0 \, + \, \sqrt{q_0^2 \, - \, q^2}, \end{aligned}$$
(34)

where

$$\begin{aligned} q^2 \, \equiv \, q^\mu q_\mu \, = \, q_0^2 \, - \, \left| \textbf{q} \, \right| ^2 \end{aligned}$$
(35)

is the invariant mass squared of the non QCD partons. Note that, for large values of \(q^2\), the non colored particles can take away a substantial fraction of the available energy from the QCD subprocess, reducing to a large extent the hard scale Q from the “natural” or upper value \(m_1\):

(36)

In the real world, one may cite the following cases of heavy-to-heavy decays (7):

  1. 1.

    The rare (one-loop mediated) beauty quark decays

    $$\begin{aligned} b \, \rightarrow \, X_s \, + \, \gamma , \end{aligned}$$
    (37)

    which we have already considered in the Introduction. In the real world, if we take a constituent (i.e. large) strange quark mass \(m_s \approx 500 \, \textrm{MeV}\) (let’s say, one half of the \(\Phi \) mass), the quark mass ratio

    $$\begin{aligned} \frac{m_s}{m_b} \, \approx \, \frac{1}{10}. \end{aligned}$$
    (38)

    In this process, since the photon is real, the 4-momentum \(q^\mu \) of the probe is light-like, \(q^2=0\), so the hard scale Q, according to Eq. (34), exactly coincides with the beauty mass:

    $$\begin{aligned} Q \, = \, m_b; \end{aligned}$$
    (39)
  2. 2.

    The CKM-favored semileptonic b decays

    $$\begin{aligned} b \, \rightarrow \, X_c \, + \, l \, + \, \nu , \end{aligned}$$
    (40)

    where \(X_c\) is the final hadronic state containing the charm quark, coming from beauty fragmentation, with the (rather large) quark mass ratio

    $$\begin{aligned} \frac{m_c}{m_b} \, \approx \, \frac{1}{3}. \end{aligned}$$
    (41)

    As already noted, the lepton pair can take away a considerable energy from the QCD subprocess. Unlike previous case 1, according to Eq. (34), the true hard scale

    $$\begin{aligned} Q \, = \, E_{X_c} \, + \, \left| \textbf{p}_{X_c} \right| , \end{aligned}$$
    (42)

    is substantially smaller than the beauty mass \(m_b\) for a large dilepton invariant mass, ;

  3. 3.

    As a final example of (7), let us mention the CKM-favored top quark decays

    $$\begin{aligned} t \, \rightarrow \, b \, + \, W, \end{aligned}$$
    (43)

    where the heavy quark mass ratio is very small:

    $$\begin{aligned} \frac{m_b}{m_t} \, \approx \, \frac{1}{35}. \end{aligned}$$
    (44)

We may conclude that phenomenology offers a rather wide class of processes (7), with quite spread values of the quark mass ratio.

3 Massless case

For simplicity’s sake, let us begin our analysis considering the rare decay (37) in the massless limit of the final strange quark,

$$\begin{aligned} m_s \, = \, 0. \end{aligned}$$
(45)

In this heavy-to-light decay, both fixed-order and resummed calculations greatly simplify. Again for simplicity’s sake, let us approximate the effective weak nonleptonic Hamiltonian governing the decay (37) by keeping only the local operator

$$\begin{aligned} O_7(x)\equiv & {} \frac{e}{16 \, \pi ^2} \, {\bar{s}}_i(x) \, \sigma ^{\mu \nu } \left( m_b \, R \, + \, m_s \, L \right) b^i(x) \, F_{\mu \nu }(x) \nonumber \\= & {} \frac{e \, m_b}{16 \, \pi ^2} \, {\bar{s}}_i(x) \, \sigma ^{\mu \nu } R \,b^i(x) \, F_{\mu \nu }(x)\quad (m_s = 0), \end{aligned}$$
(46)

where e is the proton charge and we have defined the standard Right (R) and Left (L) projectors:

$$\begin{aligned} R \, \equiv \, \frac{1 \, + \, \gamma _5}{2}; \qquad L \, \equiv \, \frac{1 \, - \, \gamma _5}{2}. \end{aligned}$$
(47)

The generalization to all the operators in the effective Hamiltonian will be discussed in Sect. 7.3.

3.1 Total decay rate

The tree-level width of the rare decay (37) reads:

$$\begin{aligned} \Gamma ^{(0)}_0 \, = \, \frac{G_F^2 m_b^5}{32 \, \pi ^3} \, C_7^2 \, \left| \lambda _t \right| ^2 \, \frac{\alpha _{em}}{\pi }, \end{aligned}$$
(48)

where \(G_F\) is the Fermi constant, \(m_b\) is the on-shell beauty mass, \(\alpha _{em} \approx 1/137\) is the fine-structure constant and the constant \(C_7\) is the Wilson (short-distance) Coefficient function of the operator \(O_7\), resumming large logarithms of \(m_w/m_b\) to all orders, as well as collecting finite corrections. Finally \(\lambda _t\) is a product of CKM matrix elements:

$$\begin{aligned} \lambda _t \, \equiv \, V_{tb} V_{ts}^*, \end{aligned}$$
(49)

Through the paper, we use the following conventions: the lower index zero, on the l.h.s. of Eq. (48) for example, refers to the massless limit. In general, we will denote quantities calculated in the \(m_s=0\) limit with a zero subscript. The upper index, between round brackets, denotes instead the order in perturbation theory.

3.2 Photon spectrum or invariant hadron squared mass distribution

We consider the invariant hadron squared-mass spectrum, paying particular attention to the low-mass or threshold region

$$\begin{aligned} m_{X_s}^2 \, \ll \, m_b^2. \end{aligned}$$
(50)

As already noted, the hard scale Q is given by the beauty mass \(m_b\), as:

$$\begin{aligned} Q \, = \, E_{X_s} \, + \, |\textbf{p}_{X_s}| \, = \, m_b \, - \, E_\gamma \, + \, |\textbf{p}_\gamma | \, = \, m_b \qquad (c=1).\nonumber \\ \end{aligned}$$
(51)

Since, at lowest order in the QCD coupling \(\alpha _S\), there is no gluon radiation, the final hadronic state only contains the strange quark,

$$\begin{aligned} \alpha _S \, = \, 0: \quad X_s \, = \, s, \end{aligned}$$
(52)

so that:

$$\begin{aligned} m_{X_s}^ 2 \, = \, m_s^2 \, = \, 0. \end{aligned}$$
(53)

By defining the unitary variable

$$\begin{aligned} y \, \equiv \, \frac{m_{X_s}^2 \, - \, m_s^2}{m_b^2 \, - \, m_s^2} \, = \, \frac{m_{X_s}^2}{m_b^2} \qquad (m_s = 0), \end{aligned}$$
(54)

it follows that the tree-level (i.e. lowest-order) spectrum is a spike at vanishing y:

$$\begin{aligned} \frac{d\Gamma _0^{(0)}}{dy} \, = \, \Gamma ^{(0)}_0 \, \delta (y). \end{aligned}$$
(55)

In general, the differential spectrum in y has a perturbative expansion in powers of \(\alpha _S\) of the form:

$$\begin{aligned} \frac{d\Gamma _0}{dy}\left( y; \alpha _S \right)= & {} \Gamma _0^{(0)} \, \delta (y) \,\nonumber \\{} & {} + \, \sum _{n=1}^\infty C_F \left( \frac{\alpha _S}{\pi } \right) ^n \frac{d\Gamma _0^{(n)}}{dy}(y) \quad (0 \le y \le 1),\nonumber \\ \end{aligned}$$
(56)

where \(C_F = (N^2-1)/(2N)=4/3\) for \(N=3\) colors in QCD. In the beauty rest frame (\(p_b^\mu = (m_b;0,0,0)\)), by elementary kinematics:

$$\begin{aligned} E_\gamma \, = \, \frac{m_b^2 \, - \, m_{X_s}^2}{2 m_b} \, = \, \frac{m_b}{2} \left( 1 \, - \, \frac{m_{X_s}^2}{m_b^2} \right) . \end{aligned}$$
(57)

By defining the unitary variable

$$\begin{aligned} x \, \equiv \, \frac{E_\gamma }{\, E_\gamma ^{\max }} \, = \, 1 \, - \, \frac{m_{X_s}^2}{m_b^2} \qquad (0\le x \le 1), \end{aligned}$$
(58)

we find the relation

$$\begin{aligned} x \, = \, 1 \, - \, y \qquad (m_s=0). \end{aligned}$$
(59)

Therefore, to evaluate the hadron squared mass distribution is equivalent to compute the photon energy spectrum. In the high-mass region, the emitted photon is soft, while in the low-mass region (50), the photon is hard, i.e. its energy \(E_\gamma \) is close to its upper endpoint \(m_b/2\), so that:

$$\begin{aligned} E_\gamma ^{\min } \, = \, 0; \qquad E_\gamma ^{\max } \, = \, \frac{m_b}{2} \qquad (m_s=0). \end{aligned}$$
(60)

For technical reasons (to avoid distributions), it is simpler to consider the normalized, partially-integrated spectrum, the so-called event fraction:

$$\begin{aligned} E_0(y;\alpha _S) \, \equiv \, \frac{1}{\Gamma _0} \int \limits _0^y \frac{d\Gamma _0}{dy'}(y';\alpha _S) \, dy' \qquad (0 \le y \le 1).\nonumber \\ \end{aligned}$$
(61)

The differential spectrum is simply obtained by differentiation of the event fraction:

$$\begin{aligned} \frac{1}{\Gamma _0} \frac{d\Gamma _0}{dy} \, = \, \frac{dE_0}{dy}. \end{aligned}$$
(62)

It follows directly from the definition of event fraction given in Eq. (61) that:

$$\begin{aligned} \lim _{y \rightarrow 0^+} E_0(y;\alpha _S) \, = \, 0; \qquad \lim _{y \rightarrow 1^-} E_0(y;\alpha _S) \, = \, 1. \end{aligned}$$
(63)

By integrating both sides of Eq. (55) with respect to y, it is immediately found that the tree-level event fraction is identically equal to one for any \(y>0\):

$$\begin{aligned} E_0^{(0)}(y) \, \equiv \, \frac{1}{\Gamma _0^{(0)}} \int \limits _0^y \frac{d \Gamma _0^{(0)}}{dy'} \, dy' \, = \, \theta (y) \, \equiv \, 1 \qquad (y>0);\nonumber \\ \end{aligned}$$
(64)

where \(\theta (y) \equiv 1\) for \(y>0\) and zero otherwise is the standard Heaviside unit-step function.

The event fraction at first order in \(\alpha _S\) only depends on diagrams involving single real gluon emission (bremmstrahlung), as:

$$\begin{aligned}{} & {} E_0(y;a) \, = \, \frac{\Gamma _0^{(0)} \, + \, a \, \int \limits _0^y dy' \, d\Gamma _0^{(1)}/dy' \, + \, {\mathcal {O}}\left( a^2 \right) }{ \Gamma _0^{(0)} \, + \, a \, \int \limits _0^1 dy \, d\Gamma _0^{(1)}/dy \, + \, {\mathcal {O}}\left( a^2 \right) } \,\nonumber \\{} & {} = \, 1 \, - \, \frac{a}{\Gamma _0^{(0)}} \, \int \limits _y^1 \frac{d\Gamma _0^{(1)}}{dy'} \, dy' \, + \, {\mathcal {O}}\left( a^2 \right) \qquad (y>0); \end{aligned}$$
(65)

where we have defined the effective, first-order coupling of the (heavy) quarks to gluons

$$\begin{aligned} a \, \equiv \, \frac{C_F \alpha _S }{\pi }. \end{aligned}$$
(66)

We may say that the event fraction and the total rate give complementary information about the decay process. Note that the second equation in (63) is trivially satisfied by Eq. (65). To verify the first equation is instead less trivial. To accomplish this task, one has to resum soft-gluon effects to all orders in \(\alpha _S\), as we are going to show.

An exact first-order calculation in \(\alpha _S\) or, equivalently, in a, of the event fraction gives [11, 12]:

$$\begin{aligned} E_0\left( y; a \right) \,= & {} \, 1 - \frac{a}{2} \, \log ^2(y) \, - \, \frac{7}{4} \, a \, \log (y) \, - \, a \, \frac{31}{12} \, \nonumber \\{} & {} - a \bigg [ \, y \left( 1 \, - \, \frac{y}{4} \right) \log (y) \, + \, \frac{y}{12} \left( 2 \, y^2 \, - \, 3 \, y \, - \, 30 \right) \bigg ] \,\nonumber \\{} & {} + \, {\mathcal {O}}\left( a^2 \right) . \end{aligned}$$
(67)

The spectrum above contains three different kind of terms, as far as the Born-kinematics limit \(y \rightarrow 0^+\) is concerned:

  1. 1.

    Double and single logarithmic terms of y, namely the terms

    $$\begin{aligned} - \, \frac{a}{2} \, \log ^2(y), \qquad \, - \, \frac{7}{4} \, a \, \log (y), \end{aligned}$$
    (68)

    which formally diverge in the limit \(y \rightarrow 0^+\), and which are therefore very large in the lower end-point region \(y \ll 1\) – namely the threshold region. These are clearly the dominant terms in the small-y region;

  2. 2.

    Constant terms with respect to y, namely the term

    $$\begin{aligned} \, - \, a \, \frac{31}{12}. \end{aligned}$$
    (69)

    In units of the ubiquitous factor a, the constant

    $$\begin{aligned} C^{(1)}_0 \, = \, - \, \frac{31}{12} \, \cong \, - \, 2.6 \end{aligned}$$
    (70)

    is of order one, as expected. For \(\alpha _S(m_b)=0.21\), the first-order correction turns out to be

    $$\begin{aligned} a \, C^{(1)}_0 \, \simeq \, - \, 0.23; \end{aligned}$$
    (71)
  3. 3.

    Infinitesimal terms in y, i.e. terms which vanish in the limit \(y \rightarrow 0^+\), namely

    $$\begin{aligned} a \, H^{(1)}_0(y), \end{aligned}$$
    (72)

    where:

    $$\begin{aligned} H^{(1)}_0(y)\equiv & {} - \, y \left( 1 \, - \, \frac{y}{4} \right) \log (y) \,\nonumber \\{} & {} - \, \frac{y}{12} \left( 2 \, y^2 \, - \, 3 \, y \, - \, 30 \right) . \end{aligned}$$
    (73)

    These latter terms are the least important ones in the small-y region, but give a substantial contribution in the bulk of the spectrum, i.e. for \(y = {\mathcal {O}}(1)\). These terms can be neglected, to a first approximation, in the small-y region, but cannot be neglected anymore for generic y values, where they are not smaller than the logarithmic or the constant terms.

As far as the small-y behavior is concerned, in the general process (7), the event fraction is naturally written to first order in the form

$$\begin{aligned} E_0(y; a)= & {} 1 \, - \, \frac{a}{2} \, A^{(1)} \log ^2(y) \, + \, a \, S^{(1)} \log (y) \,\nonumber \\{} & {} + \, a \, C^{(1)}_0 \, + \, a \, \textrm{Rem}^{(1)}_0(y) \, + \, {\mathcal {O}}\left( a^2 \right) ; \end{aligned}$$
(74)

where we have introduced the coefficients:

$$\begin{aligned} A^{(1)} \, = \, 1; \qquad S^{(1)} \, = \, - \, \frac{7}{4}. \end{aligned}$$
(75)

The value of the constant \(C^{(1)}_0\) has already been given in Eq. (70) and the first-order contribution to the Remainder function

$$\begin{aligned} \textrm{Rem}^{(1)}_0(y) \, \equiv \, H^{(1)}_0(y) \, \rightarrow \, 0 \,\,\, \textrm{for} \,\, y \, \rightarrow \, 0^+. \end{aligned}$$
(76)

The complete Remainder function at first order,

$$\begin{aligned} \textrm{Rem}_0(y;a) \, = \, a \, \textrm{Rem}^{(1)}_0(y), \end{aligned}$$
(77)

is plotted in Fig. 1. Note that, since \(E_0(y=1,a) \equiv 1\), it follows that:

$$\begin{aligned} \textrm{Rem}^{(1)}_0(y=1) \, = \, - \, C^{(1)}_0 \, = \, \frac{31}{12}. \end{aligned}$$
(78)

Actually, the Remainder function is a strictly-monotonically-increasing function of y, and therefore positive in all its range, \(0 \le y \le 1\).

Fig. 1
figure 1

Plot of massless Remainder function \(\textrm{Rem}_0(y;a)\) in first approximation, Eq. (77), as a function of y in all its kinematic range, \(0 \le y \le 1\). We have defined \(a \equiv C_F \alpha _S(m_b) / \pi \cong 0.089\) for \(\alpha _S(m_b)=0.21\)

The constants \(A^{(1)}\) and \(S^{(1)}\) are process-independent, i.e. they are the same for all the heavy-to-light decays in the class (7). That is a consequence of the general properties of QCD radiation in the infrared (i.e. soft and/or collinear) limit. On the contrary, the Coefficient \(C^{(1)}_0\) and the Remainder function \(\textrm{Rem}^{(1)}_0(y)\) are short-distance dominated and are therefore process dependent.

In the simple case of the radiative decay (37) – a two-body decay at tree level – \(C^{(1)}_0\) is truly a constant and \(\textrm{Rem}^{(1)}_0(y)\) only depends on y. In more complicated heavy-to-light decays, such as for example the semileptonic \(b \rightarrow u l \nu \) decays – 3-body decays at tree-level – \(C^{(1)}\) still does not depend on y, but it does depend on other kinematical variables. Similarly, the Remainder function \(\textrm{Rem}^{(1)}(y)\) also depends on additional kinematical variables.

3.3 Factorization

The basic idea of factorization is simply to separate from each other perturbative terms having different physical origin. In particular, we factorize the large infrared logarithms into a universal, i.e. process independent, form factor. To order a (i.e. to order \(\alpha _S\)), one can write indeed:

$$\begin{aligned} E_0(y;a) \, = \, C_0(a) \, \Sigma _0(y; a) \, + \, \textrm{Rem}_0(y; a) \, + \, {\mathcal {O}}\left( a^2 \right) ;\nonumber \\ \end{aligned}$$
(79)

where we have defined the long-distance dominated QCD form factor

$$\begin{aligned} \Sigma _0(y;a) \, \equiv \, 1 \, - \, \frac{a}{2} \, A^{(1)} \log ^2(y) \, + \, a \, S^{(1)} \log (y); \end{aligned}$$
(80)

the short-distance Coefficient function:

$$\begin{aligned} C_0(a) \, \equiv \, 1 \, + \, a \, C^{(1)}_0 \end{aligned}$$
(81)

and the short-distance Remainder function

$$\begin{aligned} \textrm{Rem}_0(y;a) \, \equiv \, a \, H^{(1)}_0(y). \end{aligned}$$
(82)

Note that the perturbative expansion of the Remainder function begins at order a, i.e. it vanishes in the free limit \(a \rightarrow 0\), while the Form factor and the Coefficient function equal unity in the same limit.

By expanding the product on the r.h.s. of Eq. (79) in powers of a, one finds:

$$\begin{aligned} E_0(y;a)= & {} 1 \, + \, a \, \Sigma _0^{(1)}(y) \, + \, a \, C^{(1)}_0 \,\nonumber \\{} & {} + \, a \, \textrm{Rem}^{(1)}_0(y) \, + \, {\mathcal {O}}\left( a^2\right) . \end{aligned}$$
(83)

On the r.h.s. of the above equation, one finds exactly the same first-order terms which are on the r.h.s. of the fixed-order expansion, Eq. (74). Therefore factorization, which we have explicitly constructed at first order in a, involves a shift of terms of second (and in general also higher) order.

Let us note that we have constructed a minimal scheme for the QCD form factor \(\Sigma _0(y;a)\), inside which only logarithmic terms of y are included. In other words, constants and infinitesimal terms for \(y \rightarrow 0^+\) are not included in our \(\Sigma _0\). Let us also remark that the factorization scheme given by Eq. (79) can be consistently pushed to higher orders in \(\alpha _S\).

3.4 Threshold resummation in the heavy-to-light case

The QCD form factor, resumming to all orders in \(\alpha _S\) the infrared logarithmically-enhanced terms, has the standard expression in moment space or N-space [5, 13,14,15]:

$$\begin{aligned} \sigma _{0,N}\left( \alpha _S\right)= & {} \exp \int \limits _0^1 \frac{dy}{y} \Big [ (1-y)^{N-1} \, - \, 1 \Big ]\nonumber \\{} & {} \Bigg \{ \,\, \int \limits _{Q^2 y^2}^{Q^2 y} \frac{dk_\perp ^2}{k_\perp ^2} \, A\left[ \alpha _S\left( k_\perp ^2 \right) \right] \, \nonumber \\{} & {} \, + \, B\left[ \alpha _S\left( Q^2 y \right) \right] \,+ \, D\left[ \alpha _S \left( Q^2y^2 \right) \right] \Bigg \}. \end{aligned}$$
(84)

The function \(A(\alpha _S)\), having an ordinary perturbative expansion in powers of \(\alpha _S\),

$$\begin{aligned} A(\alpha _S) \, = \, \sum _{n=1}^\infty C_F \left( \frac{\alpha _S}{\pi } \right) ^n \, A^{(n)}, \end{aligned}$$
(85)

describes soft-gluon emission at small angle with respect to the parent strange quark [16] (i.e. both soft and collinear enhanced). The first-order coefficient \(A^{(1)}\) – the only one we are directly interested to – has been given in the first of Eq. (75), in units of \(a \equiv C_F \alpha _S/\pi \), according to our current conventions.

The function \(B(\alpha _S)\), also having an ordinary perturbative expansion in powers of \(\alpha _S\),

$$\begin{aligned} B(\alpha _S) \, = \, \sum _{n=1}^\infty C_F \left( \frac{\alpha _S}{\pi } \right) ^n \, B^{(n)} \, = \, a \, B^{(1)} \, + \, {\mathcal {O}}\left( a^2\right) , \end{aligned}$$
(86)

describes hard-gluon emission at small angle (collinear enhanced but not soft enhanced radiation). The first-order coefficient explicitly reads:

$$\begin{aligned} B^{(1)} \, = \, - \, \frac{3}{4}. \end{aligned}$$
(87)

Finally, the function \(D(\alpha _S)\), also having an ordinary perturbative expansion in powers of \(\alpha _S\),

$$\begin{aligned} D(\alpha _S) \, = \, \sum _{n=1}^\infty C_F \left( \frac{\alpha _S}{\pi } \right) ^n \, D^{(n)} \, = \, a \, D^{(1)} \, + \, {\mathcal {O}}\left( a^2\right) , \end{aligned}$$
(88)

describes soft-gluon emission at large angle (soft enhanced but not collinear enhanced radiation). In heavy-to-light transitions, the first-order coefficient takes the value:

$$\begin{aligned} D^{(1)} \, = \, - \, 1. \end{aligned}$$
(89)

It is natural to resum infrared logarithms in N-space, where, unlike physical space, factorization of kinematic constraints for multiple soft-gluon emission holds true [17]. In N-space, the logarithm-enhanced terms have the form

$$\begin{aligned} \alpha _S^k \, \log ^l(N), \qquad k \, = \, 1, \, 2, \, 3, \, \ldots , \quad 1 \, \le \, l \, \le \, 2 \, k. \end{aligned}$$
(90)

As well known, in order to obtain the form factor in physical space (y-space), one has to make first analytic continuation of \(\sigma _N\) in the N variable, from integer to complex values:

$$\begin{aligned} N \in {{\mathbb {N}}}\quad \mapsto \quad {\mathcal {N}} \in {{\mathbb {C}}}. \end{aligned}$$
(91)

The form factor in physical space is then obtained by means of an inverse Mellin transform:

$$\begin{aligned} \sigma _0(y;\alpha _S) \, = \, \int \limits _{c-i\infty }^{c+i\infty } \frac{d{\mathcal {N}}}{2\pi i} \, (1-y)^{-{\mathcal {N}}} \sigma _{0,{\mathcal {N}}}(\alpha _S), \end{aligned}$$
(92)

where the (real) constant c is chosen in such a way that all the singularities of \(\sigma _{\mathcal {N}}\) lie to the left of the integration contour (a vertical line in the complex \({\mathcal {N}}\)-plane).

The partially-integrated form factor \(\Sigma _0(y)\) is finally obtained by integrating over y:

$$\begin{aligned} \Sigma _0(y;\alpha _S) \, = \, \int \limits _0^y \sigma _0(y';\alpha _S) \, dy'. \end{aligned}$$
(93)

3.4.1 Form-factor expansion

In order to determine the first-order Coefficient and Remainder functions, one has to subtract, from the event fraction, the expansion of the form factor up to first order in \(\alpha _S\).

The first step involves expanding the exponential on the r.h.s. of Eq. (84) to first order,

$$\begin{aligned} \sigma _{0,N} \, = \, \exp (X) \, = \, 1 \, + \, X \, + \, {\mathcal {O}}\left( X^2\right) , \end{aligned}$$
(94)

followed by a truncation of the resummation functions \(A(\alpha _S),B(\alpha _S)\) and \(D(\alpha _S)\) to first order in \(\alpha _S\):

$$\begin{aligned} \sigma _{0,N}(\alpha _S)= & {} 1 \, + \, \int \limits _0^1 \frac{dy}{y} \big [ (1-y)^{N-1} \, - \, 1 \big ] \left\{ \,\, \int \limits _{Q^2 y^2}^{Q^2 y} \frac{dk_\perp ^2}{k_\perp ^2} \, A\left[ \alpha _S\left( k_\perp ^2 \right) \right] \, \right. \nonumber \\{} & {} \left. + \, B\left[ \alpha _S\left( Q^2 y \right) \right] + \, D\left[ \alpha _S \left( Q^2 y^2 \right) \right] \right\} \, + \, {\mathcal {O}}\left( \alpha _S^2 \right) \, \nonumber \\= & {} 1 \, + \, a \, \int \limits _0^1 dy \big [ (1-y)^{N-1} \, - \, 1 \big ] \left\{ \,\, \frac{A^{(1)}}{y} \int \limits _{Q^2 y^2}^{Q^2 y} \frac{dk_\perp ^2}{k_\perp ^2} \, \right. \nonumber \\{} & {} \left. + \, \frac{ B^{(1)} \, + \, D^{(1)} }{y} \right\} \, + \, {\mathcal {O}}\left( a^2 \right) ; \end{aligned}$$
(95)

where:

$$\begin{aligned} \alpha _S \, \equiv \, \alpha _S\left( Q^2\right) ; \qquad a \, \equiv \, \frac{C_F \, \alpha _S\left( Q^2\right) }{\pi }. \end{aligned}$$
(96)

Since \(X={\mathcal {O}}\left( \alpha _S\right) \), the higher-order terms \(X^2,X^3,\ldots \) in the expansion of \(\exp (X)\) are of second or higher order in \(\alpha _S\). The evaluation of the inverse Mellin transform simply gives back, at first order, the curly bracket in the last member of Eq. (95), so that:

$$\begin{aligned} \sigma _0(y;\alpha _S) \,= & {} \, \delta (y) \, - \, a \, A^{(1)} \left[ \frac{\log (y)}{y} \right] _+ \,\nonumber \\{} & {} + \, a \, \left( B^{(1)} \, + \, D^{(1)} \right) \left[ \frac{1}{y} \right] _+ \, + \, {\mathcal {O}}(a^2); \end{aligned}$$
(97)

where the plus regularization of a generic function f(y) is defined as the following (weak) limit:

$$\begin{aligned} \left[ f(y) \right] _+ \, \equiv \, \lim _{\epsilon \rightarrow 0^+} \left[ \theta (y - \epsilon ) \, f(y) \, - \, \delta (y-\epsilon ) \int \limits _\epsilon ^1 f(y') \, dy' \right] .\nonumber \\ \end{aligned}$$
(98)

The plus regularization comes from virtual diagrams, related to the term \(-1\) in the function \((1-y)^{N-1}-1\). Finally, the partially-integrated form factor \(\Sigma _0(y,\alpha _S)\), entering the factorization formula in Eq. (79), is obtained by integrating over y the differential form factor \(\sigma _0(y,\alpha _S)\), according to Eq. (93):

$$\begin{aligned} \Sigma _0(y,a) \,\equiv & {} \, \int \limits _0^y \sigma _0(y',a) \, dy' \, = \, 1 \, - \, \frac{ A^{(1)} }{2} \, a \, \log ^2(y) \,\nonumber \\{} & {} + \, a \, \left( B^{(1)} \, + \, D^{(1)} \right) \log (y) \, + \, {\mathcal {O}}\left( a^2\right) . \end{aligned}$$
(99)

The form factor above is in complete agreement with that one in Eq. (80) if we make the identification

$$\begin{aligned} S^{(1)} \, = \, B^{(1)} \, + \, D^{(1)}. \end{aligned}$$
(100)

That is to say that \(S^{(1)}\) – the coefficient of the single infrared logarithm at \({\mathcal {O}}(\alpha _S)\) – is the sum of the first-order collinear \(B^{(1)}\) and soft \(D^{(1)}\) coefficients. The latter “separate” from each other at higher orders in \(\alpha _S\), from second order on, because of the different argument of the coupling, namely the collinear scale \(Q^2 y\) for the function \(B(\alpha _S)\) and the (typically much smaller) soft scale \(Q^2 y^2\) for the function \(D(\alpha _S)\), see Eq. (84).

4 First-order calculation in the massive case

In this section we consider an exact first-order calculation \(({\mathcal {O}}(\alpha _S))\) of the photon spectrum in the rare decay (37) in the massive case \(m_s \ne 0\).

4.1 Total rate

By taking into account strange quark mass effects, the tree-level width reads:

$$\begin{aligned} \Gamma ^{(0)}(r) \, = \, \Gamma _0^{(0)} \, (1-r)^3 (1+r) \, = \, \Gamma _0^{(0)} \, \frac{1+2\rho }{(1+\rho )^4}; \end{aligned}$$
(101)

where we have defined the final-quark mass correction parameters

$$\begin{aligned} r \, \equiv \, \frac{m_s^2}{m_b^2} \qquad (0 \le r \le 1) \end{aligned}$$
(102)

and

$$\begin{aligned} \rho \, \equiv \, \frac{m_s^2}{m_b^2 \, - \, m_s^2} \, = \, \frac{r}{1 \, - \, r} \qquad (0 \le \rho < \infty ). \end{aligned}$$
(103)

The inverse formula of the above one reads:

$$\begin{aligned} r \, = \, \frac{\rho }{1 \, + \, \rho }. \end{aligned}$$
(104)

The two parameters are basically the same for small values of the strange quark mass, \(m_s \ll m_b\):

$$\begin{aligned} \rho \, = \, r \, + \, {\mathcal {O}}\left( r^2\right) . \end{aligned}$$
(105)

The lowest-order width in the massless limit, \(\Gamma _0^{(0)} = \Gamma ^{(0)}(r=0)\), has been given in Eq. (48). The correction to the inclusive width at one loop in the massive case is given by:

$$\begin{aligned} \Gamma \, = \, \Gamma ^{(0)} \left[ 1 \, + \, a \, K(\rho ) \, + \, {\mathcal {O}}\left( \alpha _S^2 \right) \right] , \end{aligned}$$
(106)

where:

$$\begin{aligned} K(\rho )= & {} - \, \big (1 \, + \, 2 \, \rho \big ) \left[ \log (\rho ) \log (1 + \rho ) \, + \, 2 \, \text {Li}_2(-\rho ) \, + \, \frac{\pi ^2}{3} \right] \, \nonumber \\{} & {} + \, \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{7}{2} \right) \Big [ \log (1+\rho ) \, - \, \log (\rho ) \Big ] \, \nonumber \\{} & {} + \, 2 \, \log (1 + \rho ) \, - \, 2 \, \rho \, (1 \, + \, \rho ) \, + \, \frac{4}{3}. \end{aligned}$$
(107)

The function \(\textrm{Li}_2\) is the standard dilogarithm or Spence function:

$$\begin{aligned} \textrm{Li}_2(s) \, \equiv \, - \, \int \limits _0^s \frac{dt}{t} \log (1-t) \, = \, \sum _{n=1}^\infty \frac{s^n}{n^2} \qquad (|s| \, < \, 1).\nonumber \\ \end{aligned}$$
(108)

As well known, the inclusive width is an infrared safe quantity, so that its massless limit is finite:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} K(\rho ) \, = \, \frac{4 - \pi ^2}{3}. \end{aligned}$$
(109)

Note that the most singular terms in \(K(\rho )\) for \(\rho \rightarrow 0^+\) are of the form \(\rho \log (\rho )\).

4.2 Photon spectrum

While, in the massive case, the lowest photon energy is still zero, the maximal photon energy is reduced by a factor \((1-r)\) with respect to the massless case \(r=0\):

$$\begin{aligned} E_\gamma ^{\max } \, = \, \frac{m_b}{2}(1 \, - \, r). \end{aligned}$$
(110)

We consider the Event fraction (E) in the kinematic variable

$$\begin{aligned} y \, \equiv \, \frac{m_{X_s}^2 \, - \, m_s^2}{m_b^2 \, - \, m_s^2}. \end{aligned}$$
(111)

In terms of the normalized photon energy

$$\begin{aligned} x \, = \, \frac{E_\gamma }{E_\gamma ^{\max }} \, = \, \frac{m_b^2-m_{X_s}^2}{m_b^2-m_s^2}, \end{aligned}$$
(112)

we find the same relation of the massless case:

$$\begin{aligned} x \, = \, 1 \, - \, y \qquad (m_s \ne 0). \end{aligned}$$
(113)

The event fraction is naturally written:

$$\begin{aligned} E(y; \rho ; a)\equiv & {} \frac{1}{\Gamma } \int \limits _0^y \frac{d \Gamma }{dy'}\left( y'; \rho ; a\right) \, dy' \, \nonumber \\= & {} 1 \, + \, a \, E^{(1)}(y;\rho ) \, + \, {\mathcal {O}}\left( a^2 \right) . \end{aligned}$$
(114)

The first-order term, from the computation in [11], reads in our notation:

$$\begin{aligned} E^{(1)}(y;\rho )= & {} + \, \big ( 1 \, + \, 2 \, \rho \big ) \bigg [ - \, \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \log (y) \,\nonumber \\{} & {} + \, \frac{1}{2} \log ^2\left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \, \nonumber \\{} & {} \, + \, \log (1 \, + \, \rho ) \, \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \, + \, \textrm{Li}_2\left( \frac{\rho }{y \, + \, \rho } \right) \,\nonumber \\{} & {} - \, \textrm{Li}_2\left( \frac{\rho }{1 \, + \, \rho } \right) \bigg ] \, \nonumber \\{} & {} - \, 2 \, \log (y) \, + \, \frac{1}{4} \, \log \left( \frac{y \, +\, \rho }{1 \, + \, \rho } \right) \, \nonumber \\{} & {} + \left[ \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{1}{2} \right) \, - \, y \left( 1 \, - \, \frac{y}{4} \right) \right] \nonumber \\{} & {} \times \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \, + \, \frac{1 \, - \, y}{y \, + \, \rho } \, \Bigg \{ \rho \left[ 2 \, \rho ^2 \, + \, \rho \, (y \, + \, 2) \,\right. \nonumber \\{} & {} \left. - \, \frac{1}{6} \left( 2 \, y^2 \, - \, 7 \, y \, + \, 14 \right) \right] \, \nonumber \\{} & {} + \, \frac{y}{12} \left( 2 \, y^2 \, - \, y \,- \, 31 \right) \Bigg \}. \end{aligned}$$
(115)

One may notice the frequent occurrence on the r.h.s. of Eq. (115) of the “collinear” variable

$$\begin{aligned} z \, \equiv \, \frac{m_{X_s}^2}{m_b^2} \, = \, \frac{y \, + \, \rho }{1 \, + \, \rho } \qquad \left( \frac{\rho }{1 + \rho } \le z \le 1 \right) . \end{aligned}$$
(116)

Note that this dependent variable \(z=z(y;\rho )\) does not vanish on Born kinematics:

$$\begin{aligned} \lim _{y \rightarrow 0^+} z(y;\rho ) \, = \, \frac{\rho }{1 \, + \, \rho }, \end{aligned}$$
(117)

but it becomes small for small \(\rho \). Note also that \(z=z(y;\rho )\) exactly reduces to y in the massless limit:

$$\begin{aligned} z(y;\rho =0) \, = \, y. \end{aligned}$$
(118)

5 Factorization in the soft limit

In this section we consider the event fraction E(y) for the rare decay (37) in the threshold region \(y \ll 1\), with a mass correction parameter \(\rho \) of order one:

$$\begin{aligned} 0 \, < \, y \, \ll \, \rho \, = \, {\mathcal {O}}(1). \end{aligned}$$
(119)

Formally, that is equivalent to taking the limit

$$\begin{aligned} \frac{y}{\rho } \, \rightarrow \, 0^+. \end{aligned}$$
(120)

Since the final strange quark is not relativistic (in beauty rest frame), there are no collinearly-enhanced terms, so the threshold region is dominated by soft-gluon emission only \((E_g \ll m_b)\). As already remarked, the soft region is a single-logarithmic one, i.e. the perturbative expansion of the event fraction \(E=E(y;\rho ,\alpha _S)\) contains at most one logarithm of y for each power of the coupling \(\alpha _S\).

5.1 Soft QCD form factor

The QCD form factor resumming, to all orders in \(\alpha _S\), the soft logarithmically-enhanced terms in the perturbative series of the event fraction, i.e. in the soft limit, reads in moment space or N-space [18]:Footnote 2

$$\begin{aligned} \sigma _N^{(S)}\left( \rho ,\alpha _S\right)= & {} \exp \int \limits _0^1 \frac{dy}{y} \Big [ (1-y)^{N-1} \, - \, 1 \Big ] \nonumber \\{} & {} \times \left\{ \,\, \int \limits _{Q^2 y^2/(1 + \rho )}^{Q^2 y^2/\rho } \frac{dk_\perp ^2}{k_\perp ^2} \, A \left[ \rho ; \alpha _S\left( k_\perp ^2\right) \right] \, \right. \nonumber \\{} & {} \left. + \, D\bigg [ \alpha _S \Big ( \frac{Q^2y^2}{1+\rho } \Big ) \bigg ] \,+ \, \Delta \bigg [\alpha _S\Big ( \frac{Q^2y^2}{1 + \rho } \Big ) \bigg ] \right\} .\nonumber \\ \end{aligned}$$
(121)

The function \(A(\rho ;\alpha _S)\) is a “massive”, i.e. \(\rho \)-dependent, generalization of the usual massless function \(A(\alpha _S)\), reducing to the latter in the massless limit:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} A(\rho ;\alpha _S) \, = \, A(\alpha _S). \end{aligned}$$
(122)

The first-order term reads:

$$\begin{aligned} A^{(1)}(\rho ) \, = \, 1 \, + \, 2 \, \rho . \end{aligned}$$
(123)

The function

$$\begin{aligned} \Delta (\alpha _S) \, = \, \sum _{n=1}^\infty C_F \left( \frac{\alpha _S}{\pi } \right) ^n \Delta ^{(n)} \, = \, a \, \Delta ^{(1)} \, + \, {\mathcal {O}}\left( a^2\right) \end{aligned}$$
(124)

describes soft parton emission off the (massive) strange quark line. The first-order term takes the value:

$$\begin{aligned} \Delta ^{(1)} \, = \, - \, 1. \end{aligned}$$
(125)

5.1.1 Form factor expansion

By expanding the resummed form factor in Eq. (121) to first order in \(\alpha _S\), as described in Sect. 3.4, one obtains for the partially-integrated form factor in physical space:

$$\begin{aligned} \Sigma _S\left( y; \rho ; a \right)\equiv & {} \int \limits _0^y \sigma _S\left( y'; \rho ; a \right) \, dy' \, \nonumber \\= & {} 1 \, + \, a \, \Sigma _S^{(1)}(y;\rho ) \, + \,{\mathcal {O}}\left( a^2 \right) ; \end{aligned}$$
(126)

where:

$$\begin{aligned} \Sigma _S^{(1)}(y;\rho ) \, = \, - \, \left[ \left( 1 \, + \, 2 \, \rho \right) \, \log \left( \frac{\rho }{1 \, + \, \rho } \right) \, + \, 2 \right] \, \log (y).\nonumber \\ \end{aligned}$$
(127)

The following remarks are in order:

  1. 1.

    In the massless limit \(\rho \rightarrow 0^+\), the form factor contains the singular term

    $$\begin{aligned} - \, a \, \log (\rho ) \, \log (y), \end{aligned}$$
    (128)

    with \(\log (\rho )\) a mass singularity of collinear origin;

  2. 2.

    In the no-recoil limit \(\rho \rightarrow + \infty \) (equivalent to the limit \(r \rightarrow 1^-\)), the form factor exactly vanishes,

    $$\begin{aligned} \lim _{\rho \rightarrow + \infty } \Sigma _S^{(1)}(y;\rho ) \, = \, 0, \end{aligned}$$
    (129)

    as could be expected on physical ground.

5.2 Soft Coefficient function

Let’s now follow, in the present massive case, the standard factorization procedure described in Sect. 3 for the simpler massless case. According to Eq. (83), at first order in a (equivalently, in \(\alpha _S\)), the sum of the first-order Coefficient function \(C_S^{(1)}(\rho )\) and Remainder function \(\textrm{Rem}_S^{(1)}(y;\rho )\), is obtained by subtracting, from the first-order rate \(E^{(1)}\), the first-order form factor \(\Sigma _S^{(1)}\):

$$\begin{aligned}{} & {} C_S^{(1)}(\rho ) \, + \, \textrm{Rem}_S^{(1)}(y;\, \rho ) \, = \, E^{(1)}(y; \, \rho ) \, - \, \Sigma _S^{(1)}(y; \, \rho ) \, \nonumber \\{} & {} \quad = \big ( 1 \, + \, 2 \, \rho \big ) \bigg [ \, \frac{1}{2} \, \log ^2\left( y \, + \, \rho \right) \, - \, \, \frac{1}{2} \, \log ^2\left( 1 \, + \, \rho \right) \,\nonumber \\{} & {} \qquad - \, \log \left( 1 +\frac{y}{\rho } \right) \log (y) \, + \textrm{Li}_2\left( \frac{\rho }{y \, + \, \rho } \right) \nonumber \\{} & {} \qquad \, - \, \textrm{Li}_2\left( \frac{\rho }{1 \, + \, \rho } \right) \bigg ] \, + \frac{1}{4} \, \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \nonumber \\{} & {} \qquad + \left[ \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{1}{2} \right) \, - \, y \left( 1 \, - \, \frac{y}{4} \right) \right] \nonumber \\{} & {} \qquad \times \log \left( \frac{y+\rho }{1+\rho } \right) \, +\, \frac{1 \, - \, y}{y \, + \, \rho } \nonumber \\{} & {} \qquad \left\{ \rho \left[ 2 \, \rho ^2 \, + \, \rho \, (y \, + \, 2) \, - \, \frac{1}{6} \left( 2 \, y^2 \, - \, 7 \, y \, + \, 14 \right) \right] \,\right. \nonumber \\{} & {} \qquad \left. + \, \frac{y}{12} \left( 2 \, y^2 \, - \, y \, - \, 31 \right) \right\} . \end{aligned}$$
(130)

Note that the last member of the above expression, unlike the physical spectrum, does not diverge for \(y \rightarrow 0^+\), because of the subtraction of all the soft logs contained in the soft form factor; the most singular terms for \(y \rightarrow 0^+\) are of the form \(y \log (y).\)

The soft Coefficient function has the usual perturbative expansion beginning with one:

$$\begin{aligned} C_S(\rho ;a) \, = \, 1 \, + \, a \, C_S^{(1)}(\rho ) \, + \, {\mathcal {O}}\left( a^2 \right) . \end{aligned}$$
(131)

The first-order soft Coefficient function is obtained by taking the limit \(y\rightarrow 0^+\) of all members of Eq. (130) and taking into account that the Soft Remainder function vanishes in this limit:

$$\begin{aligned} C_S^{(1)}(\rho )= & {} \lim _{y \rightarrow 0^+} \left[ E^{(1)}(y;\rho ) \, - \, \Sigma _S^{(1)}(y;\rho ) \, \right] \, \nonumber \\= & {} \big ( 1 \, + \, 2 \rho \big ) \left\{ \frac{1}{2} \, \Big [\log ^2(\rho ) \, - \, \log ^2(1 \, + \, \rho ) \Big ] \,\right. \nonumber \\{} & {} \left. - \,\textrm{Li}_2\left( \frac{\rho }{1 \, + \, \rho } \right) \, + \,\frac{\pi ^2}{6} \right\} \, \nonumber \\{} & {} + \left( 2 \, \rho ^3 \, + \, 3 \, \rho ^2 \, + \, \frac{\rho }{2} \,+ \, \frac{1}{4} \right) \Big [ \log (\rho ) \, - \, \log (1 \, + \,\rho ) \Big ] \,\nonumber \\{} & {} + \, 2 \, \rho (1 \, + \, \rho ) \, - \, \frac{7}{3}. \end{aligned}$$
(132)

Note that the first-order soft Coefficient function \(C_S^{(1)}(\rho )\) contains the double-logarithmic term of the strange mass

$$\begin{aligned} \frac{1}{2} \, \log ^2(\rho ), \end{aligned}$$
(133)

as well as the single-logarithmic term

$$\begin{aligned} \frac{1}{4} \, \log (\rho ), \end{aligned}$$
(134)

both diverging in the massless limit for the final quark, \(\rho \rightarrow 0^+\). The complete soft Coefficient function up to first order,

$$\begin{aligned} C_S(\rho ,a) \, = \, 1 \, + \, a \, C_S^{(1)}(\rho ), \end{aligned}$$
(135)

is plotted in Fig. 2.

Fig. 2
figure 2

Plot of soft Coefficient function \(C_S(\rho ;a)\) in first-order approximation (see Eq. (135)), as a function of \(\rho \), in the wide interval \(0.025< \rho < 5\). The (horizontal) \(\rho \)- scale is logarithmic. We have defined \(a \equiv C_F \alpha _S(m_b) / \pi \cong 0.089\) for \(\alpha _S(m_b)=0.21\). A quite strong dependence of \(C_S(\rho ;a)\) for small \(\rho \) is observed, as expected; at \(\rho =0.025\), the first-order correction is already about 50% of the tree level value (which is one)

5.3 Soft Remainder function

The Soft Remainder function has the usual perturbative expansion beginning at first order:

$$\begin{aligned} \textrm{Rem}_S(y;\rho ;a) \, = \, a \, \textrm{Rem}_S^{(1)}(y;\rho ) \, + \, {\mathcal {O}}\left( a^2 \right) . \end{aligned}$$
(136)

According to Eq. (130), the Soft Remainder function has the first-order (leading) term given by:

$$\begin{aligned} \textrm{Rem}_S^{(1)}(y; \rho )= & {} E^{(1)}(y;\rho ) \, - \, \Sigma _S^{(1)}(y;\rho ) \, - \, C_S^{(1)}(\rho ) \, \nonumber \\= & {} \big ( 1 \, + \, 2 \, \rho \big ) \bigg \{ \frac{1}{2} \Big [\log ^2(y \, + \, \rho ) \, - \, \log ^2(\rho ) \Big ] \,\nonumber \\{} & {} + \,\text {Li}_2\left( \frac{\rho }{y \, + \, \rho } \right) \, - \,\frac{\pi ^2}{6} \, + - \, \log (y)\nonumber \\{} & {} \times \Big [\log (y \, + \, \rho )\, - \, \log (\rho ) \Big ] \bigg \} \, \nonumber \\{} & {} + \left( 2 \, \rho ^3 \, + \, 3 \, \rho ^2 \, + \, \frac{\rho }{2} \, + \, \frac{1}{4} \right) \nonumber \\{} & {} \times \Big [ \log (y \, + \, \rho ) \, - \, \log (\rho ) \Big ] \nonumber \\{} & {} - \frac{y}{y \, + \, \rho } \Bigg \{ \rho \left[ 2 \, \rho ^2 \, + \, \rho \, (y \, + \, 3) \,\right. \nonumber \\{} & {} \left. - \, \frac{1}{6} \left( 2 \, y^2 \, - \, 9 \, y \, + \, 9 \right) \right] \, \nonumber \\{} & {} + \, \frac{y}{12} \left( 2 \, y^2 \, - \, 3 \, y \, - \, 30 \right) \Bigg \} \, \nonumber \\{} & {} + y \left( 1 \, - \, \frac{y}{4} \right) \Big [ \log (1 \, + \, \rho ) \, - \, \log (y \, + \, \rho ) \Big ] \,\nonumber \\{} & {} - \, \frac{y}{4 \, (y\, + \, \rho )}. \end{aligned}$$
(137)

By taking into account that:

$$\begin{aligned} \text {Li}_2\left( 1\right) \, = \, \sum _{n=1}^\infty \frac{1}{n^2} \, = \, \frac{\pi ^2}{6}, \end{aligned}$$
(138)

it is immediate to check the vanishing of the soft Remainder function in the Born kinematics, i.e. in the limit \(y \rightarrow 0^+\) (in taking this limit, \(\rho \) is kept constant and not zero: \(\rho =\rho _0>0\)). The Soft Remainder function is plotted in Fig. 3 for different values of \(\rho \).

Fig. 3
figure 3

Plot of soft Remainder function \(Rem_S(y;\rho ;a)\) in first-order approximation, Eq. (137), as a function y in all its kinematic range, \(0 \le y \le 1\), for four different values of \(\rho \). As shown in the figure: \(\rho =1\): black continuous line; \(\rho =0.5\): red dashed line; \(\rho =0.2\): blue dotted line; \(\rho =0.1\): green dot-dashed line. By reducing \(\rho \) towards zero, the Remainder function becomes progressively bigger in size

Since

$$\begin{aligned} \textrm{Rem}_S^{(1)}(y=1;\,\rho ) \, = \, - \, C_S^{(1)}(\rho ), \end{aligned}$$
(139)

by reducing \(\rho \) towards zero (from above, let’s say, from \(\rho =1\)), the corresponding Soft Remainder functions, which are all strictly monotonically decreasing functions, and therefore all negative, become progressively bigger in size.

We can conclude that the soft factorization scheme which we have constructed in this section, is very simple, but it only works in the region it is aimed at, namely \(y \ll 1\) and \(\rho = {\mathcal {O}}(1)\): there is no bonus. To consistently describe the small y, small-mass region \(\rho \ll 1\), we have to construct a more general factorization scheme, based on a QCD form factor which also resumes small-\(\rho \) effects, i.e. the collinearly enhanced terms.

6 General factorization

In this section we construct a general factorization scheme, which correctly describes the soft region

$$\begin{aligned} 0 \, < \, y \, \ll \, \rho \, = \, {\mathcal {O}}(1) \qquad (\mathrm {soft\,\,region}) \end{aligned}$$
(140)

and the (effectively) massless region

$$\begin{aligned} 0 \, < \, \rho \, \ll \, y \, \ll \, 1 \qquad (\mathrm {effectively\,\,massless\,\,region}), \end{aligned}$$
(141)

as well as the “transition region”

$$\begin{aligned} 0 \, < \, y \, \approx \, \rho \, \ll \, 1 \qquad (\mathrm {transition\,\,region}). \end{aligned}$$
(142)

As in the previous massless or soft factorization schemes, the factorized event fraction is written as:

$$\begin{aligned} E\left( y;\rho ;\alpha _S\right)= & {} C\left( \rho ;\alpha _S\right) \, \Sigma \left( y;\rho ;\alpha _S\right) \nonumber \\ {}{} & {} + \, \textrm{Rem}\left( y;\rho ;\alpha _S\right) . \end{aligned}$$
(143)

In order to determine the Coefficient function, \(C(\rho ;\alpha _S)\), as well as the Remainder function, \(\textrm{Rem}(y;\rho ;\alpha _S)\), to \({\mathcal {O}}\left( \alpha _S\right) \), we need to know the general QCD form factor \(\Sigma \left( y;\rho ;\alpha _S\right) \), also at first order in the coupling. The latter cannot be evaluated directly, but it can be obtained by expanding in powers of \(\alpha _S\) the general resummed form factor, as described in the next section.

6.1 General QCD form factor

The general QCD form factor, resumming to all orders in \(\alpha _S\) the infrared (soft and/or collinear) logarithms occurring in the perturbative expansion of the event fraction, has the following expression [18]:

$$\begin{aligned}{} & {} \sigma _N\left( \rho ,\alpha _S\right) \, = \, \exp \int \limits _0^1 dy \, \Big [ (1-y)^{N-1} \, - \, 1 \Big ]\nonumber \\{} & {} \quad \times \left\{ \frac{1}{y} \int \limits _{Q^2 y^2/(1+\rho )}^{Q^2 y^2/(y+\rho )} \frac{dk_\perp ^2}{k_\perp ^2} \, A\left[ \rho ; \alpha _S\left( k_\perp ^2\right) \right] \, \right. \nonumber \\{} & {} \quad + \, B\left[ \alpha _S \left( \frac{Q^2y^2}{y + \rho } \right) \right] \frac{1}{y + \rho } \, + \, D\left[ \alpha _S \left( \frac{Q^2y^2}{1 + \rho } \right) \right] \frac{1}{y} \,\nonumber \\{} & {} \quad \left. + \,\Delta \left[ \alpha _S\left( \frac{Q^2y^2}{1 + \rho } \right) \right] \left( \frac{1}{y} \, - \, \frac{1}{y + \rho } \right) \right\} . \end{aligned}$$
(144)

Note that, in , the term proportional to \(\Delta (\alpha _S)\) describes soft-gluon emission off the strange quark line for . As a consequence, this term identically vanishes in the massless limit \(\rho \rightarrow 0^+\). We may say that the B and \(\Delta \) terms are somehow “complementary”, in the sense that one acts in the kinematic region where the other does not.

Let us remark that the general form factor in Eq. (144) reduces to:

  1. 1.

    the Heavy-to-Light form factor, Eq. (84), in the massless limit \(\rho \rightarrow 0^+\);

  2. 2.

    the Soft form factor, Eq. (121), in the soft limit \(y \rightarrow 0^+\), \(\rho =\)const\(\ne 0\) or, more simply, \(y/\rho \rightarrow 0^+\).

6.1.1 Form-factor expansion

To compute the partially-integrated form factor \(\Sigma = \Sigma (y; \rho ; \alpha _S)\), expanded up to first order in \(\alpha _S\), the only non-trivial integration involved is that one of the term proportional to \(a \, A^{(1)}(\rho )\), namely

$$\begin{aligned} \int \limits _y^1 \frac{d y'}{y'} \, \log \left( \frac{y' \, + \, \rho }{1 \, + \, \rho } \right) . \end{aligned}$$
(145)

In the above expression, the integration of the soft-gluon transverse momentum squared \(k_\perp ^2\) has already been made. In order to isolate the large infrared logarithms, the expansion of the resummed form factor is conveniently written – out of the many possible forms – as:

$$\begin{aligned} \Sigma (y;\rho ;a)= & {} 1 \, + \, a \, A^{(1)}(\rho ) \left[ - \, \log \left( \frac{y+\rho }{1+\rho } \right) \log (y) \,\right. \nonumber \\{} & {} \left. + \, \frac{1}{2}\, \log ^2\left( \frac{y+\rho }{1+\rho } \right) \right. \nonumber \\{} & {} \, + \, \log (1 \, + \,\rho ) \, \log \left( \frac{y+\rho }{1+\rho } \right) \, + \, \textrm{Li}_2\left( \frac{\rho }{y+\rho } \right) \,\nonumber \\{} & {} \left. - \, \textrm{Li}_2\left( \frac{\rho }{1+\rho } \right) \right] + a \, D^{(1)} \, \log (y)\nonumber \\{} & {} \, + \, a \, \Delta ^{(1)} \, \left[ \log (y) \, - \, \log \left( \frac{y+\rho }{1+\rho } \right) \right] \,\nonumber \\{} & {} + \, a \, B^{(1)} \, \log \left( \frac{y+\rho }{1+\rho } \right) \, +\,{\mathcal {O}}\left( a^2\right) . \end{aligned}$$
(146)

The following remarks about the above equation are in order:

  1. 1.

    Unlike the massless form factor, Eqs. (80) or (99), a \(\log ^2(y)\) term is absent in Eq. (146), because of the regulating effect on collinear emissions of a non-vanishing strange mass, i.e. \(\rho \ne 0\), as discussed in the Introduction. Apart from this term, actually all the possible quadratic and linear terms containing \(\log (y)\) and \(\log (y+\rho )/(1+\rho )\) do appear in Eq. (146);

  2. 2.

    The arguments of the dilogarithms are always smaller than, or equal to, one, so these term are uniformly bounded by \(\textrm{Li}_2(1) = \pi ^2/6 \simeq 1.64493\) – namely a constant of order one.

Checking that the first square bracket on the r.h.s. of Eq. (146) is equal to the integral in (145) is quite standard:

  1. 1.

    One takes the derivatives with respect to y of both expressions and checks that they are equal;

  2. 2.

    One checks that both expressions are equal for a particular value of y, such as for example the point \(y=1\), where the integral in (145) vanishes.

By replacing the explicit values of the first-order coefficients, one obtains for the first-order form factor,

$$\begin{aligned} \Sigma (y;\rho ;a) \, = \, 1 \, + \, a \, \Sigma ^{(1)}(y;\rho ) \, + \, {\mathcal {O}}\left( a^2\right) , \end{aligned}$$
(147)

the explicit expression:

$$\begin{aligned} \Sigma ^{(1)}(y;\rho )= & {} \big ( 1 \, + \, 2 \, \rho \big ) \Bigg [ - \, \log \left( \frac{y+\rho }{1+\rho } \right) \, \log (y) \,\nonumber \\{} & {} + \, \frac{1}{2} \, \log ^2\left( \frac{y+\rho }{1+\rho } \right) \, \nonumber \\{} & {} \, + \, \log (1 \, + \,\rho ) \, \log \left( \frac{y+\rho }{1+\rho } \right) + \, \textrm{Li}_2\left( \frac{\rho }{y \, + \, \rho } \right) \nonumber \\{} & {} - \, \textrm{Li}_2\left( \frac{\rho }{1 \, + \, \rho } \right) \Bigg ] \, + \, - \, 2 \log (y) \, \nonumber \\{} & {} + \, \frac{1}{4} \log \left( \frac{y+\rho }{1+\rho } \right) . \end{aligned}$$
(148)

The following remarks about the form factor above, Eq. (148), are in order:

  1. 1.

    It reduces to the massless form factor, Eq. (80), in the massless limit \(\rho \rightarrow 0^+\), and to the soft form factor, Eq. (127), for \(y/\rho \rightarrow 0^+ \), ;

  2. 2.

    It vanishes in the limit of vanishing photon energy:

    $$\begin{aligned} \lim _{y \rightarrow 1^-} \Sigma ^{(1)}(y;\rho ) \, = \, 0, \end{aligned}$$
    (149)

    where one has simply to take into account that

    $$\begin{aligned} \lim _{y \rightarrow 1^-} z(y;\rho ) \, = \, 1. \end{aligned}$$
    (150)

6.2 General Coefficient function

In order to evaluate the Coefficient and Remainder functions, the first step is to subtract, from the first-order spectrum, Eq. (115), the first-order QCD form factor, Eq. (148). That way, one obtains:

$$\begin{aligned}{} & {} C^{(1)}(\rho ) \, + \, \textrm{Rem}^{(1)}(y;\rho ) = E^{(1)}(y;\rho ) \, - \, \Sigma ^{(1)}(y;\rho ) \, \nonumber \\{} & {} \quad = \bigg [ \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{1}{2} \right) \, - \, y \left( 1 \, - \, \frac{y}{4} \right) \bigg ] \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \, \nonumber \\{} & {} \qquad + \, \frac{1 \, - \, y}{y \, + \, \rho } \left[ 2 \, \rho ^3 \, + \, \rho ^2 (y \, + \, 2) \, - \, \frac{\rho }{6} \left( 2 \, y^2 \, - \, 7 \, y \, + \, 14 \right) \right. \nonumber \\{} & {} \qquad \left. + \, \frac{y}{12}\left( 2 \, y^2 \, - \, y \, - \, 31 \right) \right] . \end{aligned}$$
(151)

The following remarks are in order:

  1. 1.

    All soft logarithms – namely all \(\log (y)\) terms – exactly canceled by subtracting from the spectrum the form factor, as it should, and as already happened in the (simpler) soft factorization scheme. Actually, in the present case, unlike the soft one, the complete cancellation of the first three rows on the r.h.s. of Eq. (115) occurred – a large number of terms canceled;

  2. 2.

    On the last member of Eq. (151), the coefficient of the collinear logarithm – namely the term \(\log \left[ (y + \rho )/(1+\rho )\right] \) – is suppressed by positive powers of \(\rho \) or y, again as it should. Note that this did not happen in the soft factorization scheme;

  3. 3.

    It is immediately checked that the event fraction minus the form factor exactly vanishes at the upper endpoint \(y=1\).

The first-order Coefficient function is given by:

$$\begin{aligned} C^{(1)}(\rho ) \, = \, \lim _{y \rightarrow 0^+} \Big [ E^{(1)}(y;\rho ) \, - \, \Sigma ^{(1)}(y;\rho ) \Big ]. \end{aligned}$$
(152)

By taking the above limit, one easily finds:

$$\begin{aligned}{} & {} C^{(1)}(\rho ) \, = \, - \, \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{1}{2} \right) \nonumber \\{} & {} \quad \Big [ \log (1 \, + \, \rho ) \, - \, \log (\rho ) \Big ] \, + \, 2 \, \rho \, (1 \, + \, \rho ) \, - \, \frac{7}{3}. \end{aligned}$$
(153)

Unlike the soft Coefficient function, Eq. (132), which, as we have shown, diverges like \(\log ^2(\rho )\) for \(\rho \rightarrow 0^+\), the general Coefficient function above has a finite value, of order one, in the massless limit:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} C^{(1)}(\rho ) \, = \, - \, \frac{7}{3}. \end{aligned}$$
(154)

Note that the most singular terms for \(\rho \rightarrow 0^+\) in \(C^{(1)}(\rho )\) are of the form \(\rho \log (\rho )\) (just like the \({\mathcal {O}}(\alpha _S)\) correction factor \(K(\rho )\) to the inclusive width, see Eq. (107)).

The no-recoil limit (\(\rho \rightarrow +\infty \)) of the general Coefficient function does not vanish, but is finite:

$$\begin{aligned} \lim _{\rho \rightarrow +\infty } C^{(1)}(\rho ) \, = \, - \, 2. \end{aligned}$$
(155)

Therefore, by increasing \(\rho \) from zero to infinity, the first-order Coefficient function increases by 1/3. A plot of the complete Coefficient function at first order,

$$\begin{aligned} C(\rho ;a) \, = \, 1 \, + \, a \, C^{(1)}(\rho ), \end{aligned}$$
(156)

as a function of the mass parameter \(\rho \), is given in Fig. 4. We observe that \(C(\rho ;a)\) is basically a monotonically-increasing function of \(\rho \), with a rather mild dependence on this variable. By comparing Fig. 4 with Fig. 2, we notice a substantial stabilization of the Coefficient function as far as the dependence on \(\rho \) is concerned; That is, generically speaking, a “good new”. Furthermore, while the soft Coefficient function is always greater than one, the general Coefficient function is always smaller than one.

Fig. 4
figure 4

Plot of the general Coefficient function \(C(\rho ;a)\) at first order, Eq. (156), as a function of the mass parameter \(\rho \), on a logarithmic scale. We have defined \(a \equiv C_F\alpha _S(m_b)/\pi \simeq 0.089\) for \(\alpha _S(m_b)=0.21\). The dotted red and blue lines represent the asymptotic values of \(C(\rho ,a)\) for \(\rho \rightarrow 0^+\) and \(\rho \rightarrow + \infty \) respectively. A rather mild dependence of \(C(\rho ;a)\) on \(\rho \) is observed

6.3 General Remainder function

The first-order Remainder function collects, by definition, all the \({\mathcal {O}}(\alpha _S)\) terms which are not included neither in the Form Factor nor in the Coefficient function:

$$\begin{aligned} \textrm{Rem}^{(1)}(y;\rho ) \, \equiv \, E^{(1)}(y;\rho ) \, - \, \Sigma ^{(1)}(y;\rho ) \, - \, C^{(1)}(\rho ).\nonumber \\ \end{aligned}$$
(157)

By construction (see Eq. (152)), it vanishes on Born kinematics:

$$\begin{aligned} \lim _{y \rightarrow 0^+} \textrm{Rem}^{(1)}(y;\rho ) \, = \, 0 \qquad (\rho =\textrm{const} > 0). \end{aligned}$$
(158)

The first-order Remainder function (the lowest non-vanishing order) explicitly reads:

$$\begin{aligned} \textrm{Rem}^{(1)}(y;\rho )= & {} \rho \left( 2 \, \rho ^2 \, + \, 3 \, \rho \, + \, \frac{1}{2} \right) \log \left( 1 \, + \, \frac{y}{\rho } \right) \,\nonumber \\{} & {} - \, y \left( 1 \, - \, \frac{y}{4} \right) \log \left( \frac{y \, + \, \rho }{1 \, + \, \rho } \right) \, \nonumber \\{} & {} - 2 \, \rho ^2 \, y \, - \, \rho \, y \, (3 \, - \, y) \, - \, \frac{y}{6} \left( 4 \, y^2 \, - \, 9 \, y \, - \, 9 \right) \, \nonumber \\{} & {} +\frac{ y^2 }{4 \, (y \, + \, \rho )} \left( 2 \, y^2 \, - \, 5 \, y \, + \, 4 \right) \, - \, \frac{y}{4 \, ( y \, + \, \rho )}.\nonumber \\ \end{aligned}$$
(159)
Fig. 5
figure 5

The continuous red line is the plot of the general Remainder function \(\textrm{Rem}(y;\rho ;a)\) in first order approximation, Eq. (159), as a function of y, in the small-y range \(0 \le y \le 0.2\), for \(\rho =0.002\). For comparison (see text), we have also plotted the subtracted Remainder function, Eq. (167) – the blue dotted line

It is immediate to check that the r.h.s. of the above equation vanishes for \(y \rightarrow 0^+\) (\(\rho =\textrm{const}>0\)), as all terms are explicitly proportional to y or to higher powers of y, or are proportional to \(\log (1 + y/\rho )\), which is also \({\mathcal {O}}(y)\).

Note that the second and the third row on the r.h.s. of Eq. (159) have been rewritten by means of a partial fractioning with respect to \(\rho \). The general Remainder function is plotted in Fig. 5. Similarly to the general Coefficient function, also the general Remainder function has a mild dependence on the mass parameter \(\rho \).

7 Improved factorization scheme

In the previous section we have constructed a factorization scheme, given by Eqs. (143), (144), (153) and (159), which correctly works in the threshold region \((y \ll 1)\) in the massive case \((\rho \ne 0)\) so long as, in taking the limit \(y\rightarrow 0^+\), the mass parameter \(\rho \) is kept constant and not zero: \(\rho = \rho _0 > 0\). However, it is also natural to ask ourselves what happens if we take the massless limit \(\rho \rightarrow 0^+\) in our massive factorization formula. We have seen that both the massive Coefficient and Remainder functions have a finite limit for \(\rho \rightarrow 0^+\), while the soft Coefficient and Remainder functions diverge like \(\log ^2(\rho )\) in the same limit. Therefore, as expected, a substantial improvement is obtained by going from the soft factorization scheme to the general one, again as far as the massless limit \(\rho \rightarrow 0^+\) is concerned. The problem then is: by taking the massless limit of our general factorization formula, do we obtain the same Form Factor, the same Coefficient and Remainder functions of the massless factorization scheme, i.e. of the standard factorization scheme applied to the massless event fraction \(E_0\), described in Sect. 3, or we do not?

Let us begin our analysis by studying the simpler object occurring in the factorization process, namely the Coefficient function. The massless limit in Eq. (154),

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} C^{(1)}(\rho ) \, = \, - \, \frac{7}{3}, \end{aligned}$$
(160)

does not coincide with the massless Coefficient function, i.e. evaluated in the usual factorization of the massless spectrum (\(\rho =0\) from the very beginning: see Sect. 3):

$$\begin{aligned} C_0^{(1)} \, = \, \, - \, \frac{31}{12}. \end{aligned}$$
(161)

To obtain the limiting Coefficient function in Eq. (160), one has to add to \(C_0^{(1)}\) the constant 1/4:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} C^{(1)}(\rho ) \, = \, C_0^{(1)} \, + \, \frac{1}{4}. \end{aligned}$$
(162)

We can say that the following two operations do not commute with each other:Footnote 3

  1. 1.

    Taking the massless limit \(\rho \rightarrow 0^+\) of the event fraction;

  2. 2.

    Factorizing the spectrum into a form factor, a Coefficient and a Remainder functions.

A similar problem, in particular, a “specular mismatch”, also occurs with the general Remainder function. In the massless limit \(\rho \rightarrow 0^+\), the general Remainder function becomes:

$$\begin{aligned}{} & {} \lim _{\rho \rightarrow 0^+} \textrm{Rem}^{(1)}(y;\rho ) \, = \, \, - \, y \left( 1 \, - \, \frac{y}{4} \right) \log (y) \, \nonumber \\{} & {} \quad - \, \frac{y}{12} \left( 2 \, y^2 \, - \, 3 \, y \, - \, 30 \right) \, - \, \frac{1}{4} \qquad (y=\textrm{const}>0).\nonumber \\ \end{aligned}$$
(163)

The massless limit above does not vanish in the Born-kinematic limit \(y \rightarrow 0^+\):

$$\begin{aligned} \lim _{y \rightarrow 0^+} \left[ \lim _{\rho \rightarrow 0^+} \textrm{Rem}^{(1)}(y;\rho ) \right] \, = \, - \, \frac{1}{4}. \end{aligned}$$
(164)

Actually, “symmetrically” with respect to the case of the general Coefficient function, the limiting general Remainder function is equal to the massless one minus the constant 1/4:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} \textrm{Rem}^{(1)}(y;\rho ) \, = \, \textrm{Rem}_0^{(1)}(y) \, - \, \frac{1}{4}. \end{aligned}$$
(165)

By comparing Eq. (162) with Eq. (165), we find that, by taking the massless limit of the general massive factorization formula, the constant 1/4 is moved from the Remainder function to the Coefficient function. As already noted, the conclusion is that it makes a difference to take the massless limit \(\rho \rightarrow 0^+\) before Factorization or after Factorization of the spectrum. The problem does not originate from the QCD form factor which, as we have shown, has a smooth behavior in the massless limit \(\rho \rightarrow 0^+\), so it necessarily originates from the splitting of the non-logarithmic terms between the Coefficient function and the Remainder function.

By looking at the plot of the Remainder function in Fig. 5 for \(\rho \ll 1\), one finds a small dip for very small y. The latter is produced by the last term on the r.h.s. of Eq. (159), namely the term

$$\begin{aligned} - \, \frac{y}{4 \, (y \, + \, \rho )}. \end{aligned}$$
(166)

Indeed the dip completely disappears in the subtracted Remainder function,

$$\begin{aligned} \overline{\textrm{Re}}^{(1)}(y;\rho ) \, \equiv \, \textrm{Re}^{(1)}(y;\rho ) \, + \, \frac{y}{4 \, (y \, + \, \rho )}, \end{aligned}$$
(167)

in which the above term has been removed by hand (see Fig. 5). Therefore we can conclude that the finite mismatch which we have found, originates from this term only, on which we then focus our attention from now on.

Formally, for any strictly positive (and fixed) value of \(\rho \), whatever small, the ratio (166) vanishes in the threshold limit \(y\rightarrow 0^+\), so this term is naturally relegated into the massive Remainder function. However for very small values of the mass parameter \(\rho \),

$$\begin{aligned} 0 \, < \, \rho \, \ll \, 1, \end{aligned}$$
(168)

the term (166) is small only in a tiny region of the kinematic variable y, namely the region

$$\begin{aligned} 0 \, < \, y \, \ll \, \rho \qquad (\textrm{small} \,\, y), \end{aligned}$$
(169)

where:

$$\begin{aligned} - \, \frac{y}{4 \, (y \, + \, \rho )} \, \approx \, - \, \frac{y}{4 \, \rho } \, \ll \, 1. \end{aligned}$$
(170)

Therefore only in the small y region (169) (small compared to \(\rho \)), the term in (166) is reasonably inserted into the Remainder function. In the complementary, large y region (again in a strong inequality sense and again compared to \(\rho \)),

$$\begin{aligned} 0 \, < \, \rho \, \ll \, y \qquad (\textrm{large} \,\, y), \end{aligned}$$
(171)

this term is approximately equal to a constant of order one,

$$\begin{aligned} - \, \frac{y}{4 \, (y \, + \, \rho )} \, \approx \, - \, \frac{1}{4}, \end{aligned}$$
(172)

so it would be reasonable to relegate it inside the Coefficient function, and no more inside the Remainder function.

Mathematically, the problem originates from the fact that:

$$\begin{aligned} \lim _{y \rightarrow 0^+} \, - \, \frac{y}{4 \, (y \, + \, \rho )} \, = \, 0, \end{aligned}$$
(173)

while:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} \, - \, \frac{y}{4 \, (y \, + \, \rho )} \, = \, - \, \frac{1}{4}. \end{aligned}$$
(174)

The ratio (166) is the only term in the Remainder function, Eq. (159), for which the limits \(y\rightarrow 0^+\) and \(\rho \rightarrow 0^+\) do not commute with each other, as:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} \left[ \lim _{y \rightarrow 0^+} \, - \, \frac{y}{4 \, (y \, + \, \rho )} \right] \, = \, \lim _{\rho \rightarrow 0^+} 0 \, = \, 0; \end{aligned}$$
(175)

while:

$$\begin{aligned} \lim _{y \rightarrow 0^+} \left[ \lim _{\rho \rightarrow 0^+} \, - \, \frac{y}{4 \, (y \, + \, \rho )} \right] \, = \, \lim _{y \rightarrow 0^+} \left( - \, \frac{1}{4} \right) = - \frac{1}{4} \, \ne \, 0.\nonumber \\ \end{aligned}$$
(176)

On the contrary, for the terms of the form \(y^n/(y+\rho )\) with \(n>1\), appearing on the r.h.s. of Eq. (159), the two limits \(y \rightarrow 0^+\) and \(\rho \rightarrow 0^+\) do commute with each other. Note also that more complicated non-commuting terms, such as for example \(y^2/(y + \rho )^2\) or \(y^2/(y^2 + \rho ^2)\), do not appear on the r.h.s. of Eq. (159). A crude solution to this problem is to define:

  1. 1.

    An improved (I) Coefficient function, by adding to the massive Coefficient function, \(C^{(1)}(\rho )\) in Eq. (153), the term (166) when the latter is large, i.e. for \(y>\rho \):

    $$\begin{aligned} C_I^{(1)}(y;\rho ) \, \equiv \, C^{(1)}(\rho ) \, - \, \frac{y}{4 \, (y \, + \, \rho )} \, \theta (y \, - \, \rho ); \end{aligned}$$
    (177)

    where \(\theta (x) \equiv 1\) for \(x>0\) and zero otherwise is the standard Heaviside unit step function;

  2. 2.

    An improved Remainder function, by adding to the subtracted Remainder function, \(\overline{\textrm{Re}}^{(1)}(y;\rho )\) in Eq. (167), the term (166) when the latter is small, i.e. in the complementary case \(y < \rho \):

    $$\begin{aligned} \textrm{Re}_I^{(1)}(y;\rho ) \, \equiv \, \overline{\textrm{Re}}^{(1)}(y;\rho ) \, - \, \frac{y}{4 \, (y \, + \, \rho )} \, \theta (\rho \, - \, y).\nonumber \\ \end{aligned}$$
    (178)

Note that:

$$\begin{aligned} C_I^{(1)}(\rho ;y) \, + \, \textrm{Re}_I^{(1)}(y;\rho ) \, \equiv \, C^{(1)}(\rho ) \, + \, \textrm{Re}^{(1)}(y;\rho ),\nonumber \\ \end{aligned}$$
(179)

i.e., with the present improvement, we have simply made a rearrangement of terms among the Coefficient function and the Remainder function. Since the QCD form factor \(\Sigma \) equals unity in the free limit,

$$\begin{aligned} \Sigma (y;\rho ;\alpha _S) \, = \, 1 \, + \, {\mathcal {O}}\left( \alpha _S \right) , \end{aligned}$$
(180)

it follows that the improved resummed formula, i.e. Eq. (143) with the improved Coefficient and Remainder functions, coincides with the standard resummed formula or the fixed-order event fraction to \({\mathcal {O}}\left( \alpha _S \right) \).

An important point is that our improvement necessarily introduces a dependence on y in the massive Coefficient function. In order to have a Coefficient function with a dependence on y which is as simple as possible, one can split the ratio (166) as:

$$\begin{aligned} - \, \frac{y}{4 \, (y \, + \, \rho )} \, = \, - \, \frac{1}{4} \, + \, \frac{\rho }{4 \,(y \, + \, \rho )}. \end{aligned}$$
(181)

When \(y>\rho \), i.e. when the ratio on the l.h.s. above is large, one inserts the constant \(-1/4\) inside the improved Coefficient function, while the (small) fraction \(\rho / ( 4 (y \, + \, \rho ) )\) is inserted in the improved Remainder function. Therefore one can also define the simpler improved Coefficient function

$$\begin{aligned} C_I^{(1)}(y;\rho ) \, \equiv \, C^{(1)}(\rho ) \, - \, \frac{1}{4} \, \theta (y-\rho ); \end{aligned}$$
(182)

together with the improved Remainder function

$$\begin{aligned}{} & {} \textrm{Re}^{(1)}_I(y;\rho ) \, \equiv \, \overline{\textrm{Re}}^{(1)}(y;\rho ) \,\nonumber \\{} & {} \quad - \, \frac{y}{4 \, (y \, + \, \rho )} \, \theta (\rho -y) \, + \, \frac{\rho }{4 \, (y \, + \, \rho )} \, \theta (y-\rho ). \end{aligned}$$
(183)

Let us remark that, even though the non-commuting term (166) is numerically rather small in size, it has its own relevance on the theoretical side. We also expect the presence of similar non-commuting terms to be generic in heavy-to-heavy decays, i.e. not to be restricted to the rare \(B \rightarrow X_s \gamma \) decays. Furthermore, in different processes the size of non-commuting terms can be numerically larger.

7.1 Smoothing the transition

The improved factorization scheme, which we have constructed in the previous section, can be further improved by eliminating the discontinuities in the Coefficient and Remainder functions produced by the \(\theta \)-functions. We can regularize the discontinuities by means of smooth functions with a similar step behavior to the \(\theta \)-functions, such as for example the sigmoids (see Figs. 6 and 7):

$$\begin{aligned} \theta (x) \quad \mapsto \quad S_\Delta (x) \, \equiv \, \frac{1}{\sqrt{2\pi } \, \Delta } \int \limits _{- \infty }^x \exp \left( - \, \frac{t^2}{2 \, \Delta ^2} \right) \, dt;\nonumber \\ \end{aligned}$$
(184)

where \(\Delta >0\) is a parameter specifying the size of the x-interval, centered around \(x=0\), where most of the variation of the function \(S_\Delta (x)\) occurs. It holds indeed:

$$\begin{aligned} S_\Delta (x=+\Delta ) \, - \, S_\Delta (x=-\Delta ) \, \simeq \, 0.68. \end{aligned}$$
(185)

Furthermore, in a weak sense:

$$\begin{aligned} \lim _{\Delta \rightarrow 0^+} \sigma _\Delta (x) \, = \, \theta (x), \end{aligned}$$
(186)

so we recover the previous case by sending the auxiliary parameter \(\Delta \) to zero. As well known from statistics, \(\Delta \) is the standard deviation \(\sigma \) of the Gaussian distribution function inside the integral on the r.h.s. of Eq. (184).

It is immediate to check the following basic properties of the function \(S_\Delta (x)\):

$$\begin{aligned}{} & {} \lim _{x\rightarrow -\infty } S_\Delta (x) = 0; \qquad \lim _{x\rightarrow +\infty } S_\Delta (x) \, = \, 1; \quad S_\Delta (0) = \frac{1}{2}; \nonumber \\{} & {} S_\Delta (x) \, + \, S_\Delta (-x) \, \equiv \, 1. \end{aligned}$$
(187)

The smoothed improved Coefficient and Remainder functions are simply obtained from the previous improved ones, Eqs. (182) and (183), by replacing the \(\theta \)-functions with \(S_\Delta \)-functions with the same arguments:

$$\begin{aligned} C_I^{(1)}(y;\rho )\equiv & {} C^{(1)}(\rho ) \, - \, \frac{1}{4} \, S_\Delta (y-\rho ); \nonumber \\ \textrm{Re}_I^{(1)}(y;\rho )\equiv & {} \overline{\textrm{Re}}^{(1)}(y;\rho ) \, + \, \frac{\rho }{4 \, (y+\rho )} \, S_\Delta (y-\rho ) \, \nonumber \\{} & {} - \, \frac{y}{4 \, (y+\rho )} \, S_\Delta (\rho -y). \end{aligned}$$
(188)

It is natural to assume \(\Delta \) to be a function of \(\rho \):

$$\begin{aligned} \Delta \, = \, \Delta (\rho ), \end{aligned}$$
(189)

together with \(\Delta < \rho \). In practice, for the numerical value of \(\Delta \), one can take a fraction of \(\rho \), such as for example:

$$\begin{aligned} \Delta \, = \, \frac{\rho }{3} \quad \textrm{or} \quad \frac{\rho }{2}. \end{aligned}$$
(190)

7.2 Partition of unity

One may wish to have an improved Coefficient function which is as similar as possible to the standard one. Actually, it is possible to construct an improved Coefficient function which is:

  1. 1.

    Exactly equal to the massive Coefficient function \(C^{(1)}(\rho )\), Eq. (153), in the small-y region

    $$\begin{aligned} y \, < \, \rho \, - \, \Delta ; \end{aligned}$$
    (191)
  2. 2.

    Independent of y and with the correct massless limit \(\rho \rightarrow 0^+\), namely \(C^{(1)}_0\) in Eq. (70), in the large-y region

    $$\begin{aligned} y \, > \, \rho \, + \, \Delta . \end{aligned}$$
    (192)

This problem can be solved by means of the so-called Partition of Unity, a general analytic method in real geometry [20]. Let us consider the function

$$\begin{aligned} \varphi _\Delta (x) \, \equiv \, \exp \left( - \, \frac{1}{\Delta + x} \, - \, \frac{1}{\Delta - x} \right) \qquad \textrm{for} \,\,\, |x| \, < \, \Delta \nonumber \\ \end{aligned}$$
(193)

and zero otherwise. In Fig. 6 we plot this function for \(\Delta =1\) (the red continuous line). Note that the function \(\varphi _1(x)\) is not qualitatively very different from a Gaussian with standard deviation \(\sigma =0.365 \approx \Delta /3\) (blue dotted line), even though the latter formally has support on the entire real line.

Fig. 6
figure 6

Plot of the function \(\varphi _\Delta (x)\) for \(\Delta =1\) in the range \(-1.3 \le x \le 1.3\) (red continuous line). As discussed in the main text, and as can also be seen from the figure, this function is smooth and identically equal to zero for \(|x| \ge 1\). For comparison, we have also plotted a Gaussian (blue dotted line) with the same normalization (integral over the real line) and with standard deviation \(\sigma =0.365\)

It is immediate to check that \(\varphi _\Delta (x)\) is an even function of x,

$$\begin{aligned} \varphi _\Delta (-x) \, = \, \varphi _\Delta (x). \end{aligned}$$
(194)

By explicitly computing the derivatives of \(\varphi _\Delta (x)\) of all orders at \(x=\pm \Delta \), it is straightforward to check that this function is infinitely smooth on the real line,

$$\begin{aligned} \varphi _\Delta \, \in \, C^\infty ({{\mathbb {R}}}). \end{aligned}$$
(195)

Given the normalization constant

$$\begin{aligned} K \, \equiv \, \int \limits _{-\Delta }^{+\Delta } \varphi _\Delta (x) \, dx \, > \, 0, \end{aligned}$$
(196)

let us define the function

$$\begin{aligned} \Phi _\Delta (x) \, \equiv \, \frac{1}{K} \int \limits _{-\infty }^x \varphi _\Delta (x') \, dx'. \end{aligned}$$
(197)

Being the primitive of an infinitely smooth function, also \(\Phi _\Delta \) is a \(C^\infty ({{\mathbb {R}}})\) function. This function also satisfies the following properties:

  1. 1.

    It is identically equal to zero for small x, precisely:

    $$\begin{aligned} \Phi _\Delta (x) \, \equiv \, 0 \quad \textrm{for} \,\, x \, < \, - \, \Delta ; \end{aligned}$$
    (198)
  2. 2.

    It is identically equal to one for large x,

    $$\begin{aligned} \Phi _\Delta (x) \, \equiv \, 1 \quad \textrm{for} \,\, x \, > \, + \, \Delta ; \end{aligned}$$
    (199)
  3. 3.

    It is strictly monotonically increasing in the interval

    $$\begin{aligned} - \, \Delta \,< \, x \, < \, + \, \Delta . \end{aligned}$$
    (200)

The function \(\Phi _\Delta (x)\) is plotted in Fig. 7 for \(\Delta =1\).

Note that the function

$$\begin{aligned} \Psi _\Delta (x) \, \equiv \, \Phi _\Delta (x) \, - \, \frac{1}{2} \end{aligned}$$
(201)

is odd in x,

$$\begin{aligned} \Psi _\Delta (-x) \, = \, - \, \Psi _\Delta (x). \end{aligned}$$
(202)

It vanishes indeed at \(x=0\).

Fig. 7
figure 7

The red continuous line is the plot of the function \(\Phi _\Delta (x)\) for \(\Delta =1\) in the range \(-1.5< x < 1.5\). As discussed in the main text and as can also be seen from the figure, this function is smooth, identically equal to zero for \(x \le -1\), identically equal to one for \(x \ge 1\), and strictly monotonically increasing in the “transition region” \(-1 \le x \le 1\). The green dotted line represents the upper horizontal asymptote \(y=1\). The blue dotted line is the plot of the sigmoid, Eq. (184), with standard deviation \(\Delta =0.365\). Note that the two curves are barely distinguishable

From Eq. (202) the fundamental relation follows:

$$\begin{aligned} \Phi _\Delta (x) \, + \, \Phi _\Delta (-x) \, \equiv \, 1, \quad x \, \in \, {{\mathbb {R}}}. \end{aligned}$$
(203)

The required improved Coefficient and Remainder functions are simply obtained by replacing, inside Eqs. (188), the \(S_\Delta \)-functions with \(\Phi _\Delta \)-functions with the same arguments:

$$\begin{aligned} C_I^{(1)}(y;\rho )\equiv & {} C^{(1)}(\rho ) \, - \, \frac{1}{4} \, \Phi _\Delta (y \, - \, \rho ); \nonumber \\ \textrm{Re}^{(1)}_I(y;\rho )\equiv & {} \overline{\textrm{Re}}^{(1)}(y;\rho ) \, + \, \frac{\rho }{4 \, (y \, + \, \rho )} \, \Phi _\Delta (y \, - \, \rho ) \, \nonumber \\{} & {} - \, \frac{y}{4 \, (y \, + \, \rho )} \, \Phi _\Delta (\rho \, - \, y). \end{aligned}$$
(204)

The improved Coefficient function at first order in a (or in \(\alpha _S\)),

$$\begin{aligned} C_I(y;\rho ;a) \, = \, 1 \, + \, a \, C_I^{(1)}(y;\rho ), \end{aligned}$$
(205)

is plotted in Fig. 8.

Fig. 8
figure 8

Plot of the improved Coefficient function \(C_I(y;\rho ;a)\) at first order in a, Eq. (205), for \(\rho =0.1\) and \(\Delta = 2/3 \, \rho \simeq 0.067\), as a function of y, in the small-y region \(0 \le y \le 0.2\). The step behavior, as well as the smooth transition around \(y=0.1\) of total width \(2\Delta \), both induced by the partition of unity, are clearly visible

The improved Remainder function at first order,

$$\begin{aligned} \textrm{Rem}_I(y;\rho ;a) \, = \, a \, \textrm{Rem}_I^{(1)}(y;\rho ), \end{aligned}$$
(206)

is plotted in Fig. 9.

Fig. 9
figure 9

Plot of the improved Remainder function, \(\textrm{Rem}_I(y;\rho ;a)\), at first order in a, Eq. (206), as a function of y, in the small-y region \(0 \le y \le 0.2\), for \(\rho =0.1\) and \(\Delta =2/3 \rho \simeq 0.067\). The steeper rise around \(y=0.1\), induced by the partition of unity, is clearly visible

Let’s now prove that the above improved Coefficient function possesses all the properties we were looking for. When \(y<\rho -\Delta \), the function \(\Phi _\Delta (y-\rho )\) identically vanishes, so that:

$$\begin{aligned} C_I^{(1)}(y;\rho ) \, \equiv \, C^{(1)}(\rho ) \qquad (y \, < \, \rho \, - \, \Delta ). \end{aligned}$$
(207)

When \(y>\rho +\Delta \), the function \(\Phi _\Delta (y-\rho )\) is identically equal to one, so that:

$$\begin{aligned} C_I^{(1)}(y;\rho ) \, \equiv \, C^{(1)}(\rho ) \, - \, \frac{1}{4} \qquad (y \, > \, \rho \, + \, \Delta ). \end{aligned}$$
(208)

According to Eq. (162),

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} C^{(1)}(\rho ) \, - \, \frac{1}{4} \, = \, C_0^{(1)}, \end{aligned}$$
(209)

so that, in the massless limit \(\rho \rightarrow 0^+\), we recover the Coefficient function \(C_0^{(1)}\) of the standard factorization of the massless spectrum (i.e. the case where the limit \(\rho \rightarrow 0^+\) is taken before the factorization of the perturbative spectrum into a form factor, a Coefficient and a Remainder functions).

7.3 Generalization

Up to now we have explicitly considered only the contribution of the (leading) operator \(O_7\) to the radiative \(B \rightarrow X_s \gamma \) decay, contained in the effective non-leptonic weak Hamiltonian (see [12] and references therein)

$$\begin{aligned} {\mathcal {H}}_{eff} \, = \, - \, \frac{4G_F}{\sqrt{2}} \lambda _t \sum _{j=1}^8 C_j(\mu ) O_j(\mu ). \end{aligned}$$
(210)

\(C_j(\mu )\) is the Wilson (short-distance) Coefficient function of the operator \(O_j(\mu )\) and \(\mu = {\mathcal {O}}(m_b)\) is a factorization scale. The operator basis read:

$$\begin{aligned} O_1= & {} {\bar{c}}_{Li} \gamma ^\mu b_{Lj} {\bar{s}}_{Lj} \gamma _\mu c_{Li}; \nonumber \\ O_2= & {} {\bar{c}}_{Li} \gamma ^\mu b_{Li} {\bar{s}}_{Lj} \gamma _\mu c_{Lj}; \nonumber \\ O_3= & {} {\bar{s}}_{Li} \gamma ^\mu b_{Li} \left[ {\bar{u}}_{Lj} \gamma _\mu u_{Lj} + \cdots + {\bar{b}}_{Lj} \gamma _\mu b_{Lj} \right] ; \nonumber \\ O_4= & {} {\bar{s}}_{Li} \gamma ^\mu b_{Lj} \left[ {\bar{u}}_{Lj} \gamma _\mu u_{Li} + \cdots + {\bar{b}}_{Lj} \gamma _\mu b_{Li} \right] ; \nonumber \\ O_5= & {} {\bar{s}}_{Li} \gamma ^\mu b_{Li} \left[ {\bar{u}}_{Rj} \gamma _\mu u_{Rj} + \cdots + {\bar{b}}_{Rj} \gamma _\mu b_{Rj} \right] ; \nonumber \\ O_6= & {} {\bar{s}}_{Li} \gamma ^\mu b_{Lj} \left[ {\bar{u}}_{Rj} \gamma _\mu u_{Ri} + \cdots + {\bar{b}}_{Rj} \gamma _\mu b_{Ri} \right] ; \nonumber \\ O_7\equiv & {} \frac{e}{16 \, \pi ^2} \, {\bar{s}}_i \, \sigma ^{\mu \nu } \left( m_b \, R \, + \, m_s \, L \right) b_i \, F_{\mu \nu }; \nonumber \\ O_8\equiv & {} \frac{g_s}{16 \, \pi ^2} \, {\bar{s}}_i (t^a)_{ij} \sigma ^{\mu \nu } \left( m_b \, R \, + \, m_s \, L \right) b_j \, G_{\mu \nu }^a; \end{aligned}$$
(211)

where \(i,j=1,2,3\) are color indices (fundamental \(SU(3)_c\) representation). The operators \(O_1,\ldots ,O_6\) are 6-dimensional 4-fermion operators, which induce \(b \rightarrow s\) transitions after contracting the repeated quark fields, such as for example \({\bar{c}}_{Li}\) and \(c_{Li}\) in \(O_1\). These contractions therefore generate \(b \rightarrow s\) effective non-local operators, with photons and/or gluons in the final states attached to the quark loop. Finally \(O_7/O_8\) is a 5-dimensional local magnetic/chromo-magnetic operator.

In this section we generalize the evaluation of the photon spectrum by including all the operators in \({\mathcal {H}}_{eff}\). Let us first omit from our computations the operator \(O_8\), which has rather peculiar properties to be discussed later.

7.3.1 Massless case

Let us first summarize the results in the simpler massless case ([5] and references therein), in a notation which easily generalizes to the massive case.

Let’s first consider the rare b decay into the simplest final state, namely:

$$\begin{aligned} b \, \rightarrow \, s \, + \, \gamma . \end{aligned}$$
(212)

At lowest order, \({\mathcal {O}}(\alpha _{em})\), only the magnetic operator \(O_7\) and some 4-fermion operators give a non-zero contribution. An important point is that the effect of these 4-fermion operators can be absorbed into a redefinition of the Wilson Coefficient function of \(O_7\) ([12] and references therein):

$$\begin{aligned} C_7 \quad \mapsto \quad C_7^{\textrm{eff}} \, \equiv \, C_7 \, + \, e_d \, C_5 \, + \, 3 \, e_d \, C_6; \end{aligned}$$
(213)

where \(e_d=-1/3\) is the electric charge of a down-type quark (in units of the proton charge). The lowest-order rate reads:

$$\begin{aligned} \Gamma ^{(0)}_{77} \, = \, \frac{G_F^2 m_b^5}{32 \, \pi ^3} \, \big (C_7^{\textrm{eff}}\big )^2 \, \left| \lambda _t \right| ^2 \, \frac{\alpha _{em}}{\pi }. \end{aligned}$$
(214)

Let’s now consider the rare b decay with an additional real gluon in the final state:

$$\begin{aligned} b \, \rightarrow \, s \, + \, g \, + \, \gamma . \end{aligned}$$
(215)

The second important point is that only the contribution to the rate from \(O_7\) (schematically, the term \(\langle O_7 \rangle \langle O_7 \rangle \)), contains non-integrable infrared singularities. All the other contributions to the partial rate, let’s call them

$$\begin{aligned} \Gamma _{ij}^{(1)}(y) \equiv \int \limits _0^y \frac{d\Gamma _{ij}^{(1)}}{dy'} \, dy', \end{aligned}$$
(216)

such as for example \(\Gamma _{27}^{(1)}(y)\) or \(\Gamma _{22}^{(1)}(y)\), contain integrable infrared singularities or are finite. In other words, \(\Gamma _{ij}^{(1)}(y)\) for \((i,j) \ne (7,7)\) does not contain any \(\log ^2 y\) and \(\log y\) terms without power-suppressed coefficients. Therefore these terms only contribute to the \({\mathcal {O}}(\alpha _S)\) Coefficient and Remainder functions in the resummed formulae. The partially integrated rate is then of the form:

$$\begin{aligned}{} & {} \int \limits _0^y \frac{d\Gamma }{dy'}(y',a) \, dy' = \Gamma ^{(0)}_{77} \left\{ 1 \, + \, a \, K_{77}^{(1)} \, - \, \frac{a}{2} \, \log ^2(y)\right. \,\nonumber \\{} & {} - \, a \, \frac{7}{4} \log (y) \, + \, a \, \textrm{Rem}_{77}^{(1)}(y) \nonumber \\{} & {} \left. + \, \sum _{(i,j) \ne (7,7),(8,8)} a \, F_{ij}^{(1)}(y) \right\} \, + \, {\mathcal {O}}(a^2); \end{aligned}$$
(217)

where \(K_{77}^{(1)}\) is a constant whose actual value, as we are going to show in a moment, is immaterial, the \(O_7O_7\)-Remainder function \(\textrm{Rem}_{77}^{(1)}(y)\) is the function \(H_0^{(1)}(y)\) in Eq. (73), and

$$\begin{aligned} F_{ij}^{(1)}(y) \, \equiv \, \frac{\Gamma _{ij}^{(1)}(y)}{\Gamma ^{(0)}_{77}}, \qquad (i,j) \, \ne \, (7,7), \, (8,8). \end{aligned}$$
(218)

By defining:

$$\begin{aligned} K_{ij}^{(1)} \, \equiv \, F_{ij}^{(1)}(y=0), \qquad (i,j) \, \ne \, (7,7), \, (8,8), \end{aligned}$$
(219)

and

$$\begin{aligned} \textrm{Rem}_{ij}^{(1)}(y) \, \equiv \, F_{ij}^{(1)}(y) \, - \, F_{ij}^{(1)}(0), (i,j) \, \ne \, (7,7), \, (8,8),\nonumber \\ \end{aligned}$$
(220)

the partial rate above is rewritten:

$$\begin{aligned} \int \limits _0^y \frac{d\Gamma }{dy'}(y';a) \, dy' \,= & {} \, 1 + a \, \sum _{(i,j) \ne (8,8)} K_{ij}^{(1)} \, \nonumber \\{} & {} - \, \frac{a}{2} \, \log ^2(y) \, - \, a \, \frac{7}{4} \log (y) \, \nonumber \\{} & {} +\, a \, \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y). \end{aligned}$$
(221)

By dividing by the total rate,

$$\begin{aligned} \int \limits _0^1 \frac{d\Gamma }{dy}(y;a) \, dy= & {} 1 \, + \, a \, \sum _{(i,j) \ne (8,8)} K_{ij}^{(1)} \, \nonumber \\{} & {} + \, a \, \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y=1),\nonumber \\ \end{aligned}$$
(222)

we obtain the event fraction \(E_0 \equiv E_{m_s=0}\) to \({\mathcal {O}}(\alpha _S)\), which is naturally factorized as (compare with Eq. (79)):

$$\begin{aligned} E_0(y;a) \, = \, C_0(a) \, \Sigma _0(y,a) \, + \, \textrm{Rem}_0(y,a), \end{aligned}$$
(223)

where:

$$\begin{aligned} C_0(a) \, \equiv \, 1 \, - \, a \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y=1) \, + \, {\mathcal {O}}(a^2), \end{aligned}$$
(224)

\(\Sigma _0(y;a)\) is given in Eq. (80) and

$$\begin{aligned} \textrm{Rem}_0(y,a) \, \equiv \, a \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y) \, + \, {\mathcal {O}}(a^2). \end{aligned}$$
(225)

The following remarks are in order:

  1. 1.

    All the constants \(K_{ij}^{(1)}\) cancel by dividing the partial rate by the total rate;

  2. 2.

    As far as the dependence on the Wilson Coefficient functions is concerned:

    $$\begin{aligned} \textrm{Rem}_{ij}^{(1)}(y) \, \propto \, \frac{C_i \, C_j}{ \big ( C_7^{\textrm{eff}} \big )^2 }. \end{aligned}$$
    (226)

7.3.2 Massive case

Let us now consider the factorization of the photon spectrum in the massive case, \(m_s \ne 0\), which is our primary concern. In the massive case, the lowest-order rate gets a correction factor \((1+r)(1-r)^3=(1+2\rho )/(1+\rho )^4\), so that:

$$\begin{aligned} \Gamma ^{(0)}_{77}(\rho ) \, = \, \frac{G_F^2 m_b^5}{32 \, \pi ^3} \, \big (C_7^{\textrm{eff}}\big )^2 \, \left| \lambda _t \right| ^2 \, \frac{\alpha _{em}}{\pi } \frac{1+2\rho }{(1+\rho )^4}. \end{aligned}$$
(227)

Since, as far as we know, no analytic expressions of the \(\Gamma _{ij}^{(1)}(y,\rho )\) contributions of the photon spectrum are available at present in the massive case, except for \(i=j=7\),Footnote 4 let us present a general discussion along the lines of the massless factorization. Similarly to the massless case, the partially integrated rate can be written in the form:

$$\begin{aligned}{} & {} \int \limits _0^y \frac{d\Gamma }{dy'}(y',\rho ,a) \, dy' = \Gamma ^{(0)}_{77}(\rho )\nonumber \\{} & {} \left\{ 1 \, + \, a \, K_{77}^{(1)}(\rho ) \, + \, a \, \Sigma ^{(1)}(y,\rho ) \, + \, a \, \textrm{Rem}_{77}^{(1)}(y;\rho ) \, \right. \nonumber \\{} & {} \left. + \, \sum _{(i,j) \ne (7,7),(8,8)} a \, F_{ij}^{(1)}(y;\rho ) \right\} \, + \, {\mathcal {O}}(a^2); \end{aligned}$$
(228)

where \(K_{77}^{(1)}(\rho )\) is a \(\rho \)-dependent constant, \(\Sigma ^{(1)}(y,\rho )\) is given in Eq. (148), \(\textrm{Rem}_{77}^{(1)}(y;\rho )\) is given in Eq. (159) and

$$\begin{aligned} F_{ij}^{(1)}(y;\rho ) \, \equiv \, \frac{\Gamma _{ij}^{(1)}(y,\rho )}{\Gamma ^{(0)}_{77}(\rho )}, \qquad (i,j) \, \ne \, (7,7), \, (8,8).\nonumber \\ \end{aligned}$$
(229)

These quantities have the corresponding massless limits above:

$$\begin{aligned}{} & {} \lim _{\rho \rightarrow 0^+} K_{77}^{(1)}(\rho ) \, = \, K_{77}^{(1)};\nonumber \\{} & {} \lim _{\rho \rightarrow 0^+} F_{ij}^{(1)}(y;\rho ) \, = \, F_{ij}^{(1)}(y). \end{aligned}$$
(230)

By defining:

$$\begin{aligned} K_{ij}^{(1)}(\rho ) \, \equiv \, F_{ij}^{(1)}(y=0;\rho ) \end{aligned}$$
(231)

and

$$\begin{aligned}{} & {} \textrm{Rem}_{ij}^{(1)}(y;\rho ) \, \equiv \, F_{ij}^{(1)}(y;\rho ) \, - \, F_{ij}^{(1)}(y=0;\rho )\nonumber \\{} & {} \qquad (i,j) \, \ne \, (7,7), \, (8,8), \end{aligned}$$
(232)

the partial rate is written:

$$\begin{aligned}{} & {} \int \limits _0^y \frac{d\Gamma }{dy'}(y',\rho ,a) \, dy' \nonumber \\{} & {} = \, 1 + a \, \sum _{(i,j) \ne (8,8)} K_{ij}^{(1)}(\rho ) \, + \, a \, \Sigma ^{(1)}(y,\rho ) \, \nonumber \\{} & {} +\, a \, \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y;\rho ) \, + \, {\mathcal {O}}(a^2). \end{aligned}$$
(233)

7.3.3 General factorization scheme

By dividing the partial rate by the total rate, we obtain the event fraction E to \({\mathcal {O}}(\alpha _S)\), which has a general factorization of the form:

$$\begin{aligned} E(y;\rho ,a) \, = \, C(\rho ,a) \, \Sigma (y,\rho ,a) \, + \, \textrm{Rem}(y,\rho ,a), \end{aligned}$$
(234)

where:

$$\begin{aligned} C(\rho ,a) \, \equiv \, 1 \, - \, a \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y=1;\rho ) \, + \, {\mathcal {O}}(a^2),\nonumber \\ \end{aligned}$$
(235)

\(\Sigma (y,\rho ,a)\) has been given to \({\mathcal {O}}(a)\) in Eq. (148) and

$$\begin{aligned} \textrm{Rem}(y,\rho ,a) \, \equiv \, a \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y;\rho ) \, + \, {\mathcal {O}}(a^2).\nonumber \\ \end{aligned}$$
(236)

7.3.4 Improved factorization scheme

Let us now describe the improved factorization scheme in the general case. The terms in the massive Remainder function are naturally decomposed into a commuting piece and a non-commuting one, namely:

$$\begin{aligned} \textrm{Rem}_{ij}^{(1)}(y;\rho ) \, = \, L_{ij}^{(1)}(y;\rho ) \, + \, N_{ij}^{(1)}(y;\rho ), \end{aligned}$$
(237)

where:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} \left[ \lim _{y \rightarrow 0^+} L_{ij}^{(1)}(y;\rho ) \right] \, = \, \lim _{y \rightarrow 0^+} \left[ \lim _{\rho \rightarrow 0^+} L_{ij}^{(1)}(y;\rho ) \right] ,\nonumber \\ \end{aligned}$$
(238)

while:

$$\begin{aligned} \lim _{\rho \rightarrow 0^+} \left[ \lim _{y \rightarrow 0^+} N_{ij}^{(1)}(y;\rho ) \right] \, \ne \, \lim _{y \rightarrow 0^+} \left[ \lim _{\rho \rightarrow 0^+} N_{ij}^{(1)}(y;\rho ) \right] .\nonumber \\ \end{aligned}$$
(239)

The splitting in Eq. (237) can be made on a term-by-term analysis of the Remainder function contributions, either analytically or numerically. The commuting terms do not pose any problem and are treated as in the general factorization scheme above. If \(L_{ij}(y=0;\rho ) \ne 0\), this term gives a contribution \(L_{ij}(y=0;\rho )\) to the improved Coefficient function and a contribution \(L_{ij}(y;\rho )-L_{ij}(y=0;\rho )\) to the improved Remainder function.

Let us now consider the (more complicated) non-commuting terms. We can assume that:

$$\begin{aligned} \lim _{y \rightarrow 0^+} N_{ij}^{(1)}(y;\rho ) \, = \, 0. \end{aligned}$$
(240)

If that is not the case, we impose the above limit by simply subtracting from \(N_{ij}^{(1)}(y;\rho )\) its value at \(y=0\):

$$\begin{aligned} N_{ij}^{(1)}(y;\rho ) \quad \mapsto \quad N_{ij}^{(1)}(y;\rho ) \, - \, N_{ij}^{(1)}(0;\rho ). \end{aligned}$$
(241)

The constant \(N_{ij}^{(1)}(0;\rho )\) is then added back to the commuting contributions (being independent on y, it is trivially a commuting term).

Now the idea is simply to treat the non-commuting terms just like the term \(y/(y+\rho )\) above, by means of the Partition of Unity, so that:

  1. 1.

    The \({\mathcal {O}}(\alpha _S)\) improved Coefficient function is obtained by adding to the general Coefficient function \(C^{(1)}(\rho )\) above, Eq. (235), the terms \(\Phi _\Delta (y-\rho ) \, N_{ij}^{(1)}(y;\rho )\), giving:

    $$\begin{aligned}{} & {} C_I^{(1)}(y;\rho ) \, = \, - \, \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y=1;\rho ) \, \nonumber \\{} & {} \quad + \, \Phi _\Delta (y-\rho )\sum _{(i,j) \ne (8,8)} N_{ij}^{(1)}(y;\rho ); \end{aligned}$$
    (242)
  2. 2.

    The \({\mathcal {O}}(\alpha _S)\) improved Remainder function is obtained from the general one, \(\textrm{Rem}^{(1)}(y;\rho )\) in Eq. (236), by replacing the non-commuting terms \(N_{ij}^{(1)}(y;\rho )\) with the terms \(\Phi _\Delta (\rho -y) \, N_{ij}^{(1)}(y;\rho )\) respectively, giving:

    $$\begin{aligned}{} & {} \textrm{Rem}_I^{(1)}(y;\rho ) \, \nonumber \\{} & {} \quad = \, \sum _{(i,j) \ne (8,8)} \left[ L_{ij}^{(1)}(y;\rho ) \, + \, \Phi _\Delta (\rho -y) \, N_{ij}^{(1)}(y;\rho ) \right] .\nonumber \\ \end{aligned}$$
    (243)

By using Eq. (203), it is immediate to check that

$$\begin{aligned}{} & {} \Phi _\Delta (y-\rho ) \, N_{ij}^{(1)}(y;\rho ) \, + \, \Phi _\Delta (\rho -y) \, N_{ij}^{(1)}(y;\rho ) \, \nonumber \\{} & {} \quad = \, N_{ij}^{(1)}(y;\rho ), \end{aligned}$$
(244)

so that the \({\mathcal {O}}(\alpha _S)\) event fraction is correctly reproduced.

7.3.5 Alternative formulation

As in the previous explicit computation, the y-dependent contribution to the improved Coefficient function can be simplified by taking the massless limit \(\rho \rightarrow 0^+\) in \(N_{ij}^{(1)}(y;\rho )\), giving:

$$\begin{aligned}{} & {} C_I^{(1)}(y;\rho ) \, \nonumber \\{} & {} \quad = \, - \, \sum _{(i,j) \ne (8,8)} \textrm{Rem}_{ij}^{(1)}(y=1;\rho ) \, + \, \Phi _\Delta (y-\rho )\nonumber \\{} & {} \qquad \sum _{(i,j) \ne (8,8)} N_{ij}^{(1)}(y;\rho =0). \end{aligned}$$
(245)

That generalizes the partial fractioning made in the \(O_7\,O_7\)-case

$$\begin{aligned} \frac{y}{y + \rho } \, = \, 1 \, - \, \frac{\rho }{y + \rho }. \end{aligned}$$
(246)

The improved Remainder function corresponding to the improved Coefficient function given in Eq. (245) then reads:

$$\begin{aligned}{} & {} \textrm{Rem}_I^{(1)}(y;\rho ) \, = \, \sum _{(i,j) \ne (8,8)}\nonumber \\{} & {} \left[ L_{ij}^{(1)}(y;\rho ) \, + \, N_{ij}^{(1)}(y;\rho ) \, - \, \Phi _\Delta (y-\rho ) \, N_{ij}^{(1)}(y;\rho =0) \right] .\nonumber \\ \end{aligned}$$
(247)

It is easy to check that also the function on the r.h.s. above vanishes for \(y \rightarrow 0^+\), as it should. Just remember Eq. (240) and the definition of the function \(\Phi _\Delta (x)\), Eq. (197), together with the fact that \(\Delta < \rho \) (note that \(N_{ij}^{(1)}(y;\rho =0)\) does not vanish for \(y \rightarrow 0^+\) because of Eq. (239)).

7.3.6 Double insertion of \(O_8\)

Let us now consider the effects of the operator \(O_8\) in the radiative decay \(B \rightarrow X_s \gamma \). This operator is obtained from \(O_7\) simply by replacing the electromagnetic field strength \(F_{\mu \nu }\) by the QCD one, \(G_{\mu \nu }^a t^a\), as well as by replacing the electric charge e by the color charge \(g_s\) (\(\alpha _{em} \equiv e^2/(4\pi ), \,\, \alpha _S \equiv g_s^2/(4\pi )\)). As a consequence, the lowest-order matrix element of \(O_8\) induces the decay

$$\begin{aligned} b \, \rightarrow \, s \, + \, g. \end{aligned}$$
(248)

Note that the gluon and the strange quark are emitted locally by \(O_8\). There are no photons in the final state, which experimentally consists, in the beauty rest frame, of two back-to-back hadronic jets. The topology of the final states in the decay (248) is quite different from that of the tree-level decay mediated by \(O_7\),

$$\begin{aligned} b \, \rightarrow \, s \, + \, \gamma , \end{aligned}$$
(249)

which consists of one hadronic jet recoiling against a (high-energy) photon.

The lowest-order contribution of \(O_8\) to the differential photon spectrum is then a spike at vanishing photon energy:

$$\begin{aligned} \frac{d \Gamma ^{(0)}_{88}}{dx}(x,\rho ) \, = \, \Gamma ^{(0)}_{88}(\rho ) \, \delta (x), \end{aligned}$$
(250)

where:

$$\begin{aligned} \Gamma ^{(0)}_{88}(\rho ) \, = \, \frac{G_F^2 m_b^5}{32 \pi ^3} \left| C_8^{\textrm{eff}} \lambda _t \right| ^2 \frac{C_F \alpha _S}{\pi } \, \frac{1+2\rho }{(1+\rho )^4}. \end{aligned}$$
(251)

\(C_8^{\textrm{eff}}\) is an effective (or improved) Wilson Coefficient function of \(O_8\), including the (non-vanishing) contributions from the 4-fermion operators [12]:

$$\begin{aligned} C_8^{\textrm{eff}} \, = \, C_8 \, + \, C_5. \end{aligned}$$
(252)

Note that \(\Gamma ^{(0)}_{88}(\rho )\) is obtained from \(\Gamma ^{(0)}_{77}(\rho )\) above simply replacing \(C_7^{\textrm{eff}}\) with \(C_8^{\textrm{eff}}\) and \(\alpha _{em}\) with \(C_F \alpha _S\).

Let us now consider the \({\mathcal {O}}(\alpha _{em})\) corrections to the decay (248) involving a real photon, contributing to the process

$$\begin{aligned} b \, \rightarrow \, s \, + \, g \, + \, \gamma . \end{aligned}$$
(253)

The gluon is again emitted locally by \(O_8 \), while the photon is emitted by the beauty and the strange quark lines. The contribution to the differential photon spectrum reads [12]:

$$\begin{aligned} \frac{1}{\Gamma ^{(0)}_{88}} \frac{d \Gamma ^{(1R)}_{88}}{dx} \, = \, \frac{\alpha _{em} }{\pi } \, e_d^2 \, X^{(1R)}, \end{aligned}$$
(254)

where \(e_d=-1/3\) and:

$$\begin{aligned} X^{(1R)}= & {} \frac{1}{2 \, (1-r)} \left[ \frac{1 + (1-x)^2}{x} \, + \, r \, \frac{2 + x(2-x)}{x} \right] \nonumber \\{} & {} \times \left[ \log \frac{1}{r} \, + \, \log (1-x + r \, x) \right] \, + \qquad \nonumber \\{} & {} - \frac{2 \, (1-x)}{x} \, + \, \frac{1-r}{4} \left( \frac{1-x}{1-x+rx} \right) ^2 (1-2x) \,\nonumber \\{} & {} - \, \frac{1-r}{4} \frac{1-x}{1-x+rx} \left( 1-x+2x^2 \right) . \end{aligned}$$
(255)

The above spectrum contains a mass singularity of collinear origin for \(m_s\rightarrow 0^+\), as well as a soft singularity for vanishing photon energy (\(x \rightarrow 0^+\)):

$$\begin{aligned} \frac{1}{ \Gamma ^{(0)}_{88} } \frac{ d \Gamma _{88}^{(R)} }{dx} \,\simeq & {} \, \delta (x) \, + \, e_d^2 \, \frac{\alpha _{em} }{2 \pi } \, {\hat{P}}^{(0)}_{\gamma e}(x) \log \left( \frac{m_b^2}{m_s^2} \right) \, \nonumber \\{} & {} - \, e_d^2 \,\frac{\alpha _{em} }{\pi } \, \frac{2}{x}, \qquad 0 < x, r \ll 1, \end{aligned}$$
(256)

where \({\hat{P}}^{(0)}_{\gamma e}(x)\) is the leading-order unregularized QED splitting function of an electron (or a positron) into a photon:

$$\begin{aligned} {\hat{P}}^{(0)}_{\gamma e}(x) \, = \, {\hat{P}}^{(0)}_{ee}(1-x) \, = \, \frac{1 + (1-x)^2}{x}. \end{aligned}$$
(257)

By coupling the quarks to the electromagnetic field, the strange quark produces a QED jet, as the leading contribution on the r.h.s. of Eq. (256) consists of a soft photon emitted at a small angle with respect to the strange quark motion direction. Note that the topology of the \(b \rightarrow s + g + \gamma \) final states mediated by \(O_8\) involves two back-to-back jets, initiated by the strange quark and by the gluon. The jet initiated by the strange quark also contains the detected photon. Experimentally, a large hadronic activity around the final photon is then expected. The topology of the \(b \rightarrow s + g + \gamma \) final states mediated by \(O_7\) is quite different, as it involves one hadronic jet containing, to \({\mathcal {O}}(\alpha _S)\), the strange quark and the gluon, recoiling against the (hard) photon. In the latter, \(O_7O_7\)-case, the photon is then expected to be isolated.

The r.h.s. of Eq. (256) also contains a single-logarithmic term \(\propto 1/x\) (upon integration over x), coming from soft, not collinearly enhanced, radiation off the beauty and the strange quarks (the factor two comes indeed from having two massive quarks in the process). This soft radiation is roughly isotropic in space (in beauty rest frame) and is then not naturally associated to any jet; it represents a kinematic violation of independent jet fragmentation – the latter coming from angular orderingFootnote 5 – at the next-to-leading level.Footnote 6 As well known, the main effect of the \({\mathcal {O}}(\alpha _{em})\) virtual corrections to the \(O_8O_8\) tree-level decay, is to introduce a plus regularization in the splitting function and in the soft-singular function, so that:

$$\begin{aligned} \frac{1}{ \Gamma ^{(0)}_{88} } \, \frac{ d \Gamma ^{(R+V)} }{dx} \,\simeq & {} \, \delta (x) \, + \, e_d^2 \, \frac{\alpha _{em} }{2 \pi } \, P^{(0)}_{\gamma e}(x) \log \left( \frac{m_b^2}{m_s^2} \right) \,\nonumber \\{} & {} - \,2 \, e_d^2 \, \frac{\alpha _{em} }{\pi } \, \left( \frac{1}{x} \right) _+, \end{aligned}$$
(258)

where:

$$\begin{aligned} P^{(0)}_{\gamma e}(x) \,= & {} \, \left[ \frac{1+(1-x)^2}{x} \right] _+ \equiv \frac{1+(1-x)^2}{x} \,\nonumber \\{} & {} - \, \delta (x) \int \limits _0^1\frac{1+(1-y)^2}{y} \, dy \end{aligned}$$
(259)

and

$$\begin{aligned} \left( \frac{1}{x} \right) _+ \, \equiv \, \frac{1}{x} \, - \, \delta (x) \int \limits _0^1 \frac{dy}{y}. \end{aligned}$$
(260)

By adding virtual photon corrections, soft singularities cancel (in a distribution sense in the differential distribution), while collinear singularities, for \(m_s \rightarrow 0^+\), do not. Therefore one has to factorize the QED collinear logarithm above by means of an ad-hoc fragmentation function, \(D_{\gamma s}(x,Q^2)\). The latter is a universal, i.e. process-independent, function, which can be interpreted as the probability of finding a photon inside a jet initiated by a strange quark, with a fraction x of the initial strange energy, in a process with hard scale Q (\(Q=m_b\) in our case). Since the strange quark mass is of the order of the QCD scale,

$$\begin{aligned} m_s \, \approx \, \Lambda _{QCD}, \end{aligned}$$
(261)

substantial non-perturbative corrections are expected. In order to avoid the introduction of a non-perturbative function – leading in real life to a loss of predictivity –, one can modify the definition of the observed final states, by requiring, for example, the photon to be angularly isolated, in some way, from the final partons/hadrons in the event.

Finally, let us remark that the soft-photon region, \(x \ll 1\), where the operator \(O_8\) dominates, is experimentally not interesting due to the huge background.

8 Conclusions

We have considered various factorization schemes for threshold resummation in processes involving the decay of a heavy quark into a different heavy (massive) quark, accompanied by non-colored partons, i.e. in practice photons, leptons or vector bosons.

By taking the radiative \(B \rightarrow X_s \gamma \) decay as a model process and restricting ourselves to the leading operator \(O_7\) in the effective \(b \rightarrow s \gamma \) weak Hamiltonian, we have first considered soft gluons only and we have constructed a simple soft factorization scheme. The latter can be consistently applied to the heavy quark decays so long as the ordinary velocity of the final quark, in the initial quark rest frame, is not too large (compared to light velocity c), so that collinear effects (collinear logarithms) are not large. The soft scheme can be probably applied to the CKM-favored semileptonic B decays,

$$\begin{aligned} B \, \rightarrow \, X_c \, + \, l \, + \, \nu _l, \end{aligned}$$
(262)

as the charm ordinary 3-velocity \(v_c\) never becomes too large in this case:

(263)

We have then constructed a general massive factorization scheme, which correctly works for a non-zero (and not too small) final quark mass \(m_s\). However, we have found that this scheme does not behave well in the massless limit \(m_s \rightarrow 0^+\), because its Coefficient function and its Remainder function do not approach, in this limit, the corresponding functions of the standard factorization formula constructed after taking the massless limit of the photon spectrum. We have shown that this mismatch is generated by a simple term in the photon spectrum (equivalent to the distribution in the final hadron invariant mass squared \(m_{X_s}^2\)), namely the term, in units of \(C_F \alpha _S/(4\pi )\),

$$\begin{aligned} \frac{m_s^2 \, - \, m_{X_s}^2}{ m_{X_s}^2 }. \end{aligned}$$
(264)

Indeed, for the above term, the massless limit \(m_s \rightarrow 0^+\) and the threshold limit \(m_{X_s} \rightarrow m_s^+\) do not commute with each other. In terms of the mass-correction parameter \(\rho \equiv m_s^2/(m_b^2-m_s^2)\) and the threshold variable \(y \equiv (m_{X_s}^2-m_s^2)/(m_b^2-m_s^2)\), the above term has been written in the main body of the paper as

$$\begin{aligned} - \, \frac{y}{y \, + \, \rho }. \end{aligned}$$
(265)

In the new variables, it is immediate to check that the massless limit \(\rho \rightarrow 0^+\) and the threshold limit \(y \rightarrow 0^+\) do not commute with each other. It is natural to expect the appearance of such terms in the photon spectrum (for which the limits \(y \rightarrow 0^+\) and \(\rho \rightarrow 0^+\) do not commute with each other) to be generic and not restricted to the \(O_7\) operator. A general discussion of the effects, in the \(B \rightarrow X_s \gamma \) photon spectrum, of all the subleading operators in the effective weak Hamiltonian has also been presented.

Since in semileptonic \(b \rightarrow c\) decays, Eq. (262), the heavy quark mass ratio is rather large, \(m_c/m_b \approx 1/3\), we expect the general massive scheme to be consistently applied to describe them. We also expect the massive scheme and the soft scheme to give quantitatively similar results for these decays.

Finally, we have constructed an improved factorization scheme for the massive case, \(m_s \ne 0\), which has the correct massless limit \(m_s \rightarrow 0^+\). That is the main result of our work. A main point is that, to that aim, we have been forced to introduce in the improved Coefficient function \(C_I\), in addition to the standard dependence on the mass-correction parameter \(\rho \), also a dependence on the threshold variable y:

$$\begin{aligned} C_I \, = \, C_I(\rho ,y). \end{aligned}$$
(266)

Actually, we constructed an improved Coefficient function which has a smooth dependence both on y and \(\rho \), and which is close to the massless Coefficient function in the quasi-massless region \(y \gg \rho \). The mathematical tool we needed is the so-called Partition of Unity.

By means of the improved factorization scheme, it is possible to describe, with a unique formalism and in a smooth way, both the quasi-massless kinematic region \(\rho \ll y\) and the pure soft region \(y \ll \rho \), which are dynamically quite different, as well as the transition region \(y \approx \rho \). As is often the case, the interpolation between the asymptotic regions above, to the slice \(y \approx \rho \) is, to some extent, arbitrary, as many different functions can be used to that aim. In other words, there is an ambiguity, in the interpolation from the region \(\rho \ll y\) to the region \(y \ll \rho \), which is never solved in perturbation theory, but only shifted formally to higher orders.

The improved factorization scheme can be applied, for example, to CKM-favored top quark decays \(t \rightarrow X_b + W\), in the study of the final hadron invariant squared mass distribution in the intermediate or transition region

$$\begin{aligned} m_{X_b}^2 \, - \, m_b^2 \, \approx \, m_b^2 \, \ll \, m_t^2, \end{aligned}$$
(267)

as well as in the soft region

$$\begin{aligned} m_{X_b}^2 \, - \, m_b^2 \, \ll \, m_b^2 \, \ll \, m_t^2 \end{aligned}$$
(268)

and in the effectively-massless region

$$\begin{aligned} m_b^2 \, \ll \, m_{X_b}^2 \, \ll \, m_t^2. \end{aligned}$$
(269)

The improved scheme can also be applied to more complicated decays, such as for example the semileptonic \(b \rightarrow c\) decay, Eq. (262) – a three-body decay at tree level.

We believe that our scheme can be generalized to all the hard processes in which one observes the hadron invariant mass \(m_X\) of a jet X initiated by a heavy quark Q in all the possible kinematic regions, namely \(m_X - m_Q \ll m_Q\), \(m_X - m_Q \approx m_Q\) and \(m_X \gg m_Q\).

We have explicitly constructed the improved factorization scheme at order \(\alpha _S\), i.e. at Next-to-Leading Logarithmic (NLL) accuracy. It would be interesting to explicitly extend the scheme to the next order, i.e. with \({\mathcal {O}}(\alpha _S^2)\) Coefficient and Remainder functions. The idea of using the Partition of Unity could also be generalized to higher orders.

An extension of our scheme in a different direction could be the construction of an improved factorization formula for resummed transverse momentum distributions involving heavy quarks.