1 Introduction

Over the last few decades, our understanding of CP violation has made great progress, with many new constraints from BaBar, Belle and LHCb experiments among others [1, 2]. These constraints were shown to be in remarkable agreement with each other and to support the Kobayashi–Maskawa mechanism of CP violation at work within the Standard Model (SM) with three generations [3, 4]. This has led to an accurate determination of the Cabibbo–Kobayashi–Maskawa matrix (CKM) encoding the pattern of CP violation as well as the strength of the weak transitions among quarks of different generations [5,6,7,8,9,10]. These constraints prove also essential in assessing the viability of New Physics models with well-motivated flavour structures [11,12,13,14].

As the CKM matrix is related to quark-flavour transitions, most of these constraints are significantly affected by hadronic uncertainties due to QCD binding quarks into the observed hadrons. However, some of these constraints have the very interesting feature of being almost free from such uncertainties. This is in particular the case for the constraints on the CKM angle \(\alpha \) that are derived from the isospin analysis of the charmless decay modes \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \). Indeed, assuming the isospin symmetry and neglecting the electroweak penguin contributions, the amplitudes of the \({SU}(2)\)-conjugated modes are related. The measured branching fractions and asymmetries in the \(\mathcal{B} ^{\pm ,0}\rightarrow (\pi \pi )^{\pm ,0}\) and \(B^{\pm ,0}\rightarrow (\rho \rho )^{\pm ,0}\) modes and the bilinear form factors in the Dalitz analysis of the \(B^{0}\rightarrow (\rho \pi )^0\) decays provide enough observables to simultaneously extract the weak phase \(\beta +\gamma =\pi -\alpha \) together with the hadronic tree and penguin contributions to each mode [15,16,17]. Therefore, these modes probe two different corners of the SM: on one side, they yield information on \(\alpha \) that is a powerful constraint on the Kobayashi–Maskawa mechanism (and the CKM matrix) involved in weak interactions, and on the other side, they provide a glimpse on the strong interaction and especially the hadronic dynamics of charmless two-body B-decays.

In the following, we will provide a thorough analysis of these decays, based on the data accumulated up to the conferences of Winter 2017. Combining the experimental data for the three decay modes above, we obtain the world-average value at 68% Confidence Level (CL):

$$\begin{aligned} \alpha _\mathrm{dir} = {\left( {86.2}_{-4.0}^{+4.4} ~\cup ~ {178.4}_{-5.1}^{+3.9}\right) }^{\circ }. \end{aligned}$$
(1)

The solution near \(90^\circ \) is in good agreement with the indirect determination obtained by the global fit of the flavour data performed by the CKMfitter group in Summer 2016 [18]:

$$\begin{aligned} \alpha _\mathrm{ind} = {\left( {92.5}_{-1.1}^{+1.5}\right) }^{\circ }. \end{aligned}$$
(2)

Considered separately, the \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) decays yield direct determinations of \(\alpha \) in very good agreement with the indirect determination Eq. (2), whereas the \(B\rightarrow \rho \pi \) decay exhibits a 3\(\sigma \) discrepancy. This discrepancy affects only marginally the combination Eq. (1), which is dominated by the results from \(\rho \rho \) decays, and to a lesser extent \(\pi \pi \) decays, whereas \(\rho \pi \) modes play only a limited role. At this level of accuracy, it proves interesting to assess uncertainties neglected up to now, namely the sources of violation of the assumptions underlying these determinations (\(\varDelta I={3} / {2}\) electroweak penguins, \(\pi ^0\)\(\eta \)\(\eta '\) mixing, \(\rho \) width) and the role played by the statistical framework used to extract the confidence intervals. These effects may shift the central value of \(\alpha _\mathrm{dir}\) by around \(2^\circ \), while keeping the uncertainty around \(4^\circ \) to \(5^\circ \), thus remaining within the statistical uncertainty quoted in Eq. (1).

Besides the CKM angle \(\alpha \), the isospin analysis of the charmless decays data provides a determination of the hadronic tree and penguin parameters for each mode. The penguin-to-tree and colour-suppression ratios in the \(B\!\rightarrow \pi \pi \) and \(B\!\rightarrow \rho \rho \) decay modes can be determined precisely: they show overall good agreement with theoretical expectations for \(\rho \rho \) modes, whereas the ratio between colour-allowed and -suppressed tree contributions for \(\pi \pi \) modes do not agree well with theoretical expectations (both for the modulus and the phase). We also perform indirect predictions of the experimental observables, using all the other available measurements to predict the value of a given observable, and we can compare our predictions to the existing measurements: a very good compatibility is observed for the \(\pi \pi \) and \(\rho \rho \) channels, whereas discrepancies occur in some of the observables describing the Dalitz plot for \(B\rightarrow \rho \pi \) decays. Among many other quantities, the yet-to-be-measured mixing-induced CP asymmetry in the \(B^0\!\rightarrow \pi ^0\pi ^0\) decay is predicted at 68% CL:

$$\begin{aligned} \mathcal{S} ^{00}_{\pi \pi } = {0.65}\pm {0.13}, \end{aligned}$$
(3)

using as an input the indirect value of \(\alpha \); see Eq. (2). Finally, we study how the determination of \(\alpha \) would be affected if the accuracy of specific subsets of observables is improved through new measurements. We find that an improved accuracy for the time-dependent asymmetries in \(B^0\rightarrow \rho ^0\rho ^0\) and the measurement of \(\mathcal{S} ^{00}_{\pi \pi }\) would reduce the uncertainty on \(\alpha \) in a noticeable way.

The rest of this article goes as follows. In Sect. 2, we discuss the basics of isospin analysis for charmless B-meson decays. In Sect. 3, we provide details on the extraction of the \(\alpha \) angle, focusing on the \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) modes in turn, before combining these extractions in a world average. In Sect. 4, we consider the uncertainties attached to the extraction of \(\alpha \). First, we discuss the \({SU}(2)\) isospin framework underlying these analyses, considering three sources of corrections: the presence of \(\varDelta I={3} / {2}\) electroweak penguins (isospin breaking due to different charges for the u and d quarks), \(\pi ^0\)\(\eta \)\(\eta '\) mixing (isospin breaking due to different masses for the u and d quarks), \(\rho \) width (additional amplitude to include in the isospin relations for \(B\rightarrow \rho \rho \)). In addition, we discuss the statistical issues related to our frequentist framework. In Sect. 5, we extract the hadronic (tree and penguin) amplitudes associated with each decay, comparing them briefly with theoretical expectations. We use our framework to perform indirect predictions of observables in each channel and discuss the compatibility with the available measurements in Sect. 6. We perform a prospective study to determine the impact of reducing the experimental uncertainties on specific observables in Sect. 7, before drawing our conclusions. Dedicated appendices gather additional numerical results for observables in three modes, separate analyses using either the BaBar or Belle inputs only, and a brief discussion of the quasi-two-body analysis of the charmless \(B^0\rightarrow a_1^\pm \pi ^{\mp }\) that may provide some further information on \(\alpha \).

2 Isospin decomposition of charmless two-body B decays

The decay of neutral \(B_d\) and charged \(B_u\) mesons into a pair of light unflavoured isovector mesons \(B\rightarrow h_1^ih_2^j\) (h=\(\pi ,\rho \) and \(i,j=-,0,+\)) is described by the weak transition \(\bar{b}\rightarrow \bar{u} u\bar{d}\), followed by the hadronisation of the \((\bar{u} u\bar{d},q)\) system, where \(q=d(u)\) is the spectator quark in the neutral (charged) component of the mesons isodoublet. The weak process receives dominant contributions from both the tree-level \(\bar{b}\rightarrow \bar{u}(u\bar{d})\) charged transition and the flavour-changing neutral current penguin transition, \(\bar{b}\rightarrow \bar{d} (u\bar{u})\), whose topologies are shown in Fig. 1.

Fig. 1
figure 1

Tree (left) and penguin (right) topology diagrams for the transition \(\bar{b}\rightarrow u\bar{u} \bar{d}\)

2.1 Penguin pollution

We can write the charmless \(B_q \rightarrow (\bar{u} u\bar{d},q)\rightarrow h_1^ih_2^j\) transition amplitude, accounting for the CKM factors for the tree and penguin diagrams with three different up-type quark flavours (uct) occurring in the W loop:

$$\begin{aligned} {\mathcal {A}}^{ij}= & {} \langle h_1^ih_2^j|\mathcal{H}_\mathrm{eff}|B_d\rangle \nonumber \\= & {} V_{ud}V^*_{ub}(\mathcal {T}^{ij}_u+{\mathcal {P}}^{ij}_u) + V_{cd}V^*_{cb}{\mathcal {P}}^{ij}_c + V_{td}V^*_{tb}{\mathcal {P}}^{ij}_t \end{aligned}$$
(4)

where \(\mathcal{H}_\mathrm{eff}\) is the effective Hamiltonian describing the transition. The amplitudes \(\mathcal{T}^{ij}_u\) and \(\mathcal{P}^{ij}_{u,c,t}\) represents the tree-level and (uct)-loop mediated topologies, respectively.

The unitarity of the CKM matrix can be used to eliminate one of the three terms in Eq. (4), resulting in three conventions \(\mathcal {U}\), \(\mathcal {C}\) and \(\mathcal {T}\) defined as:

$$\begin{aligned} \mathcal{U}: \mathcal{A}^{ij}= & {} V_{cd}V^*_{cb} (\mathcal{P}^{ij}_c-\mathcal{T}^{ij}_u-\mathcal{P}^{ij}_u)\nonumber \\&+\, V_{td}V^*_{tb}(\mathcal{P}^{ij}_t-\mathcal{T}^{ij}_u-\mathcal{P}^{ij}_u),\nonumber \\ \mathcal{C}: \mathcal{A}^{ij}= & {} V_{ud}V^*_{ub} (\mathcal{T}^{ij}_u+\mathcal{P}^{ij}_u-\mathcal{P}^{ij}_c) + V_{td}V^*_{tb}(\mathcal{P}^{ij}_t-\mathcal{P}^{ij}_c),\nonumber \\ \\ \mathcal{T}: \mathcal{A}^{ij}= & {} V_{ud}V^*_{ub} (\mathcal{T}^{ij}_u+\mathcal{P}^{ij}_u-\mathcal{P}^{ij}_t) + V_{cd}V^*_{cb}(\mathcal{P}^{ij}_c-\mathcal{P}^{ij}_t).\nonumber \end{aligned}$$
(5)

The \(\mathcal {C}\)-convention is adopted in the following, so that the amplitudes can be rewritten as

$$\begin{aligned} {\mathcal {A}}^{ij}=V_{ud}V^*_{ub}\mathcal {\tilde{T}}^{ij} + V_{td}V^*_{tb}\mathcal {\tilde{P}}^{ij} \end{aligned}$$
(6)

where the terms \(\mathcal {\tilde{T}}^{ij}=\mathcal {T}^{ij}_u+{\mathcal {P}}^{ij}_u-{\mathcal {P}}^{ij}_c\) and \(\mathcal {\tilde{P}}^{ij}={\mathcal {P}}^{ij}_t-{\mathcal {P}}^{ij}_c\) associated with specific CKM factors will be referred to as ‘tree’ and ‘penguin’ amplitudes, respectively, following a conventional abuse of language (these amplitudes are not physical, in the sense that both mix contributions of distinct topologies and are not renormalisation invariant). The choice of convention is arbitrary and it has no physical implication on the determination of the weak phase affecting the transitions. However, the particular choice sets the dynamical content of the hadronic tree and penguin amplitudes: for instance, it fixes the definition of the penguin-to-tree ratio and the other hadronic quantities discussed in Sect. 5.

Pulling out the weak phases \(\gamma =\arg \left( -\frac{V_{ud}V^*_{ub}}{V_{cd}V^*_{cb}}\right) \) and \(\beta =\arg \left( -\frac{V_{cd}V^*_{cb}}{V_{td}V^*_{tb}}\right) \), the amplitude can be rewritten as

$$\begin{aligned} {\mathcal {A}}^{ij} = -e^{\mathsf {i}\gamma } T^{ij} + e^{-\mathsf {i}\beta } P^{ij}, \end{aligned}$$
(7)

where the magnitude of the CKM products \(R_u=|V_{ud}V^*_{ub}|\) and \(R_t=|V_{td}V^*_{tb}|\) is included in the redefined amplitudes \(T^{ij}=R_u\mathcal{\tilde{T}}^{ij}\) and \(P^{ij}=-R_t\mathcal{\tilde{P}}^{ij}\).

Similarly, the decay amplitudes of the CP-conjugate isodoublet (\(\bar{B}^0\), \(B^-\)) can be expressed as

$$\begin{aligned} \frac{p}{q}\mathcal {\bar{A}}^{ij} =-e^{-\mathsf {i}\gamma } T^{ij} + e^{\mathsf {i}\beta } P^{ij}, \end{aligned}$$
(8)

where the factor \(p/q\sim e^{2\mathsf {i}\beta }\) is included to take into account the \(B^0-\bar{B}^0\) mixing phase of neutral B-meson that arises naturally in physical observables. For consistency between all the \({SU}(2)\)-related decay modes considered in the isospin analysis, the same phase convention has been applied to define the amplitudes for the charged B meson. The CP invariance of the strong interaction means that the same hadronic amplitudes \(T^{ij}\) and \(P^{ij}\) are involved in the CP-conjugate processes, whereas a complex conjugation is applied to the weak phases.

Rotating all amplitudes by the weak phase \(\beta \) through the redefinition \(A^{ij}=e^{\mathsf {i}\beta }{\mathcal {A}}^{ij}\) and \({\bar{A}}^{ij}=e^{\mathsf {i}\beta }\mathcal {\bar{A}}^{ij}\), the parameter \(\alpha =\pi -\beta -\gamma \) appears as half of the phase difference between the tree contributions to the CP-conjugate amplitudes:

$$\begin{aligned} e^{2\mathsf {i}\alpha } = \frac{e^{\mathsf {i}\alpha }T^{ij}}{e^{-\mathsf {i}\alpha }T^{ij}}=\frac{\bar{A}^{ij}-P^{ij}}{A^{ij} - P^{ij}}. \end{aligned}$$
(9)

In the absence of the penguin contributions, \(\alpha \) can be related to the relative phase of CP-conjugate amplitudes describing the \(B^0\) and \(\bar{B}^0\) mesons decaying into the same final state \(h_1^ih_2^j\). In particular, the time-dependent analysis of the \(B^0/{\bar{B}^0}\rightarrow h_1^+h_2^-\) decay yields the CP asymmetry:

$$\begin{aligned} a_{CP}(t)= & {} \frac{\varGamma (\bar{B}^0(t)\rightarrow h_1^+h_2^-)-\varGamma (B^0(t)\rightarrow h_1^+h_2^-)}{\varGamma (\bar{B}^0(t)\rightarrow h_1^+h_2^-)+\varGamma (B^0(t)\rightarrow h_1^+h_2^-)}\nonumber \\= & {} \mathcal{S}^{+-} \sin (\varDelta m_d\ t) - \mathcal{C}^{+-} \cos (\varDelta m_d\ t), \end{aligned}$$
(10)

where \(\varDelta m_d\) is the \(B^0\)\(\bar{B}^0\) oscillation frequency, and t is either the decay time of the meson, or (in the case of B-factories) the time difference between the CP- and tag-side decays. The coefficients can be expressed as

$$\begin{aligned}&\lambda =\frac{\bar{A}^{+-}}{A^{+-}},\nonumber \\&\mathcal{S}^{+-}=\frac{2 \mathrm{Im}\lambda }{1+|\lambda |^2}\, \quad \mathcal{C}^{+-}=\frac{1-|\lambda |^2}{1+|\lambda |^2}. \end{aligned}$$
(11)

Therefore, in the absence of the penguin contributions, the measurement of \(\mathcal{S} ^{+-}\) would yield \(\sin (2\alpha )\). The a priori non-negligible penguin contributions to the B charmless decays modify this picture: if we introduce the effective angle corresponding to the phase of \(\lambda \), we have [19]

$$\begin{aligned} \lambda =|\lambda |e^{2\mathsf {i}\alpha _\mathrm{eff}}, \quad \mathcal{S}^{+-}=\sqrt{1-(\mathcal{C}^{+-})^2} \sin (2\alpha _\mathrm{eff}). \end{aligned}$$
(12)

The non-negligible penguin contributions prevent us from identifying \(\alpha \) and \(\alpha _\mathrm{eff}\) obtained from the sole consideration of \(B^0(t)\rightarrow h_1^+h_2^-\). However, since isospin is conserved during the hadronisation process, we can write useful relations among the hadronic amplitudes in \({SU}(2)\)-related modes. These relationships will help us to determine the amount of penguin pollution from the data and thus to extract \(\alpha \) in spite of hadronic effects [15, 16].

Table 1 General decomposition of the amplitudes \(A^{ij}=\langle h_1^ih_2^j|\mathcal{H}_s|u\bar{u}\bar{d},q\rangle \) (\(q=u,d\); \(i,j=-,0,+\)) in terms of the reduced matrix elements \(A_{\varDelta I,I_f}\) for a pair of distinguishable isovector mesons \(h_1^i\) and \(h_2^j\)

2.2 General isospin decomposition and application to the \(\rho \pi \) final state

One can factorise the decay amplitudes in two parts. First, the weak decay \(\bar{b}\rightarrow u\bar{u} \bar{d}\) (common to all \(B_q\rightarrow h_1^ih_2^j\) decay processes) corresponds to a shift of isospin \(\varDelta I\). The hadronisation into two light mesons can then be described as \(\langle h_1^ih_2^j|\mathcal{H}_s|u\bar{u}\bar{d},q\rangle \) where \(\mathcal{H}_s\) represents the isospin-conserving strong interaction Hamiltonian. The Wigner–Eckart theorem can be used to express these amplitudes in terms of reduced matrix elements, \(A_{\varDelta I,I_f}\), identified by the shift \(\varDelta I\) and the final-state isospin \(I_f\) (\(I_f=0,1,2\)).Footnote 1 Table 1 yields the decomposition of the \(A^{ij}\) amplitudes in the general case of two distinguishable isovector mesons (\(h_1\ne h_2\)).

This general decomposition applies for instance to the \(B\rightarrow \rho \pi \) system. Neglecting the \(\varDelta I ={5} / {2}\) transition, the four remaining isospin amplitudes constrain the five decays amplitudes to follow the pentagonal relation:

$$\begin{aligned} A^{+-}+A^{-+}+2A^{00}=\sqrt{2}(A^{+0}+A^{0+}). \end{aligned}$$
(13)

The same identity applies to the CP-conjugate amplitudes. Moreover, the sum of the decay amplitudes of the charged modes \((A^{+0}+A^{0+})\) is a pure \(A_{\frac{3}{2},2}\) isospin amplitude.

A usual simplifying assumption consists in neglecting the \(\varDelta I={3} / {2}\) contribution from electroweak penguins, as they are expected to be small: in this limit, all penguins (gluonic and electroweak) are mediated only by the \(\varDelta I={1} / {2}\) transition \(\bar{b}\rightarrow \bar{d}(u\bar{u})_{I=0}\). The isospin relation for the amplitudes Eq. (13) may then be projected onto the penguin amplitudes, leading to the triangular relations:

$$\begin{aligned} P^{+-} + P^{-+} + 2P^{00} = \sqrt{2}(P^{+0} + P^{0+}) =0. \end{aligned}$$
(14)

The two combinations are identical and vanish under our assumptions, as they involve only \(\varDelta I={3} / {2}\) amplitudes. Under this assumption, some combinations of the decay amplitudes are free from penguin contributions. The parameter \(\alpha \) can thus be identified as half of the phase difference between the sum of the CP-conjugate amplitudes of the charged modes, or equivalently as a function of the amplitudes of the neutral modes only:

$$\begin{aligned} e^{2\mathsf {i}\alpha } = \frac{\bar{A}^{+0}+\bar{A}^{0+}}{A^{+0} + A^{0+}}=\frac{\bar{A}^{+-}+\bar{A}^{-+}+2\bar{A}^{00}}{A^{+-}+A^{-+}+2A^{00}}. \end{aligned}$$
(15)

Similarly, the combination of amplitudes \((A^{+0}-A^{0+})-\sqrt{2}(A^{+-}-A^{-+})\) receives a pure \(A_{\frac{3}{2},1}\) contribution and is thus free of penguin contributions, so that

$$\begin{aligned} P^{+0}-P^{0+}=\sqrt{2}(P^{+-}-P^{-+}), \end{aligned}$$
(16)

and the phase \(\alpha \) can also be obtained from the pure tree combination:

$$\begin{aligned} e^{2\mathsf {i}\alpha } = \frac{ (\bar{A}^{+0}-\bar{A}^{0+})-\sqrt{2}(\bar{A}^{+-}-\bar{A}^{-+}) }{( A^{+0}- A^{0+})-\sqrt{2}( A^{+-}- A^{-+})}. \end{aligned}$$
(17)

Combining Eqs. (14) and (16), one can see that there are actually only two independent penguin amplitudes involved:

$$\begin{aligned} P^{+0}= & {} - P^{0+}=(P^{+-}-P^{-+})/\sqrt{2},\nonumber \\ P^{00}= & {} -(P^{+-}+P^{-+})/2, \end{aligned}$$
(18)

illustrating the power of the \({SU}(2)\) isospin approach to reduce the number of independent hadronic quantities to be determined from the data in order to extract a constraint on \(\alpha \).

2.3 Application to the \(\pi \pi \) and \(\rho \rho \) cases

In the cases where the two isovector mesons in the final state are indistinguishable from the point of view of isospin symmetry (\(h_1=h_2=h\)), as for the \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) systems, only the even amplitudes \(I_f=0,2\) are allowed due to the Bose–Einstein statistics. Defining the symmetrised amplitudes,

$$\begin{aligned} A^{+-}_{h_1=h_2}=\frac{A^{+-}+A^{-+}}{\sqrt{2}},\quad A^{+0}_{h_1=h_2}=\frac{A^{+0}+A^{0+}}{\sqrt{2}}, \end{aligned}$$
(19)

the isospin decomposition gets simplified as reported in Table 2, leading to the triangular identity:

$$\begin{aligned} A^{+0}=\frac{A^{+-}}{\sqrt{2}}+A^{00}, \end{aligned}$$
(20)

with a similar identity for the CP-conjugate amplitudes.Footnote 2 In the case of the \(\rho \rho \) channel, one should consider a different set of independent amplitudes for each of the three possible polarisations (which is identical for the two \(\rho \) mesons).

Under the assumption of negligible contributions from \(\varDelta I={3} / {2}\) electroweak penguins, the penguin relations Eq. (18) reduce down to \(P^{+0}=0\) and \({{P^{+-}}}/{{\sqrt{2}}}=-P^{00}\). The total amplitude of the charged modes \(A^{+0}=e^{-\mathsf {i}\alpha }T^{+0}\) is free from penguin contributions, and the angle \(\alpha \) can be derived from

$$\begin{aligned} e^{-2\mathsf {i}\alpha } = \frac{\bar{A}^{+0}}{A^{+0}}. \end{aligned}$$
(21)

This complex ratio cannot be determined from a single measurement, but it is possible to reconstruct the two isospin triangles corresponding to CP-conjugate amplitudes using branching ratios and CP asymmetries for all the modes. Figure 2 illustrates this construction, which translates the measurement of \(\alpha _\mathrm{eff}\) into a determination of the CKM angle \(\alpha \). This procedure is affected by discrete ambiguities, since there are several manners of reconstructing the two isospin triangles. This leads to a fourfold ambiguity for \(\sin (2\alpha )\), i.e. an eightfold ambiguity on the solutions of \(\alpha \) in \([0,180]^\circ \), in general. These additional solutions are called “mirror solutions”. If one or both triangles are flat, several mirror solutions become degenerate, decreasing the number of distinct solutions for \(\alpha \).

Table 2 Decomposition of the amplitude \(\langle h^ih^j|\mathcal{H}_s|u\bar{u}\bar{d},q\rangle \) (\(q=u,d\); \(i,j=-,0,+\)) in terms of the isospin amplitudes \(A_{\varDelta I,I_f}\) for indistinguishable mesons in the final state (\(h_1=h_2=h\)). A global factor \(\sqrt{3}\) is applied to all the coefficients with respect to the general coefficients given in Table 1
Fig. 2
figure 2

Geometrical representation of the isospin triangular relation \(A^{+0}={A^{+-}}/{\sqrt{2}}+A^{00}\) and its CP-conjugate equivalent in the complex plane of \(B\rightarrow hh\) amplitudes (red and blue shaded areas). The angle between the CP-conjugate charged amplitudes \(A^{+-}\) and \(\bar{A}^{+-}\) corresponds to twice the weak phase \(\alpha _\mathrm{eff}\) (yellow solid arrows), whereas the angle between the CP-conjugate charged amplitudes \(A^{+0}\) and \(\bar{A}^{+0}\) corresponds to twice the weak phase \(\alpha \) (green solid arrows). The other triangles with a lighter shade represents the mirror solutions allowed by the discrete ambiguities in the observables, with the corresponding values for \(\alpha \) represented with green dashed arrows

3 Determining the weak angle \(\alpha \)

3.1 Procedure

If we consider that each of the five amplitudes \(A^{ij}\) \((ij=+-,-+,+0,0+,00)\) receives two complex contributions, tree and penguin, the hadronic contributions to the generic decay system \(B^{i+j}\rightarrow h_1^ih_2^j\) can be parametrised with 20 real parameters, in addition to the weak phase \(\alpha \) (one overall phase being irrelevant). In the case of a Bose-symmetric final state (\(h_1=h_2\)), the dimension of the parameter space reduces down to 13. Assuming the isospin relations between the amplitudes discussed in Sect. 2, the three decay systems \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) provide enough experimental measurements to fully constrain the parameter space, hereafter denoted \({\varvec{{p}}}=(\alpha ,\) \({\varvec{{\mu }}}\)), where \({{\varvec{\mu }}}\) represents the set of independent hadronic parameters (tree and penguin amplitudes). The actual dimension of the parameter space depends on the decay system and will be discussed in the following subsections.

The constraints on the parameter \(\alpha \) are determined by an exploration of the N-dimensional parameters space through the frequentist statistical approach discussed in detail in Refs. [5, 6, 10, 22] and in Sect. 4.2, but we find it useful to briefly summarise its main features here. The set of experimental observables, denoted \({\varvec{\mathcal {O}}}_{\mathrm{exp}}\), is measured in terms of likelihoods that can be used to build a \(\chi ^2\)-like test statistic:

$$\begin{aligned} \chi ^2({{\varvec{p}}})=-2\log {\mathcal {L}}({\varvec{\mathcal {O}}}_{\mathrm{exp}}-{\varvec{\mathcal {O}}}({\varvec{p}})), \end{aligned}$$
(22)

where \({\varvec{\mathcal {O}}}({\varvec{p}})\) represent the theoretical value of the observables in SM for fixed parameters \(\varvec{p}\). The test statistic \(\chi ^2\) is first minimised over the whole parameter space, letting all N parameters \(\varvec{p}\) free to vary. The absolute minimum value of the test statistic, \(\chi ^2_\mathrm{min}\), quantifies the agreement of the data with the theoretical model (assuming the validity of the SM and \({SU}(2)\) isospin symmetry in the present case). Converting \(\chi ^2_\mathrm{min}\) into a p value is, however, not trivial a priori, as one has to interpret \(\chi ^2(\varvec{p})\) as a random variable distributed according to a \(\chi ^2\) law with a certain number of degrees of freedom. The actual number of degrees of freedom of the system can be ill defined in the case where the experimental observables are interdependent (see for instance the related discussion in Sect. 3.4). In the case of M independent observables, the number of degrees of freedom of the system is defined as \(N_{\mathrm{dof}}=M-N\). This occurs in the Gaussian case, but it also can apply in non-Gaussian cases in the limit of large samples, under the conditions of Wilks’ theorem [23].

It is also possible to perform the metrology of specific parameters of the model, and in particular \(\alpha \) [5, 6, 10, 22]. Indeed, if we consider the \(\varvec{\mu }\) hadronic parameters as “nuisance parameters”, we can define the test statistic from the \(\chi ^2\) difference:

$$\begin{aligned} \Delta \chi ^2(\alpha ) = \min _{\varvec{\mu }}[\chi ^2(\alpha )] - \chi ^2_\mathrm{min} \end{aligned}$$
(23)

where \(\min _{\varvec{\mu }}[\chi ^2(\alpha )]\) is the value of \(\chi ^2\), minimised with respect to the nuisance parameters for a fixed \(\alpha \) value. This test statistic assesses how a given hypothesis on the true value of \(\alpha \) agrees with the data, irrespective of the value of the nuisance parameters. Confidence intervals on \(\alpha \) can be derived from the resulting p value, which is computed assuming that \(\varDelta \chi ^2(\alpha )\) is \(\chi ^2\)-distributed with one degree of freedom:

$$\begin{aligned} p(\alpha ) = \mathrm{Prob}(\varDelta \chi ^2(\alpha ),N_{\mathrm{dof}}=1), \qquad \end{aligned}$$
(24)

with

$$\begin{aligned} \mathrm{Prob}(\varDelta \chi ^2,N_{\mathrm{dof}})=\frac{\varGamma (N_{\mathrm{dof}}/2,\varDelta \chi ^2/2)}{\varGamma (N_{\mathrm{dof}}/2)}. \end{aligned}$$
(25)

One can express \(\mathrm{Prob}\) for \(N_{\mathrm{dof}}=1\) in terms of the complementary error function:

$$\begin{aligned} \mathrm{Prob}(\varDelta \chi ^2(\alpha ),N_{\mathrm{dof}}=1)=\mathrm{Erfc}\left( \sqrt{\varDelta \chi ^2(\alpha )/2}\right) . \end{aligned}$$
(26)

Confidence intervals at a given confidence level (CL) are obtained by selecting the values of \(\alpha \) with a p value larger than \(1-\)CL. The derivation, robustness and coverage of this definition for the p value will be further discussed in Sect. 4.2.

Although the relevant information on the \(\alpha \) constraint is fully contained in the p value function \(p(\alpha )\), confidence intervals will be derived in the following subsections. It is worth noticing that the p value for \(\alpha \) usually presents a highly non-Gaussian profile and that the \({SU}(2)\) isospin analysis suffers from (pseudo-)mirror ambiguities. Therefore, the confidence intervals provided must be interpreted with particular care.

We consider here all the experimental inputs available up to Winter 2017 conferences (for each channel, we provide the corresponding references). We compare the results of the isospin-based direct determination of \(\alpha \) from these inputs with the indirect result from the global CKM fit obtained in Summer 2016. This indirect determination includes all the quark-flavour constraints described in Refs. [10, 18], apart from the inputs from the \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) decays used for the direct determination of \(\alpha \). The indirect determination, hereafter denoted \(\alpha _\mathrm{ind}\), is illustrated in Fig. 3 and the corresponding 68% CL interval is given in Eq. (2).

Fig. 3
figure 3

On the top p value as a function of the assumed value for \(\alpha \), extracted from the global CKM fit excluding direct information on \(\alpha \) from charmless B decays. On the bottom 95% confidence-level constraint (light blue area) in the \((\bar{\rho },\bar{\eta })\) plane defining the apex of the B-meson unitarity triangle. The corresponding 68% confidence-level interval, indicated by the dark blue area on the two figures, is given by Eq. (2)

3.2 Isospin analysis of the \(B\rightarrow \pi \pi \) system

The three decays \(B^{0,+}\rightarrow (\pi \pi )^{0,+}\) depend on 12 hadronic parameters and the weak phase \(\alpha \). One can set one irrelevant global phase to zero, and one can eliminate further parameters using the complex isospin relation Eq. (20) and its CP conjugate (four real constraints) as well as the absence of the penguin contribution to the charged mode amplitude (two real constraints). The remaining system of amplitudes features six degrees of freedom. The isospin-related \(B\rightarrow \pi \pi \) decays (and similarly each of the helicity states of the \(B\rightarrow \rho \rho \) mode) can thus be described with 6 real independent parameters, including the common weak phase \(\alpha \).

The relevant available experimental observables and their current world-average value for the \(B\rightarrow \pi \pi \) modes are summarisedFootnote 3 in Table 3. Six independent observables are available, allowing us to constrain the six-dimensional parameter space of the \({SU}(2)\) isospin analysis. These branching fractions and CP asymmetries are related to the decay amplitudes as

$$\begin{aligned} \frac{1}{\tau _{B^{i+j}}} \mathcal{B}^{ij}= & {} \frac{ |A^{ij}|^2 + |{\bar{A}}^{ij}|^2}{2}, \nonumber \\ \mathcal{C}^{ij}= & {} \frac{ |A^{ij}|^2 - |{\bar{A}}^{ij}|^2}{ |A^{ij}|^2 + |{\bar{A}}^{ij}|^2}, \\ \mathcal{S}^{ij}= & {} \frac{ 2 \mathcal{I}m({\bar{A}}^{ij}A^{ij}{}^{*})}{ |A^{ij}|^2 + |{\bar{A}}^{ij}|^2},\nonumber \end{aligned}$$
(27)

where \(\tau _{B^{i+j}}\) is the measured lifetime of the charged (\(i+j=1\)) or neutral (\(i+j=0\)) B meson.

Table 3 World averages for the relevant experimental observables in the \(B\rightarrow \pi ^i\pi ^j\) modes: branching fraction \(\mathcal{B}^{ij}_{\pi \pi }\), time-integrated CP asymmetry \(\mathcal{C}^{ij}_{\pi \pi }\), time-dependent asymmetry \(\mathcal{S}^{ij}_{\pi \pi }\) and correlation (\(\rho \))

Taking into account the isospin relation Eq. (20), the amplitudes can be parametrised in terms of their penguin and tree contributions as

$$\begin{aligned}&{A}^{+-}= T^{+-} e^{-\mathsf {i}\alpha } + P,\quad {\bar{A}}^{+-}= T^{+-} e^{+\mathsf {i}\alpha } + P, \nonumber \\&{A}^{00}= \frac{T^{00} e^{-\mathsf {i}\alpha } - P}{\sqrt{2}},\quad {\bar{A}}^{00}= \frac{T^{00} e^{+\mathsf {i}\alpha } - P}{\sqrt{2}}, \\&{A}^{+0}= \frac{( T^{+-} + T^{00} ) e^{-\mathsf {i}\alpha }}{\sqrt{2}}, \quad {\bar{A}}^{+0}= \frac{( T^{+-} + T^{00}) e^{+\mathsf {i}\alpha }}{\sqrt{2}},\nonumber \end{aligned}$$
(28)

where \(T^{+-}\), \(T^{00}\) and P are three complex parameters, among which one can be taken as real to set the global phase convention.

The actual choice of the representation for the amplitudes is irrelevant from the mathematical point of view in the frequentist approach adopted here [36]. We use the following alternative representation of the amplitude system, which proves more convenient for our purposes [37]:

$$\begin{aligned} {A}^{+0}= & {} \mu e^{\mathsf {i}(\Delta -\alpha )},\quad {\bar{A}}^{+0}= \mu e^{\mathsf {i}(\Delta +\alpha )} , \nonumber \\ {A}^{+-}= & {} \mu a,\quad {\bar{A}}^{+-}= \mu {\bar{a}} e^{2\mathsf {i}{\bar{\alpha }}},\\ {A}^{00}= & {} {A}^{+0}-\frac{{A}^{+-}}{\sqrt{2}},\quad {\bar{A}}^{00}= {\bar{A}}^{+0}-\frac{{\bar{A}}^{+-}}{\sqrt{2}}\nonumber \end{aligned}$$
(29)

where the parameters a, \({\bar{a}}\) and \(\mu \) are real positive parameters related to the modulus of the decay amplitudes and \({\bar{\alpha }}\) and \(\varDelta \) are relative phases. \(A^{+-}\) is chosen to be a real positive quantity, which sets the phase convention. The weak phase \(\alpha \) clearly appears as \(2\alpha =\arg ({\bar{A}}^{+0}/A^{+0})\). The parameter \(\bar{\alpha }\), satisfying \(\bar{\alpha }=2\arg ({\bar{A}}^{+-}/A^{+-})\), would coincide with \(\alpha \) in the limit of a vanishing penguin contribution.Footnote 4

This alternative representation provides a convenient and compact parametrisation enabling a fast and stable exploration of the six-dimensional parameter space. Besides its technical convenience, this representation has a pedagogical benefit, as it exhibits the discrete ambiguities affecting the determination of the weak phase \(\alpha \) in the \(B\rightarrow hh\) modes in a clear way. Indeed, the above experimental \(B\rightarrow \pi \pi \) observables can be written as a function of the chosen set of parameters as

$$\begin{aligned} \mathcal{B}^{+0}= & {} \tau _{B^{+}}\mu ^2,\quad \mathcal{B}^{+-} = \tau _{B^{0}}\mu ^2\frac{ a^2 + {\bar{a}}^2}{2},\nonumber \\ \mathcal{B}^{00}= & {} \tau _{B^{0}}\frac{\mu ^2}{4}(4+a^2+{\bar{a}}^2-2\sqrt{2}(ac + {\bar{a}}.{\bar{c}}),\nonumber \\ \mathcal{C}^{+-}= & {} \frac{ a^2 - {\bar{a}}^2 }{ a^2 + {\bar{a}}^2},\quad \mathcal{S}^{+-} = \frac{ 2a{\bar{a}}}{ a^2 + {\bar{a}}^2}\sin (2\bar{\alpha }),\nonumber \\ \mathcal{C}^{00}= & {} \frac{a^2-{\bar{a}}^2-2\sqrt{2}(ac + {\bar{a}}{\bar{c}})}{4+a^2+{\bar{a}}^2-2\sqrt{2}(ac + {\bar{a}}{\bar{c}})}. \end{aligned}$$
(30)

where we define \(c=\cos (\alpha -\varDelta )\) and \({\bar{c}}=\cos (\alpha +\varDelta -2{\bar{\alpha }})\). The system exhibits a fourfold trigonometric ambiguity under the phase redefinitions:

$$\begin{aligned} (\alpha ,\varDelta )\rightarrow (\varDelta ,\alpha ),(2{\bar{\alpha }}- \alpha ,2{\bar{\alpha }}-\varDelta ),(2{\bar{\alpha }}-\varDelta ,2{\bar{\alpha }}-\alpha ).\nonumber \\ \end{aligned}$$
(31)

There is also an additional discrete symmetry involving not only \(\alpha \) and \(\Delta \), but also \(\bar{\alpha }\): indeed, c, \(\bar{c}\) and \(\mathcal{S}^{+-}\) are left invariant by reflection,

$$\begin{aligned} ({\bar{\alpha }},\alpha ,\varDelta )\rightarrow \frac{\pi }{2}-({\bar{\alpha }},\alpha ,\varDelta ). \end{aligned}$$
(32)

The combination of these two discrete symmetries yields an eightfold ambiguity in the determination of the \(\alpha \) angle in the \([0,180]^\circ \) range. Geometrically speaking, the fourfold ambiguity results from the choice left concerning the position of the apex with respect to the \(A^{+0}\) base for each of the two isospin triangles, and a twofold ambiguity arises in relation with the relative direction of the two \(A^{+0}\) and \(\bar{A}^{+0}\) bases.

In the absence of any additional input, the available \(B\rightarrow \pi \pi \) observables lead thus to eight strictly equivalent mirror solutions for \(\alpha \). The degeneracy could be partially lifted with the measurement of the time-dependent CP asymmetry in the \(B^0\rightarrow \pi ^0\pi ^0\) decay, \(S^{00}\). This observable can be written as:

$$\begin{aligned} \mathcal{S}^{00}= & {} \frac{4\sin (2\alpha )+2a{\bar{a}}\sin (2{\bar{\alpha }})-2\sqrt{2} ( as+{\bar{a}}{\bar{s}})}{4+a^2+{\bar{a}}^2-2\sqrt{2}(ac + {\bar{a}}{\bar{c}})}, \end{aligned}$$
(33)

where \(s=\sin (\alpha +\varDelta )\) and \({\bar{s}}=\sin (\alpha -\varDelta +2{\bar{\alpha }})\) are not invariant under the phase redefinition of \(\alpha \) and \(\Delta \) Eq. (31), leaving only a twofold ambiguity on \(\alpha \). The \(B^0\rightarrow \pi ^0\pi ^0\) decay is so far observed only in the 4-photon final state preventing the measurement of the time-dependent decay rate. Future high-luminosity facilities have investigated the feasibility of the \(\mathcal{S} ^{00}_{\pi \pi }\) asymmetry measurement, either by considering the rare Dalitz decays of the neutral pions or by exploiting the conversion of photons in the detector material [38, 39].

Fig. 4
figure 4

Constraint on the reduced amplitude \(a^{+-}=A^{+-}/A^{+0}\) in the complex plane for the \(B\rightarrow \pi \pi \) (top) and \({\bar{B}}\rightarrow \pi \pi \) systems (bottom). The individual constraint from the \(B^0(\bar{B}^0)\rightarrow \pi ^+\pi ^-\) observables and from the \(B^0(\bar{B}^0)\rightarrow \pi ^0\pi ^0\) observables are indicated by the yellow and green circular areas, respectively. The corresponding isospin triangular relations \(a^{00}+{a^{+-}}/{\sqrt{2}}=1\) (and CP conjugate) are represented by the black triangles

The consistency of the experimental \(B\rightarrow \pi \pi \) data with the triangular isospin relation of Eq. (20) can be assessed by the means of the two-side sum:

$$\begin{aligned} t=\frac{|a^{+-}|}{\sqrt{2}}+|a^{00}| \end{aligned}$$
(34)

where \(a^{ij}\) denotes the normalised amplitude \(a^{ij}={A^{ij}}/{A^{+0}}\). The sum of the length of two sides of a triangle being greater than the third one [40], we have \(t\ge 1\), the equality occurring for a flat triangle. The values

$$\begin{aligned} t=1.05\pm 0.09 \quad \text {and}\quad {\bar{t}}=1.45\pm 0.08 \end{aligned}$$

are obtained for the \(B\rightarrow \pi \pi \) and \({\bar{B}}\rightarrow \pi \pi \) systems and are consistent with an almost-flat and an open triangle, respectively, as illustrated in Fig. 4. Both isospin triangles display the expected mirror symmetry, therefore the \(\alpha \) constraint exhibits eight non-degenerate solutions in \([0,180]^\circ \), as illustrated in Fig. 5. The minima of the \(\chi ^2\) test statistic over the parameter space (\(\chi ^2_\mathrm{min}=0\)) are found at \(\alpha =5.9^\circ \), \(\alpha =84.1^\circ \), \(\alpha =100.1^\circ \), \(\alpha =124.4^\circ \), \(\alpha =129.6^\circ \), \(\alpha =140.4^\circ \), \(\alpha =125.6^\circ \) and \(\alpha =169.9^\circ \). This system of six observables for six independent parameters is not over-constrained and the vanishing value of \(\chi ^2_\mathrm{min}\) reflects the closure of both the \(\bar{B}\rightarrow \pi \pi \) and the \( B\rightarrow \pi \pi \) isospin triangles. The corresponding (symmetrised) \(\alpha \) intervals are

$$\begin{aligned} \alpha _{\pi \pi }: ({93.0}\pm {14.0} \cup {135.0}\pm {17.7} \cup {177.1}\pm {14.2})^\circ \end{aligned}$$

at 68% CL, and \(({135.3}\pm {60.3})^\circ \) at 95% CL. The interval \(\alpha = [20.0,71.0]^\circ \) is excluded at more than 3 standard deviations by the \({SU}(2)\) isospin analysis of the \(B\rightarrow \pi \pi \) system. The solution around \(90^\circ \) is in excellent agreement with the indirect determination of \(\alpha \), Eq. (2), from the other constraints of the global CKM fit [18]. If we include \(\alpha _\mathrm{ind}\) derived from Fig. 3 as an additional and independent constraint to the \({SU}(2)\) isospin fit, the minimal \(\chi ^2\) increases by 0.3, indicating a consistency at the level of 0.6 standard deviation.

It is interesting to note the isospin constraints present a discontinuity at \(\alpha =0\) (modulo \(\pi \)) as illustrated on the bottom panel of Fig. 5. The \(\alpha =0\) hypothesis is rejected with a high significance (\(\chi ^2(\alpha =0)=296\)) as expected by the direct CP violation observed in the \(B^0\rightarrow \pi ^+\pi ^-\) decay [31, 32]. However, for any finite \(\alpha \) in the vicinity of zero, the isospin relation can be satisfied if arbitrary large penguin amplitudes are allowed (the limit case \(\alpha \rightarrow 0\) happens if \(|T^{+-}|\), \(|T^{00}|\), |P| are sent to infinity; see Eq. (28) and Refs. [36, 42]). The limit \(\alpha \rightarrow 0\) was also discussed in the context of the Bayesian statistical approach in Ref. [41]. External data, e.g., based on \({SU}(3)\) consideration, could be used to set bounds on the penguin over tree ratio and thus further constrain \(\alpha \) around 0. These bounds stand beyond the pure \({SU}(2)\) approach adopted here and we will not attempt at including such additional information for the following reasons. On one hand, other charmless modes (such as \(\rho \pi \)) will provide further information on the small \(\alpha \) region, and on the other hand, we will ultimately use the determination of \(\alpha \) in a global fit together with other well-controlled constraints on the CKM parameters [18]: both types of constraints will rule out the region around \(\alpha =0\), so that additional theoretical hypotheses on the size of the \(B\rightarrow \pi \pi \) amplitudes are not necessary for our purposes.

Fig. 5
figure 5

One-dimensional scan of \(\alpha \) in the \([0,180]^\circ \) range (top) from the \({SU}(2)\) isospin analysis of the \(B\rightarrow \pi \pi \) system. The interval with a dot indicates the indirect \(\alpha \) determination introduced in Eq. (2). The zoom in the vicinity of \(\alpha =0\) (bottom figure) displays the punctual discontinuity, whose width due to the scan binning has no physical meaning

3.3 Isospin analysis of the \(B\rightarrow \rho \rho \) system

A similar \({SU}(2)\) isospin analysis can be performed on each polarisation state of the \(B\rightarrow \rho \rho \) system. Experimentally, the decays are completely dominated by the longitudinal polarisation of the \(\rho \) mesons, and in the following, we will always imply this polarisation for the \(\rho \rho \) final state (which will be denoted explicitly as \(\rho _L\rho _L\) when we quote experimental measurements). This final state presents several advantages with respect to \(B\rightarrow \pi \pi \). The measured branching fractions \(\mathcal{B}^{+-}\) and \(\mathcal{B}^{+0}\) are approximately five times larger than for \(B\rightarrow \pi \pi \), while \(\mathcal{B}^{00}\) is of the same order of magnitude, indicating a smaller penguin contamination in the \(\rho \rho \) case. Moreover, the time-dependent CP asymmetry in the \(B^0\rightarrow \rho ^0\rho ^0\) decay, \(\mathcal{S}^{00}\), is experimentally accessible for the final state with four charged pions, potentially lifting some of the discrete ambiguities affecting the determination of \(\alpha \). However, the current measurement \(\mathcal{S}^{00}=0.3\pm 0.7\) suffers from large uncertainties, leaving pseudo-mirror solutions in \(\alpha \). The available experimental observablesFootnote 5 and their current world averages are summarised in Table 4. Under the \({SU}(2)\) isospin hypothesis, the direct CP asymmetry in \(B^+\rightarrow \rho ^+\rho ^0\) vanishes and we will not take into account the experimental measurement of this quantity (which is consistent with our hypothesis and will be used to test this assumption in Sect. 4.1). Seven independent observables are available for the longitudinal helicity state of the \(\rho \) mesons, allowing us to over-constrain the six-dimensional parameter space of the \({SU}(2)\) isospin analysis.

Table 4 World averages for the relevant experimental observables in the \(B\rightarrow \rho ^i\rho ^j\) modes: branching fraction \(\mathcal{B}^{ij}_{\rho \rho }\), fraction of longitudinal polarisation \(f_L^{ij}\), time-integrated CP asymmetry \(\mathcal{C}^{ij}_{\rho \rho }\), time-dependent asymmetry \(\mathcal{S}^{ij}_{\rho \rho }\) and correlation (\(\rho \))
Fig. 6
figure 6

Constraint on the reduced amplitude \(a^{+-}=A^{+-}/A^{+0}\) in the complex plane for the \(B\rightarrow \rho \rho \) (top) and \({\bar{B}}\rightarrow \rho \rho \) systems (bottom). The individual constraint from the \(B^0(\bar{B}^0)\rightarrow \rho ^+\rho ^-\) observables and from the \(B^0(\bar{B}^0)\rightarrow \rho ^0\rho ^0\) observables are indicated by the yellow and green circular areas, respectively. The corresponding isospin triangular relations \(a^{00}+{a^{+-}}/{\sqrt{2}}=1\) (and CP conjugate) are represented by the black line. Both \(B\rightarrow \rho \rho \) and \({\bar{B}}\rightarrow \rho \rho \) isospin triangles are flat

Fig. 7
figure 7

One-dimensional scan of \(\alpha \) in the \([0,180]^\circ \) range (top) from the \({SU}(2)\) isospin analysis of the \(B\rightarrow \rho \rho \) system. The interval with a dot indicates the indirect \(\alpha \) determination introduced in Eq. (2). The zoom in the vicinity of \(\alpha =0\) (bottom) displays the punctual discontinuity, whose width, due to the scan binning, has no physical meaning

The situation of the \(B\rightarrow \rho \rho \) system is illustrated in Fig. 6. The following two-side sums are consistent with flat triangles for both \(B\rightarrow \rho \rho \) and \({\bar{B}}\rightarrow \rho \rho \) systems:

$$\begin{aligned} t=1.00\pm 0.10 \quad \mathrm{and}\quad {\bar{t}}=0.97\pm 0.11. \end{aligned}$$

This further reduces the degeneracy of the (pseudo) mirror solutions in \(\alpha \) leaving a twofold ambiguity; see Fig. 7. The minima of the \(\chi ^2\) test statistic over the parameter space (\(\chi ^2_\mathrm{min}=0.14\)) are found at \(\alpha =92.1^\circ \) and \(178.0^\circ \). The corresponding 68 and 95% CL intervals on \(\alpha \) are

$$\begin{aligned} \alpha _{\rho \rho } :&{\left( {92.0}_{-4.8}^{+4.7} \cup {177.9}_{-4.6}^{+4.9}\right) }^{\circ } ~~(68\%\ \mathrm{CL})\quad \hbox {and}\\&{\left( {92.0}_{-11.0}^{+10.0}\cup {177.9}_{-10.0}^{+10.7}\right) }^{\circ } ~~(95\%\ \mathrm{CL}). \end{aligned}$$

The solution around \(90^\circ \) is in excellent agreement (less than 0.1 standard deviation) with the indirect determination of \(\alpha \), Eq. (2), from the other constraints of the global CKM fit [18], and is more tightly constrained than in the \(B\rightarrow \pi \pi \) case, in relation with the smaller penguin contamination.

The region \(\alpha = [14.0,76.0]^\circ \cup [112.0,158.0]^\circ \) is excluded at more than 3 standard deviations by the \({SU}(2)\) isospin analysis of the \(B\rightarrow \rho \rho \) system. On the bottom panel of Fig. 7, a zoom around \(\alpha =0\) exhibits a small discontinuity, corresponding to \(\chi ^2(\alpha =0)=1.61\) and indicating that this hypothesis is mildly disfavoured by the data, consistently with the absence of large direct CP asymmetries in \(B\rightarrow \rho \rho \) decays.

3.4 Isospin analysis of the \(B\rightarrow \rho \pi \) system

The three neutral and two charged \(B^{0,+}\rightarrow (\rho \pi )^{0,+}\) decays can be described with 10 complex (tree and penguin) amplitudes and one weak phase \(\alpha \), i.e., 21 real parameters. Assuming the pentagonal isospin relation Eq. (13) that leaves only two independent complex penguin contributions, the number of degrees of freedom of the system is reduced to 12 after setting an irrelevant global phase to zero. The dimension of the system gets down to 10 if we consider the three neutral decays only. The time-dependent Dalitz analysis of the neutral \(B^{0}\rightarrow (\rho ^\pm \pi ^{\mp },\rho ^0\pi ^0) \rightarrow \pi ^+\pi ^-\pi ^{0}\) transitions has been shown to carry enough information to fully constrain the isospin-related \(B\rightarrow \rho \pi \) system, thanks to the finite width of the intermediate \(\rho \)-mesons that yields a richer interference pattern of the three-body decay [15, 16]. For instance, 11 independent phenomenological observables can be defined from the flavour-tagged time-dependent Dalitz distribution of the three-pion decay, e.g., the branching fractions of the three intermediate \(\rho ^i\pi ^j\) transitions, the corresponding direct and time-dependent CP asymmetries and two relative phases between their amplitudes.

The extraction of the parameter \(\alpha \) through the \({SU}(2)\) analysis of the neutral modes will be referred to as the “Dalitz analysis” and will be considered first in Sect. 3.4.1. An extended analysis (referred to as the “pentagonal analysis”), including the information coming from the charged decays \(B^+\rightarrow \rho ^+\pi ^0\) and \(B^+\rightarrow \rho ^0\pi ^+\), will be discussed in Sect. 3.4.2.

3.4.1 Dalitz analysis of the neutral \(B^0\rightarrow (\rho \pi )^0\) modes

Considering the neutral modes only, the time-independent amplitude of the \(\pi ^+\pi ^-\pi ^0\) final state can be written as

$$\begin{aligned}&A_{3\pi }= f_+ A^{+} + f_- A^{-} + f_0 A^{0},\\&{\bar{A}}_{3\pi }= f_+ {\bar{A}}^{+} + f_- {\bar{A}}^{-} + f_0 {\bar{A}}^{0},\nonumber \end{aligned}$$
(35)

where \(A^{i}\) (\({\bar{A}}^{i}\)) is the amplitude of the \(B^0({\bar{B}}^0)\rightarrow \rho ^i\pi ^j\) transitionFootnote 6 and \(f_i\) (\(i=-,+,0\)) is the form factor accounting for the \(\rho ^i\) line-shape. Neglecting the tiny \(B^0\) width difference \(\varDelta \varGamma _d\), the time-dependent decay rate can be written as a function of three combinations of these amplitudes:

$$\begin{aligned}&\varGamma (t)_{{B^0}/{\bar{B}^0}\rightarrow \pi ^+\pi ^-\pi ^0} \propto e^{-t/\tau _{B^0}}\{ \varGamma _{0} {\mp } \varGamma _\mathcal{C}\cos (i\Delta m_d t )\nonumber \\&\qquad \pm 2\varGamma _\mathcal{S}\sin (\varDelta m_d t )\}, \end{aligned}$$
(36)

with \(\varGamma _0=(|A_{3\pi }|^2 + |{\bar{A}}_{3\pi }|^2)\), \(\varGamma _\mathcal{C}=(|A_{3\pi }|^2 - |{\bar{A}}_{3\pi }|^2)\) and \(\varGamma _\mathcal{S}=\mathcal{I}m(\frac{q}{p} {\bar{A}}_{3\pi }A_{3\pi }^*)\).

These amplitudes are described phenomenologically in terms of form factors \(f_i\) describing the decay of the \(\rho \) meson into two pions, multiplied by coefficients, denoted \(\mathcal U\) and \(\mathcal I\), related to the \(B\rightarrow \rho \pi \) dynamics:

$$\begin{aligned} |A_{3\pi }|^2\pm |{\bar{A}}_{3\pi }|^2= & {} \sum \limits _{i=+,-0} \mathcal{U}^\pm _i |f_i|^2\nonumber \\&+\,2\sum \limits _{j<i} \mathcal{U}^{\pm ,\mathcal{R}e}_{ij} \mathcal{R}e( f_i f_j^*) \nonumber \\&-\,2\sum \limits _{j<i} \mathcal{U}^{\pm ,\mathcal{I}m}_{ij} \mathcal{I}m( f_i f_j^*) ,\nonumber \\ \mathcal{I}m\left( \frac{q}{p} {\bar{A}}_{3\pi }A_{3\pi }^*\right)= & {} \sum \limits _{i=+,-0} \mathcal{I}_i |f_i|^2 +\sum \limits _{j<i} \mathcal{I}^{\mathcal{R}e}_{ij} \mathcal{R}e( f_i f_j^*) \nonumber \\&+\sum \limits _{j<i} \mathcal{I}^{\mathcal{I}m}_{ij} \mathcal{I}m( f_i f_j^*), \end{aligned}$$
(37)

with

$$\begin{aligned} \mathcal{U}^\pm _{i}= & {} |A^{i}|^2 \pm |{\bar{A}}^{i}|^2,\nonumber \\ \mathcal{U}^{\pm ,\mathcal{R}e}_{ij}= & {} \mathcal{R}e(A^{i} A^{j^{*}}\pm {\bar{A}}^{i}{\bar{A}}^{j^*}),\nonumber \\ \mathcal{U}^{\pm ,\mathcal{I}m}_{ij}= & {} \mathcal{I}m(A^{i} A^{j^*}\pm {\bar{A}}^{i}{\bar{A}}^{j^*}), \nonumber \\ \mathcal{I}_{i}= & {} \mathcal{I}m( {\bar{A}}^{i}A^{i^*}),\\ \mathcal{I}^{\mathcal{R}e}_{ij}= & {} \mathcal{R}e( {\bar{A}}^{i}A^{j^*} - {\bar{A}}^{i} A^{j^*}), \nonumber \\ \mathcal{I}^{\mathcal{I}m}_{ij}= & {} \mathcal{I}m( {\bar{A}}^{i}A^{j^*} + {\bar{A}}^{i} A^{j^*}).\nonumber \end{aligned}$$
(38)

For given form factors \(f_i\), these 27 coefficients \(\mathcal U\) and \(\mathcal I\) fully parametrise the time-dependent Dalitz distribution of the \(B\rightarrow \pi ^+\pi ^-\pi ^0\) decay. The \(\mathcal {U}\)parameters are related to the branching fractions and the direct CP asymmetries in the \(B\rightarrow \rho ^i\pi ^j\) intermediate states while the \(\mathcal {I}\)coefficients parametrise the mixing-induced CP asymmetries.

Table 5 Relative value of the quasi-two-body related \(\mathcal U\) and \(\mathcal I\) coefficients extracted from the time-dependent Dalitz analysis of the \(B^0\rightarrow (\rho \pi )^0\) decay [48, 49]. The corresponding form-factor bilinear is indicated for each coefficient. The red vertical line indicates the overall normalisation, defined by \(U^+_+=1\)
Table 6 Relative value of the interference-related \(\mathcal U\) and \(\mathcal I\) coefficients extracted from the time-dependent Dalitz analysis of the \(B^0\rightarrow (\rho \pi )^0\) decay [48, 49]. The corresponding form-factor bilinear is indicated for each coefficient. The red vertical line indicates the overall normalisation, defined by \(U^+_+=1\)
Table 7 Correlation matrix of correlations, in percent, for the \(\mathcal U\) and \(\mathcal I\) coefficients extracted from the time-dependent Dalitz analysis of the \(B^0\rightarrow (\rho \pi )^0\) decay [48, 49]

As long as only the neutral modes are considered, it is worth noticing that the absolute normalisation of these coefficients is irrelevant for the \({SU}(2)\) isospin analysis. The measured values and correlations [48, 49] of the 26 remaining \(\mathcal U\) and \(\mathcal I\) coefficients (normalised with respect to \(\mathcal{U^+_+}\), which is set to unity) are collected in Tables 5, 6, and 7. The following compact representation of the amplitude system for the neutral transitions can be chosen:

$$\begin{aligned} {A}^{+}= & {} \cos (\varTheta ^{+}),\quad {\bar{A}}^{+}= \sin (\varTheta ^{+})e^{\mathsf {i}\varPsi ^{+}}, \nonumber \\ {A}^{-}= & {} \mu ^{-}e^{\mathsf {i}\phi ^{-}},\quad {\bar{A}}^{-}= \nu ^{-}e^{\mathsf {i}\varPsi ^{-}} , \nonumber \\ {A}^{0}= & {} \mu ^{0}e^{\mathsf {i}\phi ^{0}},\nonumber \\ 2{\bar{A}}^{0}= & {} e^{2\mathsf {i}\alpha }(2A^{0}+A^{+}+A^{-}) - {\bar{A}}^{+}-{\bar{A}}^{-}. \end{aligned}$$
(39)

This modelisation explicitly embeds the isospin relations Eq. (13) as well as the normalisation choice \(\mathcal{U^+_+} = |A^{+}|^2 + |{\bar{A}}^{+}|^2 =1\). We thus obtain a system depending on 9 real parameters, consisting in four relative amplitudes (\(\cos (\varvec{\varTheta }^{+})\), \(\mu ^{-}\), \(\mu ^{0}\), \(\nu ^{-}\)) and 5 phases (\(\alpha \), \(\phi ^{-}\), \(\phi ^{0}\), \(\varvec{\varPsi }^{+}\), \(\varvec{\varPsi }^{-}\)) (including the weak phase \(\alpha \)). It is worth stressing again that the actual parametrisation of the amplitudes has no impact on the results obtained in our frequentist framework [36]. The parametrisation Eq. (39) is mostly based on technical considerations: this minimal representation ensures a fast and stable exploration of the nine-dimensional parameter space constrained by the 26 correlated observables. The isospin relations for both CP-conjugate modes are illustrated on Fig. 8. The resulting constraint on \(\alpha \) is shown on the top panel of Fig. 9.

Fig. 8
figure 8

Constraint on the reduced amplitude \((a^{+-}+a^{-+})=(A^{+-}+A^{-+})/(A^{+0}+A^{0+})\) in the complex plane for the \(B^0\rightarrow (\rho \pi )^0\) (top) and \({\bar{B}}^0\rightarrow (\rho \pi )^0\) systems (bottom). The individual 95% CL constraint from the \(B^0(\bar{B}^0)\rightarrow \rho ^\pm \pi ^{\mp }\) observables and from the \(B^0(\bar{B}^0)\rightarrow \rho ^0\pi ^0\) observables are indicated by the yellow and green areas, respectively, whose non-trivial shapes are due to the correlations between the two modes. The corresponding isospin triangular relations \(a^{00}+{a^{+-}}/{\sqrt{2}}=1\) (and CP conjugate) are represented by the black line

Using the 26 correlated \(\mathcal U\) and \(\mathcal I\) observables, the minimum of the \(\chi ^2\) test statistic over the parameter spaces (\(\chi ^2_\mathrm{min}=8.6\)) is found at \(\alpha =54.1^\circ \). The corresponding 68 and 95% CL intervals are

$$\begin{aligned} \alpha _{(\rho \pi )^0} ~ (\mathrm{Dalitz}){:}&{\left( {54.1}_{-10.3}^{+7.7} ~\cup ~ {141.8}_{-5.4}^{+4.8}\right) }^{\circ } ~(68\%\ \mathrm{CL})\quad \text {and} \\&{\left( {54.1}_{-27.0}^{+15.0} ~\cup ~ {141.8}_{-23.8}^{+11.2}\right) }^{\circ } ~(95\%\ \mathrm{CL}). \end{aligned}$$

The interval \(\alpha = [92.0,112.0]^\circ \) is excluded at more than 3 standard deviations by the \({SU}(2)\) isospin analysis of the \(B^0\rightarrow (\rho \pi )^0\) system. These results do not show a good agreement with the indirect determination of \(\alpha \), Eq. (2), based on the other constraints of the global CKM fit [18]. Adding the indirect determination as an additional constraint to the \({SU}(2)\) isospin fit, the minimal \(\chi ^2\) increases by 9.1 units, corresponding in the Gaussian limit to a 3.0\(\sigma \) discrepancy. In front of this uncomfortable discrepancy, we should remember that this result will be combined with constraints from \(\rho \rho \) and \(\pi \pi \) systems. Without a dedicated statistical study, it is difficult to determine the effective number of degrees of freedom associated with the set of 26 interdependent observables for the \(\rho \pi \) system, but it remains interesting to notice that the \(\chi ^2_\mathrm{min}\) for \(\rho \pi \) appears less significant (statistically) than in the case of \(\rho \rho \) (which dominates the combination) and \(\pi \pi \) (which contributes significantly). The latter constraints will dominate, leading to a combined determination remaining in good agreement with the indirect value of \(\alpha \) obtained from the other constraints of the global fit. A detailed study of the actual significance of this \(\chi ^2\) offset in the \(\rho \pi \) case is discussed in Sect. 4.2. In addition, the \(\rho \pi \) Dalitz observables have been scrutinized to understand more precisely the origin of the tension, as reported in Sect. 6.3.

Fig. 9
figure 9

One-dimensional scan of \(\alpha \) in the \([0,180]^\circ \) range from the “Dalitz” \({SU}(2)\) isospin analysis of the \(B^0\rightarrow (\rho \pi )^0\) system (top) and the “pentagonal” analysis (bottom). The interval with a dot indicates the indirect \(\alpha \) determination introduced in Eq. (2)

3.4.2 “Pentagonal” analysis of the \(B\rightarrow \rho \pi \) system

The amplitudes of the two charged decay modes \(B^+\rightarrow \rho ^+\pi ^0\) and \(B^+\rightarrow \rho ^0\pi ^+\) are related to the amplitudes of the three neutral decays \(B^0\rightarrow (\rho \pi )^0\) through the pentagonal relation Eq. (13). The measurement of the branching ratios and CP-asymmetries of the charged modes may provide additional constraints to the isospin system. Considering simultaneously the charged and the neutral modes, the \(\mathcal U\) and \(\mathcal I\) observables that describe the relative amplitudes of the neutral decays must be supplemented with an absolute normalisation. This is achieved by identifying the sum of branching ratios, \(\mathcal{B}^{+-}_{\rho {\pi }}+\mathcal{B}^{-+}_{\rho {\pi }}\), with the scaled amplitude \(\mu ^{+}(\mathcal{U}^+_+ + \mathcal{U}^+_-){\tau _{B^0}}/{2}\), where \(\mu ^{+}\) is the absolute normalisation of the relative \(\mathcal U\) and \(\mathcal I\) coefficients. The experimental measurements of the branching fractions for the charged and neutral \(B\rightarrow \rho \pi \) modes and the CP asymmetries for the charged modes are listed in Table 8.

At this stage, we would like to comment on some of these inputs. The measurement of the branching fractions for the neutral intermediate states \(B^0\rightarrow \rho ^i\pi ^j\) is experimentally tricky. Indeed, the three \(\rho \) mesons, charged and neutral, contribute to the neutral modes, whereas only one \(\rho \) state contributes to each of the charged modes. A quasi-two-body analysis was performed by the BaBar and CLEO experiments [50,51,52] assuming that the interferences are negligible in the Dalitz region where the measurements were performed. The Belle measurement [53] extracts the partial branching ratio \(\mathcal{B} ^{\pm {\mp }}_{\rho {\pi }}\) and \(\mathcal{B} ^{00}_{\rho {\pi }}\) simultaneously from a global \(B^0\rightarrow \pi ^+\pi ^-\pi ^0\) Dalitz analysis, rescaling the \(\mathcal {U}\)coefficients to the overall three-body branching fractions. The reliability of the measurements on the quasi-two-body intermediate states, \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) and \(B^0\rightarrow \rho ^0\pi ^0\), is thus questionable and this may bias the pentagonal constraint. Since the Dalitz-based \({SU}(2)\) isospin analysis already provides a complete and self-consistent description of the \(B^0\rightarrow (\rho \pi )^0\) amplitude system, we will use the Dalitz analysis rather than the pentagonal one in the combined direct determination of the \(\alpha \) angle. However, for completeness, we briefly discuss the results of the pentagonal analysis, interpreting the \(\mathcal{B} ^{\pm {\mp }}_{\rho {\pi }}\) measurement as an independent constraint on \(\mathcal{B}^{+-}_{\rho {\pi }}+\mathcal{B}^{-+}_{\rho {\pi }}\). Further discussions can be found in Sect. 6.3.3, where predictions for these observables can be found.

Table 8 World averages for the branching fractions \(\mathcal{B}^{ij}_{\rho {\pi }}\) and time-integrated CP asymmetries \(\mathcal{C}^{ij}_{\rho {\pi }}\) for the charged (top) and neutral (bottom) \(B\rightarrow \rho \pi \) decay modes

Including the charged decays, the amplitude model for the neutral modes discussed in Sect. 3.4.1 can be completed with two additional real parameters (\(\mu ^{+0}\), \(\phi ^{+0}\)):

$$\begin{aligned} {A}^{+0}= & {} \mu ^{+0}e^{\mathsf {i}\phi ^{+0}},\qquad \nonumber \\ {A}^{0+}= & {} -{A}^{+0} + \frac{2A^{0}+A^{+}+A^{-}}{\sqrt{2}}, \end{aligned}$$
(40)

where the second identity uses the pentagonal relation Eq. (13) explicitly. Using Eq. (17), the CP-conjugate amplitudes are defined as

$$\begin{aligned} {\bar{A}}^{+0}= & {} \left( A^{+0}-\frac{A^{+}-A^{-}}{\sqrt{2}}\right) e^{2\mathsf {i}\alpha }+\frac{{\bar{A}}^{+}-{\bar{A}}^{-}}{\sqrt{2}},\nonumber \\ {\bar{A}}^{0+}= & {} (\sqrt{2}(A^{+}+A^{0})-A^{0+})e^{2\mathsf {i}\alpha }-\frac{{\bar{A}}^{+}-{\bar{A}}^{-}}{\sqrt{2}}. \end{aligned}$$
(41)

Adding the normalisation factor \(\mu ^{+0}\), the whole \(B\rightarrow \rho \pi \) dynamics is described through a 12-parameter system that is over-constrained by the set of observables for neutral and charged modes given in Tables 5, 6, 7 and 8.

The minimum of the \(\chi ^2\) test statistic over the parameter space (\(\chi ^2_\mathrm{min}=12.1\)) is found at \(\alpha =49.8^\circ \). The following 68% and 95% CL intervals on \(\alpha \) are obtained:

$$\begin{aligned} \alpha _{\rho {\pi }}\ (\mathrm{pentagonal}):&\left( {49.8}_{+4.1}^{-4.3}\right) {^\circ }~~(68\%\ \mathrm{CL})~~\mathrm{and}\\&\left( {{49.8}_{-10.4}^{+8.0}}_{^\circ }^{~}\right) ~(95\%\ \mathrm{CL}). \end{aligned}$$

The compatibility with the indirect \(\alpha \) determination is estimated at 3.2 standard deviations, slightly larger than for the Dalitz analysis of the neutral modes. The constraint on \(\alpha \) is displayed on the bottom panel of Fig. 9. The same comments as in Sect. 3.4.1 apply also here concerning the relative statistical significance of this constraint compared to \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) channels.

3.5 Combined result

The \({SU}(2)\) isospin analyses of \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) provide three independent constraints on \(\alpha \). A combined analysis can be performed by summing the individual \(\chi ^2(\alpha )\) curves. Using the Dalitz analysis for the \(B^0\rightarrow (\rho \pi )^0\) mode together with \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \), the minimum of the \(\chi ^2\) test statistic (\(\chi ^2_\mathrm{min}=17.1\)) is obtained for \(\alpha =86.2^\circ \), as illustrated on Fig. 10. A slightly disfavoured second solution peaks at \(\alpha =178.4^\circ \) with \(\Delta \chi ^2=\chi ^2(\alpha =178.4^\circ )-\chi ^2_\mathrm{min}=0.4\). The corresponding 68% and 95% CL intervals are

$$\begin{aligned} \alpha _\mathrm{dir} :&\left( {86.2}_{-4.0}^{+4.4} \cup {178.4}_{-5.1}^{+3.9}\right) ^{\circ }~~(68\%\ \mathrm{CL})\quad \text {and}\\&\left( {86.2}_{-7.5}^{+12.5}\cup {{178.4}_{-9.9}^{+10.5}}\right) ^{\circ } ~~(95\%\ \mathrm{CL}). \end{aligned}$$

The preferred solution near \(90^\circ \) is consistent with the indirect determination \(\alpha _\mathrm{ind.}\) given in Eq. (2) at 1.3 standard deviations. As discussed earlier, the combined constraint is dominated by \(B\rightarrow \rho \rho \) and to a lesser extent, by \(B\rightarrow \pi \pi \), whereas \(B\rightarrow \rho \pi \) plays only a limited role. The constraint resulting of the partial combination based on \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) only is reported in Table 11.

The one-dimensional constraint on \(\alpha \) can be recast in a constraint on the \((\bar{\rho },\bar{\eta })\) Wolfenstein parameters of the CKM matrix representing the apex of the B-meson unitarity triangle [10, 18]. The following relation can be derived:

$$\begin{aligned} \left( {\bar{\eta }}-\frac{\mathrm{cotan}(\alpha )}{2}\right) ^2+\left( {\bar{\rho }}-\frac{1}{2}\right) ^2 = \frac{1}{4\sin ^2(\alpha )}, \end{aligned}$$
(42)

so that the curves at fixed \(\alpha \) consist in circles centred on the point \((\bar{\rho },\bar{\eta })=(1/2,\mathrm{cotan}(\alpha )/2)\). In particular, the curve at \(\alpha \)=\(90^\circ \) is a circle of radius 1 / 2 centered on \((\bar{\rho },\bar{\eta })=(1/2,0)\), while \(\alpha \)=\(0^\circ \) amounts to a circle of an infinite radius tangent to the axis \(\bar{\eta }=0\). All the curves of constant \(\alpha \) meet at the points (0, 0) and (1, 0). Figure 11 displays the constraint resulting from the direct determination of \(\alpha \) in the \((\bar{\rho },\bar{\eta })\) plane.

In this section, we have used world averages for all the observables, combining information from the BaBar, Belle and LHCb experiments. Since both B-factory experiments, BaBar and Belle, have measured all these observables independently, it is also possible to perform separate \({SU}(2)\) isospin analyses for each of the three decay channels and each experiment. The corresponding constraints are discussed in Appendix A.

Fig. 10
figure 10

\({SU}(2)\) constraint on \(\alpha \) (green shaded area) from the combination of \(B\rightarrow \pi \pi \) (dashed line), \(B\rightarrow \rho \rho \) (dashed-dotted line) and \(B^0\rightarrow \pi ^+\pi ^-\pi ^0\) Dalitz analyses (dotted line) compared to the indirect value (dot) obtained from the global CKM fit [18]

Fig. 11
figure 11

95% CL (dark green) and 68% (light green) constraints on \(\alpha \) in the (\(\bar{\rho },\bar{\eta })\) plane from the combined \({SU}(2)\) isospin analysis of \(B\rightarrow \pi \pi \), \(\mathcal{B} \rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) (Dalitz) decays. The small yellow area indicates the 95% CL region for the apex of the B-meson unitarity triangle (solid black lines) from the global CKM fit excluding the charmless \(B\rightarrow hh\) data used in the direct \(\alpha \) determination [18]

4 Additional uncertainties on the \(\alpha \) determination

In this section, we are going to test the limits of the assumptions made to extract \(\alpha \) from the data: the breakdown of isospin symmetry and the statistical approach used to build confidence intervals.

4.1 Testing the \({SU}(2)\) isospin framework

As discussed in Sect. 2, the extraction of the weak phase \(\alpha \) relies on isospin symmetry, which is used at different levels [58, 59]:

  • The charges of the u and d quarks are taken as identical. The \(\varDelta I={3} / {2}\) contribution induced by the electroweak penguins topology to the \(B^+\rightarrow h^+h^0\) decay is considered negligible. As the gluonic penguins only yield a \(\varDelta I={1} / {2}\) isospin contribution, the pure \(\varDelta I={3} / {2}\) \(B^+\rightarrow h^+h^0\) decay only receives tree contributions in the absence of electroweak penguins. In this limit, the weak phase \(\alpha \) can be identified as half the phase difference between the amplitude of the charged mode and its CP conjugate.

  • The masses of the u and d quarks are taken as identical. Isospin symmetry is assumed to be exact in the strong hadronisation process following the weak transition \(\bar{b}(q)\rightarrow \bar{u} u\bar{d}(q)\) where q represents the light spectator quark u or d. This assumption allows one to relate the decay amplitudes of the charged (\(\bar{b}u\)) meson to the decay amplitudes of its isospin-related neutral state \(\bar{b}d\) according to the pentagonal (triangular) identity given in Eq. (13) (respectively Eq. (20)).

  • The \(\rho \rho \) final state is supposed to obey Bose–Einstein statistics, and thus the analysis for the \(\pi \pi \) and the \(\rho \rho \) systems are supposed to follow the same isospin decomposition (for a given \(\rho \) polarisation). However, the \(\rho \) mesons cannot be distinguished only in the limit of a vanishing width. Once the finite \(\rho \) width is taken into account, additional terms (forbidden by Bose symmetry fin the limit \(\varGamma _\rho =0\)) must be taken into account in the isospin decomposition of the amplitudes.

All these hypotheses are valid a priori to a very good approximation, but the accuracy reached in the determination of \(\alpha \) in Sect. 3.5 is an incentive to investigate their limitations in more detail.

4.1.1 \(\varDelta I={3} / {2}\) electroweak penguins

While preserving the isospin relations between charged and neutral decay amplitudes, the electroweak penguin topology induces a \(\varDelta I={3} / {2}\) contribution to the charged modes \(B^+\rightarrow h^+h^0\). With such a contribution, the system of amplitudes in the \(\pi \pi \) and \(\rho \rho \) cases still obeys the triangular relation Eq. (20), but Eq. (28) has to be rewritten as

$$\begin{aligned} {A}^{+-}= & {} T^{+-} e^{-\mathsf {i}\alpha } + P^{+-} , \nonumber \\ \sqrt{2}{A}^{00}= & {} T^{00} e^{-\mathsf {i}\alpha } - P^{+-} + P^{+0}_\textsc {ew}, \nonumber \\ \sqrt{2}{A}^{+0}= & {} ( T^{+-} + T^{00} ) e^{-\mathsf {i}\alpha } + P^{+0}_\textsc {ew}, \end{aligned}$$
(43)

where the amplitude \(P^{+0}_\textsc {ew}\) accounts for the \(\varDelta I={3} / {2}\) contribution from electroweak penguins to the charged mode while \(P^{+-}\) is redefined to absorb the contributions from both gluonic and electroweak penguins to the \(B^0\rightarrow h^+h^-\) neutral decay.

The electroweak penguin contribution can be related to the tree amplitude in a model-independent way by performing the Fierz transformation of the relevant current-current operators in the effective Hamiltonian for \(B\rightarrow \pi \pi \) decays [60,61,62]. This leads to an estimate of the relative contribution of the electroweak penguin, \(P^{+0}_\textsc {ew}\), compared to the tree amplitude, \(T^{+0}=(T^{00}+T^{+-})\), for the charged decay. Neglecting the penguins with internal light quarks u and c as well as the electroweak operators \(\mathcal{O}_7\) and \(\mathcal{O}_8\) (suppressed by tiny Wilson coefficients), the amplitude ratio

$$\begin{aligned} r_{\textsc {ewp}}=\frac{P_\textsc {ew}^{+0}}{T^{+0}} \simeq -\frac{3}{2}\left( \frac{\mathcal{C}_9+\mathcal{C}_{10}}{\mathcal{C}_1+\mathcal{C}_2}\right) \frac{V_{td}V^*_{tb}}{V_{ud}V^*_{ub}}e^{-\mathsf {i}\alpha } \end{aligned}$$
(44)

can be computed in terms of the short-distance Wilson coefficients \(\mathcal{C}_i\) associated to the effective operators \(\mathcal{O}_i\) describing the dominant electroweak penguin processes (\(i=9,10\)) and the current-current tree processes (\(i=1,2\)). Following this estimate, the electroweak penguin amplitude does exhibit no strong phase difference compared to the tree amplitude, preserving the charge symmetry \(|A^{+0}|=|\bar{A}^{+0}|\) in the charged B decay. Consequently, the impact of the electroweak penguin can be accounted for introducing the single real parameter \(r_{\textsc {ewp}}={P_\textsc {ew}^{+0}}/{T^{+0}}\) in the isospin analysis of the \(B\rightarrow hh\) systems.

Fig. 12
figure 12

\({SU}(2)\) constraint on \(\alpha \) including an electroweak penguin contribution given by Eq. (47) (red dashed line) and neglecting this penguin contribution (green shaded area) for \(B\rightarrow \pi \pi \) (top) and \(B\rightarrow \rho \rho \) (bottom)

The above modification of the amplitude \(A^{+0}\) affects the determination of \(\alpha \), which is the argument of \(A^{+0}\) in the absence of \(\varDelta I={3} / {2}\) penguin contributions. The CKM phase measured in \(B\rightarrow hh\) systems under this hypothesis is an effective phase \(\alpha _0\), which can be related to the true phase \(\alpha \) through the relation:

$$\begin{aligned} \tan \alpha _0 = \frac{\sin \alpha }{r_{\textsc {ewp}}+\cos \alpha }, \end{aligned}$$
(45)

and the relative electroweak contribution \(r_{\textsc {ewp}}\) induces the shift:

$$\begin{aligned} \delta \alpha = \alpha -\alpha _0=\mathrm{arcsin}[r_{\textsc {ewp}} \sin \alpha _0]. \end{aligned}$$
(46)

on the determination of \(\alpha \). A numerical evaluation yields [6, 60,61,62]:

$$\begin{aligned} r_{\textsc {ewp}}=\frac{P_\textsc {ew}^{+0}}{T^{+0}}= & {} -(1.35\pm 0.12)\times 10^{-2}\times \left| \frac{V_{td}V^*_{tb}}{V_{ud}V^*_{ub}}\right| \nonumber \\= & {} -(3.23\pm 0.30)\times 10^{-2}, \end{aligned}$$
(47)

using the CKM matrix parameters constrained from the global analysis of the flavour constraints [18]. A maximal shift \(\delta \alpha =-1.9^\circ \) occurs at \(\alpha _0=90^\circ \), in agreement with the numerical results shown in Fig. 12. The resulting 68% CL intervals are

$$\begin{aligned} \alpha _{\pi \pi }^{\textsc {ewp}} {:}&({91.2}{\pm }{14.2} ~\cup ~ {133.8}\pm {18.1} ~\cup ~ {177.1}\pm {14.8})^\circ ,\\ \alpha _{\rho \rho }^{\textsc {ewp}} {:}&{\left( {90.1}_{-4.8}^{+4.7} ~\cup ~ {177.9}_{-4.8}^{+4.9}\right) }^{\circ }. \end{aligned}$$
Fig. 13
figure 13

Constraint on the relative contamination from \(\varDelta I={3} / {2}\) electroweak penguin \(r_{\textsc {ewp}}=P_\textsc {ew}^{+0}/T^{+0}\) from the \({SU}(2)\) analysis of \(B\rightarrow \pi \pi \) (top) and \(B\rightarrow \rho \rho \) (bottom), obtained by constraining the weak phase \(\alpha \) with the indirect determination provided by the global CKM fit [18]. An additional constraint on the strong and electroweak penguin hierarchy has been implemented (\(|P_\textsc {ew}^{+0}| < |P^{+-}|\)) in order to single out the solution for \(\alpha \) compatible with the indirect determination

Conversely, neglecting all other isospin-breaking effects, we can set a limit on the electroweak penguin contamination in the \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) systems. For this purpose, we require the modified isospin analysis to agree with the indirect prediction of \(\alpha \), Eq. (2), determined from all the other constraints used in the global CKM fit [18]. Figure 13 shows the resulting constraints on the amplitude ratio \(r_{\textsc {ewp}}\) for \(B\rightarrow \pi \pi \) (top panel) and \(B\rightarrow \rho \rho \) (bottom panel) systems. We have rejected the mirror solutions that would yield very large, unphysical, differences \(\delta \alpha \) and thus exceedingly large electroweak penguin contributions (see the discussion at the end of Sect. 3.2). This is achieved by applying an additional constraint on the strong and electroweak penguin hierarchy \(|P_\textsc {ew}^{+0}| < |P^{+-}|\) for this specific study.Footnote 7 The following 68% CL intervals are obtained:

$$\begin{aligned} r_{\textsc {ewp}}= & {} ({-2}\pm {26}) \times 10^{-2}\quad \text {for the } B\rightarrow \pi \pi \, \text {system, and}\\ r_{\textsc {ewp}}= & {} \left( {0.1}_{+8.4}^{-7.8}\right) \times 10^{-2}\quad \text {for the } B\rightarrow \rho \rho \text { system}, \end{aligned}$$

showing results consistent with an electroweak penguin contribution of only a few percents, in agreement with our theoretical expectations.

The contribution of the electroweak penguin can be added in a similar way to the \(B\rightarrow \rho \pi \) system [21]:

$$\begin{aligned} A^{+-}+A^{-+}+2A^{00}= & {} \sqrt{2}(A^{+0}+A^{0+})=T^{+}e^{-\mathsf {i}\alpha }+P_\textsc {ew}^{+},\nonumber \\ {\bar{A}}^{+-}+{\bar{A}}^{-+}+2{\bar{A}}^{00}= & {} \sqrt{2}({\bar{A}}^{+0}+{\bar{A}}^{0+})=T^{+}e^{+\mathsf {i}\alpha }+P_\textsc {ew}^{+}, \end{aligned}$$

where \(T^{+}\) and \(P_\textsc {ew}^{+}\) denote the penguin and tree contributions to the sum of the charged amplitudes. Considering only the Dalitz analysis of the neutral modes, the penguin triangular relation Eq. (14) is violated:

$$\begin{aligned} P^{+-} + P^{-+} + 2P^{00} = P_\textsc {ew}^{+}. \end{aligned}$$
(48)

The weak phase \(\alpha \), derived from

$$\begin{aligned} e^{2\mathsf {i}\alpha }=\frac{\bar{A}^{+-}+\bar{A}^{-+}+2\bar{A}^{00}-P_\textsc {ew}^{+}}{A^{+-}+A^{-+}+2A^{00}-P_\textsc {ew}^{+}}, \end{aligned}$$
(49)

can still be extracted by constraining the electroweak penguin-to-tree ratio:

$$\begin{aligned} r_{\textsc {ewp}}(\rho \pi )=\frac{P_\textsc {ew}^{+}}{A^{+-}+A^{-+}+2A^{00}-P_\textsc {ew}^{+}}=\frac{P_\textsc {ew}^{+}}{T^{+}}. \end{aligned}$$
(50)

As discussed in Refs. [63, 64], the numerical expectation given in Eq. (47) does not apply to the \(\rho \pi \) system: there are additional contributions in this case compared to \(\pi \pi \) and \(\rho \rho \), coming from matrix elements of tree operators that do not cancel anymore since the final state does not have a symmetric wave function. Following Refs. [63, 64], we parametrise the penguin-to-tree ratio for the \(B\rightarrow \rho \pi \) transition as

$$\begin{aligned} r_{\textsc {ewp}}(\rho \pi )= & {} r_{\textsc {ewp}}\frac{1-|r_a|e^{\mathsf {i}\delta _a}}{1+|r_a|e^{\mathsf {i}\delta _a}}, \quad \text {with} \nonumber \\ \quad r_a= & {} \frac{\mathcal{C}_1-\mathcal{C}_2}{\mathcal{C}_1+\mathcal{C}_2}\times \frac{\langle (\rho \pi )^+|\mathcal{O}_1^u-\mathcal{O}_2^u|B^+\rangle }{\langle (\rho \pi )^+|\mathcal{O}_1^u+\mathcal{O}_2^u|B^+\rangle }, \end{aligned}$$
(51)

\(r_{\textsc {ewp}}\) is the ratio given by Eq. (47) for the symmetrical \(B\rightarrow hh\) transition where the Bose–Einstein symmetry applies. On the other hand, \(r_a=|r_a|e^{\mathsf {i}\delta _a}\) is a correction parameter accounting for the non-vanishing \(I_f=1\) contribution, with the notation \(\langle (\rho \pi )^+|=\langle \rho ^+\pi ^0|+\langle \rho ^0\pi ^+|\). We let the correction parameter free to vary up to \(|r_a|<0.3\): this rather conventional rule of thumb seems fairly conservative compared to estimates based on naive factorisation [63, 64], but data-based extractions in the case of \(B\rightarrow K^*\pi \) decays suggest that it is the appropriate order of magnitude [65]. The unknown phase \(\delta _a\), which can generate a charge asymmetry in \(B^+\rightarrow (\rho \pi )^+\) decays, is unconstrained in the \(B^0\rightarrow (\rho \pi )^0\) Dalitz analysis considered here.

Fig. 14
figure 14

\({SU}(2)\) constraint on \(\alpha \) including an electroweak penguin contribution given by Eq. (47) (red dashed line) and neglecting this penguin contribution (green shaded area) for the \(B\rightarrow \rho \pi \) system alone (top) and the combination of the three charmless systems (bottom)

As illustrated on the top panel of Fig. 14, including the estimate of \(\varDelta I={3} / {2}\) electroweak penguin in the \(B^0\rightarrow (\rho \pi )^0\) isospin analysis induces a small shift of \(\delta \alpha = -1.2^\circ \) for the preferred value for \(\alpha \):

$$\begin{aligned} \alpha _{(\rho \pi )^0}^{\textsc {ewp}}{:} \left( {52.9}_{-11.1}^{+8.7} ~\cup ~ {142.9}_{-6.3}^{+5.3}\right) ^{\circ }, \end{aligned}$$

indicating that even a larger bound on \(|r_a|\) would have only a limited impact.

The bottom panel of Fig. 14 shows the combination of the three channels in the presence of the \(\varDelta I={3} / {2}\) electroweak penguin that produces an overall shift of \(\delta \alpha = -0.7^\circ \) on the solution near \(90^\circ \). The corresponding 68% CL interval is

$$\begin{aligned} \alpha _{{SU}(2)}^{ \textsc {ewp}}{:} {\left( {85.6}_{-4.2}^{+4.1}\right) }{^\circ }, \end{aligned}$$

in agreement with the indirect value \(\alpha _\mathrm{ind}\) given in Eq. (2) at 1.5 standard deviations.

Assuming the same amplitudes ratio, \(r_{\textsc {ewp}}\), hold for the three decay systems (modulo the correction term included for the asymmetrical \(B\rightarrow \rho \pi \) decay), the combined 68% CL interval:

$$\begin{aligned} r_{\textsc {ewp}}= \left( {8.1}_{-4.8}^{+4.3}\right) \times 10^{-2} \end{aligned}$$

is obtained when \(\alpha \) is constrained to the indirect determination from Eq. (2) and any other \({SU}(2)\)-breaking effect is neglected, in good agreement with the theoretical expectation Eq. (47).

4.1.2 Isospin-breaking effects due to mixing in the \(\pi \pi \) system

We have worked up to now under the assumption that isospin symmetry was exact for the hadronic part of the B-meson decay. Even though isospin-breaking effects are known to be tiny, due to the very small mass difference between the u and d quarks, it is interesting to assess more precisely how it could affect our analysis.

In the \(B\rightarrow \pi \pi \) system, the breaking of isospin symmetry due to the difference of quark masses triggers a mixing between light pseudoscalar mesons. This affects in particular the \(\pi ^0\) meson. Following Refs. [58, 59], at leading order of isospin breaking, the \(\pi ^0\) state can be written as

$$\begin{aligned} |\pi ^0\rangle =|\pi _3\rangle +\epsilon _{\eta }|\eta \rangle +\epsilon _{\eta '}|\eta '\rangle \quad (\epsilon _{\eta },\epsilon _{\eta '} \ll 1), \end{aligned}$$
(52)

where the flavour states are defined as follows: \(|\pi _3\rangle \) represents the \(I=0\) component of the \({SU}(2)\) triplet and \(|\eta \rangle \), \(|\eta '\rangle \) are the states resulting from the mixing of the \({SU}(3)\) pseudoscalar meson octet and singlet components (according to the flavour decomposition \(3\otimes 3=8\oplus 1\)). The triangular isospin relation that applies to the isospin amplitudes:

$$\begin{aligned} \frac{A^{+-}}{\sqrt{2}}+A^{33}=A^{+3} \end{aligned}$$
(53)

can be translated into mass-eigenstate amplitudes upon introducing the shifts \(\delta A^{i0}\):

$$\begin{aligned} A^{33}= & {} A^{00} - \delta A^{00}, \qquad \nonumber \\ A^{+3}= & {} A^{+0} - \delta A^{+0}. \end{aligned}$$
(54)

The mixing among light pseudoscalar mesons thus generates a slight modification of the triangular relation in the \(B\rightarrow \pi \pi \) amplitudes:

$$\begin{aligned} \frac{A^{+-}}{\sqrt{2}} + A^{00} =A^{+0}(1-e),\quad e = \frac{\delta A^{+0} - \delta A^{00}}{ A^{+0}}, \end{aligned}$$
(55)

with a similar equation for the CP-conjugate amplitudes.

Table 9 World averages for the branching ratios and direct CP asymmetries of the \(B^{0,+}\rightarrow \pi ^{0,+}\eta ^{(')}\) modes [66]

At leading order in \(\epsilon _{\eta ^{(')}}\), the amplitude shifts \(\delta A^{i0}\) can be written in terms of the \(B\rightarrow \pi \eta ^{(')}\) amplitudes \(A^{\pi \eta ^{(')}}\) as

$$\begin{aligned} \delta A^{00}= & {} \sqrt{2}\left( \epsilon _{\eta } A^{\pi ^0\eta }+ \epsilon _{\eta '} A^{\pi ^0\eta '}\right) ,\nonumber \\ \delta A^{+0}= & {} \epsilon _{\eta } A^{\pi ^+\eta } + \epsilon _{\eta '} A^{\pi ^+\eta '}, \end{aligned}$$
(56)

where the factor \(\sqrt{2}\) in the first equation accounts for the Bose–Einstein symmetry in the symmetric \(A^{00}-A^{33}\) amplitude. Both neutral and charged \(B\rightarrow \pi \eta ^{(')}\) decay amplitudes can be constrained using the experimental branching ratios and \(\mathcal{CP}\) asymmetries reported in Table 9. Contrary to the \(\varDelta I={3} / {2}\) contribution discussed in Sect. 4.1.1, the a priori unknown strong phase affecting \(\delta A^{+0}\) may generate a charge asymmetry \(|A^{+0}|\ne |\bar{A}^{+0}|\) in the charged B meson decay. Consequently, the amplitude system is further constrained by introducing the measured CP asymmetry in the \(B^+\rightarrow \pi ^+\pi ^0\) mode, see Table 10, which was not considered in the isospin-symmetric analysis.

Table 10 World averages for the \(C^{+0}\) direct CP asymmetries for the \(B\rightarrow \pi ^+\pi ^0\) decay and for the longitudinally polarised state in \(B^+\rightarrow \rho ^+\rho ^0\) decay

We must add eight complex amplitudes, \(A^{\pi ^+\eta }\), \(A^{\pi ^+\eta '}\), \(A^{\pi ^0\eta }\), \(A^{\pi ^0\eta '}\) and their CP conjugates, to the \(B\rightarrow \pi \pi \) system. We use Ref. [67] for the numerical estimates of the mixing parameters:

$$\begin{aligned} \epsilon _{\eta }= & {} (1.7\pm 0.3)\times 10^{-2}\\ \epsilon _{\eta '}= & {} (0.4\pm 0.1)\times 10^{-2} \end{aligned}$$

leading to a slightly modified constraint on the weak phase \(\alpha \), shown in Fig. 15:

$$\begin{aligned} \alpha _{\pi \pi }^{\mathrm{mixing}}:({92.5}\pm {15.5} ~\cup ~ {135.0}\pm {19.0} ~\cup ~ {172.2}\pm {10.4})^\circ , \end{aligned}$$

at 68% CL. We obtain thus a global shift for \(\alpha \) of \(\delta \alpha =-0.5^\circ \). The linear increase of the 68% interval, namely \(\pm 1.5^\circ \), is mostly due to the limited experimental resolution on the \(B\rightarrow \pi \eta ^{(')}\) data and it must be considered as an upper limit of the mixing-induced breaking effect on \(\alpha \).

Fig. 15
figure 15

Determination of \(\alpha \) from the \(B\rightarrow \pi \pi \) system including \(\pi ^0\)\(\eta \)\(\eta '\) mixing (red dashed line) and neglecting this mixing (green shaded area)

It is possible to go one step further using the admittedly naive simplification that there is no gluonic component to the \(\eta ^{(')}\) states. Additional relations between the \(B\rightarrow \pi \pi \) and the \(B\rightarrow \pi \eta ^{(')}\) amplitudes can then be derived, unaffected by first-order \({SU}(3)\) breaking [68]:

$$\begin{aligned} A^{+\eta }= & {} \cos (\phi _P) A^{+0}+ \sqrt{2} A^{0\eta },\nonumber \\ A^{+\eta '}= & {} \sin (\phi _P) A^{+0} + \sqrt{2} A^{0\eta '}, \end{aligned}$$
(57)

where \(\phi _P\) is the mixing angle for the isospin singlets \(|\eta _q\rangle \) = \({(|u{\bar{u}}\rangle +|d{\bar{d}}\rangle )}/{\sqrt{2}}\) and \(|\eta _s\rangle \)=\(|s{\bar{s}}\rangle \) forming the \(\eta ^{(')}\) states. With these additional constraints, only four additional complex parameters are needed to account for the \(B\rightarrow \pi \eta ^{(')}\) amplitudes. In that case, the deviation for the triangular isospin relation Eq. (55) is given by

$$\begin{aligned} e=\cos (\phi _P)\ \epsilon _{\eta } + \sin (\phi _P)\ \epsilon _{\eta '}. \end{aligned}$$
(58)

Using this relation and without further assumption on the \(\eta \)\(\eta '\) mixing angle, the isospin-breaking effect on the \(B\rightarrow \pi \pi \) determination of \(\alpha \) is found to be \(\delta \alpha =-1^\circ \) with a slightly reduced resolution \(\pm 1.0 ^\circ \). Assuming the physical \(\eta ^{(')}\) states are well described by the flavour combinations:

$$\begin{aligned} |\eta \rangle =\frac{\sqrt{2}|\eta _q\rangle -|\eta _s\rangle }{\sqrt{3}},\nonumber \\ |\eta '\rangle = \frac{|\eta _q\rangle +\sqrt{2}|\eta _s\rangle }{\sqrt{3}}, \end{aligned}$$
(59)

corresponding to an \(\eta \)\(\eta '\) mixing angle \(\tan (\phi _P)=1/\sqrt{2}\), a deviation \(\delta \alpha =(-0.3\pm 1.4)^\circ \) is obtained for the \(B\rightarrow \pi \pi \) system, in the same ball park as our previous estimate.

4.1.3 Additional isospin-breaking effects for the \(\rho \rho \) and \(\rho \pi \) systems

In the limit of a vanishing \(\rho \) meson width, the isospin analysis of each \(B\rightarrow \rho \rho \) polarisation state is identical to the \(B\rightarrow \pi \pi \) analysis, with a dominance of the longitudinal polarisation. In parallel with the previous section, isospin breaking manifests itself though the \(\rho \)\(\omega \)\(\phi \) mixing:

$$\begin{aligned} |\rho ^0\rangle \sim |\rho _3\rangle +\epsilon _\omega |\omega \rangle +\epsilon _\phi |\phi \rangle , \end{aligned}$$
(60)

where the mixing term \(\epsilon _\omega \) is of \(\mathcal{O}(1\%)\) and \(\epsilon _\phi \) is negligible as there is an almost ideal mixing in the case of the \(\phi \) meson. Additional sources of isospin breaking can occur in the \(\rho \rightarrow \pi \pi \) decay (see Ref. [21] for a detailed review):

  • The isospin breaking due to the \(\pi ^0\)\(\eta \)\(\eta '\) mixing may affect the \(\rho ^+\rightarrow \pi ^+\pi ^0\) decay but turns out negligible, as the leading term in \(\epsilon _\eta \) is suppressed by the small \(\rho ^+\rightarrow \pi ^+\eta \) decay rate: \({\varGamma (\rho ^+\rightarrow \pi ^+\eta )}/{\varGamma (\rho ^+\rightarrow \pi ^+\pi ^0)}<0.6\%\) [69].

  • Differences in the di-pion couplings for the neutral and charged \(\rho \) mesons are experimentally limited to less than 1%: indeed, \(1-({\varGamma _{\rho ^+}}/{\varGamma _{\rho ^0}})=(0.2\pm 0.9)\%\) [69].

  • Isospin breaking affecting the \(\rho ^0,\omega \rightarrow \pi ^-\pi ^+\) interference is restricted to a small window in the \(\pi \pi \) mass spectrum: this effect, integrated over the whole \(\pi \pi \) range, is estimated at the order of 2% [21].

  • The dominant source of isospin breaking is actually due to the large decay width \(\varGamma _\rho \), which makes the two final-state mesons distinguishable in the \(B\rightarrow \rho _1(\pi \pi )\rho _2(\pi \pi )\) decay. This results in a residual \(I_f=1\) amplitude contribution, forbidden by Bose–Einstein symmetry in the limit \(\varGamma _\rho =0\), but potentially as large as \(({\varGamma _\rho }/{m_\rho })^2\sim 4\%\). It is a slight abuse of language to call this effect an isospin-breaking contribution, as it does not vanish in the limit \(m_u=m_d\) (but it does in the limit of a vanishing decay width \(\varGamma _\rho \sim (m_{(\pi \pi )_{1}}-m_{(\pi \pi )_{2}})=0\)).

Fig. 16
figure 16

Determination of \(\alpha \) from the \(B\rightarrow \rho \rho \) system including \({SU}(2)\)-breaking amplitudes limited to 4% (red dashed line), limited to 10% (blue dot-dashed line) and neglecting this contributions (green shaded area)

The above isospin-breaking effects can be accounted for by modifying the isospin triangular relation in the same way as in Eq. (55):

$$\begin{aligned} \frac{A^{+-}}{\sqrt{2}}+A^{00}=A^{+0} - \varDelta ^{+0}. \end{aligned}$$
(61)

The unknown amplitude \(\varDelta ^{+0}\) is parametrised in a generic way:

$$\begin{aligned} \varDelta ^{+0}= r_T T^{+-} + r_P P^{+-}, \end{aligned}$$
(62)

where \(r_T\) and \(r_P\) are two independent complex parameters accounting for isospin breaking in the tree and penguin contributions, respectively. The arbitrary strong phase affecting these parameters can generate CP violation in the charged decays. The measured CP asymmetry in the \(B^+\rightarrow \rho ^+\rho ^0\) mode given in Table 10 is thus included as an additional constraint in the presence of these isospin-breaking terms. Figure 16 illustrates the determination of the weak phase \(\alpha \) in the case of isospin-breaking contributions smaller than \(|r_T|,|r_P|<4\%\) and the very conservative bound \(|r_T|,|r_P|<10\%\). A small shift on the preferred \(\alpha \) value near \(90^\circ \), \(\delta \alpha =-0.6^\circ \) \((-1.2^\circ )\), results from isospin-breaking contributions limited to 4% (10%), respectively. The linear increase of the 68% interval, \(({}^{+0.2}_{-1.7})^\circ \) for \(|r_T|,|r_P|<4\%\), has to be understood as an upper limit of the impact of these contributions on the determination of \(\alpha \).

Isospin breaking arises similarly in the \(B\rightarrow \rho \pi \) system through mixing in the light pseudoscalar and vector sectors. However, as long as the weak phase determination is limited to the neutral \(B^0\rightarrow (\rho \pi )^0\) Dalitz analysis, the isospin constraints reduces to the triangular penguin relation Eq. (14). Any isospin breaking in the tree amplitudes can be absorbed by a redefinition of the unconstrained charged amplitudes \(T^{+0}\), \(T^{0+}\). Corrections to the penguin amplitude relation Eq. (14) in the \(B\rightarrow \rho \pi \) system will be suppressed by the small penguin-to-tree ratios. For instance, the impact of the \(\pi ^0\)\(\eta \)\(\eta '\) mixing on the weak phase determination in the \(B^0\rightarrow (\rho \pi )^0\) analysis is suppressed by factors like \(\epsilon _{\eta ^{(')}}\left| {P_{\rho \eta ^{(')}}}/{T^+}\right| \), where \(P_{\rho \eta ^{(')}}\) represents the penguin contribution to the \(B^+\rightarrow \rho ^+\eta ^{(')}\) transition and \(T^+\) is defined in Eq. (48). The corresponding deviation is found to be negligible, \(\delta \alpha <0.1^\circ \) [21]. Therefore, apart for the electroweak penguin contribution discussed in Sect. 4.1.1, no further isospin breaking will be considered for the \(B^0\rightarrow (\rho \pi )^0\) analysis.

4.2 Impact of the statistical treatment

4.2.1 p values based on the bootstrap approach and Wilks’ theorem

In Sect. 3.1, we have outlined our framework to determine confidence intervals on \(\alpha \), starting from a test statistic \(\varDelta \chi ^2 (\alpha )\), which is converted into a p value following Eq. (24). This procedure can be proven to be exact in the simple case where the observables obey Gaussian probability distribution functions and they are linearly related to the parameters of interest. Following Wilks’ theorem, this can be extended to more general cases, at least asymptotically, if the data sample is large and the observables resolutions are small enough to consider the problem as locally linear [23]. In such case, the construction in Sect. 3.1 ensures exact coverage: if one repeated the determination of \(\alpha \) using independent data sets from many identical experiments, the 68% CL interval for \(\alpha \) would encompass the true value of \(\alpha \) in 68% of the cases.

However, many effects can alter this picture. Even in the exact Gaussian case, the p value can be distorted when the observables have a nonlinear dependence on the fundamental parameters of interest. An example consists in Eq. (12), where the reexpression of the CP-asymmetries in terms of \(\alpha _\mathrm{eff}\) is nonlinear and implements the trigonometric boundary on CP-asymmetries \(\sqrt{\mathcal{C}^2+\mathcal{S}^2}<1\) [6]. More generally, one may wonder if we stand close to the hypotheses of Wilks’ theorem with the current set of data. Otherwise, the construction given in Sect. 3.1 might suffer from under- or over-coverage. In this section, we will assess the finite-size errors associated with our statistical framework by considering a different construction of the p value that takes into account some of the effects deemed subleading in the Wilks-based approach.

We start by recalling some elements related to the construction of p values in our context, as discussed in Refs. [22, 70,71,72]. We want to assess how much the data is compatible with the hypothesis that the true value of the weak phase \(\alpha \), denoted \(\alpha _t\), is equal to some fixed value \(\alpha \), i.e., \({\mathcal H}_{\alpha }: \alpha _t = \alpha \). This hypothesis is composite, as it sets the value of some of the theoretical parameters, but not all of them. Indeed, the hadronic parameters (tree and penguin amplitudes) are also theoretical parameters required in our isospin analysis but are not set in \({\mathcal H}_{\alpha }\). These hadronic parameters are nuisance parameters, denoted collectively as \(\varvec{\mu }\).

The test statistic \(\varDelta \chi ^2 (\alpha )\) defined in Eq. (23) can be seen as the maximal likelihood ratio comparing the most plausible configuration under \({\mathcal H}_\alpha \) with the most plausible one in general. It is a definite positive function chosen in a way that large values indicate that the data present evidence against \({\mathcal H}_\alpha \): in the following, we will state explicitly its dependence on the data X by using the notation \(\varDelta \chi ^2 (X;\alpha )\). A p value is built by calculating the probability to obtain a value for the test statistic at least as large as the one that was actually observed, assuming that the hypothesis \({\mathcal H}_{\alpha }\) is true:

$$\begin{aligned} 1-p(X_0;\alpha ,\varvec{\mu })= & {} \int ^{\varDelta \chi ^2 (X_0;\alpha )}_{0} \mathrm{d}\varDelta \chi ^2 \, h(\varDelta \chi ^2 |\alpha ,\varvec{\mu })\nonumber \\= & {} {\mathcal P}[\varDelta \chi ^2 <\varDelta \chi ^2 (X_0;\alpha )], \end{aligned}$$
(63)

where the probability distribution function (PDF) h of the test statistic is obtained from the PDF g of the data as

$$\begin{aligned} h(\varDelta \chi ^2 |\alpha ,{\varvec{\mu }}) = \int \mathrm{d}X\, \delta [\varDelta \chi ^2-\varDelta \chi ^2 (X;\alpha )] g(X;\alpha ,{\varvec{\mu }}). \end{aligned}$$

A small value of the p value thus provides evidence against the hypothesis \({\mathcal H}_{\alpha }\). We notice that in general the p value Eq. (63) exhibits a dependence on the nuisance parameters through the PDF h, even though the test statistic \(\varDelta \chi ^2 \) itself is independent of \(\varvec{\mu }\).

For linear models, in which the observables X depend linearly on the parameter \(\alpha _t\), \(\varDelta \chi ^2 (X;\alpha )\) is a sum of standard normal random variables, and is distributed as a \(\chi ^2 \) with \(N_{\mathrm{dof}}=1\). Under the conditions of Wilks’ theorem [23], this property can be extended to non-Gaussian cases, the distribution of \(\varDelta \chi ^2 (X;\alpha )\) will converge to a \(\chi ^2 \) law depending only on the number of parameters tested. The p value becomes independent of the nuisance parameters \(\varvec{\mu }\) and can be still interpreted as coming from a \(\chi ^2 \) law with \(N_{\mathrm{dof}}=1\). This is the rationale for the statistical framework presented in Sect. 3.1.

We compare this Wilks-based approach with the bootstrap one based on the plug-in principle [73]. It consists here in defining

$$\begin{aligned} p^\mathrm{bootstrap}(X_0;\alpha )= & {} p(X_0;\alpha ,\hat{\varvec{\mu }}(\alpha ))\, \nonumber \\ \chi ^2 (\alpha ,\hat{\varvec{\mu }}(\alpha ))= & {} \min _{\varvec{\mu }}\chi ^2 (\alpha ,\varvec{\mu }). \end{aligned}$$
(64)

We perform the evaluation of the p value for a given value of \(\alpha \) by setting the nuisance parameters to the value minimising \(\chi ^2 (\alpha ,\varvec{\mu })\). This approach assesses the role played by nuisance parameters by replacing them with an estimator \(\hat{\varvec{\mu }}(\alpha )\) that depends on \(\alpha \). Other constructions could have been considered, with more conservative statistical properties (supremum, constrained supremum, etc.) [74], but this would go beyond the scope of our study. We focus here on the bootstrap approach: it is relatively simple to implement, it exhibits good coverage properties in the examples considered here and it provides a first glimpse of the role played by nuisance parameters in coverage, which is neglected in the Wilks-based approach used up to now.

4.2.2 Comparison of p values for the extraction of \(\alpha \)

The comparison between the two approaches may be performed by examining their coverage properties. Once computed, a p value can be used to determine confidence-level intervals for the parameter of interest \(\alpha \). These intervals have a correct frequentist interpretation if the p value exhibits exact coverage, i.e., for any \(\beta \) between 0 and 1, \(P(p\le \beta )=\beta \). Over-coverage \(P>\beta \) corresponding to a conservative p value and too wide a CL interval, under-coverage \(P<\beta \) corresponding to a liberal p value and too narrow a CL interval. In general, an exact p value, or if not possible, a reasonably conservative p value, is desirable, at least for the confidence levels of interest. This conservative approach is generally adopted in high-energy physics in order to avoid rejecting a hypothesis (such as “the SM is true”) too hastily.

We have already discussed the extraction of the p value using the Wilks-based approach. In the case of the bootstrap approach, the statistical coverage of the \(\alpha \) intervals has been studied through a full frequentist exploration of the space of nuisance parameters. A complete analysis with toy Monte Carlo simulations was carried out in order to compute the PDF h and thus the p value for the angle \(\alpha \). The individual constraints on \(\alpha \) from the \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B^0\rightarrow \pi ^+\pi ^-\pi ^0\) systems, are displayed in Fig. 17, comparing the bootstrap and Wilks-based approaches. Considering the 68% CL interval on \(\alpha \), the Wilks-based approach is slightly more conservative than the bootstrap one for both \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) systems, whereas the situation is reversed for the \(B^0\rightarrow (\rho \pi )^0\) analysis. The 68% CL intervals obtained with the bootstrap method for the three systems are

$$\begin{aligned}&\alpha _{\pi \pi }^{\mathrm{bootstrap}} {:} \left( {84.1}_{-5.3}^{+6.0}\right) ^{\circ } ~\cup ~ {\left( {100.1}_{-6.9}^{+6.1}\right) }{^\circ } ~\cup ~ {({135.0}\pm {17.0})}^{\circ } \\&\qquad \cup ~ \left( {169.9}_{-6.1}^{+6.9}\right) {^\circ } ~\cup ~ \left( {5.9}_{-6.0}^{+5.3}\right) {^\circ }, \\&\alpha _{\rho \rho }^\mathrm{bootstrap} {:}\left( {92.0}_{-4.1}^{+4.2}\right) {^\circ } ~\cup ~ \left( {177.9}_{-4.1}^{+4.2}\right) {^\circ }, \\&\alpha _{\rho {\pi }}^\mathrm{bootstrap} {:}\left( {54.0}_{-17.0}^{+10.0}\right) {^\circ } ~\cup ~ \left( {142.0}_{-8.7}^{+6.8}\right) {^\circ }. \end{aligned}$$

For the latter system, the bootstrap approach slightly reduces the tension with the indirect determination \(\alpha _\mathrm{ind}\) to 2.7 standard deviations.

Fig. 17
figure 17

p value for \(\alpha \) comparing the Wilks-based approach (denoted \(\mathrm{Erfc}\)) and the full frequentist approach based on the bootstrap method for \(B\rightarrow \pi \pi \) (top), \(B\rightarrow \rho \rho \) (middle), \(B\rightarrow \rho \pi \) (bottom)

As shown on Fig. 18, when combining the three analyses for the 68% CL interval, the over-coverage for \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) almost compensates the under-coverage for \(B\rightarrow \rho \pi \). The 68% CL interval for the preferred solution close to \(90^\circ \) increases by about 1\(^\circ \), which can be considered as an estimate of the finite-size errors associated to the statistical framework. The second solution near \(180^\circ \) is slightly less disfavoured, increasing the corresponding 68% CL interval by \(\pm 1.4^\circ \) with respect to the Wilks-based approach. The overall combination results in

$$\begin{aligned} \alpha ^\mathrm{bootstrap}_\mathrm{dir} {:}&\left( {86.1}_{-5.0}^{+5.3}\right) {^\circ } ~\cup ~ \left( {178.5}_{-6.4}^{+5.5}\right) {^\circ } ~~(68\% \text { CL})\quad \mathrm{and}\\&\left( {86.1}_{-9.3}^{+14.3}\right) {^\circ } ~\cup ~ \left( {178.5}_{-12.5}^{+13.5}\right) {^\circ } ~~(95\% \text { CL}). \end{aligned}$$

This direct \(\alpha \) measurement within the bootstrap approach is consistent with the indirect CKM determination \(\alpha _\mathrm{ind}\) at the level of 1.1 standard deviation.

Fig. 18
figure 18

p value for \(\alpha \) comparing the Wilks-based approach (denoted \(\mathrm{Erfc}\)) and the full frequentist approach based on the bootstrap method, for the combination of the three \(B\rightarrow hh\) decay modes (top) and for the partial combination based on \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) only (bottom)

Going one step further, we can compare the coverage of the two methods. This can be done by studying the PDF of the p values to check whether they are exact, conservative or liberal. Even though \(p^{\mathrm {bootstrap}}\) does not depend on the true value of the nuisance parameters explicitly, its distribution does in general. For illustrative purposes, we choose here to compute the distribution of both tests assuming as true values \(\alpha = 92.5^\circ \) and \(\varvec{\mu } = \hat{\varvec{\mu }}(\alpha = 92.5^\circ )\) (we do not attempt to investigate other values of \(\alpha \) or the nuisance parameters). The computation of the PDF of \(p^{\mathrm {bootstrap}}\) requires a twofold recursion of the bootstrap procedure (double bootstrap).Footnote 8 The p value for the \(\varDelta \chi ^2\) test is obtained assuming that it obeys Wilks’ theorem while the bootstrap test is assumed to be a true p value, i.e., uniformly distributed over [0, 1].

Fig. 19
figure 19

Coverage test displaying the true significance as a function of the test significance using the bootstrap and Wilks-based approaches (the latter being denoted \(\mathrm{Erfc}\)) for the combined extraction of \(\alpha \) (top) and for the partial combination based on \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) only (bottom)

The results are shown in Fig. 19. Rather than showing the PDF of the two p values as a function of p, we have expressed p in units of \(\sigma \) (assumed test significance) and expressed the cumulative distribution function (integral of the PDF from 0 to p) once again in units of \(\sigma \) (true test significance). A p value with exact coverage corresponds to a diagonal straight line, over-coverage (under-coverage) happens if the curve is above (below) this diagonal. In our particular case, one can clearly see the improved coverage from the bootstrap test with respect to the \(\varDelta \chi ^2\) case, assuming Wilks’ theorem. The bootstrap distribution has a flat distribution at least up to the \(2\sigma \) level, as shown by the curve in Fig. 19, which is close to a diagonal straight line (this is expected since we set the true values of the nuisance parameters to their plug-in values). On the other hand, the \(\varDelta \chi ^2\) test combined with Wilks’ theorem is too aggressive (under-covers) for confidence levels above \(0.6\sigma \) for the global combination of the three \(B\rightarrow hh\) decays modes and above \(1.2\sigma \) when considering \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) only. Due to the highly CPU-consuming nature of this computation, we have not tried to perform the same computation for other true values of \(\alpha \) and the nuisance parameters, but one may expect that the results of these coverage tests should hold for values of \(\alpha \) in the vicinity of the solution compatible with its indirect determination.

Table 11 68% CL intervals for the weak phase \(\alpha \) for different theoretical hypotheses, statistical approaches and channels. In the “Iso” column, Y indicates that the analysis is performed assuming isospin symmetry, whereas N denotes the inclusion of isospin breaking, namely, the effect of \(\pi ^0\)\(\eta \)\(\eta '\) mixing in \(B\rightarrow \pi \pi \) and the breakdown of isospin triangle relations up to \(|r_T|,|r_P|<4\%\) in \(B\rightarrow \rho \rho \), as discussed in Sect. 4.1.2 (isospin-breaking effects are neglected in the \(B^0\rightarrow (\rho \pi )^0\) Dalitz analysis). In the “\(\textsc {ewp}\)” column, Y indicates that a contamination from \(\varDelta I=3/2\) electroweak penguins is included as discussed in Sect. 4.1.1, whereas N corresponds to setting these penguin contributions to zero. In the “Stat” column, B corresponds to the bootstrap approach and W to the Wilks-based one. For each channel or combination of channels, the deviation with respect to the indirect determination \(\alpha _\mathrm{ind}\) is indicated within brackets

The difference in the 68% CL intervals between the bootstrap and Wilks-based approaches could be taken as a linear correction to the direct extraction of the parameter \(\alpha \), i.e. focusing only on the solution around \(90^\circ \):

$$\begin{aligned} \alpha _\mathrm{dir}= \left( 86.2 ~~^{{+4.4}}_{{-4.0}}~\mathrm{\small (stat.)} ~\oplus ~ ^{{+0.9}}_{{-1.0}}~\mathrm{\small (bootstrap)}\right) ^\circ . \end{aligned}$$
(65)

It is important to notice that the size of the bootstrap correction is an indication of the non-asymptotic regime only for the direct determination of the weak phase based on the current data on charmless decays. This estimate is likely to change when including additional or improved observables. In particular, the observed under-coverage for the Wilks-based approach is likely to be reduced in a global fit of the CKM matrix, where the direct \(\alpha \) measurement Eq. (65) is combined with the more precise indirect determination Eq. (2). The bootstrap interval correction has been computed here for the direct measurement of \(\alpha \) alone, for a particular set of observables. Therefore this correction cannot be considered as an absolute uncertainty to be included automatically when combined with another independent measurement.

4.3 Summary for the direct determination of \(\alpha \)

Our results for the direct determination of the weak phase \(\alpha \) are summarized in Table 11 for the different model hypotheses and the different statistical approaches considered up to now. The 68% CL interval (only for the solution near \(90^\circ \) Footnote 9) is reported as well as the compatibility with the indirect \(\alpha _\mathrm{ind}\) determination Eq. (2). We see that depending on the approach, the central value for the combination shifts by \(2^\circ \) or less, remaining thus within the error quoted in Eq. (1). The uncertainty remains between \(4^\circ \) and \(5^\circ \) when isospin-breaking effects are allowed. On the other hand, as discussed in Sect. 4.2.2, the comparison between the bootstrap and Wilks-based approaches suggests a sizeable uncertainty attached to the statistical framework, but this uncertainty is attached to the direct extraction of \(\alpha \) from the three \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) modes and cannot be used as such when it is combined with other constraints, e.g., within global fits. It is likely that one would get a smaller uncertainty if a similar analysis was performed with an extended set of observables leading to a more accurate determination of \(\alpha \).

These various arguments lead us to keep Eq. (1) as our final answer for the direct extraction of \(\alpha \) from charmless B decays to be used in latter analyses.

5 Hadronic amplitudes

In addition to the CKM angle \(\alpha \), our study of the \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \pi \) and \(B\rightarrow \rho \rho \) systems provides constraints on the hadronic amplitudes that cannot be computed in QCD directly. We can thus determine some features of hadronisation from the data, to be compared with the theoretical approaches proposed to describe these decays.

In order to sharpen the constraints on these parameters, we will not only consider the experimental measurements discussed previously, but we will also take as an additional constraint the indirect prediction \(\alpha _\mathrm{ind}\) derived from Fig. 3 and Eq. (2). This constraint is obtained excluding \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) data from the global CKM fit, and therefore provides an independent constraint on the isospin analysis.

Fig. 20
figure 20

68% (dark area) and 95% CL (light area) constraints on the modulus and phase for the penguin-to-tree ratio \(P^{ij}/ T^{ij}=R^{ij} e^{\mathsf {i}\delta ^{ij}}\) for \(B^0\rightarrow \pi ^+\pi ^-\) (top) and \(B^0\rightarrow \pi ^0\pi ^0\) (bottom)

5.1 Penguin-to-tree ratios

Following the \(\mathcal{C}\)-convention defined in Sect. 2.1, the penguin-to-tree ratio is given by

$$\begin{aligned} \tilde{R}^{ij}=\frac{P^{ij}}{T^{ij}} = \frac{R_t}{R_u}\times \frac{\mathcal{P}^{ij}}{\mathcal{T}^{ij}},\quad \frac{R_t}{R_u} = \left| \frac{V_{td}V^*_{tb}}{V_{ud}V^*_{ub}}\right| . \end{aligned}$$
(66)

The isospin analysis assumes that there are no penguin contributions to the charged decays in the \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B^{0}\rightarrow (\rho \pi )^0\) systems. This hypothesis allows us to extract \(\alpha \) and the tree and penguin contributions simultaneously in each decay mode. We start with the (complex) penguin-to-tree ratio:

$$\begin{aligned} \tilde{R}^{ij}=R^{ij}e^{\mathsf {i}\delta ^{ij}}=\frac{P^{ij}}{T^{ij}}, \end{aligned}$$
(67)

in the \(B^0\rightarrow (h_1^{i}h_2^{j})^0\) neutral modes. The isospin analysis of the \(B\rightarrow \pi \pi \) system constrains the ratio of moduli \( R^{ij}_{\pi \pi }\) for both colour-allowed (\(B\rightarrow \pi ^+\pi ^-\)) and colour-suppressed (\(B^0\rightarrow \pi ^0\pi ^0\)) neutral modes at a precision level of around \(20\%\). The two-dimensional constraint in the (complex) \(\tilde{R}^{ij}\) plane for these two modes is shown in Fig. 20. A large penguin contamination is observed in both modes. The one-dimensional extraction of the modulus and the phase gives the following 68% (respectively 95%) CL intervals:

$$\begin{aligned}&R^{+-}_{\pi \pi }={0.40}\pm {0.04} ~\left( ^{{+0.08}}_{{-0.10}}\right) ,\\&\delta ^{+-}_{\pi \pi } = \left[ {154.7}\pm {3.6} ~ \left( ^{{+6.9}}_{{-9.2}}\right) \right] ^\circ ,\\&R^{00}_{\pi \pi }={0.52}_{-0.07}^{+0.08} ~\left( ^{{+0.18}}_{{-0.16}}\right) , \quad \delta ^{00}_{\pi \pi } = \left[ {32.7}_{-10.3}^{+9.2} ~\left( ^{{+17.2}}_{{-22.3}}\right) \right] ^\circ . \end{aligned}$$

We observe that values of the phases close to 0 or \(\pi \) are favoured, in agreement with QCD factorisation expectations [75, 76], which also predicts a value for \(R^{+-}_{\pi \pi }\) around 0.5 [77, 78]. Recent fits of two-body non-leptonic B decays into light pseudoscalar mesons based on \({SU}(3)\) flavour symmetry yield smaller values for these ratios [79].

The penguin contamination to the colour-allowed \(B^0\rightarrow \rho ^+\rho ^-\) decay is significantly smaller than for its \(B\rightarrow \pi \pi \) counterpart, as expected from the flatness of the \(\rho \rho \) isospin triangles discussed in Sect. 3.3. The two-dimensional constraint in the (complex) \(\tilde{R}^{ij}_{\rho \rho }\) plane is shown in Fig. 21. A tight constraint on the penguin contamination to the \(B^0\rightarrow \rho ^+\rho ^-\) decay is obtained with a 68% (respectively 95%) CL upper limit:

$$\begin{aligned} R^{+-}_{\rho \rho } < 0.08~~ (0.13), \end{aligned}$$

on the edge of the expectations from QCD factorisation, which predicts this ratio to be around 0.12 [78]. Due to the small overall amplitudes contributing to the \(B^0\rightarrow \rho ^0\rho ^0\) mode, a looser constraint is found:

$$\begin{aligned} R^{00}_{\rho \rho }<0.40~~ (0.72). \end{aligned}$$

No constraint is obtained on the phase of the ratio for either \(B^0\rightarrow (\rho \rho )^0\) decays (Fig. 21).

Fig. 21
figure 21

68% (dark area) and 95% CL (light area) constraints on the modulus and phase of the penguin-to-tree ratio \(P^{ij}/T^{ij}=R^{ij} e^{\mathsf {i}\delta ^{ij}}\) for \(B^0\rightarrow \rho ^+\rho ^-\) (top) and \(B^0\rightarrow \rho ^0\rho ^0\) (bottom)

Fig. 22
figure 22

68% CL constraint on the phase shift \(\varDelta \alpha =(\alpha -\alpha _\mathrm{eff})\) for \(B\rightarrow \pi \pi \) (top) and \(B\rightarrow \rho \rho \) (bottom)

Another representation of the penguin hierarchy between the \(B^0\rightarrow \pi ^+\pi ^-\) and \(B^0\rightarrow \rho ^+\rho ^-\) decays is provided by the phase shift \(\varDelta \alpha =(\alpha -\alpha _\mathrm{eff})\) where the effective weak phase \(\alpha _\mathrm{eff}\), introduced in Eq. (12), describes the \(B^0\rightarrow h^+h^-\) transition. In the limit of a vanishing penguin contribution, the mixing phase \(\alpha _\mathrm{eff}\) coincides with the weak phase \(\alpha \). The following 68% CL intervals on \(\varDelta \alpha \), derived from Fig. 22, are obtained:

$$\begin{aligned} \varDelta \alpha _{\pi \pi }= & {} \pm ({23.8}\pm {13.9})^\circ ,\\ \varDelta \alpha _{\rho \rho }= & {} \left( {2.1}_{+3.7}^{-3.6}\right) ^\circ . \end{aligned}$$

The phase shift is consistent with the absence of the penguin in the \(B^0\rightarrow \rho ^+\rho ^-\) decay while the sizeable shift obtained in \(B^0\rightarrow \pi \pi \) excludes the \(\varDelta \alpha =0\) hypothesis at 1.9 \(\sigma \) in this mode, confirming the difference of size of penguin contributions for the two decay channels. A similar quantity can be defined for the asymmetric \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) decay and is further discussed in Sect. 6.3.2.

Due to the limited experimental statistics currently available, the Dalitz isospin analysis of the three-pion decay does not lead to a significant constraint on the penguin contribution in any of the three \(B^0\rightarrow (\rho \pi )^0\) decays. The pentagonal approach, involving further constraints from the charged modes, only provides a weak constraint on \(R^{+-}\) and \(R^{-+}\); see Fig. 23. Phases close to 0 or \(\pi \) are mildly favoured, in agreement with the expectations from QCD factorisation. No constraint can be derived for the \(B^0\rightarrow \rho ^0\pi ^0\) mode. For comparison, the constraints on the modulus of the penguin contribution to the various decays modes can be found in Fig. 24.

Fig. 23
figure 23

68% (dark area) and 95% CL (light area) two-dimensional constraints on the modulus and phase of the penguin-to-tree ratio \(P^{ij}/ T^{ij}=R^{ij} e^{\mathsf {i}\delta ^{ij}}\) for \(B^0\rightarrow \rho ^+\pi ^-\) (left), \(B^0\rightarrow \rho ^-\pi ^+\) (middle) and \(B^0\rightarrow \rho ^0\pi ^0\) (right). These results are obtained using the pentagonal isospin analysis, including the experimental inputs from the charged \(B^+\rightarrow (\rho \pi )^+\) decays

Fig. 24
figure 24

68% (dark area) and 95% CL (light area) 1D intervals on the \(|P^{ij}/ T^{ij}|\) ratios for the different \(B^{0}\rightarrow h^{i}h^{j}\) charmless decays

5.2 Testing colour suppression

The hadronisation of the \({\bar{b}}d\rightarrow {\bar{u}}d(u \bar{d})\) final state into a pair of charged isovector mesons, \(h^+_1h^-_2\), is colour allowed, while the hadronisation of this tree transition into a neutral pair, \(h^0_1h^0_2\), is colour suppressed. This colour suppression can be probed through the ratio of the corresponding tree amplitudesFootnote 10:

$$\begin{aligned} \tilde{R}_\mathcal{C}=R_\mathcal{C}e^{\mathsf {i}\delta _\mathcal{C}}=\frac{T^{00}}{T^{+-}}, \end{aligned}$$
(68)

where \(R_\mathcal{C}\) and \(\delta _\mathcal{C}\) represent the modulus and the phase of the complex ratio, respectively.

Fig. 25
figure 25

Two-dimensional constraints on the modulus and phase of the colour-suppressed ratio \(T^{00}/ T^{+-}=R_{c} e^{\mathsf {i}\delta _{c}}\) for \(B\rightarrow \pi \pi \) (left), \(B\rightarrow \rho \pi \) (middle) and \(B^{0}\rightarrow \rho \pi \) (right)

Fig. 26
figure 26

Constraint on the \(B\!\rightarrow \pi \pi \) branching fractions from the \({SU}(2)\) isospin analysis (from left to right: \(\pi ^+\pi ^-\), \(\pi ^+\pi ^0\) and \(\pi ^0\pi ^0\)). The direct measurement, not included in the fit, is indicated for each mode with an interval with a dot

The two-dimensional constraint on the phase and modulus of the colour-suppression ratio is shown in Fig. 25 for \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \) and \(B\rightarrow \rho \pi \) channels. As expected, no constraint is obtained for that latter due to the limited statistics. The modulus of the colour-suppression ratio is determined with a relative resolution of about \(10\%\) for the \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) modes. The corresponding one-dimensional 68% (95%) CL intervals on the modulus and the phase are

$$\begin{aligned} R_\mathcal{C}({\pi \pi })= & {} {0.54}\pm {0.04}\quad \left( ^{{+0.08}}_{{-0.09}}\right) ,\\ \delta _\mathcal{C}({\pi \pi })= & {} \left[ {-58.4}_{-8.6}^{+10.3}\quad \left( ^{{+22.9}}_{{-16.6}}\right) \right] ^\circ ,\\ R_\mathcal{C}({\rho \rho })= & {} {0.16}\pm {0.02}\quad \left( ^{{+0.03}}_{{-0.04}}\right) ,\\ \delta _\mathcal{C}({\rho \rho })= & {} \left[ {2.9}_{-41.8}^{+38.4}\quad (\pm {63.0})\right] ^\circ , \end{aligned}$$

indicating that the colour suppression is much more effective for \(B\rightarrow \rho \rho \) than for \(B\rightarrow \pi \pi \). In studies based on QCD factorisation [76,77,78], similar values for these moduli are found, apart from the modulus of the ratio between colour-allowed and colour-suppressed tree amplitudes for \(\pi \pi \) modes, \(R_\mathcal{C}({\pi \pi })\), smaller and around 0.2. Similarly, the relative phase for the latter ratio is significantly different from 0 and \(\pi \), contrary to general expectations from leading-order QCD factorisation. This may explain the difficulties experienced with this approach to reproduce the \(B\rightarrow \pi ^0\pi ^0\) branching ratio [80] and may point towards significant power-suppressed contributions for some of these modes. Recent fits of two-body non-leptonic B decays into light pseudoscalar mesons based on \({SU}(3)\) yield similar values for \(R_\mathcal{C}({\pi \pi })\), indicating that large phase and modulus of the colour-suppressed tree contribution \(T^{00}\) are required not only in the \(\pi \pi \) system, but also in other \(B\rightarrow PP\) decays [79].

6 Prediction of observables

If we use the indirect determination of the weak phase \(\alpha _\mathrm{ind}\), Eq. (2), as an additional input, the system of isospin-related amplitudes becomes over-constrained for the three charmless decay modes of interest. The isospin analysis can then be used to perform an indirect determination for each of the experimental observables introduced in Sect. 3 and to make predictions for yet unmeasured quantities. For each observable \(\mathcal{O} _i\), this indirect determination is obtained by performing a fit to the experimental data excluding the measurement of the observable considered. The compatibility of the indirect determination with the measurement is quantified by comparing the minimal \(\chi ^2\) values reached by the fit when excluding and including the direct measurement. The difference between these two quantities is distributed as a \(\chi ^2\) law with \(N_{\mathrm{dof}}=1\), whose interpretation in terms of standard deviations yields the compatibility pull defined as

$$\begin{aligned} \mathrm{Pull} (\mathcal{O} _i) = \sqrt{\chi ^2 _\mathrm{min}(\alpha _\mathrm{ind},\varvec{\mathcal{O}}) - \chi ^2 _\mathrm{min}(\alpha _\mathrm{ind},\varvec{\mathcal{O}}_{! i})}, \end{aligned}$$
(69)

where \(\varvec{\mathcal{O}}=\{\varvec{\mathcal{O}}_{! i},\mathcal{O} _i\}\) represents the full data set of experimental measurements, and \(\varvec{\mathcal{O}}_{! i}\) the subset out of which the specific observable \(\mathcal{O} _i\) has been excluded. The pull quantifies the compatibility of the measurement of the observable \(\mathcal{O} _i\) with all the other experimental observables under the isospin hypothesis. Let us add that the individual pulls are not in general independent from each other, so that one should refrain from giving a statistical interpretation of their distribution in each of the \(B\!\rightarrow \pi \pi \), \(B\!\rightarrow \rho {\pi }\), \(B\!\rightarrow \rho \rho \) systems.

6.1 \(B\!\rightarrow \pi \pi \) observables

As already indicated in Sect. 3.2, the number of independent amplitudes in the \(B\!\rightarrow \pi \pi \) system matches the number of experimental measurements available. Including the indirect determination of \(\alpha \) as an additional constraint, the minimal \(\chi ^2\) increases only slightly, reflecting the excellent agreement between \(\alpha _\mathrm{ind}\) and \(B\!\rightarrow \pi \pi \) data, at the level of 0.6 standard deviation.

Fig. 27
figure 27

Two-dimensional \({SU}(2)\) isospin constraint on the CP asymmetries \(\mathcal{C} _{\pi \pi }\) and \(\mathcal{S} _{\pi \pi }\) for the \(B^0\!\rightarrow \pi ^+\pi ^-\) decay (left) and the \(B^0\!\rightarrow \pi ^0\pi ^0\) decay (right). The direct measurement, not included in the fit, is indicated by the shaded area. Only the direct CP asymmetry \(C^{00}\) is measured in the \(B^0\rightarrow \pi ^0\pi ^0\) decay and we represent the direct constraint in the \((\mathcal{C} ^{00},\mathcal{S} ^{00})\) plane taking the theoretical inequality \(\mathcal{C} ^2+\mathcal{S} ^2 < 1\) into account

The indirect constraints on the three branching fractions \(B^{ij}_{\pi \pi }\) are shown in Fig. 26 while Fig. 27 displays the two-dimensional constraint on the CP asymmetries (\(\mathcal{C} _{\pi \pi },\mathcal{S} _{\pi \pi }\)) for the neutral modes \(B^0\!\rightarrow \pi ^+\pi ^-\) and \(B^0\!\rightarrow \pi ^0\pi ^0\). When excluding any of the experimental measurements, either for the branching ratios \(\mathcal{B} ^{ij}_{\pi \pi }\) or for the direct CP asymmetries \(\mathcal{C} ^{ij}_{\pi \pi }\), the \(B\!\rightarrow \pi \pi \) amplitudes system is no longer over-constrained. As a consequence, the corresponding \(\chi ^2 _\mathrm{min}(\alpha _\mathrm{ind},\varvec{\mathcal{O}}_{! i})\) vanishes and the pull associated to these observables saturates its maximal value: \(\mathrm{Pull} (B^{ij}_{\pi \pi },C^{ij}_{\pi \pi })=\sqrt{\chi ^2 _\mathrm{min}(\alpha _\mathrm{ind},\varvec{\mathcal{O}})}\), which turns out to vanish thanks to the closure of both isospin triangles with the current world-average data. The yet unmeasured time-dependent asymmetry, \(\mathcal{S} ^{00}_{\pi \pi }\), is predicted to be

$$\begin{aligned} \mathcal{S} ^{00}_{\pi \pi } = {0.65}\pm {0.13}, \end{aligned}$$
(70)

with a 68% confidence level, and larger than 0.33 at 95% confidence level. The corresponding one-dimensional projection of the constraint is displayed in Fig. 28.

Fig. 28
figure 28

\({SU}(2)\) isospin constraint on the unmeasured time-dependent CP asymmetries in the \(B^0\!\rightarrow \pi ^0\pi ^0\) decay

The experimental values, confidence intervals and pulls for the measured observables are reported in Table 14 in Appendix C.

6.2 \(B\!\rightarrow \rho \rho \) observables

As shown in Sect. 3.3, the \(B\!\rightarrow \rho \rho \) data is in very good agreement with the indirect determination of the weak phase \(\alpha _\mathrm{ind}\). The \({SU}(2)\) isospin constraint on the \(\mathcal{B} ^{ij}_{\rho \rho }\) branching ratios and the corresponding fractions of longitudinal polarisation are displayed in Fig. 29. As expected theoretically [78], the isospin fit to the \(B\!\rightarrow \rho \rho \) data favours large fractions of longitudinal polarisation in each of the three decay modes. The most precise constraint is obtained for the neutral \(B^0\!\rightarrow \rho ^+\rho ^-\) decay with an indirect determination \(f_{\rho _L\rho _L;\mathrm{ind}}^{+-}>0.91\) (0.79) at 68% (95%) confidence level, in very good agreement with the experimental measurement \(f_{\rho _L\rho _L;\mathrm{dir}}^{+-}=0.990\pm 0.020\).

Fig. 29
figure 29

\({SU}(2)\) isospin constraint on the \(B\!\rightarrow \rho \rho \) branching fractions (top) and the fractions of longitudinal polarisation (bottom). For each observable, the direct measurement, not included in the fit, is indicated by an interval with a dot

Weaker constraints are obtained on the CP asymmetries (\(\mathcal{C} _{\rho \rho },\mathcal{S} _{\rho \rho }\)) for both neutral modes \(B^0\!\rightarrow \rho ^+\rho ^-\) and \(B^0\!\rightarrow \rho ^0\rho ^0\) as shown in Fig. 30. Numerical values, confidence intervals and pulls for the measured observables are reported on Table 15 in Appendix C.

Fig. 30
figure 30

Two-dimensional constraint on the \(B^0\!\rightarrow (\rho \rho )^0\) CP asymmetries in the (\(\mathcal{C}\),\(\mathcal{S}\)) plane. The direct measurement, not included in the fit, is indicated for each mode as a shaded area

6.3 \(B\!\rightarrow \rho {\pi }\) observables

6.3.1 \(B^0\!\rightarrow \pi ^{+}\pi ^{-}\pi ^{0}\) Dalitz analysis

The normalised \(\mathcal {U}\)and \(\mathcal {I}\)observables provide a complete description of the relative \(B^0\!\rightarrow \pi ^+\pi ^-\pi ^0\) decay amplitudes. As discussed in Sect. 3.4, the data from \(B^0\!\rightarrow (\rho \pi )^0\rightarrow \pi ^{+}\pi ^{-}\pi ^{0}\) Dalitz analyses disagrees with the indirect determination of \(\alpha \) at the level of 3.0 standard deviations. This disagreement is reflected through the indirect determination of several of the form-factor coefficients \(\mathcal {U}\)and \(\mathcal {I}\)as listed in Table 17 in Appendix C, in particular \({\mathcal U}_{+}^{-} \), \({\mathcal U}_{-}^{+} \), \(\mathcal {I}_{-}\), \({\mathcal U}_{+0}^{+\mathcal {R}e} \), \({\mathcal U}_{-0}^{+\mathcal {R}e} \) with pulls above \(2\sigma \), and \({\mathcal U}_{+-}^{+\mathcal {R}e} \), \({\mathcal U}_{+-}^{+\mathcal {I}m} \) with pulls near or above \(3\sigma \).

We can separate the quasi-two-body terms (or Q2B) coefficients related to the dynamics of the \(\rho \pi \) intermediate state,

$$\begin{aligned} {\varvec{\mathcal {O}}}_{\rho {\pi }}=(\mathcal {U}^{+}_{i}, \mathcal {U}^{-}_{i}, \mathcal {I}^i)_{(i=+,0,-)}, \end{aligned}$$

from the parameters describing the interference pattern,

$$\begin{aligned} {\varvec{\mathcal {O}}}_\mathrm{interf} = (\mathcal {U}^{\pm ,\mathcal {R}e (\mathcal {I}m)}_i,\mathcal {I}^{\mathcal {R}e (\mathcal {I}m)}_{ij}), \end{aligned}$$

as can be seen in Eq. (39). The indirect determination of most of the Q2B coefficients, \(\mathcal {U}^i_j\) and \(\mathcal {I}_j\), related to the \(B^0\!\rightarrow \rho ^\pm \pi ^{\mp }\) intermediate states, deviates from the direct measurement by more than \(2\sigma \). This is also the case for the corresponding interference terms \(\mathcal {U}^{+\mathcal {R}e/\mathcal {I}m}_{ij}\), which represent the real and imaginary parts of the combination of \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) amplitudes, \((A^+A^{-^*}+ {\bar{A}}^+{\bar{A}}^{-^*})\). The largest deviation is observed for the \(U^{+\mathcal {R}e}_{+-}\) coefficient which exceeds \(3\sigma \). Figures 31 and 32 display the indirect determinations of the form-factor bilinear coefficients for quasi-two-body and interference coefficients, respectively.

Fig. 31
figure 31

\({SU}(2)\) isospin constraint on the Q2B-related coefficients \(\mathcal {U}\)and \(\mathcal {I}\)describing the \(B^0\!\rightarrow (\rho \pi )^{0}\) intermediate states. The upper left figure corresponds to the overall normalisation, \({\mathcal U}_{+}^{+}\)  \(=\) 1. For each observable, the direct measurement, not included in the fit, is indicated by an interval with a dot

Fig. 32
figure 32

Two-dimensional \({SU}(2)\) isospin constraint on the real and imaginary parts of interference-related coefficients \(\mathcal {U}\) and \(\mathcal {I}\) describing the \(B^0\!\rightarrow \pi ^{+}\pi ^{-}\pi ^{0}\) Dalitz decay. For each observable, the direct measurement, not included in the fit, is indicated by the shaded area. The largest tension is observed in the uppermost left figure related to the \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) amplitude combination \((A^+A^{-^*}+ {\bar{A}}^+{\bar{A}}^{-^*})\)

In order to understand these discrepancies, we can rely on two parametrisations focusing on the dynamics of the \(\rho \pi \) intermediate state through Q2B observables. A first, “theoretical”, parametrisation was proposed in Ref. [81]. The time-dependent CP asymmetry given by Eq. (10) is written independently for the three neutral \(B^0\!\rightarrow \rho ^i\pi ^j\) intermediate states:

$$\begin{aligned} a^{+}_{CP}(t)= & {} \frac{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^+\pi ^-) - \varGamma ({B}^0(t)\rightarrow \rho ^+\pi ^-) }{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^+\pi ^-) + \varGamma ({B}^0(t)\rightarrow \rho ^+\pi ^-)}\nonumber \\= & {} \mathcal{S} ^{+}\sin (\varDelta m_d t)-\mathcal{C} ^{+}\cos (\varDelta m_d t),\nonumber \\ a^{-}_{CP}(t)= & {} \frac{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^-\pi ^+) - \varGamma ({B}^0(t)\rightarrow \rho ^-\pi ^+) }{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^-\pi ^+) + \varGamma ({B}^0(t)\rightarrow \rho ^-\pi ^+)}\nonumber \\= & {} \mathcal{S} ^{-}\sin (\varDelta m_d t)-\mathcal{C} ^{-}\cos (\varDelta m_d t),\nonumber \\ a^{0}_{CP}(t)= & {} \frac{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^0\pi ^0) - \varGamma ({B}^0(t)\rightarrow \rho ^0\pi ^0) }{\varGamma ({\bar{B}}^0(t)\rightarrow \rho ^0\pi ^0) + \varGamma ({B}^0(t)\rightarrow \rho ^0\pi ^0)}\nonumber \\= & {} \mathcal{S} ^{0}\sin (\varDelta m_d t)-\mathcal{C} ^{0}\cos (\varDelta m_d t), \end{aligned}$$
(71)

where \(\mathcal{C} ^{i}\) and \(\mathcal{S} ^{i}\) are direct and mixing-induced CP asymmetries, respectively, and the subscript i refers to the charge of the emitted \(\rho ^i\) meson.

The \(\rho ^\pm \pi ^{\mp }\) final state being a CP admixture, the parameters \(\mathcal{C} ^{\pm }\) are not the only measurements of direct CP violation. Considering either the decay where the \(\rho ^\pm \) meson is emitted by the spectator interaction or by the W exchange, we can use one of the following time-integrated flavour-independent asymmetries:

$$\begin{aligned} A^{-}= & {} \frac{|\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^+\pi ^-)|^2 - |\mathcal{A} ({B}^0\rightarrow \rho ^-\pi ^+)|^2 }{|\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^+\pi ^-)|^2 + |\mathcal{A} ({B}^0\rightarrow \rho ^-\pi ^+)|^2},\nonumber \\ A^{+}= & {} \frac{|\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^-\pi ^+)|^2 - |\mathcal{A} ({B}^0\rightarrow \rho ^+\pi ^-)|^2 }{|\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^-\pi ^+)|^2 + |\mathcal{A} ({B}^0\rightarrow \rho ^+\pi ^-)|^2}, \end{aligned}$$
(72)

to constrain the \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) amplitude system completely.

Due to the overlapping final states in the Dalitz analysis, the separation of the three intermediate states is experimentally challenging and prevents a clean direct measurement of the above Q2B asymmetries. The previous “theoretical” set of observables can be traded for a second, “experimental”, parametrisation in order to avoid this problem. Focusing only on the region of the charged \(\rho \) meson in the Dalitz plane and neglecting the interferences and the residual contribution from the colour-suppressed \(B^0\!\rightarrow \rho ^0\pi ^0\) mode, a specific time-dependent measurement of the CP-mixed \(B^0\!\rightarrow \rho ^\pm \pi ^{\mp }\) intermediate state was first performed at B-factories in 2005 [50, 82]. Introducing the flavour-integrated charge asymmetry:

$$\begin{aligned} A_{\rho {\pi }}=\frac{|\mathcal{A} _{\rho ^+\pi ^-}|^2 - |\mathcal{A} _{\rho ^-\pi ^+}|^2}{|\mathcal{A} _{\rho ^+\pi ^-}|^2 + |\mathcal{A} _{\rho ^-\pi ^+}|^2}, \end{aligned}$$
(73)

where

$$\begin{aligned} |\mathcal{A} _{\rho ^+\pi ^-}|^2= & {} |\mathcal{A} ({B}^0\rightarrow \rho ^+\pi ^-)|^2 + |\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^+\pi ^-)|^2,\\ |\mathcal{A} _{\rho ^-\pi ^+}|^2= & {} |\mathcal{A} ({B}^0\rightarrow \rho ^-\pi ^+)|^2 + |\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^-\pi ^+)|^2, \end{aligned}$$

the \(B^0(t)\rightarrow \rho ^\pm \pi ^{\mp }\) decay rate is written as

$$\begin{aligned} \varGamma (B^0(t)\rightarrow \rho ^\pm \pi ^{\mp })= & {} (1+q_{\rho } A_{\rho {\pi }})e^{-\frac{t}{\tau _{B^0}}}\nonumber \\&\times \,\{1-q_{B}(\mathcal{S} +q_{\rho }\varDelta \mathcal{S})\sin (\varDelta m_d t)\nonumber \\&+\,q_{B}(\mathcal{C} +q_{\rho }\varDelta \mathcal{C})\cos (\Delta m_d t)\},\nonumber \\ \end{aligned}$$
(74)

where \(q_B=+1 (-1)\) for \(B^0({\bar{B}}^0)\) and \(q_{\rho }\) is the electric charge of the emitted \(\rho \)-meson. The CP-violating (CP-conserving) parameters \(\mathcal{C} \) and \(\mathcal{S} \) (\(\Delta \mathcal{C} \) and \(\Delta \mathcal{S} \)) generate the flavour-dependent asymmetries:

$$\begin{aligned} \mathcal{C} ^{\pm }= \mathcal{C} \pm \Delta \mathcal{C},\quad \mathcal{S} ^{\pm }= \mathcal{S} \pm \Delta \mathcal{S}, \end{aligned}$$
(75)

while the flavour-independent CP asymmetries defined in Eq. (72) can be rewritten in terms of the experimental parameters as

$$\begin{aligned} A^{+}= & {} -\frac{A_{\rho {\pi }}+C+A_{\rho {\pi }}\varDelta C}{1+\varDelta C+A_{\rho {\pi }}C},\nonumber \\ A^{-}= & {} -\frac{A_{\rho {\pi }}-C-A_{\rho {\pi }}\varDelta C}{A_{\rho {\pi }}-\varDelta C-A_{\rho {\pi }}C}. \end{aligned}$$
(76)

In order to remove experimental biases due to the imperfect separation of the intermediate states, one can derive the Q2B parameters from the subset \({\varvec{\mathcal {O}}}_{\rho {\pi }}\) of the form-factor coefficients:

$$\begin{aligned} \mathcal{C} ^i= & {} \frac{\mathcal {U}^-_i}{\mathcal {U}^+_i}, \quad \mathcal{S} ^i=\frac{2\mathcal {I}^i}{\mathcal {U}^+_i},\nonumber \\ A^\pm= & {} \frac{ \pm ({\mathcal U}_{-}^{+}-{\mathcal U}_{+}^{+}) - ({\mathcal U}_{-}^{-} +{\mathcal U}_{+}^{-}) }{\pm ({\mathcal U}_{+}^{-}-{\mathcal U}_{-}^{-})+( {\mathcal U}_{-}^{+} +{\mathcal U}_{+}^{+}) },\nonumber \\ A_{\rho {\pi }}= & {} \frac{ {\mathcal U}_{+}^{+} -{\mathcal U}_{-}^{+}}{{\mathcal U}_{+}^{+} +{\mathcal U}_{-}^{+}}. \end{aligned}$$
(77)

Finally, we can introduce the relative contribution of the colour-suppressed \(B^0\!\rightarrow \rho ^0\pi ^0\) mode to the overall \(B^0\!\rightarrow (\rho \pi )^0\) branching fraction:

$$\begin{aligned} f^{00}=\frac{\mathcal {U}^+_0}{\mathcal {U}^+_0 + \mathcal {U}^+_- + \mathcal {U}^+_+}. \end{aligned}$$
(78)

We can then reparametrise the \(B^0\rightarrow \pi ^+\pi ^-\pi ^0\) amplitude system in terms of a selected set of \(\mathcal {U}\), \(\mathcal {I}\)and Q2B observables: on hand, \({\varvec{\mathcal {O}}}_\mathrm{interf}\), \({\varvec{\mathcal {O}}}_{\rho ^0\pi ^0}=\{\mathcal{C} ^0,S^0,f^{00}\}\) and on the other hand, either the theoretically motivated set \({\varvec{\mathcal {O}}}_{\rho ^\pm \pi ^{\mp }}=\{\mathcal{C} ^\pm ,S^\pm ,A\}\) or the experimentally convenient choice \({\varvec{\mathcal {O}}}_{\rho ^\pm \pi ^{\mp }}=\{C,S,\varDelta C,\varDelta S,A\}\) (where A stands for \(A^+\), \(A^-\) or \(A_{\rho {\pi }}\)). The world averages of the Q2B parameters derived from the measured \(\mathcal {U}\)and \(\mathcal {I}\)coefficients are listed in Table 18 in Appendix C, together with their corresponding prediction in the \({SU}(2)\) isospin framework. These results are also shown in Figs. 33 and 34.

Fig. 33
figure 33

Two-dimensional constraints on the flavour-dependent, \((\mathcal{C} ^i\), \(\mathcal{S} ^i)_{i=+,0,-}\), and flavour-independent (\(A^{+}\), \(A^{-}\)) asymmetries. In each case, the experimental value derived from the related \(\mathcal {U}\) or \(\mathcal {I}\) coefficients and not included in the fit is indicated as a shaded area

Fig. 34
figure 34

Two-dimensional constraints on the \(B^0\!\rightarrow \rho ^\pm \pi ^{\mp }\) CP-violating parameters (\(\mathcal{C} _{\rho {\pi }}\), \(\mathcal{S} _{\rho {\pi }}\)) and CP-conserving parameters (\(\Delta \mathcal{C} _{\rho {\pi }}\), \(\Delta \mathcal{S} _{\rho {\pi }}\)) (top) and one-dimensional constraints on the flavour-integrated charge asymmetry \(\mathcal{A} _{\rho {\pi }}\) and the relative \(B^0\rightarrow \rho ^0\pi ^0\) branching fraction \(f^{00}\) (bottom). In each case, the corresponding experimental value derived from the related \({\mathcal U}\) or \(\mathcal {I}\) coefficients, not included in each individual fit, is indicated either by a shaded area or an interval with a dot

A reasonable agreement is observed for the parameters related to the \(B^0\rightarrow \rho ^0\pi ^0\) mode. The \({SU}(2)\) isospin fit favours large values for both direct and mixing-induced CP asymmetry parameters in the \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) decays, in contrast with experimental data. The overall \(3\sigma \) discrepancy is mostly reflected by the correlated flavour-independent CP asymmetries, \(A^\pm \), or equivalently the charge asymmetry, \(A_{\rho {\pi }}\). While the predicted CP-violating asymmetry averages, \(\mathcal{C}\) and \(\mathcal{S}\), are in a reasonable agreement with the experimental data, the CP-conserving terms, \(\varDelta \mathcal{C} \) and \(\varDelta \mathcal{S} \), deviate by more than \(2.5\sigma \).

6.3.2 Role of the strong phases in the \(B^0\rightarrow (\rho \pi )^0\) analysis

In order to understand this discrepancy in more detail, the effective weak angles associated with the \(B^0\rightarrow \rho ^+\pi ^-\) and \(B^0\rightarrow \rho ^-\pi ^+\) decays,

$$\begin{aligned} 2\alpha ^+= & {} \mathrm{Arg}\left[ \frac{q}{p}\frac{\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^-\pi ^+)}{\mathcal{A} ({B}^0\rightarrow \rho ^+\pi ^-)}\right] \quad \mathrm{and}\nonumber \\ 2\alpha ^-= & {} \mathrm{Arg}\left[ \frac{q}{p}\frac{\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^+\pi ^-)}{\mathcal{A} ({B}^0\rightarrow \rho ^-\pi ^+)}\right] , \end{aligned}$$
(79)

constitute further interesting quantities as they both reduce to \(\alpha \) in the limit of a vanishing penguin contribution. The time-independent Q2B observables in the \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) decays do not lead to a determination of each effective mixing angle: only the average \(\alpha _\mathrm{eff}=(\alpha ^+ + \alpha ^-)/2\) can be measured up to a fourfold ambiguity in the \([0,90]^\circ \) range [59]. Indeed, the Q2B observables can be combined to measure the two shifted phases:

$$\begin{aligned} 2\alpha ^\pm \pm 2\delta= & {} \mathrm{Arg}\left[ \frac{q}{p}\frac{\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^\pm \pi ^{\mp })}{\mathcal{A} ({B}^0\rightarrow \rho ^\pm \pi ^{\mp })}\right] \nonumber \\= & {} \mathrm{arcsin}\left[ \frac{\mathcal{S} {\mp }\varDelta \mathcal{S}}{\sqrt{1-(\mathcal{C} {\mp }\varDelta \mathcal{C})^2}}\right] , \end{aligned}$$
(80)

where the phase shift,

$$\begin{aligned} 2\delta = \mathrm{Arg}\left[ \frac{\mathcal{A} ({B}^0\rightarrow \rho ^+\pi ^-)}{\mathcal{A} ({B}^0\rightarrow \rho ^-\pi ^+)}\right] , \end{aligned}$$
(81)

cannot be determined from the Q2B data alone but cancels in the average \(\alpha _\mathrm{eff}\). A bound on the deviation \(\varDelta \alpha \) between the effective angle \(\alpha _\mathrm{eff}\) and the weak phase \(\alpha \) can be derived [6, 83, 84] from the penguin relation Eq. (14):

$$\begin{aligned} |\varDelta \alpha |=|\alpha -\alpha _\mathrm{eff}|\le \frac{1}{2}\mathrm{arccos}\left[ \frac{1}{\sqrt{1-A_{\rho {\pi }}^2}}\left( 1-4\frac{\mathcal{B} ^{00}}{\mathcal{B} ^{\pm {\mp }}}\right) \right] ,\nonumber \\ \end{aligned}$$
(82)

where \(\mathcal{B} ^{00}\) and \(\mathcal{B} ^{\pm {\mp }}\) denote the branching fractions of the \(B^0\rightarrow \rho ^0\pi ^0\) and \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) decays, respectively. This relation originates with the fact that the sum of the penguin amplitudes \(({P}^{+-}+{ P}^{-+})=-{ P}^{00}\) is bounded by the branching fraction of the colour-suppressed mode while the difference \(({ P}^{+-}-{ P}^{-+})\) vanishes in relation with the charge-integrated CP asymmetry \(A_{\rho {\pi }}\) (see also the discussion of the Q2B analysis of the charmless \(B^0\rightarrow a_1^\pm \pi ^{\mp }\) decay in Appendix B).

Fig. 35
figure 35

One-dimensional scan of \(\varDelta \alpha =(\alpha -\alpha _\mathrm{eff})\) from the isospin analysis of \(B^0\rightarrow (\rho \pi )^0\) Dalitz data. The dashed curve includes the indirect determination \(\alpha _\mathrm{ind}\) as additional constraint

The constraint on \(\varDelta \alpha \) given in Fig. 35 features a solution consistent with 0:

$$\begin{aligned} |\varDelta \alpha |_\mathrm{min}=\left( {8.8}^{+7.4}_{-9.9}\right) ^\circ , \end{aligned}$$

and a mirror solution around \(100^\circ \). Including the indirect determination \(\alpha _\mathrm{ind}\), Eq. (2), as an additional input, a tighter constraint is obtained, \(|\varDelta \alpha |_\mathrm{min}=({3.5}^{+2.4}_{-2.6}){^\circ }\), and a new mirror solution, disfavoured by the \(B^0\rightarrow (\rho \pi )^0\) data, emerges around \(45^\circ \). As shown in Fig. 36, when using only the Q2B data set \(\{\mathcal {U}^i_j,\mathcal {I}_j\}_{i,j=(+,-)}\), four solutions are found for \(\alpha _\mathrm{eff}\) around \(0^\circ \), \(45^\circ \) and \(90^\circ \) (the peak near \(45^\circ \) combines two quasi-degenerate solutions). The solution close to \(90^\circ \) is consistent with the indirect determination of \(\alpha \) Eq. (2). However, only the solution for \(\alpha _\mathrm{eff}\) near 45\({}^\circ \) remains once the Q2B observables and the interference-related constraints for \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) are considered altogether. The solutions close to \(0^\circ \) and 90\({}^\circ \) are disfavoured at \(2.5\sigma \). The rejection of these solutions is slightly reinforced with the addition of the observables related to \(B^0\rightarrow \rho ^0\pi ^0\).

Fig. 36
figure 36

On the top One-dimensional scan of the effective angle average \(\alpha _\mathrm{eff}\) using \(B^0\rightarrow (\rho \pi )^0\) Dalitz data. The dot-dashed curve is obtained when using only the \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) quasi-two-body observables, \(\{\mathcal {U}^+_-,\mathcal {U}^-_-,\mathcal {U}^+_+,\mathcal {U}^-_+,\mathcal {I}_+,\mathcal {I}_-\}\). The dashed curve includes also the interference-related observables, \(\{\mathcal {U}^{\pm \mathcal {R}e (\mathcal {I}m)}_{+-},\mathcal {I}_{+-}^{\mathcal {R}e (\mathcal {I}m)}\}\). On the bottom Two-dimensional constraint in the (\(\alpha _\mathrm{eff},\varDelta \alpha \)) plane obtained using the whole \(\mathcal {U}\) and \(\mathcal {I}\) observables. Contrary to other similar figures in this article (where the excluded region is above \(2\sigma \) or 95% CL), the excluded region corresponds here to a confidence level above \(3\sigma \) (99% CL) in order to show the suppressed solution around \((90^\circ , 0^\circ )\)

Fig. 37
figure 37

Two-dimensional constraint in the (\(\alpha _\mathrm{eff}\), \(\delta _\mathrm{eff}\)) plane. The coloured area is obtained from the interference-related coefficients \(\mathcal {U}^{\pm \mathcal {I}m (\mathcal {R}e)}_{+-}\) only. The black ellipses come from the Q2B coefficients \(\{\mathcal {U}^i_j,\mathcal {I}_j\}_{i,j=(+,-)}\). The shaded ellipses result from considering the whole \(\mathcal {U}\) and \(\mathcal {I}\) data set for \(B^0\rightarrow (\rho \pi )^0\)

As can be expected, the interference observables provide further constraints on the relative phases of the decay amplitudes. In particular, one can easily show from Eq. (38) that the phase shift \(2\delta \), which vanishes in the absence of the penguin contribution, can be determined from the interference-related coefficients \(\mathcal {U}^{\pm \mathcal {I}m (\mathcal {R}e)}_{+-}\):

$$\begin{aligned} \tan [2\delta ] = \frac{ {\mathcal U}_{+-}^{+\mathcal {I}m} + {\mathcal U}_{+-}^{-\mathcal {I}m} }{{\mathcal U}_{+-}^{+\mathcal {R}e} + {\mathcal U}_{+-}^{-\mathcal {R}e}}. \end{aligned}$$
(83)

Simultaneously, the related phase between the conjugate modes,

$$\begin{aligned} 2{\bar{\delta }} = 2\delta +(\alpha ^+-\alpha ^-) = \mathrm{Arg}\left[ \frac{\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^-\pi ^+)}{\mathcal{A} ({\bar{B}}^0\rightarrow \rho ^+\pi ^-)}\right] , \end{aligned}$$
(84)

can be extracted from the same coefficients:

$$\begin{aligned} \tan [{2\bar{\delta }}] = -\frac{ {\mathcal U}_{+-}^{+\mathcal {I}m} - {\mathcal U}_{+-}^{-\mathcal {I}m} }{{\mathcal U}_{+-}^{+\mathcal {R}e} - {\mathcal U}_{+-}^{-\mathcal {R}e}}, \end{aligned}$$
(85)

meaning that the phase average \(\delta _\mathrm{eff}=({\bar{\delta }}+\delta )/2\) can be extracted independently using either the Q2B coefficients or the interference-related coefficients for \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\):

$$\begin{aligned} 4\delta _\mathrm{eff}= & {} \mathrm{arctan}\left[ \frac{ {\mathcal U}_{+-}^{+\mathcal {I}m} + {\mathcal U}_{+-}^{-\mathcal {I}m} }{{\mathcal U}_{+-}^{+\mathcal {R}e} + {\mathcal U}_{+-}^{-\mathcal {R}e}}\right] \\&-\mathrm{arctan}\left[ \frac{ {\mathcal U}_{+-}^{+\mathcal {I}m} - {\mathcal U}_{+-}^{-\mathcal {I}m} }{{\mathcal U}_{+-}^{+\mathcal {R}e} - {\mathcal U}_{+-}^{-\mathcal {R}e}}\right] \\= & {} \mathrm{arcsin}\left[ \frac{2\mathcal {I}_{-}}{\sqrt{{\mathcal U}_{-}^{+} {}^2-{\mathcal U}_{-}^{-} {}^2}}\right] -\mathrm{arcsin}\left[ \frac{2\mathcal {I}_{+}}{\sqrt{{\mathcal U}_{+}^{+} {}^2-{\mathcal U}_{+}^{-} {}^2}}\right] . \end{aligned}$$

The Q2B coefficients provide also a constraint on the average mixing angle \(\alpha _\mathrm{eff}\):

$$\begin{aligned} 4\alpha _\mathrm{eff}=\mathrm{arcsin}\left[ \frac{2\mathcal {I}_{-}}{\sqrt{{\mathcal U}_{-}^{+} {}^2-{\mathcal U}_{-}^{-} {}^2}}\right] +\mathrm{arcsin}\left[ \frac{2\mathcal {I}_{+}}{\sqrt{{\mathcal U}_{+}^{+} {}^2-{\mathcal U}_{+}^{-} {}^2}}\right] . \end{aligned}$$

Figure 37 compares the two-dimensional constraints in the (\(\alpha _\mathrm{eff}\), \(\delta _\mathrm{eff}\)) plane using either subset of observables. Clearly, only the solutions for \(\alpha _\mathrm{eff}\) near \(45^\circ \) are favoured by both subsets.

As discussed previously, the two observables \(\mathcal {U}^{+\mathcal {R}e (\mathcal {I}m)}_{+-}\) play an important role in the discrepancy between the \(B^0\rightarrow (\rho \pi )^0\) Dalitz data and the isospin hypothesis. It is interesting to show how removing these two observables impacts the analysis: the \(\alpha _\mathrm{eff}\) solutions near \(0^\circ \) and \(90^\circ \) get favoured, and the direct determination of \(\alpha \) agrees much better with the indirect value, as illustrated in Fig. 38. From the experimental side, the observables \(\mathcal {U}\) \(_{+-}^{\pm \mathcal {R}e (\mathcal {I}m)}\) and \(\mathcal {I}\) \(_{\pm }^{\mathcal {R}e (\mathcal {I}m)}\) are the coefficients of the mixed form-factors bilinear (\(f^+f^{-^*}\)). Therefore, they are the only observables sensitive to a possible phase between the functionals describing the line-shape of the \(\rho \)-mesons of opposite charge. It would be very interesting to study the role played by the isobar approximation and to determine the possible re-interpretations of the experimental data, but this lies clearly beyond the scope of the isospin analysis presented here, and we will refrain from drawing further conclusions on this discrepancy.

Fig. 38
figure 38

Top One-dimensional scan of the angle \(\alpha \) using the \(B^0\rightarrow (\rho \pi )^0\) Dalitz data except the two coefficients \(\mathcal {U}^{+\mathcal {R}e}_{+-}\) and \(\mathcal {U}^{+\mathcal {I}m}_{+-}\), which measure the real and imaginary parts of \((A^+A^{-^*}+ {\bar{A}}^+{\bar{A}}^{-^*})\). The interval with a dot indicates the indirect determination of \(\alpha \) introduced in Eq. (2). Bottom Two-dimensional constraint in the (\(\alpha _\mathrm{eff}\), \(\Delta \alpha \)) plane. As for other similar figures in this article, the excluded region corresponds to CLs above \(2\sigma \). The standard solution around \((90^\circ , 0^\circ )\) is favoured with this set of observables

6.3.3 \(B\!\rightarrow \rho {\pi }\) pentagonal analysis

The absolute scale of the neutral \(B^0\rightarrow \pi ^-\pi ^+\pi ^0\) amplitudes can be determined by adding the measured branching fractions for the two charged modes \(B^+\rightarrow \rho ^0\pi ^+\) and \(B^+\rightarrow \rho ^+\pi ^0\), related to the neutral modes via the pentagonal relation Eq. (13). Once the data listed in Table 8 is added to the fit, individual \(B\rightarrow \rho \pi \) branching fractions and direct CP asymmetries in the charged modes can be determined. The corresponding pulls are given in Table 16 in Appendix C.

Fig. 39
figure 39

Constraint on the branching fraction of the neutral \(B^0\!\rightarrow \rho ^0\pi ^0\) mode (top, left) and the mixed \(B^0\!\rightarrow \rho ^\pm \pi ^{\mp }\) mode (top, right). The direct measurement, not included in the fit, is indicated by an interval with a dot. The bottom figure displays the prediction for the branching fractions for \(B^0\!\rightarrow \rho ^\pm \pi ^{\mp }\) in the (\(B^0\!\rightarrow \rho ^+\pi ^-\), \(B^0\!\rightarrow \rho ^-\pi ^+\)) plane

While the predicted branching fraction for the neutral \(B^0\rightarrow \rho ^0\pi ^0\) mode is in very good agreement and competitive with the experimental measurement, a discrepancy is observed in the balance of branching fractions for the charged modes and the mixed \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) decays. The branching fraction of both charged modes are predicted to be larger than their measured value, which is compensated by a lower \(B^0\rightarrow \rho ^\pm \pi ^{\mp }\) branching ratio. Figures 39 and 40 display the \({SU}(2)\) isospin constraints for the neutral and charged modes, respectively.

Fig. 40
figure 40

Constraint on the branching fraction and the direct CP asymmetry of the charged \(B^+\!\rightarrow \rho ^+\pi ^0\) mode (top, left) and \(B^+\!\rightarrow \rho ^0\pi ^+\) mode (top, right). For each observable, the direct measurement, not included in the fit, is indicated by an interval with a dot. The bottom figure displays the prediction for the branching fractions for \(B^+\!\rightarrow (\rho \pi )^+\) in the (\(B^+\!\rightarrow \rho ^0\pi ^+\), \(B^+\!\rightarrow \rho ^+\pi ^0\)) plane compared to the experimental measurement indicated by the shaded area

7 Prospective study

In this section, we discuss how improved measurements of some \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \rho \), \(B\rightarrow \rho \pi \) observables can affect the accuracy on the CKM angle \(\alpha \). So far, the experimental data is statistically limited for all the \(B\rightarrow hh\) charmless modes considered here. Our prospective study aims at identifying the specific decay channels worth measuring more accurately to improve the resolution on \(\alpha \) significantly.

For simplicity, we adopt a systematic approach rather than relying on the expected performance of current or forthcoming flavour experiments such as LHCb or Belle II. We consider the subset of observables related to a specific \(B\rightarrow h^ih^j\) decay, and we determine how much the accuracy on \(\alpha \) is improved if we reduce the uncertainties for this subset of observables by the (arbitrary) factor \(\sqrt{2}\), or if we take the radical limit of setting these uncertainties to zero. The central value of the current world-average measurements and correlation coefficients are kept unchanged. For the neutral modes \(B^0\rightarrow h^+h^-\) and \(B^0\rightarrow h^0h^0\), we distinguish the case of a counting analysis (C) that gives access to the branching ratio only (e.g., the LHCb measurement of \(B^0\rightarrow \rho ^0\rho ^0\) [47]), and the case of flavour-tagged analyses: indeed, a time-integrated (TI) analysis extracts only the direct CP asymmetry whereas a time-dependent (TD) analysis yields both the direct and the mixing-induced CP-asymmetry parameters (e.g., the LHCb contribution to the study of \(B^0\rightarrow \pi ^+\pi ^-\) [33]).

The global combination of the decay-specific determinations of \(\alpha \) is so far dominated by the \(B\rightarrow \rho \rho \) data that provides a constraint on \(\alpha \) with a relative uncertainty at the level of 5%, i.e., \(\alpha _{\rho \rho }=({92.1}_{-4.9}^{+4.6}){^\circ }\) for the solution near \(90^\circ \) given in Eq. (35). As shown in Table 12, if all other observables remain unchanged, improving the accuracy of the branching ratio of the charged mode \(B^+\rightarrow \rho ^+\rho ^0\) would improve the resolution on \(\alpha \) only marginally, even in the case of a vanishing resolution (indicating that this observable has only a limited impact on the accuracy for \(\alpha \)). Improving the measurements for the neutral modes, in particular the colour-suppressed \(B^0\rightarrow \rho ^0\rho ^0\) decay, has a larger impact, essentially driven by the CP-asymmetries parameters. Improving the time-dependent asymmetries in the \(B^0\rightarrow \rho ^0\rho ^0\) is also worth investigating, e.g., in the second run of LHCb data taking. Reducing by \(\sqrt{2}\) all the \(B\rightarrow \rho \rho \) uncertainties would reduce the 68% CL interval for \(\alpha \) by more than 1\(^{\circ }\).

The subleading contribution to the combined \(\alpha \) determination is provided by the \(B\rightarrow \pi \pi \) system, \(\alpha _{\pi \pi }=({93.0}\pm {14.0})^\circ \); see Eq. (35). As shown on Table 13, any sizeable improvement on \(\alpha \) is driven by the increased accuracy in the measurement of the direct CP asymmetry in the colour-suppressed decay \(B^0\rightarrow \pi ^0\pi ^0\). A reduction by \(\sqrt{2}\) of the uncertainty of this single observable would reduce the 68% CL range for \(\alpha \) by \(1^\circ \). A similar improvement of all the measured \(B\rightarrow \pi \pi \) observables would reduce by about \(1.5^\circ \) the uncertainty on \(\alpha \).

Table 12 68% CL interval for \(\alpha \) from the \({SU}(2)\) isospin analysis of the \(B\rightarrow \rho \rho \) data in the case that the uncertainty of some observables is reduced by a factor \(\sqrt{2}\) (third column) or set to zero (fourth column). The relative gain in resolution with respect to the current measurement is indicated within brackets. Only the preferred solution for \(\alpha \) close to \(90^\circ \) is reported
Table 13 68% CL interval for \(\alpha \) from the \({SU}(2)\) isospin analysis of the \(B\rightarrow \pi \pi \) data in the case that the uncertainty of some observables is reduced by a factor \(\sqrt{2}\) (third column) or set to zero (fourth column). The relative gain in resolution with respect to the current measurement is indicated within brackets. Only the preferred solution for \(\alpha \) close to \(90^\circ \) is reported

No time-dependent analysis has become available so far for the colour-suppressed mode due to the dominant di-photon decay of the neutral pions that hinders the measurement of the decay time, so that \(\mathcal{S} ^{00}_{\pi \pi }\) is not measured yet. However, the low branching fraction of the Dalitz decay \(\pi ^0\rightarrow \gamma e^+e^-\) or the photon conversion in the detector material may allow one to measure the \(B^0\) decay time in the future high-statistics flavour experiments (LHCb upgrade, Belle II). Figure 41 shows the p value for \(\alpha \) when adding the \(S^{00}_{\pi \pi }\) observable under different hypotheses concerning the experimental resolution, and setting the central value as predicted in Eq. (70). In addition to a significant improvement on \(\alpha \), the measurement of \(\mathcal{S} ^{00}_{\pi \pi }\) would reduce the number of mirror solutions in the \(B\rightarrow \pi \pi \) isospin analysis (see the discussion around Eq. (33)).

Fig. 41
figure 41

Expected impact of the measurement of the time-dependent asymmetry \(\mathcal{S} ^{00}\) on the \(\alpha \) determination from the isospin analysis of the \(B\rightarrow \pi \pi \) system. The central value for \(\mathcal{S} ^{00}\) corresponds to the best prediction from the \({SU}(2)\) isospin analysis

The same prospective exercise is more delicate for the \(B^0\rightarrow \pi ^+\pi ^-\pi ^0\) system due to the discrepancy between the direct measurement \(\alpha _{\rho {\pi }}=({54.1}^{+7.7}_{-10.3}){^\circ }\cup ({141.8}^{+4.8}_{-5.4}){^\circ }\) (Eq. (40)) and the indirect determination from the global CKM fit (Eq. (2)). New measurements of this decay would certainly aim first at a better understanding of this discrepancy, rather than improving the accuracy on \(\alpha \) extracted from this channel. Assuming the central values given by the individual predictions for each \(\mathcal {U}\)and \(\mathcal {I}\)coefficient, as listed in Table 17, and keeping unchanged the current experimental resolution and correlations, we obtain a mild constraint on \(\alpha \), consistent with \(\alpha _\mathrm{ind}\), namely \(\alpha _{\rho {\pi }}^{\mathrm{fit}~\mathcal {U},\mathcal {I}}=({82}^{-33}_{-48}){^\circ }\). In this case, the reduction of the observable uncertainty by \(\sqrt{2}\) results in a reduction by approximately \(9^\circ \) for the 68% CL interval for \(\alpha \). On the other hand, keeping the current world-average central values and reducing the uncertainty on all the \(\mathcal {U}\)and \(\mathcal {I}\)observables by a factor \(\sqrt{2}\) leads to a more stringent 68% CL interval \(\alpha _{\rho {\pi }}^{\sigma \scriptscriptstyle /\sqrt{2}}=({54.1}^{+5.6}_{-6.9}){^\circ }\cup ({141.0}^{+1.9}_{-1.9}){^\circ }\), which remains difficult to interpret due to the discrepancies already discussed between the current data and our theoretical framework based on isospin symmetry. Additional measurements should hopefully provide a clearer and more consistent picture for the \(B\rightarrow \rho \pi \) sector before discussing any improvement in the extraction of \(\alpha \) from these modes.

8 Conclusion

Quark-flavour transitions provide particularly stringent tests of the Standard Model, both through rare decays and \(\mathcal{CP}\)-violating processes. An accurate knowledge of the Cabibbo–Kobayashi–Maskawa matrix is essential for these studies and it requires the combination of many precise constraints. We have focussed on the determination of the \(\alpha \) angle, which can be extracted with a high accuracy from two-body charmless B-meson decays extensively studied at B-factories and LHCb.

We recalled that this extraction can be done from \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \pi \) and \(B\rightarrow \rho \rho \) decays but it is affected by the presence of penguin contributions. We explained how \({SU}(2)\) isospin symmetry can be used to constrain the structure of hadronic penguin and tree amplitudes, enabling the extraction of \(\alpha \) from branching ratios and \(\mathcal{CP}\) asymmetries. We gave details on the analyses of \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \pi \) and \(B\rightarrow \rho \rho \) systems separately, before combining these results to reach an accuracy around 4\(^\circ \) on the direct determination of \(\alpha \). The \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) systems dominate the combination and they favour solutions in good agreement with the indirect determination of \(\alpha \) from a global CKM fit analysis, Eq. (2), it is not the case for the \(B\rightarrow \rho \pi \) system which favours different ranges of values for \(\alpha \) with a discrepancy at the level of \(3\sigma \) compared to the indirect determination. The combination of the three channels is dominated by \(B\rightarrow \rho \rho \), and to a lesser extent \(B\rightarrow \pi \pi \), resulting in the 68% CL confidence intervals given in Eq. (1).

We have then studied several uncertainties that may affect this extraction. We have tested the hypotheses underlying the \({SU}(2)\) isospin analysis: setting the \(\Delta I={3} / {2}\) electroweak penguins to zero, neglecting the difference of light-quark masses generating \(\pi ^0\)\(\eta \)\(\eta '\) mixing, setting the \(\rho \) width to zero to cancel \(\Delta I=1\) contributions thanks to Bose–Einstein symmetry. These effects may shift the central value of \(\alpha _\mathrm{dir}\) by around \(2^\circ \), while keeping the uncertainty around \(4^\circ \) to \(5^\circ \), thus remaining within the statistical uncertainty quoted in Eq. (1). In addition, we have discussed a few aspects concerning the statistical treatment used in order to extract the p value, comparing two statistical approaches to test the impact of hadronic nuisance parameters on coverage. The approach based on Wilks’ theorem and mainly used here proves to be more conservative than the bootstrap method for \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \), but less conservative in the case of \(B\rightarrow \rho \pi \). The comparison of the coverage properties of the two approaches leads to a further uncertainty of around \(1^\circ \). We stress that this uncertainty is only attached to the combination of the three direct determinations of \(\alpha \) with the current data: it is likely to be reduced if one combines the direct determination of \(\alpha \) with other observables, leading to a more accurate determination of \(\alpha \) (which is in particular the typical case of global CKM fits).

Assuming the validity of the Standard Model and taking as an input the indirect determination of \(\alpha \) from the global CKM fit, we used the observables in \(B\rightarrow \pi \pi \), \(B\rightarrow \rho \pi \) and \(B\rightarrow \rho \rho \) decays to extract information on ratios of hadronic amplitudes (penguin-to-tree and colour-suppressed), finding results in broad agreement with expectations from QCD factorisation, apart from the ratio of the colour-suppressed to colour-allowed tree \(\pi \pi \) contributions: indeed both the phase and the modulus of this ratio do not agree well with theoretical expectations. It would be interesting to widen this discussion and see how various theoretical approaches to non-leptonic two-body decays can reproduce the patterns of hadronic amplitudes that we have extracted from the data.

Under the same hypotheses, we have also performed the indirect determinations of observables of interest (using all the other measurements available), comparing the pulls for those already observed and predicting the values of the remaining ones. The compatibility between direct measurements and indirect determinations is very good for the observables in the \(B\rightarrow \pi \pi \) and \(B\rightarrow \rho \rho \) systems, whereas we could identify a subset of \(B\rightarrow \rho \pi \) observables likely to be responsible for the discrepancies observed with respect to the Standard Model expectations in these modes. Among many other quantities, we have predicted the yet-to-be-measured mixing-induced \(\mathcal{CP}\) asymmetry in the \(B^0\!\rightarrow \pi ^0\pi ^0\) decay; see Eq. (33).

Finally, we have performed a prospective study to analyse how improved measurements for some subsets of observables can improve the uncertainty of \(\alpha \). In particular, we have noticed that an improved accuracy for the time-dependent asymmetries in \(B^0\rightarrow \rho ^0\rho ^0\) and the measurement of \(\mathcal{S} ^{00}_{\pi \pi }\) would reduce the uncertainty on \(\alpha \) in a noticeable way.

We have seen that the extraction of \(\alpha \) is now possible to a high accuracy, using many different channels and experimental sources. It would be very interesting to measure the remaining observables that we can accurately predict using the data already available. The current accuracy reached by \(\alpha \) makes it a particularly useful constraint both for the Standard Model and for searches of New Physics, although the theoretical computations for these transitions remain challenging. Some of these channels will be improved by the LHCb experiment. In addition, the advent of the Belle-II experiment will certainly lead to new and improved measurements for a large set of branching ratios and \(\mathcal{CP}\) asymmetries, providing an opportunity to understand better the results obtained for the \(B\rightarrow \rho \pi \) system and allowing us to extract hadronic parameters with a higher accuracy. Both avenues should be highly beneficial for the upcoming studies of flavour physics in the quark sector.