Anatomy of B→DD¯\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ B\to D\overline{D} $$\end{document} decays

The decays Bd0 → Dd−Dd+ and Bs0 → Ds−Ds+ probe the CP-violating mixing phases ϕd and ϕs, respectively. The theoretical uncertainty of the corresponding determinations is limited by contributions from penguin topologies, which can be included with the help of the U-spin symmetry of the strong interaction. We analyse the currently available data for Bd,s0 → Dd,s−Dd,s+ decays and those with similar dynamics to constrain the involved non-perturbative parameters. Using further information from semileptonic Bd0 → Dd−ℓ + νℓ decays, we perform a test of the factorisation approximation and take non-factorisable SU(3)-breaking corrections into account. The branching ratios of the Bd0 → Dd−Dd+, Bs0 → Ds−Dd+ and Bs0 → Ds−Ds+, Bd0 → Dd−Ds+ decays show an interesting pattern which can be accommodated through significantly enhanced exchange and penguin annihilation topologies. This feature is also supported by data for the Bs0 → Dd−Dd+ channel. Moreover, there are indications of potentially enhanced penguin contributions in the Bd0 → Dd−Dd+ and Bs0 → Ds−Ds+ decays, which would make it mandatory to control these effects in the future measurements of ϕd and ϕs. We discuss scenarios for high-precision measurements in the era of Belle II and the LHCb upgrade.

1 Introduction CP-violating effects offer important tools to search for new physics (NP) beyond the Standard Model (SM). In this endeavour, B 0 q -B 0 q mixing (q = d, s) is a key player. This phenomenon does not arise at the tree level in the SM and may induce interference effects between oscillation and decay processes, resulting in "mixing-induced" CP violation. The BaBar and Belle experiments at the e + e − B-factories and the LHCb experiment at the Large Hadron Collider (LHC) have already performed high precision measurements of the B 0 d -B 0 d and B 0 s -B 0 s mixing phases φ d and φ s , respectively [1,2]. In the era of the Belle II [4] and LHCb upgrade [3], the experimental analysis will be pushed towards new frontiers of precision.

JHEP07(2015)108
In this paper, we present an analysis of the decays B 0 d → D − d D + d and B 0 s → D − s D + s which are related to each other through the U -spin symmetry of strong interactions [5,6].
With the help of this flavour symmetry, penguin effects can be included in the determination of φ d and φ s from the mixing-induced CP asymmetries of these decays. The theoretical precision is limited by non-factorisable U -spin-breaking effects. The impact of these contributions can be probed in a clean way by comparing branching ratio measurements of the non-leptonic decays with data from semileptonic B 0 The use of the latter two modes is a new element in this strategy. We also explore the role of exchange and penguin annihilation topologies, which govern the decays B 0 d → D − s D + s and B 0 s → D − d D + d [7]. These modes are also related to each other by the U -spin symmetry of strong interactions.
The analysis of the B 0 d → D − d D + d , B 0 s → D − s D + s system complements the determination of φ d and φ s from the decays B 0 d → J/ψK 0 S and B 0 s → J/ψφ, respectively, where penguin effects have to be included as well [5,[8][9][10][11][12][13][14][15][16][17][18]. The dynamics of the B → DD modes differs from those of the B → J/ψX channels. In the latter case, the QCD penguins require a colour-singlet exchange and are suppressed by the Okubo-Zweig-Iizuka (OZI) rule [19][20][21], while this feature does not apply to the electroweak (EW) penguins, which are colourallowed and hence contribute significantly to the decay amplitudes [22]. On the other hand, the QCD penguins are not OZI suppressed in the B 0 d → D − d D + d , B 0 s → D − s D + s system, whereas the EW penguins contribute only in colour-suppressed form. The EW penguin sector offers an interesting avenue for NP to enter weak meson decays [23][24][25][26], such as in models with extra Z bosons [27,28], and would then lead to discrepancies in the determined values of φ d and φ s should the Z bosons have CP-violating flavour-changing couplings to quarks.
The outline of this paper is as follows: in section 2, we discuss the decay amplitude structure of the B 0 d → D − d D + d , B 0 s → D − s D + s decays and their observables, while we turn to the picture emerging from the current data in section 3. There, we include additional decay modes, which have dynamics similar to the B 0 d → D − d D + d , B 0 s → D − s D + s system, to address the importance of exchange and penguin annihilation topologies, and probe nonfactorisable effects by means of the differential B 0 d → D − d + ν rate. We perform a global analysis of the penguin parameters a and θ, which allows us to extract φ d and φ s from measurements of CP violation in the B 0 d → D − d D + d and B 0 s → D − s D + s modes, respectively. The current uncertainties of these measurements are unfortunately still very large. In section 4, we focus on the era of the Belle II and LHCb upgrade, and explore the prospects by discussing different scenarios. Finally, we summarise our conclusions in section 5. In an appendix, we give a summary of the various parameters and observables used in our analysis.
2 Decay amplitudes and observables the following form [5]: where γ serves as a CP-violating weak phase and is the usual angle of the unitarity triangle (UT) of the Cabibbo-Kobayashi-Maskawa (CKM) matrix [29,30], while A ≡ λ 2 A T + E + P (c) + P A (c) − P (t) + P A (t) (2.2) and are CP-conserving hadronic parameters. Here T and P (q) denote the strong amplitudes of the (colour-allowed) tree and penguin topologies (with internal q-quark exchanges), respectively, which can be expressed in terms of hadronic matrix elements of the corresponding low-energy effective Hamiltonian. We have also included the amplitudes describing exchange E and penguin annihilation P A (q) topologies, which are naively expected to play a minor role [31]. However, we find that the current data imply sizeable contributions for E + P A (c) − P A (t) with respect to T + P (c) − P (t) . The parameter

JHEP07(2015)108
measures the side of the UT originating from the origin of the complex plane with the angle γ between the real axis, while λ ≡ |V us | = 0.22548 ± 0.00068 is the Wolfenstein parameter of the CKM matrix [32], and A ≡ |V cb |/λ 2 = 0.806 ± 0.017; the numerical values refer to the analysis of ref. [33].
originates from thē b →ccs quark-level processes. Its SM transition amplitude can be written as where the hadronic parameters A and a e iθ are given by expressions which are analogous to those in eqs. (2.2) and (2.3), respectively. The key difference in the structure of the B 0 s → D − s D + s decay amplitude with respect to eq. (2.1) is the suppression of the a e iθ e iγ term by the tiny CKM parameter Moreover, the overall factor of λ is absent, thereby enhancing the decay rate with respect The U -spin symmetry of strong interactions implies the following relations between the hadronic parameters:

7)
It is important to emphasise that hadronic form factors and decay constants cancel within factorisation in ae iθ and a e iθ , since these quantities are defined as ratios of hadronic amplitudes, as can be seen in eq. (2.3). Consequently, factorisable U -spin-breaking corrections to the relation in eq. (2.7) vanish [5,6]. On the other hand, the U -spin relation in eq. (2.8) is affected by SU(3)-breaking effects 1 in B q → D q form factors and D q decay constants (q = d, s). We discuss these effects in more detail later.

CP-violating asymmetries
Due to B 0 q -B 0 q oscillations (q = d, s), an initially present B 0 q -meson state evolves in time into a linear combination of B 0 q and B 0 q states. CP violation in the B 0 q → D − q D + q decays, which are characterised by CP-even final states, is probed through the following time-dependent 1 SU(3) symmetry refers to the symmetry group interchanging u, d and s quarks. The isospin, U -spin and V -spin subgroups refer to interchanging u ↔ d, d ↔ s, and s ↔ u, respectively. Throughout the paper the mention of SU(3) refers to the U -spin subgroup, unless specified otherwise.

JHEP07(2015)108
rate asymmetries [34]: H denote the mass and decay width difference between the two B q mass eigenstates, respectively. The three CP observables are given by 2 (2.10) (2.11) where we have to make the following replacements for the decays at hand [5]: The parameter η q denotes the CP eigenvalue of the final state and is given by +1. While the direct CP asymmetries A dir CP (B q → D − q D + q ) are caused by interference between tree and penguin contributions, the mixing-induced CP asymmetries A dir CP (B q → D − q D + q ) originate from interference between B 0 q -B 0 q mixing and decay processes, and depend on the mixing phases φ d and φ s . These quantities take the general forms where β is the usual angle of the UT. The SM value of φ s , which is given by −2β s = −2λ 2 η and hence doubly Cabibbo suppressed, can be determined with high precision from SM fits of the UT [33]: The CP-violating phases φ NP q vanish in the SM and allow us to take NP contributions to B 0 q -B 0 q mixing into account. It is useful to introduce "effective mixing phases" through the following expression [13,17]:

17)
2 Whenever information from both B 0 q → f and B 0 q → f decays is needed to determine an observable, as is the case for CP asymmetries or untagged branching ratios, we use the notation B d and B s . where the hadronic "penguin phase shifts" ∆φ In the limit a = a = 0, we simply have The penguin parameter ae iθ cannot be calculated reliably within QCD. Since this quantity is governed by the ratio of a penguin amplitude to a colour-allowed tree amplitude, it is plausible to expect a ∼ 0.1-0.2. Applying the Bander-Silverman-Soni mechanism [35] and the formalism developed in refs. [36,37] yields the following estimate [6]: In the corresponding calculation, form factors and decay constants cancel as the parameter ae iθ is actually defined as a ratio of hadronic amplitudes, as we emphasised after eq. (2.8). However, incalculable long-distance contributions, such as processes of the kind and which can be considered as long-distance penguins with up-and charm-quark exchanges [38], respectively, as illustrated in figure 2, may have an impact on ae iθ .

JHEP07(2015)108
In this paper, we discuss strategies to control these effects by means of experimental data. In the B 0 d → D − d D + d case (eq. (2.20)), the penguin effects have to be taken into account for the determination of φ d . In the B 0 s → D − s D + s case (eq. (2.21)), the parameter a is associated with the tiny factor and is hence doubly Cabibbo-suppressed. However, in view of the experimental precision in the LHCb upgrade era, also these effects have to be controlled.

Untagged decay rate information
For the analysis of the experimental data later on, it is useful to introduce another observable, containing the untagged rate information. It is defined as [5,6]: is the well-known B → P P phase-space function. Due to the sizeable lifetime difference in the B s -meson system, y s ≡ ∆Γ s /2Γ s = 0.0608 ± 0.0045 [2], a difference arises between the "theoretical" branching ratio defined through the untagged decay rate at time t = 0 [5] and the "experimental" branching ratio which is extracted from the time-integrated untagged rate [39]. They can be related as [40] B where the numerical estimate uses extracted from the measurement of the effective B 0 s → D − s D + s lifetime [6,41]. The observable H takes the following form in terms of the penguin parameters [5]: H = 1 − 2 a cos θ cos γ + a 2 1 + 2 a cos θ cos γ + 2 a 2 . (2.29) Moreover, the U -spin relation in eq. (2.7) implies . (2.30) Using eq. (2.7) and keeping a and θ as free parameters, the expression in eq. (2.29) results in the following lower bound [42,43]: Figure 3. Flow chart illustrating the new strategy to determine H using data from semileptonic Moreover, H allows us to put a lower bound on the penguin parameter a: The signs have been chosen in such a way that this expression applies to the current experimental situation discussed in section 3.4.

Information from semileptonic decays
The experimental determination of H through eq. (2.25) requires information on the amplitude ratio |A /A|, which is affected by U -spin-breaking corrections to the relation in eq. (2.8). To avoid the limitations this brings, we propose a new method to determine H using data from semileptonic B 0 q → D − q + ν decays, which is illustrated by the flow chart in figure 3. To this end, we introduce the ratio where the parameters b q and ρ q are given in eq. (2.13); V cq is the relevant CKM matrix element, f Dq denotes the D q -meson decay constant defined through 35) and the factor X Dq is given by
The parameter a (q) NF measures non-factorisable effects in the amplitudes defined through eq. (2.2), which we may write as is the amplitude of the colour-allowed tree topology in factorisation, with G F denoting Fermi's constant. In naive factorisation, a (q) NF = a 1 , where a 1 represents the appropriate combination of Wilson coefficient functions of the current-current operators. 3 We have A ≡ A d and A ≡ A s , and shall suppress the label q in the following discussion for simplicity. Introducing the abbreviations we obtain measures the importance of the penguin topologies with respect to the colour-allowed tree amplitude. On the other hand, x ≡ |x|e iσ ≡ E + P A (ct) T + P (ct) (2.43) probes the importance of the exchange and penguin annihilation topologies. We will return to x in subsection 3.3.2, where we determine |x| and the CP-conserving strong phase σ from experimental data. The parameter a T NF describes the non-factorisable corrections to the "tree" diagram (eq. (2.39)), i.e. we have with ∆ T NF = 0 for exact factorisation. Finally, the observable H can be expressed as follows: (2.45)

JHEP07(2015)108
In comparison with eq. (2.25), the advantage is that the theoretical precision is now only limited by non-factorisable U -spin-breaking effects. Moreover, as the R Dq are ratios of B q rates, the dependence on the ratio of fragmentation functions f s /f d , which is needed for normalisation purposes [51], drops out in this expression. It is instructive to have a closer look at the non-factorisable U -spin-breaking effects entering eq. (2.45), although we will constrain them through experimental data. We obtain the following expression: (2.46) Using heavy-meson chiral perturbation theory and the 1/N C expansion, non-factorisable SU(3)-breaking corrections to the colour-allowed tree amplitudes of B 0 q → D q D p decays were found at the level of a few percent in ref. [52], suggesting small corrections from the last factor. The U -spin relation between the B 0 s → D − s D + s and B 0 d → D − d D + d decays is reflected by the one-to-one correspondence of the r (q) P and x (q) terms. These contributions enter eq. (2.46) only in ratios of terms with structures 1 + Λ (q) , where we expect the Λ (q) to be at most O(0.2). Assuming SU(3)-breaking at the 30% level for the Λ (q) terms yields a correction of only O(5%) for the ratios, i.e. a robust situation.
Let us now exploit experimental data to probe these effects. Using the ratio R Ds , we may actually determine |a NF is now -by definition -a real parameter and the corrections due to the a term are at most at the level of a few percent. Assuming Consequently, the information for the semileptonic differential rate allows us to quantify the non-factorisable U -spin-breaking corrections to the determination of H (eq. (2.45)).
Let us now return to discuss the remaining quantities entering eq. (2.45). The D qmeson decay constants (q = d, s) can be extracted from leptonic decays: The current experimental status has been summarised in ref. [53]:

JHEP07(2015)108
A detailed overview of the status of lattice QCD calculations has been given by the FLAG Working Group in ref. [54].
In the infinite quark-mass limit, the following consistency relation arises [55]: For a discussion on QCD and Λ QCD /m Q corrections to this relation we refer the reader to refs. [55][56][57]. There has recently been impressive progress in the calculation of hadronic form factors within lattice QCD, where now the first unquenched calculations of thē B →D ν form factors at nonzero recoil are available [58,59]. Using the results of ref. [59], we obtain F while the expression in eq. (2.52) gives 0.924, thereby indicating small corrections. In the numerical analysis in this paper, we will use the result in eq. (2.53). Experimental data for the semileptonic decay B 0 it might be feasible to distinguish them due to the shifted invariant mass spectrum of the D + s µ − combinations, and the difference in the missing reconstructed mass, which is correlated to the "corrected mass" as illustrated in ref. [60]. Combined with a fit to the angular distributions, this gives information on the different form factors. We encourage to add this channel to the experimental agenda of the LHCb and Belle II experiments and perform detailed studies for the upgrade era. On the other hand, the differential rate of the B 0 d → D − d + ν mode has already been measured and will actually be used in the next section to estimate the non-factorisable effects in B 0 d decays. The lack of experimental data on semileptonic B 0 s decays can be circumvented by studying the ratio of other B 0 d and B 0 s decays, discussed in the next section, and applying SU (3)  3 Picture emerging from the current data

Overview
The main objective of this analysis is the determination of the B 0 q -B 0 q mixing phases High precision determinations of these phases require us to control not only the contributions from penguin topologies, but also the impact of additional decay topologies and non-factorisable effects. The latter two aspects cannot be quantified using information from the B 0 s system therefore need to be studied. An overview of the different decay modes discussed in this section and their applications is given in table 1.

JHEP07(2015)108
Decay A Topologies Used for: determination of a and θ (and φ d ) x x x . . . and consistency of a NF,c /a NF,c Table 1. Overview of the various topologies contributing to the B → DD decays. The naming convention is indicated in the second column.

Preliminaries
The direct and mixing-induced CP asymmetries of the B 0 d → D − d D + d decay and the H observable, when using the U -spin relation in eq. (2.7), depend on the four parameters a, θ, φ d and γ. In 1999, when this decay was originally suggested by one of us, the determination of the UT angle γ was the main goal. The proposed strategy therefore assumed input on φ d , and H [5]. However, at present it is possible to extract γ in a powerful way through pure B → D ( * ) K ( * ) tree decays. Using current data for these channels, the CKMfitter and UTfit collaborations have obtained the following averages: For the numerical analysis in this paper, we shall use the CKMfitter result. By the time of the Belle II and LHCb upgrade era, much more precise measurements of γ from pure tree decays will be available (see section 4). Using γ as an input, we may instead determine φ d and the penguin parameters from H and the CP asymmetries . The penguin parameters thus determined allow us to take their effects into account in the determination of φ s from the mixing-induced CP asymmetry A mix

Comparing B → DD branching fractions
To quantify the contributions from additional decay topologies and the impact of non- s system, we need to extend the decay basis to modes with dynamics similar to the B 0 If we replace the spectator quarks correspondingly, we obtain the B 0 s , which are characterised by the following decay amplitudes: A andã e iθ take analogous expressions. If we use the U -spin flavour symmetry, we obtain the following relations:ã e iθ =ã e iθ ,Ã =Ã .
Moreover, there are the charged decays B + → D 0 D + d and B + → D 0 D + s , which are again related to each other through the U -spin symmetry. These modes also do not receive contributions from exchange and penguin annihilation topologies. However, there are additional contributions from annihilation topologies, as illustrated in figure 4, which enter with the same CKM factor as the penguin contributions with up-quark exchanges.

Probing annihilation topologies with charged B decays
The decay amplitudes take the following forms: It is useful to introduce the following ratios: , . (3.14) Using the expressions for the decay amplitudes given above yields If we apply the SU(3) flavour symmetry (actually the V -spin subgroup), we obtaiñ while the isospin symmetry of strong interactions implies If we neglect the annihilation contribution in eq. (3.11) and assume the same penguin contributions in eq. (3.15), i.e.ã =ã c , we obtain A deviation from unity of this ratio would therefore imply either the presence of non-zero annihilation contributions or large SU(3)-breaking effects through eq. (3.17). In the case of (3.16), the penguin parameters are suppressed by the tiny factor and hence play a negligible role. Consequently, the ratio

JHEP07(2015)108
essentially relies on the strong isospin symmetry. The current experimental results compiled by the Particle Data Group (PDG) read as follows [1]: and correspond to 4 For the last decay combination, we may also employ the direct measurement of the ratio of the relevant branching fractions [62], which is given by which has a significantly smaller uncertainty with respect to eq. (3.26) thanks to a cancellation of uncertainties in the directly measured ratio of branching fractions. We note the deviation from one at the 2.4 σ level, which is unexpected.

Probing exchange and penguin annihilation topologies
The current PDG results for the CP-averaged branching ratios of the B 0 s → D − s D + s decays are given as follows [1]:  respectively (see table 1), it is possible that the puzzling pattern of the data is actually due to the presence of these exchange and penguin annihilation contributions. Let us first have a closer look at the ratio of the amplitudes of the B 0 measures, in analogy to the parameter x introduced in eq. (2.43), the importance of the exchange and penguin annihilation topologies with respect to the dominant tree topology; we use abbreviations as in eq. (2.40). If we neglect the terms with the penguin parameters, which enter with the tiny , and introduce the SU(3)-breaking parameter we obtain the relation The parameter is only affected by SU (3)-breaking effects entering at the spectator-quark level. Applying factorisation, where P (ct) /T =P (ct) /T (see the comment after eq. (2.22)), we obtain . (3.36) Here we have taken into account the restrictions following for the corresponding B q → D q form factor from the heavy-quark effective theory [47]: where ξ q w q (q 2 ) is the Isgur-Wise function with Studies of the light-quark dependence of the Isgur-Wise function were performed within heavy-meson chiral perturbation theory, indicating an enhancement of ξ s /ξ d at the level of 5% [63]. Applying the same formalism to f Ds /f D d leads to estimates for the value of this ratio of about 1.2 [64], which are in agreement with the experimental results in eq. (2.51).
Since 1992, when these calculations were pioneered, there has been a lot of progress in lattice QCD (for an overview of the state-of-the-art analyses, see ref. [65]). The most recent result for the SU(3)-breaking effects in the form factors reads as follows [66]:

JHEP07(2015)108
which is in excellent agreement with the picture from heavy-meson chiral perturbation theory. Using this result as an input yields The error quantifies only the uncertainties related to the form factors. We cannot quantify the non-factorisable effects. However, as they enter only at the level of different spectator quarks and as already the leading SU(3)-breaking effects are small, we expect a minor impact. The ratio 5 then takes the following form: thereby fixing a circle forx in the complex plane. The numerical value in eq. (3.41) refers to a direct measurement of the corresponding ratio of branching ratios [62].
For the other decay combination, we obtain In analogy to eq. (3.33), we introduce a parameter which is given in factorisation by neglecting the tiny difference between the form-factor ratios for q 2 = m 2 D d and m 2 Ds . As in eq. (3.40), the error quantifies only the form factor uncertainties.
The penguin parameters do not enter eq. (3.43) with the tiny . However, if we use the SU(3) relation

JHEP07(2015)108
where decay constants and form factors cancel in factorisation, we get the parameter x was introduced in eq. (2.43). Consequently, we have where the second-order terms are expected to give small corrections at the few-percent level. Introducing the ratio we obtain in analogy to eq. (3.42).
It is interesting to consider the double ratio where we have neglected the |x ( ) | terms and have used eq. (3.46). The experimental results in eqs. (3.41) and (3.51) give which is in agreement with the expectation in eq. (3.53). The current uncertainties are unfortunately too large to draw any further conclusions.

Probing exchange and penguin annihilation topologies directly
The exchange and penguin annihilation topologies can be probed in a direct way by means of the decays B 0 These modes receive only contributions from exchange and penguin annihilation topologies [7,31], as illustrated in figure 5, and are JHEP07(2015)108
Illustration of exchange and penguin annihilation topologies contributing to related to each other through the U -spin symmetry. The current experimental information on the corresponding CP-averaged branching ratios is given as follows [1]: The experimental signal for the B 0 s → D − d D + d decay is in accordance with the picture emerging from the discussion given above.
Let us now have a closer look at these decays. Their amplitudes can be written as

57)
where the primed parameters are defined in an analogous way. We obtain then where we have neglected the terms proportional to the tiny factor and introduced the parameter takes then the simple form which fixes a circle with radius |ς x | around the origin in the complex plane.
where we have neglected the penguin annihilation contributions on the right-hand side, and have introduced As in eq. (3.62), we expect that the numerical value describes the leading SU(3)-breaking effect (the uncertainty corresponds only to the decay constants and masses). Non-factorisable SU(3)-breaking contributions to this quantity cannot be estimated at present. For the comparison with the experimental data we introduce which takes the simple form Also in this case it is interesting to consider the double ratio which allows us to test the relatioñ with the corresponding confidence-level contours shown in figure 6. The exchange and penguin annihilation topologies play hence a surprisingly prominent role in the decays at hand, pointing towards large long-distance strong interaction effects. An example of such a contribution to the exchange topology is given by as illustrated in the right panel of figure 2. A similar analysis can be performed for the observables in eqs. (3.51) and (3.67), which allow the determination of |x| andσ. In contrast to the determination of |x | andσ , the penguin effects in the amplitude ratios do not enter with the tiny and lead to additional uncertainties. In figure 7, we show the constraints from the current data, which are still pretty weak. Here we may have long-distance rescattering contributions from processes of the kind In the future, following these lines, the comparison between the values ofx andx will offer yet another test of the relation in eq. (3.70), going beyond eq. (3.69) through information on the strong phases.

Information from branching ratios and non-factorisable effects
Let us now use the currently available data to constrain the penguin parameters. Unfortunately, a measurement of the differential semileptonic B 0 s → D − s + ν rate is not available.
Consequently, we may not yet apply eq. (2.45) and have to follow a different avenue, involving larger theoretical uncertainties. In analogy to the H observable defined in eq. (2.25), we introduce the following quantities: = 1 − 2ã cosθ cos γ +ã 2 1 + 2 ã cosθ cos γ + 2ã 2 , (3.76) (ã,θ) and (ã c ,θ c ), respectively. These determinations are analogous to the determination of (a, θ) from H and the direct CP asymmetry in B 0 d → D − d D + d (and the suppressed CP asymmetry in B 0 s → D − s D + s ). The hence determined parameters offer insights into the r A and r P A parameters introduced in eqs. (3.12) and (3.49), respectively, and allow for a comparison of the relative non-factorisable contributions.
In contrast to the measurements of the CP asymmetries, the extraction of the H,H and H c observables from the data requires knowledge of the following amplitude ratios: These quantities are governed by U -spin-breaking effects in the ratio of the colour-allowed tree contributions, which we may write as where the parameters a T NF describe non-factorisable contributions affecting the colourallowed tree amplitude (see eq. (2.44)). If we assume that all the a T NF parameters are equal to one another due to the SU(3) flavour symmetry, the following relation can be derived: T T where the decay constants and form factors cancel. In terms of branching ratios, using eqs. (3.79)-(3.81), this relation implies The current data give which is consistent with eq. (3.86) within the uncertainties.

JHEP07(2015)108
Using data for the semileptonic B 0 d → D − d + ν decay, the non-factorisable effects can be probed through corresponding to a non-factorisable contribution where eq. (2.53) has been used for the ratio of form factors. In this result, the penguin effects suppressed by in eq. (3.88) were neglected. It is plausible to interpret the deviation from one at the 2.9 σ level as footprints of sizeable penguin effects. We shall discuss this parameter below. The ratios in eqs. (3.79)-(3.81) can be written in the following forms: which is also graphically illustrated in figure 8. In the above equation, a NF differs from a NF introduced in eq. (2.41) through the (1 + x) term, to account for the contributions from exchange and penguin annihilation topologies, which are absent in the decays of the other two ratios. We thus have Figure 8. Flow chart illustrating the classic strategy to determine H using data from B → DD branching ratio measurements.
As in the discussion in subsection 2.4, it is convenient to write In these relations, SU(3)-breaking effects enter only through different spectator quarks and are expected to be small. In order to determine the SU(3)-breaking effects in the ratio of the colour-allowed tree amplitudes |T /T |, we use again eq. (3.37) to derive the following expression [5]: For the calculation of the numerical value, we have used eq. (3.39) and the values of the decay constants in eq. (2.51). In analogy, we get

JHEP07(2015)108
In the case of the charged decays, a particularly simple situation arises in factorisation as the form factors cancel: as is also evident from eq. (3.84).
Using the amplitude ratio in eq. (3.94), we can now determine the observable H (see eq. (2.25)). For this, we also have to quantify the uncertainty from the SU(3)-breaking corrections to the term involving the x and x parameters. The analysis discussed in subsection 3.3.2 allows us to accomplish this task. Using the results forx andσ in eq. (3.72) and the relations In figure 9 we show the corresponding uncertainty budgets. Should the relations in eq. (3.100) actually receive corrections, they would affect this error budget. In the future, implementing the strategy proposed in section 2.4, this approximation/assumption is no longer needed. The result on H allows us to put first constraints on the penguin parameters a with the help of the lower bound in eq. (2.32): where we have used the lower value of H at one standard deviation.

62.7%
H f Ds /f Dd

32.0%
B(B ± → D u D ± s ) 35.8% H c Figure 9. Pie charts illustrating the uncertainty budget of the H observables.

Information from CP asymmetries
Let us now add experimental information on CP violation to our analysis. Concerning the decay B 0 d → D − d D + d , the current status of the measurement of the direct and mixinginduced CP asymmetries is given as follows: The measurements by the BaBar and Belle collaborations are not in good agreement with one another, in particular for the mixing-induced CP asymmetry. HFAG gives the following averages [2]: which have to be taken with great care. It is nevertheless interesting to use these results as input for the strategy discussed above. A χ 2 fit to eq.

JHEP07(2015)108
Assuming that |ã T NF | = 1, i.e. that the colour-allowed tree topologies have negligible nonfactorisable contributions, and that all parameters are real, this results iñ r P = −0.250 ± 0.055 . (3.121) Note that the uncertainty onr P only reflects the uncertainty on the input quantity (3.93), but does not take into account further theoretical uncertainties associated with the made approximation. Further assumingP (ut) =P (ct) , which leads to the approximation ae iθ ≈ R br P /(1 +r P ), we obtain which can be converted into with the assumed U -spin-breaking parameters ξ = 1.00±0.20 and δ = (0±20) • (which are of similar size as the corresponding parameters in ref. [17]) and use the expression in eq. (2.19), the penguin parameters in eq. (3.117) determined from the fit can be converted into Finally, we can extract φ s form the effective mixing phase in eq. (3.123), which yields Despite the suppression through the parameter , penguins may have a significant impact on the extraction of φ s and have to be taken into account. This will be particularly relevant JHEP07(2015)108 for the LHCb upgrade era. In this new round of precision, we will also get valuable insights into the validity of the U -spin symmetry, parameterised through eq. (3.126).
Unfortunately, there is no measurement of CP violation in B 0 s → D − s D + d available, which would be very interesting, in particular in view of the situation for B 0 d → D − d D + d . Consequently, we may not yet determineã andθ from the data.
However, for the charged B + → D 0 D + d decay, the PDG gives In figure 11, we illustrate the corresponding situation, which complements figure 10.
It is interesting to compare the penguin parameter ae iθ with its charged decay counterpartã c eθ c . We obtain where we have used eqs. (3.11) and (3.48). The precision that can be obtained with the current data does not yet allow us to draw any conclusions regarding r A P A . However, in the future it will be interesting to monitor this quantity as the experimental precision improves. Moreover, a measurement of the direct CP violation in the B 0 s → D − s D + d channel will allow us to determineãe iθ from the information fromH. The comparison withã c eθ c will yield the r A parameter from eq. (3.11), so that eq. (3.132) will then allow the determination of r P A . Consequently, following these lines, we may reveal the impact of the annihilation and penguin annihilation topologies in the decays at hand.

Prospects for the LHCb upgrade and Belle II era
Let us conclude the discussion on the B → DD decays by exploring the potential of these decay modes in the Belle II era and at the LHCb upgrade. We do this using several scenarios, examined in section 4.3, that reflect the different possibilities still allowed by the current data. The inputs used in these scenarios are discussed first. Section 4.1 gives the experimental prospects for the relevant CP and branching ratio information of the B → DD decays, while section 4.2 deals with the future constraints on the additional decay topologies.

Extrapolating from current results
The B-factories have pioneered the study of B → DD decays, including the discoveries of numerous B → DD decay modes [72], the measurements of branching fractions [73,74], and the analyses of CP asymmetries [68,69,73]. The LHCb collaboration subsequently continued the study of B → DD decays, notably focusing on the analysis of B 0 s decays [41,62,70], which are abundantly produced at the LHC. Based on the successful performance of LHCb during run I of the LHC, an estimate can be made of its performance with the data samples that are expected to be collected after the upgrade of the LHCb detector. For these extrapolations, an integrated luminosity of 5 fb −1 in run II, from 2015 until 2018,

JHEP07(2015)108
Observable Current measurement Upgrade Experiment 1.06 +0.14 −0.21 ± 0.08 [69] ±0.08 Belle A dir CP (B ± → D u D ± (s) ) 0.00 ± 0.08 ± 0.02 [73] ±0.02 Belle 0.050 ± 0.008 ± 0.004 [62] 18% 7% Table 3. Experimental prospects for ratios of branching fractions. The second and fourth ratios are obtained from direct determinations of the ratios of branching fractions, whereas the others are calculated from individual branching fractions. The value in brackets indicates the possible uncertainty if this ratio were determined directly. Note that for the calculation of the H observables, additional uncertainties due to |A /A| arise.
is assumed. In addition, the B production cross section will increase at a centre-of-mass energy of 13 TeV compared to 8 TeV by about 60%. For the upgrade scenario, an integrated luminosity of 50 fb −1 is assumed with increased trigger efficiency, leading to about a three times larger data sample per fb −1 compared to the B yield per fb −1 at run I. Similarly, a prognosis can be made for measurements at Belle II, which is expected to start taking data in 2018. Here we assume that 50 times more data will be collected than currently is available (1 ab −1 ). The expectations for the CP asymmetry parameters are listed in table 2. The extrapolations are done for the currently available measurements only; no attempt is made to forecast the precision on yet-to-be-performed analyses. For example, the LHCb collaboration will also determine the CP asymmetries of the B 0 d → D − d D + d decay, but it remains to be seen what the accuracy will be in comparison with possible Belle II results.
The expectations for the branching fractions are listed in table 3. The "current measurement" column reflects the best available knowledge at this moment, which in some cases could have been more precise if the ratio of branching fractions were determined directly, rather than dividing the individually measured branching fractions. In the extrapolations it is assumed that the ratios of branching fractions are determined. Moreover, it is assumed that the systematic uncertainties due to f s /f d (4.7%), due to the D-meson branching fractions (3.9%, 2.1% and 1.3% for D + s , D + d and D 0 mesons, respectively) and  due to the different B 0 s lifetimes (2.9% (1.5%) for a CP (flavour) eigenstate), remain the same. We assume that the total experimental systematic uncertainty will decrease from 5.0% to 4.0%. In some ratios, the uncertainty on the D branching fractions cancels with their contribution to the f s /f d uncertainty. This is taken into account where appropriate. In our upgrade era scenario, systematic uncertainties will be the limiting factor on the ratio of branching fractions. Therefore, we would like to encourage research into f s /f d , and B lifetime differences. If these three factors could be reduced to a level of about 2%, then that would lead to a systematic uncertainty of (5-6)% for all of these decays, assuming an experimental uncertainty of 4%. Finally, the prospects for improvements on external input parameters are listed in table 4.

Exchange and penguin annihilation contributions
For the construction of the H observable based on the ratio of hadronic amplitudes in eq. (3.94), the contributions from exchange and penguin annihilation topologies, represented by the parametersx andσ , need to be quantified. With future, improved measurements of the B → DD branching ratios it is expected that the picture emerging from the current data, discussed in section 3.3.2, can be sharpened further. We therefore explore the precision that can be achieved towards the end of the Belle II era and of the LHCb upgrade. Based on the best fit solution obtained from the current data, i.e. eq. (3.72), and the prospects in table 3, we start from the following input measurements: The associated confidence-level contours are shown in figure 12. Note that at first sight these uncertainties do not seem to have improved significantly with respect to the present experimental situation. However, this is merely caused by the shape of the confidence contour in figure 12. JHEP07(2015)108

Future scenarios
To achieve the smallest theoretical uncertainty on the H observable, it should be constructed from the semileptonic decay information, see eq. (2.45), as explained in section 2.4. This method is preferred over the direct ratio of hadronic branching fractions in eq. (2.25), as it does not rely on form factor information, and is not experimentally limited by f s /f d . However, as the necessary information on dΓ/dq 2 (B 0 s → D − s + ν ) is currently not yet available, we also do not have any estimates for the precision that can be achieved at LHCb or Belle II. For the following discussion, we will thus, like for the fits to the current data, rely on the original definition using the ratio of hadronic branching fractions, eq. (2.25).
Using the currently available data, we have illustrated in section 3.4 that it is possible to simultaneously determine the penguin contributions and the B 0 d -B 0 d mixing phase φ d using the CP and branching ratio information of the B 0 d → D − d D + d decay. However, as the result of φ d in eq. (3.118) shows, the precision on φ d is very limited. Also for the Belle II and LHCb upgrade era it will be challenging to reach precisions below the (10-20) • level. A high-precision determination of φ d using the B 0 d → D − d D + d decay is only possible if the direct CP asymmetry and the H observable together are sufficient to unambiguously pin down the penguin parameters a and θ. In such a situation, the phase φ d can then be determined from the mixing-induced CP asymmetry. As figure 10 illustrates, a and θ cannot precisely be determined. The situation would arise either for very large values of a, which looks unrealistic, or if the H observable can be determined with a precision of well below the 5% level, which in view of the prospects in the B → DD decays. This extra information breaks the ambiguity that is still present in the confidence-level contours shown in figure 10, and can therefore improve the precision on a and θ, and thus also on ∆φ s . For the future benchmark scenarios we therefore only focus on the high precision determination of φ s . Using external input for φ d in principle makes one of the three observables used by the fit (A dir CP , A mix CP or H) superfluous. As the H observable receives corrections from possible U -spin-breaking effects through |A /A|, which result in large theoretical uncertainties, it is preferred to determine the penguin parameters using information on the CP asymmetries only, omitting H. Such a determination is theoretically clean. This will ultimately lead to the highest precision on a, θ and ∆φ s . In this situation, the branching ratio information can instead be used to gain insight into the hadronic physics of the B 0 system. We can then follow the opposite path where the fit results for a and θ can be used to determine the H observable with the help of eq. (2.29), labelled H (a,θ) below. Since a enters there with the tiny , the U -spin-breaking corrections affecting the U -spin relation in eq. (3.126) have a very minor impact. Since we now know the value of H, the relation (2.25) can be inverted to instead determine the ratio of hadronic amplitudes from the measured ratio of branching ratios. This experimental measurement of the ratio of hadronic amplitudes can be compared with the theoretical result in eq. (3.106). This favourable strategy is illustrated by the flow chart in figure 13. However, the ideal scenario described above cannot always be realised. When the value of the mixing-induced CP asymmetry is compatible with 1 (at the 1σ level), its power to constrain the penguin parameters a and θ is limited. This can best be illustrated using the contour plots, like figure 10 or figure 17 below. In this situation, the annular constraint originating from the mixing-induced CP asymmetry becomes a closed disk, leading to a large overlap region with the direct CP asymmetry constraint. Consequently, it is not possible to conclusively pin down a and θ in such a situation. Additional information is thus needed to improve the picture, and reach our target of matching the foreseen experimental precision on φ s with an equally precise determination of ∆φ s . In this situation, the H observable forms an essential ingredient in the fit, and it can therefore not be used to experimentally constrain the ratio of hadronic amplitudes. This less favourable strategy is illustrated by the flow chart in figure 14.
Given the current experimental situation, either of the two situations sketched above can still be realised, depending on the future world average for A mix To demonstrate the variety of situations in which we may ultimately end up, we made an overview of six different scenarios, covering both situations. Scenarios 1-3 represent the favourable situation in which the H observable can be omitted from the fit, in which we can determine a and θ in a theoretical clean way, and get experimental access to the ratio of hadronic amplitudes. Scenarios 4-6, on the other hand, fall in the second category and do require information on H to conclusively pin down a and θ. All six scenarios are chosen JHEP07(2015)108 Penguin: (a, θ) Quantity is Theoretically Clean Affected by U -Spin-Breaking Effects Figure 13. Flow chart illustrating the favourable strategy to control ∆φ s , which only requires information on the B 0  Table 5. Penguin parameters and observables corresponding to the six different scenarios. to be compatible with the current experimental situation, with scenario 5 representing the current best fit point, and have a < R b , which is suggested by eq. (2.3). Although it is mathematically possible for a to be larger than R b , it would imply that the penguin topologies are larger than the tree contribution, which seems very unlikely. The condition a = R b thus serves as a naturally upper limit for the size of the penguin contributions. The different scenarios we consider, and the resulting input values of the three observables (A dir CP , A mix CP and H) are listed in table 5. The choice of input points can be compared with the current fit solution for a and θ in figure 15, and with the current measurements of the B 0 d → D − d D + d CP asymmetry parameters in figure 16. For each of the six scenarios, the individual constraints coming from A dir CP , A mix CP and H are illustrated in figure 17. The three left-most plots represent the favourable situation, while the three right-most plots fall in the second category. For the three right-most plots the constraint from the mixing-induced CP asymmetry is more disk-like as the central value of A mix CP is closer to one. As a consequence, the overlap with the direct CP asymmetry, JHEP07(2015)108 Figure 16. which forms a narrow band in all cases, is too large to pin down a and θ. In this respect, scenario 4 should be seen as a limiting case; information of H is not strictly necessary if one is only interested in the 1 sigma results.
For each of the six scenarios we also performed a χ 2 fit, similar to the one described in section 3.4, but including φ d as a Gaussian constraint. The fit results for a and θ, and the associated values for the shifts ∆φ d and ∆φ s are listed in table 6. U -spin-breaking effects, parametrised by eq. (3.126), have been included in the results for ∆φ s . The associated confidence-level contours are shown in figure 17. In all cases we succeed in our goal of matching the foreseen experimental precision on φ s , see table 2, with an equally precise determination of ∆φ s . This is also the case for φ eff d and ∆φ d , as illustrated in table 7. For the first three scenarios, which do not include the H observable in the fit, the resulting solution for H (a,θ) and the values for the ratio of hadronic amplitudes are listed in table 8. The resulting uncertainties are about a factor two smaller that the current theoretical uncertainties derived within the factorisation framework, and of comparable size to the experimental precision that can be obtained on the ratio of hadronic amplitudes describing the B 0 d → J/ψK 0 S and B 0 s → J/ψK 0 S decays [17]. Consequently, the experimental determination of |A /A| is yet another interesting topic for Belle II and the LHCb upgrade. It will provide valuable insights into possible non-factorisable U -spin-breaking effects and the hadronisation dynamics of the B → DD decays.

JHEP07(2015)108
Scenarios 1-3: Scenarios 4-6: Figure 17. Illustration of the determination of the penguin parameters a and θ for the scenarios introduced above.   Table 8. Constraints on the ratio of hadronic amplitudes for those scenarios where the H observable is not needed.

Conclusions
In this paper, we have presented a detailed study of the system of B → DD decays, exploring the picture emerging from the currently available data and proposing new strategies for the era of Belle II and the LHCb upgrade. We find that patterns in the current branching ratio measurements can be accommodated through sizeable contributions from exchange and penguin annihilation topologies, which play a more prominent role than naively expected. This feature suggests that long-distance effects of strong interactions are at work, which cannot be understood within perturbation theory. Using data for the differential semileptonic

JHEP07(2015)108
In view of our new insights into the prominent role of the exchange and penguin annihilation topologies, the penguin contributions may also be more important than naively expected. The current experimental situation for CP violation in the B 0 d → D − d D + d channel is not satisfactory, with measurements of the CP-violating asymmetries by the BaBar and Belle collaborations that are not in good agreement with one another. Measurements of CP-violation in the B 0 s → D − s D + d channel might help to clarify the situation. In the case of the B 0 s → D − s D + s mode, the LHCb collaboration has presented a first analysis of CP violation, with large experimental uncertainties. In the future, the experimental errors can be significantly reduced. Using only information from branching ratios, we find the lower bound a ≥ 0.052 for the penguin effects from the • from a χ 2 fit to the data. These results indicate potentially sizeable penguin effects, although the large uncertainties do not allow us to draw further conclusions.
Since the determination of φ d from B 0 d → D − d D + d will not be competitive with the B 0 d → J/ψK 0 S analysis and the control of penguins though B 0 s → J/ψK 0 S , we advocate to use φ d as an input from the latter analysis for the determination of the penguin parameters from B 0 d → D − d D + d and relating them to their counterparts in B 0 s → D − s D + s with the help of the U -spin symmetry of strong interactions. Following these lines, it will be possible to control the penguin effects in the determination of φ s from the CP violation in the B 0 s → D − s D + s channel. We find that the implementation of this strategy depends strongly on the values of the measured CP asymmetries, as we illustrated through a variety of future scenarios which are consistent with the current experimental situation, giving us a guideline for the LHCb upgrade era. We distinguish between two kinds of scenarios: in the first, the direct and mixing-induced CP asymmetries of the B 0 d → D − d D + d channel are sufficient to determine a and θ in a theoretically clean way, allowing us to determine the ratio |A |/A| from the observable H, providing valuable insights into non-factorisable U -spin-breaking effects. In the second -less favourable -class of scenarios, information both from H and the CP asymmetries is needed to determine the penguin parameters. We have demonstrated that the resulting theoretical uncertainty for the penguin shift ∆φ s of the B 0 s → D − s D + s channel will be smaller than the experimental uncertainty for φ eff s (B s → D − s D + s ) in both classes of scenarios. Similar analyses can be performed for B 0 d → D * − d D * + d and B 0 s → D * − s D * + s decays, where time-dependent measurements of the angular distribution of the decay products of the two vector mesons are required [78,79].
Analyses of B → DD decays in the era of Belle II and the LHCb upgrade will offer interesting new insights both into the physics of strong interactions and into CP violation. We look forward to confronting the strategies discussed in this paper with future data! JHEP07(2015)108

Acknowledgments
We would like to thank Greg Ciezarek for very interesting discussions on the experimental prospects of measuring form factors with semileptonic B 0 s decays, and Patrick Koppenburg for carefully reading the manuscript.

A Notation
In this appendix, we give an overview of the notation, parameters and observables used in our analysis of the B → DD system.

Decay
Amplitude Topologies Variables  Table 11. List of decay ratios and the corresponding observables and variables.
Open Access. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.