1 Introduction

Apart from confirming some of the prevailing anomalies in the B-sector, the currently running LHC has provided several new measurements which hint towards physics beyond standard model (SM). Some of these measurements are indicating towards lepton universality violation. The most striking measurements hinting towards lepton flavor non universality are the \(R_{D^{(*)}}\equiv \varGamma (B \rightarrow D^{(*)}\, \tau \bar{\nu })/\varGamma (B \rightarrow D^{(*)}\, l \bar{\nu })\) \((l=e,\,\mu )\) [1,2,3,4,5,6,7,8,9] which disagree with the SM at \(\sim \) 3.8\(\sigma \) level [10]. In Moriond 2019, the Belle collaboration has announced new preliminary measurements of \(R_{D^{(*)}}\) using semileponic tag [11]. These measurements are consistent with the SM at the level of 1.2\(\sigma \). With the addition of these new measurements, the tension between \(R_{D^{(*)}}\) experimental world average and the SM prediction reduces to 3.1\(\sigma \).

Lepton flavor universality violation was further corroborated by the measurement of \(R_K \equiv \varGamma (B^+ \rightarrow K^+ \,\mu ^+\,\mu ^-)/\varGamma (B^+ \rightarrow K^+\,e^+\,e^-) = 0.745^{+0.090}_{-0.074}~\mathrm{(stat)} \pm 0.036~\mathrm{(syst)}\) [12]. This measurement was performed in the low dilepton invariant mass-squared \(q^2\) range (\(1.0 \le q^2 \le 6.0 \, \mathrm{GeV}^2\)) and it deviates from the SM prediction, which is \(\simeq 1\) [13, 14], by 2.6\(\sigma \). This measurement has been recently updated in Moriond 2019. The updated value of \(R_K\) is \(0.846^{+0.060+0.016}_{-0.054-0.014}\) [15] which is \(\sim 2.5\sigma \) away from the SM. In April 2018, the LHCb collaboration announced the measurement of \(R_{K^*} \equiv \varGamma (B^0 \rightarrow K^{*0} \mu ^+\mu ^-)/\varGamma (B^0 \rightarrow K^{*0} e^+ e^-)\) [16]:

$$\begin{aligned}&R_{K^*}^{[0.045,1.1]} = 0.660^{+0.110}_{-0.070}~\mathrm{(stat)} \pm 0.024~\mathrm{(syst)}, \end{aligned}$$
(1)
$$\begin{aligned}&R_{K^*}^{[1.1,6.0]} = 0.685^{+0.113}_{-0.069}~\mathrm{(stat)} \pm 0.047~\mathrm{(syst)}, \end{aligned}$$
(2)

where the superscript denotes the dilepton invariant mass-squared \(q^2\) range. These measurements differ from the SM prediction, which is \(\simeq 1\) [13, 14], by 2.2–2.4\(\sigma \) in the low-\(q^2\) region and by 2.4–2.5\(\sigma \) in the central-\(q^2\) region. In Moriond 2019, the Belle collaboration has presented their first measurements of \(R_{K^*}\) in \(B^0\) decays along with the first ever measurement of \(R_{K^*}\) in \(B^+\) decays [17]. However, these measurements have large uncertainties due to which the \(R_{K^*}\) disagreement with the SM is still at the level of \(2.4\sigma \). Apart from \(R_{K^{(*)}}\), there are other measurements, all in the \(b \rightarrow s \, \mu ^+ \, \mu ^-\) sector, which show discrepancies with the SM. The measurement of some of the angular observables [18,19,20], in particular \(P'_5\), disagrees with the SM predictions [21] at the level of 4\(\sigma \) in the (4.3–8.68) \(q^2\)-bin. This disagreement is further supported by the measurements of ATLAS [22] and CMS [23] collaborations. Also, there is tension in the branching ratio of \(B_s \rightarrow \phi \mu ^+ \mu ^-\) [24, 25]. Therefore \(b \rightarrow s \mu ^+ \mu ^-\) and \(b \rightarrow c\, \tau \, \bar{\nu }\) decays serve as a fruitful hunting ground to probe beyond SM physics.

In order to identify the Lorentz structure of new physics responsible for various anomalies in the \(b \rightarrow s \mu ^+ \mu ^-\) sector, in a model-independent way, there have been a plethora of works in recent times [26,27,28,29,30,31,32,33,34,35,36,37,38,39]. For \(b \rightarrow c\, \tau \, \bar{\nu }\), Refs. [40,41,42,43,44] identified new physics operators which can account for \(R_{D^{(*)}}\) anomaly. However, simultaneous explanation of anomalies in \(b \rightarrow s \mu ^+ \mu ^-\) and \(b \rightarrow c \tau \bar{\nu }\) sector in specific new physics models is bit tricky. This is because the \(b \rightarrow c \tau \bar{\nu }\) transition occurs at the tree level within the SM, whereas \(b \rightarrow s \mu ^+ \mu ^-\) decay can only occur at the loop level. One needs a relatively large new physics contributions in order to explain the \(R_{D^{*}}\) anomaly. However such a large new physics contributions must also be consistent with the measurement of other observables which are in agreement with their SM predictions. Therefore there are only a limited set of new physics models which can simultaneously explain the \(R_{D^{(*)}}\) and \(b \rightarrow s \mu ^+ \mu ^-\) anomalies, see e.g. [45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62].

In this work we revisit four models: (1) SM like vector bosons (VB), (2) \(SU(2)_L\)-singlet vector leptoquark (\(U_1\)), (3) \(SU(2)_L\)-triplet scalar leptoquark (\(S_3\)) and (4) \(SU(2)_L\)-triplet vector leptoquark models (\(U_3\)). We assume a coupling to only third generation in the gauge basis. These models were studied in [48, 58]. In Ref. [58], it was shown that VB model is a viable model, but the \(Br(\tau \rightarrow 3\mu )\) were shown to be \(O(10^{-8})\) which is close to its present upper bound of \(2.1 \times 10^{-8}\) [63]. Further \(U_1\) model was also considered to be a potential model in Refs. [48, 58] to explain \(R_K\) and \(R_{D^{*}}\) anomalies. In Refs. [64, 65], it was shown that the \(U_1\) model is disfavored due to the constraints coming from processes such as \(\tau \rightarrow e \nu \bar{\nu }\), \(\tau \rightarrow \mu \nu \bar{\nu }\) which are generated solely due to renormalization group running (RGE) effects. However, it should be noted that Refs. [64, 65] considered a different kind of transformation from gauge to mass basis as used in this work. We choose the one used in the Ref. [58].Footnote 1 Further, in Ref. [66], it was shown that the high-\(p_T\) searches also disfavor the \(U_1\) model.

For the first time we perform a global analyses of the four NP models by taking into account all relevant data from B-sector as well as the leptonic decays generated through the RGE running. Our main findings are summarized below:

  1. 1.

    In Ref. [58], it was found that the VB is a viable model for the combined explanation of the B-anomalies. However, on the basis of global fit, we found that for the VB model the \(\mathscr {B}(\tau \rightarrow 3\mu )\) is an order magnitude above the current experimental upper bound at the best fit. Therefore, we show that the VB model is simply excluded by \(\tau \rightarrow 3\mu \) process.

  2. 2.

    Reference [48] motivated \(U_1\) LQ model as potential candidate for the combined explanation of the B-anomalies. Following this, Ref. [58] showed it to be a viable model. However, our analyses shows that \(R_{D^{*}}\) value is only marginally improved over its SM value.

  3. 3.

    In Refs. [64, 65] it were shown that this minimal framework for the simultaneous explanation of the anomalies is challenged by the \(\tau \) lepton flavor violating decays and Z-pole observables which arise due to RGE running. However, we find that using a more flexible transformation matrices to go from gauge to mass basis, the constraints coming from these processes can be avoided.

The paper is arranged as follows. After the introduction, in Sect. 2, we discuss the methodology used in our anlysis. In Sect. 3, we describe the four new physics models in the framework of third generation coupling in the weak basis. In Sect. 4, we present our results. Finally, we conclude in Sect. 5.

2 Methodology

In our analyses we take into account following constraints

  1. 1.

    the measurement of \(R_{D^{(*)}}\) [10, 11],

  2. 2.

    the branching ratio of \(B^0_s \rightarrow \mu ^+ \mu ^-\) [67,68,69],

  3. 3.

    the differential branching ratio of \(B^{0} \rightarrow K^{*0} \mu ^+ \mu ^-\) and \(B^{+} \rightarrow K^{*+} \mu ^+ \mu ^-\) measured by LHCb [70, 71],

  4. 4.

    the CP-averaged differential angular distribution for \(B^0 \rightarrow K^{*0} (\rightarrow K^+ \pi ^-) \mu ^+ \mu ^-\) [19],

  5. 5.

    the differential branching ratio of \(B^{0}\rightarrow K^{0} \mu ^+\mu ^-\) and \(B^{+}\rightarrow K^{+} \mu ^+\mu ^-\) measured by LHCb [70] and CDF [72],

  6. 6.

    the differential branching ratio of \(B^0_s \rightarrow \phi \mu ^+ \mu ^-\) by LHCb [25] and CDF [72] and the angular observables measured by LHCb [25],

  7. 7.

    the differential branching ratio of \(B \rightarrow X_s \mu ^+ \mu ^-\) measured by BaBar [73],

  8. 8.

    the recent data by ATLAS [74] and CMS [75] for the angular observables in \(B^{0} \rightarrow K^{*0}\mu ^+ \mu ^-\) decay,

  9. 9.

    the measurement of \(R_{K^{(*)}}\) [15,16,17],

  10. 10.

    mass difference \(\varDelta M_s\) in \(B_s\)-mixing [76],

  11. 11.

    branching ratio of \(B \rightarrow K^{(*)} \nu \bar{\nu }\) [77],

  12. 12.

    branching ratio of \(\tau \rightarrow 3 \mu \) [63],

  13. 13.

    branching ratio of \(B \rightarrow K \mu \tau \) [78],

  14. 14.

    branching ratio of \(\tau \rightarrow \phi \mu \) [79],

  15. 15.

    branching ratio of \(\tau \rightarrow e \nu \bar{\nu }\) [76].

Note that out of the four models, not all of these contribute to all the observables. For example, the leptoquark models do not contribute to the four fermion operator at the tree level, i.e. the processes like branching ratio of \(\tau \rightarrow 3 \mu \), \(\tau \rightarrow e \nu \bar{\nu }\) and \(B^0_s\)-\(\bar{B^0_s}\) mixing do not occur at the tree level in these models. Additionally, \(U_1\) model does not give any new physics contribution to the processes related to \(b \rightarrow s \nu \bar{\nu }\) transition at the tree level.

In order to check this viability of the new physics models, we perform three kinds of fit,

  1. 1.

    Fit 1: Global fit

  2. 2.

    Fit 2: Fit with excluding \(b\rightarrow c \tau \bar{\nu }\) data

  3. 3.

    Fit 3: Fit with only clean observables

In fit 1, we perform the global fit by taking all the relevant data from 1 to 10. Using the fit results, we check the consistency of the other observables from 11 to 15. We then remove the \(R_{D^{(*)}}\) data from the fit 1 to perform second kind of fit. Fit 2 would enable us to know that how well these models can explain the anomalies in \(b \rightarrow s \) sector and on the other hand would shed light on the what \(b \rightarrow s \, \mu ^+ \, \mu ^-\) data imply in \(b \rightarrow c\, \tau \, \bar{\nu }\) sector in the context of these models. In the third kind of fit, we only consider clean observables such as \(R_{K^{(*)}}\), \(R_{D^{(*)}}\). Along with this the model specific constraints such as \(B^0_s\)-\(\bar{B^0_s}\) mixing is also taken into account. This fit is performed to check to what extent our conclusions are dependent on hadronic uncertainties.

We do a \(\chi ^2\) fit using CERN minimization code MINUIT [80]. The \(\chi ^2\) function is defined as

$$\begin{aligned} \chi ^2(C_i) = (\mathscr {O}_{th}(C_i) -\mathscr {O}_{exp})^T \, \mathscr {C}^{-1} \, (\mathscr {O}_{th}(C_i) -\mathscr {O}_{exp}). \end{aligned}$$
(3)

The theoretical predictions, \(\mathscr {O}_{th}(C_i)\) are calculated using flavio [81]. The \(\mathscr {O}_{exp}\) are the experimental measurements of the observables used in the fit. The total covariance matrix \(\mathscr {C}\) is obtained by adding the individual theoretical and experimental covariance matrices. We closely follow the methodology for global fits discussed in Refs. [82, 83].

3 New physics models

In this section we describe the general framework and discuss the new physics models which we have studied. In order to explain the \(R_K\) anomaly, Ref. [84] considered following operator

$$\begin{aligned} \frac{G}{\varLambda ^2}(\bar{b'_L}\gamma _{\mu }b'_L)(\bar{\tau '_L}\gamma ^{\mu }\tau '_L). \end{aligned}$$
(4)

Here \(\varLambda \) is the scale of new physics. It was assumed that, in the gauge basis, NP couple to the third generation only. The transformation from gauge basis to the mass basis will then generate operator \((\bar{s}_L\gamma _{\mu }b_L)(\bar{\mu }_L\gamma ^{\mu }\mu _L)\) which contributes to \(b \rightarrow s \mu ^+ \mu ^-\).

Then in Ref. [45] it was pointed out that full \(SU(3)_C \times SU(2)_L \times U(1)_Y\) invariant operator can be written in the following way

$$\begin{aligned} {\mathscr {L}}_{eff}= & {} \frac{G_1^{ijkl}}{\varLambda ^2}(\bar{Q'^{i}_L}\gamma _{\mu }Q'^{j}_L)(\bar{L'^{k}_L }\gamma ^{\mu }L'^{l}_L)\nonumber \\&+ \frac{G_2^{ijkl}}{\varLambda ^2}(\bar{Q'^{i}_L}\gamma _{\mu }\sigma ^I Q'^{j}_L)(\bar{L'^{k}_L}\gamma ^{\mu }\sigma ^I L'^{l}_L), \end{aligned}$$
(5)

the ijk and l are the generation indices and \(Q'\) and \(L'\) are the quark and lepton doublets in the gauge basis. The four fermion operator in the first term of Eq. 5 contains neutral current interactions only while the second four fermion operator contains both charged and neutral current interactions and hence can simultaneously generate NP effects in both \(R_{K^{(*)}}\) and \(R_{D^{(*)}}\).

We assume that the only non-zero Wilson coefficients are \(G_1^{3333}\) and \(G_2^{3333}\), ensuring that the NP couples to only the third generation in the gauge basis. The transformation used in going from gauge basis to mass basis is given by

$$\begin{aligned} u'_L = Uu_L ,~~ ~d'_L = D d_L,~~~ l'_L = L l_L, ~~~\nu '_L = L\nu _L, \end{aligned}$$
(6)

here the primed spinors (gauge basis) has all three generation of fermions and U, D, and L are \(3 \times 3\) unitary matrices. Note that the transformation for both the charged and neutral leptons are assumed to be the same because the neutrino masses are neglected here.

Based on the effective operators, the global analyses indicate that a non-zero new physics contribution to operator \(({\bar{c}}_L \gamma ^\mu b)(\bar{\nu }_L \gamma ^\mu \tau _L)\) is required to explain the \(R_{D^{(*)}}\) anomalies. On the other hand, for explaining the anomalies in \(R_{K^{(*)}}\) and \(b\rightarrow s \mu ^+ \mu ^-\) data, we need a new physics in \(({\bar{s}}_L \gamma ^\mu b)( \bar{\mu }\gamma _\mu \mu )\) operator. Since both the operators involve only second and third generations, we assume mixing only between second and third generation so that the matrices D and L can be defined using the two rotation angles \(\theta _{bs}\) and \(\theta _{\mu \tau }\), respectively.

Therefore, we define

$$\begin{aligned} D= \left( {\begin{array}{lll} 1 &{} 0 &{} 0\\ 0 &{} \cos {\theta _{bs}} &{}\sin {\theta _{bs}}\\ 0 &{} -\sin {\theta _{bs}} &{} \cos {\theta _{bs}}\\ \end{array} } \right) , L= \left( {\begin{array}{lll} 1 &{} 0 &{} 0\\ 0 &{} \cos {\theta _{\mu \tau }} &{}\sin {\theta _{\mu \tau }}\\ 0 &{} -\sin {\theta _{\mu \tau }} &{} \cos {\theta _{\mu \tau }}\\ \end{array} } \right) . \end{aligned}$$

The transition \(b \rightarrow c \tau \bar{\nu }\) occurs at the tree level in the SM so the explanation of anomalies in \(R_{D^{(*)}}\) requires a large NP coupling with third generation. However, as the transition \(b \rightarrow s \mu \mu \) occurs at loop level in SM, a small NP coupling with second generation is required for the explanation of \(R_{K^{(*)}}\) anomalies. The above transformation gives small couplings through rotation to the second generation down quarks and charged leptons in the mass basis.

Therefore, in the mass basis, the new physics couplings can be written as,

$$\begin{aligned} G_{(1,2)}^{ijkl} = g_{(1,2)}X^{ij}Y^{kl}, \end{aligned}$$
(7)

where X and Y are the matrices which are function of the rotation angles. The form of these matrices for \(b \rightarrow s l^+ l^-\) decay is

$$\begin{aligned} X= & {} D^\dagger \left[ {\begin{array}{lll} 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 1 \\ \end{array} } \right] \nonumber \\ D= & {} \left[ {\begin{array}{lll} 0 &{} 0 &{} 0\\ 0 &{} \sin ^2{\theta _{bs}} &{}- \sin {\theta _{bs}} \cos {\theta _{bs}}\\ 0 &{} -\sin {\theta _{bs}} \cos {\theta _{bs}} &{} \cos ^2{\theta _{bs}}\\ \end{array} } \right] , \end{aligned}$$
(8)
$$\begin{aligned} Y= & {} L^\dagger \left[ {\begin{array}{lll} 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 0\\ 0 &{} 0 &{} 1 \\ \end{array} } \right] \nonumber \\ L= & {} \left[ {\begin{array}{lll} 0 &{} 0 &{} 0\\ 0 &{} \sin ^2{\theta _{\mu \tau }} &{} -\sin {\theta _{\mu \tau }} \cos {\theta _{\mu \tau }}\\ 0 &{} -\sin {\theta _{\mu \tau }} \cos {\theta _{\mu \tau }} &{} \cos ^2{\theta _{\mu \tau }}\\ \end{array} } \right] . \end{aligned}$$
(9)

In case of up-type quarks involved in the process, the matrix U is used instead of D. The couplings \(g_1\) and \(g_2\) take specific values depending upon the new physics models. In the more recent study [85] generic couplings in the mass basis are considered.

The effective Hamiltonian relevant for the \(b\rightarrow s\ell ^+_i\ell ^-_j \), \(b \rightarrow c \ell _i \bar{\nu }_j\) and \(b \rightarrow s \nu _i \bar{\nu }_j\) processes can be written as

$$\begin{aligned} H_{eff}(b\rightarrow s\ell ^+_i\ell ^-_j )= & {} - \frac{\alpha G_F}{\sqrt{2}\pi }V_{tb}V_{ts}^*\Big [C_9^{ij}(\bar{s}_L\gamma ^{\mu }b_L)(\bar{l_i}\gamma _{\mu }l_j)\nonumber \\&+~C_{10}^{ij}(\bar{s}_L\gamma ^{\mu }b_L)(\bar{l_i}\gamma _{\mu }\gamma ^5 l_j)\Big ], \end{aligned}$$
(10)
$$\begin{aligned} H_{eff}(b \rightarrow c \ell _i \bar{\nu }_j)= & {} \frac{4G_F}{\sqrt{2}}V_{cb}C_{V}^{ij}(\bar{c}_L\gamma ^{\mu }b_L)(\bar{l}_{iL}\gamma _{\mu }\nu _{jL}), \end{aligned}$$
(11)
$$\begin{aligned} H_{eff}(b \rightarrow s \nu _i \bar{\nu }_j)= & {} -\frac{\alpha G_F}{\sqrt{2} \pi } V_{tb} V_{ts}^* C_L^{ij} ({\bar{s}}_L \gamma ^\mu b_L)\nonumber \\&\times (\bar{\nu }_i \gamma _\mu (1-\gamma ^5) \nu _j). \end{aligned}$$
(12)

The new physics contributions to the Wilson coefficients read

$$\begin{aligned} C_9^{\mu \mu }= & {} -C_{10}^{\mu \mu } =- \frac{\pi }{\sqrt{2}\alpha G_F V_{tb}V_{ts}^*}\frac{(g_1 + g_2)}{\varLambda ^2}\nonumber \\&\times (\sin \theta _{bs}\cos \theta _{bs}\sin ^2\theta _{\mu \tau }), \end{aligned}$$
(13)
$$\begin{aligned} C_V^{ij}= & {} -\frac{1}{2\sqrt{2}G_F V_{cb}}\frac{2g_2}{\varLambda ^2}\nonumber \\&\times (-V_{cs}\sin \theta _{bs}\cos \theta _{bs} + V_{cb} \cos ^2\theta _{bs})Y^{ij}, \end{aligned}$$
(14)
$$\begin{aligned} C_L^{ij}= & {} -\frac{\pi }{ \sqrt{2} \alpha G_F V_{tb} V_{ts}^*} \frac{(g_1-g_2)}{\varLambda ^2}\left( \sin \theta _{bs} \cos \theta _{bs} \right) Y^{ij}. \nonumber \\ \end{aligned}$$
(15)

Having discussed our general framework and assumptions, next we discuss the four new physics models studied in this paper.

Table 1 Best fit values of the mixing angles in vector boson model. For Fit 1, dof=123, \(\chi ^2/dof = 1.00\)
Table 2 Best fit values of the mixing angles in \(U_1\) leptoquark model. For Fit 1, dof=122, \(\chi ^2/dof = 1.00\)
Fig. 1
figure 1

The plots in the top row depict allowed new physics parameter space, at 2\(\sigma \), due to various observables in the \(U_1\) model. The left and right panels in the top row represent the allowed regions for \(g_{U_1}^{33} = 1\) and \(g_{U_1}^{33} = 2\) , respectively. The left and right panels in the bottom row depicts the allowed regions in the \(S_3\) and \(U_3\) models, respectively. The plots for \(S_3\) and \(U_3\) models correspond to \(g_{S_3(U_3)}^{33} = 1\). The grey, magenta, brown, yellow and green regions in these figures show allowed parameter space for \(R_{K^{(*)}}\), \(R_{D^{(*)}}\), branching ratios of LFV decays (\(\tau \rightarrow \phi \mu \) & \(B\rightarrow K\tau ^{\pm }\mu ^{\mp }\)), branching ratio of \(\tau \rightarrow e \nu \bar{\nu }\) and branching ratios of \(B \rightarrow K^{(*)} \nu \bar{\nu }\) transitions, respectively

3.1 SM-like vector bosons (VB)

First, we consider an additional heavy vector bosons which transforms as \((\mathbf{1},\mathbf{3},\mathbf{1})\) under the SM gauge group \(SU(3)_C \times SU(2)_L \times U(1)_Y\). In the gauge basis, its interaction with the fermions is given by

$$\begin{aligned} \varDelta {\mathscr {L}}^{}_{V}= & {} g^{33}_{qV}\ ({{\overline{Q}}'_{L3}} \gamma ^{\mu }\sigma ^I Q'{_{L3}})V_{\mu }^I \nonumber \\&+ g^{33}_{lV}\ ({\overline{L}'_{L3}}\gamma ^{\mu }\sigma ^I L^{'}_{L3})V_{\mu }^I. \end{aligned}$$
(16)

On integrating out this heavy vector boson, this generates one of the two operators shown in Eq. 5 with

$$\begin{aligned} g_1 = 0, \quad g_2 = - g^{33}_{qV} g^{33}_{\ell V}. \end{aligned}$$
(17)

For simplicity, we set these couplings to a fixed value as \(g_{qV}^{33} = g_{lV}^{33} = \sqrt{0.5}\). The \(Z'\) (i.e the neutral component) contributes \(b \rightarrow s \mu ^+ \mu ^-\) and \(b \rightarrow s \nu {\bar{\nu }}\) decays, whereas the \(W'\)(the charged component) contributes to \(b \rightarrow c \tau ^- {\bar{\nu }}\) decay the at the tree level. In addition to the semileptonic operators, this model also generates four fermion operators at the tree level. These put additional constraints on the VB model. For example, the \(\varDelta F=2\) process, the \(B_s- \bar{B_s}\) mixing can be described by the effective Hamiltonian,

$$\begin{aligned} H_{eff}(B_s-{\overline{B}}_s)= & {} \frac{G_F^2 m_W^2}{16\pi ^2}(V_{tb} V_{ts}^*)^2 C_{VLL} \nonumber \\&\quad \times ({\bar{s}}_L \gamma ^\mu b_L) ({\bar{s}}_L \gamma ^\mu b_L). \end{aligned}$$
(18)

Here the Wilson coefficient is given by

$$\begin{aligned} C_{VLL} = C_{VLL}^{SM} + \frac{(g_{qV}^{33})^2}{2m_V^2}\frac{16\pi ^2}{G_F^2 m_W^2(V_{tb}V_{ts}^*)^2} \sin ^2\theta _{bs}\cos ^2\theta _{bs}. \end{aligned}$$

the SM contribution to the Wilson coefficient \(C_{VLL}^{SM}\) reads,

$$\begin{aligned} C_{VLL}^{SM} = \eta _{B_s}x_t \left[ 1+\frac{9}{1-x_t} - \frac{6}{(1-x_t)^2} -\frac{6x_t^2\log {x_t}}{(1-x_t)^3}\right] .\nonumber \\ \end{aligned}$$
(19)

The QCD constant \(\eta _{B_s}\) is equal to 0.551 and the ratio \(x_t =m_t^2/m_W^2\). The mass difference is given by

$$\begin{aligned} \varDelta M_s = \frac{2}{3} ~m_{B_s} f_{B_s}^2 {\hat{B}}_{B_s}~ \frac{G_F^2 m_W^2}{16\pi ^2}(V_{tb} V_{ts}^*)^2 C_{VLL} \end{aligned}$$
(20)

As mentioned above, the VB also gives tree level contribution to the four lepton operators such as \((\bar{\mu }_L \gamma ^\mu \tau _L) (\bar{\mu }_L \gamma _\mu \mu _L)\), which induces LFV tau-decay \(\tau \rightarrow 3 \mu \). The branching ratio of \(\tau \rightarrow 3 \mu \) is

$$\begin{aligned} B(\tau \rightarrow 3 \mu ) = 0.94 \frac{0.5^2}{16 m_V^4} \frac{m_\tau ^5 \tau _\tau }{192 \pi ^3} \sin ^6\theta _{\mu \tau } \cos ^2 \theta _{\mu \tau }. \end{aligned}$$
(21)

On the experimental side, there is an upper bound on the branching ratio \(B(\tau \rightarrow 3 \mu )\), which is \(2.8 \times 10^{-8}\) at \(90\%\) CL [63]. In addition the VB model can contribute to more processes such as \(\tau \rightarrow \mu \nu \bar{\nu }\), \(\tau \rightarrow \mu \rho \) etc. But, we will show that the model is excluded if we take into account just the constraints from \(R_{K^{(*)}}\), \(R_{D^{(*)}}\), \(\varDelta M_s\) and \(Br(\tau \rightarrow 3\mu )\).

3.2 Leptoquark(LQ) models

We consider the three leptoquark models, a scalar \(SU(2)_L\) singlet LQ \(S_1\) \((3,1,-2/3)\), a scalar triplet LQ \(S_3(3,3,-2/3)\) and vector singlet LQ \(U_1(3,1,4/3)\). In the gauge basis, the interaction Lagrangian for these LQ models is given by [86]

$$\begin{aligned} \varDelta {\mathscr {L}}^{}_{U_1}= & {} g_{U_1}^{33}({\overline{Q}'}_{L3}\gamma ^{\mu }L'_{L3})U_{1\mu } +\mathrm{h.c.}, \end{aligned}$$
(22)
$$\begin{aligned} \varDelta {\mathscr {L}}^{}_{S_3}= & {} g_{S_3}^{33}( {{\overline{Q}}'_{L3}} \sigma ^I i \sigma ^2 L_{L3}^{'c}) S_{3}^I +\mathrm{h.c.}, \end{aligned}$$
(23)
$$\begin{aligned} \varDelta {\mathscr {L}}^{}_{U_3}= & {} g_{U_3}^{33}({\overline{Q}'}_{L3}\gamma ^{\mu } \sigma ^I L'_{L3})U_{3\mu } +\mathrm{h.c.}. \end{aligned}$$
(24)

On integrating out a heavy LQ, the operators of Eq. 5 are generated at the tree level but with different weights of the two operators depending on the representation. We can identify the couplings \(g_1\) and \(g_2\) for various LQ models as

$$\begin{aligned} U_1:~ g_1= & {} g_2 = - \frac{1}{2} |g_{U_1}^{33} |^2 < 0, \end{aligned}$$
(25)
$$\begin{aligned} S_3: g_1= & {} 3g_2 = \frac{3}{4} |g_{S_3}^{33} |^2 > 0, \end{aligned}$$
(26)
$$\begin{aligned} U_3: g_1= & {} -3g_2 = - \frac{3}{2} |g_{U_3}^{33} |^2 < 0, \end{aligned}$$
(27)

the couplings \(g_{U_1}^{33}\), \(g_{U_3}^{33}\) and \(g_{S_3}^{33}\) are set to one. Clearly, all these LQ models can potentially contribute to \(b \rightarrow s \mu ^+ \mu ^-\), \(b \rightarrow s \nu {\bar{\nu }}\) and \(b \rightarrow c \tau ^- {\bar{\nu }}\) transitions at the tree level. However, for the \(U_1\) model since \(g_1=g_2\), there is no tree level NP contribution to \(b \rightarrow s \nu {\bar{\nu }}\) (see Eq. 15). Note that the LQ models do not give contributions to the four fermion operators at the tree level. But these operators are still generated at low scale due to RGE running of the operators of Eq. 5  [64, 65]. Some of the processes which are generated due to RGE effects in the LQ models are \(B \rightarrow K^{(*)} \nu {\bar{\nu }}\), \(\tau \rightarrow 3 \mu \), \(\tau \rightarrow \mu \nu \bar{\nu }\), \(\tau \rightarrow e \nu \bar{\nu }\), \(\tau \rightarrow \mu \rho \). In addition to this the Z-boson axial and vector couplings are also affected. It was shown in Refs. [64, 65] that the \(Br(\tau \rightarrow e \bar{\nu }\nu )\) constraints can be very stringent for the models with \(g_1 = g_2\). However, note that in the present work we use a different transformation to rotate from gauge to mass basis.

Table 3 Best fit values of the mixing angles in \(S_3\) leptoquark model. For Fit 1, dof=122, \(\chi ^2/dof = 1.02\)

4 Results and discussions

In this section, we present the result of the fits for the SM like vector boson, and \(U_1\), \(S_3\), \(U_3\) LQ models. Their mass set to 1 TeV. As discussed in Sect. 2, we perform three kind of fits. For the first one, which we call a Global fit, all relevant data is included. We then perform a fit, fit 2, by removing only the \(R_{D^{(*)}}\) data from the \(\chi ^2\). Finally, a fit including only the clean observables is performed. The fit results for the VB model are presented in Table 1, and we observe the following: At the best fit point the VB model evades the current upper bound on the \(Br(\tau \rightarrow 3\mu )\) and this holds even on removing the \(b \rightarrow c \tau \bar{\nu }\) data from the fit. Therefore, we can conclude that the VB model, coupling only to the third generation in the weak basis is inconsistent with the present data.

The fit results for the \(U_1\) model are presented in Table 2. At the best point the central values of \(R_{K^{(*)}}\) in the central \(q^2\) region lies within 1\(\sigma \) of the experimental range, whereas this is not true for the \(R_{K^{*}}\) in low \(q^2\) bin. The central value of \(R_{D^{*}}\) falls in 2\(\sigma \) of the experimental range. Further, the angular observable \(P'_5\) can be accommodated within 2\(\sigma \). We get the similar results for the fits performed only with the clean observables. On removing \(R_{D^{(*)}}\) from the fit, we find that the tension in the same gets even worse as compared to the SM.

The new physics coupling are found to be highly correlated with large errors. Hence, one should also consider the allowed range of various observables. In Fig. 1 we present the \(2\sigma \) contours of various observables in the space of couplings.

It is evident from the left panel of top row of Fig. 1 that the \(R_{K^{(*)}}\) (grey) and \(R_{D^{(*)}}\) (magenta) regions do not overlap even at \(2\sigma \). The brown region depicts constraints on this model due to LFV decays, \(Br(\tau \rightarrow \phi \mu )\) and \(Br(B\rightarrow K\tau ^{\pm }\mu ^{\mp })\). Evidently, these puts a tight constraints on this model. On the other hand, the yellow region shows that the \(Br(\tau \rightarrow e \nu \bar{\nu })\) does not put additional constraint on this model. Note that for a different mixing pattern, the Refs. [64, 65] found this to be extremely constraining for the \(U_1\) model which otherwise was found to be viable in their setup.

Table 4 Best fit values of the mixing angles in \(U_3\) leptoquark model, For Fit 1, dof=122, \(\chi ^2/dof = 0.97\)
Table 5 Fit results for \((U_1 + U_3)\), \((U_1 + S_3)\) and \((U_3 + S_3)\) combinations of two LQ particles

The main reason is that our parametrization is more flexible. In particular, we obtain the transformation matrices Eq. 18 of [64, 65] only in the limit \(\theta _{bs}\rightarrow 0\) and \(\theta _{\mu \tau } \rightarrow 0\). However, in our analysis we allow the parameters \(\theta _{bs}\) and \(\theta _{\mu \tau }\) to be free and fit them from the real data. Our parametrization is more suitable because it provides more flexibility to satisfy the additional constraints such as \(\mathscr {B}(\tau \rightarrow e \nu \bar{\nu })\). To understand this, we note that according to Eq. 97 of Ref. [64, 65] this branching ratio is driven by the Wilson coefficient \((C_{L}^{\tau \ell })_{31} \propto g_2 \lambda ^{u}_{33} \lambda _{33}^e\) which according to [64, 65] is \( \simeq -0.5\), simply because they have set \(\lambda ^u_{33}=\lambda ^e_{33}=1\) and \(g_2=-0.5\) for U1 leptoquark. In contrast, according to our parametrization we get \((C_{L}^{\tau \ell })_{31} \sim g_2 \cos ^2{\theta _{bs}} \cos ^2{\theta _{\mu \tau }}\), which is a function of the mixing angles. Furthermore, in our analyses the parameter \(g_2\) is fixed by model but in a general EFT analysis [64, 65], \(C_3\) (\(=g_2\)) is a free parameter and varying \(C_3\) it as done in [64, 65], is not equivalent to varying \(\theta _{bs}\) and \(\theta _{\mu \tau }\) because the later parameters also appear in all other observables such as \(R_K\), \(R_D\) with a different functional dependence.

The upper right panel in Fig. 1 shows the same plot for \(g_{U_1}^{33} = 2\), clearly the higher values of the coupling also does not help. Therefore, we conclude that the \(U_1\) LQ, in this present setup is not a viable model for the combined explanation of the charge and neutral current anomalies.

The fit results for \(S_3\) and \(U_3\) models are presented in the Tables 3 and 4 respectively. From these one can infer that: these models are able to explain \(R_{K^{(*)}}\) within 1\(\sigma \), and reduces the tension in the \(R_{D^{(*)}}\). The central value of \(R_{K^{*}}\) for the low \(q^2\) does not come within 1\(\sigma \). The angular observable \(P'_5\) can be accommodated within \(2 \sigma \). The tension in \(R_{D^{(*)}}\) becomes worse on removing \(b \rightarrow c \tau \bar{\nu }\) data from the fit. In the fit with only clean observables, the results of the global fit are almost unchanged. The left and right panels of in the bottom row of Fig. 1 represents the \(2\sigma \) contours of the relevant observables in the couplings plane for \(S_3\) and \(U_3\) models, respectively. In \(U_3\) model, the \(R_{K^{(*)}}\) and \(R_{D^{(*)}}\) regions do overlap within \(2\sigma \), but this is challenged by the constraint coming from the upper bound on \(b \rightarrow s \nu \bar{\nu }\) transitions. Again, we find that the constraints coming from the LFV decays are quite stringent. Further, we find that the \(Br(\tau \rightarrow e \nu \bar{\nu })\) is not important for these models.

Finally, we try a combination of two LQ particles taking the LQ couplings to be a free parameter. These results are shown in Table 5. Clearly, \(b\rightarrow s\nu \bar{\nu }\) is a big challenge for all combinations.

5 Conclusions

The measurements of \(R_{K^{(*)}}\) by the LHCb collaboration has reinforced the earlier hints of lepton universality violation observed in \(R_{D^{(*)}}\). In this work we look for simultaneous explanations of these measurements in VB, \(U_1\), \(S_3\) and \(U_3\) models. Here we assume a coupling only to the third generation in the gauge basis. Performing ‘a global fit’ to all relevant data, we find that the vector boson model violates the upper bound on the branching ratio of \(\tau \rightarrow 3\mu \) and hence is inconsistent with the present data. The \(U_1\) LQ model can not accommodate the \(R_{K^{(*)}}\) and \(R_{D^{(*)}}\) anomalies. This is evident from the fit as well as from the allowed regions which do not overlap even at 2\(\sigma \). We also find that, with considered structure of the mixing in this work, the \(Br(\tau \rightarrow e \nu \bar{\nu })\) which arises due to RGE effects, does not put constraint on this model. Further, we find that the \(S_3\) and \(U_3\) LQ models are highly constrained by the \(b \rightarrow s \nu \bar{\nu }\) data.