1 Introduction

In the B semi-leptonic decays, there is a long-lived puzzle, which is the ‘1/2 vs 3/2’ puzzle [1,2,3,4,5,6]. That is, theoretical calculations predict that semi-leptonic B decays should have a substantially smaller rate to the \({\frac{1}{2}}^+\) than to the \({\frac{3}{2}}^+\) doublet, \(Br(B\rightarrow {D}^{{1}/{2}}_{0,1}\ell \nu _{\ell })\ll Br(B\rightarrow {D}^{{3}/{2}}_{1,2}\ell \nu _{\ell })\), which conflicts with experimental results, \(Br(B\rightarrow {D}^{{1}/{2}}_{0,1}\ell \nu _{\ell })\approx Br(B\rightarrow {D}^{{3}/{2}}_{1,2}\ell \nu _{\ell })\) and inspires a lot of research interests [7,8,9,10,11,12]. Many efforts have been made to overcome this difficulty [13,14,15,16,17], and part of the theoretical-experimental differences, for example, the \(B\rightarrow {D}_{0}\ell \nu _{\ell }\) decay could be improved by adding relativistic corrections [18, 19], while the puzzle regarding the two \(1^+\) states \(D_1\) and \(D_1^{\prime }\) remains to this day.

Since there is a light quark in B and P wave \(D_J\) (\(J=0,1,2\)) mesons, the relativistic corrections are expected large in the transition \(B\rightarrow D_J\). Actually, we have shown that even in a double heavy \(0^- \rightarrow 0^+\) process \(B_c\rightarrow {\chi }_{c0}\ell \nu _{\ell }\), the relativistic corrections are very large [20, 21]. Our previous study confirms that the \(Br(B\rightarrow {D}_{0}\ell \nu _{\ell })\) can be enhanced substantially [19] and consist with data. For the process \(B\rightarrow {D}_{2}\ell \nu _{\ell }\), the relativistic corrections almost cancel each other [20]. So in a relativistic study, the \(Br(B\rightarrow {D}_{2}\ell \nu _{\ell })\) agrees with data [19]. While for the two \(1^+\) final states, choosing our usually used parameters, we find large cancellation between \(^1P_1\) and \(^3P_1\) partial waves in \(D^{\prime }_1\) state, which leads to an one order smaller \(Br(B\rightarrow D^{\prime }_1 \ell \nu _{\ell })\) than \(Br(B\rightarrow {D}_{1} \ell \nu _{\ell })\), see the Fig. 3. Other relativistic studies also obtained the similar results [18, 22]. So relativistic corrections, or similarly, 1/m corrections in Heavy Quark Effective Theory (HQET) [14] can not overcome the ‘1/2 vs 3/2’ puzzle.

In addition, when calculate the strong decays, we find the \(D_1^{\prime }(2430)\) strong decay width is within the range of data, while the \(D_1(2420)\) width is about half of the experimental data, see Fig. 2. So both the strong decay widths and the weak production rates in theory do not agree with data. We know that \(1^+\) states \(D_1\) and \(D^{\prime }_1\) are mixture of \(^1P_1\) and \(^3P_1\), and Ref. [16] pointed out that the mixing may soften the ‘1/2 vs 3/2’ puzzle, therefore, we wonder whether this puzzle could be solved by using a complete relativistic method with the help of such mixing property.

In previous paper [23], we solved the complete Salpeter equation for a \(1^+\) state, and obtained almost complete relativistic wave function (except the instantaneous approximation), which provides us a new way to calculate the mixing angle. We find this almost complete relativistic method brings new phenomena, namely the obtained meson masses inverse and the the mixing angle flips in some parameter spaces. At the same time, it also leads to great changes of some physical quantities in this range. So we wonder whether this new phenomenon could help to explain the ‘1/2 vs 3/2’ puzzle.

In Sect. 2, we will give the relativistic wave functions used and our method to calculate the mixing angle for \(1^+\) states. The methods to calculate the transition matrix elements for weak and strong decays are shown in Sect. 3. Our results are given in Sect. 4. Finally, we give a short summary in Sect. 5.

2 Relativistic wave functions

In this section, we give the relativistic wave functions of the bound states used in this paper. The quantum number \(J^P\) of each term in the wave function is the same as that of the meson. The relativistic wave function of a meson is a function of momentum P, mass M, possible polarization \(\varepsilon \) of the meson, as well as the relative momentum q between quark 1 and anti-quark 2 with \({q}_{_{\bot }}=q-\frac{q\cdot P}{M^2}P\) (\(=(0,\vec {q})\) in the center-of-mass frame of the meson), the quark mass \(m_1\), anti-quark mass \(m_2\), quark energy \(\omega _1\), and anti-quark energy \(\omega _2\) with \(\omega _i=\sqrt{m_i^2+\vec q\,^2}\) (\(i=1,2\)).

2.1 Wave function of a \(1^+\) state and its mixing angle

In a previous paper [23], by solving the complete instantaneous Bethe–Salpeter equation [24, 25] for a \(1^+\) state, we obtain the relativistic wave function for \(D_1\) or \(D^{\prime }_1\),

(1)

where we have defined the shorthands

$$\begin{aligned} x_{+}=\frac{\omega _{1}+\omega _{2}}{m_1\omega _{2}+m_2\omega _{1}}, \quad x_{-}=\frac{\omega _{1}-\omega _{2}}{m_1\omega _{2}+m_2\omega _{1}}. \end{aligned}$$

The normalization condition is

$$\begin{aligned} 1&=\int \frac{{d}^3\vec q}{(2\pi )^3} \frac{8\omega _1\omega _2{\vec {q}}^2}{3M(m_1\omega _2+m_2\omega _1)}\nonumber \\&\quad \times (g_1g_2+2h_1h_2)\equiv \cos ^2\theta +\sin ^2\theta . \end{aligned}$$
(2)

In contrast to the non-relativistic wave function which only contains one independent wave function, our relativistic wave function contains four independent radial wave functions \(g_1({\vec q}^2)\), \(g_2({\vec q}^2)\), \(h_1({\vec q}^2)\) and \(h_2({\vec q}^2)\), whose numerical values are obtained by solving the instantaneous BS equation which contains four coupled equations [23]. Since this method is almost complete relativistic one except the only approximation of instantaneous approach which is suitable for a heavy meson, our solutions can perfectly describe the real physical word, that is the solutions of \(1^+\) states appear in pairs. For example, the first two solutions are all 1P states, they are \(D_1\) and \(D_1^{\prime }\) states with close masses, and the second two are all 2P states, etc, see for examples Refs. [23, 27].

Using this relativistic wave function, we provide a new way to calculate the mixing angle between \(^1P_1\) and \(^3P_1\) states [23, 27], see the second equation of Eq. (2) in this paper, where the mixing angle \(\theta \) is defined. Different from the usual method by using the interaction potential, the mixing angle is calculated by the relativistic wave function, which is more accurate since the wave functions is relativistic. In Eq. (1), \(g_1({\vec q}^2)\) and \(g_2({\vec q}^2)\) are \(^1P_1\) waves, while \(h_1({\vec q}^2)\) and \(h_2({\vec q}^2)\) are \(^3P_1\) waves, so the solutions of Eq. (1) are all mixed states of \(^1P_1\) and \(^3P_1\). But the right hand side of Eq. (2) cannot give an unique angle with definite value and sign since it is an equation with multiple solutions. While we have given the wave functions of \(^1P_1\) and \(^3P_1\) states, separately in Ref. [26]. Comparing the numerical values of complete wave function in Eq. (1) for \(1^+\) state with those of the \(^3P_1\) wave function in Eq. (17) and \(^1P_1\) wave function in Eq. (23) in Ref. [26], we can uniquely determine the value and the sign of the mixing angle, and conclude that the definition of the mixing in Eq. (2) is equivalent to the following commonly used formula

$$\begin{aligned} |D_1(2420)\rangle&=\left| \frac{3}{2}\right\rangle =\cos \theta |^1P_1\rangle +\sin \theta |^3P_1\rangle ,\nonumber \\ |D_1^{\prime }(2430)\rangle&=\left| \frac{1}{2}\right\rangle =-\sin \theta |^1P_1\rangle +\cos \theta |^3P_1\rangle . \end{aligned}$$
(3)

It should be noted that the angle here can be equivalent to \(90^{\circ }\), but it is different from the case of the equal mass system, where the meson has charge conjugate parity, the wave functions of \(|^1P_1\rangle \) and \(|^3P_1\rangle \) cannot exist at the same time. The equivalent \(90^{circ}\) angle here is mainly derived from Eq. (2), for example, when \(\theta \rightarrow -90^\circ \), in fact, it means we have the relation

$$\begin{aligned} \int \frac{{d}^3\vec q}{(2\pi )^3} \frac{8\omega _1\omega _2{\vec {q}}^2}{3M(m_1\omega _2+m_2\omega _1)} (g_1g_2)\rightarrow \cos ^2(-90^\circ )=0. \end{aligned}$$
(4)

Therefore, it is not that the wave function is zero, but that the integral is zero, which is equivalent to an angle of \(-\,90^{\circ }\).

2.2 Wave function of a \(0^-\) state

The relativistic wave function for the B meson, which is a \(0^{-}\) state, is written as [28],

(5)

where the numerical values of radial wave functions \(f^{}_1=f^{}_1({\vec q}^2)\) and \(f^{}_2=f^{}_2({\vec q}^2)\) are obtained by solving the instantaneous BS equation [28].

2.3 Wave function of a \(1^-\) state

The relativistic wave function for the \(D^*\) meson, which is a \(1^{-}\) state, is written as [29],

(6)

where the four independent radial wave functions \(b_1\), \(b_2\), \(b_3\) and \(b_4\) are function of \({\vec q}^2\), and their numerical values are obtained by solving the instantaneous BS equation [29].

3 Transition amplitude

In this section, we give the method of applying the relativistic wave functions to calculate the transition processes, which include the semi-leptonic decays \(B\rightarrow D^{\prime }_1 \ell \nu _{\ell }\), \(B\rightarrow {D}_{1} \ell \nu _{\ell }\), and strong decays \({D}^{\prime }_1 \rightarrow D^{*}\pi \), \({D}_1 \rightarrow D^{*}\pi \).

3.1 Weak decay

Taking the decay \(B^-\rightarrow D^0_1 \ell ^{-}\nu _{\ell }\) as an example, the transition amplitude can be written as

$$\begin{aligned} {\mathcal {M}} = \frac{G_F}{\sqrt{2}} V_{bc} \overline{u}_{\ell } \gamma _\mu (1-\gamma _5) v_{\nu _{\ell }} \left\langle D^0_1|J^{\mu }|B^- \right\rangle , \end{aligned}$$
(7)

where \(G_F\) is the Fermi constant, \(V_{bc}\) is the CKM matrix element, \(J^{\mu }=\bar{b}\gamma ^{\mu }(1 - \gamma _5)c\) is the charged weak current, respectively. The hadronic matrix element in our model is expressed as the overlapping integral over the positive wave functions of the initial and final mesons [30]:

(8)

where \(q_{_{f\bot }}={q}_{_{\bot }}+\frac{m_u}{m_c+m_u} \left( P_{f}-\frac{P_f\cdot P}{M^2}P\right) \), \(\bar{\varphi }^{++}_{_{P_f}}(q_{_{f\bot }}) =\gamma _0\left( {{\varphi }^{++}_{_{P_f}} (q_{_{f\bot }})}\right) ^{\dag }\gamma _0\). \(\varphi ^{\pm \pm }_{_P}({q}_{_{\bot }})\) is the positive wave function, and it is defined as , where the project operator , \(i=1\) and 2 for quark and anti-quark, respectively.

3.2 Strong decay

Taking the OZI allowed strong decay \({D}^0_1 \rightarrow D^{*+}\pi ^-\) as an example. To avoid using the wave function of light meson \(\pi ^-\) which may bring us large errors because the instantaneous approximation is not good for a light meson, we abandon choosing the widely used \({}^3P_0\) model, but choose the method in Ref. [31], then the transition amplitude is written as

$$\begin{aligned} {\mathcal {M}}=\frac{iP_{\pi }^{\mu }}{f_{\pi }}\langle D^{*+}| \bar{d}\gamma _{\mu }\gamma _{5}u|D^{0}_{1}\rangle , \end{aligned}$$
(9)

where \(P_{\pi }\) and \(f_{\pi }\) are the momentum and the decay constant of \(\pi ^{-}\), respectively. The hadronic matrix element can be written as

(10)

where \(q_{_{f\bot }}={q}_{_{\bot }}-\frac{m_c}{m_c+m_d}\left( P_{f} -\frac{P_f\cdot P}{M^2}P\right) \).

4 Results

When solving the Salpeter equation, because the wave function is relativistic, to avoid double counting, we only need a non-relativistic interaction. So we choose the Cornell potential, a linear scalar potential plus a single gluon exchange vector potential [28], \(I(r)=\lambda r-\frac{4\alpha _s}{3r}+V_0\). In our calculation, we choose the following fixed parameters, \(m_u=m_d=0.28\) GeV, \(m_c=1.48\) GeV, \(m_b=5.15\) GeV, and vary the free parameter \(V_0\) to fit ground state masses. In previous paper [23], we fixed others and varied the light quark mass \(m_q\) to predict mixing angle, here we will vary \(\lambda \) to obtain different results. Comparing with \(m_q\), as a parameter of the non-perturbative linear potential, the \(\lambda \) has a wide range in literature.

4.1 Mixing angle

When choosing our usually used value \(\lambda =0.21\) GeV\(^2\) [20], the branching ratio \(Br(B \rightarrow D_1 \ell \nu _{\ell })\) is a little larger than data, while the \(Br(B\rightarrow D^{\prime }_1 \ell \nu _{\ell })\) is much smaller than data. At the same time, we find that there is a strong cancellation between \(^1P_1\) and \(^3P_1\) partial waves of \(D^{\prime }_1\) in the \(B\rightarrow D^{\prime }_1\) transition. To see if this cancellation is sensitive to the mixing angle \(\theta \), we vary the input parameter \(\lambda \). A surprising phenomenon happens, the decay rate is indeed very sensitive to the mixing angle \(\theta \), but not sensitive to the direct variation of parameter \(\lambda \) except in some special range (note that, quark masses and \(\lambda \) are our input parameters, while mixing angle is calculated by using Eq. (2)).

Fig. 1
figure 1

The calculated mixing angle \(\theta \) along with the parameter \(\lambda \). Where \(\theta >0\) with \(\lambda < 0.1788~ \textrm{GeV}^{2}\), while \(\theta <0\) when \(\lambda > 0.1788~ \textrm{GeV}^{2}\). Between the blue and orange dotted lines is the range of our best results

In Fig. 1, for the \(c\bar{q}\) (\(q=u,~d\)) \(1^+\) states \(D_1(2420)\) and \(D_1^{\prime }(2430)\), we show the relation between the mixing angle \(\theta \) and \(\lambda \). When \(\lambda > 0.1788~ \textrm{GeV}^{2}\), the obtained \(\theta \) is negative; when \(\lambda < 0.1788~ \textrm{GeV}^{2}\), \(\theta \) is positive; \(\lambda = 0.1788~ \textrm{GeV}^{2}\) is the coincident point of \(\theta =90^{\circ }\) and \(-\,90^{\circ }\). So \(\theta =90^{\circ }\) and \(-\,90^{\circ }\) are the same point in Fig. 1, at this point equivalent masses \(M(D_1)=M(D_1^{\prime })\) are obtained. On the two sides of this point, the mass splitting and the mixing angle flip the sign, and the former is also called the mass inversion phenomenon [23, 32, 33]. So instead of varying the quark mass \(m_q\) [23], here by changing the parameter \(\lambda \), we also obtain the flip phenomenon which confirms the previous results [23]. Therefore we conclude that this flip phenomenon is not caused by a certain parameter, it should be the case of physics itself.

From Fig. 1, we can see that the values of \(\theta \) are stable in a large parameter range, that is, if we vary the \(\lambda \) in a large range, the \(\theta \) remains almost unchanged (same phenomenon occurs in Ref. [23], where the varying parameter is \(m_q\)). For example, when \(0.2~ \textrm{GeV}^{2}\le \lambda \le 0.29~ \textrm{GeV}^{2}\), \(\theta \) is around \(-\,35.3^\circ \sim -32.3^\circ \); when \(0.02 ~ \textrm{GeV}^{2} \le \lambda \le 0.16~ \textrm{GeV}^{2}\), we have \(59.6^\circ \le \theta \le 63.0^\circ \). While in some special range, \(\theta \) is very sensitive to \(\lambda \). See Fig. 1, in the small range of \(0.16~ \textrm{GeV}^{2}\le \lambda \le 0.20~ \textrm{GeV}^{2}\), \(\theta \) varies from \(63.0^\circ \) to \(90.0^\circ \) (\(-\,90.0^\circ \)), and further from \(-\,90.0^\circ \) to \(-\,35.3^\circ \). The total variation \(\bigtriangleup \theta =81.7^\circ \) is very huge.

The results indicate that, when using the mixing formula Eq. (3) where \(\theta \) is an input parameter, (a) not any \(\theta \) is reasonable; for example, the angle ranges \(0\sim 59.6^\circ \) and \(-\,32.3^\circ \sim -0\) are prohibited in Fig. 1, where we use the red dashed and green dotted line to show the angles of \(-\,35.3^\circ \) and \(54.7^\circ \) in the heavy quark limit from HQET as comparison; (b) large errors may exist. The mixing angle is parameter dependent, while in Eq. (3), the parameters in a certain range are used to estimate some values in another range, which may lead to an uncertain error. For example, if we use the wave functions obtained at \(\theta =-35.3^{\circ }\) to calculate strong decays at the angle \(\theta =60.0^{\circ }\) with Eq. (3), the results are \(\Gamma (D_1\rightarrow D^{*+}\pi ^-)=185\) MeV and \(\Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)=11.3\) MeV. But our direct results at \(\theta =60.0^{\circ }\) are \(\Gamma (D_1\rightarrow D^{*+}\pi ^-)=151\) MeV and \(\Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)=17.3\) MeV.

4.2 Strong decay widths and weak production rates

We show the strong decay widths \(\Gamma (D_1\rightarrow D^{*+}\pi ^-)\) and \(\Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)\) in Fig. 2, and the branching ratios \(Br(B\rightarrow {D}_1\ell \nu _{\ell })\) and \(Br(B\rightarrow {D}_1^{\prime }\ell \nu _{\ell })\) in Fig. 3, where the black dashed line is for \(D_1(2420)\) and green line is for \(D^{\prime }_1(2430)\), respectively. In the left diagram in Fig. 2, the horizontal axis is \(\varphi \), which is defined as \(\varphi =\theta \) when \(\varphi \le 90^{\circ }\) and \(\varphi =180^{\circ }+\theta \) when \(\varphi \ge 90^{\circ }\), where \(\theta \) is the mixing angle. So as an example, in Fig. 2, when \(\varphi =120^{\circ }\), it is actually \(\theta =-60^{\circ }\).

Fig. 2
figure 2

The strong widths of the \(D_1(2420)\) (black dashed line) and \(D^{\prime }_1(2430)\) (green line) decay to \(D^{*+}\pi ^-\). Between the blue and orange dotted lines is the range of our best results

Fig. 3
figure 3

The branching ratios \(Br(B\rightarrow {D}_1\ell \nu _{\ell })\) (black dashed line) and \(Br(B\rightarrow {D}_1^{\prime }\ell \nu _{\ell })\) (green line). Between the blue and orange dotted lines is the range of our best results

Fig. 4
figure 4

The masses of \(D_1(2420)\) (black dashed line) and \(D^{\prime }_1(2430)\) (green line). Between the blue and orange dotted lines is the range of our best results

The strong widths and weak production rates are sensitive to the mixing angle. In most angular ranges, the curves are almost linear, see Figs. 2 and 3. But similar to the case of mixing angle, the widths and branching ratios both remain stable (or change slightly) in a large range of parameter \(\lambda \). While in a special range, they change rapidly. For example, when \(0.20~ \textrm{GeV}^{2}\le \lambda \le 0.25~ \textrm{GeV}^{2}\), we have \(9.70~\textrm{MeV} \ge \Gamma (D_1\rightarrow D^{*+}\pi ^-)\ge 9.42~\textrm{MeV}\), \(188~\textrm{MeV} \le \Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)\le 213~\textrm{MeV}\), but when \(\lambda \) changes from \(0.2~ \textrm{GeV}^{2}\) to \(0.165~ \textrm{GeV}^{2}\), \(\Gamma (D_1\rightarrow D^{*+}\pi ^-)\) varies from 9.70 MeV to 167 MeV, \(\Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)\) from 188 MeV to 15.2 MeV.

4.3 Masses

The masses of \(D_1(2420)\) (black dashed line) and \(D^{\prime }_1(2430)\) (green line) are shown in Fig. 4, where the lower mass 2.42 GeV is our input, so the mass splitting \(\bigtriangleup M\) between them is our prediction. The relation between the \(\bigtriangleup M\) and the \(\lambda \) is much different from those of strong decay width or weak production rate. The \(\bigtriangleup M\) is sensitive to the \(\lambda \), and the curve is perfect linear. On the contrary, the relation between the \(\bigtriangleup M\) and \(\theta \) is special, when the \(\theta \) closes to \(\pm \, 90^{\circ }\), the mass splitting remains small and almost unchanged. When the angle is far away from \(\pm \, 90^{\circ }\), the \(\bigtriangleup M\) changes rapidly along with the variation of \(\theta \). Currently, the detected masses are [34]

$$\begin{aligned}&M(D_1(2420))=2422.1\pm 0.8~ \textrm{MeV},\nonumber \\&M(D^{\prime }_1(2430))=2412\pm 9~ \textrm{MeV}. \end{aligned}$$
(11)

So the experiment gives a very small mass splitting, which can be only realized with a mixing angle around the \(\pm \, 90^{\circ }\), not the heavy quark limit \(-\,35.3^{\circ }\) or \(+54.7^{\circ }\) with a little larger splitting which conflicts with data.

We note that, the angle of mass inversion, or the condition \(M({D_1})=M({D_1^{\prime }})\), happens at \(\theta _1=\pm \, 90^{\circ }\), the equivalent strong decay width \(\Gamma (D_1)=\Gamma (D_1^{\prime })\) happens around \(\theta _2=-80.1^{\circ }\), while \(Br(B\rightarrow {D}_1\ell \nu _{\ell })=Br(B\rightarrow {D}_1^{\prime }\ell \nu _{\ell })\) occurs around \(\theta _3=-70^{\circ }\). These three points are not coincident, \(\theta _1\ne \theta _2\ne \theta _3\), especially the last two, which enables the theoretical results and experimental data to agree with each other. Because when \(Br(B\rightarrow {D}_1\ell \nu _{\ell })\) is close to \(Br(B\rightarrow {D}_1^{\prime }\ell \nu _{\ell })\) at \(\theta _3=-70^{\circ }\), \(\Gamma (D_1)\) is still much smaller than \(\Gamma (D_1^{\prime })\), this is consistent with the experimental data.

4.4 Best fitting results

Comprehensively considering the masses, the strong decay widths, and the weak production rates of \(D_1\) and \(D_1^{\prime }\), we conclude that only the mixing angle is far away from the \(-\,35.3^{\circ }\) in HQET and moves to the direction of \(-\,90^{\circ }\), the theory can provide consistent results with the experimental data. We find if we choose the parameter

$$\begin{aligned} 0.1815~ \textrm{GeV}^{2}\le \lambda \le 0.185~ \textrm{GeV}^{2}, \end{aligned}$$
(12)

see the range between the blue dotted line and orange dotted line in Figs. 1, 2, 3 and 4, all our predictions agree with data.

  1. (1)

    The mixing angle

    $$\begin{aligned} -67.3^\circ \le \theta \le -49.3^\circ , \end{aligned}$$
    (13)

    which is much different from \(-\,35.3^\circ \). This result indirectly explains why relativistic corrections or the 1/m corrections in HQET as well as the Lattice result do not overcome the ‘1/2 vs 3/2’ puzzle, since the corrections, whether being large or small, do not significantly change the value of mixing angle which is around \(-\,35.3^\circ \). And the large value of mixing angle is crucial to provide consistent strong decay widths and weak production rates as well as small mass splitting of \(D_1\) and \(D_1^{\prime }\) with data.

  2. (2)

    The strong decay widths

    $$\begin{aligned} 58.1~\textrm{MeV}&\ge \Gamma (D_1\rightarrow D^{*+}\pi ^-)\ge 20.5 ~\textrm{MeV},\nonumber \\ 128~\textrm{MeV}&\le \Gamma (D_1^{\prime }\rightarrow D^{*+}\pi ^-)\le 168~\textrm{MeV}. \end{aligned}$$
    (14)

    When \(\lambda =0.185\) GeV\(^2\), our results \(\Gamma (D_1)=30.8\) MeV and \(\Gamma (D_1^{\prime })=252\) MeV are consistent and close to the current data [34]

    $$\begin{aligned} \Gamma (D_1)&=31.3\pm 1.9~ \textrm{MeV},\nonumber \\ \Gamma (D^{\prime }_1)&=314\pm 29~ \textrm{MeV}, \end{aligned}$$
    (15)

    respectively. Where the supposition \(\Gamma (D_1)=\Gamma (D_1\rightarrow D^{*}\pi )=\frac{3}{2}\times \Gamma (D_1\rightarrow D^{*+}\pi ^-)\) has been used.

  3. (3)

    The branching ratios

    $$\begin{aligned} 3.95\times 10^{-3}&\le Br(B \rightarrow D_1\ell \nu _{\ell })\le 5.83\times 10^{-3},\nonumber \\ 3.34\times 10^{-3}&\ge Br(B \rightarrow D_1^{\prime }\ell \nu _{\ell })\ge 1.44\times 10^{-3}. \end{aligned}$$
    (16)

    With \(\lambda =0.1815\) GeV\(^2\), the results \(Br(B \rightarrow D_1\ell \nu _{\ell })\times \frac{2}{3}=2.63\times 10^{-3}\) and \(Br(B \rightarrow D^{\prime }_1\ell \nu _{\ell })\times \frac{2}{3}=2.23\times 10^{-3}\) are consistent with the data

    $$\begin{aligned}&Br(B\rightarrow {D}_1\ell \nu _{\ell })\times Br({D}_1 \rightarrow D^{*+}\pi ^-)\nonumber \\&\quad =(3.03\pm 0.20)\times 10^{-3},\nonumber \\&Br(B\rightarrow {D}^{\prime }_1\ell \nu _{\ell })\times Br({D}^{\prime }_1 \rightarrow D^{*+}\pi ^-)\nonumber \\&\quad =(2.7\pm 0.6)\times 10^{-3}, \end{aligned}$$
    (17)

    in PDG [34] and the average experimental results [35]

    $$\begin{aligned}&Br(B\rightarrow {D}_1\ell {\nu }_{\ell })\times Br({D}_1 \rightarrow D^{*+}\pi ^-)\nonumber \\&\quad =(2.81\pm 0.25)\times 10^{-3},\nonumber \\&Br(B\rightarrow {D}^{\prime }_1\ell {\nu }_{\ell })\times Br({D}^{\prime }_1 \rightarrow D^{*+}\pi ^-)\nonumber \\&\quad =(1.9\pm 0.7)\times 10^{-3}. \end{aligned}$$
    (18)
  4. (4)

    The small mass splitting

    $$\begin{aligned} 1.5~\textrm{MeV}\le M(D^{\prime }_1(2430))-M(D_1(2420))\le 2.5~ \textrm{MeV}, \end{aligned}$$
    (19)

    is also confirmed by data. But our result favor the mass of \(D^{\prime }_1(2430)\) being a little larger than that of \(D_1(2420)\).

5 Discussion and conclusion

We have shown the relation of mixing angle \(\theta \) with a varying light quark mass \(m_q\) in previous paper [23], which is similar to the one of ‘\(\theta ~\textrm{vs}~ \lambda \)’ in Fig. 1, see the Fig. 2 in Ref. [23] for detail. That is, in a large \(m_q\) range, the \(\theta \) remains unchanged; while in some special short range of \(m_q\), \(\theta \) closes to \(\pm \, 90^\circ \) (where \(M({D_1})=M({D_1^{\prime }})\)) and changes rapidly. In Fig. 1, we fix \(m_u=0.28\) GeV, \(m_c=1.48\) GeV, and show the relation of ‘\(\theta ~\textrm{vs}~ \lambda \)’, if we change the fixed parameters, for example, set \(m_u=0.35\) GeV and \(m_c=1.62\) GeV, the corresponding new relation of ‘\(\theta ~\textrm{vs}~ \lambda \)’ is shown in Fig. 5. We can see that, the large mixing angles still exists in a small special range. The obvious difference is that this special range changes and moves to the small \(\lambda \) range. The curve on the left is slightly (about 2\(^{\circ }\)) close to \(+54.7^\circ \), while the right is slightly (about 1\(^{\circ }\)) far away from \(-\,35.3^\circ \). So we conclude that although its value depends on parameters, the existence of large angles is not caused by parameters, the physical phenomenon may be like this. Further, we point out that the phenomenon, that the mixing angle, strong decay widths and weak production rates remains almost unchanged over a wide range of parameters, but changes dramatically in some particular parameter range (around \(\theta =\pm \, 90^\circ \)), is not unique to our model, but should exist in any method when the relativistic corrections are considered completely.

Fig. 5
figure 5

The calculated mixing angle \(\theta \) along with the parameter \(\lambda \), with new setting of quark mass parameters

In conclusion, using an almost complete relativistic method, we study the mixing angle, the masses and strong decays widths of \(D_1\) and \(D_1^{\prime }\) as well as their weak production rates in B semileptonic decays. We find in a small special range of parameter \(0.1815~ \textrm{GeV}^{2}\le \lambda \le 0.185~ \textrm{GeV}^{2}\), theoretical results are all consistent with data and solve the puzzle of ‘1/2 vs 3/2’.

The small mass splitting between \(D_1\) and \(D_1^{\prime }\), which is confirmed by experimental data, is crucial to solve the puzzle, since it means that the physics happens to be close to the mass inversion point \(M({D_1})=M({D_1^{\prime }})\) (\(\theta =\pm \, 90^\circ \)), where the mixing angle is large and far away from the \(-\,35.3^{\circ }\) in heavy quark limit. And the large mixing angle leads to the solution of the ‘1/2 vs 3/2’ puzzle.