Since the b-quark mass is much larger than the QCD asymptotic scale, \(m_b>>\Lambda _{\mathrm{QCD}}\), the leptonic decay of the heavy quarkonium \(\Upsilon (1S)\) is one of the important channel for testing the non-relativistic QCD theories. At present, the decay width \(\Gamma (\Upsilon (1S)\rightarrow e^+ e^-)\) has been calculated up to next-to-next-to-next-to-leading order (\(\hbox {N}^3\hbox {LO}\)) level [1,2,3,4,5,6,7,8,9,10,11,12]. At the \(\hbox {N}^3\hbox {LO}\) level, the conventional renormalization scale uncertainty is still very large, which is usually estimated by varying the renormalization scale (\(\mu _r\)) within the assumed range of [3, 10] GeV. However, at this perturbative order, the predicted decay width is still lower than the PDG averaged experimental value [5], i.e. \(\Gamma _{\Upsilon (1S)\rightarrow e^+ e^-}|_{\mathrm{Exp.}} = 1.340(18)\) keV [13]. It has been pointed out that the conventional scale-setting approach, in which the renormalization scale is guessed and usually chosen as the one to eliminate the large logs, will meet serious theoretical problems due to the mismatching of \(\alpha _s\) and the coefficients at each perturbative order, and its accuracy depends heavily on the how many terms of the pQCD series are known and the convergence of the pQCD series [14]. It is thus important to adopt a proper scale-setting approach so as to achieve a more accurate fixed-order pQCD prediction.

In year 2015, the authors of Ref. [15] used the principle of maximum conformality (PMC) [16,17,18,19] to eliminate such scale ambiguity and predicted, \(\Gamma _{\Upsilon (1S) \rightarrow e^+ e^-} \sim 1.27\) keV. This value agrees with experimental value within errors by further considering the factorization scale uncertainty. However, the analysis there was done by using the PMC multi-scale approach (PMC-m) [18, 19], in which the PMC scales at each order are different and are of perturbative nature whose values for higher-order terms are of less accuracy due to more of its perturbative terms are unknown, leading to a somewhat larger residual scale dependence. More explicitly, for a \(\hbox {N}^3\hbox {LO}\)-level pQCD series of \(\Gamma _{\Upsilon (1S) \rightarrow e^+ e^-}\), the PMC-m approach shows that there are three PMC scales for its LO, NLO and NNLO terms accordingly [15]: the LO PMC scale is at the \(\hbox {N}^2\)LL accuracy, the NLO PMC scale is at the NLL accuracy and the NNLO PMC scale is at the LL accuracy, respectively.

Recently, a single-scale PMC scale-setting approach (PMC-s) has been suggested by Ref. [20], which fixes the scale by using all the \(\beta \)-terms of the process as a whole and can achieve a scale independent and scheme independent prediction at any fixed order, satisfying the renormalization group invariance [21]. Since such scale is determined by using the renormalization group equation, it determines an effective value of the strong coupling constant \(\alpha _s(Q_*)\), whose argument \(Q_*\) corresponds to an overall effective momentum flow of the process. In this paper, as an attempt, we adopt the PMC-s approach with the purpose of achieving a more accurate pQCD prediction free of renormalization scale error on the \(\Upsilon (1S)\) leptonic decay width.

Up to \(\hbox {N}^3\hbox {LO}\)-level, the decay width \(\Gamma _{\Upsilon (1S)\rightarrow \ell ^+\ell ^-}\) can be written in the following form by using the degeneracy relations among different orders [18, 19]

$$\begin{aligned} \Gamma _3= & {} r_{1,0}a_s^3(\mu _{r}) + (r_{2,0}+3\beta _{0}r_{2,1})a_{s}^{4}(\mu _{r}) \nonumber \\&+(r_{3,0}+3\beta _{1}r_{2,1}+ 4\beta _{0}r_{3,1}+ 6\beta _{0}^{2}r_{3,2})a_{s}^{5}(\mu _{r})\nonumber \\&+(r_{4,0}+3\beta _{2}r_{2,1}+ 4\beta _{1}r_{3,1} +5\beta _{0}r_{4,1} + \nonumber \\&\frac{27}{2}\beta _{1}\beta _{0}r_{3,2} +10\beta _{0}^{2}r_{4,2}+10\beta _{0}^{3}r_{4,3}) a_{s}^{6}(\mu _{r}), ~ \end{aligned}$$
(1)

where \(a_s=\alpha _s/4\pi \), and the coefficients \(r_{i,j}\) can be derived from Refs. [6,7,8,9,10, 12], whose explicit expressions have been given in the Appendix of Ref. [15]. The conformal coefficients \(r_{i,0}={\hat{r}}_{i,0}\) are independent of the initial choice of renormalization scale \(\mu _r\), and the non-conformal coefficients \(r_{i,j}\) \((j\ne 0)\) are functions of \(\mu _r\), which can be written as

$$\begin{aligned} r_{i,j}=\sum ^j_{k=0}C^k_j{{{\hat{r}}}}_{i-k,j-k}\mathrm{ln}^k(\mu _r^2/m_b^2),~ \end{aligned}$$
(2)

where \({{{\hat{r}}}}_{i,j}=r_{i,j}|_{\mu _r=m_b}\), \(C^k_j\) is defined as \(j!/(k!(j-k)!)\), and ijk are the polynomial coefficients. By substituting Eq. (2) into Eq. (1), the decay width \(\Gamma _{\Upsilon (1S)\rightarrow \ell ^+\ell ^-}\) can be written as

$$\begin{aligned} \Gamma _3= & {} {{{\hat{r}}}}_{1,0}a_s^3(\mu _{r}) + [{{{\hat{r}}}}_{2,0}+3\beta _{0}({{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0} \ln \frac{\mu _r^2}{m_b^2})]a_{s}^{4}(\mu _{r}) \nonumber \\&+[{{{\hat{r}}}}_{3,0}+3\beta _{1}({{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0} \ln \frac{\mu _r^2}{m_b^2})+ 4\beta _{0}({{{\hat{r}}}}_{3,1}\nonumber \\&+{{{\hat{r}}}}_{2,0} \ln \frac{\mu _r^2}{m_b^2})+ 6\beta _{0}^{2}({{{\hat{r}}}}_{3,2}+2{{{\hat{r}}}}_{2,1} \nonumber \\&\ln \frac{\mu _r^2}{m_b^2}+{{{\hat{r}}}}_{1,0} \ln ^2\frac{\mu _r^2}{m_b^2})]a_{s}^{5}(\mu _{r})\nonumber \\&+[{{{\hat{r}}}}_{4,0}+3\beta _{2}({{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0} \ln \frac{\mu _r^2}{m_b^2})\nonumber \\&+ 4\beta _{1}({{{\hat{r}}}}_{3,1}+{{{\hat{r}}}}_{2,0} \ln \frac{\mu _r^2}{m_b^2}) +5\beta _{0}({{{\hat{r}}}}_{4,1}+{{{\hat{r}}}}_{3,0} \ln \frac{\mu _r^2}{m_b^2}) \nonumber \\&+\frac{27}{2}\beta _{1}\beta _{0}({{{\hat{r}}}}_{3,2}+2{{{\hat{r}}}}_{2,1} \ln \frac{\mu _r^2}{m_b^2}+{{{\hat{r}}}}_{1,0} \ln ^2\frac{\mu _r^2}{m_b^2}) \nonumber \\&+10\beta _{0}^{2}({{{\hat{r}}}}_{4,2}+2{{{\hat{r}}}}_{3,1} \ln \frac{\mu _r^2}{m_b^2}+{{{\hat{r}}}}_{2,0} \ln ^2\frac{\mu _r^2}{m_b^2})\nonumber \\&+10\beta _{0}^{3}({{{\hat{r}}}}_{4,3}+3{{{\hat{r}}}}_{3,2} \ln \frac{\mu _r^2}{m_b^2}+3{{{\hat{r}}}}_{2,1} \ln ^2\frac{\mu _r^2}{m_b^2}\nonumber \\&+{{{\hat{r}}}}_{1,0} \ln ^3\frac{\mu _r^2}{m_b^2})] a_{s}^{6}(\mu _{r}). \end{aligned}$$
(3)

Applying the standard PMC-s procedures [20], all the non-conformal terms should be resummed into the running coupling. The \(\hbox {N}^3\hbox {LO}\)-level leptonic decay width \(\Gamma _3\) changes to the following conformal series,

$$\begin{aligned} \Gamma _3|_{\mathrm{PMC-s}}= & {} {{{\hat{r}}}}_{1,0}a_s^3(Q_*) + {{{\hat{r}}}}_{2,0}a_s^4(Q_*)+{{{\hat{r}}}}_{3,0}a_s^5(Q_*)\nonumber \\&+ {{{\hat{r}}}}_{4,0}a_s^6(Q_*),~ \end{aligned}$$
(4)

where \(Q_*\) is the PMC scale that determines the effective momentum flow and hence the effective running coupling \(\alpha _s(Q_*)\) of the process. More explicitly, the PMC scale \(Q_*\) is obtained by first shifting the scale \(\mu _r\) in \(a_s\) to \(Q_*\) in Eq. (3) by using scale displacement relation of the strong coupling constant, i.e.

$$\begin{aligned} a^k_s(\mu _r) = a^k_s(Q_*) - k \beta _0 \ln \frac{\mu _r^2}{Q_*^2} a^{k+1}_s(Q_*)+{\mathcal {O}}[a^{k+2}_s(Q_*)]. \end{aligned}$$

Then the PMC scale \(Q_*\) is obtained by requiring all the non-conformal terms vanish, e.g.

$$\begin{aligned} 0= & {} 3\beta _{0}[{{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0}(\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})]a_{s}^{4}(Q_*) \nonumber \\&+\{3\beta _{1}[{{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})]\nonumber \\&+ 4\beta _{0}[{{{\hat{r}}}}_{3,1}+{{{\hat{r}}}}_{2,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})]\nonumber \\&+ 6\beta _{0}^{2}[{{{\hat{r}}}}_{3,2}+2{{{\hat{r}}}}_{2,1} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})\nonumber \\&+{{{\hat{r}}}}_{1,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})^2]\}a_{s}^{5}(Q_*)\nonumber \\&+\{3\beta _{2}[{{{\hat{r}}}}_{2,1}+{{{\hat{r}}}}_{1,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})] \nonumber \\&+ 4\beta _{1}[{{{\hat{r}}}}_{3,1}+{{{\hat{r}}}}_{2,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})]\nonumber \\&+5\beta _{0}[{{{\hat{r}}}}_{4,1}+{{{\hat{r}}}}_{3,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})] \nonumber \\&+\frac{27}{2}\beta _{1}\beta _{0}[{{{\hat{r}}}}_{3,2}+2{{{\hat{r}}}}_{2,1} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})\nonumber \\&+{{{\hat{r}}}}_{1,0} (\ln \frac{\mu _r^2}{m_b^2}+\ln \frac{Q_*^2}{\mu _r^2})^2]+10\beta _{0}^{2}[{{{\hat{r}}}}_{4,2}+2{{{\hat{r}}}}_{3,1} \nonumber \\&(\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})\nonumber \\&+{{{\hat{r}}}}_{2,0} (\ln \frac{\mu _r^2}{m_b^2}+\ln \frac{Q_*^2}{\mu _r^2})^2]+10\beta _{0}^{3}[{{{\hat{r}}}}_{4,3}\nonumber \\&+3{{{\hat{r}}}}_{3,2} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})+3{{{\hat{r}}}}_{2,1} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})^2\nonumber \\&+{{{\hat{r}}}}_{1,0} (\ln \frac{\mu _r^2}{m_b^2}+ \ln \frac{Q_*^2}{\mu _r^2})^3]\} a_{s}^{6}(Q_*)+{\mathcal {O}} [a_{s}^{7}(Q_*)]. ~ \end{aligned}$$
(5)

Due to its perturbative nature, we expand the solution of \(\ln \frac{Q^2_*}{m^2_b}\) as a power series over \(a_s(Q_*)\), i.e.

$$\begin{aligned} \ln \frac{Q^2_*}{m^2_b}=\sum _{i=0}^{2} S_{i}a^i_s(Q_*), \end{aligned}$$
(6)

where \(S_i\) are perturbative coefficients that can be determined up to next-to-next-to-leading-log (\(\mathrm{N^2LL}\)) accuracy by using the known \(\hbox {N}^3\hbox {LO}\)-level series \(\Gamma _3\). By further using the scale displacement relation between the coupling \(a_s(Q_*)\) at the \(k_{th}\)-order and \(a_s(m_b)\) as

$$\begin{aligned} a^k_s(Q_*) = a^k_s(m_b) - k \beta _0 \ln \frac{Q_*^2}{m_b^2} a^{k+1}_s(m_b)+{\mathcal {O}}[a^{k+2}_s(m_b)], \end{aligned}$$

we finally obtain

$$\begin{aligned} \ln \frac{Q^2_*}{m^2_b}=T_0+T_1 a_s(m_b)+T_2 a^2_s(m_b)+{\mathcal {O}} [a^3_s(m_b)], \end{aligned}$$
(7)

where the coefficients \(T_i~(i=0, 1, 2)\) are

$$\begin{aligned} T_0= & {} -\frac{{{{\hat{r}}}}_{2,1}}{{{{\hat{r}}}}_{1,0}}, \end{aligned}$$
(8)
$$\begin{aligned} T_1= & {} \frac{2 \beta _0 ({{{\hat{r}}}}_{2,1}^2-{{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{3,2})}{{{{\hat{r}}}}_{1,0}^2}+\frac{4 ({{{\hat{r}}}}_{2,0} {{{\hat{r}}}}_{2,1}-{{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{3,1})}{3{{{\hat{r}}}}_{1,0}^2}, \end{aligned}$$
(9)

and

$$\begin{aligned} T_2= & {} \frac{5 \beta _1 ({{{\hat{r}}}}_{2,1}^2-{{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{3,2})}{2 {{{\hat{r}}}}_{1,0}^2} \nonumber \\&+\frac{16 ({{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{2,0} {{{\hat{r}}}}_{3,1}-{{{\hat{r}}}}_{2,0}^2 {{{\hat{r}}}}_{2,1})+15 ({{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{2,1} {{{\hat{r}}}}_{3,0}-{{{\hat{r}}}}_{1,0}^2 {{{\hat{r}}}}_{4,1})}{9 {{{\hat{r}}}}_{1,0}^3} \nonumber \\&-\frac{2 \beta _0 (8 {{{\hat{r}}}}_{2,1} {{{\hat{r}}}}_{3,1} {{{\hat{r}}}}_{1,0}-5 {{{\hat{r}}}}_{4,2} {{{\hat{r}}}}_{1,0}^2+4 {{{\hat{r}}}}_{2,0} {{{\hat{r}}}}_{3,2} {{{\hat{r}}}}_{1,0}-7 {{{\hat{r}}}}_{2,0} {{{\hat{r}}}}_{2,1}^2)}{3 {{{\hat{r}}}}_{1,0}^3}\nonumber \\&+\frac{2 \beta _0^2 (12 {{{\hat{r}}}}_{1,0} {{{\hat{r}}}}_{3,2} {{{\hat{r}}}}_{2,1}-7 {{{\hat{r}}}}_{2,1}^3-5 {{{\hat{r}}}}_{1,0}^2 {{{\hat{r}}}}_{4,3})}{3 {{{\hat{r}}}}_{1,0}^3}. \end{aligned}$$
(10)

It is found that \(Q_*\) is exactly free of \(\mu _r\) at any fixed-order, indicating that the conventional ambiguity of setting \(\mu _r\) is eliminated. Such exactly cancellation of \(\mu _r\)-dependence is due to the fact that, as shown by Eq. (5), the coefficients of \(\ln {\mu _r^2}/{m_b^2}\) are exactly the same as those of \(\ln {Q_*^2}/{\mu _r^2}\). This shows that one can choose any perturbative value as the renormalization scale to finish the perturbative calculations, and the resultant scale \(Q_*\) shall be independent to such choice. Thus, together with the \(\mu _r\)-independent conformal coefficients, the PMC decay width \(\Gamma _3|_{\mathrm{PMC-s}}\) shall be independent to the initial choice of the renormalization scale.

As a subtle point, because the \(\hbox {N}^3\)LL-order and higher-order terms of the perturbative series (7), e.g. \(\mathcal{O}(a_s^3)\)-terms, are unknown, the scale \(Q_*\) shall have a residual scale dependence. Such residual scale dependence is different from the arbitrary conventional \(\mu _r\)-dependence, since it is generally negligible due to a faster pQCD convergence [24]. As shall be shown below, the residual scale dependence for a \(\hbox {N}^3\hbox {LO}\) decay width \(\Gamma _{\Upsilon (1S)\rightarrow e^+ e^-}\) is negligible due to both \(\alpha _s\)-suppression and exponential suppression.

To do the numerical calculation, we take the four-loop \(\alpha _s\)-running behavior, and use \(\alpha _s(M_{Z})= 0.1181\pm 0.0011\) [13] to fix the QCD asymptotic scale \(\Lambda _{\mathrm{QCD}}\). We adopt the fine structure constant \(\alpha (2m_b)=1/132.3\) [25]. The b-quark \(\overline{\mathrm{MS}}\)-mass \({\bar{m}}_b({\bar{m}}_b)= 4.18\pm 0.03\) GeV [13], and by using the four-loop relation between the \(\overline{\mathrm{MS}}\) quark mass and the pole quark mass [26], we obtain the b-quark pole mass \(m_b=4.93\pm 0.03\) GeV.

Using Eq. (7), we obtain

$$\begin{aligned} \mathrm{ln}\frac{Q_*^2}{m_b^2}= & {} -2.61-72.52 a_s(m_b) + 6089.58 a^2_s(m_b)\nonumber \\&\pm |6089.58 a^3_s(\mu )|^{\mathrm{MAX}}_{\mu \in [m_b/2, 2m_b]}\nonumber \\= & {} -2.61-1.24+1.77 \pm 0.065, \end{aligned}$$
(11)

which leads to \(Q_*=1.75\pm 0.06\) GeV. Here as an estimation of those contributions from unknown higher-order terms, as suggested by Refs. [22, 23], we take the maximum value of \(|T_2 a^3_s(\mu )|\) with \(\mu \in [m_b/2, 2m_b]\) as a conservative prediction of the magnitude of the uncalculated \(a_s^3\)-terms, which causes a scale shift \(\Delta Q_*=\pm 0.06\) GeV. Such a small scale shift (\(\sim \pm 3\%\)) is reasonable, since the value of \(Q_*\) suffers from both \(\alpha _s\)-suppression and exponential-suppression. Thus, Eq. (11) indicates that the typical momentum flow of the decay, \(\Upsilon \rightarrow e^+ e^-\), is about 1.75 GeV, which is only half of the usually guessed choice of 3.50 GeV. Thus by using the present known \(\hbox {N}^3\hbox {LO}\) pQCD series, the accurate typical momentum flow for \(\Upsilon \rightarrow e^+ e^-\) can be achieved.

Fig. 1
figure 1

The \(\Upsilon (1S)\) leptonic decay width \(\Gamma _n\) up to \(n_{\mathrm{th}}\)-order QCD corrections as a function of the renormalization scale \(\mu _r\) under the conventional sale-setting, where \(n=(0,1,2,3)\)

We present the decay width \(\Gamma _{n}\) up to \(n_{\mathrm{th}}\)-order QCD corrections under conventional scale-setting in Fig. 1. As expected, if the renormalization scale \(\mu _r\) is large enough, e.g. \(\mu _r>3\) GeV, the renormalization scale dependence becomes smaller with the increment of loop corrections. On the other hand, it is found that the PMC prediction on \(\Gamma _{n}\) under the PMC-s approach is independent to the choice of \(\mu _r\) at any fixed \(n_{\mathrm{th}}\)-order.

Fig. 2
figure 2

The \(\hbox {N}^3\hbox {LO}\) decay width for the \(\Upsilon (1S)\) leptonic decay versus the renormalization scale \(\mu _r\) under the PMC-s, PMC-m and conventional scale-setting approaches

To show the scale dependence more explicitly, we present the \(\hbox {N}^3\hbox {LO}\) decay width \(\Gamma _3\) in Fig. 2, where the results under conventional and PMC-m scale-setting approaches are presented as a comparison. Firstly, the conventional scale-setting approach leads to the largest renormalization scale dependence, e.g. \(\Gamma _{3}|_{\mathrm{Conv.}}=[0.665, 0.824]\) keV for \(\mu _r\in [3,10]\) GeV, which are only about \(50\%-60\%\) of the experimental value \(\Gamma ^{\mathrm{exp.}}\simeq 1.340\) keV. Secondly, such conventional renormalization scale dependence is suppressed by using the PMC-m approach, and a more larger decay width can be achieved. But as has been observed in Ref. [15], there is still large residual scale dependence due to a somewhat larger \(\mu _r\)-dependence for its NLO and NNLO PMC scales, e.g. \(\Gamma _{3}|_{\mathrm{PMC-m}}=[1.049, 1.353]\) keV for \(\mu _r\in [3,10]\) GeV. Such large residual scale dependence for PMC-m approach is reasonable, since the \(\Gamma _3\) perturbative series starts at \(\alpha _s^3\)-order, slight change of its arguments shall result in large scale uncertainty for the decay width. This fact make the process inversely provides a good platform for testing the correct running behavior of the strong coupling constant. Finally, Fig. 2 shows that, by using the PMC-s approach, the \(\Upsilon (1S)\) leptonic decay width is unchanged for any choice of \(\mu _r\), e.g. \(\Gamma _3|_{\mathrm{PMC-s}}\equiv 1.262\) keV.

Table 1 Contribution from each order for the \(\hbox {N}^3\hbox {LO}\) decay width \(\Gamma _3\) (in unit: keV) under the PMC-s and conventional (\(\mu _r=3.5\) GeV) scale-setting approaches

We present the contributions from each order for \(\Gamma _3\) in Table 1. Under conventional scale-setting, the magnitude of the NLO, \(\hbox {N}^2\)LO and \(\hbox {N}^3\hbox {LO}\) term is about \(5\%\), \(95\%\), and \(50\%\) of the LO term, respectively. It shows that even at the present known \(\hbox {N}^3\hbox {LO}\) level, the conventional pQCD convergence is not as good as required. After applying the PMC, the pQCD convergence is improved, the magnitude of \(\hbox {N}^3\hbox {LO}\) term is only \(\sim 8\%\) of the LO term. More over, as a conservative estimation of the magnitude of the unknown \(\hbox {N}^4\)LO-terms of the PMC series, we set its value as \(\Delta _4=\pm |r_{4,0} a_s^{7}(Q_*)|\). It is negligibly small, e.g. \(\Delta _4 \sim \pm 0.002 ~\mathrm{keV}\).

Moreover, after eliminating the renormalization scale uncertainties via using PMC-s approach, there are still uncertainty sources, such as the \(\alpha _s\) fixed-point error \(\Delta \alpha _s(M_Z)\), the choices of b-quark pole mass \(m_b\), the choices of the factorization scale, and etc.

As for the \(\alpha _s\) fixed-point error, by using \(\Delta \alpha _s(M_Z)=0.0011\) [13] to fix the \(\alpha _s\) value at the required scales, we have \(\Lambda _{\mathrm{QCD},n_f=4}=296\pm 16\) MeV, which lead to

$$\begin{aligned} \Gamma _{\Upsilon (\mathrm{1S})\rightarrow \ell ^+\ell ^-}|_{\mathrm{PMC-s}} =1.262^{+0.161}_{-0.138} ~\mathrm{keV} \end{aligned}$$
(12)

and

$$\begin{aligned} \Gamma _{\Upsilon (\mathrm{1S})\rightarrow \ell ^+\ell ^-}|_{\mathrm{Conv.}} =0.779^{+0.054}_{-0.050} ~\mathrm{keV}. \end{aligned}$$
(13)

As shall show below, such fixed-point error \(\Delta \alpha _s(M_Z)\) dominates the error for \(\Upsilon (1S)\) leptonic decay width. This indicates that after applying the PMC-s approach, even if we have achieved a renormalization scale-independent conformal coefficients for each perturbative order and have determined the correct momentum flow of the process (being the argument of \(\alpha _s\)), we still need an accurate referenced fixed-point value \(\alpha _s(M_Z)\) so as to a determine an accurate \(\alpha _s\) at any scales and hence to achieve a more accurate pQCD prediction. Here, the conventional error of \(\Delta \Gamma |_{\mathrm{Conv.}}=\left( ^{+0.054}_{-0.050}\right) ~\mathrm{keV}\) is predicted by fixing \(\mu _r=3.5\) GeV. Equations (12) and (13) shows that the conventional error is smaller than the PMC-s one, this is because that the determined effective scale \(Q_*=1.75\;\mathrm{GeV}\) is smaller than 3.50 GeV, and then the value of \(\alpha _s(Q_*)\) is more sensitive to the variation of \(\Lambda _{\mathrm{QCD}}\).

Fig. 3
figure 3

The \(\hbox {N}^3\hbox {LO}\) decay width for \(\Upsilon (1S)\) leptonic decay versus \(m_b\) under the PMC-s and conventional approaches. The solid line is for PMC-s, which is independent to the choice of \(\mu _r\). The error band is for conventional result for \(\mu _r\in [3,10]\) GeV, where the dashed line is for \(\mu _r=3.50\) GeV, the lower edge is for \(\mu _r=10\) GeV and the upper edge is for \(\mu _r=4.45\) GeV, respectively

The \(\hbox {N}^3\hbox {LO}\) leptonic decay width \(\Gamma _{\Upsilon (\mathrm{1S})\rightarrow \ell ^+\ell ^-}\) versus the choice of b-quark pole mass \(m_b\) is presented in Fig. 3. If using \(m_b=4.93\pm 0.03\) GeV, the error \(\Delta \Gamma \) shall be negligibly small for both the PMC-s and conventional results under a fixed choice of \(\mu _r\).

Table 2 The factorization scale errors \(\Delta \Gamma _3\) which are calculated by separately varying \(\mu _h\), \(\mu _s\) and \(\mu _{us}\) by \(\pm 10\%\) of their center values

At present, we have no strict way to set the factorization scale of the process, which is usually chosen as the renormalization scale. For the \(\Upsilon (1S)\) leptonic decay, the question is much more involved, since it involves three typical factorization scales, i.e. the hard one \(\mu _h\sim m_b\), the soft one \(\mu _s\sim m_b v_b\), and the ultra-soft one \(\mu _{us} \sim m_b v_b^2\), where \(v_b \sim \alpha _s(m_b v_b)\) [27] represents the relative velocity between the constituent b and \({\bar{b}}\) quarks in \(\Upsilon \). For definiteness of discussing the factorization scale dependence, we vary the scales \(\mu _h\), \(\mu _s\) and \(\mu _{us}\) within the range of \(\pm 10\%\) of their center values, and the results are presented in Table 2. Table 2 indicates that there is still factorization scale uncertainties after applying the PMC-s approach. The conventional factorization scale uncertainties sound relatively smaller, which are due to accidentally cancellation among different terms involving different scales. In fact, if the process involves only one single energy scale, its factorization scale dependence shall be greatly suppressed if we can set the correct momentum flow of the process by applying the PMC, such kind of examples have been found in Top-pair and Higgs boson production processes [28, 29].

Using the known \(\hbox {N}^3\hbox {LO}\) terms together with the PMC-s approach, we obtain a more accurate renormalization scale independent prediction

$$\begin{aligned} \Gamma _{\Upsilon (\mathrm{1S})\rightarrow \ell ^+\ell ^-}|_\mathrm{PMC-s}=1.262^{+0.195}_{-0.175} ~\mathrm{keV}, \end{aligned}$$
(14)

where the errors are squared average of those from \(\Delta \alpha _s(M_Z)\), \(m_b\), and the choices of the factorization scales. This decay width agrees with the experimental measurement, \(\Gamma _{\Upsilon (1S)\rightarrow e^+ e^-}|_{\mathrm{Exp.}} = 1.340(18)\) keV [13].

Fig. 4
figure 4

The \(\hbox {N}^3\hbox {LO}\)-level \(\Upsilon \) leptonic decay width (\(\Gamma _3\)) under the PMC-s and conventional scale-setting approaches, respectively. The errors are squared averages of the mentioned theoretical uncertainties. The experimental value [13] is also given as a comparison

More explicitly, we present our result of the \(\Upsilon (1S)\) leptonic decay width up to \(\hbox {N}^3\hbox {LO}\) level in Fig. 4, where both the results for PMC-s and conventional scale-setting approaches are presented. Figure 4 shows that the PMC-s prediction agrees with the experimental measurement within errors, while the conventional prediction is well below the data, e.g.

$$\begin{aligned} \Gamma _{\Upsilon (\mathrm{1S})\rightarrow \ell ^+\ell ^-}|_{\mathrm{Conv.}} =0.779^{+0.097}_{-0.137} ~\mathrm{keV}, \end{aligned}$$
(15)

where the errors are squared average of those from \(\Delta \alpha _s(M_Z)\), \(m_b\), and the choices of the factorization scales and by varying the renormalization scale within the range of [3, 10] GeV.

As a summary, in the paper, we have studied the \(\Upsilon (1S)\) leptonic decay width \(\Gamma (\Upsilon (1S)\rightarrow \ell ^+\ell ^-)\) by using the PMC-s scale-setting approach. By using the PMC-s approach, we have found that the overall typical momentum flow of the \(\Upsilon (1S)\) leptonic decay is \(\sim 1.75\) GeV, and then a more accurate fixed-order pQCD prediction for \(\Gamma (\Upsilon (1S)\rightarrow \ell ^+\ell ^-)\) can be achieved, which agrees with the data and is independent to any choice of renormalization scale. Our present analysis provides another good example for emphasizing the importance of a proper scale-setting approach. Before we draw conclusion of whether there is new physics beyond the standard model for a high-energy process, we need first to get the pQCD prediction as accuracy as possible, especially, we need to set a proper scale, corresponding to the correct momentum flow of the process, for perturbative predictions.