1 Introduction

The polarized Bjorken sum rule (BSR) \(\Gamma _1^{p-n}(Q^2)\) [1, 2] is an important spacelike QCD observable for various reasons. It is a difference of the first moment of the spin-dependent structure functions of proton and neutron, therefore its isovector nature makes it easier to describe it theoretically, in pQCD in terms of OPE, than the separate integrals of the two nucleons. Further, high quality experimental results for this quantity, obtained in polarized deep inelastic scattering (DIS), are now available in a large range of spacelike squared momenta \(-q^2 \equiv Q^2\): \(0.054 \ \mathrm{GeV}^2 \le Q^2 < 5 \ \mathrm{GeV}^2\) [3,4,5,6,7,8,9,10,11]. In particular, the newer experimental results [5] with highly reduced statistical uncertainties, extracted mainly from the Jefferson Lab CLAS EG1-DVCS experiment [12] taken on polarized targets made of protons and deuterons, render BSR an attractive quantity to test on it various extensions of pQCD to low \(Q^2 \lesssim 1 \ \mathrm{GeV}^2\).

Theoretically, pQCD (with OPE), in \(\overline{\mathrm{MS}}\) scheme, has been the usual approach to describe such quantities, cf. [3,4,5]. This approach, however, has the theoretical disadvantage that the running coupling \(a(Q^2)\) [\(\equiv \alpha _s(Q^2)/\pi \)] possesses Landau singularities at low positive \(Q^2 \lesssim 0.1 \ \mathrm{GeV}^2\), and this makes it inconvenient for evaluation of spacelike observables \(\mathcal{D}(Q^2)\) at low \(Q^2\), such as BSR. In recent years, an extension of pQCD couplings to low \(Q^2\), without Landau singularities, called (Fractional) Analytic Perturbation Theory [(F)APT)] [13,14,15,16,17,18,19,20,21,22,23,24,25,26,27] has been applied in the fitting of the theoretical OPE expression to the experimental inelastic contributions to BSR [28,29,30,31], with good results.

In this work we fit the theoretical OPE expressions to the experimental BSR results in pQCD, in (F)APT, and two additional extensions of QCD to low \(Q^2\), namely the 2\(\delta \) [32, 33] and 3\(\delta \) [34, 35] \({\mathcal {A}}\)QCD. The latter two extensions have the coupling \({\mathcal {A}}(Q^2)\) [the analog of the pQCD coupling \(a(Q^2)\)] which is free of Landau singularities and physically motivated in the entire relevant regime of \(Q^2\) in the complex plane, \(Q^2 \in \mathbb {C} \backslash (-\infty , -M_\mathrm{thr}^2]\), where \(M_\mathrm{thr}^2 \lesssim 1 \ \mathrm{GeV}^2\) is a positive threshold scale. The present work is an extension of our previous work on BSR [36, 37]; in comparison with the latter work, we now vary the \(Q^2\)-range of the fit, include the consideration of the elastic contribution, and estimate the uncertainties of the values of the extracted fit parameters due to systematic (in addition to statistical) uncertainties of the experimental BSR data.

The mentioned QCD variants (F)APT, 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD were constructed directly by imposing certain physically motivated conditions on the QCD coupling. In this context, we point out that there exist several other approaches to the construction of the QCD coupling, among them those using Dyson–Schwinger equations which involve various versions of the dynamically generated gluon mass (the mentioned 2\(\delta \) \({\mathcal {A}}\)QCD gives a coupling with similar properties). For a recent review of various approaches, we refer to [38].

In Sect. 2 we present the theoretical leading-twist and higher-twist OPE contributions for the considered quantity BSR, as well as the parametrization of the elastic contribution to BSR. In Sect. 3 we present the results of various fits of theoretical expressions to the experimental results. Finally, in Sect. 4 we summarize our conclusions.

Most of the formal aspects of the calculations are relegated to Appendices: in Appendix A we present the form of the leading-twist perturbation coefficients for a general renormalization scale and scheme; in Appendix B we present construction of \({\mathcal {A}}_n\), the analogs of pQCD powers \(a^n\), in extensions of pQCD without Landau singularities; in Appendix C we summarize such extensions [(F)APT, 2\(\delta \) and 3\(\delta \)]; in Appendix D we explain how the statistical and systematic uncertainties of the experimental data are reflected in the corresponding uncertainties of the parameters extracted in the fits; and in Appendix E we estimate the effects of the finiteness of the charm quark mass in our evaluations.

2 Bjorken sum rule: theoretical expressions

The polarized Bjorken sum rule (BSR), \(\Gamma _1^{p-n}\), is defined as the difference between proton and neutron polarized structure functions \(g_1\) integrated over the whole x-Bjorken interval

$$\begin{aligned} \Gamma _1^{p-n}(Q^2)=\int _0^1 dx \left[ g_1^p(x,Q^2)-g_1^n(x,Q^2) \right] . \end{aligned}$$
(1)

Based on the various measurements of these and the related structure functions, the inelastic part of the above quantity, \(\Gamma _1^{p-n}(Q^2)_\mathrm{inel.}\), has been extracted at various values of squared momenta \(Q_j^2\) (\(0.054 \ \mathrm{GeV}^2 \le Q_j^2 < 5 \ \mathrm{GeV}^2\)): [3,4,5] (Jefferson Lab), [8, 9] (SLAC), [6, 7] (COMPASS at CERN).Footnote 1

Theoretically, this quantity can be written in the Operator Product Expansion (OPE) form [1, 2]

$$\begin{aligned} \Gamma _1^{p-n, \mathrm {OPE}}(Q^2)= {\Big |}\frac{g_A}{g_V} {\Big |} \frac{1}{6}(1 - \mathcal{D}_\mathrm{BS}(Q^2))+\sum _{i=2}^\infty \frac{\mu _{2i}(Q^2)}{Q^{2i-2}}.\nonumber \\ \end{aligned}$$
(2)

Here, \(|g_A/g_V|\) is the ratio of the nucleon axial charge, \((1-\mathcal{D}_\mathrm{BS})\) is the perturbation expansion for the leading-twist contribution, and \(\mu _{2i}/Q^{2i-2}\) are the higher-twist contributions. The value obtained from neutron \(\beta \) decay is known to a high accuracy, \(|g_A/g_V|=1.2723 \pm 0.0023\) [39], and we will use the central value. In the higher-twist terms, we will take only the terms \(\sim 1/Q^2\) and \(1/(Q^2)^2\).

2.1 Perturbation expansion of the leading-twist

The leading-twist term has the canonical part \(\mathcal{D}_\mathrm{BS}(Q^2)\) whose perturbation expansion in \(a \equiv \alpha _s/\pi \) is known up to \(\hbox {N}^3 \hbox {LO}\) (\(\sim a^4\))

$$\begin{aligned} \mathcal{D}_\mathrm{BS}(Q^2)_\mathrm{pt}= & {} {\bar{a}} + {\bar{d}}_1 {\bar{a}}^2 + {\bar{d}}_2 {\bar{a}}^3 + {\bar{d}}_3 {\bar{a}}^4 + \mathcal{O}({\bar{a}}^5), \end{aligned}$$
(3)

where the bar indicates that the expansion is in the \(\overline{\mathrm{MS}}\) scheme, and the renormalization scale \(\mu ^2\) is implicitly understood to be equal to the physical scale \(Q^2\). The NLO, N\(^2\)LO and N\(^3\)LO coefficients \({\bar{d}}_j\) (\(j=1,2,3\)) were obtained in [40,41,42], respectively. In the considered range of momentum transfer \(0< Q^2 < 5 \ \mathrm{GeV}^2\), we will assume that the effective number of active quark flavors is \(N_f=3\), and therefore only the nonsinglet (NS) contributions appear.Footnote 2

When the renormalization scale is changed from \(\mu ^2=Q^2\) to a general value \(\mu ^2 = k Q^2\) (\(0 < k \sim 1\)), and when the renormalization scheme parameters are changed from the \(\overline{\mathrm{MS}}\) values \({\bar{c}}_j \equiv {\bar{\beta }}_j/\beta _0\) to general scheme parameter values \(c_j\) (\(j \ge 2\)), the perturbation expansion changes accordingly

$$\begin{aligned} \mathcal{D}_\mathrm{BS}(Q^2)_\mathrm{pt}= & {} a(k Q^2) + d_1(k) a(k Q^2)^2 + d_2(k;c_2) a(k Q^2)^3 \nonumber \\&+\, d_3(k; c_2, c_3) a(k Q^2)^4 + \mathcal{O}(a^5). \end{aligned}$$
(4)

The expressions for the new coefficients \(d_1(k)\), \(d_2(k; c_2)\) and \(d_3(k; c_2, c_3)\) are obtained on the basis of independence of the observable \(\mathcal{D}_\mathrm{BS}(Q^2)_\mathrm{pt}\) from k and \(c_j\) (\(j \ge 2\)), and are given in Appendix A.

In those versions of \({\mathcal {A}}\)QCD where the coupling is a holomorphic function \({\mathcal {A}}(Q^2)\) [instead of the nonholomorphic \(a(Q^2)\)] with nonperturbative contributions, the power expansion (4) becomes a nonpower expansion where \(a^n\) get replaced by \({\mathcal {A}}_n\) (\(\ne {\mathcal {A}}^n\))

$$\begin{aligned} \mathcal{D}_\mathrm{BS}(Q^2)_\mathrm{{{\mathcal {A}}}QCD}= & {} {\mathcal {A}}(k Q^2) + d_1(k) {\mathcal {A}}_2(k Q^2) \nonumber \\&+\, d_2(k;c_2) {\mathcal {A}}_3(k Q^2) \nonumber \\&+\, d_3(k; c_2, c_3) {\mathcal {A}}_4(k Q^2) + \mathcal{O}({\mathcal {A}}_5). \end{aligned}$$
(5)

The construction of the power analogs \({\mathcal {A}}_n\) of \(a^n\) were obtained in Ref. [45, 46] for integer n and in Ref. [47] for general real \(n > -1\). A brief description for the construction of \({\mathcal {A}}_n\) is given in Appendix B. These expressions are based on the renormalization group equation (RGE), in close analogy with the RGE in the perturbation theory. The couplings \({\mathcal {A}}_n(Q^2)\) can be obtained once the coupling \({\mathcal {A}}(Q^2)\) is known. The construction of \({\mathcal {A}}(Q^2)\) coupling is summarized in Appendix C for various variants of QCD with holomorphic coupling: (F)APT, 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD, and we refer to that Appendix for more details.

2.2 Higher-twist

In the theoretical OPE expression (2), the term with the dimension \(D=2\) (i.e., \(\propto 1/Q^2\)) has the coefficient

$$\begin{aligned} \mu _4 = \frac{M_N^2}{9} (a_2^{p-n} + 4 d_2^{p-n} + 4 f_2^{p-n}(Q^2)), \end{aligned}$$
(6)

where \(M_N \approx 0.94\) GeV is the nucleon mass, \(a_2^{p-n}\) is the (twist-2) target mass correction, and \(d_2^{p-n}\) is a twist-3 matrix element

$$\begin{aligned} d_2^{p-n} = \int _0^1 dx x^2 (2 g_1^{p-n} + 3 g_2^{p-n}). \end{aligned}$$
(7)

At \(Q^2=1 \ \mathrm{GeV}^2\), we have \(a_2^{p-n}=0.031 \pm 0.010\) and \(d_2^{p-n}=0.008 \pm 0.0036\) [5]. We will neglect \(Q^2\)-dependence of these two quantities [as was done also in Ref. [5]], and will take the central values, i.e., \(a_2^{p-n} + 4 d_2^{p-n}=0.063\). On the other hand, the coefficient \(f_2^{p-n}(Q^2)\) will be a parameter of the fit, and its \(Q^2\)-dependence will not be neglected, its evolution is known [48, 49] in pQCD

$$\begin{aligned} f_2^{p-n}(Q^2)=f_2^{p-n}(Q_\mathrm{in}^2) \left( \frac{a(Q^2)}{a(Q_\mathrm{in}^2)}\right) ^{\gamma _0/8\beta _0}, \end{aligned}$$
(8)

where \(\nu \equiv \gamma _0/(8\beta _0) =32/81\) when \(N_f=3\), and the reference scale will be taken \(Q_\mathrm{in}^2 = 1 \ \mathrm{GeV}^2\). In QCD with holomorphic coupling \({\mathcal {A}}(Q^2)\), the power \(a^{\gamma _0/8\beta _0}\) gets replaced by \({\mathcal {A}}_{\gamma _0/8\beta _0}\) (which is in general not equal to the simple power \({\mathcal {A}}^{\gamma _0/8\beta _0}\), as mentioned above, cf. also Appendix B,

$$\begin{aligned} f_2^{p-n}(Q^2)=f_2^{p-n}(Q_\mathrm{in}^2) \left( \frac{{\mathcal {A}}_{\gamma _0/8\beta _0}(Q^2)}{{\mathcal {A}}_{\gamma _0/8\beta _0}(Q_\mathrm{in}^2)}\right) , \end{aligned}$$
(9)

In addition, in some of the fits we will also include the \(D=4\) term in the theoretical OPE expression (2) \(\mu _6/(Q^2)^2\), where we will consider the coefficient \(\mu _6\) as \(Q^2\)-independent. Thus, our theoretical expression for BSR will be the truncated (at \(D=4\), or \(D=2\)) OPE expression

$$\begin{aligned}&\Gamma _1^{p-n,\mathrm{OPE[4]}}(Q^2; k, f_2^{p-n}(1); \mu _6) = {\Big |} \frac{g_A}{g_V} {\Big |} \frac{1}{6}(1 - \mathcal{D}_\mathrm{BS}(Q^2))\nonumber \\&\quad +\, \frac{M_N^2}{Q^2} \frac{1}{9} \left( a_2^{p-n} +4 d_2^{p-n} + 4 f_2^{p-n}(Q^2) \right) +\frac{\mu _6}{(Q^2)^2}, \end{aligned}$$
(10)

where the leading-twist truncated expressions are given in Eqs. (4) and (5), and the \(Q^2\)-dependent part of the \(D=2\) term (twist-4) in Eqs. (8) and (9), for the pQCD and \({\mathcal {A}}\)QCD version of QCD, respectively. We will regard the renormalization scale parameter \(k \equiv \mu ^2/Q^2\) and the higher-twist coefficients \(f_2^{p-n}(Q^2_\mathrm{in})\) (with \(Q^2_\mathrm{in}=1 \ \mathrm{GeV}^2\)) and \(\mu _6\) as the free parameters to be determined in the fitting procedure.

In our approach, we will include in the leading-twist part of the OPE all the known terms (i.e., up to order \(\sim a^4 \sim {\mathcal {A}}_4\)). The order of the leading-twist terms in general affects the fitted higher-twist contributions (cf. [50,51,52,53,54,55,56,57] for the fit of truncated OPE to structure functions).

2.3 Elastic contribution

The OPE evaluation is, in principle, for inclusive observables; in the case of BSR, this means that the OPE fit should be applied to the experimental values of the sum of the inelastic and elastic contribution.

The elastic contribution to BSR can be expressed [58, 59] in terms of the proton and neutron electromagnetic form factors \(F_1\) and \(F_2\)

$$\begin{aligned} \Gamma _1^{p-n}(Q^2)_\mathrm{el.}= & {} \frac{1}{2} F_1^{p}(Q^2) \left( F_1^{p}(Q^2) + F_2^{p}(Q^2) \right) \nonumber \\&-\, \frac{1}{2} F_1^{n}(Q^2) \left( F_1^{n}(Q^2) + F_2^{n}(Q^2) \right) . \end{aligned}$$
(11)

The most recent parametrization of these form factors was performed in [60] from light-front holographic QCD (LFH QCD). Namely, these form factors can be expressed in terms of the inverse power expressions

$$\begin{aligned} \mathcal{F}_{\tau }\equiv & {} \left( 1 + \frac{Q^2}{M_{\rho }^2} \right) ^{-1} \times \cdots \times \left( 1 + \frac{Q^2}{(2 \tau -3 ) M_{\rho }^2} \right) ^{-1}, \nonumber \\&\quad (\tau =2,3,\ldots ), \end{aligned}$$
(12)

which are products of \(\tau -1\) poles along the vector meson Regge radial trajectory in terms of the \(\rho \)-vector mass \(M_{\rho }=0.755\) GeV and its radial excitations. We have

$$\begin{aligned} F_1^p(Q^2)= & {} \mathcal{F}_{\tau =3}(Q^2), \nonumber \\ F_2^p(Q^2)= & {} \chi _p \left[ (1 - \gamma _p) \mathcal{F}_{\tau =4}(Q^2) + \gamma _p\mathcal{F}_{\tau =6}(Q^2) \right] , \nonumber \\\end{aligned}$$
(13a)
$$\begin{aligned} F_1^n(Q^2)= & {} -\frac{1}{3} r \left[ \mathcal{F}_{\tau =3}(Q^2) - \mathcal{F}_{\tau =4}(Q^2) \right] , \nonumber \\ F_2^n(Q^2)= & {} \chi _n \left[ (1 - \gamma _n) \mathcal{F}_{\tau =4}(Q^2) + \gamma _n\mathcal{F}_{\tau =6}(Q^2) \right] . \nonumber \\ \end{aligned}$$
(13b)

Here: \(\chi _p = \mu _p-1 = 1.793\) and \(\chi _n = \mu _n=-1.913\) are the anomalous moments of p and n, respectively; \(\gamma _{n,p}\) are the higher Fock probabilities for the \(L=0 \rightarrow L=1\) spin-flip electromagnetic form factors, and r is a phenomenological factor. The values of these three parameters are obtained by fitting to polarization data for the form factors [60]: \(\gamma _p=0.27\); \(\gamma _n=0.38\); \(r=2.08\).

As a consequence, the elastic contribution (11) can be represented as a combination of inverse powers \(( 1 + Q^2/M_{\rho }^2/(2 m -1) )\), and for large \(Q^2\) this can be expanded in inverse powers of \(Q^2\)

$$\begin{aligned} \Gamma _1^{p-n}(Q^2)_\mathrm{el.}= & {} 2.3368 \left( \frac{M_{\rho }^2}{Q^2}\right) ^4 \nonumber \\&+\, 13.8851 \left( \frac{M_{\rho }^2}{Q^2}\right) ^5 + \mathcal{O}(1/Q^{12}). \end{aligned}$$
(14)

This means that at high \(Q^2\) the elastic contribution starts with the dimension \(D=8\) term \(\sim (1/Q^2)^4\). Theoretically, it is not included in the truncated OPE expression (10) in pQCD.Footnote 3\(^{,}\)Footnote 4 Further, in 3\(\delta \) and 2\(\delta \) \({\mathcal {A}}\)QCD, the higher dimensional terms up to (and including) the \(D=8\) term are not included in the leading-twist contribution (5) because in these approaches \({\mathcal {A}}(Q^2) - a(Q^2) \sim (\Lambda ^2/Q^2)^5\) as explained in Appendix C.2 Eq. (C7). Therefore, the truncated OPE expression (10) does not contain the dimension \(D=8\) term \(\sim (1/Q^2)^4\) also in 3\(\delta \) and 2\(\delta \) \({\mathcal {A}}\)QCD approaches. The only exception is the (F)APT where \({\mathcal {A}}(Q^2) - a(Q^2) \sim (\Lambda ^2/Q^2)^1\), cf. Appendix C.1, and where the truncated OPE (10) could contain, in principle, all the elastic contributions, including the \(\sim (1/Q^2)^4\) term.

Therefore, it looks more natural to first fit the truncated OPE expression (10), with pQCD and (3\(\delta \) and 2\(\delta \)) \({\mathcal {A}}\)QCD, to the BSR experimental results [3,4,5, 7, 8] which are obtained for the inelastic contribution; and after such a fit, add the elastic contribution (11) [with the parametrization (12)–(13b)]. We note that the theoretical expression obtained for the total BSR in this way is the sum of the expressions (10) and (11), which is again an OPE expression as it should be for such an inclusive spacelike observable as the total BSR.

The other approach would be to fit the truncated OPE expression (10) with the experimental points for the total BSR, i.e., fit with the experimental points for inelastic BSR with the elastic contribution added to them. Such an approach seems less natural, at least in pQCD and 3\(\delta \) and 2\(\delta \) \({\mathcal {A}}\)QCD, because the truncated expression (10) in these cases in principle does not contain the leading elastic contribution \(\sim 1/(Q^2)^4\), cf. Eq. (C7) in Appendix C.2.

Nonetheless, below we will apply, for completeness, both approaches in our numerical fitting procedures.

3 Numerical fits

In the numerical fits, we will consider the following parameters to be fitted in the truncated OPE expression (10): (i) the renormalization scale parameter \(k \equiv \mu ^2/Q^2\) of the (truncated) leading-twist contribution (4) [for pQCD] or (5) [for \({\mathcal {A}}\)QCD]; (ii) the parameter \(f_2^{p-n}(1 \ \mathrm{GeV}^2)\) appearing in the \(D=2\) term of Eq. (10) [cf. Eqs. (8) and (9)]; (iii) and in some fits the parameter \(\mu _6\) in the \(D=4\) term of Eq. (10) will be included. We will take for the experimental data points for the inelastic contribution to BSR the data from [3,4,5, 7, 8];Footnote 5 they are in the momentum interval \(0.054 \ \mathrm{GeV}^2 \le Q_j^2 < 5 \ \mathrm{GeV}^2\). Among these data points, we exclude from the fit the four points with \(Q_j^2 \ge 3 \ \mathrm{GeV}^2\) (three from [5], at \(Q_j^2=3.223\), 3.871 and \(4.739 \ \mathrm{GeV}^2\); and one from [7] at \(Q^2=3 \ \mathrm{GeV}^2\)) because they tend to decrease the quality parameter \(\chi ^2/\mathrm{(d.o.f.)}\) of the fit significantly. Nonetheless, we will show both quality parameters \(\chi ^2/\mathrm{(d.o.f.)}\) obtained from the mentioned fits, i.e., the one with the four points excluded, and included.Footnote 6 The number of active flavors will be kept all the time at \(N_f=3\). The fits will be performed by the least squares method, taking into account the statistical uncertainties \(\sigma _{j,\mathrm{stat}}\) in the points \(Q_j^2\), we will consider them independent of each other. These uncertainties \(\sigma _{j,\mathrm{stat}}\) are in general significantly smaller than the systematic uncertainties \(\sigma _{j,\mathrm{sys}}\). The latter are strongly correlated, and we will consider them as completely correlated. The uncertainties in the values of the extracted parameters \(k \equiv \mu ^2/Q^2\), \(f_2^{p-n}(Q^2_\mathrm{in})\) and \(\mu _6\) are then due to statistical (small) and systematic (larger) uncertainties of the data. We refer to Appendix D on how we obtained the uncertainties of the extracted values of the fit parameters.

Fig. 1
figure 1

Fits of the OPE expression (10) truncated at \(D=2\) term, i.e., \(\mu _6=0\), to the experimental data for \(\Gamma _1^{p-n}(Q^2)_\mathrm{inel.}\), in four different QCD variants, where in the fits: a \(Q^2 \ge 0.66~\mathrm{GeV}^2\); b \(Q^2 \ge 0.268~\mathrm{GeV}^2\). See the text for details. The respective lower bound of the fitting interval, \(Q^2_\mathrm{min}=0.66\) or \(0.268~\mathrm{GeV}^2\), is included as the thin dotted vertical line

Using specific values of the renormalization scale parameter \(k \equiv \mu ^2/Q^2\) may allow us to incorporate in our evaluations (5) at least a part of the contributions of higher orders of the series. It is expected that \(k \sim 1\), and usually it is taken in the literature in the range \(1/2< k < 2\), sometimes \(1/4< k < 4\). In the considered work, after replacing the powers of the pQCD coupling by their analytic counterparts, cf. Eqs. (4)–(5), a spacelike observable depends usually weakly on the contributions of higher orders. Still, in order not to miss the possibly relevant influence of higher orders, we decided to increase the range of possible k values to: \(1/16< k < 16\). This choice still avoids very large or very small renormalization scales where the corresponding coefficients \(d_n(k)\) at the powers \(a(k Q^2)^{n+1}\), or at their analytic counterparts \({\mathcal {A}}_{n+1}(k Q^2)\), contain powers of the large terms \(\sim \ln k\) [cf. Eq. (A3)] which may destroy the convergence of the series already at low n. We will see that the results of the fitting will rarely give the extreme values \(k=16\) or \(k=1/16 \approx 0.063\), mostly in the cases of (\(\overline{\mathrm{MS}}\)) pQCD and (F)APT.

We present below the results of the fits of the inelastic contributions to the truncated OPE expression (10), first in Sect. 3.1 with \(\mu _6=0\), and then in Sect. 3.2 for \(\mu _6 \ne 0\) included as a fit parameter; the elastic contribution is added afterwards. In Sect. 3.3 we present the corresponding fits for the case when the higher-twist term in the OPE has a mass parameter. In Sect. 3.4 we include the fits for two different ansätze for BSR at very low \(Q^2\). Finally, in Sect. 3.5 we present the fits of the truncated OPE expression (10), with \(\mu _6 \ne 0\) included as a fit parameter, applied to the total BSR, i.e., to the sum of the inelastic data points (at \(Q_j^2\)) and the corresponding elastic contribution \(\Gamma _1^{p-n}(Q_j^2)_\mathrm{el.}\).

In each case, the fits are performed for four variants of QCD: in pQCD (in \(\overline{\mathrm{MS}}\)), in (F)APT (in \(\overline{\mathrm{MS}}\)), and in 2\(\delta \) [33] and 3\(\delta \) \({\mathcal {A}}\)QCD [35]. Each of these fits is performed by excluding among the data points those with \(Q^2 < Q^2_\mathrm{min}\), where \(Q^2_\mathrm{min}=0.268\) or \(0.66 \ \mathrm{GeV}^2\) (and sometimes also: \(Q^2_\mathrm{min}=0.47 \ \mathrm{GeV}^2\)). As mentioned earlier, the four points at high \(Q^2 \ge 3 \ \mathrm{GeV}^2\) are excluded from the fit as well.

It is reasonable to assume that the number of active quark flavors in the considered interval \(0.054 \ \mathrm{GeV}^2< Q^2 < 5 \ \mathrm{GeV}^2\) is \(N_f=3\), cf. Appendix E. All the fits will be performed in \(\overline{\mathrm{MS}}\) pQCD approach, in the Fractional Analytic Perturbation Theory [(F)APT], and in 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD approach, cf. Appendix C where also the values of the parameters of these three QCD variants are presented. In \(\overline{\mathrm{MS}}\) pQCD and in 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD the (underlying) pQCD running coupling \(a(Q^2)\) is determined by the requirement \(\pi a(M_Z^2;\overline{\mathrm{MS}}) = 0.1185\) [39, 61], and in (F)APT we use \({\overline{\Lambda }}_3=0.45\) GeV (at the end of Sect. 3.2 we also comment on the case \({\overline{\Lambda }}_3=0.40\) GeV); we refer to Appendix C for more details.

Table 1 The values of the extracted fit parameters \(k = \mu ^2/Q^2\) and \(f_2^{p-n}(Q^2_\mathrm{in})\) (\(Q^2_\mathrm{in} = 1 \ \mathrm{GeV}^2\)), obtained by fitting the OPE expression (10), truncated at \(D=2\) (\(\mu _6=0\)), for the considered four QCD variants, and for the minimal \(Q^2\) values of the fit being 0.66, 0.47 and \(0.268~ \mathrm{GeV}^2\). The uncertainties of the extracted \(f_2^{p-n}\) values are separated to those coming from the statistical (small) and systematic (large) uncertainties of the experimental data (cf. Appendix D). The resulting fit quality parameters \(\chi ^2/\mathrm{d.o.f.}\) are given as well (see the text for explanation)

3.1 Fits with \(\mu _6=0\)

In Fig. 1a, b, we present the curves in the mentioned four QCD variants, for \(Q^2_\mathrm{min}=0.66\) and \(0.268 \ \mathrm{GeV}^2\), respectively. The corresponding results and four fit quality parameters \(\chi ^2/\mathrm{d.o.f}\) are given in Table 1, for \(Q^2_\mathrm{min} = 0.66\), 0.47 and \(0.268 \ \mathrm{GeV}^2\). As mentioned earlier, the fit is performed for the data in the momentum interval \(Q^2_\mathrm{min} \le Q_j^2 \le 3 \ \mathrm{GeV}^2\) (with the COMPASS data point [7] excluded), and the corresponding quality parameter for that interval is denoted as \(\chi ^2/\mathrm{d.o.f}\). In addition, \(\chi ^2_\mathrm{ext}/\mathrm{d.o.f.}\) is the quality parameter for the wider interval \(Q^2_\mathrm{min} \le Q_j^2 < 5 \ \mathrm{GeV}^2\); \(\chi ^2_{0.268}/\mathrm{d.o.f.}\) is the parameter when \(0.268 \ \mathrm{GeV}^2 \le Q_j^2 \le 3 \ \mathrm{GeV}^2\); and \(\chi ^2_\mathrm{all}/\mathrm{d.o.f.}\) is the parameter when all the experimental points [3,4,5, 7, 8] are included, \(0.054 \ \mathrm{GeV}^2 \le Q_j^2 < 5 \ \mathrm{GeV}^2\). In these quantities \(\chi ^2\), the central values of k and \(f_2^{p-n}(Q^2_\mathrm{in})\) (\(Q^2_\mathrm{in} = 1 \ \mathrm{GeV}^2\)) parameters obtained from the mentioned fit interval \(Q^2_\mathrm{min} \le Q_j^2 \le 3 \ \mathrm{GeV}^2\) are used; we take here “d.o.f” as \((N-p)\) where N is the number of fitted data points entering the considered \(\chi ^2\) and p is the number of parameters of the fit (\(p=2\) here)

$$\begin{aligned}&\chi ^2(Q_{K+1}^2 \le Q^2 \le Q_{K+N}^2)/\mathrm{d.o.f.} \nonumber \\&\quad =\frac{1}{(N\!-\!p)}\!\sum _{j=K+1}^{K+N} \frac{1}{\sigma ^2_{j,\mathrm{stat}}} \left( \Gamma _1^{p-n,OPE[4]}(Q_j^2; k, f_2^{p-n}(1); \mu _6=0)\right. \nonumber \\&\qquad \left. - \Gamma _1^{p-n}(Q_j^2)_\mathrm{exp} \right) ^2. \end{aligned}$$
(15)

The values of these quantities are always large, in the best cases between 1 and 10. This is so because the statistical uncertainties of the newer JLAB data [5] are very small, \(\sigma _{j,\mathrm{stat}} \lesssim 10^{-3}\), and simultaneously, our fit function (truncated OPE) is different from the ideal function which we do not know. Nonetheless, they decrease when analytic QCD variants are employed, especially 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD.

We wish to point out that the approach of (\(\overline{\mathrm{MS}}\)) QCD in the case of \(Q_\mathrm{min}^2=0.268 \ \mathrm{GeV}^2\) is, in principle, not applicable. This is so because the corresponding coupling \(a(Q^2)\) has a Landau branching point at \(Q^2_\mathrm{branch}=0.371 \ \mathrm{GeV}^2\), which makes the \(D=2\) running coefficient \(f_2^{p-n}(Q^2)\), Eq. (8), undefined at \(Q^2 \le 0.371 \ \mathrm{GeV}^2\). Nonetheless, in order to be able to present a curve, we applied in the fitting case \(Q_\mathrm{min}^2=0.268 \ \mathrm{GeV}^2\) in the \(\overline{\mathrm{MS}}\) pQCD approach a restriction on the leading-twist renormalization scale \(\mu ^2 = k Q^2\), namely \(k > 1.383\); and this not just in the leading-twist contribution, but we also made an ad hoc replacement in the \(D=2\) running coefficient \(f_2^{p-n}(Q^2)\), Eq. (8): \(f_2^{p-n}(Q^2) \mapsto f_2^{p-n}(k Q^2)\). We applied this also in the case when \(\mu _6 \ne 0\) (Sect. 3.2, for \(\overline{\mathrm{MS}}\) pQCD approach with \(Q_\mathrm{min}^2=0.268 \ \mathrm{GeV}^2\)). In other approaches (APT and \({\mathcal {A}}\)QCD’s) this is not necessary, as there are no Landau singularities. Further, we note that in the last column in Table 1 we have \(\chi ^2/\mathrm{d.o.f.} = \infty \) in the case of \(\overline{\mathrm{MS}}\) pQCD for \(Q^2_\mathrm{min}=0.66\) and \(0.47 \ \mathrm{GeV}^2\). This is so because at the lowest available experimental point, \(Q_{j=1}^2=0.054 \ \mathrm{GeV}^2\), the coupling is \(a(k Q_1^2; \overline{\mathrm{MS}}) = \infty \) because \(k Q_1^2 < Q^2_\mathrm{branch}=0.371 \ \mathrm{GeV}^2\) and thus we hit Landau singularities there.

We can deduce from Fig. 1 and Table 1: (a) the best results in the considered approach (\(\mu _6=0\)) are obtained in 3\(\delta \) \({\mathcal {A}}\)QCD; (b) the quality of extrapolation of the obtained fitted curves from the fitting interval, \(Q^2_\mathrm{min} \le Q^2 \le 3 \ \mathrm{GeV}^2\), to the entire interval, \(0.054 \ \mathrm{GeV}^2 \le Q^2 < 5 \ \mathrm{GeV}^2\), does not improve significantly when the fitted interval is extended (i.e., when \(Q^2_\mathrm{min}\) is lowered), cf. also the last column in Table 1. This indicates that the behavior of the curves in the extrapolated regions, \(0.054 \ \mathrm{GeV}^2< Q^2 < 0.268 \ \mathrm{GeV}^2\) and \(3 \ \mathrm{GeV}^2 < Q^2 \le 5 \ \mathrm{GeV}^2\), is of similar quality in the cases of different values of \(Q^2_\mathrm{min}(\mathrm{fit})\), and gives the dominant part of \(\chi ^2_\mathrm{all}/\mathrm{d.o.f.}\). The same can be observed for the values of \(f_2^{p-n}(1)\): they do not depend much on the value of \(Q^2_\mathrm{min}(\mathrm{fit})\), but only on the QCD variant used in the fit. This appears to be related with the fact that only one parameter beyond the leading-twist contribution is used here (\(\mu _4 \leftrightarrow f_2^{p-n}\)), representing a truncated OPE ansatz with a significantly restricted freedom.

3.2 Fits with \(\mu _6 \ne 0\)

When we include \(\mu _6\) in the fit as the third parameter, the resulting curves are presented in Fig. 2a, b, for \(Q^2_\mathrm{min}=0.66\) and \(0.47 \ \mathrm{GeV}^2\), respectively. Further, when \(Q^2_\mathrm{min}= 0.268 \ \mathrm{GeV}^2\), the results are shown in Fig. 3a, b, at the higher \(Q^2\) and the lower \(Q^2 < 1 \ \mathrm{GeV}^2\) momenta, respectively. The corresponding results are given in Table 2, for \(Q^2_\mathrm{min} = 0.66\), 0.47 and \(0.268 \ \mathrm{GeV}^2\), with the same notations as in Table 1. The various versions of \(\chi ^2/\mathrm{d.o.f.}\) are those as explained in the previous Sect. 3.1, except that now in the relation (15) the factor in front of the sum is \(1/(N-3)\) (\(p=3\), d.o.f. is \(N-3\)).

Fig. 2
figure 2

Fits of the OPE expression (10) truncated at \(D=4\) term (i.e., \(\mu _4 \ne 0\)), to the experimental data for \(\Gamma _1^{p-n}(Q^2)_\mathrm{inel.}\), done in four different QCD variants, where in the fits: a left plot, \(Q^2 \ge 0.66~\mathrm{GeV}^2\); b right plot, \(Q^2 \ge 0.47~\mathrm{GeV}^2\). See the text for details. The respective lower bound of the fitting interval, \(Q^2_\mathrm{min}=0.66\) and \(0.47~\mathrm{GeV}^2\), is included as the thin dotted vertical line

Fig. 3
figure 3

As Fig. 2, but for \(Q^2_\mathrm{min} = 0.268~\mathrm{GeV}^2\): a for larger \(Q^2\); b for \(Q^2 < 1~\mathrm{GeV}^2\). The lower bound of the fitting interval, \(Q^2_\mathrm{min}=0.268~\mathrm{GeV}^2\), is included as the thin dotted vertical line

Table 2 The values of the extracted fit parameters \(k = \mu ^2/Q^2\), \(f_2^{p-n}(1 \ \mathrm{GeV}^2)\) and \(\mu _6\) (in \(\mathrm{GeV}^4\)), obtained by fitting the OPE expression (10), truncated at \(D=4\) (\(\mu _6 \ne 0\)). The notations are as in Table 1. See the text in Sect. 3.1 for explanation of the various \(\chi ^2/\mathrm{d.o.f.}\)’s

Comparing Table 2 and Figs. 2 and 3 with Table 1 and the Fig. 1 of the previous Sect. 3.1 where \(\mu _6=0\) was kept, we can see that in the cases of \(Q^2(\mathrm{fit}) \ge 0.66 \ \mathrm{GeV}^2\) and \(Q^2(\mathrm{fit}) \ge 0.47 \ \mathrm{GeV}^2\) the \(\mu _6=0\) fits give in general better \(\chi ^2_\mathrm{all}/\mathrm{d.o.f.}\) (the last columns of Tables 1 and 2). This means that the extrapolation down to the lowest experimental point \(Q^2=Q_{j=1}^2=0.054 \ \mathrm{GeV^2}\) is better when \(\mu _6=0\) (with the exception of \(\overline{\mathrm{MS}}\) pQCD case where problems with Landau singularities appear). This occurs because the inclusion of the \(\mu _6/(Q^2)^2\) term in the truncated OPE makes this expression less stable at very low \(Q^2\).

On the other hand, when \(Q^2(\mathrm{fit}) \ge 0.268 \ \mathrm{GeV}^2\), some of the fits (2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD) give better extrapolation when \(\mu _6 \ne 0\). Table 2 and Fig. 3 also show that when \(\mu _6 \ne 0\) and \(Q^2(\mathrm{fit}) \ge 0.268 \ \mathrm{GeV}^2\), the best extrapolation to low \(Q^2\) is obtained in 3\(\delta \) \({\mathcal {A}}\)QCD, followed by 2\(\delta \) \({\mathcal {A}}\)QCD. As Fig. 3b suggests, the fitted curve in pQCD \(\overline{\mathrm{MS}}\) extrapolated to low \(Q^2\) appears to be almost as good; in this case, however, we should keep in mind that the renormalization scale is \(\mu ^2 = k Q^2\) (\(k =3.00\)) and that this scale was used also in \(f^{p-n}_2(Q^2)\), i.e., the ad hoc replacement \(f^{p-n}_2(Q^2) \mapsto f^{p-n}_2(k Q^2)\) was performed in order to avoid the Landau singularities in the \(D=2\) term at \(Q^2 \ge 0.268 \ \mathrm{GeV}^2\) (cf. also the discussion about that point in Sect. 3.1). Despite this replacement, in \(\chi ^2_\mathrm{all}/\mathrm{d.o.f.}\) Landau singularities are hit, because the three lowest experimental points give \(k Q_{j}^2 < Q^2_\mathrm{branch}=0.371 \ \mathrm{GeV}^2\) (\(j=1,2,3\)) and are thus in the Landau singularity region.Footnote 7

In the case of (F)APT, in contrast to 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD, the coupling \({\mathcal {A}}(Q^2)\) differs from the underlying pQCD coupling \(a(Q^2)\) nonnegligibly at high \(|Q^2| > 1 \ \mathrm{GeV}^2\) [the index in Eq. (B1) is \(N=1\) in (F)APT; \(N=5\) in 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD]. This implies that (F)APT has a certain ambiguity when “normalizing” the strength of \({\mathcal {A}}(Q^2)\). As mentioned, we fixed the strength of the coupling \({\mathcal {A}}(Q^2)\) in (F)APT to the value \({\overline{\Lambda }}_{N_f=3}=0.45\) GeV, since in such a case (F)APT reproduces approximately the QCD phenomenology at high energies (cf. also [17, 18]). A question appears whether the results of our fits in (F)APT depend significantly on this value. We repeated the analysis in (F)APT with the value \({\overline{\Lambda }}_{N_f=3}=0.40\) GeV, and it turned out that the results of the fits did not change significantly. For example, when \(Q^2_\mathrm{min}=0.268 \ \mathrm{GeV}^2\) and \(\mu _6=0\), we obtained \(k=2.86\) and the central value \(f_2^{p-n}(1.)= -0.137\), and for the three quality parameters \(\chi ^2_\mathrm{ext}/\mathrm{d.o.f.}\), \(\chi ^2_{0.268}/\mathrm{d.o.f.}\) and \(\chi ^2_\mathrm{all}/\mathrm{d.o.f}\) the values 120., 54.7 and 878., respectively (to be compared with the corresponding values in Table 1, the third line from below). When \(\mu _6 \ne 0\) in the fit, we obtained \(k=2.62\), \(f_2^{p-n}(1.)=-0.187\), \(\mu _6=0.013 \ \mathrm{GeV}^4\) and for the mentioned three quality parameters the values 42.3, 7.94 and \(4.43 \times 10^3\), respectively (to be compared with the corresponding values in Table 2, the third line from below).

When we add the parametrized elastic contribution of BSR, Eqs. (11)–(13), to the experimental points and to the theoretical curves of Fig. 3, we obtain the results presented in Fig. 4a, b. In comparison with Fig. 3, the values of BSR are shifted to significantly higher values at low \(Q^2\). With this approach, the quality of fits (\(\chi ^2/\mathrm{d.o.f}\)) does not change, as we consider the elastic contribution as known (and parametrized) and added here simultaneously to the (inelastic) theoretical fitting curves and to the data points. This means that the results of Table 2 remain unchanged under this subsequent addition of \(\Gamma _1^{p-n}(Q^2)_\mathrm{el.}\).

Fig. 4
figure 4

As Fig. 3, but shifted upwards by the parametrized elastic contribution (11) [cf. also Eqs. (12)–(13)]

We can observe in the results of Tables 1 and 2 that the values of the higher-twist parameters, \(f_2^{p-n}(1)\) and \(\mu _6\), are in the analytic variants of QCD smaller than in pQCD, this reduction being especially strong in the 3\(\delta \) QCD variant. It has been noted in the literature that in pQCD OPE there is a duality between the order of truncation of the leading-twist series and the higher-twist contribution [28,29,30,31, 50,51,52,53,54,55,56,57]: higher-twist contribution often significantly decreases with the inclusion of higher orders in the leading-twist part. This effect and ambiguity become stronger in the ranges where the perturbation theory becomes questionable (for example, at the large and low values of the Bjorken variable x, as it was shown in Refs. [62, 63], respectively). It has been observed that the higher-twist contribution is smaller, but also more stable (under the inclusion of more terms in the leading-twist), in QCD variants with infrared modifications of the coupling (various modifications lead to quite similar results [64, 65]). The latter probably incorporate a part of the higher-twist contributions (which are rather cumbersome [66]) into (formally) the leading-twist contribution for small x range at moderately small \(Q^2\) values (\(\lesssim 1 \ \mathrm{GeV}^2\)) (see Ref. [67] and more recent studies [68,69,70] of the precise combined H1 and ZEUS data [71] for the DIS structure function \(F_2\)).

Following the above observations in Tables 1 and 2, we can conclude that the applications of the 2\(\delta \) and especially 3\(\delta \) \({\mathcal {A}}\)QCD are very appropriate frameworks for the BSR studies because they appear to resum effectively a large part of the perturbative contribution into the leading-twist part (5). In this context, we wish to recall that 3\(\delta \) \({\mathcal {A}}\)QCD is significantly different from the other two \({\mathcal {A}}\)QCD variants [(F)APT and 2\(\delta \) \({\mathcal {A}}\)QCD] in the infrared region, because its coupling is not just finite there but goes to zero, \({\mathcal {A}}(Q^2) \sim Q^2 \rightarrow 0\), as motivated by large-volume lattice calculations, cf. [72,73,74,75,76] and Appendix C.2. Further, we wish to point out that the 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD couplings \({\mathcal {A}}\) (and thus \({\widetilde{\mathcal {A}}}_n\) and \({\mathcal {A}}_n\)) are at large \(Q^2\) indistinguishable from their underlying pQCD couplings a (\({\widetilde{a}}_n\), \(a^n\)), cf. Eq. (C7), in contrast to (F)APT which satisfies the relation Eq. (B1) with \(N=1\). Therefore, theoretically, neither the higher-twist contribution of order \(1/(Q^2)^N\) with \(N \le 4\) (\(D \le 8\)), nor a part of it, is incorporated in the leading-twist contribution (5) in the 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD, in contrast to (F)APT. This indicates that the higher-twist terms extracted here with 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD (in truncated OPE) represent an effective form for the true higher-twist contribution with dimension \(D \le 8\) [and a part of the other (\(D \ge 10\)) presumably small contribution]. In pQCD this is definitely not so, because of the mentioned duality there between the order of truncation of the leading-twist series and the extracted higher-twist contribution. As a consequence, the extracted effective higher-twist contribution in pQCD represents a sum of the true higher-twist contribution and a significant part of the perturbative (leading-twist) contribution; this effective higher-twist contribution appears to be in general larger than the true higher-twist contribution.

Fig. 5
figure 5

As Fig. 3, but for the fit the OPE expression (16) with “massive” higher-twist term was used

Table 3 As in Table 2, but now the OPE form is “massive”, Eq. (16). The fits were made only for \(Q^2_\mathrm{min}(\mathrm{fit})=0.268 \ \mathrm{GeV}^2\). The extracted values of the squared mass \(M^2\) are in \(\mathrm{GeV}^2\)

3.3 Fits with “massive” OPE

For comparison, we performed a similar fit, but now with a “massive” higher-twist term instead of the truncated OPE expression (10)

$$\begin{aligned}&\Gamma _1^{p-n,\mathrm{mOPE[4]}}(Q^2; k, f_2^{p-n}(1); M^2) \nonumber \\&\quad = {\Big |}\frac{g_A}{g_V} {\Big |} \frac{1}{6}(1 - \mathcal{D}_\mathrm{BS}(Q^2)) \nonumber \\&\qquad +\, \frac{M_N^2}{(Q^2 + M^2)} \frac{1}{9} \left( a_2^{p-n} + 4 d_2^{p-n} + 4 f_2^{p-n}(Q^2) \right) ,\nonumber \\ \end{aligned}$$
(16)

where the squared mass \(M^2\) in the denominator of the higher-twist partFootnote 8 is taken to be constant (not running), and is expected to be \(0 < M^2 \lesssim 1 \ \mathrm{GeV}^2\). Now, instead of \(f^{p-n}_2(1)\) and \(\mu _6\), the fit parameters are \(f_2^{p-n}(1)\) and \(M^2\). The resulting curves, for \(Q^2_\mathrm{min}=0.268 \ \mathrm{GeV}^2\), are given in Fig. 5a, b, at the higher \(Q^2\) and the lower \(Q^2 < 1 \ \mathrm{GeV}^2\) momenta, respectively. The corresponding results are given in Table 3. These curves are analogous to those in the previous Fig. 3a, b in which the truncated OPE (10) was used. Numerically, the behavior at low \(Q^2\) in the “massive” case is significantly influenced by the \(Q^2\)-dependence of \(f_2^{p-n}(Q^2)\). In the \(\overline{\mathrm{MS}}\) pQCD case, as in the \(\overline{\mathrm{MS}}\) pQCD cases of the analyses in all the Sections, we replaced in the higher-twist running parameter \(f_2^{p-n}(Q^2)\) [cf. Eq. (8)] the scale \(Q^2\) in an ad hoc way by the renormalization scale \(k Q^2\) used in the leading-twist part (\(k=16\) resulted here), in order to artificially avoid the problem of Landau singularities in the pQCD coupling \(a(Q^2)\). Comparing Fig. 5 and Table 3 with the corresponding “nonmassive” case Fig. 3 and Table 2, we see that the results and extrapolations in the case of 3\(\delta \) \({\mathcal {A}}\)QCD are now comparably good in the “massive” and the \( \mu _4 \& \mu _6\) approaches. Stated differently, the corresponding \(\chi ^2_{ \{... \}}/\mathrm{d.o.f.}\) values in Tables 2 and 3 (with \(Q^2_\mathrm{fit}=0.268 \ \mathrm{GeV}^2\)) are very similar. In the case of 2\(\delta \) \({\mathcal {A}}\)QCD and (F)APT, the extrapolations are better in the “massive” than in the \( \mu _4 \& \mu _6\) approach, i.e., \(\chi ^2_\mathrm{all}/\mathrm{d.o.f.}\) is significantly reduced in the “massive” case. One reason for this lies perhaps in the fact that the massive higher-twist term is under control at very low \(Q^2\), unlike the separate \(\mu _4(Q^2)/Q^2\) and \(\mu _6/(Q^2)^2\) terms. Further, the extracted values of \(f_2^{p-n}(1)\) are in general similar in the \(\mu _6=0\), \( \mu _4 \& \mu _6\) and the “massive” approaches, although the uncertainties of the extracted parameters are quite high in the “massive” approach.Footnote 9

The results of Table 3 show that the QCD variants with infrared-finite analytic coupling, and especially 3\(\delta \) \({\mathcal {A}}\)QCD, give smaller values of higher-twist parameters \(f_2^{p-n}(1)\) and \(M^2\) than pQCD. On the one hand, at low \(Q^2\), the smaller values of \(M^2\) compensate partially the decreased value of \(f_2^{p-n}\) in the higher-twist contribution. On the other hand, smaller values of \(M^2\) and \(f_2^{p-n}(1)\) mean that at higher values of \(Q^2\) the higher-twist contribution is significantly reduced; this can be seen also by expanding the massive higher-twist term of Eq. (16) in powers of \(M^2/Q^2\). Such effect is in full agreement with one observed in Refs. [28,29,30,31]; the effect can be considered as a stabilization of the higher-twist contribution. We also recall that 3\(\delta \) \({\mathcal {A}}\)QCD is significantly different from (F)APT and 2\(\delta \) \({\mathcal {A}}\)QCD in the infrared region, since its coupling goes to zero there, \({\mathcal {A}}(Q^2) \sim Q^2 \rightarrow 0\).

Table 4 As in Table 3, but now the squared mass in the higher-twist term in Eq. (16) is running according to Eq. (17)
Fig. 6
figure 6

As Fig. 3, but now with the low-\(Q^2\) expressions (18) and (19) included. b is a zoomed-in version of a, for \(Q^2 < 1 \ \mathrm{GeV}^2\). The lower bound of the fitting interval, \(Q^2_\mathrm{min}=0.268~\mathrm{GeV}^2\), is included as the thin dotted vertical line

It is possible to choose for the higher-twist term a massive form with a running mass, in the spirit of a dynamical effective gluon mass of the gluon propagator at low \(Q^2\) [82]. Such masses appear in the literature often in definitions of QCD couplings at low \(Q^2\), and are responsible for the freezing (finiteness) of the coupling at \(Q^2 \rightarrow 0\). The couplings in (F)APT and 2\(\delta \) \({\mathcal {A}}\)QCD have a freezing which could be described also via a running effective gluon mass. Our view is that the OPE higher-twist terms represent a new contribution not contained in the QCD coupling itself. The squared mass \(M^2\) in such terms, Eq. (16), is considered constant, in the spirit of the approach of Ref. [77] (cf. also [78, 79]), where the basic component (delta function) in the spectral function of the higher-twist contribution gives such a mass term. Nonetheless, we repeated the aforementioned analysis for the case of a running squared mass \(M^2(Q^2)\) representative of a dynamical effective gluon mass, chosen with a simple parametrization of Ref. [83]

$$\begin{aligned} M^2(Q^2) = \frac{m_0^2}{1 + (Q^2/\mathcal{M})^{1+p}} \quad (\mathcal{M}=0.5 \ \mathrm{GeV}, p=0.1), \nonumber \\ \end{aligned}$$
(17)

where we chose for the parameters \(\mathcal{M}\) and p values within the expected regions [83]. The adjustable squared mass scale was taken (instead of \(m_0^2\)) to be \(M^2(1 \ \mathrm{GeV}^2)\). The same analysis then gave the results presented in Table 4. We can see that the results are qualitatively similar to those with the constant squared mass \(M^2\), Table 3, except that the values of \(M^2(1 \ \mathrm{GeV}^2)\) are now lower. Nonetheless, a reasonable definition of the average squared mass value in the present analysis, for the considered fit interval \(0.268 \ \mathrm{GeV}^2< Q^2 < 3 \ \mathrm{GeV}^2\) may be \(\langle M^2(Q^2) \rangle = (1/2) \times ( M^2(0.268) + M^2(3) ) \approx 1.52 \times M^2(1 \ \mathrm{GeV}^2)\). Or, if we regard that the squared mass is most relevant only in the low-\(Q^2\) part \(0.268 \ \mathrm{GeV}^2< Q^2 < 1 \ \mathrm{GeV}^2\) of the fit interval, a reasonable definition of the average squared mass would be \(\langle M^2(Q^2) \rangle = (1/2) \times ( M^2(0.268) + M^2(1) ) \approx 1.85 \times M^2(1 \ \mathrm{GeV}^2)\).

3.4 Testing low-\(Q^2\) regime ansätze

At low \(Q^2\), the inelastic contribution to BSR behaves as \(\sim Q^2\), according to Gerasimov–Drell–Hearn sum rule [84, 85] as pointed out and used in [28,29,30,31, 86,87,88,89]. Based on this, an expansion [4] motivated by chiral perturbation theory (\(\chi \)PT) can be constructed

$$\begin{aligned} \Gamma _1^{p-n}(Q^2)_\mathrm{inel.}= & {} \frac{\chi _n^2 - \chi _p^2}{8 M_N^2} Q^2 + A (Q^2)^2+ B (Q^2)^3 \nonumber \\&\quad (Q^2 \lesssim 0.5 \ \mathrm{GeV}^2), \end{aligned}$$
(18)

where, according to Gerasimov–Drell–Hearn sum rule [84, 85], \(\chi _n\) and \(\chi _p\) are anomalous magnetic moments of nucleons [which appear also in the elastic BSR contributions, cf. Eq. (13)]; the parameters A and B are determined in the fit. When we fit with this expression the inelastic BSR data [3,4,5] for \(Q^2 \le Q^2_\mathrm{max}=0.2 \ \mathrm{GeV}^2\), we obtain \(A=0.765 \ \mathrm{GeV}^{-2}\) and \(B= 0.678 \ \mathrm{GeV}^{-4}\), with \(\chi ^2(Q^2 \le Q^2_\mathrm{max})/\mathrm{d.o.f.} = 0.720\). On the other hand, if we take \(Q^2_\mathrm{max}=0.5 \ \mathrm{GeV}^2\) in the fitting, we obtain \(A=0.744 \ \mathrm{GeV}^{-2}\) and \(B= -1.033 \ \mathrm{GeV}^{-4}\), \(\chi ^2(Q^2 \le Q^2_\mathrm{max})/\mathrm{d.o.f.} = 1.313\).

Another possible ansatz for the inelastic contribution to BSR at low \(Q^2\) is the form of the light-front holographic (LFH) effective charge \({\mathcal {A}}^\mathrm{(LFH)}\) in the BSR (\(g_1\)) scheme [90, 91] [\({\mathcal {A}}(0)_{g_1}=1\)]

$$\begin{aligned}&\Gamma _1^{p-n}(Q^2)_\mathrm{inel.}\nonumber \\&\quad = \left| \frac{g_A}{g_V} \right| \frac{1}{6} \left[ 1 - {\mathcal {A}}^\mathrm{(LFH)}(Q^2) \right] \nonumber \\&\quad = \left| \frac{g_A}{g_V} \right| \frac{1}{6} \left[ 1 - \exp \left( - \frac{Q^2}{2 \kappa ^2} \right) \right] ,\quad (Q^2 \lesssim 1 \mathrm{GeV}^2). \end{aligned}$$
(19)

When we fit with this expression the inelastic BSR data [3,4,5] for \(Q^2 < Q^2_\mathrm{max}=0.5 \ \mathrm{GeV}^2\), we obtain \(\kappa =0.479\) GeV.Footnote 10

Fig. 7
figure 7

As Fig. 3, but now the truncated (at \(D=4\)) OPE expression (10) is fitted to the data for the total contribution \(\Gamma _1^{p-n}(Q_j^2)_\mathrm{inel.+el.}\). The lower bound of the fitting interval, \(Q^2_\mathrm{min}=0.268~\mathrm{GeV}^2\), is included as the thin dotted vertical line

In Fig. 6a, b, we present these low-\(Q^2\) expressions. The \(\overline{\mathrm{MS}}\) pQCD and 3\(\delta \) \({\mathcal {A}}\)QCD curves (obtained from fit with \(Q^2 \ge Q^2_\mathrm{min} = 0.268 \ \mathrm{GeV}^2\)) are also included for comparison. In Fig. 6b we can see that the theoretical fitted curves of the presented QCD variants connect smoothly with the nonperturbative low-\(Q^2\) curves (\(\chi \)PT and LFH, both fitted to BSR up to \(Q^2=0.5 \ \mathrm{GeV}^2\)) at the values of \(Q^2\) around \(0.3 \ \mathrm{GeV}^2\). Nonetheless, we recall that the apparent success of the (\(\overline{\mathrm{MS}}\)) pQCD curve, down to about \(0.3 \ \mathrm{GeV}^2\), was achieved due to the ad hoc change of scale in \(f^{p-n}_2(Q^2)\) to \(k Q^2\) (with \(k=3\)), to avoid the Landau singularities in the \(D=2\) term at \(Q^2 \ge 0.268 \ \mathrm{GeV}^2\) (cf. also the discussion about that point in Sect. 3.2). We further notice that the 3\(\delta \) QCD curve agrees well with the mentioned \(\chi \)PT curve in a broader interval, \(0.17 \ \mathrm{GeV}^2< Q^2 < 0.3 \ \mathrm{GeV}^2\). Similar analyses with the goal of connecting the curves of pQCD (or of specific QCD variants) with nonperturbative curves at low \(Q^2\) were performed in some of the references [28,29,30,31], in [92], and was discussed also in [93].

In Refs. [94, 95], BSR at low \(Q^2 < 0.3 \ \mathrm{GeV}^2\) was calculated with baryon chiral perturbation theory (B\(\chi \)PT) up to NLO. The authors of [94, 95] did not give the values of their parameters for BSR. However, careful visual comparison of their obtained \(Q^2\)-dependence [cf. curves and bands in their Fig. 6(b) in Ref. [95]] with our \(\chi \)PT curve (18) with \(Q^2_\mathrm{fit} \le 0.5 \ \mathrm{GeV}^2\) [the long-dashed curve in our Fig. 6b] shows very good agreement between them.

Table 5 As in Table 2, but now the fit is performed on the sum which includes the elastic contribution, i.e., on \(\Gamma _1^{p-n}(\mathrm{inel.+el.})\)

3.5 Fitting to total (inelastic \(+\) elastic) BSR data

We perform also the fitting of the truncated (at \(D=4\)) OPE expression (10) to the data for the total BSR \(\Gamma _1^{p-n}(Q^2)_\mathrm{inel.+el.}\). As argued at the end of Sect. 2.3 [after Eq. (14)], such a fit has problematic aspects. These data are obtained by adding to the experimental data \(\Gamma _1^{p-n}(Q_j^2)_\mathrm{inel.}\) of Refs. [3,4,5] the parametrized elastic contribution Eqs. (11)–(13) of Ref. [60]. The uncertainties of the parametrized elastic form factors are considered to be less than \(10 \%\) [60]. Since the elastic contribution (11) is quadratic in the form factors (and is numerically dominated by the proton contribution), the relative uncertainties of the elastic contribution are less than \(5 \%\). Since we do not have more information about these uncertainties, we will neglect them in this analysis. This means that in the total BSR, we will consider that the statistical and the systematic uncertainties \(\sigma _{j, \mathrm{stat}}\) and \(\sigma _{j, \mathrm{sys}}\) are those of the inelastic contribution. Otherwise, the fitting is performed as in the previous Sect. 3.2.

The resulting curves are presented in Fig. 7a, b, for \(Q^2 \ge Q^2_\mathrm{min}\) with \(Q^2_\mathrm{min} = 0.268 \ \mathrm{GeV}^2\). The obtained results are given in Table 5, for fits with various \(Q^2_\mathrm{min} = 0.66\), 047 and \(0.268 \ \mathrm{GeV}^2\). Comparing the obtained results in Fig. 7 and Table 5 with the corresponding results in Fig. 4 and Table 2 (where the elastic part was not included in the fit procedure), we see that the inclusion of the elastic contribution in the fit procedure significantly deteriorates (increases) the values of the various fit quality parameters \(\chi ^2\), especially when the fit is performed in the larger interval \((Q^2_\mathrm{min}=) 0.268 \ \mathrm{GeV}^2 \le Q^2 < 3 \mathrm{GeV}^2\). In particular, the differences in quality are clearly visible when comparing Fig. 7b with Fig. 4b at low \(Q^2\). Only when \(Q^2_\mathrm{min}\) is relatively high, \(Q^2_\mathrm{min}=0.66 \ \mathrm{GeV}^2\), are some of the \(\chi ^2\) parameters comparable in the two cases (but not the extrapolation quality parameters \(\chi ^2_{0.268}\) and \(\chi ^2_\mathrm{all}\)). Further, the inclusion of the elastic contribution in the fit in general does not change significantly the extracted (mostly negative) values of \(f_2^{p-n}(1 \mathrm{GeV}^2)\), but increases significantly the (positive) values of \(\mu _6\).

For the fit analysis with the “massive” truncated OPE, Eq. (16), the results are presented in Table 6 for \(Q^2_\mathrm{min}=0.268 \ \mathrm{GeV}^2\). In general, the results are worse than in the corresponding “massive” OPE case fitted to the inelastic contribution (cf. Table 3 in Sect. 3.2). Namely, the extracted values of the parameter \(M^2\) become negative (\(M^2 \sim -0.1 \ \mathrm{GeV}^2\)) and the fit quality values \(\chi ^2/\mathrm{d.o.f.}\) are in general significantly higher than those in Table 3. The uncertainties of the extracted values of \(f_2^{p-n}(1)\) and \(M^2\) become reduced (in comparison to the case when the elastic part is not included in the fit); this is so because the elastic parts increase significantly BSR \(\Gamma _1^{p-n}(Q^2)\) at low \(Q^2\) where they are represented mostly by the “massive” higher-twist term, and hence the relative uncertainties of BSR at low \(Q^2\) become smaller.

In addition to the massive case with constant squared mass \(M^2\), we include also the case of \(Q^2\)-dependent squared mass Eq. (17) in the analysis with the elastic contribution included in the fit. The results for this case are presented in Table 7. These results are qualitatively similar to those with constant squared mass, Table 6; and the comparison of Table 7 with its counterpart Table 4 without the elastic contribution is similar to the above comparison when the squared mass is constant. We recall that an average mass in the \(Q^2\)-independent case can be regarded to be \(\langle M^2(Q^2) \rangle = (1/2) \times ( M^2(0.268) + M^2(1) ) \approx 1.85 \times M^2(1 \ \mathrm{GeV}^2)\).

Table 6 As in Table 3, i.e., the OPE form is “massive”, Eq. (16), but now the elastic part of BSR is included in the fit
Table 7 As in Table 6, but with \(Q^2\)-dependent mass Eq. (17) in the higher-twist part of Eq. (16)

4 Summary

Experimental results for the polarized Bjorken sum rule (BSR) \(\Gamma _1^{p-n}(Q^2)\) were fitted, for various ranges of \(Q^2\), with OPE theoretical expressions using QCD couplings obtained in four different approaches: perturbative QCD (pQCD) in \(\overline{\mathrm{MS}}\) scheme; (Fractional) Analytic Perturbation Theory [(F)APT]; Two-delta \({\mathcal {A}}\)QCD (2\(\delta \)); and Three-delta lattice-motivated \({\mathcal {A}}\)QCD (3\(\delta \)). The QCD running coupling \({\mathcal {A}}(Q^2)\) in the latter three QCD variants does not have Landau singularities, in contrast to the pQCD coupling \(a(Q^2)\) [\(\equiv \alpha _s(Q^2)/\pi \)].

In the fit of the inelastic experimental BSR results, up to two higher-twist terms [\(\sim 1/Q^2, 1/(Q^2)^2\)] were added to the theoretical leading-twist contribution. The elastic contributions, which are \(\sim 1/(Q^2)^n\) with typically \(n \ge 4\) at \(Q^2 > 1 \ \mathrm{GeV}^2\), were then added by using the parametrization obtained from the literature. The fits were performed for the ranges \(Q^2_\mathrm{min} \le Q^2 \le 3 \ \mathrm{GeV}^2\), where \(Q^2_\mathrm{min} = 0.66\), 0.47 and \(0.268 \ \mathrm{GeV}^2\). In general, the best curves were obtained when 2\(\delta \) or 3\(\delta \)-couplings were used. When only \(D=2\) (\(\sim 1/Q^2\)) higher-twist term was included in the fit, the quality of the fitted curves, in the range of the fit and in the extrapolated ranges of \(Q^2\), in general did not depend significantly on \(Q^2_\mathrm{min}\) of the fit. On the other hand, when both \(D=2\) and \(D=4\) terms were included, the quality in the extrapolated ranges of \(Q^2\) was in general better for the lowest \(Q^2_\mathrm{min}\) value (\(0.268 \ \mathrm{GeV}^2\)), i.e., when the \(Q^2\)-range of the fit was the largest. Comparably good results were obtained when “massive” higher-twist term was used in the OPE and the QCD coupling was either from (F)APT or 2\(\delta \) or 3\(\delta \) \({\mathcal {A}}\)QCD.

When the range of fit had \(Q^2_\mathrm{min}=0.268 \ \mathrm{GeV}^2\), the pQCD \(\overline{\mathrm{MS}}\) coupling approach worked and gave acceptable results only if the renormalization scale of the coupling was maintained everywhere at sufficiently high values, and the coefficient \(f_2^{p-n}(Q^2)\) [\(\sim a(Q^2)^{\gamma _0/8 \beta _0}\)] at the \(D=2\) term had an (ad hoc) increased scale \(Q^2 \mapsto k Q^2\), in order to avoid the problem of the Landau singularities.

When the fit procedure was performed by fitting the theoretical OPE, truncated at \(D=4\) (\(\sim 1/(Q^2)^2\)) terms, to the sum of (experimental) inelastic and (parametrized) elastic BSR, the quality of the results turned out to be significantly worse in all the cases of the theoretical curves, something expected by the arguments presented at the end of Sect. 2.3. Namely, the elastic contribution is dominated by terms which behave at high values of \(Q^2\) as \(\sim 1/(Q^2)^{(D/2)}\) where usually \(D \ge 8\), and these terms are not contained in the theoretical expressions for BSR which are usually OPE series truncated at \(1/(Q^2)^2\).

The results of this work can be interpreted as an additional indication of the following important property: the evaluation of the (truncated) leading-twist contribution of spacelike low-\(Q^2\) QCD observables such as inelastic BSR, in QCD variants 2\(\delta \) and in particular 3\(\delta \) \({\mathcal {A}}\)QCD [both have infrared finite and holomorphic coupling \({\mathcal {A}}(Q^2)\)], appear to resum effectively a large part of the perturbative contribution of the observables, and leads to reduced extracted values of the higher-twist terms (\(D=2, 4\)) in the truncated OPE. This property was noted earlier, for different observables, in Refs. [34, 35, 96]. In this context, it appears to be important that in 2\(\delta \) and 3\(\delta \) \({\mathcal {A}}\)QCD the coupling practically merges with the underlying pQCD coupling \(a(Q^2)\) at higher values of \(Q^2 \gg \Lambda ^2_\mathrm{QCD}\). This property is not shared by the (F)APT holomorphic coupling where the leading-twist series contains parts of the higher-twist contribution of as low dimensionality as \(D=2\). The extracted parameters in the higher-twist contribution, including those in the “massive” OPE, are especially reduced in 3\(\delta \) \({\mathcal {A}}\)QCD. This suggests the possibility that the true higher-twist contribution is small, including the (sum of) terms of high dimension; and that the (truncated) OPE with 3\(\delta \) \({\mathcal {A}}\)QCD leading-twist gives, through fitting, an extracted value which is a good approximation to this true value of the higher-twist contribution. Numerically, the significantly reduced extracted value in (truncated) OPE with 3\(\delta \) \({\mathcal {A}}\)QCD is probably partly related with the fact that 3\(\delta \) \({\mathcal {A}}\)QCD differs from both 2\(\delta \) and (F)APT \({\mathcal {A}}\)QCD variants in that its coupling goes to zero in the deep infrared regime, \({\mathcal {A}}^{(3 \delta )}(Q^2) \sim Q^2 \rightarrow 0\). The latter property, we recall, is suggested by the large-volume lattice calculations of the dressing functions of the Landau-gauge gluon and ghost propagators at low \(Q^2\) values.