1 Introduction

Measurements using hadronic final states in \(e^{+}e^{-}\) annihilation have provided detailed experimental tests of Quantum Chromodynamics (QCD), the theory of the strong interaction in the Standard Model. These measurements were based on comparisons of moments and differential distributions of event shapes or jet rates to perturbative predictions. As new data are not foreseen in the near future, the progress in such measurements depends wholly on improvements in the theoretical (and phenomenological) description of these observables. Moreover, in multiple QCD analyses of the LEP data in the past, it was shown that the experimental uncertainties play a relatively small role in comparison to the theory-related uncertainties.

This situation raises some important questions. First of all, would the increasingly precise perturbative QCD (pQCD) calculations amended with resummation techniques be able to improve the precision of the results in QCD studies without any new data? And if not, what would be the limiting factors for the precision of QCD studies in the future and what should be done to eliminate them? To answer these questions we perform a QCD analysis with state-of-the-art pQCD calculations extended with estimations of higher-order corrections not known at present.

Fully differential pQCD calculations for the production of three partonic jets in \(e^{+}e^{-}\) hadronic annihilation are available to \({{{\mathcal {O}}}}(\alpha _{S}^{3})\) accuracy [1,2,3,4,5,6], which corresponds to next-to-next-to-leading order (NNLO) in QCD perturbation theory for this process. Four- and five-jet production [7,8,9,10,11,12], as well as the total cross section [13] are also known including terms at \({{{\mathcal {O}}}}(\alpha _{S}^{3})\),Footnote 1 therefore it is possible to make predictions for any infrared-safe observable at this level of accuracy. Although higher-order corrections are presently not known, it is in principle possible to estimate such corrections from data and therefore to obtain “predictions” at \({{{\mathcal {O}}}}(\alpha _{S}^{4})\). This approach is obviously limited to cases of observables for which only a small number of coefficients of the perturbative expansion should be estimated, such as event shape moments. In this paper we present an implementation of this approach with the aim of assessing the impact of these terms on possible future extractions of the strong coupling with exact predictions at \({{{\mathcal {O}}}}(\alpha _{S}^{4})\).

When confronting calculations based on QCD perturbation theory (of any order) with data, it must be kept in mind that although in \(e^+e^-\) annihilation strong interactions occur only in the final state, nevertheless, the observed quantities are affected by hadronization and power corrections. These corrections must either be extracted from Monte Carlo predictions or computed using analytic models. Below, we consider both of these approaches for describing non-perturbative effects and perform simultaneous fits of \(\alpha _{S}(M_Z)\) and the \({{{\mathcal {O}}}}(\alpha _{S}^{4})\) perturbative coefficients to event shape moments (together with model parameters for analytic hadronization models) for thrustFootnote 2 [16, 17] and the C-parameter [18, 19].

Anticipating some of our findings, we observe a clear discrepancy between results obtained with fits using Monte Carlo and analytic hadronization models, even after the inclusion of higher orders in pQCD and extending the analytic models to \({{{\mathcal {O}}}}(\alpha _{S}^{4})\). This implies that efforts to significantly improve the overall precision of \(\alpha _{S}\) extractions in the future must address not only the computation of higher-order pQCD corrections, but also the refined modeling of non-perturbative effects.

We expect that the presented analysis will provide valuable input for the planing of \(\alpha _{S}(M_Z)\) measurements and data taking at future \(e^{+}e^{-}\) facilities.

2 Theory predictions

The n-th moment of an event shape variable O is defined by

$$\begin{aligned} \langle O^{n} \rangle = \frac{1}{\sigma _{\mathrm {tot}}} \int _{O_{\mathrm {min}}}^{O_{\mathrm {max}}} O^n \frac{\mathrm {d}\sigma (O)}{\mathrm {d}O}\mathrm {d}O, \end{aligned}$$

where \(\sigma _{\mathrm {tot}}\) stands for the total hadronic cross section and \([O_{\mathrm {min}},O_{\mathrm {max}}]\) is the kinematically allowed range of the observable O.

The fixed-order prediction for the n-th moment of O at a reference renormalization scale \(\mu = \mu _0\), normalized to the LO cross section \(\sigma _0\) for \(e^+e^- \rightarrow \text{ hadrons }\) reads:

$$\begin{aligned} \begin{aligned} \frac{1}{\sigma _0} \int _{O_{\mathrm {min}}}^{O_{\mathrm {max}}} O^n \frac{\mathrm {d}\sigma (O)}{\mathrm {d}O} \mathrm {d}O&=\frac{\alpha _{S}(\mu _0)}{2\pi }A^{{\langle O^{n} \rangle }}_0 +\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^2 B^{{\langle O^{n} \rangle }}_0 \\ +\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^3 C^{{\langle O^{n} \rangle }}_0&+\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^4 D^{{\langle O^{n} \rangle }}_0 +{{{\mathcal {O}}}}(\alpha _{S}^5). \end{aligned} \end{aligned}$$

Throughout the paper we employ the \(\overline{\mathrm {MS}}\) renormalization scheme and \(\alpha _{S}\) (without a superscript) always denotes the strong coupling in this scheme. The coefficients \(A^{\langle O^{n} \rangle }_0\), \(B^{\langle O^{n} \rangle }_0\) and \(C^{\langle O^{n} \rangle }_0\) for moments of standard event shapes have been known for some time [2, 4]. In this paper, we use the CoLoRFulNNLO [6, 20, 21] approach to recompute these coefficients with high numerical precision, see Table 1.

Table 1 LO, NLO and NNLO contributions to the moments of event shapes. For the details on the analytic calculation see Appendix B

This allows us to extend the extraction of the strong coupling constant from these observables to \(\hbox {N}^{3}\)LO with a simultaneous extraction of the perturbative coefficients \(D^{\langle O^{n} \rangle }_0\) from the data.

However, the experimentally measured event shape moments are normalized to the total hadronic cross section, and the perturbative expansion of \(\langle O^{n} \rangle \) is given by

$$\begin{aligned} \begin{aligned} \langle O^{n} \rangle&=\frac{\alpha _{S}(\mu _0)}{2\pi }{\bar{A}}^{\langle O^{n} \rangle }_0 +\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^2 {\bar{B}}^{\langle O^{n} \rangle }_0\\&\quad +\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^3 {\bar{C}}^{\langle O^{n} \rangle }_0 +\left( \frac{\alpha _{S}(\mu _0)}{2\pi }\right) ^4 {\bar{D}}^{\langle O^{n} \rangle }_0 +{{{\mathcal {O}}}}(\alpha _{S}^5). \end{aligned} \end{aligned}$$

The relations between \({\bar{A}}^{\langle O^{n} \rangle }_0\), \({\bar{B}}^{\langle O^{n} \rangle }_0\), \({\bar{C}}^{\langle O^{n} \rangle }_0\), \({\bar{D}}^{\langle O^{n} \rangle }_0\) and \(A^{\langle O^{n} \rangle }_0\), \(B^{\langle O^{n} \rangle }_0\), \(C^{\langle O^{n} \rangle }_0\), \(D^{\langle O^{n} \rangle }_0\) are straightforward to obtain using

$$\begin{aligned} \begin{aligned} \frac{\sigma _0}{\sigma _{\mathrm {tot}}}&= 1 - \frac{\alpha _{S}}{2\pi } A_{\mathrm {tot}} + \left( \frac{\alpha _{S}}{2\pi }\right) ^2 \left( A_{\mathrm {tot}}^2 - B_{\mathrm {tot}}\right) \\ {}&\quad - \left( \frac{\alpha _{S}}{2\pi }\right) ^3 \left( A_{\mathrm {tot}}^3 - 2A_{\mathrm {tot}}B_{\mathrm {tot}} + C_{\mathrm {tot}}\right) + {\mathcal {O}}(\alpha _{S}^4), \end{aligned} \end{aligned}$$

and we find

$$\begin{aligned} {\bar{A}}^{\langle O^{n} \rangle }_0&= A^{\langle O^{n} \rangle }_0, \\ {\bar{B}}^{\langle O^{n} \rangle }_0&= B^{\langle O^{n} \rangle }_0 - A_{\mathrm {tot}} A^{\langle O^{n} \rangle }_0, \\ {\bar{C}}^{\langle O^{n} \rangle }_0&= C^{\langle O^{n} \rangle }_0 - A_{\mathrm {tot}} B^{\langle O^{n} \rangle }_0 + \left( A_{\mathrm {tot}}^2 - B_{\mathrm {tot}}\right) A^{\langle O^{n} \rangle }_0, \\ {\bar{D}}^{\langle O^{n} \rangle }_0&= D^{\langle O^{n} \rangle }_0 - A_{\mathrm {tot}} C^{\langle O^{n} \rangle }_0 + \left( A_{\mathrm {tot}}^2 - B_{\mathrm {tot}}\right) B^{\langle O^{n} \rangle }_0 \\&- \left( A_{\mathrm {tot}}^3 - 2A_{\mathrm {tot}}B_{\mathrm {tot}} + C_{\mathrm {tot}}\right) A^{\langle O^{n} \rangle }_0. \end{aligned}$$
Table 2 Available measurements and data used in the analysis. Here we use \(\tau \equiv (1-T)\)

The coefficients \(A_{\mathrm {tot}}\), \(B_{\mathrm {tot}}\) and \(C_{\mathrm {tot}}\) are listed in Appendix A.

Finally, to perform a simultaneous fit of multiple data points at different center-of-mass energies, we use the four-loop running of \(\alpha _{S}(\mu )\)

$$\begin{aligned} \mu ^2\frac{d}{d \mu ^2} \frac{\alpha _S(\mu )}{4\pi } = - \left( \frac{\alpha _S(\mu )}{4\pi }\right) ^2 \sum _{n} \beta _n \left( \frac{\alpha _S(\mu )}{4\pi }\right) ^n, \end{aligned}$$
(1)

with \(\beta _0 = (11 C_A - 2 N_F)/3\), \(\beta _1 = (34 C_A^2 - 10 C_A N_F - 6 C_F N_F)/3\) and \(\beta _2 = (2857 C_A^3 - 1415 C_A^2 N_F - 615 C_A C_F N_F + 54 C_F^2 N_F + 79 C_A N_F^2 +\) \(+ 66 C_F N_F^2)/54\). We are using the customary normalization of \(T_R = 1/2\) for the color charge operators, thus in QCD we have \(C_A = N_c = 3\) and \(C_F = (N_c^2-1)/(2N_c) = 4/3\), while \(N_F\) denotes the number of light quark flavors. The corresponding dependence of the perturbative coefficients \({\bar{A}}^{\langle O^{n} \rangle }\), \({\bar{B}}^{\langle O^{n} \rangle }\), \({\bar{C}}^{\langle O^{n} \rangle }\) and \({\bar{D}}^{(n)}\) on scale read:

$$\begin{aligned} {\bar{A}}^{\langle O^{n} \rangle }= & {} {\bar{A}}^{\langle O^{n} \rangle }_0, \nonumber \\ {\bar{B}}^{\langle O^{n} \rangle }= & {} {\bar{B}}_0^{\langle O^{n} \rangle } + \frac{1}{2} {\bar{A}}^{\langle O^{n} \rangle }_0 \beta _0 L, \nonumber \\ {\bar{C}}^{\langle O^{n} \rangle }= & {} {\bar{C}}_0^{\langle O^{n} \rangle } + {\bar{B}}^{\langle O^{n} \rangle }_0 \beta _0 L + \frac{1}{4} {\bar{A}}^{\langle O^{n} \rangle }_0(\beta _1 + \beta _0^2 L)L, \nonumber \\ {\bar{D}}^{\langle O^{n} \rangle }= & {} {\bar{D}}_0^{\langle O^{n} \rangle } + \frac{3}{2} {\bar{C}}_0^{\langle O^{n} \rangle }\beta _{0}L + \frac{1}{2} {\bar{B}}_0^{\langle O^{n} \rangle }\left( \beta _{1}+\frac{3}{2} \beta _{0}^2L\right) L \nonumber \\&+ \frac{1}{8} {\bar{A}}_0^{\langle O^{n} \rangle }\left( \beta _{2}+\frac{5}{2}\beta _{1}\beta _{0}L+\beta _{0}^3L^2\right) L, \end{aligned}$$
(2)

where we have \(L=\ln (\mu ^{2}/\mu _{0}^{2})\).

We take into account the effect of non-vanishing b-quark mass on the predictions for the \(A^{\langle O^{n} \rangle }\) and \(B^{\langle O^{n} \rangle }\) coefficient by subtracting the fraction of b-quark events, \(r_b(Q)\), from the massless result and adding back the corresponding massive prediction obtained with the Zbb4 [22] program,

$$\begin{aligned} A^{\langle O^{n} \rangle }&= A^{\langle O^{n} \rangle }_{m_b=0}(1-r_b(Q)) + r_b(Q) A^{\langle O^{n} \rangle }_{m_b\ne 0}, \\ B^{\langle O^{n} \rangle }&= B^{\langle O^{n} \rangle }_{m_b=0}(1-r_b(Q)) + r_b(Q) B^{\langle O^{n} \rangle }_{m_b\ne 0}. \end{aligned}$$

3 Data sets

For the performed analysis, one minus thrust \(\tau \equiv (1-T)\) and the C-parameter (C) were selected. The selection of these particular observables is motivated by the abundance of available measurements. More specifically, in this analysis we considered data sets from the ALEPH, AMY, DELPHI, HRS, JADE, L3, MARK, MARKII, OPAL and TASSO experiments, see Table 2 for details.

As the theory predictions for all the measured event shape moments were calculated in the CoLoRFulNNLO  framework and the hadronization corrections can be obtained consistently for all the event shape moments, the most complete analysis of the data might include simultaneously all the measured moments.

However, as most of the moments of the event shapes are quite strongly correlated [23], a simultaneous analysis of all available data would require taking these correlations into account. Unfortunately, very few data sets contain this crucial information and therefore, we have limited our analysis only to the first moments, i.e. averages of the event shapes. From the two available sets of measurements with the same data in the range \(\sqrt{s}=133{-}183{\,\text {Ge}\text {V}}\), available from Ref. [24] and Ref. [25] the measurements from Ref. [25] were used in the analysis.

4 Modeling of non-perturbative corrections

As discussed in the Introduction, the modeling of non-perturbative corrections is essential in order to perform a meaningful comparison of theoretical predictions with data. One option for obtaining the hadronization corrections is to extract them from Monte Carlo simulations. Recent examples of this approach include the studies of the energy-energy correlation [36] and the two-jet rate [37].

Some previous extractions of the strong coupling from event shape moments [23, 38] and event shape distributions [39] have used an analytic hadronization model based on the dispersive approach to power corrections [40,41,42]. An important ingredient of this model is the relation between the strong coupling defined in the \(\overline{\mathrm {MS}}\) scheme and the effective soft coupling \(\alpha _{S}^{CMW}\) in the Catani–Marchesini–Webber (CMW) scheme. As the extension of \(\alpha _{S}^{CMW}\) beyond NLL accuracy is believed not to be unique [43, 44], the coefficients in this relation are “scheme-dependent”. However, in one particular proposal, the relation between \(\alpha _{S}\) and \(\alpha _{S}^{CMW}\) has recently been computed up to \({\mathcal {O}}(\alpha _{S}^{4})\) accuracy [44], which allows us to implement a consistent analytic model of hadronization corrections at this order in the perturbative expansion.

Below, we pursue both options and use Monte Carlo tools as well as the analytic approach to estimate the non-perturbative corrections.

4.1 Monte Carlo hadronization models

In this work we use the Monte Carlo event generation setups similar to those in previous comparable studies [36]. We made use of the Herwig7.2.0 [45] and Sherpa2.2.8 [46] Monte Carlo event generators (MCEGs) with similar setups for perturbative calculations, but different hadronization models. The QCD matrix elements for the \(e^+e^- \rightarrow Z/\gamma \rightarrow 2,3,4,5\) parton processes were generated using MadGraph5 [47] and the OpenLoops [48] one-loop library. The 2-parton final state process was computed at NLO accuracy in perturbative QCD. The generated events were hadronized using the Lund hadronization model [49] or the cluster hadronization model [50]. In the following, the setup labelled as \(H^L\) denotes predictions computed with Herwig7.2.0 employing the Lund hadronization model [49], \(H^C\) denotes Herwig7.2.0 predictions obtained with the cluster hadronization model [50], and finally \(S^C\) denotes results obtained using Sherpa2.2.8 with the cluster hadronization model [51].

For the study, predictions of event shape moments were calculated from MC generated events at hadron and parton levels (\( \langle O^{n} \rangle _{\mathrm {MC\ hadrons}}\) and (\( \langle O^{n} \rangle _{\mathrm {MC\ partons}}\)). To take into account that the presence of a shower cut-off \(Q_0\approx {{{\mathcal {O}}}}(1{\,\text {Ge}\text {V}})\) in Monte Carlo programs affects the event shape distributions (e.g. see Refs. [52, 53]) both parton and hadron level MC predictions were calculated with several different values of the parton shower cut-offs and extrapolated to \(Q_0\rightarrow 0{\,\text {Ge}\text {V}}\).

Fig. 1
figure 1

Data and predictions by MCEGs extrapolated to \(Q_0=0{\,\text {Ge}\text {V}}\). The NNLO result was computed using \(\alpha _{S}(M_Z) = 0.118\)

Figure 1 shows the final results obtained with the various MCEG setups after extrapolation to \(Q_0 = 0{\,\text {Ge}\text {V}}\), together with the experimental measurements. The hadron and parton level MC predictions seen in Fig. 1 provide reasonable descriptions of the data as well as the NNLO perturbative results for a wide range of center-of-mass energies. However, the MC predictions at lowest \(\sqrt{s}\) show non-physical behavior, i.e. \(\langle O^{n} \rangle \) increases with \(\sqrt{s}\) for the parton level results. In order to analyze only the data that can be adequately described by the Monte Carlo modeling, we exclude measurements with \(\sqrt{s}<29{\,\text {Ge}\text {V}}\) from the analysis. In practice this criterion is much weaker than a requirement that MC matches the data well or that the subleading power corrections to the analytic hadronization models are small. However, this choice, in addition to retaining as much of the data as is reasonable, serves to highlight the discrepancies between the Monte Carlo and analytic hadronization models in regions where hadronization effects are most pronounced, i.e. at low energies.

Finally, the correction of theory predictions for hadronization was implemented in the analysis as follows,

$$\begin{aligned} \langle O^{n} \rangle _{\mathrm {corrected}} = \langle O^{n} \rangle _{\mathrm {theory}}\times \frac{ \langle O^{n} \rangle _{\mathrm {MC\ hadrons},\,Q_0=0{\,\text {Ge}\text {V}}}}{ \langle O^{n} \rangle _{\mathrm {MC\ partons},\,Q_0=0{\,\text {Ge}\text {V}}}}. \end{aligned}$$

The hadronization correction factors for different center-of-mass energies, observables and MC setups are shown in Fig. 2.

Fig. 2
figure 2

Hadronization corrections extracted from MC generated samples after extrapolation to \(Q_0=0{\,\text {Ge}\text {V}}\) and the hadronization corrections from the \({{{\mathcal {A}}}}_{0}\) scheme calculated as ratios of the hadron and parton level predictions using \(\alpha _S(M_Z)=0.118\), \(\alpha _0(\mu _I)=0.5\) and \({\mathcal {M}}=1.49\). The hadronization corrections from the \({{{\mathcal {A}}}}_{T}\)- and cusp-schemes are not shown, but these very closely follow the hadronization corrections from the \({{{\mathcal {A}}}}_{0}\)-scheme

4.2 Analytic hadronization models

The dispersive model of analytic hadronization corrections for event shapes in \(e^{+}e^{-}\) annihilation has been worked out in detail in Refs. [40,41,42]. In this model, hadronization corrections simply shift the perturbative event shape distributions,

$$\begin{aligned} \frac{\mathrm {d}\sigma _{\mathrm {hadrons}}(O)}{\mathrm {d}O} = \frac{\mathrm {d}\sigma _{\mathrm {partons}}(O - a_O {{\mathcal {P}}})}{\mathrm {d}O}, \end{aligned}$$
(3)

where the power correction \({{\mathcal {P}}}\) is universal for all event shapes, while the \(a_O\) are specific, known constants, e.g. \(a_{1-T}=2\) and \(a_{C}=3\pi \) for \(1-T\) and the C-parameter. Inserting Eq. (3) into the definition of the moments, one obtains the non-perturbative predictions for event shape moments. In particular, the effect of hadronization corrections on the average is additive,

$$\begin{aligned} \langle O^1 \rangle _{\mathrm {hadrons}} = \langle O^1 \rangle _{\mathrm {partons}} + a_O \mathcal {P}, \end{aligned}$$

where \(\langle O^1 \rangle _{\mathrm {partons}}\) is the value obtained in fixed-order perturbation theory as described in Sect. 2. Some deviations from this simple model were discussed very recently in Ref. [54].

Finally, we must compute the power correction \({{\mathcal {P}}}\) at \({\mathcal {O}}(\alpha _{S}^4)\) accuracy. The perturbative ingredients of this calculation are the running of the strong coupling in the \(\overline{\mathrm {MS}}\) scheme and the relation between the coupling defined in the \(\overline{\mathrm {MS}}\) and the CMW schemes. This relation takes the following generic form

$$\begin{aligned} \alpha _{S}^{CMW}&= \alpha _{S} \left[ 1 + \frac{\alpha _{S}}{2\pi } K \right. + \left( \frac{\alpha _{S}}{2\pi }\right) ^{2} L \nonumber \\&\quad \left. + \left( \frac{\alpha _{S}}{2\pi }\right) ^{3} M + {\mathcal {O}}(\alpha _{S}^4)\right] . \end{aligned}$$
(4)

The value of the K coefficient has been known to coincide with the one-loop cusp anomalous dimension for a long time and hence it may be tempting to assume that the cusp anomalous dimension provides a sensible definition of the CMW coupling also beyond \({\mathcal {O}}(\alpha _{S}^2)\). However, this assumption turns out to be incorrectFootnote 3 and as mentioned above, it is believed that there is no unique extension of \(\alpha _{S}^{CMW}\) beyond NLL accuracy. Nevertheless, recently several proposals have been made for the definition of the effective soft coupling in the literature [43, 44]. In particular, Ref. [44] introduces the effective soft-gluon coupling \({{\mathcal {A}}}_{i}^{CMW}\) as

$$\begin{aligned} {{\mathcal {A}}}_{i}^{CMW}(\alpha _{S}) = C_i \frac{\alpha _{S}^{CMW}}{\pi } = C_i \frac{\alpha _{S}}{\pi }\left( 1 + \frac{\alpha _{S}}{2\pi } K + \cdots \right) , \end{aligned}$$

(here i denotes the type of radiating parton, so \(C_q = C_F\) and \(C_g = C_A\)) and proposes two different prescriptions for defining this coupling beyond NLL accuracy, denoted by \({\mathcal {A}}_{0,i}\) and \({\mathcal {A}}_{T,i}\).Footnote 4 We will refer to these cases as the “\({\mathcal {A}}_{0}\)-scheme” and the “\({\mathcal {A}}_{T}\)-scheme” below. We note that the complete expression for the M coefficient is currently not known in the \({\mathcal {A}}_{T}\)-scheme, hence in our analysis we approximate this coefficient with its value in the \({\mathcal {A}}_{0}\)-scheme and set \(M_T = M_0\). In order to facilitate the comparison of our results with previous work [38], we also define the “cusp-scheme”, in which we simply set the K, L and M coefficients of Eq. (4) equal to the appropriate cusp anomalous dimension. In the following, we will denote the results obtained in the \({\mathcal {A}}_{0}\)-scheme by \(A^0\), in the \({\mathcal {A}}_{T}\)-scheme by \(A^T\) and in the cusp-scheme by \(A^{\mathrm {cusp}}\). The explicit expressions for the K, L and M coefficients in all three schemes are presented in Appendix C.

Finally, the power correction takes the following form up to N\({}^3\)LO,

$$\begin{aligned} {{\mathcal {P}}}(\alpha _{S},Q,\alpha _{0})&=\frac{4C_F}{\pi ^2}\, {\mathcal {M}} \times \frac{\mu _I}{Q}\times \,\bigg \lbrace \alpha _0(\mu _I)-\bigg [ \alpha _{S}(\mu _R) \nonumber \\&\quad +\left( K + \beta _0 \left( 1 + \ln \frac{\mu _R}{\mu _I}\right) \right) \frac{\alpha _{S}^2(\mu _R)}{2 \pi } \nonumber \\&\quad + \left( 2 L + \left( 4 \beta _0 \left( \beta _0 + K\right) + \beta _1\right) \left( 1 + \ln \frac{\mu _R}{\mu _I}\right) \right. \nonumber \\&\quad \left. + 2 \beta _0^2 \ln ^2\frac{\mu _R}{\mu _I}\right) \frac{\alpha _{S}^3(\mu _R)}{8 \pi ^2} \nonumber \\&\quad + \left( 4 M + \big (2 \beta _0 \left( 12 \beta _0 (\beta _0 + K) + 5 \beta _1\right) + \beta _2 \right. \nonumber \\&\quad \left. + 4 \beta _1 K + 12 \beta _0 L \big ) \left( 1 + \ln \frac{\mu _R}{\mu _I}\right) \right. \nonumber \\&\quad + \beta _0 (12 \beta _0 (\beta _0 + K) + 5 \beta _1) \ln ^2\frac{\mu _R}{\mu _I} \nonumber \\&\quad \left. +4 \beta _0^3 \ln ^3\frac{\mu _R}{\mu _I} \right) \frac{\alpha _{S}^4(\mu _R)}{32 \pi ^3} \bigg ]\bigg \rbrace , \end{aligned}$$
(5)

where \(\mu _I\) is the scale at which the perturbative and non-perturbative couplings are matched in the dispersive model and \({\mathcal {M}}\) is the so-called Milan factor [42] with an estimated value of \({\mathcal {M}}_\mathrm{est.}\pm \delta {\mathcal {M}}_\mathrm{est.}=1.49\pm 0.30\) [38, 42]. \(\alpha _0(\mu _I)\) corresponds to the first moment of the effective coupling below the scale \(\mu _I\)

$$\begin{aligned} \alpha _0(\mu _I) = \frac{1}{\mu _I}\int _0^{\mu _I} {\mathrm d}\mu \, \alpha ^{CMW}(\mu ), \end{aligned}$$

and it is a non-perturbative parameter of the model. Following the usual choice, we will set \(\mu _I = 2{\,\text {Ge}\text {V}}\). We note that the value of \(\alpha _0(\mu _I)\) is in principle scheme-dependent, i.e. it depends on the precise relation between the strong coupling in the \(\overline{\mathrm {MS}}\) and CMW schemes. In contrast, \(\alpha _{S}(\mu _R)\) always refers to the value of the strong coupling in the \(\overline{\mathrm {MS}}\) scheme, at scale \(\mu _R\).

Before moving on, let us make two comments regarding Eq. (5). First, it can be argued (see e.g. the discussion in Ref. [39]) that the L and M coefficients that appear in \({{\mathcal {P}}}(\alpha _{S},Q,\alpha _{0})\) do not coincide precisely with those in Eq. (4), but receive modifications due to the non-inclusive nature of the observables we are studying. As such, L and M in Eq. (5) might be different for different observables. Second, beyond NLO, there is some doubt that the non-inclusive corrections parametrized by the Milan factor \({\mathcal {M}}\) can still be captured by a simple overall multiplicative factor in \({{\mathcal {P}}}(\alpha _{S},Q,\alpha _{0})\). Nevertheless, here we take the pragmatic viewpoint that Eq. (5) provides a reasonable model for non-perturbative corrections. In particular, as in other analyses [38, 39], the Milan factor appears as a multiplicative constant. However, the applicability of this approach should be investigated in detail in future studies, either when new data or more precise estimations of the Milan factor become available.

5 Fit procedure and systematic uncertainties

The values of \(\alpha _{S}\) were determined in the optimization procedures using the MINUIT2 [55, 56] program and the minimized function

$$\begin{aligned} \chi ^2(\alpha _{S})=\sum ^\mathrm{all\ data\ sets}_{i}\chi ^2_{i}(\alpha _{S}), \end{aligned}$$
(6)

where for data set i we have \(\chi ^2_{i}(\alpha _{S})=(\mathbf {D}-\mathbf {P}(\alpha _{S}))V^{-1}(\mathbf {D}-\mathbf {P}(\alpha _{S}))^{T}\), with \(\mathbf {D}\) standing for the vector of data points, \(\mathbf {P}(\alpha _{S})\) for the vector of calculated predictions and V for the covariance matrix of \(\mathbf {D}\). In this analysis the covariance matrix V was diagonal with the values of the diagonal elements calculated by adding the statistical and systematic uncertainties in quadrature for every measurement.

For all the fits with MC hadronization models the central results for \(\alpha _{S}(M_Z)\) (as well as the \(D^{\langle O^n \rangle }\) coefficients in the N\({}^3\)LO fits) were extracted with the \(H^{L}\) setup. The uncertainty on the fit result was estimated using the \(\chi ^2+1\) criterion as implemented in MINUIT2 (exp.). The systematic effects related to the modeling of hadronization with MCEGs were estimated as the difference of results obtained with \(H^{L}\) and \(H^{C}\) setups (hadr.). To estimate the systematic effects related to the choice of renormalization scale, the latter was varied by a factor of two in both directions (scale). The scale variation at N\({}^3\)LO was performed with the perturbative coefficients \(D^{\langle O^n \rangle }\) fixed to their values obtained in the nominal fit. The uncertainty was estimated as half of the difference between the maximal and minimal obtained values among the three results.

When using the analytic hadronization model (with any of the three schemes discussed above), in addition to \(\alpha _{S}(M_Z)\) and \(D^{\langle O^n \rangle }\), the quantities \(\alpha _{0}(\mu _I)\) and \({\mathcal {M}}\) were also treated as fit parameters to be extracted from data. Although the value of the Milan factor is in principle fixed by a theoretical calculation, including it as a constrained parameter in the fit provides a way of taking into account its uncertainty. For the fits with a constrained Milan factor a term \(({\mathcal {M}}-{\mathcal {M}}_\mathrm{est.})^2/(\delta {\mathcal {M}}_\mathrm{est.})^2\) is added to Eq. (6).

As previously, the \(\chi ^2+1\) criterion was used to estimate the fit uncertainty (exp.), while the systematic effects of missing higher-order terms were estimated using the same renormalization scale variation procedure as for MC hadronization models (scale.). In particular, when varying the scale to estimate the related \(\alpha _{S}(M_Z)\) (\(\alpha _{0}(\mu _I)\)) uncertainties, the \(D^{\langle O^n \rangle }\) coefficients, \(\alpha _{0}(\mu _I)\) (\(\alpha _{S}(M_Z)\)) and \({\mathcal {M}}\) were fixed to the values obtained in the nominal fits.

6 Results and discussion

The results of the NNLO and N\({}^3\)LO fits are presented in Tables 3 and 4, while the predictions of the \(\hbox {N}^{3}\)LO fits for individual energy points are shown in Fig. 3.

Table 3 Results of the extraction analyses using the \(\langle (1-T)^1 \rangle \) observable
Table 4 Results of the extraction analyses using the \(\langle C^1 \rangle \) observable

The presented NNLO results for \(\alpha _{S}(M_Z)\) obtained with both MC and analytic hadronization models are in good agreement between the fits to \(\langle (1-T)^1 \rangle \) and \(\langle C^1 \rangle \), which can be viewed as a check of the internal consistency of the \(\alpha _{S}(M_{Z})\) extraction method at NNLO. However, similarly to previous studies [38] which used less data,Footnote 5 a large discrepancy between the results obtained with the MC hadronization model and the analytic hadronization models are seen.Footnote 6

Turning to \(\alpha _0(2{\,\text {Ge}\text {V}})\) still at NNLO, we recall that this parameter is scheme-dependent, so the fitted values in the three schemes should not be directly compared to each other. Nevertheless, we see that the choice of scheme has only a small numerical impact on the extracted values of \(\alpha _0(2{\,\text {Ge}\text {V}})\). The values of the Milan parameter \({\mathcal {M}}\), constrained in fits, are seen to be unaffected by the choice of scheme and agree with the theoretical prediction within the somewhat large fit uncertainty. Furthermore, the extracted values of both \(\alpha _0(2{\,\text {Ge}\text {V}})\) and \({\mathcal {M}}\) obtained form the \(\langle (1-T)^1 \rangle \) and \(\langle C^1 \rangle \) observables agree well with each other.

Turning to the N\({}^{3}\)LO results, we see that the overall picture is quite similar to the one at NNLO: the fits for \(\alpha _{S}(M_Z)\) are in good agreement between the two observables for both MC and analytic hadronization models. The extracted values of \(\alpha _0(2{\,\text {Ge}\text {V}})\) and \({\mathcal {M}}\) are also consistent between the determinations based on \(\langle (1-T)^1 \rangle \) and \(\langle C^1 \rangle \). However, for all of these quantities we find rather large uncertainties, primarily related to the insufficient amount and quality of data and the extraction method itself. Nevertheless, these uncertainties are not very much larger than those from some classical \(\alpha _{S}(M_{Z})\) extraction analyses in the past [57]. Moreover, the obtained values of both \(D^{\langle (1-T)^1 \rangle }\) and \(D^{\langle C^1 \rangle }\) are in reasonable agreement between fits using MC and analytic hadronization models. This demonstrates the viability of the extraction of the higher-order coefficients \(D^{\langle O^n \rangle }\), once a large amount of precise and consistent data will be available, e.g. from CEPC [58] or FCC-ee [59]. For this extraction the precise high-energy data would be especially valuable. Finally, in Fig. 4 we present the extracted values of \(\alpha _{S}(M_{Z})\) and \(\alpha _{0}(2{\,\text {Ge}\text {V}})\) at NNLO and N\({}^3\)LO accuracy in the \({\mathcal {A}}_0\)-scheme. The results at each perturbative order are quite consistent across the two observables and the fits for \(\langle (1-T)^1 \rangle \) and \(\langle C^1 \rangle \) have rather similar precision. However, the fits at N\({}^3\)LO clearly prefer larger values for both \(\alpha _{S}(M_{Z})\) and \(\alpha _{0}(2{\,\text {Ge}\text {V}})\).

Fig. 3
figure 3

Data and fits obtained with different hadronization models. All available data points from Table 2 are shown

Fig. 4
figure 4

The values of \(\alpha _{S}(M_{Z})\) and \(\alpha _{0}(2{\,\text {Ge}\text {V}})\) obtained from the NNLO and \(\hbox {N}^{3}\)LO fits with analytic hadronization model in the \({\mathcal {A}}^0\) scheme. The contours correspond to 1-, 2- and 3 standard deviations obtained in the fit. Systematic uncertainties are not included

At the same time, the discrepancy between results obtained with the MC hadronization model and the analytic hadronization model remains in place at N\({}^{3}\)LO accuracy. This suggests that the discrepancy pattern has a fundamental origin and would hold even in future analyses, regardless of the availability of the exact N\({}^{3}\)LO predictions. Consequently, the improvement of the hadronization modeling and a better understanding of hadronization itself is more important for increasing the precision of \(\alpha _{S}(M_{Z})\) extractions than the calculation of perturbative corrections beyond NNLO. In order to achieve this better understanding and improved modeling of hadronization, in the future it would be important to perform dedicated studies using observables strongly affected by hadronization, e.g. measurements of the hadronic final state in future \(e^{+}e^{-}\) experiments at \(\sqrt{s}\approx 20{-}50{\,\text {Ge}\text {V}}\) performed with radiative events or in dedicated collider runs.

7 Conclusions

The aim of the present analysis was to assess the factors that will determine the precision of QCD analyses of \(e^{+}e^{-}\) data after the recent and foreseen rapid developments of QCD calculation techniques and the appearance of even more precise theoretical predictions. To do this, we have performed an extraction of the strong coupling \(\alpha _{S}(M_{Z})\) from averages of event shapes \(\langle (1-T)^1 \rangle \) and \(\langle C^1 \rangle \) and found that the results obtained using NNLO predictions and analytic hadronization corrections based on the dispersive model are consistent with the last PDG average \(\alpha (M_Z)_\mathrm{PDG 2020} = 0.1179 \pm 0.0010\).

Furthermore, we considered a method for extracting \(\alpha _{S}(M_{Z})\) at N\({}^3\)LO precision in perturbative QCD, employing exact NNLO predictions and estimations of the N\({}^3\)LO corrections from the data. The method produced results which are compatible with the current world average within the somewhat large uncertainties, e.g.

$$\begin{aligned} \alpha _{S}(M_{Z})^{N^{3}LO+A^0}=0.12911\pm 0.00177 {\text{( } exp.)}\pm 0.0123 {\text{( } scale)} \end{aligned}$$

from the \(\langle (1-T)^1 \rangle \) data. The obtained precision can be increased with more high-quality data from future experiments. For the extraction, Monte Carlo and analytic hadronization models were used, the latter being extended to N\({}^3\)LO for the first time. The comparison of the results for these models suggests that extractions of \(\alpha _{S}(M_{Z})\) in future analyses will be strongly affected by the modeling of hadronization effects even when the exact higher-order corrections will be included. However, the improvements in the modeling of high-energy physics phenomena by MCEGs in the recent decades were closely tied to the experimental measurements performed at the LEP, HERA and LHC colliders and therefore had limited impact on the description of phenomena at lower energies. As a consequence, the advances in modeling of particle collisions at lower energies and understanding of hadronization can be expected only with the availability of new measurements in the corresponding energy ranges.