1 Introduction

Event shape observables characterize the distribution of final-state particles produced in high-energy particle interactions. Event shapes have been measured extensively in \(e^{+}e^{-}\) collisions [1,2,3,4,5,6,7,8,9,10,11,12] and in \(ep\) collisions [13,14,15,16,17]; such observables are calculable to high precision using perturbative Quantum Chromodynamics (pQCD) [18,19,20,21,22]. Event shapes are incisive probes of QCD, notably to constrain the strong coupling constant \(\alpha _{s}\) [9, 23,24,25,26,27,28,29]. The description of hadronic final states in Monte Carlo event generators has likewise benefitted substantially from event shape measurements [7, 18, 30,31,32,33].

Jets arise from energetic partons (quarks and gluons) produced in hard interactions. The partons are initially highly virtual, decaying in a partonic shower that is experimentally observable as a correlated spray of hadrons. Jets provide a laboratory for testing QCD [34]. However, the precision of jet measurements at hadron colliders is limited by the contribution of non-perturbative (NP) processes and by the presence of the underlying event, which consists of final-state particles that do not originate from the hard-scattering process that produced the jet being studied. This limitation is addressed by the application of jet grooming algorithms [35,36,37,38,39,40,41,42], which systematically removes particles likely to originate in NP processes and the underlying event, in a way that is theoretically and experimentally well-controlled [43].

In jet grooming algorithms, typically the Cambridge–Aachen [44] sequential recombination algorithm is applied, which combines jet constituents (particle four-vectors) with small angular distance into jets in the first recombination steps, and those with wide angular distance in later steps. The recombination sequence is then inspected in reverse, starting with the softest branch. For each step in this inspection the kinematics of its contributing branches are compared to a specified condition; for instance, the ratio \(z_{\textrm{g}} \) of the transverse momentum (\(p_\textrm{T}\)) of the softer branch to the summed \(p_\textrm{T}\) of both branches is required to satisfy \(z_{\textrm{g}} >z_{\textrm{cut}} \), for a given value of \(z_{\textrm{cut}} \). If this condition is not satisfied, the softer branch is discarded from the event (hence the term “grooming”) and the comparison continues to the next recombination step along the harder branch until the condition is satisfied. If no recombination step satisfies the condition, the event is discarded. The choice of condition differentiates jet grooming algorithms [43].

The technique of jet grooming has been applied extensively in proton-proton (\(pp \)) collisions at the large hadron collider and at the relativistic heavy ion collider, for instance to search for the decay of boosted heavy particles, discriminate jets initiated by quarks or gluons, or measure jet substructure [43, 45,46,47,48,49,50,51,52,53,54,55,56,57,58]. Jet grooming has also been used to study the modification of jets propagating in the Quark-Gluon Plasma generated in heavy-ion collisions [49, 59,60,61,62,63].

Jets have also been measured extensively in \(e^{+}e^{-}\) [9, 64,65,66,67,68,69] and lepton-proton deep inelastic scattering (DIS) [70,71,72,73,74,75,76,77,78,79,80,81,82,83,84,85,86,87,88]. The underlying event background in such collision systems is smaller than in \(pp \) collisions, enabling more accurate measurement of soft components of jets. In addition, the kinematic properties of the partonic scattering are known experimentally, providing more precise comparison to QCD calculations. To date, grooming has not been applied to data from \(ep\) DIS.

It was recently proposed to apply grooming algorithms not only to jet measurements at hadron colliders, but also to full events in ep DIS [89], leveraging the similarities between DIS at high virtualities and single jets. From a theoretical standpoint, groomed observables in \(ep\) DIS are free of non-global logarithms [22, 89,90,91] and have reduced hadronization effects relative to ungroomed observables [38, 90]. In addition, the magnitude of the non-perturbative component of such observables is controllable experimentally by varying the strength of the grooming parameter \(z_{\textrm{cut}} \)[38, 89, 90].

In this paper, the H1 Collaboration at HERA reports the first measurement of groomed event shapes in \(e^{-}p\) and \(e^{+}p\) neutral-current (NC) deep inelastic scattering (electrons and positrons are referred to generically as “electrons” in the following). The data were recorded during the years 2003 to 2007 with electron and proton beam energies of \(E_e=27.6\) GeV and \(E_p=920\) GeV, respectively, corresponding to \(\sqrt{s} =319\) GeV. The recorded dataset has integrated luminosity \({\mathcal {L}}_\textrm{int} =351~\textrm{pb}^{-1} \). The analysis is based on events with exchanged-boson virtuality \(Q^2 > 150\) GeV\(^2\) and inelasticity \( 0.2< y < 0.7\). We report measurements of the production cross section of groomed event 1-jettiness and groomed event invariant mass for several choices of grooming parameter \(z_{\textrm{cut}} \). Monte Carlo model calculations and analytic calculations based on Soft Collinear Effective Theory (SCET) [89] and NNLO+NLL\(^{\prime }\) pQCD [18] are compared to the measurements.

These data provide new, differential constraints on the detailed structure of DIS-induced final states, which are valuable for the tuning of MC event generators. Improvement of such event generators is important, for instance, for the physics program of the future electron-ion collider [92].

The paper is organized as follows: Sect. 2 describes the experimental apparatus and dataset; Sect. 3 presents theoretical calculations that are compared to the data; Sect. 4 outlines the analysis procedure and event selection; Sect. 5 presents the corrections applied to the data; Sect. 6 presents the uncertainties; Sect. 7 reports results of the analysis and the comparison of theoretical predictions to the measurements; and Sect. 8 provides a summary.

2 Experimental setup

The H1 experiment is a general purpose particle detector with full azimuthal coverage around the electron-proton interaction region [93, 93,94,95,96,97]. The detector is described in Refs. [97,98,99,100]. H1 employs a right-handed coordinate system in which the proton beam direction defines the positive z direction. The nominal ep interaction point is located at \(z=0\).

The liquid argon calorimeter (LAr) [101, 102], which provides a trigger for high-\(Q^2\) neutral current DIS, subtends polar angles of \(4^\circ<\theta < 154^\circ \) and provides an energy resolution of \(\sigma (E) /E \simeq (11\%/\sqrt{E/\textrm{GeV} }) \oplus 1\%\) for electrons and \(\sigma (E)/E \simeq (55\%/ \sqrt{E/\textrm{GeV} }) \oplus 3\%\) for charged pions. The central tracking system, comprising gaseous drift and proportional chambers and a silicon vertex detector, covers the polar range \(15^\circ<\theta <165^\circ \) and has transverse momentum resolution for charged particles \(\sigma (p_\textrm{T})/{p_\textrm{T}}\simeq 0.2\% \cdot p_\textrm{T}/\textrm{GeV} \oplus 1.5\%\). A lead-scintillating fiber calorimeter (SpaCal) [94, 103], consisting of both electromagnetic and hadronic sections, covers the backward direction (\(153^\circ< \theta < 177^\circ \)). The SpaCal has electromagnetic energy resolution of \(\sigma (E)/{E} \simeq (7.5\%/\sqrt{E/\textrm{GeV} })\oplus 2\%\).

Events are triggered by requiring a high-energy cluster in the electromagnetic portion of the LAr. The trigger and event selection used in the analysis closely follows Ref. [87]. The efficiency of the trigger is greater than \(99\%\) in the phase space of this analysis. Events are further selected online by requiring the scattered electron to have an energy greater than 11 GeV and to fall within the LAr fiducial volume.

In the offline analysis, an energy flow algorithm combines information from the tracking detectors and calorimeters to generate a set of four-vectors [104,105,106]. These four-vectors are used to define the scattered electron and the hadronic final state (HFS). Isolated energy deposits with high energy in the backward and central sections of the electromagnetic calorimeters are typically the result of QED radiation of a photon off the electron. If the energy deposit is closer to the scattered electron than the electron beam direction, the photon is likely to come from final-state radiation and its four-vector is recombined with the four-vector of the scattered electron. If the photon is closer to the electron beam (\(-z\)) direction, it is likely the result of initial-state radiation (ISR) and is removed from the event. The HFS is defined as all the particles remaining after this procedure, excluding the scattered electron.

The kinematic variables describing the event, \(Q^2\), y, and Bjorken x (denoted \(x_\textrm{Bj} \)), are reconstructed using the \(I\Sigma \) method as described in Ref. [107]. Bjorken \(x_\textrm{Bj} =Q^{2}/(2P\cdot q)\), where P and q refer to the incoming proton and exchanged boson four-vectors, respectively. The inelasticity \(y=P\cdot q / P\cdot k\), where k is the incoming electron four-vector. These kinematic variables are used to reconstruct the boost to the Breit frame, as well as the vector defining the current hemisphere of the Breit frame, \(q_{J}=xP+q\). The definitions of the Breit frame and \(q_J\) are described in more detail in Sect. 4.

Events are further selected based on the following quality assurance cuts placed on quantities measured in the laboratory frame:

  • The measured z-location of the event vertex is constrained to be within 35 cm of the nominal vertex z-location.

  • The total longitudinal momentum balance (\(E-p_z\)) of the event, evaluated by summing the \(E-p_z\) of all measured particles, is required to satisfy \(50< E-p_z < 60\) GeV. This requirement predominantly serves to reduce the contribution from events with significant QED initial-state radiation and events likely to come from photoproduction or beam-gas background. It has the additional benefit of rejecting events in which the hadronic final state is poorly reconstructed.

  • The total transverse momentum (\(p_\textrm{T}\)) of the hadronic final state is required to approximately balance that of the scattered electron \(p_\textrm{T}\), \(0.6< {p_\textrm{T,HFS}}/{p_{\textrm{T},e}} < 1.6\) and \({p_\textrm{T,HFS}}-{p_{\textrm{T},e}} < 5\) GeV. These requirements ensure that the hadronic final state and scattered electron are well-measured.

  • The vector defining the current hemisphere of the Breit frame, \(q_{J}\), is required to have a polar angle \(7^{\circ }<\theta _{q_J}<175^{\circ }\) in the lab frame in order to suppress non-collision backgrounds and to ensure the hadronic final state is properly contained within the detector.

  • Events with \(\theta _{q_J}>149^{\circ }\) and the velocity of the Lorentz boost to the Breit frame \(\beta > 0.9\) are rejected to suppress contamination from initial-state QED radiation. Events in which the difference between the polar angles of the HFS and the current hemisphere vector satisfies \(\theta _{q_J} - \theta _{\textrm{HFS}} > 90^{\circ }\) are also rejected for the same reason.

  • In the kinematic region studied here, the hadronic final state and the scattered electron are typically produced at similar polar angles. Events events with \(Q^2 < 700\) GeV\(^2\) and \(|\Delta \eta |>0.3\) are poorly measured and are rejected, where pseudorapidity in the laboratory frame \(\eta =-\textrm{ln}(\textrm{tan}\frac{\theta }{2})\), and \(\Delta \eta \) is the difference between the hadronic final state and the scattered electron.

After the kinematic phase space selection and the above event criteria are enforced, the analysed dataset consists of 189,106 events.

3 Simulations and theoretical calculations

Calculations based on the following Monte Carlo event generators and QCD calculations are compared to the data:

  • Djangoh 1.4 [108,109,110] uses Born-level matrix elements for NC DIS and dijet production, combined with the color dipole model from Ariadne [111] for higher-order emissions. Djangoh includes an interface to HERACLES [109] for higher-order QED effects at the lepton vertex. The proton parton distribution function (PDF) used by Djangoh is CTEQ6L [112]. Hadronization is simulated with the Lund hadronization model [113, 114], using parameters tuned to data by the ALEPH Collaboration [9].

  • Rapgap 3.1 [115] implements Born-level matrix elements for NC DIS and dijet production and uses the leading logarithmic approximation for parton showering. It is using the CTEQ6L PDF set and Lund hadronization implemented in Pythia [116].

  • The MC event generator Pythia 8.307 [117, 118] is used with two different parton-shower models: the default dipole-like \(p_\perp \)-ordered shower and the Dire [119,120,121] parton shower, which is an improved dipole shower with additional handling of collinear enhancements. Both implementations use the Pythia 8.3 default for hadronisation [118], which is based on the Lund string model. The parton showers both use 0.118 for value of the strong coupling constant at the mass of the Z boson, and both variations considered here use MMHT2014 as the hard PDF [122].

  • A recent prediction from Ref. [123] generalizes the POWHEG method to DIS, including handling of initial-state radiation off the lepton beam. This prediction, denoted Pythia+POWHEG, includes predictions in NLO QCD matched to parton showers using the POWHEG method [124, 125], which are then interfaced to Pythia 8.303. The default Pythia shower and hadronization schemes are applied. The Frixione–Kunszt–Signer (FKS) subtraction technique [126] is used to parameterize the radiation phase space.

  • The MC event generator Herwig 7.2 [127] is also studied in three variants. For the default prediction, Herwig 7.2 implements leading-order matrix elements that are supplemented with an angular-ordered parton shower [128] and the cluster hadronization model [129, 130]. The second variant makes use of the MC@NLO method that implements NLO matrix element corrections. In addition, a matching with the default angular-ordered parton shower is performed [131]. The third variant also makes use of NLO matrix elements but applies the dipole merging technique and a dipole parton shower [131]. The PDF used for all three of the Herwig variations is MMHT2014 [122]. The events generated with Herwig are further processed with Rivet [132].

  • Predictions are obtained with Sherpa 2 [133, 134], where Comix [135] generates matrix elements for up to three final-state jets. The CKKW merging formalism [136] is used to augment these jets with dipole showers [137, 138], and the final-state partons are hadronized with cluster hadronization as implemented in AHADIC++ [139]. As an alternative prediction, the hadronization step is performed with the Lund string fragmentation model [140]. Both predictions use the Sherpa 2 default PDF, which is CT10 [141].

  • A set of predictions for the groomed 1-jettiness is provided by the Sherpa authors using a pre-release version of Sherpa 3. Sherpa 3 features a new cluster hadronization model, described in Ref. [142]. Matrix elements at NLO are obtained from OpenLoops [143], and the resulting partons are showered via the Sherpa dipole shower [138] based on the truncated shower method described in Refs. [144, 145]. Sherpa 3 additionally features intrinsic \(k_{\textrm{T}}\) of the partons within the proton, while the predictions from Sherpa 2 have no intrinsic \(k_{\textrm{T}}\) included. The model for partonic intrinsic \(k_{\textrm{T}}\) used in the Sherpa 3 prediction is a Gaussian form with mean of 0 GeV and \(\sigma =0.75\) GeV. The prediction has an associated uncertainty, defined as the extrema of a 7-point scale variation.

  • Another prediction at NNLO+NLL\(^{\prime }\) [18] is provided for the groomed 1-jettiness. The predictions are computed using the CAESAR formalism [146] for NLL resummation as implemented in the Sherpa [133] framework [147, 148] and extended to cover the case of soft-drop groomed event shapes in Refs. [29, 149]. The resummed results are matched to fixed order predictions at NNLO accuracy, derived with the techniques implemented in Sherpa in Ref. [150]. Via a multiplicative matching, the calculation achieves NNLO+NLL\(^\prime \) accuracy. A description of the full setup in the DIS case can be found in Ref. [18]. Non-perturbative corrections are applied via a transfer matrix approach [151] from Sherpa tuned to data from the LEP and HERA experiments [18, 142]. The same techniques for calculating the groomed invariant mass are used here for the first time. Care is taken to evaluate logarithms of the form \(\ln (M^2/Q^2)\) to avoid ambiguities arising from the normalisation of the observable. The calculation ultimately achieves the same NNLO+NLL\(^\prime \) accuracy as before.

  • Predictions for the groomed invariant mass are provided using the formalism of SCET [89]. Predictions are constructed at NNLL for the shape of the groomed invariant mass spectrum in the single-jet limit, corresponding to small values of the groomed invariant mass \(M_{\text {Gr.}}^{2}\) \(\ll \) \(Q^{2}\). The perturbative predictions are convoluted with a shape function to account for non-perturbative effects. The predictions are provided at two values of the shape function mean parameter, \(\Omega _{\textrm{NP}}\). No attempt is made to match the NNLL calculation to a fixed-order prediction.

4 Observables

4.1 Breit frame

The reported measurements are carried out in the Breit frame of reference [152]. The Breit frame is the reference frame in which

$$\begin{aligned} 2x_\textrm{Bj} \vec {P}+\vec {q} = 0, \end{aligned}$$
(1)

where \(\vec {P}\) is the three-momentum of the incoming proton beam and \(\vec {q}\) is the three-momentum of the exchanged boson. As for the laboratory frame of reference, we choose the positive z axis of the Breit frame to be the direction of the incoming proton. In the quark-parton model, the Breit frame is the frame of reference in which the struck parton is initially aligned with the z axis with \(p_z = Q/2\) and leaves along the same axis with \(p_z = -Q/2\) after the momentum transfer from the space-like virtual boson. The maximum available longitudinal momentum in the Breit frame is therefore Q/2, and at Born level the scattered parton has transverse momentum \(p_\textrm{T} =0\).

The direction of the proton remnant is defined as \(\eta _{\mathrm {_{Breit}}} = +\infty \), while the direction of the struck parton is defined as \(\eta _{\mathrm {_{Breit}}} = -\infty \). The region \(\eta _{\mathrm {_{Breit}}} > 0\) is denoted the remnant hemisphere (RH), and the \(\eta _{\mathrm {_{Breit}}} < 0\) region is denoted the current hemisphere (CH). Events with large \(p_\textrm{T}\) in the Breit frame therefore correspond to multi-jet topologies, e.g. QCD Compton scattering in which the quark recoils with significant \(p_\textrm{T}\) from a hard gluon emission. For the kinematic selection used in this analysis, namely \(Q^2 > 150\) GeV\(^2\) and inelasticity \( 0.2< y < 0.7\), the scattered electron is typically at mid-rapidity (\(\eta _{\mathrm {_{Breit}}} \sim 0\)) in the Breit frame. This phase space also corresponds to the region in which the magnitude of the Lorentz boost from the lab frame to the Breit frame is small, thus minimizing the event-to-event change in the detector acceptance in the Breit frame [153].

4.2 Jet and event clustering in the Breit frame: Centauro algorithm

In the HERA convention, the leading-order quark-parton model process \(eq\rightarrow eq\) produces a jet at \(\eta _{\mathrm {_{Breit}}} =-\infty \) in the Breit frame. Such jets will not be captured by longitudinally-invariant \(k_\textrm{T}\)-type sequential recombination clustering algorithms applied in that frame [153, 154], since those algorithms cluster based on the transverse component of particle momenta that is largely boosted away by the transformation to the Breit frame. Additionally, the distance \(d_{ij}\) between particles becomes large in the direction of the struck parton due to the factor \(R^2=\sqrt{\Delta \eta _{\textrm{Breit}}^2+\Delta \phi ^2}\) in the denominator of the distance metric for those algorithms.

Centauro [154] is a sequential-recombination jet algorithm that overcomes these limitations by means of an asymmetric clustering measure, which preferentially clusters a jet from radiation in the current hemisphere of the Breit frame. Centauro thus clusters the Born-level configuration into a jet more readily than other laboratory and Breit frame algorithms. The Centauro distance measure is

$$\begin{aligned} d_{ij} = (\Delta \bar{\eta }_{ij})^2+2\bar{\eta }_i\bar{\eta }_j(1-\cos (\Delta \phi _{ij})), \end{aligned}$$
(2)

where \(\Delta \phi _{ij}\) is the azimuthal angle difference measured in the Breit frame between pairs of objects that are candidates for clustering, and \(\Delta \bar{\eta }_{ij}=\bar{\eta }_i-\bar{\eta }_j\) is the difference in an angular variables measuring along the longitudinal direction of the Breit frame. The angular variable is defined as

$$\begin{aligned} \bar{\eta }_i = -\frac{2p_{\textrm{T},i}}{n\cdot p_{i}}, \end{aligned}$$
(3)

where \(p_i\) is the four-momentum of particle i, \(p_{\textrm{T},i}\) is its transverse momentum in the Breit frame, and n is the four-vector of the beam proton, normalized to unity energy.

The Centauro algorithm can be used as a traditional jet-finding algorithm to reconstruct jets, but it can also be applied to the entire DIS event (equivalent to setting the jet radius \(R\rightarrow \infty \)) to generate a clustering tree in which the last clustered radiation is farthest from the nominal struck parton direction in the Breit frame. As discussed in Ref. [89], the natural quantity for comparison of branches in this tree is the Lorentz-invariant momentum fraction \(z_{i} \),

$$\begin{aligned} z_i=\frac{P\cdot p_i}{P\cdot q}, \end{aligned}$$

which in the Breit frame represents the fraction of the virtual boson momentum carried by the object i. Branches of the tree with low \(z_{i} \) are either soft or at wide angles with respect to the virtual boson. Branches with high \(z_{i} \) are likely to be fragments of the struck parton.

4.3 Event grooming

Event grooming follows the procedure described in Ref. [89], as follows. All four-vectors in the event are clustered into a tree by the Centauro algorithm; the tree is iteratively declustered in order reverse to the initial clustering; and at each declustering step the values of \(z_{i} \) of the branches are compared to the grooming condition,

$$\begin{aligned} \frac{\text {min(}z_{i},z_{j})}{z_{i} +z_{j}} > z_{\textrm{cut}}. \end{aligned}$$
(4)

If the grooming condition is not met, the branch with smaller \(z_{i} \) is removed and the remaining branch is again subdivided and compared to the grooming condition. The procedure continues until the grooming condition is met. Events in which the algorithm queries the full clustering tree without the grooming condition being met are removed from the analysis and do not contribute to the measured event shape cross section. The fraction of events which do not pass the grooming condition naturally increases with \(z_{\textrm{cut}}\). This approach is a version of the modified MassDrop Tagger (mMDT) grooming algorithm with \(\beta =0\) [90], adapted for DIS with \(z_{i} \) playing the role of \(p_{\textrm{T}_i}\) in standard mMDT.

Fig. 1
figure 1

Normalized distribution of particles measured in data and simulated by Djangoh and Rapgap, as a function of pseudorapidity in the laboratory frame. Left: particle-level; right: detector-level. Upper: ungroomed (\(z_{\textrm{cut}} \)=0); lower: groomed with \(z_{\textrm{cut}} \)=0.2

Figure 1 shows single-particle pseudorapidity distributions for groomed and ungroomed events at both particle- and detector-level. “Particle-level” here refers to the quantities produced by the event generator, where the final state comprises particles with proper lifetime greater than 8 ns. “Detector-level” refers to the four-vectors reconstructed after the particles from the generator are passed through a GEANT [155] detector simulation program, as well as several reconstruction algorithms. The complex shape of the ungroomed detector-level pseudorapidity distribution in the top right panel of Fig. 1 can be attributed to the transition between the barrel and forward sections of the LAr, as well as the contribution of secondary interactions in the detector and beamline.

Figure 1 also compares the detector-level simulated distributions to raw data, with good agreement found. For the data, the fraction of accepted events that pass the grooming cut is 99.3% for \(z_{\textrm{cut}} \)= 0.05, 98.4% for \(z_{\textrm{cut}} \)= 0.1, and 92.7% for \(z_{\textrm{cut}} \)= 0.2.

4.4 Groomed event shapes

Classical global event shape observables incorporate a summation over all particles in an event, including those which are produced at small angles with respect to the beam. Monte Carlo simulations show that, for high-\(Q^{2}\) \(ep\) DIS events such as those in this analysis, 30–40% of generated particles fall in the forward region \(\eta _{\mathrm {_{Lab}}} >3.5\), beyond the detector acceptance. In previous event shape analyses at HERA [13,14,15,16,17], the impact of the missing forward acceptance was reduced by limiting the measurement to particles in the current hemisphere, which are better contained by the detector. In that case, however, theoretical calculations of event shapes must also include radiation that is nominally emitted in the remnant hemisphere but enters the current hemisphere at higher order, generating non-global logarithms that can compromise theoretical precision [89, 156]. These limitations motivate consideration of methods that can ameliorate both the experimental and theoretical challenges typically associated with event shape observables.

It can be seen from Fig. 1 that the application of grooming alleviates the need for a redefinition of event shape observables, since the grooming procedure grooms away the particles produced outside the detector acceptance. This follows from the fact that at high \(Q^{2}\), the exchanged virtual boson typically points toward the central region of the detector. Since the grooming tends to remove particles that are anti-collinear and at wide angles with respect to the exchanged boson, the particles that survive the grooming are generally well-contained within the central region of the detector. For the hardest grooming cut considered here, \(z_{\textrm{cut}}\ = 0.2\), only 0.5% of particles surviving the grooming at particle-level are beyond the forward acceptance of the detector. QCD initial-state radiation, beam remnants, and wide-angle soft radiation are largely groomed away. The remaining particles therefore consist predominantly of fragments of the struck parton, which are collimated in the virtual boson direction. Groomed event shape observables are calculated from these surviving particles.

This paper reports two groomed event shape observables: \(M_{\text {Gr.}}^{2}\), the groomed invariant mass, and \(\tau ^{b}_{1,{\text {Gr.}}}\), the groomed 1-jettiness. The observable \(M_{\text {Gr.}}^{2}\) is defined as

$$\begin{aligned} M_{\text {Gr.}}^{2}= \left( \sum _{i\in \textrm{groomed}(z_{\textrm{cut}})}p_i\right) ^{2}, \end{aligned}$$
(5)

where the sum over i runs over all particles in the hadronic final state that survive the grooming for the specified value of \(z_{\textrm{cut}} \), and \(p_i\) is the four-momentum of particle i. For comparison with the predictions in Ref. [89], the groomed invariant mass is expressed as the natural logarithm of \(M_{\text {Gr.}}^{2}\)normalized to \(Q^2_\mathrm {min.}\), the minimum value of \(Q^{2}\) in the event population; i.e.

$$\begin{aligned} \text {GIM}\equiv \textrm{ln}(M_{\text {Gr.}}^{2}/Q^2_\mathrm {min.}). \end{aligned}$$
(6)

Measurements of the single- and double-differential GIM cross sections are reported in the present paper. The double-differential cross sections are presented as functions of \(Q^{2}\). For the single-differential measurement, as well as the normalized double-differential measurement presented in Fig. 5, \(Q^2_\mathrm {min.}\) is fixed at 150 \(\textrm{GeV}^2\). For the double-differential cross section measurement presented in Fig. 6, \(Q^2_\mathrm {min.}\) is set to the lower edge of each \(Q^{2}\) bin.

The 1-jettiness event shape observable \(\tau _1\) and its variants, \(\tau ^{a}_{1}\), \(\tau ^{b}_{1}\), and \(\tau ^{c}_{1}\), are defined in Ref. [156]. The 1-jettiness variant \(\tau ^{b}_{1}\) is chosen for this analysis because it aligns best with the grooming procedure: both the Centauro clustering algorithm and the grooming procedure are defined in the Breit frame, which is also the natural frame for \(\tau ^{b}_{1}\). In contrast, \(\tau ^{a}_{1}\) uses a jet found in the lab frame, and \(\tau ^{c}_{1}\) uses the center-of-mass frame. The groomed observable \(\tau ^{b}_{1,{\text {Gr.}}}\) is defined as

$$\begin{aligned} \tau ^{b}_{1,{\text {Gr.}}}=\frac{2}{Q^2} \sum _{i\in \textrm{groomed}(z_{\textrm{cut}})}\text {min(}q_B\cdot p_i,q_J\cdot p_i), \end{aligned}$$
(7)

where \(q_{B}=xP\) and \(q_{J}=xP+q\), and the sum likewise runs over all the hadronic final state particles that survive the grooming procedure for the chosen value of \(z_{\textrm{cut}} \).

Equation 7 projects each particle 4-vector onto both \(q_J\), the virtual boson 4-vector, and \(q_B\), the beam 4-vector, and selects the axis to which it is best aligned. Particles in the current hemisphere are better aligned with \(q_J\), while particles in the remnant hemisphere are better aligned with \(q_B\). The observable \(\tau ^{b}_{1,{\text {Gr.}}}\) takes values between 0 and 1, with values near 0 corresponding to collimated events resembling a single jet and values near 1 corresponding to multi-jet events. If momentum conservation in the Breit frame is assumed, \(\tau _1^b\) is formally equivalent to the DIS thrust normalized by Q [156]. The sum is normalized by the value of \(Q^{2}\) measured in the corresponding event.

Since radiation in the proton-going direction is removed by grooming, groomed event shapes are more tightly correlated with the struck parton direction than standard DIS event shapes [20, 22, 157, 158]. At leading order, groomed events can therefore be considered as jets; grooming effectively defines a jet without imposing a jet radius cutoff for the clustering.

4.5 Choice of \(z_{\textrm{cut}} \) value

In the analytic SCET calculations, the value of \(z_{\textrm{cut}} \) should be chosen to respect the factorization of the calculation [89]. Two regions are defined in the calculation: \( 1 \gg z_{\textrm{cut}} \gg M_{\text {Gr.}}^{2}/Q^2 \), and \(1 \gg z_{\textrm{cut}} \simeq M_{\text {Gr.}}^{2}/Q^2 \). The reach in \(Q^{2}\) is fixed primarily by the integrated luminosity and center-of-mass energy, and the reach to low masses is limited by the detector resolution for small angles in the Breit frame. Values of \(z_{\textrm{cut}} \) greater than 0.3 not only begin to violate the condition \(z_{\textrm{cut}} \ll 1\), they also result in a large fraction of events with small groomed invariant mass that are challenging to reconstruct precisely. Therefore, in this analysis we report event shape distributions for \(z_{\textrm{cut}} \) values of 0.05, 0.1, and 0.2.

5 Corrections

The corrected differential cross section in a bin of event shape observable e is defined as

$$\begin{aligned} \frac{d\sigma _i}{de} = \frac{\sum _j A^+_{ij}(n_\text {data,j} - n_\text {Bkg,j})}{{\mathcal {L}}_\textrm{int} \cdot \Delta }\cdot c_{\text {QED,i}}, \end{aligned}$$
(8)

where the indices i and j represent particle-level and detector-level quantities, respectively; \(\Delta \) is the bin width; \(A^+_{ij}\) is the regularized inverse of the detector response matrix; \({\mathcal {L}}_\textrm{int} \) is the analysed integrated luminosity; \(n_\text {data,j}\) is the number of events measured in bin j; \(n_\text {Bkg,j}\) is the number of estimated background events in bin j; and \(c_{\textrm{QED}}\) is the QED correction factor.

The data are corrected to the non-radiative particle-level in the phase space of \(Q^{2} > 150~\textrm{GeV}^2\) and \(0.2<y<0.7\). Detector effects are corrected by regularized unfolding using the TUnfold package [159], and QED effects are corrected bin-by-bin. No hadronization corrections are applied to the data.

5.1 Regularized unfolding

TUnfold utilizes a least-squares fit technique with Tikhonov regularization. The so-called “curvature” mode of TUnfold is utilized, which regularizes the second derivative of the output distribution. The regularization is performed at values of the regularization parameter \(\tau \) that minimize the influence of the unfolding on the final result while maintaining good closure.

The detector response matrix for unfolding is calculated as the average of the respective matrices built using events generated by Rapgap [115] and Djangoh [108,109,110]. The simulated datasets correspond to an integrated luminosity of \({\mathcal {L}}_\textrm{int} = 40\) fb\(^{-1}\) for both Djangoh and Rapgap. The generated events are then passed through the H1 detector simulation implemented in GEANT3 [155] and augmented with a fast calorimeter simulation [160, 161]. The same reconstruction algorithms that are used for data are applied to the output of the simulation. The response matrix has three bins in the reconstructed observable for each bin of measured data.

The following models are used to evaluate the number of background events measured in each bin, \(n_\text {Bkg,j}\):

  • NC DIS is simulated by Djangoh [108] and Rapgap [115]. For \(60< Q^2 < 150~\textrm{GeV}^2\), Djangoh and Rapgap are used, while for \(4< Q^2 < 60~\textrm{GeV}^2\), only Djangoh is used. The contribution of these backgrounds to the measured event sample is around 5% and is dominated by migration to higher \(Q^{2}\) of events near to the kinematic boundary, i.e. \(Q^2 < 150\) \(\textrm{GeV}^2\).

  • Events with \(Q^2 < 4\) GeV\(^2\), including photoproduction, are simulated by Pythia 6.2 [116, 162].

  • QED Compton scattering is simulated by COMPTON [163].

  • Di-lepton production is simulated by GRAPE [164].

  • Deeply virtual Compton scattering is simulated by MILOU [165].

  • Charged-current DIS is simulated by Djangoh.

The contributions of all sources of background other than NC DIS are negligible.

Closure of the unfolding procedure is tested by unfolding the detector-level distribution as determined via Rapgap with the response matrix generated by Djangoh, and vice versa. The output distribution of the unfolding procedure is compared to the corresponding particle-level distribution to determine whether the procedure is returning results close to the truth. Typical values of \(\tau \) are \(1\cdot 10^{-5}\) and \(4\cdot 10^{-4}\) for the single- and double-differential distributions, respectively.

5.2 QED radiation and corrections

The radiation of photons off the electron affects the cross section in several ways. Initial-state emission of a real photon off the electron distorts the measurement of \(Q^2\), y, and x, occasionally producing an energetic cluster in the SpaCal. Final-state radiation is typically emitted collinear to the scattered electron and thus produces one energetic cluster in the calorimeter, but occasionally the photon will be produced at a larger angle with respect to the electron and will be resolved. In both cases, these photons must be removed from the hadronic final state since they tend to lie at mid-rapidity in the Breit frame and therefore can significantly disturb the grooming procedure. Additionally, virtual corrections to the NC DIS process can change the overall normalization and shape of the inclusive cross section.

QED effects are included in Djangoh and Rapgap via an interface to the HERACLES program [109]. HERACLES simulates the first-order electroweak corrections to both \(e^+p\) and \(e^-p\) DIS, including virtual corrections and real photon emission from the lepton. The data are corrected for these effects by applying a bin-by-bin factor, \(c_{\textrm{QED}}\), which is defined as the ratio between the non-radiative and radiative particle-level distributions. The data, which are a mixture of \(e^+p\) and \(e^-p\) collisions, are corrected to the \(e^-p\) cross section. This effect is also encapsulated in \(c_{\textrm{QED}}\). The value of \(c_{\textrm{QED}}\) is similar for both observables and has a uniform value of about 1.15. The magnitude of the QED correction is a result of the cut on \(E-p_z\) being applied on the radiative particle-level. A non-radiative particle-level event always has \(E-p_z = 2E_e\), whereas the radiative particle-level event has been defined with the ISR photon excluded, such that \(E-p_z = 2(E_e-E_{\gamma })\), where \(E_{\gamma }\) is the energy of the photon radiated by the electron in the initial state. The result is that in the radiative particle-level, the cross section is decreased by the likelihood that \(2(E_e-E_{\gamma }) < 50~\textrm{GeV} \), which is around 15% in the kinematics of this measurement. The values of \(c_{\textrm{QED}}\) are presented in the data tables in the appendix. In the highest \(Q^2\) bin of the double-differential measurement, \(c_{\textrm{QED}}\) has values around 20%, due to the difference between the \(e^-p\) and \(e^+p\) cross sections at high \(Q^2\).

6 Uncertainties

The following components of the analysis contribute to the systematic uncertainty of the reported cross sections. All sources of uncertainty are evaluated with both Rapgap and Djangoh. The average of the uncertainties as determined using the two models is used as the uncertainty on the data. The total systematic uncertainty is defined as the sum in quadrature of the individual systematic uncertainties arising from the sources described below.

6.1 Alignment

The polar angle alignment of the tracking detectors with the liquid argon calorimeter has a precision of 1 mrad [166]. This precision results in an uncertainty in the measured position for all HFS objects and for the scattered electron. The HFS and electron polar angle uncertainties are considered separately, and each is passed through the unfolding procedure. The resulting uncertainties in the final distributions are typically \(\sim 1\%\). The values reported in the data tables in Sect. 9 are signed quantities corresponding to the difference between the nominal angles and the angles after the systematic shift of the polar angle of all simulated objects upwards by 1 mrad.

6.2 Energy scales

The measured energy of the scattered electron has a precision of 0.5% in the backward and central regions of the detector and a precision of \(1\%\) in the forward region [166]. The uncertainty in final distributions due to this precision is determined by varying the scattered electron energy and passing the modified events through the unfolding procedure. The resulting uncertainty has a value of less than \(\sim 3\%\).

Independent cluster energy calibrations are used to describe clusters inside and outside of high-\(p_\textrm{T}\) jets [87, 167]. The uncertainty in energy scale of particles inside of high-\(p_\textrm{T}\) jets is denoted as the jet energy scale uncertainty (JES), and the uncertainty in energy scale of particles outside of jets is denoted as the residual cluster energy scale uncertainty (RCES). These energy scales are independently varied by a factor of \(1\%\), and the resulting distributions are passed through the unfolding procedure. The difference in unfolded distributions between the variations and nominal scale values provides the corresponding uncertainty, which is typically 1–2%.

The value of these uncertainties as reported in the data tables is the signed difference between the unfolded result with the nominal energy scale and the energy scale shifted upwards, i.e. a negative value corresponds to the case where the value in a given bin was increased after the systematic shift.

6.3 Integrated luminosity and normalization effects

The uncertainty in integrated luminosity is 2.7% [168], which is applied to the final cross sections. This uncertainty additionally accounts for several other small normalization uncertainties, including trigger efficiency, the QED correction, the calorimeter noise suppression algorithm, and electron identification.

6.4 Unfolding

Three sources of uncertainty from the unfolding procedure are studied. In almost all bins of the measurement, unfolding-related uncertainties dominate over detector-related uncertainties.

Model dependence: The uncertainty from the model dependence of the unfolding is estimated as half the difference between the spectra unfolded using the migration matrices from Rapgap and Djangoh, respectively. This is the dominant systematic uncertainty in many bins of the measurement, with typical values of 5–10% for the single-differential.

Regularization: The uncertainty associated with regularization is determined by varying the regularization parameter by a factor of two larger and smaller than its nominal value. For the single-differential cross sections, the regularization uncertainty is similar in size to the model dependence uncertainty.

Statistics: The statistical uncertainty is determined by a resampling procedure, in which the input data to the unfolding are varied according to the statistical precision associated with the number of events in each bin and then unfolded. For each observable, this procedure was repeated one thousand times. In each bin of the measurement, the standard deviation of the one thousand replicas is taken as an uncertainty on the value of the bin. This source of uncertainty is typically sub-leading, excepting a few bins with limited statistics in the double-differential distributions.

7 Results

In this section we present cross sections of the normalized groomed invariant mass GIM Eq. (6) and groomed 1-jettiness \(\tau ^{b}_{1_{\text {Gr.}}}\) Eq. (7), fully corrected for detector and QED effects as described in Sect. 5. The analysis phase space is defined by \(0.2< y < 0.7\) and \(Q^2 > 150~\) \(\textrm{GeV}^2\). Section 3 describes the MC models and analytic pQCD calculations that are compared to the data. The data tables are provided in Sect. 9.

7.1 Single-differential cross sections

Fig. 2
figure 2

Differential cross section of groomed invariant mass GIM \(=\textrm{ln}(M_{\text {Gr.}}^{2}/150~\textrm{GeV}^2) \) in \(ep\) DIS at \(\sqrt{s} =319~ \textrm{GeV}\), for \(z_{\textrm{cut}} =0.05\), 0.1 and, 0.2. The value of \(Q^2_\mathrm {min.}\)is set to 150 \(\textrm{GeV}^2\). The phase space is restricted to \(Q^{2} >150\) \(\textrm{GeV}^2\) and \(0.2< y < 0.7\). Uncertainty bars on the data show the quadrature sum of the statistical error and systematic uncertainty. The Monte Carlo event generators and pQCD calculations that are compared to the data are described in Sect. 3

Fig. 3
figure 3

Differential cross section of groomed 1-jettiness \(\tau ^{b}_{1_{\text {Gr.}}}\) in \(ep\) DIS at \(\sqrt{s} =319~ \textrm{GeV}\), for \(z_{\textrm{cut}} =0.05\), 0.1 and, 0.2. The phase space is restricted to \(Q^{2} >150\) \(\textrm{GeV}^2\) and \(0.2< y < 0.7\). Uncertainty bars on the data show the quadrature sum of the statistical error and systematic uncertainty. The Monte Carlo event generators and pQCD calculations that are compared to the data are described in Sect. 3

Figures 2 and 3 show the single-differential GIM and \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections, respectively, for \(z_{\textrm{cut}} =0.05\), 0.1, and 0.2. The numerical values of the data points are provided in Tables 2, 3, 4, 5, 6 and 7. The GIM distributions exhibit peaks around \(\textrm{ln}(M_{\text {Gr.}}^{2}/150~\textrm{GeV}^2) \sim -2\), corresponding to masses of around 20 GeV. The \(\tau ^{b}_{1_{\text {Gr.}}}\) distributions peak at small \(\tau ^{b}_{1_{\text {Gr.}}}\), around 0.05. These values of GIM and \(\tau ^{b}_{1_{\text {Gr.}}}\) are referred to as the “peak” region and roughly correspond to events wherein the groomed final state is a single jet. The region \(\textrm{ln}(M_{\text {Gr.}}^{2}/150~\textrm{GeV}^2) \gtrsim 1\) and \(\tau ^{b}_{1_{\text {Gr.}}} \gtrsim 0.5\) is referred to henceforth as the “tail” or “fixed-order” region and typically corresponds to events with multiple jets or sub-jets that survived grooming. The tail region is sensitive to matrix elements, PDFs, and the color connection between the struck parton and the beam remnant. The figures show that most of the MC generators underpredict the large mass and large \(\tau ^{b}_{1_{\text {Gr.}}} \) region of the groomed event shape observables. The level of disagreement between the models and the data in this region does not appear to be a strong function of \(z_{\textrm{cut}}\).

Sherpa 3 better describes the first bin of the groomed \(\tau ^{b}_{1_{\text {Gr.}}}\) distribution compared to Sherpa 2. This could arise from either the improved hadronization model or the addition of intrinsic \(k_\textrm{T}\) to the initial-state partons. With the improvement of the first bin, Sherpa 3 successfully describes the data within uncertainties across the whole \(\tau ^{b}_{1_{\text {Gr.}}}\) distribution. The 7-point scale variation produces uncertainties around 10% in the peak region and 30% in the tail region.

The NNLO+NLL\(^\prime \) prediction provides a reasonable description of the single-jet peak region at low \(z_{\textrm{cut}} \), but the description is poorer at higher \(z_{\textrm{cut}} \). This may indicate the need for higher resummed accuracy at higher \(z_{\textrm{cut}} \). The prediction underestimates the cross section in the tail region, where the fixed order calculation is expected to provide an accurate description of the data.

7.2 Comparison to SCET predictions

Fig. 4
figure 4

Differential cross section of groomed invariant mass GIM \(=\textrm{ln}(M_{\text {Gr.}}^{2}/150~\textrm{GeV}^2) \) in \(ep\) DIS at \(\sqrt{s} =319~ \textrm{GeV}\), for \(z_{\textrm{cut}} =0.05\), 0.1 and, 0.2. The value of \(Q^2_\mathrm {min.}\)is set to 150 \(\textrm{GeV}^2\). The phase space is restricted to \(Q^{2} >150\) \(\textrm{GeV}^2\) and \(0.2< y < 0.7\). Uncertainty bars on the data show the quadrature sum of the statistical error and systematic uncertainty. SCET calculations [89] are compared to the data, with value of the shape function mean \(\Omega _{\textrm{NP}}\) of 1.1 GeV and 1.5 GeV. The calculations are normalized to the integral of the data in the region \(\textrm{GIM}<-1\). The shaded bands on the predictions correspond to the associated scale uncertainty

Figure 4 shows the measured GIM single-differential cross section, with SCET calculations in comparison [89]. The predictions are normalized to the data in the range \(\textrm{GIM}<-1\) by equating their integrals. Two values of the mean of the non-perturbative shape function, \(\Omega _{\textrm{NP}} = 1.1\) GeV and 1.5 GeV, are used in the prediction. The shape function encapsulates the non-perturbative contribution to the observable resulting from hadronization, which becomes increasingly important at low values of GIM. The prediction has associated scale uncertainties, which are determined by varying all scales in the perturbative prediction by a factor of 2. Note, however, that the uncertainty of the shape function is not evaluated, so that the total theory uncertainty is underestimated at the smaller values of \(z_{\textrm{cut}} \), where the shape function makes a more significant contribution to the total distribution.

The level of agreement of the calculation with data is limited for \(z_{\textrm{cut}} =0.05\) and 0.1, with better agreement for \(z_{\textrm{cut}} =0.2\). This accords with the expectation that the SCET approximation is valid for \(1\gg z_{\textrm{cut}} \sim M_{\text {Gr.}}^{2}/Q^2 \) [89], which is not respected for \(z_{\textrm{cut}} =0.05\) and 0.1. The data likewise prefer \(\Omega _\textrm{NP}=1.5\) GeV, which is expected since the calculation generates on average smaller mass than observed in the data, and the shape function, which accounts for non-perturbative effects, increases the mass relative to the partonic calculation. The value \(\Omega _{\textrm{NP}}=1.5\) GeV is larger than the naive expectation, \(\Omega _{\textrm{NP}}\sim 1\) GeV [89]. The high value of \(\Omega _{\textrm{NP}}\) may compensate for the effect of gluon jets, which are not included in the calculation.

Fig. 5
figure 5

Number of events as a function of GIM for \(z_{\textrm{cut}} \)= 0.2 in several \(Q^{2}\) bins, with integral normalized to unity in the region \(\textrm{GIM}<-2\). \(Q^2_\mathrm {min.}\)Vertical dotted lines show bin boundaries of bins used in the normalization. The data points are offset horizontally for clarity. The bins have been scaled by their width

The SCET calculation predicts that the shape of the groomed invariant mass distribution is independent of \(Q^{2}\) in the low mass limit, defined by the relation

$$\begin{aligned} \frac{M^2_{\text {Gr.}}}{Q^2} \ll z_{\textrm{cut}} \ll 1. \end{aligned}$$
(9)

Figure 5, which tests this prediction, shows the GIM distribution for \(z_{\textrm{cut}} \)= 0.2 in five bins of \(Q^{2}\). The factor \(Q^2_\mathrm {min.}\)is taken to be 150 \(\textrm{GeV}^2\) for all \(Q^{2}\) bins. The integrals of all \(Q^{2}\) distributions are normalized in the region \(\textrm{GIM}<-2\). In this region, the \(x_\textrm{Bj} \) and \(Q^{2}\)-dependence of the cross section occurs only in the component of the event that has been groomed away; the groomed distribution is therefore expected to be invariant with respect to \(x_\textrm{Bj} \) and \(Q^{2}\). The shape of the distributions shown in Fig. 5 is observed to be independent of \(Q^{2}\), in agreement with this prediction.

7.3 Double-differential cross sections

Fig. 6
figure 6

Double-differential groomed invariant mass distributions for \(z_{\textrm{cut}} = 0.05\), 0.1, and 0.2, for five bins in \(Q^{2}\). The value of \(Q^2_\mathrm {min.}\)is taken to be the lowest value of \(Q^{2}\) in the bin, e.g. 1122 \(\textrm{GeV}^2\) for the highest \(Q^{2}\) bin. Uncertainty bars on the data show the quadrature sum of the statistical and systematic uncertainties. The data points are horizontally offset from the bin center for visibility. The lines represent predictions from Sherpa 2 using the cluster hadronization model. The five distributions at different values of \(Q^{2}\) are vertically offset by the factor given in parentheses

Fig. 7
figure 7

Double-differential \(\tau ^{b}_{1_{\text {Gr.}}}\) cross section for \(z_{\textrm{cut}} = 0.05\), 0.1, and 0.2, presented in five bins in \(Q^{2}\). Further details are given in the caption of Fig. 6

Figures 6 and 7 show double-differential cross sections of GIM and \(\tau ^{b}_{1_{\text {Gr.}}}\), alongside calculations from Sherpa 2 with the cluster hadronization model for comparison. The data are presented in five bins of \(Q^2\) and at three values of \(z_{\textrm{cut}} \). The binning used in these figures is presented in Table 1. Bins with very small event counts were merged with the neighboring bins. The numerical values of the data are provided in Tables 8, 9, 10, 11, 12, 13, 14, 15, 16, and 17.

A reasonable agreement between the predictions of Sherpa 2 and the data is found in the majority of bins, with some tension observed at very low \(\tau ^{b}_{1_{\text {Gr.}}}\) and GIM. The description of the peak region of the \(\tau ^{b}_{1_{\text {Gr.}}}\) distribution can be seen to improve with \(Q^{2}\).

At higher values of \(Q^{2}\), the mean values of the event shape distributions decrease, in accordance with the expectation from QCD. These measurements may provide new constraints on the strong coupling constant \(\alpha _{\textrm{s}}\).

8 Summary

This paper presents the first measurement of groomed event shape observables in deep inelastic ep collisions. Measurements of the invariant mass and 1-jettiness \(\tau ^{b}_{1_{\text {Gr.}}}\) of the groomed hadronic final state are reported for \(e^{-}p\) collisions at \(\sqrt{s} =319\) GeV, for events selected with \(Q^{2} > 150\) GeV\(^2\) and \( 0.2< y < 0.7\). Events are clustered using the Centauro jet algorithm, and results are presented for values of the grooming parameter \(z_{\textrm{cut}} =0.05\), 0.1, and 0.2. Cross sections are reported single- and double-differentially.

Event grooming suppresses non-perturbative contributions to event shape distributions in a theoretically well-controlled way. Comparisons of Monte Carlo models and analytical pQCD calculations to these data therefore provide significant new tests of their implementation of both perturbative and non-perturbative processes.

Two of the models that are commonly compared to HERA data, Rapgap and Djangoh, can describe the data in the fixed-order tail regions of the groomed event shape distributions but underestimate the single-jet peak regions. More recently developed models, Pythia 8, Herwig 7, and Sherpa 2, underestimate the fixed-order tail region. The agreement of the models with the data in the low-mass or low-\(\tau ^{b}_{1_{\text {Gr.}}}\) region improves for higher \(z_{\textrm{cut}} \). Sherpa 3 accurately describes the full distribution of the groomed 1-jettiness within the experimental and theoretical uncertainties.

The numerical predictions of the SCET calculation fail for the lower \(z_{\textrm{cut}} \) values. Comparison of the calculation with data indicates overall a preference for a larger value of the non-perturbative shape function \(\Omega _{\textrm{NP}}\), suggesting that hadronization and other non-perturbative effects are significant in the single-jet limit. The prediction of Ref. [89], that the shape of the low-mass region is independent of the hard scale \(Q^2\), is found to hold in spite of the disagreement between the numerical predictions and the data.

Event shapes are sensitive to many aspects of QCD final states, making them valuable input for tuning MC event generators. However, such models contain many parameters and determining their optimum values is challenging, since a given effect can often be correctly described by tuning multiple different parameters. The introduction of grooming suppresses the contribution of certain event components, such as the proton beam fragmentation, and causes others, such as the contribution from soft radiation, to scale with \(z_{\textrm{cut}} \).

An accurate description of DIS in MC event generators is crucial for the scientific program of the upcoming Electron-Ion Collider [92]. Future facilities currently under discussion, the LHeC [169, 170] and FCC-eh [171], likewise will require precise MC modeling of the DIS hadronic final state to achieve their physics goals. The groomed event shape distributions reported here provide new, differential constraints for the tuning of MC models and offer the possibility for extracting PDFs and fundamental QCD parameters such as \(\alpha _s\).

9 Tables

Numerical data are provided for the single-differential cross sections as a function of the groomed invariant mass for grooming parameter \(z_{\textrm{cut}} = 0.05\) in Table 2, for \(z_{\textrm{cut}} = 0.1\) in Table 3, and for \(z_{\textrm{cut}} = 0.2\) in Table 4. Similarly, numerical cross-section data are presented as a function of the groomed 1-jettiness for grooming parameter \(z_{\textrm{cut}} = 0.05\) in Table 5, for \(z_{\textrm{cut}} = 0.1\) in Table 6, and for \(z_{\textrm{cut}} = 0.2\) in Table 7.

Table 1 Binnings used in the analysis. The reduced binnings are used only in the low-statistics regions of the double-differential result
Table 2 Single-differential groomed invariant mass cross sections for \(z_{\textrm{cut}} = 0.05\). The statistical error represents the outcome of the replica method described in Sect. 6. The sources of systematic uncertainty are described in detail in Sect. 6. The total uncertainty on the data is the quadrature sum of the statistical and systematic errors. \(c_{\textrm{QED}}\) is the QED correction factor as derived from HERACLES; the radiative cross section can be recovered by dividing the given data by the corresponding value of \(c_{\textrm{QED}}\)
Table 3 Same as Table 2 for the GIM cross section at \(z_{\textrm{cut}} = 0.1\)
Table 4 Same as Table 2 for the GIM cross section at \(z_{\textrm{cut}} = 0.2\)
Table 5 Same as Table 2 for the \(\tau ^{b}_{1_{\text {Gr.}}}\) cross section at \(z_{\textrm{cut}} = 0.05\)
Table 6 Same as Table 2 for the \(\tau ^{b}_{1_{\text {Gr.}}}\) cross section at \(z_{\textrm{cut}} = 0.1\)

Numerical data on double-differential measurements as a function of the groomed invariant mass, \(z_{\textrm{cut}} \) and \(Q^2\) are shown in Table 8 for \(150<Q^{2} <200\) \(\textrm{GeV}^2\), Table 9 for \(200<Q^{2} <282\) \(\textrm{GeV}^2\), Table 10 for \(282<Q^{2} <447\) \(\textrm{GeV}^2\), Table 11 for \(447<Q^{2} <1122\) \(\textrm{GeV}^2\), and Table 12 for \(1122<Q^{2} <20000\) \(\textrm{GeV}^2\). Note that \(Q^2_{\textrm{min}}\) and hence the definition of the GIM variable is different for each \(Q^2\) interval.

Table 7 Same as Table 2 for the \(\tau ^{b}_{1_{\text {Gr.}}}\) cross section at \(z_{\textrm{cut}} = 0.2\)
Table 8 Groomed invariant mass cross section in the range \(150<Q^{2} <200\) \(\textrm{GeV}^2\). \(Q^2_{\mathrm {min.}}\) is set to 150 GeV\(^{2}\). The statistical error represents the outcome of the replica method described in Sect. 6. The systematic uncertainty is the quadrature sum of all the sources listed in Sect. 6. The total uncertainty on the data is the quadrature sum of the statistical and systematic errors given here. \(c_{\textrm{QED}}\) is the QED correction factor as derived from HERACLES; the radiative cross section can be recovered by dividing the given data by the corresponding value of \(c_{\textrm{QED}}\). The lowest bins in groomed invariant mass were consolidated due to low statistics. The binning follows the reduced low GIM binning given in Table 1
Table 9 Same as Table 8 for groomed invariant mass cross sections in the range \(200<Q^{2} <282\) \(\textrm{GeV}^2\). \(Q^2_{\mathrm {min.}}\) is set to 200 GeV\(^{2}\). The binning follows the standard GIM binning given in Table 1
Table 10 Same as Table 8 for groomed invariant mass cross sections in the range \(282<Q^{2} <447\) \(\textrm{GeV}^2\). \(Q^2_{\mathrm {min.}}\) is set to 282 GeV\(^{2}\). The binning follows the standard GIM binning given in Table 1
Table 11 Same as Table 8 for groomed invariant mass cross sections in the range \(447<Q^{2} <1122\) \(\textrm{GeV}^2\). \(Q^2_{\mathrm {min.}}\) is set to 447 GeV\(^{2}\). The binning follows the reduced high GIM binning given in Table 1
Table 12 Same as Table 8 for groomed invariant mass cross sections in the range \(1122<Q^{2} <20000\) \(\textrm{GeV}^2\). \(Q^2_{\mathrm {min.}}\) is set to 1122 GeV\(^{2}\). The binning follows the reduced high GIM binning given in Table 1
Table 13 Same as Table 8 for \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections in the range \(150<Q^{2} <200\) \(\textrm{GeV}^2\). The binning follows the standard \(\tau ^{b}_{1_{\text {Gr.}}}\) binning given in Table 1
Table 14 Same as Table 8 for \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections in the range \(200<Q^{2} <282\) \(\textrm{GeV}^2\). The binning follows the standard \(\tau ^{b}_{1_{\text {Gr.}}}\) binning given in Table 1
Table 15 Same as Table 8 for \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections in the range \(282<Q^{2} <447\) \(\textrm{GeV}^2\). The binning follows the standard \(\tau ^{b}_{1_{\text {Gr.}}}\) binning given in Table 1
Table 16 Same as Table 8 for \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections in the range \(447<Q^{2} <1122\) \(\textrm{GeV}^2\). The binning follows the standard \(\tau ^{b}_{1_{\text {Gr.}}}\) binning given in Table 1
Table 17 Same as Table 8 for \(\tau ^{b}_{1_{\text {Gr.}}}\) cross sections in the range \(1122<Q^{2} <20,000\) \(\textrm{GeV}^2\). The binning follows the reduced \(\tau ^{b}_{1_{\text {Gr.}}}\) binning given in Table 1