Dependence of inclusive jet production on the anti-kT distance parameter in pp collisions at s\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$ \sqrt{\mathrm{s}} $$\end{document} = 13 TeV

The dependence of inclusive jet production in proton-proton collisions with a center-of-mass energy of 13 TeV on the distance parameter R of the anti-kT algorithm is studied using data corresponding to integrated luminosities up to 35.9 fb−1 collected by the CMS experiment in 2016. The ratios of the inclusive cross sections as functions of transverse momentum pT and rapidity y, for R in the range 0.1 to 1.2 to those using R = 0.4 are presented in the region 84 < pT< 1588 GeV and |y|< 2.0. The results are compared to calculations at leading and next-to-leading order in the strong coupling constant using different parton shower models. The variation of the ratio of cross sections with R is well described by calculations including a parton shower model, but not by a leading-order quantum chromodynamics calculation including nonperturbative effects. The agreement between the data and the theoretical predictions for the ratios of cross sections is significantly improved when next-to-leading order calculations with nonperturbative effects are used.


Introduction
Quantum chromodynamics (QCD) is a gauge theory describing the strong interaction between partons (quarks and gluons). Jets are reconstructed using hadron particles produced by the fragmentation of partons in collisions [1]. Thus jets approximate the original partons created in short-distance scatterings. The production cross sections for high transverse momentum (p T ) partons can be calculated using perturbative QCD (pQCD). Specifically, predictions for hadron production in proton-proton collisions require models for parton showering [2][3][4] and nonperturbative (NP) effects such as hadronization [5] and underlying event (UE) [6]. When the fixed-order prediction in pQCD is not adequate, higher-order terms must be included using resummation methods [7][8][9].
The results of measurements of inclusive jet production cross sections for proton-proton collisions are typically presented using the anti-k T jet algorithm [10] characterized by a distance parameter R, which is a measure of the jet size in rapidity-azimuth plane. Anti-k T -1 -JHEP12(2020)082 jets with distance parameter R are referred to as AKn jets, where R = 0.1n. The CMS collaboration [11] has reported measurements at center-of-mass energy ( √ s) of 7 TeV [12] and 8 TeV [13] using AK5 and AK7 jets. The CMS results at √ s = 13 TeV for AK4 and AK7 jets are reported in ref. [14]. After the application of a correction for NP and electroweak effects, the results for AK7 jets are well described by next-to-leading order (NLO) calculations based on the nlojet++ [15] program used in the fastnlo software package [16]. The prediction from the powheg [17] generator, which also computes matrix elements at NLO and is used with parton showering simulated with pythia8 [18] or herwig++ [19], describes results well for both AK7 and AK4 jets. However, the ATLAS collaboration has measured the production cross sections for both AK4 and AK6 jets and finds a discrepancy between the measured results and the powheg prediction [20]. The ATLAS collaboration has also compared the measurements of inclusive jet production at √ s = 13 TeV with the next-to-next-to-leading order (NNLO) prediction in pQCD [21].
The measurement of a jet production cross section as a function of the distance parameter is sensitive to the details of the theoretical modeling of the perturbative and NP processes involved in the evolution of the partons. The measurement of the ratio of cross sections with two jet sizes was first performed by the ALICE collaboration with AK2 and AK4 jets [22]. A similar study was also produced by the CMS collaboration with AK5 and AK7 jets [23]. We explore this topic further in the present paper by extending the measurement to various values of jet size. Recently, ALICE collaboration has also measured both the absolute cross sections of inclusive jet production and the ratio of cross sections for R = 0.1-0.6 in 20 < p T < 140 GeV [24]. Dependence of inclusive jet production on the distance parameter is also studied in detail in ref. [25].
Quarks and gluons radiate secondary gluons that can be emitted outside of the catchment area of the jet definition, which is the region in rapidity-azimuth plane contributing to the jet. This lost p T is calculated using a QCD splitting function, with the leading-order (LO) result [26][27][28] in the small-R approximation (R 1) for quark-initiated jets and for gluon-initiated jets.
Here C F = 4 3 and C A (= 3) are the Casimir factors for quarks and gluons respectively, T R = 1 2 is the SU(3) quantum number, and n f is the number of active quark flavors. Larger values of R capture a larger fraction of the radiation.
Properties of jets are also modified by hadronization, an NP process describing the transition of partons into hadrons. As described in ref. [29], some theoretical models parameterize the effect of hadronization by taking α S (µ) = µ l δ(µ − µ l ), where µ l is commensurate with the Landau pole, yielding -2 -JHEP12(2020)082 in the small-R limit, where C = C F (C A ) for quark (gluon) initiated jets, and A(µ l ) is related to the scale appearing in the calculations of hadronization. Losses are again minimized at larger values of R.
The algorithm defining the jets can also select particles from the underlying event, which in general involves low momentum transfer. These particles typically have low p T . The energy density (Λ UE per unit y) from these sources is approximately uniform over the jet area, and their contribution to the jet p T is approximately given [28,30] by for small R values. Since, as discussed above, the contributions of various perturbative and NP effects depend on the jet size, and because radiation and hadronization are different for jets initiated by quarks and by gluons, comparisons of jets with different cone sizes yield information about these processes, and can be used to improve theoretical calculations.
In this paper, we present measurements of the ratio of the cross section for inclusive anti-k T jets with distance parameters of R = 0.1, 0.2, . . . , 1.2 to that of AK4 jets. The results are compared with predictions from different Monte Carlo (MC) generators, involving matrix element calculations at different orders and utilizing different parton shower and hadronization models. Predictions for cross section ratios have also been obtained using a pQCD calculation at NLO that uses the following convention where R is the anti-k T jet distance parameter, and R = 0.4 is taken as the reference jet size. The terms in eq. (1.5) are differential cross sections for three-jet production and are calculated at fixed-order using nlojet++ with terms up to α 4 S [31,32]. Measurements are restricted to p T < 1588 GeV because of the large experimental uncertainty in the calibration of high energy jets, which was not optimized for the cross section ratios. JHEP12(2020)082 of 1 < p T < 10 GeV and |η| < 1.4, the track resolutions are typically 1.5% in p T and 25-90 (45-150) µm in the transverse (longitudinal) impact parameter [33]. In the barrel section of the ECAL, an energy resolution of about 1% is achieved for unconverted or lateconverting photons that have energies in the range of tens of GeV. The remaining barrel photons have a resolution of about 1.3% up to |η| = 1, rising to about 2.5% at |η| = 1.4. In the endcaps, the resolution of unconverted or late-converting photons is about 2.5%, while the remaining endcap photons have a resolution between 3 and 4% [34]. In the region |η| < 1.74, the HCAL cells have widths of 0.087 in η and 0.087 radians in azimuth (φ). In the η-φ plane, and for |η| < 1.48, the HCAL cells map on to 5 × 5 arrays of ECAL crystals to form calorimeter towers projecting radially outwards from close to the nominal interaction point. For |η| > 1.74, the coverage of the towers increases progressively to a maximum of 0.174 in ∆η and ∆φ [35]. Within each tower, the energy deposits in ECAL and HCAL cells are summed to define the calorimeter tower energies, subsequently used to provide the energies and directions of hadronic jets. When combining information from the entire detector, the jet energy resolution amounts typically to 15% at 30 GeV, 10% at 100 GeV, and 5% at 1 TeV at |η| < 0.5, while at |η| = 2.0 the jet energy resolution increases by 1-2% at low p T [36].
Events of interest are selected using a two-tiered trigger system [37]. The first level, composed of custom hardware processors, uses information from the calorimeters and muon detectors to select events at a rate of around 100 kHz. The second level, known as the high-level trigger (HLT), consists of a farm of processors running a version of the full event reconstruction software optimized for fast processing that reduces the event rate to around 1 kHz before data storage.

Jet reconstruction
The CMS particle-flow algorithm [38] reconstructs and identifies each individual particle in an event with an optimized combination of information from the various elements of the CMS detector. The energy of photons is obtained from the ECAL measurement. The energy of electrons is determined from a combination of the electron momentum at the primary interaction vertex as determined by the tracker, the energy of the corresponding ECAL cluster, and the energy sum of all bremsstrahlung photons spatially compatible with originating from the electron track. The energy of muons is obtained from the curvature of the corresponding track as determined using the tracker and the muon system. The energy of charged hadrons is determined from a combination of their momentum measured in the tracker and the matching ECAL and HCAL energy deposits, corrected for zero-suppression effects and for the response function of the calorimeters to hadronic showers. Finally, the energy of neutral hadrons is obtained from the corresponding corrected ECAL and HCAL energy deposits.
For each event, hadronic jets are clustered from these reconstructed particles (particleflow candidates) using the infrared-and collinear-safe anti-k T algorithm [10], as implemented in the FastJet package [39]. The jet momentum is determined as the vectorial sum of all particle momenta in the jet, and is found from simulation to be, on average, -4 -JHEP12(2020)082 within 5 to 10% of the momentum of the particle-level jets reconstructed using stable particles (lifetime >30 ps) excluding neutrinos, for jet p T > 50 GeV and rapidity |y| < 2.5. Additional proton-proton interactions within the same or nearby bunch crossings (pileup) can contribute additional tracks and calorimetric energy depositions, increasing the apparent jet momentum. To mitigate this effect, tracks identified as originating from pileup vertices are discarded and an offset correction [40] is applied to correct for remaining contributions [41]. Additional selection criteria are applied to each jet to remove jets potentially dominated by instrumental effects or reconstruction failures [42]. These criteria consist of the following conditions: the energy fraction of the jet carried by neutral hadrons and photons should be less than 90%, the jet should have at least two constituents, and at least one of those should be a charged hadron. This set of criteria is more than 99% efficient for genuine jets.
The missing transverse momentum vector ( p miss T ) is defined as the negative vector sum of the p T of all reconstructed particle-flow objects in an event; its magnitude is denoted using p miss T . A set of algorithms is used to reject events with anomalous high-p miss T arising from a variety of reconstruction failures, detector malfunctions, or noncollision backgrounds; these algorithms are designed to identify more than 85-90% of the spurious high-p miss T events with a misidentification rate of less than 0.1% [43]. Jet energy corrections are derived using simulated pythia inclusive jet samples discussed in section 4.2 so that the average measured transverse momentum of jets is the same as the corresponding particle-level jets. This methodology is used to derive nominal jet energy correction factors only for AK4 and AK8 jets. The nominal corrections for AK4 jets are used for AK1 to AK6 jets. For larger jet sizes (R > 0.6) the nominal correction factors derived for AK8 jets are applied. To account for the differences in the distance parameter, an extra correction factor (C R ) is determined in case of each distance parameter for the average pileup condition based purely on simulation and applied to the corresponding jets. A detailed discussion on the derivation of C R is made later in this section. Measurements of the momentum balance in dijet, photon+jet, Z+jet, and multijet events are used to determine any residual differences between the jet energy scale in data and simulation, and appropriate corrections are made [41]. The in situ techniques are based on the missing transverse momentum projection fraction method, which is fundamentally insensitive to jet size, and on particle-flow reconstruction, whose reliance on tracking and particle-flow hadron calibration further reduces differences in energy response between jets of different radii. Residual corrections are derived using only AK4 jets and applied to jets of all the sizes.
The factor C R is derived in the following way. In simulated pythia inclusive jet samples, the detector-and particle-level jets are required to be geometrically matched satisfying ∆R < (0.5×jet size); the ratio of the average detector-level jet p T to the particlelevel jet p T is calculated as a function of the particle-level jet p T for all the jet sizes and then used as an extra correction factor C R for both data and simulation. The C R factors are also derived using simulated herwig++ inclusive jet samples, so the jet energy response is calibrated to unity for all the jet sizes in herwig++ samples as well. The difference in C R factors derived using pythia and herwig++ inclusive jet samples is used to estimate -5 -JHEP12(2020)082 the systematic uncertainties in C R separately for each of the distance parameters. The C R factors are important to ensure that the jet energy resolution in simulation is properly corrected to match the data without changing the jet energy scale, and that pythia and herwig++ are on equal footing with respect to jet energy scale and resolution when unfolding the data in section 5.1. The value of the C R factor ranges from 0.95 to 1.10 depending on the energy, rapidity, and size of the reconstructed jets; this correction is significant only for very small and very large jet sizes. However, C R corrects the jet energy response for different jet sizes at the level of simulation only and no dedicated residual correction is derived for data as a function of jet size. Nevertheless, it has been checked that the difference in average p T between AK8 and AK4 jets pointing in the same direction in data is smaller than the difference of the same between pythia and herwig++ simulations.

Collision data
Proton-proton collision data collected by the CMS experiment during 2016, corresponding to an integrated luminosity of 35.9 fb −1 , are used for this analysis. The data sample is collected using single-jet triggers, which select events containing at least one AK8 jet, formed from particle-flow candidates, with p T exceeding one of the threshold values listed in table 1. Absolute trigger efficiency is measured with a tag-and-probe procedure [44] using the events having a back-to-back dijet topology, where the tag jet is matched to a single-jet trigger, and the efficiency is measured using the probe jet.
Because of limited bandwidth and storage space, only a fraction of the events satisfying the triggering condition with lower thresholds are recorded. For this reason, in each jet-p T bin, only the trigger that has the highest effective integrated luminosity and is also more than 99% efficient is used.
Offline, events are required to contain at least one jet with p T above that value for which the trigger is 99% efficient. These values are also used to define the p T bins for the measurement.
Similarly to other CMS publications [12,13], we require that p miss where the index i runs over all particle-flow candidates in the event and i p T i denotes the scalar sum of transverse momenta; this rejects calorimeter noise and a part of electroweak backgrounds from the production of W(→ lepton) + jets, Z(→ lepton) + jets, and top quarks when the top quarks decay to final states with leptons.

Simulated samples
The data are compared to predictions from several different MC generators, listed below. The pythia v8.212 [18] generator computes matrix elements only for 2 → 2 Feynman diagrams at LO; the missing orders in the perturbation series are approximated using p Tordered dipole showering. The pythia generator employs the empirical Lund string model to hadronize the partons. The NNPDF2.3 [45] LO parton distribution function (PDF) set -6 -JHEP12(2020)082  Table 1. Trigger p T thresholds and effective integrated luminosity of the HLT triggers based on AK8 jets. These triggers were not active during the initial part of data taking in 2016, thus the maximum integrated luminosity is less than 35.9 fb −1 .
is used to describe the momentum fractions carried by the partons within the incoming protons; our UE model is the CUETP8M1 tune [46] (CMS Underlying Event Tune for pythia8 based on Monash [47]), which was derived by tuning the model parameters using minimum bias data collected by the CMS collaboration. The herwig++ v2.7. 1 [19] generator also calculates only 2 → 2 scatterings, but has a different fragmentation and hadronization model than pythia. It employs angular-ordered showers to radiate the partons and a cluster model to produce the hadrons. The NNPDF3.0 LO PDF set is used, and the UE modeling is described by the CUETHppS1 tune [46].
The MadGraph (MadGraph5_amc@nlo V5 2.2.2) [48] generator provides calculations of matrix elements with up to four outgoing partons in the final state at LO. The partons are showered and hadronized with pythia combined with MadGraph, using the MLM merging scheme [49]. The NNPDF3.0 NLO PDF set and the CUETP8M1 UE tune are used here as well.
The powheg v2 [17,50] generator computes the dijet production cross section at NLO in pQCD. Successive parton showering, hadronization, and UE modeling is performed either using pythia with the CUETP8M1 tune (referred to as PH+P8 in the figures) or herwig++ with the CUETHppS1 and EE5C [51] tunes (referred to as PH+Herwig in the figures). The NNPDF3.0 NLO PDF set is used for powheg as well, and the value of the h damp parameter in powheg is 250 GeV.
The herwig 7.1.1 [52] generator, used with the NNPDF3.1 NNLO PDF set with α S (m Z ) = 0.118, also evaluates the matrix elements for dijet production at NLO, and is matched to the herwig7 parton shower using the FxFx [53] jet merging method. The CH2 tune is used to model UE. This prediction is referred to as herwig7.
Fixed-order predictions for dijet production at NLO are computed using nlojet++ within the framework of the fastnlo package. To account for the effects of hadronization, an additional correction factor is used, which will be discussed in section 5.2. This prediction is referred to as NLO⊗NP in the figures. Predictions from nlojet++ are obtained using the CT14NLO PDF set.

JHEP12(2020)082
Recently, a prediction for single-inclusive jet production using joint resummation in the threshold energy in the small-R limit has been computed at next-to-leading logarithmic (NLL) accuracy in the framework of Soft Collinear Effective Theory in refs. [9,54]; the CT14NLO PDF set is also used for this prediction, which is referred to as (NLO+NLL). This prediction is compared with the measurements reported in this paper.

Measurement of cross sections and cross section ratios
The inclusive jet cross section is calculated as 1) where N jets is the number of jets in a p T and y bin, L int is the integrated luminosity of the data set, is the product of trigger and event selection efficiencies, and ∆p T and ∆y are the bin widths in p T and y. The widths of p T bins are proportional to the jet energy resolution and increase with jet p T . The ratios of cross sections for the different jet sizes with respect to AK4 jets is calculated as the bin-by-bin quotient of the cross sections of AKn (n= 1, 2, . . . , 12) and AK4 jets respectively; in the ratios, all the terms in eq. (5.1) except N jets and cancel.

Unfolding
To correct for detector inefficiencies and resolution, a number of methods available in the RooUnfold package [55] are used to unfold the jet p T spectra. The nominal choice of unfolding technique in this paper is the D'Agostini unfolding [56] with early stopping. Up to 5-8 iterations are used depending on jet size and rapidity region. An alternative method is singular value decomposition (SVD) [57]. A third method is called bin-by-bin [58], which multiplies the particle-level spectra by the ratio between the detector-level spectra in data and simulation.
The SVD and bin-by-bin techniques are used to cross-check the result of unfolding with the D'Agostini unfolding. As an additional cross-check, unfolding is also performed using a χ 2 minimization without regularization using the TUnfold package [59].
Response matrices between p T spectra of detector-level and generator-level jets are obtained by one-to-one matching of the nearest detector-and particle-level jets, excluding matches with ∆R > (0.5 × jet size), where ∆R denotes the distance between detectorand particle-level jets in the rapidity-azimuth plane. This criterion leads to almost 100% matching efficiency between the detector-level and the particle-level jets. Response matrices are constructed, for all rapidity and jet sizes, from the CMS detector simulation based on Geant4 [60] using simulated samples from three MC event generators, pythia, herwig++, and MadGraph. For the particle-level results, response matrices based on the pythia simulation are used for the unfolding. The response matrix for AK4 jets in the first rapidity region for the pythia sample is shown in figure 1. The response matrix is diagonal, which shows that unfolding works well. Response matrix constructed from a simulation of a sample generated using pythia, for AK4 jets in the |y| < 0.5 bin (left). A correlation matrix generated after data is unfolded by the D'Agostini unfolding using pythia simulation for AK4 jets (right).
For both the D'Agostini and SVD unfolding techniques, the nearest neighbor p T bins are correlated, and the next-to-nearest bins are anti-correlated (right plot in figure 1 for AK4 jets with the D'Agostini unfolding). Next-to-next-to-nearest bins are again correlated.
Several cross-checks are made regarding the unfolding. To investigate possible bias due to the choice of MC generator used to construct the response matrices, event samples are generated using three different generators: pythia, herwig++, and MadGraph, followed by the detector simulation whose output is scaled and smeared independently for each generator to match the energy scale and resolution of jets in data. Detector-level distributions from each of the samples are unfolded using these three response matrices, and the unfolded distributions are compared to the corresponding particle-level distributions. No evidence for significant bias is observed. Similarly, the data are unfolded using response matrices from these three simulated samples; the differences among the unfolded spectra are within systematic uncertainties corresponding to the correction factor C R . The same conclusion holds when comparing the unfolded distributions obtained using different unfolding techniques, such as D'Agostini, SVD, bin-by-bin, and χ 2 minimization.

Nonperturbative corrections for fixed-order calculations
Fixed-order NLO calculations yield predictions for the partonic fields, but in experimental measurements, jets are composed of hadrons. To evolve the parton-level prediction to the hadron level, NP corrections are calculated and applied. Although generators such as pythia and herwig come with MC-based phenomenological simulation of these processes, nlojet++ does not. The impact of NP on the nlojet++ prediction is approximated as a multiplicative correction factor as follows. The NP correction is the ratio of an observable from a generator, which includes NP effects with hadronization and multiple parton interaction (MPI) processes switched on, to the same observable obtained from the same generator without NP effects, i.e., by switching off hadronization and MPI processes.
Simulated powheg+pythia (CUETP8M1 tune) and powheg+herwig++ (EE5C and CUETHppS1 tunes) samples are used to compute NP factors for all the jet sizes in all the p T and rapidity bins. The average NP correction obtained from the powheg+pythia and powheg+herwig++ (EE5C) samples is defined as the final NP correction, and the envelope of the differences is taken as its uncertainty. Figure 2 depicts the NP corrections for the cross section ratio of the AK2 and AK8 jets with respect to the AK4 jets. Hadronization corrections are larger for smaller jet sizes, and MPI introduces a larger correction for large-R jets. Because both hadronization and MPI are important for low-p T jets, the NP correction is also significant in the low-p T portion of phase space; in the high-p T region, the NP correction factor approaches 1. For AK4 jets, the corrections for hadronization and MPI almost cancel, and the resulting NP correction is close to unity throughout the p T range. At around p T = 85 GeV, the correction goes down to 0.8 for AK2 jets, and it goes up to 1.25 for AK8 jets.

Experimental uncertainties in the measurement
Multiple sources of uncertainty affect the precision of the measurement: statistical, jet energy scale (JES) uncertainties, jet energy resolution (JER) uncertainties, and uncertainties in the pileup condition. We also include systematic uncertainties corresponding to the use of JES corrections derived for one R along with the R-dependent C R factor on jets formed using another R.
To estimate the statistical uncertainty in data, the jackknife resampling [61] method is used. In this technique, ten different data samples, each containing 90% events of the full data sample, are constructed such that the removed 10% of the events are complementary for each subsample. These subsamples are chosen in such a way that they correspond to very similar phase space regions. The statistical uncertainty is the standard deviation of the ten distributions multiplied by √ 9 = 3. The resulting statistical uncertainty is roughly <1% for jet p T < 1 TeV, and increases at high jet p T . A similar procedure is followed to estimate the statistical uncertainty due to the response matrices used for unfolding. Here also, ten subsets of the simulated sample are considered, each with a nonoverlapping 10% of events removed. The distributions in data are unfolded using each subsample, and the standard deviation of ten unfolded distributions, multiplied by a factor of 3, is the -10 -JHEP12(2020)082 statistical uncertainty due to the response matrices; in this case the statistical uncertainty is roughly 0.5-1.0% for the cross section ratio throughout the p T range.
The jet energy scale corrections have a number of uncertainties corresponding to the techniques used and the amount of pileup. The JES has an uncertainty of about 1-2% in the central region [62]. The uncertainty is larger in the forward region and at low jet p T . To evaluate the uncertainty in the measurement of the cross section ratio, the JES is varied upwards and downwards by the uncertainties corresponding to different sources. The difference in the unfolded cross section ratios using the nominal and varied JES is the uncertainty. Twenty-seven different sources of JES uncertainty are considered individually and added in quadrature. The uncertainty because of JES is very similar for all the jet sizes, except for the pileup component. The uncertainties mostly cancel out in the ratio, but there is a small residual, which is about 0.5-1.0% for |y| ≤ 2.0 up to 1 TeV of jet p T and goes up to 2% for very high jet p T .
To estimate the uncertainty in the ratio of cross sections with respect to that of AK4 jets because of using JES corrections derived for one value of R with jets from other values of R and then applying the C R factor, the standard calibration factors from the AK8 jets are applied to AK1 to AK6 jets, and, for jets of other sizes, the calibration factors for the AK4 jets are used. The C R factors for jets of all sizes are derived for this scenario. The systematic uncertainties in the inclusive jet cross section ratios are evaluated using the difference between the results obtained by these two procedures. The uncertainty coming from the C R correction is more significant for larger jet sizes.
The C R calibration factors are derived using both pythia and herwig++ simulations as a function of jet p T in different rapidity bins for all jet sizes. The difference in the resulting C R corrections is an 'R-dependent' uncertainty, and it is defined such that it vanishes for AK4 jets, which is used as the reference.
The JER and its associated uncertainty are obtained from a dijet balance technique [62]. The JER in data is worse than in simulation. To match the JER in data and simulation, a spreading is added to the jets in simulation. Here also, as in the case for the JES, cross section ratios are obtained using upward and downward variations of the energy resolution factors for simulation while unfolding the data. The difference with respect to the nominal unfolding is used as an estimate of the uncertainty. The uncertainty due to JER is more important for large-R jets at low p T . The uncertainty also grows in regions of larger rapidities.
To match the pileup conditions in data and in MC simulation, pileup profile weighting is performed for the simulated samples. The weighting factors depend on the total inelastic cross section; we vary its nominal value of 67.5 mb [63] up and down by its uncertainty of 2.6% when reconstructing the response matrices, and take the difference in the unfolded data as the uncertainty. This source of systematic uncertainty is larger at low p T for large jet sizes, although its absolute value is small.
The uncertainties from different experimental sources are added in quadrature, and the total uncertainty is shown in figure 3 for the cross section ratios of the AK2 and AK8 jets with respect to the AK4 jets. In the cross section ratio, many of the systematic uncertainties almost cancel, so the final uncertainty is small. The statistical component of the uncertainty is also shown in the same figure.
The experimental systematic uncertainty at low p T and large R is dominated by the pileup uncertainty. The JER uncertainty is also larger there because of additional spreading caused by pileup. At intermediate p T , the uncertainty is dominated by the C R uncertainty; at high p T the JES dominates the experimental uncertainty because the cross sections fall very steeply and event counts are small at high jet p T . The sizes of the statistical -12 -JHEP12(2020)082 uncertainties are similar to those of the total systematic uncertainties and are dominated by data at high p T and by the uncertainty in the response matrix because of the number of MC events at intermediate p T . At low p T , the data have similar statistical uncertainties as the simulated sample, since the corresponding triggers are prescaled.
Another source of uncertainty, which is relevant only for jets with R > 0.8, is the uncertainty in the trigger efficiency correction. The AK8 single-jet triggers are not fully efficient for larger jet sizes near the trigger turn-on points for AK8 jets; an efficiency correction is applied for those jet sizes following eq. (5.1). The difference in the absolute value of the trigger efficiency from the curve used to model the variation of trigger efficiency as a function of jet p T is the uncertainty. The size of this uncertainty is 0.5-1.0% throughout the p T range.

Theoretical uncertainties
Apart from the systematic uncertainties due to experimental sources, theoretical calculations and generators have uncertainties in their predictions for the cross section ratio. For the fixed-order predictions, the contributing factors include the choice of renormalization and factorization scales (scale), the PDF uncertainty (PDF), the uncertainty from α S , and the uncertainty due to the NP corrections (NP correction).
In the matrix element computation, the coupling (α S for QCD) is evaluated at an energy scale known as the renormalization scale (µ R ). Another scale is chosen to compute the PDF, in order to resum initial-state radiation below that scale, called the factorization scale (µ F ). For the fixed-order calculations, both are set equal to the p T of individual jets. The scale uncertainty is evaluated using the following combinations of factors for (µ R , µ F ): (2,1), (1,2), (0.5, 1), (1, 0.5), (2, 2), (0.5, 0.5). The envelope of the variations is the scale uncertainty in the prediction. Scale variations in the numerator and the denominator of the ratio of cross sections are fully correlated since the underlying parton configuration is the same. This is one of the largest sources of theoretical uncertainties.
The PDFs are determined using data from several experiments. The PDFs therefore have uncertainties from the experimental measurements, modeling, and parameterization assumptions. The resulting uncertainty is calculated according to the prescription of CT14 [64] at the 90% confidence level and then scaled to the 68.3% confidence level. The PDF uncertainty is independent of jet size within statistical uncertainties, and thus cancels in the ratios.
The cross section measurement for inclusive jets depends on the value of α S . In the nlojet++ prediction, its value (0.118) is varied by ±0.001. The uncertainty is taken as the difference between the results with varied and nominal values of α S and this difference is scaled to correspond to ∆α S 0.0015, as recommended in ref. [65]. For the jet cross section ratio, the uncertainty due to the α S variation in the numerator and denominator cancels.
As mentioned in section 5.2, the envelope of the differences between the NP correction factors obtained using different parton showering algorithms to determine the NP correction is the uncertainty in the NP correction. The uncertainty is significant only at low p T .  Figure 4. Comparison of the ratio of the differential cross sections of jets of different sizes with respect to that of AK4 jets from data and from NLO predictions using powheg+pythia (CUETP8M1 tune) in the region |y| < 0.5. Colored symbols indicate data and colored lines represent prediction from simulation. Offsets by the amount written in the parentheses have been added to the corresponding data points to separate the results for different jet sizes.
All these uncertainties are added in quadrature, and are collectively referred as the theoretical uncertainty in what follows.
The correlation between the experimental and the theoretical uncertainties is not studied.

Comparison of ratio of cross sections
The ratios of cross sections with respect to the AK4 jets are shown in figure 4 in the central region (|y| < 0.5) for all the jet sizes using unfolded data and the prediction from the NLO MC generator powheg with pythia parton showering; they are offset by fixed quantities for clarity.
The NLO powheg generator, interfaced with the parton showering model, describes the data well at moderate values of jet size, but there is a deviation at low p T for very large values of jet size.
The ratios of the cross sections of inclusive AK2 and AK8 jets with respect to those of AK4 jets are computed at LO and NLO in pQCD, following eq. (1.5), with nlojet++ for the most central region (|y| < 0.5). The comparison with data is shown in figure 5. Both the LO and NLO predictions are systematically below data for AK8 jets and above data for AK2 jets. The NP correction is essential to describe the trend in data below medium jet p T -14 -JHEP12 (2020) Figure 5. Comparison of the ratios of differential cross sections for the AK2 (upper) and AK8 (lower) jets with respect to that of AK4 jets from data and pQCD predictions using nlojet++ in the region |y| < 0.5. Black symbols indicate data and colored lines represent pQCD predictions. Statistical uncertainties are shown as vertical bars for the data and the NLO⊗NP prediction. The yellowish olive region around data represents the experimental systematic uncertainty whereas the region shaded in light blue color around NLO⊗NP prediction shows the theoretical uncertainty in the prediction.
values. Also, the NLO calculation improves data-theory agreement significantly over LO, bringing data and theoretical prediction into agreement within statistical and systematic uncertainties at p T > 1000 GeV for both AK2 and AK8 jets. Resummed calculations bring the theoretical prediction even closer to the data, especially for AK8 jets. The uncertainty -15 -JHEP12(2020)082 corresponding to the resummed calculations is within 5% for cross section ratio, and is not shown here to avoid congestion in the figure.

Variation of the ratio of cross sections with jet size
The cross section is determined as a function of p T for both data and theoretical predictions. The numbers are then divided by the cross section for the AK4 jets in the same p T and rapidity window separately for data and each theoretical prediction, and presented in figure 6, in three ranges of p T for the most central (|y| < 0.5) and the most forward (1.5 < |y| < 2.0) regions as a function of jet size. Almost all the MC simulations involving resummation via parton shower can describe the trend with jet size seen in data, whereas the LO calculation exhibits different behavior. Prediction from NLO calculation, as shown in |y| < 0.5, improves significantly the description of cross section ratio, as observed in data, for small jet sizes, and lies between the LO prediction and data for large jet sizes. Analytic calculations with joint resummation, available for jet sizes up to 0.8, provide an advancement with respect to fixed-order predictions, and lead to a better agreement with data. Similar behavior is observed in all the rapidity regions reported.

Summary
A measurement has been made of the ratio of cross sections of inclusive anti-k T jets of multiple sizes with respect to jets with the distance parameter R = 0.4; this is the first such result from the CMS collaboration. Because of cancellation of many experimental and theoretical systematic uncertainties for the ratio, it is more sensitive to perturbative and nonperturbative effects than the absolute cross section measurement; the experimental systematic uncertainty in the cross section ratio is of similar size as the statistical uncertainty, whereas the theoretical uncertainty is dominated by the choice of the renormalization and factorization scales.
From the ratio measurement, we observe that the nonperturbative correction is important in describing the data at low transverse momentum. Thus, the modeling of nonperturbative effects, such as hadronization and the underlying event has a significant impact on the description of the data in different regions of phase space.
Finally, the variation of the ratio of cross sections with jet size R emphasizes the importance of the inclusion of parton showering algorithms to capture the effects of higherorder terms in the perturbation series by the resummation approach, which are absent in the case of fixed-order computation. This is also demonstrated by the analytic calculations using joint resummation in threshold for single jet production, and jet size. Therefore, this study shows the importance of final-state radiation modeled in Monte Carlo simulation to describe the data, and also implies that the differences between various parton showering and hadronization models are significant.
JHEP12 (2020) Figure 6. Comparison of the ratio of cross sections of inclusive jets of various sizes with respect to AK4 jets, as a function of jet size in different regions of jet p T in data, and for multiple theoretical predictions in rapidity bins |y| < 0.5 (left column) and 1.5 < |y| < 2.0 (right column) at particle level. When the dijet production cross section ratio is presented using pure NLO predictions for two jet sizes, the ratio becomes LO at α S ; this is quoted as LO⊗NP in the figure. Points corresponding to a particular prediction are connected via lines to guide the eye. Experimental uncertainties in the ratio of cross sections are shown with bands around the data points, whereas theoretical uncertainties are shown with the bands around the fixed-order predictions.

JHEP12(2020)082
other CMS institutes for their contributions to the success of the CMS effort. In addition, we gratefully acknowledge the computing centers and personnel of the Worldwide LHC Computing Grid for delivering so effectively the computing infrastructure essential to our analyses. Finally, we acknowledge the enduring support for the construction and operation of the LHC and the CMS detector provided by the following funding agencies: BMBWF and FWF (

JHEP12(2020)082
Thalis and Aristeia programs cofinanced by EU-ESF and the Greek NSRF; the Rachadapisek Sompot Fund for Postdoctoral Fellowship, Chulalongkorn University and the Chulalongkorn Academic into Its 2nd Century Project Advancement Project (Thailand); the Kavli Foundation; the Nvidia Corporation; the SuperMicro Corporation; the Welch Foundation, contract C-1845; and the Weston Havens Foundation (U.S.A.).
Open Access. This article is distributed under the terms of the Creative Commons Attribution License (CC-BY 4.0), which permits any use, distribution and reproduction in any medium, provided the original author(s) and source are credited.