1 Introduction

Collimated streams of particles (jets) can be produced in highly energetic parton-parton interactions in proton-proton (s) collisions, and their properties are described by the theory of strong interactions, quantum chromodynamics (QCD). In the lowest order perturbative QCD (pQCD), two jets with high transverse momenta \(p_{{\mathrm{T}}}\) are produced “back-to-back” in the transverse plane. Higher order corrections lead to deviations from this configuration. Experimentally, this can be investigated by the measurement of the azimuthal separation, \(\varDelta \phi _{12} =|\phi _\text {jet1}-\phi _\text {jet2} |\), between the two leading \(p_{{\mathrm{T}}}\) jets in the transverse plane. Within the framework of pQCD, a final state with three or more partons is required for significant deviations from \(\varDelta \phi _{12} =180^\circ \). However, when deviations of \(\varDelta \phi _{12}\) from \(180^\circ \) are small, a pQCD calculation at a fixed order in the strong coupling \(\alpha _S\) becomes unstable and a resummation of soft parton emissions to all orders in \(\alpha _S\) has to be performed. This resummation is approximated through the use of parton showers in Monte Carlo (MC) event generators.

Azimuthal correlations in inclusive 2-jet events have been measured previously by the D0 Collaboration in \({\text {p}} {\bar{\mathrm{p}}} \) collisions at a center-of-mass energy of \(\sqrt{s}=1.96\,\text {Te}\text {V} \) [1, 2], in \({\text {p}} {\text {p}} \) collisions by the ATLAS Collaboration at \(\sqrt{s}=7\,\text {Te}\text {V} \) [3], and by the CMS Collaboration at \(\sqrt{s}=7\), 8, and \(13\,\text {Te}\text {V} \) [4,5,6], but none of the measurements considered in detail the region close to the back-to-back configuration. A detailed study of azimuthal correlations close to the back-to-back configuration allows a more precise test of different resummation strategies, and it is a first step towards an improved understanding of the effects of soft initial and final state gluons [7, 8]. The leading- and next-to-leading-logarithm contributions to the dijet azimuthal angular correlation have been investigated in [9,10,11]. The effects of applying a transverse momentum dependent parton showering to the dijet azimuthal angular correlation were studied in [12].

In this article measurements are reported of the normalized inclusive 2-jet distribution as a function of the azimuthal separation \(\varDelta \phi _{12} \) between the two leading \(p_{{\mathrm{T}}}\) jets (jets 1 and 2),

$$\begin{aligned} \frac{1}{\sigma _{p_{{\mathrm{T}}} ^{\mathrm{max}}}} \frac{d\sigma }{d\varDelta \phi _{12}}, \end{aligned}$$
(1)

in several intervals of the leading jet \(p_{{\mathrm{T}}}\) (\(p_{{\mathrm{T}}} ^{\mathrm{max}}\)) within the rapidity range \(|y |<2.5\). The total dijet cross section \(\sigma _{p_{{\mathrm{T}}} ^{\mathrm{max}}}\) is measured within each range of \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) integrated over the full range in \(\varDelta \phi _{12}\). The binning of the measurement presented here is much finer than that of Ref. [6]. We consider \(\varDelta \phi _{12} \) in the range \(170^{\circ } < \varDelta \phi _{12} \le 180^{\circ }\).

The inclusive 3-jet distributions, differential in \(\varDelta \phi _{12}\) and \(p_{{\mathrm{T}}} ^{\mathrm{max}}\), with the \(p_{{\mathrm{T}}}\) of third highest \(p_{{\mathrm{T}}}\) jet typically being 1-2 orders of magnitude smaller than \(p_{{\mathrm{T}}} ^{\mathrm{max}}\), are also suitable to test resummation effects arising from the presence of multiple scales in the interaction. Measurements of the inclusive 3-jet distribution normalized to \(\sigma _{p_{{\mathrm{T}}} ^{\mathrm{max}}}\) are also presented, for several ranges of \(p_{{\mathrm{T}}} ^{\mathrm{max}}\), and within \(|y |<2.5\).

The measurements are performed using data collected from p p collisions at \(\sqrt{s}=13\,\text {Te}\text {V} \) during 2016 with the CMS experiment at the CERN LHC, corresponding to an integrated luminosity of 35.9\(\,\text {fb}^{-1}\).

2 The CMS detector

The central feature of the CMS detector is a superconducting solenoid, 13 m in length and 6 m in inner diameter, providing an axial magnetic field of 3.8 T. Within the solenoid volume are a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter (ECAL) and a brass and scintillator hadron calorimeter (HCAL), each composed of a barrel and two endcap sections. Charged-particle trajectories are measured by the tracker with full azimuthal coverage within pseudorapidities \(|\eta |<2.5\). The ECAL, which is equipped with a preshower detector in the endcaps, and the HCAL cover the region \(|\eta |<3.0\). Forward calorimeters extend the pseudorapidity coverage provided by the barrel and endcap detectors to the region \(3.0<|\eta |<5.2\). Finally, muons are measured up to \(|\eta |<2.4\) by gas-ionization detectors embedded in the steel flux-return yoke outside the solenoid. A detailed description of the CMS detector together with a definition of the coordinate system used and the relevant kinematic variables can be found in Ref. [13].

3 Theoretical predictions

Simulations from leading-order (LO) and next-to-LO (NLO) MC event generators are investigated. Among the LO event generators, both pythia 8 [14] (version 8.219) and herwig++ [15] (version 2.7.1) are used for predictions because they feature different parton showering (PS) algorithms for soft and collinear parton radiation at leading-log accuracy. In pythia 8 the PS emissions cover a region of phase space ordered in x (fraction of the proton momentum carried by the parton) and the \(p_{{\mathrm{T}}}\) of the emitted parton, whereas in herwig++ the parton emissions are ordered in x and the angle of the radiated parton (angular ordering). The Lund string model [16] is used for hadronization in pythia 8  [14], whereas in herwig++ the cluster fragmentation model [17] is applied. Multiparton interactions (MPI) are simulated in pythia 8 (tune CUETP8M1 [18] with the parton distribution function (PDF) set NNPDF2.3LO [19, 20]) and in herwig++ (tune CUETHppS1 [18] with the PDF set CTEQ6L1 [21]) with parameters tuned to measurements in p p collisions at the LHC and \({\text {p}} {\bar{\mathrm{p}}} \) collisions at the Tevatron.

The \({\textsc {MadGraph}} {}5\_\text {a}{\textsc {mc@nlo}} \) [22] version 2.3.3 event generator (labelled as MadGraph in the following) interfaced with pythia 8 with tune CUETP8M1 is also used in the analysis. Processes with up to 4 final-state partons at LO accuracy are calculated using the NNPDF2.3LO PDF set. The \(k_{{\mathrm{T}}}\)-MLM matching procedure [23] is used with a matching scale of 10\(\,\text {Ge}\text {V}\).

Among the NLO event generators, predictions obtained using the powheg box library [24,25,26] (version 2) with the PDF set NNPDF3.0NLO [27] are considered. The event generators pythia 8 (tune CUETP8M1) and herwig++ (tune CUETHppS1) are used to simulate PS, hadronization, and MPI. The powheg generator in dijet mode [28], referred to as ph-2j, provides an NLO dijet calculation, which is accurate to LO for the azimuthal correlation between the leading jets. The powheg generator in three-jet mode [29] (using the MiNLO scheme [30, 31]), referred to as ph-3j, provides an NLO \(2\rightarrow 3\) calculation. For the ph-2j matrix elements (ME), a minimum \(p_{{\mathrm{T}}}\) of 100 GeV is required on the partons in the Born process, while for the ph-3j ME the minimum is lowered to 10 GeV to ensure coverage of the full phase space. These thresholds are applied to optimize the generation of events in the phase space of interest. The matching between the powheg matrix element calculations and the pythia 8 underlying event (UE) [18] simulation is performed by using the shower-veto procedure (UserHook option 2 [14]). The matching between the powheg matrix element calculations and the herwig++ UE [18] is performed by using a truncated shower [24].

Events generated by pythia 8 (tune CUETP8M1), herwig++ (tune CUETHppS1), and MadGraph interfaced with pythia 8 (tune CUETP8M1) are passed through a full detector simulation based on Geant4 [32]. The simulated events events are reconstructed with standard CMS programs.

Table 1 summarizes the theoretical predictions used in the present analysis.

Table 1 Monte Carlo event generators, parton densities, and underlying event tunes used for comparison with measurements

4 Jet reconstruction and event selection

The measurements are based on data samples collected with single-jet high-level triggers [33, 34]. The five single-jet triggers require at least one jet in the event with \(p_{{\mathrm{T}}} > 140\), 200, 320, 400, or \(450 \,\text {Ge}\text {V} \) within the full rapidity coverage of the CMS calorimetry. Table 2 shows the various \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions accessed by the various triggers and the integrated luminosity for each trigger in the analysis. Each trigger is fully efficient for jets in the corresponding \(p_{{\mathrm{T}}}\) range in Table 2.

Table 2 The integrated luminosity for each trigger sample in the analysis, and trigger used for each \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) range

Particles are reconstructed and identified using a particle-flow (PF) algorithm [35], which utilizes an optimized combination of information from the various elements of the CMS detector. Jets are reconstructed by clustering the four-vectors of the PF candidates with the infrared- and collinear-safe anti-\(k_{{\mathrm{T}}}\) clustering algorithm [36] with a distance parameter \(R=0.4\). The clustering is performed with the FastJet package [37]. To reduce the contribution to the reconstructed jets from additional p p interactions within the same bunch crossing (pileup), the charged-hadron subtraction technique [38] is used to remove tracks identified as originating from pileup vertices. The average number of pileup interactions per single bunch crossing observed in the data is about 27. The pileup contribution from neutral hadrons is corrected using a jet-area-based correction technique [39].

For this analysis, jets with rapidity \(|y |< 5.0\) are reconstructed. For both the inclusive 2- and 3-jet samples, the events are selected by requiring the two highest \(p_{{\mathrm{T}}}\) jets to have \(|y |< 2.5\) and \(p_{{\mathrm{T}}} > 100\,\text {Ge}\text {V} \). For the inclusive 3-jet events a third jet with \(p_{{\mathrm{T}}} >30\,\text {Ge}\text {V} \) and \(|y |< 2.5\) is required. Contributions from pileup are negligible because the pileup removal algorithm has an efficiency of \(\sim \)99% for jets with \(30<p_{{\mathrm{T}}} < 50\,\text {Ge}\text {V} \) and \(|y |< 2.5\) [40].

5 Measurements of the normalized inclusive 2- and 3-jet distributions

The normalized inclusive 2- and 3-jet distributions as a function of \(\varDelta \phi _{12}\) are corrected for detector resolution. We achieve this by unfolding the observables to the level of stable final-state particles. In this way, a direct comparison of these measurements to results from other experiments and to QCD predictions is possible. Particles are considered stable if their mean decay length is larger than 1 cm.

The unfolding procedure is based on the D’Agostini algorithm [41], which is implemented in the RooUnfold package [42], by using a response matrix that maps the generated jets onto the jets reconstructed by the CMS detector. The regularization (number of iterations) of the unfolding procedure is chosen by comparing the difference in \(\chi ^2\) between data and MC at detector level to that between data and MC at particle level. The consistency of the unfolding procedure is checked against the alternative TUnfold package [43, 44], which uses a least square minimization with Tikhonov regularization. Both methods provide equivalent results. The unfolding is performed in \(\varDelta \phi _{12}\). The response matrices are obtained using simulated events from the pythia 8 event generator with the tune CUETP8M1. The difference between the unfolded distributions and the distributions at detector level range from \(\sim \)1% for the low \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions up to \(\sim \)5% for the high \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions.

The sources of systematic uncertainties arise primarily from the jet energy scale calibration (JES), the jet energy resolution (JER), the \(\varDelta \phi _{12}\) resolution, and the model dependence of the unfolding matrix. The effect of migrations between \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions is very small because of the normalization of the cross sections in each \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) range and therefore is neglected.

The \(\varDelta \phi _{12}\) resolution is \({\sim }0.5^{\circ }\), as obtained from fully simulated event samples from pythia 8 and MadGraph. A bin size of \(1^{\circ }\) is a compromise between the ability to study the back-to-back region and the impact of the unfolding correction of \(\sim \)2%. In Ref. [6] the study is focused on a different \(\varDelta \phi _{12}\) region, and a coarser bin size is chosen to account for the smaller size of the data sample.

Alternative response matrices are obtained by using the \(\varDelta \phi _{12}\) resolution determined from fully simulated events. This resolution is varied by ±10%, an amount that is motivated by the observed difference between data and simulation. The resulting uncertainty is estimated to be below 1%.

An additional systematic uncertainty is caused by the dependence of the response matrix on the choice of the MC generator. Alternative response matrices are built using the herwig++ and MadGraph + pythia 8 event generators. Because this analysis uses a finer binning compared with that of Ref. [6], the sensitivity to the uncertainty in the unfolding is increased. The observed effect from bin migration is less than 2%.

The JER and shifts in the JES can cause events to migrate between the \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The JES uncertainties on the energy measurement are estimated to be 1–2% [38]. The resulting JES uncertainties in the normalized inclusive 2-jet distributions due to bin migrations are less than 2%, whereas for the normalized inclusive 3-jet distributions they are less than 3%. The effect of the JER uncertainties [38] is estimated by varying the JER parameters by one standard deviation up and down and comparing the results before and after the changes. The JER-induced uncertainties are less than 0.2% for the inclusive 2-jet \(\varDelta \phi _{12}\) measurement and below 0.4% for the normalized inclusive 3-jet measurement.

6 Comparison to theoretical predictions

In this section the measurements are compared with different theoretical predictions introduced in Section 3. In all figures displaying ratios, the solid band indicates the total experimental uncertainty and the error bars represent the statistical uncertainties from the simulation. In the figures displaying the normalized distributions, the error bars on the data represent the total experimental uncertainty and the error bars on the predictions represent the statistical uncertainty of the simulation. The uncertainties are often so small that the bars are not visible.

Fig. 1
figure 1

Normalized inclusive 2-jet distributions as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\) for different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The data are represented by the markers and the theory by histograms. Overlaid with the data are predictions from the herwig++ event generator (solid lines) and pythia 8 (dotted lines). The total experimental uncertainty is depicted as error bars on the measurement

The unfolded normalized inclusive 2-jet distribution as a function of \(\varDelta \phi _{12}\) is shown in Fig. 1, and compared with the predictions from herwig++ (solid lines ) and pythia 8 (dotted lines) for different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The distributions are strongly peaked at \(180^{\circ }\) and become steeper with increasing \(p_{{\mathrm{T}}} ^{\mathrm{max}}\). The ratio of the pythia 8, herwig++, and MadGraph + pythia 8 event generator predictions to data are depicted in Fig. 2 for the inclusive 2-jet distributions in the nine \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) ranges. Among the event generators, pythia 8 and herwig++ show the largest deviations from the measurements for the \(p_{{\mathrm{T}}} ^{\mathrm{max}} < 800\) GeV regions in the inclusive 2-jet case, and the MadGraph + pythia 8 event generator gives the best description in the same regions. The three generators show large deviations from the measurements in the \(p_{{\mathrm{T}}} ^{\mathrm{max}} > 800\) GeV regions. The nonperturbative corrections are estimated to be small (below 1.5%) by comparing the predictions from pythia 8 without the simulation of multi-parton interactions and hadronization (dashed blue curve) to the predictions from pythia 8 when these effects are included (solid blue curve). The nonperturbative correction factors are available in HepData.

Fig. 2
figure 2

Ratios of the normalized inclusive 2-jet distributions for the pythia 8, herwig++, and MadGraph + pythia 8 predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all the \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid band indicates the total experimental uncertainty and the error bars on the MC points represent the statistical uncertainty of the simulated data

The ratios of the NLO predictions to data for the unfolded normalized inclusive 2-jet distributions for the different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions are shown in Fig. 3. The NLO calculations considered are ph-2j + pythia 8, ph-2j + herwig++, and ph-3j + pythia 8. Among these NLO predictions ph-3j + pythia 8 agrees better with the data. The ph-2j + herwig++ prediction is similar to the one of ph-3j + pythia 8, except for the lowest \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) region.

Fig. 3
figure 3

Ratios of the normalized inclusive 2-jet distributions for the ph-2j + pythia 8, ph-3j + pythia 8, and ph-2j + herwig++ predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all the \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid band indicates the total experimental uncertainty and the error bars on the MC points represent the statistical uncertainty of the simulated data. The ph-3j prediction is not shown for the highest bin in \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) because of the large statistical fluctuations

Fig. 4
figure 4

Normalized inclusive 3-jet distributions as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\) for different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The data are represented by the markers and the theory by histograms. Overlaid with the data are predictions from the herwig++ event generator (solid lines) and pythia 8 (dotted lines). The total experimental uncertainty is depicted as error bars on the measurement

In Fig. 4 the unfolded normalized inclusive 3-jet distribution as a function of \(\varDelta \phi _{12}\) are compared with the predictions from herwig++ (solid lines) and pythia 8 (dotted lines) for different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The ratios of the normalized inclusive 3-jet distributions for the pythia 8, herwig++, and MadGraph + pythia 8 predictions to data are shown in Fig. 5 for the different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. In contrast to the 2-jet case, MadGraph + pythia 8 shows the largest deviations from the measurements close to \(180^{\circ }\), whereas pythia 8 and herwig++ give a good description of the data.

Fig. 5
figure 5

Ratios of the normalized inclusive 3-jet distributions for the pythia 8, herwig++, and MadGraph + pythia 8 predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all the \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid band indicates the total experimental uncertainty and the error bars on the MC points represent the statistical uncertainty of the simulated data

Fig. 6
figure 6

Ratios of the normalized inclusive 3-jet distributions for the ph-2j + pythia 8, ph-3j + pythia 8, and ph-2j + herwig++ predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid band indicates the total experimental uncertainty and the error bars on the MC points represent the statistical uncertainty of the simulated data. The ph-3j prediction is not shown for the highest bin in \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) because of the large statistical fluctuations

The ratios of the NLO predictions from ph-2j + pythia 8, ph-2j + herwig++, and ph-3j + pythia 8 to data for the normalized inclusive 3-jet distributions are shown in Fig. 6. All the considered NLO+PS predictions fail to describe the measurements close to \(180^\circ \). The predictions from ph-3j and MadGraph (Fig. 5) behave very differently, in contrast to their similar trend in the inclusive 2-jet case.

Since pythia 8, ph-2j + pythia 8, ph-3j + pythia 8, and MadGraph + pythia 8 use the same parton shower, the observed differences in the predictions can be attributed to the treatment of the additional partons present in the powheg and MadGraph ME.

In general we observe that the \(\varDelta \phi _{12}\) region close to \(180^\circ \) is not well described by the predictions. The predictions agree better with the measurements for increasing \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) and moving further away from the back-to-back region in \(\varDelta \phi _{12}\), where the contribution of resummation effects becomes smaller [10]. The fact that none of the generators is able to describe the 2- and 3-jet measurements simultaneously suggests that the observed differences (of the order of 10%) are related to the way soft partons are simulated within the PS. The observed differences between \(p_{{\mathrm{T}}}\) and angular ordered PS for the LO generators pythia 8 and herwig++ are small (Figs. 2, 5) compared to the MadGraph predictions, which can be attributed to the presence of higher order ME.

Fig. 7
figure 7

Ratios of the normalized inclusive 2-jet distributions for the ph-2j + pythia 8 predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid beige band indicates the total experimental uncertainty and the hatched band represents the total theoretical uncertainty

Fig. 8
figure 8

Ratios of the normalized inclusive 3-jet distributions for the ph-2j + pythia 8 predictions to data as a function of the azimuthal separation of the two leading jets \(\varDelta \phi _{12}\), for all \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid beige band indicates the total experimental uncertainty, the hatched band represents the total theoretical uncertainty

The theoretical calculations have an intrinsic uncertainty arising from the freedom of choice of the renormalization and factorization scales (\(\mu _r\) and \(\mu _f\)), the choice of the PDF and \(\alpha _S (m_{{{Z}}})\), and the modeling of nonperturbative effects and PS. The total theoretical uncertainty is the quadratic sum of the uncertainties from the scale, PDF, \(\alpha _S\), and PS variations. Despite the better agreement of ph-3j, the ph-2j event generator is used instead for the estimation of the scale, PDF, and \(\alpha _S\) uncertainties, because of the larger event sample. For the estimation of the PS uncertainty pythia 8 is utilized. The following four sources of theoretical uncertainties are analyzed:

  • The uncertainties due to the renormalization and factorization scales of the hard process are evaluated by varying the default choice \(\mu _r = \mu _f = p_{{\mathrm{T}}} \) of the underlying Born configuration between \(p_{{\mathrm{T}}}\)/2 and 2\(p_{{\mathrm{T}}}\). The envelope of the following seven combinations is considered: \((\mu _r/p_{{\mathrm{T}}}, \mu _f/p_{{\mathrm{T}}}) = (0.5, 0.5)\), (0.5, 1), (1, 0.5), (1, 1), (1, 2), (2, 1), and (2, 2).

  • The PDF uncertainties are evaluated according to the prescriptions for the NNPDF3.0 NLO PDF set. There are 100 replicas of the NNPDF3.0 NLO PDF set. For each replica the cross section is calculated and the uncertainty is taken as the envelope from all the replicas.

  • The uncertainty due to the value of the strong coupling \(\alpha _S\) is obtained by a variation of \(\alpha _S (m_{{{Z}}})\) by ±0.001, as recommended in Ref. [45].

  • The uncertainty due to PS is evaluated with the pythia 8 event generator by varying the default renormalization scale choice \(\mu _r = p_{{\mathrm{T}}} \) of the branching in initial state (ISR) and final state radiation (FSR) between \(\mu _r/2\) and 2\(\mu _r \). The envelope of the following nine combinations is considered: (ISR \(\mu _r/p_{{\mathrm{T}}} \) , FSR \(\mu _r/p_{{\mathrm{T}}} \)) \(= (0.5,0.5)\), (0.5, 1), (0.5, 2), (1, 0.5), (1, 1), (1, 2), (2, 0.5), (2, 1), and (2, 2).

The nonperturbative contributions (MPI and hadronization) are included in the calculations above. The uncertainty from these contributions are estimated from the different choices of the UE tune and found to be negligible.

The uncertainty from PS dominates for the normalized inclusive 2-jet distributions. It is one order of magnitude larger than the rest of the sources near \(\varDelta \phi _{12} =180^{\circ }\). On the other hand, for the normalized inclusive 3-jet distributions, the main contributions come from PS and PDF uncertainties. The predictions from ph-2j + pythia 8 and ph-2j + herwig++ (Fig. 3) show the differences from using different PS models together with different matching procedures.

Figure 7 (8) show the ratios of the ph-2j predictions to data for the normalized inclusive 2(3)-jet distributions for the different \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) regions. The solid beige band indicates the total experimental uncertainty, and the hatched band represents the total theoretical uncertainty.

For the inclusive 2-jet distributions, the theoretical uncertainty is larger than the experimental one in the region close to \(\varDelta \phi _{12} =180^{\circ }\) (Fig. 7). This is because the contribution from PS dominates in this region, and its uncertainty is large. For the inclusive 3-jet distributions (Fig. 8), the theoretical uncertainty is smaller in the region close to \(180^\circ \). In this case, the region close to \(180^\circ \) is not filled by the partons from the PS, but by the third parton from ph-2j, leading to a smaller PS uncertainty.

7 Summary

Measurements of the normalized inclusive 2- and 3-jet distributions as a function of the azimuthal separation \(\varDelta \phi _{12}\) between the two jets with the highest transverse momentum \(p_{{\mathrm{T}}}\), in the collinear back-to-back region, are presented for several \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) ranges of the leading jet. The measurements are performed using data collected with the CMS experiment at the LHC, corresponding to an integrated luminosity of 35.9\(\,\text {fb}^{-1}\) of p p collisions at a center-of-mass energy of \(13\,\text {Te}\text {V} \).

The measured \(\varDelta \phi _{12}\) distributions generally agree with predictions from pythia 8, herwig++, MadGraph + pythia 8, ph-2j + herwig++, and powheg (ph-2j and ph-3j) matched to pythia 8. Discrepancies between the measurement and theoretical predictions are as large as 15%, mainly in the region \(177^\circ< \varDelta \phi _{12} < 180^\circ \). The predictions agree better with the measurements for larger \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) and smaller \(\varDelta \phi _{12}\), where the contribution of resummation effects becomes smaller. The 2- and 3-jet measurements are not simultaneously described by any of models.

The tree-level multijet event generator MadGraph in combination with pythia 8 for showering, hadronization, and multiparton interactions, shows deviations from the measured \(\varDelta \phi _{12}\) for the inclusive 2-jet case, and even larger deviations for the 3-jet case. The pythia 8 and herwig++ predictions show deviations (up to 10%) for the 2-jet inclusive distributions, whereas their predictions are in reasonable agreement with the inclusive 3-jet distributions.

The next-to-leading-order ph-2j + pythia 8 prediction does not describe the data and a different trend compared to pythia 8 and herwig++ towards \(\varDelta \phi _{12} =180^{\circ }\) is observed. The ph-3j + pythia 8 predictions agree with the measurements except for the last bin in the low \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) intervals. The ph-2j + herwig++ prediction agrees well with the measurement in the highest \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) ranges. For the inclusive 3-jet case, ph-2j + pythia 8 performs similarly to pythia 8 and herwig++ in the whole \(\varDelta \phi _{12}\) range for high \(p_{{\mathrm{T}}} ^{\mathrm{max}}\) intervals. MadGraph + pythia 8, ph-3j + pythia 8, and ph-2j + herwig++ show deviations from the measurements of up to 15%.

The measurement of correlations for collinear back–to–back dijet configurations probes the multiple scales involved in the event and, therefore, the differences observed between predictions and the measurements illustrate the importance of improving the models of soft parton radiation accompanying the hard process.