1 Introduction

An important feature of the color interaction in quantum chromodynamics (QCD) is that the outgoing partons produced in the hard interaction continue to interfere with each other during their fragmentation phase. This phenomenon, called color coherence, manifests itself by the relative abundance of soft radiation in the region between the color connected final-state partons and the suppression of soft radiation elsewhere.

Color coherence phenomena were initially observed in \(\mathrm {e}^+\mathrm {e}^-\) collisions by several experiments at PETRA, PEP and LEP [18]. These experiments showed the coherence effect in \(\mathrm {e}^+\mathrm {e}^- \rightarrow \text {q}\bar{\mathrm{q}}\text {g}\) three-jet events through the suppression of particle production in the region between the quark and antiquark jets.

In hadron collisions, in addition to the color connection between the final-state partons, the color connection between the outgoing partons and the incoming partons must be considered. The Tevatron experiments CDF and D0 have both reported evidence for color coherence effects in measurements of the spatial correlations between neighboring jets [9, 10]. These correlations were not well reproduced by Monte Carlo (MC) simulations that use incoherent parton shower models. However, the data were successfully described by simulations that include color coherence effects through the ordering of the parton emission angles [11].

The technique originally developed by the Tevatron experiments is used to study color coherence effects in pp collisions at \(\sqrt{s}=7\) \(\,\text {TeV}\) with the Compact Muon Solenoid (CMS) detector. Events with at least three jets (called three-jet events) are selected, and these jets are ordered by their transverse momenta \(p_{\mathrm {T1}}>p_{\mathrm {T2}}>p_{\mathrm {T3}}\) with respect to the beam direction. We measure the angular correlation between the second and third jet to probe the effects of color coherence.

The CMS detector has a right-handed coordinate system with its origin at the center of the detector. The \(z\) axis points along the direction of the counterclockwise beam, \(\phi \) is the azimuthal angle in the transverse plane perpendicular to the beam, and \(\theta \) is the polar angle relative to the \(z\) axis. The pseudorapidity of the \(i\)th jet is denoted by \(\eta _i=-\ln [\tan (\theta _i/2)]\) and its azimuthal angle by \(\phi _i\).

The measured observable \(\beta \) [10] is defined as the azimuthal angle of the third jet with respect to the second jet in (\(\eta ,\phi \)) space as shown in Fig. 1. Implicitly, this can be expressed by

$$\begin{aligned} \tan \beta =\frac{|\varDelta \phi _{23} |}{\varDelta \eta _{23}}, \end{aligned}$$
(1)

where \(\varDelta \phi _{23}=\phi _3-\phi _2\) (defined so that \(-\pi \le \varDelta \phi _{23}\le \pi \)), \(\varDelta \eta _{23} = {{\mathrm{sign}}}(\eta _2) \cdot (\eta _3 - \eta _2)\), and \(0 \le \beta \le \pi \). The absolute value of \(\varDelta \phi _{23}\) in Eq. 1 and the sign of the pseudorapidity of the second jet, \({{\mathrm{sign}}}(\eta _2)\), in the definition of \(\varDelta \eta _{23}\) are introduced to map symmetric configurations around \(\varDelta \phi _{23} = 0\) or \(\eta = 0\) onto the same \(\beta \) value. For \(\varDelta \phi _{23} = 0\), \(\beta \) is defined to be zero or \(\pi \) depending on the sign of \(\varDelta \eta _{23}\) being positive or negative. In the case of \(\varDelta \eta _{23} = 0\), which cannot happen simultaneously with \(\varDelta \phi _{23} = 0\), \(\beta \) is defined to equal \(\pi /2\).

Fig. 1
figure 1

Visualization of the observable \(\beta \) in (\(\eta ,\phi \)) space using a simulated three-jet event. The sizes of the rectangular boxes are proportional to the particle energies

In a naive leading-order model the two partons are produced back-to-back in the transverse plane. One of the two partons may radiate a third parton. In the absence of color coherence effects there is no preferred direction of emission of this third parton around the radiating parton. In contrast, when color coherence effects are present, the third parton will tend to lie in the event plane defined by the emitting parton and the beam axis. Therefore, in the presence of color coherence, the third jet population along the event plane (in particular near \(\beta \approx 0\)) will be enhanced and out of the plane (\(\beta \approx \pi /2\)) will be suppressed. The color coherence effects are expected to become stronger in the region between the second jet and the remnant when the angle between them becomes smaller. Therefore the study of the \(\beta \) variable is performed in two situations: when the second jet is rather central (\(|\eta _2 | \le 0.8\)) and when the second jet is more forward (\(0.8 < |\eta _2 | \le 2.5\)).

The aims of this paper are

  • To measure the \(\beta \) distributions, normalized to the total number of events in each region, as a function of \(\beta \) separately in the central (\(|\eta _2 | \le 0.8\)) and forward region (\(0.8<|\eta _2 | \le 2.5\)):

    $$\begin{aligned} F_{\eta _2, i}(\beta ) = \frac{N_{\eta , i}}{N_{\eta }}, \end{aligned}$$
    (2)

    where \(N_{\eta }\) is the total number of events in the \(\eta _2\) region, \(N_{\eta , i}\) the number of events in the given \(i\)th \(\beta \) bin of the \(\eta _2\) region. The choice of this normalization significantly reduces the impact of experimental systematic uncertainties such as the uncertainty in the luminosity.

  • To gauge the sensitivity of the variable \(\beta \) to color coherence effects.

  • To compare our measurements to the predictions of MC event generators with various implementations of color coherence.

2 The CMS detector

A detailed description of the CMS experiment can be found elsewhere [12]; so here we describe the detector systems most relevant to the present analysis. The central feature of the CMS apparatus is a superconducting solenoid of 6 m internal diameter, providing a magnetic field of 3.8 T. Within the field volume, a silicon pixel and strip tracker, a lead tungstate crystal electromagnetic calorimeter and a brass/scintillator hadron calorimeter (HCAL) are installed. The central tracking system provides coverage up to \(|\eta |= 2.5\) in pseudorapidity and the calorimeters up to \(|\eta |=3.0\). An iron and quartz-fiber Cherenkov forward hadron calorimeter (HF) covers the pseudorapidity range \(3.0<|\eta |<5.0\).

3 Event selection

The CMS detector records events using a two-level trigger system consisting of a hardware-based level-1 (L1) trigger and a software-based high-level trigger (HLT). For this study, single jet triggers that reconstruct jets from calorimeter energy deposits at L1 and HLT are used to select events based on different \(p_{\mathrm {T}} \) jet thresholds. Five different triggers with \(p_{\mathrm {T}} \) thresholds of 30, 50, 70, 100, and 140\(\,\text {GeV}\) are used to select the events. The triggers were prescaled during the 2010 run when the associated rate exceeded the allocated band width except the highest-threshold one. Therefore, the events are split into five different bins in \(p_{\mathrm {T1}}\) with each bin containing the events collected during a period when the appropriate trigger was not prescaled. Each bin starts at \(p_{\mathrm {T}\text {min}}\) defined in such a way that the associated trigger efficiency exceeds 99 %. Table 2 lists the binning in \(p_{\mathrm {T1}}\), and, for each bin, it gives the associated trigger, the number of selected events, and the integrated luminosity for the period during which the given trigger was not prescaled.

Jets are reconstructed with the anti-\(k_{\mathrm {T}}\) algorithm [13], which is implemented in the FastJet package [14] using a distance parameter \(R = 0.5\), from a list of particle candidates reconstructed using the particle-flow (PF) algorithm. This PF algorithm [15] reconstructs all particle candidates in each event using an optimized combination of information from all CMS subdetector systems: muons, electrons (with associated bremsstrahlung photons), photons (unconverted and converted), and charged/neutral hadrons. The four-vectors of the

neutral particles are computed by assuming that they come from the primary vertex, which is defined as the vertex with the highest sum of transverse momenta of all reconstructed tracks pointing to it. The reconstructed jet energy \(E\) is defined as the scalar sum of the energies of the constituents, and the jet momentum \({{\varvec{p}}}\) is the vector sum of the momenta of the constituents. The jet transverse momentum \(p_{\mathrm {T}}\) is the component of \({{\varvec{p}}}\) perpendicular to the beam. The \(E\) and \({{\varvec{p}}}\) values of a reconstructed jet are further corrected for the response of the detector, which is obtained from MC simulations, test beam results, and pp collision data [16, 17]. The corrections account for the presence of multiple pp collisions in the same or adjacent bunch crossings (pileup interactions) using the jet area method [18].

Events are required to have a primary vertex reconstructed within 24 cm of the detector center along the beam line [19]. Additional selection criteria are applied to each event to remove any spurious jet-like features originating from isolated noise patterns in certain HCAL regions [20]. Events having at least three jets with \(p_{\mathrm {T}} >30\) \(\,\text {GeV}\) are selected. The pseudorapidity of the two leading jets must be within \(|\eta _{1} |,|\eta _{2} |\le 2.5\), while for the third jet no constraints are applied in order to avoid a bias in the \(\beta \) measurement.

To further reduce the background from misidentified jets, i.e., jets resulting from noise in the electromagnetic, hadron and/or hadron forward calorimeters, a set of tight identification criteria are applied: each jet should contain at least two particles, one of which is a charged hadron, and the jet energy fraction carried by neutral hadrons, photons, muons, and electrons should be less than 90 %. With these criteria the contamination of the sample with misidentified jets is suppressed to a level less than 1 % [15].

The dijet invariant mass of the two leading jets, \(M_{12}\), is required to exceed 220 \(\,\text {GeV}\) to ensure a back-to-back configuration. With this requirement more than 98 % of the events have \(|\varDelta \phi _{12}-\pi |<1\). Finally the distance in the (\(\eta ,\phi \)) space between the second and third jets is constrained to be \(0.5<\varDelta R_{23} = \sqrt{(\varDelta \eta _{23})^2 + (\varDelta \phi _{23})^2}<1.5\) in order to ensure a three-jet topology where the third jet is closer to the second jet.

The selections used in the analysis are summarized in Table 1. The numbers of events passing the selection criteria in each \(p_{\mathrm {T1}}\) bin are summarized in Table 2. The measured \(\varDelta \eta _{23}\) and \(\varDelta \phi _{23}\) distributions are compared to various MC models in Figs. 2 and 3. In general a reasonable agreement is observed with the different models. A study of the amount of energy collected by the HF detector indicated that there is no diffractive component in the data sample.

Fig. 2
figure 2

Observed \(\varDelta \eta _{23}\) distributions, corrected for detector effects, compared to MC predictions by pythia  6, pythia  8, herwig++, and MadGraph  + pythia  6. The MC samples are normalized to the total number of events in data

Fig. 3
figure 3

Observed \(\varDelta \phi _{23}\) distributions, corrected for detector effects, compared to MC predictions by pythia  6, pythia  8, herwig++, and MadGraph  + pythia  6. The MC samples are normalized to the total number of events in data

Table 1 Summary of the event selection
Table 2 The binning in \(p_{\mathrm {T1}}\) and, for each bin, the associated trigger, the integrated luminosity for the period during which the given trigger was not prescaled, and the number of selected events. The selection criteria are described in Table 1

4 Monte Carlo models

The reconstructed jets are compared to the predictions of four different Monte Carlo generators that simulate jet production in pp collisions at \(\sqrt{s} = 7\,\text {TeV} \). The numbers of events for all generator samples is much higher than the number of collected data events so the statistical uncertainties in the MC predictions are not visible in the figures.

The pythia [21] (version 6.422) event generator uses leading-order (LO) matrix elements to generate the 2 \(\rightarrow \) 2 hard process in perturbative QCD (pQCD) and the parton shower (PS) model to simulate higher-order processes [2224]. The PS model gives a good description of parton emission when the emitted partons are close in phase space. Events are generated with the Z2 tune for the underlying event. This Z2 tune is identical to the Z1 tune described in Ref. [25], except that Z2 uses the CTEQ6L1 [26] parton distribution functions (PDFs) of the proton in which the parton showers are ordered in \(p_{\mathrm {T}} \). The hadronization is simulated using the Lund string model [27, 28]. The older D6T tune [2931], where parton showers are ordered in \(Q^2\), is considered for comparison. The D6T tune was designed to describe the lower-energy results of UA5 and CDF. The color coherence effects are implemented in pythia  6 by means of an angular ordering algorithm where the effects can be switched on and off via the steering parameters MSTP(67) and MSTJ(50), which control the initial-state and the final-state showers, respectively.

The pythia  8 [32] (version 8.145) event generator, used with tune 4C [33], orders the parton showers in \(p_{\mathrm {T}} \) and models the underlying event using the multiple-parton interaction model from pythia  6 including initial- and final-state QCD radiation. The color coherence effects are implemented in a similar manner as for the \(p_{\mathrm {T}} \)-ordered showers in pythia  6.

The herwig++ [11, 34] (version 2.4.2) event generator takes LO matrix elements and simulates parton showers using the coherent branching algorithm with angular ordering of showers. The cluster hadronization model [35] is used in the formation of hadrons from the quarks and gluons produced in the parton shower. The underlying event is simulated using the eikonal multiple partonic scattering model [36]. The color coherence effects are implemented by the angular ordering of emissions in the parton shower using the coherent branching algorithm [37].

The MadGraph  4 [38] (version 2.24) event generator is interfaced with pythia  6 for the parton showering and the hadronization using the D6T tune and uses fixed-order matrix element calculations for the multiparton topologies. From two to four partons are considered in the final state. The color coherence for the hard jets at leading order comes from the exact QCD color amplitudes in the model. The \(k_{\mathrm {T}}\) MLM matching scheme [39] applied with a matching parameter of 60\(\,\text {GeV}\) avoids double-counting between the partons from MadGraph and the PS.

5 Measurement of the normalized \(\beta \) distribution and systematic uncertainties

The measurement of the \(\beta \) distribution is performed in two regions defined by the pseudorapidity of the second jet: the central region \(|\eta _2 | \le 0.8\) and the forward region \(0.8<|\eta _2 | \le 2.5\). The angular correlation effects considered in this analysis appear to have a reduced sensitivity to the transverse momentum of the leading jet \(p_{\mathrm {T1}}\). Consequently different \(p_{\mathrm {T1}}\) bins are merged into one single bin.

The \(\beta \) distribution in a given \(\eta _2\) region is obtained as a sum of the events weighted by the luminosity collected by the trigger used in the associated \(p_{\mathrm {T1}}\) bin. In case of MC samples the \(\beta \) distribution is obtained by summing together the events weighted by their generation level weight in a given \(\eta _2\) region. The normalized \(\beta \) distribution is then obtained by dividing the weighted number of events in a given bin of \(\beta \) by the total weighted number of events in the given \(\eta _2\) region.

In order to correct for the smearing effects induced by the detector resolution, an unfolding procedure is performed using the response matrices obtained from MC event generators. For this purpose the events generated with the MC programs (pythia  6, pythia  8, MadGraph  + pythia  6, and herwig++) are processed through a full CMS detector simulation package based on geant 4 [40].

Particle-level jets are built from the four-vectors of the MC generated particles with hadronization, but without detector effects. These jets are obtained using the same jet algorithm as for the reconstructed events. The resolutions in \(\varDelta \eta _{23}\) and \(\varDelta \phi _{23}\) are found to be of the order of 0.005 to 0.01, depending on the transverse momentum and pseudorapidity of the jets.

An iterative Bayesian unfolding technique [41] implemented in the RooUnfold package [42] is used to derive the unfolding corrections to the measured \(\beta \) distributions from the detector effects. The response matrix used to unfold the data is built using herwig++. The impact of the unfolding on the normalized distributions is typically of the order of 1 %.

Most of the systematic effects cancel out in the normalized \(\beta \) distribution, but the residual influence of several sources of systematic uncertainty has been considered:

  • The jet energy scale uncertainty is evaluated varying the jet response by 2.5–5 %, depending on the \(\eta \) and \(p_{\mathrm {T}} \) of the jets [43]. The impact of this source of systematic uncertainties is below \(1~\%\).

  • The jet energy and angular resolutions are accounted for by varying them by \({\pm }10~\%\) [44] and rebuilding the response matrices for the unfolding accordingly. The observed impact from both sources is in the range of 0.4–0.6 %.

  • The uncertainty due to the unfolding procedure is estimated by the dependence of the response matrix on the choice of MC generator, Alternative response matrices are built using alternative generators: pythia  6, pythia  8 and MadGraph  + pythia  6. The observed effect is of the order of 0.5 %.

The measurement is found to be insensitive to the number of pileup interactions within statistical fluctuations. In the data corresponding to this analysis the average number of pileup events per bunch crossing was around two. The total systematic uncertainties for each bin are about 2 %, and a list of the major uncertainties is summarized in Table 3. Each systematic source was found to be fully correlated between \(\beta \) and \(\eta _2\) bins [43, 44]. However, the various systematic sources are uncorrelated among themselves.

Table 3 Typical systematic and statistical uncertainties in the normalized \(\beta \) spectrum and the statistical errors

6 Results

The unfolded \(\beta \) distributions are shown in Fig. 4 together with the predictions from the various MC models for the central (\(|\eta _2 | \le 0.8\)) and forward (\(0.8 < |\eta _2 | \le 2.5\)) regions.

The values of the unfolded \(\beta \) distributions and their uncertainties are presented in Tables 4 and 5.

Fig. 4
figure 4

Observed \(\beta \) distributions for the data, corrected for detector effects, and for the MC generators (pythia  6, pythia  8, herwig++, and MadGraph  + pythia  6) in the central (\(|\eta _{2} | \le 0.8\)) and forward (\(0.8 < |\eta _2 | \le 2.5\)) regions. The error bars show the statistical uncertainties, while the yellow shaded bands correspond to the combined systematic uncertainty

Table 4 The unfolded \(\beta \) distributions and their uncertainties for the central region \(|\eta _2 | \le 0.8\). All uncertainties are symmetric and given in percent (\(\%\))
Table 5 The unfolded \(\beta \) distributions and their uncertainties for the forward region \(0.8<|\eta _2 | \le 2.5\). All uncertainties are symmetric and given in percent (\(\%\))

The ratios of the various MC predictions to the measured \(\beta \) distributions are shown in Fig. 5. The data exhibit a clear enhancement of events compared to the pythia and MadGraph generators near the event plane (\(\beta = 0\)) and a suppression in the transverse plane (\(\beta = \pi /2\)). The \(\chi ^2\) comparisons of data with MC simulation, taking into account the statistical and systematic correlations between different data points, are shown separately for the central and forward regions in Table 6. The number of degrees of freedom (NDF) is 17, which is the number of bins minus one to account for the constraint imposed by the normalization.

Fig. 5
figure 5

The ratio of the various MC predictions to the measured \(\beta \) distribution. The error bars show the statistical uncertainty of the data. The yellow band represents the systematic uncertainty, while the green band represents the total uncertainty

Table 6 Values of \(\chi ^2\) for comparisons of the \(\beta \) distribution for the data with the predictions of various MC generators. The number of degrees of freedom for both regions is 17

None of the models used in the analysis describes the data satisfactorily. Even though pythia  6 was adjusted with the Tevatron data, it fails to describe the LHC data since the \(\chi ^2\)/NDF is large. No significant difference is observed between the tunes D6T and Z2. The pythia  8 tune 4C generator describes the data better than pythia  6 over the entire phase space, but the disagreement in the forward region is not negligible. The herwig++ event generator describes the data better than the other MC generators in the central region, but the agreement is poor in the forward region. Finally, when MadGraph is used with the exact \(2 \rightarrow 3\) matrix element calculations at LO, the global description of the data is improved with respect to pythia  6 alone.

The impact of the color coherence effects is studied by switching them on and off for the first emission in the initial- and final-state showers in pythia  6. One can observe in Fig. 6 that the agreement between the data and the simulation deteriorates when the color coherence effects in the MC events are suppressed. More quantitatively, the \(\chi ^2\) divided by the number of degrees of freedom increases up to 7.7 in the central region and 11.5 in the forward region. The first emission in the initial- and final-state showers contributes roughly the same order. Using pythia, it has been verified that the impact of the non-perturbative component of the QCD calculation (hadronization and underlying event) is negligible for this analysis. One conclusion from this pythia study, as shown Fig. 6, is that the data clearly support larger color coherence effects than in present MC implementations.

Fig. 6
figure 6

The MC predictions for the \(\beta \) distribution from pythia  6, with and without color coherence effects in the first branching of the initial- and final-state showers, compared to the measurement. The error bars show the uncorrelated statistical uncertainty of the data. The yellow band represents the systematic uncertainty, while the green band represents the total uncertainty

7 Summary

Color coherence effects in multijet events have been studied in a sample of pp collisions corresponding to an integrated luminosity of 36 pb\(^{-1}\) , collected with the CMS detector at \(\sqrt{s} = 7\,\text {TeV} \). Distributions of the variable \(\beta \), which was previously used in similar analyses at the Tevatron, are used to measure the angular correlation between the second and third jets in transverse-momentum order, in the pseudorapidity and azimuthal angle space. The measurements, unfolded for detector effects, are compared to the predictions of the MC event generators pythia  6, pythia  8, herwig++, and MadGraph  + pythia  6 in the central and forward rapidity regions. We have shown that the variable \(\beta \) is sensitive to color coherence effects, and insensitive to the hadronization and underlying event. It is necessary to implement the color coherence effects in MC simulations to better describe the data. Although the MC models in the analysis include this effect by default, none of them describes the data satisfactorily for all \(\beta \) values. The pythia  6 expectations predict weaker color coherence effects than those observed, while pythia  8 exhibits a better agreement with the data. The MadGraph MC generator, which uses the exact \(2 \rightarrow 3\) matrix element calculations at LO matched to pythia  6 for parton showering, improves the agreement with data with respect to pythia  6 alone, while herwig++ describes the data in the central region better than the other MC generators but shows discrepancies in the forward region.