1 Introduction

The theoretical description of multijet production in hadronic collisions is one of the key ingredients for the interpretation of the data from high-energy hadron colliders, the 1.96 TeV proton–antiproton Tevatron collider at Fermilab, and the proton–proton Large Hadron Collider (LHC) at CERN. Final states with multijets, possibly associated with electroweak gauge bosons, are in fact the dominant signature of the decay of heavy particles produced at high energy, whether in the Standard Model (top quarks and Higgs bosons), or in theories beyond the Standard Model (BSM), such as supersymmetry. The identification of these particles, and the study of their properties, requires an accurate modelling of the Standard Model (SM) sources of multijets. Great progress was achieved towards this goal in the past decade. On one side, the calculation of inclusive, parton-level, cross-sections to next-to-leading-order (NLO) in QCD has produced results for processes as complex as W+4 jets [1]. On the other, algorithms have been developed and implemented in numerical codes to provide a complete description of the hadronic final states emerging from processes with up to 6 jets, merging the exact leading-order (LO) calculation of the partonic matrix elements (ME) with the evolution, provided by so-called shower Monte Carlo (MC) codes, of the partonic shower (PS) and the subsequent hadronization of the partons in to physical hadrons.

The development of theoretical tools has been accompanied by experimental measurements, which provide the necessary validation test-bed for these calculations. Parton-level NLO calculations provide a first-principle description of inclusive final states: they have an intrinsic high degree of precision, due to the reduced dependence on the unphysical choice of a renormalization and factorization scale and, furthermore, are not subject to modelling uncertainties related to the details of the non-perturbative phase of the final state evolution. Calculations based on the merging of LO matrix elements, shower evolution and hadronization, on the other hand, while affected by the larger scale-setting uncertainty due to the LO approximation, provide a fully exclusive description of the final states, and are therefore more suitable for the experimental analyses. Their ultimate goal is not only to give reliable estimates of the inclusive jet rates and energy distributions, but also to reproduce properties of the final states such as the jet inner structure and the distribution of softer particles produced outside of the jets, including those resulting from the evolution of the fragments of the original colliding hadrons.Footnote 1 These properties, which depend on the details of the non-perturbative dynamics, can only be described through the phenomenological models embedded in the shower MC codes. The parameters of these phenomenological models need to be tuned using experimental data of some suitable observables. The factorization assumption built into any description of large-Q processes justifies the use of these same parameters in the prediction of different observables, and provides the basis for the predictive power of such tools. This assumption however must be validated with a direct comparison with data. Elements that need to be probed include the scaling with beam energy of the UE parameters, the universality of the parameters controlling the shower evolution and hadronization, and the overall independence of all parameters on the type of hard process. Deviations from the expected universality would highlight faults in the underlying modelling of effects beyond perturbative physics, or could be due to the insufficient precision of the perturbative description, in case NLO effects were to modify significantly the LO predictions. Differences compatible with the theoretical systematics of the LO approximation could however be reabsorbed by modifying the perturbative parameters that govern the LO systematics, for example the renormalization and factorization scales, or the matching variables used in the matrix-element/shower merging algorithm.

It is therefore important to understand the correlations between the effects of changing the soft and UE parameters on one side, and the perturbative parameters on the other. In this paper we present studies which demonstrate that, in the tuning of ME-PS matched predictions, it is vital that there is consistency in the treatment of α S  in both the ME and PS components. While this is a general issue for all shower MCs, we consider as an explicit example the merging of LO matrix elements with the Pythia 6 shower MC [2], as implemented in the framework of the AlpGen code [3], one of the reference tools for experimental multijet studies at the Tevatron and at the LHC. The most recent versions of Pythia (6.425) and AlpGen (2.14) codes were used for producing the results.

On the Pythia 6 side, we consider several different tune variations of the interleaved p T-ordered parton-shower model [4], focusing on the so-called “Perugia” set of tunes of [5, 6]. These range from the Perugia 0 tune (from 2009) to the Perugia 2011 updates that have been developed as part of this work, including systematic up/down variations of the shower activity (see the Appendix and [5, 6] for details). We also compare to the “DW” tune [7] of the virtuality-ordered shower model [8, 9]. For Herwig [10], we include the “Jimmy” underlying-event model [11], with default parameters. We emphasize that the qualitative conclusions presented in this paper carry over to other shower models, including the ones implemented in Pythia 8 [12, 13] and Herwig++ [14], but the quantitative aspects should still be considered limited to the particular tunes and shower models studied here. We rely on Fastjet [15] for jet clustering and have further used the Rivet-based [16] mcplots web site [17] for some of our comparisons.

The paper is organized as follows: in Sect. 2 we describe in detail the theoretical nature of the α S  consistency problem, and give a practical example of how it can be manifest in the prediction of high p T observables. In Sect. 3 we show how a simple prescription can be applied to stabilize ME-PS tunings against this problem, propose a new tune for AlpGen + Pythia 6 matched predictions, and demonstrate the behavior of this tune under tuning variations. In Sect. 4 we show that this new AlpGen + Pythia 6 tune is able to reproduce (within statistical errors) the Tevatron and LHC vector boson plus jets data. In addition we also the tune predictions to the jet shape measurements at the Tevatron and LHC. Finally we conclude in Sect. 5.

2 The importance of consistent α S treatment in ME-PS matched predictions

In this section we demonstrate that consistent treatment of α S  in ME-PS matched predictions is important in order to achieve the desired accuracy in the prediction of high p T observables. We first present the theoretical arguments behind this, and then go on to show and explain that without adopting this approach one can observe undesirable and counter-intuitive effects on experimental observables.

2.1 Theoretical background

The philosophy behind matching prescriptions such as the MLM one [18, 19] employed by AlpGen is to separate phase space cleanly into two distinct regions; a short-distance one, which is supposed to be described by matrix elements, and a long-distance one described by parton showers. In the long-distance region, real and virtual corrections, with the latter represented by Sudakov factors, are both generated by the shower and are intimately related by unitarity (for pedagogical reviews, see, e.g., [20, 21]). On the short-distance side, the real corrections are generated by the matrix elements while the virtual ones are still generated by the shower.

Much effort has gone into ensuring that the behavior across the boundary between the two regions be as smooth as possible. CKKW showed [22] that it is possible to remove any dependence on this “matching scale” at NLL precision by careful choices of all ingredients in the matching; technical details of the implementation are important, and the dependence on the unphysical matching scale may be larger than NLL unless the implementation matches the theoretical algorithm precisely [2325].

Especially when two different computer codes are used for matrix elements and showering, respectively (as when AlpGen or MadGraph [26] is combined with Pythia 6 or Herwig), inconsistent parameter sets between the two codes can jeopardize the consistency of the calculation and lead to unexpected results, as will be illustrated in the following sections.

To give a very simple theoretical example, suppose a matched matrix-element generator (MG) uses a different definition of α s than the parton-shower generator (SG). Suppressing parton luminosity factors to avoid clutter, the real corrections, integrated over the hard part of phase space, for some arbitrary final state F, will then have the form

$$ \sigma_{F+1}^{ \mathrm{incl}} = \int_{Q_F^2}^s \mathrm{d}\varPhi_{F+1}\alpha_s^{ \mathrm{MG}}|M_{F+1}|^2, $$
(1)

where we have factored out the coupling corresponding to the “+1” parton and suppressed the dependence on any other couplings that may be present in |M F+1|2. The virtual corrections at the same order, generated by the shower off F, will have the form

(2)

with P i (z) the DGLAP splitting kernels (or equivalent radiation functions in dipole or antenna shower approaches). If the two codes use the same definitions for the strong coupling, \(\alpha_{s}^{ \mathrm{SG}}=\alpha_{s}^{ \mathrm {MG}}\), then the fact that P(z)/Q 2 captures the leading singularities of |M F+1|2 guarantees that the difference between the two expressions can at most be a non-singular term. Integrated over phase space, such a term merely leads to a finite \(\mathcal{O}(\alpha_{s})\) change to the total cross section, which is within the expected precision. Indeed, it is a central ingredient in both the MLM and (L)-CKKW matching prescriptions that a reweighting of the matched matrix elements be performed in order to ensure that the scales appearing in α s match smoothly between the hard and soft regions. Thus, we may assume that the choice of renormalization scale after matching is μp T on both sides of the matching scale, where p T is a scale characterizing the momentum transfer at each emission vertex, as established by [27, 28] and encoded in the CKKW formalism [22].

In the case of the CKKW approach as implemented in the Sherpa MC framework [29], this prescription can be controlled exactly, since the matrix element and the shower evolution are part of the same computer code and hence naturally use the same α s definition. This is also true in Lönnblad’s variant [23] of the algorithm, used in Ariadne [30]. In the case of codes like AlpGen or Madgraph, on the other hand, an issue emerges. These codes are designed to generate parton-level event samples to be used with an arbitrary shower MC. Different shower MCs however use slightly different scales for the parton branchings, as a result of different approaches to the shower evolution, and may use different values of Λ QCD, as a result of the tuning of the showers and/or underlying events. A possible mismatch therefore arises in the values of α s used by the matrix-element calculation and those used by the shower.

If there is a mismatch in Λ QCD or α s (M Z ), then this will effectively generate a real-virtual difference whose leading singularities are proportional to

$$ \alpha_s^2b_0\ln \biggl( \frac{\varLambda^2_{ \mathrm{MG}}}{\varLambda^2_{ \mathrm{SG}}} \biggr) \frac{ \mathrm{d}Q^2}{Q^2}\sum_i P_i(z)|M_{F}|^2, $$
(3)

which is of next-to-leading logarithmic order (unless Λ MGΛ SG, in which case it vanishes). Similarly, even if both matrix-element and shower codes are using the same Λ QCD, but they use different running orders, then there will be an \({\mathcal{O}}(\alpha_{s}^{3} \ln(p_{\mathrm{T}}^{2}/\varLambda^{2}))\) mismatch, which may also become large if p TΛ.

To be more concrete, let us consider a specific example. Compare (A) a matched MG+SG calculation which uses the same Λ QCD value on both sides of the matching to (B) a calculation in which the value used on the MG side is reduced to half its previous value but the SG one remains the same, as summarized by the two first columns of Table 1. Going from case A to B, the following changes result:

  1. 1.

    The number of (F+1) states added by the MG decreases, due to the lowering of the Λ QCD value on the MG side, while the number of surviving F states remains constant, since the shower Sudakov is not modified. The total estimated cross section therefore decreases.

  2. 2.

    At the differential level, the smaller number of (F+1) states combined with the unchanging number of F states implies smaller absolute jet cross sections and smaller fractions σ jet/σ tot.

Similarly we may consider what happens if (C) we reduce the Λ QCD value on the SG side instead, as summarized in the last column of Table 1. Going from case A to C, the following changes result:

  1. 1.

    Since the SG generates fewer branchings, the lesser amount of radiation will increase the fraction of F states surviving the Sudakov veto, and will lead to narrower jets, increasing the fraction of reconstructed jets at any given fixed p T value. Given that the total number of (F+1) states generated at the parton level remains constant (Λ QCD remains the same on the MG side) the total cross section increases.

  2. 2.

    Since both the total cross section and the number of reconstructed additional jets also increase, jet fractions can either increase or decrease.

In particular, note the somewhat counter-intuitive effect that decreasing the shower α s value actually increases the jet rates and hardens their p T spectrum in a matched calculation, while it normally decreases them in a standalone shower calculation.

Table 1 The three cases, A, B, and C discussed in the text, for an arbitrary reference Λ value

Since, as was discussed above, inconsistencies among the choices on the two sides can lead to differences at the NLL level, it is obviously important to ensure that they are consistent within a reasonable margin. This is particularly true in the context of event-generator tuning, in which specifically the NLL components of the shower description are sought to be optimized with respect to measured data, and hence changes at this level could effectively destroy the tuning.

Finally, we remind the reader that a change in Λ QCD can be interpreted as a change in the opposite direction of the renormalization scale argument (for constant Λ QCD), modulo small flavor threshold effects that we shall ignore here. This is easy to realize from the definition of the coupling,

$$ \alpha_s\bigl(k \mu^2\bigr)\stackrel{ \mathrm{1-loop}}{=}\frac{1}{b_0\ln (k\mu^2/\varLambda^2 )} = \frac{1}{b_0\ln (\mu^2/(\frac{1}{k}\varLambda^2) )}. $$
(4)

Thus, we may write renormalization scale variations (e.g., by a factor of 2 in each direction) either by applying a prefactor directly on the renormalization scale argument of α s or by applying the inverse of that factor to Λ QCD while keeping the renormalization scale argument unchanged. Due to the technical structure of the codes, the former is more convenient in AlpGen (via the ktfac setting) whilst the latter is more convenient for Pythia 6.

2.2 Examples of the interplay between tunes and matching

In this section we give several examples of how the issues in ME-PS matching described in Sect. 2 can affect high-p T observables using AlpGen interfaced to Pythia 6 with DW [7], Perugia 0 (P0) [5, 31] and Perugia 2010 (P2010) tunes [6]. Details about these tunes, and others mentioned below, are collected in the Appendix, Sect. A.2.

In Fig. 1 we show the ratio of predictions for the transverse energy (E T ) spectrum of the leading jet (that jet with the highest E T per event) in W+jet final states at the Tevatron, obtained by the merging of AlpGen with different shower codes; Herwig, Pythia 6 virtuality-ordered shower (DW), and p T-ordered shower (Perugia 0). The differences between Herwig and Herwig plus Jimmy at small E T can be explained by the different amounts of energy that, in the various cases, are deposited by the UE in the jet cones. In particular, as shown in Fig. 2, these differences can accommodate the slight shape discrepancy between data and AlpGen + Herwig that was noted, at small E T , in the CDF study [32]. It is difficult, however, to attribute to the UE energy the significant differences seen in Fig. 1 at large E T .

Fig. 1
figure 1

Ratio of predictions for the leading-jet E T spectrum in W+jets final states at the Tevatron, obtained with AlpGen plus various MC codes and tunes. The leading jet observable is defined at the particle-level as in the CDF W+jets analysis [32]

Fig. 2
figure 2

Comparison of CDF data [32] with the leading-jet E T spectrum predicted by AlpGen plus various MC codes and tunes

In order to investigate the source of the differences in the predictions, systematic parameter variations of the perturbative and non-perturbative model components of Pythia 6 have been studied using the Perugia family of Pythia 6 tunes with Perugia 0 as the central tune and Perugia Hard and Perugia Soft as the systematic variation tunes. The Perugia Soft and Perugia Hard tunes both use the same Parton Density Function (PDF) as Perugia 0, CTEQ5L [33], but differ in the values of Pythia 6 parameters controlling both perturbative and non-perturbative activity levels. In comparison to Perugia Soft, the Perugia Hard tune has more perturbative (initial and final state radiation) activity but less non-perturbative (multiple interactions, beam remnant and hadronization) activity. Perugia Soft on the other hand has less perturbative but more non-perturbative activity than the Perugia 0 tune. In order to investigate the interplay of the tuning variations with the MLM matching, the effect of the change of the tune on the physics observables in both the Pythia 6 standalone case and AlpGen + Pythia 6 case is presented.

In Fig. 3 the distribution of jet multiplicity (N jet) in W+jets events is compared for events generated with the Perugia 0, Perugia Hard and Perugia Soft tunes. The N jet observable is defined at the particle-level according to the definition used in the ATLAS measurement of the W+jets cross-section at \(\sqrt{s}=7~\mathrm{TeV}\) [34]. Jets are clustered from stable particles using the anti-Kt jet algorithm [35] with the radius parameter R=0.4, and considered in case they satisfy the following kinematic cuts: p T>20 GeV and |η|<2.8. Comparisons are performed for both the Pythia 6 standalone (left) and AlpGen + Pythia 6 (right) cases. For the Pythia 6 standalone case we observe that Perugia Hard tune yields more high-p T jets than the Perugia 0 tune while Perugia Soft yields less final state jets correspondingly. For the AlpGen + Pythia 6 case an opposite trend is observed: Perugia Hard tune yields less high-p T jets and Perugia Soft tune yields more high-p T jets. In order to determine which modelling components of the Perugia Soft and Perugia Hard tunes cause this behavior, we considered the effect of varying individual sets of parameters of the Perugia Soft and Perugia Hard tunes in AlpGen + Pythia 6 predictions. Parameters were grouped according to the modelling aspect they control into Initial State Radiation (ISR), Final State Radiation including the FSR from the ISR partons (FISR), the Underlying Event (UE) and Color Reconnections (CR) blocks.Footnote 2

Fig. 3
figure 3

Jet (p T>20 GeV and |η|<2.8) multiplicity distribution in W+jets electron channel events in pp collisions at 7 TeV. Distributions are shown for the samples generated with Pythia 6 standalone (left) and with AlpGen + Pythia 6 (right). For each case the distributions obtained when using Perugia 0 (P0), Perugia Hard (Phard) and Perugia Soft (Psoft) tunes are shown. All distributions are scaled so that the value of the first bin agrees with the ATLAS measurement [34]

Dedicated samples where only parameters of an individual block were varied in the ranges used in Perugia Soft and Perugia Hard tunes on top of the Perugia 0 tune were produced. The results of the study, in terms of the cross-section contribution of each AlpGen sub-sample (after MLM matching), are given in Table 2. As was already noted in Fig. 3, the cross-section for multijet production in the Perugia Hard case decreases with respect to Perugia 0, and vice versa for Perugia Soft. From Table 2, we see that the parameter blocks that produce this effect are the ISR and FISR blocks, while the impact of the CR and UE block variations on the cross-sections is negligible. In addition to the simultaneous variations of parameters in the blocks, we have also performed individual parameter variations for each of the parameters in order to check that potential correlations between the parameters do not affect the conclusions. Studies have also been performed for the Hadronization and Beam Remnant blocks of [6]. The variations of these parameters also had a negligible effect on the kinematic distributions and cross-section values.

Table 2 Impact of different variations of the Pythia 6 Perugia 0 tunes on the cross sections of AlpGen W+jets sub-samples with different matrix element parton multiplicities, and the total inclusive W cross section. For the studies sub-samples with up to four additional partons from the matrix element were generated. The matching is performed inclusively for the highest parton multiplicity sub-sample and exclusively for other sub-samples. The tabulated cross-sections were extracted after the MLM matching and parton shower. The errors shown are statistical only. The parameter settings of the various setups are discussed in the text

In Fig. 4 we demonstrate that the increased parton shower activity can indeed lead to the reduced cross-section (and softer jet spectra) due to the increased rates at which the AlpGen + Pythia 6 events are vetoed during the MLM matching. In the figure the distributions of the events that pass or fail (ISVETO=0 or ISVETO≠0) the MLM matching criterion are shown for the exclusive sub-sample of AlpGen + Pythia 6 Perugia 2010 W+jets events with exactly three additional partons from the matrix element in the final state.Footnote 3 Each of the distributions is normalized to unit area. The distributions are shown as a function of the largest p T shower emission from the initial state radiation (left) and as a function of the largest p T multiple proton–proton interaction.Footnote 4 In the left hand side figure we see that the events are rejected with higher probability, the larger the p T of the hardest ISR branching in the event. Therefore, a Pythia 6 standalone tune which increases the ISR activity can, somewhat counter-intuitively, reduce the rate for multijet and hard emissions. In the right hand side we demonstrate that the events are accepted and rejected independently of the transverse momentum of the hardest multiple interaction in the event (which is the desired behavior of the matching application used with the parton shower code).

Fig. 4
figure 4

Distribution of the probabilities for the event acceptance (ISVETO=0) or rejection (ISVETO≠0) during the MLM matching step, as a function of the largest p T shower emission from the initial state radiation (left) and the largest p T multiple proton–proton interaction in the event (right). The events were generated using exclusive sub-sample of AlpGen + Pythia 6 Perugia 2010 W+jets events with exactly three additional partons from the matrix element in the final state for pp collisions at 7 TeV and AlpGen + Pythia 6 Perugia 2010 tune

To conclude, the origin of the differences observed in the predictions of tunes with different ISR/FSR activity matched to AlpGen is rather due to the mismatch between the jet-emission probability predicted by the matrix elements and by the shower. This comes from the mismatch in the value of α S discussed earlier, arising from different values of Λ QCD or from the use of a different evolution variable in the shower. If the value of α S in the shower increases, the emission rate of additional jets during the shower evolution will increase. Since the matching algorithm rejects events with extra jets generated by the shower, to replace them with events where the jet is accounted for by a higher-order matrix element calculation, a larger value of α S in the shower leads to a higher rejection rate. Unless this change in α S is accompanied by a similar change in the matrix element calculation, the additional rejection is not compensated by the relative increase in rate for the higher-order parton-level contributions, leading to the effects reported in this section.

This important interplay between MC parameters, which are typically tuned to “soft” observables such as UE or the small-p T DY spectrum, and the performance of the matching algorithms for “hard” observables, calls for particular attention when adopting new UE tunes in the framework of multijet studies with matrix-element matching. Along the same lines, it should be kept in mind that, tuning a stand-alone shower MC to better model multijet final states, will force it to emulate effects present in the multiparton matrix elements. Using such tunes with matrix-element matching therefore requires ad-hoc modifications of the matching algorithm, or of its parameters.

3 Stabilizing ME-PS matched tunings

In this section we discuss how to overcome the problems discussed in the previous section with a simple prescription, and outline a tuning strategy that should allow to consistently optimize, in the context of the Pythia 6 shower MC, the description of both the UE and the high-E T properties of final states.

3.1 A new AlpGen + Pythia 6 α S  consistent tune

As it was explained in Sect. 2.1, and practically demonstrated in Sect. 2.2, it is highly desirable to have a consistent treatment of α S on either side of the ME and PS boundary. In the Appendix, Sect. A.1 the relevant settings for a new α S  consistent AlpGen + Pythia 6 tune are described in detail. In this tune, the α S  consistency is essentially ensured by setting the effective value of Λ QCD to be the same throughout the Pythia 6 parton shower algorithms and in the AlpGen matrix elements. Note that AlpGen will still use the \(\varLambda_{\overline{ \mathrm{MS}}}\) value given by the PDF set, for all powers of α S associated with the lowest-order process.Footnote 5 That is, the ‘tuned’ Λ QCD value is only used for the additional emissions, analogously to the CMW Λ value used in parton showers [28]. A consistent choice for Λ QCD of

$$ \varLambda^{(5)}_{ \mathrm{QCD}} \sim0.26, $$
(5)

is made, where the superscript indicates the number of flavors. This choice is informed by comprehensive Professor tunings [36, 37] of the p T-ordered shower in Pythia 6 [4] to event shapes and other LEP data. Note that the settings for Pythia 6 are those of the central Perugia 2011 (P2011) tune [6], which was inspired by these studies. We will refer to this new tune of AlpGen + Pythia 6 as the Perugia 2011 “matched” tune.

3.2 Tests of the consistent α S  approach: behavior under scale variations

In this section we study the behavior of the new AlpGen + Pythia 6 Perugia 2011 “matched” tune under Λ QCD variations to demonstrate that, with a consistent treatment of α S , the expected behavior of ME-PS matched predictions under variations of tuning parameters is restored. W+jets events selected with the same criteria applied for Fig. 3 are used. Figure 5 shows the jet multiplicity (left) and leading jet transverse momentum (right) distributions for the Perugia 2011 “matched” tune and four variant tune samples generated with different Λ QCD values. Two samples, labeled as “Λ Alp. ↑” and “Λ Alp. ↓”, have Λ QCD respectively increased and decreased by a factor of 2 only in the ME calculation. This is achieved by setting respectively the AlpGen parameter ktfac to 1/2 and 2. The increase (decrease) of the Λ QCD value in AlpGen results in more (less) jets and a harder (softer) leading jet spectrum as shown in Fig. 5. The two samples labeled as “Λ PS ↑,Λ Alp. ↑” and “Λ PS ↓,Λ Alp. ↓” correspond to a consistent variation of Λ QCD both in the ME and PS, with Λ QCD respectively increased and decreased by a factor of 2. The impact of these variations is qualitatively similar to the case where Λ QCD is only varied in the ME, restoring the expected behavior of ME-PS matched prediction under variation of Λ QCD. However, the samples with Λ QCD varied simultaneously in the ME and in the PS exhibit a smaller deviation from the nominal sample. The mitigation of the impact of a Λ QCD coherent change in a ME-PS matched sample compared to the same change only in the ME calculation is due to the interplay between the radiation produced by PS and the matching algorithm, as detailed in Sect. 2.1.

Fig. 5
figure 5

Comparison of AlpGen + Pythia 6 (p T>20 GeV) jet multiplicity (left) and leading jet transverse momentum (right) distributions in W+jets electron channel events. The samples are generated using different AlpGen + Pythia 6 parameter setups described in the text

The AlpGen parameter xlclu sets the value of Λ QCD used by α S , and allows it to be adapted to variations of it in Pythia. The ktfac parameter, which allows to rescale the scale of α S , can be used to establish the range of the systematical uncertainty, or to tune the description of specific observables. Its variation, in the range 0.5<ktfac<2, accounts for a possible mismatch in the exact form of the scale used at the branching vertices by Pythia and by AlpGen.

4 Comparisons with data

In this section we demonstrate that the new Λ QCD-consistent Perugia 2011 “matched” tuning of AlpGen + Pythia 6 introduced in Sect. 3.1 compares well with recent Tevatron and LHC measurements, and that, with the arrival of improved precision measurements, there should be room for further tuning of these predictions.

4.1 Z/W+jets production

The figures that follow show comparisons of AlpGen+Pythia 6 Monte Carlo predictions to measurements of published Z+jets and W+jets processes from CDF [32, 38, 39] and W+jets from ATLAS [40].Footnote 6 These cross-section measurements are corrected for all known detector effects to particle level and compared to Monte Carlo predictions. The Monte Carlo predictions of V+jet production cross-sections are formed by clustering the stable final state particles (τ>10 ps) following parton shower and hadronization of the unweighted events. This clustering is done using the same jet algorithm as the measurement, as implemented in the Fastjet [15] package. All stable final state particles are used, with the exception of the leptons that result from the decay of the signal W or Z boson. The decay leptons are corrected for the final state QED radiation such that their 4-momentum is equivalent to that before radiation. After the events have been clustered, the restrictions on the allowed phase space of the jets and of the W/Z boson decay products are applied to be consistent with the measurement to which we are comparing. The prediction of the final AlpGen + Pythia 6 cross-sections contains contributions from V+0, 1, 2, 3 parton samples (showered with exclusive MLM matching), and V+4 parton samples (showered with inclusive MLM matching).

Two different AlpGen + Pythia 6 generations are compared to the data; the new AlpGen + Pythia 6 Perugia 2011 “matched” tune introduced in Sect. 3.1 (labeled “Alp.+Pyt. P2011”), and an AlpGen + Pythia 6 prediction using the default settings of AlpGen and the Pythia 6 DW tune (labeled “Alp.+Pyt. DW”). The ratio of the matched predictions to the data are shown and compared. Additionally, the results of variations of ktfac by factors of 0.5 and 2.0 in the AlpGen + Pythia 6 Perugia 2011 “matched” prediction are shown as solid lines. The hatched regions show the total error (statistical plus systematic) propagated to the theory/data ratio from the data measurements. The error bars on the points show the statistical error on the theoretical prediction.

In Fig. 6 we show the ratio of the predicted theory cross-sections to the data for the CDF Z+jets measurement [39]. In this measurement jets are defined by the CDF midpoint algorithm [46], with R cone=0.7 and are required to have \(p^{\mathrm{jet}}_{\mathrm{T}} > 30~\mathrm{GeV}\) and |y jet|<2.1. In Fig. 7 we show the ratio of the predicted theory cross-sections to the data for the CDF W+jets measurement [32]. In this measurement jets are defined by the CDF JetClu algorithm [47], with R cone=0.4 and are required to have p T>20 GeV and |η|<2.5. In Fig. 8 we show the ratio of the predicted theory cross-sections to the data for the ATLAS W+jets measurements [40]. In this measurement jets are defined by the anti-Kt algorithm [35], with a radius parameter R=0.4 and are required to have p T>30 GeV and |η|<4.4.

Fig. 6
figure 6

(Top) The ratio of predicted theory and CDF measured data cross-sections for the production of a Zee boson in association with at least N jet jets [39]. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥1 jet bin. (Bottom) The ratio of predicted theory and CDF measured data cross-sections for the production of a Zee boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side) as a function of jet E T . In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines

Fig. 7
figure 7

(Top) The ratio of predicted theory and CDF measured data cross-sections for the production of a W boson in association with at least N jet jets [32]. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥1 jet bin. (Bottom) The ratio of predicted theory and CDF measured data cross-sections for the production of events containing a W boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side), as a function of the leading jet E T (left hand side), and the sub-leading jet E T (right hand side). In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines

Fig. 8
figure 8

(Top) The ratio of predicted theory and ATLAS published cross-sections [40] for the production of a W boson in association with at least N jet jets. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥0 jet bin. (Bottom) The ratio of predicted theory and ATLAS data cross-sections for the production of events containing a W boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side), as a function of the leading jet p T (left hand side), and the sub-leading jet p T (right hand side). In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines

The AlpGen + Pythia 6 Perugia 2011 “matched” prediction compares well with the measured cross sections both as a function of the inclusive jet multiplicity and jet p T. In particular, the prediction correctly describes the low p T region of the differential cross section and, as will be shown in the next section, the jet sub-structure, without presenting any significant disagreement with data at high p T. This shows that it is possible to tune separately the long- and short-distance contributions of the prediction to obtain a satisfactory description of the observables in the whole experimental accessible phase space. Remarkably, the prediction describes the data both at \(\sqrt{s}=1.96~\mathrm{TeV}\) and \(\sqrt{s}=7~\mathrm{TeV}\). This illustrates that the scaling properties of the long-distance contribution with the centre of mass energy of the collision does not produce unexpected effects in the high p T region of the cross section. The variation of ktfac by factors of 0.5 and 2.0 has little effect on the shapes of the differential cross-sections, while for the inclusive N jet cross-sections it produces variations that bracket the default prediction. The ktfac parameter can therefore be used to explore the sensitivity of the prediction to a variation of the renormalization and factorization scale, other than allow tuning on data. With the statistics of the currently available measurements there is no much room for optimizing the parameters of the AlpGen + Pythia 6 Perugia 2011 “matched” prediction in a tune that better describes the measurements. However, with the arrival of more precise higher-statistics LHC measurements, this should be possible in the future.

Even though not explicitly shown here, in the relevant publications one can find similarly good agreement between the available measurements [32, 40, 48] and predictions based on AlpGen + Herwig.

4.2 Jets shapes

Finally we test the ability of the new AlpGen + Pythia 6 Perugia 2011 “matched” tune and the systematics variations to describe the jet shapes at the LHC and the Tevatron.

For the LHC, the jet shapes measured in inclusive jet production by the ATLAS collaboration [49] are taken as reference. For this measurement the jets are reconstructed using the anti-kt algorithm with the distance parameter R=0.6, the transverse momentum range 30 GeV<p T<600 GeV and rapidity in the region |y|<2.8. The jet shapes are expected to be sensitive to both perturbative (parton shower) and non-perturbative (fragmentation and underlying event) modelling aspects. We perform the data comparisons for both the AlpGen + Pythia 6 and Pythia 6 standalone cases. The samples are generated using different Pythia 6 standalone and AlpGen + Pythia 6 parameter settings as follows: for Pythia 6 standalone the P2011 and the associated systematics tunes Perugia 2011 radHi and Perugia 2011 radLo are compared. The same tunes are also used for the generating the AlpGen + Pythia 6 distributions whereby the Λ QCD values are always set to the same values in AlpGen (using ktfac) and Pythia 6 (i.e. the Perugia 2011 “matched” central settings and the systematic variations around the central settings). The setups are compared to the integral jet shape distributions as measured in the data. The integral jet shape is defined as the average fraction of the jet p T that lies inside a cone of radius r concentric with the jet cone [49]:

$$ \varPsi(r) = \frac{1}{N_{\mathrm{jet}}}\sum_{i=1}^{N_{\mathrm{jet}}}\frac{p_{\mathrm{T}} (0,r)}{p_{\mathrm{T}} (0,R)},\quad 0\leq r \leq R. $$
(6)

The sum is performed over all the N jet jets in the kinematic region of interest.

In Fig. 9 the integral jet shape distributions are compared to the ATLAS data for the jets in the transverse momentum ranges of 40–60 GeV (top) and 260–310 GeV (bottom) in the whole measured rapidity range (|y|< 2.8). We observe that both Pythia 6 standalone (left) and AlpGen + Pythia 6 (right) with P2011 provide reasonably good description of the jet shapes. Due to MLM matching the jets in the AlpGen + Pythia 6 case tend to be more narrow than in the Pythia 6 standalone case.

Fig. 9
figure 9

Comparison of the integral jet shapes as measured by ATLAS [49] with the predictions of the Pythia 6 standalone (left) and AlpGen + Pythia 6 (right) using P2011, Perugia 2011 radHi and Perugia 2011 radLo tunes. The comparisons are performed for the jets with |y|<2.8 and p T ranges of 40–60 GeV (top) and 260–310 GeV (bottom)

For the Tevatron, the shapes of jets produced in association with a Z boson as measured by CDF [50] are used. In this measurement jets are defined by the CDF midpoint algorithm [46], with R cone=0.7 and are required to have \(p^{\mathrm{jet}}_{\mathrm{T}} > 30~\mathrm{GeV}\) and |y jet|<2.1. Figure 10 shows good agreement between AlpGen + Pythia 6 and the measurement for both the DW and P2011 tunes.

Fig. 10
figure 10

A comparison of the predicted and CDF measured jet shape in Z+jets events, showing the fraction of the total energy of a jet of radius R contained within a radius r, Ψ(r), as a function of r/R. “Alp.+Pyt. DW” uses the default AlpGen parameters and the DW Pythia tune, “Alp.+Pyt. P2011” is described in Sect. 3.1 and it includes a consistent choice of Λ QCD in AlpGen and Pythia

The comparisons in Figs. 9 and 10 (as well as comparisons to the jet shapes in other kinematic regions and comparisons to further LHC measurements) reveal no major short-comings of the AlpGen + Pythia 6 Perugia 2011 “matched” tune. This tune has been developed by tuning Pythia 6 standalone, whereby the effective value of Λ QCD was set to be the same throughout the Pythia 6 parton shower in the anticipation of using it with the AlpGen matrix elements using the same effective Λ QCD value. The agreement with the measured jet shapes data could therefore potentially be improved by performing a dedicated tuning of AlpGen + Pythia 6.

5 Conclusions and outlook

We have shown that, in the context of tuning ME-PS matched predictions, it is vital that the tuning adopted ensures a consistent treatment of α S  on either side of the “matching boundary”. In the case of AlpGen + Pythia 6 matched predictions, we have outlined a simple prescription to ensure this. This can be easily generalized, and applied to the case of matching AlpGen with other shower MCs, such as Herwig. We have then given an example of such a tune that compares well to Tevatron and LHC measurements of vector boson plus multijet final states. In addition, we have shown how consistent variations around a central ME-PS matched tune can be performed, so as to define a systematic uncertainty on that prediction.

The observations made in this paper add some further insight on the more general issue of tuning MC event generators in presence of matching. It is clear that the overall tuning strategy goes well beyond what we have discussed. Further parameters of the matching procedure, such as the momentum scale at which one sets the separation/matching between matrix-element- and shower-generated jets, can play a role in the overall tuning. The matching scale dependence of the predictions, in the case of W+jets production, has been studied in detail, for all the existing tools, in Ref. [51]. This parameter does not affect the structure of jets at scales well above the matching scale, and even at smaller scales its variations lead to very small changes, consistently with the expectation that, being an unphysical parameter, the choice of matching scale should not affect the results. The renormalization scale dependence of the predictions has also been studied in [51]. Notice that a change in renormalization scale is equivalent to a change in Λ QCD, and therefore the conclusions of that work are consistent with our findings here: the main impact of a scale change is in the jet multiplicity distribution, and in the hardness of the jet p T spectra. Notice also that this implicitly underscores the potential relevance of the choice of PDF sets in the event generation: LO sets tend in fact to have a value of Λ QCD larger than NLO sets, with harder p T and multiplicity spectra.

On the shower MC side, it is well known that strong constraints arise from the precise measurements of hadronic final states in Z 0 decays. In particular, event-shape and jet-rate observables are directly sensitive to the treatment of α S  in the shower, and enter with a dominant weight in the tuning of the quarks’ shower evolution [20, 36]. In this perspective, our proposal amounts to a recipe to allow the ME generator to adapt the choice of the α S  argument, used in the CKKW α S  reweighting, to what is most suitable to match the shower evolution as tuned to Z 0-decay data. The scale parameter (which directly affects jets’ spectra) and Λ QCD (which more directly affects jets’ fragmentation properties) provide therefore two complementary knobs to allow, in principle, to reproduce the jet spectra in hadronic collisions, while preserving the good agreement with the jet fragmentation properties determined at LEP. The resulting description of the jet fragmentation properties at the LHC, where, contrary to Z 0 decays, jets are dominantly initiated by gluons, would therefore emerge as a prediction of the framework.

The results shown in this paper are consistent, within the limited statistical and systematic precision, with the available data. More compelling tests will soon be possible at the LHC, with the higher-statistics information on both spectra and jet properties (shapes, fragmentation functions, etc.), necessary to allow for tuning exercises more complete and compelling than the one presented here. We nevertheless hope that the results of this work will prove valuable in defining a new set of consistent ME-PS tunes for the precise future study of LHC multijet final states.