Importance of a consistent choice of αS in the matching of AlpGen and Pythia

We consider the impact of varying αs choices (and scales) on each side of the so-called “matching scale” in MLM-matched matrix-element + parton-shower predictions of collider observables. We explain how inconsistent prescriptions can lead to counter-intuitive results and present a few explicit examples, focusing mostly on W/Z+jets processes. We give a specific prescription for how to improve the consistency of the matching and also address how to perform consistent tune variations (e.g., of the renormalization scale) around a central choice. Comparisons to several collider processes are included to illustrate the properties of the resulting improved matching, relying on AlpGen + Pythia 6, with the latter using the so-called Perugia 2011 tunes, developed as part of this effort. Our observations, nevertheless, apply to the large class of tools where matrix-element generators are merged with independent codes for the parton-shower evolution.


Introduction
The theoretical description of multijet production in hadronic collisions is one of the key ingredients for the interpretation of the data from high-energy hadron colliders, the 1.96 TeV proton-antiproton Tevatron collider at Fermilab, and the proton-proton Large Hadron Collider (LHC) at CERN. Final states with multijets, possibly associated with electroweak gauge bosons, are in fact the dominant signature of the decay of heavy particles produced at high energy, whether in the Standard Model (top quarks and Higgs bosons), or in theories beyond the Standard Model (BSM), such as supersymmetry. The identification of these a e-mail: michelangelo.mangano@cern.ch particles, and the study of their properties, requires an accurate modelling of the Standard Model (SM) sources of multijets. Great progress was achieved towards this goal in the past decade. On one side, the calculation of inclusive, parton-level, cross-sections to next-to-leading-order (NLO) in QCD has produced results for processes as complex as W + 4 jets [1]. On the other, algorithms have been developed and implemented in numerical codes to provide a complete description of the hadronic final states emerging from processes with up to 6 jets, merging the exact leading-order (LO) calculation of the partonic matrix elements (ME) with the evolution, provided by so-called shower Monte Carlo (MC) codes, of the partonic shower (PS) and the subsequent hadronization of the partons in to physical hadrons.
The development of theoretical tools has been accompanied by experimental measurements, which provide the necessary validation test-bed for these calculations. Partonlevel NLO calculations provide a first-principle description of inclusive final states: they have an intrinsic high degree of precision, due to the reduced dependence on the unphysical choice of a renormalization and factorization scale and, furthermore, are not subject to modelling uncertainties related to the details of the non-perturbative phase of the final state evolution. Calculations based on the merging of LO matrix elements, shower evolution and hadronization, on the other hand, while affected by the larger scale-setting uncertainty due to the LO approximation, provide a fully exclusive description of the final states, and are therefore more suitable for the experimental analyses. Their ultimate goal is not only to give reliable estimates of the inclusive jet rates and energy distributions, but also to reproduce properties of the final states such as the jet inner structure and the distribution of softer particles produced outside of the jets, including those resulting from the evolution of the frag-ments of the original colliding hadrons. 1 These properties, which depend on the details of the non-perturbative dynamics, can only be described through the phenomenological models embedded in the shower MC codes. The parameters of these phenomenological models need to be tuned using experimental data of some suitable observables. The factorization assumption built into any description of large-Q processes justifies the use of these same parameters in the prediction of different observables, and provides the basis for the predictive power of such tools. This assumption however must be validated with a direct comparison with data. Elements that need to be probed include the scaling with beam energy of the UE parameters, the universality of the parameters controlling the shower evolution and hadronization, and the overall independence of all parameters on the type of hard process. Deviations from the expected universality would highlight faults in the underlying modelling of effects beyond perturbative physics, or could be due to the insufficient precision of the perturbative description, in case NLO effects were to modify significantly the LO predictions. Differences compatible with the theoretical systematics of the LO approximation could however be reabsorbed by modifying the perturbative parameters that govern the LO systematics, for example the renormalization and factorization scales, or the matching variables used in the matrixelement/shower merging algorithm.
It is therefore important to understand the correlations between the effects of changing the soft and UE parameters on one side, and the perturbative parameters on the other. In this paper we present studies which demonstrate that, in the tuning of ME-PS matched predictions, it is vital that there is consistency in the treatment of α S in both the ME and PS components. While this is a general issue for all shower MCs, we consider as an explicit example the merging of LO matrix elements with the PYTHIA 6 shower MC [2], as implemented in the framework of the AlpGen code [3], one of the reference tools for experimental multijet studies at the Tevatron and at the LHC. The most recent versions of PYTHIA (6.425) and AlpGen (2.14) codes were used for producing the results.
On the PYTHIA 6 side, we consider several different tune variations of the interleaved p T -ordered parton-shower model [4], focusing on the so-called "Perugia" set of tunes of [5,6]. These range from the Perugia 0 tune (from 2009) to the Perugia 2011 updates that have been developed as part of this work, including systematic up/down variations of the shower activity (see the Appendix and [5,6] for details). We also compare to the "DW" tune [7] of the virtualityordered shower model [8,9]. For Herwig [10], we include the "Jimmy" underlying-event model [11], with default parameters. We emphasize that the qualitative conclusions presented in this paper carry over to other shower models, including the ones implemented in PYTHIA 8 [12,13] and Herwig++ [14], but the quantitative aspects should still be considered limited to the particular tunes and shower models studied here. We rely on Fastjet [15] for jet clustering and have further used the Rivet-based [16] mcplots web site [17] for some of our comparisons.
The paper is organized as follows: in Sect. 2 we describe in detail the theoretical nature of the α S consistency problem, and give a practical example of how it can be manifest in the prediction of high p T observables. In Sect. 3 we show how a simple prescription can be applied to stabilize ME-PS tunings against this problem, propose a new tune for AlpGen + PYTHIA 6 matched predictions, and demonstrate the behavior of this tune under tuning variations. In Sect. 4 we show that this new AlpGen + PYTHIA 6 tune is able to reproduce (within statistical errors) the Tevatron and LHC vector boson plus jets data. In addition we also the tune predictions to the jet shape measurements at the Tevatron and LHC. Finally we conclude in Sect. 5.

The importance of consistent α S treatment in ME-PS matched predictions
In this section we demonstrate that consistent treatment of α S in ME-PS matched predictions is important in order to achieve the desired accuracy in the prediction of high p T observables. We first present the theoretical arguments behind this, and then go on to show and explain that without adopting this approach one can observe undesirable and counterintuitive effects on experimental observables.

Theoretical background
The philosophy behind matching prescriptions such as the MLM one [18,19] employed by AlpGen is to separate phase space cleanly into two distinct regions; a short-distance one, which is supposed to be described by matrix elements, and a long-distance one described by parton showers. In the longdistance region, real and virtual corrections, with the latter represented by Sudakov factors, are both generated by the shower and are intimately related by unitarity (for pedagogical reviews, see, e.g., [20,21]). On the short-distance side, the real corrections are generated by the matrix elements while the virtual ones are still generated by the shower. Much effort has gone into ensuring that the behavior across the boundary between the two regions be as smooth as possible. CKKW showed [22] that it is possible to remove any dependence on this "matching scale" at NLL precision by careful choices of all ingredients in the matching; technical details of the implementation are important, and the dependence on the unphysical matching scale may be larger than NLL unless the implementation matches the theoretical algorithm precisely [23][24][25].
Especially when two different computer codes are used for matrix elements and showering, respectively (as when AlpGen or MadGraph [26] is combined with PYTHIA 6 or Herwig), inconsistent parameter sets between the two codes can jeopardize the consistency of the calculation and lead to unexpected results, as will be illustrated in the following sections.
To give a very simple theoretical example, suppose a matched matrix-element generator (MG) uses a different definition of α s than the parton-shower generator (SG). Suppressing parton luminosity factors to avoid clutter, the real corrections, integrated over the hard part of phase space, for some arbitrary final state F , will then have the form where we have factored out the coupling corresponding to the "+1" parton and suppressed the dependence on any other couplings that may be present in |M F +1 | 2 . The virtual corrections at the same order, generated by the shower off F , will have the form with P i (z) the DGLAP splitting kernels (or equivalent radiation functions in dipole or antenna shower approaches). If the two codes use the same definitions for the strong coupling, α SG s = α MG s , then the fact that P (z)/Q 2 captures the leading singularities of |M F +1 | 2 guarantees that the difference between the two expressions can at most be a nonsingular term. Integrated over phase space, such a term merely leads to a finite O(α s ) change to the total cross section, which is within the expected precision. Indeed, it is a central ingredient in both the MLM and (L)-CKKW matching prescriptions that a reweighting of the matched matrix elements be performed in order to ensure that the scales appearing in α s match smoothly between the hard and soft regions. Thus, we may assume that the choice of renormalization scale after matching is μ ∼ p T on both sides of the matching scale, where p T is a scale characterizing the momentum transfer at each emission vertex, as established by [27,28] and encoded in the CKKW formalism [22].
In the case of the CKKW approach as implemented in the Sherpa MC framework [29], this prescription can be controlled exactly, since the matrix element and the shower evolution are part of the same computer code and hence naturally use the same α s definition. This is also true in Lönnblad's variant [23] of the algorithm, used in Ariadne [30]. In the case of codes like AlpGen or Madgraph, on the other hand, an issue emerges. These codes are designed to generate parton-level event samples to be used with an arbitrary shower MC. Different shower MCs however use slightly different scales for the parton branchings, as a result of different approaches to the shower evolution, and may use different values of Λ QCD , as a result of the tuning of the showers and/or underlying events. A possible mismatch therefore arises in the values of α s used by the matrix-element calculation and those used by the shower.
If there is a mismatch in Λ QCD or α s (M Z ), then this will effectively generate a real-virtual difference whose leading singularities are proportional to which is of next-to-leading logarithmic order (unless Λ MG ∼ Λ SG , in which case it vanishes). Similarly, even if both matrix-element and shower codes are using the same Λ QCD , but they use different running orders, then there will be an O(α 3 s ln(p 2 T /Λ 2 )) mismatch, which may also become large if p T Λ.
To be more concrete, let us consider a specific example. Compare (A) a matched MG + SG calculation which uses the same Λ QCD value on both sides of the matching to (B) a calculation in which the value used on the MG side is reduced to half its previous value but the SG one remains the same, as summarized by the two first columns of Table 1. Going from case A to B, the following changes result: 1. The number of (F + 1) states added by the MG decreases, due to the lowering of the Λ QCD value on the MG side, while the number of surviving F states remains constant, since the shower Sudakov is not modified. The total estimated cross section therefore decreases. 2. At the differential level, the smaller number of (F + 1) states combined with the unchanging number of F states implies smaller absolute jet cross sections and smaller fractions σ jet /σ tot .
Similarly we may consider what happens if (C) we reduce the Λ QCD value on the SG side instead, as summarized in the last column of Table 1. Going from case A to C, the following changes result: 1. Since the SG generates fewer branchings, the lesser amount of radiation will increase the fraction of F states surviving the Sudakov veto, and will lead to narrower jets, increasing the fraction of reconstructed jets at any given fixed p T value. Given that the total number of (F + 1) states generated at the parton level remains constant (Λ QCD remains the same on the MG side) the total cross section increases. 2. Since both the total cross section and the number of reconstructed additional jets also increase, jet fractions can either increase or decrease.
In particular, note the somewhat counter-intuitive effect that decreasing the shower α s value actually increases the jet rates and hardens their p T spectrum in a matched calculation, while it normally decreases them in a standalone shower calculation. Since, as was discussed above, inconsistencies among the choices on the two sides can lead to differences at the NLL level, it is obviously important to ensure that they are consistent within a reasonable margin. This is particularly true in the context of event-generator tuning, in which specifically the NLL components of the shower description are sought to be optimized with respect to measured data, and hence changes at this level could effectively destroy the tuning.
Finally, we remind the reader that a change in Λ QCD can be interpreted as a change in the opposite direction of the renormalization scale argument (for constant Λ QCD ), modulo small flavor threshold effects that we shall ignore here. This is easy to realize from the definition of the coupling, Thus, we may write renormalization scale variations (e.g., by a factor of 2 in each direction) either by applying a prefactor directly on the renormalization scale argument of α s or by applying the inverse of that factor to Λ QCD while keeping the renormalization scale argument unchanged. Due to the technical structure of the codes, the former is more convenient in AlpGen (via the ktfac setting) whilst the latter is more convenient for PYTHIA 6.

Examples of the interplay between tunes and matching
In this section we give several examples of how the issues in ME-PS matching described in Sect. 2 can affect highp T observables using AlpGen interfaced to PYTHIA 6 with DW [7], Perugia 0 (P0) [5,31] and Perugia 2010 (P2010) tunes [6]. Details about these tunes, and others mentioned below, are collected in the Appendix, Sect. A.2.
In Fig. 1 we show the ratio of predictions for the transverse energy (E T ) spectrum of the leading jet (that jet with the highest E T per event) in W +jet final states at the Tevatron, obtained by the merging of AlpGen with different shower codes; Herwig, PYTHIA 6 virtuality-ordered shower Fig. 1 Ratio of predictions for the leading-jet E T spectrum in W +jets final states at the Tevatron, obtained with AlpGen plus various MC codes and tunes. The leading jet observable is defined at the particle-level as in the CDF W +jets analysis [32] (DW), and p T -ordered shower (Perugia 0). The differences between Herwig and Herwig plus Jimmy at small E T can be explained by the different amounts of energy that, in the various cases, are deposited by the UE in the jet cones. In particular, as shown in Fig. 2, these differences can accommodate the slight shape discrepancy between data and AlpGen + Herwig that was noted, at small E T , in the CDF study [32]. It is difficult, however, to attribute to the UE energy the significant differences seen in Fig. 1 at large E T .
In order to investigate the source of the differences in the predictions, systematic parameter variations of the perturbative and non-perturbative model components of PYTHIA 6 have been studied using the Perugia family of PYTHIA 6 tunes with Perugia 0 as the central tune and Perugia Hard and Perugia Soft as the systematic variation tunes. The Perugia Soft and Perugia Hard tunes both use the same Parton Density Function (PDF) as Perugia 0, CTEQ5L [33], but differ in the values of PYTHIA 6 parameters controlling both perturbative and non-perturbative activity levels. In comparison to Perugia Soft, the Perugia Hard tune has more perturbative (initial and final state radiation) activity but less non-perturbative (multiple interactions, beam remnant and hadronization) activity. Perugia Soft on the other hand has less perturbative but more non-perturbative activity than the Perugia 0 tune. In order to investigate the interplay of the tuning variations with the MLM matching, the effect of the change of the tune on the physics observables in both the PYTHIA 6 standalone case and AlpGen + PYTHIA 6 case is presented.
In Fig. 3 the distribution of jet multiplicity (N jet ) in W +jets events is compared for events generated with the Perugia 0, Perugia Hard and Perugia Soft tunes. The N jet observable is defined at the particle-level according to the definition used in the ATLAS measurement of the W +jets crosssection at √ s = 7 TeV [34]. Jets are clustered from stable particles using the anti-Kt jet algorithm [35] with the radius parameter R = 0.4, and considered in case they satisfy   [34] the following kinematic cuts: p T > 20 GeV and |η| < 2.8. Comparisons are performed for both the PYTHIA 6 standalone (left) and AlpGen + PYTHIA 6 (right) cases. For the PYTHIA 6 standalone case we observe that Perugia Hard tune yields more high-p T jets than the Perugia 0 tune while Perugia Soft yields less final state jets correspondingly. For the AlpGen + PYTHIA 6 case an opposite trend is observed: Perugia Hard tune yields less high-p T jets and Perugia Soft tune yields more high-p T jets. In order to determine which modelling components of the Perugia Soft and Perugia Hard  2 Dedicated samples where only parameters of an individual block were varied in the ranges used in Perugia Soft and Perugia Hard tunes on top of the Perugia 0 tune were produced. The results of the study, in terms of the cross-section contribution of each AlpGen sub-sample (after MLM matching), are given in Table 2. As was already noted in Fig. 3, the cross-section for multijet production in the Perugia Hard case decreases with respect to Perugia 0, and vice versa for Perugia Soft. From Table 2, we see that the parameter blocks that produce this effect are the ISR and FISR blocks, while the impact of the CR and UE block variations on the cross-sections is negligible. In addition to the simultaneous variations of parameters in the blocks, we have also performed individual parameter variations for each of the parameters in order to check that potential correlations between the parameters do not affect the conclusions. Studies have also been performed for the Hadronization and Beam Remnant blocks of [6]. The variations of these parameters also had a negligible effect on the kinematic distributions and cross-section values. 2 The parameter blocks organization is similar to the one introduced in [6] and are listed in Sect. A.2.
In Fig. 4 we demonstrate that the increased parton shower activity can indeed lead to the reduced cross-section (and softer jet spectra) due to the increased rates at which the AlpGen + PYTHIA 6 events are vetoed during the MLM matching. In the figure the distributions of the events that pass or fail (ISVETO = 0 or ISVETO = 0) the MLM matching criterion are shown for the exclusive sub-sample of Alp-Gen + PYTHIA 6 Perugia 2010 W +jets events with exactly three additional partons from the matrix element in the final state. 3 Each of the distributions is normalized to unit area. The distributions are shown as a function of the largest p T shower emission from the initial state radiation (left) and as a function of the largest p T multiple proton-proton interaction. 4 In the left hand side figure we see that the events are rejected with higher probability, the larger the p T of the hardest ISR branching in the event. Therefore, a PYTHIA 6 standalone tune which increases the ISR activity can, somewhat counter-intuitively, reduce the rate for multijet and hard emissions. In the right hand side we demonstrate that the events are accepted and rejected independently of the transverse momentum of the hardest multiple interaction in the event (which is the desired behavior of the matching application used with the parton shower code).
To conclude, the origin of the differences observed in the predictions of tunes with different ISR/FSR activity matched to AlpGen is rather due to the mismatch between the jetemission probability predicted by the matrix elements and by the shower. This comes from the mismatch in the value of α S discussed earlier, arising from different values of Λ QCD or from the use of a different evolution variable in the shower. If the value of α S in the shower increases, the emission rate of additional jets during the shower evolution will increase. Since the matching algorithm rejects events with extra jets generated by the shower, to replace them with events where the jet is accounted for by a higher-order matrix element calculation, a larger value of α S in the shower leads to a higher rejection rate. Unless this change in α S is accompanied by a similar change in the matrix element calculation, the additional rejection is not compensated by the relative increase in rate for the higher-order parton-level contributions, leading to the effects reported in this section.
This important interplay between MC parameters, which are typically tuned to "soft" observables such as UE or the small-p T DY spectrum, and the performance of the matching algorithms for "hard" observables, calls for particular attention when adopting new UE tunes in the framework of multijet studies with matrix-element matching. Along the same lines, it should be kept in mind that, tuning a standalone shower MC to better model multijet final states, will force it to emulate effects present in the multiparton matrix elements. Using such tunes with matrix-element matching therefore requires ad-hoc modifications of the matching algorithm, or of its parameters.

Stabilizing ME-PS matched tunings
In this section we discuss how to overcome the problems discussed in the previous section with a simple prescription, and outline a tuning strategy that should allow to consistently optimize, in the context of the PYTHIA 6 shower MC, the description of both the UE and the high-E T properties of final states.

A new AlpGen + PYTHIA 6 α S consistent tune
As it was explained in Sect. 2.1, and practically demonstrated in Sect. 2.2, it is highly desirable to have a consistent treatment of α S on either side of the ME and PS boundary. In the Appendix, Sect. A.1 the relevant settings for a new α S consistent AlpGen + PYTHIA 6 tune are described in detail. In this tune, the α S consistency is essentially ensured by setting the effective value of Λ QCD to be the same throughout the PYTHIA 6 parton shower algorithms and in the AlpGen matrix elements. Note that AlpGen will still use the Λ MS value given by the PDF set, for all powers of α S associated with the lowest-order process. 5 That is, the 'tuned' Λ QCD value is only used for the additional emissions, analogously to the CMW Λ value used in parton showers [28]. A consistent choice for Λ QCD of is made, where the superscript indicates the number of flavors. This choice is informed by comprehensive Professor tunings [36,37] of the p T -ordered shower in PYTHIA 6 [4] to event shapes and other LEP data. Note that the settings for PYTHIA 6 are those of the central Perugia 2011 (P2011) tune [6], which was inspired by these studies. We will refer to this new tune of AlpGen + PYTHIA 6 as the Perugia 2011 "matched" tune. In this section we study the behavior of the new Alp-Gen + PYTHIA 6 Perugia 2011 "matched" tune under Λ QCD variations to demonstrate that, with a consistent treatment of α S , the expected behavior of ME-PS matched predictions under variations of tuning parameters is restored. W +jets events selected with the same criteria applied for Fig. 3 are used. Figure 5 shows the jet multiplicity (left) and leading jet transverse momentum (right) distributions for the Perugia 2011 "matched" tune and four variant tune samples generated with different Λ QCD values. Two samples, labeled as "Λ Alp. ↑" and "Λ Alp. ↓", have Λ QCD respectively increased and decreased by a factor of 2 only in the ME calculation. This is achieved by setting respectively the AlpGen parameter ktfac to 1/2 and 2. The increase (decrease) of the Λ QCD value in AlpGen results in more (less) jets and a harder (softer) leading jet spectrum as shown in Fig. 5. The two samples labeled as "Λ PS ↑, Λ Alp. ↑" and "Λ PS ↓, Λ Alp. ↓" correspond to a consistent variation of Λ QCD both in the ME and PS, with Λ QCD respectively increased and decreased by a factor of 2. The impact of these variations is qualitatively similar to the case where Λ QCD is only varied in the ME, restoring the expected behavior of ME-PS matched prediction under variation of Λ QCD . However, the samples with Λ QCD varied simultaneously in the ME and in the PS exhibit a smaller deviation from the nominal sample. The mitigation of the impact of a Λ QCD coherent change in a ME-PS matched sample compared to the same change only in the ME calculation is due to the interplay between the radiation produced by PS and the matching algorithm, as detailed in Sect. 2.1. The AlpGen parameter xlclu sets the value of Λ QCD used by α S , and allows it to be adapted to variations of it in PYTHIA. The ktfac parameter, which allows to rescale the scale of α S , can be used to establish the range of the systematical uncertainty, or to tune the description of specific observables. Its variation, in the range 0.5 < ktfac < 2, accounts for a possible mismatch in the exact form of the scale used at the branching vertices by PYTHIA and by Alp-Gen.

Comparisons with data
In this section we demonstrate that the new Λ QCD -consistent Perugia 2011 "matched" tuning of AlpGen + PYTHIA 6 introduced in Sect. 3.1 compares well with recent Tevatron and LHC measurements, and that, with the arrival of improved precision measurements, there should be room for further tuning of these predictions.

Z/W +jets production
The figures that follow show comparisons of AlpGen + PYTHIA 6 Monte Carlo predictions to measurements of published Z+jets and W +jets processes from CDF [32,38,39] Fig. 6 (Top) The ratio of predicted theory and CDF measured data cross-sections for the production of a Z → ee boson in association with at least N jet jets [39]. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥1 jet bin. (Bottom) The ratio of predicted theory and CDF measured data cross-sections for the production of a Z → ee boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side) as a function of jet E T . In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines and W +jets from ATLAS [40]. 6 These cross-section measurements are corrected for all known detector effects to particle level and compared to Monte Carlo predictions. The Monte Carlo predictions of V +jet production crosssections are formed by clustering the stable final state particles (τ > 10 ps) following parton shower and hadronization of the unweighted events. This clustering is done using the same jet algorithm as the measurement, as implemented in the Fastjet [15] package. All stable final state particles are used, with the exception of the leptons that result from the decay of the signal W or Z boson. The decay leptons are corrected for the final state QED radiation such that their 4momentum is equivalent to that before radiation. After the events have been clustered, the restrictions on the allowed phase space of the jets and of the W /Z boson decay products are applied to be consistent with the measurement to which we are comparing. The prediction of the final Alp-Gen + PYTHIA 6 cross-sections contains contributions from V + 0, 1, 2, 3 parton samples (showered with exclusive 6 Measurements of these processes have also been published by D0 [41][42][43], at the Tevatron, and by CMS [44], at the LHC. ATLAS has also published a measurement of Z+jets at the LHC [45]. MLM matching), and V + 4 parton samples (showered with inclusive MLM matching).
Two different AlpGen + PYTHIA 6 generations are compared to the data; the new AlpGen + PYTHIA 6 Perugia 2011 "matched" tune introduced in Sect. 3.1 (labeled "Alp.+Pyt. P2011"), and an AlpGen + PYTHIA 6 prediction using the default settings of AlpGen and the PYTHIA 6 DW tune (labeled "Alp.+Pyt. DW"). The ratio of the matched predictions to the data are shown and compared. Additionally, the results of variations of ktfac by factors of 0.5 and 2.0 in the AlpGen + PYTHIA 6 Perugia 2011 "matched" prediction are shown as solid lines. The hatched regions show the total error (statistical plus systematic) propagated to the theory/data ratio from the data measurements. The error bars on the points show the statistical error on the theoretical prediction.
In Fig. 6 we show the ratio of the predicted theory crosssections to the data for the CDF Z+jets measurement [39]. In this measurement jets are defined by the CDF midpoint algorithm [46], with R cone = 0.7 and are required to have p jet T > 30 GeV and |y jet | < 2.1. In Fig. 7 we show the ratio of the predicted theory cross-sections to the data for the CDF W +jets measurement [32]. In this measurement Fig. 7 (Top) The ratio of predicted theory and CDF measured data cross-sections for the production of a W → eν boson in association with at least N jet jets [32]. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥1 jet bin. (Bottom) The ratio of predicted theory and CDF measured data cross-sections for the production of events containing a W → eν boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side), as a function of the leading jet E T (left hand side), and the sub-leading jet E T (right hand side). In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines jets are defined by the CDF JetClu algorithm [47], with R cone = 0.4 and are required to have p T > 20 GeV and |η| < 2.5. In Fig. 8 we show the ratio of the predicted theory cross-sections to the data for the ATLAS W +jets measurements [40]. In this measurement jets are defined by the anti-Kt algorithm [35], with a radius parameter R = 0.4 and are required to have p T > 30 GeV and |η| < 4.4.
The AlpGen + PYTHIA 6 Perugia 2011 "matched" prediction compares well with the measured cross sections both as a function of the inclusive jet multiplicity and jet p T . In particular, the prediction correctly describes the low p T region of the differential cross section and, as will be shown in the next section, the jet sub-structure, without presenting any significant disagreement with data at high p T . This shows that it is possible to tune separately the long-and short-distance contributions of the prediction to obtain a satisfactory description of the observables in the whole experimental accessible phase space. Remarkably, the prediction describes the data both at √ s = 1.96 TeV and √ s = 7 TeV. This illustrates that the scaling properties of the long-distance contribution with the centre of mass energy of the collision does not produce unexpected effects in the high p T region of the cross sec-tion. The variation of ktfac by factors of 0.5 and 2.0 has little effect on the shapes of the differential crosssections, while for the inclusive N jet cross-sections it produces variations that bracket the default prediction. The ktfac parameter can therefore be used to explore the sensitivity of the prediction to a variation of the renormalization and factorization scale, other than allow tuning on data. With the statistics of the currently available measurements there is no much room for optimizing the parameters of the AlpGen + PYTHIA 6 Perugia 2011 "matched" prediction in a tune that better describes the measurements. However, with the arrival of more precise higher-statistics LHC measurements, this should be possible in the future.
Even though not explicitly shown here, in the relevant publications one can find similarly good agreement between the available measurements [32,40,48] and predictions based on AlpGen + Herwig.

Jets shapes
Finally we test the ability of the new AlpGen + PYTHIA 6 Perugia 2011 "matched" tune and the systematics varia- Fig. 8 (Top) The ratio of predicted theory and ATLAS published cross-sections [40] for the production of a W → eν boson in association with at least N jet jets. In the left hand figure the theory predictions are not normalized to the data. In the right hand figure the theory predictions are normalized such that they equal the data measurement in the ≥0 jet bin. (Bottom) The ratio of predicted theory and ATLAS data cross-sections for the production of events containing a W → eν boson in association with at least 1 jets (left hand side) and at least 2 jets (right hand side), as a function of the leading jet p T (left hand side), and the sub-leading jet p T (right hand side). In the left hand plot, the theory prediction is normalized such that the predicted rate for ≥1 jet production is equal to that measured in the data. In the right hand plot, the theory prediction is normalized such that the predicted rate for ≥2 jet production is equal to that measured in the data. The results of the ktfac variations are shown as solid lines tions to describe the jet shapes at the LHC and the Tevatron.
For the LHC, the jet shapes measured in inclusive jet production by the ATLAS collaboration [49] are taken as reference. For this measurement the jets are reconstructed using the anti-kt algorithm with the distance parameter R = 0.6, the transverse momentum range 30 GeV < p T < 600 GeV and rapidity in the region |y| < 2.8. The jet shapes are expected to be sensitive to both perturbative (parton shower) and non-perturbative (fragmentation and underlying event) modelling aspects. We perform the data comparisons for both the AlpGen + PYTHIA 6 and PYTHIA 6 standalone cases. The samples are generated using different PYTHIA 6 standalone and AlpGen + PYTHIA 6 parameter settings as follows: for PYTHIA 6 standalone the P2011 and the associated systematics tunes Perugia 2011 radHi and Perugia 2011 radLo are compared. The same tunes are also used for the generating the AlpGen + PYTHIA 6 distributions whereby the Λ QCD values are always set to the same values in AlpGen (using ktfac) and PYTHIA 6 (i.e. the Perugia 2011 "matched" central settings and the systematic variations around the central settings). The setups are compared to the integral jet shape distributions as measured in the data. The integral jet shape is defined as the average fraction of the jet p T that lies inside a cone of radius r concentric with the jet cone [49]: The sum is performed over all the N jet jets in the kinematic region of interest. In Fig. 9 the integral jet shape distributions are compared to the ATLAS data for the jets in the transverse momentum ranges of 40-60 GeV (top) and 260-310 GeV (bottom) in the whole measured rapidity range (|y| < 2.8). We observe that both PYTHIA 6 standalone (left) and AlpGen + PYTHIA 6 (right) with P2011 provide reasonably good description of the jet shapes. Due to MLM matching the jets in the AlpGen + PYTHIA 6 case tend to be more narrow than in the PYTHIA 6 standalone case.
For the Tevatron, the shapes of jets produced in association with a Z boson as measured by CDF [50] are used. In this measurement jets are defined by the CDF midpoint algorithm [46], with R cone = 0.7 and are required to have  Figure 10 shows good agreement between AlpGen + PYTHIA 6 and the measurement for both the DW and P2011 tunes.
The comparisons in Figs. 9 and 10 (as well as comparisons to the jet shapes in other kinematic regions and comparisons to further LHC measurements) reveal no major short-comings of the AlpGen + PYTHIA 6 Perugia 2011 "matched" tune. This tune has been developed by tuning PYTHIA 6 standalone, whereby the effective value of Λ QCD was set to be the same throughout the PYTHIA 6 parton shower in the anticipation of using it with the AlpGen matrix elements using the same effective Λ QCD value. The agreement with the measured jet shapes data could therefore potentially be improved by performing a dedicated tuning of AlpGen + PYTHIA 6.

Conclusions and outlook
We have shown that, in the context of tuning ME-PS matched predictions, it is vital that the tuning adopted ensures a consistent treatment of α S on either side of the Fig. 10 A comparison of the predicted and CDF measured jet shape in Z+jets events, showing the fraction of the total energy of a jet of radius R contained within a radius r, Ψ (r), as a function of r/R. "Alp.+Pyt. DW" uses the default AlpGen parameters and the DW PYTHIA tune, "Alp.+Pyt. P2011" is described in Sect. 3.1 and it includes a consistent choice of Λ QCD in AlpGen and PYTHIA "matching boundary". In the case of AlpGen + PYTHIA 6 matched predictions, we have outlined a simple prescription to ensure this. This can be easily generalized, and applied to the case of matching AlpGen with other shower MCs, such as Herwig. We have then given an example of such a tune that compares well to Tevatron and LHC measurements of vector boson plus multijet final states. In addition, we have shown how consistent variations around a central ME-PS matched tune can be performed, so as to define a systematic uncertainty on that prediction.
The observations made in this paper add some further insight on the more general issue of tuning MC event generators in presence of matching. It is clear that the overall tuning strategy goes well beyond what we have discussed. Further parameters of the matching procedure, such as the momentum scale at which one sets the separation/matching between matrix-element-and shower-generated jets, can play a role in the overall tuning. The matching scale dependence of the predictions, in the case of W +jets production, has been studied in detail, for all the existing tools, in Ref. [51]. This parameter does not affect the structure of jets at scales well above the matching scale, and even at smaller scales its variations lead to very small changes, consistently with the expectation that, being an unphysical parameter, the choice of matching scale should not affect the results. The renormalization scale dependence of the predictions has also been studied in [51]. Notice that a change in renormalization scale is equivalent to a change in Λ QCD , and therefore the conclusions of that work are consistent with our findings here: the main impact of a scale change is in the jet multiplicity distribution, and in the hardness of the jet p T spectra. Notice also that this implicitly underscores the potential relevance of the choice of PDF sets in the event generation: LO sets tend in fact to have a value of Λ QCD larger than NLO sets, with harder p T and multiplicity spectra.
On the shower MC side, it is well known that strong constraints arise from the precise measurements of hadronic fi-nal states in Z 0 decays. In particular, event-shape and jetrate observables are directly sensitive to the treatment of α S in the shower, and enter with a dominant weight in the tuning of the quarks' shower evolution [20,36]. In this perspective, our proposal amounts to a recipe to allow the ME generator to adapt the choice of the α S argument, used in the CKKW α S reweighting, to what is most suitable to match the shower evolution as tuned to Z 0 -decay data. The scale parameter (which directly affects jets' spectra) and Λ QCD (which more directly affects jets' fragmentation properties) provide therefore two complementary knobs to allow, in principle, to reproduce the jet spectra in hadronic collisions, while preserving the good agreement with the jet fragmentation properties determined at LEP. The resulting description of the jet fragmentation properties at the LHC, where, contrary to Z 0 decays, jets are dominantly initiated by gluons, would therefore emerge as a prediction of the framework.
The results shown in this paper are consistent, within the limited statistical and systematic precision, with the available data. More compelling tests will soon be possible at the LHC, with the higher-statistics information on both spectra and jet properties (shapes, fragmentation functions, etc.), necessary to allow for tuning exercises more complete and compelling than the one presented here. We nevertheless hope that the results of this work will prove valuable in defining a new set of consistent ME-PS tunes for the precise future study of LHC multijet final states.
V2.14 of the code, and PDF set CTEQ5L [33]. For the Tevatron (LHC) studies, partons are generated in the phase-space regions defined by: Matching is performed with parameters (as defined in [51]): The relevant parameters controlling the magnitude, renormalization scale, and running order for α S are set to these values: 7 • The prefactor rescaling the value of the renormalization scale in the CKKW scale-setting procedure: ktfac = 1.0. • Λ QCD (5-flavor): xlclu = 0.26.

A.1.2 PYTHIA parameters
To the extent that the explanations given here are necessarily somewhat brief, we recommend the interested reader to follow up on the definitions of the parameters below in the program's comprehensive manual [2].
The strong coupling in PYTHIA 6 there are several different ways of specifying the parameters controlling α S . In order for the user to have explicit control of all of them, we use the option MSTP(3)=1, which allows to specify formally independent Λ QCD values for each of the different algorithmic components in PYTHIA. As a special case, we may then choose to set all those values equal, or at least set those equal that correspond to initial-and final-state radiation. These are: • PARP(61) for Λ QCD for ISR.
• PARP(72) for Λ QCD for FSR outside resonance decays (e.g., FSR off hard jets from the matrix element and/or from ISR).
The number of flavors with which to interpret Λ QCD must also be specified. Since a value of n f = 5 is hard-coded in at least one sub-algorithm of PYTHIA, we advise to always translate Λ QCD values to 5-flavor ones and, correspondingly, set the n f parameter MSTU(112)=5. 7 The parameters xlclu and lpclu were introduced in connection with this work, and are implemented in AlpGen starting from v2.14.
The default behavior, if these parameters are not set, is to assign the values inherited from the PDF (as was the case for AlpGen versions before 2.14).
Further, in order to avoid that any of these values are modified by the code, we set MSTP(64)=2, and PARP(64)=1.0. The former forces the code to keep Λ QCD unmodified for ISR. In particular, the translation from "MSbar" to "CMW", which is applied for MSTP(64)=3, is not performed. This is equivalent to interpreting the effective Λ QCD value as already being in a scheme similar to CMW. The latter, PARP(64)=1.0, sets the prefactor for the renormalization scale used for ISR equal to unity, i.e., the renormalization scale will just be p T . Any re-interpretation of Λ QCD , for instance to translate between different effective scheme definitions or to introduce multiplicative factors on the effective renormalization scale, for scale variation purposes, should then be imposed directly on the three Λ QCD values above. This is the prescription followed in the socalled Perugia 2011 tunes which were developed as part of this effort, with parameters as listed in [6].
Finally, one needs to settle on an effective value for Λ QCD . According to comprehensive Professor tunings [36,37] of the p T -ordered shower in PYTHIA [4] to event shapes and other LEP data, one needs values of order where the superscript indicates the number of flavors. We interpret this as an effective value, derived directly from data using a "PYTHIA scheme" that is defined numerically by PYTHIA's shower algorithm. It is not necessarily directly comparable to MS determinations. 8 The strong coupling in Pythia 8 Although we restrict the numerical studies in this paper to PYTHIA 6, for completeness we also include the case of PYTHIA 8 [52], for which the corresponding relevant parameters are Radiation phase space The size of the allowed phase space for radiation in the shower generator may also affect the matched result. In the p T -ordered shower in PYTHIA 6, the switch MSTP(72) controls the starting scale for final-state 8 It is probably closest to the so-called CMW scheme [28]. For completeness, a reasonable derived guess for a corresponding MS value would then be ∼0. 16, since the CMW prescription yields effective values that are approximately 1.6 times larger than the MS one. radiation off jets that are produced by initial-state radiation and/or are color-connected to the beam. Naively, the FSR off such a parton should start at the scale at which it was created, which is obtained with MSTP(72)=2, the recommended option. Using the other available options is strongly discouraged, as these lead to a quite bad agreement with NLL resummations, underscored by Banfi et al. in [53]. The Perugia 2011 tunes all use MSTP(72)=2. Further, in the Perugia 2011 tunes, we start both the ISR and FSR evolutions at p Tevol = SCALUP, with the PYTHIA evolution variable p Tevol defined in [4] and SCALUP the scale parameter defined in the Les Houches Accord for event generators [54,55]. Technically, this is achieved by setting PARP(71) = PARP(67) = 1.0, where the former controls the scale factor applied to the starting scale for FSR and the latter sets the one for ISR. Note that these parameters could still be varied somewhat around their central values, since the p Tevol variable used by PYTHIA is not 100 % identical to the p T definition that might be used to place cuts in a matrix-element generator, but we have not judged this difference essential at the current level of precision.

A.2 PYTHIA tunes
In this work, we have used the Perugia 0, Perugia Hard, Perugia Soft, Perugia 2010, Perugia 2011, Perugia 2011 radHi, Perugia 2011 radLo [6], and the DW [7] tunes of PYTHIA 6.4 [2]. All use the CTEQ5L PDF set [33]. For a complete description, see the indicated references. The salient features of the tunes are as follows.
Tune DW [7] is a tune of the Q 2 -ordered shower. It is based on the Tevatron "Tune A", which had great success in describing the underlying event measured at the Tevatron. Contrary to Tune A, however, DW also included the Drell-Yan p T spectrum, for which Tune A predicted a far too soft spectrum. Tune DW therefore has a significantly lower renormalization scale for ISR (and thus a larger value of α s ), and 2 GeV of so-called "primordial k ⊥ ", as compared to 1 GeV in Tune A. The energy scaling of the underlying event was based on comparisons between the underlyingevent level at the Tevatron between 630 and 1800 GeV.
The Perugia tunes [6] are all tunes of the p T -ordered shower. Unlike DW, which was developed by tuning to the underlying event in jet events, the Perugia tunes primarily used minimum-bias data as drivers, relying on the universality of PYTHIA's MPI modelling to extrapolate to the underlying event. In addition, a comprehensive update of the LEP fragmentation parameters was included in all tunes.
The first set, the Perugia 0 family, used LEP event shapes and fragmentation data, Tevatron minimum-bias data, and the Tevatron Drell-Yan p T spectrum. Again, the scaling from Tevatron data at 630 GeV was used to determine the scaling with CM energy, with some additional constraints from older UA5 data also included. A "Hard" and "Soft" variation attempted to vary the shower radiation up and down, respectively. Both Perugia 0 and the "Hard" variation use the so-called "CMW" scheme for Λ QCD for ISR, while the soft retained the unmodified MSbar value, in all cases taking the numerical value from the PDF set used. For the "Hard" variation, the renormalization scale for ISR was 0.5p T , for Perugia 0 p T , and for the "Soft" variation, √ 2p T . In addition, the "Hard" variation had higherthan-nominal values for FSR, and had a slightly harder hadronization spectrum, while the converse was true for the "Soft" one. None of these early tunes used the recommended MSTP(72)=2 setting, and hence predicted rather narrow ISR jets. The PYTHIA tune numbers are 320, 321, and 322, for Perugia 0, "Soft", and "Hard", respectively. Table 3 lists the parameter settings of the Perugia family that were used for the block variations in Sect. 2.2.
In Perugia 2010, jet shapes were included among the tuning constraints. The amount of FSR outside resonance de-cays (previously controlled by the Λ QCD value read from the PDF set) was adjusted to agree with the level inside them (constrained by fits to LEP event shapes), combined with the recommended MSTP(72)=2. The Λ QCD value for ISR was still read from the PDF set, and translating from MSbar to CMW, as in Perugia 0. A few fragmentation parameters were slightly revised, since some of the previous ones had only been constrained using the Q 2 -ordered shower, and a new color-reconnection model was introduced. No "Hard" and "Soft" variations were produced for this tune. The PYTHIA tune number is 327 for Perugia 2010.
In Perugia 2011, it was possible to include some early lessons from LHC at 7 TeV. Based on observed strangeness and baryon production rates, a few of the fragmentation parameters were again revised. The universal effective Λ QCD choice advocated in this paper was introduced. Variations labeled "radHi" and "radLo" were defined as well, expressing a factor 2 variation in the Λ QCD values used for ISR and FSR. The PYTHIA tune numbers for Perugia 2011, radHi, and radLo, are 350, 351, and 352, respectively.
Tabulated values of the parameters of all of the Perugia tunes can be found in the appendices of [6].