Not only real experimental data but also simulated data are necessary for modern data analyses because experiments, that is, detectors, etc. are getting complex so that we need the help of Monte Carlo (MC) simulation to understand the experimental data. A Monte Carlo method is a technique to simulate high-energy physics interactions and detector responses by applying random samplings to probability distribution modelling experiments. In collider experiments, MC simulated events (MC events or MC samples in short) are produced event-by-event; in the case of proton-proton colliders, each event corresponds to a bunch crossing, where several pp collisions (pile-up) may occur. MC simulation is also useful to design new experiments and detectors.

7.1 Overview

We outline the production of MC events with MC simulation, where three steps are considered: event generation, detector simulation and reconstruction as shown in Fig. 7.1.

In the event generation step, we produce events, for example, two photons from Higgs bosons in proton-proton collisions (\(pp\rightarrow gg\rightarrow H\rightarrow \gamma \gamma \)), or two quarks from Z bosons in electron-positron collisions (\(e^+e^-\rightarrow Z\rightarrow q\bar{q}\)). Unstable particles, whose lifetime is short enough not to reach detectors, are decayed according to branching fractions which are obtained from experimental measurements or theoretical predictions. The output of this step is a list of particles with various information, for instance, energy, momentum, production and decay positions, status and relation between particles, i.e., parent and children.

In the detector simulation step, we simulate our detector responses to the stable particles produced in the previous step. For example, in the case of electrons, they lose energy by interacting with detectors: produce electron-hole pairs in pixel and silicon detectors, ionise particles in gas detectors and produce EM showers in calorimeters. These energy deposits are converted to charges if necessary. The format of outputs is the same as that of the real data from detectors because the next step should be applied to both real data and MC simulation. The detector simulation should ideally be as precise as possible but it depends on the requirements of physics achievements and the technical limitations, for example, modelling of detectors, computing resources, etc. The pile-up effect in the pp collisions can be taken into account after the detector simulation, for example, we prepare several events of the inelastic interactions and mix them following the number of collisions per bunch crossing.

In the reconstruction step, we reconstruct events from the output of the detector simulation and identify particles. Reconstructed objects in each event are still candidates of particles. For instance, electron candidates mean that they are reconstructed from EM calorimeter clusters matched with charged tracks. They come from electrons (called true electrons) or fake electrons (\(\pi ^\pm \), \(\tau \), etc.); note that the fraction of true electrons is not so high. Then, identification programs are applied in order to select, for example, true electrons from electron candidates as much as possible. In other words, fake electrons are rejected as much as possible. As a result, the fraction of true electrons in the selected electron candidates becomes high. Momentum and energy are also calculated for each object including calibrations if possible. The output of this step is used in the data analysis.

Fig. 7.1
figure 1

The flow of the production of MC events (full detector simulation) and real data for data analysis. The Monte Carlo method is used in the event generation and detector simulation steps but not in the reconstruction step. The reconstruction step should work for both MC events (solid) and real data (dashed)

7.2 Event Generation

An MC event is produced with several steps, where each step uses different programs. We explain the outline of how to produce an event using a concrete example of the \(t\bar{t}\) process in pp colliders with several keywords often used for MC production: matrix-element event generator, parton density function, parton shower, fragmentation, harmonisation, underlying event, etc. The detail of theoretical aspects can be found in books, for example, [1]. Then, we give concrete computing programs used in the ATLAS experiment.

7.2.1 Production of \(t\bar{t}\) Process

Let us consider the production of \(t\bar{t}\) events in pp collisions: \(t\bar{t}, t\rightarrow W^+b, \bar{t}\rightarrow W^-\bar{b}\) where one of W bosons decay into quarks and the other into leptons. Figure 7.2 shows two Feynman diagrams of this production: \(gg\rightarrow t\bar{t}, t\rightarrow W^+b, W^+\rightarrow u\bar{d}, \bar{t}\rightarrow W^-\bar{b}, W^-\rightarrow \ell ^-\bar{\nu }\). This is so-called a hard-process part. Matrix-element event generators (ME generators) are used for the production of the hard-process part. ME generators can produce events by considering all the diagrams which have the same final state. ME generators also perform the simulation of a gluon (g) from a proton in the proton-proton collision. The momentum fraction of gluons is described by parton density functions (PDF), which are obtained based on QCD and experimental measurements. Gluon and quark PDFs depend on the energy of the interaction and we need to define an energy scale to evaluate PDFs, for example, as top mass for \(t\bar{t}\) production. This scale is one of the important parameters in the MC production, which is called a factorisation scale \(\mu _F\).

Fig. 7.2
figure 2

Feynman diagrams of \(t\bar{t}\) process in the pp collisions: \(gg\rightarrow t\bar{t}, t\rightarrow W^+b, W^+\rightarrow u\bar{d}, \bar{t}\rightarrow W^-\bar{b}, W^-\rightarrow \ell ^-\bar{\nu }\) with s-channel (left) and t-channel (right)

However, the production of a hard-process using ME generators is not the end of the story but a starting point of the event generation. To produce MC events, several different steps (parton shower, fragmentation/hadronisation, etc.) have to be performed as shown in Fig. 7.3. Lots of quarks and gluons with relatively small \(p_{\textrm{T}}\) are emitted using a method of parton shower, that is, soft and collinear emissions. A Sudakov form factor, which is a probability not to emit a parton until a target energy scale, is calculated to perform the MC method. There are two different types of showers for such emissions: initial state radiation (ISR) and final state radiation (FSR); the theoretical idea behind them is similar, however, ISR is complicated due to so-called backward evolution, which is “tracing the showers backwards in time” (see, for example, [2]). As shown in Fig. 7.3, the use of ISR and FSR depends on when the emissions happen; ISR (FSR) is for radiations before (after) the hard-process. ISR gluons and quarks are radiated from the initial gluons of \(gg\rightarrow t\bar{t}\) but FSR gluons are radiated from quarks (u and/or \(\bar{d}\)) of \(W^+\rightarrow u\bar{d}\).

Fig. 7.3
figure 3

Details of MC event production of \(t\bar{t}\) from the proton-proton collisions: parton showers (ISR, FSR), fragmentation, hadronisation, photon radiation and underlying event. The hard-process shown with a dashed box is the same as the left plot of Fig. 7.2. Redrawn from Fig. 1 of [3]

Quarks and gluons cannot be observed due to the colour confinement (see Sect. 6.4), so that they need to be combined to compose hadrons. This procedure is called hadronisation. For example, to produce a B meson (\(B^0(\bar{b}d), B^+(\bar{b}u)\)), u or d quarks are produced from parton shower/fragmentation and one of them is combined with a \(\bar{b}\) quark. New additional quarks and gluons are created from the existing quarks, gluons or vacuums, which are gluon fields in this case. This step is called fragmentation but is sometimes included in the step of hadronisation. Photons can be radiated from charged leptons and quarks, which are called QED radiative corrections. The remaining parts of protons not used in the hard-process are called “underlying event” and must be treated properly with parton showers, hadronisation and fragmentation. Finally, the decay of mesons, baryons and leptons is performed until particles produced from decays become “stable”. Kinematics (energy and momentum) of all the particles are determined under conservation rules (energy, momentum, spin/polarisation, etc.) with the MC method.

Feynman diagrams shown in Fig. 7.2 are the leading order (LO) of \(gg\rightarrow t\bar{t}\) process and there is no additional gluons and quarks. However, we can use ME generators to produce more high \(p_{\textrm{T}}\) gluons and quarks. For example, when we consider one additional strong coupling, additional gluon or quark can be emitted, which should be treated by the ME generators instead of the parton shower. This is a part of the contribution of next-to-leading order (NLO). Gluons or quarks produced by ME generators and by parton shower are properly treated to avoid a double counting, which is briefly explained in the next section.

7.2.2 Event Generators

Many event generators are available on the market. Several main generators used in ATLAS for Run 1 and Run 2 data analysis are listed in alphabetical order: Alpgen [4], Herwig [5,6,7], MadGraph (MadGraph5_aMC@NLO) [8], MC@NLO [9], Photos [10], Powheg [11], Pythia [2, 12], Sherpa [3], Tauola [13], etc. Since different generators have different features, each generator has its own pros and cons.

Herwig, Pythia and Sherpa are multi-purpose event generators. They can do all the steps explained before including parton showers, fragmentation, hadronisation and decay. Not only LO but also NLO calculations for ME are available for a part of processes in these generators. However, other ME generators like MadGraph, Powheg, etc. are often used for NLO in ATLAS. Herwig and Pythia can be used for simulating parton showers, fragmentation, hadronisation and decay in these ME event generators. The theoretical idea of the parton shower is different among generators: angular ordering for Herwig and \(p_{\textrm{T}}\) (or \(k_T\)) ordering for Pythia. Models for fragmentation and hadronisation are also different: cluster model for Herwig and string model for Pythia. Their difference is often used as systematic uncertainties from the parton shower and fragmentation/hadronisation models. Herwig has three major versions; Herwig, Herwig++ and Herwig 7. Pythia has two major versions: Pythia 6 and Pythia 8. Herwig++, Herwig 7 and Pythia 8 were often used in ATLAS in Run 2 compared to Herwig and Pythia 6 since new features and techniques of event generations were only implemented in Herwig++, Herwig 7 and Pythia 8.

Alpgen, MadGraph and Sherpa are multi-leg generators. They can produce events with ME including multi-partons. The multi-partons are so-called additional partons or additional jets, so that we often call such physics processes X+jets, for example, W+jets, Z+jets, etc. The idea behind such additional jets with ME (ME jets) is that the modelling of jets produced with parton showers (PS) might not work well because the parton shower is based on soft and collinear approximation. Figure 7.4 shows one of the diagrams for W+2-jets and these additional quark and gluon associated with a W boson can be produced by either ME or PS. Calculations based on ME is in principle correct (see also the next paragraph, however); when the existence and behaviour of additional jets are critical in data analysis, the use of ME generators is recommended to describe additional jets, in particular, high \(p_{\textrm{T}}\) jets, but we should be aware that the implementation of loop corrections, etc. may depend on each generator. For example, in the SUSY searches, typical SUSY events from gluinos and squarks have several jets in the final state (e.g., \(gg \rightarrow \tilde{g}\tilde{g} \rightarrow qq\tilde{\chi _1}^0 qq\tilde{\chi _1}^0\)), and one of dominant background processes is \(Z \rightarrow \nu \bar{\nu }\)+jets. The “+jets” of the Z process must be modelled well to predict background, so that Sherpa is used for Z plus up to 4-jets with ME in ATLAS [14].

Fig. 7.4
figure 4

Feynman diagram of W+jets: 2 jets (a quark and a gluon) are associated with a W boson

There is one important thing to be considered, which is called a jet-parton matching. Even if multi-leg generators are used to produce additional jets with ME, the parton shower has to be applied to produce additional jets, in particular, low \(p_{\textrm{T}}\) jets. Practically we assume that ME takes care of high \(p_{\textrm{T}}\) jets and PS does low \(p_{\textrm{T}}\) jets because ME jets are well modelled in high \(p_{\textrm{T}}\) region. To ensure this assumption, reconstructed high \(p_{\textrm{T}}\) jets must match to partons produced by ME. In addition, jets in some phase spaces are produced by both ME and PS, which leads to double counting of events. For this purpose, MLM prescription [15], CKKW-matching procedure [16, 17], etc. are applied and events are discarded if they cannot satisfy their requirement. To explain MLM prescription in Alpgen, let us consider the production of W+up to 2-jets with a ME generator and jets with \(p_\textrm{T}>20\) GeV are used for data analysis. First, events are produced with W with 0 ME-jet, W with 1 ME-jet and W with 2 ME-jets, separately, where additional partons with, for example, \(p_\textrm{T}>15\) GeV are produced by a ME generator. Note that the parton shower is also applied, so that some high \(p_{\textrm{T}}\) jets might be produced by PS. Then, all the reconstructed jets with \(p_\textrm{T}>15\) GeV are checked if they matched to ME-parton and we count such jets. For W with 0 (1) ME-jet, we require such jets should be exactly 0 (1). For W with 2 ME-jets, we require such jets should be 2 or more. Then, we merge the remaining events to make a W+up to 2-jets events.

MadGraph, MC@NLO, Powheg and Sherpa are used as NLO generators for some specific processes. Additional one parton and also loop diagrams up to the next-to-leading order are properly taken into account.

Photos generates QED radiative corrections for charged leptons and quarks. Tauola is a program to simulate tau-decay including polarisation properly. They are optionally used in Pythia and Herwig.Footnote 1

7.2.2.1 Cross Section

MC events are produced by using event generators, which can provide their cross sections including branching fractions. However, in many cases, we don’t use cross sections provided by event generators but values obtained from dedicated programs, because such programs can perform more higher order calculations than event generators. NLO event generators are available for most important physics processes on the marker but NNLO event generators are limited. On the other hand, the dedicated programs can calculate cross sections up to NNLO (or higher for some processes).

There is an important parameter, i.e., a renormalisation scale \(\mu _R\), which is the scale at which the strong coupling is evaluated in order to calculate cross sections. The value of cross sections does not depend on the choice of \(\mu _R\); however, since we cannot perform complete calculations including all the orders of the strong coupling, the calculated cross sections might depend on \(\mu _R\). In the high-energy region (\(\gg O(100~\textrm{MeV})\)), a perturbative method works well in QCD like QED and we can calculate cross sections, for instance, up to next-to-next-to-next-to-leading order (NNNLO, N3LO or N\(^3\)LO) for the Higgs gluon fusion production.

Including the higher order calculations, cross sections can be properly predicted and they are well consistent with the experimental results. Table 7.1 shows the results of W, Z, \(t\bar{t}\), Higgs, ZZ and \(t\bar{t}W\) production cross section measurements as concrete examples with theoretical predictions.

Table 7.1 Cross section measurements at the ATLAS experiment (\(\sqrt{s}=13\) TeV) with theoretical predictions. The measurements are given with statistical and systematic uncertainties. The predictions are given with combined uncertainties including PDF, \(\alpha _S\), scales, etc. For the Higgs production cross section, five processes are included: gluon fusion (NNNLO), VBF (approximate NNLO), VH (NNLO/NLO), \(t\bar{t}H+tH\) (NLO) and \(b\bar{b}H\) (NNLO/NLO)

7.3 Detector Simulation

Particles produced in the event generators are detected through the interactions with several detector components (materials). To reach the detector volume including the beam pipe, particles has a long enough lifetime, that is, they are “stable particles” from a viewpoint of the detector simulation. Such stable particles are electrons (\(e^\pm \)), photons, \(\pi ^\pm \), kaons (\(K^\pm \), \(K^0_S\), \(K^0_L\)), \(\mu ^\pm \), protons (p/\(\bar{p}\)), neutrons and neutrinos in case of the SM. In addition, in the SUSY and other new physics models beyond the SM, some particles, for example, the lightest neutralino is stable in the SUSY models with R-parity conservation. Some of the stable particles, for example, \(\pi ^\pm \), \(K^\pm \), \(K^0_S\) and \(\mu ^\pm \) can decay according to their lifetime in the detector volume, which is done in the detector simulation step, not by the event generator.

Geant4 program [24] is a detector simulation toolkit widely used in the experimental particle physics. We build each detector component and define its interactions based on its real detector in order to emulate how particles interact and how much energy of particles is lost; energy loss and multiple scattering (charged tracks in tracking volumes); electromagnetic interaction/shower; hadronic interaction/shower; etc. This is based on the best knowledge of the particle interaction with materials. Geant4 traces particles step-by-step and simulates their interaction. This is a reason why MC simulation with Geant4 is called full simulation.

There are different types of MC simulations: “fast simulation” and “parametric simulation”.Footnote 2 In the parametric simulation, the detector response is described by expected resolution functions for each stable particle. Momentum and energy are smeared with the resolution functions. The effects of reconstruction and identification programs, that is, their efficiencies are replaced with weights or MC methods following their expected performance. “Fast simulation” is sometimes the same as the parametric simulation but this term is also used in the case that a part of the detector simulation step, for example, calorimeter response, is replaced with a faster algorithm to emulate detector response. In terms of the modelling of the real data, in general, the full simulation is better than the fast and parametric simulations. However, from the point of view of execution time, the full simulation is much slower; for example, in some extreme cases, several minutes per event with the full simulation but less than a few seconds with the parametric simulation. If we need lots of events to reduce the MC statistical uncertainties, the use of fast or parametric simulations is one of the options. In addition, it takes a much longer time to develop computing programs with Geant4.