Measurements of inclusive and differential fiducial cross-sections of $t\bar{t}\gamma$ production in leptonic final states at $\sqrt{s}$ = 13 TeV in ATLAS

Inclusive and differential cross-sections for the production of a top-quark pair in association with a photon are measured with proton-proton collision data corresponding to an integrated luminosity of 36.1 fb$^{-1}$, collected by the ATLAS detector at the LHC in 2015 and 2016 at a centre-of-mass energy of 13 TeV. The measurements are performed in single-lepton and dilepton final states in a fiducial volume. Events with exactly one photon, one or two leptons, a channel-dependent minimum number of jets, and at least one $b$-jet are selected. Neural network algorithms are used to separate the signal from the backgrounds. The fiducial cross-sections are measured to be 521 $\pm$ 9(stat.) $\pm$ 41(sys.) fb and 69 $\pm$ 3(stat.) $\pm$ 4(sys.) fb for the single-lepton and dilepton channels, respectively. The differential cross-sections are measured as a function of photon transverse momentum, photon absolute pseudorapidity, and angular distance between the photon and its closest lepton in both channels, as well as azimuthal opening angle and absolute pseudorapidity difference between the two leptons in the dilepton channel. All measurements are in agreement with the theoretical predictions.


Introduction
Measurements of top-quark properties play an important role in testing the Standard Model (SM) and its possible extensions. Studies of the production and kinematic properties of a top-quark pair in association with a photon (ttγ) probe the tγ electroweak coupling. For instance, deviations in the transverse momentum (p T ) spectrum of the photon from the SM prediction could point to new physics through anomalous dipole moments of the top quark [1][2][3]. A precision measurement of the ttγ production cross-section could effectively constrain some of the Wilson coefficients in top-quark effective field theories [4]. Furthermore, differential distributions of photon production in tt events could provide insight on the tt production mechanism, in particular about the tt spin correlation and the charge asymmetry [5].
Evidence for the production of a top-quark pair in association with an energetic, isolated photon was found in proton-antiproton (pp) collisions at the Tevatron collider at a centre-of-mass energy of √ s = 1.96 TeV by the CDF Collaboration [6]. Observation of the ttγ process was reported by the ATLAS Collaboration in proton-proton (pp) collisions at √ s = 7 TeV [7]. Recently, both the ATLAS and CMS Collaborations measured the ttγ cross-section at √ s = 8 TeV [8,9]. In the ATLAS measurement, the differential crosssections with respect to the transverse momentum p T and absolute pseudorapidity |η|1 of the photon were reported. In the CMS measurement, the ratio of the ttγ fiducial cross-section to the tt total cross-section was measured.
This paper describes a measurement of the ttγ production cross-section in final states with one or two leptons, electron or muons, referred to as the single-lepton or dilepton channel, based on a data set recorded at the LHC in 2015 and 2016 at a centre-of-mass energy of √ s = 13 TeV and corresponding to an integrated luminosity of 36.1 fb −1 . The photon can originate not only from a top quark, but also from its charged decay products, including a charged fermion (quark or lepton) from the decay of the W-boson. In addition, it can be radiated from an incoming charged parton. In this analysis, no attempt is made to separate these different sources of photons, but criteria are applied to suppress those radiated from top-quark decay products. In each channel, the fiducial cross-section is measured with a likelihood fit to the output of a neural network trained to differentiate between signal and background events. In both channels, differential cross-sections, normalized to unity, are measured in the same fiducial region without performing the likelihood fit, as a function of the photon p T , the photon |η|, and the distance ∆R between the photon and its closest lepton. The distance ∆R between two objects is defined as the quadratic sum of their pseudorapidity difference ∆η and azimuthal opening angle ∆φ. In the dilepton channel, the normalized differential cross-sections are also measured as a function of the absolute pseudorapidity difference |∆η| and ∆φ between the two leptons. The measured cross-sections are compared to predictions from leading order (LO) generators. The predictions for the inclusive cross-sections are corrected by next-to-leading order (NLO) k-factors [10] in the strong interaction, calculated at parton level. This paper is organized as follows. The ATLAS detector is briefly introduced in Section 2. The data and simulation samples used are listed in Section 3. The derivation of the NLO correction to the signal cross-section is described in Section 4. The object and event selection, the neural-network algorithms, and the definition of the fiducial region are presented in Section 5. The estimation of the backgrounds are introduced in Section 6. The strategies to extract the fiducial and differential cross-sections are described in Section 7. The evaluation of the systematics uncertainties are discussed in Section 8. Section 9 gives the final results, and Section 10 presents the conclusion.

ATLAS detector
The ATLAS detector [11] consists of three main components. The innermost component is the Inner Detector (ID), which is used for tracking charged particles. It surrounds the beam pipe and is located inside a superconducting solenoid, operating with a magnetic field of 2 T. An additional silicon pixel layer, the insertable B-layer, was added between 3 and 4 cm from the beam line to improve b-hadron tagging [12,13] for Run 2. The calorimeter outside the ID is divided into two subsystems. The inner subsystem is the electromagnetic calorimeter (ECAL) and the second is the hadronic calorimeter (HCAL). The outermost layer is the third main component of the ATLAS detector: the muon spectrometer (MS), which is within a magnetic field provided by air-core toroid magnets with a bending integral of about 2.5 Tm in the barrel and up to 6 Tm in the end-caps. The ID provides tracking information from silicon pixel and silicon microstrip detectors in the pseudorapidity range |η| < 2.5 and from a transition radiation tracker (TRT) covering |η| < 2.0. The magnetic field of the superconducting solenoid bends charged particles for the momentum measurement. The ECAL uses lead absorbers and liquid argon (LAr) as active medium and is divided into barrel (|η| < 1.475) and end-cap (1.375 < |η| < 3.2) regions. The HCAL is composed of a steel/scintillating-tile calorimeter, segmented into three barrel structures within |η| < 1.7, and two copper/LAr hadronic endcap calorimeters, that cover the region 1.5 < |η| < 3.2. The solid angle coverage is completed with forward copper/LAr and tungsten/LAr calorimeter modules, optimised for electromagnetic and hadronic measurements respectively, and covering the region 3.1 < |η| < 4.9. The MS measures the deflection of muon tracks within |η| < 2.7 using multiple layers of high-precision tracking chambers in toroidal fields of approximately 0.5 T and 1 T in the central and end-cap regions, respectively. The MS is instrumented with separate trigger chambers covering |η| <2.4.
Data are selected from inclusive pp interactions using a two-level trigger system [14]. A hardware-based trigger uses custom hardware and coarser-granularity detector data to initially reduce the trigger rate to approximately 100 kHz from the original 40 MHz LHC proton bunch crossing rate. Next, a software-based high-level trigger, which has access to the full detector granularity, is applied to further reduce the event rate to 1 kHz.

Data and simulation samples
The data used for this analysis were recorded by the ATLAS detector in 2015 and 2016 at a centre-of-mass energy of 13 TeV, corresponding to an integrated luminosity of 36.1 fb −1 . Only the data-taking periods in which all detector systems were operating normally are considered. Candidate events were collected using single-lepton triggers, designed to select events with at least one isolated high-p T electron or muon.
The signal and background processes were modelled using Monte Carlo (MC) generators and passed through a detector simulation using G 4 [15,16]. The simulated events were reconstructed with the same software algorithms as data. To account for overlapping pp collisions (pile-up), multiple interactions were simulated with the soft QCD processes of P v8.186 [17] using the set of tuned parameters called A2 [18] and the MSTW2008LO parton distribution functions (PDF) set [19].
The ttγ signal sample was simulated as a 2 → 7 process for the semileptonic and dileptonic decay channels of the tt system at LO by M G 5_aMC@NLO v2.33 [20] (denoted as MG5_ MC in the following) interfaced with P v8.212 [21], using the A14 set of tuned parameters [22] and the NNPDF2.3LO PDF set [23]. The photon could be radiated from an initial charged parton, an intermediate top quark, or any of the charged final state particles. The top-quark mass, top-quark decay width, W-boson decay width, and fine structure constant were set to 172.5 GeV, 1.320 GeV, 2.085 GeV, and 1/137, respectively. The five-flavour scheme was used where all the quark masses are set to zero, except for the top quark. The renormalization and the factorization scales were set dynamically, corresponding to half the sum of transverse energies over all the particles generated from the matrix element calculation, where the transverse energy of a particle of mass m is defined as E T = m 2 + p 2 T . The photon was requested to have p T > 15 GeV and |η| < 5.0. At least one lepton with p T > 15 GeV was required, with all the leptons satisfying |η| < 5.0. The ∆R between the photon and any of the charged particles among the seven final-state particles was required to be greater than 0.2. The resulting total cross-section of the sample is calculated to be 4.62 pb. The NLO k-factors, introduced in Section 4, are applied to correct the fiducial cross-sections and acceptances to NLO.
The inclusive tt sample [24]  To assess the effects of initial-and final-state radiation (ISR and FSR), alternative signal samples were produced with the relevant P 8 A14 Var3c tune parameters [22] varied to increase or decrease the parton radiation. The effect of the choice of parton shower algorithm for the signal is evaluated with a sample generated using Herwig v7.0.1 [34] instead of P v8.212. An alternative tt sample was generated to enhance the parton shower radiation, with the renormalization and factorization scales varied down by a factor of two, the high radiation variation of the A14 Var3c tune parameter and the h damp value increased by a factor of two. A corresponding tt sample with reduced parton shower radiation was generated with the renormalization and factorization scales multiplied by a factor of two and the low radiation variation of the A14 Var3c tune parameter. The uncertainty arising from the choice of tt generator is evaluated using a S v2.2 sample. An alternative Zγ sample, generated by MG5_ MC v2.33 interfaced with P v8.212, is used to evaluate the modelling uncertainty of the Zγ background estimate.
The tt and V+jets samples contain events already accounted for by the ttγ and V γ samples. Based on truth information, the overlap is removed by vetoing the events where the selected photon originates from the hard interaction in the tt and V+jets samples. 4 Next-to-leading order k-factor for ttγ Calculations at NLO precision in QCD are available for the ttγ process at a centre-of-mass energy of √ s = 14 TeV [10], extending results performed using the approximation of stable top quarks [35]. A dedicated calculation at √ s = 13 TeV has been performed for both single-lepton and dilepton channels, by the authors of Ref. [10]. The renormalization and factorization scales are both set to the top-quark mass, while the rest of the parameters are set to the same values used by the MG5_ MC ttγ MC sample, as described in Section 3. These calculations are used to derive corrections at parton level to the LO ttγ MC sample.
The NLO calculation is performed at parton level in a phase space very close to the fiducial region defined in Section 5.4. The lepton (at least one lepton) is required to have p T > 25 GeV for the single-lepton (dilepton) channel, and all leptons must have |η| < 2.5. The photon is required to have p T > 20 GeV and |η| < 2.37. Jets are reconstructed from quarks and gluons using the anti-k t algorithm [36] with a radius parameter of R = 0.4, and they are required to have p T > 25 GeV and |η| < 2.5. At least four (two) jets are required for the single-lepton (dilepton) channel. All jets are required to be separated from the photon by ∆R(γ, jet) > 0.4. Leptons are required to be separated from the photon by ∆R(γ, ) > 1.0. The leptons in the dilepton channel are required to be separated from the jets by ∆R(jet, ) > 0.4.
The LO cross-sections are calculated using the MG5_ MC LO sample at particle level in the same phase space as above to derive the NLO k-factors for the single-lepton and dilepton channels. Since the kinematic properties of all the objects used in the NLO theoretical calculation are taken from parton level, the photons, leptons, and jets of the LO MC sample must be defined carefully to correspond to those at the parton level. This is achieved by requiring the photon and the leptons to be produced from the matrix element rather than from the parton shower and requiring the leptons have any QED radiation from the parton shower stage added back. The anti-k t algorithm with R = 0.4 is used for jet clustering, using all the final state particles, excluding the above photon, leptons, and their corresponding neutrinos. The calculated NLO cross-sections are 120 fb and 31 fb for the single-lepton and dilepton channels, while the calculated LO cross-sections are 92 fb and 21 fb for the same channels, respectively, resulting in NLO k-factors of 1.30 and 1.44. Statistical uncertainties are negligible. Systematic uncertainties of the k-factors receive contributions from two sources. For the NLO theoretical cross-section, the relative uncertainty due to the QCD scale and PDF choices are 14% (13%) for the single-lepton (dilepton) channel, with the QCD scale uncertainty dominating. For the LO MC cross-section, the non-perturbative effects in the parton shower model are studied by turning off the multiple parton interaction and hadronization of P 8 separately, resulting in an uncertainty of 8% (4%) for the single-lepton (dilepton) channel. In addition, the jet cone size is varied from 0.4 to 0.3 or 0.5 separately to evaluate the impact of additional QCD radiation on the reconstruction of the particle-level jet. The resulting uncertainties are 11% (6%) for the single-lepton (dilepton) channel. Summing up the components in quadrature, the total relative uncertainty on the k-factor is 20% (15%) for the single-lepton (dilepton) channel.

Object and event selection
The object and event selection at the detector level are introduced in Section 5.1 and Section 5.2 respectively. The neural-network algorithms used in the analysis are described in Section 5.3. In Section 5.4, the fiducial region at particle level is defined.

Object selection
Electron candidates are reconstructed from energy deposits in the central region of the ECAL associated with reconstructed tracks from the ID [37] and are required to have a p T > 25 GeV and an absolute calorimeter cluster pseudorapidity |η cluster | < 2.47, excluding the transition region between the barrel and endcap calorimeters (|η cluster | [1.37, 1.52]). "Tight" likelihood-based identification criteria are applied, which correspond to an efficiency between 80% and 90% for electrons in different p T and η ranges measured in Z → ee events [37]. Muon candidates are reconstructed by an algorithm that combines the track segments in the various layers of the MS with the tracks in the ID [38] and are required to have a p T > 25 GeV and |η| < 2.5. "Medium" cut-based identification criteria are required, which correspond to an average efficiency around 96% in tt events for muons in different p T and η ranges [38]. Isolation criteria are applied to both the electron and muon candidates using calorimeter-and track-based information to obtain 90% efficiency for leptons with p T = 25 GeV, rising to 99% efficiency at p T = 60 GeV in Z → events. The transverse impact parameter divided by its estimated uncertainty |d 0 |/σ(d 0 ) is required to be lower than five for electron candidates and three for muon candidates. The longitudinal impact parameter must satisfy |z 0 sin(θ)| < 0.5 mm for both. The lepton reconstruction and identification efficiencies in simulation are corrected with data-driven scale factors [37,38] in order to match the corresponding values in data.
Photon candidates are reconstructed from energy deposits in the central region of the ECAL [39] and classified as unconverted if there is no matching track or reconstructed conversion vertex or as converted if there is a matching reconstructed conversion vertex or a matching track consistent with originating from a photon conversion. They must have a p T > 20 GeV and |η cluster | < 2.37, excluding the transition region between the barrel and endcap. "Tight" cut-based identification criteria, based on discriminating variables and corresponding to an efficiency around 85% at 40 GeV, are applied [39]. Cut-based p T -dependent isolation criteria are applied using calorimeter-and track-based information and correspond to an efficiency between 75% and 90% for prompt photons (photons not from hadrons decays) in Z → γ events. The photon reconstruction and identification efficiencies in simulation are corrected with data-driven scale factors [39] in order to match the corresponding values in data.
Jets are reconstructed using the anti-k t algorithm with a radius parameter of R = 0.4 from topological clusters of energy deposits in the calorimeter [40]. The jet energy scale and jet energy resolution are calibrated using energy-and η-dependent calibration schemes resulting from simulation and in situ corrections based on data [41]. The jets are required to have a p T > 25 GeV and |η| < 2.5. Jets likely to originate from pile-up are suppressed by using the output of a multivariate jet-vertex-tagger (JVT) [42]. Scale factors are used to correct the selection efficiency in simulation to match data. Jets containing b-hadrons (b-jets) are identified with a b-tagging algorithm using a multivariate discriminant that combines information about secondary vertices and track impact parameters (MV2c10) [43,44]. The operating point used corresponds to an overall 77% b-tagging efficiency in tt events, with a corresponding rejection of c-jets (light-jets) by a factor of 6 (134). Efficiencies to tag b-, c-, and light-jets in the simulation are scaled by p T -and η-dependent factors [43] to match the efficiencies in data. The transverse energy carried by the neutrinos is accounted for in the reconstructed missing transverse momentum E miss T [45], which is computed as the transverse component of the negative vector sum of all the selected electrons, muons, photons, and jets, as well as ID tracks associated with the primary vertex but not with any of the above objects, which is called track-based soft term.
An overlap removal procedure is applied to avoid the same calorimeter energy deposit or the same track being reconstructed as two different objects. Electrons sharing their track with a muon candidate are removed. Jets within a ∆R = 0.2 cone of an electron are removed. After that, electrons within a ∆R = 0.4 cone of a remaining jet are removed. When a muon and a jet are close, the jet is removed if it has no more than two associated tracks and is within ∆R < 0.2 of the muon, otherwise the muon is removed if it is within ∆R < 0.4 of the jet and the jet has more than two associated tracks. Photons within a ∆R = 0.4 cone of a remaining electron or muon are removed. Finally, the jets within a ∆R = 0.4 cone of a remaining photon are removed.

Event selection
The events must have at least one primary vertex with at least two associated tracks, each with p T > 400 MeV. Primary vertices are formed from reconstructed tracks spatially compatible with the interaction region. The primary vertex with the highest sum of p 2 T over all associated tracks is chosen. Events are categorized into the single-lepton channel if their final state contains one electron or one muon, and into the dilepton channel if they contain two electrons, two muons, or one electron and one muon, with each pair required to be of opposite charge. The lepton (at least one of the leptons) must be matched to a fired single-lepton trigger for the single-lepton (dilepton) channel. The p T of the electron (muon) that fired the trigger has to be larger than 27 (27.5) GeV in order to match the higher lepton p T trigger threshold in 2016. The selected events must have at least four (two) jets in the single-lepton (dilepton) channel, at least one of which is b-tagged, and exactly one photon. A Z-boson veto is applied in the single electron channel by excluding events with invariant mass of the system of the electron and the photon around the Z-boson mass (|m(e, γ) − m(Z)| < 5 GeV), where m(Z) = 91.188 GeV. In the dilepton channel when the two leptons have the same flavour, events are excluded if the dilepton invariant mass or the invariant mass of the system of the two leptons and the photon is between 85 and 95 GeV, and E miss T is required to be larger than 30 GeV. The dilepton invariant mass is required to be higher than 15 GeV to suppress events from J/ψ, Υ and γ * decays. Finally, to suppress photons radiated from lepton(s), the ∆R between the selected photon and lepton(s) must be greater than 1.0. The event selection is summarized in Table 1.
There are four types of backgrounds to the selected ttγ candidates, three of which are events with a misidentified object. The contribution from events with the selected photon being a misidentified jet or a non-prompt photon from hadron decays, referred to as hadronic-fake background, is estimated following the method outlined in Section 6.1. The contribution from events with the selected photon being a misidentified electron, referred to as electron-fake background, is estimated following the method outlined in Section 6.2. The contribution from events with the selected lepton being a misidentified jet or non-prompt lepton from heavy-flavour decays, referred to as fake-lepton background, is estimated following the method outlined in Section 6.3. Finally, the contribution from events with a prompt photon (excluding the ttγ signal and the fake-lepton background with prompt photon radiation), referred to as prompt-photon background, is estimated following the method outlined in Section 6.4. In the single-lepton channel, the main backgrounds are from events with a hadronic-fake or electron-fake photon and W γ production, while in the dilepton channel, Zγ production and events with a hadronic-fake photon are the dominant backgrounds.
A total number of 11 662 and 902 candidate events are selected for the single-lepton and dilepton channels, respectively, with expected numbers of 6490 ± 420 and 720 ± 34 signal events, where the corresponding NLO k-factors are applied and the uncertainties include the simulation statistical uncertainty and all systematic uncertainties introduced in Section 8. Table 2 summarizes the observed data and the expected event yields for the signal and background processes. Figures 1 and 2 show the distributions of the observables before the likelihood fit (pre-fit) that are used for the measurement of the normalized differential cross-sections in the single-lepton and dilepton channels, respectively. All data-driven corrections and systematic uncertainties are included in the table and plots, and the signals are scaled by the NLO k-factors. The observed data and the expected event yields for the signal and backgrounds in the single-lepton and dilepton channels. All data-driven corrections and systematic uncertainties are included. The signals are scaled by the NLO k-factors. The fake-lepton background in the dilepton channel is negligible, represented by a "-". The Zγ (W γ) background in the single-lepton (dilepton) channel is included in "Other prompt". The uncertainty of the W γ background in the single-lepton channel is not given since the normalization of this background is a free parameter in the likelihood fit.

Multivariate analysis
To discriminate the ttγ signal from backgrounds, a neural-network algorithm, called the event-level discriminator (ELD), is trained separately for the single-lepton and dilepton channels. Given the significant contribution of hadronic-fake photons in the single-lepton channel, a dedicated neural network, referred to as the prompt-photon tagger (PPT) in the following, is trained to discriminate between prompt photons and hadronic-fake photons. The PPT is used as one of the inputs to the ELD in the single-lepton channel.
Both neural-network algorithms are feedforward binary classifiers that have been trained using Keras [46] and evaluated using [47]. Theano [48] is used as backend. The inputs variables are normalized to have a standard deviation of 1 and a mean of 0. To reduce the risk of over-training, regularization methods such as dropout [49] and batch normalization [50] layers are used. Additionally, k-fold cross-validation is performed.
Six variables which characterize the photon candidate shower shape in the ECAL and the energy leakage fraction into the HCAL are used in the PPT: R had , R η , R φ , w η 2 , w s3 , and F side . These are the standard discriminating variables used in ATLAS for photon identification and their definitions are given in the Appendix. Prompt photons from simulated QCD-Compton processes and hadronic-fake photons from simulated dijet events are used as signal and background photons in the training and testing of the PPT. Photons are required to pass the Tight identification and have p T > 25 GeV and |η clu | < 2.37, excluding the calorimeter transition region. The PPT shape of the prompt photons in simulation is corrected to match data in photon p T -η bins. The correction factors for each bin are extracted from the ratio between the PPT distribution in data and that of simulation, using photons in a Z → γ control region. The control region is defined by requiring exactly one photon and two opposite-sign leptons, with the invariant mass of the lepton pair between 60 and 100 GeV. The resulting correction factors range from 0.5 to 2.0 and are in general around one. PPT systematic uncertainties are evaluated separately for prompt photons and fake photons and are discussed in Section 8.3. The PPT distribution after event selection in the single-lepton channel is shown in Figure 3. The shape difference between data and prediction of the PPT is caused by  in the dilepton channel after event selection and before likelihood fit. All data-driven corrections and systematic uncertainties are included. Overflow events are included in the last bin. In particular, events with |∆η( , )| > 2.5 are included in the last bin of (d).
the shape difference between data and simulation of the input discriminating variables and is covered by the assigned systematic uncertainties. Simulated signal and background events passing the event selection are used for the training and testing of the ELD, except for the fake-lepton background in the single-lepton channel which is taken from data as described in Section 6.3. In the dilepton channel, the selection criteria on the E miss T , the invariant masses, and the jet multiplicity are removed to increase the sample size for training. The training takes 15 (7) variables as input for the single-lepton (dilepton) channel, which are summarized in Table 3. The b-tagging related variables are important for the ELD training in both channels, because of the significant amount of background without any heavy flavor jets. The use of the PPT as input to ELD improves the discrimination power against hadronic-fake background.
Variables like the dilepton invariant mass and missing transverse energy are useful for the ELD training in the dilepton channel, due to the dominant background of Zγ. The distributions of the ELD after event selection are shown in Figure 4 for the single-lepton and dilepton channels. The shapes of the ELD are compared between signal and total background in Figure 5 for the single-lepton and dilepton channels. In the single-lepton channel, the kinematic properties and jet flavour compositions are similar between the ttγ signal and the dominating background, which is tt production with a hadronic-fake or electron-fake photon. In the dilepton channel, this is not the case since Zγ production is dominant. Thus the ELD is more discriminating in the dilepton channel than in the single-lepton channel. The ELD is used in the likelihood fit to data to extract the fiducial cross-sections.

Fiducial region definition
The fiducial region of the analysis is defined at particle level in a way that mimics the event selection in Section 5.2. Leptons must have p T > 25 GeV and |η| < 2.5 and must not originate from hadron decays. Table 3: Input variables for the event-level discriminator for the single-lepton and dilepton channels. For events without the 5th jet, the p T ( j 5 ) is set to zero.

Variable Description
Single lepton Dilepton PPT Prompt-photon tagger output H T Scalar sum of the p T of the leptons and jets m(γ, ) Invariant mass of the system of the photon and the lepton E miss T Missing transverse energy m T W Reconstructed transverse mass of the leptonically decaying W-boson Invariant mass of the system of the two leptons Photons not from hadron decays and in a ∆R = 0.1 cone around a lepton are added to the lepton before the lepton selection. Photons are required to have p T > 20 GeV and |η| < 2.37 and must not originate from hadron decays or be used for lepton dressing. The photon isolation computed from the ratio of the scalar sum of the transverse momentum of all stable2 charged particles around the photon over its transverse momentum must be smaller than 0.1. Jets are clustered using the anti-k t algorithm with R = 0.4 using all final state particles excluding non-interacting particles and muons that are not from hadron decay. Jets must have p T > 25 GeV and |η| < 2.5. A ghost matching method [51] is used to determine the flavour of the jets, with those matched to b-hadrons tagged as b-jets. A simple overlap removal is performed: jets within ∆R < 0.4 of a selected lepton or photon are removed. For events in the single-lepton (dilepton) channel, exactly one photon and exactly one lepton (two leptons) are required. At least four (two) jets are required with at least one of them b-tagged. Events are rejected if there is any lepton and photon pair satisfying ∆R(γ, ) < 1.0. The acceptance for the generated signal events to pass the fiducial selection of the single-lepton (dilepton) channel is 8.2% (0.96%).

Hadronic-fake background
The hadronic-fake background is an important background in this analysis. Its main source is the tt process, where one of the final state jets is reconstructed and identified as a photon. In addition, there are small contributions from W+jets and single top processes to the single-lepton channel and from Z+jets events to the dilepton channel.
The hadronic-fake background is estimated using all the simulation samples by requiring that the selected photon is a hadron or from a hadronic decay at generator level. A data-driven method, called the ABCD method, is applied to derive a set of scale factors, based on the ratio of hadronic-fake background estimated by the method over the one from simulation. This set of scale factors is derived in the single-lepton channel and applied to both the single-lepton and dilepton channels to calibrate the simulation to match data.
In the ABCD method, the isolation selection and part of the Tight identification criteria of the photon, which are assumed to be uncorrelated, are inverted to define three regions enriched with hadronic-fake photons. These regions are orthogonal to one another, and to the signal region. Region A uses photons that pass the isolation selection defined in Section 5.1 but fail at least two out of the four identification requirements on the discriminating variables F side , w s3 , ∆E, and E ratio (defined in the Appendix), while passing all other Tight identification criteria. These four variables describe the shower shape in the first layer of ECAL and are chosen for their small correlation with the photon isolation but strong discrimination power between prompt and hadronic-fake photons. Region B uses photons that fail the identification criteria as in region A and do not pass the isolation selection. Additionally, the sum of the p T of all tracks within ∆R = 0.2 around the photon is required to be larger than 3 GeV to further suppress the prompt-photon contribution. Region C selects photons that fail the isolation requirements as in region B but pass the Tight identification. Region D is the signal region.
The hadronic-fake background in the signal region can be expressed as: where N h-fake A, data , N h-fake B, data and N h-fake C, data are the numbers of hadronic-fake events in regions A, B, and C, estimated by subtracting the events with prompt photons and other backgrounds from the number of data events in these regions, and N h-fake A, MC , N h-fake B, MC , N h-fake C, MC and N h-fake D, MC are the numbers of hadronic-fake events predicted by simulation in regions A, B, C, and D. The factor θ MC corrects for possible bias caused by residual correlation between the isolation variables and the four discriminating variables used to define the regions.
The ABCD method is applied to photons in different p T -η bins, separately for converted and unconverted photons. The resulting scale factors range from 0.8 to 3.2, with typical values around 1.5 and large statistical and systematic uncertainties of more than 0.5. These scale factors are applied to the MC-based hadronic-fake background prediction.

Electron-fake background
Dilepton events where an electron is mis-identified as a photon contribute to the electron-fake background in the single-lepton channel. Its main source is the tt dileptonic decay process. When the lepton is an electron, there is also some contribution from the Z → ee process. A data-driven method is applied to derive a set of scale factors for correcting the electron-to-photon fake rate in simulation to match data.
The electron-to-photon fake rate is measured with a tag-and-probe method using two control regions (CR), exploiting the Z → ee process. For the first CR, events with an electron-photon pair with an invariant mass in the range [40,140] GeV and an opening angle greater than 2.62 rad are selected. The electron is called the tag electron and the photon is called the probe photon. Non-Z-boson backgrounds are subtracted with a sideband fit of the invariant mass distribution. The fitted signal contains Z → eeγ contributions with one of the electrons not reconstructed or identified which is subtracted using simulation. For the second CR, events with an electron-positron pair satisfying the same requirements as in the first CR are selected and the same procedure is applied. The fake rate is calculated as the ratio of the number of probe photons over the number of probe electrons. To avoid a trigger bias, the tag electron in both CRs must match the single-lepton trigger. A set of p T -η binned fake-rate scale factors is defined by taking the ratio between the fake rate in data and in the simulation. The values of the scale factors range from 0.8 to 2.1 and are in general consistent with one within their uncertainties.
The electron-fake background in the single-lepton channel is validated in a control region selected by replacing the photon in the signal region event selection with an electron. This region is dominated by tt events and Z+jets events in the single electron channel with negligible contribution from other processes. The ratio of data over prediction in this region is 0.98 ± 0.01, where the uncertainty is due to the sample size. This overall correction is applied to the electron-fake background predicted by simulation in the single-lepton channel signal region, in addition to the fake-rate scale factors.
The electron-fake background in the dilepton channel is very small. Simulation is used to predict its contribution. No dedicated control region is selected to validate this background.

Fake-lepton background
In the single-lepton channel, the fake-lepton background is dominated by multi-jet processes with an additional photon which could either be a prompt or a fake photon. It is estimated directly from data by using a matrix method [52]. The number of background events in the signal region is evaluated by applying efficiency factors (fake lepton and real lepton efficiencies) to the number of events satisfying a tight (identical to the signal selection) as well as a looser lepton selection. The fake-lepton efficiency is measured using data in control regions dominated by multi-jet background with the real lepton contribution subtracted using simulation. The real lepton efficiency is extracted from a tag-and-probe technique using leptons from Z boson decays. The efficiencies are parametrized as a function of the lepton η and m T W (the lepton p T and m T W ) when the lepton is an electron (muon).
In the dilepton channel, the contamination of background processes with at least one fake lepton is estimated by selecting same-sign dilepton events in data, after subtracting events with two same-sign prompt leptons, using simulation. The fake-lepton background in the dilepton channel is found to be negligible.

Prompt-photon background
All background processes to tt production are also background to ttγ production when accompanied by prompt-photon radiation. These processes include W γ, Zγ, and associated production of a photon in single top, diboson, and ttV productions. In the single-lepton channel, W γ is the dominant prompt-photon background, and Zγ and the others are grouped together as "Other prompt." In the dilepton channel, Zγ is dominant, with all others grouped as "Other prompt." Background from tt production with a photon produced in an additional pp interaction in the same bunch crossing has been studied and is found to be negligible.
Validation regions are selected to check the modelling of W γ in the single-lepton channel and Zγ in the dilepton channel. The Zγ validation region is selected by requiring exactly one b-tagged jet and the invariant mass of the system of the two leptons in a mass window of [60,100] GeV. The W γ validation region is selected with the same event selection as for the signal region of the single-lepton channel, with the following modifications: the number of jets must be either two or three; exactly one b-jet is required; the E miss T is required to be larger than 40 GeV; the ELD value must be smaller than 0.04; and the invariant mass of the system of the lepton and photon is required to be smaller than 80 GeV if the lepton is an electron. The modelling of W γ is also checked in a light-flavour validation region requiring zero b-jet and without the ELD cut.
The normalization of the W γ background is treated as a free parameter of the likelihood fit in the single-lepton channel, since this background is well separated from the ttγ signal by the ELD and the uncertainty of its theoretical prediction is large. The shape of W γ is taken from simulation and checked in the validation region to ensure good modelling. The normalization and shape of the Zγ background in the dilepton channel as well as other prompt backgrounds in both channels are predicted by simulation.

Analysis strategy
The analysis is performed in two parts, one being the measurement of the fiducial cross-section and the other the measurement of normalized differential cross-sections. Both parts share the same strategy for the estimation of backgrounds and systematic uncertainties. In the fiducial cross-section measurement, the ELD is fitted and the post-fit background yields and systematic uncertainties are used. In the normalized differential cross-section measurements, no fit is performed, except for the determination of the W γ contribution in the single-lepton channel, where a systematics-free ELD fit is performed.

Fiducial cross-section
The fiducial cross-section is extracted using a profile likelihood fit to the ELD distribution. The parameter of interest, the fiducial cross-section σ fid , is related to the number of signal events in bin i of the ELD as where L is the integrated luminosity, C is the correction factor for the signal efficiency and for migration into the fiducial region, and f ELD i is the fraction of signal events falling into bin i of the ELD. The correction factor C is defined as N s,sel.
MC /N s,fid MC , where N s,sel. MC is the simulated number of signal events passing the event selection described in Section 5.2 and N s,fid MC is the corresponding number of signal events generated in the fiducial region defined in Section 5.4. The value of C is 0.36 (0.30) for the single-lepton (dilepton) channel, with negligible statistical uncertainty.
A likelihood function is defined from the product over all bins of the ELD distribution: where N obs i , N s i , and N b i are the observed number of events in data, the predicted number of signal events, and the estimated number of background events in bin i of the ELD, which form a Poisson term P in that bin. Nuisance parameter θ t is to parameterize a systematic uncertainty t, which is constrainted by a Gaussian G(0|θ t , 1), and the collection of all the systematic uncertainties is denoted as ì θ. For systematic uncertainties related to the finite number of MC events, the Gaussian terms in the likelihood are replaced by Poisson terms. Each systematic uncertainty affects N s i and N b i in each bin of the ELD. The cross-section is measured by profiling the nuisance parameters and maximizing this likelihood.

Normalized differential cross-sections
An unfolding procedure is applied to the observed detector-level distribution of a given observable, with backgrounds subtracted, to derive the true distribution of the signal at particle level, from which the differential cross-section as a function of the observable is calculated. The differential cross-section is normalized to unity.
The differential cross-section is given by The indices j and k indicate the bin of the observable at detector and particle levels, respectively. The variables N obs j and N b j are the number of observed events and of estimated background events in bin j at detector level, respectively. The efficiency k is the fraction of signal events generated at particle level in bin k of the fiducial region to be reconstructed and selected at detector level and have the objects, that are used to define the observable to be unfolded, matched between reconstruction and particle-levels with ∆R < 0.1. The migration matrix M k j expresses the probability for an event in bin k at particle level to end up in bin j at detector level, calculated from events passing both the fiducial region selection and the event selection, as well as the above matching procedure. The outside-migration fraction f out, j is the fraction of signal events generated outside the fiducial region but reconstructed and selected in bin j at detector level or events failing the above matching. The signal MC sample is used to determine k , f out, j , and M k j , the values of which are illustrated in Figure 6, using the photon p T in the single-lepton channel as an example. The normalization and the corresponding uncertainty of the W γ contribution in the single-lepton channel are taken from the likelihood fit introduced in Section 7.1 but without systematic uncertainties included. The normalized differential cross-section is where the sum is over all the bins of the observable.
The inversion of the migration matrix M k j is performed using the iterative Bayesian method [53] implemented in the R U package [54]. The method relies on the Bayesian probability formula to invert the migration matrix, starting from a given prior of the particle-level distribution and iteratively updating it with the posterior distribution. The binning choices of the unfolded observables take into account the detector resolution and the expected statistical uncertainty, with the latter being the dominating factor. Three iterations are chosen which give a good convergence of the unfolded distribution and a statistically stable result. Tests are performed, using simulation, to verify that the unfolding procedure does not bias the results. The results are cross-checked with other unfolding methods, which give consistent results. The chosen observables to unfold are the photon p T and |η| and the ∆R between the photon and the closest lepton for both single-lepton and dilepton channels and the ∆φ and |∆η| between the two leptons for the dilepton channel. The kinematic properties of the photon are sensitive to the tγ coupling, while the dilepton ∆φ is sensitive to the tt spin correlation. The normalized differential cross-sections are measured, since the overall signal normalization is given by the measured fiducial cross-section.

Systematic uncertainties
Signal and background modelling and experimental uncertainties in the analysis are described in this section, as well as the PPT systematic uncertainty. They affect the normalization of signal and background and/or the shape of their corresponding distributions, such as the ELD and the observables to be unfolded. Each of the signal and background modelling uncertainties is correlated between different channels for the relevant signal or background process. Each of the experimental uncertainties is correlated between signal and simulated backgrounds and between different channels. The PPT systematic uncertainty is separately studied for the prompt, electron-fake, and hadronic-fake photons.

Signal modelling uncertainties
The signal modelling uncertainties include the uncertainties due to the choice of the QCD scales, the parton shower, the amount of ISR and FSR, and the PDF set. Their effects on the corrections defined in Section 7 (both for the fiducial and normalized differential cross-section measurements) as well as on the shape of the ELD distributions are evaluated.
To study the QCD scale uncertainty, the renormalization and factorization scales are varied up and down by a factor of two from their nominal choices independently or simultaneously. The largest variation of the corrections or the shapes is assigned as the uncertainty. To evaluate the parton shower uncertainty, P 8 and H 7 both interfaced to MG5_ MC are compared. The ISR/FSR uncertainty is studied by comparing the variations of the A14 tune parameters of P 8 with its nominal values. The PDF uncertainty is evaluated using the standard deviation of the distribution formed by the 100 eigenvector set of the NNPDF set [23].

Background modelling uncertainties
The systematic uncertainties on the hadronic-fake background due to background subtraction in the hadronic-fake control regions A, B, and C are estimated by varying up and down the signal by 100%, the other MC-based backgrounds by 50%, and the other data-driven backgrounds by their estimated uncertainties, separately. The statistical uncertainties in the three data control regions are also considered. Systematic uncertainties arising from the correction factor θ MC , the extrapolation of the hadronic-fake scale factors, and the shapes of the distributions of the ELD and the observables to be unfolded are estimated using tt samples rather than all of the simulated hadronic-fake samples, since tt is the dominant source of the hadronic-fake background. The uncertainty due to the rate of additional QCD radiation is estimated by comparing the samples with enhanced/reduced parton shower radiation as described in Section 3 with the nominal sample, and the uncertainty due to the modelling of the generator and parton shower is estimated by comparing P +P 8 with S .
The systematic uncertainties on the electron-fake background mainly come from the sideband fit when measuring the fake rate, which is estimated by varying the fit parameters within their uncertainties. The uncertainty due to Z → eeγ subtraction is also considered by replacing the Zγ MC sample by the Z+jets sample, where the photon radiation is described by the parton shower. Uncertainties of the shapes of the distributions are evaluated using tt systematic variations MC samples as for the hadronic-fake background.
In the dilepton channel where the electron-fake background is very small, a 50% uncertainty is assumed to cover a possible mis-modelling in the estimate.
For evaluating the systematic uncertainty of the fake-lepton background in the single-lepton channel, several alternative parameterizations of the real and fake efficiencies of the matrix method are studied and two predicting larger and smaller yields are selected as up and down variations. The lepton η, b-jet multiplicity, and m T W (the lepton η, minimum ∆R between the lepton and the closest jet, and the jet p T ) parameterization is used as up (down) variation when the lepton is an electron. The jet p T and b-jet multiplicity (the lepton p T , η, and minimum ∆R between the lepton and the closest jet) parameterization is used as up (down) variation when the lepton is a muon. The resulting uncertainties are around 50%, and given that this background contribution is relatively small, no additional systematic uncertainties are considered.
The uncertainty on the V γ background shape is studied by varying the renormalization and factorization scales up and down by a factor of two from their nominal values independently or simultaneously and then choosing the maximum shape distortions as the QCD scale uncertainties. For the Zγ background in the dilepton channel, S is compared with MG5_ MC+P 8 to evaluate the shape uncertainty due to the choice of generator and parton shower. No shape modelling uncertainty is assigned to the other small prompt backgrounds. Apart from the W γ background in the single-lepton channel whose normalization is a free parameter in the likelihood fit, a normalization uncertainty of 50% is assigned to each source of the prompt-photon background, included in Table 2 as "Zγ" and "Other prompt".

Experimental uncertainties
Experimental systematic uncertainties affect the normalization and shape of the simulated signal and background samples. For MC-based backgrounds calibrated to data using data-driven techniques, only the shape variations are considered.
The photon identification and isolation efficiencies as well as the efficiencies of the lepton reconstruction, identification, isolation, and trigger in the MC samples are all corrected with data-driven scale factors as mentioned in Section 5.1. These scale factors, which are p T and η dependent, are varied to study their impacts on the final results. Similarly, the data-based corrections to the lepton and photon momentum scale and resolution in simulation are varied within their uncertainties [38,55].
The PPT systematic uncertainty is evaluated separately for prompt photons and hadronic-fake photons. The data-driven PPT scale factors as mentioned in Section 5.1 are turned on and off to assign a PPT systematic uncertainty for the prompt photon. The resulting uncertainty is also assigned to the electron-fake photon PPT distribution, since its shape and shape difference between data and simulation are similar to that of the prompt photons. The maximum PPT shape difference between data and prediction in the hadronic-fake control region C of Section 6.1, with the expected signal contamination in this region varied by ±50%, is used to estimate the hadronic-fake PPT uncertainty. The hadronic-fake photons in region C are non-isolated while those of the signal region are isolated. To account for a possible underestimation of the systematic uncertainty caused by this difference, the shape differences between data and prediction in the isolated hadronic-fake control region A of Section 6.1 are considered as an additional PPT systematic uncertainty. The PPT shape uncertainties are estimated in photon p T and η bins.
The jet energy scale (JES) uncertainty is derived using a combination of simulations, test beam data and in situ measurements [56][57][58]. Additional contributions from jet flavour composition, η-intercalibration, punch-through, single-particle response, calorimeter response to different jet flavours, and pile-up are taken into account, resulting in 21 uncorrelated JES uncertainty subcomponents. The jet energy resolution (JER) in simulation is smeared up by the measured JER uncertainty [59]. The uncertainty associated with the JVT cut is obtained by varying the efficiency correction factors. The b-tagging weights used for jet flavour tagging are corrected by data-driven scale factors, which are separately measured for b-jets, c-jets, and light-flavour jets [44,60]. The three sets of scales factors are varied by their measured uncertainties.
The uncertainties associated with energy scales and resolutions of photons, leptons and jets are propagated to the E miss T . Additional uncertainties originate from the modelling of its soft term [61]. The uncertainty in the combined 2015+2016 integrated luminosity is 2.1%. It is derived, following a methodology similar to that detailed in Ref. [62], and using the LUCID-2 detector for the baseline luminosity measurements [63], from calibration of the luminosity scale using x-y beam-separation scans.
The uncertainty associated to the modelling of pile-up in the simulation is assessed by varying the reweighting of the pile-up in the simulation within its uncertainties.

Systematic uncertainties for unfolding
Systematic uncertainties for unfolding arise from the detector response description, signal modelling, and background modelling. The systematic uncertainties due to background modelling and the detector response are evaluated by varying the input detector-level distributions, unfolding them with corrections based on the nominal signal sample, and comparing the resulting unfolded distributions to the nominal one.
The systematic uncertainties due to signal modelling are evaluated by varying the signal corrections, with which the nominal input detector-level distributions are unfolded, and comparing the resulting unfolded distributions to the nominal one. The statistical uncertainties of the signal and background MC samples are also considered. The covariance matrix C i j for each of these systematic uncertainties is estimated as σ i × σ j , where σ i and σ j are the symmetrized uncertainties for bin i and bin j of the unfolded distribution.
The covariance matrix for the statistical uncertainty of data is calculated by the unfolding algorithm [54].

Fiducial cross-sections
The fiducial cross-section is extracted via a binned maximum likelihood fit to the ELD distribution in data as described in section 7.1. The measured cross-sections are σ SL fid = 521 ± 9(stat.) ± 41(sys.) fb and σ DL fid = 69 ± 3(stat.) ± 4(sys.) fb , for the single-lepton and dilepton channels, respectively, and agree well within uncertainties with the corresponding predicted cross-sections of 495 ± 99 fb and 63 ± 9 fb. The ELD distributions after the fit (post-fit) are shown in Figure 7 for the single-lepton and dilepton channels. The corresponding event yields are summarized in Table 4, including all the systematic uncertainties. Compared to pre-fit (Table 2), the signal event yields are higher, while some of the background event yields are lower. Some of the systematic uncertainties are moderately constrained, e.g. the parton shower uncertainty of the ttγ and tt modelling and the PPT shape uncertainty of prompt photons.
The fiducial cross-sections in each of the individual channels (e+jets, µ+jets, ee, eµ, and µµ) as well as a combined single-lepton and dilepton cross-section are also measured. The former are measured by fitting the ELD distribution in each individual channel separately, while the latter is measured by fitting them simultaneously, sharing the same signal strength parameter µ = σ ttγ /σ NLO ttγ , which scales coherently the fiducial cross-sections of each channel. A comparison of all measurements with the predictions is shown in Figure 8.
All the systematic uncertainties introduced in Section 8 are grouped into a smaller set of classes and summarized in Table 5 for the single-lepton and dilepton channels. The effect of each group of uncertainties is calculated from the quadratic difference between the relative uncertainty in the measured fiducial cross-section with this group of uncertainties included or excluded from the fit with corresponding nuisance parameters fixed to their fitted values. In the single-lepton channel, the jet-related and background modelling systematic uncertainties are dominant, followed by the PPT and signal modelling systematic uncertainties. In the dilepton channel, the data statistical uncertainty is the leading contribution, followed by the signal and background modelling systematic uncertainties. The luminosity and pile-up uncertainties are also important in this channel.  The observed data and post-fit event yields for the signal and backgrounds in the single-lepton and dilepton channels. All data-driven corrections and systematic uncertainties are included. The fake-lepton background in the dilepton channel is negligible, represented by a "-". The Zγ (W γ) background in the single-lepton (dilepton) channel is included in "Other prompt."  Table 5: Summary of the effects of the groups of systematic uncertainties on the fiducial cross-section in the single-lepton and dilepton channels. Due to rounding effects and small correlations between the different sources of uncertainty, the total systematic uncertainty is different from the sum in quadrature of the individual sources.

Normalized differential cross-sections
The normalized differential cross-sections are shown in Figures 9 and 10, for the single-lepton and dilepton channels, respectively. They are compared to the nominal ttγ sample (MG5_ MC+P 8) and the samples with variations of the P 8 A14 tune parameters and the alternative parton shower model of MG5_ MC+Herwig 7. In addition, a comparison with the nominal tt P +P 8 MC sample where prompt-photon radiation is modelled in the parton shower is included. All ttγ samples predict very similar shapes and describe the data well. A small deviation from the prediction is observed in the dilepton ∆φ distribution, where the leptons in the prediction are more back-to-back than in data. The deviation of data from the prediction is 1.5 standard deviations, based on the χ 2 calculated according to the procedure described in what follows. The P +P 8 tt sample gives an improved agreement with data compared to the nominal and varied ttγ samples, although the overall agreement is still poor. It can also be seen from Figures 9(a) and 10(a) that the photons generated by P 8 have a softer p T spectrum than in data.
The systematic uncertainties of the unfolded distributions are decomposed into the signal modelling uncertainty, experimental uncertainty, and background modelling uncertainty in both channels. In the single-lepton (dilepton) channel, the background modelling uncertainty is split into tt (Zγ) and the others. These decomposed uncertainties are illustrated in Figures 11 and 12 for the single-lepton and dilepton channels, respectively. For the single-lepton channel, the systematic uncertainty is dominated by the tt modelling, which is used to model the shapes of the hadronic-fake and electron-fake backgrounds. For the dilepton channel, the systematic uncertainty is dominated by the Zγ modelling, mostly from the comparison between S and MG5_ MC+P 8 generators. Because the unfolding is performed with the distributions before the fit, the background modelling uncertainties are not constrained as in the case of fiducial cross-section measurement where a fit to ELD distribution is performed, and thus they have a much larger impact on the result.
The differences between the unfolded and the predicted distributions are quantified by the chi-squared per degree of freedom χ 2 /ndf, where the χ 2 is where σ norm data and σ norm pred. are the unfolded and predicted normalized differential cross-sections, C jk is the covariance matrix of σ norm data , and j and k are the binning indices of the distribution. For normalized differential cross-sections, the last bin of the above formula is removed from the χ 2 calculation and ndf is reduced by one since this bin gives redundant information. The total correlation matrix is shown in Table 6, taking the ∆φ( , ) in the dilepton channel as an example. There is moderate correlation, either positive or negative, between different bins of the unfolded ∆φ( , ) distribution. The calculated χ 2 /ndf values and their corresponding p-values are summarized in Tables 7 and 8, quantifying the compatibility between data and each of the predictions.          Table 8: χ 2 /ndf values and p-values between the measured normalized differential cross-sections and predictions from several generators in the dilepton channel.

Conclusions
Fiducial cross-sections of top-quark pair production in association with a photon are measured in the single-lepton and dilepton decay channels of the top-quark pair using 36.1 fb −1 of 13 TeV pp collision data collected in 2015 and 2016 by the ATLAS detector at the LHC. The normalized differential cross-sections are measured as a function of the photon p T and |η|, and the ∆R between the photon and the closest lepton for both channels, and the |∆η| and ∆φ between the two leptons for the dilepton channel.
In both channels, the measured fiducial cross-sections agree well with the NLO SM predictions within uncertainties. The measured normalized differential cross-sections also agree well with the LO ttγ prediction and the NLO tt prediction, where the photon comes from the parton shower. The largest disagreement between data and LO ttγ prediction is observed in the distribution of the azimuthal opening angle between the two leptons in the dilepton channel, which is sensitive to tt spin correlation, while NLO tt sample provides an improved agreement with data in this variable.
The definitions of the discriminating variables used in ATLAS photon identification are given in Table 9. Name Description Hadronic leakage R had or R had 1 Transverse energy leakage in the hadronic calorimeter normalized to transverse energy of the photon candidate in the ECAL. In the region 0.8 ≤ |η| ≤ 1.37, the entire energy of the photon candidate in the HCAL is used (R had ), while in the region |η| < 0.8 and |η| > 1.37 the energy of the first layer of the HCAL is used (R had1 ). Energy ratios and width in the second layer of ECAL R η Energy ratio of 3 × 7 to 7 × 7 cells in the η × φ plane. R φ Energy ratio of 3 × 3 to 3 × 7 cells in the η × φ plane. w η 2 Lateral width of the shower, using a window of η × φ = 3 × 5 cells.

Energy ratios and widths in the first (strip) layer of ECAL w s3
Shower width along η, using 3 strips around the largest energy deposit. w s tot Shower width along η, using 20 × 2 strip cells in the η × φ plane. F side Energy outside the 3 central strips but within 7 strips, normalized to the energy within the 3 central strips. E ratio Ratio between difference of the first and second energy maximum divided by their sum (E ratio = 1 if there is no second maximum).

∆E
Difference between the second energy maximum and the minimum found between first and second maximum (∆E = 0 if there is no second maximum).
The shapes of the PPT of the prompt, hadronic-fake, and electron-fake photons are compared in Figure 13.   , from one to five, represents the b-tagging efficiencies of > 85%, 77-85%, 70-77%, 60-70%, and < 60%. All data-driven corrections and systematic uncertainties are included. Figure 15 shows the post-fit distributions of four important variables for the training of the ELD in the dilepton channel.    Figure 17 shows the distributions of the efficiency, outside fraction, and migration matrix for the dilepton azimuthal angle in the dilepton channel, which are the inputs for unfolding, as described in Section 7.2.   [6] CDF Collaboration, Evidence for ttγ production and measurement of σ ttγ /σ tt , Phys. Rev. D 84 (2011)